RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14063
         (531 letters)



>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score =  135 bits (343), Expect = 1e-33
 Identities = 99/378 (26%), Positives = 177/378 (46%), Gaps = 11/378 (2%)

Query: 7   KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLG 66
           +  EE  E    +  ++ +++ LE E+      LE E   + +  E    E    ++ L 
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLE-ELENELEELEERLEELKEKIEALK 336

Query: 67  EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVF-SAK 125
           E   + + + +EL   L  +++  +E+E   S L   E+  +    +  +        A+
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALL--EELEELFEALREELAELEAELAE 394

Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
            R E +  K E+E       + +++L DL+  L++++  L E +T  EEL   +  +E +
Sbjct: 395 IRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQ 454

Query: 186 IAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLK--SENRV 243
           + E R RL +L RE A+L +++++L +E+S     +    +  R    V  +    E+ +
Sbjct: 455 LEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGL 514

Query: 244 SGILGRLGDLGGIDQKYDIAVSTACGA-LNYIVTETVEAGEAVIAAVKRQNVGRVNVIPL 302
            G+ G + +L  + +KY+ A+  A G  L  +V E  E  +  I  +K    GR   +PL
Sbjct: 515 PGVYGPVAELIKVKEKYETALEAALGNRLQAVVVENEEVAKKAIEFLKENKAGRATFLPL 574

Query: 303 DKMQQYHSQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKRIGYS 362
           D+++   S   D    P  +    DLI   D K   A  F   +TLV  DL QA+R+   
Sbjct: 575 DRIKPLRSLKSD--AAPGFLGLASDLIDF-DPKYEPAVRFVLGDTLVVDDLEQARRLARK 631

Query: 363 -GGGYRMVTLEGAIIEPS 379
               YR+VTL+G ++EPS
Sbjct: 632 LRIKYRIVTLDGDLVEPS 649



 Score = 58.2 bits (141), Expect = 7e-09
 Identities = 62/286 (21%), Positives = 124/286 (43%), Gaps = 9/286 (3%)

Query: 10  EERTEKLTRVQL----VETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNL 65
           E   E L R++     +E  L+ LE +  KA  + EL+  ++     +   +     K L
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241

Query: 66  GEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAK 125
            E E ++ ++E+EL    E +++   E+E  KSEL+ + ++ +E      + +      +
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE--IEE 299

Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
              E  + +  LE   +   +  ++L +L+  +E ++  L ER+TL EEL   +  +E  
Sbjct: 300 LEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEA 359

Query: 186 IAESRARLADLTREEAKLLDQV-EKLAREVSEKRESMQTSRSNNRLIDFVMQLKSENRVS 244
             E   +L+ L  E  +L + + E+LA   +E  E         R I+ + +        
Sbjct: 360 KEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLS-E 418

Query: 245 GILGRLGDLGGIDQKYDIAVSTACGALNYIVTETVEAGEAVIAAVK 290
            +     +L  ++ + +  + T    LN  + E  E  E +   +K
Sbjct: 419 RLEDLKEELKELEAELE-ELQTELEELNEELEELEEQLEELRDRLK 463



 Score = 53.9 bits (130), Expect = 2e-07
 Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 4/96 (4%)

Query: 415 TLVAQDLNQAKRIGYS-GGGYRMVTLEGAIIEPSGTMSGGGSNPIRGLMGRKATVSTD-- 471
           TLV  DL QA+R+       YR+VTL+G ++EPSG+++GG  N    L  ++     +  
Sbjct: 616 TLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRSSLAQKRELKELEEE 675

Query: 472 -TSLVKDLEVKEKRLASLETELRILSQQKMEVETQL 506
              L   LE  E+ L SL+ ELR L     E+  QL
Sbjct: 676 LAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711



 Score = 45.1 bits (107), Expect = 9e-05
 Identities = 53/233 (22%), Positives = 112/233 (48%), Gaps = 10/233 (4%)

Query: 2   EVLYGKYDEERTEKLTRVQLVETDLKALEPELRKAVNFL-ELENCVQRKHNEIYQYERYV 60
           E L  +   +  E   +++ ++ +L ALE EL +  + L ELE  ++    E+ + +  +
Sbjct: 701 EDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERL 760

Query: 61  NMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWS 120
                 E E +++ +E+ L    E I++  ++ ++ + EL+ +E++ +E+          
Sbjct: 761 E-----ELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERE 815

Query: 121 VFSAKTRFECDIAKSELEIFLSTQSKETK-KLADLEANLEKVQTTLTERKTLCEELTTRV 179
           + S + R E    + E+E  L  + +E + KL +LE  LE+++  L E K   EEL    
Sbjct: 816 LESLEQRRE--RLEQEIE-ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEK 872

Query: 180 PAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLID 232
             +E E+ E      +L  E  +L  ++ +L  E+ + RE ++   +    ++
Sbjct: 873 EELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925



 Score = 42.8 bits (101), Expect = 4e-04
 Identities = 43/226 (19%), Positives = 96/226 (42%), Gaps = 19/226 (8%)

Query: 8   YDEERTEKLTRVQLVETDLKALE---PELRKAVNFLELENCVQRKHNEIYQYERYVNMKN 64
            +EE  E    ++ ++  L+ LE     L +A+  L+ E        E  + +R    + 
Sbjct: 742 LEEELEELEEELEELQERLEELEEELESLEEALAKLKEEI-------EELEEKRQALQEE 794

Query: 65  LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSA 124
           L E E ++++ E+ L A    ++      E  + E++ +E++ +E  +            
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE--LEEKL-------D 845

Query: 125 KTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMES 184
           +   E +  + ELE       +   +  +LE  L++++    E +    EL + +  ++ 
Sbjct: 846 ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKE 905

Query: 185 EIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRL 230
           EI + R RL +L  +  +L  ++ +L  E+ E+ E    +     +
Sbjct: 906 EIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREI 951



 Score = 37.8 bits (88), Expect = 0.018
 Identities = 24/89 (26%), Positives = 47/89 (52%)

Query: 134 KSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL 193
           +++LE           +L  LE  LE+++  L E +   EEL   + A+E E+ + ++RL
Sbjct: 680 EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRL 739

Query: 194 ADLTREEAKLLDQVEKLAREVSEKRESMQ 222
            +L  E  +L +++E+L   + E  E ++
Sbjct: 740 EELEEELEELEEELEELQERLEELEEELE 768



 Score = 36.2 bits (84), Expect = 0.054
 Identities = 28/156 (17%), Positives = 78/156 (50%), Gaps = 5/156 (3%)

Query: 67  EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
           +++ + ++ E++L    E++++  D +E  + +L+ +E++ +++     +++  + +   
Sbjct: 169 KYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAE----RYQ-ELKAELR 223

Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
             E  +  ++L+       +  ++L+ LE  LE++Q  L E +   EEL + +  +  E+
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283

Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQ 222
            E +  L +L  E  +L  ++  L   + E    ++
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELE 319



 Score = 34.7 bits (80), Expect = 0.16
 Identities = 21/78 (26%), Positives = 40/78 (51%)

Query: 148 TKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQV 207
             +L  LE  L+ ++  L   + L EEL  ++  +E ++ E +  LA L  E  +L  ++
Sbjct: 680 EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRL 739

Query: 208 EKLAREVSEKRESMQTSR 225
           E+L  E+ E  E ++  +
Sbjct: 740 EELEEELEELEEELEELQ 757



 Score = 30.1 bits (68), Expect = 4.1
 Identities = 50/242 (20%), Positives = 97/242 (40%), Gaps = 36/242 (14%)

Query: 5    YGKYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQ-YERYVNMK 63
              +  +   E+L  ++    + +     L + +  LE     +R   EI +  E    ++
Sbjct: 784  LEEKRQALQEELEELEEELEEAERRLDALERELESLEQR--RERLEQEIEELEEEIEELE 841

Query: 64   N-LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVF 122
              L E E +++++E+EL    E +++   E E  + ELK +E++ +E      +      
Sbjct: 842  EKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELE---- 897

Query: 123  SAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTER--KTLCEELTTRVP 180
            S     + +I K    +      +   KL  LE  L +++  L E    TL  EL   + 
Sbjct: 898  SELAELKEEIEKLRERL-----EELEAKLERLEVELPELEEELEEEYEDTLETELEREIE 952

Query: 181  AMESEIA---------------------ESRARLADLTREEAKLLDQVEKLAREVSEKRE 219
             +E EI                      E +++  DL   + KLL+ +E+L +E  E+ +
Sbjct: 953  RLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFK 1012

Query: 220  SM 221
              
Sbjct: 1013 ET 1014


>gnl|CDD|214944 smart00968, SMC_hinge, SMC proteins Flexible Hinge Domain.  This
           entry represents the hinge region of the SMC (Structural
           Maintenance of Chromosomes) family of proteins. The
           hinge region is responsible for formation of the DNA
           interacting dimer. It is also possible that the precise
           structure of it is an essential determinant of the
           specificity of the DNA-protein interaction.
          Length = 120

 Score =  111 bits (279), Expect = 3e-29
 Identities = 39/121 (32%), Positives = 60/121 (49%), Gaps = 6/121 (4%)

Query: 244 SGILGRLGDLGGIDQKYDIAVSTACGA-LNYIVTETVEAGEAVIAAVKRQNVGRVNVIPL 302
            G+LGR+ DL  +D KY+ A+  A G  L  +V +T E  +  I  +K+  +GR   +PL
Sbjct: 1   PGVLGRVADLISVDPKYETALEAALGGRLQAVVVDTEETAKKAIEFLKKNRLGRATFLPL 60

Query: 303 DKMQQYHSQC----YDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKR 358
           DK++                P  V   IDL++  D ++R A  +    TLV  DL  A+R
Sbjct: 61  DKIKPRSPAGSKLREALLPEPGFVGPAIDLVEY-DPELRPALEYLLGNTLVVDDLETARR 119

Query: 359 I 359
           +
Sbjct: 120 L 120


>gnl|CDD|219044 pfam06470, SMC_hinge, SMC proteins Flexible Hinge Domain.  This
           family represents the hinge region of the SMC
           (Structural Maintenance of Chromosomes) family of
           proteins. The hinge region is responsible for formation
           of the DNA interacting dimer. It is also possible that
           the precise structure of it is an essential determinant
           of the specificity of the DNA-protein interaction.
          Length = 117

 Score =  109 bits (275), Expect = 1e-28
 Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 2/117 (1%)

Query: 244 SGILGRLGDLGGIDQKYDIAVSTACG-ALNYIVTETVEAGEAVIAAVKRQNVGRVNVIPL 302
            G+ GRL DL  +D+ Y+ AV  A G  L  IV +  E  +A I  +K+ N+GR   +PL
Sbjct: 2   KGVYGRLADLIEVDEGYEKAVEAALGGRLQAIVVDDEETAKAAIEFLKKNNLGRATFLPL 61

Query: 303 DKMQQYHSQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKRI 359
           D+++            P     LIDL++  DEK+R A  +    TLV  DL++A  +
Sbjct: 62  DRIKP-RPLDDSPREIPGGATPLIDLVEYPDEKLRKALRYLLGNTLVVDDLDEALEL 117


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score =  113 bits (285), Expect = 3e-26
 Identities = 90/370 (24%), Positives = 168/370 (45%), Gaps = 21/370 (5%)

Query: 22  VETDLKALEPELRKAV-NFLELENCVQRKHNEIYQYERYVNMKNLGEHE---TKVQQMEQ 77
           V+  +  LE E+     +  E E  ++     + + E  ++ K L E E    ++++  +
Sbjct: 292 VKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEID-KLLAEIEELEREIEEERK 350

Query: 78  ELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKSEL 137
                 E   +  +E+E  ++EL+ V+K+  E+      +R  +   K + E +  K EL
Sbjct: 351 RRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKL--EKLKREINELKREL 408

Query: 138 EIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRA------ 191
           +       + +++LADL A +  ++  + E +   E+    +   E ++ +  A      
Sbjct: 409 DRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468

Query: 192 -RLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLKSENRVSGILGRL 250
             L DL  E  ++  ++ KL RE++E     + S    R    V ++   + + G+ G +
Sbjct: 469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKAS-IQGVHGTV 527

Query: 251 GDLGGIDQKYDIAVSTACGA-LNYIVTETVEAGEAVIAAVKRQNVGRVNVIPLDKMQQYH 309
             LG + ++Y  A+  A G  LN +V E     +  I  +KR+  GR   +PL+KM+   
Sbjct: 528 AQLGSVGERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRATFLPLNKMRDER 587

Query: 310 SQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQDLNQAKRIGYSGGGYRMV 369
                       +   +DL++  D K   AF +   +TLV +D+  A+R+    G YRMV
Sbjct: 588 RDL-SILSEDGVIGFAVDLVEF-DPKYEPAFKYVFGDTLVVEDIEAARRL---MGKYRMV 642

Query: 370 TLEGAIIEPS 379
           TLEG + E S
Sbjct: 643 TLEGELFEKS 652



 Score = 50.8 bits (122), Expect = 2e-06
 Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 10/93 (10%)

Query: 415 TLVAQDLNQAKRIGYSGGGYRMVTLEGAIIEPSGTMSGGGSNPIRGLMGRKATVSTDTSL 474
           TLV +D+  A+R+    G YRMVTLEG + E SG M+ GGS   RG +          S 
Sbjct: 623 TLVVEDIEAARRLM---GKYRMVTLEGELFEKSGAMT-GGSRAPRGGILFSR------SE 672

Query: 475 VKDLEVKEKRLASLETELRILSQQKMEVETQLN 507
             +L+   +RL  L+ EL  L  +   +E +L+
Sbjct: 673 PAELQRLRERLEGLKRELSSLQSELRRIENRLD 705



 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 50/221 (22%), Positives = 100/221 (45%), Gaps = 33/221 (14%)

Query: 10  EERTEKLTRVQLVE----TDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNL 65
           EE  E + R+ L+       L+ L  E  KA  +  L         E  +YE Y  +K  
Sbjct: 180 EEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLK-------EKREYEGYELLKEK 232

Query: 66  GEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAK 125
              E + + +E++L +  E ++K T+E+   +  L+ +E+  +E N              
Sbjct: 233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELN-------------- 278

Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
            + +      E E     Q +  +K+ +LEA +  ++ ++ E++   E+   R+  +E+E
Sbjct: 279 KKIK---DLGEEE-----QLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330

Query: 186 IAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRS 226
           I +  A + +L RE  +   + +KL  E +E +E ++  R+
Sbjct: 331 IDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRA 371



 Score = 39.7 bits (93), Expect = 0.004
 Identities = 31/144 (21%), Positives = 63/144 (43%), Gaps = 13/144 (9%)

Query: 76  EQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKS 135
            Q L   LE +K+   E+ S +SEL+ +E +  E          S  S K        + 
Sbjct: 676 LQRLRERLEGLKR---ELSSLQSELRRIENRLDEL-----SQELSDASRKIG----EIEK 723

Query: 136 ELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLAD 195
           E+E     + K  ++L +LE +L  ++  +   K+  +EL  R+  +E ++ +    L D
Sbjct: 724 EIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783

Query: 196 LTREEA-KLLDQVEKLAREVSEKR 218
           L    +   + +++    ++ E+ 
Sbjct: 784 LEARLSHSRIPEIQAELSKLEEEV 807



 Score = 37.7 bits (88), Expect = 0.017
 Identities = 54/258 (20%), Positives = 96/258 (37%), Gaps = 38/258 (14%)

Query: 18  RVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGEHETKVQQMEQ 77
            +Q +   L+ L+ EL    + L+ E        +    E     + +GE E +++Q+EQ
Sbjct: 675 ELQRLRERLEGLKREL----SSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQ 730

Query: 78  ELTANLESIK-------KCTDEMESAKSELKTVEKK-----------GKESNVIGSKFRW 119
           E     E ++           E+E+ KSELK +E +            +  N + ++   
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790

Query: 120 SVFSAKTRFECDIAK--SELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLC----- 172
           S           + +  S +E  L    ++  +L   +  LEK    L E++        
Sbjct: 791 SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIK 850

Query: 173 ------EELTTRVPAMESEIAESRARLADLTREEAKL---LDQVEKLAREVSEKRESMQT 223
                 E L  +   +E E+ E  A L DL      L    D++E   RE+  K E ++ 
Sbjct: 851 SIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEA 910

Query: 224 SRSNNRLIDFVMQLKSEN 241
                R     ++ K E 
Sbjct: 911 QIEKKRKRLSELKAKLEA 928


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 77.0 bits (190), Expect = 1e-14
 Identities = 115/575 (20%), Positives = 231/575 (40%), Gaps = 64/575 (11%)

Query: 7   KYDEERTEKLTRVQLVETDLKALEP---ELRKAVNFLELE-NCVQR---KHNEIYQYERY 59
           KY E R E   +++    +L  LE    EL + +  LE +    +R      E+ + E  
Sbjct: 169 KYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELA 228

Query: 60  VNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKK----GKESNVIGS 115
           + +  L E   +++++++EL    E +++ T E++  + +L+ +  +     +E   +  
Sbjct: 229 LLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQK 288

Query: 116 KFRWSVFSAKTRFECDI--AKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCE 173
           +  +++ +  +R E      +  L        +   +L +LE+ L+++   L E +   E
Sbjct: 289 EL-YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347

Query: 174 ELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESM------------ 221
           EL   + ++E+E+ E  A L +L     +L +Q+E L  +V++    +            
Sbjct: 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA 407

Query: 222 ---QTSRSNNRLIDFVMQL---KSENRVSGILGRLGDLGGIDQKYDIAVSTACGALNYIV 275
              +      RL   + +L     E  +  +   L +L    ++    +     AL  + 
Sbjct: 408 RLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELR 467

Query: 276 TETVEAGEAVIAAVKRQNVGRVNVIPLDKMQQYHSQCYDKYRTPEN--------VPRLID 327
            E  EA +A+ AA +     +  +  L+++Q+      +  +            +  L +
Sbjct: 468 EELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSE 527

Query: 328 LIQVQDEKIRLAFYFATRETL---VAQDLNQAKRIGYS---GGGYRMVTLEGAIIEPSET 381
           LI V DE    A   A    L   V ++LN AK+           R+  L    I+ +E 
Sbjct: 528 LISV-DEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEI 586

Query: 382 LVAQDLNQAKRIGYSGGGYRMVTLEGAI---IEP--SETLVAQDLNQAKRI-GYSGGGYR 435
                       G+ G    +V  +  +   +       LV  DL+ A  +      GYR
Sbjct: 587 QGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR 646

Query: 436 MVTLEGAIIEPSGTMSGGGSNPIRGLMGRKATVSTDTSLVKDLEVK-----------EKR 484
           +VTL+G ++ P G ++GG +     ++ R+  +      +++LE K            K 
Sbjct: 647 IVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKE 706

Query: 485 LASLETELRILSQQKMEVETQLNCTGNELKYKKQE 519
           L  LE EL  L ++  E+  Q++    +L   + E
Sbjct: 707 LEELEEELEQLRKELEELSRQISALRKDLARLEAE 741



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 44/221 (19%), Positives = 97/221 (43%), Gaps = 8/221 (3%)

Query: 10  EERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNM--KNLGE 67
           EE  EK+  ++    +L+    ELRK     ELE  +++   E+ +  R ++   K+L  
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRKELEELSRQISALRKDLAR 737

Query: 68  HETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTR 127
            E +V+Q+E+ +    + + +   E+E  +  L+  E++  E+     +    +   + +
Sbjct: 738 LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI--EQLK 795

Query: 128 FECDIAKSELEIFLSTQSKETK-KLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
            E    +  L+  L  +      + A+L   LE ++  +   +   E+L  ++  +  +I
Sbjct: 796 EELKALREALDE-LRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854

Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSN 227
               A + +L     +L  ++E L  E +   E++   RS 
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.8 bits (94), Expect = 0.003
 Identities = 40/166 (24%), Positives = 70/166 (42%), Gaps = 29/166 (17%)

Query: 59  YVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFR 118
           Y   K + E   K+++ E+E    LE  KK   E E+ K E   +E K +          
Sbjct: 22  YFVRKKIAE--AKIKEAEEEAKRILEEAKK---EAEAIKKE-ALLEAKEE---------- 65

Query: 119 WSVFSAKTRFECDIAKSELEIFL---------STQSKETKKLADLEANLEKVQTTLTERK 169
             +   +  FE ++ +   E+               ++ + L   E  LEK +  L +++
Sbjct: 66  --IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123

Query: 170 TLCEELTTRVPAMESEIAESRARLADLTREEAK--LLDQVEKLARE 213
              E+    +  +  E  +   R++ LT EEAK  LL++VE+ AR 
Sbjct: 124 QELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARH 169


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 38.9 bits (91), Expect = 0.007
 Identities = 42/219 (19%), Positives = 92/219 (42%), Gaps = 28/219 (12%)

Query: 8   YDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGE 67
           Y  E     +R++ ++  L      L +    LEL+   + +   I   E+  ++ +L E
Sbjct: 246 YHLEHVNIDSRLERLKEQLVENSELLTQ----LELDE-AEEELGLIQ--EKIESLYDLLE 298

Query: 68  HETKVQQMEQELTANL-ESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
            E + + + +E    L + ++K  +  E  K E++ V++  + +       R      K 
Sbjct: 299 REVEAKNVVEENLPILPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVR------KF 352

Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
             E    +S L+  L     +    ++L+ NLE+++  LT+              +E E 
Sbjct: 353 EKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTD--------------IEDEQ 398

Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
            + +  L  L ++E +  + +E+L  ++ E +  M+ S 
Sbjct: 399 EKVQEHLTSLRKDELEARENLERLKSKLHEIKRYMEKSN 437


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 37.7 bits (88), Expect = 0.015
 Identities = 35/187 (18%), Positives = 65/187 (34%), Gaps = 19/187 (10%)

Query: 65  LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVF-- 122
           L E     Q   +E  +  E      +  E  + E   +E + +E+       R  +   
Sbjct: 330 LEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEEL 389

Query: 123 -----SAKTRFE-----CDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLC 172
                  + RF         A+  LE     + +  ++ A+LEA L   +  + E + L 
Sbjct: 390 EEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALL 449

Query: 173 EE---LTTRVPAMESEIA----ESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
           E         P   S       E R R+ +L  E   L ++VE++   +    + ++   
Sbjct: 450 EAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAED 509

Query: 226 SNNRLID 232
              RL +
Sbjct: 510 RIERLEE 516



 Score = 33.5 bits (77), Expect = 0.35
 Identities = 26/99 (26%), Positives = 40/99 (40%), Gaps = 4/99 (4%)

Query: 129 ECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAE 188
           E      ELE  L    +E     ++E  LE+    L E +   E L  R   +E  IAE
Sbjct: 472 EDRERVEELEAELEDLEEEV---EEVEERLERA-EDLVEAEDRIERLEERREDLEELIAE 527

Query: 189 SRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSN 227
            R  + +      +L ++  +L  E  EKRE+   +   
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEE 566



 Score = 30.0 bits (68), Expect = 4.3
 Identities = 31/164 (18%), Positives = 63/164 (38%), Gaps = 12/164 (7%)

Query: 65  LGEHETKVQQ---MEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSV 121
           L EHE + ++   +E E+    E+I +   E E    E++ + ++ +E   +  +     
Sbjct: 243 LEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEE---LEEE----- 294

Query: 122 FSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPA 181
                  E  +  ++ E   + + +   +  +L   LE+ +          E L      
Sbjct: 295 -RDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADD 353

Query: 182 MESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
           +E    E R   A+L  E  +  + VE    E+ E  E ++  R
Sbjct: 354 LEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELR 397



 Score = 28.9 bits (65), Expect = 8.1
 Identities = 25/109 (22%), Positives = 45/109 (41%), Gaps = 14/109 (12%)

Query: 133 AKSELEIFLSTQSKETKKLADLEANLEKVQTTLT----ERKTLCEELTTRVPAMESEIAE 188
            + E +  L    +  ++L  LEA +E ++ T+     ER+ L EE+      +E    E
Sbjct: 235 TRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEE 294

Query: 189 SRARLADLTREEA----------KLLDQVEKLAREVSEKRESMQTSRSN 227
               LA+   ++A          +L D+ E+L   + E R + Q     
Sbjct: 295 RDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 37.4 bits (87), Expect = 0.022
 Identities = 48/240 (20%), Positives = 92/240 (38%), Gaps = 34/240 (14%)

Query: 2   EVLYGKYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVN 61
           E L   + E RTE+L + Q+VE +L     +L  A+         Q K            
Sbjct: 706 EALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAAR----TQAKA----------- 750

Query: 62  MKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSV 121
              L E + +  +    L  +  ++K+   ++E  ++ ++ +  +  E         +  
Sbjct: 751 --RLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVR------EYRA 802

Query: 122 FSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPA 181
           F  +T    D  + E              L +LE++ E++Q  LT    L ++   R   
Sbjct: 803 FMQETWLHRDSLREERPNLAIQ-------LRELESSAEELQQELT---RLIKDTKLRRKK 852

Query: 182 MESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLKSEN 241
           +E E      +L  L      L D++ +LA E+ E   + Q   S +  +D + + K + 
Sbjct: 853 LEQERKALEKQLDQLDELLRGLRDEMRQLA-ELKEPANANQAEGSISERLDQLEEFKRKR 911



 Score = 30.4 bits (69), Expect = 3.0
 Identities = 28/178 (15%), Positives = 71/178 (39%), Gaps = 26/178 (14%)

Query: 68  HETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSK----FRWSVFS 123
            ET+++Q++ +L   LE  +   + ++    EL   E+  K   V G       + S   
Sbjct: 683 AETQLRQLDAQLKQLLEQQQAFLEALKDDFREL-RTERLAKWQVVEGELDNQLAQLSAAI 741

Query: 124 AKTRFECDIAKSELE-----------IFLSTQSKETKKLADLEANLEKV----------Q 162
              R +      EL+           +  +T  +  +++ +LE  +E++          +
Sbjct: 742 EAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYR 801

Query: 163 TTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRES 220
             + E     + L    P +  ++ E  +   +L +E  +L+   +   +++ ++R++
Sbjct: 802 AFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKA 859


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 36.9 bits (85), Expect = 0.029
 Identities = 43/230 (18%), Positives = 80/230 (34%), Gaps = 29/230 (12%)

Query: 10  EERTEKLTRVQ-------------------LVETDLKALEPELRKAVNFLELENCVQRKH 50
           +ER +KL                       L E   KALE    K    LE EN +   +
Sbjct: 173 KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDY 232

Query: 51  NEIYQYERYVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKES 110
            ++ +    +  + L + + +++  +QEL    E + +   E +  + E K  E++ K  
Sbjct: 233 LKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLL 292

Query: 111 NVIGSKFRWSVFSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKT 170
                    +    + + E    +           +  K+L  LE  L+K +  + E + 
Sbjct: 293 ---------AKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343

Query: 171 LCEELTTRVPAMESEIAESRAR-LADLTREEAKLLDQVEKLAREVSEKRE 219
             +EL  +  A E E  +           EE  L  +  +  R  S  + 
Sbjct: 344 ELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKL 393


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 36.1 bits (84), Expect = 0.044
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 8/78 (10%)

Query: 146 KETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL---ADLTREEAK 202
           ++ + L   E NLEK +  L+ ++   + L  +   +E  IAE R  L   + LT+EEAK
Sbjct: 94  RKMESLDKKEENLEKKEKELSNKE---KNLDEKEEELEELIAEQREELERISGLTQEEAK 150

Query: 203 --LLDQVEKLAREVSEKR 218
             LL++VE+ AR  + K 
Sbjct: 151 EILLEEVEEEARHEAAKL 168


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 35.9 bits (83), Expect = 0.057
 Identities = 36/216 (16%), Positives = 83/216 (38%), Gaps = 8/216 (3%)

Query: 7   KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLG 66
              EE  EKL +++ +  +L+ L+ +L+      EL     R        E    ++   
Sbjct: 526 ALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRK 585

Query: 67  EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
           E   ++++  +EL   L+ +++   ++E     L+  E + +       +    +     
Sbjct: 586 EELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAE-EELESELEKLNL 644

Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
           + E +         L  + +E +     E    + +  L E+    E+L       E E+
Sbjct: 645 QAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQL-------EEEL 697

Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRESMQ 222
            + R  L +L ++  ++   +E+L    +E  E  +
Sbjct: 698 EQLREELEELLKKLGEIEQLIEELESRKAELEELKK 733



 Score = 31.7 bits (72), Expect = 1.3
 Identities = 39/235 (16%), Positives = 91/235 (38%), Gaps = 26/235 (11%)

Query: 7   KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLG 66
               E  E L      E        E+++     ELE     +   +   ER   ++   
Sbjct: 195 SELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQ----EIEALE--ERLAELEEEK 248

Query: 67  EHETKVQ-QMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAK 125
           E   +++ ++ +  +  LE++K   +E+   +  L+ +E+K +                 
Sbjct: 249 ERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLE-------------- 294

Query: 126 TRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESE 185
              E +    ELE  L       ++L +L   L+ ++  L + +   E+L + +  +  E
Sbjct: 295 ---ELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351

Query: 186 IAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLIDFVMQLKSE 240
             E    L +  +E  + L+++EK   +  E+ + ++ +     L + + +L + 
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQ--ELKEELAELSAA 404


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 35.3 bits (81), Expect = 0.071
 Identities = 16/48 (33%), Positives = 29/48 (60%)

Query: 150 KLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLT 197
           +L+DL A+L+ + T+L + K     L T V A+++ +    + LADL+
Sbjct: 57  RLSDLSADLQDMTTSLDDAKATLNGLITTVQALQTSVDSLSSELADLS 104


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 35.3 bits (82), Expect = 0.10
 Identities = 30/165 (18%), Positives = 58/165 (35%), Gaps = 34/165 (20%)

Query: 77  QELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAK-- 134
           +E     E +K+    + + ++E+        +  ++GS+            E DI    
Sbjct: 718 EEAEKEFELLKEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRL---EANEDDIKGLA 774

Query: 135 -----------------------SELEIFLSTQSK--ETKKLADLEANLEKVQTTL--TE 167
                                     E+FL          +LA LE  LEK++  +   E
Sbjct: 775 NLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIE 834

Query: 168 RKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAR 212
           +K   E    + P  E  + + + +LA+   + AKL +++  L  
Sbjct: 835 KKLSNEGFVAKAP--EEVVEKEKEKLAEYQVKLAKLEERLAVLKA 877


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 34.6 bits (79), Expect = 0.15
 Identities = 31/191 (16%), Positives = 67/191 (35%), Gaps = 11/191 (5%)

Query: 38  NFLELENCVQRKHNEIYQYERYVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAK 97
           N+   E  ++    +        ++ NL      + +  QE     + IK     +    
Sbjct: 250 NYEPSEQELKLGFEKFVH-IINTDIANLKTQNDNLYEKIQEAMKISQKIK----TLREKW 304

Query: 98  SELKTVEKK--GKESNVIGSKFRWSVFSAKTRFECDIAKSELEIFLSTQSKETKKLADLE 155
             LK+   K     + +      W     K + E ++ + E++   S   +  K+L    
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364

Query: 156 ANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVS 215
            + E+ +    ER    E+LT  +  +  +  +    +     E   +   +EK  R+  
Sbjct: 365 ISTEQFELMNQER----EKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYD 420

Query: 216 EKRESMQTSRS 226
              +++  SRS
Sbjct: 421 SLIQNITRSRS 431


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 32.6 bits (75), Expect = 0.18
 Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 14/88 (15%)

Query: 148 TKKLADLEANLEKVQTTLTERKTLCEE----------LTTRVPAMESEIAESRARLADLT 197
            KK   LE  +EK++  L E K   EE          LT R+  +E E+ ES  RL + T
Sbjct: 41  QKKNQQLEEEVEKLEEQLKEAKEKLEESEKLATNAEALTRRIQLLEEELEESEKRLKETT 100

Query: 198 REEAKLLDQVEKLAREVSEKRESMQTSR 225
            +    L + +K A E   K ++++  R
Sbjct: 101 EK----LREADKKAEESERKVKALENER 124


>gnl|CDD|221303 pfam11902, DUF3422, Protein of unknown function (DUF3422).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria, archaea and
           eukaryotes. Proteins in this family are typically
           between 426 to 444 amino acids in length.
          Length = 419

 Score = 34.1 bits (79), Expect = 0.18
 Identities = 14/56 (25%), Positives = 28/56 (50%), Gaps = 5/56 (8%)

Query: 174 ELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNR 229
           EL+  +  +E  +A     +AD   ++ +LL ++ +LA EV    E++    +  R
Sbjct: 212 ELSPELAELERRLAALTQAMADSEADDEELLSELTELAAEV----EALAA-ATTFR 262


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 33.7 bits (77), Expect = 0.29
 Identities = 41/205 (20%), Positives = 85/205 (41%), Gaps = 9/205 (4%)

Query: 18  RVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGEHETKVQQMEQ 77
           R Q  +T+LK LE EL +    LE       +  +  Q        +  ++  K++ +E+
Sbjct: 179 RAQNADTELKLLESELEELREQLEECQKELAEAEKKLQ-SLTSEQASSADNSVKIKHLEE 237

Query: 78  ELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKSEL 137
           EL    +      + ++S K +L  + +  +E   +  + R      +          +L
Sbjct: 238 ELKRYEQDA----EVVKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDL 293

Query: 138 EIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLT 197
           +  L    K  +KLADLE   EK++  L   K+L +++   +   +    +   R+  L 
Sbjct: 294 QSRLERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPD----DLSRRIVVLQ 349

Query: 198 REEAKLLDQVEKLAREVSEKRESMQ 222
            EE +L ++   ++    +   ++Q
Sbjct: 350 NEELQLKEKNGSISSSAKQLETTLQ 374



 Score = 29.9 bits (67), Expect = 4.4
 Identities = 49/229 (21%), Positives = 88/229 (38%), Gaps = 20/229 (8%)

Query: 1   MEVLYGKYDEERTEKL-TRVQL--VETDLKALEPELRKAVNFLELENCVQRKHNEI---Y 54
            +V       +R E + ++ +L  +E +L   E E ++A   LE +     ++ E     
Sbjct: 49  KQVEESMTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDR 108

Query: 55  QYERYVNMKNLGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIG 114
             E  V +K L E E K +    E     + +K   D  ES K + +  ++  +    I 
Sbjct: 109 NLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDA-ESLKLQNEKEDQLKEAKESIS 167

Query: 115 ------SKFRWSVFSAKT-----RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQT 163
                 S+ +    +A T       E +  + +LE      ++  KKL  L +       
Sbjct: 168 RIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSAD 227

Query: 164 TLTERKTLCEELTTRVP--AMESEIAESRARLADLTREEAKLLDQVEKL 210
              + K L EEL        +   + E   ++ +L RE A L ++  KL
Sbjct: 228 NSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREENRKL 276


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 33.5 bits (77), Expect = 0.30
 Identities = 32/149 (21%), Positives = 63/149 (42%), Gaps = 14/149 (9%)

Query: 9   DEERTEKLTRVQLVETDLKALEPELRKAVNFLELE-NCVQRKHNEIYQYERYVNM-KNLG 66
             E T+    +Q V+ +L+  + +L K +  LE E   V +K + I   E+   + + LG
Sbjct: 372 GSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELG 431

Query: 67  EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
           E + ++ + E E+   L  ++   + +E+ +  L    K+   +  +      +      
Sbjct: 432 EAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKINAFELERAITIA------ 485

Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLE 155
               D AK  L+ F     +  +KL  LE
Sbjct: 486 ----DKAKKTLKEFREKLLE--RKLQQLE 508


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 32.5 bits (74), Expect = 0.37
 Identities = 25/92 (27%), Positives = 45/92 (48%), Gaps = 1/92 (1%)

Query: 151 LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKL 210
           +A++E  L K++  LTE      EL  R+  +  E+ + R  +  L +  A+L  ++  L
Sbjct: 69  IAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSL 128

Query: 211 AREVSEKRESMQT-SRSNNRLIDFVMQLKSEN 241
             E+ + RE +Q   + N  L D ++ L  E 
Sbjct: 129 ETEIRDLREELQEKEKDNETLQDELISLNIEL 160


>gnl|CDD|181676 PRK09165, PRK09165, replicative DNA helicase; Provisional.
          Length = 497

 Score = 33.3 bits (77), Expect = 0.37
 Identities = 11/44 (25%), Positives = 27/44 (61%), Gaps = 3/44 (6%)

Query: 173 EELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSE 216
           E+L TR+ + +SEI+ S+ R   ++ E+    +++   ++E+ +
Sbjct: 273 EQLATRILSEQSEISSSKIRRGKISEED---FEKLVDASQELQK 313


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 33.1 bits (76), Expect = 0.40
 Identities = 27/96 (28%), Positives = 48/96 (50%), Gaps = 12/96 (12%)

Query: 125 KTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTER-KTLCEELTTRVPAME 183
           +TR E    K+ELE     QS++   L + +A  +K++    ER KTL          +E
Sbjct: 177 QTREELAAQKAELE---EKQSQQKTLLYEQQAQQQKLEQARNERKKTLT--------GLE 225

Query: 184 SEIAESRARLADLTREEAKLLDQVEKLAREVSEKRE 219
           S + + + +L++L   E++L D + +  RE   + E
Sbjct: 226 SSLQKDQQQLSELRANESRLRDSIARAEREAKARAE 261


>gnl|CDD|236042 PRK07525, PRK07525, sulfoacetaldehyde acetyltransferase; Validated.
          Length = 588

 Score = 33.0 bits (76), Expect = 0.41
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 263 AVSTACGALNYIVTETVEAGEAVIAAVKRQNVGRVNVI 300
            ++ A GA   +V    E G A+  A+  QN G+  VI
Sbjct: 520 GIAEAMGAEGVVVDTQEELGPALKRAIDAQNEGKTTVI 557


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.4 bits (74), Expect = 0.58
 Identities = 24/141 (17%), Positives = 46/141 (32%), Gaps = 26/141 (18%)

Query: 73  QQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDI 132
            +    L A L+ +     E+ + ++EL T+  + +                        
Sbjct: 160 AERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQ---------------------- 197

Query: 133 AKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRAR 192
            +++L        +  K LA L + L   Q  L E +     L   + + E+  A++R  
Sbjct: 198 -QAKLAQ---LLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREA 253

Query: 193 LADLTREEAKLLDQVEKLARE 213
            A      A+      K   E
Sbjct: 254 AAAAEAAAARARAAEAKRTGE 274


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 30.7 bits (70), Expect = 0.67
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 141 LSTQSKETKK-LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTRE 199
           L  Q  E KK +A L+A  E  Q  L+E +   EE       +E E++E   R+ +L  +
Sbjct: 64  LRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKM---LEDELSELEKRIEELNEQ 120

Query: 200 EAKLLDQVEKLA 211
              L DQ+E L+
Sbjct: 121 NKLLHDQIELLS 132


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 31.5 bits (72), Expect = 0.77
 Identities = 19/85 (22%), Positives = 42/85 (49%), Gaps = 3/85 (3%)

Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEK 217
           L ++Q  L E +    EL  ++  ++ E  E +  L+ L  E  +L  ++ ++ +++S  
Sbjct: 68  LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARI-KQLSA- 125

Query: 218 RESMQTSRSNNRLIDFVMQLKSENR 242
             +++    N  L + + +LK EN 
Sbjct: 126 -NAIELDEENRELREELAELKQENE 149



 Score = 28.8 bits (65), Expect = 5.0
 Identities = 16/72 (22%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 175 LTTRVPAMESEIAESRARLADLTREEAKL---LDQVEKLAREVSEKRESMQTSRSNNRLI 231
              R+P ++ E+AE +  LA+L  + A+L     ++++    +  + E +Q   +  + +
Sbjct: 64  ARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL 123

Query: 232 D-FVMQLKSENR 242
               ++L  ENR
Sbjct: 124 SANAIELDEENR 135


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 31.4 bits (72), Expect = 0.79
 Identities = 20/83 (24%), Positives = 45/83 (54%), Gaps = 8/83 (9%)

Query: 167 ERKTLCEELTTRVPAMESEIAESRARLADL------TREEAKLLDQVEKLAREVSEKRES 220
           + KT  E+L   +  ++  IAE +A++  L      T E  +LL+++++L +E+ + +  
Sbjct: 66  KLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAE 125

Query: 221 MQTSRSNNRLIDFVMQLKSENRV 243
           ++    N+   + + +LK E +V
Sbjct: 126 LEKYEKND--PERIEKLKEETKV 146


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 32.0 bits (73), Expect = 0.82
 Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 14/99 (14%)

Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLT---REEAKLLDQVEKLAREV 214
           L +++  L + K L  +    V  +     + +  L  L         L ++V+KL  E+
Sbjct: 303 LNEIEERLAQIKRLKRKYGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDKLEEEL 362

Query: 215 SEKRESMQTSR--SNNRLIDFVMQLKSENRVSGILGRLG 251
            +   ++   R  +  RL           RV   L  L 
Sbjct: 363 DKAAVALSLIRRKAAERL---------AKRVEQELKALA 392


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.6 bits (72), Expect = 1.1
 Identities = 53/239 (22%), Positives = 97/239 (40%), Gaps = 32/239 (13%)

Query: 7   KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKH---NEIYQYERYVNMK 63
           +Y  E       ++ +E   + L  ELR+    L+ E+ + +      ++ + E  +   
Sbjct: 456 EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKY 515

Query: 64  NLGEHETKVQQMEQ------ELTANLESIKKCTDEMESAKSELKTVEKKGKE-----SNV 112
           NL E E K ++ E+      +L   ++S+KK  +++E  K +L  +EKK  E     + +
Sbjct: 516 NLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL 575

Query: 113 IGSKFRWSVFSAKTRFECDI---------------AKSELEIFLSTQSKETKKLADLEAN 157
           +  +     F +    E  +               A+ ELE       K  ++L      
Sbjct: 576 LK-ELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEE 634

Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSE 216
           L + +  L E +   EEL       E E  E R    +L+RE A L  ++E+L +   E
Sbjct: 635 LAETEKRLEELRKELEEL--EKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREE 691


>gnl|CDD|223434 COG0357, GidB, Predicted S-adenosylmethionine-dependent
           methyltransferase involved in bacterial cell division
           [Cell envelope biogenesis, outer membrane].
          Length = 215

 Score = 31.1 bits (71), Expect = 1.2
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 3/54 (5%)

Query: 226 SNNRLIDFVMQLKSE---NRVSGILGRLGDLGGIDQKYDIAVSTACGALNYIVT 276
           S  + I F+ ++K E     V  + GR  + G   ++YD+  S A  +LN ++ 
Sbjct: 99  SLGKKIAFLREVKKELGLENVEIVHGRAEEFGQEKKQYDVVTSRAVASLNVLLE 152


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 29.8 bits (68), Expect = 1.4
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 170 TLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQ 222
            L E L++ +  +E E+A  +  LA L  E  +   ++ KL  E  E RE  +
Sbjct: 17  QLVERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKK 69


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 31.2 bits (71), Expect = 1.5
 Identities = 19/79 (24%), Positives = 28/79 (35%), Gaps = 2/79 (2%)

Query: 133 AKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAE-SRA 191
               LE F  +  KE      LE   EKV     +     EE+   VP    +I E    
Sbjct: 448 ILKSLEEFKQSIVKEAALSI-LEEIREKVLEQAEQGCENQEEVKKEVPKKVKKIPEVDTY 506

Query: 192 RLADLTREEAKLLDQVEKL 210
            L +      +  +++E L
Sbjct: 507 LLLEELGINEETYEKLEAL 525


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 30.2 bits (69), Expect = 1.7
 Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 145 SKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAK-- 202
            ++ + L   E +LE+ +  L  R+   EE    +  +  E  +   R++ LT EEAK  
Sbjct: 95  DRKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQQELERISGLTAEEAKEI 154

Query: 203 LLDQVEKLARE 213
           LL++VE   + 
Sbjct: 155 LLEEVEAELKH 165


>gnl|CDD|183048 PRK11239, PRK11239, hypothetical protein; Provisional.
          Length = 215

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 173 EELTTRVPAMESEIAESRARLADL 196
            +L  RV A+E E+AE + RL  L
Sbjct: 186 GDLQARVEALEIEVAELKQRLDSL 209


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 30.7 bits (69), Expect = 2.1
 Identities = 44/244 (18%), Positives = 96/244 (39%), Gaps = 36/244 (14%)

Query: 9   DEERTEKLTRVQLVETDLKALEPELRKAVNFLELENCVQRKHNEIYQYERYVNMKNLGEH 68
           + + ++  +R++ +ET L+       K     E     Q     +   ER   +    E+
Sbjct: 163 EGDVSDMRSRIEQLETALRHST---EKTEELEEQHKEAQSSSESMSA-ERNALLAQRAEN 218

Query: 69  ETKVQQMEQELTA----------NLESIKKCTDEMESAKSELK------TVEKKGKESNV 112
           + ++ ++EQ++             LE  +    E+E  K ELK      T++++ +E+  
Sbjct: 219 QQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETER 278

Query: 113 IGSKFRWSVFSAKTRFECDIAKSELEIFLSTQSKET---KKLADLEANLEKVQTTLTERK 169
           I  +          R       +  E   S+Q K     ++LA L +  +     L + +
Sbjct: 279 IDLQLENEQLHEDLR-------TLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSR 331

Query: 170 TLCEELTTRVPAMESEIAE-----SRARLADLTREEAKLLDQVEKLAREVSEKRESMQTS 224
               +++ ++  +   + E     ++ R       EA   D+++KL+ E+ +  E +Q  
Sbjct: 332 LESAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADK-DRIQKLSAELLKLEEWLQEE 390

Query: 225 RSNN 228
           RS  
Sbjct: 391 RSQR 394


>gnl|CDD|114591 pfam05873, Mt_ATP-synt_D, ATP synthase D chain, mitochondrial
           (ATP5H).  This family consists of several ATP synthase D
           chain, mitochondrial (ATP5H) proteins. Subunit d has no
           extensive hydrophobic sequences, and is not apparently
           related to any subunit described in the simpler ATP
           synthases in bacteria and chloroplasts.
          Length = 160

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 15/68 (22%), Positives = 28/68 (41%), Gaps = 12/68 (17%)

Query: 33  LRKAVNFLELENCVQRKHNEI---YQYERY---------VNMKNLGEHETKVQQMEQELT 80
            +K V    L +  ++K+  +   Y  ++Y          ++K + E++TK     QE  
Sbjct: 56  YKKNVPKAGLVDSFEKKYEALKVPYPEDKYTAQVDAEEKEDVKRIAEYKTKSADRIQEYE 115

Query: 81  ANLESIKK 88
             LE  K 
Sbjct: 116 KELEKWKN 123


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 30.8 bits (69), Expect = 2.4
 Identities = 41/222 (18%), Positives = 94/222 (42%), Gaps = 4/222 (1%)

Query: 7   KYDEERTEKLTRVQLVETDLKALEPELRKAVNFLELENC-VQRKHNEIYQYERYV-NMKN 64
           K  E+ T       +++T++ AL  EL +  N L  +   +Q    E   Y   + +M++
Sbjct: 326 KLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRD 385

Query: 65  LGEH-ETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFS 123
             E  E K++ +++++    E+ ++    ++  K  L++++        +    +     
Sbjct: 386 RYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEK 445

Query: 124 AKTRFECDIAKSELEIF-LSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAM 182
            +        +   E +         K+  DL+  ++ +Q  L+ER+   E L   V  +
Sbjct: 446 ERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKL 505

Query: 183 ESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRESMQTS 224
            S   + R+ L     E  K+ ++ EKL +E+   R + +++
Sbjct: 506 ASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESA 547


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 30.6 bits (69), Expect = 2.5
 Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 3/79 (3%)

Query: 142 STQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEA 201
           ++  ++ + L+  E  LE  + +LT++    +E   +V  +E +      R+A L++ EA
Sbjct: 111 TSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEA 170

Query: 202 K---LLDQVEKLAREVSEK 217
           +   L +   KL  E++ +
Sbjct: 171 REIILAETENKLTHEIATR 189


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 30.3 bits (68), Expect = 2.6
 Identities = 20/71 (28%), Positives = 39/71 (54%)

Query: 149 KKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVE 208
            +  DL+ + E+++  L E +   EEL   +  +E+E  E + RL  L  E ++L + ++
Sbjct: 128 PEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLK 187

Query: 209 KLAREVSEKRE 219
           KL  EV + ++
Sbjct: 188 KLPGEVYDLKK 198



 Score = 29.1 bits (65), Expect = 5.6
 Identities = 42/217 (19%), Positives = 89/217 (41%), Gaps = 22/217 (10%)

Query: 4   LYGKYDEERTEKLTRVQLVET----DLKALEPELRKAVNFLELENCVQRKHNEIYQYER- 58
           L  K+ EE    +  ++ ++     D++ ++ EL +     EL+N + R   +  Q    
Sbjct: 74  LAEKFFEELRGMVGHIERMKIPIGHDVEHIDVELVRK----ELKNALVRAGLKTLQRVPE 129

Query: 59  YVNMKNLGEH-ETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKF 117
           Y+++K   E  + K++++++E    L+ +++   E E  +  LK +E +      +  K 
Sbjct: 130 YMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKL 189

Query: 118 RWSVFSAKTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTT 177
              V+  K R++        E+    +  E + ++DL      +     E     +    
Sbjct: 190 PGEVYDLKKRWD--------ELEPGVELPEEELISDLVKETLNLAPKDIEG----QGYIY 237

Query: 178 RVPAMESEIAESRARLADLTREEAKLLDQVEKLAREV 214
                E EI      +A  +RE+A    ++ K A E 
Sbjct: 238 AEDEKEVEILLGTVYIAAPSREDAVEELEIIKEAIEE 274


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
           the evolutionarily conserved kinetochore-associated
           Ndc80 complex and is involved in chromosome segregation.
          Length = 117

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 149 KKLADLEANLEKVQTTLTERKTLCEEL 175
           K+L DLE+ L  +Q+ L E K   EEL
Sbjct: 13  KELNDLESELNNLQSELEELKERLEEL 39


>gnl|CDD|218079 pfam04420, CHD5, CHD5-like protein.  Members of this family are
           probably coiled-coil proteins that are similar to the
           CHD5 (Congenital heart disease 5) protein. In
           Saccharomyces cerevisiae this protein localises to the
           ER and is thought to play a homeostatic role.
          Length = 161

 Score = 29.6 bits (67), Expect = 2.7
 Identities = 13/41 (31%), Positives = 24/41 (58%)

Query: 191 ARLADLTREEAKLLDQVEKLAREVSEKRESMQTSRSNNRLI 231
           A+ A L R+  KL D++EKL +E++ ++   +      RL+
Sbjct: 66  AKWAKLNRKLDKLTDELEKLKKELTAQKSKFKWVLKKLRLL 106


>gnl|CDD|237625 PRK14147, PRK14147, heat shock protein GrpE; Provisional.
          Length = 172

 Score = 29.5 bits (66), Expect = 2.8
 Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 165 LTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEKRE 219
           L +     + L   V ++ SEIA  +A   D  RE A L +Q +++AR+V + R+
Sbjct: 13  LAQNPPETDPLKAEVESLRSEIALVKA---DALRERADLENQRKRIARDVEQARK 64


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 30.6 bits (69), Expect = 2.9
 Identities = 28/140 (20%), Positives = 55/140 (39%), Gaps = 34/140 (24%)

Query: 71   KVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFEC 130
            ++ ++E+   ANL+ +KK    + + + E+  +  +         K R    + K  F  
Sbjct: 1115 RITELEKAKLANLDVVKKAISNLNNLQQEVTLLRNE---------KIRMHTGTDKVDFS- 1164

Query: 131  DIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESR 190
            DI K                   LE  L+ + T L +      E+T ++ A+E E  ++ 
Sbjct: 1165 DIEK-------------------LEKQLQVIDTKLAD--AYLLEVTKQISALEKEKPKN- 1202

Query: 191  ARLADLTREEAKLLDQVEKL 210
               ++L  + AK  D    +
Sbjct: 1203 --QSELKSKIAKFFDTTADI 1220


>gnl|CDD|190201 pfam02058, Guanylin, Guanylin precursor. 
          Length = 87

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 8/28 (28%), Positives = 12/28 (42%)

Query: 72 VQQMEQELTANLESIKKCTDEMESAKSE 99
          V   +   + +LES+KK  D  E     
Sbjct: 1  VTVQDGGFSFSLESVKKLKDLQEQWAQS 28


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 29.9 bits (68), Expect = 3.5
 Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 13/96 (13%)

Query: 151 LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI-------AESRARLADLTREEAKL 203
           L+  EA   ++Q  L E          R   +  E+       A + A++  L ++ A L
Sbjct: 90  LSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIAAL 149

Query: 204 LDQVEKL--AREVSEKRESMQTSRSNNRLIDFVMQL 237
             Q+  L  A + SEKR+      S  ++ D   +L
Sbjct: 150 RRQLAALEAALDASEKRD----RESQAKIADLGRRL 181



 Score = 28.8 bits (65), Expect = 8.1
 Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 181 AMESEIAESRARLADL-TREEAKLLDQVEKLAREVSE 216
           A E    ES+A++ADL  R    L  +V++L R  SE
Sbjct: 162 ASEKRDRESQAKIADLGRRLNVALAQRVQELNRYRSE 198


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 30.1 bits (69), Expect = 3.6
 Identities = 17/65 (26%), Positives = 33/65 (50%), Gaps = 7/65 (10%)

Query: 148 TKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQV 207
            K+LA LE  +E+V+  L+    +      + P  E  + + R +LA+   + AKL +++
Sbjct: 817 EKELAKLEKEIERVEKKLSNEGFV-----AKAP--EEVVEKEREKLAEYEEKLAKLKERL 869

Query: 208 EKLAR 212
            +L  
Sbjct: 870 ARLKA 874


>gnl|CDD|132835 cd00147, cPLA2_like, Cytosolic phospholipase A2, catalytic domain;
           hydrolyses arachidonyl phospholipids.  Catalytic domain
           of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4)
           hydrolyzes the sn-2-acyl ester bond of phospholipids to
           release arachidonic acid. At the active site, cPLA2
           contains a serine nucleophile through which the
           catalytic mechanism is initiated. The active site is
           partially covered by a solvent-accessible flexible lid.
           cPLA2 displays interfacial activation as it exists in
           both "closed lid" and "open lid" forms. Movement of the
           cPLA2 lid possibly exposes a greater hydrophobic surface
           and the active site. cPLA2 belongs to the alpha-beta
           hydrolase family which is identified by a characteristic
           nucleophile elbow with a consensus sequence of
           Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu =
           nucleophile). Calcium is required for cPLA2 to bind with
           membranes or phospholipids. Group IV cPLA2 includes six
           intercellular enzymes: cPLA2alpha, cPLA2beta,
           cPLA2gamma, cPLA2delta, cPLA2epsilon, and cPLA2zeta.
          Length = 438

 Score = 29.9 bits (68), Expect = 3.6
 Identities = 29/113 (25%), Positives = 46/113 (40%), Gaps = 22/113 (19%)

Query: 427 IGYSGGGYR-MVTLEGAIIEPSGTMSGG---GSNPIRGLMGRKATVST-----DTSLVKD 477
           I  SGGGYR M    GA+        GG       + GL G    +++     D S  KD
Sbjct: 46  ILGSGGGYRAMTGGAGAL---KALDEGGLLDCVTYLSGLSGSTWLMASLYSNPDWS-QKD 101

Query: 478 LEVKEKRLASLETELRILSQQKMEVET--QLNCTGNELKYKKQE-YDTCLIDV 527
           L+   + L       R + +  + + +  +L     EL+ KK+  ++  L D 
Sbjct: 102 LDEAIEWL------KRHVIKSPLLLFSPERLKYYAKELEEKKKAGFNVSLTDF 148


>gnl|CDD|240354 PTZ00310, PTZ00310, AMP deaminase; Provisional.
          Length = 1453

 Score = 30.2 bits (68), Expect = 3.7
 Identities = 36/154 (23%), Positives = 57/154 (37%), Gaps = 24/154 (15%)

Query: 163  TTLTERKTLCEE----------LTTRVPAME----------SEIAESRARLADLTREEAK 202
             TL E   +CE+          L+T   AME            I   RA++  L R++  
Sbjct: 1295 KTLEEELAICEKIVDVRAETVLLSTHDQAMEVMLREMETTNERIVHLRAQVDSLRRQQRS 1354

Query: 203  LLDQVEKLAREVSEKRESMQTSRSNNRLIDF-VMQLKSENRVSGILGRLGDLGGIDQKYD 261
            L   VE+L  E   ++E  +     N  I   V +L+   +  G+    G  G  +++  
Sbjct: 1355 L---VERLTEEGIRRQEVSEQVAGENNSIALRVNRLRERLQSEGLTVGGGLYGRSNEEES 1411

Query: 262  IAVSTACGALNYIVTETVEAGEAVIAAVKRQNVG 295
             A ST  G     +         V+ AV   +V 
Sbjct: 1412 EASSTQQGETVKRLLRWKPMPPDVMRAVTVASVP 1445


>gnl|CDD|226055 COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope
           biogenesis, outer membrane].
          Length = 372

 Score = 29.8 bits (67), Expect = 3.9
 Identities = 25/121 (20%), Positives = 54/121 (44%), Gaps = 18/121 (14%)

Query: 76  EQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFR--WSVFSAKTRFECDIA 133
           + E   N  S +   D +  A+ E++  E++ K+++   + +R    VF  K       A
Sbjct: 164 QSEKLINQLSERARRDTVRFAEEEVQKAEERVKKASNDLTDYRIKNGVFDPK-------A 216

Query: 134 KSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL 193
           ++E+++ L         ++ LE  L  +Q  L   K++      ++P +++ I   R +L
Sbjct: 217 QAEVQMSL---------VSKLEDELIVIQAQLDTVKSVMNPENPQIPGLKARIESLRKQL 267

Query: 194 A 194
            
Sbjct: 268 L 268


>gnl|CDD|215217 PLN02387, PLN02387, long-chain-fatty-acid-CoA ligase family
           protein.
          Length = 696

 Score = 29.7 bits (67), Expect = 4.2
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 15/69 (21%)

Query: 145 SKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLL 204
           SK+ KKL D+ + LE V+               RV  M+ E  +S + L+  +       
Sbjct: 186 SKQLKKLIDISSQLETVK---------------RVIYMDDEGVDSDSSLSGSSNWTVSSF 230

Query: 205 DQVEKLARE 213
            +VEKL +E
Sbjct: 231 SEVEKLGKE 239


>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score = 29.6 bits (67), Expect = 4.3
 Identities = 16/88 (18%), Positives = 32/88 (36%)

Query: 151 LADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKL 210
           + +LE  L+K++  L             V  M+  + E+  +L  L     + ++  ++L
Sbjct: 285 VKELEKGLKKLERELELSALDEHPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKEL 344

Query: 211 AREVSEKRESMQTSRSNNRLIDFVMQLK 238
                E  +          L DF+   K
Sbjct: 345 TEYFGEDPKETSPEEFFKILRDFLRMFK 372


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 29.0 bits (65), Expect = 5.2
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 5/86 (5%)

Query: 149 KKLADLEANLEKVQTTLTERKTL-----CEELTTRVPAMESEIAESRARLADLTREEAKL 203
           +   +LEA L + +  L+           E L  +  A+E+E+A    R+  L     +L
Sbjct: 7   RDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQL 66

Query: 204 LDQVEKLAREVSEKRESMQTSRSNNR 229
           +++    A E+ E+ E +       R
Sbjct: 67  IEEGHPDAEEIQERLEELNQRWEELR 92


>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain.  Mitosin or
           centromere-associated protein-F (Cenp-F) is found bound
           across the centromere as one of the proteins of the
           outer layer of the kinetochore. Most of the
           kinetochore/centromere functions appear to depend upon
           binding of the C-terminal par to f the molecule, whereas
           the N-terminal part, here, may be a cytoplasmic player
           in controlling the function of microtubules and dynein.
          Length = 288

 Score = 29.0 bits (64), Expect = 5.4
 Identities = 16/82 (19%), Positives = 39/82 (47%)

Query: 145 SKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLL 204
           ++  +K+ +LE  L+K++    +R+   + L   +   + ++   +   A+L RE  +L+
Sbjct: 14  TRALQKIQELEGQLDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLM 73

Query: 205 DQVEKLAREVSEKRESMQTSRS 226
           +  E L +   +    +Q   S
Sbjct: 74  EICEHLEKTKQKISHELQVKES 95


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 29.3 bits (66), Expect = 5.5
 Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 10/87 (11%)

Query: 143 TQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAK 202
           T+ +E +++   E  ++K++ T+   +    EL   +  ++ EI +  + L    RE   
Sbjct: 409 TEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRD 468

Query: 203 ----------LLDQVEKLAREVSEKRE 219
                        ++E+L +E+ EK++
Sbjct: 469 KVRKDREIRARDRRIERLEKELEEKKK 495



 Score = 28.5 bits (64), Expect = 9.6
 Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 20/154 (12%)

Query: 67  EHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKT 126
           E E   ++  +E+T   + IKK  + +E  + E   ++++ +E      K    +   + 
Sbjct: 405 EKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRR 464

Query: 127 RFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
                + K              +++   +  +E+++  L E+K   EEL       E ++
Sbjct: 465 EVRDKVRKD-------------REIRARDRRIERLEKELEEKKKRVEEL-------ERKL 504

Query: 187 AESRARLADLTREEAKLLDQVEKLAREVSEKRES 220
           AE R         +   +  VEKL  E  E+ E 
Sbjct: 505 AELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEE 538


>gnl|CDD|203593 pfam07200, Mod_r, Modifier of rudimentary (Mod(r)) protein.  This
           family represents a conserved region approximately 150
           residues long within a number of eukaryotic proteins
           that show homology with Drosophila melanogaster Modifier
           of rudimentary (Mod(r)) proteins. The N-terminal half of
           Mod(r) proteins is acidic, whereas the C-terminal half
           is basic, and both of these regions are represented in
           this family. Members of this family include the Vps37
           subunit of the endosomal sorting complex ESCRT-I, a
           complex involved in recruiting transport machinery for
           protein sorting at the multivesicular body (MVB). The
           yeast ESCRT-I complex consists of three proteins (Vps23,
           Vps28 and Vps37). The mammalian homologue of Vps37
           interacts with Tsg101 (Pfam: PF05743) through its mod(r)
           domain and its function is essential for lysosomal
           sorting of EGF receptors.
          Length = 150

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 13/63 (20%), Positives = 26/63 (41%)

Query: 158 LEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEKLAREVSEK 217
           ++ +           E L       E ++ E +A+L D   E  KL ++ E+  +++ E 
Sbjct: 29  VQDLNLEKDALLAQNESLAEENLEKEPQLEELKAQLRDKYEELKKLEEKYEEKKQQLDEL 88

Query: 218 RES 220
             S
Sbjct: 89  SSS 91


>gnl|CDD|227251 COG4914, COG4914, Predicted nucleotidyltransferase [General
           function prediction only].
          Length = 190

 Score = 28.6 bits (64), Expect = 5.9
 Identities = 18/88 (20%), Positives = 31/88 (35%), Gaps = 3/88 (3%)

Query: 309 HSQCYDKYRTPENVPRLIDLIQVQDEKIRLAFYFATRETLVAQD-LNQAKRIGYSGGGYR 367
               +D   T    PRL     V+ E +R+       +  V  + L  AK +   G   +
Sbjct: 66  EENGWDMGTTDHGTPRLE--AIVEGEAVRVDLLENILDFYVPPELLEDAKEVDIDGIKVK 123

Query: 368 MVTLEGAIIEPSETLVAQDLNQAKRIGY 395
            + LE  I   ++    +D    K +  
Sbjct: 124 SIGLEDLIALKAKAARKEDEEFLKEVAR 151


>gnl|CDD|185292 PRK15394, PRK15394,
           4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol
           deformylase ArnD; Provisional.
          Length = 296

 Score = 29.1 bits (66), Expect = 6.2
 Identities = 11/30 (36%), Positives = 19/30 (63%), Gaps = 3/30 (10%)

Query: 314 DKYR-TPENVPRLIDLIQVQDEKIRLAFYF 342
           D +R T E VPRL++++      I+ +F+F
Sbjct: 11  DTFRGTREGVPRLLEILSKHG--IQASFFF 38


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 27.6 bits (61), Expect = 6.3
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 134 KSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARL 193
           K E     +++++   KLA LE +LE  + +  E     ++L + + ++E++ AE   RL
Sbjct: 1   KEEKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRL 60

Query: 194 ADLTREEAKLLDQVE--KLAREVSEKRES 220
            +L  +  +LL  +E  KLA+   E R S
Sbjct: 61  RELQAKIDELLKNLELEKLAKMELESRLS 89


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 29.1 bits (66), Expect = 6.4
 Identities = 18/71 (25%), Positives = 36/71 (50%), Gaps = 1/71 (1%)

Query: 154 LEANLEKVQTTLTERKTLCEELTTRVPAM-ESEIAESRARLADLTREEAKLLDQVEKLAR 212
           LE   ++++  + E +       +R+ ++ E EI E R +L +LT E A+L  +++ L  
Sbjct: 23  LEQQNKELEAKIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLRL 82

Query: 213 EVSEKRESMQT 223
              + RE  + 
Sbjct: 83  AAEDFREKYED 93


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 28.8 bits (65), Expect = 6.7
 Identities = 27/147 (18%), Positives = 54/147 (36%), Gaps = 13/147 (8%)

Query: 65  LGEHETKVQQMEQELTANLESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSA 124
           L E+   +++ E+ L  +L  I     ++      LK      ++             + 
Sbjct: 152 LEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLA--------DELNL 203

Query: 125 KTRFECDIAKSELEIFLSTQSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMES 184
               E + A+ EL       S++ K+L +L+  L+++   +        EL   +   E 
Sbjct: 204 CDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEK 263

Query: 185 EIAESRARLADLTREE-AKLLDQVEKL 210
              E R      + +E +KL  +V  L
Sbjct: 264 IREECR----GWSAKEISKLKAKVSLL 286


>gnl|CDD|217325 pfam03028, Dynein_heavy, Dynein heavy chain and region D6 of dynein
           motor.  This family represents the C-terminal region of
           dynein heavy chain. The chain also contains ATPase
           activity and microtubule binding ability and acts as a
           motor for the movement of organelles and vesicles along
           microtubules. Dynein is also involved in cilia and
           flagella movement. The dynein subunit consists of at
           least two heavy chains and a number of intermediate and
           light chains. The 380 kDa motor unit of dynein belongs
           to the AAA class of chaperone-like ATPases. The core of
           the 380 kDa motor unit contains a concatenated chain of
           six AAA modules, of which four correspond to the ATP
           binding sites with P-loop signatures described
           previously, and two are modules in which the P loop has
           been lost in evolution. This C-terminal domain carries
           the D6 region of the dynein motor where the P-loop has
           been lost in evolution but the general structure of a
           potential ATP binding site appears to be retained.
          Length = 706

 Score = 29.2 bits (66), Expect = 7.3
 Identities = 9/23 (39%), Positives = 15/23 (65%), Gaps = 1/23 (4%)

Query: 283 EAVIAAVKRQNVGRVNVIPLDKM 305
           ++ + A+  Q+V R N  PLDK+
Sbjct: 573 QSFLTAI-MQSVARKNKWPLDKL 594


>gnl|CDD|233080 TIGR00665, DnaB, replicative DNA helicase.  This model describes
           the helicase DnaB, a homohexameric protein required for
           DNA replication. The homohexamer can form a ring around
           a single strand of DNA near a replication fork. An
           intein of > 400 residues is found at a conserved
           location in DnaB of Synechocystis PCC6803, Rhodothermus
           marinus (both experimentally confirmed), and
           Mycobacterium tuberculosis. The intein removes itself by
           a self-splicing reaction. The seed alignment contains
           inteins so that the model built from the seed alignment
           will model a low cost at common intein insertion sites
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 434

 Score = 28.9 bits (66), Expect = 8.3
 Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 173 EELTTRVPAMESEIAESRARLADLTREE-AKLLDQVEKLAR 212
           E+L  R+ + ES +   + R   L+ E+  KL     KL+ 
Sbjct: 237 EQLVMRMLSSESRVDSQKLRTGKLSDEDWEKLTSAAGKLSE 277


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 28.8 bits (65), Expect = 8.3
 Identities = 16/89 (17%), Positives = 37/89 (41%), Gaps = 4/89 (4%)

Query: 131 DIAKSELEI---FLSTQSKETK-KLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEI 186
           D  + +++     +  Q K+    +A  +   +++       K   EELT  +  +  +I
Sbjct: 191 DHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDI 250

Query: 187 AESRARLADLTREEAKLLDQVEKLAREVS 215
            +  A L  L    AK+  ++E+  + + 
Sbjct: 251 EDPSAALNKLNTAAAKIKSKIEQFQKVIK 279


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 28.4 bits (64), Expect = 8.3
 Identities = 15/83 (18%), Positives = 40/83 (48%), Gaps = 4/83 (4%)

Query: 144 QSKETKKLADLEANLEKVQTTLTERKTLCEELTTRVPAMESEIAESRARLADLTREEAKL 203
           +  +  +L   +  L+K+   +  +    EEL   +  +ES+I +   + ++L  E A+ 
Sbjct: 199 EDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEA 258

Query: 204 LDQVEKL----AREVSEKRESMQ 222
             ++E+      +E+ + +E ++
Sbjct: 259 EKKLEQCRGFTFKEIEKLKEQLK 281


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 28.9 bits (64), Expect = 9.5
 Identities = 39/152 (25%), Positives = 70/152 (46%), Gaps = 13/152 (8%)

Query: 83  LESIKKCTDEMESAKSELKTVEKKGKESNVIGSKFRWSVFSAKTRFECDIAKSELEIFLS 142
           LE +KK   E E    E K  EK  +E   +  K +  +F  + R E +I   E+++   
Sbjct: 407 LEELKKILAEDEKLLDEKKQFEKIAEE---LKGKEQELIFLLQAR-EKEIHDLEIQLTAI 462

Query: 143 TQSKE--TKKLADLEANLEKVQTTLTERKTLC-------EELTTRVPAMESEIAESRARL 193
             S+E   K++ DL+  LEK +    E    C       +ELT     M  E+ + +  +
Sbjct: 463 KTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDI 522

Query: 194 ADLTREEAKLLDQVEKLAREVSEKRESMQTSR 225
            +  ++E ++L Q+E L  +    R+ +++ R
Sbjct: 523 INCKKQEERMLKQIENLEEKEMNLRDELESVR 554


>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 26.0 bits (58), Expect = 9.5
 Identities = 13/42 (30%), Positives = 22/42 (52%)

Query: 168 RKTLCEELTTRVPAMESEIAESRARLADLTREEAKLLDQVEK 209
           +K   EEL  +V  +E+E    +  +  L RE  KL  ++E+
Sbjct: 24  KKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELEE 65


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.130    0.348 

Gapped
Lambda     K      H
   0.267   0.0771    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,717,717
Number of extensions: 2520742
Number of successful extensions: 3441
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3091
Number of HSP's successfully gapped: 336
Length of query: 531
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 430
Effective length of database: 6,457,848
Effective search space: 2776874640
Effective search space used: 2776874640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (27.2 bits)