RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2152
         (205 letters)



>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score =  171 bits (436), Expect = 6e-55
 Identities = 106/158 (67%), Positives = 126/158 (79%)

Query: 23  VQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNE 82
           V  LA SIYQEFE++I  Y E+VVK+LMPLV+ VLE+LD   +ENQE EVELELLREDNE
Sbjct: 1   VYDLANSIYQEFERLIDNYGEEVVKNLMPLVIRVLENLDSLASENQELEVELELLREDNE 60

Query: 83  QLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVY 142
           +L TQYEREK+LRK A+QKL + ED LE +KKEL  KIE LE  VR LELK+KN SD V 
Sbjct: 61  RLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELKAKNLSDQVS 120

Query: 143 RLEEKEQELKKEYSILHERYTELLKTHMEYIERTKLMM 180
           RLEE+E ELKKEY+ LHERYT+LLK ++EY+ER K  +
Sbjct: 121 RLEERETELKKEYNSLHERYTKLLKNYVEYVERQKDEI 158


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 41.2 bits (97), Expect = 2e-04
 Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 3/66 (4%)

Query: 66  ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
           E +E    L  L  +NE L  + ER ++  ++ DQ   QI+ A++ + +EL  +IE L+S
Sbjct: 67  EVKELRKRLAKLISENEALKAENERLQKREQSIDQ---QIQQAVQSETQELTKEIEQLKS 123

Query: 126 IVRMLE 131
             + L+
Sbjct: 124 ERQQLQ 129


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 38.1 bits (89), Expect = 0.002
 Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 1/82 (1%)

Query: 48  DLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQ-LRKAADQKLFQIE 106
           + M        +LD  F E  E   E E L    E+ + +  +  +  ++  ++ +  +E
Sbjct: 489 ERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLE 548

Query: 107 DALEDDKKELNSKIESLESIVR 128
           +ALE +K+E+ +KIE L+ + +
Sbjct: 549 EALEGEKEEIKAKIEELQEVTQ 570


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 37.5 bits (88), Expect = 0.003
 Identities = 28/110 (25%), Positives = 55/110 (50%), Gaps = 18/110 (16%)

Query: 66  ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
           E  + E  LE  +E+  +L  ++E+E + R+    +L ++E  L   ++ L+ K+E LE 
Sbjct: 52  EAIKKEALLEA-KEEIHKLRNEFEKELRERR---NELQKLEKRLLQKEENLDRKLELLE- 106

Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
                  K +        LE+KE+EL+++   L ++  EL +   E ++ 
Sbjct: 107 -------KREE------ELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143



 Score = 27.0 bits (61), Expect = 8.3
 Identities = 20/112 (17%), Positives = 53/112 (47%), Gaps = 22/112 (19%)

Query: 71  EVELELLREDNEQLVTQYEREKQLRK-----AADQKLFQIEDALEDDKKELNSKIESLES 125
           E +++   E+ ++++ + ++E +  K      A +++ ++ +  E + +E  ++++ LE 
Sbjct: 30  EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEK 89

Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTK 177
                            RL +KE+ L ++  +L +R  EL K   E  ++ +
Sbjct: 90  -----------------RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQ 124


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 16/104 (15%)

Query: 82  EQLVTQYER-EKQLRKAADQKLFQI----EDALEDDKKELNSKIESLESIVRMLELKSKN 136
           E+L+ ++   EKQ     D +L ++    E+ +E + KEL  +I  LE I     L S+ 
Sbjct: 407 EELMARFGLSEKQAEAILDLRLRRLTGLEEEKIEKELKELEKEIADLEKI-----LASEE 461

Query: 137 ASDHVYRLEEKE-QELKKEYSILHERYTELLKTHMEYIERTKLM 179
               +  + +KE  E+KK++    ER TE+++   + IE   L+
Sbjct: 462 R---LLDIIKKELLEIKKKFG--DERRTEIVEEEEDEIEDEDLI 500


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 35.5 bits (82), Expect = 0.018
 Identities = 29/111 (26%), Positives = 55/111 (49%)

Query: 57  LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
           +E L+    E +E   ELE   E+ E+ + + + E +  +A  ++L      LE++K+EL
Sbjct: 830 IEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889

Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
             ++  LES +  L+ + +   + +  LE K + L+ E   L E   E  +
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYE 940



 Score = 33.1 bits (76), Expect = 0.084
 Identities = 30/122 (24%), Positives = 61/122 (50%), Gaps = 3/122 (2%)

Query: 47  KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIE 106
           + L   +  + E L+    E +E +  LE L E+ E L     + K+  +  ++K   ++
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792

Query: 107 DALEDDKKEL---NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYT 163
           + LE+ ++EL     ++++LE  +  LE + +     +  LEE+ +EL+++   L E   
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852

Query: 164 EL 165
           EL
Sbjct: 853 EL 854



 Score = 31.6 bits (72), Expect = 0.31
 Identities = 28/98 (28%), Positives = 51/98 (52%), Gaps = 4/98 (4%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIV 127
           +E E ELE      + L  + E  +Q R+  +Q++    + LE++ +EL  K++ LE  +
Sbjct: 796 EELEEELEEAERRLDALERELESLEQRRERLEQEI----EELEEEIEELEEKLDELEEEL 851

Query: 128 RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
             LE + +   + +  LE +++EL+ E   L E   EL
Sbjct: 852 EELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889



 Score = 31.2 bits (71), Expect = 0.36
 Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 3/103 (2%)

Query: 66  ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALED---DKKELNSKIES 122
           E  E   ELE L+ + E L  + ER  +  +   ++L ++E  LE+   + +ELN ++E 
Sbjct: 391 ELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE 450

Query: 123 LESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
           LE  +  L  + K     +  L+E+ Q L+KE S L  R   L
Sbjct: 451 LEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRL 493



 Score = 30.5 bits (69), Expect = 0.70
 Identities = 28/123 (22%), Positives = 58/123 (47%), Gaps = 10/123 (8%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNS--------- 118
           +E   ELE L E+  +L  + E  ++  + A++++ +++  LE+ ++EL           
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELK 294

Query: 119 -KIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTK 177
            +IE LE  + +L  + +   + +  LEE+ +ELK++   L E   E      E  +   
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354

Query: 178 LMM 180
            + 
Sbjct: 355 ELE 357



 Score = 30.1 bits (68), Expect = 0.80
 Identities = 23/97 (23%), Positives = 45/97 (46%)

Query: 69  EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
           + E EL+ L+ +   L    E  ++  +  +++L +++  L   ++EL      LE +  
Sbjct: 685 KLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEE 744

Query: 129 MLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
            LE   +   +   RLEE E+EL+     L +   E+
Sbjct: 745 ELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781



 Score = 29.7 bits (67), Expect = 1.2
 Identities = 26/118 (22%), Positives = 54/118 (45%)

Query: 55  NVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKK 114
             LE L+   +  +E   EL+   E+ E+ + + + E +  +   ++L +    L+++ +
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298

Query: 115 ELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEY 172
           EL  +I  L   +  LE + +   + +  L+EK + LK+E         EL +   E 
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAEL 356



 Score = 29.7 bits (67), Expect = 1.2
 Identities = 17/72 (23%), Positives = 35/72 (48%)

Query: 106 EDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
             A + + KEL  ++  LE+ +  LE + K+  + +  LE+  +EL+++   L  +  EL
Sbjct: 662 SLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEEL 721

Query: 166 LKTHMEYIERTK 177
            +      E  +
Sbjct: 722 KRELAALEEELE 733



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 32/109 (29%), Positives = 55/109 (50%)

Query: 57  LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
           LESL+ A  + +E   ELE  R+  ++ + + E E +  +     L +  ++LE  ++ L
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERL 826

Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
             +IE LE  +  LE K     + +  LE++ +ELK+E   L     EL
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEEL 875



 Score = 28.9 bits (65), Expect = 2.3
 Identities = 27/109 (24%), Positives = 50/109 (45%)

Query: 57  LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
           LE L+      +     LE L E+  + + + ER+ +  K     L +  + L+   +EL
Sbjct: 683 LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEEL 742

Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
             ++E LE  +  L+ + +   + +  LEE   +LK+E   L E+   L
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQAL 791



 Score = 27.8 bits (62), Expect = 5.6
 Identities = 24/103 (23%), Positives = 52/103 (50%)

Query: 65  TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
            E +E E  LE L +   +L    E  ++   A  ++L ++ +AL ++  EL +++  + 
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396

Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
           + +  L+ + ++  + + RL E+ ++LK+E   L     EL  
Sbjct: 397 NELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQT 439



 Score = 27.0 bits (60), Expect = 9.5
 Identities = 26/111 (23%), Positives = 57/111 (51%)

Query: 57  LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
           LE L+  F   +E   ELE    +    + + +RE +  +   ++L +  + L+++ KEL
Sbjct: 371 LEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKEL 430

Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
            +++E L++ +  L  + +   + +  L ++ +EL++E + L E    L K
Sbjct: 431 EAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEK 481


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 34.2 bits (79), Expect = 0.021
 Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 8/76 (10%)

Query: 66  ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
           ENQE + EL  L  + E+L  +  R KQL   A          L+++ +EL  ++  L+ 
Sbjct: 95  ENQELKQELSTLEAELERLQKELARIKQLSANAI--------ELDEENRELREELAELKQ 146

Query: 126 IVRMLELKSKNASDHV 141
               LE +++   ++ 
Sbjct: 147 ENEALEAENERLQENE 162



 Score = 31.5 bits (72), Expect = 0.20
 Identities = 24/82 (29%), Positives = 38/82 (46%), Gaps = 8/82 (9%)

Query: 73  ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL 132
           EL  L+E+  +L  Q    +Q  +   Q+L  +E  LE  +KEL    +   + +   EL
Sbjct: 74  ELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI---EL 130

Query: 133 KSKNASDHVYRLEEKEQELKKE 154
             +N       L E+  ELK+E
Sbjct: 131 DEENR-----ELREELAELKQE 147



 Score = 26.5 bits (59), Expect = 8.6
 Identities = 21/106 (19%), Positives = 49/106 (46%), Gaps = 8/106 (7%)

Query: 77  LREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL---K 133
            RE   +L  +    ++      ++L      L+ + +EL  ++ +LE+ +  L+    +
Sbjct: 64  ARERLPELQQELAELQEELAELQEQL----AELQQENQELKQELSTLEAELERLQKELAR 119

Query: 134 SKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTKLM 179
            K  S +   L+E+ +EL++E + L +   E L+   E ++  +  
Sbjct: 120 IKQLSANAIELDEENRELREELAELKQE-NEALEAENERLQENEQR 164


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 34.8 bits (81), Expect = 0.024
 Identities = 23/66 (34%), Positives = 40/66 (60%), Gaps = 3/66 (4%)

Query: 105 IEDA---LEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHER 161
           IE+A   + +DK++LN  I SLE + R LE K++ A   +   E+ ++EL+++   L E 
Sbjct: 504 IEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563

Query: 162 YTELLK 167
             +LL+
Sbjct: 564 EDKLLE 569



 Score = 30.2 bits (69), Expect = 0.84
 Identities = 23/107 (21%), Positives = 47/107 (43%), Gaps = 6/107 (5%)

Query: 61  DLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKI 120
                E    E E +L  E  E+     E E +L + A+++    + A+++ KKE +  I
Sbjct: 535 KAEEAEALLKEAE-KLKEELEEKKEKLQEEEDKLLEEAEKE---AQQAIKEAKKEADEII 590

Query: 121 ESLESIVRML--ELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
           + L  + +     +K+    +   RL +  ++ +K+     E+  EL
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEEL 637



 Score = 29.0 bits (66), Expect = 1.8
 Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 15/76 (19%)

Query: 64  FTENQEHEVELELLREDNEQLVTQYEREKQLRKA----------ADQKLFQIEDALEDDK 113
           F E+ E E EL +L E   ++ T  E E+++             A +KL  I   L    
Sbjct: 105 FIEDLEEEEELPILEEWVAKIRTLPELEQEIHNCIDEEGEVKDSASEKLRGIRRQL---- 160

Query: 114 KELNSKI-ESLESIVR 128
           +   S+I E LESI+R
Sbjct: 161 RRKKSRIREKLESIIR 176



 Score = 27.1 bits (61), Expect = 8.8
 Identities = 14/67 (20%), Positives = 33/67 (49%), Gaps = 3/67 (4%)

Query: 109 LEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
           LE+ ++EL  K E  E++++  E   +   +   +L+E+E +L +E     +   + +K 
Sbjct: 525 LEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEE---AEKEAQQAIKE 581

Query: 169 HMEYIER 175
             +  + 
Sbjct: 582 AKKEADE 588


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 34.5 bits (80), Expect = 0.027
 Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 15/111 (13%)

Query: 65  TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
            E  + E  LE  +E+  +L  + ERE + R+    +L ++E  L   ++ L+ K+ESL+
Sbjct: 45  AETLKKEALLEA-KEEVHKLRAELERELKERR---NELQRLERRLLQREETLDRKMESLD 100

Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
                LE K K        L  KE+ L +      E   EL+    E +ER
Sbjct: 101 KKEENLEKKEK-------ELSNKEKNLDE----KEEELEELIAEQREELER 140


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 34.5 bits (79), Expect = 0.029
 Identities = 26/79 (32%), Positives = 44/79 (55%)

Query: 75  ELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKS 134
           ELL E  E+     E  +Q  K+  Q+L QIE  L +    L+ K E+L S  + LE K 
Sbjct: 72  ELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKTLESKE 131

Query: 135 KNASDHVYRLEEKEQELKK 153
           ++ +D    ++E+E++++K
Sbjct: 132 QSLTDKSKHIDEREEQVEK 150


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 33.9 bits (78), Expect = 0.053
 Identities = 25/111 (22%), Positives = 46/111 (41%), Gaps = 4/111 (3%)

Query: 58  ESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
           E L+    E  E + EL+ L+E+ ++L  +           + K+ ++E+  ED   E+ 
Sbjct: 392 EKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIK 451

Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
            +   LE +   L          +Y L+E+   ++KE S L     E    
Sbjct: 452 KQEWKLEQLAADLS----KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498



 Score = 32.0 bits (73), Expect = 0.23
 Identities = 24/97 (24%), Positives = 43/97 (44%)

Query: 69  EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
           E    +E    + EQ + +   EK+  +   Q+L +    L++  K +  +IE+L     
Sbjct: 805 EEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE 864

Query: 129 MLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
            LE + +     +  LE +  +LKKE   L  +  EL
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKERDELEAQLREL 901



 Score = 31.6 bits (72), Expect = 0.31
 Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 4/113 (3%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIV 127
              E ELE L E+  +L  + E  +QL +  ++K   I+D  E+++  +  KI  LE+ +
Sbjct: 247 ASLEEELEKLTEEISELEKRLEEIEQLLEELNKK---IKDLGEEEQLRVKEKIGELEAEI 303

Query: 128 RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEY-IERTKLM 179
             LE         +   EE+  +L+ E   L     EL +   E    R KL 
Sbjct: 304 ASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 31/152 (20%), Positives = 67/152 (44%), Gaps = 19/152 (12%)

Query: 24  QSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQ 83
             L+ S   E +  ++K +E+V +     +   L  ++    +     +E E L ++ ++
Sbjct: 786 ARLSHSRIPEIQAELSKLEEEVSR-----IEARLREIE---QKLNRLTLEKEYLEKEIQE 837

Query: 84  LVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLE-----LKSK--N 136
           L  +   + + +  + +   +  + L   K+EL  ++E LE+ +R LE     LK +   
Sbjct: 838 LQ-EQRIDLKEQIKSIE---KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893

Query: 137 ASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
               +  LE K +EL+ +     +R +EL   
Sbjct: 894 LEAQLRELERKIEELEAQIEKKRKRLSELKAK 925


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 33.3 bits (76), Expect = 0.072
 Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 4/78 (5%)

Query: 104 QIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY----SILH 159
           Q +    + K+E+N  IE L ++ + LE K+++    +   E+ ++EL++E         
Sbjct: 501 QAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERER 560

Query: 160 ERYTELLKTHMEYIERTK 177
            +  EL K   E ++  K
Sbjct: 561 NKKLELEKEAQEALKALK 578



 Score = 32.1 bits (73), Expect = 0.18
 Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 3/86 (3%)

Query: 89  EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKE 148
           E+ K       +++  + + L   +KEL  K E LE +++  E   K     +  L+E+E
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559

Query: 149 QELKKEYSILHERYTELLKTHMEYIE 174
           +  K E   L +   E LK   + +E
Sbjct: 560 RNKKLE---LEKEAQEALKALKKEVE 582


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.1 bits (76), Expect = 0.089
 Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 4/108 (3%)

Query: 58  ESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
           + L+      Q H  E E LRED + L    ER ++LR+ A +   ++E+A  +  ++  
Sbjct: 328 DRLEECRVAAQAHNEEAESLREDADDL---EERAEELREEAAELESELEEA-REAVEDRR 383

Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
            +IE LE  +  L  +  +A   +   E+  +EL++E   L ER  EL
Sbjct: 384 EEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAEL 431



 Score = 28.5 bits (64), Expect = 2.7
 Identities = 26/101 (25%), Positives = 44/101 (43%), Gaps = 6/101 (5%)

Query: 55  NVLESLDLAFTENQEHEVELELLREDNEQL----VTQYEREKQLRKAADQKLFQIEDALE 110
           ++ E +       +E     E LRE   +L      + E   +  + A++   ++   L 
Sbjct: 520 DLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEV-AELN 578

Query: 111 DDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQEL 151
               EL  +IESLE I  +L   + +A D + RL EK + L
Sbjct: 579 SKLAELKERIESLERIRTLLAAIA-DAEDEIERLREKREAL 618



 Score = 27.7 bits (62), Expect = 5.3
 Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 69  EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
           E E ELE L E+ E++  + ER + L +A D+ + ++E+  ED ++ +  + E++E    
Sbjct: 479 ELEAELEDLEEEVEEVEERLERAEDLVEAEDR-IERLEERREDLEELIAERRETIEEKRE 537

Query: 129 MLELKSKNASDHVYRLEEKEQE 150
             E   + A++     EEK + 
Sbjct: 538 RAEELRERAAELEAEAEEKREA 559


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 32.6 bits (74), Expect = 0.13
 Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 14/129 (10%)

Query: 28  GSIYQEFEKMITKYDEDVV---KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQL 84
           G + +E + + TK +  V    KD   +      S++       +  V  E      +QL
Sbjct: 451 GQLKKENDMLQTKLNSMVSAKQKDKQSMQ-----SMEKRLKSEADSRVNAE------KQL 499

Query: 85  VTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRL 144
             + +R+K+  + A +   Q   + E+  + L    + LE  ++ LE   K   +    L
Sbjct: 500 AEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRML 559

Query: 145 EEKEQELKK 153
           E++ QEL+K
Sbjct: 560 EKEAQELRK 568


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.0 bits (73), Expect = 0.20
 Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 5/117 (4%)

Query: 74  LELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELK 133
           +E L E+N +L  + E  K+  +  + +L +    + D  ++ + +I + +  +  LE +
Sbjct: 431 VERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK-DREIRARDRRIERLEKE 489

Query: 134 SKNASDHVYRLEEKEQELKKEYSILHERYTELLKT----HMEYIERTKLMMGSGSSD 186
            +     V  LE K  EL+K   +        +K      +E IE  +   G    D
Sbjct: 490 LEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYGIKEGD 546



 Score = 28.1 bits (63), Expect = 3.6
 Identities = 32/149 (21%), Positives = 63/149 (42%), Gaps = 12/149 (8%)

Query: 29  SIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQY 88
            + ++  ++    D + +K L+     + E+L     E +  E E     E  E  +T Y
Sbjct: 363 KVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERRE--ITVY 420

Query: 89  EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRML----------ELKSKNAS 138
           E+  +  +   ++L +    L+ + +EL  +IE LES +             + + +   
Sbjct: 421 EKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARD 480

Query: 139 DHVYRLEEKEQELKKEYSILHERYTELLK 167
             + RLE++ +E KK    L  +  EL K
Sbjct: 481 RRIERLEKELEEKKKRVEELERKLAELRK 509


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 32.1 bits (73), Expect = 0.21
 Identities = 32/112 (28%), Positives = 46/112 (41%), Gaps = 29/112 (25%)

Query: 23  VQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNE 82
           +QS  G   +E EKM+ + +E                    + E  E + EL   + + E
Sbjct: 535 IQSSGGLSDEEIEKMVKEAEE--------------------YKEQDEKKKELVDAKNEAE 574

Query: 83  QLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKS 134
            L+  Y  EKQL    D+    I DA   DK EL  KI  L S +   ++ S
Sbjct: 575 TLI--YSVEKQLSDLKDK----ISDA---DKDELKQKITKLRSTLSSEDVDS 617


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 31.2 bits (71), Expect = 0.36
 Identities = 36/163 (22%), Positives = 71/163 (43%), Gaps = 11/163 (6%)

Query: 11  THDDSHIVMSEKVQSLAGSIY------QEFEKMITKYDEDVVKDLMPLVVNVLESLDLAF 64
                   + E+++ L   +Y         E+      E +  +L   +  +   L+   
Sbjct: 271 ELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERL-ANLERQLEELEAQLEELE 329

Query: 65  TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
           ++  E   EL  L E  E+L  + E  +   +  + +L ++E  LE+ +++L    E+L 
Sbjct: 330 SKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL----ETLR 385

Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
           S V  LEL+  + ++ + RLE + + L+     L +   ELLK
Sbjct: 386 SKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK 428



 Score = 30.0 bits (68), Expect = 1.0
 Identities = 23/81 (28%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 69  EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
           E E ELE L+E+ E+L    E  ++  + A+Q L    DA E +  +L ++++SLE   R
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQAL----DAAERELAQLQARLDSLE---R 496

Query: 129 MLELKSKNASDHVYRLEEKEQ 149
           + E     +      L+ +  
Sbjct: 497 LQENLEGFSEGVKALLKNQSG 517



 Score = 28.9 bits (65), Expect = 2.2
 Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query: 89  EREKQLRKAADQKLFQIE--DALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEE 146
            RE +L     +     E  + L+++ KE   ++E L + ++ LE K +     V  LEE
Sbjct: 222 LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE 281

Query: 147 KEQELKKEY 155
           + +EL+KE 
Sbjct: 282 EIEELQKEL 290



 Score = 28.5 bits (64), Expect = 2.9
 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 9/101 (8%)

Query: 55  NVLESLDLAFTENQEHEVELELLREDNEQLVTQYERE-KQLRKAADQKLFQIEDALEDDK 113
             +E L+    E +E   E E   E+ E  + Q + E K LR+A D         L  + 
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD--------ELRAEL 812

Query: 114 KELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
             LN +  +L   +  LE +       +  LEE+ +EL ++
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSED 853



 Score = 28.1 bits (63), Expect = 3.9
 Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 13/112 (11%)

Query: 65  TENQEHEVELELLR-EDN-EQLVTQYER-EKQLRKAADQKLFQIEDALED--------DK 113
           TE +       L R ED   +L  Q +  E+Q  KA  ++  +++  L +          
Sbjct: 177 TERKLERTRENLDRLEDILNELERQLKSLERQAEKA--ERYKELKAELRELELALLVLRL 234

Query: 114 KELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
           +EL  ++E L+  ++  E + +  +  +  LEEK +EL+ E S L E   EL
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEEL 286



 Score = 28.1 bits (63), Expect = 4.0
 Identities = 18/107 (16%), Positives = 40/107 (37%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIV 127
           +  E  +       E L  Q E   +  ++   ++ ++E+ +E+ + EL + +    S+ 
Sbjct: 827 ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLE 886

Query: 128 RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIE 174
             L L      +    L E E +  +    L E   +L +  +    
Sbjct: 887 EALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933



 Score = 27.3 bits (61), Expect = 7.1
 Identities = 23/106 (21%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 73  ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALE---DDKKELNSKIESLESIVRM 129
           +L  L  + EQL  +  +  +     + ++ ++E+ LE   ++  E  ++IE LE+ +  
Sbjct: 734 DLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ 793

Query: 130 LELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
           L+ + K   + +  L  +   L +E + L ER   L +       R
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERR 839


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 31.0 bits (71), Expect = 0.37
 Identities = 19/88 (21%), Positives = 41/88 (46%), Gaps = 23/88 (26%)

Query: 89  EREKQLRKAADQK------LFQIEDALED--------DKKELNSKIESLESIVRMLELKS 134
             +K+ ++  + K      ++ +E +L++        DKK++   IE L+      EL+ 
Sbjct: 518 AEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKE-----ELEG 572

Query: 135 KNASDHVYRLEEKEQELKKEYSILHERY 162
           ++  +    +E K +EL+K    + ER 
Sbjct: 573 EDKEE----IEAKTEELQKVVQPIGERM 596


>gnl|CDD|233067 TIGR00641, acid_CoA_mut_N, methylmalonyl-CoA mutase N-terminal
           domain.  Methylmalonyl-CoA mutase (EC 5.4.99.2)
           catalyzes a reversible isomerization between
           L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an
           adenosylcobalamin cofactor. It may be a homodimer, as in
           mitochondrion, or a heterodimer with partially
           homologous beta chain that does not bind the
           adenosylcobalamin cofactor, as in Propionibacterium
           freudenreichii. The most similar archaeal sequences are
           separate chains, such as AF2215 abd AF2219 of
           Archaeoglobus fulgidus, that correspond roughly to the
           first 500 and last 130 residues, respectively of known
           methylmalonyl-CoA mutases. This model describes the
           N-terminal domain subfamily. In a neighbor-joining tree,
           AF2215 branches with a bacterial isobutyryl-CoA mutase,
           which is also the same length. Scoring between the noise
           and trusted cutoffs are the non-catalytic, partially
           homologous beta chains from certain heterodimeric
           examples of 5.4.99.2.
          Length = 524

 Score = 30.9 bits (70), Expect = 0.47
 Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 69  EHEVELELLREDNEQLVT-QYEREKQLRKAADQKLFQ 104
           E E E+E+L+ DN  +   Q  + K+LR   DQ+  +
Sbjct: 437 EEEDEVEVLKVDNSSVREEQIAKLKKLRAERDQEKVE 473


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 30.4 bits (69), Expect = 0.53
 Identities = 18/79 (22%), Positives = 39/79 (49%), Gaps = 1/79 (1%)

Query: 77  LREDNEQLVTQYEREK-QLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSK 135
           LRE   +L  +  +     + A  ++     DAL+++  +L+++I+ LE ++  LE K  
Sbjct: 4   LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPA 63

Query: 136 NASDHVYRLEEKEQELKKE 154
            + +     EE+E+   + 
Sbjct: 64  ASGEGGGGEEEEEEAKAEA 82


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 30.7 bits (70), Expect = 0.56
 Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 3/60 (5%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAAD---QKLFQIEDALEDDKKELNSKIESLE 124
            E   EL  L  +   L  ++++EK+L +A      +L    DA  DD   L +++  LE
Sbjct: 453 AELRAELAALEAELAALEARWQQEKELVEAILALRAELEADADAPADDDAALRAQLAELE 512


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 30.2 bits (69), Expect = 0.67
 Identities = 13/44 (29%), Positives = 22/44 (50%)

Query: 111 DDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
           +  + L  +   LE  +R LELK ++      +LE + + LK E
Sbjct: 1   ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSE 44


>gnl|CDD|234348 TIGR03761, ICE_PFL4669, integrating conjugative element protein,
           PFL_4669 family.  Members of this protein family, such
           as PFL4669, are found in integrating conjugative
           elements (ICE) of the PFGI-1 class as in Pseudomonas
           fluorescens.
          Length = 216

 Score = 30.0 bits (68), Expect = 0.69
 Identities = 9/33 (27%), Positives = 20/33 (60%)

Query: 98  ADQKLFQIEDALEDDKKELNSKIESLESIVRML 130
           AD  L +IE+ L   ++E+ + ++ L+ ++  L
Sbjct: 58  ADWALLRIEEKLLSARQEMQALLQRLDDLLAQL 90


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 30.4 bits (69), Expect = 0.74
 Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 1/70 (1%)

Query: 104 QIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY-SILHERY 162
           + +    ++K+ L   IE LE + + LE + +     +  +E           S+L  + 
Sbjct: 494 EAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKA 553

Query: 163 TELLKTHMEY 172
             ++  HM  
Sbjct: 554 QIVVLAHMGL 563


>gnl|CDD|202685 pfam03564, DUF1759, Protein of unknown function (DUF1759).  This is
           a family of proteins of unknown function. Most of the
           members are gag-polyproteins.
          Length = 146

 Score = 29.3 bits (66), Expect = 0.79
 Identities = 14/58 (24%), Positives = 26/58 (44%)

Query: 82  EQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASD 139
           E L  +Y+  + + ++   KL ++     D   +L    +    I+R LE   +NA D
Sbjct: 61  EALKERYDNPRVIIRSLLNKLMKLPSTNNDSVSQLRRLYDEANEIIRQLEQLGENADD 118


>gnl|CDD|217457 pfam03255, ACCA, Acetyl co-enzyme A carboxylase carboxyltransferase
           alpha subunit.  Acetyl co-enzyme A carboxylase
           carboxyltransferase is composed of an alpha and beta
           subunit.
          Length = 145

 Score = 29.2 bits (66), Expect = 0.80
 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 109 LEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
            E    EL +KI+ L    ++      + SD ++RLE+K  +LK+E
Sbjct: 5   FEKPLAELEAKIDELR---KLARKNDVDVSDEIHRLEKKLDKLKRE 47


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 30.0 bits (68), Expect = 0.87
 Identities = 29/143 (20%), Positives = 58/143 (40%), Gaps = 23/143 (16%)

Query: 66  ENQEHEVELELLRED-----------NEQLVTQYEREKQL-----RKAADQKLFQIEDAL 109
           EN E E ++E L+++             ++  + E  ++       KA  +   + +  L
Sbjct: 89  ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNL 148

Query: 110 EDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYT--ELLK 167
            +  K LN K    E +++ L+    NAS  +  LEE + ++K  +S          L  
Sbjct: 149 SEALKGLNYKKNFKEKLLKELKSVILNAS-SLLSLEELKAKIKTLFSSNKPELALLTLSV 207

Query: 168 THMEYIERTKLM----MGSGSSD 186
              + IE+ +++    +GS    
Sbjct: 208 IDFDEIEQAEILEKSIIGSSDVP 230


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 30.0 bits (67), Expect = 0.87
 Identities = 19/128 (14%), Positives = 36/128 (28%), Gaps = 4/128 (3%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDD----KKELNSKIESL 123
           +E E  +     +  Q  ++ E  K     A +      D ++         L++K E  
Sbjct: 298 EEQETLVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQHGILSAKREGA 357

Query: 124 ESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTKLMMGSG 183
               R +          V  L     E  +E   L+      +  +    E  +      
Sbjct: 358 VDKRRTISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVDEYPAAREALESAGQRN 417

Query: 184 SSDRMDSR 191
             DR  + 
Sbjct: 418 VEDRTRAV 425


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 29.8 bits (67), Expect = 0.94
 Identities = 19/88 (21%), Positives = 36/88 (40%)

Query: 67  NQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESI 126
            Q+   E E L++  ++ +   E++KQ  +AA Q   + + A E   K   +     E+ 
Sbjct: 93  QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152

Query: 127 VRMLELKSKNASDHVYRLEEKEQELKKE 154
            +     +K A+    +  E E   K  
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAA 180


>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
           subunit alpha; Validated.
          Length = 319

 Score = 29.7 bits (68), Expect = 0.95
 Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 3/41 (7%)

Query: 114 KELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
            EL +KIE L ++    ++   + S+ + RLE+K +EL K+
Sbjct: 13  AELEAKIEELRAVAEDSDV---DLSEEIERLEKKLEELTKK 50


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.5 bits (66), Expect = 1.1
 Identities = 24/100 (24%), Positives = 48/100 (48%), Gaps = 5/100 (5%)

Query: 73  ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL 132
           E   L+ED E+L  + E  ++ ++   ++L ++E   E    E+  +++ LE     LE 
Sbjct: 129 EYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYE----EVQERLKRLEVENSRLEE 184

Query: 133 KSKNASDHVYRLEEKEQELKKEYSIL-HERYTELLKTHME 171
             K     VY L+++  EL+    +   E  ++L+K  + 
Sbjct: 185 MLKKLPGEVYDLKKRWDELEPGVELPEEELISDLVKETLN 224


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 27/115 (23%), Positives = 51/115 (44%), Gaps = 11/115 (9%)

Query: 55  NVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAA---DQKLFQIE---DA 108
             L++L+      Q+ E       E+  +L  +  R KQ   +A    ++L Q+E   +A
Sbjct: 420 EDLQALESQL--RQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEA 477

Query: 109 LEDDKKELNS---KIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHE 160
           LE  ++E       +E L+S +R L  +   A + + R E +  +L++    L  
Sbjct: 478 LEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELEL 532


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 28.8 bits (65), Expect = 1.2
 Identities = 28/111 (25%), Positives = 51/111 (45%), Gaps = 15/111 (13%)

Query: 50  MPLVVNVLESLDLAFTENQEHEVELEL----LREDNEQLVTQYEREKQLRKAADQKLFQI 105
           +  ++NV+ SL      + E    LE     L  + E+L    ER K   +  +++L   
Sbjct: 31  IVRIINVIYSLLQQRDRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLEREL--- 87

Query: 106 EDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYS 156
              L+  +++L  K+++LE        K KN  + V RL+   Q+ K +Y+
Sbjct: 88  -ALLQAKERQLEKKLKTLEQ-------KLKNEKEEVQRLKNIIQQRKTQYN 130


>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
           production and conversion].
          Length = 472

 Score = 29.6 bits (67), Expect = 1.2
 Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 9/41 (21%)

Query: 85  VTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
           ++  ER   LR+ A        D LE   +EL + + +LE+
Sbjct: 55  LSAEERAAILRRIA--------DLLEARAEEL-AALITLET 86


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 29.6 bits (66), Expect = 1.3
 Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 8/99 (8%)

Query: 67  NQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESI 126
            Q      + L    +Q      R K L     ++L     +L+  +K+L +    L+S 
Sbjct: 129 RQNLAKAQQELARLTKQAQDLQTRLKTL-AEQRRQLEAQAQSLQASQKQLQASATQLKSQ 187

Query: 127 VRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
           V  L+L+S        ++E++ Q L    +    R  EL
Sbjct: 188 VLDLKLRSA-------QIEQEAQNLATRANAAQARTEEL 219



 Score = 27.3 bits (60), Expect = 7.0
 Identities = 11/90 (12%), Positives = 31/90 (34%)

Query: 85  VTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRL 144
           + Q  +    R  A Q   +         ++    I+  ++ +     +    ++ +   
Sbjct: 198 IEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRER 257

Query: 145 EEKEQELKKEYSILHERYTELLKTHMEYIE 174
           E + Q L+   + L +   +L   +  Y+ 
Sbjct: 258 ERQLQRLETAQARLEQEVAQLEAYYQAYVR 287


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 29.3 bits (66), Expect = 1.5
 Identities = 23/104 (22%), Positives = 44/104 (42%), Gaps = 8/104 (7%)

Query: 58  ESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
           E L   F  + + +    LL+++   L +    E   R+  + K         D K   +
Sbjct: 845 EVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELK--------IDVKSISS 896

Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHER 161
            K+ +LE    ++ELK   +SD +  LE K + + +   +L+  
Sbjct: 897 LKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNI 940


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 29.0 bits (65), Expect = 1.7
 Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 9/128 (7%)

Query: 47  KDLMPLVVNVLESLDLAFTENQEHEVELEL------LREDNEQLVTQYEREKQLRKAADQ 100
           K L   +  V E L     EN+E E E ++      L  D E  V + +    L  A+ +
Sbjct: 294 KALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQA---LVVASSE 350

Query: 101 KLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHE 160
           +L ++    E  +  L  +  SL+   R  E +++   D + +L  K +EL+ E     +
Sbjct: 351 RLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQ 410

Query: 161 RYTELLKT 168
            Y +LL  
Sbjct: 411 LYKQLLDE 418



 Score = 28.6 bits (64), Expect = 2.7
 Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 89  EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKE 148
           + E+   K  + +L    +AL++   EL S IE + S ++ L  K K  ++ +  + ++ 
Sbjct: 259 DTEEDRTKEREAEL----EALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQEN 314

Query: 149 QELKKEYSILHERYTELLK 167
           +EL++EY I  +R  ELL 
Sbjct: 315 EELEEEYKI-KKRTVELLP 332


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 28.7 bits (65), Expect = 1.8
 Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 5/62 (8%)

Query: 94  LRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKK 153
           +RKA   +  Q +  LE + K+L  + E LE  V  LE K +       R EE+ Q  +K
Sbjct: 112 MRKALQAE--QGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK---REEEERQIEEK 166

Query: 154 EY 155
            +
Sbjct: 167 RH 168


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.1 bits (66), Expect = 1.9
 Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 82  EQLVTQYEREKQLRKAADQKLFQIED---ALEDDKKELNSKIESLESIVRMLE 131
           E+     +  ++L K  +++L +IE     LE++  EL ++I+ LE  +  LE
Sbjct: 75  EKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLE 127



 Score = 28.7 bits (65), Expect = 2.4
 Identities = 20/79 (25%), Positives = 33/79 (41%), Gaps = 9/79 (11%)

Query: 47  KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYERE-KQLRKAADQKLFQI 105
           K  + +VV VL+            EVE EL +   E+L  + E    +L +   ++L +I
Sbjct: 176 KGYVYVVVVVLK--------ELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEI 227

Query: 106 EDALEDDKKELNSKIESLE 124
           E   E   +EL    +   
Sbjct: 228 EKERESLLEELKELAKKYL 246



 Score = 28.7 bits (65), Expect = 2.6
 Identities = 18/101 (17%), Positives = 42/101 (41%), Gaps = 19/101 (18%)

Query: 84  LVTQYEREKQLRKAADQKLFQIEDALEDDKKE-----------LNSKIESLESIV----- 127
           +  +  +++ L    +  +  IED  E+   E           L+  ++ L S +     
Sbjct: 12  VTLKSYKDEVLEALHELGVVHIEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNP 71

Query: 128 ---RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
                 ++  K+  + +  +EE+ ++++KE   L E  +EL
Sbjct: 72  LREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISEL 112


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 29.0 bits (65), Expect = 2.0
 Identities = 20/78 (25%), Positives = 33/78 (42%)

Query: 63  AFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIES 122
           A +E  E + EL+ L+    +L  +YE EK   +   Q L +   +LE    E   +   
Sbjct: 358 AVSEVGELKAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGD 417

Query: 123 LESIVRMLELKSKNASDH 140
           LE  +R L   +  +   
Sbjct: 418 LEKELRKLTHVAGESQGS 435


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.0 bits (65), Expect = 2.1
 Identities = 25/100 (25%), Positives = 50/100 (50%)

Query: 66  ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
             +    ELE L E  + L  + E+ ++  +  + +L ++ +   +  K L  +++ LE 
Sbjct: 309 GLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEE 368

Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
            +  LE + + A + + +LEE  QELK+E + L     E+
Sbjct: 369 RLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEI 408



 Score = 27.8 bits (62), Expect = 5.5
 Identities = 26/89 (29%), Positives = 39/89 (43%)

Query: 65  TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
              +E   ELE  +E  E+L  +    + L   A +   +    LE   +EL  KIE LE
Sbjct: 235 EALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLE 294

Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKK 153
            + R +E   +        LEE E+ L+K
Sbjct: 295 ELEREIEELEEELEGLRALLEELEELLEK 323


>gnl|CDD|227506 COG5179, TAF1, Transcription initiation factor TFIID, subunit TAF1
           [Transcription].
          Length = 968

 Score = 28.9 bits (64), Expect = 2.2
 Identities = 30/167 (17%), Positives = 63/167 (37%), Gaps = 8/167 (4%)

Query: 12  HDDSHIVMSEKVQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHE 71
           +D   +  S  V  +  +I  +F +   +   D V          ++S+DL    +    
Sbjct: 225 YDFDLVFNSNYVAPIKNNIIYDFSRAGGRMKRDTVTT------EFVDSIDLLIATDDWER 278

Query: 72  VELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLE 131
             +    +D +     + + K L          I DA   + K   +  +  E ++   +
Sbjct: 279 YVIVDEADDMKTPRKSFYKLKSLTDVDWLYDEDIIDAKLKESKAPLNLND--EKLLLKRQ 336

Query: 132 LKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTKL 178
            K K     V+  ++   ELK +++I +++Y  +LK   +   R  L
Sbjct: 337 AKPKKKQQEVFSDDKPLNELKGKFNISNDKYYGILKKTHQSKVRNTL 383


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 28.6 bits (65), Expect = 2.4
 Identities = 30/164 (18%), Positives = 78/164 (47%), Gaps = 16/164 (9%)

Query: 19  MSEKVQSLAGSIYQEFEKMITKYD-EDVVKDLMPLVVNVLESLDLAFTENQEHEVELELL 77
             EK + +   I Q ++ +  +      V+        + + L+ A  +N+E + E++ +
Sbjct: 280 AEEKNEEIQERIDQLYDILEREVKARKYVEKNSD---TLPDFLEHAKEQNKELKEEIDRV 336

Query: 78  RE----DNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKK---ELNSKIESLESIVRML 130
           ++    +  +L +  + EKQL ++ +++  +I + + + +    EL  ++E +   +  +
Sbjct: 337 KQSYTLNESELESVRQLEKQL-ESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEI 395

Query: 131 ELKSKNASDHVYRLEEKEQELK---KEY-SILHERYTELLKTHM 170
           E + +  S+ +  L + E E +   + Y + LHE    L K+++
Sbjct: 396 EKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEKSNL 439


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 28.0 bits (63), Expect = 2.4
 Identities = 20/67 (29%), Positives = 35/67 (52%), Gaps = 7/67 (10%)

Query: 89  EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKE 148
           ER K+L K A +   + + A+ + +++ N KI+ LE        + + + D V + EE  
Sbjct: 104 ERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEK-------EKEISEDEVKKAEEDL 156

Query: 149 QELKKEY 155
           Q+L  EY
Sbjct: 157 QKLTDEY 163


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 28.6 bits (64), Expect = 2.4
 Identities = 21/94 (22%), Positives = 41/94 (43%), Gaps = 4/94 (4%)

Query: 75  ELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKS 134
             L   + QLV    +  +LR+    +L QI  +L ++   L  +  +LE  +  L+ + 
Sbjct: 306 TELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRL 365

Query: 135 KNASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
                   +L+ + +EL++E       Y  LL+ 
Sbjct: 366 SKLP----KLQVQLRELEREAEAARSLYETLLQR 395


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 28.6 bits (64), Expect = 2.4
 Identities = 32/131 (24%), Positives = 50/131 (38%), Gaps = 23/131 (17%)

Query: 81  NEQLVTQYEREKQLRKAADQKL------FQIEDALEDDKKELNSKIESLESIVR------ 128
            E+ +   ++ +    A D++L                KK    +I+ LESIVR      
Sbjct: 310 AEEKMRMLKKARSALDACDRELEEKAREVSELKMERQRKKP---QIDELESIVRLKQAEA 366

Query: 129 -MLELKSKNASDHVYRLEE----KEQELKKEY--SILHERYTELLKTHMEYIERTKLMM- 180
            M +LK+  A     RL+     K ++ ++EY    L  R +E  +      E  KL   
Sbjct: 367 DMFQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEE 426

Query: 181 GSGSSDRMDSR 191
               SD M  R
Sbjct: 427 SHRDSDSMKMR 437


>gnl|CDD|116408 pfam07794, DUF1633, Protein of unknown function (DUF1633).  This
           family contains sequences derived from a group of
           hypothetical proteins expressed by Arabidopsis thaliana.
           These sequences are highly similar and the region
           concerned is about 100 residues long.
          Length = 698

 Score = 28.7 bits (63), Expect = 2.8
 Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 8/85 (9%)

Query: 76  LLREDNEQLVTQYEREKQLRKAADQKLFQIE------DALEDDKKELNSKIESLESIVRM 129
            +RE++  L    +RE  +R  A+  +  IE         +D +K    KI  +E     
Sbjct: 459 AIREEDPHLGADQDRE--VRFGAEGIVPGIERLKIELSTSKDLEKGYAEKIGFMEMEFGG 516

Query: 130 LELKSKNASDHVYRLEEKEQELKKE 154
           LE   + A + ++RLEEK+ EL K+
Sbjct: 517 LEADKQMARNQIHRLEEKKDELSKK 541


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 27.8 bits (62), Expect = 2.8
 Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 4/88 (4%)

Query: 91  EKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQE 150
            KQ      ++L  +E+ + + + EL    E L+ +  +L+       + +  LEE  +E
Sbjct: 84  LKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLE----ERLESLEESIKE 139

Query: 151 LKKEYSILHERYTELLKTHMEYIERTKL 178
           L KE   L +   E ++   E +ER + 
Sbjct: 140 LAKELRELRQDLREEVEELREELERLQE 167


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 28.5 bits (65), Expect = 2.9
 Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 18/74 (24%)

Query: 90  REK-QLRKAADQKLFQIEDAL--------EDDKKELNSKIESLESIVRMLELKSKNASDH 140
           +E  + R  AD  ++Q E  L         D+K+++ + I+ L+       LK ++    
Sbjct: 523 KELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEA-----LKGEDKEA- 576

Query: 141 VYRLEEKEQELKKE 154
              ++ K +EL + 
Sbjct: 577 ---IKAKTEELTQA 587


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 28.5 bits (64), Expect = 2.9
 Identities = 14/79 (17%), Positives = 32/79 (40%)

Query: 82  EQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHV 141
           +Q+  +    ++  +    +  ++E  L+  + E+ S    L      L+   K  +D  
Sbjct: 41  KQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLN 100

Query: 142 YRLEEKEQELKKEYSILHE 160
            RL   E + +++   L E
Sbjct: 101 ARLNALEVQEREQRRRLAE 119


>gnl|CDD|184695 PRK14471, PRK14471, F0F1 ATP synthase subunit B; Provisional.
          Length = 164

 Score = 27.8 bits (62), Expect = 3.2
 Identities = 34/118 (28%), Positives = 59/118 (50%), Gaps = 11/118 (9%)

Query: 47  KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQ--YEREKQLRKAADQKLFQ 104
           K ++  V    +S+  A    +E   E++ L+ DNE+L+ +   ER+  L++A + K   
Sbjct: 31  KPILGAVKEREDSIKNALASAEEARKEMQNLQADNERLLKEARAERDAILKEAREIKEKM 90

Query: 105 IEDALEDDKKELNSKIES------LESIVRMLELKSKNASDHVYRLEEKEQELKKEYS 156
           I DA E+ + E +  IE        E    M E+K++ A+     +E  E+ L+KE S
Sbjct: 91  IADAKEEAQVEGDKMIEQAKASIESEKNAAMAEIKNQVAN---LSVEIAEKVLRKELS 145


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 27.3 bits (61), Expect = 3.6
 Identities = 28/110 (25%), Positives = 48/110 (43%), Gaps = 14/110 (12%)

Query: 65  TENQEHEVELELLREDNEQLV-------TQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
            + ++ E +++ L+ED E+          +YERE        ++L  +   L + KKE+ 
Sbjct: 17  EKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLNELKKEIA 76

Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
                 ES     EL     S      EE+++ L+ E S L +R  EL +
Sbjct: 77  QLKAEAESA--QAELSEAEES-----WEEQKKMLEDELSELEKRIEELNE 119


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This
           family consist of exonuclease VII, large subunit
           EC:3.1.11.6 This enzyme catalyzes exonucleolytic
           cleavage in either 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. This exonuclease VII enzyme
           is composed of one large subunit and 4 small ones.
          Length = 295

 Score = 28.0 bits (63), Expect = 3.8
 Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 11/86 (12%)

Query: 75  ELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIE--------SLESI 126
             L ++ ++L    ER   L+  + + L Q E+ L + +  L S I+         LE +
Sbjct: 165 NRLEKEQDRLNLLRER---LKSLSRRLLEQHEERLAELRDRLISSIQNLLSRKQSRLERL 221

Query: 127 VRMLELKSKNASDHVYRLEEKEQELK 152
           +   EL+  +  ++         +LK
Sbjct: 222 ILNRELEKNSLLENKLATANLTAQLK 247


>gnl|CDD|183759 PRK12804, PRK12804, flagellin; Provisional.
          Length = 301

 Score = 28.0 bits (62), Expect = 3.8
 Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 121 ESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELL 166
           E +   +R LE+ SKNA D +  ++  E  L + +SIL +R  EL+
Sbjct: 50  EKMRGQIRGLEMASKNAQDGISLIQTAEGALTETHSIL-QRVRELV 94


>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
           metabolism].
          Length = 533

 Score = 28.0 bits (63), Expect = 3.8
 Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 16/78 (20%)

Query: 40  KYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNE---QLVTQYER----EK 92
            + ED +++L  L    LE+L LA +          L+  D E   +LV + +R    E+
Sbjct: 435 AFSEDGLEELDALFALTLENLRLAIS---------VLVTGDLELARRLVERKKRVRRLER 485

Query: 93  QLRKAADQKLFQIEDALE 110
           +  K    +L     ++E
Sbjct: 486 RSSKRHLDRLRDGAASVE 503


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 27.9 bits (63), Expect = 3.8
 Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 3/67 (4%)

Query: 60  LDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAAD---QKLFQIEDALEDDKKEL 116
           LD    E    ++E++ LR   E    +YE E  LR++A+     L +  D     + +L
Sbjct: 63  LDELTNERARLQLEIDNLRLAAEDFREKYEDELNLRQSAENDIVGLRKDLDEATLARVDL 122

Query: 117 NSKIESL 123
             KIESL
Sbjct: 123 EMKIESL 129


>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
           component [Inorganic ion transport and metabolism].
          Length = 289

 Score = 28.0 bits (62), Expect = 3.9
 Identities = 19/89 (21%), Positives = 38/89 (42%), Gaps = 3/89 (3%)

Query: 80  DNEQLVTQYEREKQLRKA---ADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKN 136
            N Q +     E++L+K     +    Q++  +E   + +NS++E L    R L+L  + 
Sbjct: 152 SNGQELDGDPLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLEKRRLQLNGQL 211

Query: 137 ASDHVYRLEEKEQELKKEYSILHERYTEL 165
             +       ++ EL+K  + L      L
Sbjct: 212 DDEFQQHYVAEKSELQKRLAQLQTELDAL 240


>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
          Length = 329

 Score = 27.9 bits (63), Expect = 4.0
 Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 9/91 (9%)

Query: 92  KQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRM-----LELKSKNASDHVYRLEE 146
            ++ K  +++  + E  L++ K+E+  KI  +  I        ++L S+    +  + EE
Sbjct: 7   SEILKEYEKRRDKEELELKNRKEEVYKKIPEIAEIDDEIAKLGIKL-SRAILKNPDKKEE 65

Query: 147 KEQELKKEYSILHERYTELLKTH---MEYIE 174
             +ELK++ + L  +  ELL ++    +Y+E
Sbjct: 66  TLKELKEKITDLRVKKAELLVSNGYPPDYLE 96


>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
           mechanisms].
          Length = 352

 Score = 27.7 bits (62), Expect = 4.2
 Identities = 9/51 (17%), Positives = 20/51 (39%)

Query: 78  REDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
           RE+ ++     +  +    AA     Q    LE +     +++ S E+ + 
Sbjct: 150 REELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGAQAQVASAEAALD 200


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.1 bits (63), Expect = 4.3
 Identities = 28/104 (26%), Positives = 53/104 (50%), Gaps = 6/104 (5%)

Query: 73  ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL 132
           ELE L+E+ E+L  + E  +  ++  ++K+ ++E+ +E    EL  +IE LE  V+ L+ 
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIE----ELKKEIEELEEKVKELKE 287

Query: 133 KSKNASDHVYRLEEKEQELKKEYSI--LHERYTELLKTHMEYIE 174
             + A +++   E  E+ L +   I     R  E +    E I+
Sbjct: 288 LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK 331



 Score = 27.0 bits (60), Expect = 9.4
 Identities = 29/107 (27%), Positives = 59/107 (55%), Gaps = 6/107 (5%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQ--KLFQIEDALEDDKKELNSKIESLES 125
           +E E  +E L+++ E+L  + +  K+L++ A++  KL +  +   D+ +E+  ++  LE 
Sbjct: 262 RELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321

Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEY 172
            +  +E + K   +   RLEE +++LK+    L +R  EL + H  Y
Sbjct: 322 EINGIEERIKELEEKEERLEELKKKLKE----LEKRLEELEERHELY 364


>gnl|CDD|216035 pfam00632, HECT, HECT-domain (ubiquitin-transferase).  The name
           HECT comes from Homologous to the E6-AP Carboxyl
           Terminus.
          Length = 298

 Score = 27.7 bits (62), Expect = 5.1
 Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 41  YDEDVVKDLMPL--VVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAA 98
            D ++ + L  L  +    E L L FT + E E  +EL+   +   VT   +E+ +    
Sbjct: 83  VDPELYRSLKSLLELDGDEEELCLTFTVSGEGEGTIELIPNGSNIPVTNENKEEYVNLYV 142

Query: 99  DQKLF-QIEDALE 110
           D +L   I+  LE
Sbjct: 143 DYRLNESIKKQLE 155


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 27.5 bits (61), Expect = 5.3
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 9/82 (10%)

Query: 82  EQLVTQY---EREKQLRKAADQKLFQ----IED--ALEDDKKELNSKIESLESIVRMLEL 132
           E+L ++    E+E+   K    KL      I +  AL+ +  EL  + E L   +  LE 
Sbjct: 32  ERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEK 91

Query: 133 KSKNASDHVYRLEEKEQELKKE 154
           +  +    +  L+E++++L+ E
Sbjct: 92  EDDDLDGELVELQEEKEQLENE 113


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 27.7 bits (62), Expect = 5.3
 Identities = 24/143 (16%), Positives = 57/143 (39%), Gaps = 17/143 (11%)

Query: 26  LAGSIYQEFEKMITKYDEDVVKDLMPL------VVNVLESLDLAFTENQEHEVELELLRE 79
           L+GS   + E +I +   ++      L      +   L  +D   +     E ++  L E
Sbjct: 370 LSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEE-QIAQLLE 428

Query: 80  DNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASD 139
           +  +   +  R +   +   ++L  +++A+E  +K L+ K +         +  +    +
Sbjct: 429 ELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTK---------QKINAFELE 479

Query: 140 HVYRLEEKEQELKKEY-SILHER 161
               + +K ++  KE+   L ER
Sbjct: 480 RAITIADKAKKTLKEFREKLLER 502


>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
           superfamily.  The aldehyde dehydrogenase superfamily
           (ALDH-SF) of  NAD(P)+-dependent enzymes, in general,
           oxidize a wide range of  endogenous and exogenous
           aliphatic and aromatic aldehydes to their corresponding
           carboxylic acids and play an  important role in
           detoxification. Besides aldehyde detoxification, many
           ALDH isozymes possess multiple additional catalytic and
           non-catalytic functions such as participating in
           metabolic pathways, or as binding proteins, or
           osmoregulants, to mention a few. The enzyme has three
           domains, a NAD(P)+ cofactor-binding domain, a catalytic
           domain, and a bridging domain; and the active enzyme is
           generally either homodimeric or homotetrameric. The
           catalytic mechanism is proposed to involve cofactor
           binding, resulting in a conformational change and
           activation of an invariant catalytic cysteine
           nucleophile. The cysteine and aldehyde substrate form an
           oxyanion thiohemiacetal intermediate resulting in
           hydride transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
           are represented by enzymes which share a number of
           highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group. The
           ALDH-like group is represented by such proteins as
           gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
           reductase, and coenzyme A acylating aldehyde
           dehydrogenase. All of these proteins have a conserved
           cysteine that aligns with the catalytic cysteine of the
           ALDH group.
          Length = 367

 Score = 27.6 bits (62), Expect = 5.4
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 9/40 (22%)

Query: 86  TQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
              ER   LRK A        D LE+ ++EL + +E+LE+
Sbjct: 14  PPAERAAILRKIA--------DLLEERREEL-AALETLET 44


>gnl|CDD|219330 pfam07201, HrpJ, HrpJ-like domain.  This family represents a
           conserved region approximately 200 residues long within
           a number of bacterial hypersensitivity response
           secretion protein HrpJ and similar proteins. HrpJ forms
           part of a type III secretion system through which, in
           phytopathogenic bacterial species, virulence factors are
           thought to be delivered to plant cells. This family also
           includes the InvE invasion protein from Salmonella. This
           protein is involved in host parasite interactions and
           mutations in the InvE gene render Salmonella typhimurium
           non-invasive. InvE S. typhimurium mutants fail to elicit
           a rapid Ca2+ increase in cultured cells, an important
           event in the infection procedure and internalisation of
           S. typhimurium into epithelial cells. This family
           includes bacterial SepL and SsaL proteins. SepL plays an
           essential role in the infection process of
           enterohemorrhagic Escherichia coli and is thought to be
           responsible for the secretion of EspA, EspD, and EspB.
           SsaL of Salmonella typhimurium is thought to be a
           component of the type III secretion system.
          Length = 161

 Score = 26.9 bits (60), Expect = 5.5
 Identities = 15/66 (22%), Positives = 27/66 (40%)

Query: 106 EDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
           E+ L + ++ L S   SL  ++  LE    + SD    L    Q+ +     L E   + 
Sbjct: 41  EEKLRELRQLLKSGRSSLPQLLAYLESLFPDPSDQDLALRALLQQARPLKRALKELLEQA 100

Query: 166 LKTHME 171
           L+   +
Sbjct: 101 LEQLEQ 106


>gnl|CDD|183173 PRK11519, PRK11519, tyrosine kinase; Provisional.
          Length = 719

 Score = 27.4 bits (61), Expect = 5.9
 Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 7/60 (11%)

Query: 98  ADQKLFQIEDALEDDKKELNSKIESL----ESIVRMLELKSKNASDHVY-RLEEKEQELK 152
           A + L +   ALED+K +LN ++ ++    + IVR+   +   +   VY +L  K+QELK
Sbjct: 340 AYRTLLEKRKALEDEKAKLNGRVTAMPKTQQEIVRL--TRDVESGQQVYMQLLNKQQELK 397


>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form.  The
           Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
           2.A.36) The CPA1 family is a large family of proteins
           derived from Gram-positive and Gram-negative bacteria,
           blue green bacteria, yeast, plants and animals.
           Transporters from eukaryotes have been functionally
           characterized, and all of these catalyze Na+:H+
           exchange. Their primary physiological functions may be
           in (1) cytoplasmic pH regulation, extruding the H+
           generated during metabolism, and (2) salt tolerance (in
           plants), due to Na+ uptake into vacuoles. This model is
           specific for the bacterial members of this family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 525

 Score = 27.2 bits (60), Expect = 7.1
 Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 9/85 (10%)

Query: 16  HIVMSEKVQSLAGSIYQEFEKMITKY-DEDVVKDLMPLVVNVLESL-DLAFTENQEHEVE 73
              +SE  +       +E E++I +Y      K  +   V  L  +  +A     E +  
Sbjct: 406 RKFVSEHSE-------RELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDAR 458

Query: 74  LELLREDNEQLVTQYEREKQLRKAA 98
           +E LR D E+ +     EK LR+ A
Sbjct: 459 IEELRADGEEKIRSGMGEKNLRRRA 483


>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain.  Mitosin or
           centromere-associated protein-F (Cenp-F) is found bound
           across the centromere as one of the proteins of the
           outer layer of the kinetochore. Most of the
           kinetochore/centromere functions appear to depend upon
           binding of the C-terminal par to f the molecule, whereas
           the N-terminal part, here, may be a cytoplasmic player
           in controlling the function of microtubules and dynein.
          Length = 288

 Score = 27.1 bits (59), Expect = 7.4
 Identities = 24/99 (24%), Positives = 49/99 (49%), Gaps = 3/99 (3%)

Query: 60  LDLAFTENQEHEVELELLR---EDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
           LD    E Q+ + +L+ L    +  +Q V   + E    K  +Q+L +I + LE  K+++
Sbjct: 27  LDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLMEICEHLEKTKQKI 86

Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY 155
           + +++  ES V   E +  +    + +LE++ +  K E 
Sbjct: 87  SHELQVKESQVNFQEGQLNSCKKQIEKLEQELKRCKSEL 125


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 27.3 bits (61), Expect = 7.4
 Identities = 29/111 (26%), Positives = 44/111 (39%), Gaps = 12/111 (10%)

Query: 57  LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
           LE LD A  E +E   E     +   +L  +    KQ       K  +++ A+E+ + E 
Sbjct: 315 LEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374

Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
               E L  +   L+   K  S+ V     KE   K    I+    T+LLK
Sbjct: 375 VDNAEELAKLQDELDKIVKTKSELV-----KE---KYHRGIV----TDLLK 413


>gnl|CDD|238352 cd00652, TBP_TLF, TATA box binding protein (TBP): Present in
           archaea and eukaryotes, TBPs are transcription factors
           that recognize promoters and initiate transcription. TBP
           has been shown to be an essential component of three
           different transcription initiation complexes: SL1, TFIID
           and TFIIIB, directing transcription by RNA polymerases
           I, II and III, respectively. TBP binds directly to the
           TATA box promoter element, where it nucleates polymerase
           assembly, thus defining the transcription start site.
           TBP's binding in the minor groove induces a dramatic DNA
           bending while its own structure barely changes. The
           conserved core domain of TBP, which binds to the TATA
           box, has a bipartite structure, with intramolecular
           symmetry generating a saddle-shaped structure that sits
           astride the DNA. New members of the TBP family, called
           TBP-like proteins (TBLP, TLF, TLP) or TBP-related
           factors (TRF1, TRF2,TRP), are similar to the core domain
           of TBPs, with identical or chemically similar amino
           acids at many equivalent positions, suggesting similar
           structure. However, TLFs contain distinct, conserved
           amino acids at several positions that distinguish them
           from TBP.
          Length = 174

 Score = 26.7 bits (60), Expect = 7.5
 Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 7/56 (12%)

Query: 36  KMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYERE 91
           +++ K    V K     V N++ S DL F       + LE L   + +    YE E
Sbjct: 78  RILQKLGFPVEKFPEFKVQNIVASCDLGF------PIRLEELALKHPE-NASYEPE 126


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 27.2 bits (61), Expect = 7.9
 Identities = 27/104 (25%), Positives = 40/104 (38%), Gaps = 21/104 (20%)

Query: 71   EVELELLRED---NEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESL--ES 125
            E  + +L+ D    EQL   Y++ +Q ++ A Q+ F    AL     E+  +      E 
Sbjct: 924  EPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAF----AL----TEVVQRRAHFSYED 975

Query: 126  IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTH 169
               ML   S        RLE+ EQE          R  E L+  
Sbjct: 976  AAEMLAKNSDLNEKLRQRLEQAEQER--------TRAREQLRQA 1011


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 27.2 bits (61), Expect = 8.2
 Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 68  QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIED---ALEDDKKELNSKIESL 123
              + E+  L++  E    +  + + L +A  Q+L  +E     LE+ ++EL +++E L
Sbjct: 145 HALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL 203


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 27.2 bits (60), Expect = 8.7
 Identities = 34/155 (21%), Positives = 65/155 (41%), Gaps = 13/155 (8%)

Query: 3   LDQETVYGTHDDSHIVMSEKVQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDL 62
           L  E +     +  +   EK +SL  +  +E E       EDV K L PL  +       
Sbjct: 245 LKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSK-LSPLQYDCW----W 299

Query: 63  AFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDK--KELNSKI 120
              EN +    L+      E+     ++ + LR   D    ++E +L++    K  + K+
Sbjct: 300 EKVENLQD--LLDRATNQVEKAALVLDQNQDLRDKVD----KLEASLKEANVSKFSSYKV 353

Query: 121 ESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY 155
           E L+  +++LE + + +   ++   +  QE  KE+
Sbjct: 354 ELLQQKLKLLEERLQASDHEIHSYIQLYQESIKEF 388


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 27.0 bits (60), Expect = 8.9
 Identities = 32/141 (22%), Positives = 64/141 (45%), Gaps = 10/141 (7%)

Query: 56  VLESL-DLAFTENQEHEVELEL---LREDNEQLVTQYEREKQLRKAADQKLFQIEDALED 111
           VLESL D A  + +   V  +L   ++ D +   +  E+  QL+KA  + L + +  L+ 
Sbjct: 182 VLESLIDAAEWKLEVERVLPQLKVTIKTDAKDWRSHLEQMHQLKKAISKSLPETKQYLKK 241

Query: 112 DKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL---LKT 168
             ++L+  +E + S  + L  + +              +++++Y+   +  +EL   L  
Sbjct: 242 LSQDLSKALEKISSREKHLNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNE 301

Query: 169 HMEYIERTKLMM---GSGSSD 186
             E +E+ K  M   G+  SD
Sbjct: 302 ISEELEQVKQEMEERGASMSD 322


>gnl|CDD|227612 COG5293, COG5293, Predicted ATPase [General function prediction
           only].
          Length = 591

 Score = 26.8 bits (59), Expect = 9.0
 Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 5/104 (4%)

Query: 76  LLREDNEQLVTQYEREKQLRKAADQK--LFQIEDALEDDK--KELNSKIESLESIVRMLE 131
           L+R+  E     +E    + K A  K   F +    ED K   E   KI+ LES  ++ E
Sbjct: 154 LIRDSQENFSKVFELSHSVLKHAKWKNYPFLLHLLGEDGKCAAEYYDKIQELESKKKLAE 213

Query: 132 LKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
           L  K     +  LEE E    ++   ++++    L T   + + 
Sbjct: 214 LLRKTWIGSLDSLEEIETTELRKQDEVNKK-QATLNTFDFHAQD 256


>gnl|CDD|177669 PLN00035, PLN00035, histone H4; Provisional.
          Length = 103

 Score = 25.6 bits (56), Expect = 9.5
 Identities = 8/26 (30%), Positives = 18/26 (69%)

Query: 23 VQSLAGSIYQEFEKMITKYDEDVVKD 48
          V+ ++G IY+E   ++  + E+V++D
Sbjct: 44 VKRISGLIYEETRGVLKIFLENVIRD 69


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.129    0.337 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,225,197
Number of extensions: 978732
Number of successful extensions: 2322
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2030
Number of HSP's successfully gapped: 576
Length of query: 205
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 113
Effective length of database: 6,857,034
Effective search space: 774844842
Effective search space used: 774844842
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.8 bits)