RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7417
         (372 letters)



>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 45.5 bits (108), Expect = 4e-05
 Identities = 45/187 (24%), Positives = 87/187 (46%), Gaps = 13/187 (6%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
             AK    +EQL +   E   Q+  E E   R   KQA+  + R + EQ+  + K E   
Sbjct: 619 AVAKQKQAEEQLVQANAELEEQKRAEAE--ARTALKQARLDLQRLQNEQQSLKDKLELAI 676

Query: 77  RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ-----TNKSKGPNFTIQRREPG 131
             +++  E + R L+ + ++L E   ++QA + ALK+      T +           +  
Sbjct: 677 AERKQQAETQLRQLDAQLKQLLE---QQQAFLEALKDDFRELRTERLAKWQVVEGELDNQ 733

Query: 132 LTLSTAQLERNKT--KEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 189
           L   +A +E  +T  K +++E KK   +  +  L+V+  ++ +L+ +  EL  TI ++  
Sbjct: 734 LAQLSAAIEAARTQAKARLKELKK-QYDRELASLDVDPNTVKELKRQIEELETTIERIAV 792

Query: 190 DKYDLEE 196
            + ++ E
Sbjct: 793 RRPEVRE 799



 Score = 33.9 bits (78), Expect = 0.14
 Identities = 41/208 (19%), Positives = 82/208 (39%), Gaps = 25/208 (12%)

Query: 12  EFVKRQEAKSSALDEQLKEYI--VEWRNQRAK----EEEDLKRLKEKQAKRKVMRAEEEQ 65
           +  ++ EA     +E+  E    +    QR        E+L++L+       + +A+EEQ
Sbjct: 428 QLRQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEIND--EALEKAQEEQ 485

Query: 66  KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTI 125
           + A     E+ + +     K++ +  E  QR E    + +  +  L+ Q +   G     
Sbjct: 486 EQA-EANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLLHF 544

Query: 126 QRREP-------GLTLSTAQLERN----KTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQ 174
            R E        G  +S   LER     +  E  + +    +++ ++ L+V  +      
Sbjct: 545 LRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDY-----A 599

Query: 175 AKATELWETIVKLETDKYDLEERQKRQD 202
           A  TEL E + + E        +QK+ +
Sbjct: 600 ANETELRERLQQAEEALQSAVAKQKQAE 627



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 27/124 (21%), Positives = 50/124 (40%), Gaps = 5/124 (4%)

Query: 34  EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK 93
             R++  + E  L  L  K   + V R  E  K   +++ E +  +        R  EEK
Sbjct: 356 SIRSELEEVEARLDALTGKH--QDVQRKYERLKQKIKEQLERDLEKNNERLAAIR--EEK 411

Query: 94  RQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
            ++    E+  QA+ + L++Q    K   F  +  E  L L   +   +      EE ++
Sbjct: 412 DRQKAAIEEDLQALESQLRQQLEAGK-LEFNEEEYELELRLGRLKQRLDSATATPEELEQ 470

Query: 154 IALN 157
           + +N
Sbjct: 471 LEIN 474


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 44.4 bits (106), Expect = 7e-05
 Identities = 33/142 (23%), Positives = 60/142 (42%), Gaps = 23/142 (16%)

Query: 28  LKEYIVEWRNQRAK-----EEEDLKRLKEK-QAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
           L E I+E     AK     ++E L  L    +   +    E++ + A+   +E E+ + E
Sbjct: 499 LPENIIE----EAKKLIGEDKEKLNELIASLEELER--ELEQKAEEAEALLKEAEKLKEE 552

Query: 82  VEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLER 141
           +EEKK++  EE+ + LEEAEK+ Q  +   K++ ++       +Q+              
Sbjct: 553 LEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYA---------- 602

Query: 142 NKTKEQIEEEKKIALNIRIKPL 163
              K     E +  LN   +  
Sbjct: 603 -SVKAHELIEARKRLNKANEKK 623



 Score = 39.0 bits (92), Expect = 0.003
 Identities = 27/117 (23%), Positives = 49/117 (41%), Gaps = 13/117 (11%)

Query: 15  KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRK---VMRAEEE--QKMAQ 69
           +  E K+   +  LKE          K +E+L+  KEK  + +   +  AE+E  Q + +
Sbjct: 530 RELEQKAEEAEALLKEA--------EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581

Query: 70  RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
            KKE +E  +   + +K      K   L EA K+        +++  K K     ++
Sbjct: 582 AKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 42.0 bits (99), Expect = 5e-04
 Identities = 37/297 (12%), Positives = 95/297 (31%), Gaps = 26/297 (8%)

Query: 64   EQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNF 123
              ++ +++KE E+ +    ++    DL++  + LEE E+  +        +  + K    
Sbjct: 1108 NAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEE----KEIAKEQRLKSKTK 1163

Query: 124  TIQR--REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELW 181
                  R+P L     + +++   +  +                +       +   +   
Sbjct: 1164 GKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGS 1223

Query: 182  ------ETIVKLETDKYDLEERQKRQDYDVSKIIINKKIMPSPGQGRPGFNPRYVQTPNL 235
                  E   K +       + +K      S+            +G+P   P+ V     
Sbjct: 1224 DQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQ- 1282

Query: 236  FNNATALTGKYPPKIQVASKYERRVDTRSYDDKKKLFEGGYLSLTKESLEKQYAEKVDQF 295
                       PP        E    ++     KK  +           +K+ +EK    
Sbjct: 1283 ---------YSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTAR 1333

Query: 296  GNRQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDEEEENPIYLKISDDEEEKEEEN 352
              + KT++ +    +  +    P   + +       E+++   +  S+DE+++++E+
Sbjct: 1334 KKKSKTRVKQASASQSSRLLRRPRKKKSD----SSSEDDDDSEVDDSEDEDDEDDED 1386


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 39.5 bits (93), Expect = 0.002
 Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 14/72 (19%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ-KMAQRKKEEEERRQREVEEKKQRDLEEKR 94
              R +EEE + +  E++      R EE Q K  ++KKEE E +  ++  ++QR LEEK 
Sbjct: 262 DKTREEEEEKILKAAEEE------RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEK- 314

Query: 95  QRLEEAEKKRQA 106
                 E+K+QA
Sbjct: 315 ------ERKKQA 320



 Score = 33.8 bits (78), Expect = 0.14
 Identities = 11/43 (25%), Positives = 32/43 (74%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR 78
           R + A+E+++ K+ +E++AK   +  EE++K+ +++++++ R+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 40.1 bits (95), Expect = 0.002
 Identities = 28/104 (26%), Positives = 55/104 (52%), Gaps = 11/104 (10%)

Query: 18  EAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVM--RAEEEQKMAQRKKE-- 73
           EAK    +E+ K  + E +    KE E +K+    +AK ++   R E E+++ +R+ E  
Sbjct: 30  EAKIKEAEEEAKRILEEAK----KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQ 85

Query: 74  --EEERRQREVE-EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
             E+   Q+E   ++K   LE++ + LE+ EK+ +     L+++
Sbjct: 86  KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKK 129


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 39.8 bits (93), Expect = 0.002
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 6/82 (7%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           QRA E+   K L+++ A      AE+  K A++  ++ E +Q++ EE K +   E + + 
Sbjct: 83  QRAAEQARQKELEQRAA------AEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136

Query: 98  EEAEKKRQAMMAALKEQTNKSK 119
           E   +K+    A  + +     
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKA 158



 Score = 35.2 bits (81), Expect = 0.045
 Identities = 20/81 (24%), Positives = 34/81 (41%)

Query: 37  NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
            +   E E  K+ KE+  K+    A+ +     +KK  E +++ E E K + + + K + 
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKA 191

Query: 97  LEEAEKKRQAMMAALKEQTNK 117
            E   K   A   A  E   K
Sbjct: 192 EEAKAKAEAAKAKAAAEAAAK 212



 Score = 31.0 bits (70), Expect = 0.89
 Identities = 20/83 (24%), Positives = 42/83 (50%)

Query: 37  NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
               KE+E  K+L+++  + +  RA E+ +  + ++     +  +  E+  +  EEK+++
Sbjct: 61  PAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQ 120

Query: 97  LEEAEKKRQAMMAALKEQTNKSK 119
            EEA+ K+ A   A  E   + K
Sbjct: 121 AEEAKAKQAAEAKAKAEAEAEKK 143



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 18/80 (22%), Positives = 34/80 (42%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
            K+ E+  + K     +K     +++  A+ K + E + + + EE K +    K +   E
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE 208

Query: 100 AEKKRQAMMAALKEQTNKSK 119
           A  K +A  AA      + K
Sbjct: 209 AAAKAEAEAAAAAAAEAERK 228



 Score = 30.2 bits (68), Expect = 1.7
 Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 9/90 (10%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ--RKKEEEERRQREVEEKKQRDLEEKRQ 95
           ++ K+ E+ K  +  +AK K   AE E+K  +  +K+ EEE + +   E K++  E K++
Sbjct: 116 EKQKQAEEAKAKQAAEAKAK-AEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKK 174

Query: 96  RL------EEAEKKRQAMMAALKEQTNKSK 119
                    EA+ K +A  A  K +  K+K
Sbjct: 175 AEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204



 Score = 29.8 bits (67), Expect = 2.0
 Identities = 17/65 (26%), Positives = 38/65 (58%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
           + +Q K+   + E+E++    ++ EE  +QR  E+ +Q++LE++    + A++  QA   
Sbjct: 54  RIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQ 113

Query: 110 ALKEQ 114
           A ++Q
Sbjct: 114 AEEKQ 118



 Score = 29.8 bits (67), Expect = 2.4
 Identities = 31/114 (27%), Positives = 55/114 (48%), Gaps = 6/114 (5%)

Query: 11  PEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQK---- 66
            E  K++ A+ +   E  +    E   ++A++       K+KQA+    +   E K    
Sbjct: 78  EEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAE 137

Query: 67  -MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 119
             A++K +EE ++Q E EE K +   E +++  EA+KK +A   A  E   K+K
Sbjct: 138 AEAEKKAKEEAKKQAE-EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190



 Score = 28.6 bits (64), Expect = 6.2
 Identities = 16/80 (20%), Positives = 45/80 (56%), Gaps = 7/80 (8%)

Query: 37  NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR-- 94
           N+  ++++   + ++++ K    + E++ + A++++  E+ RQ+E+E++   +   K+  
Sbjct: 53  NRIQQQKKPAAKKEQERQK----KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAE 108

Query: 95  -QRLEEAEKKRQAMMAALKE 113
               +  EK++QA  A  K+
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQ 128


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 39.5 bits (93), Expect = 0.002
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 7/84 (8%)

Query: 41  KEEEDLKRLKEKQAKRKVM--RAEEEQKMAQRKKEE--EERRQREVEE---KKQRDLEEK 93
           KE E LK+    +AK +V   RAE E+++ +R+ E    ERR  + EE   +K   L++K
Sbjct: 43  KEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKK 102

Query: 94  RQRLEEAEKKRQAMMAALKEQTNK 117
            + LE+ EK+       L E+  +
Sbjct: 103 EENLEKKEKELSNKEKNLDEKEEE 126


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 21/120 (17%)

Query: 13  FVKRQEAKSSALDEQLKEYIVE-----WRNQRAKEE----EDLKRLKEKQAKRKVMRAEE 63
            +K+  +   A D +L+E   E        QR K +    E + RLK+ +       A+ 
Sbjct: 316 MLKKARSALDACDRELEEKAREVSELKMERQRKKPQIDELESIVRLKQAE-------ADM 368

Query: 64  EQKMAQRKKEEEERRQREVEEKKQRDLEE-----KRQRLEEAEKKRQAMMAALKEQTNKS 118
            Q  A   + E ER QR    K ++  EE      + RL EAE++RQ +   LK Q    
Sbjct: 369 FQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEESH 428


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 38.9 bits (91), Expect = 0.003
 Identities = 39/169 (23%), Positives = 72/169 (42%), Gaps = 16/169 (9%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR- 94
           R + AKE E + +  +K+A+ K+  A  E K    + E  ++RQ    E+++   EE+R 
Sbjct: 28  RQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLR-ERNQQRQEARREREELQREEERL 86

Query: 95  QRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
            + EE    R   +  L+ Q  + +              LS  +LE  + ++Q++ E   
Sbjct: 87  VQKEEQLDARAEKLDNLENQLEEREK------------ALSARELELEELEKQLDNELYR 134

Query: 155 ALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDY 203
              +  +        +  L A+  E     VK   ++ DLE  +K Q+ 
Sbjct: 135 VAGLTPEQARKL--LLKLLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181



 Score = 35.1 bits (81), Expect = 0.052
 Identities = 35/152 (23%), Positives = 61/152 (40%), Gaps = 13/152 (8%)

Query: 27  QLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR-------Q 79
           +L +       +  KE E+       +AK  ++R   +Q+   R++ EE +R       +
Sbjct: 30  RLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQK 89

Query: 80  REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG-----PNFT-IQRREPGLT 133
            E  + +   L+    +LEE EK   A    L+E   +           T  Q R+  L 
Sbjct: 90  EEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLK 149

Query: 134 LSTAQLERNKTKEQIEEEKKIALNIRIKPLNV 165
           L  A+LE  K +   + E++  L    K  N+
Sbjct: 150 LLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 41/191 (21%), Positives = 81/191 (42%), Gaps = 28/191 (14%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           + ++ + A      E+L+  + E  ++  +  E+L  L+EK  + K      E+  +   
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK------EELESLEA 358

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
           + EE   + E  E +  +LEE+ + L     + +  +A+L  +          I+R E  
Sbjct: 359 ELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE----------IERLEA- 407

Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
             L   +  R + +++IEE  K      +K          +LQA+  EL E + +L+ + 
Sbjct: 408 -RLERLEDRRERLQQEIEELLKKLEEAELK----------ELQAELEELEEELEELQEEL 456

Query: 192 YDLEERQKRQD 202
             LEE  +   
Sbjct: 457 ERLEEALEELR 467



 Score = 38.1 bits (89), Expect = 0.008
 Identities = 43/201 (21%), Positives = 86/201 (42%), Gaps = 18/201 (8%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ-------- 65
           ++    +   L E+LKE   E     A+ +E  ++L+E + +   +  E E+        
Sbjct: 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYAL 293

Query: 66  KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTI 125
                + E++++  RE     +R LEE   +LEE E K   +   L E   K       +
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEK-------L 346

Query: 126 QRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIV 185
           +  +  L    A+LE  + + +  E +   L  +++ L  +   + +L+ +   L   I 
Sbjct: 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK---VAQLELQIASLNNEIE 403

Query: 186 KLETDKYDLEERQKRQDYDVS 206
           +LE     LE+R++R   ++ 
Sbjct: 404 RLEARLERLEDRRERLQQEIE 424



 Score = 32.3 bits (74), Expect = 0.52
 Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 6/95 (6%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
           E+L E I     +  + EE ++ L+ +       RA  E+ +A  + E EE        +
Sbjct: 848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEEL------SE 901

Query: 86  KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG 120
           + R+LE KR  L    ++ +  +A L+ +    + 
Sbjct: 902 ELRELESKRSELRRELEELREKLAQLELRLEGLEV 936



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 41/181 (22%), Positives = 82/181 (45%), Gaps = 8/181 (4%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
           EQL++ + E   Q +   +DL RL+ +  + +   A+  +++ + + E EE  +R  E +
Sbjct: 715 EQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAE 774

Query: 86  KQRD-LEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK- 143
           ++    E + + LE   ++ +  + AL+E  ++ +     +      L      LER   
Sbjct: 775 EELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIA 834

Query: 144 -TKEQIE--EEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKR 200
            T+ ++E  EE+   L+  I+ L  E   I +L+    EL   +  L  ++  LEE    
Sbjct: 835 ATERRLEDLEEQIEELSEDIESLAAE---IEELEELIEELESELEALLNERASLEEALAL 891

Query: 201 Q 201
            
Sbjct: 892 L 892


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.9 bits (91), Expect = 0.005
 Identities = 41/214 (19%), Positives = 90/214 (42%), Gaps = 19/214 (8%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E ++ +  +     E+LKE I   + +  + E  L+ L++  A+ +  + E E+K++   
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371

Query: 72  KEEEERRQREVEE---------KKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPN 122
           +E EE  +   EE         + + +LEE ++ +E  E++ + +   L++   + K   
Sbjct: 372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELE 431

Query: 123 FTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWE 182
              +  E    L     E  + +EQ+EE +     +  +        + +LQ +   L +
Sbjct: 432 --AELEELQTELEELNEELEELEEQLEELRDRLKELERE--------LAELQEELQRLEK 481

Query: 183 TIVKLETDKYDLEERQKRQDYDVSKIIINKKIMP 216
            +  LE     LE  Q+      + +   +  +P
Sbjct: 482 ELSSLEARLDRLEAEQRASQGVRAVLEALESGLP 515



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 43/214 (20%), Positives = 96/214 (44%), Gaps = 36/214 (16%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
           ++ +  +     E+L+E + E + +  + EE+L+ L+E  AK K    E E+K    ++E
Sbjct: 735 LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794

Query: 74  EEERRQ-------------------REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
            EE  +                    +  E+ ++++EE  + +EE E+K   +   L+E 
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854

Query: 115 TNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQ 174
             +           E    L   + E+ + +++++E ++       + L  E   + +L+
Sbjct: 855 EKE---------LEELKEELEELEAEKEELEDELKELEE-----EKEELEEE---LRELE 897

Query: 175 AKATELWETIVKLETDKYDLEERQKRQDYDVSKI 208
           ++  EL E I KL     +LE + +R + ++ ++
Sbjct: 898 SELAELKEEIEKLRERLEELEAKLERLEVELPEL 931



 Score = 35.5 bits (82), Expect = 0.056
 Identities = 41/187 (21%), Positives = 87/187 (46%), Gaps = 6/187 (3%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
            + +  +    + + +   +E + +  K E      + +Q + ++   EEE +  + + E
Sbjct: 695 NELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELE 754

Query: 74  EEERRQREVEE---KKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
           E + R  E+EE     +  L + ++ +EE E+KRQA+   L+E   + +     +   E 
Sbjct: 755 ELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALER 814

Query: 131 GLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETD 190
            L     + ER + + +  EE+   L  ++  L  E   + +L+ +  EL E + +LE +
Sbjct: 815 ELESLEQRRERLEQEIEELEEEIEELEEKLDELEEE---LEELEKELEELKEELEELEAE 871

Query: 191 KYDLEER 197
           K +LE+ 
Sbjct: 872 KEELEDE 878



 Score = 35.1 bits (81), Expect = 0.068
 Identities = 37/176 (21%), Positives = 78/176 (44%), Gaps = 5/176 (2%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEE-EQKMAQRKK-EEEERRQREVEEKKQRDLEEK 93
             Q  K E   +   E +     +   + ++   + ++ EEE  R  E  E+ Q +LEE 
Sbjct: 206 ERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEA 265

Query: 94  RQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
            + +EE + + + +   L+E   +       I+  E  ++L   +LE  + + +  EE+ 
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERL 325

Query: 154 IALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKII 209
             L  +I+ L  E   + + +    EL + + +LE  K +LEE+      ++ ++ 
Sbjct: 326 EELKEKIEALKEE---LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378



 Score = 33.9 bits (78), Expect = 0.15
 Identities = 38/191 (19%), Positives = 87/191 (45%), Gaps = 15/191 (7%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E +KR+ A      EQL+  + E   +  + EE+L+ L+E+  +      EEE +  +  
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEE-----LEEELESLEEA 773

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
             + +    E+EEK+Q  L+E+ + LEE  ++ +  + AL+ +    +      +R    
Sbjct: 774 LAKLKEEIEELEEKRQA-LQEELEELEEELEEAERRLDALERELESLE-----QRRERLE 827

Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
             +   + E  + +E+++E ++    +  +   ++     +L+    E  E   +L+  +
Sbjct: 828 QEIEELEEEIEELEEKLDELEEELEELEKELEELKE----ELEELEAEKEELEDELKELE 883

Query: 192 YDLEERQKRQD 202
            + EE ++   
Sbjct: 884 EEKEELEEELR 894



 Score = 33.5 bits (77), Expect = 0.19
 Identities = 35/190 (18%), Positives = 89/190 (46%), Gaps = 5/190 (2%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEK--QAKRKVMRAEEEQKMAQRKKEEEERRQREVE 83
            +L+E + E + +  + E++++ LK +  + + ++   +EE    + + EE E  +  + 
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEG-EISLL 307

Query: 84  EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 143
            ++  +LE + + LEE  ++ +  + ALKE+  + +     +++    L  +  +LE   
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367

Query: 144 TKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDY 203
               +EE +++   +R + L      + +++ +  EL   I  LE     L ER +    
Sbjct: 368 -SALLEELEELFEALREE-LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKE 425

Query: 204 DVSKIIINKK 213
           ++ ++    +
Sbjct: 426 ELKELEAELE 435



 Score = 33.1 bits (76), Expect = 0.25
 Identities = 45/196 (22%), Positives = 89/196 (45%), Gaps = 15/196 (7%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
           Q+ +   L+E+L E       Q  K EE+LK LK +    + +  E  +++ + +++ EE
Sbjct: 665 QKRELKELEEELAEL----EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEE 720

Query: 77  RR------QREVEEKKQR--DLEEKRQRLEEAEKKRQAMMAALKEQTNK--SKGPNFTIQ 126
            +      + E+E+ + R  +LEE+ + LEE  ++ Q  +  L+E+             +
Sbjct: 721 LKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEE 780

Query: 127 RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVK 186
             E        Q E  + +E++EE ++    +  + L        +L+ +  EL E I +
Sbjct: 781 IEELEEKRQALQEELEELEEELEEAERRLDALE-RELESLEQRRERLEQEIEELEEEIEE 839

Query: 187 LETDKYDLEERQKRQD 202
           LE    +LEE  +  +
Sbjct: 840 LEEKLDELEEELEELE 855



 Score = 30.5 bits (69), Expect = 1.6
 Identities = 24/101 (23%), Positives = 50/101 (49%), Gaps = 1/101 (0%)

Query: 14   VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
            +++   +   L+ +L+   VE      + EE+ +   E + +R++ R EEE +       
Sbjct: 907  IEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNL 966

Query: 74   EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
                   EVEE+   +L+ +R+ LEEA++K   ++  L ++
Sbjct: 967  RAIEEYEEVEERY-EELKSQREDLEEAKEKLLEVIEELDKE 1006



 Score = 29.3 bits (66), Expect = 3.7
 Identities = 37/171 (21%), Positives = 75/171 (43%), Gaps = 17/171 (9%)

Query: 39  RAKEEEDLKRLKE----KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE-----KKQRD 89
            AK EE  K ++E     + K +   AE + +  +   E  E    E+E+     ++Q +
Sbjct: 151 NAKPEERRKLIEEAAGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAE 210

Query: 90  LEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE 149
             E+ Q L+   ++ +  +   K +  + +     ++  E  L+    +LE  + + +  
Sbjct: 211 KAERYQELKAELRELELALLLAKLKELRKE-----LEELEEELSRLEEELEELQEELEEA 265

Query: 150 EEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKR 200
           E++   L   ++ L  E   + +LQ +  EL E I +LE +   L ER + 
Sbjct: 266 EKEIEELKSELEELREE---LEELQEELLELKEEIEELEGEISLLRERLEE 313



 Score = 28.1 bits (63), Expect = 8.5
 Identities = 28/132 (21%), Positives = 65/132 (49%), Gaps = 5/132 (3%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
           E+ +  +     +    E+  +RL+++  + +    E E+K+ + ++E EE  +   E K
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELK 862

Query: 86  KQRD-LEEKRQRLE----EAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 140
           ++ + LE +++ LE    E E++++ +   L+E  ++       I++    L    A+LE
Sbjct: 863 EELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLE 922

Query: 141 RNKTKEQIEEEK 152
           R + +    EE+
Sbjct: 923 RLEVELPELEEE 934


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 37.0 bits (86), Expect = 0.010
 Identities = 22/65 (33%), Positives = 41/65 (63%), Gaps = 2/65 (3%)

Query: 42  EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
           EEE+L  L+++Q + +  R  E      ++ EE ERR+RE +E++++  +E++QR +E  
Sbjct: 150 EEEELAELRQQQRQFEQRRNAE--LAETQRLEEAERRRREEKERRKKQDKERKQREKETA 207

Query: 102 KKRQA 106
           +K  A
Sbjct: 208 EKIAA 212



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 18/61 (29%), Positives = 37/61 (60%)

Query: 53  QAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
           QA  +V+  EE  ++ Q++++ E+RR  E+ E ++ +  E+R+R E+  +K+Q      +
Sbjct: 143 QALLEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQR 202

Query: 113 E 113
           E
Sbjct: 203 E 203


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 37.0 bits (86), Expect = 0.011
 Identities = 20/106 (18%), Positives = 49/106 (46%), Gaps = 5/106 (4%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
            + KE + + R + ++ +E++++ +E+  + K   A+    ++    + E+RR  ++ EK
Sbjct: 67  ARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQL-EK 125

Query: 86  KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
            Q +++  R +L           + L  +  K     F ++R   G
Sbjct: 126 LQDEIKRTRSKLNALHSLLAEKRSFLCRELAKL----FPLRRVIRG 167


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 37.0 bits (85), Expect = 0.015
 Identities = 36/156 (23%), Positives = 78/156 (50%), Gaps = 4/156 (2%)

Query: 2    KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRA 61
            K D A     E  K  EAK  A + + K    + + + AK+  +  + + + A  +   A
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362

Query: 62   EEEQKMAQRKKEEEERRQREVEEKKQ--RDLEEKRQRLEEAEKKRQAM--MAALKEQTNK 117
            EE+ + A++KKEE +++    ++K +  +  +E +++ EE +KK   +   AA K++ ++
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422

Query: 118  SKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
            +K      ++ +     +    + ++ K++ EE KK
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458



 Score = 37.0 bits (85), Expect = 0.019
 Identities = 40/153 (26%), Positives = 75/153 (49%), Gaps = 1/153 (0%)

Query: 2    KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKE-KQAKRKVMR 60
            K D A     E  K  EAK  A + +  +   +   +  K EE  K+ +E K+A     +
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478

Query: 61   AEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG 120
            AEE +K  + KK+ EE +++  E KK  + ++K    ++AE+ ++A  A   E+  K+  
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538

Query: 121  PNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
                 ++++        +L++ + K++ EE KK
Sbjct: 1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKK 1571



 Score = 36.3 bits (83), Expect = 0.030
 Identities = 42/162 (25%), Positives = 82/162 (50%), Gaps = 12/162 (7%)

Query: 16   RQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEE 75
            + +A+     E+ K+ + + + + A+E++  + LK+ + + K+  AEE +K  + KK+ E
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678

Query: 76   ERRQREVEEKK-----QRDLEEKRQ----RLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
            E ++ E +EKK     +++ EE ++    + +EAE+K++A      E+ NK K      +
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738

Query: 127  RREPGLTLSTAQL---ERNKTKEQIEEEKKIALNIRIKPLNV 165
              E       A+    E+ K     +EE+K A  IR +   V
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAV 1780



 Score = 36.3 bits (83), Expect = 0.034
 Identities = 29/106 (27%), Positives = 54/106 (50%), Gaps = 6/106 (5%)

Query: 15   KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMR--AEEEQKMAQRKK 72
              +EAK +  D++  E   +      K  E LK+  E+  K + ++    EE+K A+  K
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK 1722

Query: 73   EEEERRQREVEEKKQRDLEEKRQ----RLEEAEKKRQAMMAALKEQ 114
            + EE  + + EE K+   E+K++    + +E EKK+ A +   +E+
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768



 Score = 34.3 bits (78), Expect = 0.12
 Identities = 29/112 (25%), Positives = 59/112 (52%)

Query: 2    KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRA 61
            K D A     E  K  EAK  A + + K    +   +  K+ ++ K+ +E +   +  +A
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKA 1530

Query: 62   EEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
            EE +K  + KK EE+++  E+++ ++    E++++ EEA+K  +    AL++
Sbjct: 1531 EEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582



 Score = 32.0 bits (72), Expect = 0.55
 Identities = 58/325 (17%), Positives = 122/325 (37%), Gaps = 35/325 (10%)

Query: 36   RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ 95
            + + AK+ E  ++ +E +   ++ +AE+ +K    +K EEER+  E  + +     E  +
Sbjct: 1171 KAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVK 1230

Query: 96   RLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE--EEKK 153
            + EEA+K  +    A +E+ N+        +        +  + E  +  ++++  EEKK
Sbjct: 1231 KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK 1290

Query: 154  IALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKIIINKK 213
             A   +      +     K   +A +  E   K E  K   +  +K+ +         K 
Sbjct: 1291 KADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350

Query: 214  IMPSPGQGRPGFNPRYVQTPNLFNNATALTGKYPPKIQVASKYERRVDTRSYDDKKKLFE 273
                                           +       A++ +     +  ++ KK  +
Sbjct: 1351 -----------------------------EAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381

Query: 274  GGYLSLTKESLEKQYAEKVDQFGNRQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDEEE 333
                +  K++ EK+ A++  +     K K  +       KKK      + EE K+ DE +
Sbjct: 1382 ----AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437

Query: 334  ENPIYLKISDDEEEKEEENFMSNNA 358
            +     K +D+ ++K EE   +  A
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEA 1462



 Score = 31.6 bits (71), Expect = 0.73
 Identities = 32/150 (21%), Positives = 67/150 (44%), Gaps = 5/150 (3%)

Query: 12   EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRK---VMRAEEEQKMA 68
            E  K  E    A +E+  E I ++   R       +   + +  RK   + +AEE++K  
Sbjct: 1234 EAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293

Query: 69   QRKKEEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
            + KK EE+++  E ++K  + +  +E +++ EEA+KK  A     +E    ++      +
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353

Query: 127  RREPGLTLSTAQLERNKTKEQIEEEKKIAL 156
                    +  + E  + K++  ++K  A 
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383



 Score = 29.7 bits (66), Expect = 3.0
 Identities = 28/140 (20%), Positives = 65/140 (46%), Gaps = 1/140 (0%)

Query: 15   KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ-KMAQRKKE 73
             ++  +   +  +  E   +    R +E   L   ++K    +  +AEE + K  + KK 
Sbjct: 1569 AKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA 1628

Query: 74   EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLT 133
            EEE+++ E  +KK+ + ++K + L++AE++ +   A   ++  + K      ++ E    
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688

Query: 134  LSTAQLERNKTKEQIEEEKK 153
             +   L++   + +  EE K
Sbjct: 1689 KAAEALKKEAEEAKKAEELK 1708



 Score = 29.7 bits (66), Expect = 3.5
 Identities = 28/132 (21%), Positives = 61/132 (46%), Gaps = 8/132 (6%)

Query: 36   RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE--EKKQRDLEEK 93
            + + A++ ED K+ +  +   +V +AEE +K    +K E  R+  E    E+ ++  + K
Sbjct: 1165 KAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAK 1224

Query: 94   R----QRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE 149
            +    ++ EEA+K  +    A +E+ N+        +        +  + E  +  ++++
Sbjct: 1225 KAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELK 1284

Query: 150  --EEKKIALNIR 159
              EEKK A   +
Sbjct: 1285 KAEEKKKADEAK 1296



 Score = 29.3 bits (65), Expect = 4.7
 Identities = 36/158 (22%), Positives = 73/158 (46%), Gaps = 7/158 (4%)

Query: 2    KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMR- 60
            K D A     E  K +EAK  A + +  +   +   +  K +E  K+ +E + K    + 
Sbjct: 1445 KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKK 1504

Query: 61   AEEEQKMAQRKKEEEERRQ----REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
            A E +K A   K+ EE ++    ++ EE K+ D  +K +  ++A++ ++A      E+  
Sbjct: 1505 AAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564

Query: 117  KSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
            K++      + +   + L  A+  +   + +IEE  K+
Sbjct: 1565 KAEEAKKAEEDKN--MALRKAEEAKKAEEARIEEVMKL 1600



 Score = 29.0 bits (64), Expect = 6.0
 Identities = 25/117 (21%), Positives = 52/117 (44%), Gaps = 3/117 (2%)

Query: 38   QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
            + AK+ E  K  + ++A+    +AE+ +K  + +K E+ R+  E  + +     E  ++ 
Sbjct: 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKA 1160

Query: 98   EEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
            E+A K  +A  A   E   K++      + R+           + +   + EEE+K 
Sbjct: 1161 EDARKAEEARKA---EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKA 1214



 Score = 29.0 bits (64), Expect = 6.3
 Identities = 25/132 (18%), Positives = 58/132 (43%), Gaps = 3/132 (2%)

Query: 26   EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR---QREV 82
            E+ K+       +  K EE  K+ ++ +   +  +AE+ +K  + +K E+ +R    R+ 
Sbjct: 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKA 1160

Query: 83   EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERN 142
            E+ ++ +   K +  ++AE  R+A      E+  K++        R+        +  + 
Sbjct: 1161 EDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKA 1220

Query: 143  KTKEQIEEEKKI 154
            +  ++ E  KK 
Sbjct: 1221 EDAKKAEAVKKA 1232



 Score = 28.2 bits (62), Expect = 9.7
 Identities = 25/104 (24%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 15   KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ--RKK 72
            K  E K +A  ++  +   +   ++ K +E  K+ +E +   +  +  EE K A+  +KK
Sbjct: 1406 KADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465

Query: 73   EEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
             EE ++  E ++K  + +  +E +++ EEA+KK      A + +
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAK 1509


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 36.9 bits (86), Expect = 0.017
 Identities = 26/119 (21%), Positives = 49/119 (41%), Gaps = 19/119 (15%)

Query: 19   AKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEK---------------QAKRKVMRAEE 63
            AK+S L+E+L++ + +   +R +  E L++ + +                AKR+ M  E 
Sbjct: 981  AKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQ-MLQEL 1039

Query: 64   EQKMAQ---RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 119
            +Q++           E R R   ++    L   R R  + EK+     A +   T K +
Sbjct: 1040 KQELQDLGVPADSGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLR 1098



 Score = 30.7 bits (70), Expect = 1.5
 Identities = 39/185 (21%), Positives = 66/185 (35%), Gaps = 44/185 (23%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEE---------DLKRLKEKQAKRKVMRAE-- 62
           V R EA   A     +E +   R QR   E+         +L++   +Q + + + AE  
Sbjct: 489 VSRSEAWDVA-----RELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQRAERLLAEFC 543

Query: 63  -----------------EEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKR- 104
                            EE +       E     RE     ++ LE+ + R++    +  
Sbjct: 544 KRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRLAARAP 603

Query: 105 -----QAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE-EEKKIALNI 158
                Q  +A L+EQ+    G  F   +          + ER  T E+ E   +K AL+ 
Sbjct: 604 AWLAAQDALARLREQS----GEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDE 659

Query: 159 RIKPL 163
            I+ L
Sbjct: 660 EIERL 664



 Score = 30.3 bits (69), Expect = 2.2
 Identities = 24/93 (25%), Positives = 37/93 (39%), Gaps = 10/93 (10%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK----KEEEERRQREVEEKKQRDLE 91
           R Q A E+  L  +  + A+     ++ EQ            +   RQ+E  E+ Q DLE
Sbjct: 299 RRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQEKIERYQADLE 358

Query: 92  EKRQRLEEAE------KKRQAMMAALKEQTNKS 118
           E  +RLEE         ++Q    A  E   + 
Sbjct: 359 ELEERLEEQNEVVEEADEQQEENEARAEAAEEE 391


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 36.5 bits (85), Expect = 0.021
 Identities = 26/107 (24%), Positives = 45/107 (42%), Gaps = 4/107 (3%)

Query: 53  QAKRKVMRAEEEQKMAQRKKEE-EERRQREVEEKKQRDLEEKR---QRLEEAEKKRQAMM 108
           QAK ++   E+E+K A+  K   E R+ R   EK  R+   K+    R  + +    A +
Sbjct: 433 QAKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAAL 492

Query: 109 AALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 155
           A +K +   +  P        P  +   A  E  K + +  + +K A
Sbjct: 493 ARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQA 539


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 36.2 bits (83), Expect = 0.022
 Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 13/83 (15%)

Query: 21  SSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQR 80
              ++ + +E   E      K E+++ RLKE+             +++ R+ EE ERR  
Sbjct: 348 YEDIEYETEENREEGTQAE-KYEQEIARLKERL------------RVSVRRLEEYERRLL 394

Query: 81  EVEEKKQRDLEEKRQRLEEAEKK 103
             E++ Q+ L+E + RLE++E++
Sbjct: 395 GQEQQMQKLLQEYQARLEDSEER 417


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 36.6 bits (85), Expect = 0.023
 Identities = 26/134 (19%), Positives = 61/134 (45%), Gaps = 11/134 (8%)

Query: 26  EQLKEYIVEWRNQRA---KEEEDLK-RLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
           ++L+E  ++ + Q     KE E+L  + +E + + + + A      ++    ++ER + E
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895

Query: 82  VE----EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTA 137
            +    E+K  +LE + ++  +   + +A + AL+E+               P   LS  
Sbjct: 896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE-LSEIEDPKGEDEEIPEEELSLE 954

Query: 138 QLERNKTKEQIEEE 151
            ++     +++EEE
Sbjct: 955 DVQ--AELQRVEEE 966



 Score = 33.9 bits (78), Expect = 0.18
 Identities = 42/191 (21%), Positives = 81/191 (42%), Gaps = 11/191 (5%)

Query: 9   GDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMA 68
            + E V+    +   + ++ ++ +   R +R K E     LKEK+        +E++ + 
Sbjct: 177 EELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALE 236

Query: 69  QRKKEEEERRQ--REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
           ++K+  E +     E  EK   ++ E  +RLEE E+  + +   +K+     +     ++
Sbjct: 237 RQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD---LGEEEQLRVK 293

Query: 127 RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVK 186
            +   L    A LER+  +++ E E       R+  L  E   I KL A+  EL   I +
Sbjct: 294 EKIGELEAEIASLERSIAEKERELED---AEERLAKLEAE---IDKLLAEIEELEREIEE 347

Query: 187 LETDKYDLEER 197
               +  L E 
Sbjct: 348 ERKRRDKLTEE 358



 Score = 32.7 bits (75), Expect = 0.40
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 8/105 (7%)

Query: 14   VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDL---KRLKEKQAKRKVMRAEEEQKM-AQ 69
            +   +AK  AL+E+L E            EE+L       E Q   + +RA E   M A 
Sbjct: 919  LSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAI 978

Query: 70   RKKEEEERRQREVEEKKQRDLEEKRQRLEEAE----KKRQAMMAA 110
            ++ EE  +R  E++EK+ +  EE++  LE  E    KKR+  M A
Sbjct: 979  QEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEA 1023



 Score = 32.0 bits (73), Expect = 0.68
 Identities = 28/198 (14%), Positives = 80/198 (40%), Gaps = 9/198 (4%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E ++++E K     E+L+E +     +    + +LK L+ +  + +    + E+ +   +
Sbjct: 726 EQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLE 785

Query: 72  KEEEERRQREVEEKKQ------RDLEEKRQRLEEAEKKRQAMMAALKE--QTNKSKGPNF 123
                 R  E++ +          +E + + +E+   +       L++  Q  + +  + 
Sbjct: 786 ARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL 845

Query: 124 TIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWET 183
             Q +     +     ++ + +E++EE +    ++  +  +++     +L+A+  EL   
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKK-ERDELEAQLRELERK 904

Query: 184 IVKLETDKYDLEERQKRQ 201
           I +LE       +R    
Sbjct: 905 IEELEAQIEKKRKRLSEL 922



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 21/97 (21%), Positives = 44/97 (45%), Gaps = 5/97 (5%)

Query: 15  KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLK----RLKEKQAKRKVMRAEEEQKMAQR 70
           KR+  +     ++L E + +     A  E  +       ++K  + K    + EQ  A  
Sbjct: 405 KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADL 464

Query: 71  KKEEEERRQREVE-EKKQRDLEEKRQRLEEAEKKRQA 106
            K E+E    + E ++ +++L + ++ L EAE + +A
Sbjct: 465 SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARA 501



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 47/219 (21%), Positives = 85/219 (38%), Gaps = 34/219 (15%)

Query: 10  DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
           +P  ++R   +   L  +L     E R  R +   D    +   A RK+   E+E +  +
Sbjct: 672 EPAELQRLRERLEGLKRELSSLQSELR--RIENRLDELSQELSDASRKIGEIEKEIEQLE 729

Query: 70  RKKEEEERRQREVEEK---KQRDLEEKRQRLEEAEKKRQAMMAAL--------------- 111
           +++E+ + R  E+EE     ++++E  +  L+E E + + +   L               
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLS 789

Query: 112 ----KEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK-------IALNIRI 160
                E   +       + R E  L     +L R   +++  E++        I L  +I
Sbjct: 790 HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQI 849

Query: 161 KPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQK 199
           K +  E   I  L  K  EL E + +LE    DLE R  
Sbjct: 850 KSIEKE---IENLNGKKEELEEELEELEAALRDLESRLG 885


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 36.1 bits (84), Expect = 0.023
 Identities = 29/150 (19%), Positives = 51/150 (34%), Gaps = 32/150 (21%)

Query: 26  EQLKEYIVEW-RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE- 83
           E+      +     R K       +   QA+    +AE      + + E E      V  
Sbjct: 120 EEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQCEAEGTGGTGVAG 179

Query: 84  -----EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQ 138
                ++K+  L+  + RLE  + +  A +A L+ Q                      A 
Sbjct: 180 KGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQ---------------------KAA 218

Query: 139 LERNKTKEQIEEEKKIA----LNIRIKPLN 164
           LERN+     E++ +I     L  R++ L 
Sbjct: 219 LERNRQAAVAEKQARIEANDGLLARLEALG 248


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 36.0 bits (83), Expect = 0.027
 Identities = 18/46 (39%), Positives = 32/46 (69%)

Query: 69  QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
           +R +E+ ER QRE +++ +  LE +R+  EEA K+R+A+ AA + +
Sbjct: 6   RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSR 51



 Score = 33.3 bits (76), Expect = 0.23
 Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 5/63 (7%)

Query: 52  KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
           ++A+ K+ R + E+K   + K E ER+ +E E  KQR+  E  QR     ++  A+ A +
Sbjct: 6   RRAREKLEREQRERKQRAKLKLERERKAKE-EAAKQREAIEAAQR----SRRLDAIEAQI 60

Query: 112 KEQ 114
           K  
Sbjct: 61  KAD 63



 Score = 32.2 bits (73), Expect = 0.49
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 11/78 (14%)

Query: 34  EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD--LE 91
           E R  R K E + +  K++ AK   ++ E E     RK +EE  +QRE  E  QR   L+
Sbjct: 4   ELRRAREKLEREQRERKQR-AK---LKLERE-----RKAKEEAAKQREAIEAAQRSRRLD 54

Query: 92  EKRQRLEEAEKKRQAMMA 109
               +++  ++ ++++ A
Sbjct: 55  AIEAQIKADQQMQESLQA 72


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 34.5 bits (80), Expect = 0.029
 Identities = 22/110 (20%), Positives = 60/110 (54%), Gaps = 14/110 (12%)

Query: 10  DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEK-QAKRKVMRAEEEQKMA 68
           D + V  +     A  +QL++   + + +  K+E++L++ ++K Q +   +  E  +   
Sbjct: 22  DVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKA-- 79

Query: 69  QRKKEEEERRQREVEEKKQRDLEEKRQRLE-EAEKKRQAMMAALKEQTNK 117
                    +Q+E++ +KQ++L++K+Q  + E ++K+Q ++  + ++ +K
Sbjct: 80  ---------KQQELQ-QKQQELQQKQQAAQQELQQKQQELLQPIYDKIDK 119


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 34.1 bits (79), Expect = 0.037
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
           L L  A  E    + + EEEK+  L  R  PL +   S  +LQ    +L   I +L+ ++
Sbjct: 7   LLLLKAAEELEFEQRKKEEEKEKYLAERCPPLRLSL-SRAELQELCKKLHARIDRLDEER 65

Query: 192 YDLEERQKRQDYDV 205
           YD+EE+  ++D ++
Sbjct: 66  YDIEEKVAKKDKEI 79



 Score = 28.3 bits (64), Expect = 3.0
 Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 11/85 (12%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           QR KEEE  K L E+    ++  +  E    Q   ++   R   ++E+        R  +
Sbjct: 20  QRKKEEEKEKYLAERCPPLRLSLSRAEL---QELCKKLHARIDRLDEE--------RYDI 68

Query: 98  EEAEKKRQAMMAALKEQTNKSKGPN 122
           EE   K+   +  LK++ N  +G  
Sbjct: 69  EEKVAKKDKEIEDLKKKVNDLRGKF 93



 Score = 27.9 bits (63), Expect = 4.5
 Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 4/81 (4%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
           RA+ +E  K+L  +  +    R + E+K+A++ KE E+ +++  + + +     K+  L+
Sbjct: 44  RAELQELCKKLHARIDRLDEERYDIEEKVAKKDKEIEDLKKKVNDLRGKF----KKPTLK 99

Query: 99  EAEKKRQAMMAALKEQTNKSK 119
           +  K   AM+ AL    +K  
Sbjct: 100 KVRKSADAMLKALLGSKHKVS 120


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 36.0 bits (83), Expect = 0.037
 Identities = 17/41 (41%), Positives = 26/41 (63%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQR 80
           AKEE + +R  E+Q +R+  +A  E   AQ K E E+RR++
Sbjct: 262 AKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302



 Score = 34.8 bits (80), Expect = 0.083
 Identities = 14/63 (22%), Positives = 37/63 (58%), Gaps = 6/63 (9%)

Query: 41  KEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
           ++  +L++L +++A+R+    E+ ++  ++   E +R Q + E      +E++R++L+  
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAE------VEKRREKLQNL 306

Query: 101 EKK 103
            KK
Sbjct: 307 LKK 309



 Score = 31.4 bits (71), Expect = 1.0
 Identities = 17/58 (29%), Positives = 34/58 (58%), Gaps = 6/58 (10%)

Query: 61  AEEEQKMAQRKKEEEERRQREVEEKKQRDLE------EKRQRLEEAEKKRQAMMAALK 112
            EE+++  ++  +EE  R+R+ EE+++R+ E      ++ Q   E EK+R+ +   LK
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLK 308


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 35.4 bits (81), Expect = 0.043
 Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 11/127 (8%)

Query: 24  LDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE 83
           + E  K  I + + +  K +E+  + K+ +A      ++  +K A+ K+ E ++++  V 
Sbjct: 275 VAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVA 334

Query: 84  EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 143
           E  Q+   +             A   +L E    S  P + ++  +P   LS   L   K
Sbjct: 335 EDLQKTKPQVE-----------AQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLK 383

Query: 144 TKEQIEE 150
           T+ ++ E
Sbjct: 384 TEVRLRE 390



 Score = 33.4 bits (76), Expect = 0.18
 Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 2/84 (2%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           QR +  +  +  K           +E++++A+ +K E E+ Q E+++  +  L+ K  + 
Sbjct: 246 QRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKA 305

Query: 98  EEA--EKKRQAMMAALKEQTNKSK 119
            +   E K     A  KE   + K
Sbjct: 306 FDLKQESKASEKEAEDKELEAQKK 329



 Score = 31.5 bits (71), Expect = 0.69
 Identities = 13/84 (15%), Positives = 42/84 (50%), Gaps = 2/84 (2%)

Query: 25  DEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE 84
               +  + + + + ++E+    +  +++  +K + A++ Q+ A   ++  ++++ EV +
Sbjct: 193 GVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQ 252

Query: 85  KKQ--RDLEEKRQRLEEAEKKRQA 106
           K+Q  ++L +        E K+ A
Sbjct: 253 KQQEAKNLPKPADTSSPKEDKQVA 276



 Score = 29.6 bits (66), Expect = 2.7
 Identities = 21/138 (15%), Positives = 61/138 (44%), Gaps = 6/138 (4%)

Query: 22  SALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
           S  D+++ E + E   +      D+  LKE++++    RA++ ++   +K+ + ++ Q++
Sbjct: 176 SISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQK 235

Query: 82  VE------EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLS 135
            +      +K++ ++ +K+Q  +   K          +Q  +++       + E      
Sbjct: 236 ADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295

Query: 136 TAQLERNKTKEQIEEEKK 153
            A   ++     +++E K
Sbjct: 296 EALKAKDHKAFDLKQESK 313


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 35.0 bits (81), Expect = 0.044
 Identities = 21/81 (25%), Positives = 40/81 (49%)

Query: 18  EAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEER 77
           EA+   L E+ KE       Q    +E +K+L EK    +     E+++M + K +E+E 
Sbjct: 217 EAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEE 276

Query: 78  RQREVEEKKQRDLEEKRQRLE 98
             +E  + +   L+++ Q L+
Sbjct: 277 LLKEGFKTEAESLQKEIQDLK 297



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 13/60 (21%), Positives = 35/60 (58%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
           KEK  + +  +AE  +   +  +E+++  ++ +E +++   E  +Q +E+ E +R+ ++A
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLA 261


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 35.0 bits (81), Expect = 0.059
 Identities = 14/48 (29%), Positives = 33/48 (68%), Gaps = 6/48 (12%)

Query: 41  KEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQR 88
            ++E+L+R KE++       A +EQK  ++ K++EE++++E+E+ ++ 
Sbjct: 548 DDKEELQREKEEK------EALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 9/39 (23%), Positives = 25/39 (64%)

Query: 62  EEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
           EE Q+  + K+  +E+++    +K++   +++ ++LE+A
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589



 Score = 29.2 bits (66), Expect = 3.9
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 69  QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQ 105
           QR+KEE+E  + +   +K +  EEK+++  E  +K +
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 29.2 bits (66), Expect = 4.1
 Identities = 13/46 (28%), Positives = 30/46 (65%), Gaps = 1/46 (2%)

Query: 52  KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           K   ++ ++ E+E+K A  K+++  R+ ++ EEKK+++LE+  +  
Sbjct: 546 KLDDKEELQREKEEKEAL-KEQKRLRKLKKQEEKKKKELEKLEKAK 590


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 35.0 bits (81), Expect = 0.064
 Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 10/104 (9%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQR---AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQR 70
           +K+ EA+ + L  QL+      R Q    ++E E  ++LKEK    ++   E E++  + 
Sbjct: 153 IKQLEAELAGLQAQLQAL----RQQLEVISEELEARRKLKEKGLVSRLELLELERE--RA 206

Query: 71  KKEEEERRQREVEEKKQRDLEEKRQRLEEAE-KKRQAMMAALKE 113
           + + E  R     E  +R ++E +   ++ E   R+ ++  L E
Sbjct: 207 EAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTE 250



 Score = 28.4 bits (64), Expect = 7.3
 Identities = 37/161 (22%), Positives = 65/161 (40%), Gaps = 25/161 (15%)

Query: 53  QAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
           ++++  +RA+ E  +AQ K+ E E          Q  L+  RQ+LE   ++ +     LK
Sbjct: 136 ESRKSTLRAQLELILAQIKQLEAEL------AGLQAQLQALRQQLEVISEELE-ARRKLK 188

Query: 113 EQTNKSKGPNFTIQRREPGLTLSTAQLE--RNKTKEQIEEEKKIALNIR-IKPLNVEGWS 169
           E+   S+     ++R          +LE      K QI+E   + L  + I+    E   
Sbjct: 189 EKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDE---LQLERQQIEQTFREEVL 245

Query: 170 IHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKIII 210
                    EL E   +L     +L ER  +    + ++II
Sbjct: 246 --------EELTEAQARLA----ELRERLNKARDRLQRLII 274


>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
          Length = 635

 Score = 34.9 bits (81), Expect = 0.074
 Identities = 24/105 (22%), Positives = 44/105 (41%), Gaps = 14/105 (13%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE---KKQRDLEEKRQRLEEAEKKRQA 106
           +++QA+   ++    +K  +    + E  +R  ++   K QR+LE+  Q LE+ E +   
Sbjct: 527 RQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAE--- 583

Query: 107 MMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEE 151
            + AL+ Q       +F  Q  E     +   L      EQ  E 
Sbjct: 584 -IEALQAQVA---DADFFSQPHE----QTQKVLADLADAEQELEV 620


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 34.5 bits (80), Expect = 0.079
 Identities = 20/74 (27%), Positives = 42/74 (56%)

Query: 33  VEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE 92
           +E+     K  + +K++ EK  K++    +E++K A   K++EE  + E E+K++   EE
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458

Query: 93  KRQRLEEAEKKRQA 106
           + +  EE E++ + 
Sbjct: 459 EEEAEEEKEEEEEK 472



 Score = 34.5 bits (80), Expect = 0.080
 Identities = 19/81 (23%), Positives = 45/81 (55%), Gaps = 2/81 (2%)

Query: 42  EEEDLKRL--KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
            EE+++ L   +K  K+     E+ +K  + +K+E++++    ++K++ + EEK ++ EE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 100 AEKKRQAMMAALKEQTNKSKG 120
            E++ +      +E+  K K 
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475



 Score = 29.5 bits (67), Expect = 3.5
 Identities = 26/92 (28%), Positives = 43/92 (46%), Gaps = 4/92 (4%)

Query: 276 YLSLTKESLE----KQYAEKVDQFGNRQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDE 331
           +L LT+E +E     + A K  +    +  K  +   +   KK  A +  EEEE +  ++
Sbjct: 391 FLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450

Query: 332 EEENPIYLKISDDEEEKEEENFMSNNATAEDF 363
           +EE     +   +EE++EEE      AT  DF
Sbjct: 451 KEEEKEEEEEEAEEEKEEEEEKKKKQATLFDF 482


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 34.8 bits (80), Expect = 0.083
 Identities = 31/167 (18%), Positives = 68/167 (40%), Gaps = 7/167 (4%)

Query: 57  KVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
                E++ +  ++  +E+E+ ++E+E++ +   E +R +  E EK+ Q  + ALK++  
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVE 582

Query: 117 KSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEK-------KIALNIRIKPLNVEGWS 169
                    +  +     S   L + K  +Q   +K       KI   +RI+    +G  
Sbjct: 583 SIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKI 642

Query: 170 IHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKIIINKKIMP 216
           +  L      +    ++++    +LE+  K       K+    K  P
Sbjct: 643 VQILGGNKWNVTVGGMRMKVHGSELEKINKAPPPKKFKVPKTTKPEP 689



 Score = 28.6 bits (64), Expect = 7.0
 Identities = 30/171 (17%), Positives = 65/171 (38%), Gaps = 25/171 (14%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
              K  E K+  L++ LKE       ++ K+E + +  + K+ +R              K
Sbjct: 522 ALEKELEQKNEHLEKLLKE------QEKLKKELEQEMEELKERERN-------------K 562

Query: 72  KEEEERRQREVEEKKQRDLEEK----RQRLEEAEKKRQAMMAALKEQTNKSKGPN--FTI 125
           K E E+  +E  +  ++++E      +++     K+ +++   +K +  K K P      
Sbjct: 563 KLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNF 622

Query: 126 QRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAK 176
           Q  + G  +      +     QI    K  + +    + V G  + K+   
Sbjct: 623 QADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEKINKA 673


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 34.4 bits (79), Expect = 0.091
 Identities = 17/101 (16%), Positives = 50/101 (49%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
           Q+ KS+   E+ ++   + + +  ++++  ++ + KQ +++ + A+E++K A+   ++  
Sbjct: 69  QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAA 128

Query: 77  RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNK 117
            +Q++ EE   +     + + E   K+  A       +  K
Sbjct: 129 LKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169



 Score = 33.2 bits (76), Expect = 0.19
 Identities = 23/82 (28%), Positives = 35/82 (42%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
             AK     K   E +AKR    A++    A++K E E  ++   E KK+ + E   +  
Sbjct: 137 AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAA 196

Query: 98  EEAEKKRQAMMAALKEQTNKSK 119
            EA+KK +A          K K
Sbjct: 197 AEAKKKAEAEAKKKAAAEAKKK 218



 Score = 32.1 bits (73), Expect = 0.49
 Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 3/103 (2%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQR- 70
           E +K+ E +  A  EQ K+   E   Q A +++  +    K A     +AE E K A   
Sbjct: 101 ERLKQLEKERLAAQEQKKQAE-EAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAA 159

Query: 71  -KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
            KK   E +++   E  ++   E +++ E     + A  A  K
Sbjct: 160 AKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK 202



 Score = 29.0 bits (65), Expect = 3.9
 Identities = 19/82 (23%), Positives = 32/82 (39%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           ++A  E   K   E   K      ++ +  A  K   E +++ E E KK+   E K++  
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220

Query: 98  EEAEKKRQAMMAALKEQTNKSK 119
            EA+       A  K    K+ 
Sbjct: 221 AEAKAAAAKAAAEAKAAAEKAA 242


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 33.8 bits (77), Expect = 0.12
 Identities = 26/103 (25%), Positives = 54/103 (52%), Gaps = 5/103 (4%)

Query: 15  KRQEAKSSALDEQLKEYIV--EWRNQRAKEEEDLKRLKEKQ---AKRKVMRAEEEQKMAQ 69
           ++  AK      + KE  V  E + ++A E+E LK+L++++    +++    E E++   
Sbjct: 70  QQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQL 129

Query: 70  RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
            +K++EE+ ++   E+K++    K +   EA K + A  A  K
Sbjct: 130 EQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172



 Score = 31.1 bits (70), Expect = 0.89
 Identities = 25/109 (22%), Positives = 56/109 (51%), Gaps = 4/109 (3%)

Query: 6   AGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ 65
            GGG    +        A+ +Q      +  + +  E++  K+ ++   + K  +A E++
Sbjct: 42  GGGGGGSVIDAVMVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQE 101

Query: 66  KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
           ++ Q +KE    R +  E++KQ +  EK+ +LE+ +++ QA  AA +++
Sbjct: 102 RLKQLEKE----RLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146



 Score = 29.5 bits (66), Expect = 3.2
 Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 4/107 (3%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEE-EDLKRLKEKQAKRKVMRAEEEQKM--- 67
           E   +Q A+   L +  KE +     Q+  EE E   +L++KQ + +  +A  EQK    
Sbjct: 91  ELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAE 150

Query: 68  AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
           A + K   E  + +   + ++  EE  +  EEA+ K +A  A  K +
Sbjct: 151 AAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197



 Score = 28.4 bits (63), Expect = 7.1
 Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 2/80 (2%)

Query: 38  QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           Q+ K E    +   + AK K   A E +K A+   +  E  + + E    +   E   + 
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKA--AAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA 202

Query: 98  EEAEKKRQAMMAALKEQTNK 117
              + K +A   A  E+  +
Sbjct: 203 AAEKAKAEAEAKAKAEKKAE 222


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 18/82 (21%), Positives = 45/82 (54%)

Query: 34  EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK 93
            W++   +++E  +  +  + K++ +  +    MAQ ++  +E+RQ  + E ++     +
Sbjct: 44  RWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEAREDVATAR 103

Query: 94  RQRLEEAEKKRQAMMAALKEQT 115
            + LE+ E+++Q    AL++QT
Sbjct: 104 DEWLEQLEREKQEFFKALQQQT 125


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 34.1 bits (78), Expect = 0.12
 Identities = 19/86 (22%), Positives = 36/86 (41%)

Query: 48  RLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAM 107
            L  + A+ K  R  E+ +   RKKE  ++R+    E  +   +E+   +  A  +R A+
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383

Query: 108 MAALKEQTNKSKGPNFTIQRREPGLT 133
            A  K   + S   +   +  E   +
Sbjct: 384 KAKKKGLIDASPNEDTPSENEESKGS 409



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 12/84 (14%), Positives = 31/84 (36%), Gaps = 1/84 (1%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK-KEEEERRQREVEEKKQRDLEEKRQRLE 98
            K E   ++ +  Q +R   R    +   +R       R +R   + K++ L +     +
Sbjct: 339 EKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNED 398

Query: 99  EAEKKRQAMMAALKEQTNKSKGPN 122
              +  ++  +  + +   +  PN
Sbjct: 399 TPSENEESKGSPPQVEATTTAEPN 422


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 33.0 bits (76), Expect = 0.13
 Identities = 18/59 (30%), Positives = 36/59 (61%), Gaps = 5/59 (8%)

Query: 70  RKKEEEERRQREVEEKKQRDLEE--KRQRLEEAEKKRQAMMAALKE-QTNKSKGPNFTI 125
           RK E+ ++R++E+ +K+ R+L +   +   ++ EK+++ +M   KE   +  K PNF I
Sbjct: 40  RKMEKYQKREKEI-QKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMMK-PNFAI 96


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.9 bits (77), Expect = 0.14
 Identities = 20/55 (36%), Positives = 37/55 (67%), Gaps = 6/55 (10%)

Query: 52  KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
           K AK++    EE  + A+R+ E++ R +RE E++K+++ E +R+R  EAE+  +A
Sbjct: 576 KLAKKR----EEAVEKAKREAEQKAREEREREKEKEKERERERER--EAERAAKA 624



 Score = 33.5 bits (76), Expect = 0.18
 Identities = 16/47 (34%), Positives = 35/47 (74%), Gaps = 2/47 (4%)

Query: 66  KMAQRKKEEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAA 110
           K+A++++E  E+ +RE E+K  ++R+ E+++++  E E++R+A  AA
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAA 622



 Score = 28.5 bits (63), Expect = 6.5
 Identities = 11/40 (27%), Positives = 26/40 (65%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD 89
           K ++A  K  R  E++   +R++E+E+ ++RE E +++ +
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 34.1 bits (79), Expect = 0.14
 Identities = 25/142 (17%), Positives = 49/142 (34%), Gaps = 3/142 (2%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E  K     S  L E+LKE  +E ++  +  EE+  R +E + + +     E ++ A  +
Sbjct: 8   ELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAE 67

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
            EEE + +       +   E            R A   A +     ++ P      ++  
Sbjct: 68  AEEEAKAEAAAAAPAE---EAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKK 124

Query: 132 LTLSTAQLERNKTKEQIEEEKK 153
                 +  + K     +  K 
Sbjct: 125 GPKPKKKKPKRKAARGGKRGKG 146


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 34.0 bits (78), Expect = 0.16
 Identities = 33/155 (21%), Positives = 70/155 (45%), Gaps = 5/155 (3%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEKQ-----AKRKVMRAEEEQKMAQRKKEEEERRQR 80
           E+L+E +   R    + EE L++LK  +      + K+ + E E +    +K E  +   
Sbjct: 301 EELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE 360

Query: 81  EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 140
           E  ++ +  LEE  + LE+A ++ + +  A++E   +    +  ++  +  L     +LE
Sbjct: 361 ERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELE 420

Query: 141 RNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQA 175
             + + +  EE+   L  +I  L  +   I +L  
Sbjct: 421 ELERELEELEEEIKKLEEQINQLESKELMIAELAG 455



 Score = 33.6 bits (77), Expect = 0.20
 Identities = 40/203 (19%), Positives = 85/203 (41%), Gaps = 7/203 (3%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEED-----LKRLKEKQAKRKVMRAEEEQKMA 68
           +K  E + S L+E L+   +       +E E+     L++L  +    ++++A  E+   
Sbjct: 602 LKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEE 661

Query: 69  QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 128
           + ++ E E R+     + +  LEEK + LE+ E++ + +   L+E   K       I+  
Sbjct: 662 KVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEEL 721

Query: 129 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKAT-ELWETIVKL 187
           E          +  +  E+  E  +      +    +    +  L A+   E  E + KL
Sbjct: 722 ESRKAELEELKKELEKLEKALELLEELREK-LGKAGLRADILRNLLAQIEAEANEILSKL 780

Query: 188 ETDKYDLEERQKRQDYDVSKIII 210
             ++YDL     R+D +   +++
Sbjct: 781 SLNRYDLRRLTIRKDGNGGLVVV 803



 Score = 30.9 bits (70), Expect = 1.3
 Identities = 43/189 (22%), Positives = 88/189 (46%), Gaps = 5/189 (2%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E +K  +AK   L+ QL E + +  +     EE+LK LK+ +  ++    E+E++  +++
Sbjct: 178 EVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQE----EQEEEELEQE 233

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
            E  E R  E+EE+K+R LEE + RL E E      +   +E+  + +     ++ +   
Sbjct: 234 IEALEERLAELEEEKER-LEELKARLLEIESLELEALKIREEELRELERLLEELEEKIER 292

Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
           L     ++E  + + +        L   ++ L      + KL+ K  +L   + +L  +K
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352

Query: 192 YDLEERQKR 200
            +L +  + 
Sbjct: 353 NELAKLLEE 361



 Score = 30.1 bits (68), Expect = 2.0
 Identities = 34/168 (20%), Positives = 77/168 (45%), Gaps = 5/168 (2%)

Query: 34  EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK 93
               +  +  ++    K ++ +  +   EE ++  Q ++ +EE RQ E   ++ ++L E+
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577

Query: 94  RQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
            + L   +++ + +   LKE   K K     ++ R   L      LE ++ + ++EE ++
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLK----ELEERLSQLEELLQSLELSEAENELEEAEE 633

Query: 154 IALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQ 201
              +   + LN++      LQA   EL E + +LE +     +R + +
Sbjct: 634 ELESEL-EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENE 680


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 33.5 bits (77), Expect = 0.18
 Identities = 18/81 (22%), Positives = 42/81 (51%), Gaps = 1/81 (1%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKE-KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE 84
           E+L+E I   + Q  + ++ +   +     KRK+    E        + +E++++++ EE
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339

Query: 85  KKQRDLEEKRQRLEEAEKKRQ 105
           KK++ +E   +R+E+ E +  
Sbjct: 340 KKKKQIERLEERIEKLEVQAT 360


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 33.7 bits (77), Expect = 0.20
 Identities = 25/97 (25%), Positives = 50/97 (51%), Gaps = 10/97 (10%)

Query: 15  KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLK--EKQAKRKVMRAEEEQKMAQRKK 72
           +++      L+ + K      + ++AKE+E LK+LK  +K+AK K+   +        KK
Sbjct: 9   EKKILTEEELERKKK------KEEKAKEKE-LKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 73  EEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
            E++ R+R+VE++   D  +      + +K+  + MA
Sbjct: 62  SEKKSRKRDVEDENPEDFIDPDTPFGQ-KKRLSSQMA 97



 Score = 31.0 bits (70), Expect = 1.1
 Identities = 15/67 (22%), Positives = 36/67 (53%), Gaps = 6/67 (8%)

Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
            E ++R++ K    +E+ KE   E +  +A ++E   +L+ +QA       ++ +K ++
Sbjct: 14 TEEELERKKKK----EEKAKEK--ELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSR 67

Query: 70 RKKEEEE 76
          ++  E+E
Sbjct: 68 KRDVEDE 74



 Score = 29.1 bits (65), Expect = 5.3
 Identities = 10/52 (19%), Positives = 27/52 (51%)

Query: 66  KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNK 117
              + + E++   + E+E KK+++ + K + L++ +  ++   A L+ Q   
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQAS 53


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 33.1 bits (76), Expect = 0.20
 Identities = 15/68 (22%), Positives = 27/68 (39%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
           R    E  K L++     K+  AEEE+      KEE ++   E++  ++   E + +   
Sbjct: 5   REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64

Query: 99  EAEKKRQA 106
             E     
Sbjct: 65  SGEGGGGE 72



 Score = 32.3 bits (74), Expect = 0.40
 Identities = 17/109 (15%), Positives = 39/109 (35%), Gaps = 5/109 (4%)

Query: 49  LKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMM 108
           LKE +     +  E  +     K  E E  + E +      L+E+  +L+    + + ++
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDA-----LKEEIDKLDAEIDRLEELL 55

Query: 109 AALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALN 157
             L+ +   S       +  E     +       +  +    E++ AL+
Sbjct: 56  DELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALS 104



 Score = 28.9 bits (65), Expect = 5.4
 Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 4/87 (4%)

Query: 23  ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
            L E L E   E R     E+E L   +E++A+   ++ E ++     + +  E    E+
Sbjct: 3   ELREALAELAKELRK--LTEDEKLAEAEEEKAEYDALKEEIDK--LDAEIDRLEELLDEL 58

Query: 83  EEKKQRDLEEKRQRLEEAEKKRQAMMA 109
           E K     E      EE E K +A   
Sbjct: 59  EAKPAASGEGGGGEEEEEEAKAEAAEF 85


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 32.7 bits (75), Expect = 0.20
 Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 60  RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
           + +E QKM +  ++E    Q   + KK + L+EK+  +   + +R+ M    K
Sbjct: 73  KMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEM--MDDQRELMKMQFK 123



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRN-QRAKEEEDLKRLKEKQ 53
           + +  QE K   L + +KE+  E+R  Q + + + LK+L+EKQ
Sbjct: 66  KLLIDQE-KMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQ 107


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 33.6 bits (76), Expect = 0.21
 Identities = 14/56 (25%), Positives = 32/56 (57%)

Query: 49   LKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKR 104
            +K++  K ++   EE+++M   ++ +EE   ++ +E++QR  +      +E  KKR
Sbjct: 1017 IKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 32.8 bits (75), Expect = 0.23
 Identities = 19/71 (26%), Positives = 41/71 (57%), Gaps = 1/71 (1%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
           ++++ + ++ + EE+ + AQ++ EE E    E+EEK ++  EE+ Q LE+   + +    
Sbjct: 6   EQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQ-EEEEAQLLEKKADELEEENR 64

Query: 110 ALKEQTNKSKG 120
            L+E+   S+ 
Sbjct: 65  RLEEEAAASEE 75


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 33.4 bits (77), Expect = 0.24
 Identities = 17/98 (17%), Positives = 48/98 (48%), Gaps = 6/98 (6%)

Query: 11  PEFV--KRQEAKSSALDEQ---LKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ 65
             FV  +  E    AL ++   LK+ +     ++A+ +   +  +++    + + AE E+
Sbjct: 132 GPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEE 191

Query: 66  KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           K  + + + E+ ++ +  E  Q   +++++  ++A K+
Sbjct: 192 KQQELEAQLEQLQE-KAAETSQERKQKRKEITDQAAKR 228



 Score = 30.7 bits (70), Expect = 1.8
 Identities = 20/85 (23%), Positives = 45/85 (52%), Gaps = 6/85 (7%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E   R++A+S AL E  ++ +V      A E E+  + +E +A+ + ++ +  +   +RK
Sbjct: 159 ELQAREKAQSQALAEAQQQELVAL-EGLAAELEE--KQQELEAQLEQLQEKAAETSQERK 215

Query: 72  KEEEERRQREVEEKKQRDLEEKRQR 96
              ++R++   +  K+ +L E+  R
Sbjct: 216 ---QKRKEITDQAAKRLELSEEETR 237


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 33.1 bits (77), Expect = 0.24
 Identities = 18/82 (21%), Positives = 36/82 (43%), Gaps = 13/82 (15%)

Query: 42  EEEDLKRLKEKQAKRKVMRAEEEQKMAQR------------KKEEEERRQREVEEKKQRD 89
              D+  L E   +R+ ++ E E+  A+R            K E+ E    EV+E K+  
Sbjct: 23  FPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEE- 81

Query: 90  LEEKRQRLEEAEKKRQAMMAAL 111
           ++     L+E E + + ++  +
Sbjct: 82  IKALEAELDELEAELEELLLRI 103


>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631).  The
           members of this family are sequences derived from a
           group of hypothetical proteins expressed by certain
           bacterial species. The region concerned is approximately
           440 amino acid residues in length.
          Length = 729

 Score = 33.0 bits (76), Expect = 0.28
 Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 3/42 (7%)

Query: 74  EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 115
           E+ERR+ E+ E++ R+ EE R RLE A   RQ +   L+++ 
Sbjct: 459 EQERRRSELVEQRTREAEEGRARLELA---RQQVQQELEQRL 497


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 32.7 bits (75), Expect = 0.31
 Identities = 16/82 (19%), Positives = 39/82 (47%), Gaps = 5/82 (6%)

Query: 41  KEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
           K   +    +  +A+      E ++K  ++ +E+E+R      E+ +++L E+R+ L   
Sbjct: 296 KNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKR-----LEELEQNLIEERKELNSK 350

Query: 101 EKKRQAMMAALKEQTNKSKGPN 122
            ++ Q  +  L+++  K K   
Sbjct: 351 LEEIQKKLEDLEKRLEKLKSNK 372



 Score = 27.7 bits (62), Expect = 9.8
 Identities = 16/69 (23%), Positives = 36/69 (52%), Gaps = 1/69 (1%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE-AEKKRQAMM 108
            EK +  K       +++ + +  EEER  ++   +K R+ E++ + LE+   ++R+ + 
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348

Query: 109 AALKEQTNK 117
           + L+E   K
Sbjct: 349 SKLEEIQKK 357


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 33.0 bits (76), Expect = 0.31
 Identities = 19/70 (27%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 53  QAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE---AEKKRQAMMA 109
           +  R++++ E E++  +++K+E  + + E  EK+  +LEE+   LEE   AEK     + 
Sbjct: 409 ELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQ 468

Query: 110 ALKEQTNKSK 119
            +KE+  + +
Sbjct: 469 QIKEEIEQVR 478


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 32.6 bits (75), Expect = 0.32
 Identities = 21/92 (22%), Positives = 40/92 (43%), Gaps = 7/92 (7%)

Query: 49  LKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMM 108
           L++K  + +   A  E++  +  KE+E  R+ +    K +  + + +RLE+       + 
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEK-------LE 291

Query: 109 AALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 140
           A L E+    +G     +   PG  L    LE
Sbjct: 292 ARLAEERPVEEGKPLAFRFPPPGKRLGKLVLE 323



 Score = 29.9 bits (68), Expect = 2.2
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 12/77 (15%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEE-----------QKMAQ-RKKEEEERRQREVEEKK 86
             K E   +     + ++K +  E+E            K A+ R K  E+   R  EE+ 
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEERP 299

Query: 87  QRDLEEKRQRLEEAEKK 103
             + +    R     K+
Sbjct: 300 VEEGKPLAFRFPPPGKR 316


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.8 bits (73), Expect = 0.38
 Identities = 19/70 (27%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 52  KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR--QRLEEAEKKRQAMMA 109
           K  + +  ++E EQ++ + ++E+EE  +R  E + + +  EKR  +  +  EK+    +A
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173

Query: 110 ALKEQTNKSK 119
            LK+Q  + K
Sbjct: 174 FLKKQNQQLK 183


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 32.5 bits (74), Expect = 0.38
 Identities = 41/156 (26%), Positives = 67/156 (42%), Gaps = 26/156 (16%)

Query: 57  KVMRAEEEQKMAQRKKEEEERR-----QREVEEKKQR------DLEEKRQRLEEAEKKRQ 105
           K++R + E KM +  +E  E       Q+ +E+  +R       L    QR+ E +++ +
Sbjct: 460 KLIRLQHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVE 519

Query: 106 AMMAALKEQTNKSKGPNFTIQRREPGL-TLSTAQLERNKTKEQIEEEKKIALNIRIKPLN 164
            +  AL+EQ +KS+  +    + E  L  L  A  E  K +EQIEE +            
Sbjct: 520 DLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQ------- 572

Query: 165 VEGWSIHKLQAKATELWETIVKLETDKYDLEERQKR 200
                   L  K  EL   + K + D   +EER K+
Sbjct: 573 -------NLSRKIAELEAALQKKDEDMRAMEERYKK 601


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 32.4 bits (74), Expect = 0.38
 Identities = 21/85 (24%), Positives = 47/85 (55%), Gaps = 6/85 (7%)

Query: 8  GGDPEFVKRQEAKSSALDEQLKEYIV---EWRNQRAKEEE---DLKRLKEKQAKRKVMRA 61
          GGDPE ++  + +  A  E + E I    EWR ++ + +    +  +L ++ AK K+ + 
Sbjct: 12 GGDPELIRESQRRRFASVELVDEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKE 71

Query: 62 EEEQKMAQRKKEEEERRQREVEEKK 86
          +  + +A+ K+ ++E  ++E E ++
Sbjct: 72 DATELIAETKELKKEITEKEAEVQE 96


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 32.7 bits (74), Expect = 0.39
 Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 8/148 (5%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           E + RQ  +   L+E+++        Q+ + E   + L+      K  RAE E   A   
Sbjct: 66  ELISRQLQELRRLEEEVRSLRETSLQQKMRLEAQAEELEALAVAEKAGRAEAEGLRAALA 125

Query: 72  KEEEERRQREVEEKKQRDLEE-KRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ---- 126
             EE R+   +EE  Q++LEE +R   E+     QA + AL    +K++G   ++Q    
Sbjct: 126 GAEEVRKN--LEEGGQQELEEAQRLHQEQLSSLTQAHLEALSSLRSKAEGLEKSLQSLET 183

Query: 127 -RREPGLTLSTAQLERNKTKEQIEEEKK 153
            R      L+ AQ E +  +EQ+ + ++
Sbjct: 184 RRAGEAKALAAAQAEADTLREQLSKTQE 211



 Score = 31.9 bits (72), Expect = 0.54
 Identities = 30/109 (27%), Positives = 53/109 (48%), Gaps = 1/109 (0%)

Query: 3   TDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAE 62
           T+ A     E  ++QE    AL E++ E     R Q +  E  L   + + AK  V   +
Sbjct: 555 TEEAASLRQELTQQQEVYGQALQEKVAEVETRLREQLSDTERRLNEARREHAKAVVSLRQ 614

Query: 63  EEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
            +++ AQ K+  +E R R  EE ++ + +   +RL+E E+ +  M+  L
Sbjct: 615 IQRQAAQEKERNQELR-RLQEEARKEEGQRLSRRLQELERDKNLMLQRL 662


>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
           Provisional.
          Length = 589

 Score = 32.4 bits (74), Expect = 0.39
 Identities = 15/59 (25%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 43  EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
           EEDL+RL++  +  ++ R  E+    +  +        +++++ Q + E+ R R+ EAE
Sbjct: 28  EEDLRRLRD-DSALRLRRYREDLLRDRLLRRRLGEELDDLQKRLQTECEDLRSRVSEAE 85


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 32.2 bits (74), Expect = 0.48
 Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 16/113 (14%)

Query: 43  EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEK 102
           E D++   +   K+K+   E E K+ Q +  E+     +  ++++ + E+ +Q+L +A  
Sbjct: 38  EADVQAQLDALNKQKL--LEAEDKLVQ-QDLEQTLALLDKIDRQKEETEQLKQQLAQAPA 94

Query: 103 K-RQAM--MAALKEQTNKSKGPNFTIQRREPGLTLSTAQLER--NKTKEQIEE 150
           K RQA   + ALK+  ++          RE   TLS  QLE    +T +Q++ 
Sbjct: 95  KLRQAQAELEALKDDNDEET--------RETLSTLSLRQLESRLAQTLDQLQN 139


>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein.  This family
           consists of several eukaryotic paraflagellar rod
           component proteins. The eukaryotic flagellum represents
           one of the most complex macromolecular structures found
           in any organism and contains more than 250 proteins. In
           addition to its locomotive role, the flagellum is
           probably involved in nutrient uptake since receptors for
           host low-density lipoproteins are localised on the
           flagellar membrane as well as on the flagellar pocket
           membrane.
          Length = 289

 Score = 31.9 bits (73), Expect = 0.48
 Identities = 20/85 (23%), Positives = 47/85 (55%), Gaps = 2/85 (2%)

Query: 25  DEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE 84
            + +   I     Q++  EEDL+R KEK+ +     A++ ++ + +++E E+  Q+ VE+
Sbjct: 1   TKDISAVIEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQ 60

Query: 85  KKQ--RDLEEKRQRLEEAEKKRQAM 107
           ++   R ++E  + L +  ++R+  
Sbjct: 61  QQACWRAIQELERELRDLAEERREE 85


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.9 bits (72), Expect = 0.53
 Identities = 29/84 (34%), Positives = 51/84 (60%), Gaps = 3/84 (3%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ 95
           + ++ +   +L+ LK+K+ +R+ +  EEEQ+   RK+EE +R+ RE EEK++   E +R+
Sbjct: 206 KQKQQEAALELEELKKKREERRKVLEEEEQR---RKQEEADRKSREEEEKRRLKEEIERR 262

Query: 96  RLEEAEKKRQAMMAALKEQTNKSK 119
           R E AEK+++     L E     K
Sbjct: 263 RAEAAEKRQKVPEDGLSEDKKPFK 286


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 31.0 bits (71), Expect = 0.57
 Identities = 18/60 (30%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKM-AQRKKEEEERRQREVEEKKQRDLEEKRQR 96
          ++AK+ +  KR + KQA++     ++E K  A+  K E+  R RE+  ++Q + E+K   
Sbjct: 15 KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAIV 74


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 31.2 bits (71), Expect = 0.63
 Identities = 22/65 (33%), Positives = 42/65 (64%)

Query: 42  EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
             +   +L+EKQA+R+   AEEE++  ++K EE+   +R+ EE+ + + E+K++  E  E
Sbjct: 4   GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63

Query: 102 KKRQA 106
           ++ QA
Sbjct: 64  REEQA 68



 Score = 30.8 bits (70), Expect = 0.79
 Identities = 22/57 (38%), Positives = 38/57 (66%)

Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE 92
          R QR  EEE+ +  K+ + KR+  R EEE+   +R+K++EE  ++E EE+ +++ EE
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 31.9 bits (73), Expect = 0.65
 Identities = 19/98 (19%), Positives = 46/98 (46%), Gaps = 6/98 (6%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKM---AQRKKEEEERRQREVEEKKQRDLEE 92
             +  +E  +  R   +QA+++     E Q+     + + ++E+++    E +++R+ +E
Sbjct: 628 TRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRN-DE 686

Query: 93  KRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
           KRQ  +EA  K   +     ++T + +       RR+ 
Sbjct: 687 KRQAQQEA--KALNVEEQSVQETEQEERVQQVQPRRKQ 722



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 6/68 (8%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ 95
           R +R +   D KR  + Q + K +  EE+        E+EER Q+    +KQR L +K  
Sbjct: 676 RRERQRRRNDEKR--QAQQEAKALNVEEQSVQET---EQEERVQQVQPRRKQRQLNQK-V 729

Query: 96  RLEEAEKK 103
           R+E++  +
Sbjct: 730 RIEQSVAE 737


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 31.5 bits (72), Expect = 0.66
 Identities = 22/127 (17%), Positives = 48/127 (37%), Gaps = 8/127 (6%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
           +   +  L + +       R   +  E +  RL+   A+     A  E +  +  +E + 
Sbjct: 72  ERQGNQDLQDSV----ANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127

Query: 77  RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLST 136
            +Q  V  +    +E   Q++    ++  A+ AAL     + +     I   + G  L+ 
Sbjct: 128 EKQ--VSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIA--DLGRRLNV 183

Query: 137 AQLERNK 143
           A  +R +
Sbjct: 184 ALAQRVQ 190


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.9 bits (72), Expect = 0.69
 Identities = 18/45 (40%), Positives = 25/45 (55%), Gaps = 8/45 (17%)

Query: 308 GERPGKKKDAPESPEEEEVKRDDEEEENPIYLKISDDEEEKEEEN 352
           G       D+ E  EEEE + ++EEEE        ++EEE+EEEN
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEE--------EEEEEEEEEN 890


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 31.6 bits (72), Expect = 0.71
 Identities = 20/126 (15%), Positives = 42/126 (33%), Gaps = 19/126 (15%)

Query: 5   VAGGGDPEFVKR----------QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQA 54
           V  GG  + +            Q  + S +   +K  +    N   + E ++   +    
Sbjct: 826 VPSGGGSQDIIADHQGHQAIIEQRTQDSGIRNDVKHQV---DNMVTEYEGNIGDTQNS-- 880

Query: 55  KRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
               +R EE     Q  + +   +   + +  + + E+  Q         + +M   KE 
Sbjct: 881 ----IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEY 936

Query: 115 TNKSKG 120
            +K KG
Sbjct: 937 QDKHKG 942


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 30.2 bits (69), Expect = 0.74
 Identities = 10/78 (12%), Positives = 44/78 (56%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
            + ++    L++ + + + ++ + ++  A   +   E++++E+++K Q    ++++  ++
Sbjct: 25  KEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQD 84

Query: 100 AEKKRQAMMAALKEQTNK 117
            +K++Q  +  + ++ NK
Sbjct: 85  LQKRQQEELQKILDKINK 102


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 31.4 bits (72), Expect = 0.79
 Identities = 24/112 (21%), Positives = 47/112 (41%), Gaps = 26/112 (23%)

Query: 10  DPEFVKR----------QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVM 59
           +P+ V+              K   LDE+ ++ + E              L+E QA+R  +
Sbjct: 10  NPDAVREKLKKRGGDALDVDKLLELDEERRKLLRE--------------LEELQAERNEL 55

Query: 60  RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
             E  + + + + + EE    EV+E K+  L+E    L+E E +   ++  +
Sbjct: 56  SKEIGRALKRGEDDAEELIA-EVKELKE-KLKELEAALDELEAELDTLLLTI 105


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 31.6 bits (72), Expect = 0.80
 Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 7/74 (9%)

Query: 43  EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD-----LEEKRQRL 97
            E  +   + + + K  R ++EQ++AQR  +  + +  E+EE   +      LE ++QRL
Sbjct: 164 RELYQAWLKARQQLK-DRQQKEQELAQRL-DFLQFQLEELEEADLQPGEDEALEAEQQRL 221

Query: 98  EEAEKKRQAMMAAL 111
              EK R+    AL
Sbjct: 222 SNLEKLRELSQNAL 235


>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3.  The
           C.elegans Notch pathway, involved in the control of
           growth, differentiation and patterning in animal
           development, relies on either of the receptors GLP-1 or
           LIN-12. Both these receptors promote signalling by the
           recruitment of LAG-3 to target promoters, where it then
           acts as a transcriptional activator. LAG-3 works as a
           ternary complex together with the DNA binding protein,
           LAG-1.
          Length = 476

 Score = 31.1 bits (69), Expect = 0.91
 Identities = 21/109 (19%), Positives = 57/109 (52%), Gaps = 5/109 (4%)

Query: 47  KRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
           + + +   ++  MR  ++Q+M  ++ E++ ++Q   + ++Q+  E ++Q++     ++Q 
Sbjct: 320 QHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQM---LLQQQQ 376

Query: 107 MMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 155
            M  L++    + G  F  Q  +    L   Q++  + +EQI+ +++ A
Sbjct: 377 QMHQLQQHHQMNGGGQFATQAHQHAAYLQ--QMQHMRLQEQIQHQQQQA 423


>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form.  The
           Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
           2.A.36) The CPA1 family is a large family of proteins
           derived from Gram-positive and Gram-negative bacteria,
           blue green bacteria, yeast, plants and animals.
           Transporters from eukaryotes have been functionally
           characterized, and all of these catalyze Na+:H+
           exchange. Their primary physiological functions may be
           in (1) cytoplasmic pH regulation, extruding the H+
           generated during metabolism, and (2) salt tolerance (in
           plants), due to Na+ uptake into vacuoles. This model is
           specific for the bacterial members of this family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 525

 Score = 31.4 bits (71), Expect = 0.92
 Identities = 25/123 (20%), Positives = 45/123 (36%), Gaps = 25/123 (20%)

Query: 38  QRAKEEEDLKRLKEKQ---------AKRKVMRAEEEQKMA-----QRKKEEEER---RQR 80
           +R    E  +R  E+          AK  +M+A E+ ++      +   E + R    + 
Sbjct: 405 KRKFVSEHSERELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDARIEELRA 464

Query: 81  EVEEKKQRDLEE----KRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLST 136
           + EEK +  + E    +R RL   + KR A++                +      L L  
Sbjct: 465 DGEEKIRSGMGEKNLRRRARLYVLDAKRSAVVDLR----AGGLISQEVLLELMRELDLKE 520

Query: 137 AQL 139
           A+L
Sbjct: 521 AEL 523


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 20/81 (24%), Positives = 44/81 (54%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
           + + E+  K L+E + +   ++A+ E+    R++ EE     E  ++ +++L++ +  LE
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELE 127

Query: 99  EAEKKRQAMMAALKEQTNKSK 119
           + EK     +  LKE+T  +K
Sbjct: 128 KYEKNDPERIEKLKEETKVAK 148


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 30.1 bits (68), Expect = 1.1
 Identities = 22/80 (27%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMA--QRKKEEEERRQREVEEKKQRDLEEK 93
           ++ + K+ E+ K L+++ A++++ R E + ++A  +RK  E +R+   ++EKK+ + +E 
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161

Query: 94  RQRLEEAEKKRQAMMAALKE 113
            +   EA K +Q ++ A  E
Sbjct: 162 ARLKSEASKIKQELVDAELE 181



 Score = 27.8 bits (62), Expect = 6.4
 Identities = 16/83 (19%), Positives = 34/83 (40%), Gaps = 6/83 (7%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
             L  LK+ Q K+   R   E+++A+++   +E       + +  +LE K   L+     
Sbjct: 96  NKLLSLKDDQTKKLEERKGLEKEIAEKEISRQEL------DSEIAELERKILELQRQAAL 149

Query: 104 RQAMMAALKEQTNKSKGPNFTIQ 126
            +    A  ++  + K     I+
Sbjct: 150 LKEKKEAEDKEIARLKSEASKIK 172


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 20/93 (21%), Positives = 47/93 (50%)

Query: 23  ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
           A+D + K+   E  +   K+ E  +  +E + K + +  + E  + + K+E +  RQR +
Sbjct: 33  AMDAREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLL 92

Query: 83  EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 115
           +E ++   E + +  E   +++ A+   L+ +T
Sbjct: 93  DEAREEADEIREKWQEALRREQAALSDELRRRT 125


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 30.9 bits (69), Expect = 1.3
 Identities = 22/85 (25%), Positives = 50/85 (58%), Gaps = 3/85 (3%)

Query: 20  KSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQ 79
           K +A  + L+  I+E +     E E+ +RL+ ++ +R + R   E++  +R++ E +R +
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERERLERERMER-IERERLERERLERERLERDRLE 500

Query: 80  REVEEKKQRDLEEK--RQRLEEAEK 102
           R+  ++ +R+  ++  R RLE+A +
Sbjct: 501 RDRLDRLERERVDRLERDRLEKARR 525



 Score = 30.1 bits (67), Expect = 2.5
 Identities = 21/67 (31%), Positives = 41/67 (61%), Gaps = 3/67 (4%)

Query: 41  KEEEDLKRLKEKQAKRKV--MRAEEEQKMAQRKKEEEERRQREVEEKKQRD-LEEKRQRL 97
           K+  +  R++++ A RK   M+  E++++ + ++EE ER +RE  E+ +R+ LE +R   
Sbjct: 432 KDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLER 491

Query: 98  EEAEKKR 104
           E  E+ R
Sbjct: 492 ERLERDR 498


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 30.8 bits (70), Expect = 1.3
 Identities = 37/148 (25%), Positives = 58/148 (39%), Gaps = 23/148 (15%)

Query: 9   GDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMA 68
           GD   V R+  KS      L E            E     LKE+  K +     +E    
Sbjct: 189 GDLTNVLRRRKKSELPSSILSEI-----------EALEAELKEQSEKYED--LAQEIAHL 235

Query: 69  QRKKEEEERRQREVEEKKQR---DLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTI 125
           + + EE +R    +E+K +    DL E+R++LE   K+ +A   A + Q  +        
Sbjct: 236 RNELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRE------LA 289

Query: 126 QRREPGLTLSTAQLERNKTKEQIEEEKK 153
               P L L    L+  K + Q EE+ +
Sbjct: 290 ADPLP-LLLIPNLLDSTKAQLQKEEQSQ 316


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.8 bits (70), Expect = 1.3
 Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 7/112 (6%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
           +    A+ +A   +L   + E R Q+AK  + L+  K+  A+     + +++K+ + +  
Sbjct: 173 LAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRAN 232

Query: 74  EEERRQR----EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGP 121
           E   +      E    K R   E     E A  + +A  A    +T K   P
Sbjct: 233 ESRLKNEIASAEAAAAKAR---EAAAAAEAAAARARAAEAKRTGETYKPTAP 281



 Score = 29.7 bits (67), Expect = 2.6
 Identities = 37/187 (19%), Positives = 66/187 (35%), Gaps = 22/187 (11%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
           +K  E + ++L+ QL E   + +  R +  +   RL           A E Q+  QR++ 
Sbjct: 68  LKSLETEIASLEAQLIETADDLKKLRKQIADLNARLN----------ALEVQEREQRRRL 117

Query: 74  EEE----RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR-- 127
            E+    +R             E  QR         A+  A  E+ +  K     +    
Sbjct: 118 AEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVR 177

Query: 128 ------REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELW 181
                 +    TL + Q  +     Q+ EE+K  L      L+ +   + +L+A  + L 
Sbjct: 178 AEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLK 237

Query: 182 ETIVKLE 188
             I   E
Sbjct: 238 NEIASAE 244



 Score = 29.3 bits (66), Expect = 3.1
 Identities = 9/67 (13%), Positives = 24/67 (35%), Gaps = 1/67 (1%)

Query: 87  QRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLER-NKTK 145
           Q+++    +++ E + +R  +   LK    +       +      L     Q+   N   
Sbjct: 44  QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARL 103

Query: 146 EQIEEEK 152
             +E ++
Sbjct: 104 NALEVQE 110


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 30.6 bits (69), Expect = 1.3
 Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 4/89 (4%)

Query: 22  SALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
             L E++K+ I E    +   EED   + E    R+ ++ E+   + +R  EEEE  + E
Sbjct: 356 KELKERVKDAIPEDLYDKIATEEDATTIDEL---REFLK-EKGHPVVERWAEEEEEEEEE 411

Query: 82  VEEKKQRDLEEKRQRLEEAEKKRQAMMAA 110
            EE+         + +   E     M AA
Sbjct: 412 EEEEAAEAEAPMEEPVPGFEVPEMPMPAA 440


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 30.6 bits (69), Expect = 1.4
 Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 9/140 (6%)

Query: 42  EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
           EE D  R +EK   R   R+ E  +   R +     R R   E+  R+    R R     
Sbjct: 3   EEPD--REREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60

Query: 102 KKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIK 161
           +  +++         +S+      +RR   +        R + +    + +K      + 
Sbjct: 61  RSPRSLR---YSSVRRSRDR---PRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLW 114

Query: 162 PLNVEGW-SIHKLQAKATEL 180
            +   G+  +   QAKA+++
Sbjct: 115 DIKPPGYELVTADQAKASQV 134


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 29.9 bits (68), Expect = 1.4
 Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 4/54 (7%)

Query: 298 RQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDEEEENPIYLKISDDEEEKEEE 351
           R+K  LP     +P   K+  E   E  ++   E       L   D+E ++EE+
Sbjct: 8   RKKGILP----PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEED 57


>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
           Provisional.
          Length = 170

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 16/32 (50%), Positives = 21/32 (65%), Gaps = 4/32 (12%)

Query: 69  QRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
            RK EE+E + +EVEEK+    E   +RLEEA
Sbjct: 135 LRKFEEDEAKAKEVEEKR----EATWKRLEEA 162


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 30.6 bits (69), Expect = 1.6
 Identities = 31/199 (15%), Positives = 69/199 (34%), Gaps = 16/199 (8%)

Query: 10  DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
           D E  + +  K + +         +      +++   K  ++ +   K++ AE E +   
Sbjct: 222 DAEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTRE-VKIILAETEAE-VA 279

Query: 70  RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRRE 129
             K E  R   + E   ++ ++E++ + E+  +  +A+ A                +++E
Sbjct: 280 AWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIE-------RQKE 332

Query: 130 PGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 189
             L         N  + Q +EE K A NI               +A      ET      
Sbjct: 333 TELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQA-------EAAVETARETEEAERA 385

Query: 190 DKYDLEERQKRQDYDVSKI 208
           ++  L    +  + +  +I
Sbjct: 386 EQAALVAAAEAAEQEQVEI 404



 Score = 28.6 bits (64), Expect = 6.2
 Identities = 23/150 (15%), Positives = 48/150 (32%), Gaps = 11/150 (7%)

Query: 11  PEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQR 70
            +  K  EA+   +    ++   E   Q      +  + + ++  +      E       
Sbjct: 311 VQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAE 370

Query: 71  KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
              E  R   E E  +Q  L        E E+   A+ A   +   +++           
Sbjct: 371 AAVETARETEEAERAEQAALVAA-AEAAEQEQVEIAVRAEAAKAEAEAQ--------AAE 421

Query: 131 GLTLSTAQLERNKTKEQIEEEKKIALNIRI 160
                 A+  R K K + E ++ +A  I++
Sbjct: 422 IKAE--AEAIREKGKAEAEAKRALAEAIQV 449


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 29.8 bits (67), Expect = 1.6
 Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 4/43 (9%)

Query: 76  ERRQREVEEKKQRDLEE----KRQRLEEAEKKRQAMMAALKEQ 114
           E   RE E+KK R  EE    K ++ +EAE     +   +KEQ
Sbjct: 89  EWHLREKEKKKSRTKEEKKALKEEKDKEAEPYMWCVWDGVKEQ 131


>gnl|CDD|153328 cd07644, I-BAR_IMD_BAIAP2L2, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Brain-specific
           Angiogenesis Inhibitor 1-Associated Protein 2-Like 2.
           The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
           (I-BAR) domain, is a dimerization and lipid-binding
           module that bends membranes and induces membrane
           protrusions. This group is composed of uncharacterized
           proteins known as BAIAP2L2 (Brain-specific Angiogenesis
           Inhibitor 1-Associated Protein 2-Like 2). They contain
           an N-terminal IMD, an SH3 domain, and a WASP homology 2
           (WH2) actin-binding motif at the C-terminus. The related
           proteins, BAIAP2L1 and IRSp53, function as regulators of
           membrane dynamics and the actin cytoskeleton. The IMD
           domain binds and bundles actin filaments, binds
           membranes and produces membrane protrusions, and
           interacts with the small GTPase Rac.
          Length = 215

 Score = 29.9 bits (67), Expect = 1.6
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 11/89 (12%)

Query: 20  KSSALDEQL-----KEYIVEWRNQRAKEEEDLKRL--KEKQAKRKVMRAEEEQKMAQRKK 72
           K++ LD Q      + Y +E+R++ A  E+ +  L   E+Q  R V   +E     ++  
Sbjct: 101 KNTKLDMQFIEDSRRVYELEYRHRAANLEKCMSELWRMERQRDRNVREMKENVNRLRQSM 160

Query: 73  E----EEERRQREVEEKKQRDLEEKRQRL 97
           +    E +R     E+++ R L EK   L
Sbjct: 161 QAFLKESQRAAELEEKRRYRFLAEKHYLL 189


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 30.7 bits (69), Expect = 1.6
 Identities = 24/94 (25%), Positives = 42/94 (44%), Gaps = 7/94 (7%)

Query: 24  LDEQLKEYIVEWRNQRAKEEEDLKRLKEK----QAKRKVM---RAEEEQKMAQRKKEEEE 76
            + +L+  I    +     + DL +LK++    Q K   M   + +++Q M   +K  + 
Sbjct: 428 NESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKS 487

Query: 77  RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAA 110
                V  +KQ   E+KR++ EE    R A  AA
Sbjct: 488 EADSRVNAEKQLAEEKKRKKEEEETAARAAAQAA 521


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 30.3 bits (69), Expect = 1.7
 Identities = 13/64 (20%), Positives = 31/64 (48%), Gaps = 4/64 (6%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
            +EE +   L   + K+      ++Q+ A++K E+EE  +   ++K +   ++ +    E
Sbjct: 397 EEEEGENGNLSPAERKK----LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452

Query: 100 AEKK 103
            +K 
Sbjct: 453 TKKV 456


>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO.  PilO proteins are
           involved in the assembly of pilin. However, the precise
           function of this family of proteins is not known.
          Length = 144

 Score = 29.4 bits (67), Expect = 1.7
 Identities = 13/49 (26%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 69  QRKKEEEERRQREVEEKKQR--DLEEKRQRLEEAEKKRQAMMAALKEQT 115
           +R + +E   ++E EEK ++  +L   +++LEE E++  A++  L  + 
Sbjct: 2   ERAQAKEATLKQEFEEKARQAANLPAYKKQLEELEERFGALLKQLPSKK 50


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 30.2 bits (68), Expect = 1.8
 Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 5/100 (5%)

Query: 3   TDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAE 62
           TD   G +P  ++R E      ++ + +   +   +  KE E L + +E   KR  +   
Sbjct: 352 TDNTCGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVAL--- 408

Query: 63  EEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEK 102
           EE      + EE   ++   EEK+   LE+ +    E EK
Sbjct: 409 EEGLKELEQDEENFLKELSKEEKEL--LEKLKMEASEVEK 446


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 30.3 bits (69), Expect = 1.8
 Identities = 14/59 (23%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 56  RKVMRAEEEQKMAQRKKEEEE-RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
            K +   EE+   + ++E +E   ++E+ E +++ LE+K ++  +  K  +A+   L E
Sbjct: 60  MKPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLE 118



 Score = 29.9 bits (68), Expect = 2.3
 Identities = 22/101 (21%), Positives = 47/101 (46%), Gaps = 12/101 (11%)

Query: 22  SALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
           SA+D+   + + E   +    EE  + LKE +A++++  AE++    + KK    ++ ++
Sbjct: 53  SAVDKLAMKPLEEI--EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKK--AIKKGKD 108

Query: 82  VEEKKQRDLEEKRQRLEEAEKKR--------QAMMAALKEQ 114
            E   +  LE + +  E   + R        +A++  L E 
Sbjct: 109 EEALAEELLELEAEEPEPPLRPRLIVNDATPEALLELLAEN 149


>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
           2-oxoacid:ferredoxin oxidoreductases, beta subunit
           [Energy production and conversion].
          Length = 294

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 6/68 (8%)

Query: 32  IVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLE 91
           +  +   +A+EE      K K     +    EE    + + +   +   E+ +K Q+D++
Sbjct: 231 LYRYEPGKAEEE------KGKDIPIGIFYPVEEYLEYEERFKHLTKSNPELIQKLQKDVD 284

Query: 92  EKRQRLEE 99
           E+ + L+ 
Sbjct: 285 ERWEELKR 292


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 11/87 (12%)

Query: 15  KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKR-KVMRAEEEQKMAQRKKE 73
           K+     +  +E+L+  +      + K EE L +L+E+   R +   A  E+++    + 
Sbjct: 177 KKLAELKAEEEEELERAL------KEKREELLSKLEEELLARLESKEAALEKQLRLEFER 230

Query: 74  EEERRQREVEEKKQRDLEEKRQRLEEA 100
           E+E    E+ +K +  L ++ +R  EA
Sbjct: 231 EKE----ELRKKYEEKLRQELERQAEA 253



 Score = 28.1 bits (63), Expect = 8.5
 Identities = 21/110 (19%), Positives = 49/110 (44%), Gaps = 10/110 (9%)

Query: 12  EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
              K +E   + L+ +      + R +  +E+E+L         RK    +  Q++ ++ 
Sbjct: 201 LLSKLEEELLARLESKEAALEKQLRLEFEREKEEL---------RKKYEEKLRQELERQA 251

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEA-EKKRQAMMAALKEQTNKSKG 120
           +  E++ + E+  +      E  + ++E  E++R   +A L E  ++ KG
Sbjct: 252 EAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKG 301


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 30.0 bits (68), Expect = 1.9
 Identities = 22/99 (22%), Positives = 46/99 (46%), Gaps = 9/99 (9%)

Query: 8   GGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKM 67
             +P+ VK          +   E ++   ++R K    L  ++E QAKR     E  +++
Sbjct: 8   RNNPDLVKESLKARGLSVDIDLEKLIALDDERKKL---LSEIEELQAKRN----ELSKQI 60

Query: 68  AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
            + K +++   + E  +K+ ++L+E+   L  A K  +A
Sbjct: 61  GKAKGQKK--DKIEEIKKELKELKEELTELSAALKALEA 97


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
           protein.  Members of this protein family are homologs of
           the HlyD membrane fusion protein of type I secretion
           systems. Their occurrence in prokaryotic genomes is
           associated with the occurrence of a novel class of
           microcin (small bacteriocins) with a leader peptide
           region related to nitrile hydratase. We designate the
           class of bacteriocin as Nitrile Hydratase Leader
           Microcin, or NHLM. This family, therefore, is designated
           as NHLM bacteriocin system secretion protein. Some but
           not all NHLM-class putative microcins belong to the TOMM
           (thiazole/oxazole modified microcin) class as assessed
           by the presence of the scaffolding protein and/or
           cyclodehydratase in the same gene clusters [Transport
           and binding proteins, Amino acids, peptides and amines,
           Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 31/165 (18%), Positives = 61/165 (36%), Gaps = 26/165 (15%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           E  +RL+E   K             Q + EE       ++E ++R  ++ ++ LEE   +
Sbjct: 93  ELRERLQESYQK---------LTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGR 143

Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPL 163
            +  +AAL  +  K +G    + R            ++ + +EQ + +            
Sbjct: 144 LREELAALSREVGKQRG---LLSRGLATFKRDRILQQQWR-EEQEKYDAADKAR------ 193

Query: 164 NVEGWSIHKLQAKATE--LWETIVKLETDKYDLEERQKRQDYDVS 206
                +I+ LQ KA E  L   +  L    + L    +++   V 
Sbjct: 194 -----AIYALQTKADERNLETVLQSLSQADFQLAGVAEKELETVE 233


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 30.3 bits (69), Expect = 1.9
 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 20/73 (27%)

Query: 43  EEDLKRLKEKQAKRKVMRAEE--------------EQKMAQRKKEEEERRQ------REV 82
           EE  +RL    AK + MR  +               Q++A+R+K EE R++       E+
Sbjct: 81  EESRQRLSRLVAKLEEMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEI 140

Query: 83  EEKKQRDLEEKRQ 95
           +E+++  +E ++Q
Sbjct: 141 KEREETQIELEQQ 153


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 30.1 bits (68), Expect = 2.0
 Identities = 22/92 (23%), Positives = 37/92 (40%), Gaps = 7/92 (7%)

Query: 11  PEFVKRQEAKSSALDEQLKEYIVEWRNQ---RAKEEED--LKRLKEKQAKRKVMRAEEEQ 65
            E  K ++     L +  + +  EWR++    AK E    L   +      K      EQ
Sbjct: 323 EEVEKSKQKFEPLLAQYGESF-QEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQ 381

Query: 66  KMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
           +      E  +  QR+ + ++Q D EE +QR 
Sbjct: 382 RQQLWFFENRKLWQRKKKLREQAD-EEYQQRH 412


>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
          Length = 694

 Score = 30.3 bits (68), Expect = 2.0
 Identities = 14/55 (25%), Positives = 23/55 (41%), Gaps = 14/55 (25%)

Query: 311 PGKKKDAPESPEEEEVKRDD--------------EEEENPIYLKISDDEEEKEEE 351
           P + +D    P+++   R D                +++PIY  +SD EE   EE
Sbjct: 534 PEQMEDPYLEPDDDRFDRRDGAAAAATSHPREAPAPDDDPIYEGVSDSEEPVYEE 588


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 29.4 bits (67), Expect = 2.0
 Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 9/83 (10%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE--- 92
             +RAK+E     L+  QAK +   AE   + A+  ++ ++R  + +EE K     E   
Sbjct: 46  AAERAKKE-----LELAQAKYEAQLAEARAEAAEIIEQAKKRAAQIIEEAKAEAEAEAAR 100

Query: 93  -KRQRLEEAEKKRQAMMAALKEQ 114
            K Q   E E++R+     L++Q
Sbjct: 101 IKAQAQAEIEQERKRAREELRKQ 123


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 30.1 bits (69), Expect = 2.0
 Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 13/65 (20%)

Query: 62  EEEQKM---AQRKKEEEERRQREVEEKKQRD---------LEEKRQRLEEAEKKR-QAMM 108
           EE ++M   A+   EE+++R+  VE + Q D         L+E   ++   EK++ +A +
Sbjct: 503 EEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAI 562

Query: 109 AALKE 113
             LKE
Sbjct: 563 KELKE 567


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.1 bits (68), Expect = 2.0
 Identities = 21/81 (25%), Positives = 44/81 (54%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
           V+R E ++S L  +L+E   E     ++ E   + +++K  K + +RA + +     K+ 
Sbjct: 431 VERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490

Query: 74  EEERRQREVEEKKQRDLEEKR 94
           EE++++ E  E+K  +L + R
Sbjct: 491 EEKKKRVEELERKLAELRKMR 511



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 15/90 (16%), Positives = 42/90 (46%)

Query: 10  DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
           +   +   E +   L+E ++    E    + + EE  + +++ +++ +  R E   K+ +
Sbjct: 413 ERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK 472

Query: 70  RKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
            ++     R+ E  EK+  + +++ + LE 
Sbjct: 473 DREIRARDRRIERLEKELEEKKKRVEELER 502


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 14/35 (40%), Positives = 16/35 (45%)

Query: 52  KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKK 86
           K AK     A EE + A   KEEE     + E KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194


>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
           homologs [Posttranslational modification, protein
           turnover, chaperones].
          Length = 291

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 13/82 (15%), Positives = 23/82 (28%), Gaps = 4/82 (4%)

Query: 51  EKQAKRKVMRAEEEQ-KMAQR-KKEEEERRQREVEEKKQRDLEEK--RQRLEEAEKKRQA 106
            + A  K M AE ++       + E +    R   E +   +  +   +    A  +  A
Sbjct: 175 VQAAMEKQMAAERDKRAEILEAEGEAQAAILRAEGEAEAAIILAEAEAEAEVIARAEADA 234

Query: 107 MMAALKEQTNKSKGPNFTIQRR 128
                         P    QR 
Sbjct: 235 AKIIAAALREAPAAPQALAQRY 256


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 22/88 (25%), Positives = 37/88 (42%), Gaps = 8/88 (9%)

Query: 30  EYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE-----E 84
           E + + R+   K  E L +L+    K        E+K     K  EE  +   E     E
Sbjct: 43  ERLRKLRSLLTKLSEALDKLRSYLPKLNP---LREEKKKVSVKSLEELIKDVEEELEKIE 99

Query: 85  KKQRDLEEKRQRLEEAEKKRQAMMAALK 112
           K+ ++LEE+   LE   K+ +  +  L+
Sbjct: 100 KEIKELEEEISELENEIKELEQEIERLE 127


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 28.6 bits (65), Expect = 2.8
 Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 4/66 (6%)

Query: 36 RNQRAKEEEDLKRLKE--KQAKRKVMRA--EEEQKMAQRKKEEEERRQREVEEKKQRDLE 91
          + +  K E ++  L E  K  K  +  A  ++++  A  K E++  + +   EKK+   E
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKE 93

Query: 92 EKRQRL 97
          E + RL
Sbjct: 94 EPKPRL 99


>gnl|CDD|234604 PRK00058, PRK00058, methionine sulfoxide reductase A; Provisional.
          Length = 213

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 2/47 (4%)

Query: 208 IIINKKIMPSPGQGRPGFNPRYVQTPNL-FNNATALTGKYPPKIQVA 253
            + +KK + +  +  PG     +    L   N   LTG  P  ++ A
Sbjct: 3   SLFDKKHLVTAEEALPG-RATPMPVATLHAVNGHPLTGPVPEGMEQA 48


>gnl|CDD|215101 PLN00203, PLN00203, glutamyl-tRNA reductase.
          Length = 519

 Score = 29.7 bits (67), Expect = 2.9
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 18/63 (28%)

Query: 164 NVEGW--------SIHKLQAKATELWETIVKLETDKY------DLEERQKRQDYDVSKII 209
           N E W        +I KL++ A    E I   E +K       DL ++Q++   D+S+ I
Sbjct: 420 NFEAWRDSLETVPTIKKLRSYA----ERIRAAELEKCLSKMGDDLTKKQRKAVEDLSRGI 475

Query: 210 INK 212
           +NK
Sbjct: 476 VNK 478


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 24/98 (24%), Positives = 38/98 (38%), Gaps = 3/98 (3%)

Query: 16  RQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEE 75
             E ++ AL  +L+        Q A+E E  +   E         A+  Q + + +   E
Sbjct: 308 ALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAA---ADARQAIREAESRLE 364

Query: 76  ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
           E R+R  EE  + D  E+  R    +  R A  A L  
Sbjct: 365 EERRRLDEEAGRLDDAERELRAAREQLARAAERAGLSP 402



 Score = 29.0 bits (65), Expect = 5.7
 Identities = 17/67 (25%), Positives = 28/67 (41%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
           A      +R +  +A+     A E+   A+ + EE   R R +EE     ++E R RL E
Sbjct: 873 AATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAMVDEIRARLAE 932

Query: 100 AEKKRQA 106
                 +
Sbjct: 933 TRAALAS 939


>gnl|CDD|235870 PRK06834, PRK06834, hypothetical protein; Provisional.
          Length = 488

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 96  RLEEAEKKRQAMMAALKEQTNKSKG-PNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
            ++E  K+  AMM+ L    +  +G P   + RR P L L TA   R +    +   + +
Sbjct: 355 GMDEPRKRIAAMMSGLDIHYDLGEGHP--LLGRRMPDLDLVTADGPR-RVFTLLHNARPV 411

Query: 155 ALNIRIKP--LNVEGWS--IHKLQAKATELWE 182
            LN+   P   ++  WS  +  + AK    WE
Sbjct: 412 LLNLG-APGAFDIAPWSDRVRLVDAKYAGPWE 442


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 15/83 (18%), Positives = 44/83 (53%)

Query: 37  NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
            +   E E+ ++  ++Q +++  RA+ + ++A+++ ++E  +QR   E+  +  EE   R
Sbjct: 85  ERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLR 144

Query: 97  LEEAEKKRQAMMAALKEQTNKSK 119
            E   +  +  +  ++ +T + +
Sbjct: 145 QEAMRRATEEEILEMRRETIEEE 167


>gnl|CDD|233879 TIGR02468, sucrsPsyn_pln, sucrose phosphate synthase/possible
           sucrose phosphate phosphatase, plant.  Members of this
           family are sucrose-phosphate synthases of plants. This
           enzyme is known to exist in multigene families in
           several species of both monocots and dicots. The
           N-terminal domain is the glucosyltransferase domain.
           Members of this family also have a variable linker
           region and a C-terminal domain that resembles sucrose
           phosphate phosphatase (SPP) (EC 3.1.3.24) (see
           TIGR01485), the next and final enzyme of sucrose
           biosynthesis. The SPP-like domain likely serves a
           binding and not a catalytic function, as the reported
           SPP is always encoded by a distinct protein.
          Length = 1050

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 55  KRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE 92
           K+K +  EE Q++A+R+ E E  R RE       DL E
Sbjct: 90  KKKQLEWEEAQRLAKRRLERERGR-REATADMSEDLSE 126


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 29.2 bits (66), Expect = 3.2
 Identities = 12/66 (18%), Positives = 36/66 (54%), Gaps = 3/66 (4%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK---RQR 96
            +E+E+ +  K K A +   +   + K+ +++K + E+ ++ + E ++   E++   + R
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98

Query: 97  LEEAEK 102
           L + ++
Sbjct: 99  LRKLQE 104


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 28.4 bits (64), Expect = 3.2
 Identities = 13/78 (16%), Positives = 41/78 (52%), Gaps = 5/78 (6%)

Query: 45  DLKRLKEK--QAKRKVMRAEEEQKMAQRKKEEEE---RRQREVEEKKQRDLEEKRQRLEE 99
            ++RLK +    +R++   + +++  ++K +  E   + ++E  ++ +  +++++ +   
Sbjct: 72  TIERLKTQLEDLERELALLQAKERQLEKKLKTLEQKLKNEKEEVQRLKNIIQQRKTQYNH 131

Query: 100 AEKKRQAMMAALKEQTNK 117
             KKR   +  LKE+  +
Sbjct: 132 ELKKRDREIEKLKERLLQ 149


>gnl|CDD|149180 pfam07960, CBP4, CBP4.  The CBP4 in S. cerevisiae is essential for
           the expression and activity of ubiquinol-cytochrome c
           reductase. This family appears to be fungal specific.
          Length = 128

 Score = 28.1 bits (63), Expect = 3.3
 Identities = 18/77 (23%), Positives = 39/77 (50%), Gaps = 9/77 (11%)

Query: 36  RNQRAKEEEDLKRLKEKQAKRK---------VMRAEEEQKMAQRKKEEEERRQREVEEKK 86
           R+ R +E ++L  + ++ +K               E+++ + + +K + E  Q+E  E+ 
Sbjct: 52  RDLRQQEFDELVTILKETSKSDKPIWKTGPIESPWEKKRNVREFQKTKAEEAQKEELERI 111

Query: 87  QRDLEEKRQRLEEAEKK 103
           + +LEE R + EE  K+
Sbjct: 112 REELEEARAQSEEMRKE 128


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 29.3 bits (66), Expect = 3.4
 Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 11/63 (17%)

Query: 67  MAQRKKEEEERRQ-REVE---------EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
           +AQ++KEEEERR  R  E           KQ+  EE++   EE  + R A   AL     
Sbjct: 118 LAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEET-RHRAANATALAAIGG 176

Query: 117 KSK 119
           + K
Sbjct: 177 RKK 179


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 21/107 (19%), Positives = 41/107 (38%), Gaps = 16/107 (14%)

Query: 10  DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
           D E  ++   +   L       I       ++  + L+ +K  + +  ++  E E+  A+
Sbjct: 35  DLEEGEKGLKELEKLKVAEVAQISLSSLL-SEVLDYLRSVKGLEGRLFILPEEVEKLEAE 93

Query: 70  RKKEEEERRQ------------REVEEK---KQRDLEEKRQRLEEAE 101
            K  EE  +             R++EE+      +LE+    LEE E
Sbjct: 94  LKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELE 140


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 25/152 (16%), Positives = 62/152 (40%), Gaps = 16/152 (10%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           E   +++E + + K +  + EQ  A+ +K+EEE    E  + K  D   K+   +     
Sbjct: 92  EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEE---LEKAKNKFLDKAWKKLAKKYDSNL 148

Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALN---IRI 160
            +A+     ++  K K     ++  +  +  +++ L    + E+++ + K   +     +
Sbjct: 149 SEALKGLNYKKNFKEK----LLKELKSVILNASSLL----SLEELKAKIKTLFSSNKPEL 200

Query: 161 KPLNVEGWSIHKLQAKATELWETIVKLETDKY 192
             L +    I   + +  E+ E  +   +D  
Sbjct: 201 ALLTLS--VIDFDEIEQAEILEKSIIGSSDVP 230


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 29.6 bits (67), Expect = 3.5
 Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 15/74 (20%)

Query: 55  KRKVMRAEEEQKMAQRKKEEEERRQRE---------------VEEKKQRDLEEKRQRLEE 99
           K+++   + + K+ ++ +E+ E+ QRE               +E+  + +LE+ +++LEE
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEE 258

Query: 100 AEKKRQAMMAALKE 113
            +   +      KE
Sbjct: 259 LKLPEEVKKVIEKE 272


>gnl|CDD|227201 COG4864, COG4864, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 328

 Score = 29.1 bits (65), Expect = 3.5
 Identities = 20/61 (32%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 54  AKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-RQAMMAALK 112
           AK +  +AE ++ +AQ K EE       +E++ +  +EE R ++ EAE +   A+  AL+
Sbjct: 231 AKLQTEQAEADKNIAQAKAEERRAMAVALEQEMRARVEEMRAKVVEAEAEVPLALSEALR 290

Query: 113 E 113
           E
Sbjct: 291 E 291


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.4 bits (66), Expect = 3.6
 Identities = 18/69 (26%), Positives = 38/69 (55%), Gaps = 3/69 (4%)

Query: 36  RNQRAKE-EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR 94
           R +  KE  + L+   +K  + K      E+  A+ K +EE+ +Q E E+K++   +E +
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDASEE--AEAKAKEEKLKQEENEKKQKEQADEDK 424

Query: 95  QRLEEAEKK 103
           ++ ++ E+K
Sbjct: 425 EKRQKDERK 433


>gnl|CDD|221431 pfam12127, YdfA_immunity, SigmaW regulon antibacterial.  This
           protein is found in bacteria. Proteins in this family
           are about 330 amino acids in length. The operon from
           which this protein is derived confers immunity for the
           host species to a broad range of antibacterial
           compounds, unlike the specific immunity proteins that
           are linked to and co-regulated with their
           antibiotic-synthesis proteins.
          Length = 318

 Score = 29.2 bits (66), Expect = 3.9
 Identities = 18/61 (29%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 54  AKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-RQAMMAALK 112
           A+ +  +AE ++++AQ K EE        E++ +  ++E R ++ EAE +  +AM  A +
Sbjct: 228 ARLQTDQAEADKRIAQAKAEERRAMAVAREQEMKAKVQEMRAKVVEAEAEVPKAMAEAFR 287

Query: 113 E 113
           E
Sbjct: 288 E 288


>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
          Length = 850

 Score = 29.2 bits (66), Expect = 3.9
 Identities = 18/80 (22%), Positives = 41/80 (51%), Gaps = 10/80 (12%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEE---ERRQREVEEKKQRDLEEKRQ 95
             + E ++++ + +    ++  AE E +  ++++E E   E+ Q E+E  K ++L +   
Sbjct: 729 LIEAEAEVEQAELRAKALRIE-AEAELEKLRKRQELELEYEQAQNELEIAKAKELADI-- 785

Query: 96  RLEEAEKKRQAMMAALKEQT 115
              EA K  + ++ AL  +T
Sbjct: 786 ---EATKFER-IVEALGRET 801


>gnl|CDD|221041 pfam11241, DUF3043, Protein of unknown function (DUF3043).  Some
           members in this family of proteins with unknown function
           are annotated as membrane proteins. This cannot be
           confirmed.
          Length = 168

 Score = 28.4 bits (64), Expect = 4.0
 Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 7/44 (15%)

Query: 70  RKKEEEERRQR----EVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
           R++ E  RR+     + +  K+     +R+R   A   R AMMA
Sbjct: 9   RREAEAARRRPLVPEDRKAAKKAARAARRERRARA---RAAMMA 49


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 28.7 bits (64), Expect = 4.0
 Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 61  AEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQ 105
            E+E +   RKK ++E R+R  ++K++R   E  Q+ +E  +K+Q
Sbjct: 143 FEKECRALSRKKSDDEHRKRSGKQKEKRR-VEDSQKHKEDRRKKQ 186


>gnl|CDD|116627 pfam08017, Fibrinogen_BP, Fibrinogen binding protein.  Proteins in
           this family bind to fibrinogen. Members of this family
           includes the fibrinogen receptor, FbsA, which mediates
           platelet aggregation.
          Length = 393

 Score = 29.1 bits (64), Expect = 4.2
 Identities = 25/111 (22%), Positives = 52/111 (46%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           E  +R  E +++  V+   +     + +    ERRQR+VE K Q ++ E+RQR  E + +
Sbjct: 144 ERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQ 203

Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
              +    ++  N+S+G     ++R+         LER +   + + +  +
Sbjct: 204 GNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNV 254



 Score = 28.7 bits (63), Expect = 6.3
 Identities = 21/86 (24%), Positives = 43/86 (50%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           E  +R  E +++  V+   +     + +    ERRQR+ E + Q ++ E+RQR  E + +
Sbjct: 48  ERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQ 107

Query: 104 RQAMMAALKEQTNKSKGPNFTIQRRE 129
              +    ++  NKS+G     ++R+
Sbjct: 108 GNVLERRQRDVENKSQGNVLERRQRD 133



 Score = 28.3 bits (62), Expect = 7.2
 Identities = 24/100 (24%), Positives = 47/100 (47%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           E  +R  E +++  V+   +     + +    ERRQR+ E + Q ++ E+RQR  E + +
Sbjct: 128 ERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQ 187

Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 143
              +    ++  NKS+G     ++R+         LER +
Sbjct: 188 GNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQ 227



 Score = 28.3 bits (62), Expect = 8.0
 Identities = 22/86 (25%), Positives = 44/86 (51%)

Query: 44  EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
           E  +R  E +++  V+   +     + +    ERRQR+VE K Q ++ E+RQR  E + +
Sbjct: 64  ERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQ 123

Query: 104 RQAMMAALKEQTNKSKGPNFTIQRRE 129
              +    ++  N+S+G     ++R+
Sbjct: 124 GNVLERRQRDAENRSQGNVLERRQRD 149



 Score = 27.9 bits (61), Expect = 9.3
 Identities = 22/83 (26%), Positives = 42/83 (50%)

Query: 47  KRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
           +R  E +++  V+   +     + +    ERRQR+VE K Q ++ E+RQR  E   +   
Sbjct: 83  QRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNV 142

Query: 107 MMAALKEQTNKSKGPNFTIQRRE 129
           +    ++  NKS+G     ++R+
Sbjct: 143 LERRQRDAENKSQGNVLERRQRD 165


>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family.  This is a family
          of sequences found in both bacteria and bacteriophages.
          This region is approximately 130 residues long and in
          some cases is found as part of the PVL
          (Panton-Valentine leukocidin) group of genes, which
          encode a member of the leukocidin group of bacterial
          toxins that kill leukocytes by creation of pores in the
          cell membrane. PVL appears to be a virulence factor
          associated with a number of human diseases.
          Length = 118

 Score = 27.9 bits (62), Expect = 4.2
 Identities = 14/49 (28%), Positives = 31/49 (63%)

Query: 48 RLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
           L E       MR +E +++ + +  E+ER++RE+E K++++ E +R++
Sbjct: 38 ELSEALDAPYGMRLKEYREIKKSENIEQERKERELERKRRKEAELRRKK 86


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 29.1 bits (65), Expect = 4.2
 Identities = 22/83 (26%), Positives = 48/83 (57%)

Query: 40  AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
            +EE++ +++KE++ K+K    EE +    +++ +E+R  +E E++K++ +EE R R EE
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163

Query: 100 AEKKRQAMMAALKEQTNKSKGPN 122
            +++R    +  K+   K     
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNK 186


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 29.1 bits (65), Expect = 4.3
 Identities = 15/81 (18%), Positives = 34/81 (41%), Gaps = 6/81 (7%)

Query: 67  MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
           MA R + E      +  ++K  + ++++ R+E  EK         K++   +   +  ++
Sbjct: 1   MAYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLD------KKEEVVTDNVDSPVK 54

Query: 127 RREPGLTLSTAQLERNKTKEQ 147
            +     L  A   +  TKE+
Sbjct: 55  EQSSQENLKIADEVKKSTKEE 75


>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
           recombination, and repair].
          Length = 440

 Score = 29.1 bits (66), Expect = 4.4
 Identities = 27/113 (23%), Positives = 57/113 (50%), Gaps = 6/113 (5%)

Query: 52  KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-RQAMMA- 109
            Q +R++ RA   + + Q+K+  E   +R      +R L E++QRL+E   + R+A+   
Sbjct: 271 DQLQRRLHRALR-RLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDELAIRLRRALENQ 329

Query: 110 -ALKEQT--NKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIR 159
            ALK+Q     ++  N  IQR++  L     +L++   ++   + +++   + 
Sbjct: 330 LALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLEALVE 382


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 29.3 bits (66), Expect = 4.5
 Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 2/57 (3%)

Query: 60  RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
             EE   +    KE EE     +E++   +LEE    L+E +      +  L E +N
Sbjct: 72  LEEEILDLEAEIKEVEEN-LESLEKEI-NELEEWLNVLDEEKSFLDENLEELSELSN 126


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 29.0 bits (65), Expect = 4.5
 Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
            E    A D ++ E I +   + A + E  +RL+ +QA+    RAEE+ ++ +     E+
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAE----RAEEDARLRELYPLPED 429

Query: 77  RRQREVEEKKQRDLEEKRQR 96
             + E E ++ +  EE+  R
Sbjct: 430 EFEDEDELEEAQPEEEEEAR 449


>gnl|CDD|168632 PRK06633, PRK06633, acetyl-CoA acetyltransferase; Provisional.
          Length = 392

 Score = 28.8 bits (64), Expect = 4.5
 Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 8/74 (10%)

Query: 237 NNATALTGKYPPKIQVASKYE--RRVDTRSYDDKKKLFEGGYLSLTKESLEKQY---AEK 291
           N +  + G Y   I+  +K+   + VD   YD    +F G ++ +T E++ KQ+    ++
Sbjct: 118 NMSLGMHGSY---IRAGAKFGDIKMVDLMQYDGLTDVFSGVFMGITAENISKQFNISRQE 174

Query: 292 VDQFGNRQKTKLPK 305
            D+F      K  K
Sbjct: 175 QDEFALSSHKKAAK 188


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 7/101 (6%)

Query: 23  ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
           +L     E I E    +A+ ++ L + +E+            + +AQ   E +E      
Sbjct: 63  SLQALNLELIQELNELKARLQQQLLQSREQLQ-------LLIESLAQLSSEFQELANEIF 115

Query: 83  EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNF 123
           EE  +R  E  +Q L++  K  + ++   +EQ  +    + 
Sbjct: 116 EELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESA 156


>gnl|CDD|235362 PRK05192, PRK05192, tRNA uridine 5-carboxymethylaminomethyl
           modification enzyme GidA; Validated.
          Length = 618

 Score = 28.9 bits (66), Expect = 4.9
 Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 11/84 (13%)

Query: 128 REPGLTLSTAQLER-NKTKEQIEEEKKIALNIRIKP--LNVEGWSIHKLQAKATEL---- 180
            E GL +   +  R  + KE IEEE +   + R+ P  LN  G    K +    +L    
Sbjct: 449 YELGL-VDDERWARFEEKKEAIEEEIERLKSTRVTPDELNELGGDPLKREVSLLDLLRRP 507

Query: 181 ---WETIVKLETDKYDLEERQKRQ 201
              +E + KL+ +  DL+     Q
Sbjct: 508 EITYEDLAKLDPELADLDPEVAEQ 531


>gnl|CDD|184697 PRK14475, PRK14475, F0F1 ATP synthase subunit B; Provisional.
          Length = 167

 Score = 28.4 bits (63), Expect = 4.9
 Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 6/55 (10%)

Query: 59  MRAEEEQKMAQRK--KEEEERRQREVEEKKQRDL----EEKRQRLEEAEKKRQAM 107
           +R E +  +A  K  +EE ER+   +    + D      E +++LEE  K+R  M
Sbjct: 56  LREEAQALLADVKAEREEAERQAAAMLAAAKADARRMEAEAKEKLEEQIKRRAEM 110


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 28.7 bits (65), Expect = 5.0
 Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 10/82 (12%)

Query: 40  AKEEEDLKRLKE-----KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD----- 89
             EE   +  +E     KQA+R++   +E+++   ++ +  + +  E+EE   +      
Sbjct: 150 GLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEE 209

Query: 90  LEEKRQRLEEAEKKRQAMMAAL 111
           LEE+R+RL  +EK  +A+  AL
Sbjct: 210 LEEERKRLSNSEKLAEAIQNAL 231


>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 435 to 791 amino acids in
           length. This family is related to pfam00589 suggesting
           it may be an integrase enzyme.
          Length = 418

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 38  QRAKEEED--LKRLKEKQAKRKVMRAEEEQKMAQRK----KEEEERRQREVEEKKQRDLE 91
           Q+A  EED  L+ L  K+   K        ++A+ K     E  E+R+REV  ++QR   
Sbjct: 280 QKASVEEDPELQELIRKRDHLKKEIIALYGQVAKAKGTPLYERLEKRRREVRNERQRLRR 339

Query: 92  EKRQRLEEAEKKRQA 106
           E ++++ E   + Q 
Sbjct: 340 ELKKKIREEFDEEQP 354


>gnl|CDD|148049 pfam06210, DUF1003, Protein of unknown function (DUF1003).  This
           family consists of several hypothetical bacterial
           proteins of unknown function.
          Length = 108

 Score = 27.5 bits (62), Expect = 5.2
 Identities = 12/49 (24%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 53  QAKRKVMRAEEEQKMAQRKKEEEERRQREVEE--KKQRDLEEKRQRLEE 99
           QA R  +RAE + ++  + + E E   R+++   +   +L  + Q  +E
Sbjct: 60  QADRDRLRAELDYQINLKAELEIEALHRKLDALREALGELLTRDQERQE 108


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 27.4 bits (61), Expect = 5.3
 Identities = 16/66 (24%), Positives = 38/66 (57%), Gaps = 11/66 (16%)

Query: 42  EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
           ++ +L+R+ EK+ + +V++ ++E++ A+R + E ER            L ++ Q+LE+ E
Sbjct: 49  QKPELQRVLEKRKRDQVLKEQKEEEEAKRLQSELERE-----------LMKRAQKLEQLE 97

Query: 102 KKRQAM 107
            ++   
Sbjct: 98  LEKAKE 103


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 29.2 bits (65), Expect = 5.4
 Identities = 10/43 (23%), Positives = 22/43 (51%), Gaps = 4/43 (9%)

Query: 314  KKDAPESPE--EEEVKRDDEEEENPIYL--KISDDEEEKEEEN 352
            K+D  ++ E  + + K D+EE  + + +  +I  D +E   + 
Sbjct: 3938 KEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP 3980


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 28.8 bits (64), Expect = 5.7
 Identities = 19/103 (18%), Positives = 40/103 (38%), Gaps = 5/103 (4%)

Query: 15  KRQEAKSS-----ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
           +R++A+ S     AL + L       + +  +       +++ QAK KV  A+ E   A+
Sbjct: 270 RREKAQQSKDHANALKKALPTVGNRIKKEEQETLVRQFTVEQTQAKSKVESAKIETDRAR 329

Query: 70  RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
             +       +++   +   L  KR+   +  +      A L 
Sbjct: 330 EMETLAHDNVKQIVGAQHGILSAKREGAVDKRRTISTARAGLD 372


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.5 bits (64), Expect = 5.7
 Identities = 26/128 (20%), Positives = 57/128 (44%), Gaps = 8/128 (6%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA---EKKRQA 106
            ++Q K + +  E++++  ++++EE E RQR  +EK +    +K Q+  +    + K++A
Sbjct: 98  AQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEA 157

Query: 107 MMAALKEQTNKSKGPNFTIQ-----RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIK 161
             +A    +  +K      Q     R +        Q ++ + +E+ ++ KK       K
Sbjct: 158 AESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217

Query: 162 PLNVEGWS 169
               E W 
Sbjct: 218 QKAEEAWQ 225


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 28.8 bits (64), Expect = 6.0
 Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 4/79 (5%)

Query: 29  KEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQR 88
           KE   + ++   K EE    LKE+  +R++   +       +K+  E   Q      +Q 
Sbjct: 508 KEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALL--EHKKETLEYTSQLAELLDQQA 565

Query: 89  DLEEKRQR--LEEAEKKRQ 105
           D  E   +    + E+   
Sbjct: 566 DRFELSAQAAGSQKERGSD 584


>gnl|CDD|224796 COG1884, Sbm, Methylmalonyl-CoA mutase, N-terminal domain/subunit
           [Lipid metabolism].
          Length = 548

 Score = 28.5 bits (64), Expect = 6.3
 Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 1/57 (1%)

Query: 31  YIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQ 87
           Y VEW      EE     ++E +A   +++A E+    +   E   RRQ  +EE ++
Sbjct: 397 YYVEWLTDEL-EEAAWAYIEEIEAMGGMLKAIEDGYPQREIAESAARRQARIEEGER 452


>gnl|CDD|214351 CHL00063, atpE, ATP synthase CF1 epsilon subunit.
          Length = 134

 Score = 27.5 bits (62), Expect = 6.8
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 84  EKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
           ++ Q+ LE     LE+AE K+Q + A L 
Sbjct: 90  QEAQQTLEIAEANLEKAEGKKQKIEANLA 118


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 27.6 bits (62), Expect = 6.8
 Identities = 15/73 (20%), Positives = 34/73 (46%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
           +A   E  + L+E + +   +  + +++  Q  +E +   + E+E  K+    E     E
Sbjct: 57  QALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKE 116

Query: 99  EAEKKRQAMMAAL 111
            A ++ +A +A L
Sbjct: 117 RALEELRAEVAEL 129


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 28.4 bits (63), Expect = 6.9
 Identities = 20/100 (20%), Positives = 41/100 (41%), Gaps = 5/100 (5%)

Query: 26  EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
           E  KE I +   ++ K E  L+   +++ K +     +  +     K+ EER +  +E  
Sbjct: 121 ESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWR--NLYKKNEERFKEYLENF 178

Query: 86  KQRDLEEKRQRLEEAEKKRQAMMA---ALKEQTNKSKGPN 122
           K++  + K  +  E  K   + +     +KE+       N
Sbjct: 179 KRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHN 218


>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175). 
          Length = 820

 Score = 28.7 bits (65), Expect = 6.9
 Identities = 21/102 (20%), Positives = 39/102 (38%), Gaps = 23/102 (22%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKE--KQAKRKVMRAEEEQKMAQRKKEE 74
           QE  S AL+              A +EE + +L +  ++A    MR   EQ    ++  +
Sbjct: 478 QERLSEALE------------NGASDEE-IAKLMQELREALDDYMRQLAEQ---AQRNPQ 521

Query: 75  EERRQREVEEKK---QRDLEEKRQRLEEAEKKRQAMMAALKE 113
           +  +  +        Q+DL+    R+EE         A  ++
Sbjct: 522 QGDQPPDQGNSMELTQQDLQRMMDRIEEL--MESGRRAEAQQ 561


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly
          related to archaeal Holliday junction resolvase
          [Nucleotide transport and metabolism].
          Length = 175

 Score = 27.8 bits (62), Expect = 6.9
 Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 2/44 (4%)

Query: 37 NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ--RKKEEEERR 78
            R  EE   K  +E+       R EEE K+ +   KK EE R 
Sbjct: 34 KARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77


>gnl|CDD|215458 PLN02853, PLN02853, Probable phenylalanyl-tRNA synthetase alpha
           chain.
          Length = 492

 Score = 28.5 bits (64), Expect = 7.0
 Identities = 25/102 (24%), Positives = 43/102 (42%), Gaps = 22/102 (21%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
            + Q  RKV   E+E K      +E +    EV++K    L+ KR++L            
Sbjct: 118 GKPQVSRKVQHVEDEVKELLLAIQEGK----EVDDKDIDALK-KRRKL------------ 160

Query: 110 ALKEQTNK----SKGPNFTIQRREPGLTLSTAQLERNKTKEQ 147
            +  +T K     KGPN+  +R++    L+   L+    K+ 
Sbjct: 161 -ITLETWKGYSIKKGPNYAPERKKAATDLTREMLQSGDWKDL 201


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 28.5 bits (64), Expect = 7.1
 Identities = 39/198 (19%), Positives = 79/198 (39%), Gaps = 15/198 (7%)

Query: 13  FVKRQEAKSSAL-DEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
           F+K Q   S     ++ + Y+        K ++ +KR K+ +   +V  + + + + Q+ 
Sbjct: 796 FIKLQPLLSLLGSRKEYRSYL----ACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKF 851

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
               + ++R    KK+    +  QR+E AE++ Q +   +K   +  K  N  ++     
Sbjct: 852 GRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKS-ISSLKLVNLELESEIIE 910

Query: 132 LTLSTAQLERNKTK---EQIEEEKKIALNIRIK---PLNVEGWSIH-KLQAKATELWETI 184
           L  S +       +   E I   KK+  NI ++    +         KL    ++L ET 
Sbjct: 911 LKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS 970

Query: 185 VKLET--DKYDLEERQKR 200
            + E    K  +  R+  
Sbjct: 971 EEYEDLLKKSTILVREGN 988


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 28.2 bits (63), Expect = 7.2
 Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 6/67 (8%)

Query: 50  KEKQAKRKVMRAEEEQKMAQRKKEEEE----RRQREVEEKKQRDLEEKRQRLEEAEKKRQ 105
           +E++A         +++  +   E EE    R +RE EE+  R  E +RQR+   +    
Sbjct: 100 QEEKAVNPFTPQPGQREERRPTLESEEEWRARLKREQEEQYLR--ERQRQRMARLQANAA 157

Query: 106 AMMAALK 112
           A  + L 
Sbjct: 158 AYDSPLA 164


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 28.4 bits (64), Expect = 7.2
 Identities = 20/93 (21%), Positives = 44/93 (47%), Gaps = 7/93 (7%)

Query: 23  ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
            L E+LKE       +  + EE+L+++K+   K    +A  +    +R++ E+    +E 
Sbjct: 331 ELKEELKEL----EEELKELEEELEKIKKLLKKLP-KKARGQLPPEKREQLEKLLETKEK 385

Query: 83  EEKKQRDLEEKRQRLEE--AEKKRQAMMAALKE 113
             ++  +LEE+ + L+E       +  ++  K 
Sbjct: 386 LSEELEELEEELKELKEELESLYSEGKISVNKT 418


>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
          Length = 268

 Score = 28.1 bits (63), Expect = 7.3
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 60 RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
            E E + A+ ++EE   +QR +E  K   L +++ R
Sbjct: 44 ECEREAEEAKEREEENREKQRRIERLKSNSLLDEKFR 80


>gnl|CDD|233161 TIGR00870, trp, transient-receptor-potential calcium channel
           protein.  The Transient Receptor Potential Ca2+ Channel
           (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also
           been called the store-operated calcium channel (SOC)
           family. The prototypical members include the Drosophila
           retinal proteinsTRP and TRPL (Montell and Rubin, 1989;
           Hardie and Minke, 1993). SOC members of the family
           mediate the entry of extracellular Ca2+ into cells in
           responseto depletion of intracellular Ca2+ stores
           (Clapham, 1996) and agonist stimulated production of
           inositol-1,4,5 trisphosphate (IP3). One member of the
           TRP-CCfamily, mammalian Htrp3, has been shown to form a
           tight complex with the IP3 receptor (TC #1.A.3.2.1).
           This interaction is apparently required for IP3
           tostimulate Ca2+ release via Htrp3. The vanilloid
           receptor subtype 1 (VR1), which is the receptor for
           capsaicin (the ?hot? ingredient in chili peppers) and
           servesas a heat-activated ion channel in the pain
           pathway (Caterina et al., 1997), is also a member of
           this family. The stretch-inhibitable non-selective
           cation channel(SIC) is identical to the vanilloid
           receptor throughout all of its first 700 residues, but
           it exhibits a different sequence in its last 100
           residues. VR1 and SICtransport monovalent cations as
           well as Ca2+. VR1 is about 10x more permeable to Ca2+
           than to monovalent ions. Ca2+ overload probably causes
           cell deathafter chronic exposure to capsaicin.
           (McCleskey and Gold, 1999) [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 743

 Score = 28.5 bits (64), Expect = 7.8
 Identities = 22/95 (23%), Positives = 31/95 (32%), Gaps = 22/95 (23%)

Query: 33  VEWRNQRAKE----EEDLKRL----------KEKQAKRKVMRAEEEQKMAQRKKEEEERR 78
            EW+ QRAK     E +              K      K +   + +K         +R 
Sbjct: 624 EEWKFQRAKLWMSYEREGGTCPPPFNIIPGPKSFVGLFKRIEKHDGKKR--------QRW 675

Query: 79  QREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
            R VEE      E K + L E     Q +M  L +
Sbjct: 676 CRRVEEVNWTTWERKAETLIEDGLHYQRVMKRLIK 710


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 28.4 bits (63), Expect = 7.8
 Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 10/114 (8%)

Query: 42  EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
            E+DL+ LK++ A+    RA+EE +     K   E  + E E +++     K   +   E
Sbjct: 3   PEKDLEDLKKEAAR----RAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEE 58

Query: 102 KKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 155
            KR+A  AA      K+K      Q+RE    ++  +  + K K     + K A
Sbjct: 59  AKRRAAAAA------KAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAA 106


>gnl|CDD|233149 TIGR00845, caca, sodium/calcium exchanger 1.  The Ca2+:Cation
           Antiporter (CaCA) Family (TC 2.A.19)Proteins of the CaCA
           family are found ubiquitously, having been identified in
           animals, plants, yeast, archaea and widely divergent
           bacteria.All of the characterized animal proteins
           catalyze Ca2+:Na+ exchange although some also transport
           K+. The NCX1 plasma membrane protein exchanges 3 Na+ for
           1 Ca2+. The E. coli ChaA protein catalyzes Ca2+:H+
           antiport but may also catalyze Na+:H+ antiport. All
           remaining well-characterized members of the family
           catalyze Ca2+:H+ exchange.This model is specific for the
           eukaryotic sodium ion/calcium ion exchangers of the Caca
           family [Transport and binding proteins, Other].
          Length = 928

 Score = 28.2 bits (63), Expect = 7.9
 Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 4/60 (6%)

Query: 296 GNRQKTKLPKWFGERPGKKKDAPESPE--EEEVKRDDEEEENPIYLKISDDEEEKEEENF 353
           G R    +P    E  G  +   +  E    E++ +++E E  I +KI DDEE ++ + F
Sbjct: 555 GARGTVIVPYRTVE--GTARGGGKDFEDTCGELEFENDETEKTIRVKIVDDEEYEKNDTF 612


>gnl|CDD|219004 pfam06386, GvpL_GvpF, Gas vesicle synthesis protein GvpL/GvpF.
           This family consists of several bacterial and archaeal
           gas vesicle synthesis protein (GvpL/GvpF) sequences. The
           exact function of this family is unknown.
          Length = 246

 Score = 28.0 bits (63), Expect = 7.9
 Identities = 14/73 (19%), Positives = 30/73 (41%), Gaps = 5/73 (6%)

Query: 33  VEWR-----NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQ 87
           VEW      ++ A  +E ++   E +A R  + +  E K    +    +  +  + E++Q
Sbjct: 111 VEWGVKVYWDEDAILKEVVEEDPELRALRAEIESAPEGKAYLLRIRLGQLIEEALRERRQ 170

Query: 88  RDLEEKRQRLEEA 100
              E+  + L   
Sbjct: 171 ELAEDILEALSPL 183


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 27.8 bits (63), Expect = 8.0
 Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 2/39 (5%)

Query: 71  KKEEEERRQREVEEKKQRDLE-EKRQRLEEAEKK-RQAM 107
           KK  EE R+   EE  ++ LE  +     EA +  ++A+
Sbjct: 85  KKGTEEERRERREENLEKALELLREGNRSEARECFQRAV 123


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 28.3 bits (63), Expect = 8.0
 Identities = 13/36 (36%), Positives = 23/36 (63%)

Query: 32  IVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKM 67
           IV+W     K++E +  LK+KQA +++  A E+ K+
Sbjct: 268 IVDWLASNFKKDEGIDLLKDKQALQRLTEAAEKAKI 303


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 28.1 bits (63), Expect = 8.1
 Identities = 21/81 (25%), Positives = 40/81 (49%), Gaps = 18/81 (22%)

Query: 51  EKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ------RLEEAEKKR 104
           E+  +RK+   E EQK    ++E  E+  +++ EK   DLEE  +      ++EE E+K+
Sbjct: 86  EQGLQRKLKELEREQK----EEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQ 141

Query: 105 --------QAMMAALKEQTNK 117
                   ++ +  +KE+   
Sbjct: 142 MEILEKALKSYLKIVKEENKS 162


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 28.5 bits (64), Expect = 8.1
 Identities = 26/99 (26%), Positives = 38/99 (38%), Gaps = 2/99 (2%)

Query: 16  RQEAKSSALDEQLKEYIVE-WRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEE 74
           R  A   AL   L E I+E  + +  +E+E L+ L EK  +      EEE +  ++  +E
Sbjct: 475 RSNAFDIALRLGLPEPIIEEAKTEFGEEKELLEELIEK-LEEVRKELEEELEEVEKLLDE 533

Query: 75  EERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
            E         K   LE K Q +  A           K 
Sbjct: 534 VELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKV 572


>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
           unknown].
          Length = 108

 Score = 26.7 bits (59), Expect = 8.1
 Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 24  LDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE 83
           + E L++  +      A+  E +++L ++  K+  + AEE ++       + +  Q E+E
Sbjct: 4   MGEGLRKLALIGAGLAAETAEKVQKLVDELVKKGELNAEEAKRFVDDLLRQAKEAQGELE 63

Query: 84  EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
           EK  R +EE    L + E  RQ+ M  L E+
Sbjct: 64  EKIPRKIEEM---LSDLEVARQSEMDELTER 91


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 28.0 bits (63), Expect = 8.5
 Identities = 25/96 (26%), Positives = 49/96 (51%), Gaps = 11/96 (11%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
           +EA+++A +++ ++  +E RN      + L    EK  K      E   K+   +KE+ E
Sbjct: 508 KEAEANAEEDKKRKEEIEARNN----ADSLAYQAEKTLK------EAGDKLPAEEKEKIE 557

Query: 77  RRQREVEEK-KQRDLEEKRQRLEEAEKKRQAMMAAL 111
           +   E++E  K  D+EE + + EE ++  Q +  A+
Sbjct: 558 KAVAELKEALKGEDVEEIKAKTEELQQALQKLAEAM 593


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 27.7 bits (62), Expect = 8.5
 Identities = 27/89 (30%), Positives = 49/89 (55%), Gaps = 11/89 (12%)

Query: 21  SSALDEQLK---EYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEER 77
           S AL+E+L+   E ++E    R   E++ +R +     RK +R      + Q ++EEEE+
Sbjct: 75  SLALEERLRNLLEKLIEVSEHRVDSEKEDERYEITSDVRKQLRF-----LEQLEREEEEK 129

Query: 78  RQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
           R    EE+++R L   + R E++  K++A
Sbjct: 130 R---DEEERERLLRAAKSRSEQSRLKQKA 155


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 28.0 bits (61), Expect = 8.6
 Identities = 24/86 (27%), Positives = 43/86 (50%), Gaps = 4/86 (4%)

Query: 35  WRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR 94
           +RN+R   E   K   EK+ +R++     EQ  A+R K+E+ R +   +E ++R+ + +R
Sbjct: 132 FRNKRNLMEHIHKVKNEKKKERQL----AEQLAAKRLKDEQHRHKARKQELRKREKDRER 187

Query: 95  QRLEEAEKKRQAMMAALKEQTNKSKG 120
            R E+A     A   A  ++     G
Sbjct: 188 ARREDAAAAAAAKQKAAAKKAAAPSG 213


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 26.8 bits (60), Expect = 8.6
 Identities = 28/105 (26%), Positives = 46/105 (43%), Gaps = 13/105 (12%)

Query: 10  DPEFVKRQEAK---SSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQK 66
           +PE VK +  K        ++L E   E R       E    L+E QA+R  +  E  + 
Sbjct: 10  NPEAVKEKLRKRGVDVLDVDELLELDEERR-------ELQVELEELQAERNELSKEIGKA 62

Query: 67  MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
              +KKE+ E    EV+E K  +L+     L E E +   ++ ++
Sbjct: 63  K--KKKEDAEALIAEVKELKD-ELKALEAELRELEAELDKLLLSI 104


>gnl|CDD|130083 TIGR01010, BexC_CtrB_KpsE, polysaccharide export inner-membrane
           protein, BexC/CtrB/KpsE family.  This family contains
           gamma proteobacterial proteins involved in capsule
           polysaccharide export [Transport and binding proteins,
           Carbohydrates, organic alcohols, and acids].
          Length = 362

 Score = 28.1 bits (63), Expect = 8.7
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 1/60 (1%)

Query: 61  AEEEQKMAQRKKEEEERRQREVEEKKQRD-LEEKRQRLEEAEKKRQAMMAALKEQTNKSK 119
           AEE QK+ QR  +E ER    + E+ ++D +      ++EAE++  A  A L +   K+K
Sbjct: 142 AEEAQKINQRLLKEGERLINRLNERARKDTIAFAENEVKEAEQRLNATKAELLKYQIKNK 201


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 26.8 bits (60), Expect = 9.0
 Identities = 15/52 (28%), Positives = 34/52 (65%), Gaps = 5/52 (9%)

Query: 59  MRAEEEQKMAQRKKEEEERRQREVEEKKQR----DLEEKRQRLEEAEKKRQA 106
           +RAE + K+ + K++E   R++E++E + +     +E+K+++L EA+ + Q 
Sbjct: 64  LRAERQAKVEE-KEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
          Length = 141

 Score = 27.0 bits (60), Expect = 9.1
 Identities = 15/72 (20%), Positives = 33/72 (45%)

Query: 43  EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEK 102
           + DL+++K   +    +  E E  +   ++E  + RQ+ + + K+   ++   +  E E 
Sbjct: 44  KNDLEKVKTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAELES 103

Query: 103 KRQAMMAALKEQ 114
           K +A    L  Q
Sbjct: 104 KYEAFAKQLANQ 115


>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
           C4-dicarboxylate transport system [Signal transduction
           mechanisms].
          Length = 603

 Score = 28.1 bits (63), Expect = 9.2
 Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 2/57 (3%)

Query: 35  WRNQRAKEEEDLKRLKEKQAKRKVM--RAEEEQKMAQRKKEEEERRQREVEEKKQRD 89
            R +RA+      +    + +R+V    A+  +  A+ + E  ER Q E   ++ +D
Sbjct: 317 RRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQD 373


>gnl|CDD|234937 PRK01294, PRK01294, lipase chaperone; Provisional.
          Length = 336

 Score = 27.7 bits (62), Expect = 9.3
 Identities = 24/112 (21%), Positives = 51/112 (45%), Gaps = 17/112 (15%)

Query: 17  QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE--- 73
           + A+ +AL+ QL E +     +  +++  L++L + QA      A  ++    R +    
Sbjct: 208 KAARLAALEAQLPEDLRAALQESQRQQALLQQLAQLQAS----GASPQELRLMRAQLVGP 263

Query: 74  ------EEERRQREVEEKKQRDLEEKRQRLEEA----EKKRQAMMAALKEQT 115
                 E+  +QR   +++  D   +R ++  A     + RQA +A L++Q 
Sbjct: 264 EAAQRLEQLDQQRAAWQQRYDDYLAQRAQILNAAGLSPQDRQAQIAQLRQQR 315


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 41/187 (21%), Positives = 83/187 (44%), Gaps = 8/187 (4%)

Query: 15  KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEE 74
           +++         +L+E I E   +  + +++++ L+EK  + K ++ + E+ +   +  E
Sbjct: 244 EKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303

Query: 75  EERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTL 134
           E   +    EK+   LEE+   +EE  K+ +     L+E   K K     +++R   L  
Sbjct: 304 EYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKE----LEKRLEELEE 359

Query: 135 STAQLERNKTK-EQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYD 193
                E  K K E++E  KK    +  + L  E   + +L+    E+ E I K+     +
Sbjct: 360 RHELYEEAKAKKEELERLKKRLTGLTPEKLEKE---LEELEKAKEEIEEEISKITARIGE 416

Query: 194 LEERQKR 200
           L++  K 
Sbjct: 417 LKKEIKE 423


>gnl|CDD|202204 pfam02321, OEP, Outer membrane efflux protein.  The OEP family
           (Outer membrane efflux protein) form trimeric channels
           that allow export of a variety of substrates in Gram
           negative bacteria. Each member of this family is
           composed of two repeats. The trimeric channel is
           composed of a 12 stranded all beta sheet barrel that
           spans the outer membrane, and a long all helical barrel
           that spans the periplasm.
          Length = 186

 Score = 27.5 bits (61), Expect = 9.6
 Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 14  VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKE--KQAKRKVMRAEEEQKMAQRK 71
           VK  +A+  A + +L++   + R + A+   +L   KE  + AK+ +  AEE  ++A+ +
Sbjct: 83  VKAAKAQLEAAEAELEQARRQLRLEVAQAYFNLLAAKEQLELAKQALELAEEALELAEAR 142

Query: 72  KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
            E       +V +  + +L E R  L  AE   +   A L++   
Sbjct: 143 YEAGLISLLDVLQA-EVELLEARLELLNAEADLELARAQLEQLLG 186


>gnl|CDD|233357 TIGR01322, scrB_fam, sucrose-6-phosphate hydrolase.  [Energy
           metabolism, Biosynthesis and degradation of
           polysaccharides].
          Length = 445

 Score = 28.1 bits (63), Expect = 9.6
 Identities = 30/117 (25%), Positives = 42/117 (35%), Gaps = 23/117 (19%)

Query: 57  KVMRAEEEQKM--AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
           K +R EE   +   Q         + E+      DLE           K  A    L   
Sbjct: 328 KALRTEEHINVFGDQEHTLPGLNGEFELIL----DLE-----------KDSAFELGLAL- 371

Query: 115 TNKSKGPNFTIQRREPGLTL---STAQLERNKT--KEQIEEEKKIALNIRIKPLNVE 166
           TNK +    TI   E  +TL   S+  LE         +   KK++L+I I   +VE
Sbjct: 372 TNKGEETLLTIDADEGKVTLDRRSSGNLEDYGGTRSCPLPNTKKVSLHIFIDKSSVE 428


>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
          Length = 174

 Score = 27.2 bits (61), Expect = 9.7
 Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 10/76 (13%)

Query: 39  RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL- 97
           R    + LK  +E+  +     AE +QK+AQ ++E E  R          D + + + + 
Sbjct: 52  REAILQALKEAEERLRQAAQALAEAQQKLAQAQQEAERIRA---------DAKARAEAIR 102

Query: 98  EEAEKKRQAMMAALKE 113
            E EK+    MA LK+
Sbjct: 103 AEIEKQAIEDMARLKQ 118


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 27.7 bits (62), Expect = 9.7
 Identities = 20/76 (26%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 34  EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR----QREVEEKKQRD 89
           E  N+  + +   K  + K+ +RK +  E +++   +KK  +  R     +EV +K++  
Sbjct: 267 EPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKAR 326

Query: 90  LEEKRQRLEEAEKKRQ 105
             +K QR E  EKK+ 
Sbjct: 327 ARKKEQRKERGEKKKL 342


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 27.6 bits (62), Expect = 10.0
 Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 11/62 (17%)

Query: 71  KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
           ++E+E+++    EEKK      K ++ +  E     ++   KE+       NF I+   P
Sbjct: 95  EEEKEKKKAMSKEEKK----AIKEEKEKLEEPYGYCLVDGHKEKVG-----NFRIEP--P 143

Query: 131 GL 132
           GL
Sbjct: 144 GL 145


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.307    0.127    0.344 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,414,440
Number of extensions: 1971623
Number of successful extensions: 11900
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7284
Number of HSP's successfully gapped: 1639
Length of query: 372
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 274
Effective length of database: 6,590,910
Effective search space: 1805909340
Effective search space used: 1805909340
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (26.7 bits)