RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3368
         (169 letters)



>gnl|CDD|215741 pfam00135, COesterase, Carboxylesterase family. 
          Length = 510

 Score =  117 bits (296), Expect = 1e-31
 Identities = 43/111 (38%), Positives = 60/111 (54%), Gaps = 21/111 (18%)

Query: 29  IIELKQGRLRGVVRSPVHNGNLNNIHMYLGIPYAAPPVGQLRFMAPQSPQPWPGLMIADS 88
           ++E   G++RG+    V  G    ++ +LGIPYA PPVG+LRF  PQ P+PW G++ A  
Sbjct: 2   VVETSYGKVRGLRV-TVDGGP---VYAFLGIPYAKPPVGELRFKKPQPPEPWTGVLDATK 57

Query: 89  FSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTP 139
           + P CPQ      ++D   +M                  SEDCLYLN+YTP
Sbjct: 58  YGPACPQ------NNDLGSEMWNKNT-----------GMSEDCLYLNVYTP 91


>gnl|CDD|238191 cd00312, Esterase_lipase, Esterases and lipases (includes fungal
           lipases, cholinesterases, etc.)  These enzymes act on
           carboxylic esters (EC: 3.1.1.-). The catalytic apparatus
           involves three residues (catalytic triad): a serine, a
           glutamate or aspartate and a histidine.These catalytic
           residues are responsible for the nucleophilic attack on
           the carbonyl carbon atom of the ester bond. In contrast
           with other alpha/beta hydrolase fold family members,
           p-nitrobenzyl esterase and acetylcholine esterase have a
           Glu instead of Asp at the active site carboxylate.
          Length = 493

 Score = 98.6 bits (246), Expect = 1e-24
 Identities = 40/111 (36%), Positives = 53/111 (47%), Gaps = 25/111 (22%)

Query: 29  IIELKQGRLRGVVRSPVHNGNLNNIHMYLGIPYAAPPVGQLRFMAPQSPQPWPGLMIADS 88
           ++    G++RGV    V+         +LGIPYA PPVG LRF  PQ  +PW  ++ A S
Sbjct: 1   LVVTPNGKVRGVDEGGVY--------SFLGIPYAEPPVGDLRFKEPQPYEPWSDVLDATS 52

Query: 89  FSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTP 139
           + P C Q       D     +   +L             SEDCLYLN+YTP
Sbjct: 53  YPPSCMQ------WDQLGGGLWNAKLP-----------GSEDCLYLNVYTP 86


>gnl|CDD|225181 COG2272, PnbA, Carboxylesterase type B [Lipid metabolism].
          Length = 491

 Score = 97.1 bits (242), Expect = 4e-24
 Identities = 45/120 (37%), Positives = 56/120 (46%), Gaps = 29/120 (24%)

Query: 29  IIELKQGRLRGVVRSPVHNGNLNNIHMYLGIPYAAPPVGQLRFMAPQSPQPWPGLMIADS 88
           + E   G++ G+          N +H +LGIPYAAPPVG+LRF  P  P+PW G+  A  
Sbjct: 4   VAETTTGKVEGITV--------NGVHSWLGIPYAAPPVGELRFRRPVPPEPWSGVRDATQ 55

Query: 89  FSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKK 148
           F P CPQ            +M  G               SEDCLYLNI+ P  E   EK 
Sbjct: 56  FGPACPQ---------PFNRMGSGEDF----------TGSEDCLYLNIWAP--EVPAEKL 94


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 36.7 bits (85), Expect = 0.001
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
                EE+ +E+ EEKKKEE+K+EE EE
Sbjct: 70  AAAAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 30.9 bits (70), Expect = 0.10
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 145 EEKKEEREEKKKEEKKKEEGEEK 167
               EE+ E+K+EEKKKEE +E+
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEE 93



 Score = 27.1 bits (60), Expect = 2.1
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
           P        +E+ EEK++E+KK+EE EE
Sbjct: 66  PAAAAAAAAEEKAEEKEEEKKKEEEKEE 93



 Score = 26.3 bits (58), Expect = 4.7
 Identities = 13/23 (56%), Positives = 19/23 (82%), Gaps = 1/23 (4%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
           E+EEEKK+E EEK++EE++   G
Sbjct: 80  EKEEEKKKE-EEKEEEEEEALAG 101



 Score = 25.5 bits (56), Expect = 7.7
 Identities = 8/24 (33%), Positives = 20/24 (83%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGE 165
           + EE+++E+++E++KEE+++E   
Sbjct: 77  KAEEKEEEKKKEEEKEEEEEEALA 100


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 36.1 bits (83), Expect = 0.005
 Identities = 15/26 (57%), Positives = 24/26 (92%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
           +EEEEE++EE EE+++EE+++EE EE
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 35.7 bits (82), Expect = 0.009
 Identities = 15/29 (51%), Positives = 26/29 (89%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEEE++EE EE+++EE+++EE EE ++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 32.3 bits (73), Expect = 0.12
 Identities = 13/28 (46%), Positives = 25/28 (89%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           + EEE++EE EE+++EE+++EE EE+++
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 28.4 bits (63), Expect = 2.2
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEEE++EE EE++ EE    E  E ++
Sbjct: 875 EEEEEEEEEEEEEEENEEPLSLEWPETRQ 903



 Score = 27.3 bits (60), Expect = 4.7
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEEE++EE EE+ +E    E  E ++K
Sbjct: 876 EEEEEEEEEEEEEENEEPLSLEWPETRQK 904


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.9 bits (82), Expect = 0.007
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++  EE KK E ++KK EE KK E +EKK 
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690



 Score = 35.1 bits (80), Expect = 0.016
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE KK E +EKK  E  K+E EE KK
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703



 Score = 33.6 bits (76), Expect = 0.050
 Identities = 14/26 (53%), Positives = 20/26 (76%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKK 168
            EE +K+ E ++KK EE KK+E E+KK
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKK 1758



 Score = 32.8 bits (74), Expect = 0.092
 Identities = 17/29 (58%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +  E  KKE  E KK EE KK+E EEKKK
Sbjct: 1689 KAAEALKKEAEEAKKAEELKKKEAEEKKK 1717



 Score = 30.5 bits (68), Expect = 0.46
 Identities = 17/26 (65%), Positives = 20/26 (76%)

Query: 144  EEEKKEEREEKKKEEKKKEEGEEKKK 169
            EE KK E E+KK E+ KK+E EEKKK
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKK 1648



 Score = 30.1 bits (67), Expect = 0.58
 Identities = 14/29 (48%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            + EEE K +  EE KK E+ K++ EE KK
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682



 Score = 29.7 bits (66), Expect = 0.79
 Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            L++    KK+  E KKK E+KK+  E KKK
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 29.7 bits (66), Expect = 0.93
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE+KK+  E KK    KK+  E KKK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKK 1426



 Score = 29.3 bits (65), Expect = 1.0
 Identities = 16/44 (36%), Positives = 23/44 (52%)

Query: 125  KNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
            + + E+ + L       + EE KK E  + K EE KK E E+KK
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634



 Score = 29.3 bits (65), Expect = 1.1
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 4/34 (11%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEE----GEEKKK 169
             ++  E KK+  E KK EE KK +     EE KK
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535



 Score = 29.3 bits (65), Expect = 1.1
 Identities = 15/30 (50%), Positives = 23/30 (76%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            L++ EEEKK+  + KKKE ++K++ EE KK
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654



 Score = 29.3 bits (65), Expect = 1.2
 Identities = 15/30 (50%), Positives = 22/30 (73%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            L++E EE K+  E KKKE ++K++ EE KK
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723



 Score = 29.0 bits (64), Expect = 1.3
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
             ++EE +KK +  +KK EEKKK +  +KK
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 29.0 bits (64), Expect = 1.4
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            EE +K EE  + K EE KKE  E+KKK
Sbjct: 1719 EELKKAEEENKIKAEEAKKEAEEDKKK 1745



 Score = 29.0 bits (64), Expect = 1.4
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ +E KK+  E KKK +  K++ EE KK
Sbjct: 1315 KKADEAKKKAEEAKKKADAAKKKAEEAKK 1343



 Score = 29.0 bits (64), Expect = 1.5
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            EE +KKE  E+KK EE KK E E K K
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIK 1731



 Score = 28.6 bits (63), Expect = 2.1
 Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            E  E+K E  EKKKEE KK+    KKK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 28.6 bits (63), Expect = 2.3
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +++ +E K++ EEKKK ++ K++ EE KK
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKK 1445



 Score = 28.6 bits (63), Expect = 2.3
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EEKK+  E KKK E+ K+  E KKK
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452



 Score = 28.6 bits (63), Expect = 2.3
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 142  EEEEEKKEEREEKKKEEKKKEEGEEKK 168
            E  EEK E  E+KK+E KKK +  +KK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKK 1386



 Score = 28.2 bits (62), Expect = 2.4
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            EE +K EE  + K  E+ K+  E+KKK
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKK 1676



 Score = 28.2 bits (62), Expect = 2.4
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EE K+  E KKK E+ K+  E KKK
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491



 Score = 28.2 bits (62), Expect = 2.6
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ +E KK+  E KKK ++ K+  E KKK
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKK 1511



 Score = 28.2 bits (62), Expect = 2.7
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKK 168
            +E +KK E ++KK +E KK    +KK
Sbjct: 1394 DEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 28.2 bits (62), Expect = 2.8
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ +E KK+  E+KKK ++ K+    KKK
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 28.2 bits (62), Expect = 2.8
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 145  EEKKEEREEKKKEEKKKEEGEEKKK 169
            +E K++ EE KK ++ K++ EE KK
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKK 1458



 Score = 28.2 bits (62), Expect = 3.0
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EEKK+  E KKK E+ K++ +E KK
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKK 1412



 Score = 28.2 bits (62), Expect = 3.1
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EE K+  E KKK E+ K++ +E KK
Sbjct: 1475 AKKKAEEAKKADEAKKKAEEAKKKADEAKK 1504



 Score = 27.8 bits (61), Expect = 3.4
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +++ +E K+    KKK ++ K++ EEKKK
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 27.8 bits (61), Expect = 3.5
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            + EEE K +  E KK+ E+ K++ EE KK
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751



 Score = 27.8 bits (61), Expect = 3.6
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE KK +  +KK EE KK+    KKK
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKK 1337



 Score = 27.8 bits (61), Expect = 3.6
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ +E KK+  E KK +E KK+  E KKK
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKK 1498



 Score = 27.8 bits (61), Expect = 3.6
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EE K+  E KKK E+ K++ +  KK
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336



 Score = 27.8 bits (61), Expect = 3.6
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            + E    + E  E+K E  +K++ E KKK
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 27.8 bits (61), Expect = 3.8
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 144  EEEKKEEREEKKKEEKKKEEGEEKKK 169
            +E KK+  E KK +E KK+  E KKK
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKK 1330



 Score = 27.8 bits (61), Expect = 4.0
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
            ++ EE KK +  +KK EE KK E  +KK
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKK 1465



 Score = 27.8 bits (61), Expect = 4.1
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE KK+    KKK E+ K+  E  K 
Sbjct: 1322 KKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350



 Score = 27.4 bits (60), Expect = 4.4
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EE K+  E KKK E+ K+  E KKK
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478



 Score = 27.4 bits (60), Expect = 4.5
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
             ++ EE+KK +  +KK EE KK +  +KK
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKK 1323



 Score = 27.4 bits (60), Expect = 4.5
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ +  KK+  E+KK +E KK+  E+KKK
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKK 1406



 Score = 27.4 bits (60), Expect = 4.6
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 145  EEKKEEREEKKKEEKKKEEGEEKKK 169
            EE K++ EE KK ++ K++ EE KK
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKK 1484



 Score = 27.4 bits (60), Expect = 4.8
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
            ++ EE KK+  E KK  E KK+  E KK
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAKK 1517



 Score = 27.4 bits (60), Expect = 4.9
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE KK +  +KK EE KK+  E KK 
Sbjct: 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKA 1505



 Score = 27.4 bits (60), Expect = 5.2
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE+KK +  +KK EE KK+  E KK 
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKA 1413



 Score = 27.0 bits (59), Expect = 5.9
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 140  LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++ EE K+  E KKK E+ K+  E KKK
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465



 Score = 27.0 bits (59), Expect = 6.0
 Identities = 9/29 (31%), Positives = 13/29 (44%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ +  KK+  E KK  E  K E E    
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357



 Score = 27.0 bits (59), Expect = 6.0
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +E +KK E  +KK +  KK+  E KK 
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 27.0 bits (59), Expect = 6.4
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 143  EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            EE +KK +  +KK EE KK     K +
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAE 1351



 Score = 26.6 bits (58), Expect = 7.5
 Identities = 8/29 (27%), Positives = 14/29 (48%)

Query: 141  QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++ EE KK     K + E   +E E  ++
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEE 1364



 Score = 26.6 bits (58), Expect = 8.1
 Identities = 10/34 (29%), Positives = 16/34 (47%), Gaps = 4/34 (11%)

Query: 140  LQEEEEEKKEEREEKKKEEKKK----EEGEEKKK 169
             +++ EE K+  E  K E +      E  EEK +
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 26.6 bits (58), Expect = 8.5
 Identities = 15/34 (44%), Positives = 23/34 (67%), Gaps = 4/34 (11%)

Query: 140  LQEEEEEKKEER----EEKKKEEKKKEEGEEKKK 169
            L++ EE+KK +     EEKKK ++ K++ EE KK
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKK 1316


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 34.2 bits (79), Expect = 0.008
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
                ++EE EE+++EE+++E  EE
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 31.9 bits (73), Expect = 0.042
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGE 165
                  E++EE EE+++EE+++ E E
Sbjct: 69  AAAAAAAEEEEEEEEEEEEEEEESEEE 95



 Score = 31.2 bits (71), Expect = 0.086
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
             EEE++EE EE+++EE+ +EE 
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEA 96



 Score = 30.0 bits (68), Expect = 0.21
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGE 165
             EEEE++EE EE+++EE ++E   
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 28.5 bits (64), Expect = 0.81
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEG 164
              +EEEEE++EE EE+++ E++   G
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMAG 99



 Score = 26.9 bits (60), Expect = 2.6
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query: 149 EEREEKKKEEKKKEEGEEKK 168
              EE+++EE+++EE EE+ 
Sbjct: 73  AAAEEEEEEEEEEEEEEEES 92



 Score = 26.5 bits (59), Expect = 4.2
 Identities = 7/21 (33%), Positives = 15/21 (71%)

Query: 149 EEREEKKKEEKKKEEGEEKKK 169
               E+++EE+++EE EE++ 
Sbjct: 72  AAAAEEEEEEEEEEEEEEEES 92


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 35.7 bits (83), Expect = 0.008
 Identities = 13/29 (44%), Positives = 23/29 (79%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++ EE+K+E+++K    KKKEE EE++K
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEK 448



 Score = 32.2 bits (74), Expect = 0.12
 Identities = 10/30 (33%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +EEE+++K+++    K+++++EE E++KK
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 31.8 bits (73), Expect = 0.14
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++  E+ +++REE+KKE+KKK    +KK+
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 13/30 (43%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +EE++EKK++    KK+E+++EE +EKK+
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 31.4 bits (72), Expect = 0.20
 Identities = 9/30 (30%), Positives = 22/30 (73%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            + E++ ++E++E+KKK    K++ EE+++
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEE 446



 Score = 31.0 bits (71), Expect = 0.24
 Identities = 13/29 (44%), Positives = 23/29 (79%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++ EEEKKE++++    +KK+EE EE+K+
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 30.7 bits (70), Expect = 0.34
 Identities = 14/28 (50%), Positives = 25/28 (89%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEEEEK+++ EEK++EE++ EE +E+++
Sbjct: 443 EEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 30.7 bits (70), Expect = 0.36
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             E+ +K+  EEKK+++KK   G++K++
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEE 442



 Score = 30.7 bits (70), Expect = 0.36
 Identities = 15/27 (55%), Positives = 24/27 (88%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
           E++EE+KEE EE+ +EEK++EE ++KK
Sbjct: 449 EKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 30.7 bits (70), Expect = 0.41
 Identities = 13/28 (46%), Positives = 23/28 (82%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E++E+KK+    KKKEE+++EE E+K++
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEE 453



 Score = 30.3 bits (69), Expect = 0.51
 Identities = 19/32 (59%), Positives = 26/32 (81%), Gaps = 3/32 (9%)

Query: 141 QEEEEEKKEER---EEKKKEEKKKEEGEEKKK 169
           +EEE+EKKEE    EE++ EE+K+EE E+KKK
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 29.9 bits (68), Expect = 0.55
 Identities = 12/28 (42%), Positives = 22/28 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
               +KKEE EE++KE+K++E+ EE+++
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 29.9 bits (68), Expect = 0.56
 Identities = 8/29 (27%), Positives = 24/29 (82%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E++++    ++++++EE++KE+ EE+K+
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 29.9 bits (68), Expect = 0.67
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            E+ EKK E E+K+K++K     +++++
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 29.5 bits (67), Expect = 0.74
 Identities = 14/28 (50%), Positives = 26/28 (92%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEEEE+++E++E++KEE+++E  EEK++
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEE 468



 Score = 29.5 bits (67), Expect = 0.77
 Identities = 14/28 (50%), Positives = 26/28 (92%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEEEE+KE++EE+K+EE+++ E E++++
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 28.7 bits (65), Expect = 1.4
 Identities = 22/74 (29%), Positives = 32/74 (43%), Gaps = 17/74 (22%)

Query: 109 MSKGRLQYYQALMPYLKNQSED----CLYLNIYTPLQEEEEE---------KKEEREEKK 155
             K R +    L    K+  E       +L     L EEE E         KK ++  +K
Sbjct: 363 KRKVRREVLPFLSIIFKHNPELAARLAAFLE----LTEEEIEFLTGSKKATKKIKKIVEK 418

Query: 156 KEEKKKEEGEEKKK 169
            E+K++EE +EKKK
Sbjct: 419 AEKKREEEKKEKKK 432



 Score = 28.7 bits (65), Expect = 1.4
 Identities = 12/28 (42%), Positives = 23/28 (82%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++++EE EEK+K+E++KEE EE+ +
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 28.7 bits (65), Expect = 1.5
 Identities = 8/28 (28%), Positives = 23/28 (82%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++E++KK    +KK+EE+++E+ +++++
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 28.7 bits (65), Expect = 1.7
 Identities = 13/28 (46%), Positives = 25/28 (89%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++EE++EE +EKK+EEK++EE E +++
Sbjct: 438 KKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 27.2 bits (61), Expect = 4.5
 Identities = 9/28 (32%), Positives = 22/28 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++    K++ EE+++E++KKEE +E+++
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEE 459



 Score = 27.2 bits (61), Expect = 5.3
 Identities = 8/28 (28%), Positives = 23/28 (82%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++    +++EE+++EEK+K+E E++++
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEE 458


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 33.8 bits (78), Expect = 0.011
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
                 EEKKEE EE++++E+ +EE   
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 33.0 bits (76), Expect = 0.021
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEE 166
                     EE++E+++EE++KEE EE
Sbjct: 68  AAAAAAAAAAEEKKEEEEEEEEKEESEE 95



 Score = 29.2 bits (66), Expect = 0.45
 Identities = 10/23 (43%), Positives = 18/23 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
           EE++E++EE EEK++ E++   G
Sbjct: 78  EEKKEEEEEEEEKEESEEEAAAG 100



 Score = 26.8 bits (60), Expect = 2.5
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 149 EEREEKKKEEKKKEEGEEKK 168
               E+KKEE+++EE +E+ 
Sbjct: 74  AAAAEEKKEEEEEEEEKEES 93


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 34.5 bits (80), Expect = 0.013
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
             C +  IY   +EE+E+KK   +E+KK  K+++E  E+ 
Sbjct: 83  SKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 30.7 bits (70), Expect = 0.26
 Identities = 13/33 (39%), Positives = 23/33 (69%)

Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +TP+ +  EE+KE+++   KEEKK  + E++K 
Sbjct: 87  FTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKL 119


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 33.7 bits (78), Expect = 0.025
 Identities = 13/40 (32%), Positives = 24/40 (60%)

Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           E C +  IY   + E+E+KK   +E+KK  K++++  E+ 
Sbjct: 84  EKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 33.8 bits (78), Expect = 0.029
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             EEE++EE +EKK+E+KK+E   +  K
Sbjct: 275 AAEEERQEEAQEKKEEKKKEEREAKLAK 302



 Score = 33.4 bits (77), Expect = 0.038
 Identities = 14/33 (42%), Positives = 24/33 (72%), Gaps = 4/33 (12%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEG----EEKKK 169
           +E +EE +E++EEKKKEE++ +      EE++K
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRK 310



 Score = 29.9 bits (68), Expect = 0.51
 Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 6/36 (16%)

Query: 140 LQEEEEEKKEEREEKK------KEEKKKEEGEEKKK 169
            QE++EEKK+E  E K      +E++K EE E KK+
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQ 319



 Score = 28.4 bits (64), Expect = 1.7
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +EEEEEK  +  E++++E+ +E+ EEKKK
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKEEKKK 293



 Score = 28.4 bits (64), Expect = 2.0
 Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 9/37 (24%)

Query: 141 QEEEEEKKEEREEK---------KKEEKKKEEGEEKK 168
           +++EE+KKEERE K         +K E+K+ + + +K
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 28.4 bits (64), Expect = 2.1
 Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 6/35 (17%)

Query: 141 QEEEE---EKKEEREEK---KKEEKKKEEGEEKKK 169
           +EEE+     +EER+E+   KKEEKKKEE E K  
Sbjct: 267 EEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLA 301


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 33.1 bits (76), Expect = 0.034
 Identities = 16/30 (53%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+EE E+KKEE E K++EE+ ++E EE +K
Sbjct: 48  LEEEREKKKEEEERKEREEQARKEQEEYEK 77



 Score = 30.1 bits (68), Expect = 0.37
 Identities = 16/28 (57%), Positives = 24/28 (85%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEE +EERE+KK+EE++KE  E+ +K
Sbjct: 43  KEEEELEEEREKKKEEEERKEREEQARK 70



 Score = 28.9 bits (65), Expect = 0.86
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +EE ++ +E+RE ++KEE++ EE  EKKK
Sbjct: 27  EREERKKLEEKREGERKEEEELEEEREKKK 56



 Score = 27.4 bits (61), Expect = 2.8
 Identities = 14/29 (48%), Positives = 22/29 (75%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++ E E+KEE E +++ EKKKEE E K++
Sbjct: 36  EKREGERKEEEELEEEREKKKEEEERKER 64



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 10/44 (22%), Positives = 24/44 (54%)

Query: 125 KNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
             +  +          +E E++K+EE  ++++E+ +KE+ E +K
Sbjct: 34  LEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77



 Score = 26.6 bits (59), Expect = 6.4
 Identities = 13/30 (43%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+E+ E +++E EE ++E +KK+E EE+K+
Sbjct: 34  LEEKREGERKEEEELEEEREKKKEEEERKE 63



 Score = 26.6 bits (59), Expect = 6.5
 Identities = 12/30 (40%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +EEE E++++ EEK++ E+K+EE  E+++
Sbjct: 23  AEEEEREERKKLEEKREGERKEEEELEEER 52


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 32.8 bits (75), Expect = 0.041
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             ++E++E  +E EEK+  + KKE+ E+K+K
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 27.8 bits (62), Expect = 2.0
 Identities = 9/29 (31%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E++E  KE  E++  + KK+++ +++KK
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 27.4 bits (61), Expect = 2.9
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E EE++  + +++KKE+K+K+  E+  K
Sbjct: 104 KETEEKEAIKAKKEKKEKKEKKVAEKLAK 132



 Score = 26.3 bits (58), Expect = 6.2
 Identities = 10/27 (37%), Positives = 19/27 (70%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
            +E E+KE  + KK++++KKE+   +K
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKVAEK 129



 Score = 25.9 bits (57), Expect = 8.0
 Identities = 11/30 (36%), Positives = 21/30 (70%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEK 167
             + +E EEK+  + +K+K+EKK+++  EK
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 31.2 bits (71), Expect = 0.044
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
           L+ EEE+++EE E +K+EE+K+ E
Sbjct: 29  LKAEEEKREEEEEARKREERKERE 52



 Score = 30.0 bits (68), Expect = 0.14
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEK 167
           + EEEK+EE EE +K E++KE  + K
Sbjct: 30  KAEEEKREEEEEARKREERKEREKNK 55


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 33.1 bits (76), Expect = 0.050
 Identities = 14/29 (48%), Positives = 22/29 (75%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E  +E+K  R+ KK+EEKKK+E E+ +K
Sbjct: 560 KEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 31.2 bits (71), Expect = 0.28
 Identities = 12/26 (46%), Positives = 20/26 (76%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
           E+E+KE  +E+K+  K K++ E+KKK
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKK 581



 Score = 30.8 bits (70), Expect = 0.30
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +++EE + E+EEK+  +++K   + KK+
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQ 575



 Score = 30.8 bits (70), Expect = 0.32
 Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 2/31 (6%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            LQ E+EEK+    +++K  +K ++ EEKKK
Sbjct: 552 ELQREKEEKEA--LKEQKRLRKLKKQEEKKK 580



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 8/27 (29%), Positives = 19/27 (70%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEK 167
           ++E  ++++   + KK+E+KK++  EK
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 27.7 bits (62), Expect = 4.0
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
            +++   K +++EEKKK+E +K E
Sbjct: 564 KEQKRLRKLKKQEEKKKKELEKLE 587


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 32.5 bits (75), Expect = 0.053
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             E+ EKK E+ + K E+KK ++ E K +
Sbjct: 70  AWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 29.0 bits (66), Expect = 0.86
 Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 143 EEEEKKEEREEKKKE-EKKKEEGEEKK 168
           ++E K  E+ EKK E  K K E ++ K
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAK 91



 Score = 28.6 bits (65), Expect = 0.90
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++E K   + +KK EK K + E+KK 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKA 90


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 33.3 bits (76), Expect = 0.056
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
               E++   +EE E KKK+E+K +E E KK
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKK 35



 Score = 27.9 bits (62), Expect = 3.6
 Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 139 PLQEEEEEKKEEREEK--KKEEKKKEEGEEKKK 169
            L EEE E+K+++EEK  +KE KK +  +++ K
Sbjct: 12  ILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 32.9 bits (75), Expect = 0.065
 Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 16/80 (20%)

Query: 94  PQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLN----IYTPLQEEEEEKKE 149
           P    N D++           Q + A++P   N  E+  ++     IY  LQEE   + E
Sbjct: 400 PGNARNYDNNGR---------QDFHAIVP--NNIPEELYWVEEEHQIYRKLQEERRLR-E 447

Query: 150 EREEKKKEEKKKEEGEEKKK 169
           E    K E+  + + E K+K
Sbjct: 448 EAIRAKAEKTARMKAEMKEK 467


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 31.9 bits (73), Expect = 0.072
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +EE+K +E E K++ E+ KE   EKKK
Sbjct: 98  ARDEEKKLDEEEAKRQHEEAKEREREKKK 126



 Score = 25.7 bits (57), Expect = 8.8
 Identities = 8/29 (27%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           Q   +E+K+  EE+ K + ++ +  E++K
Sbjct: 96  QRARDEEKKLDEEEAKRQHEEAKEREREK 124


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 31.2 bits (71), Expect = 0.081
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
               E   E +E ++EEK++E  EE
Sbjct: 74  AAGAEAAAEADEAEEEEKEEEAEEE 98



 Score = 28.5 bits (64), Expect = 0.72
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
            E   E  E  EE+K+EE ++E  ++
Sbjct: 77  AEAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 26.6 bits (59), Expect = 3.8
 Identities = 7/23 (30%), Positives = 15/23 (65%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEG 164
            E +E +EE +E++ EE+  ++ 
Sbjct: 81  AEADEAEEEEKEEEAEEESDDDM 103


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 32.1 bits (73), Expect = 0.12
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 135 NIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +      E+  K+ E+  K+ EEK+K+  E K K
Sbjct: 93  ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127



 Score = 29.8 bits (67), Expect = 0.55
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           Q+E E++    +  K+ E+  ++ EEK+K
Sbjct: 91  QKELEQRAAAEKAAKQAEQAAKQAEEKQK 119



 Score = 29.4 bits (66), Expect = 0.98
 Identities = 16/32 (50%), Positives = 17/32 (53%), Gaps = 3/32 (9%)

Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
           Q EEE K +   E KK   E KKK E E K K
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAK 182



 Score = 28.3 bits (63), Expect = 2.2
 Identities = 8/29 (27%), Positives = 16/29 (55%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
            +E E+    E+  K+ E+  K+  E++K
Sbjct: 91  QKELEQRAAAEKAAKQAEQAAKQAEEKQK 119



 Score = 28.3 bits (63), Expect = 2.4
 Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 3/32 (9%)

Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
           ++ EE++K+  E K K   E K K E E +KK
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKK 143



 Score = 27.9 bits (62), Expect = 2.5
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           Q+++   K+E+E +KK E++ EE E+++ 
Sbjct: 57  QQKKPAAKKEQERQKKLEQQAEEAEKQRA 85



 Score = 27.9 bits (62), Expect = 2.8
 Identities = 7/29 (24%), Positives = 17/29 (58%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++ E+  K+  E++K+ E+ K +   + K
Sbjct: 105 KQAEQAAKQAEEKQKQAEEAKAKQAAEAK 133



 Score = 27.5 bits (61), Expect = 3.6
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%)

Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
           +   E KK+  E KKK   E K K E + K K
Sbjct: 159 KAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190



 Score = 27.5 bits (61), Expect = 4.3
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            E E + K E + K K E+ K + E  K 
Sbjct: 175 AEAEAKAKAEAKAKAKAEEAKAKAEAAKA 203



 Score = 27.1 bits (60), Expect = 4.7
 Identities = 14/31 (45%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 141 QEEEEEKKEEREEKKK--EEKKKEEGEEKKK 169
            E E EKK + E KK+  EE K +   E KK
Sbjct: 136 AEAEAEKKAKEEAKKQAEEEAKAKAAAEAKK 166



 Score = 27.1 bits (60), Expect = 4.9
 Identities = 5/31 (16%), Positives = 19/31 (61%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             ++ E++ +E  +++  E+ +++E E++  
Sbjct: 69  RQKKLEQQAEEAEKQRAAEQARQKELEQRAA 99



 Score = 27.1 bits (60), Expect = 5.6
 Identities = 7/29 (24%), Positives = 17/29 (58%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +  E+ +++E E++   EK  ++ E+  K
Sbjct: 84  RAAEQARQKELEQRAAAEKAAKQAEQAAK 112



 Score = 27.1 bits (60), Expect = 5.7
 Identities = 14/31 (45%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 141 QEEEEEKKEE--REEKKKEEKKKEEGEEKKK 169
           Q  E + K E   E+K KEE KK+  EE K 
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.0 bits (73), Expect = 0.14
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 6/37 (16%)

Query: 139 PLQEEEEEKKEEREEKKKEEKK------KEEGEEKKK 169
           P +EEEEEK+E++EE++K   K      +EE EEKKK
Sbjct: 36  PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKK 72



 Score = 31.3 bits (71), Expect = 0.25
 Identities = 14/28 (50%), Positives = 22/28 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEEK  ++EE+  EE++KEE ++K K
Sbjct: 48  KEEEEKTTDKEEEVDEEEEKEEKKKKTK 75



 Score = 30.5 bits (69), Expect = 0.41
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEE 166
           T  +EE +E++E+ E+KKK +K KE   E
Sbjct: 55  TDKEEEVDEEEEKEEKKKKTKKVKETTTE 83



 Score = 27.8 bits (62), Expect = 3.2
 Identities = 12/29 (41%), Positives = 22/29 (75%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEE+  ++ EE  +EE+K+E+ ++ KK
Sbjct: 48  KEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76



 Score = 27.8 bits (62), Expect = 3.6
 Identities = 14/29 (48%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E E+E  +E EE++KEEKK+EE +   K
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDK 57



 Score = 27.0 bits (60), Expect = 6.0
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEEEK+E++++ KK ++   E E   K
Sbjct: 62  DEEEEKEEKKKKTKKVKETTTEWELLNK 89


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 31.4 bits (71), Expect = 0.14
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           E C +  IY     E+E+KK   +E+KK  K++++ E + 
Sbjct: 80  ELCDFTPIYEWHLREKEKKKSRTKEEKKALKEEKDKEAEP 119



 Score = 27.1 bits (60), Expect = 4.7
 Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 5/44 (11%)

Query: 131 CLYLNIYTPLQE---EEEEKKEER--EEKKKEEKKKEEGEEKKK 169
            L L  +TP+ E    E+EKK+ R  EEKK  +++K++  E   
Sbjct: 78  KLELCDFTPIYEWHLREKEKKKSRTKEEKKALKEEKDKEAEPYM 121


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 30.7 bits (70), Expect = 0.15
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 10/43 (23%)

Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGE----------EKKK 169
             P   EE  K  E E+K+KE+KKK++ E          EKKK
Sbjct: 62  KGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKK 104


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 31.2 bits (71), Expect = 0.16
 Identities = 15/30 (50%), Positives = 22/30 (73%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+E E+ KKE  EEK++EE++K   EEK +
Sbjct: 148 LRELEKIKKERAEEKEREEEEKAAEEEKAR 177



 Score = 30.8 bits (70), Expect = 0.26
 Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 2/32 (6%)

Query: 140 LQEEEEEKKEEREEKK--KEEKKKEEGEEKKK 169
           L  E E+ K+ER E+K  +EE+K  E E+ ++
Sbjct: 147 LLRELEKIKKERAEEKEREEEEKAAEEEKARE 178



 Score = 30.1 bits (68), Expect = 0.45
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 143 EEEEKKEEREEKKKEEKKKEE 163
            EE+++EE E+  +EEK +EE
Sbjct: 159 AEEKEREEEEKAAEEEKAREE 179


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 30.7 bits (70), Expect = 0.19
 Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 2/23 (8%)

Query: 147 KKEEREEKKKEEKKKEEGEEKKK 169
           +KEERE +K+ E  KE  E+ KK
Sbjct: 82  EKEERELRKRAE--KEALEQAKK 102



 Score = 26.8 bits (60), Expect = 3.3
 Identities = 11/32 (34%), Positives = 19/32 (59%), Gaps = 3/32 (9%)

Query: 141 QEEEEEKKEEREE---KKKEEKKKEEGEEKKK 169
           +E E  K+ E+E     KKEE+ +E   +++K
Sbjct: 84  EERELRKRAEKEALEQAKKEEELREAKRQQRK 115


>gnl|CDD|220163 pfam09284, RhgB_N, Rhamnogalacturonase B, N-terminal.  Members of
           this family are found in prokaryotic Rhamnogalacturonase
           B, and adopt a structure consisting of a beta
           supersandwich, with eighteen strands in two beta-sheets.
           The exact function of the domain is unknown, but a
           putative role includes carbohydrate-binding.
          Length = 249

 Score = 30.9 bits (70), Expect = 0.24
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 45  VHNGNLNNIHMYLGIPYAAPPVGQLRFMA 73
             +G+ N I+M   I  A P +G+LRF+A
Sbjct: 81  ARSGD-NTIYMATYIT-AEPSIGELRFIA 107


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 31.2 bits (70), Expect = 0.25
 Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEE +  +R EK K EKKK E E++ +
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGR 111



 Score = 27.0 bits (59), Expect = 6.3
 Identities = 15/28 (53%), Positives = 21/28 (75%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E EE KK  + +KKK+ K+KEE ++KKK
Sbjct: 270 EAEETKKSPKHKKKKQRKEKEEKKKKKK 297


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 31.1 bits (71), Expect = 0.27
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 114 LQYYQALMPYLKNQSEDCL-YLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L YY+ L+    +++E  L YLN    L E +E+ ++  ++  K E+K E+    KK
Sbjct: 217 LDYYEKLLDTYGDKAEFMLAYLNFKEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKK 273


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.1 bits (71), Expect = 0.27
 Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 145 EEKKEEREEKKKEEKKKEEGEEKKK 169
            EKK E+EE +K   KK+     KK
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKK 445



 Score = 29.9 bits (68), Expect = 0.62
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKK 168
             E+K+ R++++K EKK E+ E +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEK 432



 Score = 29.1 bits (66), Expect = 1.2
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           Q + E+K E+ E +K   KKK E   KK 
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 27.2 bits (61), Expect = 4.5
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
            E++ ++E  EK   +KK E   +K K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAK 447



 Score = 27.2 bits (61), Expect = 4.8
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
             E KK  ++++K E+K ++E  EK  
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAA 434



 Score = 26.8 bits (60), Expect = 6.0
 Identities = 7/30 (23%), Positives = 16/30 (53%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            ++EE EK   +++ +   KK +  + + K
Sbjct: 425 AEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 30.8 bits (70), Expect = 0.27
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           +T +    + +KEE++   KEEKK  + E++K
Sbjct: 87  FTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118



 Score = 30.8 bits (70), Expect = 0.28
 Identities = 11/40 (27%), Positives = 23/40 (57%)

Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
             C +  ++   + ++EEKK   +E+KK  K ++E  E++
Sbjct: 83  SKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 29.1 bits (66), Expect = 0.33
 Identities = 12/28 (42%), Positives = 21/28 (75%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE++ K+EE +E++KE +K E  E +K+
Sbjct: 52  EEKKRKQEEEKERRKEARKAERAEARKR 79



 Score = 25.7 bits (57), Expect = 5.2
 Identities = 7/25 (28%), Positives = 15/25 (60%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGE 165
           +++EEEK+  +E +K E  +  +  
Sbjct: 56  RKQEEEKERRKEARKAERAEARKRG 80


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 30.6 bits (70), Expect = 0.34
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEG 164
               EE++EE EE+++EE  +EE 
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEEEA 321



 Score = 29.8 bits (68), Expect = 0.61
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
           Q +    +EE EE+++EE+++   EE
Sbjct: 295 QAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 29.1 bits (66), Expect = 1.0
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEE 166
           L  + +    E EE+++EE+++EE  E
Sbjct: 292 LSAQAQAAAAEEEEEEEEEEEEEEPSE 318



 Score = 28.7 bits (65), Expect = 1.5
 Identities = 8/28 (28%), Positives = 17/28 (60%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGE 165
           +   +    ++EE EE+++EE++  E E
Sbjct: 293 SAQAQAAAAEEEEEEEEEEEEEEPSEEE 320



 Score = 27.1 bits (61), Expect = 4.2
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+E    + +    +++EE+++EE EE+  
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317



 Score = 26.8 bits (60), Expect = 7.3
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L EE +E    + +    E+++EE EE+++
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEE 313


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.4 bits (69), Expect = 0.36
 Identities = 9/27 (33%), Positives = 13/27 (48%)

Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
             EE+KE      K+    ++ E KKK
Sbjct: 169 AAEERKEALAAAAKKSATPQKVETKKK 195



 Score = 27.0 bits (60), Expect = 4.6
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           +++EE   EER+E      KK    +K 
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKV 190


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 30.2 bits (69), Expect = 0.43
 Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 7/64 (10%)

Query: 107 KKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKE-EKKKEEGE 165
           KK  KG L+    L    K   E+     +   L +++E K   + +KKKE ++ K    
Sbjct: 33  KKSQKGELEV-TNLNEQYKEMKEE-----LKAALLDKKELKAWHKAQKKKEKQEAKAAKA 86

Query: 166 EKKK 169
           + K 
Sbjct: 87  KSKP 90


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 27.7 bits (62), Expect = 0.45
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 145 EEKKEEREEKKKEEKKKEEGEEKKK 169
           EE+K E EE+ ++E++ EE  E K+
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46



 Score = 27.3 bits (61), Expect = 0.72
 Identities = 12/25 (48%), Positives = 19/25 (76%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKK 168
           EEEK+E+ EE + EE+ +E+ E K+
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 30.3 bits (69), Expect = 0.48
 Identities = 11/31 (35%), Positives = 24/31 (77%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           P ++ EE+K ++R++KKK +KK ++ ++K +
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 28.0 bits (63), Expect = 2.3
 Identities = 8/28 (28%), Positives = 22/28 (78%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           +EE++ +K ++++K+K++ KK + + +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 27.6 bits (62), Expect = 3.9
 Identities = 13/31 (41%), Positives = 22/31 (70%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           P +++ EEKK ++ +KKK+ KKK +  +KK 
Sbjct: 382 PPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 29.6 bits (67), Expect = 0.49
 Identities = 8/32 (25%), Positives = 17/32 (53%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
               E EE  +EE+ ++  E++  +   EK++
Sbjct: 106 EEEVEVEELDEEEQIDELLEKELAKLKREKRR 137



 Score = 29.2 bits (66), Expect = 0.59
 Identities = 6/28 (21%), Positives = 20/28 (71%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           ++ +E  ++E  + K+E++++ E ++K+
Sbjct: 118 EQIDELLEKELAKLKREKRRENERKQKE 145


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 29.7 bits (67), Expect = 0.50
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E KKE++E+K K+EK  E    KKK
Sbjct: 161 KEVKKEKKEKKDKKEKMVEPKGSKKK 186



 Score = 29.3 bits (66), Expect = 0.74
 Identities = 15/30 (50%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +E E E++E++E+KKK+E KKE+ E+K K
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDK 173



 Score = 28.5 bits (64), Expect = 1.4
 Identities = 13/32 (40%), Positives = 25/32 (78%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +++E E ++EE++EKKK+++ K+E +EKK 
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD 172



 Score = 27.4 bits (61), Expect = 3.4
 Identities = 13/28 (46%), Positives = 25/28 (89%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E EEE+K+E+++KK+ +K+K+E ++KK+
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 26.6 bits (59), Expect = 5.7
 Identities = 13/32 (40%), Positives = 24/32 (75%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           T  + E+E + EE E+K+K++KK+ + E+K+K
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 30.3 bits (68), Expect = 0.53
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           ED L   I    + E++E  EE EE+K EE+     + KK
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340



 Score = 27.6 bits (61), Expect = 3.2
 Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 5/36 (13%)

Query: 139 PLQEEEEEKK-----EEREEKKKEEKKKEEGEEKKK 169
            L  E   K      E+ EE ++E+ ++E G  KK 
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKG 338



 Score = 26.5 bits (58), Expect = 8.0
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE EE+K E E    ++ KK +  + KK
Sbjct: 321 EESEEEKNEEEGGLSKKGKKLKKLKGKK 348



 Score = 26.5 bits (58), Expect = 8.4
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE E+K       K E ++ E+ EE ++
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEE 325


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 30.0 bits (67), Expect = 0.58
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           + E++    + EE +KEEK+ E  EE+K 
Sbjct: 134 KSEQKNDWRDAEECQKEEKEPEPEEEEKP 162


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 29.6 bits (67), Expect = 0.59
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEE++++E E+ K   K K +   K K
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAK 65



 Score = 27.7 bits (62), Expect = 2.7
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE+EEK+EE+ +   + K K+  + K +
Sbjct: 40  EEDEEKEEEKAKVAAKAKAKKALKAKIE 67


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 29.8 bits (67), Expect = 0.62
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L  +E++K+ E   K+   K+K+  E   K
Sbjct: 111 LAAQEQKKQAEEAAKQAALKQKQAEEAAAK 140



 Score = 29.4 bits (66), Expect = 0.98
 Identities = 9/30 (30%), Positives = 20/30 (66%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L++ E+E+   +E+KK+ E+  ++   K+K
Sbjct: 103 LKQLEKERLAAQEQKKQAEEAAKQAALKQK 132



 Score = 27.5 bits (61), Expect = 3.7
 Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           Q EE  K+   ++K+ EE   +     K 
Sbjct: 119 QAEEAAKQAALKQKQAEEAAAKAAAAAKA 147



 Score = 27.5 bits (61), Expect = 4.2
 Identities = 9/29 (31%), Positives = 20/29 (68%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E ++++  E+E  K+ EK++   +E+KK
Sbjct: 90  EELQQKQAAEQERLKQLEKERLAAQEQKK 118



 Score = 27.1 bits (60), Expect = 5.6
 Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 4/33 (12%)

Query: 141 QEEEEEKKEEREEKKK----EEKKKEEGEEKKK 169
           +   E KK+   E  K    E KKK E E   K
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194



 Score = 26.7 bits (59), Expect = 7.1
 Identities = 15/32 (46%), Positives = 16/32 (50%), Gaps = 3/32 (9%)

Query: 141 QEEEEEKKEEREEKKK---EEKKKEEGEEKKK 169
              E +KK E E   K   E KKK E E KKK
Sbjct: 179 AAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK 210


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.5 bits (66), Expect = 0.68
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 132 LYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            Y  +   L+E ++EK+E  +E ++ E + EE +E+ K
Sbjct: 136 DYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLK 173


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 28.4 bits (64), Expect = 0.75
 Identities = 16/41 (39%), Positives = 25/41 (60%)

Query: 128 SEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           S D  Y   Y  + ++ +EKKEEREE K+E  ++ + EE+ 
Sbjct: 36  STDLYYRTDYLEVTDKVKEKKEEREEDKEELIERIKKEEET 76


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 29.7 bits (66), Expect = 0.76
 Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 11/93 (11%)

Query: 82  GLMIADSFSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQ 141
           G++ AD  +P  P+      ++ +   ++   L++     P  K + E  +  N    + 
Sbjct: 187 GIVGADEEAPPAPKNTLTPLEELYETNVNLFALKH-----PLEKLEEEIDILKNDGDKVA 241

Query: 142 EEEEEKKEEREEKKKEEKKKE------EGEEKK 168
           EEEE + +E  E+ +E+KK+       EG+E+K
Sbjct: 242 EEEEFELDEEHEEAEEDKKEALEKIGAEGDEEK 274


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 28.0 bits (63), Expect = 0.80
 Identities = 8/21 (38%), Positives = 14/21 (66%)

Query: 146 EKKEEREEKKKEEKKKEEGEE 166
                 EE+KKEE+++EE ++
Sbjct: 61  AAAAAAEEEKKEEEEEEEEDD 81



 Score = 25.3 bits (56), Expect = 8.0
 Identities = 9/21 (42%), Positives = 13/21 (61%), Gaps = 2/21 (9%)

Query: 143 EEEEKKEEREEKKKEEKKKEE 163
                 EE  EKK+EE+++EE
Sbjct: 61  AAAAAAEE--EKKEEEEEEEE 79


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 29.3 bits (66), Expect = 0.83
 Identities = 9/30 (30%), Positives = 21/30 (70%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             ++++ KK++ ++K K++ KK++  EKK 
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114



 Score = 28.9 bits (65), Expect = 0.98
 Identities = 12/30 (40%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++E EEK++ + +KKK +KKK++ ++KK 
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKDKDKKD 103



 Score = 28.1 bits (63), Expect = 1.5
 Identities = 9/30 (30%), Positives = 23/30 (76%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +E+++ K ++++ KKK++K K++ ++KK 
Sbjct: 78  YEEKQKWKWKKKKSKKKKDKDKDKKDDKKD 107



 Score = 27.7 bits (62), Expect = 2.5
 Identities = 8/28 (28%), Positives = 23/28 (82%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++++KK+++++ K E+K ++E E+K +
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 27.3 bits (61), Expect = 2.8
 Identities = 4/29 (13%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +++++K +++++K +++ +KE  ++ + 
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 26.6 bits (59), Expect = 5.1
 Identities = 8/28 (28%), Positives = 22/28 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++++KK+++ EKK E++ +++ E+  K
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLTK 127



 Score = 26.2 bits (58), Expect = 7.4
 Identities = 8/28 (28%), Positives = 23/28 (82%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +++ +KK+++++ KK++KK ++ E+K +
Sbjct: 88  KKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 26.2 bits (58), Expect = 8.2
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query: 136 IYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           IY     E ++KK+E  E+ ++ KK+ E ++K K
Sbjct: 52  IYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWK 85



 Score = 26.2 bits (58), Expect = 8.3
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            L EE E+ K+E EEK+K + KK++ ++KK 
Sbjct: 66  ELAEEIEKVKKEYEEKQKWKWKKKKSKKKKD 96


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 29.1 bits (65), Expect = 0.84
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 4/38 (10%)

Query: 129 EDCLYLNIYTPLQEEEEEK----KEEREEKKKEEKKKE 162
            D  +  I   L+EE+E+K    KEE+E KKKE  K+E
Sbjct: 82  SDADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119



 Score = 26.8 bits (59), Expect = 5.3
 Identities = 10/33 (30%), Positives = 22/33 (66%)

Query: 137 YTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++ ++   EE+KE+++   KEEK+ ++ E  K+
Sbjct: 86  FSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKR 118


>gnl|CDD|199907 cd10320, RGL4_N, N-terminal catalytic domain of rhamnogalacturonan
           lyase, a family 4 polysaccharide lyase.  The
           rhamnogalacturonan lyase of the polysaccharide lyase
           family 4 (RGL4) is involved in the degradation of RG
           (rhamnogalacturonan) type-I, an important pectic plant
           cell wall polysaccharide, by cleaving the alpha-1,4
           glycoside bond between L-rhamnose and D-galacturonic
           acids in the backbone of RG type-I through a
           beta-elimination reaction. RGL4 consists of three
           domains, an N-terminal catalytic domain, a middle domain
           with a FNIII type fold and a C-terminal domain with a
           jelly roll fold; the middle and C-terminal domains are
           both putative carbohydrate binding modules. There are
           two types of RG lyases, which both cleave the alpha-1,4
           bonds of the RG-I main chain (RG chain) through the
           beta-elimination reaction, but belong to two
           structurally unrelated polysaccharide lyase (PL)
           families, 4 and 11.
          Length = 265

 Score = 29.3 bits (66), Expect = 0.87
 Identities = 20/78 (25%), Positives = 34/78 (43%), Gaps = 12/78 (15%)

Query: 17  LQVTMSSKEYTNIIELK-----QGRLRGVVRSPVHNGNLNNIHMY--LGIPYAAPPVGQL 69
           L  T+S+ +  + I++      +  L  VVR          I+MY     P   P +G+L
Sbjct: 73  LGATVSATQSGDYIQISFSRTFETELHYVVRK-----GEPGIYMYTVATHPAPEPSLGEL 127

Query: 70  RFMAPQSPQPWPGLMIAD 87
           R +   +P  +P   I+D
Sbjct: 128 RTVFRLNPDLFPNGAISD 145


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.4 bits (66), Expect = 0.88
 Identities = 11/33 (33%), Positives = 24/33 (72%), Gaps = 3/33 (9%)

Query: 140 LQEEE---EEKKEEREEKKKEEKKKEEGEEKKK 169
            +EE+   EE +++++E+  E+K+K + +E+KK
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 9/71 (12%)

Query: 99  NLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEE 158
           +L  D   + + +    Y + L  Y K   E             EE E K + E+ K+EE
Sbjct: 361 DLSGDKRQELLKE----YNKKLQDYTKKLGE-----VKDETDASEEAEAKAKEEKLKQEE 411

Query: 159 KKKEEGEEKKK 169
            +K++ E+  +
Sbjct: 412 NEKKQKEQADE 422


>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
          Length = 814

 Score = 29.6 bits (66), Expect = 0.90
 Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 7/111 (6%)

Query: 59  IPYAAPPVGQLRFMAPQSPQPW-PGLMIADSFSPVCPQKLPNL----DDDDFSKKMSKGR 113
           IP  AP +    F    S + +   +++AD F    P+ +  +    D DD    +S+ +
Sbjct: 375 IPSRAPSINDHLFTKQNSIKLYVRRVLVADEFVEFLPRYMSFVKGVVDSDDLPLNVSREQ 434

Query: 114 LQYYQALMPYLKNQSEDCL--YLNIYTPLQEEEEEKKEEREEKKKEEKKKE 162
           LQ  + L    K      L  +  +Y   ++ +E  + E  ++  EEKKKE
Sbjct: 435 LQQNKILKAVSKRIVRKILDTFRTLYKEGKKNKETLRAELAKETDEEKKKE 485


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 28.8 bits (65), Expect = 0.93
 Identities = 12/29 (41%), Positives = 23/29 (79%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EE+E +KEE+E K+++EK+  +G  +K+
Sbjct: 18  REEKEREKEEKERKEEKEKEWGKGLVQKE 46



 Score = 27.7 bits (62), Expect = 2.0
 Identities = 8/20 (40%), Positives = 18/20 (90%)

Query: 144 EEEKKEEREEKKKEEKKKEE 163
           EE+++E+  EK+++E+K+E+
Sbjct: 15  EEKREEKEREKEEKERKEEK 34



 Score = 27.3 bits (61), Expect = 2.5
 Identities = 12/23 (52%), Positives = 20/23 (86%)

Query: 147 KKEEREEKKKEEKKKEEGEEKKK 169
            +E+REEK++E+++KE  EEK+K
Sbjct: 14  IEEKREEKEREKEEKERKEEKEK 36



 Score = 25.7 bits (57), Expect = 9.3
 Identities = 8/22 (36%), Positives = 19/22 (86%)

Query: 140 LQEEEEEKKEEREEKKKEEKKK 161
            + EE+E+++E +E+K+E++K+
Sbjct: 16  EKREEKEREKEEKERKEEKEKE 37


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 29.5 bits (66), Expect = 0.94
 Identities = 10/28 (35%), Positives = 21/28 (75%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +E + +  +EE+K++E+ KEE ++KK+
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKE 122



 Score = 28.3 bits (63), Expect = 2.1
 Identities = 10/29 (34%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           + + E  KEE +EK++ +++K++ +EK K
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 27.9 bits (62), Expect = 2.6
 Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           P +E ++ K +E  ++K+  K+KE+ +EKK 
Sbjct: 124 PKEEPKDRKPKEEAKEKRPPKEKEKEKEKKV 154



 Score = 27.9 bits (62), Expect = 2.9
 Identities = 13/29 (44%), Positives = 24/29 (82%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +EEE+EK++ +EEKKK+++K +E  + +K
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 27.2 bits (60), Expect = 5.6
 Identities = 11/28 (39%), Positives = 21/28 (75%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E+ +E+K++++EK KEE K  + +E+ K
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPKEEAK 138



 Score = 26.4 bits (58), Expect = 9.1
 Identities = 12/28 (42%), Positives = 21/28 (75%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E+E+E+ +E ++KKKE+ K+E  + K K
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPK 134


>gnl|CDD|181941 PRK09546, zntB, zinc transporter; Reviewed.
          Length = 324

 Score = 29.2 bits (66), Expect = 0.95
 Identities = 16/52 (30%), Positives = 23/52 (44%), Gaps = 21/52 (40%)

Query: 64  PPVGQL-----------RFMAPQSPQPWPGLMIADSFSPVCPQKLPNLDDDD 104
           PP G+L           R+MAPQ           D F+ +  ++LP + DDD
Sbjct: 179 PPRGELALLRKQLIVMRRYMAPQR----------DVFARLASERLPWMSDDD 220


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 28.8 bits (65), Expect = 0.99
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+ EEEEK++E E ++     K   E+ + 
Sbjct: 122 LEREEEEKRDEEERERLLRAAKSRSEQSRL 151


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 11/31 (35%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 140 LQEEEEEKKEERE-EKKKEEKKKEEGEEKKK 169
            Q+EEEE++ ER  E   E+ ++   ++K K
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAK 151


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 29.3 bits (65), Expect = 1.2
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE+ +K +E  E+ K E+ KE  E KKK
Sbjct: 220 EEDTKKADEDGEEPKVEEVKEGDEGKKK 247


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 27.4 bits (61), Expect = 1.2
 Identities = 4/28 (14%), Positives = 15/28 (53%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE E+  ++ ++   ++K  ++ ++   
Sbjct: 48  EEPEDDDDDEDDDDDDDKDDKDDDDDDD 75



 Score = 25.5 bits (56), Expect = 6.3
 Identities = 4/25 (16%), Positives = 17/25 (68%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
           +E+++  +++++K  ++   +E +E
Sbjct: 56  DEDDDDDDDKDDKDDDDDDDDEDDE 80


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 29.3 bits (66), Expect = 1.3
 Identities = 12/25 (48%), Positives = 20/25 (80%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEE 163
              EE+ +KK+++E+KK+EE K+EE
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREE 763



 Score = 27.8 bits (62), Expect = 3.4
 Identities = 8/28 (28%), Positives = 22/28 (78%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +  EEK +++++K+K+++++ + EEK +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 26.6 bits (59), Expect = 9.4
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           ++ E +  +  EEK K++KKKE+ +E++
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEE 758


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 28.6 bits (64), Expect = 1.3
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
            E++ K+ E E+  +E++K EE EE
Sbjct: 156 LEKKLKELEAEDVDEEDEKDEEEEE 180


>gnl|CDD|235123 PRK03381, PRK03381, PII uridylyl-transferase; Provisional.
          Length = 774

 Score = 28.8 bits (65), Expect = 1.4
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 58  GIPYAAPPVGQLRFMAPQSPQPWP 81
           G+P AA  + +L   AP  P PWP
Sbjct: 344 GLPIAAATLSRLAASAPPLPTPWP 367


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 28.8 bits (65), Expect = 1.5
 Identities = 12/20 (60%), Positives = 19/20 (95%)

Query: 150 EREEKKKEEKKKEEGEEKKK 169
           EREEK+KE+K K+EG+E+++
Sbjct: 392 EREEKEKEKKAKKEGKEERR 411


>gnl|CDD|224403 COG1486, CelF, Alpha-galactosidases/6-phospho-beta-glucosidases,
           family 4 of glycosyl hydrolases [Carbohydrate transport
           and metabolism].
          Length = 442

 Score = 28.8 bits (65), Expect = 1.5
 Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 3/57 (5%)

Query: 113 RLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEE-KKKEEGEEKK 168
             +YY    PY   + +        T   E  + +KE  E  KK E K+K E  EK+
Sbjct: 257 YHRYY--YYPYDMKEKKPDEIEKFGTRANEVMKREKELFELYKKPELKEKPEELEKR 311


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 28.6 bits (64), Expect = 1.5
 Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 137 YTPLQEEEEEK-KEEREEKKKEEKKKEEGEEKKK 169
              ++ EE E  ++ RE +    ++++E  EKKK
Sbjct: 103 ADRVEGEEPESIRKWRERRDLRIEERDEASEKKK 136


>gnl|CDD|224667 COG1753, COG1753, Predicted antotoxin, copG family [Signal
           transduction mechanisms].
          Length = 74

 Score = 27.1 bits (60), Expect = 1.5
 Identities = 12/54 (22%), Positives = 26/54 (48%)

Query: 107 KKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKK 160
            KM +G+  +   +   ++ +  +   L        EEE +K ++EEK+ E+ +
Sbjct: 17  VKMKRGKESFSDVIRELIEKKRGNLEVLMRAFGTLSEEEVEKIKKEEKEAEKMR 70


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 29.0 bits (65), Expect = 1.6
 Identities = 15/30 (50%), Positives = 23/30 (76%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+E++EEK EE+ E+ K E+  EE EE++K
Sbjct: 279 LKEKKEEKDEEKSEEVKTEEVDEEFEEEEK 308


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 27.6 bits (61), Expect = 1.6
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 146 EKKEEREEKKKEEKKKEEGEEK 167
                + E KKEEKK+EE EE+
Sbjct: 82  ATAGAKAEAKKEEKKEEEEEEE 103



 Score = 26.4 bits (58), Expect = 5.1
 Identities = 9/22 (40%), Positives = 15/22 (68%)

Query: 142 EEEEEKKEEREEKKKEEKKKEE 163
                K E ++E+KKEE+++EE
Sbjct: 82  ATAGAKAEAKKEEKKEEEEEEE 103


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 28.7 bits (64), Expect = 1.7
 Identities = 9/29 (31%), Positives = 16/29 (55%), Gaps = 1/29 (3%)

Query: 141 QEEEEEKKEEREEKKKEE-KKKEEGEEKK 168
            E  E KK    EK + + +KKE  ++++
Sbjct: 327 AENAEIKKTRTAEKNEAKARKKEIAQKRR 355


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 27.3 bits (61), Expect = 1.8
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query: 152 EEKKKEEKKKEEGEE 166
            E KKEEKK+EE EE
Sbjct: 79  AEAKKEEKKEEEEEE 93



 Score = 25.7 bits (57), Expect = 7.6
 Identities = 7/16 (43%), Positives = 13/16 (81%)

Query: 148 KEEREEKKKEEKKKEE 163
            E ++E+KKEE+++E 
Sbjct: 79  AEAKKEEKKEEEEEES 94



 Score = 25.4 bits (56), Expect = 8.3
 Identities = 8/16 (50%), Positives = 12/16 (75%)

Query: 152 EEKKKEEKKKEEGEEK 167
             + K+E+KKEE EE+
Sbjct: 78  AAEAKKEEKKEEEEEE 93


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 28.4 bits (63), Expect = 1.8
 Identities = 7/29 (24%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +  + K+E +  +K+ E K+ E ++K++
Sbjct: 303 HKAFDLKQESKASEKEAEDKELEAQKKRE 331


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.2 bits (63), Expect = 1.9
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             E EE KEE +E    E K+++G+ +K+
Sbjct: 59  TAEIEEVKEEEKEAANSEDKEDKGDAEKE 87



 Score = 28.2 bits (63), Expect = 2.2
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E+E    E++E+K   EK+ EE EE+ +
Sbjct: 68  EEKEAANSEDKEDKGDAEKEDEESEEENE 96



 Score = 27.8 bits (62), Expect = 2.2
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            E EE K+EE+E    E+K+ +   EK+ 
Sbjct: 60  AEIEEVKEEEKEAANSEDKEDKGDAEKED 88


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 27.9 bits (63), Expect = 1.9
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEK 167
           ++ EE+KEE E++  E + K E  EK
Sbjct: 130 KKLEEEKEELEKRVAELEAKLEAIEK 155


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 27.6 bits (62), Expect = 1.9
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           QEE++ + E+  +   EEK + E E+ +K
Sbjct: 28  QEEKKSEAEKLAKMSAEEKAEYELEKLEK 56


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 27.9 bits (62), Expect = 2.2
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
           EEE  + EE EE+ +E+    +  EKK
Sbjct: 71  EEEIVEDEEEEEEDEEDNVDLKDIEKK 97



 Score = 27.9 bits (62), Expect = 2.3
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 141 QEEEEEKKEEREEKKKEEKK 160
           +EEEEE +E+  + K  EKK
Sbjct: 78  EEEEEEDEEDNVDLKDIEKK 97


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 28.4 bits (64), Expect = 2.2
 Identities = 7/28 (25%), Positives = 14/28 (50%), Gaps = 2/28 (7%)

Query: 141 QEEEEEKKEEREEKKKE--EKKKEEGEE 166
           Q   E +K  RE + K+  E +  + ++
Sbjct: 459 QARLEREKAAREARHKKAAEARAAKDKD 486


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 28.3 bits (63), Expect = 2.3
 Identities = 13/97 (13%), Positives = 28/97 (28%), Gaps = 6/97 (6%)

Query: 72  MAPQSPQPWPGLMIADSFSPVCPQKLPNLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDC 131
               S    P      ++  +  ++ P+      S   S                +    
Sbjct: 307 AMYSSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSD------SEASDSGPTRKRKR 360

Query: 132 LYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
             +        +E+E   + +E   E+++K   EE K
Sbjct: 361 RRVPPLPEYSSDEDEDDSDEDEVDYEKERKRRREEDK 397


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.2 bits (63), Expect = 2.3
 Identities = 7/24 (29%), Positives = 23/24 (95%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
           L++++++++EER +++K+++++EE
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEE 215



 Score = 27.4 bits (61), Expect = 3.6
 Identities = 11/29 (37%), Positives = 24/29 (82%), Gaps = 2/29 (6%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +++++K+EE  E++K+ KK++E EE+K+
Sbjct: 192 LKQQQQKREE--ERRKQRKKQQEEEERKQ 218



 Score = 27.0 bits (60), Expect = 4.4
 Identities = 8/30 (26%), Positives = 26/30 (86%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             ++ ++++++REE++++++KK++ EE++K
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEEEERK 217



 Score = 27.0 bits (60), Expect = 4.4
 Identities = 10/27 (37%), Positives = 24/27 (88%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEK 167
           Q+ EEE++++R+++++EE++K++ EE 
Sbjct: 197 QKREEERRKQRKKQQEEEERKQKAEEA 223


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 27.1 bits (60), Expect = 2.3
 Identities = 17/28 (60%), Positives = 20/28 (71%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEE  KK E E +K  EKKK+EGEE+ K
Sbjct: 41  EEEIIKKAEEEAQKLIEKKKKEGEEEAK 68



 Score = 25.5 bits (56), Expect = 7.2
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           ++EE  +K EE  +K  E+KKKE  EE KK
Sbjct: 40  IEEEIIKKAEEEAQKLIEKKKKEGEEEAKK 69


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 28.1 bits (62), Expect = 2.4
 Identities = 11/25 (44%), Positives = 20/25 (80%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGE 165
            EE++ ++E+ EE+ +EE K+EEG+
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQ 239


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 26.8 bits (60), Expect = 2.4
 Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 2/27 (7%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEK 167
           +EE+ +++EE +  K    K+EE EEK
Sbjct: 5   EEEKAQREEELKRLKNL--KREEIEEK 29


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 27.8 bits (61), Expect = 2.5
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEE  ++EE   ++KEEK K + EE++K
Sbjct: 72  EEEARRQEEERAREKEEKAKRKAEEEEK 99


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 28.1 bits (63), Expect = 2.6
 Identities = 9/28 (32%), Positives = 18/28 (64%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE EE   +R+ +K+  ++ +E EE ++
Sbjct: 162 EELEEDAADRDARKRAAEEAKEQEELRR 189


>gnl|CDD|197363 cd09846, DUF1312, N-Utilization Substance G (NusG) N terminal
          (NGN) insert and Lin0431 are part of DUF1312.  Domains
          of Unknown Function 1312 (DUF1312) are represented in
          at least 71 bacterial species with no functional
          annotation. Included in this family are N-Utilization
          Substance G (NusG) N terminal (NGN) insert and Lin0431,
          having similar structure and surface features that
          appear to be conserved across these domain families,
          suggesting similar function. NusG contains NGN at the
          N-terminus and Kyrpides Ouzounis and Woese (KOW)
          repeats at the C-terminus in bacteria and archaea, and
          this insert (often known as Domain II) is found in
          several bacteria. Lin0431 is similar to NGN-insert but
          does ot contain the disulphite bridge.
          Length = 81

 Score = 26.4 bits (59), Expect = 2.7
 Identities = 11/21 (52%), Positives = 15/21 (71%), Gaps = 1/21 (4%)

Query: 24 KEYTNIIELKQGRLRGVVRSP 44
          + YTNIIE++ GR+R V  S 
Sbjct: 30 EGYTNIIEVEDGRVR-VKESD 49


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 27.7 bits (62), Expect = 2.7
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
              E  E + E++ + K+E KKK+  E K  
Sbjct: 239 EDVEAAESRAEKKRKSKEEIKKKKPKESKGV 269


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 27.7 bits (62), Expect = 2.7
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           Q +  +K++ +E KKKE+ +K E EE K
Sbjct: 17  QRKARKKQKRKEAKKKEDAQKSEAEEVK 44



 Score = 27.3 bits (61), Expect = 3.8
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E   E++  + E++K  KK++  E KKK
Sbjct: 5   EALLEQRRRKREQRKARKKQKRKEAKKK 32



 Score = 26.9 bits (60), Expect = 4.3
 Identities = 9/26 (34%), Positives = 20/26 (76%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
           + + E++K  +++K+KE KKKE+ ++
Sbjct: 12  RRKREQRKARKKQKRKEAKKKEDAQK 37


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 27.7 bits (62), Expect = 2.8
 Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 123 YLKNQSEDCLYLNIYTPLQEEEEEKKEEREE--KKKEEKKKEE 163
             KN S D  +L      ++E E ++E REE  KK+E  K+EE
Sbjct: 50  IGKNPSVDTSFLPDKAREEKEAELREELREEFLKKQEAVKEEE 92


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 27.9 bits (63), Expect = 2.9
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEEE +KEE + + +EE K E  E    
Sbjct: 39  EEEEARKEEAKREAEEEAKAEAEEAAAA 66


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 27.3 bits (60), Expect = 3.1
 Identities = 10/22 (45%), Positives = 19/22 (86%)

Query: 142 EEEEEKKEEREEKKKEEKKKEE 163
           E  EEK +ERE+++++E+++EE
Sbjct: 145 ERREEKIDEREDQEEQEREREE 166



 Score = 26.6 bits (58), Expect = 6.1
 Identities = 12/30 (40%), Positives = 23/30 (76%), Gaps = 1/30 (3%)

Query: 141 QEEEEEKKE-EREEKKKEEKKKEEGEEKKK 169
           +EE EE+ E ER E+K +E++ +E +E+++
Sbjct: 135 REELEERMEWERREEKIDEREDQEEQERER 164


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.5 bits (61), Expect = 3.3
 Identities = 14/71 (19%), Positives = 32/71 (45%), Gaps = 6/71 (8%)

Query: 99  NLDDDDFSKKMSKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKK------EERE 152
           N+D ++  KK+   + +    +    +  + +   L      ++EEEE++      EE E
Sbjct: 112 NIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEE 171

Query: 153 EKKKEEKKKEE 163
           ++  + K K+ 
Sbjct: 172 QQMNKRKNKQA 182


>gnl|CDD|236082 PRK07737, fliD, flagellar capping protein; Validated.
          Length = 501

 Score = 27.8 bits (62), Expect = 3.3
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 137 YTPLQEEEEEKKEEREEKKKEEKKK 161
           Y PL +E++E   E++ +  EEK K
Sbjct: 303 YPPLTDEQKEAMTEKQIELWEEKAK 327


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 27.0 bits (60), Expect = 3.3
 Identities = 12/30 (40%), Positives = 24/30 (80%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L+E+ ++KK+++++KKK++KK  +   KKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134


>gnl|CDD|152814 pfam12379, DUF3655, Protein of unknown function (DUF3655).  This
           domain family is found in viruses, and is approximately
           70 amino acids in length. The family is found in
           association with pfam08716, pfam01661, pfam05409,
           pfam06471, pfam08717, pfam06478, pfam09401, pfam06460,
           pfam08715, pfam08710.
          Length = 70

 Score = 26.2 bits (57), Expect = 3.5
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 132 LYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEE 166
           +Y + Y P +EE+ E+ E  EE+ +E  + E G E
Sbjct: 1   MYCSFYPPDEEEDCEEYECEEEEIEETCEHEYGTE 35


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 27.7 bits (61), Expect = 3.5
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
           + EEEK E++  K+++ KK +E  E
Sbjct: 127 DVEEEKTEKKVRKRRKVKKMDEDVE 151


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 27.7 bits (62), Expect = 3.6
 Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 8/45 (17%)

Query: 124 LKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           LK  SE        T  +EEE + + E   + K  K+++EG  ++
Sbjct: 226 LKETSE--------TEEREEETDVEIETTSETKGTKQEQEGSTEE 262


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 27.2 bits (61), Expect = 3.7
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           +   ++EK   R E K+ E K +E E + 
Sbjct: 76  IARYKKEKARYRSEAKELEAKAKEAEAES 104


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 27.7 bits (62), Expect = 3.7
 Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 122 PYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEE 166
            ++  Q   C ++N      EEEE++ EE EE+++E ++ E  E 
Sbjct: 331 QHILPQGR-CTWVNP-EQKDEEEEQEDEEEEEEEEEPEEPEPEEG 373


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 26.6 bits (59), Expect = 3.7
 Identities = 16/74 (21%), Positives = 34/74 (45%), Gaps = 9/74 (12%)

Query: 101 DDDDFSKKM-----SKGRLQYYQALMPYLKNQSEDCLYLNIYTPLQEEEEEKKEEREEKK 155
           ++DD+ +       +KGR +Y         +  +D +        +EE    +  R E +
Sbjct: 32  EEDDYDRDEIWAMFNKGRKRYAY----RDDDDDDDDMEAGGADIQEEERRSARLARLEDR 87

Query: 156 KEEKKKEEGEEKKK 169
           +EE+ ++E E +K+
Sbjct: 88  EEERLEKEEEREKR 101


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 27.0 bits (60), Expect = 3.8
 Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE EE+  +  E++   + K E +EKKK
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKK 126



 Score = 27.0 bits (60), Expect = 3.9
 Identities = 8/32 (25%), Positives = 15/32 (46%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
               +E E++     + + +EKKK E  + K 
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKT 134



 Score = 26.3 bits (58), Expect = 7.1
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E EEE  +E E++   E K E  E+KK+
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKR 127


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 27.2 bits (61), Expect = 4.2
 Identities = 16/29 (55%), Positives = 22/29 (75%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           L+EE +E  EER E+K+EEK++E  EE K
Sbjct: 126 LREEIDEFNEERIERKEEEKEREREEELK 154


>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
           domain.  The centromere protein B (CENP-B) dimerisation
           domain is composed of two alpha-helices, which are
           folded into an antiparallel configuration. Dimerisation
           of CENP-B is mediated by this domain, in which monomers
           dimerise to form a symmetrical, antiparallel, four-helix
           bundle structure with a large hydrophobic patch in which
           23 residues of one monomer form van der Waals contacts
           with the other monomer. This CENP-B dimer configuration
           may be suitable for capturing two distant CENP-B boxes
           during centromeric heterochromatin formation.
          Length = 101

 Score = 26.3 bits (57), Expect = 4.2
 Identities = 7/24 (29%), Positives = 15/24 (62%)

Query: 143 EEEEKKEEREEKKKEEKKKEEGEE 166
           +EEE  ++ +E+  +E   E+ +E
Sbjct: 17  DEEEDDDDEDEEDDDEDDDEDDDE 40


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 2/32 (6%)

Query: 140 LQEE--EEEKKEEREEKKKEEKKKEEGEEKKK 169
           L EE   EE +     K++E  + E+ E KKK
Sbjct: 164 LLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 27.4 bits (61), Expect = 4.8
 Identities = 9/32 (28%), Positives = 18/32 (56%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
              ++++++KKE++E K + E K      KK 
Sbjct: 68  KKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99



 Score = 27.0 bits (60), Expect = 5.9
 Identities = 15/29 (51%), Positives = 20/29 (68%), Gaps = 2/29 (6%)

Query: 143 EEEEKKEEREEKKKEEKK--KEEGEEKKK 169
            E +KK E+++KKK+EKK  K EGE K  
Sbjct: 64  NESKKKSEKKKKKKKEKKEPKSEGETKLG 92


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 24.5 bits (54), Expect = 5.1
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEE 166
           E E+K   E+KK EE +KE  EE
Sbjct: 15  EAEQKALEEQKKIEELRKEIEEE 37


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 27.2 bits (60), Expect = 5.2
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
            ++   KK E++ KKKEE+  EE + K+
Sbjct: 69  SQQSSAKKGEQQRKKKEEQVAEELKPKQ 96



 Score = 26.8 bits (59), Expect = 6.9
 Identities = 12/29 (41%), Positives = 21/29 (72%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           Q EE EK+ + E+K++EE+ ++   E+KK
Sbjct: 119 QAEEAEKQAQLEQKQQEEQARKAAAEQKK 147



 Score = 26.5 bits (58), Expect = 7.5
 Identities = 8/29 (27%), Positives = 19/29 (65%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             E+E  K+  +E+ K ++++++ EE +K
Sbjct: 97  AAEQERLKQLEKERLKAQEQQKQAEEAEK 125



 Score = 26.5 bits (58), Expect = 8.2
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +    +KK E E K   EK K E E K K
Sbjct: 188 EAAAAKKKAEAEAKAAAEKAKAEAEAKAK 216


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 27.1 bits (61), Expect = 5.2
 Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 13/64 (20%)

Query: 107 KKMSKGRLQYYQALMPYLKN-QSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGE 165
           ++M K + +YY  L   LK  Q E          L E++++K E  E ++K EK K   E
Sbjct: 220 EQMEKSQREYY--LREQLKAIQKE----------LGEDDDDKDEVEELREKIEKLKLPKE 267

Query: 166 EKKK 169
            K+K
Sbjct: 268 AKEK 271


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 26.8 bits (60), Expect = 5.3
 Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E  EEK  E E +K E+KK    EE K+
Sbjct: 5   ETLEEKLNEEEREKLEKKKSSLSEEDKE 32


>gnl|CDD|188469 TIGR03954, integ_memb_HG, integral membrane protein.  This model
          describes a strictly bacterial integral membrane domain
          of about 85 residues in length. It occurs in proteins
          that on rare occasions are fused to transporter domains
          such as the major facilitator superfamily domain. Of
          three invariant residues, two occur as a His-Gly
          dipeptide in the middle of three predicted
          transmembrane helices [Unknown function, General].
          Length = 85

 Score = 25.9 bits (58), Expect = 5.3
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query: 6  ALFTGLLLLLLLQVTM 21
          A   G+ LLLLL V M
Sbjct: 5  AFAEGVSLLLLLFVAM 20


>gnl|CDD|224401 COG1484, DnaC, DNA replication protein [DNA replication,
           recombination, and repair].
          Length = 254

 Score = 27.0 bits (60), Expect = 5.6
 Identities = 12/41 (29%), Positives = 17/41 (41%)

Query: 129 EDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            DC        L+   EE+K  RE +K E + +      KK
Sbjct: 34  ADCKEWGYAEFLEYLLEEEKLAREARKIERRLRSASFPAKK 74


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 26.0 bits (57), Expect = 5.8
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           Q  +E+K E+ +    ++KKK   ++KK
Sbjct: 6   QLSKEKKAEKAKAGTAKDKKKWSKKKKK 33



 Score = 25.6 bits (56), Expect = 7.7
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           P  +  +EKK E+ +    + KK+  ++KKK
Sbjct: 3   PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKK 33


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 26.0 bits (57), Expect = 6.1
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 140 LQEEEEEKKEEREEKKKEEKKK 161
           L++E E  +EE EEK+K++KK 
Sbjct: 94  LEDERELAREEEEEKRKKKKKN 115



 Score = 26.0 bits (57), Expect = 6.7
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            +EE    +  R E ++E  ++EE E++KK
Sbjct: 82  QKEERRSARMARLEDERELAREEEEEKRKK 111


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 26.2 bits (58), Expect = 6.2
 Identities = 13/31 (41%), Positives = 23/31 (74%), Gaps = 2/31 (6%)

Query: 141 QEEEEEKKEEREEKKKE--EKKKEEGEEKKK 169
           +EE++ K EE+  KK+   +KKK++ ++KKK
Sbjct: 72  REEKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 26.8 bits (59), Expect = 6.3
 Identities = 8/26 (30%), Positives = 18/26 (69%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
           E+ +K + + +KK+E+K+   E+K+ 
Sbjct: 173 EDSQKHKEDRRKKQEEKRRNDEDKRP 198


>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110).  This is
           a family that is found predominantly at the C-terminus
           of Kelch-containing proteins. However, the exact
           function of this region is not known.
          Length = 99

 Score = 25.8 bits (57), Expect = 6.3
 Identities = 9/27 (33%), Positives = 13/27 (48%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
           EEEE+   E      +EK    GE ++
Sbjct: 72  EEEEQAGIEEVVSLAKEKDWGGGEGRR 98


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 27.0 bits (60), Expect = 6.4
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKE 162
           + EE    ++E EE ++E K K+
Sbjct: 303 VNEELTTVRQENEELEEEYKIKK 325


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 26.7 bits (59), Expect = 6.4
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 118 QALMPYLKNQSEDCLYLNI-----YTPLQEEEEEKKEEREEKKKEE 158
             +   ++   ED L L I     Y P  E EE KKE+ +EK++ E
Sbjct: 417 GKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPE 462


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 26.5 bits (59), Expect = 6.4
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EEE E  + R E+++ +KKK++ ++K K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTK 193


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 26.6 bits (59), Expect = 6.5
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
            EEE EK+ +REE+ K++K+ +    K+ 
Sbjct: 70  DEEEGEKELQREERLKKKKRVKTKAYKEP 98


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 24.4 bits (54), Expect = 6.6
 Identities = 8/18 (44%), Positives = 13/18 (72%)

Query: 146 EKKEEREEKKKEEKKKEE 163
           ++  + E+K +EEKKK E
Sbjct: 11  KRVWKAEQKAEEEKKKIE 28


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 26.7 bits (60), Expect = 6.8
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
           EE+ E+ EE   +  EE+++++ E+K 
Sbjct: 64  EEQRERLEELAPELLEEEEEKKEEKKG 90



 Score = 26.7 bits (60), Expect = 6.8
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 134 LNIYTPLQEEEEEKKEEREEKKKEEKKKEEGE 165
           L    P   EEEE+K+E ++        E+G+
Sbjct: 70  LEELAPELLEEEEEKKEEKKGLPPLPNAEKGK 101


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 25.7 bits (56), Expect = 6.9
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEE 166
           EEE ++ +E EE + EE++ EE EE
Sbjct: 54  EEEVDEDDEEEEGEGEEEEGEEEEE 78


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 25.8 bits (57), Expect = 7.0
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           + E EEK+   EE  KE   +EE  E+++
Sbjct: 47  KNEVEEKQAVEEEANKEGVVEEEEVEEEE 75


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 26.9 bits (59), Expect = 7.1
 Identities = 9/45 (20%), Positives = 16/45 (35%)

Query: 125 KNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             +    L       L+ +E+ KK     + KE+ + EE      
Sbjct: 186 LAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYL 230


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 26.3 bits (58), Expect = 7.3
 Identities = 12/27 (44%), Positives = 22/27 (81%)

Query: 143 EEEEKKEEREEKKKEEKKKEEGEEKKK 169
           E++ KK++ E+ K+ +KKK+E ++KKK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKK 167


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
           production and conversion].
          Length = 194

 Score = 26.2 bits (58), Expect = 7.5
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +  E EE+ EE  E+ +EE +K + E K++
Sbjct: 11  ILREAEEEAEEILEEAREEAEKIKEEAKRE 40


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 26.7 bits (60), Expect = 7.5
 Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 140 LQEEEEEKKEEREEKKKE-EKKKEEGEEKK 168
           L+ E E+K EE E   KE EK KEE EEKK
Sbjct: 528 LERELEQKAEEAEALLKEAEKLKEELEEKK 557


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 26.3 bits (58), Expect = 7.6
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 141 QEEEEEKKEEREEKKKEEKKK 161
           +E  +E+KE REE K+E K+K
Sbjct: 93  EEALDEEKERREESKEEMKEK 113



 Score = 26.3 bits (58), Expect = 8.1
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 142 EEEEEKKEEREEKKKEEKKKE 162
           E  EE KEE +EKK E+K KE
Sbjct: 101 ERREESKEEMKEKKFEKKLKE 121


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 26.8 bits (59), Expect = 7.7
 Identities = 7/29 (24%), Positives = 17/29 (58%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           L  ++  + E+  E + +E++K+ G  +K
Sbjct: 547 LDADDWTEDEDENEMETDEERKKPGCCRK 575


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 26.7 bits (60), Expect = 7.8
 Identities = 13/31 (41%), Positives = 21/31 (67%), Gaps = 2/31 (6%)

Query: 141 QEEEEEKKEEREEKKKE--EKKKEEGEEKKK 169
            E+   +KEE  ++K E  EK++EE E+K+K
Sbjct: 87  LEKRLLQKEENLDRKLELLEKREEELEKKEK 117


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
           family is found from fungi to humans, but its exact
           function is not known.
          Length = 88

 Score = 25.3 bits (56), Expect = 8.0
 Identities = 13/29 (44%), Positives = 23/29 (79%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           +E+ EE ++ REE ++E K++EE +EK+K
Sbjct: 46  KEQMEELEKAREETERERKEREERKEKRK 74



 Score = 24.9 bits (55), Expect = 9.9
 Identities = 9/21 (42%), Positives = 16/21 (76%)

Query: 141 QEEEEEKKEEREEKKKEEKKK 161
           ++E EE+KE+R+   +E +KK
Sbjct: 63  RKEREERKEKRKRAIEERRKK 83


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 26.3 bits (58), Expect = 8.3
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKK 168
           E+ EE  E + E+ K+E  K + E KK
Sbjct: 46  EDSEESNEVKIEELKDENNKLKEENKK 72


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 26.1 bits (58), Expect = 8.4
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
             E EEE + E+ E  +E +KK E   +KK
Sbjct: 14  RAEAEEEFEIEKAEAVEEAEKKIEEIYEKK 43


>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118).  This is
           the highly conserved family of the major tail subunit
           protein.
          Length = 121

 Score = 25.8 bits (56), Expect = 8.9
 Identities = 8/24 (33%), Positives = 18/24 (75%)

Query: 140 LQEEEEEKKEEREEKKKEEKKKEE 163
           ++EE ++ + ERE++ KE K++ +
Sbjct: 48  MKEERDKMETEREKRDKESKEERD 71


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 26.0 bits (57), Expect = 9.3
 Identities = 9/30 (30%), Positives = 19/30 (63%)

Query: 139 PLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
            L E++E  K+ + E++++E  K + E +K
Sbjct: 73  KLDEKKEIAKQRKREQREKELAKRQKELEK 102


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 26.3 bits (58), Expect = 9.3
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 141 QEEEEEKKEEREEKKKEEKKKEEGEE 166
           +EEEEE++EE EE  + E   EE   
Sbjct: 403 EEEEEEEEEEEEEAAEAEAPMEEPVP 428


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 26.2 bits (58), Expect = 9.5
 Identities = 11/32 (34%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 136 IYTPLQE-EEEEKKEEREEKKKEEKKKEEGEE 166
           +   L+E E E+K+EE  EK  ++  ++ GE+
Sbjct: 89  LQRKLKELEREQKEEEVREKHNKKIIEKFGED 120


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 26.2 bits (58), Expect = 9.5
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 142 EEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE  E+  E    K ++KK+++ E  +K
Sbjct: 413 EEIREELIEEGLLKSKKKKRKKKEWFEK 440


>gnl|CDD|212464 cd01717, Sm_B, Sm protein B.  The eukaryotic Sm proteins (B/B', D1,
           D2, D3, E, F and G) assemble into a hetero-heptameric
           ring around the Sm site of the 2,2,7-trimethyl guanosine
           (m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs)
           forming the core of the snRNP particle. The snRNP
           particle, in turn, assembles with other components onto
           the pre-mRNA to form the spliceosome which is
           responsible for the excision of introns and the ligation
           of exons. Members of this family share a highly
           conserved Sm fold, containing an N-terminal helix
           followed by a strongly bent five-stranded antiparallel
           beta-sheet.
          Length = 80

 Score = 24.8 bits (55), Expect = 9.7
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 149 EEREEKKKEEKKKEEGEEK 167
           E R+ K K++KK EE EEK
Sbjct: 43  EFRKIKPKKKKKGEEREEK 61


>gnl|CDD|183534 PRK12451, PRK12451, arginyl-tRNA synthetase; Reviewed.
          Length = 562

 Score = 26.3 bits (58), Expect = 9.7
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 133 YLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEK 167
              +Y    EE ++ +E  EE +   KK EEG+E+
Sbjct: 190 LFKLYVQFHEEVKDDEELEEEGRAWFKKLEEGDEE 224


>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
           repair].
          Length = 480

 Score = 26.2 bits (58), Expect = 9.7
 Identities = 11/39 (28%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 132 LYLNIYTP-LQEEEEEKKEEREEKKKEEKKKEEGEEKKK 169
           L++ + +  LQ++EE+    R EK+ E+ +K   + K +
Sbjct: 269 LWVFLLSYKLQKKEEQTLRTRIEKELEKAEKSLEKLKGR 307


>gnl|CDD|149159 pfam07925, RdRP_5, Reovirus RNA-dependent RNA polymerase lambda 3. 
           The sequences in this family are similar to the reoviral
           minor core protein lambda 3, which functions as a
           RNA-dependent RNA polymerase within the protein capsid.
           It is organised into 3 domains. N- and C-terminal
           domains create a 'cage' that encloses a conserved
           central catalytic domain within a hollow centre; this
           catalytic domain is arranged to form 'fingers', 'palm'
           and 'thumb' subdomains. Unlike other RNA polymerases,
           like HIV reverse transcriptase and T7 RNA polymerase,
           lambda 3 protein binds template and substrate with only
           localised rearrangements, and catalytic activity can
           occur with little structural change. However, the
           structure of the catalytic complex is similar to that of
           other polymerase catalytic complexes with known
           structure.
          Length = 1271

 Score = 26.4 bits (58), Expect = 9.7
 Identities = 8/21 (38%), Positives = 13/21 (61%), Gaps = 1/21 (4%)

Query: 64  PPVGQLRFMAPQSPQPWPGLM 84
           PPVG+  +  P + + WP L+
Sbjct: 802 PPVGK-EYANPSTEEIWPSLL 821


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score = 26.2 bits (57), Expect = 9.8
 Identities = 11/45 (24%), Positives = 25/45 (55%)

Query: 124 LKNQSEDCLYLNIYTPLQEEEEEKKEEREEKKKEEKKKEEGEEKK 168
           LKN++       I   ++    +++ E++EK + E+K+ E E++ 
Sbjct: 32  LKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRREPEKQV 76


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 26.4 bits (58), Expect = 9.8
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 138 TPLQEEEEEKKEEREEKKKEEKKKEEGEEK 167
              QE EEE+  E EE+++EE+++EE E +
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESE 463


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 25.4 bits (56), Expect = 10.0
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 144 EEEKKEEREEKKKEEKKKEEGEEKKK 169
           EE +K E EEK++EE ++E    K K
Sbjct: 100 EENRKLEEEEKEREELEEENDVTKGK 125


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.134    0.388 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,723,369
Number of extensions: 825144
Number of successful extensions: 6520
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5111
Number of HSP's successfully gapped: 929
Length of query: 169
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 79
Effective length of database: 6,945,742
Effective search space: 548713618
Effective search space used: 548713618
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 55 (24.8 bits)