RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16677
         (227 letters)



>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
           This family contains, amongst other G-protein-coupled
           receptors (GCPRs), members of the opsin family, which
           have been considered to be typical members of the
           rhodopsin superfamily. They share several motifs, mainly
           the seven transmembrane helices, GCPRs of the rhodopsin
           superfamily. All opsins bind a chromophore, such as
           11-cis-retinal. The function of most opsins other than
           the photoisomerases is split into two steps: light
           absorption and G-protein activation. Photoisomerases, on
           the other hand, are not coupled to G-proteins - they are
           thought to generate and supply the chromophore that is
           used by visual opsins.
          Length = 251

 Score =  127 bits (322), Expect = 3e-36
 Identities = 47/137 (34%), Positives = 68/137 (49%), Gaps = 5/137 (3%)

Query: 56  VSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWNSLDVY 115
           + ++R +KLR  TN F+++LA AD+L  +    +     V G W F   +C +   L V 
Sbjct: 1   LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVV 60

Query: 116 FSTASILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVISFVPIMCGWYT 175
              ASIL L  IS+DRY AIV PL+Y    T R    ++L VWV   ++S  P++  W  
Sbjct: 61  NGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPLLFSWLR 120

Query: 176 TSEHLKYRYQHPNICDF 192
           T E       +   C  
Sbjct: 121 TVE-----EGNVTTCLI 132


>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
           protein; Provisional.
          Length = 335

 Score = 60.6 bits (147), Expect = 6e-11
 Identities = 43/185 (23%), Positives = 79/185 (42%), Gaps = 9/185 (4%)

Query: 1   MTDSSNLFDASMPGYNVTDSTSPDENVSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMR 60
            T            Y   D     +  S+I+   +     IF     + GN++VI  V+ 
Sbjct: 13  NTTDYYYDTYYDEDYADCDLNIGYDTNSTIL---IVVYSTIFFF--GLVGNIIVIY-VLT 66

Query: 61  HRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWNSLDVYFSTAS 120
             K++   + ++++LA +D+L  M  + F     +  +W+F  F C + + L       S
Sbjct: 67  KTKIKTPMDIYLLNLAVSDLLFVMT-LPFQIYYYILFQWSFGEFACKIVSGLYYIGFYNS 125

Query: 121 ILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVISFVPIMCGWYTTSEH- 179
           +  +  +SVDRY AIV P+K     T +    + L +W+  ++I   PI+  + T  +H 
Sbjct: 126 MNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWII-SIIETTPILFVYTTKKDHE 184

Query: 180 LKYRY 184
                
Sbjct: 185 TLICC 189


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 43.8 bits (103), Expect = 3e-05
 Identities = 38/139 (27%), Positives = 69/139 (49%), Gaps = 16/139 (11%)

Query: 39  IMIFIIVASIGGNL--LVIVSVMRHRKLRIITNYFVVSLAFADML--VAMVAMTFNASVQ 94
           I IF I+  I G      I+ ++  +K++ IT+ ++ +LA +D++  +    + +N   Q
Sbjct: 101 IKIFYIIIFILGLFGNAAIIMILFCKKIKTITDIYIFNLAISDLIFVIDFPFIIYNEFDQ 160

Query: 95  VTGKWNFSFFMCDVWNSLDVYFSTASILHLCCI-SVDRYYAIVKPLKYP------INMTK 147
               W F  FMC V  S   Y    S + L  + S+DRY+AI+ P+ +       I +  
Sbjct: 161 ----WIFGDFMCKVI-SASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIIL 215

Query: 148 RVVAFMLLNVWVSPAVISF 166
            +++++L  +  SPA   F
Sbjct: 216 CIISWILSLIITSPAYFIF 234


>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
           Srx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srx is part of the Srg
           superfamily of chemoreceptors. Chemoperception is one of
           the central senses of soil nematodes like C. elegans
           which are otherwise 'blind' and 'deaf'.
          Length = 275

 Score = 41.4 bits (98), Expect = 1e-04
 Identities = 24/156 (15%), Positives = 58/156 (37%), Gaps = 7/156 (4%)

Query: 44  IVASIGG---NLLVIVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWN 100
            + S+ G   N L+    ++   L+        + A ++ ++  + + +   + +     
Sbjct: 1   FLISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDLSF 60

Query: 101 FSFFMCDVWNSLDVYFS-TASILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWV 159
               +      L +      S L    IS++R+ A+  P KY    + +    ++  +W+
Sbjct: 61  LPELLNSHIGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIITFIWI 120

Query: 160 SPAVISFVPIM---CGWYTTSEHLKYRYQHPNICDF 192
              +IS +      C +Y + E L + +     C  
Sbjct: 121 IAIIISTLFYFPLGCHFYYSPESLTWSFDEDPPCAE 156


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 39.2 bits (92), Expect = 0.001
 Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 5/46 (10%)

Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           E +K +Y  P      KKK EEKK  K+K KKK ++K KK + +G 
Sbjct: 373 EEIKEKYPKP-----PKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 29.6 bits (67), Expect = 1.3
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           +  K+ EE K++  K  KKK EEKK + R +  
Sbjct: 367 ELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKK 399


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 35.3 bits (82), Expect = 0.003
 Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 10/41 (24%)

Query: 187 PNICDFK----------KKKEEEKKEAKKKTKKKEEEKKKK 217
           PN   FK          K+K+EE+KE +K+ +K E  + +K
Sbjct: 38  PNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERAEARK 78


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 36.7 bits (85), Expect = 0.004
 Identities = 13/29 (44%), Positives = 22/29 (75%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           KK K+++ ++ K++ KKK+E+KKKK R  
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 36.0 bits (83), Expect = 0.007
 Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 10/49 (20%)

Query: 179 HLKYRYQHP----------NICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           H K+R Q P           +   +KK +++K E  K+ KKK++EKKKK
Sbjct: 117 HKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKK 165


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
           Srsx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srsx is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 257

 Score = 36.0 bits (84), Expect = 0.009
 Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 12/103 (11%)

Query: 44  IVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADML-----VAMVAMTFNASVQVTGK 98
            V  I GN+++I+   + +KLR   +Y +     AD+L     +  V + F         
Sbjct: 1   SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLFT---GTQLT 57

Query: 99  WNFSFFMCDVWNSLDVYFSTASILHLCCISVDRYYAIVKPLKY 141
            N  F +        ++  TA    +  I +DR  A+  P+ Y
Sbjct: 58  RNECFLI----IIPYIFGQTAQSPLMLMIGIDRLIAVKFPIFY 96


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 34.5 bits (80), Expect = 0.010
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +KK+EE+K EA+K  K   EEK + 
Sbjct: 25  EKKQEEKKSEAEKLAKMSAEEKAEY 49



 Score = 30.3 bits (69), Expect = 0.34
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           +  K   +EK + +KK ++K+ E +K  +M
Sbjct: 12  EVDKAIAKEKAKWEKKQEEKKSEAEKLAKM 41



 Score = 29.5 bits (67), Expect = 0.56
 Identities = 9/31 (29%), Positives = 14/31 (45%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
              K   EEK E + +  +KE E+ +    R
Sbjct: 37  KLAKMSAEEKAEYELEKLEKELEELEAELAR 67



 Score = 28.8 bits (65), Expect = 1.0
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           + K   ++E  +A  K K K E+K+++
Sbjct: 4   EEKTFTDKEVDKAIAKEKAKWEKKQEE 30



 Score = 26.5 bits (59), Expect = 6.7
 Identities = 10/28 (35%), Positives = 17/28 (60%), Gaps = 3/28 (10%)

Query: 193 KKKKEEEKKEAK---KKTKKKEEEKKKK 217
           ++KK E +K AK   ++  + E EK +K
Sbjct: 29  EEKKSEAEKLAKMSAEEKAEYELEKLEK 56



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 6/28 (21%), Positives = 16/28 (57%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           ++K E E ++ +K+ ++ E E  ++   
Sbjct: 44  EEKAEYELEKLEKELEELEAELARRELK 71


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.9 bits (82), Expect = 0.014
 Identities = 18/51 (35%), Positives = 25/51 (49%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            +E  K + +     D  KKK EE K+A +  KK EE KKK    +   E +
Sbjct: 1459 AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAK 1509



 Score = 35.5 bits (81), Expect = 0.019
 Identities = 15/51 (29%), Positives = 23/51 (45%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            ++  K + +     D  KKK EE K+   + KK  E KKK    +   E +
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522



 Score = 35.5 bits (81), Expect = 0.024
 Identities = 12/34 (35%), Positives = 20/34 (58%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            KK EE++K+A +  KK+ EE KK   ++     +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714



 Score = 34.0 bits (77), Expect = 0.061
 Identities = 15/40 (37%), Positives = 19/40 (47%)

Query: 178  EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
            +  K + +     D  KKK EE K+     KKK EE KK 
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344



 Score = 34.0 bits (77), Expect = 0.061
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            KK+ EE K+A++  KK+ EEKKK   ++
Sbjct: 1695 KKEAEEAKKAEELKKKEAEEKKKAEELK 1722



 Score = 33.2 bits (75), Expect = 0.11
 Identities = 13/40 (32%), Positives = 19/40 (47%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKK 216
            ++  K + +     D  KKK EE K+   + KK    KKK
Sbjct: 1380 ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 33.2 bits (75), Expect = 0.11
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            KKK EEKK+A +  KK EE+KKK   ++
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELK 1411



 Score = 33.2 bits (75), Expect = 0.13
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
             KK EE+KK+A++  K +E+EKK    ++
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695



 Score = 32.8 bits (74), Expect = 0.18
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            KK EEEKK+ ++  KK+ EEKKK   ++
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELK 1653



 Score = 32.4 bits (73), Expect = 0.18
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKK 215
            ++  K + +     D  KKK EE K+A++  KK EE KK
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 32.4 bits (73), Expect = 0.24
 Identities = 14/23 (60%), Positives = 17/23 (73%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKK 215
            +KKKEE KK+A    KK EE+KK
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 32.0 bits (72), Expect = 0.29
 Identities = 15/39 (38%), Positives = 21/39 (53%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKK 215
            ++  K + +     D  KKK EE K+A +  KK EE KK
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458



 Score = 31.6 bits (71), Expect = 0.33
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTR 219
            KKK EE K+A +  K + E    +  
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAE 1360



 Score = 31.3 bits (70), Expect = 0.45
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            KK+ EE K+  ++ KK EEEKKK   ++   E++
Sbjct: 1736 KKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769



 Score = 31.3 bits (70), Expect = 0.47
 Identities = 21/46 (45%), Positives = 24/46 (52%), Gaps = 5/46 (10%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKK---EAKKKT--KKKEEEKKKK 217
            ++ LK         D  KKK EEKK   EAKKK    KK +E KKK
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKK 1452



 Score = 31.3 bits (70), Expect = 0.54
 Identities = 13/24 (54%), Positives = 14/24 (58%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKK 217
            KK EEE K    +  KK EE KKK
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKK 1676



 Score = 30.9 bits (69), Expect = 0.61
 Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 5/30 (16%)

Query: 193  KKKKEEEKK--EAKKKT---KKKEEEKKKK 217
            KKK EE+KK  EAKKK    KKK +E KK 
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKA 1413



 Score = 30.9 bits (69), Expect = 0.66
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKK 217
            KK EEE K   ++ KK+ EE KKK
Sbjct: 1722 KKAEEENKIKAEEAKKEAEEDKKK 1745



 Score = 30.9 bits (69), Expect = 0.67
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKK 217
            KK  E  K EA+    + E  ++K 
Sbjct: 1342 KKAAEAAKAEAEAAADEAEAAEEKA 1366



 Score = 30.9 bits (69), Expect = 0.67
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            K + E    EA+   +K E  +KKK 
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKE 1374



 Score = 30.9 bits (69), Expect = 0.74
 Identities = 6/26 (23%), Positives = 12/26 (46%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKKT 218
             K + E   +  +  ++K E  +KK 
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKK 1373



 Score = 30.5 bits (68), Expect = 0.84
 Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 2/26 (7%)

Query: 193  KKKKEEEKKEAKKKT--KKKEEEKKK 216
                E E  E K +   KKKEE KKK
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 30.5 bits (68), Expect = 0.87
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 5/46 (10%)

Query: 177  SEHLKYRYQHPNICDFKKKKEEEKK---EAKKKT--KKKEEEKKKK 217
            ++  K + +     +  KKK EE K   EAKKK    KK +E KKK
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491



 Score = 30.1 bits (67), Expect = 1.1
 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 4/31 (12%)

Query: 191  DFKKKKEEEKK--EAKKKT--KKKEEEKKKK 217
              KKK+E +KK   AKKK   KKK +E KKK
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399



 Score = 30.1 bits (67), Expect = 1.1
 Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 6/35 (17%)

Query: 191  DFKKKKEEEKK---EAKKKT---KKKEEEKKKKTR 219
            + KKK EE KK    AKKK    KK  E  K +  
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353



 Score = 30.1 bits (67), Expect = 1.2
 Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 5/30 (16%)

Query: 193  KKKKEEEKKEAKKK-----TKKKEEEKKKK 217
            KKK +  KK+A++K      KKK EE KKK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406



 Score = 30.1 bits (67), Expect = 1.2
 Identities = 12/32 (37%), Positives = 14/32 (43%), Gaps = 5/32 (15%)

Query: 191  DFKKKKEEEKK---EAKKK--TKKKEEEKKKK 217
            + KKK +  KK   EAKK     K E E    
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357



 Score = 30.1 bits (67), Expect = 1.3
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 195  KKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            KK EE K   ++ KK EEEKKK  +++     +
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645



 Score = 29.7 bits (66), Expect = 1.4
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKK 217
               + E  +E  +  +KK+EE KKK
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 29.7 bits (66), Expect = 1.4
 Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 6/38 (15%)

Query: 194  KKKEEEKK---EAKKKT---KKKEEEKKKKTRMRGGIE 225
            KK    KK   EAKKK    KK +E KKK    +   E
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADE 1448



 Score = 29.7 bits (66), Expect = 1.4
 Identities = 18/36 (50%), Positives = 22/36 (61%), Gaps = 5/36 (13%)

Query: 191  DFKKKKEEEKK--EAKKKT---KKKEEEKKKKTRMR 221
            + KKK EE+KK  EAKKK    KK +E KKK    +
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457



 Score = 29.3 bits (65), Expect = 1.8
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 191  DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            + KK +EE K +A ++ KK EE+KKK    +
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAK 1681



 Score = 29.3 bits (65), Expect = 1.9
 Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 2/34 (5%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            KK  E KK+A +   KK EE KK    +   E +
Sbjct: 1503 KKAAEAKKKADE--AKKAEEAKKADEAKKAEEAK 1534



 Score = 29.3 bits (65), Expect = 2.0
 Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 5/30 (16%)

Query: 193  KKKKEEEKKEAKKK-----TKKKEEEKKKK 217
            KKK +E KK+A++       KKK EE KKK
Sbjct: 1301 KKKADEAKKKAEEAKKADEAKKKAEEAKKK 1330



 Score = 29.3 bits (65), Expect = 2.0
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
            KKK +E K+A    KK +E KKK    +   E
Sbjct: 1404 KKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435



 Score = 29.3 bits (65), Expect = 2.2
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 191  DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            + KK +EE K +A++  K+ EE+KKK    +
Sbjct: 1720 ELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750



 Score = 29.3 bits (65), Expect = 2.2
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 194  KKKEEEKKEAKKKTKKKEEEKKKK 217
            KKK EE K+A++  KK  E  KK+
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKE 1697



 Score = 29.0 bits (64), Expect = 2.4
 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 5/40 (12%)

Query: 191  DFKKKKEEEKK---EAKKKT--KKKEEEKKKKTRMRGGIE 225
            + KKK EE KK   EAKK    KKK +E KK    +   E
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526



 Score = 29.0 bits (64), Expect = 3.0
 Identities = 15/27 (55%), Positives = 20/27 (74%), Gaps = 1/27 (3%)

Query: 191  DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
            + KKK+ EEKK+A++  KK EEE K K
Sbjct: 1706 ELKKKEAEEKKKAEE-LKKAEEENKIK 1731



 Score = 29.0 bits (64), Expect = 3.0
 Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 5/32 (15%)

Query: 191  DFKKKKEEEKKEAKKKTK-----KKEEEKKKK 217
            D KK  E  KKEA++  K     KKE E+KKK
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKK 1717



 Score = 29.0 bits (64), Expect = 3.0
 Identities = 17/28 (60%), Positives = 19/28 (67%), Gaps = 4/28 (14%)

Query: 194  KKKEEEKK--EAKKKT--KKKEEEKKKK 217
            KK EE+KK  EAKKK    KK +E KKK
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKK 1323



 Score = 28.2 bits (62), Expect = 4.6
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 191  DFKKKKEEEKKEAKKKT--KKKEEEKKKKTRMRGGIERQ 227
            + KK +E +K E KKK    KK EE K     +    ++
Sbjct: 1550 ELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588



 Score = 27.8 bits (61), Expect = 5.6
 Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 3/38 (7%)

Query: 193  KKKKEEEKK-EAKKKTK--KKEEEKKKKTRMRGGIERQ 227
            KKK +E KK E  KK    KK EE KK    +   E++
Sbjct: 1509 KKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546



 Score = 27.8 bits (61), Expect = 5.9
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKK 216
            KKK EE KK  ++   K  EE KK
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKK 1669



 Score = 27.8 bits (61), Expect = 6.7
 Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 6/31 (19%)

Query: 193  KKKKEEEKK---EAKKKT---KKKEEEKKKK 217
            K K  EE K   E KKK    KK EE++KK 
Sbjct: 1660 KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690



 Score = 27.8 bits (61), Expect = 7.1
 Identities = 6/25 (24%), Positives = 12/25 (48%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKK 217
            + KK  E  +A+ +    E E  ++
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEE 1364



 Score = 27.4 bits (60), Expect = 7.9
 Identities = 9/33 (27%), Positives = 17/33 (51%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
            ++K E  +K+ ++  KK +  KKK    +   E
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395



 Score = 27.0 bits (59), Expect = 9.9
 Identities = 16/39 (41%), Positives = 19/39 (48%), Gaps = 4/39 (10%)

Query: 193  KKKKEEEKKEAKKKTK----KKEEEKKKKTRMRGGIERQ 227
            K K EE KK  + K K    KK EE+KKK       E +
Sbjct: 1606 KMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 34.3 bits (79), Expect = 0.023
 Identities = 12/25 (48%), Positives = 19/25 (76%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +KK ++EK    K +KKK+++KKKK
Sbjct: 169 EKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 33.9 bits (78), Expect = 0.037
 Identities = 14/26 (53%), Positives = 21/26 (80%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
           + +EEEKKE KKK + K+E+K+KK +
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDK 173



 Score = 32.0 bits (73), Expect = 0.16
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           ++KKE++KK+  KK KK++++KK+K     G
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKG 182



 Score = 32.0 bits (73), Expect = 0.16
 Identities = 11/26 (42%), Positives = 20/26 (76%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K+KK++++K  + K  KK+++KKKK 
Sbjct: 168 KEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 31.6 bits (72), Expect = 0.18
 Identities = 14/28 (50%), Positives = 20/28 (71%), Gaps = 1/28 (3%)

Query: 193 KKKKEEEKKEAK-KKTKKKEEEKKKKTR 219
           KKKK+E KKE K KK KK++  + K ++
Sbjct: 157 KKKKKEVKKEKKEKKDKKEKMVEPKGSK 184



 Score = 31.6 bits (72), Expect = 0.21
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
             K +KE E +E +KK KKK++E KK+ +
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEKK 168



 Score = 31.2 bits (71), Expect = 0.31
 Identities = 13/27 (48%), Positives = 21/27 (77%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +K+ E E++E K+K KKKE +K+KK +
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEK 170



 Score = 30.1 bits (68), Expect = 0.59
 Identities = 11/25 (44%), Positives = 20/25 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           + ++EE+K++ KKK  KKE+++KK 
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKD 172


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 33.5 bits (77), Expect = 0.027
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K+KK+++KK+ KKK K  ++  KKK 
Sbjct: 110 KQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 31.2 bits (71), Expect = 0.19
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
            K+K +++K++ KKK KKK++  KK  
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAA 131



 Score = 31.2 bits (71), Expect = 0.20
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           K K++++KK+ KKK KKK  +K  K +
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 28.9 bits (65), Expect = 1.0
 Identities = 13/25 (52%), Positives = 20/25 (80%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKT 218
           K+  +EK + KK+ KKK+++KKKKT
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 27.7 bits (62), Expect = 2.6
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +  KE+ K++ +KK KKK+++KK  
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTS 127



 Score = 27.7 bits (62), Expect = 2.6
 Identities = 11/25 (44%), Positives = 19/25 (76%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K+  +E+ K+ K+K KKK+++KKK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKT 126


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 32.3 bits (74), Expect = 0.028
 Identities = 10/22 (45%), Positives = 18/22 (81%)

Query: 195 KKEEEKKEAKKKTKKKEEEKKK 216
           K EEEK+E +++ +K+EE K++
Sbjct: 30  KAEEEKREEEEEARKREERKER 51



 Score = 30.4 bits (69), Expect = 0.15
 Identities = 8/26 (30%), Positives = 19/26 (73%), Gaps = 2/26 (7%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
           K +EE+++E ++   +K EE+K++ +
Sbjct: 30  KAEEEKREEEEE--ARKREERKEREK 53



 Score = 28.8 bits (65), Expect = 0.48
 Identities = 7/21 (33%), Positives = 16/21 (76%)

Query: 194 KKKEEEKKEAKKKTKKKEEEK 214
           K++EEE+   +++ K++E+ K
Sbjct: 35  KREEEEEARKREERKEREKNK 55



 Score = 27.7 bits (62), Expect = 1.1
 Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 191 DFKKKKEEEKKEAKKKT-KKKEEEKKKKTRMRGGIERQ 227
           D   K + +KKE K +  K++EEE+ +K   R   E+ 
Sbjct: 17  DLLAKLKAKKKELKAEEEKREEEEEARKREERKEREKN 54



 Score = 26.9 bits (60), Expect = 2.3
 Identities = 8/21 (38%), Positives = 16/21 (76%)

Query: 195 KKEEEKKEAKKKTKKKEEEKK 215
           K+EEE++  K++ +K+ E+ K
Sbjct: 35  KREEEEEARKREERKEREKNK 55


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 33.5 bits (77), Expect = 0.029
 Identities = 12/37 (32%), Positives = 27/37 (72%), Gaps = 4/37 (10%)

Query: 191 DFKKKKEEEKKEAKKKTKKK----EEEKKKKTRMRGG 223
           +F++K+EE+K++ ++KT KK    +++K+KK + +  
Sbjct: 67  EFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103



 Score = 30.8 bits (70), Expect = 0.30
 Identities = 12/36 (33%), Positives = 23/36 (63%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
           + KK+K+EEK   K+  ++K+++KKKK +      +
Sbjct: 73  EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNK 108



 Score = 28.9 bits (65), Expect = 1.2
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KK+ + +KK+ KKK KKK ++  KK
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNKK 109



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 13/26 (50%), Positives = 21/26 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           KK+K+++KK+AKK  KK+E+E  K +
Sbjct: 93  KKQKKKKKKKAKKGNKKEEKEGSKSS 118



 Score = 27.3 bits (61), Expect = 4.3
 Identities = 13/28 (46%), Positives = 18/28 (64%), Gaps = 3/28 (10%)

Query: 195 KKEEEKKEAKKK---TKKKEEEKKKKTR 219
           KKE E +E ++K    K+K+EEK  K R
Sbjct: 60  KKETEDEEFQQKREEKKRKDEEKTAKKR 87


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 34.8 bits (80), Expect = 0.029
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
           KKK  E KK+A+ + K K E K K
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 34.4 bits (79), Expect = 0.032
 Identities = 14/26 (53%), Positives = 17/26 (65%), Gaps = 1/26 (3%)

Query: 193 KKKKEEE-KKEAKKKTKKKEEEKKKK 217
           KK+ EEE K +A  + KKK  E KKK
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKK 174



 Score = 34.4 bits (79), Expect = 0.036
 Identities = 11/25 (44%), Positives = 13/25 (52%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
            K   E KK+A +  KK E E K K
Sbjct: 158 AKAAAEAKKKAAEAKKKAEAEAKAK 182



 Score = 33.3 bits (76), Expect = 0.075
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K + E EKK  ++  K+ EEE K K
Sbjct: 135 KAEAEAEKKAKEEAKKQAEEEAKAK 159



 Score = 32.9 bits (75), Expect = 0.12
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
            K+ EE++K+A++   K+  E K K
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAK 135



 Score = 32.5 bits (74), Expect = 0.17
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KK + E K +A+ K K K EE K K
Sbjct: 173 KKAEAEAKAKAEAKAKAKAEEAKAK 197



 Score = 31.7 bits (72), Expect = 0.30
 Identities = 7/25 (28%), Positives = 16/25 (64%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           ++K+ E++  A+K  K+ E+  K+ 
Sbjct: 90  RQKELEQRAAAEKAAKQAEQAAKQA 114



 Score = 31.3 bits (71), Expect = 0.31
 Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 2/27 (7%)

Query: 193 KKKKEEEKKEAKKKT--KKKEEEKKKK 217
            K K E + E K K   KK+ EE+ K 
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKA 158



 Score = 30.6 bits (69), Expect = 0.70
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
            K+ E+  K+A++K K+ EE K K
Sbjct: 104 AKQAEQAAKQAEEKQKQAEEAKAK 127



 Score = 30.2 bits (68), Expect = 0.74
 Identities = 14/26 (53%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 193 KKKKEEEKKEAKKKTKKKE-EEKKKK 217
           KK KEE KK+A+++ K K   E KKK
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKKK 167



 Score = 30.2 bits (68), Expect = 0.75
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
             K+ E+  +  ++ +K+ EE K K
Sbjct: 103 AAKQAEQAAKQAEEKQKQAEEAKAK 127



 Score = 29.8 bits (67), Expect = 1.0
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K+ +E++K+  + K K+  E K K 
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKA 136



 Score = 29.4 bits (66), Expect = 1.3
 Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 2/27 (7%)

Query: 193 KKKKEEEKKE--AKKKTKKKEEEKKKK 217
           K+K+ EE K   A +   K E E +KK
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKK 143



 Score = 29.4 bits (66), Expect = 1.7
 Identities = 7/23 (30%), Positives = 15/23 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
           +   E+  K+A++  K+ EE++K
Sbjct: 97  RAAAEKAAKQAEQAAKQAEEKQK 119



 Score = 28.3 bits (63), Expect = 3.1
 Identities = 9/25 (36%), Positives = 12/25 (48%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K K E + K   ++ K K E  K K
Sbjct: 180 KAKAEAKAKAKAEEAKAKAEAAKAK 204


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 34.5 bits (80), Expect = 0.035
 Identities = 13/44 (29%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           + L+ + + P+  + K  K  +K  AK+  KK++E++K K + R
Sbjct: 374 DELRPKTKAPS--EKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415



 Score = 33.8 bits (78), Expect = 0.061
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            +KKE+EK++ K K + ++ +   K R
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRR 426



 Score = 33.4 bits (77), Expect = 0.082
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
             K+ E+K++ K+K K K+  +  K
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 32.2 bits (74), Expect = 0.21
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K+ +++EK++ K K KK+  + K   + R
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGKRR 426



 Score = 31.5 bits (72), Expect = 0.32
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           KK+KE+EK + KK+ +  +   K++ 
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/35 (28%), Positives = 16/35 (45%)

Query: 187 PNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           P+     K+ E+++KE +K   KK     K    R
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 33.1 bits (76), Expect = 0.043
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           KKK + EK E K K + + E+++   + R
Sbjct: 34  KKKLKSEKLEKKAKRQLRAEKRQALEKGR 62



 Score = 28.1 bits (63), Expect = 2.2
 Identities = 8/29 (27%), Positives = 20/29 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K K E+ +K+AK++ + ++ +  +K R++
Sbjct: 36  KLKSEKLEKKAKRQLRAEKRQALEKGRVK 64



 Score = 26.6 bits (59), Expect = 7.9
 Identities = 8/30 (26%), Positives = 19/30 (63%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
           K KK+ + ++ +KK K++   +K++   +G
Sbjct: 32  KAKKKLKSEKLEKKAKRQLRAEKRQALEKG 61


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 32.7 bits (75), Expect = 0.060
 Identities = 11/30 (36%), Positives = 23/30 (76%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           F K+K++ +K+A+++ K+KE E++ + R R
Sbjct: 27  FHKRKQQRRKKAQEEAKEKEREERIEERKR 56


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 32.1 bits (73), Expect = 0.073
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 196 KEEEKKEAKKKTKKKEEEKKKKT 218
             EEK E K++ KKKEEEK+++ 
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEE 95



 Score = 30.5 bits (69), Expect = 0.22
 Identities = 10/21 (47%), Positives = 18/21 (85%)

Query: 193 KKKKEEEKKEAKKKTKKKEEE 213
           ++K EE+++E KK+ +K+EEE
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEE 95



 Score = 28.2 bits (63), Expect = 1.3
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEE 213
            ++K EEK+E KKK ++KEEE
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEE 94


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 32.7 bits (75), Expect = 0.074
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
                KKKK+++K +   K   K E+K +K  
Sbjct: 87  KKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118



 Score = 29.7 bits (67), Expect = 1.0
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K KK+++K +  +K  +KE E K + 
Sbjct: 99  KDKKDDKKDDKSEKKDEKEAEDKLED 124



 Score = 29.3 bits (66), Expect = 1.1
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K KK++ KK+  K   KK+++K  K+
Sbjct: 85  KWKKKKSKKKKDKDKDKKDDKKDDKS 110



 Score = 29.3 bits (66), Expect = 1.1
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           +K K ++KK  KKK K K+++  KK 
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKD 107



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
           D  K K+++KK+ K + K ++E + K  
Sbjct: 96  DKDKDKKDDKKDDKSEKKDEKEAEDKLE 123



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
            I   KK+ EE++K   KK K K+++ K K
Sbjct: 70  EIEKVKKEYEEKQKWKWKKKKSKKKKDKDK 99



 Score = 28.9 bits (65), Expect = 1.6
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +K+K + KK+  KK K K+++KK  
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDD 104



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 7/27 (25%), Positives = 17/27 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           K K +++K + K    +K++EK+ + +
Sbjct: 95  KDKDKDKKDDKKDDKSEKKDEKEAEDK 121



 Score = 28.5 bits (64), Expect = 2.4
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           KK  +++ K  KK  K+ E++ +  T+
Sbjct: 101 KKDDKKDDKSEKKDEKEAEDKLEDLTK 127



 Score = 27.3 bits (61), Expect = 5.5
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K+K + +KK++KKK  K +++K  K
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDK 105



 Score = 27.3 bits (61), Expect = 5.5
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            K K+++ K+ K K K K+++KK   
Sbjct: 84  WKWKKKKSKKKKDKDKDKKDDKKDDK 109



 Score = 27.0 bits (60), Expect = 5.9
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           KKKK+E  +E +K  K+ EE++K K + 
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKK 88



 Score = 26.6 bits (59), Expect = 9.2
 Identities = 9/27 (33%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 193 KKKKEEEKKE-AKKKTKKKEEEKKKKT 218
            ++ E+ KKE  +K+  K +++K KK 
Sbjct: 68  AEEIEKVKKEYEEKQKWKWKKKKSKKK 94


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 33.4 bits (77), Expect = 0.087
 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 2/32 (6%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
           D  K +EE  KE  ++ KK E E+    RMRG
Sbjct: 255 DRTKTEEELAKEEAERLKKLEAERL--RRMRG 284


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 32.1 bits (74), Expect = 0.098
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K +K+ EK +AK + KK ++E+ K  
Sbjct: 73  KAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 30.2 bits (69), Expect = 0.47
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K  ++ EKK  K K K ++++ KK+  
Sbjct: 68  LKAWEKAEKKAEKAKAKAEKKKAKKEEP 95



 Score = 29.0 bits (66), Expect = 1.2
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
              K  E+ +K+A+K   K E++K KK   +
Sbjct: 66  KELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 28.6 bits (65), Expect = 1.5
 Identities = 9/31 (29%), Positives = 15/31 (48%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           ++E K  +K  KK E+ K K  + +   E  
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95



 Score = 28.3 bits (64), Expect = 2.2
 Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 2/41 (4%)

Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKK-KTKKKEEEKKKK 217
           + LK   +   + D K+ K  EK E K  K K K E+KK K
Sbjct: 52  KDLKESLEA-ALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.9 bits (75), Expect = 0.12
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
           KK+ EE  K+A  K K+ EE   K
Sbjct: 117 KKQAEEAAKQAALKQKQAEEAAAK 140



 Score = 32.1 bits (73), Expect = 0.24
 Identities = 12/25 (48%), Positives = 12/25 (48%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K K E E K A    KK   E KKK
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKK 170



 Score = 28.6 bits (64), Expect = 2.5
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KK   E KK+A+ +  KK   + KK
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKK 185


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 32.8 bits (75), Expect = 0.13
 Identities = 13/33 (39%), Positives = 20/33 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           K K E +KKE   + ++K++EKK K+  RG   
Sbjct: 363 KAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 31.5 bits (72), Expect = 0.14
 Identities = 7/28 (25%), Positives = 18/28 (64%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           KK+    +  +++TK++ E++K+  + R
Sbjct: 90  KKRRIALRLRRERTKERAEKEKRTRKNR 117



 Score = 30.7 bits (70), Expect = 0.29
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            +++  +++  K+K  +K  EKK K R
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKFKRR 124



 Score = 30.3 bits (69), Expect = 0.32
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
            + + E  KE  +K K+  + ++KK + R  
Sbjct: 96  LRLRRERTKERAEKEKRTRKNREKKFKRRQK 126



 Score = 27.6 bits (62), Expect = 3.0
 Identities = 9/30 (30%), Positives = 18/30 (60%), Gaps = 6/30 (20%)

Query: 193 KKKKEEE------KKEAKKKTKKKEEEKKK 216
           ++ KE        +K  +KK K++++EK+K
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 32.2 bits (74), Expect = 0.15
 Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 4/38 (10%)

Query: 190 CDFKKKKE--EEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           CDF    +  EE+KE KK   K  EEKK     +  +E
Sbjct: 85  CDFTPIYQYFEEEKEKKKAMSK--EEKKAIKEEKEKLE 120


>gnl|CDD|215825 pfam00262, Calreticulin, Calreticulin family. 
          Length = 359

 Score = 32.3 bits (74), Expect = 0.16
 Identities = 22/104 (21%), Positives = 36/104 (34%), Gaps = 27/104 (25%)

Query: 131 RYYAIVKPLKYPINMTKR--VVAFML-------------------LNVWV----SPAVIS 165
           ++YAI      P +   +  VV + +                    +       +P  I 
Sbjct: 54  KFYAISAKFPKPFSNKGKTLVVQYEVKLQQGIDCGGAYIKLLSKDFDQKDFSGETPYTIM 113

Query: 166 FVPIMCGWYTTSEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKK 209
           F P +CG  T   H  +R  H  I  +K++K   KK   +  K 
Sbjct: 114 FGPDICGSDTKKVHFIFR--HKPITGYKEEKHLIKKPISRCDKL 155


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 30.7 bits (70), Expect = 0.18
 Identities = 8/25 (32%), Positives = 19/25 (76%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K++KEEEK + +++ K+ +  K+++
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREE 25



 Score = 30.3 bits (69), Expect = 0.23
 Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 193 KKKKEEEKKEAKKKTKKKE-EEKKKKTRMRGGIE 225
           K ++EEE K   K  K++E EEK +K +   G+ 
Sbjct: 8   KAQREEELKR-LKNLKREEIEEKLEKIKKVAGLR 40



 Score = 26.1 bits (58), Expect = 7.3
 Identities = 6/24 (25%), Positives = 15/24 (62%)

Query: 198 EEKKEAKKKTKKKEEEKKKKTRMR 221
           +E+KE +K  +++E ++ K  +  
Sbjct: 1   KERKEEEKAQREEELKRLKNLKRE 24


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 31.8 bits (73), Expect = 0.21
 Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 3/32 (9%)

Query: 192 FKKKKEEEKKEA---KKKTKKKEEEKKKKTRM 220
             K  EEE++E    KK+ KKKEE + K  ++
Sbjct: 272 ILKAAEEERQEEAQEKKEEKKKEEREAKLAKL 303



 Score = 30.3 bits (69), Expect = 0.72
 Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 8/37 (21%)

Query: 193 KKKKEEEKKEAKKKTK--------KKEEEKKKKTRMR 221
           ++KKEE+KKE ++           +K EEK++K + R
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQAR 321


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 32.1 bits (74), Expect = 0.22
 Identities = 6/26 (23%), Positives = 14/26 (53%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
            K   + +KK+ K++ K  + + K +
Sbjct: 66  LKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 27.9 bits (63), Expect = 4.1
 Identities = 6/26 (23%), Positives = 12/26 (46%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            K   + +K+ +K+  K  + K K  
Sbjct: 66  LKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 27.5 bits (62), Expect = 5.8
 Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 2/44 (4%)

Query: 178 EHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           + +K   +   + D K+ K   K + KKK K++ +  K K++ R
Sbjct: 50  KEMKEELKA-ALLDKKELKAWHKAQ-KKKEKQEAKAAKAKSKPR 91


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 31.5 bits (72), Expect = 0.23
 Identities = 7/27 (25%), Positives = 19/27 (70%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +++K E++K  KK+ +K+ ++K+   +
Sbjct: 11  RRRKREQRKARKKQKRKEAKKKEDAQK 37



 Score = 29.6 bits (67), Expect = 1.0
 Identities = 12/28 (42%), Positives = 19/28 (67%), Gaps = 3/28 (10%)

Query: 194 KKKEE---EKKEAKKKTKKKEEEKKKKT 218
           KK+EE   ++K+ KK  KKK+ +KK + 
Sbjct: 178 KKREENLKKRKDDKKNKKKKKAKKKGRI 205



 Score = 28.0 bits (63), Expect = 3.5
 Identities = 12/28 (42%), Positives = 23/28 (82%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K+KE++KK++KK+ K+++E+ +KK   R
Sbjct: 149 KRKEKQKKKSKKEWKERKEKVEKKKAER 176



 Score = 27.3 bits (61), Expect = 5.1
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
           K +K++++KEAKKK   ++ E +
Sbjct: 19  KARKKQKRKEAKKKEDAQKSEAE 41



 Score = 27.3 bits (61), Expect = 5.7
 Identities = 13/25 (52%), Positives = 20/25 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KKK ++E KE K+K +KK+ E++KK
Sbjct: 155 KKKSKKEWKERKEKVEKKKAERQKK 179



 Score = 26.5 bits (59), Expect = 9.4
 Identities = 11/29 (37%), Positives = 16/29 (55%), Gaps = 1/29 (3%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           +KKK EE  E  K  + +E+EK  K   +
Sbjct: 104 RKKKLEELDE-DKAAEIEEKEKWTKALAK 131


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.2 bits (74), Expect = 0.23
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K +KEE +K A KK  +   +K K 
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKG 448



 Score = 30.7 bits (70), Expect = 0.68
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KK ++EE ++A  K K +   KK K
Sbjct: 423 KKAEKEEAEKAAAKKKAEAAAKKAK 447



 Score = 30.7 bits (70), Expect = 0.68
 Identities = 11/25 (44%), Positives = 17/25 (68%), Gaps = 1/25 (4%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KK+++ EKK A+K+  +K   KKK 
Sbjct: 416 KKQRKAEKK-AEKEEAEKAAAKKKA 439



 Score = 29.1 bits (66), Expect = 2.2
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +KK  +++++A+KK +K+E EK   
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAA 435



 Score = 28.0 bits (63), Expect = 4.6
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +K   ++K EA  K  K  + + KK
Sbjct: 431 EKAAAKKKAEAAAKKAKGPDGETKK 455



 Score = 27.6 bits (62), Expect = 5.8
 Identities = 13/36 (36%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 193 KKKKEE---EKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           K++K E   EK+EA+K   KK+ E   K       E
Sbjct: 417 KQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.2 bits (74), Expect = 0.23
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K KK  EK E K++ +KKE++KK     +
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKK 439



 Score = 31.4 bits (72), Expect = 0.36
 Identities = 10/34 (29%), Positives = 21/34 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
           KK++EE+K++ KK    K++E++++       E 
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 31.0 bits (71), Expect = 0.53
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
            +K E++++E KK+ KKK    KKK
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKK 440



 Score = 31.0 bits (71), Expect = 0.55
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           KK+K+++    KKK +++EEEK+KK   +   E +
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 30.7 bits (70), Expect = 0.60
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +K +++ ++E K+K KK    KKK+  
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEE 443



 Score = 29.9 bits (68), Expect = 1.2
 Identities = 11/25 (44%), Positives = 20/25 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +KK+EEEKKE KKK    +++++++
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            +KK EE+K+ KKK     ++K+++ 
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEE 444



 Score = 28.7 bits (65), Expect = 2.5
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
             +K E+K+E +KK KKK+    KK 
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKK 440



 Score = 28.0 bits (63), Expect = 5.0
 Identities = 10/30 (33%), Positives = 20/30 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
            KK ++  ++A+KK +++++EKKKK     
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGK 438



 Score = 28.0 bits (63), Expect = 5.1
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
             KK ++  E  +K +++E+++KKK    G  +
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKK 440


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 31.4 bits (72), Expect = 0.26
 Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 9/50 (18%)

Query: 183 RYQHPNI-----CDFKKKKE--EEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           +     I     CDF    E  E +KE KK     +EEKK     +  +E
Sbjct: 74  KEGGAIIKDLEKCDFTPIYEYFEAEKEKKKA--MSKEEKKAIKEEKDKLE 121


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 30.7 bits (70), Expect = 0.26
 Identities = 13/34 (38%), Positives = 26/34 (76%)

Query: 182 YRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKK 215
           YR +   I D ++K+EE+++E ++K +K+E+EK+
Sbjct: 4   YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKE 37



 Score = 27.7 bits (62), Expect = 3.3
 Identities = 7/28 (25%), Positives = 19/28 (67%)

Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
            EE+++E +++ ++KE +++K+     G
Sbjct: 14  IEEKREEKEREKEEKERKEEKEKEWGKG 41



 Score = 26.9 bits (60), Expect = 5.5
 Identities = 10/33 (30%), Positives = 20/33 (60%), Gaps = 4/33 (12%)

Query: 193 KKKKEEEKKEAKKKTKK----KEEEKKKKTRMR 221
           +K+++E K+E +K+  K    KEE +K+   + 
Sbjct: 24  EKEEKERKEEKEKEWGKGLVQKEEREKRLEELE 56


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 30.1 bits (68), Expect = 0.27
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           KKKK++ K + +  T+K+EEEK          E  
Sbjct: 24  KKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEED 58



 Score = 29.3 bits (66), Expect = 0.42
 Identities = 12/28 (42%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K KK + KK  KKK KKK+ + K++  
Sbjct: 12  LKGKKIDVKK--KKKKKKKKNKSKEEVV 37



 Score = 28.2 bits (63), Expect = 1.2
 Identities = 10/26 (38%), Positives = 19/26 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           KKKK+++KK   K+    E+E+++K+
Sbjct: 21  KKKKKKKKKNKSKEEVVTEKEEEEKS 46



 Score = 26.2 bits (58), Expect = 5.9
 Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 3/25 (12%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KKKK   KK+ KK   K+E   +K+
Sbjct: 20  KKKK---KKKKKKNKSKEEVVTEKE 41


>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
          Length = 323

 Score = 31.4 bits (71), Expect = 0.29
 Identities = 39/157 (24%), Positives = 77/157 (49%), Gaps = 18/157 (11%)

Query: 21  TSPDENVSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADM 80
            + D N   I++  L   + IF ++    GN+LVI +V+  ++   + + ++ ++A +D+
Sbjct: 23  VNSDVNYFVIVFYIL---LFIFGLI----GNVLVI-AVLIVKRFMFVVDVYLFNIAMSDL 74

Query: 81  LVAMVAMTFNASVQV-TGKWNFSFFMCDVWNSLDVYFST--ASILHLCCISVDRYYAIVK 137
              M+  +F   +     +W F  FMC +   L VYF    +++  +  IS+DRY  +V 
Sbjct: 75  ---MLVFSFPFIIHNDLNEWIFGEFMCKL--VLGVYFVGFFSNMFFVTLISIDRYILVVN 129

Query: 138 PLKYPINMTKRVVAFMLLNVWVSPAVISFVPIMCGWY 174
             K   N +  +   + +  WV   ++S +P M  +Y
Sbjct: 130 ATKIK-NKSISLSVLLSVAAWVCSVILS-MPAMVLYY 164


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.2 bits (71), Expect = 0.30
 Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 6/34 (17%)

Query: 190 CDFK------KKKEEEKKEAKKKTKKKEEEKKKK 217
           CDF       K ++EEKK   K+ KK  + +K+K
Sbjct: 85  CDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEK 118


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.8 bits (72), Expect = 0.31
 Identities = 9/35 (25%), Positives = 19/35 (54%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           K K  +E K    K ++KE+E+ K+ + +   + +
Sbjct: 91  KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 30.6 bits (69), Expect = 0.59
 Identities = 14/27 (51%), Positives = 18/27 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           KKKK+E+ KE  K  K KEE K+K+  
Sbjct: 117 KKKKKEKPKEEPKDRKPKEEAKEKRPP 143



 Score = 30.6 bits (69), Expect = 0.67
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
             K+EE++KE  K+ KKK++EK K+ 
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEE 127



 Score = 30.6 bits (69), Expect = 0.74
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           ++K++E+ KE KKK K+K +E+ K  +
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRK 132



 Score = 29.9 bits (67), Expect = 1.1
 Identities = 11/29 (37%), Positives = 23/29 (79%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            K++E+EK++ K++ KKK+E+ K++ + R
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131



 Score = 29.9 bits (67), Expect = 1.3
 Identities = 8/35 (22%), Positives = 21/35 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            K  +E K E+ K+ +K++E+ K++ + +    ++
Sbjct: 92  TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKE 126



 Score = 29.5 bits (66), Expect = 1.5
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            +K +EE KE +   +K++E++KK  
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVE 155



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 10/25 (40%), Positives = 20/25 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K++ +EEKK+ K+K K++ +++K K
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPK 134



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +K+KE+ K+E KKK +K +EE K +  
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKP 133



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K +K +E+ + K+  K+KE+EK+KK 
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKV 154



 Score = 29.1 bits (65), Expect = 2.3
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            KK  +KK   KK +  EEEK+++    
Sbjct: 175 PKKPPKKKPPNKKKEPPEEEKQRQAARE 202



 Score = 28.7 bits (64), Expect = 2.4
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
             K++E++KE K +  +  EE+KK+ R+R
Sbjct: 142 PPKEKEKEKEKKVEEPRDREEEKKRERVR 170



 Score = 28.3 bits (63), Expect = 3.6
 Identities = 8/35 (22%), Positives = 20/35 (57%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           KKK++ +++   +K K++ +EK+         E++
Sbjct: 119 KKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 28.3 bits (63), Expect = 4.1
 Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +KKK++EK + + K +K +EE K+K  
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 27.9 bits (62), Expect = 4.5
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           KK++    K   KK  KK+   KKK  
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEP 190



 Score = 27.9 bits (62), Expect = 4.7
 Identities = 7/29 (24%), Positives = 22/29 (75%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           + K +  +E+++ K++ K+++++KK+K +
Sbjct: 97  EPKNESGKEEEKEKEQVKEEKKKKKEKPK 125



 Score = 27.9 bits (62), Expect = 5.0
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
            + K   KK  KKK   K++E  ++ + R 
Sbjct: 169 VRAKSRPKKPPKKKPPNKKKEPPEEEKQRQ 198



 Score = 27.5 bits (61), Expect = 6.2
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            KK  ++K   KKK   +EE++++  R
Sbjct: 175 PKKPPKKKPPNKKKEPPEEEKQRQAAR 201


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 31.2 bits (71), Expect = 0.33
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
            KKKK+++   AK         KKK  R+
Sbjct: 99  TKKKKKKDPTAAKSPKAAAPRPKKKSERI 127


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 30.3 bits (69), Expect = 0.34
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
           F KK E+E++E +K+ +K+  E+ KK
Sbjct: 77  FWKKNEKEERELRKRAEKEALEQAKK 102


>gnl|CDD|235898 PRK06964, PRK06964, DNA polymerase III subunit delta'; Validated.
          Length = 342

 Score = 31.3 bits (71), Expect = 0.35
 Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 6/51 (11%)

Query: 171 CGWYTTSEHLKYRYQHPNICD------FKKKKEEEKKEAKKKTKKKEEEKK 215
           C W+    H  YR   P            + KE +  E  KKTK   +E K
Sbjct: 63  CNWFAQGNHPDYRIVRPEALAAEAPGAADEAKEADADEGGKKTKAPSKEIK 113


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 31.2 bits (71), Expect = 0.36
 Identities = 11/25 (44%), Positives = 23/25 (92%), Gaps = 1/25 (4%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           ++K+EEE+++ +KK +++EEE+K+K
Sbjct: 196 QQKREEERRKQRKK-QQEEEERKQK 219


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 30.8 bits (70), Expect = 0.44
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           D +K++E+ K  AK K KK  + K ++
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEE 68



 Score = 30.4 bits (69), Expect = 0.63
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +K K   K +AKK  K K EEK+K  R
Sbjct: 48  EKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 30.4 bits (69), Expect = 0.69
 Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 4/31 (12%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           K K E+  K  KKK  K     K K  + G 
Sbjct: 194 KLKAEKAAKGGKKKKGK----AKAKLNVGGA 220



 Score = 30.0 bits (68), Expect = 0.91
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K K+  K + ++K K K E+++K  R
Sbjct: 56  AKAKKALKAKIEEKEKAKREKEEKGLR 82



 Score = 29.6 bits (67), Expect = 0.98
 Identities = 10/30 (33%), Positives = 15/30 (50%), Gaps = 3/30 (10%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
            +K + +K AK   KKK    K K ++  G
Sbjct: 192 NEKLKAEKAAKGGKKKK---GKAKAKLNVG 218



 Score = 29.6 bits (67), Expect = 1.1
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           ++K+EE+ K A K   KK  + K + + +
Sbjct: 43  EEKEEEKAKVAAKAKAKKALKAKIEEKEK 71



 Score = 28.9 bits (65), Expect = 1.9
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            KK  + K E K+K K+++EEK  +  
Sbjct: 58  AKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 28.5 bits (64), Expect = 2.3
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           K KK  + K  +K+  K+E+E+K    +
Sbjct: 57  KAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 28.5 bits (64), Expect = 2.8
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           EEE +E +++  K   + K K  ++  IE +
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEK 69



 Score = 28.1 bits (63), Expect = 3.3
 Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 6/34 (17%)

Query: 194 KKKEEEKKEAKKKTKKK------EEEKKKKTRMR 221
           ++KE+ K+E ++K  ++      E+E  +K R+R
Sbjct: 67  EEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100



 Score = 26.9 bits (60), Expect = 8.2
 Identities = 6/28 (21%), Positives = 15/28 (53%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           K  K + +++ K K +K+E+  ++    
Sbjct: 60  KALKAKIEEKEKAKREKEEKGLRELEED 87


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.5 bits (69), Expect = 0.45
 Identities = 13/25 (52%), Positives = 19/25 (76%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +  KE E+KEA K  K+K+E+K+KK
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 30.1 bits (68), Expect = 0.49
 Identities = 8/27 (29%), Positives = 18/27 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +K+  +E +E +    KKE+++KK+ +
Sbjct: 99  QKEVSKETEEKEAIKAKKEKKEKKEKK 125



 Score = 29.0 bits (65), Expect = 1.3
 Identities = 8/28 (28%), Positives = 19/28 (67%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
           + +K+++E  KE ++K   K +++KK+ 
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEK 121



 Score = 28.2 bits (63), Expect = 2.6
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           F ++ E+E+KE  K+T++KE  K KK + 
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKK 119



 Score = 27.0 bits (60), Expect = 5.2
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           ++K+  ++ E K+  K K+E+K+KK
Sbjct: 98  EQKEVSKETEEKEAIKAKKEKKEKK 122



 Score = 26.6 bits (59), Expect = 7.8
 Identities = 12/28 (42%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 193 KKKKEEEKKEAK--KKTKKKEEEKKKKT 218
            KK+++EKKE K  +K  KK+  K  K 
Sbjct: 114 AKKEKKEKKEKKVAEKLAKKKSTKTTKN 141


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 30.3 bits (68), Expect = 0.45
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 185 QHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           Q        KKK+E+  E K K+KKKEE+   K
Sbjct: 21  QSQK---SDKKKKEKVSEKKGKSKKKEEKPNGK 50



 Score = 28.8 bits (64), Expect = 1.3
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 3/32 (9%)

Query: 193 KKKKEEEKKEAKKKTKKKEE---EKKKKTRMR 221
           K++KE  K +++K  KKK+E   EKK K++ +
Sbjct: 12  KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKK 43


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 29.9 bits (68), Expect = 0.49
 Identities = 14/30 (46%), Positives = 23/30 (76%), Gaps = 1/30 (3%)

Query: 193 KKKKEEEKKEAKKKTK-KKEEEKKKKTRMR 221
           K + E  K+E KK++K KKEEEK++K +++
Sbjct: 92  KLEHERNKQEKKKRSKEKKEEEKERKRQLK 121



 Score = 28.8 bits (65), Expect = 1.1
 Identities = 14/30 (46%), Positives = 24/30 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
           KKK+ +EKKE +K+ K++ +++KKK + RG
Sbjct: 102 KKKRSKEKKEEEKERKRQLKQQKKKAKHRG 131


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 30.3 bits (69), Expect = 0.54
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKK 216
           D   KK  EKKE   K +K+E + KK
Sbjct: 139 DPFAKKRREKKERVAKNEKRELKNKK 164


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 30.9 bits (70), Expect = 0.60
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKK 216
           K+K EE +E   K ++K E   K
Sbjct: 52  KRKREELREKIAKAREKRERNSK 74



 Score = 27.4 bits (61), Expect = 8.2
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K KK ++KKEA++K     +EK+K+ 
Sbjct: 97  KSKKRQKKKEAERKKALLLDEKEKER 122


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 30.5 bits (69), Expect = 0.61
 Identities = 12/26 (46%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 193 KKKKE-EEKKEAKKKTKKKEEEKKKK 217
           K KKE  E+KE +++ K  EEEK ++
Sbjct: 153 KIKKERAEEKEREEEEKAAEEEKARE 178


>gnl|CDD|227008 COG4662, TupA, ABC-type tungstate transport system, periplasmic
           component [Coenzyme metabolism].
          Length = 227

 Score = 30.5 bits (69), Expect = 0.62
 Identities = 18/83 (21%), Positives = 34/83 (40%)

Query: 21  TSPDENVSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADM 80
            S D  +  I+  TL  S++   + A +G  L  ++++   R  R I       L+   +
Sbjct: 15  FSLDPELIGIVATTLYVSLISIFLAALVGVPLAFVLALREFRGKRFIKMIINTLLSMPTV 74

Query: 81  LVAMVAMTFNASVQVTGKWNFSF 103
           LV ++     +     G +N  F
Sbjct: 75  LVGLLLYLLLSRSGPLGWFNLLF 97


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.9 bits (70), Expect = 0.62
 Identities = 14/31 (45%), Positives = 22/31 (70%), Gaps = 4/31 (12%)

Query: 193 KKKKEEEKKEAKK----KTKKKEEEKKKKTR 219
           ++KKEEE+K   K      ++++EEKKKKT+
Sbjct: 45  EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTK 75



 Score = 29.4 bits (66), Expect = 1.6
 Identities = 12/19 (63%), Positives = 14/19 (73%)

Query: 197 EEEKKEAKKKTKKKEEEKK 215
           EEEK+E KKKTKK +E   
Sbjct: 64  EEEKEEKKKKTKKVKETTT 82


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.6 bits (69), Expect = 0.67
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K KK + KK+ KK  KK+ +E   K
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDK 169



 Score = 28.3 bits (63), Expect = 3.2
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K K+ + K+  KK+KKKE ++     
Sbjct: 144 TKAKKGKAKKKTKKSKKKEAKESSDKD 170



 Score = 27.1 bits (60), Expect = 8.3
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
            + K ++ K + K K  KK+E K+   
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSD 168


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.8 bits (69), Expect = 0.69
 Identities = 12/25 (48%), Positives = 21/25 (84%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KK K+ +KKE K+K K+++++KKK+
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKE 221



 Score = 30.4 bits (68), Expect = 0.79
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           +KK ++ K+ +KK K+KE +K KK  + G 
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGF 225



 Score = 30.4 bits (68), Expect = 0.83
 Identities = 19/83 (22%), Positives = 31/83 (37%), Gaps = 2/83 (2%)

Query: 147 KRVVAFMLLNVWV--SPAVISFVPIMCGWYTTSEHLKYRYQHPNICDFKKKKEEEKKEAK 204
           K V  F  L + +  SPA  + V         +                 + EE KK  K
Sbjct: 220 KEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPK 279

Query: 205 KKTKKKEEEKKKKTRMRGGIERQ 227
            K KK+ +EK++K + +     +
Sbjct: 280 HKKKKQRKEKEEKKKKKKHHHHR 302



 Score = 29.3 bits (65), Expect = 2.2
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           K EE++  +++ +K + EKKK+ + + G  R 
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRH 114



 Score = 28.9 bits (64), Expect = 2.4
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 195 KKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
           K EEE++  ++  K K E+KK++   RG
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRG 110



 Score = 28.5 bits (63), Expect = 3.1
 Identities = 8/27 (29%), Positives = 18/27 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           ++++  ++ E  K+ KKK E++K+  R
Sbjct: 86  EERRHRQRLEKDKREKKKREKEKRGRR 112


>gnl|CDD|147641 pfam05582, Peptidase_U57, YabG peptidase U57.  YabG is a protease
           involved in the proteolysis and maturation of SpoIVA and
           YrbA proteins, conserved with the cortex and/or coat
           assembly by Bacillus subtilis.
          Length = 287

 Score = 30.2 bits (69), Expect = 0.72
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           D +K    E    KK+ ++K  E  K  R    + R+
Sbjct: 49  DLEKVDASEVTRYKKRFEEKINESLKLIRQDRKLMRE 85


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.8 bits (70), Expect = 0.72
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMR 221
           EE+ K+ KKK KKKEEE K++ + R
Sbjct: 742 EEKTKKKKKKEKKKEEEYKREEKAR 766



 Score = 28.9 bits (65), Expect = 2.9
 Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
           +K K+++KKE KK+ + K EEK +
Sbjct: 743 EKTKKKKKKEKKKEEEYKREEKAR 766


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.4 bits (69), Expect = 0.76
 Identities = 15/37 (40%), Positives = 19/37 (51%), Gaps = 6/37 (16%)

Query: 190 CDFKKKK------EEEKKEAKKKTKKKEEEKKKKTRM 220
              KKK+      E  +  A+KK K KEE KKKK + 
Sbjct: 229 ESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKE 265



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGI 224
           K KEE KK+  K++K  +  KK   +    I
Sbjct: 253 KSKEEIKKKKPKESKGVKALKKVVAKGMKKI 283


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 29.2 bits (66), Expect = 0.77
 Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 10/37 (27%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEE----------EKKKK 217
           +  K  EEEKKE +KK KKK+E          EKKK+
Sbjct: 69  EAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKE 105


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 28.3 bits (64), Expect = 0.78
 Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           K+K+ E+ +  + K +  ++  +KK R
Sbjct: 5   KRKRNEKLRAKRAKKRAAKKAARKKKR 31


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 27.7 bits (62), Expect = 0.82
 Identities = 7/27 (25%), Positives = 21/27 (77%)

Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKK 215
           +C+ +K+++EE+ E +++ +++ E K+
Sbjct: 20  LCEEEKREDEEENEDEEEGEEQSEVKR 46


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7 plays
           a role in protein translation. Deletions of the TMA7
           gene results in altered protein synthesis rates.
          Length = 63

 Score = 27.9 bits (62), Expect = 0.83
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K+KKE ++++   K K+KEE K  K
Sbjct: 16  KQKKELDEEDLAFKQKQKEEAKALK 40


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 30.2 bits (68), Expect = 0.86
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
            +K + E + EA K  KK ++ KKKK   +G
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKKVLGKG 263


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 30.0 bits (68), Expect = 0.95
 Identities = 9/36 (25%), Positives = 19/36 (52%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           F K  EE  K+ +++ +++++  K+ T       RQ
Sbjct: 330 FSKAAEENIKKEEEEEERRKKLVKETTEYEQSSSRQ 365


>gnl|CDD|216941 pfam02233, PNTB, NAD(P) transhydrogenase beta subunit.  This family
           corresponds to the beta subunit of NADP transhydrogenase
           in prokaryotes, and either the protein N- or C terminal
           in eukaryotes. The domain is often found in conjunction
           with pfam01262. Pyridine nucleotide transhydrogenase
           catalyzes the reduction of NAD+ to NADPH. A complete
           loss of activity occurs upon mutation of Gly314 in E.
           coli.
          Length = 464

 Score = 30.0 bits (68), Expect = 0.96
 Identities = 12/41 (29%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 20  STSPDENVSSIIWMTLRASIMIFIIVASIGG-NLLVIVSVM 59
           S SP   + +++ MT  + ++ + +VA+IGG ++ V++S++
Sbjct: 181 SESPGTGMLALLLMTAISLVLGWHLVAAIGGADMPVVISML 221


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 28.8 bits (65), Expect = 1.0
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
           KK  EEE K   ++ K++E EKKK
Sbjct: 103 KKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 29.2 bits (66), Expect = 1.0
 Identities = 6/29 (20%), Positives = 18/29 (62%), Gaps = 1/29 (3%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKT-RMR 221
           +  E+E  + K++ +++ E K+K+  + +
Sbjct: 122 ELLEKELAKLKREKRRENERKQKEILKEQ 150



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            K K E+++E ++K K+  +E+ K  
Sbjct: 129 AKLKREKRRENERKQKEILKEQMKML 154



 Score = 26.5 bits (59), Expect = 8.8
 Identities = 8/34 (23%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 191 DFKKKKEEEKKE---AKKKTKKKEEEKKKKTRMR 221
           + ++  EEE+ +    K+  K K E++++  R +
Sbjct: 110 EVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 30.0 bits (68), Expect = 1.0
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K++KE  K++ + +  KK+EEKKKK   +
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 28.8 bits (65), Expect = 2.5
 Identities = 8/29 (27%), Positives = 20/29 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           +K+  +E+K  +K  K++E++KK+  ++ 
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLE 587



 Score = 28.8 bits (65), Expect = 2.6
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K++K   K + +++ KKKE EK +K
Sbjct: 564 KEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 27.7 bits (62), Expect = 5.6
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           +K+++E  KE K+  K K++E+KKK 
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKK 581


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 10/40 (25%), Positives = 24/40 (60%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           ++  F + + +EK+  +++ KK+ E++KKK +      +Q
Sbjct: 273 SLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQ 312



 Score = 27.4 bits (61), Expect = 5.7
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
            + K EEE  + +KK ++ E +KK K
Sbjct: 25  RQSKLEEELNKLEKKEEEAECKKKFK 50


>gnl|CDD|234582 PRK00028, infC, translation initiation factor IF-3; Reviewed.
          Length = 177

 Score = 29.4 bits (67), Expect = 1.1
 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 13/45 (28%)

Query: 181 KYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           K++Y+       +KKK+   KEAKKK K  E    K+ + R  I+
Sbjct: 70  KFKYE-------QKKKQ---KEAKKKQKVIE---VKEIKFRPKID 101


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 29.1 bits (65), Expect = 1.1
 Identities = 10/36 (27%), Positives = 22/36 (61%), Gaps = 5/36 (13%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK-----TRMRGGI 224
           ++++EE +EA ++ +++ + K +K       MR GI
Sbjct: 33  EEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGI 68


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 29.4 bits (66), Expect = 1.2
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 5/40 (12%)

Query: 192 FKKKKE---EEKKEAKKKTKKKEEEK--KKKTRMRGGIER 226
            KK KE   +EKKE KKK  +K  E+  + KT+ +   ER
Sbjct: 54  LKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQER 93


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 28.0 bits (63), Expect = 1.2
 Identities = 6/20 (30%), Positives = 11/20 (55%)

Query: 194 KKKEEEKKEAKKKTKKKEEE 213
                  +E KK+ +++EEE
Sbjct: 60  AAAAAAAEEEKKEEEEEEEE 79


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 28.2 bits (63), Expect = 1.3
 Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 3/33 (9%)

Query: 193 KKKKEEEKK---EAKKKTKKKEEEKKKKTRMRG 222
           +K KEE KK   +AK++ KK EEE  KK     
Sbjct: 20  EKAKEEAKKIIRKAKEEAKKIEEEIIKKAEEEA 52



 Score = 26.3 bits (58), Expect = 5.9
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           KK EEE ++  +K KK+ EE+ KK    G  E
Sbjct: 46  KKAEEEAQKLIEKKKKEGEEEAKKILEEGEKE 77


>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein.  This is a novel repeat
           in Pneumocystis carinii Major surface glycoprotein (MSG)
           some members of the alignment have up to nine repeats of
           this family, the repeats containing several conserved
           cysteines. The MSG of P. carinii is an important protein
           in host-pathogen interactions. Surface glycoprotein A
           from Pneumocystis carinii is a main target for the host
           immune system, this protein is implicated in the
           attachment of Pneumocystis carinii to the host alveolar
           epithelial cells, alveolar macrophages, host surfactant
           and possibly accounts in part for the hypoxia seen in
           Pneumocystis carinii pneumonia (PCP).
          Length = 81

 Score = 27.8 bits (62), Expect = 1.3
 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 4/34 (11%)

Query: 188 NICDFKKKKEEEK----KEAKKKTKKKEEEKKKK 217
             C   KK  + +    KE K + KKK++ KK  
Sbjct: 35  EKCYKLKKDLKLEELLLKELKGELKKKKKCKKAL 68


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 27.9 bits (63), Expect = 1.4
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           KK+K+E K E+K+   KK   +++K R
Sbjct: 63  KKEKKEVKAESKRYNAKKLTLEQRKAR 89


>gnl|CDD|184861 PRK14859, tatA, twin arginine translocase protein A; Provisional.
          Length = 63

 Score = 27.4 bits (61), Expect = 1.4
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
           KK   EK+E + K  KKE++KKKK
Sbjct: 40  KKATSEKEEIEIKPTKKEDKKKKK 63



 Score = 26.2 bits (58), Expect = 3.1
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKK 216
           I +FKK   E+++   K TKK++++KKK
Sbjct: 36  IKNFKKATSEKEEIEIKPTKKEDKKKKK 63


>gnl|CDD|220624 pfam10187, Nefa_Nip30_N, N-terminal domain of NEFA-interacting
           nuclear protein NIP30.  This is a the N-terminal 100
           amino acids of a family of proteins conserved from
           plants to humans. The full-length protein has putatively
           been called NEFA-interacting nuclear protein NIP30,
           however no reference could be found to confirm this.
          Length = 99

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 10/29 (34%), Positives = 20/29 (68%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +F  + EE ++ A+KK K++E E+ ++ R
Sbjct: 71  EFLDEVEESRRAAEKKRKREEAEELEEFR 99


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 29.1 bits (65), Expect = 1.5
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           K++EK+  +   K KE+ +KK+   R  
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKRRN 192


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 29.5 bits (66), Expect = 1.5
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
           K+++E+ +K A ++ KK E  K K
Sbjct: 132 KQQEEQARKAAAEQKKKAEAAKAK 155


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 28.0 bits (63), Expect = 1.6
 Identities = 7/20 (35%), Positives = 14/20 (70%)

Query: 194 KKKEEEKKEAKKKTKKKEEE 213
              EE+K+E +++ +K+E E
Sbjct: 75  AAAEEKKEEEEEEEEKEESE 94



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 6/22 (27%), Positives = 14/22 (63%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKT 218
                  +KK +++EEE+K+++
Sbjct: 72  AAAAAAEEKKEEEEEEEEKEES 93


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 27.8 bits (62), Expect = 1.6
 Identities = 12/27 (44%), Positives = 16/27 (59%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           KKKK+   K A+ K  KK ++K KK  
Sbjct: 25  KKKKKRTAKTARPKATKKGQKKDKKKD 51


>gnl|CDD|218270 pfam04795, PAPA-1, PAPA-1-like conserved region.  Family of
           proteins with a conserved region found in PAPA-1, a
           PAP-1 binding protein.
          Length = 89

 Score = 27.7 bits (62), Expect = 1.6
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
           K+ EEEK E   K  KK+  +KK+       + 
Sbjct: 19  KRAEEEKMETINKLLKKQARRKKREDKIKKGDE 51


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 28.7 bits (64), Expect = 1.7
 Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 2/36 (5%)

Query: 193 KKKKEEEKKEAKKKTK--KKEEEKKKKTRMRGGIER 226
           KKKK +EKKE  K+ K  ++E+E  K+ +    IE 
Sbjct: 70  KKKKLDEKKEIAKQRKREQREKELAKRQKELEKIEL 105


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 27.3 bits (61), Expect = 1.9
 Identities = 11/28 (39%), Positives = 13/28 (46%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           KKKK+ E   A  K KK   E   +  M
Sbjct: 29  KKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 26.9 bits (60), Expect = 2.4
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 195 KKEEEKKEAKKKTKKKEEEKKKKT 218
           K E +  + KKK KKK+ E    T
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAAT 40



 Score = 26.6 bits (59), Expect = 4.2
 Identities = 11/28 (39%), Positives = 13/28 (46%), Gaps = 3/28 (10%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
           D  KKK   KK+ KKK +      K K 
Sbjct: 21  DLVKKK---KKKKKKKAEDTAATAKAKK 45


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 29.3 bits (66), Expect = 1.9
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEE 213
           K++KE E++  ++K  KK+  
Sbjct: 287 KRRKELEREAKEEKQLKKKLA 307



 Score = 28.5 bits (64), Expect = 3.2
 Identities = 8/30 (26%), Positives = 18/30 (60%), Gaps = 3/30 (10%)

Query: 193 KKKKEEEK---KEAKKKTKKKEEEKKKKTR 219
           K+K + ++   K  K+  ++ +EEK+ K +
Sbjct: 276 KRKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 28.5 bits (64), Expect = 3.3
 Identities = 7/29 (24%), Positives = 19/29 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           +K K +  KE ++K  ++E +++K+ + +
Sbjct: 277 RKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 28.1 bits (63), Expect = 4.1
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K+  ++EK  A+KK ++KE  +KKK + R
Sbjct: 317 KEVAQKEKARARKKEQRKERGEKKKLKRR 345


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 29.2 bits (65), Expect = 1.9
 Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 178 EHLKYRYQHPN--ICDFKKKKEEEKKEAKKKT-KKKEEEKKKKTR 219
           E LK + Q     + + KKK+EE +K  +++  ++K+EE  +K+R
Sbjct: 203 EKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSR 247


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 29.0 bits (65), Expect = 2.1
 Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 3/33 (9%)

Query: 192 FKKKKEEEKK---EAKKKTKKKEEEKKKKTRMR 221
           F KKK +EKK   EA + T K+E E++   R  
Sbjct: 221 FFKKKTKEKKEKKEASESTVKEESEEESGKRDV 253


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 28.5 bits (64), Expect = 2.2
 Identities = 12/39 (30%), Positives = 24/39 (61%), Gaps = 4/39 (10%)

Query: 193 KKKKEEEKKEAKKKTKKKE----EEKKKKTRMRGGIERQ 227
           KK+ + E+K+A+++ ++ E    EE+KK    R G  ++
Sbjct: 6   KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKE 44



 Score = 27.7 bits (62), Expect = 3.6
 Identities = 10/25 (40%), Positives = 20/25 (80%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +KK EE+++  +K+ ++ EEE++KK
Sbjct: 31  RKKLEEKREGERKEEEELEEEREKK 55



 Score = 27.7 bits (62), Expect = 4.4
 Identities = 10/24 (41%), Positives = 20/24 (83%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
           KKKEEE+++ +++  +KE+E+ +K
Sbjct: 54  KKKEEEERKEREEQARKEQEEYEK 77


>gnl|CDD|107372 cd06377, PBP1_iGluR_NMDA_NR3, N-terminal
           leucine/isoleucine/valine-binding protein (LIVBP)-like
           domain of the NR3 subunit of NMDA receptor family.
           N-terminal leucine/isoleucine/valine-binding protein
           (LIVBP)-like domain of the NR3 subunit of NMDA receptor
           family. The ionotropic N-methyl-d-asparate (NMDA)
           subtype of glutamate receptor serves critical functions
           in neuronal development, functioning, and degeneration
           in the mammalian central nervous system. The functional
           NMDA receptor is a heterotetramer composed of two NR1
           and two NR2 (A, B, C, and D) or of NR3 (A and B)
           subunits. The receptor controls a cation channel that is
           highly permeable to monovalent ions and calcium and
           exhibits voltage-dependent inhibition by magnesium. Dual
           agonists, glutamate and glycine, are required for
           efficient activation of the NMDA receptor. Among NMDA
           receptor subtypes, the NR2B subunit containing receptors
           appear particularly important for pain perception; thus
           NR2B-selective antagonists may be useful in the
           treatment of chronic pain.
          Length = 382

 Score = 29.0 bits (65), Expect = 2.3
 Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 1/56 (1%)

Query: 55  IVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWN 110
           +VS++R    R   N F + +++A  L  ++ +   + +Q  G  + S  +C   +
Sbjct: 109 VVSIVRREFPRGSQNPFHLQMSWASPLSTLLDVLL-SVLQRNGWEDVSLVLCRERD 163


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.9 bits (65), Expect = 2.3
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
           KK+++++K  ++   K E  KK+K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEK 650


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 27.6 bits (61), Expect = 2.3
 Identities = 15/35 (42%), Positives = 18/35 (51%)

Query: 187 PNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           P     K+KK E+ K    K KKK  +KKKK   R
Sbjct: 3   PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.9 bits (65), Expect = 2.3
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           K KKE++  E+KKK++KK+++KK+K
Sbjct: 56  KDKKEDKNNESKKKSEKKKKKKKEK 80



 Score = 28.5 bits (64), Expect = 3.1
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           KKKK++EKKE K + + K   K  K 
Sbjct: 73  KKKKKKEKKEPKSEGETKLGFKTPKK 98



 Score = 27.8 bits (62), Expect = 5.9
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            K  E +KK  KKK KKKE+++ K  
Sbjct: 61  DKNNESKKKSEKKKKKKKEKKEPKSE 86


>gnl|CDD|153288 cd07604, BAR_ASAPs, The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP
           with SH3 domain, ANK repeat and PH domain containing
           proteins.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. This subfamily is
           composed of ASAPs (ArfGAP with SH3 domain, ANK repeat
           and PH domain containing proteins), which are Arf GTPase
           activating proteins (GAPs) with similarity to ACAPs
           (ArfGAP with Coiled-coil, ANK repeat and PH domain
           containing proteins) in that they contain an N-terminal
           BAR domain, followed by a Pleckstrin homology (PH)
           domain, an Arf GAP domain, and ankyrin (ANK) repeats.
           However, ASAPs contain an additional C-terminal SH3
           domain. ASAPs function in regulating cell growth,
           migration, and invasion. Vertebrates contain at least
           three members, ASAP1, ASAP2, and ASAP3. ASAP1 and ASAP2
           shows GTPase activating protein (GAP) activity towards
           Arf1 and Arf5. They do not show GAP activity towards
           Arf6, but is able to mediate Arf6 signaling by binding
           stably to GTP-Arf6. ASAP3 is an Arf6-specific GAP. BAR
           domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The BAR domain of ASAP1
           mediates membrane bending, is essential for function,
           and autoinhibits GAP activity by interacting with the PH
           and/or Arf GAP domains.
          Length = 215

 Score = 28.5 bits (64), Expect = 2.3
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGI 224
           D KK  ++  K+ + K  K E+EKK+  +  G I
Sbjct: 108 DLKKPFDKAWKDYETKASKIEKEKKQLAKEAGMI 141


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 27.4 bits (61), Expect = 2.4
 Identities = 8/31 (25%), Positives = 15/31 (48%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
             C + KK+++ K + K+   KK      +T
Sbjct: 14  KGCKWDKKEDDGKCKPKEGKAKKNGAPVTQT 44



 Score = 27.4 bits (61), Expect = 2.5
 Identities = 13/38 (34%), Positives = 16/38 (42%), Gaps = 9/38 (23%)

Query: 190 CDFKKKKEEE---------KKEAKKKTKKKEEEKKKKT 218
           C+ K K + E         KKE   K K KE + KK  
Sbjct: 1   CNKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNG 38



 Score = 26.2 bits (58), Expect = 6.4
 Identities = 8/40 (20%), Positives = 15/40 (37%), Gaps = 11/40 (27%)

Query: 190 CDFKKKKEEEK-----------KEAKKKTKKKEEEKKKKT 218
           C  K+ K ++               +K   KK+++  KK 
Sbjct: 27  CKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 28.0 bits (63), Expect = 2.4
 Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 188 NICDFKKKKEEEK-KEAKKKTKKKEEEKKKKTRMRGGIERQ 227
              + K+++ E+K +E KK  K+KEEE K+       ++ +
Sbjct: 56  KENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAE 96



 Score = 26.8 bits (60), Expect = 5.3
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
               F K+ E +++ A+KK +++++ +K+K  
Sbjct: 50  KFDKFLKENEAKRRRAEKKAEEEKKLRKEKEE 81


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 28.3 bits (63), Expect = 2.5
 Identities = 12/36 (33%), Positives = 18/36 (50%)

Query: 190 CDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
            DF   K   ++E +KK    +EEK+ K + R   E
Sbjct: 84  ADFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 27.3 bits (61), Expect = 2.6
 Identities = 7/20 (35%), Positives = 17/20 (85%)

Query: 194 KKKEEEKKEAKKKTKKKEEE 213
           +++EEE++E +++ ++ EEE
Sbjct: 76  EEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 27.6 bits (61), Expect = 2.6
 Identities = 10/19 (52%), Positives = 15/19 (78%)

Query: 198 EEKKEAKKKTKKKEEEKKK 216
             K EAKK+ KK+EEE+++
Sbjct: 85  GAKAEAKKEEKKEEEEEEE 103


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 27.6 bits (62), Expect = 2.7
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
           + K  EE+++ K+K KKK++E +
Sbjct: 68  RNKAAEERRKLKEKKKKKKKELE 90



 Score = 27.6 bits (62), Expect = 3.0
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 11/40 (27%)

Query: 193 KKKKEEE------KKEAKKKT-----KKKEEEKKKKTRMR 221
           KKKKE E       +E KK+      KK EE+KK+  +++
Sbjct: 84  KKKKELENFYRFQIREKKKEELAELRKKFEEDKKRIEQLK 123


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.6 bits (64), Expect = 2.7
 Identities = 8/23 (34%), Positives = 17/23 (73%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKK 215
            +KK++E+ +  K+ ++K+E KK
Sbjct: 412 NEKKQKEQADEDKEKRQKDERKK 434



 Score = 28.2 bits (63), Expect = 3.7
 Identities = 6/24 (25%), Positives = 19/24 (79%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKK 216
           + +K+++++  + K K++++E+KK
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434



 Score = 27.8 bits (62), Expect = 5.0
 Identities = 6/24 (25%), Positives = 17/24 (70%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
           ++ E+++KE   + K+K ++ ++K
Sbjct: 410 EENEKKQKEQADEDKEKRQKDERK 433


>gnl|CDD|164575 CHL00199, infC, translation initiation factor 3; Provisional.
          Length = 182

 Score = 28.3 bits (63), Expect = 2.7
 Identities = 14/37 (37%), Positives = 21/37 (56%)

Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           I D+ K K  ++K AK+  KK+     K+ +MR  IE
Sbjct: 70  IIDYGKYKFTQEKRAKEAKKKQHNSSIKEVKMRYKIE 106


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
            +KKE+EKKE  K  KK+   +K  
Sbjct: 161 LRKKEKEKKEQLKIQKKQSLLEKYG 185


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 27.4 bits (61), Expect = 2.9
 Identities = 8/30 (26%), Positives = 17/30 (56%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           + K +KE E++   +  K++   K++K R 
Sbjct: 49  ELKDEKEAERQRRIQAIKERRAAKEEKERY 78


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 28.5 bits (64), Expect = 3.0
 Identities = 8/30 (26%), Positives = 17/30 (56%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRM 220
           D  ++  EE +E   + ++ E EK+K + +
Sbjct: 148 DSDEELSEEDEEEAAEEEEAEAEKEKASEL 177


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 27.6 bits (62), Expect = 3.0
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           ++ K   K+    K  +KE++KKKK
Sbjct: 60  RRPKTARKESVAAKAAEKEKKKKKK 84


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.9 bits (65), Expect = 3.0
 Identities = 11/33 (33%), Positives = 19/33 (57%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
            +K E+E +E +K  ++ EEE  K T   G ++
Sbjct: 386 PEKLEKELEELEKAKEEIEEEISKITARIGELK 418



 Score = 28.1 bits (63), Expect = 4.9
 Identities = 10/23 (43%), Positives = 17/23 (73%)

Query: 195 KKEEEKKEAKKKTKKKEEEKKKK 217
           K+ EEK+E  ++ KKK +E +K+
Sbjct: 331 KELEEKEERLEELKKKLKELEKR 353


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 26.8 bits (60), Expect = 3.2
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 195 KKEEEKKEAKKKTKKKEEEKKKK 217
           K+E EK+E K+K +KK EE+ K+
Sbjct: 65  KREREKREKKEKLEKKLEEELKE 87


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 27.2 bits (60), Expect = 3.3
 Identities = 15/30 (50%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKKKT 218
           I + K ++E E KEA    K K EEK KKT
Sbjct: 26  IANIKNEQENEIKEA----KSKYEEKAKKT 51


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 9/31 (29%), Positives = 20/31 (64%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           +E K+EA++K ++ EE  ++   +   +E+Q
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQ 201



 Score = 27.4 bits (61), Expect = 9.3
 Identities = 11/35 (31%), Positives = 18/35 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
            +K EEE K  K + +  E+  ++  R    +ERQ
Sbjct: 683 LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ 717


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 27.8 bits (62), Expect = 3.4
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
           K+KK+ E  + K + +K + E KK  
Sbjct: 122 KEKKKREVPKPKTEKEKPKTEPKKPK 147



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +++   E K   K+ KK+E  K K  +
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEK 136


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 28.6 bits (63), Expect = 3.4
 Identities = 7/27 (25%), Positives = 19/27 (70%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K++ E  ++  +++  KKE+E++++ R
Sbjct: 1032 KERMESLQRAKEEEIGKKEKEREQRIR 1058


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 28.3 bits (63), Expect = 3.6
 Identities = 11/34 (32%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 186 HPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           H    + KK +  EK EA K  KK+  +K++   
Sbjct: 326 HAENAEIKKTRTAEKNEA-KARKKEIAQKRRAAE 358


>gnl|CDD|223866 COG0795, COG0795, Predicted permeases [General function prediction
           only].
          Length = 364

 Score = 28.1 bits (63), Expect = 3.8
 Identities = 11/53 (20%), Positives = 17/53 (32%)

Query: 27  VSSIIWMTLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFAD 79
           V    W       +IFI      GN L  V + R      ++  +    A  +
Sbjct: 147 VEGGFWAKDGNDSLIFIESIDRDGNKLQGVFIKRFDDNGRLSVVYAAEGALFE 199


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 27.6 bits (62), Expect = 3.9
 Identities = 14/32 (43%), Positives = 18/32 (56%)

Query: 190 CDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            D  KKK+ +K+E  KK KKK E    K  +R
Sbjct: 119 IDLGKKKKYKKEEENKKAKKKSESLVAKLEVR 150


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 27.2 bits (61), Expect = 4.0
 Identities = 8/27 (29%), Positives = 18/27 (66%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           +KK ++ K+E   K K+++++ K K +
Sbjct: 111 RKKLKKFKEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 28.0 bits (63), Expect = 4.0
 Identities = 8/27 (29%), Positives = 18/27 (66%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           + K +  EE++E +++ ++K EE  +K
Sbjct: 85  ELKAETLEERREIREEAEEKWEEALEK 111


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 27.8 bits (62), Expect = 4.0
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           K+K + EKK  K K K K + K K   
Sbjct: 85  KEKPKPEKKPKKPKPKPKPKPKPKPKV 111


>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
          Length = 112

 Score = 27.3 bits (59), Expect = 4.1
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 6/43 (13%)

Query: 175 TTSEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           TT EH+K   +       +KK E+ + E +KK +K E E +KK
Sbjct: 54  TTHEHIKKENED------EKKPEKPENEDEKKPEKPENEDEKK 90


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 26.8 bits (60), Expect = 4.1
 Identities = 12/25 (48%), Positives = 14/25 (56%), Gaps = 1/25 (4%)

Query: 196 KEEEKKEAKKKTKKK-EEEKKKKTR 219
           KEE +K+ KK  KK    E KKK  
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
           family of fungal proteins is conserved towards the
           C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 25.6 bits (56), Expect = 4.2
 Identities = 10/30 (33%), Positives = 12/30 (40%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
           KKK  E   E  K    +   KKK  +  G
Sbjct: 5   KKKWREANSERNKDNDLRSRVKKKAKKKFG 34


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 25.8 bits (57), Expect = 4.2
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTR 219
           K  E EK+EA+ + K++EEE  K+ R
Sbjct: 22  KALEAEKEEAEARQKEEEEEAIKQLR 47


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 28.0 bits (63), Expect = 4.3
 Identities = 8/26 (30%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRG 222
           EEE+ +A  + +++E ++K++ R  G
Sbjct: 48  EEERLKALAEEEERERKRKEE-RREG 72



 Score = 27.2 bits (61), Expect = 7.5
 Identities = 7/28 (25%), Positives = 22/28 (78%)

Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKK 216
           I +++++K E ++E + + ++++EEK++
Sbjct: 155 ILEYQREKAEREEEREAERRERKEEKER 182


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 26.1 bits (58), Expect = 4.7
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTR 219
           +  +KE KKK  KK + +++  R
Sbjct: 8   DAYRKEQKKKELKKNKAERQARR 30


>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
          Length = 144

 Score = 27.0 bits (60), Expect = 4.8
 Identities = 17/39 (43%), Positives = 24/39 (61%)

Query: 189 ICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           I DFKK  EE  K  K+  KK EE  K+ T++R  +E++
Sbjct: 90  IEDFKKSVEELDKTKKEGNKKIEELNKEITKLRKELEKR 128


>gnl|CDD|223367 COG0290, InfC, Translation initiation factor 3 (IF-3) [Translation,
           ribosomal structure and biogenesis].
          Length = 176

 Score = 27.2 bits (61), Expect = 4.8
 Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 13/45 (28%)

Query: 181 KYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIE 225
           K++Y+       +KKK    KEAKKK K+ +    K+ ++R  I+
Sbjct: 69  KFKYE-------QKKKA---KEAKKKQKQIQ---VKEIKLRPKID 100


>gnl|CDD|204055 pfam08764, Coagulase, Staphylococcus aureus coagulase.
           Staphylococcus aureus secretes a cofactor called
           coagulase. Coagulase is an extracellular protein that
           forms a complex with human prothrombin, and activates it
           without the usual proteolytic cleavages. The resulting
           complex directly initiates blood clotting.
          Length = 282

 Score = 27.8 bits (62), Expect = 4.8
 Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 6/38 (15%)

Query: 187 PNICDFKKKKEEEK------KEAKKKTKKKEEEKKKKT 218
           P   D+K K E ++      KE ++   K +E  K KT
Sbjct: 240 PTKHDYKNKSENKENFDALVKETREAVAKADESWKTKT 277


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
           proteins is associated with U3 snoRNA. U3 snoRNA is
           required for nucleolar processing of pre-18S ribosomal
           RNA.
          Length = 87

 Score = 26.2 bits (58), Expect = 4.9
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKK 217
             +  EEE K+ KKK K +  +K +K
Sbjct: 20  PPESYEEELKKQKKKNKLRFLKKLEK 45


>gnl|CDD|234900 PRK01109, PRK01109, ATP-dependent DNA ligase; Provisional.
          Length = 590

 Score = 28.0 bits (63), Expect = 5.0
 Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 130 DRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVI 164
           D    ++KP K   +   RVV+ M  +VWV P ++
Sbjct: 490 DELPKMLKPYKID-HKHPRVVSKMEPDVWVEPKLV 523


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 27.6 bits (62), Expect = 5.1
 Identities = 5/27 (18%), Positives = 14/27 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            KK+E   ++ K   ++++ + K   +
Sbjct: 119 AKKREAAGEKEKAPRRERKPKPKAPRK 145



 Score = 26.8 bits (60), Expect = 8.6
 Identities = 5/27 (18%), Positives = 15/27 (55%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           + +++ +K+EA  + +K    ++K   
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKP 140


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 27.1 bits (60), Expect = 5.4
 Identities = 8/47 (17%), Positives = 25/47 (53%)

Query: 181 KYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           ++R +  +    + +K E  K+A++  K++ ++  K+  + G  + +
Sbjct: 58  RHRSRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEGKSDEE 104


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 27.9 bits (62), Expect = 5.7
 Identities = 13/34 (38%), Positives = 18/34 (52%), Gaps = 3/34 (8%)

Query: 193 KKKKEE---EKKEAKKKTKKKEEEKKKKTRMRGG 223
           KKKKEE   EK+  K K  +KE + K + +    
Sbjct: 21  KKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASD 54



 Score = 27.2 bits (60), Expect = 9.6
 Identities = 11/29 (37%), Positives = 20/29 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           +KKK+EEK + K+  K K  +K+ K +++
Sbjct: 20  RKKKKEEKAKEKELKKLKAAQKEAKAKLQ 48


>gnl|CDD|216137 pfam00822, PMP22_Claudin, PMP-22/EMP/MP20/Claudin family. 
          Length = 161

 Score = 26.9 bits (60), Expect = 5.7
 Identities = 16/94 (17%), Positives = 32/94 (34%), Gaps = 10/94 (10%)

Query: 36  RASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQV 95
              +++  IV+ I   +L+ V+         I + + VS        A +      +   
Sbjct: 1   MLVLLLGFIVSHIAWVILLFVAT--------IPDQWKVSRYVGAAASAGLWRNC-TTQSC 51

Query: 96  TGKWNFSFFMCDVWNSLDVYFSTASILHLCCISV 129
           TG+ +      +            SI+ L  IS+
Sbjct: 52  TGQISCKVLELNDALQAVQALMILSII-LGIISL 84


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 27.5 bits (62), Expect = 5.8
 Identities = 8/27 (29%), Positives = 20/27 (74%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           + K ++ EE++E +++ ++K EE K++
Sbjct: 38  ELKAEEIEERREVREEAEEKYEEAKEE 64


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 27.1 bits (60), Expect = 5.8
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 196 KEEEKKEAKKKTKKKEEEKKKKTR 219
           K+++ K  K K K K++E KKK R
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 26.7 bits (59), Expect = 8.5
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 187 PNICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           P     +KKK+ + K+ K K KK E++KK +
Sbjct: 84  PATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 26.4 bits (59), Expect = 5.9
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           +K K   ++  + K K+KE++KKKK
Sbjct: 61  RKGKTAREEAVEAKAKEKEKKKKKK 85



 Score = 26.1 bits (58), Expect = 9.6
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 10/35 (28%)

Query: 193 KKKKEEEKKEAKKKTKKKEE----------EKKKK 217
           ++  E + KE +KK KKK+E          EKKK+
Sbjct: 68  EEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKE 102


>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
            NSA2 appears to be a protein required for the
           maturation of 27S pre-rRNA in yeast; it has been
           characterized in mammalian cells as a nucleolar protein
           that might play a role in the regulation of the cell
           cycle and in cell proliferation.
          Length = 257

 Score = 27.2 bits (61), Expect = 6.0
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
           + KK+ +EK + KK  K  EE   K+ 
Sbjct: 47  YNKKRYKEKIQMKKTIKMHEERNVKQK 73


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 27.3 bits (61), Expect = 6.1
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKK 217
           + K + K + K K  KK EE+ K+
Sbjct: 96  EPKPKPKPKPKPKPVKKVEEQPKR 119


>gnl|CDD|220397 pfam09781, NDUF_B5, NADH:ubiquinone oxidoreductase, NDUFB5/SGDH
           subunit.  Members of this family mediate the transfer of
           electrons from NADH to the respiratory chain. The
           immediate electron acceptor for the enzyme is believed
           to be ubiquinone, the reaction that occurs being: NADH +
           ubiquinone = NAD(+) + ubiquinol.
          Length = 182

 Score = 27.1 bits (60), Expect = 6.1
 Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 21/89 (23%)

Query: 149 VVAFMLL-NVWVSPAVISFVPIMCGWYTTSEHLKYRYQHP-------NICDFKKKKEEEK 200
           V AF+   NV++ PA ++ +P   G+    EH +Y Y+HP        + D   +KE EK
Sbjct: 75  VAAFITYVNVFIGPAELAEIPE--GY--EPEHWEY-YKHPITRWIARYVYD-SPQKEYEK 128

Query: 201 KEA-------KKKTKKKEEEKKKKTRMRG 222
             A       K + +  E E ++  R RG
Sbjct: 129 MLALIQEENEKAEIRLLELEVRRLMRERG 157


>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
           recombination, and repair].
          Length = 292

 Score = 27.3 bits (60), Expect = 6.2
 Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 1/28 (3%)

Query: 193 KKKKEEEKKEAK-KKTKKKEEEKKKKTR 219
           K+++E+ K + K KK +KK +E ++KTR
Sbjct: 218 KQRREKMKDDRKEKKLEKKIKELRRKTR 245


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 27.0 bits (60), Expect = 6.2
 Identities = 11/30 (36%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 193 KKKKEEEKKE-AKKKTKKKEEEKKKKTRMR 221
           KK KE+ +++  K K ++ E+EK+K+ R  
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKEKEKEERFM 95


>gnl|CDD|226640 COG4171, SapC, ABC-type antimicrobial peptide transport system,
           permease component [Defense mechanisms].
          Length = 296

 Score = 27.4 bits (61), Expect = 6.2
 Identities = 8/56 (14%), Positives = 25/56 (44%)

Query: 34  TLRASIMIFIIVASIGGNLLVIVSVMRHRKLRIITNYFVVSLAFADMLVAMVAMTF 89
           T+  ++++ +     GG L +        +  ++ +     L+   +L+A++ + F
Sbjct: 98  TVGGALLVTLAATICGGVLGIFAGATHGLRSAVLNHILDTLLSIPSLLLAIIVVAF 153


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 25.6 bits (57), Expect = 6.3
 Identities = 7/31 (22%), Positives = 18/31 (58%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           + ++++  EK   K+K KK    K+++ ++ 
Sbjct: 31  ELRRREFYEKPSEKRKRKKAAARKRRRKKLA 61


>gnl|CDD|211392 cd11380, Ribosomal_S8e_like, Eukaryotic/archaeal ribosomal protein
           S8e and similar proteins.  This family contains the
           eukaryotic/archaeal ribosomal protein S8, a component of
           the small ribosomal subunits, as well as the NSA2 gene
           product.
          Length = 138

 Score = 26.7 bits (60), Expect = 6.3
 Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 12/46 (26%)

Query: 193 KKKKEEEKKEAKKKTKKKE------------EEKKKKTRMRGGIER 226
           K+K    K +  +K +K E            E   +K R+RGG  +
Sbjct: 6   KRKATGGKFKVVRKKRKYELGRKPANTKLGGERFTRKVRVRGGNIK 51


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 27.6 bits (62), Expect = 6.3
 Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKK-TRMRGGIER 226
             K+  EE KKEA+   K+   E K++  ++R  +ER
Sbjct: 33  LAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELER 69


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 27.5 bits (62), Expect = 6.4
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 196 KEEEKKEAKKKTKKKEEEKKKKTRMR 221
           K+ E+   K   K ++ +KKKK R  
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|227822 COG5535, RAD4, DNA repair protein RAD4 [DNA replication,
           recombination, and repair].
          Length = 650

 Score = 27.5 bits (61), Expect = 6.4
 Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 3/53 (5%)

Query: 115 YFSTASILHLCCISVDRYYAIVKPLKYPINMTKRVVAFMLLNVWVSPAVISFV 167
            FS  S L    +  +     +  L+YP    + V  F    V+V P  +  V
Sbjct: 278 VFSNNSDLD---VLSEGLLEYIDSLEYPGFWGEVVDKFEKKWVFVDPVRLYIV 327


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 25.7 bits (57), Expect = 6.7
 Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 7/41 (17%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKK-------KKTRMR 221
              D  K +   ++  +KK KKK+ +KK       KK +MR
Sbjct: 25  EQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMR 65


>gnl|CDD|219160 pfam06749, DUF1218, Protein of unknown function (DUF1218).  This
           family contains hypothetical plant proteins of unknown
           function. Family members contain a number of conserved
           cysteine residues.
          Length = 97

 Score = 26.1 bits (58), Expect = 6.9
 Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 2/55 (3%)

Query: 142 PINMTKRVVAFMLLNV-WVSPAVISFVPIMCGWYTTSEHLKYRYQHPNICDFKKK 195
           P +  KR +A +   + W++  VI+F  ++ G    + H +Y       C   KK
Sbjct: 28  PKSGRKRALAVVCFVLSWIA-FVIAFALLLAGAARNAYHTRYNVHFRPSCYTLKK 81


>gnl|CDD|224977 COG2066, GlsA, Glutaminase [Amino acid transport and metabolism].
          Length = 309

 Score = 27.2 bits (61), Expect = 6.9
 Identities = 22/98 (22%), Positives = 30/98 (30%), Gaps = 35/98 (35%)

Query: 100 NFSFFMCDVWNSLDVYFSTASILHLCCI--SVDRYYAIVKPL----KYPIN--------M 145
           +F     DV  +LDVYF      H C I  +      +   L      P+          
Sbjct: 178 SFGNLEHDVEEALDVYF------HQCAIEMNCVDLARLGLFLANGGVNPLTGERVVPADQ 231

Query: 146 TKRVVAFMLLNVWVSPAVISFVPIMCGWYTTSEHLKYR 183
            +++ A ML                CG Y  S    YR
Sbjct: 232 ARQINALML---------------TCGMYDASGEFAYR 254


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 27.5 bits (61), Expect = 7.1
 Identities = 10/26 (38%), Positives = 12/26 (46%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKT 218
            KKK    K    KT KK+ + K K 
Sbjct: 753 AKKKSASTKGKAAKTVKKKSKAKSKK 778


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 27.2 bits (61), Expect = 7.3
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRG 222
           +K  EEE+++ +KK     EE K+K   RG
Sbjct: 9   EKLNEEEREKLEKKKSSLSEEDKEKIIERG 38


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 27.3 bits (61), Expect = 7.4
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 187 PNICDFKKKKEEEKKEAKKKTKKK 210
           P     K+K+E +KK+ KKK+KK 
Sbjct: 350 PLALSPKRKREGDKKQKKKKSKKL 373


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 27.5 bits (61), Expect = 7.6
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 177 SEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           SE L Y     +  D  +  EE K+E +KK K K++ +K+K  
Sbjct: 329 SEFLGYM-GGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 27.3 bits (61), Expect = 7.7
 Identities = 14/34 (41%), Positives = 23/34 (67%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
            +K + E KE KK+ KK+E++KK+  R+   IE+
Sbjct: 321 NEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEK 354


>gnl|CDD|178415 PLN02820, PLN02820, 3-methylcrotonyl-CoA carboxylase, beta chain.
          Length = 569

 Score = 27.5 bits (61), Expect = 7.8
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           E E K+ +     KEEE+  K +     ER+
Sbjct: 492 ERENKKRQGIQWSKEEEEAFKAKTVEAYERE 522


>gnl|CDD|218140 pfam04547, Anoctamin, Calcium-activated chloride channel.  The
           family carries eight putative transmembrane domains,
           and, although it has no similarity to other known
           channel proteins, it is clearly a calcium-activated
           ionic channel. It is expressed in various secretory
           epithelia, the retina and sensory neurons, and mediates
           receptor-activated chloride currents in diverse
           physiological processes.
          Length = 449

 Score = 27.2 bits (61), Expect = 8.1
 Identities = 13/79 (16%), Positives = 29/79 (36%), Gaps = 8/79 (10%)

Query: 59  MRHRKLRIITNYFVVSLAFADMLVAMVAMTFNASVQVTGKWNFSFFMCDVWNSLDVYFST 118
              R LR + +  VV L    ++  ++ +       +    +  +       SL+ +   
Sbjct: 114 PWKRLLRQLLSIPVVLLFIGLVIGIIIGI-----FILRIFLSEVYSGPSFKQSLESF--L 166

Query: 119 ASILHLCCISV-DRYYAIV 136
            +IL+   I + +  Y  V
Sbjct: 167 PAILNSVIILILNFIYEKV 185


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 26.2 bits (58), Expect = 8.1
 Identities = 8/30 (26%), Positives = 20/30 (66%)

Query: 194 KKKEEEKKEAKKKTKKKEEEKKKKTRMRGG 223
           KK+ EE ++ +++ K++EE +++    +G 
Sbjct: 96  KKQLEENRKLEEEEKEREELEEENDVTKGK 125


>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
          Length = 530

 Score = 27.4 bits (61), Expect = 8.4
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 200 KKEAKKKTKKKEEEKKKKTRMRGGI 224
           KK AKK   K E+ +++ T      
Sbjct: 15  KKAAKKAAAKAEKLRREATAKAAAA 39


>gnl|CDD|201966 pfam01777, Ribosomal_L27e, Ribosomal L27e protein family.  The
           N-terminal region of the eukaryotic ribosomal L27 has
           the KOW motif. C-terminal region is represented by this
           family.
          Length = 85

 Score = 25.7 bits (57), Expect = 8.6
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 200 KKEAKKKTKKKEEEKKKKTRMR 221
           +KEA+K+ KKK EE+ K  + R
Sbjct: 56  RKEARKEVKKKFEERYKTGKNR 77


>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
          Length = 476

 Score = 27.3 bits (61), Expect = 8.6
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKK 217
           N   + K KE+EK+E  KK K  +E+ K +
Sbjct: 194 NFLKYLKIKEDEKEEIFKKNKILKEKLKSR 223


>gnl|CDD|217935 pfam04158, Sof1, Sof1-like domain.  Sof1 is essential for cell
           growth and is a component of the nucleolar rRNA
           processing machinery.
          Length = 88

 Score = 25.7 bits (57), Expect = 8.8
 Identities = 10/27 (37%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 193 KKKKEE-EKKEAKKKTKKKEEEKKKKT 218
           KK+KEE  +K +K  +   + E+KK  
Sbjct: 61  KKRKEENRRKHSKPGSVPPKPERKKHV 87



 Score = 25.7 bits (57), Expect = 9.0
 Identities = 17/49 (34%), Positives = 26/49 (53%), Gaps = 10/49 (20%)

Query: 178 EHLKYRYQH-PNICDFK---------KKKEEEKKEAKKKTKKKEEEKKK 216
           E LK +Y+H P I             KK ++ K+E K+  K+KEE ++K
Sbjct: 22  EALKEKYKHMPEIKRIARHRHVPKAIKKAQKIKREMKEAKKRKEENRRK 70


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 26.4 bits (59), Expect = 8.9
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKK 217
           KKKKE E  E KKK     EE+K+ 
Sbjct: 89  KKKKEGEFFEEKKKKYVVSEERKED 113


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 25.7 bits (57), Expect = 9.0
 Identities = 7/29 (24%), Positives = 18/29 (62%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
           +++KE ++K  +     K+++K+K  + R
Sbjct: 32  QEEKELKQKADEGNNSGKKKKKRKAKKKR 60


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 27.1 bits (60), Expect = 9.1
 Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 4/37 (10%)

Query: 193 KKKKEEEKKEAKKKTKKK-EEEKKKK---TRMRGGIE 225
           ++K E E++E K++ K K E E+K K    + R  IE
Sbjct: 9   REKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE 45


>gnl|CDD|153326 cd07642, BAR_ASAP2, The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP
           with SH3 domain, ANK repeat and PH domain containing
           protein 2.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. ASAP2 (ArfGAP with SH3
           domain, ANK repeat and PH domain containing protein 2)
           is also known as DDEF2 (Development and Differentiation
           Enhancing Factor 2), AMAP2, centaurin beta-3, or PAG3.
           ASAP2 mediates the functions of Arf GTPases vial dual
           mechanisms: it exhibits GTPase activating protein (GAP)
           activity towards class I (Arf1) and II (Arf5) Arfs; and
           binds class III Arfs (GTP-Arf6) stably without GAP
           activity. It binds paxillin and is implicated in Fcgamma
           receptor-mediated phagocytosis in macrophages and in
           cell migration. ASAP2 contains an N-terminal BAR domain,
           followed by a Pleckstrin homology (PH) domain, an Arf
           GAP domain, ankyrin (ANK) repeats, and a C-terminal SH3
           domain. BAR domains form dimers that bind to membranes,
           induce membrane bending and curvature, and may also be
           involved in protein-protein interactions. The BAR domain
           of the related protein ASAP1 mediates membrane bending,
           is essential for function, and autoinhibits GAP activity
           by interacting with the PH and/or Arf GAP domains.
          Length = 215

 Score = 26.9 bits (59), Expect = 9.1
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 191 DFKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           D KK  ++  K+ + K  K E+EKK+  +M G I  +
Sbjct: 108 DLKKPFDKAWKDYETKVTKIEKEKKEHAKMHGMIRTE 144


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 26.5 bits (59), Expect = 9.2
 Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIER 226
           F+ +K E  +EA+KK ++  E+K+K+  M   I  
Sbjct: 21  FEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIII 55


>gnl|CDD|167649 PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional.
          Length = 198

 Score = 26.6 bits (59), Expect = 9.7
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           +K+ E+ K+EA+K+ + K E   +K + +  +E+Q
Sbjct: 27  QKEAEKIKEEARKRAESKAEWILRKAKTQAELEKQ 61


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 27.2 bits (60), Expect = 9.7
 Identities = 8/27 (29%), Positives = 17/27 (62%)

Query: 193  KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K++ EEEKKE  ++  ++  ++ K+  
Sbjct: 998  KERLEEEKKELLREIIEETCQRFKEFL 1024



 Score = 27.2 bits (60), Expect = 9.8
 Identities = 7/27 (25%), Positives = 14/27 (51%)

Query: 193 KKKKEEEKKEAKKKTKKKEEEKKKKTR 219
            K KE+ KK  +    K++ E +++  
Sbjct: 201 LKLKEQAKKALEYYQLKEKLELEEENL 227


>gnl|CDD|240396 PTZ00388, PTZ00388, 40S ribosomal protein S8-like; Provisional.
          Length = 223

 Score = 26.6 bits (59), Expect = 9.8
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 192 FKKKKEEEKKEAKKKTKKKEEEKKKKT 218
           F KK+  EK E KK  K  EE+  K+ 
Sbjct: 13  FNKKRYAEKAEMKKTIKAHEEKDVKEK 39


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 27.0 bits (60), Expect = 9.8
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 188 NICDFKKKKEEEKKEAKKKTKKKEEEKKKKTRMR 221
            I + K+K++E +KEA +  +K++E   K   +R
Sbjct: 21  EIEELKEKRDELRKEASELAEKRDELNAKVRELR 54


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 26.6 bits (59), Expect = 10.0
 Identities = 15/45 (33%), Positives = 21/45 (46%)

Query: 175 TTSEHLKYRYQHPNICDFKKKKEEEKKEAKKKTKKKEEEKKKKTR 219
           TT E L  R   P I   +K    ++K+ KK  KKK +  K+   
Sbjct: 145 TTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKE 189


>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809).  This
           family consists of several proteins of unknown function
           Raphanus sativus (Radish) and Brassica napus (Rape).
          Length = 138

 Score = 26.3 bits (57), Expect = 10.0
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 197 EEEKKEAKKKTKKKEEEKKKKTRMRGGIERQ 227
           +EEKKE K + + KEE+K+ K  + G  E++
Sbjct: 100 KEEKKEGKGEIEGKEEKKEGKGEIEGKEEKK 130


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.134    0.406 

Gapped
Lambda     K      H
   0.267   0.0694    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,406,173
Number of extensions: 1078761
Number of successful extensions: 6879
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5969
Number of HSP's successfully gapped: 576
Length of query: 227
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 134
Effective length of database: 6,812,680
Effective search space: 912899120
Effective search space used: 912899120
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 57 (25.8 bits)