RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9531
         (409 letters)



>gnl|CDD|192352 pfam09732, CactinC_cactus, Cactus-binding C-terminus of cactin
           protein.  CactinC_cactus is the C-terminal 200 residues
           of the cactin protein which are necessary for the
           association of cactin with IkappaB-cactus as one of the
           intracellular members of the Rel complex. The Rel
           (NF-kappaB) pathway is conserved in invertebrates and
           vertebrates. In mammals, it controls the activities of
           the immune and inflammatory response genes as well as
           viral genes, and is critical for cell growth and
           survival. In Drosophila, the Rel pathway functions in
           the innate cellular and humoral immune response, in
           muscle development, and in the establishment of
           dorsal-ventral polarity in the early embryo. Most
           members of the family also have a Cactin_mid domain
           pfam10312 further upstream.
          Length = 125

 Score =  226 bits (578), Expect = 5e-74
 Identities = 82/105 (78%), Positives = 89/105 (84%)

Query: 285 WSDKYRPRKPRYFNRVHTGFEWNKYNQTHYDMDNPPPKIVQGYKFNIFYPDLIDKNSTPQ 344
           WSDKYRPRKPRYFNRV TGF+WNKYNQTHYD DNPPPK VQGYKFNIFYPDLIDK   P+
Sbjct: 1   WSDKYRPRKPRYFNRVKTGFDWNKYNQTHYDSDNPPPKTVQGYKFNIFYPDLIDKGKAPR 60

Query: 345 YFLNTCADNQDFAILRFHAGPPYEDIAFKIVNREWEYSYKRGFRC 389
           Y +  C D  D  +LRFHAGPPYEDIAF+IVN+EW+YS KRGFR 
Sbjct: 61  YRIEPCEDPDDTCLLRFHAGPPYEDIAFRIVNKEWDYSRKRGFRS 105


>gnl|CDD|214996 smart01050, CactinC_cactus, Cactus-binding C-terminus of cactin
           protein.  CactinC_cactus is the C-terminal 200 residues
           of the cactin protein which are necessary for the
           association of cactin with IkappaB-cactus as one of the
           intracellular members of the Rel complex. The Rel
           (NF-kappaB) pathway is conserved in invertebrates and
           vertebrates. In mammals, it controls the activities of
           the immune and inflammatory response genes as well as
           viral genes, and is critical for cell growth and
           survival. In Drosophila, the Rel pathway functions in
           the innate cellular and humoral immune response, in
           muscle development, and in the establishment of
           dorsal-ventral polarity in the early embryo. Most
           members of the family also have a Cactin_mid domain
           further upstream.
          Length = 129

 Score =  192 bits (491), Expect = 6e-61
 Identities = 76/110 (69%), Positives = 87/110 (79%), Gaps = 4/110 (3%)

Query: 285 WSDKYRPRKPRYFNRVHTGFEWNKYNQTHYDMDNPPPKIVQGYKFNIFYPDLIDKNSTPQ 344
           W DKYR RKPRYFNRV TG++WNKYNQTHYD DNPPPKIVQGYKFNIFYPDLIDK   P+
Sbjct: 1   WEDKYRLRKPRYFNRVKTGYDWNKYNQTHYDNDNPPPKIVQGYKFNIFYPDLIDKGKAPR 60

Query: 345 YFLNTC----ADNQDFAILRFHAGPPYEDIAFKIVNREWEYSYKRGFRCH 390
           Y + +       + D  +LRF AGPPYEDIAFKIVNREW+YS++ GF+  
Sbjct: 61  YRIESSFEIEKPDDDTCLLRFSAGPPYEDIAFKIVNREWDYSHRGGFKSS 110


>gnl|CDD|220686 pfam10312, Cactin_mid, Conserved mid region of cactin.  This is the
           conserved middle region of a family of proteins referred
           to as cactins. The region contains two of three
           predicted coiled-coil domains. Most members of this
           family have a CactinC_cactus pfam09732 domain at the
           C-terminal end. Upstream of Mid_cactin in Drosophila
           members are a serine-rich region, some non-typical RD
           motifs and three predicted bipartite nuclear
           localisation signals, none of which are well-conserved.
           Cactin associates with IkappaB-cactus as one of the
           intracellular members of the Rel (NF-kappaB) pathway
           which is conserved in invertebrates and vertebrates. In
           mammals, this pathway controls the activities of the
           immune and inflammatory response genes as well as viral
           genes, and is critical for cell growth and survival. In
           Drosophila, the Rel pathway functions in the innate
           cellular and humoral immune response, in muscle
           development, and in the establishment of dorsal-ventral
           polarity in the early embryo.
          Length = 190

 Score =  173 bits (441), Expect = 1e-52
 Identities = 83/170 (48%), Positives = 112/170 (65%), Gaps = 8/170 (4%)

Query: 83  QRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYI---GSEAEVDAVE 139
           +R +E   F+EW+ +ED+FHL+QA+ R+ IRI+EGRAKPIDLLA+ +     E +V+  E
Sbjct: 1   RRLREDEAFKEWEAKEDEFHLKQAKKRAEIRIKEGRAKPIDLLARNLRFIDPEKDVE-FE 59

Query: 140 MHEPYTYLTGLAIKDLEDLLEDIKVYMELEKG-ENEAYWNDISIIVEDELHGLRKLEKQG 198
           +HEPYT L GL   +LE+L EDIKVY+ELEK  +N  YWN + II EDE   L KLEK+ 
Sbjct: 60  LHEPYTVLEGLTEDELEELEEDIKVYLELEKNPKNREYWNALLIICEDE---LEKLEKKE 116

Query: 199 HSDLTRREGIHESVAQDVASVFKGKTTAQLEALQTQIELKISSKTENIDN 248
                 R G+H SVA DV  +  GK+  +LE L+ QI+ K+ S    +D 
Sbjct: 117 AEGRADRSGLHSSVAADVKKILSGKSYEELEELEGQIKAKLRSGEAKVDT 166


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 46.1 bits (110), Expect = 2e-05
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ 66
           KV A++ +KK+K+K+K K KK+     N+ KRR+
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRK 427



 Score = 33.0 bits (76), Expect = 0.27
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
             K  KK   K+ +KK+K+     VKKR ++ 
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDT 419



 Score = 30.7 bits (70), Expect = 1.6
 Identities = 8/36 (22%), Positives = 16/36 (44%)

Query: 21  VYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
           V A+   +    K + + KK+ +  K   K++K   
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSG 430



 Score = 28.4 bits (64), Expect = 8.4
 Identities = 9/35 (25%), Positives = 15/35 (42%)

Query: 38  KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
           K   +KK  K  KK   K      K++ + +  +R
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415



 Score = 28.4 bits (64), Expect = 8.5
 Identities = 5/24 (20%), Positives = 12/24 (50%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKN 56
           K + +K+ +  K   K++K    +
Sbjct: 409 KPKVKKRHRDTKNIGKRRKPSGTS 432


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 38.7 bits (91), Expect = 0.002
 Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 29 GTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
          G   K +A+K KK+K+K++K+ +K   +    +K+  +E +AE+AER+ E+   QR  EA
Sbjct: 10 GLVDKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNR-QRQAEA 68

Query: 89 AQ 90
           Q
Sbjct: 69 EQ 70


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 36.9 bits (86), Expect = 0.005
 Identities = 19/50 (38%), Positives = 31/50 (62%), Gaps = 3/50 (6%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
           +KVE    ++K+KKKK+K+  KK+ +     + R+ER  ERAE+E+    
Sbjct: 69  AKVEKELLREKEKKKKRKRPGKKRRI---ALRLRRERTKERAEKEKRTRK 115



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
           R ++++ K++ +K+K+ +KN     K+R++E+E
Sbjct: 97  RLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129



 Score = 28.4 bits (64), Expect = 2.9
 Identities = 8/22 (36%), Positives = 16/22 (72%)

Query: 35  EARKKKKKKKKKKKKKKKKKKN 56
           E  K+ +K ++KK K+++K+K 
Sbjct: 108 EKEKRTRKNREKKFKRRQKEKE 129


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 34.3 bits (79), Expect = 0.013
 Identities = 21/48 (43%), Positives = 23/48 (47%), Gaps = 2/48 (4%)

Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
          G K EA   KKKKKKKKKK +       A  KK   E  +E A   E 
Sbjct: 15 GPKTEADLVKKKKKKKKKKAEDTAAT--AKAKKATAEDVSEGAMFMEG 60


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 35.4 bits (82), Expect = 0.013
 Identities = 17/38 (44%), Positives = 24/38 (63%)

Query: 18  QRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKK 55
           Q++V   ++ +    K + +KKKKKKKK  KK  KKKK
Sbjct: 98  QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 32.7 bits (75), Expect = 0.13
 Identities = 16/24 (66%), Positives = 19/24 (79%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKN 56
           +V   K K+KK+KKKKKKKKKKK 
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKT 126



 Score = 32.4 bits (74), Expect = 0.15
 Identities = 17/30 (56%), Positives = 23/30 (76%)

Query: 34  VEARKKKKKKKKKKKKKKKKKKNLCANVKK 63
           ++ + K+KK+KKKKKKKKKKK +  A  KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 31.2 bits (71), Expect = 0.35
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ 66
            ++K K+KK+KKKKKKKKKK       K+++
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 29.3 bits (66), Expect = 1.6
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKK 63
           K+  +  K+K K+KK+KKKKKKK      KK
Sbjct: 99  KIVKQVLKEKAKQKKQKKKKKKKKKKKTSKK 129


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 36.8 bits (86), Expect = 0.014
 Identities = 11/43 (25%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
           + E  ++KK++KK K++++ K   L    +++ +E+E ++  R
Sbjct: 280 RQEEAQEKKEEKK-KEEREAKLAKLSPEEQRKLEEKERKKQAR 321



 Score = 29.1 bits (66), Expect = 3.7
 Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 4/58 (6%)

Query: 35  EARKK----KKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
           E  +K    ++++++K  K  ++++   A  KK  +++E   A+  +     QR  E 
Sbjct: 256 EVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 33.5 bits (77), Expect = 0.025
 Identities = 24/73 (32%), Positives = 32/73 (43%), Gaps = 6/73 (8%)

Query: 31 GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAERE----EEMALTQRGK 86
          G K++ +KKKKKKKKK K K++         K   +    E  E E     E        
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNL 73

Query: 87 EAAQ--FEEWQRQ 97
            A+  FEE QR+
Sbjct: 74 TEAERAFEEAQRK 86



 Score = 29.7 bits (67), Expect = 0.62
 Identities = 25/68 (36%), Positives = 35/68 (51%), Gaps = 4/68 (5%)

Query: 39  KKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQE 98
           K KK   KKKKKKKKKK    N  K     E E  E+    +  + G+E    E+ +++E
Sbjct: 13  KGKKIDVKKKKKKKKKK----NKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEE 68

Query: 99  DQFHLEQA 106
           D  +L +A
Sbjct: 69  DGMNLTEA 76


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 34.8 bits (81), Expect = 0.032
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
          A +K +KK +K K K +KKK      K R
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 32.5 bits (75), Expect = 0.16
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 33 KVEARKKKKKKKKKKKKKKKKKKN 56
          K   + +KK +K K K +KKK K 
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKK 92



 Score = 32.5 bits (75), Expect = 0.17
 Identities = 9/33 (27%), Positives = 16/33 (48%)

Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
           K E +  +K +KK +K K K +K      + +
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 32.1 bits (74), Expect = 0.21
 Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 2/33 (6%)

Query: 35 EARKK--KKKKKKKKKKKKKKKKNLCANVKKRR 65
          +  K   K +KK +K K K +KK       K R
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 27.9 bits (63), Expect = 6.6
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 36 ARKKKKKKKKKKKKKKKKKKNL 57
          A K K K +KKK KK++ K  L
Sbjct: 78 AEKAKAKAEKKKAKKEEPKPRL 99


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.9 bits (82), Expect = 0.050
 Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 3/84 (3%)

Query: 30   TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAA 89
            TG   EARK ++ KKK +  +K ++     +    R+  EA +AE  + + + ++ ++A 
Sbjct: 1108 TGKAEEARKAEEAKKKAEDARKAEEARKAEDA---RKAEEARKAEDAKRVEIARKAEDAR 1164

Query: 90   QFEEWQRQEDQFHLEQARLRSTIR 113
            + EE ++ ED    E AR    +R
Sbjct: 1165 KAEEARKAEDAKKAEAARKAEEVR 1188



 Score = 33.6 bits (76), Expect = 0.26
 Identities = 20/88 (22%), Positives = 46/88 (52%)

Query: 33   KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
            K E  KK  + KKK ++ KKK        + +++  EA++AE  ++    ++ +EA + +
Sbjct: 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD 1537

Query: 93   EWQRQEDQFHLEQARLRSTIRIQEGRAK 120
            E ++ E++   ++ +    ++  E + K
Sbjct: 1538 EAKKAEEKKKADELKKAEELKKAEEKKK 1565



 Score = 32.8 bits (74), Expect = 0.37
 Identities = 10/68 (14%), Positives = 31/68 (45%)

Query: 33   KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
            K +A + KK  +  K + +       A  +K     + +   +++  A  ++ +E  + +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394

Query: 93   EWQRQEDQ 100
            E +++ ++
Sbjct: 1395 EAKKKAEE 1402



 Score = 32.0 bits (72), Expect = 0.78
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 33   KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE--MALTQRGKEAAQ 90
            K E  KK  + KK ++KKK  +      +KK  ++++AE A++ EE      ++ +EA +
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588

Query: 91   FEEWQRQEDQFHLEQAR 107
             EE + +E     E+ +
Sbjct: 1589 AEEARIEEVMKLYEEEK 1605



 Score = 31.3 bits (70), Expect = 1.1
 Identities = 19/93 (20%), Positives = 47/93 (50%), Gaps = 3/93 (3%)

Query: 33   KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
            K +  KK ++ KK  + KK ++       KK  ++++A+  ++ EE+   ++ +E  + E
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL---KKAEEKKKAE 1567

Query: 93   EWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLL 125
            E ++ E+  ++   +     + +E R + +  L
Sbjct: 1568 EAKKAEEDKNMALRKAEEAKKAEEARIEEVMKL 1600



 Score = 31.3 bits (70), Expect = 1.2
 Identities = 12/65 (18%), Positives = 31/65 (47%), Gaps = 1/65 (1%)

Query: 14   EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
            E+  ++   A+ + +      EA K + +    + +  ++K    A  KK   +++A+ A
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE-AAEKKKEEAKKKADAA 1383

Query: 74   EREEE 78
            +++ E
Sbjct: 1384 KKKAE 1388



 Score = 31.3 bits (70), Expect = 1.3
 Identities = 20/87 (22%), Positives = 47/87 (54%), Gaps = 3/87 (3%)

Query: 35   EARKKKKKKKKK---KKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
            EA+K  + KKK    KK ++ KK +     ++ ++  EA++AE +++    ++ +E  + 
Sbjct: 1501 EAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKA 1560

Query: 92   EEWQRQEDQFHLEQARLRSTIRIQEGR 118
            EE ++ E+    E+ +  +  + +E +
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAK 1587



 Score = 31.3 bits (70), Expect = 1.3
 Identities = 20/75 (26%), Positives = 37/75 (49%)

Query: 33   KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
            K E  KK  + KKK ++ KKK        ++ ++  EA +AE E      +  +E A+  
Sbjct: 1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAA 1369

Query: 93   EWQRQEDQFHLEQAR 107
            E +++E +   + A+
Sbjct: 1370 EKKKEEAKKKADAAK 1384



 Score = 31.3 bits (70), Expect = 1.4
 Identities = 16/62 (25%), Positives = 30/62 (48%)

Query: 39   KKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQE 98
            KKK    KKK ++ KK    A  +      EAE AE + E A  ++ +   + +  +++ 
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA 1387

Query: 99   DQ 100
            ++
Sbjct: 1388 EE 1389



 Score = 30.9 bits (69), Expect = 1.8
 Identities = 14/47 (29%), Positives = 29/47 (61%)

Query: 32   SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
            +  EA   ++K +  +KKK++ KK   A  KK  ++++A+ A+++ E
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401



 Score = 30.5 bits (68), Expect = 2.0
 Identities = 21/86 (24%), Positives = 44/86 (51%), Gaps = 4/86 (4%)

Query: 23   ARMQVQGTGSKVEARKKKKKKKKKKKK-KKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
            A+ + +    K +  KK  + KKK  + KK ++       KK  + ++A+ A++ EE   
Sbjct: 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK-- 1545

Query: 82   TQRGKEAAQFEEWQRQEDQFHLEQAR 107
             ++  E  + EE ++ E++   E+A+
Sbjct: 1546 -KKADELKKAEELKKAEEKKKAEEAK 1570



 Score = 30.5 bits (68), Expect = 2.2
 Identities = 21/91 (23%), Positives = 47/91 (51%), Gaps = 2/91 (2%)

Query: 10   VYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
            +   E++ +++   + +      K E  KK +++ K K  ++ KK     + KK  + ++
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE--EDKKKAEEAKK 1682

Query: 70   AERAEREEEMALTQRGKEAAQFEEWQRQEDQ 100
            AE  E++   AL +  +EA + EE +++E +
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713



 Score = 30.5 bits (68), Expect = 2.3
 Identities = 28/109 (25%), Positives = 55/109 (50%), Gaps = 6/109 (5%)

Query: 13   EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKK---KKKKKKKKNLCANVKKRRQER- 68
             E+D ++   A+   +      EA KK+ ++ KK    KKK+ ++K     +KK  +E  
Sbjct: 1670 AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729

Query: 69   -EAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQE 116
             +AE A++E E    ++ +EA + EE +++      E+ +    IR ++
Sbjct: 1730 IKAEEAKKEAEED-KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777



 Score = 30.1 bits (67), Expect = 2.5
 Identities = 17/73 (23%), Positives = 40/73 (54%)

Query: 35   EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
            EA+K  + KKK ++ KK  +    A   K++ +   + AE +++    ++ +EA + +E 
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527

Query: 95   QRQEDQFHLEQAR 107
            ++ E+    ++A+
Sbjct: 1528 KKAEEAKKADEAK 1540



 Score = 30.1 bits (67), Expect = 2.6
 Identities = 26/132 (19%), Positives = 60/132 (45%), Gaps = 4/132 (3%)

Query: 35   EARKKKKKKKKKKK--KKKKKKKNLCANVKKRRQE--REAERAEREEEMALTQRGKEAAQ 90
            EA+K ++ +KK  +  KK+ ++      +KK+  E  ++AE  ++ EE    +  +   +
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738

Query: 91   FEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGL 150
             EE +++ ++   ++   +    +++   K  + + K   +  E +  E  E        
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798

Query: 151  AIKDLEDLLEDI 162
             IKD+ D   +I
Sbjct: 1799 KIKDIFDNFANI 1810



 Score = 29.7 bits (66), Expect = 3.6
 Identities = 27/136 (19%), Positives = 57/136 (41%)

Query: 39   KKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQE 98
            KKK  + KK  + KKK +     ++ ++  EA++AE  ++    ++ +E  + +E ++ E
Sbjct: 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAE 1555

Query: 99   DQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDL 158
            +    E+ +     +  E         A+      E    E+ + Y     +  ++ +  
Sbjct: 1556 ELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615

Query: 159  LEDIKVYMELEKGENE 174
             E      EL+K E E
Sbjct: 1616 EEAKIKAEELKKAEEE 1631



 Score = 29.7 bits (66), Expect = 3.7
 Identities = 17/71 (23%), Positives = 40/71 (56%)

Query: 35   EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
            +A + K K ++ KK +++KKK      K+  ++++AE  ++ EE    +  +EA + EE 
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673

Query: 95   QRQEDQFHLEQ 105
            +++ ++    +
Sbjct: 1674 KKKAEEAKKAE 1684



 Score = 28.6 bits (63), Expect = 7.5
 Identities = 18/70 (25%), Positives = 36/70 (51%), Gaps = 2/70 (2%)

Query: 33   KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMA--LTQRGKEAAQ 90
            K E  KKK  + KK    KKK        +++++  EA++   E + A    ++ +EA +
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458

Query: 91   FEEWQRQEDQ 100
             EE +++ ++
Sbjct: 1459 AEEAKKKAEE 1468



 Score = 28.6 bits (63), Expect = 8.9
 Identities = 18/84 (21%), Positives = 41/84 (48%)

Query: 37   RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQR 96
            +K ++ KK  + KKK ++       KK+ +E + +  E ++     ++  EA + EE ++
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523

Query: 97   QEDQFHLEQARLRSTIRIQEGRAK 120
             ++    E+A+     +  E + K
Sbjct: 1524 ADEAKKAEEAKKADEAKKAEEKKK 1547


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 34.3 bits (79), Expect = 0.069
 Identities = 15/38 (39%), Positives = 27/38 (71%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
            K E +KKK+ KK+KK+KK KK+K +     K++++++
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190



 Score = 31.6 bits (72), Expect = 0.47
 Identities = 14/36 (38%), Positives = 24/36 (66%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
            KKKKK+ KK+KK+KK KK      K  +++++ ++
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 30.1 bits (68), Expect = 1.7
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRR 65
           K E +K+KK+KK KK+K  + K +     KK++
Sbjct: 160 KKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 28.5 bits (64), Expect = 4.8
 Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 30  TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
           T    +  + ++++KK+KKKKK+ KK      KK ++E+  E    
Sbjct: 140 TAKVEKEAEVEEEEKKEKKKKKEVKKEK--KEKKDKKEKMVEPKGS 183



 Score = 28.5 bits (64), Expect = 5.0
 Identities = 15/25 (60%), Positives = 16/25 (64%), Gaps = 6/25 (24%)

Query: 37  RKKKKKKK------KKKKKKKKKKK 55
            KK KK+K       KKKKKKKKKK
Sbjct: 169 EKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 27.8 bits (62), Expect = 8.1
 Identities = 12/37 (32%), Positives = 26/37 (70%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
            +KKKK+ KK+KK+KK KK+ +      ++++++ ++
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 35.1 bits (81), Expect = 0.075
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           R +KK K K  KK  +K K   A  +K     +A+RA +++ 
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811



 Score = 30.1 bits (68), Expect = 2.6
 Identities = 12/43 (27%), Positives = 18/43 (41%)

Query: 31  GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
           G K    K  KK  +K KK   K + + A  K +R  ++    
Sbjct: 771 GEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 34.1 bits (78), Expect = 0.081
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 28  QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN-VKKRRQEREAERAEREEE 78
           + T       ++ K+KKKKK KKKK  K+L  N V     + + E  E++ E
Sbjct: 77  KETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSE 128


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 34.2 bits (79), Expect = 0.091
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 4/65 (6%)

Query: 26  QVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRG 85
           + +   +KV A+ K KK  K K ++K+K K      K+ +  RE E    E+E+A   R 
Sbjct: 44  EKEEEKAKVAAKAKAKKALKAKIEEKEKAKRE----KEEKGLRELEEDTPEDELAEKLRL 99

Query: 86  KEAAQ 90
           ++  +
Sbjct: 100 RKLQE 104



 Score = 33.1 bits (76), Expect = 0.17
 Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 5/51 (9%)

Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
           + E  K K   K K KK  K K       +K + +RE E     E    T
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIE-----EKEKAKREKEEKGLRELEEDT 88


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 34.6 bits (80), Expect = 0.092
 Identities = 12/36 (33%), Positives = 26/36 (72%), Gaps = 6/36 (16%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
           +K+++KK +K+KKKKK+KK      K ++++++  +
Sbjct: 385 KKREEKKPQKRKKKKKRKK------KGKKRKKKGRK 414



 Score = 34.2 bits (79), Expect = 0.11
 Identities = 14/23 (60%), Positives = 19/23 (82%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKK 55
           K E +K +K+KKKKK+KKK KK+
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKR 408



 Score = 31.5 bits (72), Expect = 0.85
 Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 5/36 (13%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
           K +  +KK +K+KKKKK+KKK K      +K++  +
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKK-----RKKKGRK 414



 Score = 31.5 bits (72), Expect = 0.92
 Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 5/35 (14%)

Query: 38  KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
            KKK+++KK +K+KKKKK      KK+ ++R+ + 
Sbjct: 383 PKKKREEKKPQKRKKKKKR-----KKKGKKRKKKG 412



 Score = 28.4 bits (64), Expect = 6.9
 Identities = 11/40 (27%), Positives = 25/40 (62%), Gaps = 5/40 (12%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
            +  KKK+++KK +K+KKKK      +K++ ++  ++  +
Sbjct: 380 PKPPKKKREEKKPQKRKKKKK-----RKKKGKKRKKKGRK 414


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 34.3 bits (79), Expect = 0.093
 Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRR-QEREAERAER 75
              KKKKKKKKK   KK K K L    +K R + ++A+R ER
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 34.6 bits (80), Expect = 0.10
 Identities = 24/120 (20%), Positives = 50/120 (41%), Gaps = 2/120 (1%)

Query: 11   YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKK--RRQER 68
              E ++ +     R++ +  G   + RK K KKK+KKKKK    K+  A+V    +R + 
Sbjct: 1144 QEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDS 1203

Query: 69   EAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKY 128
            + +R   ++          + Q ++ +++         RL+S        ++  D  +  
Sbjct: 1204 DEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSD 1263



 Score = 28.5 bits (64), Expect = 9.2
 Identities = 19/64 (29%), Positives = 31/64 (48%)

Query: 14   EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
            E+  +       + Q   SK + +  K +K K KKK+KKKKK+     KK      ++R 
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201

Query: 74   EREE 77
            + +E
Sbjct: 1202 DSDE 1205


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 34.5 bits (79), Expect = 0.11
 Identities = 18/72 (25%), Positives = 40/72 (55%), Gaps = 6/72 (8%)

Query: 28  QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKE 87
            G   + E  + K++KKKKK+K K++ K+     +K ++E + +R  +E+E    ++  E
Sbjct: 102 SGKEEEKEKEQVKEEKKKKKEKPKEEPKD-----RKPKEEAKEKRPPKEKEKE-KEKKVE 155

Query: 88  AAQFEEWQRQED 99
             +  E +++ +
Sbjct: 156 EPRDREEEKKRE 167



 Score = 31.4 bits (71), Expect = 0.91
 Identities = 21/70 (30%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ-EREAERAEREEEMALTQRGKEAAQ 90
            K EA++K+  K+K+K+K+KK ++      +K+R+  R   R ++  +     + KE  +
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPE 192

Query: 91  FEEWQRQEDQ 100
            EE QRQ  +
Sbjct: 193 -EEKQRQAAR 201



 Score = 28.7 bits (64), Expect = 7.0
 Identities = 15/79 (18%), Positives = 42/79 (53%), Gaps = 9/79 (11%)

Query: 26  QVQGTGSKVEA----RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
           +V+  GSK  A      K+ K +  K+++K+K++      K+ +++++ +  E  ++   
Sbjct: 79  RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQV-----KEEKKKKKEKPKEEPKDRKP 133

Query: 82  TQRGKEAAQFEEWQRQEDQ 100
            +  KE    +E ++++++
Sbjct: 134 KEEAKEKRPPKEKEKEKEK 152


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 33.9 bits (78), Expect = 0.12
 Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 8/82 (9%)

Query: 11  YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
           YSEED         + +  T  +VE  K + ++++ KKKKKKKKK    N          
Sbjct: 154 YSEED--------VVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEAT 205

Query: 71  ERAEREEEMALTQRGKEAAQFE 92
             +    E++        A+  
Sbjct: 206 VSSAVPTELSSGAGQVGEAKKL 227


>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
          Length = 362

 Score = 34.0 bits (78), Expect = 0.14
 Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 3/90 (3%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQ- 95
             +K   K++K+ +K + +N       R +  +  R E    M +  +    A       
Sbjct: 264 VARKAAMKRRKRNRKLRARNAKELAAMRMEANQIRRNEPRARMLMGDKHWHNAWLHNNNL 323

Query: 96  --RQEDQFHLEQARLRSTIRIQEGRAKPID 123
              + ++      ++   + IQ G+  PID
Sbjct: 324 DFAKLEERAALAWQMADGLAIQWGKPLPID 353


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 32.7 bits (75), Expect = 0.14
 Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 6/66 (9%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER-EAERAEREEEMALTQRGKEAAQFEE 93
           E RKK K+K ++ K K KK++   A  +K ++ER     AE E+E A  +  K+ A+ + 
Sbjct: 63  EGRKKAKEKLRRDKLKAKKEE---AEKEKEKEERFMKALAEAEKERAELE--KKKAEAKL 117

Query: 94  WQRQED 99
            + ++ 
Sbjct: 118 MKEEKK 123


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 33.1 bits (76), Expect = 0.17
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKK-RRQEREAERAEREEE 78
          +   A++ K KKK  KKK+K   K   + VK+ R+ ERE  R +R  +
Sbjct: 14 AVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAK 61



 Score = 28.1 bits (63), Expect = 7.5
 Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 5/48 (10%)

Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER---EAERAERE 76
           K       +  K+ K KKK  KK     +  +R E    E  +AERE
Sbjct: 7  RKARQELAVQVAKQAKAKKKANKKK--RKIYFKRAESFVKEYRKAERE 52



 Score = 28.1 bits (63), Expect = 8.0
 Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 2/82 (2%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
           E   KK+K +++   +  K+        K++++   +RAE   +       +E  + +  
Sbjct: 1   ELLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAE-REIIRLKRL 59

Query: 95  QRQEDQFHLE-QARLRSTIRIQ 115
            ++  +F++  + +L   IRI+
Sbjct: 60  AKKPGKFYVPAEHKLVFVIRIK 81


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 31.8 bits (72), Expect = 0.19
 Identities = 14/29 (48%), Positives = 14/29 (48%)

Query: 27  VQGTGSKVEARKKKKKKKKKKKKKKKKKK 55
                   E  KKKKKK KK KK KK  K
Sbjct: 90  DDHPPEPTEKPKKKKKKSKKTKKPKKSSK 118



 Score = 31.0 bits (70), Expect = 0.36
 Identities = 13/20 (65%), Positives = 15/20 (75%)

Query: 37  RKKKKKKKKKKKKKKKKKKN 56
           +KKKKK KK KK KK  KK+
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 30.2 bits (68), Expect = 0.65
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 35  EARKKKKKKKKKKKKKKKKKKN 56
           E  +K KKKKKK KK KK KK+
Sbjct: 95  EPTEKPKKKKKKSKKTKKPKKS 116



 Score = 30.2 bits (68), Expect = 0.68
 Identities = 12/19 (63%), Positives = 12/19 (63%)

Query: 39  KKKKKKKKKKKKKKKKKNL 57
           KKKKKK KK KK KK    
Sbjct: 101 KKKKKKSKKTKKPKKSSKK 119



 Score = 29.8 bits (67), Expect = 1.0
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 33  KVEARKKKKKKKKKKKKKKKKK 54
           K + +KKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 26.8 bits (59), Expect = 9.6
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 30  TGSKVEARKKKKKKKKKKKKKKKKKK 55
                    +K KKKKKK KK KK K
Sbjct: 89  PDDHPPEPTEKPKKKKKKSKKTKKPK 114


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
          family of proteins are part of the yeast nuclear pore
          complex-associated pre-60S ribosomal subunit. The
          family functions as a highly conserved exonuclease that
          is required for the 5'-end maturation of 5.8S and 25S
          rRNAs, demonstrating that 5'-end processing also has a
          redundant pathway. Nop25 binds late pre-60S ribosomes,
          accompanying them from the nucleolus to the nuclear
          periphery; and there is evidence for both physical and
          functional links between late 60S subunit processing
          and export.
          Length = 134

 Score = 31.9 bits (73), Expect = 0.19
 Identities = 16/61 (26%), Positives = 35/61 (57%), Gaps = 4/61 (6%)

Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAE----RAEREEEMALTQRGKEAAQFE 92
          RK++++KK +++ K+K+++      K+ R+ER+ E      ER+E + L +   +  +  
Sbjct: 30 RKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDA 89

Query: 93 E 93
          E
Sbjct: 90 E 90


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.4 bits (77), Expect = 0.21
 Identities = 18/52 (34%), Positives = 29/52 (55%)

Query: 27  VQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           ++    K E +++++KK+KKKK    KKK      +K ++E E E  E E E
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 33.0 bits (76), Expect = 0.26
 Identities = 21/70 (30%), Positives = 39/70 (55%), Gaps = 6/70 (8%)

Query: 30  TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAA 89
           TGSK   +K KK  +K +KK++++KK      +K+++    ++ E EEE    ++ +E  
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKK------EKKKKAFAGKKKEEEEEEEKEKKEEEKE 456

Query: 90  QFEEWQRQED 99
           + EE   +E 
Sbjct: 457 EEEEEAEEEK 466



 Score = 30.7 bits (70), Expect = 1.4
 Identities = 21/56 (37%), Positives = 32/56 (57%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
           K E  KK+KKKK    KKK++++      K+  +E E E AE E+E    ++ K+A
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 30.7 bits (70), Expect = 1.5
 Identities = 16/70 (22%), Positives = 35/70 (50%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
            K+  + +KK++++KK+KKKK         ++  ++ + E  + EEE    +  +E  + 
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472

Query: 92  EEWQRQEDQF 101
           ++ Q     F
Sbjct: 473 KKKQATLFDF 482


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 32.7 bits (75), Expect = 0.21
 Identities = 16/85 (18%), Positives = 40/85 (47%), Gaps = 12/85 (14%)

Query: 33  KVEARKKKKKKKKKKKK------------KKKKKKNLCANVKKRRQEREAERAEREEEMA 80
           K+ A+K+ K ++K+ ++            +KK ++      K+  +  E    ++EEE  
Sbjct: 2   KIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEER 61

Query: 81  LTQRGKEAAQFEEWQRQEDQFHLEQ 105
             +  +   + EE+++ +  F +E+
Sbjct: 62  KEREEQARKEQEEYEKLKSSFVVEE 86



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 7/69 (10%)

Query: 42  KKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE-EMALTQRGKEAAQFEEWQRQEDQ 100
           KK   KK+ K ++K      + RRQ+REAE  EREE +    +R  E  + EE + + ++
Sbjct: 1   KKIGAKKRAKLEEK------QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREK 54

Query: 101 FHLEQARLR 109
              E+ R  
Sbjct: 55  KKEEEERKE 63


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 32.4 bits (74), Expect = 0.21
 Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 1/59 (1%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEE 93
           E +   KK K  K KK  KK  L A  KK  + R    AE++   A     +EAA  EE
Sbjct: 111 EGKVNAKKDKLSKAKKAAKKAALEAE-KKVNEARAEAVAEKKAAEAAAVAAEEAAAAEE 168



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 18/62 (29%), Positives = 23/62 (37%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
            KV A+K K  K KK  KK   +     N  +     E + AE     A      E  + 
Sbjct: 112 GKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEA 171

Query: 92  EE 93
           EE
Sbjct: 172 EE 173


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 32.5 bits (74), Expect = 0.22
 Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 5/40 (12%)

Query: 31  GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
           G K   +K KKKK +  K++KKKKK      KK++++R +
Sbjct: 136 GLKGHEKKHKKKKHEDDKERKKKKKE-----KKKKKKRHS 170



 Score = 29.8 bits (67), Expect = 1.7
 Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 9/40 (22%)

Query: 35  EARKKKKKK---KKKKKKKKKKKKNLCANVKKRRQEREAE 71
           E + KKKK    K++KKKKK+KKK      KK+R   E  
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKK------KKKRHSPEHP 174


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 32.2 bits (74), Expect = 0.26
 Identities = 16/61 (26%), Positives = 32/61 (52%), Gaps = 4/61 (6%)

Query: 24  RMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNL---CANVKKRRQE-REAERAEREEEM 79
           R  +Q    K E  ++ KK +++K++ +K+   L      ++KR +E R+ E     +E+
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172

Query: 80  A 80
           A
Sbjct: 173 A 173


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 31.6 bits (72), Expect = 0.28
 Identities = 17/43 (39%), Positives = 24/43 (55%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           A+++KKK+KKKKKKK KK          +  E  ++  E  EE
Sbjct: 88  AKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEE 130



 Score = 30.8 bits (70), Expect = 0.62
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           K   ++ K++KKK+KKKKKKK K      +K   +   E ++ EEE
Sbjct: 82  KTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEE 127



 Score = 28.5 bits (64), Expect = 3.4
 Identities = 18/44 (40%), Positives = 24/44 (54%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           E   KK+ K++KKK+KKKKKK      KK  +E      E  +E
Sbjct: 81  EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 32.9 bits (75), Expect = 0.29
 Identities = 23/118 (19%), Positives = 52/118 (44%), Gaps = 8/118 (6%)

Query: 22  YARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
           Y R Q Q   +K    ++KKK++++ ++ ++K+      +K+  +ER A + ++++    
Sbjct: 64  YNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEA 123

Query: 82  TQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVE 139
            ++     +  E    +       A+ ++     E  AK     AK   +EA+  A  
Sbjct: 124 AKQAALKQKQAEEAAAKA---AAAAKAKA-----EAEAKRAAAAAKKAAAEAKKKAEA 173



 Score = 28.6 bits (64), Expect = 5.9
 Identities = 18/69 (26%), Positives = 26/69 (37%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
           K  A  KKK + + KKK   + K   A   K    + A  A+   E A   +  E A   
Sbjct: 194 KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAA 253

Query: 93  EWQRQEDQF 101
           +   + D  
Sbjct: 254 KAAAEVDDL 262


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 32.8 bits (75), Expect = 0.32
 Identities = 19/93 (20%), Positives = 43/93 (46%), Gaps = 12/93 (12%)

Query: 28  QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKE 87
           +G  S+ E   K  + K+K K ++ K+       K+R++     + E++ +    ++  +
Sbjct: 260 EGFESEYEPINKPVRPKRKTKAQRNKE-------KRRKELEREAKEEKQLK----KKLAQ 308

Query: 88  AAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAK 120
            A+ +E  ++  Q    +AR +   R + G  K
Sbjct: 309 LARLKEIAKEVAQKEKARARKKE-QRKERGEKK 340


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 31.8 bits (73), Expect = 0.40
 Identities = 9/40 (22%), Positives = 22/40 (55%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
           KV+A++ +++ KK++   +K+K        K +  R+  +
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRK 148



 Score = 31.8 bits (73), Expect = 0.42
 Identities = 11/65 (16%), Positives = 30/65 (46%)

Query: 23  ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
           A++Q Q    + + R+   +K+K  ++++K K       +K R ++   +     +++  
Sbjct: 108 AKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISEL 167

Query: 83  QRGKE 87
             G+ 
Sbjct: 168 TVGQA 172


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 31.9 bits (73), Expect = 0.44
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 21  VYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKK 55
             A    + + +  E +KKKKKK  KKKK KK   
Sbjct: 166 AEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAA 200


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 32.6 bits (74), Expect = 0.45
 Identities = 25/123 (20%), Positives = 50/123 (40%), Gaps = 4/123 (3%)

Query: 13  EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
           EE+    L+    +      K++  +K+ KK +K+ KK+K++        K  + +    
Sbjct: 296 EEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAE 355

Query: 73  AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSE 132
            E EE++   Q   E  +    +    +  LE  RL S  +++E   +  +   K     
Sbjct: 356 EEEEEQLEKLQEKLEQLE----EELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLL 411

Query: 133 AEV 135
            E+
Sbjct: 412 LEL 414



 Score = 31.5 bits (71), Expect = 0.90
 Identities = 40/246 (16%), Positives = 94/246 (38%), Gaps = 26/246 (10%)

Query: 31  GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQ 90
           GS+ + +KK++ KK  ++ +   +  +     K ++ +  E+A++  E    +   E   
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELE- 223

Query: 91  FEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGL 150
            EE     D   L + R+     +     + I+   + +  E E+               
Sbjct: 224 -EENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEI--------------- 267

Query: 151 AIKDLEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHE 210
               L  +L++ K   + +K + E    ++ ++ ++E     +L K     +   E + E
Sbjct: 268 ----LAQVLKENKEEEKEKKLQEE----ELKLLAKEEEELKSELLKLERRKVDDEEKLKE 319

Query: 211 SVAQDVASVFKGKTTAQLEALQTQIELKISSKTENIDNVITAEEKALQREARKGMGGDEA 270
           S  +++  + K     + E  + + ELK        +     + + LQ +  +      A
Sbjct: 320 SE-KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLA 378

Query: 271 EFSVES 276
           +  +ES
Sbjct: 379 KKKLES 384



 Score = 30.3 bits (68), Expect = 2.4
 Identities = 35/228 (15%), Positives = 76/228 (33%), Gaps = 9/228 (3%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
                + KK  +  + K+K + +          +  E      +E +   Q   E+++ E
Sbjct: 201 LKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQE 260

Query: 93  EWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAI 152
             + +E       A++    + +E   K  +   K +  E E    E+      L    +
Sbjct: 261 LEKEEE-----ILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSEL----LKLERRKV 311

Query: 153 KDLEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHESV 212
            D E L E  K   +LEK   +       +  E +   +++  ++   +   +       
Sbjct: 312 DDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQ 371

Query: 213 AQDVASVFKGKTTAQLEALQTQIELKISSKTENIDNVITAEEKALQRE 260
            ++     K   + +L +     E ++  K E         E + Q E
Sbjct: 372 LEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEE 419


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.4 bits (73), Expect = 0.54
 Identities = 13/41 (31%), Positives = 29/41 (70%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
           EA +K K++ ++K +++++++      ++R +EREAERA +
Sbjct: 583 EAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623



 Score = 30.8 bits (69), Expect = 1.6
 Identities = 18/50 (36%), Positives = 32/50 (64%), Gaps = 2/50 (4%)

Query: 31  GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMA 80
            SK+ A+K+++  +K K++ ++K +      K++ +ERE ER ERE E A
Sbjct: 574 SSKL-AKKREEAVEKAKREAEQKAREEREREKEKEKERERER-EREAERA 621


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 32.3 bits (74), Expect = 0.57
 Identities = 35/193 (18%), Positives = 77/193 (39%), Gaps = 22/193 (11%)

Query: 12  SEEDDNQRLVYARMQVQGTGSKVEARKKKKKK-KKKKKKKKKKKKNLCANVKKRRQEREA 70
           S  +  + L  A  ++    ++++    + ++ +++ ++++K++  L     + ++E E 
Sbjct: 309 SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELED 368

Query: 71  ERAEREEEMALTQRGKEA-----AQFEEWQRQEDQFHLEQARL-----RSTIRIQEGRAK 120
            RAE EE        ++       + E+ +R+ ++   E  RL     R +  + +  A 
Sbjct: 369 LRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428

Query: 121 PIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMELEKGENEAYWNDI 180
              + AK    E E +             L IK  E  LE +   +   + E      + 
Sbjct: 429 IAGIEAKINELEEEKEDK----------ALEIKKQEWKLEQLAADLSKYEQELYDLKEEY 478

Query: 181 SIIVEDELHGLRK 193
              VE EL  L++
Sbjct: 479 D-RVEKELSKLQR 490



 Score = 29.7 bits (67), Expect = 4.1
 Identities = 16/99 (16%), Positives = 41/99 (41%), Gaps = 5/99 (5%)

Query: 41  KKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQ 100
             KK++ +++ ++ +    +++ R  + + ER E E ++   +R  E     E Q ++ +
Sbjct: 860 NGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL---EAQIEKKR 916

Query: 101 FHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVE 139
             L +  L++ +   E     I+          E +   
Sbjct: 917 KRLSE--LKAKLEALEEELSEIEDPKGEDEEIPEEELSL 953


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 30.7 bits (70), Expect = 0.58
 Identities = 13/71 (18%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE-AERAEREEEMALTQRGKEAAQ 90
              E R++K+++K++K++K++K+K     + ++ +  +  E  E+ +   L  R  +   
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPL-ARYADDED 71

Query: 91  FEEWQRQEDQF 101
           ++E  ++++++
Sbjct: 72  YDEELKEQERW 82


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 32.1 bits (74), Expect = 0.58
 Identities = 19/104 (18%), Positives = 40/104 (38%)

Query: 18  QRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE 77
           ++L    ++V+   S VE  + +K++ K++ +++ K +   A   +  +E +AE A    
Sbjct: 22  EKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAP 81

Query: 78  EMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKP 121
                +    A         E     E A  R   +    + K 
Sbjct: 82  AEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKG 125


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 29.1 bits (66), Expect = 0.64
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 5/45 (11%)

Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ--EREAERAEREE 77
          + R K+ KK+  KK  +KKK+ L   + KR +   +E   AER+E
Sbjct: 11 KLRAKRAKKRAAKKAARKKKRKL---IFKRAEKYVKEYRAAERDE 52



 Score = 27.6 bits (62), Expect = 2.4
 Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 33 KVEARKKKKKKKKKKKKKKKK-KKNLCANVKK-RRQEREAERAERE 76
          + +  KK+  KK  +KKK+K   K     VK+ R  ER+  R +RE
Sbjct: 13 RAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKRE 58


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 31.7 bits (73), Expect = 0.64
 Identities = 12/43 (27%), Positives = 23/43 (53%)

Query: 22 YARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
          Y  M+ +   + ++ ++ K   K +KKK+K++ K   A  K R
Sbjct: 49 YKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 29.4 bits (67), Expect = 3.4
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 38 KKKKKKKKKKKKKKKKKKNL 57
          +KKK+K++ K  K K K  L
Sbjct: 73 QKKKEKQEAKAAKAKSKPRL 92


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 31.8 bits (71), Expect = 0.64
 Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 11  YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
           Y EE    R +Y  + V+  G+    ++   +   K K +KKK++ L   +  +R + E 
Sbjct: 109 YREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLAAKRLKDEQ 168

Query: 71  ERAE-REEEMALTQRGKEAAQFEE 93
            R + R++E+   ++ +E A+ E+
Sbjct: 169 HRHKARKQELRKREKDRERARRED 192


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 32.0 bits (73), Expect = 0.81
 Identities = 35/190 (18%), Positives = 73/190 (38%), Gaps = 12/190 (6%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
              ++ + K  +  ++  + +     +K+  +  EAE  E E E+   +   E  + E+ 
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELE-EQL 381

Query: 95  QRQEDQFH---LEQARLRSTIRIQEGRAKPID-----LLAKYIGSEAEVDAVEMHEPYTY 146
           +    +     L+ A L + I   E R + ++     L  +      +++  E+ E    
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAE 441

Query: 147 LTGL--AIKDLEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTR 204
           L  L   +++L++ LE ++  +E  + E E     +    E EL  L+            
Sbjct: 442 LEELEEELEELQEELERLEEALEELREELEEAEQALD-AAERELAQLQARLDSLERLQEN 500

Query: 205 REGIHESVAQ 214
            EG  E V  
Sbjct: 501 LEGFSEGVKA 510



 Score = 31.6 bits (72), Expect = 1.0
 Identities = 53/271 (19%), Positives = 109/271 (40%), Gaps = 31/271 (11%)

Query: 2   SRLEPGTLVYSEEDDNQRLVYAR-MQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
            +L PG  + + + D   LV    +   G+     +  +++++ ++ ++K ++ +   A 
Sbjct: 639 KKLRPGYRIVTLDGD---LVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAE 695

Query: 61  VKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAK 120
           ++K   E   E  E EEE+   ++        E +    Q    +  L       E   +
Sbjct: 696 LEKALAELRKELEELEEELEQLRK--------ELEELSRQISALRKDLARLEAEVEQLEE 747

Query: 121 PIDLLAKYIGSEAEVDAVEMHEPYTYLTGL---AIKDLEDLLEDIKVYMEL------EKG 171
            I  L+K + +E E +  E+ E           A  ++E+L   I+   E          
Sbjct: 748 RIAQLSKEL-TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD 806

Query: 172 ENEAYWNDISIIVEDELHGLRKLEKQGHS---DLTRREGIHESVAQDVASVFKGKTTAQL 228
           E  A    ++    +    L  LE++  +    L   E   E +++D+ S+      A++
Sbjct: 807 ELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL-----AAEI 861

Query: 229 EALQTQIELKISSKTENIDNVITAEEKALQR 259
           E L+  IE ++ S+ E + N   + E+AL  
Sbjct: 862 EELEELIE-ELESELEALLNERASLEEALAL 891


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 30.8 bits (70), Expect = 0.82
 Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 1/87 (1%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
           + E ++K K KKKK KKKK K K+   + K  + E++ +  E E+++    +        
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK-DEKEAEDKLEDLTKSYSETLST 135

Query: 93  EWQRQEDQFHLEQARLRSTIRIQEGRA 119
             + +  ++ L +   +S +  +    
Sbjct: 136 LSELKPRKYALHKDIYQSRLDRKRRAE 162


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 29.4 bits (67), Expect = 0.82
 Identities = 12/30 (40%), Positives = 18/30 (60%)

Query: 36 ARKKKKKKKKKKKKKKKKKKNLCANVKKRR 65
          +RKKK+KK+ K + K+   K L    +K R
Sbjct: 60 SRKKKEKKEVKAESKRYNAKKLTLEQRKAR 89


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 31.4 bits (72), Expect = 0.83
 Identities = 13/43 (30%), Positives = 24/43 (55%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEM 79
           RKK +KK++K +KK +K++   A  KK+ +    +    + E 
Sbjct: 411 RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453



 Score = 30.7 bits (70), Expect = 1.4
 Identities = 13/41 (31%), Positives = 24/41 (58%), Gaps = 4/41 (9%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAER 75
            A +KK +KK++K +KK +K       +K   +++AE A +
Sbjct: 408 PAERKKLRKKQRKAEKKAEK----EEAEKAAAKKKAEAAAK 444



 Score = 29.9 bits (68), Expect = 3.0
 Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 12/63 (19%)

Query: 38  KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQ 97
             ++KK +KK++K +KK            E+E       ++ A     K      E  ++
Sbjct: 408 PAERKKLRKKQRKAEKKA-----------EKEEAEKAAAKKKAEAAAKKAKGPDGE-TKK 455

Query: 98  EDQ 100
            D 
Sbjct: 456 VDP 458



 Score = 29.5 bits (67), Expect = 3.3
 Identities = 11/42 (26%), Positives = 19/42 (45%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE 77
            RK +KK +K++ +K   KK   A  KK +      +    +
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 31.8 bits (72), Expect = 0.91
 Identities = 17/65 (26%), Positives = 30/65 (46%)

Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
          E  +KKKK++K K+K+ KK K      K + Q ++A       + +  +  K   + E  
Sbjct: 17 ELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENP 76

Query: 95 QRQED 99
          +   D
Sbjct: 77 EDFID 81


>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
           modification].
          Length = 470

 Score = 31.5 bits (71), Expect = 0.92
 Identities = 29/186 (15%), Positives = 69/186 (37%), Gaps = 5/186 (2%)

Query: 1   MSRLEPGTLVYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
           M+ LE    +  E +  +  +  R+Q      + E R +++ +  +  ++   +  L  +
Sbjct: 1   MNLLETRRSLLEEMEIIENAIAERIQRNPKLPRDELRLERQIRIFENMERISNQIWLVEH 60

Query: 61  -VKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRA 119
                 +++  +R  R  +  L    K+        + + +  ++  R          +A
Sbjct: 61  ERPTGLKDQMMKRINRSIDRDLYGFYKKLGALNVEGKLDGEIEVKGLRDLGYYESSAPKA 120

Query: 120 KPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMEL----EKGENEA 175
           + +D+ A + GSE   D +E+   +     + +      LE +K         E  +N  
Sbjct: 121 RDLDVEAAFKGSELYGDGMELERIFRKYANVHLCSDCSYLEFLKKLERFDLTTEPSKNFR 180

Query: 176 YWNDIS 181
           Y   +S
Sbjct: 181 YLEYLS 186


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 14/99 (14%), Positives = 45/99 (45%), Gaps = 3/99 (3%)

Query: 14  EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
           +D    +  A+ ++     K+   K +++++ ++  K+K+++ L    ++     E++ A
Sbjct: 159 DDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEA 218

Query: 74  EREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTI 112
             E+++ L    ++    +++   E++   E  R     
Sbjct: 219 ALEKQLRLEFEREKEELRKKY---EEKLRQELERQAEAH 254


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
          protein SURF-6 is shown to be a component of the
          nucleolar matrix and has a strong binding capacity for
          nucleic acids.
          Length = 206

 Score = 30.7 bits (70), Expect = 1.0
 Identities = 16/45 (35%), Positives = 30/45 (66%), Gaps = 4/45 (8%)

Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
          +E R++K++++K +KK+K+K+    A  K+  Q+ EAE  + EE 
Sbjct: 8  LEQRRRKREQRKARKKQKRKE----AKKKEDAQKSEAEEVKNEEN 48



 Score = 29.2 bits (66), Expect = 2.6
 Identities = 16/38 (42%), Positives = 26/38 (68%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAE 74
           +K  K+K+K+KKK KK+ K     V+K++ ER+ +R E
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREE 182



 Score = 28.0 bits (63), Expect = 7.6
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 36  ARKKKKKKKKKKKKKKKKK 54
            RK  KK KKKKK KKK +
Sbjct: 186 KRKDDKKNKKKKKAKKKGR 204


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 31.1 bits (70), Expect = 1.0
 Identities = 33/156 (21%), Positives = 59/156 (37%), Gaps = 13/156 (8%)

Query: 13  EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKK----KKKKKKNLCANVKKR--RQ 66
            + D  +      Q      + E R+K+++ K   K       K+ K +  N K+   + 
Sbjct: 227 IDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKA 286

Query: 67  EREAERAEREEEMA-------LTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRA 119
           + E ++ + E   A       L Q  K + +  E +  E Q   E           +  A
Sbjct: 287 QIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEA 346

Query: 120 KPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDL 155
           +P  L    I S   V  +++ +P T L+ L + DL
Sbjct: 347 QPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDL 382



 Score = 29.2 bits (65), Expect = 5.1
 Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 4/67 (5%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAERE-EEMALTQRGKEAAQF 91
           K E  KK+    K ++K    + N     K+R + R+ ++  +   + A T   KE  Q 
Sbjct: 219 KEELDKKQIDADKAQQKADFAQDNA---DKQRDEVRQKQQEAKNLPKPADTSSPKEDKQV 275

Query: 92  EEWQRQE 98
            E Q++E
Sbjct: 276 AENQKRE 282


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 12/37 (32%), Positives = 14/37 (37%)

Query: 20  LVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
            V A         K +   KKKK K+ K  K  KK  
Sbjct: 240 DVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVV 276



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKK 63
           RK K++ KKKK K+ K  K L   V K
Sbjct: 252 RKSKEEIKKKKPKESKGVKALKKVVAK 278


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 1/46 (2%)

Query: 7   GTLVYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKK 52
             L Y+E  D ++        +   +   A KK    +K + KKKK
Sbjct: 152 AGLWYAERKDAKQKE-EFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 30.9 bits (70), Expect = 1.1
 Identities = 16/64 (25%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAE---RAEREEEMALTQRGKEAAQFEE 93
           ++ K++++KK+ +++KKKK   A   K+R E         E  +E     R +E  + +E
Sbjct: 285 KEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEEKEREKE 344

Query: 94  WQRQ 97
           ++++
Sbjct: 345 YEQE 348


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 29.9 bits (68), Expect = 1.1
 Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 8/59 (13%)

Query: 18  QRLVYARMQVQGTGSKV--------EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
           QR     ++  G  +K         E  K++KKK+ K+KK+++K++      +K++ + 
Sbjct: 71  QRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 31.4 bits (71), Expect = 1.2
 Identities = 20/61 (32%), Positives = 36/61 (59%), Gaps = 5/61 (8%)

Query: 36  ARKKKKKKKKKKKKKKKKK---KNLCA--NVKKRRQEREAERAEREEEMALTQRGKEAAQ 90
           +RKKKKKKK K+KKKKK +   K + A  N+++  +    E++    ++ L ++  ++  
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKKFGQSPV 742

Query: 91  F 91
           F
Sbjct: 743 F 743



 Score = 29.5 bits (66), Expect = 3.8
 Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 5/56 (8%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
             +KKKKKKK K+KKKKK     N +  +Q    E  E   E +  ++  E  Q +
Sbjct: 682 GSRKKKKKKKSKEKKKKK-----NREASKQIHALENIEEGIEGSNNEKSSETPQLK 732


>gnl|CDD|237700 PRK14397, PRK14397, membrane protein; Provisional.
          Length = 222

 Score = 30.6 bits (69), Expect = 1.2
 Identities = 11/41 (26%), Positives = 16/41 (39%)

Query: 241 SKTENIDNVITAEEKALQREARKGMGGDEAEFSVESLLDPQ 281
           S  ENI  +   EEK  Q++      G  A  +  +  D  
Sbjct: 175 SHRENIGRLARGEEKPWQKKHHDAAQGTAAGAAPTANADAA 215


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 31.3 bits (71), Expect = 1.2
 Identities = 14/24 (58%), Positives = 16/24 (66%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKK 55
             V    K KK KKKKKKKKK++K
Sbjct: 267 YDVSEMVKFKKPKKKKKKKKKRRK 290


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 31.1 bits (70), Expect = 1.2
 Identities = 21/107 (19%), Positives = 45/107 (42%), Gaps = 9/107 (8%)

Query: 31  GSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQE--REAERAEREEEMALTQRGKEA 88
            +  E R++ + + +  +  +K+ +     +K +  +    + + E+E +   T+    A
Sbjct: 154 KTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRA--NA 211

Query: 89  AQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEV 135
           AQ     R E+      A  ++   IQ+ R   I   A+ I + AE 
Sbjct: 212 AQ----ARTEELARRAAAAQQTAQAIQQ-RDAQISQKAQQIAARAEQ 253



 Score = 28.8 bits (64), Expect = 6.6
 Identities = 32/199 (16%), Positives = 72/199 (36%), Gaps = 28/199 (14%)

Query: 61  VKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAK 120
            +  ++  E ER     E+   ++ +EA + E    +++    +Q   R T + Q+ + +
Sbjct: 93  AQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTR 152

Query: 121 PIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMELEKGENEAYWNDI 180
              L  +    EA+  +++  +     +   +K      + + + +   + E E      
Sbjct: 153 LKTLAEQRRQLEAQAQSLQASQKQLQASATQLK-----SQVLDLKLRSAQIEQE------ 201

Query: 181 SIIVEDELHGLRKLEKQGHSDLTRREGIHESVAQDVASVFKGKTTAQLEALQTQIELKIS 240
                  L       +    +L RR    +  AQ +          Q +A  +Q   +I+
Sbjct: 202 ----AQNLATRANAAQARTEELARRAAAAQQTAQAI---------QQRDAQISQKAQQIA 248

Query: 241 SKTENIDNVITAEEKALQR 259
           ++       I   E+ LQR
Sbjct: 249 AR----AEQIRERERQLQR 263


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 31.0 bits (70), Expect = 1.2
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 9/64 (14%)

Query: 23  ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
             +  +    K     +K + K +KK+  +         K+R  ERE  R  R+E  A  
Sbjct: 323 PELHAENAEIKKTRTAEKNEAKARKKEIAQ---------KRRAAEREINREARQERAAAM 373

Query: 83  QRGK 86
            R +
Sbjct: 374 ARAR 377



 Score = 29.4 bits (66), Expect = 3.8
 Identities = 20/114 (17%), Positives = 43/114 (37%), Gaps = 11/114 (9%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANV-------KKRRQEREAERAEREEEMALTQRGKEAA 89
            + ++K+K+KK      ++N    +       +   +  E ++    E+     R KE A
Sbjct: 292 AQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIA 351

Query: 90  Q-FEEWQRQEDQFHLE---QARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVE 139
           Q     +R+ ++   +    A  R+  R    +AK   L+      +   +  E
Sbjct: 352 QKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEE 405


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
          identified as the binding site of the SRP72 protein to
          SRP RNA.
          Length = 57

 Score = 28.2 bits (63), Expect = 1.2
 Identities = 12/32 (37%), Positives = 17/32 (53%)

Query: 26 QVQGTGSKVEARKKKKKKKKKKKKKKKKKKNL 57
          + +    K    ++KKKK  KKK+K K  KN 
Sbjct: 4  KPKKIAKKKPGTEQKKKKSAKKKRKPKLPKNF 35


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 30.8 bits (70), Expect = 1.3
 Identities = 14/67 (20%), Positives = 30/67 (44%), Gaps = 8/67 (11%)

Query: 11  YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
           +S E D   L   R +++    K++ +  K   K K ++ KK++       + RR+++E 
Sbjct: 333 FSGEPDGISL---REELEKRIEKLKEKPPKPPTKAKPERDKKERPG-----RYRRKKKEK 384

Query: 71  ERAEREE 77
           +      
Sbjct: 385 KAKSERR 391


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria and eukaryotes.
          Proteins in this family are typically between 137 to
          196 amino acids in length.
          Length = 149

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 9/47 (19%), Positives = 21/47 (44%)

Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
           +RK+   + K+ +KK  K     A  K + ++++ +      E  +
Sbjct: 3  SSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASESGI 49


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 11/45 (24%), Positives = 27/45 (60%)

Query: 34  VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           +E   + ++ KK+ KK ++K + L A ++K+ +E E  + +  ++
Sbjct: 91  IEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDK 135


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLC 58
           +K K K+ +K + KKKKKK+  
Sbjct: 87  KKAKGKRSRKNQTKKKKKKDDN 108



 Score = 27.7 bits (62), Expect = 8.0
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 35  EARKKKKKKKKKKKKKKKKKK 55
           E  KK K K+ +K + KKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKK 104


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.5 bits (69), Expect = 1.4
 Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 8/73 (10%)

Query: 45  KKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLE 104
              K+ +++KK      +K++QERE ER E E    L +      ++EEW RQ+ Q   +
Sbjct: 93  LSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKE-----KYEEWCRQKAQ---Q 144

Query: 105 QARLRSTIRIQEG 117
            A+ R+    +E 
Sbjct: 145 AAKQRTPKHKKEA 157


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 30.4 bits (69), Expect = 1.5
 Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 23  ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
           +R            R K+++ ++KK + K +K+      K++++E+E  + ER  E   T
Sbjct: 142 SRSSTVQNKEATHERLKEREIRRKKIQAKARKR------KEKKKEKELTQEERLAEAKET 195

Query: 83  QRGKEAAQFEEWQRQEDQ 100
           +R    +  E ++ QE++
Sbjct: 196 ERINLKS-LERYEEQEEE 212


>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
           transduction mechanisms].
          Length = 208

 Score = 30.2 bits (68), Expect = 1.6
 Identities = 10/76 (13%), Positives = 34/76 (44%), Gaps = 4/76 (5%)

Query: 16  DNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKK----KKKKKNLCANVKKRRQEREAE 71
           + + + +AR Q+    ++V+A++ +++ KK+++      K   +      + + +    E
Sbjct: 92  EEEHVEHARKQLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREE 151

Query: 72  RAEREEEMALTQRGKE 87
                 +++    G+ 
Sbjct: 152 VHTPVSDISALTVGQA 167


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 30.5 bits (70), Expect = 1.7
 Identities = 6/26 (23%), Positives = 11/26 (42%)

Query: 30  TGSKVEARKKKKKKKKKKKKKKKKKK 55
            G K   +  +K K++    +K  K 
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 29.0 bits (66), Expect = 6.0
 Identities = 7/21 (33%), Positives = 9/21 (42%)

Query: 35  EARKKKKKKKKKKKKKKKKKK 55
           +A  K    KK   K  +K K
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTK 846


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 30.2 bits (68), Expect = 1.7
 Identities = 15/71 (21%), Positives = 30/71 (42%), Gaps = 6/71 (8%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER------EAERAEREEEMALTQRGKEA 88
           EA+ K+  + K K + + +KK      K+  +E       EA++   E +       K  
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAK 182

Query: 89  AQFEEWQRQED 99
           A+ +   + E+
Sbjct: 183 AEAKAKAKAEE 193



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 20/71 (28%), Positives = 27/71 (38%), Gaps = 1/71 (1%)

Query: 23  ARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
           A+ +      K EA  K K + K K K ++ K    A   K   E  A +AE E   A  
Sbjct: 164 AKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE-AAAKAEAEAAAAAA 222

Query: 83  QRGKEAAQFEE 93
              +  A   E
Sbjct: 223 AEAERKADEAE 233



 Score = 29.0 bits (65), Expect = 4.0
 Identities = 10/71 (14%), Positives = 34/71 (47%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
           +  ++ ++ +K++  ++ ++K L       +  ++AE+A ++ E    Q  +  A+    
Sbjct: 72  KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE 131

Query: 95  QRQEDQFHLEQ 105
            + + +   E+
Sbjct: 132 AKAKAEAEAEK 142



 Score = 28.6 bits (64), Expect = 5.4
 Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 4/78 (5%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA-ERAEREEEMALTQRGKEAAQF 91
           K EA+K+ +++ K K   + KKK   A   K++ E EA  +AE + +    +   +A   
Sbjct: 145 KEEAKKQAEEEAKAKAAAEAKKK---AAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAA 201

Query: 92  EEWQRQEDQFHLEQARLR 109
           +     E     E     
Sbjct: 202 KAKAAAEAAAKAEAEAAA 219



 Score = 28.6 bits (64), Expect = 5.6
 Identities = 12/65 (18%), Positives = 31/65 (47%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
           ++E R   +K  K+ ++  K+ +      ++ + ++ AE   + E  A  +  +EA +  
Sbjct: 93  ELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQA 152

Query: 93  EWQRQ 97
           E + +
Sbjct: 153 EEEAK 157



 Score = 28.3 bits (63), Expect = 7.6
 Identities = 18/81 (22%), Positives = 24/81 (29%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFE 92
           K EA+ K K ++ K K +  K K       K   E  A  A   E  A      +     
Sbjct: 182 KAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLA 241

Query: 93  EWQRQEDQFHLEQARLRSTIR 113
                E Q         S + 
Sbjct: 242 SGSNAEKQGGARGNAAGSEVD 262


>gnl|CDD|236311 PRK08609, PRK08609, hypothetical protein; Provisional.
          Length = 570

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 11/87 (12%)

Query: 153 KDLEDLLEDIKVYMELEKGEN----EAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGI 208
           KD+  LLE I  YMEL KGEN     A+      +  DE    R L +    D T+ +GI
Sbjct: 4   KDVIKLLETIATYMEL-KGENPFKISAFRKAAQALELDE----RSLSEI--DDFTKLKGI 56

Query: 209 HESVAQDVASVFKGKTTAQLEALQTQI 235
            +  A+ +    +   ++ L+ L+ ++
Sbjct: 57  GKGTAEVIQEYRETGESSVLQELKKEV 83


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 30.6 bits (69), Expect = 1.8
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 34  VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
           +E + KK K KKK KK KKK+    ++     +    + ++ EE 
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEES 186


>gnl|CDD|215732 pfam00121, TIM, Triosephosphate isomerase. 
          Length = 242

 Score = 29.9 bits (68), Expect = 1.8
 Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 20/75 (26%)

Query: 198 GHSDLTRREGIHES---VAQDVASVFK---------------GKTTAQLEALQTQIELKI 239
           GHS+  RR+   E+   VA+ V +  K                +    LE L+ Q++  +
Sbjct: 91  GHSE--RRQYFGETDEIVAKKVKAALKNGLTPVLCVGETLEEREAGKTLEVLKRQLKAIL 148

Query: 240 SSKTENIDNVITAEE 254
           +   E   N+I A E
Sbjct: 149 AGLDEEWKNLIIAYE 163


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.4 bits (68), Expect = 1.8
 Identities = 14/50 (28%), Positives = 29/50 (58%), Gaps = 5/50 (10%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQR 84
           E R+ +++ +K K++KKK++K      K+ R+   +   E +E++A  Q 
Sbjct: 86  EERRHRQRLEKDKREKKKREKE-----KRGRRRHHSLGTESDEDIAPAQM 130



 Score = 28.5 bits (63), Expect = 7.0
 Identities = 18/69 (26%), Positives = 33/69 (47%)

Query: 10  VYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQERE 69
           V     D++       + + T    + +KKK++K+K++KKKKKK  +   +      E+ 
Sbjct: 254 VSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313

Query: 70  AERAEREEE 78
            +    EEE
Sbjct: 314 VQNGAVEEE 322


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor
          involved in ribosome biogenesis.
          Length = 112

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 19/57 (33%), Positives = 30/57 (52%), Gaps = 8/57 (14%)

Query: 30 TGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGK 86
            SK E   KKK++KKK K+  +K        +++RQ++  ERAE   +    +R K
Sbjct: 39 AASKNEGVSKKKQRKKKPKRLTRK--------QRKRQKKGLERAEIVTDTLSKKRAK 87


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 39/231 (16%), Positives = 96/231 (41%), Gaps = 13/231 (5%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
           +  ++ +++ ++ +++ ++ K   A +++  ++ ++   E EEE+   +   E  Q    
Sbjct: 702 DLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLE 761

Query: 95  QRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYLTGLAIKD 154
           + +E+   LE+A  +    I+E   K   L  +    E E++  E            +  
Sbjct: 762 ELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAE----------RRLDA 811

Query: 155 LEDLLEDIKVYMELEKGENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHESVAQ 214
           LE  LE ++   E  + E E    +I   +E++L  L +  ++   +L   +   E +  
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEI-EELEEKLDELEEELEELEKELEELKEELEELEA 870

Query: 215 DVASVFK--GKTTAQLEALQTQIELKISSKTENIDNVITAEEKALQREARK 263
           +   +     +   + E L+ ++    S   E  + +    E+  + EA+ 
Sbjct: 871 EKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKL 921



 Score = 29.7 bits (67), Expect = 3.8
 Identities = 19/128 (14%), Positives = 54/128 (42%), Gaps = 1/128 (0%)

Query: 13  EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
            E+  +RL   + +++    ++E R+   ++ ++   + ++ K           E   E 
Sbjct: 318 LEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377

Query: 73  -AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIGS 131
                EE+A  +      + E  + + +   LE+   R + R+++ + +  +L A+    
Sbjct: 378 FEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEEL 437

Query: 132 EAEVDAVE 139
           + E++ + 
Sbjct: 438 QTELEELN 445


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 29  GTGSKVEARKKKKKKKKKKKKKK 51
             G K  A+K KK KKKKKKK+ 
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 27.8 bits (62), Expect = 9.9
 Identities = 12/18 (66%), Positives = 12/18 (66%)

Query: 39  KKKKKKKKKKKKKKKKKN 56
           K   KK KK KKKKKKK 
Sbjct: 199 KSSAKKDKKSKKKKKKKR 216


>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase.  Cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues, is the major component of wood and
           thus paper, and is synthesised by plants, most algae,
           some bacteria and fungi, and even some animals. The
           genes that synthesise cellulose in higher plants differ
           greatly from the well-characterized genes found in
           Acetobacter and Agrobacterium sp. More correctly
           designated as 'cellulose synthase catalytic subunits',
           plant cellulose synthase (CesA) proteins are integral
           membrane proteins, approximately 1,000 amino acids in
           length. There are a number of highly conserved residues,
           including several motifs shown to be necessary for
           processive glycosyltransferase activity.
          Length = 716

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 14/47 (29%), Positives = 20/47 (42%)

Query: 37  RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
           R+KK K   K KKK  K+K   A +       E +  E+   M+   
Sbjct: 312 RRKKPKSASKAKKKDSKRKESEAPIFNLEDIDEGDEDEKSSLMSQLS 358


>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
          Length = 128

 Score = 29.0 bits (66), Expect = 2.1
 Identities = 10/14 (71%), Positives = 10/14 (71%)

Query: 43 KKKKKKKKKKKKKN 56
           KKKK  KKK KKN
Sbjct: 1  AKKKKTVKKKVKKN 14


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
          contains a C-terminal presumed domain in Sas10 which
          hash been identified as a regulator of chromatin
          silencing.
          Length = 76

 Score = 28.0 bits (63), Expect = 2.2
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 33 KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
          K + R  + KK+KK +K  K+ K+    V++       E 
Sbjct: 24 KKDRRNPRVKKRKKYEKAVKRLKSQRRVVREEDGGYGGET 63


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.2 bits (68), Expect = 2.2
 Identities = 17/88 (19%), Positives = 41/88 (46%), Gaps = 8/88 (9%)

Query: 7   GTLVYSEEDDNQRLVYARMQVQGTGSKVEAR----KKKKKKKKKKKKKKKKKKNLCANVK 62
           G  VY   D     V    +++G   +        ++K+++ +K  K++ +++      +
Sbjct: 221 GQNVYDNNDHKDFCV----EIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQR 276

Query: 63  KRRQEREAERAEREEEMALTQRGKEAAQ 90
           +R +E+ A  A+R +  A  ++ +E  Q
Sbjct: 277 RREEEKAAMEADRAQAKAEVEKRREKLQ 304


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 43/224 (19%), Positives = 86/224 (38%), Gaps = 41/224 (18%)

Query: 13  EEDDNQRLVYARMQVQGTGSKVE-ARKKKKKKKKKKKKKKKKKKNL---CANVKKRRQER 68
           +E+   RL   R+  Q    + E  R+     +++ ++ +++   L       ++  ++R
Sbjct: 323 DEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDR 382

Query: 69  EAERAEREEEM---------ALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIR-IQEGR 118
             E  E EEE+         A    G      EE + + D+    +A L +T+R  +E  
Sbjct: 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERV 442

Query: 119 AKPIDLLA--------KYIGSEAEVDAVEMHEPYTYLTGLAIKDLEDLLEDIKVYMELEK 170
            +   LL         + +     V+ +E            +++LE  LED++   E+E+
Sbjct: 443 EEAEALLEAGKCPECGQPVEGSPHVETIEEDRE-------RVEELEAELEDLEE--EVEE 493

Query: 171 GENEAYWNDISIIVEDELHGLRKLEKQGHSDLTRREGIHESVAQ 214
            E             +    L + E +      RRE + E +A+
Sbjct: 494 VEERL----------ERAEDLVEAEDRIERLEERREDLEELIAE 527


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 8/37 (21%), Positives = 14/37 (37%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQER 68
                ++K   KK  +  KK   +     + K R +R
Sbjct: 201 KLGFGKQKSSSKKPSRVTKKSAAEAAKKQLNKPRWKR 237


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 13/63 (20%), Positives = 23/63 (36%), Gaps = 6/63 (9%)

Query: 27  VQGTGSKVEARKKKK------KKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMA 80
                   E +K KK       K  K KK KKK      +VK    +  ++   + +++ 
Sbjct: 138 STAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQDLV 197

Query: 81  LTQ 83
           +  
Sbjct: 198 ING 200


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 28.0 bits (63), Expect = 2.3
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 25 MQVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
             +   +    R  ++ ++KKKKKKK KKK+
Sbjct: 22 EDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53



 Score = 27.3 bits (61), Expect = 3.8
 Identities = 12/42 (28%), Positives = 27/42 (64%)

Query: 13 EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKK 54
           E+D++++  ++++     ++ + +KKKK KKK K+   +KK
Sbjct: 20 GEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61


>gnl|CDD|236643 PRK10044, PRK10044, ferrichrome outer membrane transporter;
           Provisional.
          Length = 727

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 5/53 (9%)

Query: 107 RLRSTIRIQEGRAKPIDLLAKYIGSEAEVDAVEMHEPYTYL-----TGLAIKD 154
           R+R+ I    G A  + LL  Y     + D      PY  L     TGL ++D
Sbjct: 399 RMRNDINAWFGYADSVPLLNLYGPVNTDFDFNANSGPYQILNKQKQTGLYVQD 451


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 29.4 bits (66), Expect = 2.6
 Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 1/81 (1%)

Query: 42  KKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA-AQFEEWQRQEDQ 100
            KK  +K+++K+  +     K+  Q+ E E AE+E+E    +R   A  Q +E   QE++
Sbjct: 33  IKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEE 92

Query: 101 FHLEQARLRSTIRIQEGRAKP 121
              E   ++        R KP
Sbjct: 93  KEEEAEDVKQQEVFSFKRKKP 113


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.7 bits (68), Expect = 2.7
 Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 35 EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQR 84
          EA+K+ +  KK+   + K++      + K R E E E  ER  E+   ++
Sbjct: 46 EAKKEAEAIKKEALLEAKEE------IHKLRNEFEKELRERRNELQKLEK 89



 Score = 29.0 bits (66), Expect = 5.0
 Identities = 18/85 (21%), Positives = 47/85 (55%), Gaps = 3/85 (3%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA-AQF 91
            +EA+++  K + + +K+ ++++N    ++KR  ++E E  +R+ E  L +R +E   + 
Sbjct: 59  LLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKE-ENLDRKLE-LLEKREEELEKKE 116

Query: 92  EEWQRQEDQFHLEQARLRSTIRIQE 116
           +E ++++ +   ++  L   I  Q 
Sbjct: 117 KELEQKQQELEKKEEELEELIEEQL 141


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is
          a small protein module with a triple-stranded
          beta-sheet fold. This is a family of WW domain binding
          proteins.
          Length = 78

 Score = 27.6 bits (62), Expect = 2.8
 Identities = 15/57 (26%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKR---RQEREAERAEREEEMALTQRGKE 87
           +A +K++KKK+ KK K +++    A + K+   R +RE ++ + +EE    +  ++
Sbjct: 7  TDAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESGKLRPVEK 63


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 29.8 bits (68), Expect = 2.9
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 35  EARKKKKKKKKKKKKKKKKKKN 56
           E R+ KKKKK+KK KK + +K 
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754


>gnl|CDD|234032 TIGR02855, spore_yabG, sporulation peptidase YabG.  Members of this
           family are the protein YabG, demonstrated for Bacillus
           subtilis to be an endopeptidase able to release
           N-terminal peptides from a number of sporulation
           proteins, including CotT, CotF, and SpoIVA. It appears
           to be expressed under control of sigma-K [Cellular
           processes, Sporulation and germination].
          Length = 283

 Score = 29.6 bits (67), Expect = 2.9
 Identities = 10/57 (17%), Positives = 26/57 (45%), Gaps = 2/57 (3%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLC--ANVKKRRQEREAERAEREEEMALTQRGK 86
             V+  ++KK+KK+ K++ +   +       + K ++E  A    +  ++     G+
Sbjct: 50  ELVDESERKKRKKRFKERMETSYRLFRQDYKLMKEKKEYRATGGYKNNQLYFGMPGR 106


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 26/128 (20%), Positives = 48/128 (37%), Gaps = 8/128 (6%)

Query: 11  YSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
               + NQ+   AR + +    + E   +K+++   + +K    +N     +K    RE 
Sbjct: 60  LLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSAREL 119

Query: 71  ERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPIDLLAKYIG 130
           E  E E++  L       A     Q ++    L  A L       E +A+ +  + +   
Sbjct: 120 ELEELEKQ--LDNELYRVAGLTPEQARKLLLKLLDAELE------EEKAQRVKKIEEEAD 171

Query: 131 SEAEVDAV 138
            EAE  A 
Sbjct: 172 LEAERKAQ 179


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 12/56 (21%), Positives = 29/56 (51%)

Query: 33  KVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEA 88
             +  ++K+  K KK+KK+KK+K +   + K++  +  +   ++     T + +E 
Sbjct: 102 VSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 29.3 bits (66), Expect = 3.4
 Identities = 11/31 (35%), Positives = 14/31 (45%)

Query: 26  QVQGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
            V    S   +RK+K  KK  KK    KK+ 
Sbjct: 214 VVPTPQSSSPSRKRKAPKKVAKKVAAAKKRK 244


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 29.7 bits (67), Expect = 3.6
 Identities = 18/49 (36%), Positives = 26/49 (53%)

Query: 12 SEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
            +DD+  L     + Q   +K   +KKKKKKKKKKKK   +  +L  +
Sbjct: 32 DVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80



 Score = 28.9 bits (65), Expect = 5.6
 Identities = 21/63 (33%), Positives = 29/63 (46%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEW 94
           EA  K+  KKKKKKKKKKKKKNL               +   ++ +  ++     + E  
Sbjct: 49  EAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWK 108

Query: 95  QRQ 97
           Q Q
Sbjct: 109 QTQ 111


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 28.3 bits (63), Expect = 3.7
 Identities = 15/22 (68%), Positives = 17/22 (77%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNL 57
             KK KKK++KKKKKKK  KNL
Sbjct: 102 QAKKLKKKREKKKKKKKGAKNL 123



 Score = 27.1 bits (60), Expect = 8.9
 Identities = 13/20 (65%), Positives = 16/20 (80%)

Query: 36  ARKKKKKKKKKKKKKKKKKK 55
            R+ KK KKK++KKKKKKK 
Sbjct: 100 QRQAKKLKKKREKKKKKKKG 119


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 29.8 bits (67), Expect = 3.7
 Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 64  RRQEREAERAEREEEMALTQRGK---EAAQFEEWQRQEDQ-FHLEQARL 108
           RR++  A+RAE     A  +R +   EA Q        D+  H E ARL
Sbjct: 445 RRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADEAVHREGARL 493


>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region.  This
          family contains sequences that are similar to the
          N-terminal region of Red protein. This and related
          proteins contain a RED repeat which consists of a
          number of RE and RD sequence elements. The region in
          question has several conserved NLS sequences and a
          putative trimeric coiled-coil region, suggesting that
          these proteins are expressed in the nucleus. The
          function of Red protein is unknown, but efficient
          sequestration to nuclear bodies suggests that its
          expression may be tightly regulated of that the protein
          self-aggregates extremely efficiently.
          Length = 238

 Score = 29.0 bits (65), Expect = 3.9
 Identities = 12/62 (19%), Positives = 25/62 (40%)

Query: 38 KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQ 97
          KKKKK    +K+++  +K +    + R +ER     +  +  +L      A   +     
Sbjct: 1  KKKKKYAYLRKQEENAEKEINPKYRDRARERRKGINKDYDPSSLAAYRAVAPDAKSDIDA 60

Query: 98 ED 99
           +
Sbjct: 61 AE 62


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
          (DUF2040).  This entry is a conserved domain of
          approximately 130 residues of proteins conserved from
          fungi to humans. The proteins do contain a coiled-coil
          domain, but the function is unknown.
          Length = 128

 Score = 28.1 bits (63), Expect = 4.0
 Identities = 16/60 (26%), Positives = 32/60 (53%)

Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQF 91
            ++A K++KK+ K  ++K+ +K     ++ +  + R+ ER   EE     +R KE  +F
Sbjct: 25 DSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEF 84


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 28.3 bits (64), Expect = 4.2
 Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 9/58 (15%)

Query: 26  QVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
           +++    + E + K  + KK+K + + + K L         E +A+ AE E + AL Q
Sbjct: 62  ELEAAAPRAELQAKIARYKKEKARYRSEAKEL---------EAKAKEAEAESDHALHQ 110


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 27.2 bits (61), Expect = 4.2
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 12/54 (22%)

Query: 37 RKKKKKKKKKKKKKKKKKK-----NLC-------ANVKKRRQEREAERAEREEE 78
          RK +K+ KK    K KKKK     N           ++++++++E E+  R+E 
Sbjct: 15 RKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEA 68


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 27.2 bits (61), Expect = 4.5
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 34 VEARKKKKKKKKKKKKKKKKKKN 56
          V  +KK KK KK     +KKKKN
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKN 50



 Score = 26.8 bits (60), Expect = 5.1
 Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 4/35 (11%)

Query: 27 VQGTGSKVEARKKKKKKKKKK----KKKKKKKKNL 57
            G  + V A+KKK KK KK     +KKKK KK +
Sbjct: 20 APGLNNNVVAKKKKLKKLKKIDDDLEKKKKNKKKI 54


>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
          (DMAP1).  DNA methylation can contribute to
          transcriptional silencing through several
          transcriptionally repressive complexes, which include
          methyl-CpG binding domain proteins (MBDs) and histone
          deacetylases (HDACs). The chief enzyme that maintains
          mammalian DNA methylation, DNMT1, can also establish a
          repressive transcription complex. The non-catalytic
          amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
          DNMT1 associated protein), and can mediate
          transcriptional repression. DMAP1 has intrinsic
          transcription repressive activity, and binds to the
          transcriptional co-repressor TSG101. DMAP1 is targeted
          to replication foci through interaction with the far N
          terminus of DNMT1 throughout S phase, whereas HDAC2
          joins DNMT1 and DMAP1 only during late S phase,
          providing a platform for how histones may become
          deacetylated in heterochromatin following replication.
          Length = 175

 Score = 28.7 bits (64), Expect = 4.5
 Identities = 16/38 (42%), Positives = 21/38 (55%)

Query: 16 DNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKK 53
          D Q+L+ A  Q     S  EARK++KK  KKK  +K  
Sbjct: 25 DLQKLITAADQQAAGFSTAEARKREKKLPKKKIPQKPG 62


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.5 bits (63), Expect = 4.5
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 30  TGSKVEARKKKKKKKKKKKKKKKKKKNLC--ANVKKRRQEREAERAEREEEMALTQRGKE 87
           TG  +  R + + +  KK   KKKK  +     ++K RQERE E  ER E     ++  E
Sbjct: 95  TGVPLAIRIRDQLRSVKKDDIKKKKSLIIRQEQIEKARQERE-ELEERMEWERREEKIDE 153

Query: 88  AAQFEEWQRQEDQFHLEQ 105
               EE +R+ ++  +E+
Sbjct: 154 REDQEEQEREREEQTIEE 171


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 29.2 bits (66), Expect = 4.5
 Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 18/118 (15%)

Query: 19  RLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE- 77
           R   A   +   G + + +++   K + K ++++ ++    N ++ R ER   R  R   
Sbjct: 571 RFFGALKALFSGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRR 630

Query: 78  -----------------EMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGR 118
                            +     R  + A+  E  R +D+      R R   R  E R
Sbjct: 631 EGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKR 688


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 29.2 bits (66), Expect = 4.8
 Identities = 15/42 (35%), Positives = 21/42 (50%)

Query: 210 ESVAQDVASVFKGKTTAQLEALQTQIELKISSKTENIDNVIT 251
           E V + +      K+ AQLEAL T++   IS      DN+ T
Sbjct: 84  EKVIRMLDLDLGAKSPAQLEALITKLRKNISISLAGRDNLFT 125


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 10/52 (19%), Positives = 25/52 (48%)

Query: 27 VQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
          V     ++ +   +++ K      KK KK    N  K++ E++ ++ + ++E
Sbjct: 31 VLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKE 82


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
          family positive stand RNA viruses with genome encoding
          a polyprotein. members include zucchini yellow mosaic
          virus, and turnip mosaic viruses which cause
          considerable losses of crops worldwide. This family
          consists of a C terminus region from various plant
          potyvirus P1 proteins (found at the N terminus of the
          polyprotein). The C terminus of P1 is a serine-type
          protease responsible for autocatalytic cleavage between
          P1 and the helper component protease pfam00851. The
          entire P1 protein may be involved in virus-host
          interactions.
          Length = 245

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 14/56 (25%), Positives = 25/56 (44%)

Query: 28 QGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
              +KV  R  +K+  K K++KK +      +  +  ++RE  + E  EE    Q
Sbjct: 1  ADLEAKVAERLLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQ 56


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 35  EARKKKKKKKKKKKKKKKKK 54
             RK++  KK+KKKK KK K
Sbjct: 355 PKRKREGDKKQKKKKSKKLK 374


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 29.2 bits (65), Expect = 5.3
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKK 55
           ++VE+ K KK KKKK  KKK+  K
Sbjct: 116 NRVESWKDKKNKKKKSAKKKEAHK 139


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
          pigmentation in the mouse hair follicle producing a
          black hair with a subapical yellow band. A highly
          homologous protein agouti signal protein (ASIP) is
          present in humans and is expressed at highest levels in
          adipose tissue where it may play a role in energy
          homeostasis and possibly human pigmentation.
          Length = 124

 Score = 27.9 bits (62), Expect = 5.3
 Identities = 8/30 (26%), Positives = 15/30 (50%)

Query: 25 MQVQGTGSKVEARKKKKKKKKKKKKKKKKK 54
          + +     K+ A + +KK  +KK+KK    
Sbjct: 48 VGLNKKPKKISAEEAEKKLLQKKEKKALTN 77


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 28.1 bits (62), Expect = 5.3
 Identities = 21/104 (20%), Positives = 60/104 (57%), Gaps = 1/104 (0%)

Query: 13  EEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
           E+++  RL    ++ +    ++   ++ +++++++ ++K++K    A  ++++++ E ER
Sbjct: 48  EQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQER 107

Query: 73  AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQE 116
            ++++E A   R +E A+    +R++    +EQ RL    R++E
Sbjct: 108 IQKQKEEAEA-RAREEAERMRLEREKHFQQIEQERLERKKRLEE 150


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 27.4 bits (61), Expect = 5.5
 Identities = 15/70 (21%), Positives = 28/70 (40%)

Query: 12 SEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAE 71
           + D +      + + +      +  + K  KK +KK KKK +        KRR+  +  
Sbjct: 10 DKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGL 69

Query: 72 RAEREEEMAL 81
              EEE+ +
Sbjct: 70 PIYTEEELKI 79


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein
          7 (Rrp7p), ribosomal RNA-processing protein 7 homolog A
          (Rrp7A), and similar proteins.  This CD corresponds to
          the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
          YCL031C gene from Saccharomyces cerevisiae. It is an
          essential yeast protein involved in pre-rRNA processing
          and ribosome assembly, and is speculated to be required
          for correct assembly of rpS27 into the pre-ribosomal
          particle. Rrp7A, also termed gastric cancer antigen
          Zg14, is the Rrp7p homolog mainly found in Metazoans.
          The cellular function of Rrp7A remains unclear
          currently. Both Rrp7p and Rrp7A harbor an N-terminal
          RNA recognition motif (RRM), also termed RBD (RNA
          binding domain) or RNP (ribonucleoprotein domain), and
          a C-terminal RRP7 domain.
          Length = 118

 Score = 27.6 bits (62), Expect = 5.7
 Identities = 12/22 (54%), Positives = 18/22 (81%)

Query: 35 EARKKKKKKKKKKKKKKKKKKN 56
          EA + K K+K+KKKKKKK+ ++
Sbjct: 69 EAVEAKAKEKEKKKKKKKELED 90


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 28.9 bits (65), Expect = 5.8
 Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 7/55 (12%)

Query: 229 EALQTQIELK-ISSKTENID---NVITAEEKALQREARK---GMGGDEAEFSVES 276
              +TQ E K I    + ID   N I  E++ LQ EA +    M  D  E + E 
Sbjct: 174 SEERTQDETKMIEEYRDKIDALKNAIEVEKEGLQEEAIQEIADMSADILEHAAEE 228


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 28.5 bits (64), Expect = 5.9
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 38  KKKKKKKKKKKKKKKKKKNLC 58
           KK + KKK  K  KK + +L 
Sbjct: 156 KKPRSKKKSSKNSKKLRIDLV 176



 Score = 28.5 bits (64), Expect = 6.7
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 34  VEARKKKKKKKKKKKKKKKKKK 55
           + A KKK + KKK  K  KK +
Sbjct: 151 ILAEKKKPRSKKKSSKNSKKLR 172


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 28.7 bits (65), Expect = 6.0
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 34  VEARKKKKKKKKKKKKKKKKKKNL 57
            +AR   KK ++  +KKK    +L
Sbjct: 344 AKARAAAKKARELTRKKKLSSISL 367


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 28.7 bits (63), Expect = 6.1
 Identities = 22/103 (21%), Positives = 56/103 (54%), Gaps = 1/103 (0%)

Query: 14  EDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
           E + Q+     +++  +  K E  K+K +++K+K +++K+K +  AN      E+E ++ 
Sbjct: 159 EQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKT 218

Query: 74  EREEEMALTQRGKEAAQFEE-WQRQEDQFHLEQARLRSTIRIQ 115
           E E++  + ++     + E+  Q   +QF +++  +++ I I+
Sbjct: 219 ENEKQDLIKEQKDFIKEAEQNCQENHNQFFIKKLGIKAGIAIE 261


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 27.6 bits (62), Expect = 6.2
 Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 5/46 (10%)

Query: 32  SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREE 77
           S+ +A ++++K K+KKKKKKK+ +N       R Q RE ++ E  E
Sbjct: 67  SRNKAAEERRKLKEKKKKKKKELENF-----YRFQIREKKKEELAE 107


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.8 bits (65), Expect = 6.3
 Identities = 7/31 (22%), Positives = 18/31 (58%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQ 66
             K+  K++K+ +K KK+++     ++K  +
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
          family of proteins has no known function.
          Length = 44

 Score = 25.7 bits (57), Expect = 6.3
 Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 37 RKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
          RKKK  +K KKK KK   K       K +   +AERA+ E E
Sbjct: 3  RKKKINQKLKKKAKKANAKL--HPSNKPKYISKAERAKLEAE 42


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 28.5 bits (64), Expect = 6.3
 Identities = 26/144 (18%), Positives = 52/144 (36%), Gaps = 10/144 (6%)

Query: 1   MSRLEPGTLVYSEED--DNQRL-VYARMQVQG----TGSKVEARKKKKKKKKKKKKKKKK 53
             R  P  L+ S ED   + RL +Y             +     K+    + +   ++ +
Sbjct: 127 SGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAE 186

Query: 54  KKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQ--FEEWQRQEDQFHLEQARLRST 111
              L +  + ++ +      ER++ +A       A Q   EE +  E +   E A   + 
Sbjct: 187 LTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAA 246

Query: 112 I-RIQEGRAKPIDLLAKYIGSEAE 134
             + +E  A      A+   +EA+
Sbjct: 247 AAKAREAAAAAEAAAARARAAEAK 270


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 29.1 bits (65), Expect = 6.4
 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 35   EARKKKKKKKKKKKKK--KKKKKNLCANVKKRRQEREAERAEREEE 78
            EAR+K ++   + +    KK  +        ++  ++A  +E  EE
Sbjct: 1177 EAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222


>gnl|CDD|148523 pfam06952, PsiA, PsiA protein.  This family consists of several
           Enterobacterial PsiA proteins. The function of PsiA is
           unknown although it is thought that it may affect the
           generation of an SOS signal in Escherichia coli.
          Length = 238

 Score = 28.2 bits (63), Expect = 6.4
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 50  KKKKKKNLCANVKKRRQEREAERAEREEEMALTQ 83
           ++++K+++    K R++ RE E+     +  L Q
Sbjct: 117 RRRQKQDIAFTRKMRQEAREVEQRWLLYQNLLAQ 150


>gnl|CDD|235825 PRK06547, PRK06547, hypothetical protein; Provisional.
          Length = 172

 Score = 28.2 bits (63), Expect = 6.5
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 4/33 (12%)

Query: 69  EAERAEREEEMALTQRGKEAAQFEEWQRQEDQF 101
           EA R ER    AL +    A  +E W  QE++ 
Sbjct: 128 EALRKER----ALARDPDYAPHWEMWAAQEERH 156


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.8 bits (64), Expect = 6.6
 Identities = 21/79 (26%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 22  YARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMAL 81
           Y R+Q Q +     A+K ++++KKK+++  ++ K   A  ++R ++ E ER + +E+   
Sbjct: 64  YGRIQSQQS----SAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119

Query: 82  TQRGKEAAQFEEWQRQEDQ 100
            +  ++ AQ E+ Q++E  
Sbjct: 120 AEEAEKQAQLEQKQQEEQA 138


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 28.9 bits (65), Expect = 6.6
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 62  KKRRQEREAERAEREEEMALTQRGKEAAQFEE 93
           K+RR E  A+RA  EEE   +   + A + EE
Sbjct: 737 KQRRAEILAQRARGEEEAHHSSSPEGAIEVEE 768


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 28.3 bits (63), Expect = 6.6
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 34  VEARKKKKKKKKKKKKKKKKKKN 56
           VE +KK K KK K K KK + K 
Sbjct: 89  VEKKKKGKSKKNKLKGKKDEDKK 111


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 28.8 bits (65), Expect = 7.0
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 38  KKKKKKKKKKKKKKKKKKNLC 58
           K +  ++ K KKK  KKK+ C
Sbjct: 785 KLRSSQRDKLKKKLLKKKDFC 805


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 28.8 bits (64), Expect = 7.2
 Identities = 19/110 (17%), Positives = 39/110 (35%), Gaps = 9/110 (8%)

Query: 22  YARMQVQGTGSKVEAR---------KKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAER 72
           Y RM  + +G  V            K   + K+K K+K    K +         E E   
Sbjct: 767 YPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGERCT 826

Query: 73  AEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRSTIRIQEGRAKPI 122
            ++ EE  +        + ++  + ++   +   + R    I++ R  P+
Sbjct: 827 IKQREEKGIDAPAILNVKKKKPYKVDNMTRILPQQSRYISFIKDDRFVPV 876


>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
          Length = 367

 Score = 28.4 bits (64), Expect = 7.4
 Identities = 14/43 (32%), Positives = 18/43 (41%)

Query: 40 KKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALT 82
          KK KKK+    ++  K L    KKR      E AE+     L 
Sbjct: 1  KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIPFELD 43


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.6 bits (64), Expect = 7.5
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 8/93 (8%)

Query: 15  DDNQRLVYA----RMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREA 70
           DDN   + A      +V+     +   K+++  K+  KK +   K L     +     EA
Sbjct: 340 DDNDLTLLALTKLYEEVKSNT-DLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEA 398

Query: 71  ERAEREEEMALTQ---RGKEAAQFEEWQRQEDQ 100
           E   +EE++   +   + KE A  ++ +RQ+D+
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
          Provisional.
          Length = 191

 Score = 27.9 bits (62), Expect = 7.6
 Identities = 15/39 (38%), Positives = 18/39 (46%)

Query: 40 KKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
          KK  KKKK   +KKK       KK+ +E   E A  E  
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELS 86



 Score = 27.9 bits (62), Expect = 9.1
 Identities = 16/45 (35%), Positives = 20/45 (44%)

Query: 34 VEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEE 78
          VE   KK  KKKK   +KKK        KK ++E     AE   +
Sbjct: 43 VEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSD 87


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 28.5 bits (62), Expect = 7.7
 Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 7/100 (7%)

Query: 15  DDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCANVKK------RRQER 68
           +D ++  +A +++     +VE  KK +KKK +K+  K  +     N KK      RR E 
Sbjct: 48  NDQRKEAFASLELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEA 107

Query: 69  EA-ERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQAR 107
           EA +RA +EEE    +  +E  Q E+ +++ D   L+  +
Sbjct: 108 EAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEK 147


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
          RNase Y, an endoribonuclease. The member from Bacillus
          subtilis, YmdA, has been shown to be involved in
          turnover of yitJ riboswitch [Transcription, Degradation
          of RNA].
          Length = 514

 Score = 28.4 bits (64), Expect = 7.7
 Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 5/78 (6%)

Query: 19 RLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCA---NVKKRRQEREAERAER 75
            +  +   +      E   K+  ++ KK+ +  KK+ L      V K R E E E  ER
Sbjct: 15 GYLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKER 74

Query: 76 EEEMALTQRGKEAAQFEE 93
            E  L +  +   Q EE
Sbjct: 75 RNE--LQRLERRLLQREE 90


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 28.2 bits (63), Expect = 7.9
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query: 35  EARKKKKKKKKKKKKKKKKKKNLCANVKKR 64
           EA + K KK+K  KK+K+  K   A +K  
Sbjct: 348 EAARVKAKKEKLAKKRKQVSKEEAAAIKAA 377


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 28.0 bits (62), Expect = 8.3
 Identities = 17/60 (28%), Positives = 35/60 (58%), Gaps = 2/60 (3%)

Query: 41  KKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQ 100
           KK+ K ++  KK + +     +   +E +AE+ ER+++++  Q  K+AA  +E++ Q  Q
Sbjct: 58  KKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQ--KQAALAKEYKAQVKQ 115


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 26.8 bits (60), Expect = 8.4
 Identities = 15/54 (27%), Positives = 22/54 (40%), Gaps = 6/54 (11%)

Query: 32 SKVEARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERAEREEEMALTQRG 85
           K   +K  +     KKKKK+K K      KKR +   A  A    +  L ++ 
Sbjct: 34 EKELKQKADEGNNSGKKKKKRKAK------KKRDEAGPASTAAEAVKKMLQKKK 81


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 28.0 bits (63), Expect = 8.5
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 10  VYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKKKKKKKKKKNLCAN 60
           ++ +E +     +     +      E  K+KKKKKKKKKKK++ ++ +C  
Sbjct: 60  IFVQEKEKLNDEHGE-LYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 28.1 bits (63), Expect = 8.6
 Identities = 15/86 (17%), Positives = 37/86 (43%), Gaps = 1/86 (1%)

Query: 9   LVYSEEDDNQRLVYARMQVQGTGSKVEARKKKKKKKKKK-KKKKKKKKNLCANVKKRRQE 67
           ++  +++D + L   +  ++     + A + + + +      +K +K  L A +  +   
Sbjct: 149 ILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEAS 208

Query: 68  REAERAEREEEMALTQRGKEAAQFEE 93
              E+A  EE+ AL +     A  +E
Sbjct: 209 ALGEKAALEEQKALAEAAAAEAAKQE 234


>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
          Length = 568

 Score = 28.3 bits (63), Expect = 8.8
 Identities = 12/46 (26%), Positives = 21/46 (45%)

Query: 65  RQEREAERAEREEEMALTQRGKEAAQFEEWQRQEDQFHLEQARLRS 110
           R   EA+RA  + E    + G  A   ++W+ + D  + E  R + 
Sbjct: 170 RSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKG 215


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 27.6 bits (62), Expect = 9.0
 Identities = 14/38 (36%), Positives = 27/38 (71%)

Query: 36  ARKKKKKKKKKKKKKKKKKKNLCANVKKRRQEREAERA 73
            +KKK KK+++ KK KKK ++L A ++ R++  + ++A
Sbjct: 122 GKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKA 159


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 28.2 bits (63), Expect = 9.7
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 28  QGTGSKVEARKKKKKKKKKKKKKKKKKKN 56
           +   +K +A K  KKK K K KK  KK+ 
Sbjct: 756 KSASTKGKAAKTVKKKSKAKSKKTTKKRA 784


>gnl|CDD|218413 pfam05069, Phage_tail_S, Phage virion morphogenesis family.
          Protein S of phage P2 is thought to be involved in tail
          completion and stable head joining.
          Length = 145

 Score = 27.3 bits (61), Expect = 9.8
 Identities = 8/30 (26%), Positives = 12/30 (40%)

Query: 24 RMQVQGTGSKVEARKKKKKKKKKKKKKKKK 53
            Q    GS    RK   + KK + ++  K
Sbjct: 42 AAQQAPDGSPWAPRKPTLRAKKGRIRRGGK 71


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.378 

Gapped
Lambda     K      H
   0.267   0.0708    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,182,744
Number of extensions: 2125889
Number of successful extensions: 10595
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7197
Number of HSP's successfully gapped: 795
Length of query: 409
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 310
Effective length of database: 6,546,556
Effective search space: 2029432360
Effective search space used: 2029432360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.9 bits)