RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16862
         (303 letters)



>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
           This family contains, amongst other G-protein-coupled
           receptors (GCPRs), members of the opsin family, which
           have been considered to be typical members of the
           rhodopsin superfamily. They share several motifs, mainly
           the seven transmembrane helices, GCPRs of the rhodopsin
           superfamily. All opsins bind a chromophore, such as
           11-cis-retinal. The function of most opsins other than
           the photoisomerases is split into two steps: light
           absorption and G-protein activation. Photoisomerases, on
           the other hand, are not coupled to G-proteins - they are
           thought to generate and supply the chromophore that is
           used by visual opsins.
          Length = 251

 Score =  145 bits (368), Expect = 4e-42
 Identities = 53/190 (27%), Positives = 91/190 (47%), Gaps = 11/190 (5%)

Query: 66  VIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFS 125
           +++ +KLR  TN F+++LA ADLL  L + P+     +  +W FG  +C +     V   
Sbjct: 3   ILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVVNG 62

Query: 126 TASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTP 185
            ASIL L  IS+DRY AIV PL+Y  I       +++  VW+L  ++S  P++  W  T 
Sbjct: 63  YASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPLLFSWLRTV 122

Query: 186 DHKQYRKHHPNVCIFVVNVYYALISSSISFWIPGFVM-----------KKKKKRKKKKKK 234
           +         +         Y L+S+ + F +P  V+           +K+ +    + +
Sbjct: 123 EEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRKRARSGASQAR 182

Query: 235 KKKKKRKKKK 244
            K+   K++K
Sbjct: 183 AKRSSSKERK 192


>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
           protein; Provisional.
          Length = 335

 Score = 71.7 bits (176), Expect = 3e-14
 Identities = 50/188 (26%), Positives = 87/188 (46%), Gaps = 15/188 (7%)

Query: 3   NTIYNSLNYTYNINYTSTNFS---NTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGN 59
           NT   +    Y   Y   +++       + TNSTI I         ++   I    ++GN
Sbjct: 8   NTTIENTTDYYYDTYYDEDYADCDLNIGYDTNSTILI---------VVYSTIFFFGLVGN 58

Query: 60  LLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNS 119
           ++VI  V+   K++   + ++++LA +DLL  +  +PF     +  +W FG F C + + 
Sbjct: 59  IIVIY-VLTKTKIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVSG 116

Query: 120 FDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIML 179
                   S+  +  +SVDRY AIV P++   I       ++   +WI+  II  TPI+ 
Sbjct: 117 LYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWII-SIIETTPILF 175

Query: 180 GWYTTPDH 187
            + T  DH
Sbjct: 176 VYTTKKDH 183


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 67.0 bits (163), Expect = 1e-12
 Identities = 52/165 (31%), Positives = 87/165 (52%), Gaps = 16/165 (9%)

Query: 11  YTYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVIKHR 70
           Y  NI Y   N  N  ++P+ S  +I IF  II         I  + GN   II ++  +
Sbjct: 77  YENNITYELINIKNKCMYPSISE-YIKIFYIII--------FILGLFGNAA-IIMILFCK 126

Query: 71  KLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNS--FDVYFSTAS 128
           K++ IT+ ++ +LA +DL+  +   PF  I +  D+W FG FMC V ++  +  +FS   
Sbjct: 127 KIKTITDIYIFNLAISDLIFVI-DFPF-IIYNEFDQWIFGDFMCKVISASYYIGFFSNMF 184

Query: 129 ILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIIS 173
           ++ L  +S+DRY+AI+ P+ +      N  +++    WIL  II+
Sbjct: 185 LITL--MSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIIT 227


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 53.8 bits (130), Expect = 4e-08
 Identities = 19/67 (28%), Positives = 26/67 (38%), Gaps = 19/67 (28%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
            +K+  K  KK   KR +KK                   EK+K+K K KK+    K   K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKK-------------------EKEKEKPKVKKRHRDTKNIGK 424

Query: 285 KRKKKKN 291
           +RK    
Sbjct: 425 RRKPSGT 431



 Score = 52.3 bits (126), Expect = 1e-07
 Identities = 20/70 (28%), Positives = 29/70 (41%), Gaps = 16/70 (22%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K K   +KK  K  KK   K              R E++E++K+K K KK+   +  K  
Sbjct: 379 KTKAPSEKKTGKPSKKVLAK--------------RAEKKEKEKEKPKVKKRH--RDTKNI 422

Query: 284 KKRKKKKNQH 293
            KR+K     
Sbjct: 423 GKRRKPSGTS 432



 Score = 48.8 bits (117), Expect = 1e-06
 Identities = 17/66 (25%), Positives = 26/66 (39%), Gaps = 20/66 (30%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK  K  KK   K+ +K++K+K                    +K K KK+ + T    KR
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEK--------------------EKPKVKKRHRDTKNIGKR 425

Query: 283 KKKRKK 288
           +K    
Sbjct: 426 RKPSGT 431



 Score = 40.7 bits (96), Expect = 6e-04
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
              E+K  K  KK      +KK K+K K K  + HRD
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418



 Score = 35.7 bits (83), Expect = 0.024
 Identities = 7/25 (28%), Positives = 12/25 (48%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQH 247
           +K K +K+ +  K   KR+K     
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGTS 432



 Score = 34.5 bits (80), Expect = 0.052
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKK 244
            K+K+K K KK+ +  K   K++
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRR 426



 Score = 31.5 bits (72), Expect = 0.59
 Identities = 11/30 (36%), Positives = 14/30 (46%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           +E   K K   +KK     KK   KR +KK
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKK 403



 Score = 31.1 bits (71), Expect = 0.68
 Identities = 6/24 (25%), Positives = 12/24 (50%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
            K KKR +  K   K+++    ++
Sbjct: 410 PKVKKRHRDTKNIGKRRKPSGTSE 433



 Score = 29.9 bits (68), Expect = 1.6
 Identities = 6/24 (25%), Positives = 11/24 (45%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
           K KK+ +  K   K++K      +
Sbjct: 411 KVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 53.4 bits (128), Expect = 6e-08
 Identities = 19/70 (27%), Positives = 37/70 (52%), Gaps = 2/70 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK--KKKKKKTTKKK 280
           + K+KR  K+K+K+K+K+ ++      +    R       ++  KKK   KKK+   ++K
Sbjct: 136 EAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195

Query: 281 KRKKKRKKKK 290
           +R+  R+  K
Sbjct: 196 QRQAAREAVK 205



 Score = 53.0 bits (127), Expect = 7e-08
 Identities = 22/79 (27%), Positives = 44/79 (55%), Gaps = 11/79 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K +  ++++K+K++ K+ KKKK             + +EE + +K K++ K+K   K+K 
Sbjct: 99  KNESGKEEEKEKEQVKEEKKKKK-----------EKPKEEPKDRKPKEEAKEKRPPKEKE 147

Query: 283 KKKRKKKKNQHHRDVWQKK 301
           K+K KK +    R+  +K+
Sbjct: 148 KEKEKKVEEPRDREEEKKR 166



 Score = 52.6 bits (126), Expect = 1e-07
 Identities = 19/70 (27%), Positives = 34/70 (48%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K++ K K+  K+K+K+K KK +    R+    R     +   KK  KKK   K  +  + 
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193

Query: 283 KKKRKKKKNQ 292
           +K+R+  +  
Sbjct: 194 EKQRQAAREA 203



 Score = 50.3 bits (120), Expect = 6e-07
 Identities = 25/78 (32%), Positives = 45/78 (57%), Gaps = 9/78 (11%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK---------KK 273
           ++K+K + K++KKKKK++ K++ +  +     +  R  +E+EK+K+KK          KK
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165

Query: 274 KKTTKKKKRKKKRKKKKN 291
           ++  + K R KK  KKK 
Sbjct: 166 RERVRAKSRPKKPPKKKP 183



 Score = 49.9 bits (119), Expect = 9e-07
 Identities = 21/74 (28%), Positives = 46/74 (62%), Gaps = 5/74 (6%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K+++++K++ K++KKK+K+K  +  +D         EE +EK+  K+K+K+K  K ++ 
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKD-----RKPKEEAKEKRPPKEKEKEKEKKVEEP 157

Query: 283 KKKRKKKKNQHHRD 296
           + + ++KK +  R 
Sbjct: 158 RDREEEKKRERVRA 171



 Score = 38.3 bits (89), Expect = 0.004
 Identities = 13/47 (27%), Positives = 28/47 (59%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           +T   +E + +  K+++K+K+  K++K+KKK K K+    R   ++ 
Sbjct: 91  KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137



 Score = 38.3 bits (89), Expect = 0.004
 Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 2/74 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            + K R KK  KKK   +KK+  +  +     R +   + EE    ++++K++     K 
Sbjct: 169 VRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE--DDGKD 226

Query: 283 KKKRKKKKNQHHRD 296
           ++       +    
Sbjct: 227 RETTTSPMEEDESR 240



 Score = 36.8 bits (85), Expect = 0.011
 Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 14/75 (18%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           KR +K   K    + K                 E + E  K+++K+K++  ++KK+KK++
Sbjct: 78  KRVEKGGSKGPAAKTKPAK--------------EPKNESGKEEEKEKEQVKEEKKKKKEK 123

Query: 287 KKKKNQHHRDVWQKK 301
            K++ +  +   + K
Sbjct: 124 PKEEPKDRKPKEEAK 138



 Score = 34.5 bits (79), Expect = 0.067
 Identities = 15/77 (19%), Positives = 33/77 (42%), Gaps = 13/77 (16%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
             +++ + K + KK  KKK   KK +               EEEK+++  ++  K   ++
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKE-------------PPEEEKQRQAAREAVKGKPEE 210

Query: 281 KRKKKRKKKKNQHHRDV 297
               + ++K+    +D 
Sbjct: 211 PDVNEEREKEEDDGKDR 227



 Score = 33.7 bits (77), Expect = 0.10
 Identities = 14/76 (18%), Positives = 31/76 (40%), Gaps = 15/76 (19%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           + + + + KK  KKK   KKK               E  EE+K+++  ++    K ++  
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKK---------------EPPEEEKQRQAAREAVKGKPEEPD 212

Query: 284 KKRKKKKNQHHRDVWQ 299
              +++K +      +
Sbjct: 213 VNEEREKEEDDGKDRE 228


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 52.4 bits (125), Expect = 1e-07
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 10/82 (12%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK----- 273
           G V KKK  R  K+  KK      +  +   ++ +N  +  ++E +K  ++ ++K     
Sbjct: 66  GAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAAS 125

Query: 274 -----KKTTKKKKRKKKRKKKK 290
                +KT KK ++++K KK  
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMD 147



 Score = 30.4 bits (68), Expect = 1.2
 Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 11/73 (15%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           GF   + ++ + + K       +KK             S    +EE  +     KKK T+
Sbjct: 27  GFNKFRGERWRLQNKFLLAAGSRKK-----------IESALAVDEEPDENGAVSKKKPTR 75

Query: 279 KKKRKKKRKKKKN 291
             KR  K+   + 
Sbjct: 76  SVKRATKKTVVEI 88


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 50.5 bits (121), Expect = 2e-07
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 15/69 (21%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
               K  K+ + ++++KK KKKK               +E +++KK+KK KK+K  + K 
Sbjct: 138 ETTAKVEKEAEVEEEEKKEKKKK---------------KEVKKEKKEKKDKKEKMVEPKG 182

Query: 282 RKKKRKKKK 290
            KKK+KKKK
Sbjct: 183 SKKKKKKKK 191



 Score = 48.2 bits (115), Expect = 1e-06
 Identities = 20/66 (30%), Positives = 39/66 (59%), Gaps = 9/66 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+   + +K+ + +++++K+KK +             +E++EKK KK+K  +    KKK+
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVK---------KEKKEKKDKKEKMVEPKGSKKKK 187

Query: 283 KKKRKK 288
           KKK+KK
Sbjct: 188 KKKKKK 193



 Score = 41.6 bits (98), Expect = 2e-04
 Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 18/63 (28%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           V +++KK KKKKK+ KK+K++KK                  ++++K  + K  KK  KKK
Sbjct: 149 VEEEEKKEKKKKKEVKKEKKEKK------------------DKKEKMVEPKGSKKKKKKK 190

Query: 281 KRK 283
           K+K
Sbjct: 191 KKK 193



 Score = 40.9 bits (96), Expect = 2e-04
 Identities = 20/44 (45%), Positives = 30/44 (68%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
           E E E+++KK+KKKKK  KK+K++KK KK+K    +   +KK  
Sbjct: 146 EAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 47.7 bits (114), Expect = 5e-07
 Identities = 24/67 (35%), Positives = 43/67 (64%), Gaps = 7/67 (10%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++K+  R+K+KKKK+K+  KK+     R     R  R +E  EK+K+ +K ++K  K K+
Sbjct: 71  VEKELLREKEKKKKRKRPGKKR-----RIALRLRRERTKERAEKEKRTRKNREK--KFKR 123

Query: 282 RKKKRKK 288
           R+K+++K
Sbjct: 124 RQKEKEK 130



 Score = 42.3 bits (100), Expect = 4e-05
 Identities = 16/69 (23%), Positives = 40/69 (57%), Gaps = 13/69 (18%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           + ++K++KKK+K+  KK+R   +                E  +++ +K+K+ +K  +KK 
Sbjct: 75  LLREKEKKKKRKRPGKKRRIALR-------------LRRERTKERAEKEKRTRKNREKKF 121

Query: 282 RKKKRKKKK 290
           ++++++K+K
Sbjct: 122 KRRQKEKEK 130



 Score = 30.7 bits (70), Expect = 0.41
 Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 12/52 (23%)

Query: 253 INRTSRGEEEEEKKKKKKKKK------------KKTTKKKKRKKKRKKKKNQ 292
           + +    E+E++KK+K+  KK            K+  +K+KR +K ++KK +
Sbjct: 71  VEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFK 122



 Score = 29.9 bits (68), Expect = 0.72
 Identities = 12/38 (31%), Positives = 24/38 (63%)

Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           + + +   E+E  ++K+KKKK+K   KK+R   R +++
Sbjct: 64  LKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRE 101



 Score = 28.0 bits (63), Expect = 3.6
 Identities = 12/39 (30%), Positives = 26/39 (66%), Gaps = 2/39 (5%)

Query: 260 EEEEEKKKKKKKKKKKTTKKK--KRKKKRKKKKNQHHRD 296
           E+E  ++K+KKKK+K+  KK+    + +R++ K +  ++
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKE 110


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 48.7 bits (116), Expect = 5e-07
 Identities = 25/71 (35%), Positives = 36/71 (50%), Gaps = 5/71 (7%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           M  +  +KK K K KK + +    +          S G +  EKK KKKK +    +KKK
Sbjct: 104 MHIQPPKKKHKHKHKKHRTQDPLPEETPS-----DSEGLKGHEKKHKKKKHEDDKERKKK 158

Query: 282 RKKKRKKKKNQ 292
           +K+K+KKKK  
Sbjct: 159 KKEKKKKKKRH 169



 Score = 46.7 bits (111), Expect = 2e-06
 Identities = 29/72 (40%), Positives = 37/72 (51%), Gaps = 2/72 (2%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK--KKTTKKKKRKKK 285
             +  KKK K K KK + Q          S G +  EKK KKKK +  K+  KKKK KKK
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164

Query: 286 RKKKKNQHHRDV 297
           +KK+ +  H  V
Sbjct: 165 KKKRHSPEHPGV 176



 Score = 39.0 bits (91), Expect = 0.001
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR 249
            KKKK +  K++KKKKK KKKK + H 
Sbjct: 144 HKKKKHEDDKERKKKKKEKKKKKKRHS 170



 Score = 30.2 bits (68), Expect = 0.80
 Identities = 21/66 (31%), Positives = 28/66 (42%), Gaps = 25/66 (37%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
                 +  K  +KK KKKK                        E++K++KKKKK+KK  
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKH-----------------------EDDKERKKKKKEKK-- 163

Query: 278 KKKKRK 283
           KKKKR 
Sbjct: 164 KKKKRH 169


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 49.9 bits (120), Expect = 7e-07
 Identities = 20/72 (27%), Positives = 46/72 (63%), Gaps = 2/72 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
            KK  +K KK  +K +K+++++ +  +        + EEEEE+K+KK+++K++  ++ + 
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 282 -RKKKRKKKKNQ 292
            ++++ +KKK Q
Sbjct: 465 EKEEEEEKKKKQ 476



 Score = 45.7 bits (109), Expect = 1e-05
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 15/78 (19%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
             KK  KK KK  +K  KK++               EE++EKKKK    KKK  ++++ K
Sbjct: 404 GSKKATKKIKKIVEKAEKKRE---------------EEKKEKKKKAFAGKKKEEEEEEEK 448

Query: 284 KKRKKKKNQHHRDVWQKK 301
           +K++++K +   +  ++K
Sbjct: 449 EKKEEEKEEEEEEAEEEK 466



 Score = 38.7 bits (91), Expect = 0.003
 Identities = 17/57 (29%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS-RGEEEEEKKKKKKKKKKKTT 277
            ++++K++KKKK    KK+++++ +        +     E EEEK+++++KKKK+ T
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478



 Score = 28.7 bits (65), Expect = 4.6
 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 261 EEEEKK---KKKKKKKKKTTKKKKRKKKRKKKKNQH 293
            EEE +     KK  KK     +K +KKR+++K + 
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEK 430



 Score = 28.0 bits (63), Expect = 7.7
 Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           EEE E     KK  KK  K  ++ +K+++++ +  + 
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKK 432


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 48.9 bits (116), Expect = 1e-06
 Identities = 31/107 (28%), Positives = 45/107 (42%), Gaps = 33/107 (30%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR-DVWINRT------------------------- 256
           K KKK KK+K+K++ K +KK+        + ++ +                         
Sbjct: 201 KPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPD 260

Query: 257 -----SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK--KKNQHHRD 296
                 +  E EE KK  K KKKK  K+K+ KKK+KK      HH D
Sbjct: 261 SEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSD 307



 Score = 39.7 bits (92), Expect = 0.001
 Identities = 21/52 (40%), Positives = 29/52 (55%), Gaps = 8/52 (15%)

Query: 246 QHHRDVWINRTSRGEEEE-----EKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           Q HR+     TS+  E+      EKK KK KKK+K  K+K+R K +KK+   
Sbjct: 176 QKHRNA---ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224



 Score = 35.0 bits (80), Expect = 0.044
 Identities = 12/35 (34%), Positives = 23/35 (65%), Gaps = 3/35 (8%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           EEE++ +++ +K K  KKK+ K+KR +++   H  
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRR---HHS 116



 Score = 34.7 bits (79), Expect = 0.051
 Identities = 15/61 (24%), Positives = 32/61 (52%), Gaps = 5/61 (8%)

Query: 209 ISSSISFWIPGFVM-----KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE 263
           I  S+   +PG  M     K +++R+ +++ +K K+ KKK+ +  R    + +   E +E
Sbjct: 64  IDLSVPLKVPGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDE 123

Query: 264 E 264
           +
Sbjct: 124 D 124



 Score = 34.3 bits (78), Expect = 0.073
 Identities = 8/34 (23%), Positives = 23/34 (67%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           EEE + +++ +K K+  KK++++K+ +++ +   
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLG 118



 Score = 33.9 bits (77), Expect = 0.094
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           EEE   +++ +K K++  K++K K+ R++  +  
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLG 118



 Score = 33.1 bits (75), Expect = 0.18
 Identities = 12/32 (37%), Positives = 23/32 (71%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           E++ +K KKK+KK+K+  + K +KK+ +  K+
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227



 Score = 32.7 bits (74), Expect = 0.21
 Identities = 18/42 (42%), Positives = 23/42 (54%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEE 264
           KKKK+RK+K++KKKKKK    +  H           G  EEE
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEE 322



 Score = 31.6 bits (71), Expect = 0.54
 Identities = 26/127 (20%), Positives = 48/127 (37%), Gaps = 47/127 (37%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKK------------------------------------N 245
           ++K K+ KKK++K+K+ +R+                                       N
Sbjct: 94  LEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDEDDKDPN 153

Query: 246 QHHR--DVWINRTSRGEEE---------EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
             +R  D+ +++     E+         E  K  +K       KK K+ KK++KK+ +  
Sbjct: 154 DPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKE 213

Query: 295 RDVWQKK 301
           RD  +KK
Sbjct: 214 RDKDKKK 220



 Score = 30.8 bits (69), Expect = 1.1
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           R  +  EK K++KKK++K  + ++R  
Sbjct: 89  RHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 30.0 bits (67), Expect = 1.6
 Identities = 10/31 (32%), Positives = 21/31 (67%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           K +++++ +++  K K+ KKKR+K+K    R
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRR 113



 Score = 29.6 bits (66), Expect = 2.3
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           R  R   E++K++KKK++K+K  +++  
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 29.6 bits (66), Expect = 2.4
 Identities = 8/28 (28%), Positives = 17/28 (60%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           R   +  +K K++KKK++  K+ +R+  
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 29.6 bits (66), Expect = 2.5
 Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 16/59 (27%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K  + ++ KK  K KK+K++K                E+EEKKKKKK    +       
Sbjct: 267 KDAEAEETKKSPKHKKKKQRK----------------EKEEKKKKKKHHHHRCHHSDGG 309


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 46.6 bits (111), Expect = 1e-06
 Identities = 21/58 (36%), Positives = 35/58 (60%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           KK+ KR+KKK+KKKKKK+ KK N+         +    +EEE+ ++ K+++     +K
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142



 Score = 38.5 bits (90), Expect = 0.001
 Identities = 23/70 (32%), Positives = 39/70 (55%), Gaps = 19/70 (27%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K KK  + ++ ++K++ KK+K                 +EEK  KK+ K++K  KK+K+
Sbjct: 57  EKWKKETEDEEFQQKREEKKRK-----------------DEEKTAKKRAKRQK--KKQKK 97

Query: 283 KKKRKKKKNQ 292
           KKK+K KK  
Sbjct: 98  KKKKKAKKGN 107



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 20/68 (29%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K++++K+K ++K  KKR K+                    +KKK+KKKKKKK  K  K+
Sbjct: 70  QKREEKKRKDEEKTAKKRAKR--------------------QKKKQKKKKKKKAKKGNKK 109

Query: 283 KKKRKKKK 290
           ++K   K 
Sbjct: 110 EEKEGSKS 117



 Score = 35.8 bits (83), Expect = 0.007
 Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 20/72 (27%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           F  K+++K++K ++K  KK+ K++K                    KK+KKKKKKK     
Sbjct: 68  FQQKREEKKRKDEEKTAKKRAKRQK--------------------KKQKKKKKKKAKKGN 107

Query: 280 KKRKKKRKKKKN 291
           KK +K+  K   
Sbjct: 108 KKEEKEGSKSSE 119



 Score = 35.4 bits (82), Expect = 0.012
 Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 2/63 (3%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K  KKR K++KKK+KK  KKKK     +    +      EE   ++++ ++ K  +  + 
Sbjct: 82  KTAKKRAKRQKKKQKK--KKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEI 139

Query: 283 KKK 285
            +K
Sbjct: 140 MEK 142



 Score = 34.3 bits (79), Expect = 0.024
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 18/72 (25%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            K ++K  KK+ K++KKK+KKKK +                  K KK  KK++K   K  
Sbjct: 77  RKDEEKTAKKRAKRQKKKQKKKKKK------------------KAKKGNKKEEKEGSKSS 118

Query: 282 RKKKRKKKKNQH 293
            +   ++++ + 
Sbjct: 119 EESSDEEEEGEE 130



 Score = 33.5 bits (77), Expect = 0.046
 Identities = 13/37 (35%), Positives = 28/37 (75%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           EE ++K+++KK+K ++ T KK+ K+++KK+K +  + 
Sbjct: 66  EEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102



 Score = 29.2 bits (66), Expect = 1.5
 Identities = 13/45 (28%), Positives = 28/45 (62%), Gaps = 12/45 (26%)

Query: 258 RGEEEE----------EKKKKKKKKK--KKTTKKKKRKKKRKKKK 290
              +E+          ++K+++KK+K  +KT KK+ +++K+K+KK
Sbjct: 53  ELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKK 97



 Score = 27.3 bits (61), Expect = 6.1
 Identities = 13/41 (31%), Positives = 27/41 (65%), Gaps = 2/41 (4%)

Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK--KKKN 291
           ++   + E E+E+ ++K+++KK+  ++K  KK+ K  KKK 
Sbjct: 55  MDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQ 95


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 45.8 bits (109), Expect = 4e-06
 Identities = 19/62 (30%), Positives = 34/62 (54%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           KKKKK+  ++  K KK    +  W  +  + +++++K K KK  KK    +KK +K+ + 
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120

Query: 289 KK 290
           K 
Sbjct: 121 KL 122



 Score = 42.0 bits (99), Expect = 1e-04
 Identities = 20/68 (29%), Positives = 41/68 (60%), Gaps = 4/68 (5%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KKKKK   ++ +K KK+ ++K+    +  W  + S+ +++++K KK  KK  K+ KK ++
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQ----KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116

Query: 283 KKKRKKKK 290
           + + K + 
Sbjct: 117 EAEDKLED 124



 Score = 40.1 bits (94), Expect = 4e-04
 Identities = 19/72 (26%), Positives = 32/72 (44%), Gaps = 15/72 (20%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            ++K+K K KKKK KKKK K K                +++++K  K +KK +K  + K 
Sbjct: 78  YEEKQKWKWKKKKSKKKKDKDKD---------------KKDDKKDDKSEKKDEKEAEDKL 122

Query: 282 RKKKRKKKKNQH 293
               +   +   
Sbjct: 123 EDLTKSYSETLS 134



 Score = 38.1 bits (89), Expect = 0.002
 Identities = 22/84 (26%), Positives = 35/84 (41%), Gaps = 15/84 (17%)

Query: 216 WIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           W       KKKK K K KK  KK  K +K               ++E+E + K +   K 
Sbjct: 84  WKWKKKKSKKKKDKDKDKKDDKKDDKSEK---------------KDEKEAEDKLEDLTKS 128

Query: 276 TTKKKKRKKKRKKKKNQHHRDVWQ 299
            ++      + K +K   H+D++Q
Sbjct: 129 YSETLSTLSELKPRKYALHKDIYQ 152



 Score = 35.8 bits (83), Expect = 0.011
 Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 14/72 (19%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           V K+ ++++K K KKKK K+KK K+              +++++ KK  K +KK   + +
Sbjct: 74  VKKEYEEKQKWKWKKKKSKKKKDKD--------------KDKKDDKKDDKSEKKDEKEAE 119

Query: 281 KRKKKRKKKKNQ 292
            + +   K  ++
Sbjct: 120 DKLEDLTKSYSE 131



 Score = 27.0 bits (60), Expect = 9.6
 Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 7/39 (17%)

Query: 260 EEEEEKKKKK-------KKKKKKTTKKKKRKKKRKKKKN 291
           E  E KKKKK       K KK+   K+K + KK+K KK 
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKK 94


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 47.8 bits (114), Expect = 5e-06
 Identities = 24/93 (25%), Positives = 39/93 (41%), Gaps = 3/93 (3%)

Query: 203 NVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE 262
            + ++LISS  +    G   ++K K+    KK  K+++  KK     D       + + +
Sbjct: 719 KIDFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPK 778

Query: 263 EEKKKKKKKKKKKTTKKKKR---KKKRKKKKNQ 292
             KK  +K KK     +K     K KR  KK  
Sbjct: 779 AAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811



 Score = 32.0 bits (73), Expect = 0.44
 Identities = 16/63 (25%), Positives = 21/63 (33%), Gaps = 20/63 (31%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           +  KK K K  KK  +K K                    K   K +K    TK K+  KK
Sbjct: 770 RGEKKAKPKAAKKDARKAK--------------------KPSAKTQKIAAATKAKRAAKK 809

Query: 286 RKK 288
           +  
Sbjct: 810 KVA 812


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 47.1 bits (112), Expect = 6e-06
 Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 19/96 (19%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
             M K KK KKKKKKKKK+++       + +       +  SR + EEE  + +   KK+
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKR 329

Query: 276 TTKKKK---------------RKKKRKKKKNQHHRD 296
             +++                ++++  +KK +  R 
Sbjct: 330 KEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRP 365



 Score = 41.3 bits (97), Expect = 5e-04
 Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 3/70 (4%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQ---HHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
            F +        + +KK+++ +KK K        D      S   +  E  K KK KKKK
Sbjct: 223 LFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKK 282

Query: 276 TTKKKKRKKK 285
             KKK+RK  
Sbjct: 283 KKKKKRRKDL 292



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 19/57 (33%), Positives = 28/57 (49%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
            +KK+++ KK+ K  N    D      +    +  +  K KK KKK  KKKKR+K  
Sbjct: 236 AEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292



 Score = 38.6 bits (90), Expect = 0.004
 Identities = 18/84 (21%), Positives = 40/84 (47%), Gaps = 17/84 (20%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEE---------KKKKKKKKK 273
           K+++ R+K  K ++K++R  K             + GE++++         K KK++KKK
Sbjct: 54  KREELREKIAKAREKRERNSKLGG--------IKTLGEDDDDDDDTKAWLKKSKKRQKKK 105

Query: 274 KKTTKKKKRKKKRKKKKNQHHRDV 297
           +   KK     +++K++   +   
Sbjct: 106 EAERKKALLLDEKEKERAAEYTSE 129



 Score = 37.0 bits (86), Expect = 0.010
 Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 12/77 (15%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWI--NRTSRGEEEEEKKKKKKKKKKKTT-------- 277
            +K+K KK  + +KKK +    D      R+   + +EE + KKKK     T        
Sbjct: 174 VEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233

Query: 278 --KKKKRKKKRKKKKNQ 292
              +KKR++ +KK K  
Sbjct: 234 DEAEKKRQEVKKKLKIN 250



 Score = 32.8 bits (75), Expect = 0.21
 Identities = 28/121 (23%), Positives = 45/121 (37%), Gaps = 44/121 (36%)

Query: 222 MKKKKKRKKKKKKKKKKK--------RKKKK------------NQHHRDVWINRTSRG-- 259
           + +K+K KK  + KKKK         +  K+             +   D      S G  
Sbjct: 173 LVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGST 232

Query: 260 -EEEEEKKKKKKKKKKKT---------------------TKKKKRKKKRKKKKNQHHRDV 297
            +E E+K+++ KKK K                         K K+ KK+KKKK +  +D+
Sbjct: 233 DDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292

Query: 298 W 298
            
Sbjct: 293 D 293



 Score = 28.6 bits (64), Expect = 5.0
 Identities = 9/33 (27%), Positives = 20/33 (60%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           EE E K+K+++ ++K    ++KR++  K    +
Sbjct: 47  EEAEAKRKREELREKIAKAREKRERNSKLGGIK 79



 Score = 27.8 bits (62), Expect = 7.8
 Identities = 20/94 (21%), Positives = 34/94 (36%), Gaps = 36/94 (38%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG---------------------- 259
            KK++K+K+ ++KK     +K+K +       +    G                      
Sbjct: 98  SKKRQKKKEAERKKALLLDEKEKERAAEYT--SEDLAGLKVGHKVEEFEEGEDVILTLKD 155

Query: 260 ----EEEEEK--------KKKKKKKKKKTTKKKK 281
               E+E+E          +K+K KK    KKKK
Sbjct: 156 TGVLEDEDEGDELENVELVEKEKDKKNLELKKKK 189


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 46.4 bits (110), Expect = 8e-06
 Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 7/68 (10%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + K K+  + K K + + +KK  +  +        + EEE + K   + KKK    KKK 
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAK-------KQAEEEAKAKAAAEAKKKAAEAKKKA 175

Query: 283 KKKRKKKK 290
           + + K K 
Sbjct: 176 EAEAKAKA 183



 Score = 43.7 bits (103), Expect = 7e-05
 Identities = 14/68 (20%), Positives = 30/68 (44%), Gaps = 8/68 (11%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K+ +++   +K  K+  +  K             + + EE K K+  + K K   + ++
Sbjct: 91  QKELEQRAAAEKAAKQAEQAAKQA--------EEKQKQAEEAKAKQAAEAKAKAEAEAEK 142

Query: 283 KKKRKKKK 290
           K K + KK
Sbjct: 143 KAKEEAKK 150



 Score = 41.7 bits (98), Expect = 2e-04
 Identities = 11/68 (16%), Positives = 33/68 (48%), Gaps = 10/68 (14%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + +K+R  ++ ++K+ +++    +  +          + E+  K+ ++K+K+    K K+
Sbjct: 79  EAEKQRAAEQARQKELEQRAAAEKAAK----------QAEQAAKQAEEKQKQAEEAKAKQ 128

Query: 283 KKKRKKKK 290
             + K K 
Sbjct: 129 AAEAKAKA 136



 Score = 41.4 bits (97), Expect = 4e-04
 Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 9/70 (12%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + K K   + KKK  + +KK + +              E + K K ++ K K    K K 
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAE---------AKAKAEAKAKAKAEEAKAKAEAAKAKA 205

Query: 283 KKKRKKKKNQ 292
             +   K   
Sbjct: 206 AAEAAAKAEA 215



 Score = 40.6 bits (95), Expect = 6e-04
 Identities = 19/78 (24%), Positives = 39/78 (50%), Gaps = 12/78 (15%)

Query: 223 KKKKKRKKKKKKKKKKKRK---KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           +K  K+ ++  K+ ++K+K   + K +   +      ++ E E EKK K++ KK+   + 
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE----AKAKAEAEAEKKAKEEAKKQAEEEA 156

Query: 280 KKR-----KKKRKKKKNQ 292
           K +     KKK  + K +
Sbjct: 157 KAKAAAEAKKKAAEAKKK 174



 Score = 37.9 bits (88), Expect = 0.004
 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 12/69 (17%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK K+  KK+ +++ K +   + +             +  E KKK + + K K   K K 
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKK------------KAAEAKKKAEAEAKAKAEAKAKA 189

Query: 283 KKKRKKKKN 291
           K +  K K 
Sbjct: 190 KAEEAKAKA 198



 Score = 37.5 bits (87), Expect = 0.005
 Identities = 13/71 (18%), Positives = 33/71 (46%), Gaps = 2/71 (2%)

Query: 223 KKKKKRKKKKKKKKKK-KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           K+ ++R   +K  K+  +  K+  +  +     + ++   E + K + + +KK   + KK
Sbjct: 92  KELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK-AKQAAEAKAKAEAEAEKKAKEEAKK 150

Query: 282 RKKKRKKKKNQ 292
           + ++  K K  
Sbjct: 151 QAEEEAKAKAA 161



 Score = 35.2 bits (81), Expect = 0.030
 Identities = 14/68 (20%), Positives = 23/68 (33%), Gaps = 15/68 (22%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K  + +KK + + K K   K K +                EE K K +  K K   +   
Sbjct: 167 KAAEAKKKAEAEAKAKAEAKAKAK---------------AEEAKAKAEAAKAKAAAEAAA 211

Query: 283 KKKRKKKK 290
           K + +   
Sbjct: 212 KAEAEAAA 219



 Score = 35.2 bits (81), Expect = 0.036
 Identities = 13/70 (18%), Positives = 33/70 (47%), Gaps = 11/70 (15%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK---K 280
           +KK   KK+++++KK  ++ +             R  E+  +K+ +++   +   K   +
Sbjct: 58  QKKPAAKKEQERQKKLEQQAEEA--------EKQRAAEQARQKELEQRAAAEKAAKQAEQ 109

Query: 281 KRKKKRKKKK 290
             K+  +K+K
Sbjct: 110 AAKQAEEKQK 119



 Score = 34.8 bits (80), Expect = 0.042
 Identities = 9/69 (13%), Positives = 32/69 (46%), Gaps = 13/69 (18%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
            KK ++++KK +++  + +K +               E+ ++K+ +++       K+ ++
Sbjct: 63  AKKEQERQKKLEQQAEEAEKQR-------------AAEQARQKELEQRAAAEKAAKQAEQ 109

Query: 285 KRKKKKNQH 293
             K+ + + 
Sbjct: 110 AAKQAEEKQ 118



 Score = 34.4 bits (79), Expect = 0.051
 Identities = 11/67 (16%), Positives = 24/67 (35%), Gaps = 12/67 (17%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K + + K K + K K K ++ K            ++ E  + K   +   K +       
Sbjct: 174 KAEAEAKAKAEAKAKAKAEEAK------------AKAEAAKAKAAAEAAAKAEAEAAAAA 221

Query: 283 KKKRKKK 289
             + ++K
Sbjct: 222 AAEAERK 228


>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
           Srx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srx is part of the Srg
           superfamily of chemoreceptors. Chemoperception is one of
           the central senses of soil nematodes like C. elegans
           which are otherwise 'blind' and 'deaf'.
          Length = 275

 Score = 45.6 bits (109), Expect = 1e-05
 Identities = 37/208 (17%), Positives = 83/208 (39%), Gaps = 17/208 (8%)

Query: 52  IITAILG---NLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWY 108
            + ++ G   N L+    +K   L+        + A ++ ++    + +   ++L D  +
Sbjct: 1   FLISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDLSF 60

Query: 109 FGYFMCDVWNSFDVYFS-TASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWI 167
               +        +      S L    IS++R+ A+  P +Y  I +     ++++ +WI
Sbjct: 61  LPELLNSHIGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIITFIWI 120

Query: 168 LPGIISFTP-IMLGWYTT--PDHKQYRKHHPNVC-------IFVVNVYYALISSSISFWI 217
           +  IIS      LG +    P+   +       C        F+ N+   +I++ I+   
Sbjct: 121 IAIIISTLFYFPLGCHFYYSPESLTWSFDEDPPCAEIGWYGDFLKNLVLVIITNIINVIT 180

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN 245
               +K +K  KK     ++ K+++KK 
Sbjct: 181 ---FIKLRKFSKKSSLSSEESKKRRKKE 205


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 44.2 bits (105), Expect = 2e-05
 Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 15/71 (21%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK---KKKKTTKK 279
           KK  KRK+K+KKK KK+            W  R  + E+++ +++KK++   KK+K  KK
Sbjct: 145 KKALKRKEKQKKKSKKE------------WKERKEKVEKKKAERQKKREENLKKRKDDKK 192

Query: 280 KKRKKKRKKKK 290
            K+KKK KKK 
Sbjct: 193 NKKKKKAKKKG 203



 Score = 41.5 bits (98), Expect = 2e-04
 Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 11/68 (16%)

Query: 223 KKKKKRKKKKKKK-KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           K+K+K+KKK KK+ K++K K +K +  R           ++ E+  KK+K  KK  KKKK
Sbjct: 149 KRKEKQKKKSKKEWKERKEKVEKKKAERQ----------KKREENLKKRKDDKKNKKKKK 198

Query: 282 RKKKRKKK 289
            KKK +  
Sbjct: 199 AKKKGRIL 206



 Score = 37.7 bits (88), Expect = 0.003
 Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 20/85 (23%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK--------- 273
           +K+++RK +KK+K+K+ +KK+  Q           + EE + KKK    +          
Sbjct: 13  RKREQRKARKKQKRKEAKKKEDAQKSEA----EEVKNEENKSKKKAAPIENAEGNIVFSK 68

Query: 274 -------KKTTKKKKRKKKRKKKKN 291
                  +     K +KKK+KKK +
Sbjct: 69  VEFADGEQAKKDLKLKKKKKKKKTD 93



 Score = 36.5 bits (85), Expect = 0.007
 Identities = 23/80 (28%), Positives = 42/80 (52%), Gaps = 13/80 (16%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK----------KKKKKKK 272
           KK + RKKK ++  + K  + +    ++ W    ++ E  + K          K+K+K+K
Sbjct: 99  KKLEARKKKLEELDEDKAAEIEE---KEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQK 155

Query: 273 KKKTTKKKKRKKKRKKKKNQ 292
           KK   + K+RK+K +KKK +
Sbjct: 156 KKSKKEWKERKEKVEKKKAE 175



 Score = 35.4 bits (82), Expect = 0.020
 Identities = 11/32 (34%), Positives = 24/32 (75%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E   +++++K++++K  KK+KRK+ +KK+  Q
Sbjct: 5   EALLEQRRRKREQRKARKKQKRKEAKKKEDAQ 36



 Score = 35.4 bits (82), Expect = 0.022
 Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 11/79 (13%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-----------KKK 270
            K +KK+K+K+ KKK+  +K +  +   +   ++      E  +             ++ 
Sbjct: 18  RKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEFADGEQA 77

Query: 271 KKKKKTTKKKKRKKKRKKK 289
           KK  K  KKKK+KK   K+
Sbjct: 78  KKDLKLKKKKKKKKTDYKQ 96



 Score = 34.2 bits (79), Expect = 0.046
 Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 14/68 (20%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
             KKK K++ K++K+K +K+K ++ Q  R           EE  KK+K  KK KK  KKK
Sbjct: 153 KQKKKSKKEWKERKEKVEKKKAER-QKKR-----------EENLKKRKDDKKNKK--KKK 198

Query: 281 KRKKKRKK 288
            +KK R  
Sbjct: 199 AKKKGRIL 206



 Score = 31.5 bits (72), Expect = 0.33
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 22/91 (24%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK-- 278
               ++ +K  K KKKKKK+K    Q  + +   R  + EE +E K  + ++K+K TK  
Sbjct: 71  FADGEQAKKDLKLKKKKKKKKTDYKQLLKKL-EARKKKLEELDEDKAAEIEEKEKWTKAL 129

Query: 279 -------------------KKKRKKKRKKKK 290
                              K+K K+K+K KK
Sbjct: 130 AKAEGVKVKDDEKLLKKALKRKEKQKKKSKK 160


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 45.9 bits (108), Expect = 2e-05
 Identities = 16/79 (20%), Positives = 38/79 (48%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            + KK  +  K + +    + +  +   +    +    +++ +  KKK ++KKK  + KK+
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399

Query: 283  KKKRKKKKNQHHRDVWQKK 301
             ++ KKK ++  +    KK
Sbjct: 1400 AEEDKKKADELKKAAAAKK 1418



 Score = 42.8 bits (100), Expect = 2e-04
 Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 7/80 (8%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KKK +  KK  +  K + +   ++          +  + E  +KKK++ KKK    KKK
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEA-------EAAEEKAEAAEKKKEEAKKKADAAKKK 1386

Query: 282  RKKKRKKKKNQHHRDVWQKK 301
             ++K+K  + +   +  +KK
Sbjct: 1387 AEEKKKADEAKKKAEEDKKK 1406



 Score = 42.4 bits (99), Expect = 2e-04
 Identities = 15/66 (22%), Positives = 33/66 (50%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
                + +  ++K +  ++KK++ +   D    +    ++ +E KKK ++ KKK  + KK 
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413

Query: 283  KKKRKK 288
               +KK
Sbjct: 1414 AAAKKK 1419



 Score = 42.4 bits (99), Expect = 3e-04
 Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 3/74 (4%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KKK +  KK  + KKK  + KK     D    +    ++  E  K + +      +  +
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKA---DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363

Query: 282  RKKKRKKKKNQHHR 295
             K +  +KK +  +
Sbjct: 1364 EKAEAAEKKKEEAK 1377



 Score = 41.7 bits (97), Expect = 4e-04
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
            +KK     KK+ ++ KK ++ K +   +       +  EEE K K ++ KK+    KKK 
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746

Query: 282  ---RKKKRKKKKNQHHRDVWQKK 301
               +K + +KKK  H +   +KK
Sbjct: 1747 EEAKKDEEEKKKIAHLKKEEEKK 1769



 Score = 41.3 bits (96), Expect = 6e-04
 Identities = 21/76 (27%), Positives = 41/76 (53%), Gaps = 4/76 (5%)

Query: 221  VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKKKKT 276
              KK ++ KKK+ ++KKK  + KK +    +     ++  EE++KK    KK ++ +KK 
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690

Query: 277  TKKKKRKKKRKKKKNQ 292
             +  K++ +  KK  +
Sbjct: 1691 AEALKKEAEEAKKAEE 1706



 Score = 40.9 bits (95), Expect = 7e-04
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 7/67 (10%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KKK +  KKK    KKK ++ K            ++ E E    + +  ++K    +KK
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKA-------AEAAKAEAEAAADEAEAAEEKAEAAEKK 1372

Query: 282  RKKKRKK 288
            +++ +KK
Sbjct: 1373 KEEAKKK 1379



 Score = 40.5 bits (94), Expect = 0.001
 Identities = 15/79 (18%), Positives = 37/79 (46%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            +      +  ++K +   KKK+    +     + +  +++ ++ KKK ++ KK   + K+
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412

Query: 283  KKKRKKKKNQHHRDVWQKK 301
                KKK ++  +   +KK
Sbjct: 1413 AAAAKKKADEAKKKAEEKK 1431



 Score = 40.1 bits (93), Expect = 0.001
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            +KK    KKK  + KKK  +KKK    +     +    ++ +E KKK ++ KK    KKK
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAK----KKAEEAKKADEAKKKAEEAKKAEEAKKK 1465

Query: 282  RKKKRK----KKKNQHHR 295
             ++ +K    KKK +  +
Sbjct: 1466 AEEAKKADEAKKKAEEAK 1483



 Score = 40.1 bits (93), Expect = 0.001
 Identities = 23/73 (31%), Positives = 41/73 (56%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK ++ KKK+ ++KKK  + KK +    +      +  EE++KK ++ KK ++  KK   
Sbjct: 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761

Query: 283  KKKRKKKKNQHHR 295
             KK ++KK +  R
Sbjct: 1762 LKKEEEKKAEEIR 1774



 Score = 39.7 bits (92), Expect = 0.002
 Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 9/82 (10%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE--EKKKKKKKKKKKTTKK 279
             KKK +  KK  + KKK  + KK +           + EE +  ++ KKK ++ KK  + 
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAKKAE-------EAKKKAEEAKKADEAKKKAEEAKKADEA 1488

Query: 280  KKRKKKRKKKKNQHHRDVWQKK 301
            KK+ ++ KKK ++  +    KK
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKK 1510



 Score = 39.7 bits (92), Expect = 0.002
 Identities = 15/70 (21%), Positives = 32/70 (45%), Gaps = 4/70 (5%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK  + KKK ++ KK    KKK +  +     +    +++ E+ KK  +  K   +    
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAK----KKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357

Query: 283  KKKRKKKKNQ 292
            + +  ++K +
Sbjct: 1358 EAEAAEEKAE 1367



 Score = 39.4 bits (91), Expect = 0.002
 Identities = 24/81 (29%), Positives = 36/81 (44%), Gaps = 11/81 (13%)

Query: 223  KKKKKRKKKK--KKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
            KKK    KKK  + KK  +  K + +   D         E E  ++K +  +KKK   KK
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAAD---------EAEAAEEKAEAAEKKKEEAKK 1378

Query: 281  KRKKKRKKKKNQHHRDVWQKK 301
            K    +KK + +   D  +KK
Sbjct: 1379 KADAAKKKAEEKKKADEAKKK 1399



 Score = 39.4 bits (91), Expect = 0.002
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KKK + KKK  + KKK  + KK    +     +    ++ EE KKK ++ KK    KKK
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAK----KKAEEAKKAEEAKKKAEEAKKADEAKKK 1478

Query: 282  RKKKRKKKKNQHHRDVWQKK 301
             ++ +K  + +   +  +KK
Sbjct: 1479 AEEAKKADEAKKKAEEAKKK 1498



 Score = 39.4 bits (91), Expect = 0.002
 Identities = 17/71 (23%), Positives = 31/71 (43%), Gaps = 3/71 (4%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK ++ KK  + KKK +  KKK    +          E  + + +    + +   +K + 
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEA 1368

Query: 283  KKKRK---KKK 290
             +K+K   KKK
Sbjct: 1369 AEKKKEEAKKK 1379



 Score = 39.4 bits (91), Expect = 0.002
 Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 3/84 (3%)

Query: 221  VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE---EEKKKKKKKKKKKTT 277
            + K ++++KK ++ KKK+  +KKK +  +        +  EE    E+ KKK ++ KK  
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684

Query: 278  KKKKRKKKRKKKKNQHHRDVWQKK 301
            + +K+  +  KK+ +  +   + K
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELK 1708



 Score = 39.0 bits (90), Expect = 0.003
 Identities = 20/81 (24%), Positives = 40/81 (49%), Gaps = 1/81 (1%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE-EEKKKKKKKKKKKTTKKK 280
             +  +++ +  +KKK++ +KK      +     +    +++ EE KKK  + KK    KK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418

Query: 281  KRKKKRKKKKNQHHRDVWQKK 301
            K  + +KK + +   D  +KK
Sbjct: 1419 KADEAKKKAEEKKKADEAKKK 1439



 Score = 38.6 bits (89), Expect = 0.004
 Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE----EEKKKKKKKKKKKTTK 278
            KKK    KKK ++KKK  + KK         +   +        +E KKK ++KKK    
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436

Query: 279  KKKRKKKRK----KKKNQHHR 295
            KKK ++ +K    KKK +  +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAK 1457



 Score = 38.6 bits (89), Expect = 0.004
 Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK  + KK    KKK    KKK +  +     +    +++ E+ KK  + KKK  + KK 
Sbjct: 1405 KKADELKKAAAAKKKADEAKKKAEEKK-----KADEAKKKAEEAKKADEAKKKAEEAKKA 1459

Query: 283  KKKRKKKKNQHHRDVWQKK 301
            ++ +KK +     D  +KK
Sbjct: 1460 EEAKKKAEEAKKADEAKKK 1478



 Score = 38.6 bits (89), Expect = 0.004
 Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 6/72 (8%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKKKKTTK 278
            KKK    KK ++KKK    KKK +  +    +   +  EE +KK    KKK ++ KK  +
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAKKA--DEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346

Query: 279  KKKRKKKRKKKK 290
              K + +    +
Sbjct: 1347 AAKAEAEAAADE 1358



 Score = 38.6 bits (89), Expect = 0.004
 Identities = 22/74 (29%), Positives = 36/74 (48%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KK  + KKK  + KK +  KK ++  +     +    ++ EEKKK  + KK +  KK +
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAE 1561

Query: 282  RKKKRKKKKNQHHR 295
             KKK ++ K     
Sbjct: 1562 EKKKAEEAKKAEED 1575



 Score = 38.2 bits (88), Expect = 0.005
 Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 4/67 (5%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KKK +  KK ++ KKK  + KK    +     +    ++ +E KKK ++ KKK  + KK
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAK----KKAEEAKKADEAKKKAEEAKKKADEAKK 1504

Query: 282  RKKKRKK 288
              + +KK
Sbjct: 1505 AAEAKKK 1511



 Score = 37.8 bits (87), Expect = 0.006
 Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 2/82 (2%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE--EEEEKKKKKKKKKKKTTKK 279
             KK++ +KK    KKK + KKK ++  +    ++    E  +    KKK  + KKK  +K
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430

Query: 280  KKRKKKRKKKKNQHHRDVWQKK 301
            KK  + +KK +     D  +KK
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKK 1452



 Score = 37.8 bits (87), Expect = 0.007
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 8/77 (10%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK ++ KKK  + KK    KKK    +     +    ++ +E KKK ++ KK    KKK 
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAK----KKAEEKKKADEAKKKAEEAKKADEAKKKA 1453

Query: 283  KKKRK----KKKNQHHR 295
            ++ +K    KKK +  +
Sbjct: 1454 EEAKKAEEAKKKAEEAK 1470



 Score = 37.4 bits (86), Expect = 0.009
 Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 8/83 (9%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
             +  ++KK++ KKK    KKK  +  +       ++ + EE+KKK  + KK    KKK  
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKK----ADEAKKKAEEDKKKADELKKAAAAKKKAD 1421

Query: 281  --KRKKKRKKKKNQHHRDVWQKK 301
              K+K + KKK ++  +   + K
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAK 1444



 Score = 37.0 bits (85), Expect = 0.011
 Identities = 21/75 (28%), Positives = 39/75 (52%), Gaps = 4/75 (5%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK- 280
            +K ++ RK  + KK ++K+K  + +   +      ++ + EE KK  + KKK +  KKK 
Sbjct: 1272 IKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1331

Query: 281  ---KRKKKRKKKKNQ 292
               K+K +  KK  +
Sbjct: 1332 DAAKKKAEEAKKAAE 1346



 Score = 36.7 bits (84), Expect = 0.017
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 4/77 (5%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
            KKK +  KK ++ +KK  +  K +           + E EE+KK ++ KK ++  K K  
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE 1733

Query: 281  --KRKKKRKKKKNQHHR 295
              K++ +  KKK +  +
Sbjct: 1734 EAKKEAEEDKKKAEEAK 1750



 Score = 36.7 bits (84), Expect = 0.018
 Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 4/77 (5%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
            KKK +  KK +++ K K  ++  +   D      ++  EE+EKK  +  KK+    KK  
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705

Query: 281  --KRKKKRKKKKNQHHR 295
              K+K+  +KKK +  +
Sbjct: 1706 ELKKKEAEEKKKAEELK 1722



 Score = 36.3 bits (83), Expect = 0.021
 Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 8/84 (9%)

Query: 223  KKKKKRKKKKKKKKKKKRK--KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
            + KK  + KKK ++ KK +  KKK +  +       ++ + EE KK  + KKK +  KKK
Sbjct: 1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKA---DEAKKKAEEAKKADEAKKKAEEAKKK 1498

Query: 281  K---RKKKRKKKKNQHHRDVWQKK 301
                +K    KKK    +   + K
Sbjct: 1499 ADEAKKAAEAKKKADEAKKAEEAK 1522



 Score = 35.9 bits (82), Expect = 0.028
 Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 4/66 (6%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KKK +  KK  + KKK  + KK    +     +    +++ ++ KK  + KKK  + KK
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAK----KKAEEAKKKADEAKKAAEAKKKADEAKK 1517

Query: 282  RKKKRK 287
             ++ +K
Sbjct: 1518 AEEAKK 1523



 Score = 35.5 bits (81), Expect = 0.035
 Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 6/78 (7%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             KK +++KK  + KKK +  KK ++  +       ++ + +  KKK ++ KK     K +
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKA---EEAKKKADAAKKKAEEAKKAAEAAKAE 1351

Query: 282  ---RKKKRKKKKNQHHRD 296
                  + +  + +    
Sbjct: 1352 AEAAADEAEAAEEKAEAA 1369



 Score = 35.5 bits (81), Expect = 0.039
 Identities = 20/69 (28%), Positives = 39/69 (56%), Gaps = 3/69 (4%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            +K  ++ KK ++ KKK +  KK  +  +       +  +E EE KK ++ KKK+  +KKK
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKK---AAEALKKEAEEAKKAEELKKKEAEEKKK 1717

Query: 282  RKKKRKKKK 290
             ++ +K ++
Sbjct: 1718 AEELKKAEE 1726



 Score = 35.5 bits (81), Expect = 0.040
 Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 12/86 (13%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK ++ KKK  + KK    KKK    +     +    ++ +E KK ++ KK    KK + 
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAK-----KAEEAKKADEAKKAEEAKKADEAKKAEE 1544

Query: 283  KKKRKK-------KKNQHHRDVWQKK 301
            KKK  +       KK +  +   + K
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAK 1570



 Score = 34.7 bits (79), Expect = 0.064
 Identities = 21/73 (28%), Positives = 35/73 (47%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KKK    KK ++ KK    KK  +  +     +    ++ +E KK ++ KK +  KK + 
Sbjct: 1509 KKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEE 1568

Query: 283  KKKRKKKKNQHHR 295
             KK ++ KN   R
Sbjct: 1569 AKKAEEDKNMALR 1581



 Score = 34.3 bits (78), Expect = 0.075
 Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 3/77 (3%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE---EEEKKKKKKKKKKKTTK 278
             KK ++ K K ++ KK + +KKK +  +        + EE    EE+ K K  ++ K  +
Sbjct: 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671

Query: 279  KKKRKKKRKKKKNQHHR 295
            + K+K +  KK  +  +
Sbjct: 1672 EDKKKAEEAKKAEEDEK 1688



 Score = 34.3 bits (78), Expect = 0.080
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 4/79 (5%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            + +K  + KK ++KKK  + KK +  +          ++ EE KK  + KKK    KKK 
Sbjct: 1276 EARKADELKKAEEKKKADEAKKAEEKKKA----DEAKKKAEEAKKADEAKKKAEEAKKKA 1331

Query: 283  KKKRKKKKNQHHRDVWQKK 301
               +KK +         K 
Sbjct: 1332 DAAKKKAEEAKKAAEAAKA 1350



 Score = 34.3 bits (78), Expect = 0.094
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
            K ++ +KK ++ KK  + KKK  +  +       ++ + EE KKK  + KK    KKK  
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKK----ADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513

Query: 281  KRKKKRKKKKNQHHRDVWQKK 301
            + KK  + KK    +   + K
Sbjct: 1514 EAKKAEEAKKADEAKKAEEAK 1534



 Score = 34.3 bits (78), Expect = 0.095
 Identities = 21/76 (27%), Positives = 36/76 (47%), Gaps = 2/76 (2%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
            KK  + KK  + KKK    KK  +  +     +    ++ +E KK ++KKK    KK + 
Sbjct: 1497 KKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556

Query: 282  -RKKKRKKKKNQHHRD 296
             +K + KKK  +  + 
Sbjct: 1557 LKKAEEKKKAEEAKKA 1572



 Score = 34.3 bits (78), Expect = 0.097
 Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 10/84 (11%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE----EEKKKKKKKKKKKTT 277
              KK +  KKK ++ KK  + +K             +  EE    EE KKK+ ++KKK  
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAE------ALKKEAEEAKKAEELKKKEAEEKKKAE 1719

Query: 278  KKKKRKKKRKKKKNQHHRDVWQKK 301
            + KK +++ K K  +  ++  + K
Sbjct: 1720 ELKKAEEENKIKAEEAKKEAEEDK 1743



 Score = 34.0 bits (77), Expect = 0.13
 Identities = 18/70 (25%), Positives = 38/70 (54%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK K  + KK ++ K K ++ K        + +  + E EE+KK ++ KK ++  K K  
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664

Query: 283  KKKRKKKKNQ 292
            ++ +K ++++
Sbjct: 1665 EEAKKAEEDK 1674



 Score = 32.4 bits (73), Expect = 0.35
 Identities = 19/78 (24%), Positives = 37/78 (47%)

Query: 224  KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            KK +  KK  + KK +  KK ++  +     +    ++ EE KK ++KKK +  KK +  
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575

Query: 284  KKRKKKKNQHHRDVWQKK 301
            K    +K +  +   + +
Sbjct: 1576 KNMALRKAEEAKKAEEAR 1593



 Score = 32.4 bits (73), Expect = 0.36
 Identities = 18/67 (26%), Positives = 32/67 (47%)

Query: 224  KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            KK   + KK ++++   + +K +  R     R     + EE +K  + KK +  KK    
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA 1295

Query: 284  KKRKKKK 290
            KK ++KK
Sbjct: 1296 KKAEEKK 1302



 Score = 32.4 bits (73), Expect = 0.39
 Identities = 16/66 (24%), Positives = 37/66 (56%)

Query: 225  KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
            KK  + KK ++KKK  + KK +  +++ + +    ++ EE + ++  K  +  KK K ++
Sbjct: 1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611

Query: 285  KRKKKK 290
             +K ++
Sbjct: 1612 AKKAEE 1617



 Score = 31.6 bits (71), Expect = 0.58
 Identities = 26/84 (30%), Positives = 44/84 (52%), Gaps = 8/84 (9%)

Query: 226  KKRKKKKKKKKKKKRKKKKNQHHRDVWINRT---SRGEE-----EEEKKKKKKKKKKKTT 277
            KK ++KKK ++ KK ++ KN   R     +    +R EE     EEEKK K ++ KK   
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEE 1617

Query: 278  KKKKRKKKRKKKKNQHHRDVWQKK 301
             K K ++ +K ++ +   +  +KK
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKK 1641



 Score = 31.3 bits (70), Expect = 0.76
 Identities = 20/73 (27%), Positives = 34/73 (46%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
             KK    KK ++ KK    KK  +  +   + +    ++ EEKKK ++ KK +  K    
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580

Query: 283  KKKRKKKKNQHHR 295
            +K  + KK +  R
Sbjct: 1581 RKAEEAKKAEEAR 1593



 Score = 31.3 bits (70), Expect = 0.85
 Identities = 19/79 (24%), Positives = 42/79 (53%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            ++ +  +  K  +++KK K ++ +   +  I      + EEEKKK ++ KKK+  +KKK 
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649

Query: 283  KKKRKKKKNQHHRDVWQKK 301
            ++ +K ++    +   + K
Sbjct: 1650 EELKKAEEENKIKAAEEAK 1668



 Score = 30.5 bits (68), Expect = 1.5
 Identities = 17/70 (24%), Positives = 37/70 (52%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K    RK ++ KK ++ R ++  + + +    +    ++ EE K K ++ KK   +KKK 
Sbjct: 1576 KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635

Query: 283  KKKRKKKKNQ 292
            ++ +KK+  +
Sbjct: 1636 EQLKKKEAEE 1645



 Score = 30.1 bits (67), Expect = 1.7
 Identities = 20/79 (25%), Positives = 39/79 (49%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            ++KKK ++ KK ++ K    +K +  +     R     +  E++KK K ++ K  ++ K 
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 283  KKKRKKKKNQHHRDVWQKK 301
            K +  KK  +  + V Q K
Sbjct: 1621 KAEELKKAEEEKKKVEQLK 1639



 Score = 30.1 bits (67), Expect = 1.7
 Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 1/79 (1%)

Query: 224  KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            KK +  KK  ++ KK  +++ N+  R     R +     +   K ++ +K    KK + K
Sbjct: 1230 KKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEK 1289

Query: 284  KKRKK-KKNQHHRDVWQKK 301
            KK  + KK +  +   + K
Sbjct: 1290 KKADEAKKAEEKKKADEAK 1308



 Score = 30.1 bits (67), Expect = 1.8
 Identities = 22/85 (25%), Positives = 43/85 (50%), Gaps = 6/85 (7%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KKK +  KK +++ K K ++ K +   D      ++ +EEE+KK    KK+++   ++ R
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774

Query: 283  KKKRK------KKKNQHHRDVWQKK 301
            K+K         ++++  R    KK
Sbjct: 1775 KEKEAVIEEELDEEDEKRRMEVDKK 1799



 Score = 30.1 bits (67), Expect = 2.1
 Identities = 23/87 (26%), Positives = 43/87 (49%), Gaps = 12/87 (13%)

Query: 221  VMKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKK----KKKKK 272
            V K ++ +K  ++ KK ++ +     +K +  R     R     + EE +K    KK ++
Sbjct: 1229 VKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE 1288

Query: 273  KKKTTKKKKRKKKRK----KKKNQHHR 295
            KKK  + KK ++K+K    KKK +  +
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAK 1315



 Score = 29.7 bits (66), Expect = 2.2
 Identities = 19/74 (25%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            MK ++ +K ++ K K ++ KK + +  +   + +    +E EEKKK ++ KK +   K K
Sbjct: 1607 MKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK----KEAEEKKKAEELKKAEEENKIK 1662

Query: 282  RKKKRKKKKNQHHR 295
              ++ KK +    +
Sbjct: 1663 AAEEAKKAEEDKKK 1676



 Score = 29.7 bits (66), Expect = 2.4
 Identities = 18/75 (24%), Positives = 43/75 (57%), Gaps = 11/75 (14%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K+ ++ KKK ++ KK + +KKK  H +           +EEEKK ++ +K+K+   +++ 
Sbjct: 1737 KEAEEDKKKAEEAKKDEEEKKKIAHLK-----------KEEEKKAEEIRKEKEAVIEEEL 1785

Query: 283  KKKRKKKKNQHHRDV 297
             ++ +K++ +  + +
Sbjct: 1786 DEEDEKRRMEVDKKI 1800



 Score = 29.3 bits (65), Expect = 2.9
 Identities = 18/59 (30%), Positives = 31/59 (52%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            KKK +  KK +++KKK    KK +  +   I +      EEE  ++ +K++ +  KK K
Sbjct: 1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801



 Score = 29.3 bits (65), Expect = 3.0
 Identities = 18/78 (23%), Positives = 38/78 (48%)

Query: 224  KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            KK +  KK  + KK + KKK ++  +   + +    ++ EE KK ++ K     K ++ K
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587

Query: 284  KKRKKKKNQHHRDVWQKK 301
            K  + +  +  +   ++K
Sbjct: 1588 KAEEARIEEVMKLYEEEK 1605



 Score = 29.0 bits (64), Expect = 4.5
 Identities = 16/70 (22%), Positives = 37/70 (52%), Gaps = 2/70 (2%)

Query: 226  KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
            KK ++ KK ++KKK ++ K          R  + EE ++ ++ + ++  K  +++K+ K 
Sbjct: 1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALR--KAEEAKKAEEARIEEVMKLYEEEKKMKA 1609

Query: 286  RKKKKNQHHR 295
             + KK +  +
Sbjct: 1610 EEAKKAEEAK 1619



 Score = 29.0 bits (64), Expect = 4.6
 Identities = 18/68 (26%), Positives = 37/68 (54%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            +KKK  + KK ++ KK  +KKK +  +    ++     + EE KK ++ + ++  K  + 
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603

Query: 283  KKKRKKKK 290
            +KK K ++
Sbjct: 1604 EKKMKAEE 1611


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 44.3 bits (105), Expect = 3e-05
 Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 16/85 (18%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVW--------INRTSRG------EEEEEK 265
              KKKK     K  K    R KKK++  R  W          ++SR       E   E+
Sbjct: 99  TKKKKKKDPTAAKSPKAAAPRPKKKSE--RISWAPTLLDSPRRKSSRSSTVQNKEATHER 156

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
            K+++ ++KK   K +++K++KK+K
Sbjct: 157 LKEREIRRKKIQAKARKRKEKKKEK 181



 Score = 43.5 bits (103), Expect = 5e-05
 Identities = 23/109 (21%), Positives = 39/109 (35%), Gaps = 35/109 (32%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK--------- 272
           +KKKK+ K K  K+  KK+KKK                +    + KKK ++         
Sbjct: 84  LKKKKRVKTKAYKEPTKKKKKKDPT--------AAKSPKAAAPRPKKKSERISWAPTLLD 135

Query: 273 ------KKKTTKKKK------------RKKKRKKKKNQHHRDVWQKKYT 303
                  + +T + K            R+KK + K  +      +K+ T
Sbjct: 136 SPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184



 Score = 42.4 bits (100), Expect = 1e-04
 Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 20/71 (28%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            ++++R KKKK+ K K  K+                      KKKKKK      + K   
Sbjct: 78  LQREERLKKKKRVKTKAYKEPT--------------------KKKKKKDPTAAKSPKAAA 117

Query: 283 KKKRKKKKNQH 293
            + +KK +   
Sbjct: 118 PRPKKKSERIS 128



 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 22/68 (32%), Positives = 29/68 (42%), Gaps = 19/68 (27%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K+ +R+++ KKKK+ K K  K                   E  KKKKKK     K  K 
Sbjct: 75  EKELQREERLKKKKRVKTKAYK-------------------EPTKKKKKKDPTAAKSPKA 115

Query: 283 KKKRKKKK 290
              R KKK
Sbjct: 116 AAPRPKKK 123



 Score = 39.3 bits (92), Expect = 0.001
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           GE+E +++++ KKKK+  TK  K   K+KKKK
Sbjct: 74  GEKELQREERLKKKKRVKTKAYKEPTKKKKKK 105



 Score = 39.3 bits (92), Expect = 0.001
 Identities = 24/90 (26%), Positives = 40/90 (44%), Gaps = 15/90 (16%)

Query: 221 VMKKKKK-------RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG------EEEEEKKK 267
             KKKKK        K    + KKK  +        D    ++SR       E   E+ K
Sbjct: 99  TKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLK 158

Query: 268 KKKKKKKKTTKK--KKRKKKRKKKKNQHHR 295
           +++ ++KK   K  K+++KK++K+  Q  R
Sbjct: 159 EREIRRKKIQAKARKRKEKKKEKELTQEER 188



 Score = 38.1 bits (89), Expect = 0.003
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           E E+E +++++ KKKK  K K  K+  KKKK
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKKK 103



 Score = 37.0 bits (86), Expect = 0.007
 Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 5/38 (13%)

Query: 260 EEEEEK-----KKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           EEE EK     ++ KKKK+ KT   K+  KK+KKK   
Sbjct: 71  EEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPT 108



 Score = 35.4 bits (82), Expect = 0.023
 Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 11/62 (17%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
            +K+ +++++ +KKK+ +               +E  KKKKKK        K    + KK
Sbjct: 74  GEKELQREERLKKKKRVK-----------TKAYKEPTKKKKKKDPTAAKSPKAAAPRPKK 122

Query: 289 KK 290
           K 
Sbjct: 123 KS 124



 Score = 34.7 bits (80), Expect = 0.040
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 17/77 (22%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDV-------------WINRTS--RGEEEEEK 265
             ++ K+R+ ++KK + K RK+K+ +  +++              IN  S  R EE+EE+
Sbjct: 153 THERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEE 212

Query: 266 KKKKKKK--KKKTTKKK 280
           KKK K +  KK+   + 
Sbjct: 213 KKKAKIQALKKRRLYEG 229


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 45.3 bits (107), Expect = 3e-05
 Identities = 12/79 (15%), Positives = 35/79 (44%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K+KK+++ KK  ++ +   +      ++ +      E+ ++  +  + K+K   +++   
Sbjct: 169 KRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228

Query: 284 KKRKKKKNQHHRDVWQKKY 302
                K N+   D+ Q+  
Sbjct: 229 YLDYLKLNEERIDLLQELL 247



 Score = 37.6 bits (87), Expect = 0.007
 Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 5/82 (6%)

Query: 224  KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            K+K+   K++++++ KR     +   +V +   +  EE+EE+  K + KK++  ++KK  
Sbjct: 949  KEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKEL 1008

Query: 284  KK-----RKKKKNQHHRDVWQK 300
             +       ++  +        
Sbjct: 1009 LREIIEETCQRFKEFLELFVSI 1030



 Score = 36.9 bits (85), Expect = 0.014
 Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 9/81 (11%)

Query: 222 MKKKKKRKKKKKKKKKK-------KRKKKKNQHHRDVWINRTSRGEEEEEKKKKK--KKK 272
             KK     + K+K +            K N+   D+        +EE E  K++  K++
Sbjct: 206 QAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEE 265

Query: 273 KKKTTKKKKRKKKRKKKKNQH 293
           +      K+ K++ K+KK Q 
Sbjct: 266 EILAQVLKENKEEEKEKKLQE 286



 Score = 33.8 bits (77), Expect = 0.12
 Identities = 16/78 (20%), Positives = 44/78 (56%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           +K+ +K +K+ KK+K+  ++  +  +++ I R +  EEEE+ +K ++K ++   +   +K
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKK 380

Query: 284 KKRKKKKNQHHRDVWQKK 301
           K   ++ +   +   ++ 
Sbjct: 381 KLESERLSSAAKLKEEEL 398



 Score = 33.4 bits (76), Expect = 0.17
 Identities = 15/71 (21%), Positives = 41/71 (57%), Gaps = 3/71 (4%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K K+++++++K + KK ++++ +    +         EEEEK +K K +++K  K K +
Sbjct: 744 QKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKEL---AEEEEKTEKLKVEEEKEEKLKAQ 800

Query: 283 KKKRKKKKNQH 293
           +++ +  + + 
Sbjct: 801 EEELRALEEEL 811



 Score = 31.9 bits (72), Expect = 0.45
 Identities = 17/72 (23%), Positives = 37/72 (51%), Gaps = 4/72 (5%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-KKKKKKKKTTK 278
             ++K+++   +  K+ K++ K+KK Q      +   ++ EEE + +  K +++K    +
Sbjct: 259 QELEKEEEILAQVLKENKEEEKEKKLQE---EELKLLAKEEEELKSELLKLERRKVDDEE 315

Query: 279 KKKRKKKRKKKK 290
           K K  +K  KK 
Sbjct: 316 KLKESEKELKKL 327



 Score = 31.9 bits (72), Expect = 0.49
 Identities = 9/72 (12%), Positives = 38/72 (52%), Gaps = 1/72 (1%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRD-VWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
            ++K K ++++ +  +++ K++      + + I +  + +EEE ++   + K+++  +K 
Sbjct: 793 KEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKL 852

Query: 281 KRKKKRKKKKNQ 292
             ++  + ++  
Sbjct: 853 AEEELERLEEEI 864



 Score = 30.3 bits (68), Expect = 1.5
 Identities = 12/71 (16%), Positives = 36/71 (50%), Gaps = 2/71 (2%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK--KKKRK 283
           ++ + +  K++ +K ++   Q  ++       +  +EEE K   K++++  ++  K +R+
Sbjct: 250 EQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 309

Query: 284 KKRKKKKNQHH 294
           K   ++K +  
Sbjct: 310 KVDDEEKLKES 320



 Score = 29.9 bits (67), Expect = 2.0
 Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 18/89 (20%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK------- 274
             KK K +K++    K +  + K      +   +    EEEEEK + KK++++       
Sbjct: 712 ELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELS 771

Query: 275 -----------KTTKKKKRKKKRKKKKNQ 292
                      KT K K  ++K +K K Q
Sbjct: 772 LKEKELAEEEEKTEKLKVEEEKEEKLKAQ 800



 Score = 28.4 bits (63), Expect = 6.0
 Identities = 11/70 (15%), Positives = 30/70 (42%), Gaps = 2/70 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK--KKKKKKKKTTKKK 280
           ++ + + +++K+ K      ++ +           +  EE E+    K+ K  ++  + +
Sbjct: 396 EELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELE 455

Query: 281 KRKKKRKKKK 290
           K+  K  K K
Sbjct: 456 KQALKLLKDK 465



 Score = 28.4 bits (63), Expect = 6.2
 Identities = 9/77 (11%), Positives = 31/77 (40%), Gaps = 3/77 (3%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKN---QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            +++++R K+    K++          +             ++E  +++KK+  ++   + 
Sbjct: 957  EEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEET 1016

Query: 280  KKRKKKRKKKKNQHHRD 296
             +R K+  +     +R 
Sbjct: 1017 CQRFKEFLELFVSINRG 1033


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 45.0 bits (107), Expect = 3e-05
 Identities = 17/66 (25%), Positives = 26/66 (39%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           K++ K+  + KKK   +  +        +  T    E +  KKK K   K    K   KK
Sbjct: 35  KEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94

Query: 285 KRKKKK 290
           K K + 
Sbjct: 95  KLKDEL 100



 Score = 40.7 bits (96), Expect = 7e-04
 Identities = 17/68 (25%), Positives = 28/68 (41%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K K    K++ K+  + KKK  +    V I  +   ++ ++  +    KKK  T  K  
Sbjct: 27  SKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAA 86

Query: 283 KKKRKKKK 290
             K   KK
Sbjct: 87  AAKAPAKK 94



 Score = 36.5 bits (85), Expect = 0.016
 Identities = 12/71 (16%), Positives = 24/71 (33%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           +  K K +    K++ K+  + KK    +   +     G  ++     +    KK TK  
Sbjct: 23  LAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTA 82

Query: 281 KRKKKRKKKKN 291
            +    K    
Sbjct: 83  AKAAAAKAPAK 93



 Score = 36.1 bits (84), Expect = 0.017
 Identities = 17/81 (20%), Positives = 28/81 (34%), Gaps = 9/81 (11%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKK---------KKNQHHRDVWINRTSRGEEEEEKKKKK 269
           GF+ K++ K   + KKK  ++  +         K      +  I +       +    K 
Sbjct: 31  GFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKA 90

Query: 270 KKKKKKTTKKKKRKKKRKKKK 290
             KKK   +    KK  KK  
Sbjct: 91  PAKKKLKDELDSSKKAEKKNA 111



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 14/79 (17%), Positives = 23/79 (29%), Gaps = 21/79 (26%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            +    KKK K   K    K                        KKK K +  ++KK ++
Sbjct: 70  TESDIPKKKTKTAAKAAAAKAPA---------------------KKKLKDELDSSKKAEK 108

Query: 283 KKKRKKKKNQHHRDVWQKK 301
           K    K  + ++       
Sbjct: 109 KNALDKDDDLNYVKDIDVL 127



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 13/68 (19%), Positives = 21/68 (30%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K KK   K K K    K + K+    +     +  +         K      ++   KK+
Sbjct: 19  KLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKK 78

Query: 283 KKKRKKKK 290
            K   K  
Sbjct: 79  TKTAAKAA 86



 Score = 29.2 bits (66), Expect = 2.8
 Identities = 11/78 (14%), Positives = 24/78 (30%), Gaps = 14/78 (17%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
                K +   +++ K+K KK                 + + K    K++ K   + K+K
Sbjct: 2   TTASTKAELAAEEEAKKKLKKLA--------------AKSKSKGFITKEEIKEALESKKK 47

Query: 284 KKRKKKKNQHHRDVWQKK 301
              +  +         K 
Sbjct: 48  TPEQIDQVLIFLSGMVKD 65


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 45.0 bits (106), Expect = 3e-05
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 8/71 (11%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI--NRTSRGEEEEEKKKKKKKKKKKTTKK 279
             + +K    + K  K K  +       D     + + +G E+ +++K KKKKKK    K
Sbjct: 199 ETEGEKGGGGRGKDLKIKDLEG------DDEDDGDESDKGGEDGDEEKSKKKKKKLAKNK 252

Query: 280 KKRKKKRKKKK 290
           KK    +K K+
Sbjct: 253 KKLDDDKKGKR 263



 Score = 36.1 bits (83), Expect = 0.018
 Identities = 28/114 (24%), Positives = 41/114 (35%), Gaps = 40/114 (35%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE-------------------- 262
            KKKK+K  K KKK    KK K     D     +  G++E                    
Sbjct: 241 SKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEER 300

Query: 263 --------------------EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
                               EE +++K +++   +KK K+ KK K KKN   +D
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354



 Score = 31.9 bits (72), Expect = 0.41
 Identities = 24/114 (21%), Positives = 42/114 (36%), Gaps = 40/114 (35%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK---------------NQHHRDVWINRTS---------- 257
           KKKKK+  K KKK    +K K+               ++   + +I+ +S          
Sbjct: 242 KKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEERE 301

Query: 258 ---------------RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
                            + EE +++K +++   + K KK KK + KK      D
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355



 Score = 31.1 bits (70), Expect = 0.70
 Identities = 17/67 (25%), Positives = 33/67 (49%), Gaps = 4/67 (5%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K +++    KK KK KK K KKN   +D     +  G++ ++     +      T KK++
Sbjct: 327 KNEEEGGLSKKGKKLKKLKGKKNGLDKD----DSDSGDDSDDSDIDGEDSVSLVTAKKQK 382

Query: 283 KKKRKKK 289
           + K+++ 
Sbjct: 383 EPKKEEP 389


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 44.9 bits (107), Expect = 3e-05
 Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 17/68 (25%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
             +RKK +KK++K ++K +K                 EE +K   KKK +   KK K   
Sbjct: 408 PAERKKLRKKQRKAEKKAEK-----------------EEAEKAAAKKKAEAAAKKAKGPD 450

Query: 285 KRKKKKNQ 292
              KK + 
Sbjct: 451 GETKKVDP 458



 Score = 34.5 bits (80), Expect = 0.061
 Identities = 10/43 (23%), Positives = 21/43 (48%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           G     ++KK +KK++K  KK ++++  K    +      +K 
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446



 Score = 34.1 bits (79), Expect = 0.090
 Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 20/69 (28%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
             ++KK +KK+RK                     E+K +K++ +K    KK +   K+ K
Sbjct: 408 PAERKKLRKKQRK--------------------AEKKAEKEEAEKAAAKKKAEAAAKKAK 447

Query: 289 KKNQHHRDV 297
             +   + V
Sbjct: 448 GPDGETKKV 456



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 10/46 (21%), Positives = 19/46 (41%)

Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
              GE       ++KK +KK  K +K+ +K + +K    +      
Sbjct: 398 EEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443



 Score = 29.9 bits (68), Expect = 1.9
 Identities = 9/47 (19%), Positives = 19/47 (40%), Gaps = 8/47 (17%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKK 268
            +K +K+ +K++ +K   +KK +                 + E KK 
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAA--------KKAKGPDGETKKV 456


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 44.7 bits (106), Expect = 3e-05
 Identities = 20/75 (26%), Positives = 41/75 (54%), Gaps = 9/75 (12%)

Query: 222 MKKKKKRKKKKKKKKKKKR------KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
            K+K K ++ K+K++K+        K+ K +  +   + R     +E  +K+K + +KK+
Sbjct: 275 PKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ---LARLKEIAKEVAQKEKARARKKE 331

Query: 276 TTKKKKRKKKRKKKK 290
             K++  KKK K++K
Sbjct: 332 QRKERGEKKKLKRRK 346



 Score = 40.8 bits (96), Expect = 5e-04
 Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 29/92 (31%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK--------- 270
            + K  + ++K K ++ K+KR+K+                E +EEK+ KKK         
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKEL-------------EREAKEEKQLKKKLAQLARLKE 314

Query: 271 -------KKKKKTTKKKKRKKKRKKKKNQHHR 295
                  K+K +  KK++RK++ +KKK +  +
Sbjct: 315 IAKEVAQKEKARARKKEQRKERGEKKKLKRRK 346



 Score = 39.3 bits (92), Expect = 0.002
 Identities = 20/101 (19%), Positives = 43/101 (42%), Gaps = 22/101 (21%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG----------------------E 260
           +K+ K +KK+++ ++ + KK +        ++  S G                      E
Sbjct: 204 EKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFE 263

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
            E E   K  + K+KT  ++ ++K+RK+ + +   +   KK
Sbjct: 264 SEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304



 Score = 34.3 bits (79), Expect = 0.068
 Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 20/97 (20%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT-- 277
           FV+         KKK++KK +K  K+    D  +   S+      KK+  KKK KK    
Sbjct: 43  FVVDTTGDEATIKKKRRKKLKKPLKS----DEILENRSKVPAPSVKKRHVKKKSKKKKSK 98

Query: 278 --------------KKKKRKKKRKKKKNQHHRDVWQK 300
                         + KK  K +    ++ + D+W +
Sbjct: 99  QVSRKELLRLKKLARGKKGGKAKLLNASKDNYDLWGE 135



 Score = 32.4 bits (74), Expect = 0.28
 Identities = 19/106 (17%), Positives = 39/106 (36%), Gaps = 27/106 (25%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR----------------------DVWINRTSRGE 260
           K +KKR++ ++ ++KK  K                                W    S  E
Sbjct: 208 KAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYE 267

Query: 261 EEE-----EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
                   ++K K ++ K+K  K+ +R+ K +K+  +    + + K
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLK 313



 Score = 31.2 bits (71), Expect = 0.62
 Identities = 16/66 (24%), Positives = 27/66 (40%), Gaps = 6/66 (9%)

Query: 220 FVMKKKKKRKKK--KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKK 273
            V +K+  R KK  + KK  K +    ++ + D+W         +  KK       ++  
Sbjct: 99  QVSRKELLRLKKLARGKKGGKAKLLNASKDNYDLWGEEEPPDTPKWPKKDLAEHTLEQVA 158

Query: 274 KKTTKK 279
           KKT K 
Sbjct: 159 KKTVKT 164


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 43.8 bits (104), Expect = 5e-05
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 18/68 (26%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            ++  KR ++ K+K  K  KKK+                  EEKK +K+KKKKK  KK K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKR------------------EEKKPQKRKKKKKRKKKGK 406

Query: 282 RKKKRKKK 289
           ++KK+ +K
Sbjct: 407 KRKKKGRK 414



 Score = 40.0 bits (94), Expect = 0.001
 Identities = 17/28 (60%), Positives = 23/28 (82%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           KKK+++KK +K  KKKKRKKK KK+K +
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 38.0 bits (89), Expect = 0.005
 Identities = 18/38 (47%), Positives = 27/38 (71%), Gaps = 5/38 (13%)

Query: 258 RGEEEEEK-----KKKKKKKKKKTTKKKKRKKKRKKKK 290
           R EE +EK     KKK+++KK +  KKKK++KK+ KK+
Sbjct: 371 RIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408



 Score = 37.3 bits (87), Expect = 0.008
 Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 5/35 (14%)

Query: 262 EEEKKK-----KKKKKKKKTTKKKKRKKKRKKKKN 291
           EE K+K     KKK+++KK  K+KK+KK++KK K 
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKK 407



 Score = 30.3 bits (69), Expect = 1.3
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           EE  ++ ++ K+K  K  KKK+ +KK +K+K +  R 
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRK 402



 Score = 28.4 bits (64), Expect = 4.7
 Identities = 13/38 (34%), Positives = 22/38 (57%), Gaps = 2/38 (5%)

Query: 261 EEEEKKKK--KKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           EE  K+ +  K+K  K   KK++ KK +K+KK +  + 
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKK 403


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 42.2 bits (100), Expect = 8e-05
 Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 14/70 (20%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK------KTTKK 279
           KK+K   K+ K   + KKK +   D         E  E K+KKK K  K      +   +
Sbjct: 1   KKKKSSPKRSKGMAKSKKKTREELDA--------EARERKRKKKHKGLKSGSRHNEGNTQ 52

Query: 280 KKRKKKRKKK 289
            K K + +KK
Sbjct: 53  SKGKGQAQKK 62



 Score = 40.7 bits (96), Expect = 3e-04
 Identities = 27/93 (29%), Positives = 38/93 (40%), Gaps = 25/93 (26%)

Query: 222 MKKKKKRKKKKKKKKKKK-----------RKKKKNQHHRDVWINRTSRGEEEEEKKKKKK 270
            KK   ++ K   K KKK           RK+KK +H      +R + G  + + K + +
Sbjct: 2   KKKSSPKRSKGMAKSKKKTREELDAEARERKRKK-KHKGLKSGSRHNEGNTQSKGKGQAQ 60

Query: 271 KK------KKK-------TTKKKKRKKKRKKKK 290
           KK      KK          K KK+K K KK K
Sbjct: 61  KKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPK 93



 Score = 27.2 bits (61), Expect = 7.3
 Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 23/72 (31%)

Query: 222 MKKKKKRKKKK---------KKKKKKKRKKKKNQHHRDVWINRTSR-------GEEEEEK 265
            K+KKK K  K          + K K + +KK+           S+        E+ + K
Sbjct: 31  RKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRI-------GSKKPIPLGVEEKVKPK 83

Query: 266 KKKKKKKKKKTT 277
           KKK K KK K +
Sbjct: 84  KKKPKSKKPKLS 95


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 42.2 bits (100), Expect = 8e-05
 Identities = 17/68 (25%), Positives = 37/68 (54%), Gaps = 5/68 (7%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KKK KK KK+K+K++++ +K     D         +  EE K +K ++ ++  ++++ + 
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGD-----DELKQAAEEAKAEKAERDRELNRQRQAEA 68

Query: 285 KRKKKKNQ 292
           ++K    Q
Sbjct: 69  EQKAIVAQ 76


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 42.5 bits (100), Expect = 1e-04
 Identities = 21/72 (29%), Positives = 46/72 (63%), Gaps = 1/72 (1%)

Query: 216 WIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWIN-RTSRGEEEEEKKKKKKKKKK 274
            I  +  +K+K+ +K +  K++K+R   + + H   W++ R  R +E  ++K+K++KKK 
Sbjct: 136 MIADWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKV 195

Query: 275 KTTKKKKRKKKR 286
           K  K++++++KR
Sbjct: 196 KEAKRREKEEKR 207



 Score = 40.2 bits (94), Expect = 5e-04
 Identities = 20/68 (29%), Positives = 43/68 (63%), Gaps = 5/68 (7%)

Query: 224 KKKKRKKKKKKKKKKKRKKK---KNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           + +KRK+++K +  K+RK++   + + H   W++   R    +E  ++K+K++KK  K+ 
Sbjct: 141 RAQKRKREQKARAAKERKERLVAEAREHFGYWVD--PRDPRFQEMLQQKEKEEKKKVKEA 198

Query: 281 KRKKKRKK 288
           KR++K +K
Sbjct: 199 KRREKEEK 206



 Score = 30.5 bits (69), Expect = 0.73
 Identities = 14/82 (17%), Positives = 34/82 (41%), Gaps = 20/82 (24%)

Query: 216 WIPGF-VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           W P    M ++ + ++K+K+ +++ R                   E E  K   K  +  
Sbjct: 97  WYPSLNQMLEENREQQKEKEARRQAR-------------------EAEIAKNMAKMPQMI 137

Query: 275 KTTKKKKRKKKRKKKKNQHHRD 296
              + +KRK+++K +  +  ++
Sbjct: 138 ADWRAQKRKREQKARAAKERKE 159


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 41.6 bits (97), Expect = 1e-04
 Identities = 16/81 (19%), Positives = 55/81 (67%), Gaps = 2/81 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS--RGEEEEEKKKKKKKKKKKTTKKK 280
           K+++ R+++++++++++ ++++++  R+    R +  R   EEE +++++++ ++  +K 
Sbjct: 31  KRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKA 90

Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
           KRK + ++K+ Q  ++  QK+
Sbjct: 91  KRKAEEEEKQEQEEQERIQKQ 111



 Score = 33.9 bits (77), Expect = 0.043
 Identities = 11/79 (13%), Positives = 48/79 (60%), Gaps = 8/79 (10%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           ++++ R++++++ ++K+ K K+     +         +E+EE+++ +K+K++   + ++ 
Sbjct: 71  REEEARRQEEERAREKEEKAKRKAEEEEK--------QEQEEQERIQKQKEEAEARAREE 122

Query: 283 KKKRKKKKNQHHRDVWQKK 301
            ++ + ++ +H + + Q++
Sbjct: 123 AERMRLEREKHFQQIEQER 141



 Score = 33.1 bits (75), Expect = 0.094
 Identities = 12/81 (14%), Positives = 50/81 (61%), Gaps = 11/81 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE-----------EEEKKKKKKK 271
           +++++++++++++++++ + ++ +  R     R  R EE           +EEK K+K +
Sbjct: 36  REQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAE 95

Query: 272 KKKKTTKKKKRKKKRKKKKNQ 292
           +++K  ++++ + +++K++ +
Sbjct: 96  EEEKQEQEEQERIQKQKEEAE 116



 Score = 30.8 bits (69), Expect = 0.47
 Identities = 9/73 (12%), Positives = 42/73 (57%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K+++ ++++ +K+K++ + +  +    + + R    ++ E+++ ++KK+ ++  K+ ++ 
Sbjct: 99  KQEQEEQERIQKQKEEAEARAREEAERMRLEREKHFQQIEQERLERKKRLEEIMKRTRKS 158

Query: 284 KKRKKKKNQHHRD 296
           +   + K +  + 
Sbjct: 159 EVSPQVKKEDPKV 171


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 42.3 bits (100), Expect = 1e-04
 Identities = 22/73 (30%), Positives = 31/73 (42%), Gaps = 14/73 (19%)

Query: 224 KKKKRKKKKKK---KKKKKRKKKKNQHHR----DVWINRTSRGEEEEEKKKKKKKKKKKT 276
           K ++R K+KK    K KK +KK+K         D          ++  K   KKK +K  
Sbjct: 154 KLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDF-------LVDDAAKGGSKKKGRKGG 206

Query: 277 TKKKKRKKKRKKK 289
             + K   KRK K
Sbjct: 207 AARGKPNAKRKAK 219



 Score = 34.2 bits (79), Expect = 0.058
 Identities = 18/73 (24%), Positives = 22/73 (30%), Gaps = 12/73 (16%)

Query: 219 GFVMKKKKKRKKKKKKKK----KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           G   + K   K+K K  K     KKR  K N           S  +      KK K    
Sbjct: 205 GGAARGKPNAKRKAKDAKFGFGGKKRGSKSN--------TAESSNDLSGFSVKKMKGGGG 256

Query: 275 KTTKKKKRKKKRK 287
                 KR  K +
Sbjct: 257 AGKGGNKRPGKSR 269



 Score = 31.9 bits (73), Expect = 0.30
 Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 19/70 (27%)

Query: 222 MKKKKKRKKKKKKKKKKKRKK--KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           + +   +K  ++ +K+++ KK  K+ Q                 EK +++ K+KK   +K
Sbjct: 125 LAEAAAKKASEEARKQRELKKFGKQVQV----------------EKLQERAKEKKDMLEK 168

Query: 280 -KKRKKKRKK 288
            KK KKKRK 
Sbjct: 169 IKKLKKKRKG 178


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 42.5 bits (100), Expect = 2e-04
 Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 9/78 (11%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDV---------WINRTSRGEEEEEKKKKKKKKK 273
           KKK K KK+KK+  +   K++  +                      E+E+E K   ++  
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282

Query: 274 KKTTKKKKRKKKRKKKKN 291
            +   ++K K+KRK+ K 
Sbjct: 283 SEEETEEKEKEKRKRLKK 300



 Score = 34.8 bits (80), Expect = 0.047
 Identities = 21/89 (23%), Positives = 44/89 (49%), Gaps = 14/89 (15%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRD----VWINRTSRGEEEEEKKK------ 267
           P    ++    ++ ++K+K+K+++ KK     D    + I   S  EEEE ++       
Sbjct: 274 PKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLP 333

Query: 268 KKKKKKKKTTKK----KKRKKKRKKKKNQ 292
           KK+++K++ T      ++R ++R  KK  
Sbjct: 334 KKEEEKEEVTVSPDGGRRRGRRRVMKKKT 362



 Score = 33.7 bits (77), Expect = 0.11
 Identities = 20/73 (27%), Positives = 27/73 (36%), Gaps = 13/73 (17%)

Query: 228 RKKKKKKKKKKKRKK--------KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           R++ +++  KKK  K        KK       W +  S  E E    K K K        
Sbjct: 350 RRRGRRRVMKKKTFKDEEGYLVTKKVY----EWES-FSEDEAEPPPTKPKPKVSTPAVPA 404

Query: 280 KKRKKKRKKKKNQ 292
             +K K  KKK Q
Sbjct: 405 AAKKPKAPKKKKQ 417



 Score = 32.5 bits (74), Expect = 0.23
 Identities = 16/72 (22%), Positives = 25/72 (34%), Gaps = 2/72 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK--K 280
              K  K     + K K  KK     ++    +T      +    K+   KK        
Sbjct: 163 PSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFF 222

Query: 281 KRKKKRKKKKNQ 292
           K+K K KK+K +
Sbjct: 223 KKKTKEKKEKKE 234



 Score = 30.6 bits (69), Expect = 1.1
 Identities = 19/82 (23%), Positives = 31/82 (37%), Gaps = 3/82 (3%)

Query: 221 VMKKKKKRKKKKK-KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           +M  + K K  KK +   K+   +K +    V      R   +  K        KK TK+
Sbjct: 171 IMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPK--KSNIMSSFFKKKTKE 228

Query: 280 KKRKKKRKKKKNQHHRDVWQKK 301
           KK KK+  +   +   +    K
Sbjct: 229 KKEKKEASESTVKEESEEESGK 250



 Score = 30.6 bits (69), Expect = 1.1
 Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 7/68 (10%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
              +K   K   K    KR   K         N  S   +++ K+KK+KK+  ++T K++
Sbjct: 191 TTTEKTEGKTSVKAASLKRNPPKKS-------NIMSSFFKKKTKEKKEKKEASESTVKEE 243

Query: 282 RKKKRKKK 289
            +++  K+
Sbjct: 244 SEEESGKR 251


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 42.2 bits (100), Expect = 2e-04
 Identities = 19/81 (23%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 223 KKKKKRKKKKKKKKKKKRKKK------KNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
            ++K+R+K+K++ +K++R+K+      + Q        +  R EEE E+++  +K+ +  
Sbjct: 213 YERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDE 272

Query: 277 TKKKKRKKKRKKKKNQHHRDV 297
             +++  +KR+ K+ +H R++
Sbjct: 273 ELEQENAEKRRMKRLEHRREL 293



 Score = 39.9 bits (94), Expect = 0.001
 Identities = 15/88 (17%), Positives = 47/88 (53%), Gaps = 17/88 (19%)

Query: 222 MKKKKKRKKKKK--------KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
            K+K++R++ +         ++K+++ + ++    R+       R +EE+E + ++K++K
Sbjct: 63  RKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK 122

Query: 274 KKTTK---------KKKRKKKRKKKKNQ 292
           +K  +         + +RK++ K+++ +
Sbjct: 123 QKKLREEIDEFNEERIERKEEEKERERE 150



 Score = 39.1 bits (92), Expect = 0.002
 Identities = 14/83 (16%), Positives = 45/83 (54%), Gaps = 5/83 (6%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI----NRTSRGEEEEEKKKKKKKKKKKTT 277
           M +K+   ++ +++  +KR+ K+ +H R++          R  E EE+ ++ ++ +++  
Sbjct: 264 MLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEA 323

Query: 278 KKKKR-KKKRKKKKNQHHRDVWQ 299
           +++ R +++R++   +H   +  
Sbjct: 324 ERQARIEEERQRLLKEHAEALLG 346



 Score = 38.7 bits (91), Expect = 0.003
 Identities = 14/72 (19%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKKKKTTKKK 280
           + + ++K++K+KK + +  +    R          E EEE K    +++K ++++  + +
Sbjct: 113 EAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAE 172

Query: 281 KRKKKRKKKKNQ 292
           +R++K +K++  
Sbjct: 173 RRERKEEKEREV 184



 Score = 36.0 bits (84), Expect = 0.018
 Identities = 13/82 (15%), Positives = 44/82 (53%), Gaps = 10/82 (12%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI----NRTSRGEEEEEKKKKKKKKKKKTT 277
           M++++ +   +++++++KRK+++ +    +          R EE EE+ +++++  +   
Sbjct: 47  MEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIE 106

Query: 278 K------KKKRKKKRKKKKNQH 293
           +       + ++K+ K+KK + 
Sbjct: 107 RIQEEDEAEAQEKREKQKKLRE 128



 Score = 36.0 bits (84), Expect = 0.019
 Identities = 14/88 (15%), Positives = 48/88 (54%), Gaps = 8/88 (9%)

Query: 223 KKKKKRK--------KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           K+KK R+        + ++K+++K+R++++     +    +  R EE E +++++K++K+
Sbjct: 122 KQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKE 181

Query: 275 KTTKKKKRKKKRKKKKNQHHRDVWQKKY 302
           +   + + +++  + + +   ++    Y
Sbjct: 182 REVARLRAQQEEAEDEREELDELRADLY 209



 Score = 34.5 bits (80), Expect = 0.052
 Identities = 13/81 (16%), Positives = 45/81 (55%), Gaps = 20/81 (24%)

Query: 222 MKKKKKRKKKKKKKKK-------KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           +++KK+ K ++K++++       ++R K   +             EEE E+K+K+++++ 
Sbjct: 26  IEEKKRIKAEEKEEERRIDEMMEEERLKALAE-------------EEERERKRKEERREG 72

Query: 275 KTTKKKKRKKKRKKKKNQHHR 295
           +   +++ +++ K+++ ++  
Sbjct: 73  RAVLQEQIEEREKRRQEEYEE 93



 Score = 33.3 bits (77), Expect = 0.13
 Identities = 10/79 (12%), Positives = 44/79 (55%), Gaps = 7/79 (8%)

Query: 222 MKKKKKRKKKKKKK-----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
            + +++ +K++K++     + ++ + +  +   D         +EE E+K+++K+K++  
Sbjct: 169 REAERRERKEEKEREVARLRAQQEEAEDEREELDELRAD--LYQEEYERKERQKEKEEAE 226

Query: 277 TKKKKRKKKRKKKKNQHHR 295
            +++++++ ++ ++ Q   
Sbjct: 227 KRRRQKQELQRAREEQIEE 245


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 42.0 bits (99), Expect = 2e-04
 Identities = 24/68 (35%), Positives = 36/68 (52%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             K++K++ K+K  KKK +K KK+    D ++       +E E K+  KKKKKK  KKKK
Sbjct: 9   EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKK 68

Query: 282 RKKKRKKK 289
           +       
Sbjct: 69  KNLGEAYD 76



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 23/69 (33%), Positives = 35/69 (50%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + K+++++ K+K  KKK KK K     D         E +E + K+  KKKKK  KKKK+
Sbjct: 9   EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKK 68

Query: 283 KKKRKKKKN 291
           K   +    
Sbjct: 69  KNLGEAYDL 77



 Score = 37.0 bits (86), Expect = 0.011
 Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 9/74 (12%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI---------NRTSRGEEEEEKKKKKKKKK 273
           + + K++K++ K+K  K+K KK++   D            N+ +  ++  +KKKKKKKKK
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 274 KKTTKKKKRKKKRK 287
           KK    +       
Sbjct: 67  KKKNLGEAYDLAYD 80


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
           Srsx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srsx is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 257

 Score = 41.4 bits (98), Expect = 2e-04
 Identities = 41/196 (20%), Positives = 75/196 (38%), Gaps = 21/196 (10%)

Query: 52  IITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGY 111
            +  I GN+++II   K +KLR   +Y +     ADLL     + F  ++  T       
Sbjct: 1   SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLL-FTGTQLTRN 59

Query: 112 FMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGI 171
             C +     ++  TA    +  I +DR  A+  P+ Y ++ +   + + L    I P I
Sbjct: 60  E-CFLIIIPYIFGQTAQSPLMLMIGIDRLIAVKFPIFYRLLSSSKYLFIQL----IFPVI 114

Query: 172 ISFTPIMLGWYTTPDHKQYRKHHPNVC---------IFVVNVYYALISSSISFWIPGFVM 222
            S   ++ G+    D          VC          F +    ++I + I   +   ++
Sbjct: 115 YSSFILVYGFLERDDETII------VCAPPLALNGTAFRIFTLSSVIINVIVLIVYIILI 168

Query: 223 KKKKKRKKKKKKKKKK 238
              KK+ +      KK
Sbjct: 169 IIFKKKGQTTNSDSKK 184


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 39.2 bits (92), Expect = 3e-04
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           EE+EEK+ K+K  +   + KKK+K+K KKK+++
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62



 Score = 35.3 bits (82), Expect = 0.005
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           EE+E+K+ K+K  +     KK+KK++ KKK   
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62



 Score = 34.9 bits (81), Expect = 0.007
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           EE+++K+ K+K  +     K+KKKRK KK +  
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62



 Score = 29.9 bits (68), Expect = 0.42
 Identities = 16/54 (29%), Positives = 28/54 (51%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
             +K+ K+K  +     KK+KK+K +  RD     ++  E  ++  +KKK  KK
Sbjct: 32  QEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKKFSKK 85



 Score = 28.0 bits (63), Expect = 2.0
 Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 6/32 (18%)

Query: 223 KKKKKRKKKKK------KKKKKKRKKKKNQHH 248
           ++++K  K+K        KKKKKRK KK +  
Sbjct: 31  EQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62



 Score = 26.1 bits (58), Expect = 8.4
 Identities = 9/25 (36%), Positives = 13/25 (52%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKK 284
           + +E     KKKKK+K  KK+    
Sbjct: 40  KADEGNNSGKKKKKRKAKKKRDEAG 64


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 10/74 (13%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK---KKKKKKKKKTTK 278
           +K + + ++ KKKKKKKK+K KKN           S  E         +      +    
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNN-------ATGSSAEATVSSAVPTELSSGAGQVGEA 224

Query: 279 KKKRKKKRKKKKNQ 292
           KK +KK+     N+
Sbjct: 225 KKLKKKRSIAPDNE 238



 Score = 38.1 bits (89), Expect = 0.003
 Identities = 17/64 (26%), Positives = 26/64 (40%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KKK+KKKKKK KK        +      +         +  + KK KKK+      ++ +
Sbjct: 182 KKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKSE 241

Query: 285 KRKK 288
             K 
Sbjct: 242 VYKS 245



 Score = 33.9 bits (78), Expect = 0.080
 Identities = 15/31 (48%), Positives = 22/31 (70%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           EEE +  K + +++  KKKK+KKK+K KKN 
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNN 196



 Score = 32.7 bits (75), Expect = 0.15
 Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++  K R ++++ KKKKK+KKKK         N T    E         +      +  +
Sbjct: 169 VELLKARLEEERAKKKKKKKKKK-----TKKNNATGSSAEATVSSAVPTELSSGAGQVGE 223

Query: 282 RKKKRKKKK 290
            KK +KK+ 
Sbjct: 224 AKKLKKKRS 232



 Score = 32.3 bits (74), Expect = 0.26
 Identities = 11/35 (31%), Positives = 14/35 (40%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           E  ++KKKKKKKK KK        +          
Sbjct: 179 ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTE 213



 Score = 31.5 bits (72), Expect = 0.42
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
             EEE +  K + ++++  KKKK+KKK+ KK N
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKTKKNN 196



 Score = 30.0 bits (68), Expect = 1.4
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
             EE E  K + ++++ K  KKKK+KK +K    
Sbjct: 165 TEEEVELLKARLEEERAKKKKKKKKKKTKKNNAT 198



 Score = 29.2 bits (66), Expect = 2.2
 Identities = 14/68 (20%), Positives = 20/68 (29%), Gaps = 12/68 (17%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KKKKK+ KK                         S    E      +  + KK  KK+  
Sbjct: 186 KKKKKKTKKNNATGSSAEATVS------------SAVPTELSSGAGQVGEAKKLKKKRSI 233

Query: 283 KKKRKKKK 290
               +K +
Sbjct: 234 APDNEKSE 241


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 39.5 bits (93), Expect = 3e-04
 Identities = 16/81 (19%), Positives = 33/81 (40%), Gaps = 10/81 (12%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K    K+  K   K+K K +K Q             + E EK  K   ++K   + +K 
Sbjct: 5   EKTFTDKEVDKAIAKEKAKWEKKQE----------EKKSEAEKLAKMSAEEKAEYELEKL 54

Query: 283 KKKRKKKKNQHHRDVWQKKYT 303
           +K+ ++ + +  R   + +  
Sbjct: 55  EKELEELEAELARRELKAEAK 75



 Score = 38.0 bits (89), Expect = 0.001
 Identities = 15/71 (21%), Positives = 33/71 (46%), Gaps = 13/71 (18%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
              K+K K +KK+++KK   +K  +                EEK + + +K +K  ++ +
Sbjct: 16  AIAKEKAKWEKKQEEKKSEAEKLAK-------------MSAEEKAEYELEKLEKELEELE 62

Query: 282 RKKKRKKKKNQ 292
            +  R++ K +
Sbjct: 63  AELARRELKAE 73



 Score = 36.9 bits (86), Expect = 0.003
 Identities = 15/68 (22%), Positives = 35/68 (51%), Gaps = 7/68 (10%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           + K+K + +KK+++KK + +K           +   + E E EK +K+ ++ +    +++
Sbjct: 17  IAKEKAKWEKKQEEKKSEAEKLAKM-------SAEEKAEYELEKLEKELEELEAELARRE 69

Query: 282 RKKKRKKK 289
            K + KK 
Sbjct: 70  LKAEAKKM 77


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 39.2 bits (92), Expect = 3e-04
 Identities = 22/72 (30%), Positives = 43/72 (59%), Gaps = 10/72 (13%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWI--------NRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
           K+++++K++ K+ +N+   D ++         +T+R E  E K K+K+KKKKK  + +  
Sbjct: 32  KREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDF 91

Query: 282 -RKKKRKKKKNQ 292
            R + R+KKK +
Sbjct: 92  YRFQIREKKKEE 103



 Score = 32.2 bits (74), Expect = 0.099
 Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 11/62 (17%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKK 273
                ++  + K K+K+KKKKK+K+ ++    Q        R  + EE  E +KK ++ K
Sbjct: 63  GKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQI-------REKKKEELAELRKKFEEDK 115

Query: 274 KK 275
           K+
Sbjct: 116 KR 117


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 40.0 bits (94), Expect = 4e-04
 Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK--KK 280
            K K R+++  K+ KK+ K+   +  +  ++ ++   +   +KK  + KK K+  K  +K
Sbjct: 98  LKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEK 157

Query: 281 KRKKKRKKKK 290
           KRKK   K+K
Sbjct: 158 KRKKNAGKEK 167



 Score = 32.6 bits (75), Expect = 0.13
 Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 1/72 (1%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE-EEKKKKKKKKKKKTTKKKK 281
           KK K  ++K++ K+  +  K + +  ++    R    E + +EK+  K+ KK    KK +
Sbjct: 73  KKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSE 132

Query: 282 RKKKRKKKKNQH 293
            KK   KKK   
Sbjct: 133 IKKLVLKKKFDE 144



 Score = 28.8 bits (65), Expect = 2.3
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKK 244
           +KK K+  K  +KK+KK   K+K
Sbjct: 145 LKKSKQLDKALEKKRKKNAGKEK 167



 Score = 28.4 bits (64), Expect = 3.0
 Identities = 14/78 (17%), Positives = 35/78 (44%), Gaps = 7/78 (8%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            ++  K  KK K  +++++ K             +   +  K K ++++  K  KK++++
Sbjct: 65  IEELEKALKKTKDSEEKEELKRTLQS-------MKSRLKTLKNKDREREILKEHKKQEKE 117

Query: 284 KKRKKKKNQHHRDVWQKK 301
             ++ KK  + +    KK
Sbjct: 118 LIKEGKKPYYLKKSEIKK 135



 Score = 28.4 bits (64), Expect = 3.3
 Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 19/62 (30%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           + +KK + +K   KKK  + +K K+                     K  +KK+KK   K+
Sbjct: 126 YYLKKSEIKKLVLKKKFDELKKSKQ-------------------LDKALEKKRKKNAGKE 166

Query: 280 KK 281
           KK
Sbjct: 167 KK 168



 Score = 27.2 bits (61), Expect = 8.2
 Identities = 11/63 (17%), Positives = 31/63 (49%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
           +K+ ++ +K  KK K+   ++         +   +  K K ++++   + KK++K+  K+
Sbjct: 62  EKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKE 121

Query: 290 KNQ 292
             +
Sbjct: 122 GKK 124


>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
          Length = 323

 Score = 41.0 bits (96), Expect = 5e-04
 Identities = 46/157 (29%), Positives = 77/157 (49%), Gaps = 15/157 (9%)

Query: 46  MLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTD 105
           +    + I  ++GN+LVI  +I  R + ++  Y + ++A +DL++     PF  I +  +
Sbjct: 33  VFYILLFIFGLIGNVLVIAVLIVKRFMFVVDVY-LFNIAMSDLMLVFS-FPF-IIHNDLN 89

Query: 106 EWYFGYFMCDVWNS--FDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLS 163
           EW FG FMC +     F  +FS    + L  IS+DRY  +V   +        +VLL ++
Sbjct: 90  EWIFGEFMCKLVLGVYFVGFFSNMFFVTL--ISIDRYILVVNATKIKNKSISLSVLLSVA 147

Query: 164 NVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIF 200
             W+   I+S  P M+ +Y   D+    K     CIF
Sbjct: 148 -AWVCSVILS-MPAMVLYYV--DNTDNLKQ----CIF 176


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 40.9 bits (96), Expect = 5e-04
 Identities = 19/78 (24%), Positives = 35/78 (44%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
              ++  K      KK +K+ KN   +     +  + +E++E K + + K    T KK +
Sbjct: 41  TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100

Query: 283 KKKRKKKKNQHHRDVWQK 300
           K K+K  K + + DV   
Sbjct: 101 KTKKKPPKPKPNEDVDNA 118



 Score = 37.8 bits (88), Expect = 0.005
 Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 7/65 (10%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
               +++ K      KK K         ++ +  +++ EKKKKKKK+KK+   + + K  
Sbjct: 40  STFSEEENKVATTSTKKDKK-------EDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92

Query: 286 RKKKK 290
            K  K
Sbjct: 93  FKTPK 97



 Score = 33.9 bits (78), Expect = 0.084
 Identities = 27/78 (34%), Positives = 33/78 (42%), Gaps = 13/78 (16%)

Query: 221 VMKKKKKRKK----KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
             KK KK  K    KKK +KKKK+KK+K            S GE +   K  KK KK K 
Sbjct: 53  STKKDKKEDKNNESKKKSEKKKKKKKEKK--------EPKSEGETKLGFKTPKKSKKTKK 104

Query: 277 TK-KKKRKKKRKKKKNQH 293
              K K  +      N+ 
Sbjct: 105 KPPKPKPNEDVDNAFNKI 122



 Score = 32.4 bits (74), Expect = 0.27
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 14/77 (18%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           +++ K      KK K++ K N              E +++ +KKKKKKK+K   K + + 
Sbjct: 44  EEENKVATTSTKKDKKEDKNN--------------ESKKKSEKKKKKKKEKKEPKSEGET 89

Query: 285 KRKKKKNQHHRDVWQKK 301
           K   K  +  +   +K 
Sbjct: 90  KLGFKTPKKSKKTKKKP 106



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 8/70 (11%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK--------KKKTTKKKKR 282
           +K K +                ++  S  E +      KK K        KKK+ KKKK+
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 283 KKKRKKKKNQ 292
           KK++K+ K++
Sbjct: 77  KKEKKEPKSE 86



 Score = 28.1 bits (63), Expect = 6.8
 Identities = 17/69 (24%), Positives = 31/69 (44%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           +K K +             K+         N+ +    +++KK+ K  + KK ++KKK+K
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 284 KKRKKKKNQ 292
           KK KK+   
Sbjct: 77  KKEKKEPKS 85


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 40.1 bits (94), Expect = 7e-04
 Identities = 15/86 (17%), Positives = 46/86 (53%), Gaps = 7/86 (8%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-------KKKKKKKKT 276
           ++K ++  K++  K K++  ++         +  R   +EE KK       KK K++++ 
Sbjct: 139 RQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQK 198

Query: 277 TKKKKRKKKRKKKKNQHHRDVWQKKY 302
            ++++RK+++K+++ +  +   ++ +
Sbjct: 199 REEERRKQRKKQQEEEERKQKAEEAW 224



 Score = 34.7 bits (80), Expect = 0.044
 Identities = 18/64 (28%), Positives = 39/64 (60%), Gaps = 16/64 (25%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +KK K++++K++++++K+RKK++                EEEE+K+K ++  +K  K  K
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQ----------------EEEERKQKAEEAWQKWMKNVK 232

Query: 282 RKKK 285
           ++ K
Sbjct: 233 QRPK 236



 Score = 33.5 bits (77), Expect = 0.10
 Identities = 9/83 (10%), Positives = 40/83 (48%), Gaps = 4/83 (4%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK----KKKKKTTK 278
            ++K++++++K++++ + +++  +   + W  + ++   ++   K KK          + 
Sbjct: 108 LEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSG 167

Query: 279 KKKRKKKRKKKKNQHHRDVWQKK 301
             K ++   +++ +     W+ K
Sbjct: 168 SAKPERNVSQEEAKKRLQEWELK 190



 Score = 33.2 bits (76), Expect = 0.13
 Identities = 17/79 (21%), Positives = 41/79 (51%), Gaps = 17/79 (21%)

Query: 216 WIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           W+     +++KK +K  ++K+K++R+K++               EE E +++  K+K ++
Sbjct: 92  WLSAKQAQRQKKLQKLLEEKQKQEREKER---------------EEAELRQRLAKEKYEE 136

Query: 276 TTKKKKR--KKKRKKKKNQ 292
             ++K +   K+R  K  +
Sbjct: 137 WCRQKAQQAAKQRTPKHKK 155



 Score = 28.2 bits (63), Expect = 5.3
 Identities = 11/69 (15%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 234 KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK-KKNQ 292
           +  K+ + +++ Q   + W++      +++ +K  ++K+K++  K+++  + R++  K +
Sbjct: 74  RSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133

Query: 293 HHRDVWQKK 301
           +     QK 
Sbjct: 134 YEEWCRQKA 142


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 39.9 bits (93), Expect = 0.001
 Identities = 19/68 (27%), Positives = 25/68 (36%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K   +K   K K   K+  KK      V   + ++   +  KK KK  KK      K  
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267

Query: 283 KKKRKKKK 290
           KK  KK  
Sbjct: 268 KKAAKKAA 275



 Score = 37.2 bits (86), Expect = 0.007
 Identities = 19/69 (27%), Positives = 22/69 (31%), Gaps = 18/69 (26%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K   K+   K K   KK  KKK                    K     KK  KT  K  +
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKA------------------AKTAVSAKKAAKTAAKAAK 250

Query: 283 KKKRKKKKN 291
           K K+  KK 
Sbjct: 251 KAKKTAKKA 259



 Score = 36.0 bits (83), Expect = 0.018
 Identities = 21/68 (30%), Positives = 26/68 (38%), Gaps = 3/68 (4%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K  K  KK  KK  K   K  K            ++ +++  KK     K K T K  KR
Sbjct: 262 KAAKAVKKAAKKAAKAAAKAAKGAA---KATKGKAKAKKKAGKKAAAGSKAKATAKAPKR 318

Query: 283 KKKRKKKK 290
             K KK K
Sbjct: 319 GAKGKKAK 326



 Score = 35.2 bits (81), Expect = 0.031
 Identities = 20/70 (28%), Positives = 25/70 (35%), Gaps = 10/70 (14%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
             K KK  KK  KK  K  +K  K          + ++   +  K   K  K K   KKK
Sbjct: 249 AKKAKKTAKKALKKAAKAVKKAAK----------KAAKAAAKAAKGAAKATKGKAKAKKK 298

Query: 281 KRKKKRKKKK 290
             KK     K
Sbjct: 299 AGKKAAAGSK 308



 Score = 35.2 bits (81), Expect = 0.032
 Identities = 15/58 (25%), Positives = 24/58 (41%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
            K   +  K   K  K + K K +  +       ++   +  K+  K KK KK TKK+
Sbjct: 275 AKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKR 332



 Score = 35.2 bits (81), Expect = 0.033
 Identities = 17/73 (23%), Positives = 25/73 (34%), Gaps = 8/73 (10%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            KK  K   K  KK K+  KK          + +    ++  KK  K   K      K  
Sbjct: 238 AKKAAKTAAKAAKKAKKTAKKAL--------KKAAKAVKKAAKKAAKAAAKAAKGAAKAT 289

Query: 284 KKRKKKKNQHHRD 296
           K + K K +  + 
Sbjct: 290 KGKAKAKKKAGKK 302



 Score = 34.1 bits (78), Expect = 0.078
 Identities = 16/61 (26%), Positives = 23/61 (37%), Gaps = 2/61 (3%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK--KK 281
           KK  +   K  K   K  K K +  +       +  + +   K  K+  K K  KK  KK
Sbjct: 272 KKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331

Query: 282 R 282
           R
Sbjct: 332 R 332



 Score = 32.6 bits (74), Expect = 0.26
 Identities = 20/70 (28%), Positives = 27/70 (38%), Gaps = 8/70 (11%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           V  KK  +   K  KK KK  KK         + + ++  ++  KK  K   K      K
Sbjct: 236 VSAKKAAKTAAKAAKKAKKTAKKA--------LKKAAKAVKKAAKKAAKAAAKAAKGAAK 287

Query: 281 KRKKKRKKKK 290
             K K K KK
Sbjct: 288 ATKGKAKAKK 297


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 38.4 bits (90), Expect = 0.001
 Identities = 12/73 (16%), Positives = 37/73 (50%), Gaps = 12/73 (16%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           +   KK+++++++++ + +   ++ Q             +E  EK+  K K++K+   ++
Sbjct: 94  LGLDKKEKEEEEEEEVEVEELDEEEQI------------DELLEKELAKLKREKRRENER 141

Query: 281 KRKKKRKKKKNQH 293
           K+K+  K++    
Sbjct: 142 KQKEILKEQMKML 154



 Score = 32.3 bits (74), Expect = 0.16
 Identities = 17/76 (22%), Positives = 37/76 (48%), Gaps = 15/76 (19%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK--KKKKKTTKK 279
           +K +KK +K     KK+K ++++ +              E EE  ++++  +  +K   K
Sbjct: 84  LKWRKKVRKLLGLDKKEKEEEEEEE-------------VEVEELDEEEQIDELLEKELAK 130

Query: 280 KKRKKKRKKKKNQHHR 295
            KR+K+R+ ++ Q   
Sbjct: 131 LKREKRRENERKQKEI 146


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 38.0 bits (89), Expect = 0.001
 Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 11/71 (15%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKK 273
            G    K  + ++K K+KKKKK+K+ +N    Q        R  + EE  E +KK ++ K
Sbjct: 64  AGASRNKAAEERRKLKEKKKKKKKELENFYRFQI-------REKKKEELAELRKKFEEDK 116

Query: 274 KKTTKKKKRKK 284
           K+  + K  +K
Sbjct: 117 KRIEQLKAARK 127



 Score = 33.3 bits (77), Expect = 0.046
 Identities = 15/60 (25%), Positives = 34/60 (56%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           K++++ K+  K +++   D W   T  G + +    + K  +++   K+K+KKK+K+ +N
Sbjct: 32  KREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELEN 91


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 39.1 bits (92), Expect = 0.001
 Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 14/64 (21%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K K + ++ +++ KK              R    E++K  ++++K K    +K++K R +
Sbjct: 107 KAKVQAQRAEQQAKK--------------REAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152

Query: 289 KKNQ 292
           K   
Sbjct: 153 KPEP 156



 Score = 38.8 bits (91), Expect = 0.001
 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 17/73 (23%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           K K + ++ +++ KKR+                    E+EK  ++++K K    +KKRK 
Sbjct: 107 KAKVQAQRAEQQAKKREA-----------------AGEKEKAPRRERKPKPKAPRKKRKP 149

Query: 285 KRKKKKNQHHRDV 297
           + +K + QH    
Sbjct: 150 RAQKPEPQHTPVS 162



 Score = 37.2 bits (87), Expect = 0.005
 Identities = 12/68 (17%), Positives = 28/68 (41%), Gaps = 15/68 (22%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K + +R +++ KK++   +K+K                   E+K K K  +KK   + ++
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEK---------------APRRERKPKPKAPRKKRKPRAQK 153

Query: 283 KKKRKKKK 290
            + +    
Sbjct: 154 PEPQHTPV 161



 Score = 36.4 bits (85), Expect = 0.009
 Identities = 11/71 (15%), Positives = 28/71 (39%), Gaps = 17/71 (23%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           V  ++ +++ KK++   +K K  + +                  K K K  +KK+  + +
Sbjct: 110 VQAQRAEQQAKKREAAGEKEKAPRRER-----------------KPKPKAPRKKRKPRAQ 152

Query: 281 KRKKKRKKKKN 291
           K + +     +
Sbjct: 153 KPEPQHTPVSD 163



 Score = 34.9 bits (81), Expect = 0.029
 Identities = 11/59 (18%), Positives = 22/59 (37%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           + KK+    +K+K  ++ +K K +  R     R  + E +        +       K K
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKVK 176



 Score = 31.8 bits (73), Expect = 0.28
 Identities = 7/36 (19%), Positives = 18/36 (50%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
            +++ KK++    K+K  +++RK K     +    +
Sbjct: 115 AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPR 150



 Score = 31.4 bits (72), Expect = 0.43
 Identities = 6/37 (16%), Positives = 19/37 (51%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           +++ KK++     +K  R++++ K K    +   + +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152



 Score = 28.0 bits (63), Expect = 6.0
 Identities = 7/34 (20%), Positives = 19/34 (55%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           EE K K + ++ ++  KK++   +++K   +  +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERK 137


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 39.4 bits (92), Expect = 0.002
 Identities = 20/68 (29%), Positives = 26/68 (38%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KK + K+KKK KKK        +  D   N             +K K K +  KKK    
Sbjct: 73  KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132

Query: 285 KRKKKKNQ 292
           K   KK +
Sbjct: 133 KSTNKKKK 140



 Score = 38.7 bits (90), Expect = 0.003
 Identities = 20/69 (28%), Positives = 26/69 (37%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK K ++KKK KKK        N         + +        +K K K +K   K    
Sbjct: 73  KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132

Query: 283 KKKRKKKKN 291
           K   KKKK 
Sbjct: 133 KSTNKKKKK 141



 Score = 37.5 bits (87), Expect = 0.009
 Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 2/69 (2%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR--GEEEEEKKKKKKKKKKKTTKKKKRK 283
           KK K K+KKK KKK     +  +     N + +  G       +K K K +K  KK    
Sbjct: 73  KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132

Query: 284 KKRKKKKNQ 292
           K   KKK +
Sbjct: 133 KSTNKKKKK 141



 Score = 36.0 bits (83), Expect = 0.025
 Identities = 25/75 (33%), Positives = 32/75 (42%), Gaps = 10/75 (13%)

Query: 224 KKKKRKKKKKKKKK----------KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           KK K K+KKK KKK             K    Q    + I+   + + + EK KKK    
Sbjct: 73  KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132

Query: 274 KKTTKKKKRKKKRKK 288
           K T KKKK+    K 
Sbjct: 133 KSTNKKKKKVLSSKD 147



 Score = 28.6 bits (64), Expect = 5.5
 Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 2/70 (2%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           K  K    K + +   K +  + D  IN+    + +++ KK  K    K +KK K K+K+
Sbjct: 24  KNLKHSSYKIRLESDIKDSLLNLD--INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKK 81

Query: 287 KKKKNQHHRD 296
           K KK  H  D
Sbjct: 82  KIKKKLHIDD 91


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 39.4 bits (93), Expect = 0.002
 Identities = 15/75 (20%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            +  +   ++ K KK  +KKK  +  +     + +RG +  +  K +++++ +  +++++
Sbjct: 104 ARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGR--RRRRK 161

Query: 283 KKKRKKKKNQHHRDV 297
           KKK++K   +  R+V
Sbjct: 162 KKKKQKPTEKIPREV 176



 Score = 31.0 bits (71), Expect = 0.84
 Identities = 14/73 (19%), Positives = 31/73 (42%), Gaps = 5/73 (6%)

Query: 222 MKKKKKRKKKKKKKKKKKR--KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           M K +  +  K+     K   +K K      + +   S   EEEE +K++ K++ +   K
Sbjct: 1   MSKVRVYELAKELGVSSKELLEKLKEL---GIEVKSHSSTVEEEEARKEEAKREAEEEAK 57

Query: 280 KKRKKKRKKKKNQ 292
            + ++    +  +
Sbjct: 58  AEAEEAAAAEAEE 70



 Score = 28.3 bits (64), Expect = 5.6
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRT 256
           K +++R+ +++++KKKK++K   +  R+V I  T
Sbjct: 148 KGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPET 181


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 38.8 bits (90), Expect = 0.003
 Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 5/75 (6%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           ++  ++ + ++   R        +    N +      +   KKKK KK K  K+ KRK K
Sbjct: 37  EEENQRLQLRQVISRFTSFESDDQANSGNVSG-----KRVPKKKKIKKPKLRKRTKRKNK 91

Query: 286 RKKKKNQHHRDVWQK 300
           + K     + + ++ 
Sbjct: 92  KIKSLTAFNEENFET 106



 Score = 31.5 bits (71), Expect = 0.63
 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           G V  K+  +KKK KK K +KR K+KN+  +    + T+  EE  E  +        + K
Sbjct: 64  GNVSGKRVPKKKKIKKPKLRKRTKRKNKKIK----SLTAFNEENFETDRAPSLLSYLSGK 119

Query: 279 KKK 281
           + K
Sbjct: 120 QSK 122


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 38.1 bits (89), Expect = 0.003
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 17/71 (23%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK----TTK 278
           +K++++ K   K K KK  K K +             E+E+ K++K++K  ++    T +
Sbjct: 44  EKEEEKAKVAAKAKAKKALKAKIE-------------EKEKAKREKEEKGLRELEEDTPE 90

Query: 279 KKKRKKKRKKK 289
            +  +K R +K
Sbjct: 91  DELAEKLRLRK 101



 Score = 37.3 bits (87), Expect = 0.005
 Identities = 13/40 (32%), Positives = 24/40 (60%), Gaps = 6/40 (15%)

Query: 260 EEEEEKKKKKK------KKKKKTTKKKKRKKKRKKKKNQH 293
           EEE+E+K+++K       K KK  K K  +K++ K++ + 
Sbjct: 39  EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78



 Score = 35.4 bits (82), Expect = 0.021
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVW 298
            +   K K KK  K K  +K+K K+++++K  +   +  
Sbjct: 50  AKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDT 88



 Score = 35.0 bits (81), Expect = 0.031
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           + +   K K KK  K   ++K++ K+ K++K
Sbjct: 49  KAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79



 Score = 34.6 bits (80), Expect = 0.040
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           +EE+ ++K+++K K   K K KK  K K  +
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEE 68



 Score = 31.6 bits (72), Expect = 0.36
 Identities = 9/32 (28%), Positives = 11/32 (34%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
            + EK  K  KKKK   K K          + 
Sbjct: 195 LKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226



 Score = 31.6 bits (72), Expect = 0.37
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
                   + K KK  K K +  +K KR+K+ K  +  
Sbjct: 47  EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84



 Score = 30.8 bits (70), Expect = 0.62
 Identities = 9/36 (25%), Positives = 11/36 (30%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
            E+  K  KKKK K   K               + D
Sbjct: 197 AEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYDD 232



 Score = 30.8 bits (70), Expect = 0.72
 Identities = 12/65 (18%), Positives = 27/65 (41%), Gaps = 21/65 (32%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           + ++++++K K   K + KK                       K K ++K+K  ++K+ K
Sbjct: 41  EDEEKEEEKAKVAAKAKAKK---------------------ALKAKIEEKEKAKREKEEK 79

Query: 284 KKRKK 288
             R+ 
Sbjct: 80  GLREL 84



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 10/36 (27%), Positives = 14/36 (38%)

Query: 250 DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           +V IN   + E+  +  KKKK K K           
Sbjct: 188 NVLINEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 29.2 bits (66), Expect = 2.3
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKN 245
           K K ++  K  KKKK K K K N
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLN 216



 Score = 29.2 bits (66), Expect = 2.5
 Identities = 10/25 (40%), Positives = 10/25 (40%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQH 247
           K  K  KKKK K K K      N  
Sbjct: 199 KAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 28.9 bits (65), Expect = 3.0
 Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 9/52 (17%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
            K  K + ++K+K K++K +K   +   D           E+E  +K + +K
Sbjct: 59  KKALKAKIEEKEKAKREKEEKGLRELEED---------TPEDELAEKLRLRK 101



 Score = 28.9 bits (65), Expect = 3.1
 Identities = 11/40 (27%), Positives = 13/40 (32%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
           EK K +K  K    KK K K K          D +     
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYDD 232



 Score = 28.9 bits (65), Expect = 3.1
 Identities = 8/28 (28%), Positives = 9/28 (32%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
            E+  K  KKKK K K            
Sbjct: 197 AEKAAKGGKKKKGKAKAKLNVGGANDDD 224



 Score = 28.1 bits (63), Expect = 5.8
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQH 247
           ++ +K K +K  K  KKKK K K   +
Sbjct: 190 LINEKLKAEKAAKGGKKKKGKAKAKLN 216



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 8/31 (25%), Positives = 13/31 (41%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           E+ + +K  K  KKK  K K +        +
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 37.4 bits (87), Expect = 0.003
 Identities = 17/65 (26%), Positives = 40/65 (61%), Gaps = 9/65 (13%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           KK   KK+ K ++K+ +++ +   +         EE EE+KK ++K++ +  ++++ +++
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEE---------EEREERKKLEEKREGERKEEEELEEE 51

Query: 286 RKKKK 290
           R+KKK
Sbjct: 52  REKKK 56



 Score = 37.4 bits (87), Expect = 0.004
 Identities = 15/69 (21%), Positives = 48/69 (69%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK+ K ++K+ ++++++ ++++ +  + +   R    +EEEE +++++KKK++  +K++ 
Sbjct: 6   KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65

Query: 283 KKKRKKKKN 291
           ++ RK+++ 
Sbjct: 66  EQARKEQEE 74



 Score = 35.8 bits (83), Expect = 0.013
 Identities = 15/75 (20%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK--R 282
           KK   KK+ K ++K+ ++++ +   +    R    E+ E ++K++++ +++  KKK+   
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 283 KKKRKKKKNQHHRDV 297
           +K+R+++  +   + 
Sbjct: 61  RKEREEQARKEQEEY 75



 Score = 35.4 bits (82), Expect = 0.014
 Identities = 16/69 (23%), Positives = 38/69 (55%), Gaps = 10/69 (14%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           KK   KK+ K ++K+ R++++               E EE KK ++K++ ++  +++  +
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEE----------EREERKKLEEKREGERKEEEELEE 50

Query: 284 KKRKKKKNQ 292
           ++ KKK+ +
Sbjct: 51  EREKKKEEE 59



 Score = 32.7 bits (75), Expect = 0.15
 Identities = 19/79 (24%), Positives = 46/79 (58%), Gaps = 8/79 (10%)

Query: 222 MKKKKKRKKKKKKKKKKKR--------KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           +  KK+ K ++K+ ++++R        ++KK +  R+         EEE EKKK+++++K
Sbjct: 3   IGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK 62

Query: 274 KKTTKKKKRKKKRKKKKNQ 292
           ++  + +K +++ +K K+ 
Sbjct: 63  EREEQARKEQEEYEKLKSS 81


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 37.0 bits (86), Expect = 0.003
 Identities = 20/79 (25%), Positives = 37/79 (46%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             + KK  KKKKK+         N+ + D     T+  +  E +KK K+K ++   K KK
Sbjct: 22  RSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKK 81

Query: 282 RKKKRKKKKNQHHRDVWQK 300
            + +++K+K +       +
Sbjct: 82  EEAEKEKEKEERFMKALAE 100



 Score = 32.7 bits (75), Expect = 0.089
 Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 8/63 (12%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+K +R K K KK++ +++K+K +               E EK++ + +KKK   K  K 
Sbjct: 69  KEKLRRDKLKAKKEEAEKEKEKEERFMKAL--------AEAEKERAELEKKKAEAKLMKE 120

Query: 283 KKK 285
           +KK
Sbjct: 121 EKK 123



 Score = 32.3 bits (74), Expect = 0.12
 Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 15/63 (23%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           RKK K+K ++ K K KK               EE E++K+K+++  K   + +K + + +
Sbjct: 65  RKKAKEKLRRDKLKAKK---------------EEAEKEKEKEERFMKALAEAEKERAELE 109

Query: 288 KKK 290
           KKK
Sbjct: 110 KKK 112



 Score = 30.0 bits (68), Expect = 0.84
 Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 13/67 (19%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K K+K ++ K K KK++ +K+K +             EE   K   + +K++   +KKK 
Sbjct: 67  KAKEKLRRDKLKAKKEEAEKEKEK-------------EERFMKALAEAEKERAELEKKKA 113

Query: 283 KKKRKKK 289
           + K  K+
Sbjct: 114 EAKLMKE 120



 Score = 29.3 bits (66), Expect = 1.6
 Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 10/65 (15%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           + KR + +KK K+K R+ K           +  + E E+EK+K+++  K     +K+R +
Sbjct: 58  ESKRPEGRKKAKEKLRRDKL----------KAKKEEAEKEKEKEERFMKALAEAEKERAE 107

Query: 285 KRKKK 289
             KKK
Sbjct: 108 LEKKK 112



 Score = 28.9 bits (65), Expect = 1.9
 Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 19/72 (26%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK----KKKKKTTK 278
           KK K++ ++ K K KK+  +K+               +E+EE+  K      K++ +  K
Sbjct: 66  KKAKEKLRRDKLKAKKEEAEKE---------------KEKEERFMKALAEAEKERAELEK 110

Query: 279 KKKRKKKRKKKK 290
           KK   K  K++K
Sbjct: 111 KKAEAKLMKEEK 122


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 37.1 bits (87), Expect = 0.004
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE---EEEEKKKKKKKKKKKTTKKKKRKKK 285
            +KKK KK +      N+ ++D  +  +       ++E K  +K +KK +  K K  KKK
Sbjct: 32  AQKKKGKKGELEITDLNEEYKD--LKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKK 89

Query: 286 RKKKKNQ 292
            KK++ +
Sbjct: 90  AKKEEPK 96



 Score = 32.9 bits (76), Expect = 0.079
 Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 25/63 (39%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            ++ KK+ +  +K +KK +K K K                           KKK K  + 
Sbjct: 61  ALLDKKELKAWEKAEKKAEKAKAKAE-------------------------KKKAKKEEP 95

Query: 280 KKR 282
           K R
Sbjct: 96  KPR 98



 Score = 30.6 bits (70), Expect = 0.46
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           E+ E+K +K K K +K   KK+  K R
Sbjct: 72  EKAEKKAEKAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 37.7 bits (88), Expect = 0.004
 Identities = 25/71 (35%), Positives = 37/71 (52%), Gaps = 6/71 (8%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEE---KKKKKKKKKKKT---TKK 279
           K RKK +  K+ +K KKK      D  I+   + ++ ++   K KKKKKK+KKT    KK
Sbjct: 117 KARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKK 176

Query: 280 KKRKKKRKKKK 290
             + K  + K 
Sbjct: 177 GNKGKAGRPKG 187



 Score = 33.4 bits (77), Expect = 0.087
 Identities = 24/77 (31%), Positives = 29/77 (37%), Gaps = 27/77 (35%)

Query: 221 VMKKKKKRKKKKKKK----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK-- 274
           + KK KK+KKK+KK     KK  + K                      + K  K K K  
Sbjct: 156 LYKKAKKKKKKRKKTYVVAKKGNKGKAG--------------------RPKGVKGKYKVV 195

Query: 275 -KTTKKKKRKKKRKKKK 290
               KK  R  KRK KK
Sbjct: 196 DPRMKKDLRALKRKAKK 212


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 38.1 bits (89), Expect = 0.004
 Identities = 19/67 (28%), Positives = 31/67 (46%), Gaps = 1/67 (1%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK-KKRKKKRKKKK 290
           K  ++ KK K KK Q   D +   T +  ++   K K KKKKKK       + +  ++ K
Sbjct: 325 KYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384

Query: 291 NQHHRDV 297
           +   + V
Sbjct: 385 SSGKKKV 391



 Score = 35.4 bits (82), Expect = 0.032
 Identities = 19/55 (34%), Positives = 22/55 (40%), Gaps = 5/55 (9%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
            F    KK  KK   K K KK+KKK             S   +E +   KKK KK
Sbjct: 344 SFFTATKKPIKKSNSKAKLKKKKKKAGASAVP-----KSETSQEAKSSGKKKVKK 393


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 37.8 bits (88), Expect = 0.004
 Identities = 15/84 (17%), Positives = 27/84 (32%), Gaps = 11/84 (13%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKK-----------NQHHRDVWINRTSRGEEEEEK 265
           I     +   KRK++KK ++     +               H        + R ++  + 
Sbjct: 128 IERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKL 187

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
                   +K+ K K   KK  KK
Sbjct: 188 SSTASSGDEKSPKSKAAPKKAGKK 211



 Score = 30.5 bits (69), Expect = 0.85
 Identities = 17/89 (19%), Positives = 27/89 (30%), Gaps = 23/89 (25%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKK-----------------------KKNQHHRDVWIN 254
                K K+   K+K++KK ++                          K+    R     
Sbjct: 126 SMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAA 185

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           + S      ++K  K K   K   KK RK
Sbjct: 186 KLSSTASSGDEKSPKSKAAPKKAGKKMRK 214


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 37.0 bits (86), Expect = 0.005
 Identities = 7/59 (11%), Positives = 24/59 (40%)

Query: 238 KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           ++  K++ +   +  ++ +        +    +K+ ++    +KRK  R K++      
Sbjct: 68  ERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 38.2 bits (90), Expect = 0.005
 Identities = 10/62 (16%), Positives = 16/62 (25%), Gaps = 19/62 (30%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
            K +K +       K                       KK   K  +KT ++    +K  
Sbjct: 816 EKPEKLRYLADAPAKDP-------------------AGKKAAVKFSRKTKQQYVASEKDG 856

Query: 288 KK 289
           K 
Sbjct: 857 KA 858



 Score = 37.1 bits (87), Expect = 0.011
 Identities = 11/62 (17%), Positives = 16/62 (25%), Gaps = 20/62 (32%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K +K R       K    KK                       K  +K K++    +K  
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAA--------------------VKFSRKTKQQYVASEKDG 856

Query: 283 KK 284
           K 
Sbjct: 857 KA 858



 Score = 35.5 bits (83), Expect = 0.030
 Identities = 7/26 (26%), Positives = 9/26 (34%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
               K    KK   K  RK K++   
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVA 851



 Score = 34.8 bits (81), Expect = 0.050
 Identities = 7/26 (26%), Positives = 12/26 (46%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
           K    KK   K ++K K++    +K 
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKD 855



 Score = 34.4 bits (80), Expect = 0.066
 Identities = 7/54 (12%), Positives = 13/54 (24%), Gaps = 17/54 (31%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
                  K    KK   +  +K +                 ++    +K  K T
Sbjct: 823 YLADAPAKDPAGKKAAVKFSRKTK-----------------QQYVASEKDGKAT 859



 Score = 33.6 bits (78), Expect = 0.12
 Identities = 9/33 (27%), Positives = 13/33 (39%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E+ E+ +       K    KK   K  +K K Q
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848



 Score = 32.1 bits (74), Expect = 0.35
 Identities = 9/34 (26%), Positives = 12/34 (35%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
            E+ EK +       K    KK   K  +K  Q 
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848



 Score = 30.5 bits (70), Expect = 1.3
 Identities = 5/27 (18%), Positives = 10/27 (37%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           +   K    KK   K  ++ K++    
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVAS 852



 Score = 29.4 bits (67), Expect = 2.5
 Identities = 4/26 (15%), Positives = 8/26 (30%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
                K    K    K  +K +++  
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYV 850


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 37.9 bits (88), Expect = 0.005
 Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 2/71 (2%)

Query: 215 FWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
                  +KK +  +K + K +KK+  +K+    R+  INR +R E      + + ++  
Sbjct: 325 LHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAERE--INREARQERAAAMARARARRAA 382

Query: 275 KTTKKKKRKKK 285
              KKK     
Sbjct: 383 VKAKKKGLIDA 393



 Score = 37.1 bits (86), Expect = 0.009
 Identities = 16/96 (16%), Positives = 45/96 (46%), Gaps = 14/96 (14%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQ-------------HHRDVWINRTSRGEEEEEKK 266
              +++ + ++K+K+KK      ++N+             H  +  I +T   E+ E K 
Sbjct: 286 HHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKA 345

Query: 267 KKKK-KKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           +KK+  +K++  +++  ++ R+++     R   ++ 
Sbjct: 346 RKKEIAQKRRAAEREINREARQERAAAMARARARRA 381


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 37.0 bits (86), Expect = 0.006
 Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 1/62 (1%)

Query: 221 VMKKKKKRKKKKKK-KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           ++KK+K R++   +  K+ K KKK N+  R ++  R     +E  K +++  + K+  KK
Sbjct: 3   LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKK 62

Query: 280 KK 281
             
Sbjct: 63  PG 64



 Score = 30.8 bits (70), Expect = 0.63
 Identities = 8/30 (26%), Positives = 16/30 (53%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           KK+K +++      K+ + KK+  KK +  
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRKI 34



 Score = 30.4 bits (69), Expect = 0.89
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           K+K +++   +  K+ K KKK  KKK + + 
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYF 36


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 37.3 bits (86), Expect = 0.007
 Identities = 16/79 (20%), Positives = 38/79 (48%), Gaps = 6/79 (7%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK------TT 277
           +K++ +K + + KK   +  K + H+   + + S+  E+E + K+ + +KK+        
Sbjct: 279 QKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQ 338

Query: 278 KKKKRKKKRKKKKNQHHRD 296
           K K + + +    N+   D
Sbjct: 339 KTKPQVEAQPTSLNEDAID 357



 Score = 31.1 bits (70), Expect = 0.81
 Identities = 16/82 (19%), Positives = 37/82 (45%), Gaps = 7/82 (8%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK------KKKKKKTTK 278
           K++  KK+    K ++K    Q + D       R +++E K   K       K+ K+  +
Sbjct: 219 KEELDKKQIDADKAQQKADFAQDNADK-QRDEVRQKQQEAKNLPKPADTSSPKEDKQVAE 277

Query: 279 KKKRKKKRKKKKNQHHRDVWQK 300
            +KR+ ++ + + + + +   K
Sbjct: 278 NQKREIEKAQIEIKKNDEEALK 299



 Score = 30.4 bits (68), Expect = 1.2
 Identities = 12/81 (14%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT--KKK 280
           K+ ++ K++  KK+    K ++         ++      +++++ K   K   T+  K+ 
Sbjct: 213 KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKED 272

Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
           K+  + +K++ +  +   +K 
Sbjct: 273 KQVAENQKREIEKAQIEIKKN 293



 Score = 28.0 bits (62), Expect = 7.4
 Identities = 11/75 (14%), Positives = 33/75 (44%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            ++K    +    K++ + ++K+ +         TS  +E+++  + +K++ +K   + K
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291

Query: 282 RKKKRKKKKNQHHRD 296
           +  +   K   H   
Sbjct: 292 KNDEEALKAKDHKAF 306


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 37.2 bits (86), Expect = 0.008
 Identities = 15/67 (22%), Positives = 24/67 (35%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K K      KKK + + K    +   +      +  + E   ++K   +KKK   K K  
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKAD 243

Query: 284 KKRKKKK 290
           K     K
Sbjct: 244 KAAAAAK 250



 Score = 35.7 bits (82), Expect = 0.025
 Identities = 19/90 (21%), Positives = 51/90 (56%), Gaps = 6/90 (6%)

Query: 218 PGFVMK--KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE----EEEEKKKKKKK 271
           PG V++   + + ++   KK +++RKKK+ Q   ++   + +  E     E+E+ K +++
Sbjct: 57  PGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ 116

Query: 272 KKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           +K+    +K+ + ++K+++ Q  +   ++K
Sbjct: 117 QKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146



 Score = 35.7 bits (82), Expect = 0.026
 Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 5/84 (5%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE-----EEEEKKKKKKKKKKKTT 277
           +KKK    K K   +  + K   +  +       +  E     E    KKK + + K   
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAA 204

Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
           +K K + + K K  +      ++K
Sbjct: 205 EKAKAEAEAKAKAEKKAEAAAEEK 228



 Score = 34.5 bits (79), Expect = 0.052
 Identities = 15/68 (22%), Positives = 28/68 (41%), Gaps = 2/68 (2%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KKK + + K   +K + + + +   +      +  EE+   +KKK   K K  K     K
Sbjct: 193 KKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAA--EEKAAAEKKKAAAKAKADKAAAAAK 250

Query: 285 KRKKKKNQ 292
             ++K   
Sbjct: 251 AAERKAAA 258



 Score = 34.2 bits (78), Expect = 0.080
 Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 7/73 (9%)

Query: 223 KKKK-----KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
            K K     K+K ++  K  ++ K K             +  +   EK K + + K K  
Sbjct: 161 AKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAK--KKAEAEAKAAAEKAKAEAEAKAKAE 218

Query: 278 KKKKRKKKRKKKK 290
           KK +   + K   
Sbjct: 219 KKAEAAAEEKAAA 231



 Score = 33.8 bits (77), Expect = 0.096
 Identities = 15/79 (18%), Positives = 44/79 (55%), Gaps = 7/79 (8%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K +++RKKK+++  ++ + K+  +  R        +  E+E  K ++++K+ +  +K+ +
Sbjct: 76  KGEQQRKKKEEQVAEELKPKQAAEQER-------LKQLEKERLKAQEQQKQAEEAEKQAQ 128

Query: 283 KKKRKKKKNQHHRDVWQKK 301
            +++++++        QKK
Sbjct: 129 LEQKQQEEQARKAAAEQKK 147



 Score = 33.4 bits (76), Expect = 0.12
 Identities = 11/67 (16%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           +++K+ ++ +K+ + ++K+++ Q  +     +  + E  + K   +  K K   + KK+ 
Sbjct: 115 EQQKQAEEAEKQAQLEQKQQEEQARKAA-AEQKKKAEAAKAKAAAEAAKLKAAAEAKKKA 173

Query: 284 KKRKKKK 290
           ++  K  
Sbjct: 174 EEAAKAA 180



 Score = 31.8 bits (72), Expect = 0.41
 Identities = 16/69 (23%), Positives = 32/69 (46%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +K+ +K + K ++++K+  + +K            +R    E+KKK +  K K   +  K
Sbjct: 103 LKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK 162

Query: 282 RKKKRKKKK 290
            K   + KK
Sbjct: 163 LKAAAEAKK 171



 Score = 31.5 bits (71), Expect = 0.59
 Identities = 14/74 (18%), Positives = 33/74 (44%), Gaps = 8/74 (10%)

Query: 225 KKKRKKKKKKKKKKKRK---KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +K+ + ++K+++++ RK   ++K +          ++   E  K K   + KKK  +  K
Sbjct: 124 EKQAQLEQKQQEEQARKAAAEQKKKAEAAK-----AKAAAEAAKLKAAAEAKKKAEEAAK 178

Query: 282 RKKKRKKKKNQHHR 295
             ++ K K      
Sbjct: 179 AAEEAKAKAEAAAA 192



 Score = 30.7 bits (69), Expect = 0.93
 Identities = 9/73 (12%), Positives = 37/73 (50%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           ++++ ++ +K++ K ++++K+  +  +   + +  + E+  +   ++KKK +    K   
Sbjct: 99  EQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA 158

Query: 283 KKKRKKKKNQHHR 295
           +  + K   +  +
Sbjct: 159 EAAKLKAAAEAKK 171


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 34.7 bits (80), Expect = 0.008
 Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 7/74 (9%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK----KKTTK 278
           KKKKK+KKKK K K++   +K+ +         +   E EE++  +K +++      T  
Sbjct: 20  KKKKKKKKKKNKSKEEVVTEKEEEEKSS---AESDLKEGEEDEDNEKIEQEEDGMNLTEA 76

Query: 279 KKKRKKKRKKKKNQ 292
           ++  ++ ++K+  +
Sbjct: 77  ERAFEEAQRKRLKE 90



 Score = 31.2 bits (71), Expect = 0.12
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 14/64 (21%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            V+  K K K KK   KKKK+KKKK                + +E+   +K++++K++ +
Sbjct: 4   NVVGGKLKLKGKKIDVKKKKKKKKKK--------------NKSKEEVVTEKEEEEKSSAE 49

Query: 280 KKRK 283
              K
Sbjct: 50  SDLK 53



 Score = 28.2 bits (63), Expect = 1.5
 Identities = 14/25 (56%), Positives = 16/25 (64%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
             K K K KK   KKK+KKK+KK K
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNK 31



 Score = 28.2 bits (63), Expect = 1.9
 Identities = 11/29 (37%), Positives = 16/29 (55%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           K KK   KKK  KKKK+ K +++   +  
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKE 41



 Score = 27.8 bits (62), Expect = 2.3
 Identities = 11/31 (35%), Positives = 18/31 (58%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           ++KKKKKKKK K   +    K++ +K   + 
Sbjct: 20  KKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50



 Score = 26.2 bits (58), Expect = 8.8
 Identities = 15/30 (50%), Positives = 17/30 (56%), Gaps = 2/30 (6%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           K K KK   KK  KKKK+KKK K K+    
Sbjct: 11  KLKGKKIDVKK--KKKKKKKKNKSKEEVVT 38


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 36.7 bits (85), Expect = 0.008
 Identities = 15/81 (18%), Positives = 28/81 (34%), Gaps = 5/81 (6%)

Query: 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
                P    + + +  +++ K   +     +             + + E++ KK K K 
Sbjct: 47  KVLEAPTEEPQPEPEPPEEQPKPPTEPETPPE-----PTPPKPKEKPKPEKKPKKPKPKP 101

Query: 273 KKKTTKKKKRKKKRKKKKNQH 293
           K K   K K K + K KK   
Sbjct: 102 KPKPKPKPKVKPQPKPKKPPS 122



 Score = 32.5 bits (74), Expect = 0.18
 Identities = 18/70 (25%), Positives = 24/70 (34%), Gaps = 17/70 (24%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            +    K K+K K +KK KK K +                  K K K K K K   K K+
Sbjct: 77  PEPTPPKPKEKPKPEKKPKKPKPKP-----------------KPKPKPKPKVKPQPKPKK 119

Query: 283 KKKRKKKKNQ 292
              +   K  
Sbjct: 120 PPSKTAAKAP 129


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 37.3 bits (85), Expect = 0.008
 Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 8/74 (10%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK------KKKKKKKKT 276
           KK  +++ ++ + + KKR  ++ +H +  W     R ++E+EKK+      + +KKK+K 
Sbjct: 95  KKAAEKEARRAEAEAKKRAAQEEEHKQ--WKAEQERIQKEQEKKEADLKKLQAEKKKEKA 152

Query: 277 TKKKKRKKKRKKKK 290
            K +K +K  K KK
Sbjct: 153 VKAEKAEKAEKTKK 166



 Score = 33.5 bits (75), Expect = 0.14
 Identities = 19/73 (26%), Positives = 39/73 (53%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
           P  V K KK  KKK +K+  K  + ++  + +        R E E +K+  ++++ K+  
Sbjct: 64  PQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWK 123

Query: 278 KKKKRKKKRKKKK 290
            +++R +K ++KK
Sbjct: 124 AEQERIQKEQEKK 136



 Score = 30.8 bits (68), Expect = 0.91
 Identities = 19/83 (22%), Positives = 43/83 (51%), Gaps = 2/83 (2%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           + ++ ++ +K KK +KKK +K+    H  +  +N     E+E  + + + KK+    ++ 
Sbjct: 60  LTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEH 119

Query: 281 KRKKKRKKK--KNQHHRDVWQKK 301
           K+ K  +++  K Q  ++   KK
Sbjct: 120 KQWKAEQERIQKEQEKKEADLKK 142


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 35.8 bits (83), Expect = 0.008
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           +EK K+KK+KKKK  KKKK+  K+  KK +
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 34.3 bits (79), Expect = 0.022
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQ 246
            KK+KK+KKKKKKKK  K+  KK +
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 34.3 bits (79), Expect = 0.026
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           +E+ K+KK+KKKKKK  KKK  KK  KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 33.9 bits (78), Expect = 0.029
 Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 2/32 (6%)

Query: 263 EEKK--KKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           EE+K  K+  K+K K  K+KK+KKK+KKKK  
Sbjct: 96  EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127



 Score = 32.7 bits (75), Expect = 0.074
 Identities = 16/29 (55%), Positives = 22/29 (75%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           E+ ++KK+KKKKKKKK  K  K+  K+KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 32.7 bits (75), Expect = 0.078
 Identities = 16/26 (61%), Positives = 22/26 (84%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQ 246
           V+K+K K+KK+KKKKKKKK+KK   +
Sbjct: 104 VLKEKAKQKKQKKKKKKKKKKKTSKK 129



 Score = 32.4 bits (74), Expect = 0.11
 Identities = 15/26 (57%), Positives = 19/26 (73%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
            K+K K+KK+K  KKKK+KKK  KK 
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 31.6 bits (72), Expect = 0.18
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
           +E+ ++KK+KKKKKKK  KK  +K  +KKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 31.6 bits (72), Expect = 0.20
 Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 18/61 (29%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           F+++++K  K+  K+K K+K++KKK +                  KKKKKK  KK   KK
Sbjct: 93  FLVEEQKIVKQVLKEKAKQKKQKKKKK------------------KKKKKKTSKKAAKKK 134

Query: 280 K 280
           K
Sbjct: 135 K 135


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 35.7 bits (83), Expect = 0.008
 Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 4/75 (5%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK----KKKTTK 278
            K   +K+KK   K+ +R+  K      +        + E E+ K++KKK    KK+  K
Sbjct: 55  VKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEK 114

Query: 279 KKKRKKKRKKKKNQH 293
           ++KR+ K++KKK +H
Sbjct: 115 ERKRQLKQQKKKAKH 129



 Score = 31.5 bits (72), Expect = 0.23
 Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 14/55 (25%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
            +K + +R K++KKK+ K++K+++ +  R              + K++KKK K +
Sbjct: 90  ALKLEHERNKQEKKKRSKEKKEEEKERKR--------------QLKQQKKKAKHR 130



 Score = 27.2 bits (61), Expect = 6.2
 Identities = 23/94 (24%), Positives = 37/94 (39%), Gaps = 25/94 (26%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK- 278
           F++K     K  +  K   K++KK N         R  R      +  K+ KK   +TK 
Sbjct: 41  FILKHLSTLKFTEAVKVSIKKQKKIN-------PKRLQR------QAAKEVKKPGISTKA 87

Query: 279 -----------KKKRKKKRKKKKNQHHRDVWQKK 301
                      K+++KK+ K+KK +      Q K
Sbjct: 88  QQALKLEHERNKQEKKKRSKEKKEEEKERKRQLK 121


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 37.1 bits (86), Expect = 0.010
 Identities = 15/68 (22%), Positives = 39/68 (57%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+++  K+  KK +   +K  + +   D      ++ +EE+ K+++ +KK+K+   + K 
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425

Query: 283 KKKRKKKK 290
           K+++ ++K
Sbjct: 426 KRQKDERK 433



 Score = 33.2 bits (76), Expect = 0.14
 Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 10/63 (15%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            KK  + K +    ++ + K K+ +           + EE E+K+K++  + K+  +K +
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKEEK----------LKQEENEKKQKEQADEDKEKRQKDE 431

Query: 282 RKK 284
           RKK
Sbjct: 432 RKK 434



 Score = 31.3 bits (71), Expect = 0.69
 Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K+++  K+  KK +   KK           + T   EE E K K++K K+++  KK+K 
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVK------DETDASEEAEAKAKEEKLKQEENEKKQKE 418

Query: 283 KKKRKKKKNQHHRD 296
           +    K+K Q    
Sbjct: 419 QADEDKEKRQKDER 432


>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI).  YihI activates the
           GTPase activity of Der, a 50S ribosomal subunit
           stability factor. The stimulation is specific to Der as
           YihI does not stimulate the GTPase activity of Era or
           ObgE. The interaction of YihI with Der requires only the
           C-terminal 78 amino acids of YihI. A yihI deletion
           mutant is viable and shows a shorter lag period, but the
           same post-lag growth rate as a wild-type strain. yihI is
           expressed during the lag period. Overexpression of yihI
           inhibits cell growth and biogenesis of the 50S ribosomal
           subunit. YihI is an unusual, highly hydrophilic protein
           with an uneven distribution of charged residues,
           resulting in an N-terminal region with high pI and a
           C-terminal region with low pI.
          Length = 169

 Score = 35.8 bits (83), Expect = 0.011
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 16/69 (23%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT-------TKKK 280
           RK  K   K   + KKK          R    +E  E+K+KKK+K  K+       ++ +
Sbjct: 1   RKSGKNGPKLAPKGKKKT---------RYELDQEARERKRKKKRKGLKSGSRHNEESESQ 51

Query: 281 KRKKKRKKK 289
           K+K   +KK
Sbjct: 52  KQKGAAQKK 60



 Score = 30.8 bits (70), Expect = 0.45
 Identities = 28/97 (28%), Positives = 34/97 (35%), Gaps = 32/97 (32%)

Query: 222 MKKKKKRKKKKKKKKKK------------KRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
            K  K   K   K KKK            KRKKK+            SR  EE E +K+K
Sbjct: 1   RKSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGL------KSGSRHNEESESQKQK 54

Query: 270 KKKKKK--------------TTKKKKRKKKRKKKKNQ 292
              +KK                K K  K+  KKKK +
Sbjct: 55  GAAQKKDPRIGSKKPVPLIVEFKVKPEKQAPKKKKVK 91



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 19/75 (25%), Positives = 30/75 (40%), Gaps = 26/75 (34%)

Query: 223 KKKKKRKKKK-------KKKKKKKRKKKKNQHHR---------DVWINRTSRGEEEEEKK 266
           K+KKKRK  K       + + +K++   + +  R          V          E + K
Sbjct: 30  KRKKKRKGLKSGSRHNEESESQKQKGAAQKKDPRIGSKKPVPLIV----------EFKVK 79

Query: 267 KKKKKKKKKTTKKKK 281
            +K+  KKK  KK K
Sbjct: 80  PEKQAPKKKKVKKPK 94


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 36.7 bits (85), Expect = 0.012
 Identities = 20/73 (27%), Positives = 41/73 (56%), Gaps = 6/73 (8%)

Query: 210 SSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
           S  I++ I  +V K+ +K    ++++++K+ KK++ +   D         E +EE++K++
Sbjct: 16  SEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDK------EEEVDEEEEKEE 69

Query: 270 KKKKKKTTKKKKR 282
           KKKK K  K+   
Sbjct: 70  KKKKTKKVKETTT 82


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 34.9 bits (80), Expect = 0.013
 Identities = 17/26 (65%), Positives = 17/26 (65%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
            E  EK KKKKKK KKT K KK  KK
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKK 119



 Score = 33.7 bits (77), Expect = 0.035
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRK 287
            E  +K KKKKKK K TKK K+  K+ 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 32.9 bits (75), Expect = 0.050
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKN 245
           K KK+KKK KK KK K+  KK+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 32.2 bits (73), Expect = 0.093
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
           K KKK+KK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 31.8 bits (72), Expect = 0.15
 Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKK 289
            E  +K KKKKKK+ K KK KK  KK 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 0.80
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRD 250
           K KKKKKK KK KK K    +D
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 0.88
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKK 288
            E  +K KKKKKK    KK +K  +K 
Sbjct: 94  PEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 1.0
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQ 246
           K +KKKKK KK KK KK   +
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 27.5 bits (61), Expect = 4.6
 Identities = 11/21 (52%), Positives = 11/21 (52%)

Query: 218 PGFVMKKKKKRKKKKKKKKKK 238
           P    KK KK KK KK  KK 
Sbjct: 100 PKKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 35.3 bits (82), Expect = 0.014
 Identities = 21/80 (26%), Positives = 29/80 (36%), Gaps = 26/80 (32%)

Query: 225 KKKRKKKKKKKKK--------------KKRKKKKNQHHRDVWI---------NRTSRGEE 261
           KKK  KK+KK K                KR  K  +     WI               ++
Sbjct: 88  KKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLE---KDWIIEVKPNDDPKEDPFAKK 144

Query: 262 EEEKKKKKKKKKKKTTKKKK 281
             EKK++  K +K+  K KK
Sbjct: 145 RREKKERVAKNEKRELKNKK 164



 Score = 29.6 bits (67), Expect = 1.4
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 7/69 (10%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINR--TSRGEEEEEKK-----KKKKKKKKKTTKKKKRK 283
           KKK  KK+K+ K         W+ R    R  ++ EK      K     K+    KK+R+
Sbjct: 88  KKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRRE 147

Query: 284 KKRKKKKNQ 292
           KK +  KN+
Sbjct: 148 KKERVAKNE 156


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 36.0 bits (83), Expect = 0.015
 Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 10/74 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+K+KR+ +  +K K+ R+KK+ +           R  +E+++           +    +
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEE----------KRRNDEDKRPGGGGGSSGGQSGLSTK 214

Query: 283 KKKRKKKKNQHHRD 296
            +  K+K+ +HH  
Sbjct: 215 DEPPKEKRQKHHDP 228



 Score = 27.9 bits (62), Expect = 6.4
 Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 2/43 (4%)

Query: 256 TSRGEEEEEKKKK--KKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
            SR + ++E +K+  K+K+K++    +K K+ R+KK+ +  R+
Sbjct: 150 LSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRN 192


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 35.8 bits (83), Expect = 0.016
 Identities = 16/89 (17%), Positives = 28/89 (31%), Gaps = 11/89 (12%)

Query: 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN-----------QHHRDVWINRTSRGEE 261
            ++W     +  +KK+ + KKK  K  +K + +                           
Sbjct: 142 ANYWCERTEILAEKKKPRSKKKSSKNSKKLRIDLVGEWLSTAISPPDSSPVSLDAREKLR 201

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
               K + K K KK+ K+ K   K     
Sbjct: 202 AYLPKWEGKTKNKKSLKEYKDLIKLLDSG 230


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 34.6 bits (80), Expect = 0.017
 Identities = 19/79 (24%), Positives = 43/79 (54%), Gaps = 22/79 (27%)

Query: 225 KKKRKKKKKKKK----KKKRK-------KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           KKK+ KKK  ++    K+ RK       K+K Q           R ++ +E+ K+K++++
Sbjct: 1   KKKKAKKKGVEEVVFDKEARKEYLTGFHKRKQQ-----------RRKKAQEEAKEKEREE 49

Query: 274 KKTTKKKKRKKKRKKKKNQ 292
           +   +K+ R++++++ + Q
Sbjct: 50  RIEERKRIREERKQELEKQ 68



 Score = 30.4 bits (69), Expect = 0.46
 Identities = 14/63 (22%), Positives = 37/63 (58%), Gaps = 17/63 (26%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+K++R+KK +++ K+K ++++                  EE+K+ ++++K++  K+ K 
Sbjct: 29  KRKQQRRKKAQEEAKEKEREER-----------------IEERKRIREERKQELEKQLKE 71

Query: 283 KKK 285
           +K+
Sbjct: 72  RKE 74


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 36.3 bits (84), Expect = 0.017
 Identities = 17/64 (26%), Positives = 24/64 (37%), Gaps = 8/64 (12%)

Query: 236 KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR--------KKKRK 287
            K  +KKK            +   +EE+  KK KK +KK   K  K         KK+  
Sbjct: 417 DKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESV 476

Query: 288 KKKN 291
           K + 
Sbjct: 477 KSQE 480


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 35.4 bits (82), Expect = 0.019
 Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 6/72 (8%)

Query: 221 VMKKKKKRKKKKKKKKKK--KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           V +K+KK+  KKK+K  K  K +K++ +  + V      R E + E  KK K KKKK  K
Sbjct: 168 VDEKQKKKSAKKKRKLYKELKERKEREKKLKKV----EQRLELQRELMKKGKGKKKKIVK 223

Query: 279 KKKRKKKRKKKK 290
            K  K   K KK
Sbjct: 224 DKDGKVVYKWKK 235



 Score = 34.7 bits (80), Expect = 0.035
 Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 8/73 (10%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            +KKK  KKK+K  K+ K +K++ +  + V      R E + E  KK K KKKK  K K 
Sbjct: 171 KQKKKSAKKKRKLYKELKERKEREKKLKKV----EQRLELQRELMKKGKGKKKKIVKDKD 226

Query: 282 R----KKKRKKKK 290
                K K+++K+
Sbjct: 227 GKVVYKWKKERKR 239



 Score = 33.1 bits (76), Expect = 0.11
 Identities = 18/79 (22%), Positives = 39/79 (49%), Gaps = 6/79 (7%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
               +++KKK  KKK+K  K+  +        R  + ++ E++ + +++  KK  K KK+
Sbjct: 166 SLVDEKQKKKSAKKKRKLYKELKER-----KEREKKLKKVEQRLELQRELMKKG-KGKKK 219

Query: 283 KKKRKKKKNQHHRDVWQKK 301
           K  + K     ++   ++K
Sbjct: 220 KIVKDKDGKVVYKWKKERK 238



 Score = 33.1 bits (76), Expect = 0.12
 Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 10/76 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK------KKKKKKKKKKT 276
                 K+KKK  KKK++  K+ +        R  + ++ E++        KK K KKK 
Sbjct: 165 TSLVDEKQKKKSAKKKRKLYKELKER----KEREKKLKKVEQRLELQRELMKKGKGKKKK 220

Query: 277 TKKKKRKKKRKKKKNQ 292
             K K  K   K K +
Sbjct: 221 IVKDKDGKVVYKWKKE 236



 Score = 27.3 bits (61), Expect = 9.1
 Identities = 27/97 (27%), Positives = 37/97 (38%), Gaps = 21/97 (21%)

Query: 219 GFVMKKK--KKRKKKKKKKKKK-KRKKKK-----------NQHHRDVWINRTSRGEEEEE 264
           G + KKK  K R K   KK+K  K  K+K                           EE+E
Sbjct: 17  GLLEKKKDYKLRAKDYHKKEKTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKE 76

Query: 265 KKKKKKKKKKKT-------TKKKKRKKKRKKKKNQHH 294
              K++ K  KT       TK++   KK +K K + H
Sbjct: 77  VLSKEQVKLLKTQDLNYVRTKRQAEAKKIEKLKEELH 113


>gnl|CDD|223939 COG1007, NuoN, NADH:ubiquinone oxidoreductase subunit 2 (chain N)
           [Energy production and conversion].
          Length = 475

 Score = 36.1 bits (84), Expect = 0.021
 Identities = 23/114 (20%), Positives = 37/114 (32%), Gaps = 17/114 (14%)

Query: 50  FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYF 109
            + ITA++  LLV + + + R         +  LA   LL AL  + F A        + 
Sbjct: 13  ILAITALVL-LLVGLFIRRSRS-------LLAYLAVLGLLAALASLLFIAYRGS---AFA 61

Query: 110 GYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLS 163
           G F+ D            + L    +S    Y                 LL+ +
Sbjct: 62  GAFIVDALALLMKIVVLLASLAALILS--LPYLERAGTFKGEFY----ALLLFA 109


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor involved
           in ribosome biogenesis.
          Length = 112

 Score = 34.0 bits (78), Expect = 0.021
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
            K       + K+    R   +   S+ E   +KK++KKK K+ T K++KR+KK 
Sbjct: 17  DKSSLTSLPRAKRTDRKRPGILA-ASKNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70



 Score = 31.7 bits (72), Expect = 0.13
 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 8/60 (13%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            KK++K+K K+  +K++KR+KK  +    V  +  S       KK+ K   + K  KK+K
Sbjct: 48  KKKQRKKKPKRLTRKQRKRQKKGLERAEIV-TDTLS-------KKRAKSIGRAKYVKKRK 99



 Score = 31.3 bits (71), Expect = 0.20
 Identities = 14/64 (21%), Positives = 25/64 (39%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           +R         K       +  R            + E   KKK++KKK  +  ++++KR
Sbjct: 7   RRAPSPSIDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKR 66

Query: 287 KKKK 290
           +KK 
Sbjct: 67  QKKG 70


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 36.2 bits (84), Expect = 0.021
 Identities = 19/72 (26%), Positives = 37/72 (51%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             K  K RK K KKK+KKK+K   ++  +   +  + R + +E++K   K   KK+     
Sbjct: 1164 GKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGS 1223

Query: 282  RKKKRKKKKNQH 293
             ++  +++K + 
Sbjct: 1224 DQEDDEEQKTKP 1235



 Score = 33.9 bits (78), Expect = 0.11
 Identities = 19/75 (25%), Positives = 33/75 (44%)

Query: 217  IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
                   K +K K KKK+KKKKK    K++    V  ++    +E+ +   K   KK  +
Sbjct: 1161 KTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNS 1220

Query: 277  TKKKKRKKKRKKKKN 291
            +   +   + +K K 
Sbjct: 1221 SGSDQEDDEEQKTKP 1235



 Score = 32.3 bits (74), Expect = 0.32
 Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 7/77 (9%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR-------GEEEEEKKKKKKKKKKK 275
            K+K ++   + +KK+K+ +K KN   +D+W+    +        EE EEK+  K+++ K 
Sbjct: 1101 KEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKS 1160

Query: 276  TTKKKKRKKKRKKKKNQ 292
             TK K  K ++ K K +
Sbjct: 1161 KTKGKASKLRKPKLKKK 1177



 Score = 31.9 bits (73), Expect = 0.53
 Identities = 15/75 (20%), Positives = 30/75 (40%), Gaps = 14/75 (18%)

Query: 218  PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
            P     KK+        KKKKK +KK                 +++ K + K+    +++
Sbjct: 1305 PTKKKVKKRLEGSLAALKKKKKSEKK--------------TARKKKSKTRVKQASASQSS 1350

Query: 278  KKKKRKKKRKKKKNQ 292
            +  +R +K+K   + 
Sbjct: 1351 RLLRRPRKKKSDSSS 1365



 Score = 31.2 bits (71), Expect = 0.86
 Identities = 16/70 (22%), Positives = 32/70 (45%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K KKK KKKKK    K +K     + + V  +   + +++ + KK       +   ++++
Sbjct: 1173 KLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQK 1232

Query: 283  KKKRKKKKNQ 292
             K +K    +
Sbjct: 1233 TKPKKSSVKR 1242



 Score = 30.8 bits (70), Expect = 0.99
 Identities = 12/61 (19%), Positives = 29/61 (47%)

Query: 230  KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
            + ++K+  K+++ K     +   + +    ++E++KKK    K KK +     K+    +
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205

Query: 290  K 290
            K
Sbjct: 1206 K 1206



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 1/66 (1%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
             K+ K K   K+    +          D   N  S+      KKK KK+ +      KK+
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSK-PSSPTKKKVKKRLEGSLAALKKK 1324

Query: 283  KKKRKK 288
            KK  KK
Sbjct: 1325 KKSEKK 1330



 Score = 29.2 bits (66), Expect = 3.2
 Identities = 16/77 (20%), Positives = 26/77 (33%), Gaps = 14/77 (18%)

Query: 220  FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
               +  K+++ K K K K  + +K                 +++EKKKKK    K     
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKL--------------KKKEKKKKKSSADKSKKAS 1193

Query: 280  KKRKKKRKKKKNQHHRD 296
                 KR     +   D
Sbjct: 1194 VVGNSKRVDSDEKRKLD 1210



 Score = 29.2 bits (66), Expect = 3.2
 Identities = 19/78 (24%), Positives = 29/78 (37%), Gaps = 15/78 (19%)

Query: 224  KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            K+++ K K K K  K RK K                +++E+KKKK    K K        
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKL---------------KKKEKKKKKSSADKSKKASVVGNS 1198

Query: 284  KKRKKKKNQHHRDVWQKK 301
            K+    + +   D    K
Sbjct: 1199 KRVDSDEKRKLDDKPDNK 1216



 Score = 27.7 bits (62), Expect = 9.6
 Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 14/83 (16%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRT--------------SRGEEEEEKKKK 268
            KKK+K+KKK    K KK     N    D    R               S  E++EE+K K
Sbjct: 1175 KKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTK 1234

Query: 269  KKKKKKKTTKKKKRKKKRKKKKN 291
             KK   K  K KK    +  + N
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDN 1257


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 35.6 bits (82), Expect = 0.022
 Identities = 15/47 (31%), Positives = 22/47 (46%)

Query: 246 QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
            HH +V    T+  E +E KK KK +  K     K +K K+K   + 
Sbjct: 129 MHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSN 175



 Score = 30.2 bits (68), Expect = 1.2
 Identities = 11/53 (20%), Positives = 18/53 (33%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
            P    + KK +K +  K  K  + KK  +       +    G +   K   K
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWK 192



 Score = 28.3 bits (63), Expect = 5.0
 Identities = 18/70 (25%), Positives = 22/70 (31%), Gaps = 21/70 (30%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
            K+ KK KK +  K  K  K                    KK KKK      + K     
Sbjct: 144 AKEVKKPKKGQSPKVPKAPK-------------------PKKPKKKGSVSNRSVKMPGID 184

Query: 285 KRKK--KKNQ 292
            R K   K+Q
Sbjct: 185 PRSKPDWKSQ 194


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 35.8 bits (83), Expect = 0.022
 Identities = 22/86 (25%), Positives = 33/86 (38%), Gaps = 13/86 (15%)

Query: 203 NVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE 262
            +  +L+              KKKKR+ ++  +  + R +KK                +E
Sbjct: 210 ELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKR-------------KSKE 256

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKK 288
           E KKKK K+ K     KK   K  KK
Sbjct: 257 EIKKKKPKESKGVKALKKVVAKGMKK 282



 Score = 35.4 bits (82), Expect = 0.029
 Identities = 22/80 (27%), Positives = 36/80 (45%), Gaps = 3/80 (3%)

Query: 213 ISFWIPGFVMKKKKKRKKKKKKKK---KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
           +  ++P  + K+  K     + K      K  KKK +   +      SR E++ + K++ 
Sbjct: 199 LCSYLPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEI 258

Query: 270 KKKKKKTTKKKKRKKKRKKK 289
           KKKK K +K  K  KK   K
Sbjct: 259 KKKKPKESKGVKALKKVVAK 278


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 35.4 bits (82), Expect = 0.023
 Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-----KKKKKKKKT 276
                +  KK++K K+++R++++        I         ++  K      +    K+T
Sbjct: 31  KLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTESVSFRTPYYKRT 90

Query: 277 TKKKKRKKKRKKKKNQH 293
           TKK K+KKK+KK     
Sbjct: 91  TKKMKKKKKKKKVVMSD 107


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 34.2 bits (79), Expect = 0.024
 Identities = 17/63 (26%), Positives = 35/63 (55%), Gaps = 6/63 (9%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           K+++++K+ K+K+ +   D W+  T +G      ++ K  +K+    K   K+K+KKKK 
Sbjct: 32  KEEEEEKEEKEKEAEPDEDGWVTVTKKG------RRPKTARKESVAAKAAEKEKKKKKKK 85

Query: 292 QHH 294
           +  
Sbjct: 86  ELK 88



 Score = 34.2 bits (79), Expect = 0.027
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 12/73 (16%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           P    K+    K  +K+KKKKK+K+ KN    Q        R S+  E  E +KK ++ K
Sbjct: 62  PKTARKESVAAKAAEKEKKKKKKKELKNFYRFQI-------RESKRNELAELRKKFEEDK 114

Query: 274 KKTTK-KKKRKKK 285
           K+    K  RK K
Sbjct: 115 KRIALLKAARKFK 127



 Score = 32.6 bits (75), Expect = 0.076
 Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 7/70 (10%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           M++  K ++++     K+ K+K+ +   D W+  T +G     K  +K+    K  +K+K
Sbjct: 27  MEEYDKEEEEE-----KEEKEKEAEPDEDGWVTVTKKG--RRPKTARKESVAAKAAEKEK 79

Query: 282 RKKKRKKKKN 291
           +KKK+K+ KN
Sbjct: 80  KKKKKKELKN 89


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 33.1 bits (76), Expect = 0.026
 Identities = 12/34 (35%), Positives = 15/34 (44%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           G + E    KKKKKKKK   +      + KK   
Sbjct: 15  GPKTEADLVKKKKKKKKKKAEDTAATAKAKKATA 48



 Score = 31.9 bits (73), Expect = 0.057
 Identities = 13/32 (40%), Positives = 15/32 (46%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E +  KKKKKKKKKK        K +K     
Sbjct: 19  EADLVKKKKKKKKKKAEDTAATAKAKKATAED 50



 Score = 30.4 bits (69), Expect = 0.21
 Identities = 12/22 (54%), Positives = 14/22 (63%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
           KKKKK+KKKK +      K KK
Sbjct: 24  KKKKKKKKKKAEDTAATAKAKK 45



 Score = 30.0 bits (68), Expect = 0.34
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKK 244
           + KKKK+KKKKK +      K K
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAK 44



 Score = 28.9 bits (65), Expect = 0.71
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
           KKKKK+KKK +      + KK   
Sbjct: 25  KKKKKKKKKAEDTAATAKAKKATA 48



 Score = 28.9 bits (65), Expect = 0.76
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQ 246
             KK+KKKKKKK +      K +
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAK 44



 Score = 27.7 bits (62), Expect = 2.1
 Identities = 13/59 (22%), Positives = 17/59 (28%), Gaps = 20/59 (33%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K +    KKKKKKKK+K +                          K KK       +  
Sbjct: 17  KTEADLVKKKKKKKKKKAED--------------------TAATAKAKKATAEDVSEGA 55



 Score = 26.2 bits (58), Expect = 7.5
 Identities = 8/24 (33%), Positives = 11/24 (45%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
           KKKKK+K +      K +K     
Sbjct: 27  KKKKKKKAEDTAATAKAKKATAED 50



 Score = 26.2 bits (58), Expect = 7.8
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           +T     +++KKKKKKK +      K +K  
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAKAKKAT 47


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 33.0 bits (76), Expect = 0.027
 Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 12/66 (18%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +++  +R ++KKKKKKK +KK K       WI R      ++E+ +++ K+ K  +K   
Sbjct: 32  VRRISQRNRRKKKKKKKLKKKSKE------WILR------KKEQMRRRGKEVKPDSKYTG 79

Query: 282 RKKKRK 287
           RK++ +
Sbjct: 80  RKRRPR 85



 Score = 29.2 bits (66), Expect = 0.64
 Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 5/41 (12%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTK-----KKKRKKKRKKK 289
           ++  R  +   +KKKKKKK KK +K     KK++ ++R K+
Sbjct: 30  SKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70



 Score = 28.0 bits (63), Expect = 1.8
 Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 3/41 (7%)

Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKK---KRKKKRKKKK 290
           I+ +      +  ++KKKKKKK   K K    RKK++ +++
Sbjct: 27  IDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67



 Score = 27.6 bits (62), Expect = 2.3
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            +E+ +  K ++  ++   KKKK+KK +KK K
Sbjct: 23  DDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 26.5 bits (59), Expect = 5.0
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
            +    +     ++ ++KKKKK   KKK K+   +KK Q  R
Sbjct: 25  EQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRR 66



 Score = 26.5 bits (59), Expect = 5.0
 Identities = 11/32 (34%), Positives = 22/32 (68%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           E++E+    K ++  +  ++KK+KKK+ KKK+
Sbjct: 22  EDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 32.6 bits (75), Expect = 0.027
 Identities = 15/58 (25%), Positives = 29/58 (50%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           K+K+  K + K+ KK+  KK   +  R +   R  +  +E    ++ + + K+  KKK
Sbjct: 5   KRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKK 62



 Score = 28.3 bits (64), Expect = 0.92
 Identities = 9/39 (23%), Positives = 21/39 (53%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
           E+ + K+ KK+  K   +KK++K   K+  ++ ++    
Sbjct: 10  EKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAA 48



 Score = 27.6 bits (62), Expect = 1.8
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
             KK+K+ +K   K+ +K+  KK   +  R +  K+
Sbjct: 2   LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 35.5 bits (82), Expect = 0.028
 Identities = 17/70 (24%), Positives = 31/70 (44%), Gaps = 17/70 (24%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            ++ +KR +K K+K  K   K K                  E  KK++  + ++  K+KK
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAK-----------------PERDKKERPGRYRRKKKEKK 385

Query: 282 RKKKRKKKKN 291
            K +R+  +N
Sbjct: 386 AKSERRGLQN 395



 Score = 32.8 bits (75), Expect = 0.21
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
              + +  +   K K ++ KK    + R+KK++KK     R
Sbjct: 351 EKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 35.5 bits (82), Expect = 0.030
 Identities = 21/89 (23%), Positives = 41/89 (46%), Gaps = 16/89 (17%)

Query: 218 PGFVMKKK-KKRKKKK-------KKKKKKKRKKKKNQHHRDVWINRTSR--------GEE 261
           P  V KKK +KR  ++        K   +    + + H     ++  S         G +
Sbjct: 280 PRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGID 339

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           E+++ +  ++ K++  KK+K KKK +K+K
Sbjct: 340 EDDEDEDDEESKEEVEKKQKVKKKPRKRK 368



 Score = 28.2 bits (63), Expect = 6.6
 Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 10/71 (14%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK----------K 281
            K  +K   +        D         E +EE +KK+K KKK   +K           K
Sbjct: 321 PKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLK 380

Query: 282 RKKKRKKKKNQ 292
            + KRKK    
Sbjct: 381 LRNKRKKGFKG 391


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 35.4 bits (81), Expect = 0.032
 Identities = 18/63 (28%), Positives = 35/63 (55%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K KKRK  +++++KK +    +   + ++        ++EE+ +  KKKKK+  K+KK  
Sbjct: 529 KNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLD 588

Query: 284 KKR 286
            K+
Sbjct: 589 SKK 591



 Score = 31.2 bits (70), Expect = 0.81
 Identities = 19/64 (29%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR--- 286
           K   K K KKRK  + +  + + +   S  +++  KK K    KK+   +  +KKK+   
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIA 582

Query: 287 KKKK 290
           K+KK
Sbjct: 583 KQKK 586



 Score = 29.7 bits (66), Expect = 2.1
 Identities = 15/64 (23%), Positives = 35/64 (54%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           K   K K KK+K   ++++ +    +  N+  +  ++ +    KK+++ +  KKKK++  
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIA 582

Query: 286 RKKK 289
           ++KK
Sbjct: 583 KQKK 586


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 34.4 bits (79), Expect = 0.033
 Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 9/68 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K++K+  K+ ++K+  K KK+K +             +  E+  KKK  K  K T KK  
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKE---------KKEKKVAEKLAKKKSTKTTKNTTKKAT 147

Query: 283 KKKRKKKK 290
           KK   KK+
Sbjct: 148 KKTTTKKE 155



 Score = 31.3 bits (71), Expect = 0.32
 Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 1/71 (1%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRD-VWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           +    KK K+  +        KK      +   IN      E+E+K+  K+ ++K+  K 
Sbjct: 55  IFNVDKKPKRLGRFPGFTNSYKKAYVTLAEGYSINLFPEESEKEQKEVSKETEEKEAIKA 114

Query: 280 KKRKKKRKKKK 290
           KK KK++K+KK
Sbjct: 115 KKEKKEKKEKK 125



 Score = 30.9 bits (70), Expect = 0.48
 Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 17/72 (23%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           +K++K+  K+  +K+                   + KK+KK+KK+KK  +K  +KK  K 
Sbjct: 96  EKEQKEVSKETEEKEA-----------------IKAKKEKKEKKEKKVAEKLAKKKSTKT 138

Query: 289 KKNQHHRDVWQK 300
            KN   +   + 
Sbjct: 139 TKNTTKKATKKT 150



 Score = 30.1 bits (68), Expect = 0.74
 Identities = 13/44 (29%), Positives = 21/44 (47%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
            +E E+K+  K KK+K  KK+K+  ++  KK          K  
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKA 146



 Score = 29.7 bits (67), Expect = 0.98
 Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 10/62 (16%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           ++K+  K KK+KK+KK++K  +    +          +  +  K   KK  KKTT KK+ 
Sbjct: 107 EEKEAIKAKKEKKEKKEKKVAEKLAKK----------KSTKTTKNTTKKATKKTTTKKEE 156

Query: 283 KK 284
            K
Sbjct: 157 GK 158


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 33.8 bits (78), Expect = 0.033
 Identities = 16/90 (17%), Positives = 46/90 (51%), Gaps = 17/90 (18%)

Query: 223 KKKKKRKKKKKKK---------KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK- 272
           +K++K +K++K+K         ++++++ ++ +  ++  + R +  E+ +E+ K++++  
Sbjct: 24  EKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQERWD 83

Query: 273 -------KKKTTKKKKRKKKRKKKKNQHHR 295
                  +KK  K  K+ K   K     +R
Sbjct: 84  DPMAQFLRKKKEKTDKKGKPLYKGPFPPNR 113



 Score = 33.4 bits (77), Expect = 0.051
 Identities = 14/62 (22%), Positives = 38/62 (61%), Gaps = 8/62 (12%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           G ++  ++KR++K+++K++K+RK++K             +G  ++E+++K+ ++ +K   
Sbjct: 9   GRIIDIEEKREEKEREKEEKERKEEK--------EKEWGKGLVQKEEREKRLEELEKAKN 60

Query: 279 KK 280
           K 
Sbjct: 61  KP 62



 Score = 30.7 bits (70), Expect = 0.37
 Identities = 17/77 (22%), Positives = 37/77 (48%), Gaps = 18/77 (23%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK---KKKKTTKK 279
           K  +    ++K+++K++ K++K               E +EEK+K+  K   +K++  K+
Sbjct: 7   KSGRIIDIEEKREEKEREKEEK---------------ERKEEKEKEWGKGLVQKEEREKR 51

Query: 280 KKRKKKRKKKKNQHHRD 296
            +  +K K K    + D
Sbjct: 52  LEELEKAKNKPLARYAD 68



 Score = 30.0 bits (68), Expect = 0.86
 Identities = 12/54 (22%), Positives = 30/54 (55%), Gaps = 8/54 (14%)

Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK--------KRKKKRKKKKNQH 293
           +RD         E+ EEK+++K++K++K  K+K        K +++++ ++ + 
Sbjct: 4   YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 35.2 bits (81), Expect = 0.033
 Identities = 18/74 (24%), Positives = 29/74 (39%), Gaps = 8/74 (10%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
              KK K+KK             +      G E+E+   +     +   K  KRK +R +
Sbjct: 258 PPSKKPKEKKTSSTFLPSLMGGYF-----SGSEDEDDDDEDIDPDQVVKKPVKRKNRRGQ 312

Query: 289 KKNQHHRDVWQKKY 302
           +  Q    +W+KKY
Sbjct: 313 RARQ---AIWEKKY 323



 Score = 35.2 bits (81), Expect = 0.037
 Identities = 16/42 (38%), Positives = 22/42 (52%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE 262
           ++  + K KK K KKK KK KKK+ +   D      S  E+E
Sbjct: 139 ILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180



 Score = 33.3 bits (76), Expect = 0.17
 Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 1/55 (1%)

Query: 239 KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
             K  KN     +       G E +  KK K KKK K +KKK+ K+   K   + 
Sbjct: 121 NSKPVKNVLESIMEGLDKILGIETK-AKKGKAKKKTKKSKKKEAKESSDKDDEEE 174



 Score = 31.7 bits (72), Expect = 0.50
 Identities = 14/43 (32%), Positives = 21/43 (48%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK 265
            KK K KKK KK KKK+ K+  ++   +   +      EE  +
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAE 188



 Score = 29.8 bits (67), Expect = 1.7
 Identities = 17/67 (25%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           KK     K  KK+++K +K++     + W  R ++ E  + K  +  +     T++K  +
Sbjct: 321 KKYGSGAKHVKKEREKEQKEREGRQSE-WEARQAKREGGDAKAGRAAEPTGSRTQQKGDR 379

Query: 284 KKRKKKK 290
            KR +KK
Sbjct: 380 PKRGEKK 386



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 13/41 (31%), Positives = 18/41 (43%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKK 268
             K KK K KKK KK K +  ++          E E++ K 
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 5/53 (9%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           + K    + K KK K KK+ KK  +          S  +++EE+ + + + K 
Sbjct: 136 LDKILGIETKAKKGKAKKKTKKSKKKEAK-----ESSDKDDEEESESEDESKS 183



 Score = 27.5 bits (61), Expect = 9.6
 Identities = 17/74 (22%), Positives = 28/74 (37%), Gaps = 17/74 (22%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----------- 266
             +  ++ K+     K  +  +    + Q   D    R  RGE+++ KK           
Sbjct: 346 SEWEARQAKREGGDAKAGRAAEPTGSRTQQKGD----RPKRGEKKKPKKPSVDKPLHPSW 401

Query: 267 --KKKKKKKKKTTK 278
             KKK K+KK   K
Sbjct: 402 EAKKKAKEKKANAK 415


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 35.2 bits (81), Expect = 0.036
 Identities = 12/36 (33%), Positives = 22/36 (61%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDV 297
           E+E KKK  K+ +   + K+RK++++KK  +    V
Sbjct: 127 EKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPV 162



 Score = 31.3 bits (71), Expect = 0.55
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
              R  + E++ KKK  K+ +   + KR+K+RK+KKN
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKN 155



 Score = 29.8 bits (67), Expect = 1.9
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
                E+E KKK  K+ +     K+++++KRKK K Q 
Sbjct: 122 RFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQD 159


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 35.4 bits (82), Expect = 0.037
 Identities = 14/59 (23%), Positives = 22/59 (37%), Gaps = 7/59 (11%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            K      K  KK+  K+K            ++ +    +   K KKKKKK+K+     
Sbjct: 497 LKTTSSVGKSAKKQDSKKKSSSKL-------DKAANKISKAAVKVKKKKKKEKSIDLDD 548



 Score = 34.3 bits (79), Expect = 0.080
 Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 14/51 (27%)

Query: 221 VMKKKKKRKKKKKKK--KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
             K+K+KR+   K +  KK+KRK KK ++   V IN         EK+ KK
Sbjct: 630 KKKRKRKRRFLTKIEGVKKEKRKDKKLKN---VIIN---------EKRNKK 668



 Score = 32.7 bits (75), Expect = 0.22
 Identities = 13/32 (40%), Positives = 23/32 (71%), Gaps = 2/32 (6%)

Query: 264 EKKKKKKKKKKKTTKKKKR--KKKRKKKKNQH 293
           +K+KKK+K+K++   K +   K+KRK KK ++
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658



 Score = 32.7 bits (75), Expect = 0.26
 Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 16/81 (19%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK-------- 272
           V  K  KR +K +K+++++   ++N      W+  TS    +  KK+  KKK        
Sbjct: 470 VANKLLKRSEKAQKEEEEEELDEENP-----WLKTTSS-VGKSAKKQDSKKKSSSKLDKA 523

Query: 273 --KKKTTKKKKRKKKRKKKKN 291
             K      K +KKK+K+K  
Sbjct: 524 ANKISKAAVKVKKKKKKEKSI 544



 Score = 31.6 bits (72), Expect = 0.51
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 4/79 (5%)

Query: 224 KKKKRKKKKKKKK----KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            K KR KK K KK     KK K K+     +  +        EE +K ++++ +++ T K
Sbjct: 218 AKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAALEELEKLERRRAEERMTLK 277

Query: 280 KKRKKKRKKKKNQHHRDVW 298
            K   K  K   +  R  +
Sbjct: 278 HKNTSKWAKSMLKTGRAKY 296



 Score = 31.6 bits (72), Expect = 0.62
 Identities = 19/100 (19%), Positives = 36/100 (36%), Gaps = 21/100 (21%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK--- 274
           P    K+ + +K KK+ K + K KK+ ++             +   +  K+ +K +K   
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEE 486

Query: 275 ------------------KTTKKKKRKKKRKKKKNQHHRD 296
                             K+ KK+  KKK   K ++    
Sbjct: 487 EEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANK 526



 Score = 31.2 bits (71), Expect = 0.74
 Identities = 12/32 (37%), Positives = 21/32 (65%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           ++ ++K+K+K++   K    KK K+K KK KN
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 18/60 (30%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           KKRKKK+K+K++   K                   E  +K+K+K KK K     +KR KK
Sbjct: 627 KKRKKKRKRKRRFLTKI------------------EGVKKEKRKDKKLKNVIINEKRNKK 668



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 25/124 (20%), Positives = 36/124 (29%), Gaps = 55/124 (44%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE------------------- 263
             K   K KKKKKK+K      +    +  I      EE+E                   
Sbjct: 527 ISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDLIKEAF 586

Query: 264 -----------EKKK-------------------------KKKKKKKKTTKKKKRKKKRK 287
                      EKK+                         KK+KKK+K  ++   K +  
Sbjct: 587 AGDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGV 646

Query: 288 KKKN 291
           KK+ 
Sbjct: 647 KKEK 650



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 17/77 (22%), Positives = 38/77 (49%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
             K  ++ + +KK++   +  +  R++     S  EE EE  KK   ++K   +  +++ 
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEA 434

Query: 285 KRKKKKNQHHRDVWQKK 301
           + KK K ++  +  +KK
Sbjct: 435 ESKKLKKENKNEFKEKK 451



 Score = 27.7 bits (62), Expect = 9.0
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           EE K K+ KK K K   +  +K+K K+
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKE 242


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 35.3 bits (81), Expect = 0.037
 Identities = 19/66 (28%), Positives = 28/66 (42%), Gaps = 17/66 (25%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KKKK+ K K+K+ KK K  +K                 E + K + ++        KK  
Sbjct: 21  KKKKEEKAKEKELKKLKAAQK-----------------EAKAKLQAQQASDGTNVPKKSE 63

Query: 283 KKKRKK 288
           KK RK+
Sbjct: 64  KKSRKR 69



 Score = 34.1 bits (78), Expect = 0.091
 Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 16/77 (20%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK- 284
            + +KK   +++ +RKKKK +           + +E+E KK K  +K+ K   + ++   
Sbjct: 6   SEAEKKILTEEELERKKKKEE-----------KAKEKELKKLKAAQKEAKAKLQAQQASD 54

Query: 285 ----KRKKKKNQHHRDV 297
                +K +K    RDV
Sbjct: 55  GTNVPKKSEKKSRKRDV 71



 Score = 29.5 bits (66), Expect = 2.5
 Identities = 16/67 (23%), Positives = 35/67 (52%), Gaps = 7/67 (10%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK    ++ ++KKKK+++ K+K        + +    ++E + K + ++    T   KK 
Sbjct: 10  KKILTEEELERKKKKEEKAKEKE-------LKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62

Query: 283 KKKRKKK 289
           +KK +K+
Sbjct: 63  EKKSRKR 69


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 35.1 bits (81), Expect = 0.039
 Identities = 12/76 (15%), Positives = 43/76 (56%), Gaps = 1/76 (1%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           +  F+  ++K+++ K++++KK+  ++KK +        +      +  +K ++++K++  
Sbjct: 274 LVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAP-KVKQRFEANDPAQKLQEERKEQLA 332

Query: 277 TKKKKRKKKRKKKKNQ 292
             +K+ K++ K+ + +
Sbjct: 333 KLRKEEKEREKEYEQE 348



 Score = 30.5 bits (69), Expect = 1.0
 Identities = 16/91 (17%), Positives = 38/91 (41%), Gaps = 17/91 (18%)

Query: 220 FVMKKKKKRKKKKKK-----KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           F  ++++K+    +K      +++ +K K     + ++I       +EEE K+K + + +
Sbjct: 94  FYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSIYIPLLKDKMQEEELKRKIRVQMR 153

Query: 275 K------------TTKKKKRKKKRKKKKNQH 293
                          K + +++  KKKK   
Sbjct: 154 AQELLQSSRLPPRMAKHEAQERLTKKKKRGQ 184



 Score = 30.5 bits (69), Expect = 1.0
 Identities = 11/62 (17%), Positives = 33/62 (53%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             ++  +  ++K     K+ +   +  R     + ++ ++E+++ +++KKKKK+   K K
Sbjct: 252 AGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVK 311

Query: 282 RK 283
           ++
Sbjct: 312 QR 313



 Score = 29.7 bits (67), Expect = 1.9
 Identities = 15/63 (23%), Positives = 33/63 (52%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
              ++  K  ++K      +    V   RT R E+E +++++KK+ +++  KKK+   K 
Sbjct: 251 SAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKV 310

Query: 287 KKK 289
           K++
Sbjct: 311 KQR 313



 Score = 29.0 bits (65), Expect = 3.0
 Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 10/74 (13%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK------ 270
           IP    K +++  K+K + + + ++  ++         R ++ E +E   KKKK      
Sbjct: 132 IPLLKDKMQEEELKRKIRVQMRAQELLQSSR----LPPRMAKHEAQERLTKKKKRGQKKS 187

Query: 271 KKKKKTTKKKKRKK 284
           K KKKT K K+ K 
Sbjct: 188 KYKKKTFKPKRAKS 201



 Score = 27.8 bits (62), Expect = 7.1
 Identities = 13/72 (18%), Positives = 33/72 (45%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
             +    ++     ++  K  ++K       W +       E ++K+ K++++KK  +++
Sbjct: 240 SSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQR 299

Query: 281 KRKKKRKKKKNQ 292
           K+KKK    K +
Sbjct: 300 KKKKKEMAPKVK 311


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 34.0 bits (78), Expect = 0.041
 Identities = 9/33 (27%), Positives = 17/33 (51%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
            +E E++   + K +   KKK+   K K +K +
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138



 Score = 33.6 bits (77), Expect = 0.050
 Identities = 9/38 (23%), Positives = 20/38 (52%)

Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
               EE  ++ +++   + KT  K+K+K++  K K + 
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK 136



 Score = 32.8 bits (75), Expect = 0.084
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           + T + +  E K + K+KKK++  K K  K+K K +  +
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145



 Score = 31.6 bits (72), Expect = 0.21
 Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 18/63 (28%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           + +++   + K + K KKK+                  E  K K +K+K KT  KK +  
Sbjct: 108 ETEQEDPPETKTESKEKKKR------------------EVPKPKTEKEKPKTEPKKPKPS 149

Query: 285 KRK 287
           K K
Sbjct: 150 KPK 152



 Score = 31.3 bits (71), Expect = 0.28
 Identities = 13/51 (25%), Positives = 21/51 (41%), Gaps = 8/51 (15%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           +++   + K + K+KKKR+  K                + E KK K  K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPK--------PKTEKEKPKTEPKKPKPSKPK 152



 Score = 30.1 bits (68), Expect = 0.87
 Identities = 13/31 (41%), Positives = 20/31 (64%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            + E K+KKK++  K  T+K+K K + KK K
Sbjct: 117 TKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147



 Score = 29.3 bits (66), Expect = 1.4
 Identities = 10/39 (25%), Positives = 21/39 (53%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
             +S  EEEE +++   + +++   + K + K KKK+  
Sbjct: 91  LSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREV 129



 Score = 29.0 bits (65), Expect = 2.1
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           + E ++KKK++  K KT K+K + + +K K 
Sbjct: 118 KTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 35.1 bits (81), Expect = 0.044
 Identities = 13/61 (21%), Positives = 27/61 (44%)

Query: 223  KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            + + + +K+   + + +  K+K  +   V  N+    EE+  +   KK+K KK  K    
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTE 1580

Query: 283  K 283
             
Sbjct: 1581 A 1581



 Score = 32.4 bits (74), Expect = 0.40
 Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 5/74 (6%)

Query: 223  KKKKKRKKKKKK-----KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
            KKK      +KK      + ++ K+   Q   +    +    E     ++K  ++    +
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 278  KKKKRKKKRKKKKN 291
              KKRK K++ K N
Sbjct: 1565 DIKKRKNKKQYKSN 1578



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 13/72 (18%), Positives = 30/72 (41%)

Query: 219  GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
            GF+   +KK  + + + +++K    + +   D             ++K  ++   +   K
Sbjct: 1508 GFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIK 1567

Query: 279  KKKRKKKRKKKK 290
            K+K KK+ K   
Sbjct: 1568 KRKNKKQYKSNT 1579



 Score = 28.1 bits (63), Expect = 7.7
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           +  EEK KKKKKK+KK  ++ KR++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 34.8 bits (80), Expect = 0.045
 Identities = 15/72 (20%), Positives = 23/72 (31%), Gaps = 11/72 (15%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KKK + + KKK   + KKK             +  +    K   + K   +     K  +
Sbjct: 200 KKKAEAEAKKKAAAEAKKKAA-----------AEAKAAAAKAAAEAKAAAEKAAAAKAAE 248

Query: 285 KRKKKKNQHHRD 296
           K    K     D
Sbjct: 249 KAAAAKAAAEVD 260



 Score = 34.4 bits (79), Expect = 0.057
 Identities = 14/79 (17%), Positives = 43/79 (54%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           ++K  ++ ++++KKK++++ ++ Q  +     R  + E+E    +++KK+ ++  K+   
Sbjct: 70  QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129

Query: 283 KKKRKKKKNQHHRDVWQKK 301
           K+K+ ++         + K
Sbjct: 130 KQKQAEEAAAKAAAAAKAK 148



 Score = 33.2 bits (76), Expect = 0.14
 Identities = 18/74 (24%), Positives = 23/74 (31%), Gaps = 19/74 (25%)

Query: 222 MKKKKKRKKKKK-----KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
             KK   + KKK       K     KKK               E E +KK   + KKK  
Sbjct: 175 AAKKAAAEAKKKAEAEAAAKAAAEAKKK--------------AEAEAKKKAAAEAKKKAA 220

Query: 277 TKKKKRKKKRKKKK 290
            + K    K   + 
Sbjct: 221 AEAKAAAAKAAAEA 234



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 8/67 (11%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
             K+K+ ++   K     K K     +        R     +K   + KKK +    KK 
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAK--------RAAAAAKKAAAEAKKKAEAEAAKKA 179

Query: 283 KKKRKKK 289
             + KKK
Sbjct: 180 AAEAKKK 186


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 33.4 bits (77), Expect = 0.047
 Identities = 21/75 (28%), Positives = 41/75 (54%), Gaps = 13/75 (17%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWI-----NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
            +++++K++RK+ +N+   D ++      R      EE  K  +++KK+K  KKKK+K+ 
Sbjct: 32  AREEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKEL 91

Query: 286 --------RKKKKNQ 292
                   R+KKK +
Sbjct: 92  EDFYRFQLREKKKEE 106



 Score = 32.6 bits (75), Expect = 0.090
 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 26/76 (34%)

Query: 219 GFVMKKKKKRKKKKKKKKKKK--------RKKKKNQHHRDVWINRTSRGEEEEEKKKKKK 270
           G   +++KK K+KKKKKKK+         R+KKK               EE+ +  KK +
Sbjct: 71  GKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKK---------------EEQADLLKKFE 115

Query: 271 KKKKKTTKKKKRKKKR 286
           + K++    +K K+KR
Sbjct: 116 EDKERV---EKMKEKR 128


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 34.8 bits (81), Expect = 0.047
 Identities = 15/84 (17%), Positives = 37/84 (44%), Gaps = 15/84 (17%)

Query: 212 SISFWIPGFVMKKKKKRKKKKKKK-----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKK 266
           +I+  +   V   ++K+ +K + +     ++ K  K++ +              +++E K
Sbjct: 18  AIAAVVALIVSLAQRKKSQKGELEVTNLNEQYKEMKEELKAAL----------LDKKELK 67

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKK 290
              K +KKK  ++ K  K + K +
Sbjct: 68  AWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 33.7 bits (78), Expect = 0.097
 Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 2/36 (5%)

Query: 220 FVMKKK--KKRKKKKKKKKKKKRKKKKNQHHRDVWI 253
            ++ KK  K   K +KKK+K++ K  K +    +++
Sbjct: 59  ALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLFV 94



 Score = 32.9 bits (76), Expect = 0.18
 Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 6/70 (8%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
            ++KK +K +      N+ ++++      + E +     KK+ K     +KKK K++ K 
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEM------KEELKAALLDKKELKAWHKAQKKKEKQEAKA 83

Query: 289 KKNQHHRDVW 298
            K +    ++
Sbjct: 84  AKAKSKPRLF 93


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 32.6 bits (75), Expect = 0.049
 Identities = 9/31 (29%), Positives = 23/31 (74%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           +EEE+ +++++ K+ K  K+++ ++K +K K
Sbjct: 4   KEEEKAQREEELKRLKNLKREEIEEKLEKIK 34



 Score = 29.5 bits (67), Expect = 0.57
 Identities = 10/33 (30%), Positives = 21/33 (63%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           R EEE+ +++++ K+ K   +++  +K  K KK
Sbjct: 3   RKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35



 Score = 28.0 bits (63), Expect = 2.1
 Identities = 7/20 (35%), Positives = 17/20 (85%)

Query: 226 KKRKKKKKKKKKKKRKKKKN 245
           K+RK+++K +++++ K+ KN
Sbjct: 1   KERKEEEKAQREEELKRLKN 20



 Score = 27.6 bits (62), Expect = 3.3
 Identities = 6/25 (24%), Positives = 18/25 (72%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRK 287
           +E+K+++K ++++  K+ K  K+ +
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREE 25



 Score = 27.6 bits (62), Expect = 3.3
 Identities = 5/25 (20%), Positives = 18/25 (72%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKK 288
           +++K+++K +++   K+ +  KR++
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREE 25



 Score = 27.2 bits (61), Expect = 3.8
 Identities = 4/23 (17%), Positives = 16/23 (69%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQ 246
           +K++ K +++++ K+ +  K+ +
Sbjct: 3   RKEEEKAQREEELKRLKNLKREE 25



 Score = 27.2 bits (61), Expect = 4.2
 Identities = 6/21 (28%), Positives = 15/21 (71%)

Query: 222 MKKKKKRKKKKKKKKKKKRKK 242
            ++K +R+++ K+ K  KR++
Sbjct: 5   EEEKAQREEELKRLKNLKREE 25



 Score = 26.5 bits (59), Expect = 6.4
 Identities = 5/22 (22%), Positives = 17/22 (77%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
           K+++K +++++ K+ K  K+++
Sbjct: 4   KEEEKAQREEELKRLKNLKREE 25



 Score = 26.5 bits (59), Expect = 7.8
 Identities = 6/21 (28%), Positives = 15/21 (71%)

Query: 224 KKKKRKKKKKKKKKKKRKKKK 244
           K++K ++K +++++ KR K  
Sbjct: 1   KERKEEEKAQREEELKRLKNL 21



 Score = 26.1 bits (58), Expect = 9.1
 Identities = 6/28 (21%), Positives = 18/28 (64%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRD 250
           K++K+ +K +++++ K+ K  K +   +
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEE 28


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 34.5 bits (80), Expect = 0.050
 Identities = 16/75 (21%), Positives = 46/75 (61%), Gaps = 9/75 (12%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           V++K  K ++++++K  K  ++++ +          ++ ++EE+KK++++ K  K + ++
Sbjct: 257 VLRKVDKTREEEEEKILKAAEEERQE---------EAQEKKEEKKKEEREAKLAKLSPEE 307

Query: 281 KRKKKRKKKKNQHHR 295
           +RK + K++K Q  +
Sbjct: 308 QRKLEEKERKKQARK 322



 Score = 31.1 bits (71), Expect = 0.69
 Identities = 12/60 (20%), Positives = 34/60 (56%), Gaps = 9/60 (15%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            +   +++R+++ ++KK++K+K+++                  EE++K ++K++KK  +K
Sbjct: 272 ILKAAEEERQEEAQEKKEEKKKEER---------EAKLAKLSPEEQRKLEEKERKKQARK 322


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 34.2 bits (78), Expect = 0.058
 Identities = 20/69 (28%), Positives = 28/69 (40%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
             K  KK+  ++KK   KK    K           TS    ++  K KK K  K TTKK 
Sbjct: 4   TKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKV 63

Query: 281 KRKKKRKKK 289
             K ++ + 
Sbjct: 64  TVKFEKTES 72



 Score = 33.4 bits (76), Expect = 0.12
 Identities = 19/69 (27%), Positives = 31/69 (44%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            KK  K+K  ++KK   K+     +  +     +T+  +  ++  K KK K  KTT KK 
Sbjct: 4   TKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKV 63

Query: 282 RKKKRKKKK 290
             K  K + 
Sbjct: 64  TVKFEKTES 72



 Score = 31.5 bits (71), Expect = 0.51
 Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 9/80 (11%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            KK   +K    K+  K +K  K           +++  ++  K KK K  K  T K   
Sbjct: 15  EKKPAAKKATTSKETAKTKKTAKTT---------STKAAKKAAKVKKTKSVKTTTKKVTV 65

Query: 282 RKKKRKKKKNQHHRDVWQKK 301
           + +K +  K +       KK
Sbjct: 66  KFEKTESVKKESVAKKTVKK 85



 Score = 29.9 bits (67), Expect = 1.7
 Identities = 16/67 (23%), Positives = 23/67 (34%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
             K    +  KK  K KK +  K       V   +T   ++E   KK  KK+       +
Sbjct: 35  TAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFE 94

Query: 281 KRKKKRK 287
              K  K
Sbjct: 95  ASNKLFK 101



 Score = 28.4 bits (63), Expect = 4.7
 Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 4/71 (5%)

Query: 223 KKKKKRK----KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           K KK  K    K  KK  K K+ K      + V +        ++E   KK  KK+  + 
Sbjct: 31  KTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90

Query: 279 KKKRKKKRKKK 289
           +      +  K
Sbjct: 91  EVFEASNKLFK 101


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 33.9 bits (78), Expect = 0.064
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 9/62 (14%)

Query: 222 MKKKKKRKKKKKKKKKKKRK----KKKNQHH-----RDVWINRTSRGEEEEEKKKKKKKK 272
              +KK K+KK++ K +K++    K   + H     +++ + ++    E +   KKKK K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKAK 218

Query: 273 KK 274
            K
Sbjct: 219 SK 220



 Score = 30.9 bits (70), Expect = 0.64
 Identities = 12/40 (30%), Positives = 22/40 (55%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
           E  +KK+K+KK+  K +K++   +K   + H D   K+  
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELL 198



 Score = 30.9 bits (70), Expect = 0.69
 Identities = 13/66 (19%), Positives = 32/66 (48%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
             +KK+K++K++     +   + +    E  ++  K+    + +   +  R  K+KK K+
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKAKS 219

Query: 292 QHHRDV 297
           ++  DV
Sbjct: 220 KYEEDV 225


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 34.4 bits (80), Expect = 0.064
 Identities = 20/109 (18%), Positives = 49/109 (44%), Gaps = 15/109 (13%)

Query: 199 IFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKK-----------RKKKKNQH 247
           + ++ +  AL++  +   I  FV KK  + K K+ +++ K+            KK+    
Sbjct: 2   MLLIIILIALVALVVGAVIGYFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLE 61

Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK----KRKKKRKKKKNQ 292
            ++      +  E+E  +++ + +K +K   +K     RK +  +K+ +
Sbjct: 62  AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREE 110



 Score = 34.0 bits (79), Expect = 0.085
 Identities = 18/91 (19%), Positives = 50/91 (54%), Gaps = 14/91 (15%)

Query: 222 MKKKKKRKKKKKKKK-----KKKRKKKKNQHHRDVWINRTSRGE-EEEEKKKKKK----- 270
           + ++ K++ +  KK+     K++  K +N+  +++   R  R E ++ EK+  +K     
Sbjct: 43  ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKEL---RERRNELQKLEKRLLQKEENLD 99

Query: 271 KKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           +K +   K+++  +K++K+  Q  +++ +K+
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKE 130



 Score = 32.1 bits (74), Expect = 0.43
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 23/91 (25%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS-----------------------RG 259
           K+ +  KK+   + K++  K +N+  +++   R                         + 
Sbjct: 49  KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           EEE EKK+K+ ++K++  +KK+ + +   ++
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEE 139


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 33.6 bits (77), Expect = 0.072
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
            +KKKK +       +K+ K+K +KKK     +
Sbjct: 1   VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.6 bits (64), Expect = 2.7
 Identities = 12/31 (38%), Positives = 17/31 (54%), Gaps = 3/31 (9%)

Query: 223 KKKKKRK---KKKKKKKKKKRKKKKNQHHRD 250
           KKKK R    K  +K+ K+K +KKK     +
Sbjct: 3   KKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33



 Score = 28.6 bits (64), Expect = 3.2
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQH 247
           +KKKK +    K  +K+ K+K  + 
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKK 26



 Score = 27.8 bits (62), Expect = 4.6
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRD 250
           +KKK+ +    K  +KR K+K++  + 
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKI 28



 Score = 27.0 bits (60), Expect = 8.2
 Identities = 5/27 (18%), Positives = 14/27 (51%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDV 251
           +KK+K +    K  +++ K+    + +
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKI 28



 Score = 27.0 bits (60), Expect = 8.6
 Identities = 15/71 (21%), Positives = 23/71 (32%), Gaps = 18/71 (25%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVW------------------INRTSRGEEEEE 264
             K  RK+ K+K +KKK     N      W                   N  +R +   +
Sbjct: 11  NYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNTAARTDLLIQ 70

Query: 265 KKKKKKKKKKK 275
            K   + KK +
Sbjct: 71  DKPIPELKKGE 81


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 32.4 bits (74), Expect = 0.073
 Identities = 18/69 (26%), Positives = 36/69 (52%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+ +KR +++  K ++K  K + +  R   I         +E+K++ +K   K   KK  
Sbjct: 32  KRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKVE 91

Query: 283 KKKRKKKKN 291
           + KR++K+N
Sbjct: 92  RLKRREKRN 100


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 34.2 bits (79), Expect = 0.074
 Identities = 19/77 (24%), Positives = 35/77 (45%), Gaps = 10/77 (12%)

Query: 217 IPG-FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           I G F + ++    + + ++ KK+ KK + +           + E E EKK+++ +K K 
Sbjct: 80  IKGVFTLGEENIEIEAQIEELKKELKKLEEKI---------EQLEAEIEKKEEELEKAKN 130

Query: 276 TTKKKKRKKKRKKKKNQ 292
               K  KK  KK  + 
Sbjct: 131 KFLDKAWKKLAKKYDSN 147



 Score = 31.9 bits (73), Expect = 0.39
 Identities = 14/74 (18%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE---EEKKKKKKKKKKKTTKKKKRKKKRK 287
           ++  +K    KK+KN+  + +W++  +  +E+    +K+KK  +K   + +K+ ++ + +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437

Query: 288 KKKNQHHRDVWQKK 301
            K  +      +K+
Sbjct: 438 IKALEKEIKELEKQ 451



 Score = 27.7 bits (62), Expect = 9.9
 Identities = 11/44 (25%), Positives = 26/44 (59%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
           E ++E KK ++K ++   + +K++++ +K KN+     W+K   
Sbjct: 99  ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAK 142


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 34.5 bits (79), Expect = 0.077
 Identities = 18/63 (28%), Positives = 24/63 (38%), Gaps = 4/63 (6%)

Query: 225  KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE----EEEKKKKKKKKKKKTTKKK 280
            KK  ++  KK   KK  KK ++         +S  E     E  K K +   KKK     
Sbjct: 1195 KKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAA 1254

Query: 281  KRK 283
            K K
Sbjct: 1255 KEK 1257



 Score = 33.7 bits (77), Expect = 0.14
 Identities = 13/66 (19%), Positives = 21/66 (31%)

Query: 219  GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
            G   K  ++  KK   KK  K+  +          +        E  K K +   KK   
Sbjct: 1192 GAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAP 1251

Query: 279  KKKRKK 284
               ++K
Sbjct: 1252 AAAKEK 1257



 Score = 32.1 bits (73), Expect = 0.47
 Identities = 6/31 (19%), Positives = 16/31 (51%)

Query: 260  EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            + EE ++K ++   +  +   K+  ++  KK
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKK 1204



 Score = 28.3 bits (63), Expect = 6.6
 Identities = 14/60 (23%), Positives = 27/60 (45%), Gaps = 10/60 (16%)

Query: 221  VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
            + K  K+  K ++ ++K +R   + +          S   ++  ++  KK   KKTTKK 
Sbjct: 1165 LDKLDKEDAKAEEAREKLQRAAARGE----------SGAAKKVSRQAPKKPAPKKTTKKA 1214



 Score = 27.9 bits (62), Expect = 7.6
 Identities = 8/41 (19%), Positives = 18/41 (43%), Gaps = 1/41 (2%)

Query: 257  SRGEEEEEKKKKKKKKKK-KTTKKKKRKKKRKKKKNQHHRD 296
            ++ EE  EK ++   + +    KK  R+  +K    +  + 
Sbjct: 1173 AKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 33.9 bits (78), Expect = 0.079
 Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 25/94 (26%)

Query: 222 MKKKKKRKKKKKKKKK--------------KKRKKKKNQHHRDVWINRTSRGEEEEEKKK 267
           +  K KR+ K   ++               KK+K+ K +        R  R +    K K
Sbjct: 204 LGGKLKRRDKDAPERHSDELVDADDIEGPAKKKKQTKEERIATAKEGREDREKFGSRKGK 263

Query: 268 KKKKKKKKTTKKKKRKK-----------KRKKKK 290
           K K+ K  T K+K RKK           + K+K+
Sbjct: 264 KDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKR 297



 Score = 33.5 bits (77), Expect = 0.10
 Identities = 22/84 (26%), Positives = 32/84 (38%), Gaps = 20/84 (23%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWIN--------------RTSR------GEEEEEK 265
           +K   K    K K+R K   + H D  ++              +  R      G E+ EK
Sbjct: 197 EKGVDKALGGKLKRRDKDAPERHSDELVDADDIEGPAKKKKQTKEERIATAKEGREDREK 256

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
              +K KK K  K    K+K +KK
Sbjct: 257 FGSRKGKKDKEGKSTTNKEKARKK 280



 Score = 32.7 bits (75), Expect = 0.19
 Identities = 17/69 (24%), Positives = 32/69 (46%), Gaps = 15/69 (21%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           +K K+ K+ K    K++ +KKN     H + V      RG     K+K+  + K+K  + 
Sbjct: 260 RKGKKDKEGKSTTNKEKARKKNFMMTLHKKRV------RG-----KQKRSLRDKQKVLRA 308

Query: 280 KKRKKKRKK 288
              ++K+  
Sbjct: 309 HILRQKKGG 317



 Score = 31.9 bits (73), Expect = 0.40
 Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 20/85 (23%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK----KKKKKT 276
             + ++ R+K   +K      KK  +           +    +EK +KK       KK+ 
Sbjct: 247 AKEGREDREKFGSRK-----GKKDKEG----------KSTTNKEKARKKNFMMTLHKKRV 291

Query: 277 TKKKKRKKKRKKKKNQHHRDVWQKK 301
             K+KR  + K+K  + H    QKK
Sbjct: 292 RGKQKRSLRDKQKVLRAHILR-QKK 315



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 21/96 (21%), Positives = 40/96 (41%), Gaps = 30/96 (31%)

Query: 217 IPGFVMKKKKKRK---------KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK 267
           I G   KKK+ ++         ++ ++K   ++ KK  +          S   +E+ +KK
Sbjct: 229 IEGPAKKKKQTKEERIATAKEGREDREKFGSRKGKKDKEGK--------STTNKEKARKK 280

Query: 268 -------KKK---KKKKKTTKKKKRKKK---RKKKK 290
                  KK+   K+K+    K+K  +    R+KK 
Sbjct: 281 NFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKG 316


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 32.9 bits (75), Expect = 0.082
 Identities = 20/68 (29%), Positives = 43/68 (63%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K+  +++ K  K+ +K  KKKK    +++   R     E+E  K++K+ +K + +KKK++
Sbjct: 52  KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQK 111

Query: 283 KKKRKKKK 290
           +++R++KK
Sbjct: 112 ERERRRKK 119



 Score = 30.6 bits (69), Expect = 0.48
 Identities = 15/61 (24%), Positives = 37/61 (60%), Gaps = 8/61 (13%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           + +KK+  K++K+++++K   K+ +           + E  ++K+K++++++KK TKK K
Sbjct: 74  LDEKKEIAKQRKREQREKELAKRQK--------ELEKIELSKKKQKERERRRKKLTKKTK 125

Query: 282 R 282
            
Sbjct: 126 S 126



 Score = 30.6 bits (69), Expect = 0.58
 Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 11/70 (15%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KKKK  +KK+  K++K+ +++K    R           ++E +K +  KKK+K  +++++
Sbjct: 70  KKKKLDEKKEIAKQRKREQREKELAKR-----------QKELEKIELSKKKQKERERRRK 118

Query: 283 KKKRKKKKNQ 292
           K  +K K  Q
Sbjct: 119 KLTKKTKSGQ 128



 Score = 30.2 bits (68), Expect = 0.75
 Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 5/66 (7%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K +K+KK  +KK+  K++K++ +           R +E E+ +  KKK+K++  ++KK 
Sbjct: 66  RKFEKKKKLDEKKEIAKQRKREQREKEL-----AKRQKELEKIELSKKKQKERERRRKKL 120

Query: 283 KKKRKK 288
            KK K 
Sbjct: 121 TKKTKS 126



 Score = 29.1 bits (65), Expect = 1.7
 Identities = 19/78 (24%), Positives = 42/78 (53%), Gaps = 8/78 (10%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           K+  +K+ K  K+ RK +K +   +         +E  +++K+++++K+   ++K+ +K 
Sbjct: 52  KESAEKQVKSSKEDRKFEKKKKLDEK--------KEIAKQRKREQREKELAKRQKELEKI 103

Query: 286 RKKKKNQHHRDVWQKKYT 303
              KK Q  R+  +KK T
Sbjct: 104 ELSKKKQKERERRRKKLT 121



 Score = 27.5 bits (61), Expect = 6.0
 Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 12/91 (13%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR---------DVWINRTSRGEEEEEKKKKKK-KK 272
            + +K+  KK  ++ K ++ ++N   +          +     +  E+E  +K+ K  K+
Sbjct: 5   DQNQKKNGKKFTREYKVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKE 64

Query: 273 KKKTTKKKK--RKKKRKKKKNQHHRDVWQKK 301
            +K  KKKK   KK+  K++ +  R+    K
Sbjct: 65  DRKFEKKKKLDEKKEIAKQRKREQREKELAK 95


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 33.2 bits (76), Expect = 0.087
 Identities = 21/64 (32%), Positives = 28/64 (43%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KKKK   +KKK  KK K+KKK+ +   ++     S  EE EE  KK   +  K       
Sbjct: 52  KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAE 111

Query: 283 KKKR 286
               
Sbjct: 112 VNHE 115



 Score = 31.3 bits (71), Expect = 0.36
 Identities = 18/35 (51%), Positives = 21/35 (60%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           E+ E   KK  KKKK T +KKK  KK KKKK +  
Sbjct: 41  EDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKE 75



 Score = 27.1 bits (60), Expect = 9.3
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
            KKKK   +KK T KK +KKK++K++
Sbjct: 51  SKKKKTTPRKKKTTKKTKKKKKEKEE 76


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 32.6 bits (74), Expect = 0.093
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
            + + +K  KKKK+K ++KK + KK+++K N
Sbjct: 18  GKTQSQKSDKKKKEKVSEKKGKSKKKEEKPN 48



 Score = 30.3 bits (68), Expect = 0.71
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           R E  + + +K  KKKK+   +KK K K+K++K
Sbjct: 14  RKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46



 Score = 29.9 bits (67), Expect = 0.87
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKN 245
           KKKK K  +KK K KK+++K N
Sbjct: 27  KKKKEKVSEKKGKSKKKEEKPN 48



 Score = 29.6 bits (66), Expect = 1.0
 Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 13/52 (25%)

Query: 237 KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K++++  K Q  +              +KKKK+K  +KK   KKK +K   K
Sbjct: 12  KQRKESGKTQSQK-------------SDKKKKEKVSEKKGKSKKKEEKPNGK 50



 Score = 28.4 bits (63), Expect = 2.7
 Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 15/50 (30%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           K+RK+  K + +K  KKKK               E+  EKK K KKK++K
Sbjct: 12  KQRKESGKTQSQKSDKKKK---------------EKVSEKKGKSKKKEEK 46



 Score = 26.9 bits (59), Expect = 9.6
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 223 KKKKKRKKKKKKKKKKKRKK 242
           KKKK++  +KK K KKK +K
Sbjct: 27  KKKKEKVSEKKGKSKKKEEK 46


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 33.9 bits (77), Expect = 0.098
 Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 6/80 (7%)

Query: 217  IPGFVMKKKKKRKKKKKKKKKK------KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK 270
            +P  +  K    K+      ++         ++K++    +   R    EE+E  +  ++
Sbjct: 981  LPREIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQR 1040

Query: 271  KKKKKTTKKKKRKKKRKKKK 290
             K+++  KK+K +++R +K 
Sbjct: 1041 AKEEEIGKKEKEREQRIRKT 1060



 Score = 33.2 bits (75), Expect = 0.17
 Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 3/63 (4%)

Query: 225  KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
            K +  K++ K +++K + +  Q  +        + E+E E++ +K          KKR K
Sbjct: 1018 KDRIVKERIKDQEEKERMESLQRAK---EEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074

Query: 285  KRK 287
            K++
Sbjct: 1075 KKR 1077



 Score = 32.0 bits (72), Expect = 0.46
 Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 14/80 (17%)

Query: 217  IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
            +P     ++K   K +  K++ K +++K +          S    +EE+  KK+K++++ 
Sbjct: 1005 LPVPPECREKHEIKDRIVKERIKDQEEKER--------MESLQRAKEEEIGKKEKEREQR 1056

Query: 277  TKKKKRK------KKRKKKK 290
             +K          KKR KKK
Sbjct: 1057 IRKTIHDNYKEMAKKRLKKK 1076



 Score = 31.2 bits (70), Expect = 0.88
 Identities = 10/72 (13%), Positives = 37/72 (51%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            + K+      ++ +     + ++    +D  +    + +EE+E+ +  ++ K++   KK+
Sbjct: 991  LDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKE 1050

Query: 282  RKKKRKKKKNQH 293
            ++++++ +K  H
Sbjct: 1051 KEREQRIRKTIH 1062


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 31.5 bits (72), Expect = 0.10
 Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 12/69 (17%)

Query: 223 KKKKKRKKKKKKKKKKKRKKK----------KNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
             +K RK+ KK    K +KKK          K +   +  I    R +EEE++++K+ +K
Sbjct: 13  HHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEE--IEEKKRKQEEEKERRKEARK 70

Query: 273 KKKTTKKKK 281
            ++   +K+
Sbjct: 71  AERAEARKR 79



 Score = 28.0 bits (63), Expect = 1.4
 Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 16/79 (20%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG--------------EEEEEKKKKKKK 271
           + R K +KK  +  RK +K       W ++  +               EE EEKK+K+++
Sbjct: 1   RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEE 60

Query: 272 KKKKTTKKKKRKKKRKKKK 290
           +K++  +K+ RK +R + +
Sbjct: 61  EKER--RKEARKAERAEAR 77



 Score = 27.2 bits (61), Expect = 3.2
 Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 17/79 (21%)

Query: 223 KKKKKRKKKKKKKKKKKRK--------KKKNQH---------HRDVWINRTSRGEEEEEK 265
           + + K +KK  +  +K RK        K K +           ++  +      + ++E+
Sbjct: 1   RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEE 60

Query: 266 KKKKKKKKKKTTKKKKRKK 284
           +K+++K+ +K  + + RK+
Sbjct: 61  EKERRKEARKAERAEARKR 79


>gnl|CDD|152713 pfam12278, SDP_N, Sex determination protein N terminal.  This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 168 and 410 amino
           acids in length. This family is the N terminal end of
           the sex determination protein of many different animals.
           It plays a role in the gender determination of around
           20% of all animals.
          Length = 168

 Score = 32.7 bits (74), Expect = 0.11
 Identities = 19/72 (26%), Positives = 35/72 (48%), Gaps = 8/72 (11%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKK--------KKKKKK 274
           K K+ R +  +   + + + ++ QH R+ W  +  R  E E+ KKK        +K++ +
Sbjct: 14  KFKQLRSEDSESGLRSRTEDERLQHRREEWFIQQEREREHEKLKKKMILEYELRRKREIE 73

Query: 275 KTTKKKKRKKKR 286
           K   K+ R K R
Sbjct: 74  KLLSKRSRTKSR 85


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 33.5 bits (77), Expect = 0.11
 Identities = 20/72 (27%), Positives = 39/72 (54%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           + K ++K K  K + K+ K+     +   D+     S+ E + EK   + K+KKK  KK+
Sbjct: 279 MEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKE 338

Query: 281 KRKKKRKKKKNQ 292
           ++KKK+ ++  +
Sbjct: 339 EKKKKQIERLEE 350



 Score = 32.7 bits (75), Expect = 0.22
 Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 2/70 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK--KKKKKTTKKK 280
           K + KR KK     +     K+    +    N     E +E+KK+KKK  KKKK+  + +
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLE 349

Query: 281 KRKKKRKKKK 290
           +R +K + + 
Sbjct: 350 ERIEKLEVQA 359



 Score = 30.8 bits (70), Expect = 0.84
 Identities = 13/59 (22%), Positives = 31/59 (52%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           +     K+K K K +++    D  +    + +++EEKKKK+ ++ ++  +K + +   K
Sbjct: 304 EMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDK 362



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 16/68 (23%), Positives = 34/68 (50%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K+ K+     +     ++K K++  RD         E+++EKKK++KKKK+    +++ 
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERI 352

Query: 283 KKKRKKKK 290
           +K   +  
Sbjct: 353 EKLEVQAT 360



 Score = 28.1 bits (63), Expect = 6.7
 Identities = 18/73 (24%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 222 MKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
           M+K +++ K  K + K+ +K     +     +    ++  R  E+ + + K+KKK+KK  
Sbjct: 279 MEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKE 338

Query: 278 KKKKRKKKRKKKK 290
           +KKK++ +R +++
Sbjct: 339 EKKKKQIERLEER 351


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 33.8 bits (77), Expect = 0.11
 Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 7/68 (10%)

Query: 222  MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            MKKK  +K   KK   KK   KK          +    ++   KK  +K   KK  KK  
Sbjct: 992  MKKKVAKKAPAKKAAAKKAAAKKA-------AAKKKVAKKAPAKKVARKPAAKKAAKKPA 1044

Query: 282  RKKKRKKK 289
            RK   +K 
Sbjct: 1045 RKAAGRKA 1052


>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1.  This protein
           family shows weak but suggestive similarity to
           translation initiation factor SUI1 and its prokaryotic
           homologs.
          Length = 173

 Score = 32.9 bits (75), Expect = 0.11
 Identities = 14/68 (20%), Positives = 25/68 (36%), Gaps = 16/68 (23%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
              +   + +++K +K                         E  E  KKKKK  +K T K
Sbjct: 51  AEAVTIGEAQEEKGEKDLLGI----------------QKAQEAREGGKKKKKLPQKVTIK 94

Query: 279 KKKRKKKR 286
           ++ R K++
Sbjct: 95  REPRTKRK 102


>gnl|CDD|223058 PHA03366, PHA03366, FGAM-synthase; Provisional.
          Length = 1304

 Score = 33.8 bits (78), Expect = 0.12
 Identities = 23/118 (19%), Positives = 36/118 (30%), Gaps = 21/118 (17%)

Query: 72  LRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFM-CDVWNSFDVYFSTASI- 129
           L  + N  +  L      + + +   + +   T  +Y      C    S  +     S+ 
Sbjct: 195 LDKVCNGVMGDLPLGGDALPVAIQASSPLPGYTVAYYLSLTPTCTSVRSNYLTRGAHSVE 254

Query: 130 ---LHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTT 184
              LH               +Q      Q T LL   +  +LPG        LGWY T
Sbjct: 255 TGFLHTHG-----------SIQTMG--PQLTGLLFQPS--LLPGG-LPRASALGWYVT 296


>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
           basal transcription 1 (ABT1) and similar proteins.  This
           subfamily corresponds to the RRM of novel nuclear
           proteins termed ABT1 and its homologous counterpart,
           pre-rRNA-processing protein ESF2 (eighteen S factor 2),
           from yeast. ABT1 associates with the TATA-binding
           protein (TBP) and enhances basal transcription activity
           of class II promoters. Meanwhile, ABT1 could be a
           transcription cofactor that can bind to DNA in a
           sequence-independent manner. The yeast ABT1 homolog,
           ESF2, is a component of 90S preribosomes and 5'
           ETS-based RNPs. It is previously identified as a
           putative partner of the TATA-element binding protein.
           However, it is primarily localized to the nucleolus and
           physically associates with pre-rRNA processing factors.
           ESF2 may play a role in ribosome biogenesis. It is
           required for normal pre-rRNA processing, as well as for
           SSU processome assembly and function. Both ABT1 and ESF2
           contain an RNA recognition motif (RRM), also termed RBD
           (RNA binding domain) or RNP (ribonucleoprotein domain).
           .
          Length = 98

 Score = 31.4 bits (72), Expect = 0.12
 Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 12/76 (15%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           + +   K+K++KK+   K +   + W+         E + KK  K+  ++          
Sbjct: 33  QPEDPAKRKRRKKKGGNKKKKFTEGWV---------EFEDKKVAKRVAESLNNTP---IG 80

Query: 287 KKKKNQHHRDVWQKKY 302
            KK++++  D+W  KY
Sbjct: 81  GKKRSRYRDDLWNLKY 96


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 33.9 bits (77), Expect = 0.13
 Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 9/91 (9%)

Query: 215 FWIPGFVMKKKKKRKKKKKKKKKKKRKKKK--NQHHRDV--WINRTSRGEEEEEKKKKKK 270
           F  P   ++K   R    +KK+ +     +   +   +   W++R  R       K+ KK
Sbjct: 595 FVTPSTSLEKSMDRILHGQKKRAEGAVVFEKPLEATENFNPWLDRKMR-----RIKRIKK 649

Query: 271 KKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           K  ++  + K+ KKK  +++N     +  +K
Sbjct: 650 KAYRRIRRDKRLKKKMPEEENTQENHLGSEK 680



 Score = 33.1 bits (75), Expect = 0.21
 Identities = 16/74 (21%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            + ++++K K+  K K K  RK +KN+  +++ +   S  E+ E +K ++ +  ++ T++
Sbjct: 384 LMFREERKAKRVAKIKSKTYRKIRKNRKEKEMALIPKS--EDLENEKSEEARALERMTQR 441

Query: 280 KKRKKKRKKKKNQH 293
            K      +K  + 
Sbjct: 442 HKNTSSWTRKMLER 455



 Score = 28.5 bits (63), Expect = 5.2
 Identities = 21/57 (36%), Positives = 22/57 (38%), Gaps = 14/57 (24%)

Query: 218 PGFVMKKKKKRKKKKKKK--KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
           PG  + KKK    KK     K  KRK     H   V IN         EK  KK KK
Sbjct: 768 PGLKVIKKKSNTVKKVVGGIKYSKRKDFNLSH---VIIN---------EKVNKKDKK 812


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 30.6 bits (70), Expect = 0.13
 Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 15/61 (24%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           +R K+K +K    R+ ++ + +               EK  +K+K+KK   +K++RKK  
Sbjct: 17  RRFKRKVEKAGILRELRRREFY---------------EKPSEKRKRKKAAARKRRRKKLA 61

Query: 287 K 287
           +
Sbjct: 62  R 62



 Score = 27.1 bits (61), Expect = 2.4
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 223 KKKKKRKKKKKKKKKKKRKKK 243
           K  +KRK+KK   +K++RKK 
Sbjct: 40  KPSEKRKRKKAAARKRRRKKL 60



 Score = 26.4 bits (59), Expect = 4.7
 Identities = 7/21 (33%), Positives = 15/21 (71%)

Query: 225 KKKRKKKKKKKKKKKRKKKKN 245
           K+KRKK   +K+++K+  ++ 
Sbjct: 44  KRKRKKAAARKRRRKKLAREE 64



 Score = 26.0 bits (58), Expect = 5.7
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQ 246
           K   K+K+KK   +KR++KK  
Sbjct: 40  KPSEKRKRKKAAARKRRRKKLA 61


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 33.4 bits (76), Expect = 0.14
 Identities = 19/79 (24%), Positives = 30/79 (37%), Gaps = 19/79 (24%)

Query: 218 PGFVMKKKKKRKKKKKKK----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           P  + KK K  +K K +     K ++R+KKK +                   KK   K K
Sbjct: 537 PEEIYKKMKAIEKSKTELDRTDKNRERRKKKRKR---------------RAAKKAVTKAK 581

Query: 274 KKTTKKKKRKKKRKKKKNQ 292
           K+    K++     KK + 
Sbjct: 582 KERKIGKEKVDGVAKKSSS 600



 Score = 32.7 bits (74), Expect = 0.23
 Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 3/57 (5%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           K ++R+KKK+K++  K+   K +  R +      + +   +K       K + TK  
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKERKI---GKEKVDGVAKKSSSAFFAKLQETKAV 612



 Score = 31.9 bits (72), Expect = 0.40
 Identities = 14/60 (23%), Positives = 25/60 (41%), Gaps = 7/60 (11%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           K R+++KKK+K++  KK   +        +  R   +E+     KK       K +  K 
Sbjct: 559 KNRERRKKKRKRRAAKKAVTK-------AKKERKIGKEKVDGVAKKSSSAFFAKLQETKA 611



 Score = 28.4 bits (63), Expect = 4.9
 Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 2/36 (5%)

Query: 262 EEEKKKKKKKKKKKT--TKKKKRKKKRKKKKNQHHR 295
           EE  KK K  +K KT   +  K +++RKKK+ +   
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAA 573


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 31.3 bits (71), Expect = 0.15
 Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 9/63 (14%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            KK+   K K K+ K K+                + G E   +K K KK KK   K  K 
Sbjct: 19  DKKEDDGKCKPKEGKAKKNGAPVT---------QTAGTETTTEKCKGKKDKKDCKKGCKW 69

Query: 283 KKK 285
           +  
Sbjct: 70  EGN 72



 Score = 30.9 bits (70), Expect = 0.20
 Identities = 18/63 (28%), Positives = 25/63 (39%), Gaps = 7/63 (11%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K+   K K K+ K KKN           ++    E   +K K KK K   KK  K +   
Sbjct: 21  KEDDGKCKPKEGKAKKNG-------APVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73

Query: 289 KKN 291
            K+
Sbjct: 74  CKD 76



 Score = 27.4 bits (61), Expect = 3.7
 Identities = 19/69 (27%), Positives = 27/69 (39%), Gaps = 19/69 (27%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK---------TTKKKK 281
            KK K K + K+K  +             +E++ K K K+ K KK          T+   
Sbjct: 2   NKKNKTKTECKEKGCKWD----------KKEDDGKCKPKEGKAKKNGAPVTQTAGTETTT 51

Query: 282 RKKKRKKKK 290
            K K KK K
Sbjct: 52  EKCKGKKDK 60


>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
           hydrophilic C-term.  This domain is a hydrophilic region
           found at the C-terminus of plant and metazoan
           pre-mRNA-splicing factor 38 proteins. The function is
           not known.
          Length = 97

 Score = 31.3 bits (71), Expect = 0.16
 Identities = 8/37 (21%), Positives = 20/37 (54%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           E EEE+  ++ ++K +    + R+  R++ + +  R 
Sbjct: 15  ESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRR 51


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 30.4 bits (69), Expect = 0.18
 Identities = 8/26 (30%), Positives = 20/26 (76%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
           + EEEK++++++ +K+  +K++ K K
Sbjct: 30  KAEEEKREEEEEARKREERKEREKNK 55


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 32.3 bits (74), Expect = 0.18
 Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 15/69 (21%)

Query: 222 MKKKKKRKKKKK---KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           + KK +R KKK+   K+ +  R KKK            S   EEE+KKKKKK  KKK  K
Sbjct: 149 LLKKVERLKKKELDIKEAEAARDKKK------------SNNAEEEKKKKKKKSAKKKKLK 196

Query: 279 KKKRKKKRK 287
           K      + 
Sbjct: 197 KVAAVGMKA 205



 Score = 29.2 bits (66), Expect = 2.0
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 7/57 (12%)

Query: 235 KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
            KK +R KKK        ++        ++KK    +++KK  KKK  KKK+ KK  
Sbjct: 150 LKKVERLKKKE-------LDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVA 199



 Score = 27.3 bits (61), Expect = 10.0
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKK 244
             KK    +++KKKKKKK  KKKK
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKK 194


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 33.3 bits (76), Expect = 0.20
 Identities = 14/36 (38%), Positives = 18/36 (50%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           E++ + KKK    K K  K  K+K K K KK    R
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           +KK K KKK+   K +  K  KKK++       KK
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782



 Score = 29.0 bits (65), Expect = 4.1
 Identities = 22/84 (26%), Positives = 28/84 (33%), Gaps = 24/84 (28%)

Query: 198 CIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS 257
           C  + N   A+I+         +  K K K+K    K K  K                  
Sbjct: 725 CSVIGNSIDAVITKYAGTPKTPYEKKTKAKKKSASTKGKAAK------------------ 766

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKK 281
                   KKK K K KKTTKK+ 
Sbjct: 767 ------TVKKKSKAKSKKTTKKRA 784



 Score = 28.6 bits (64), Expect = 4.7
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKN 291
            K K  K    K K + KK  KK+ 
Sbjct: 760 TKGKAAKTVKKKSKAKSKKTTKKRA 784



 Score = 27.9 bits (62), Expect = 8.7
 Identities = 15/37 (40%), Positives = 16/37 (43%), Gaps = 8/37 (21%)

Query: 264 EKKKKKKKK--------KKKTTKKKKRKKKRKKKKNQ 292
           EKK K KKK         K   KK K K K+  KK  
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 31.2 bits (71), Expect = 0.22
 Identities = 14/60 (23%), Positives = 26/60 (43%), Gaps = 19/60 (31%)

Query: 233 KKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           + ++K+K ++K +Q H                   KKK+KK +    +  KK   KK+ +
Sbjct: 63  QPQRKRKEQQKSSQTH-------------------KKKRKKSRHVSSRSAKKISAKKRRR 103



 Score = 29.3 bits (66), Expect = 1.0
 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 15/56 (26%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           ++K+K ++K  +  KKKRKK ++   R                 KK   KK++ + 
Sbjct: 65  QRKRKEQQKSSQTHKKKRKKSRHVSSR---------------SAKKISAKKRRRSS 105



 Score = 27.4 bits (61), Expect = 5.3
 Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 11/55 (20%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
           + ++KRK+++K  +  K+K+KK    R V             KK   KK+++ ++
Sbjct: 63  QPQRKRKEQQKSSQTHKKKRKK---SRHVSSRSA--------KKISAKKRRRSSS 106


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score = 32.4 bits (73), Expect = 0.22
 Identities = 16/74 (21%), Positives = 37/74 (50%), Gaps = 4/74 (5%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKK----KNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           P  + KK+++  KK  +  +K+ + K    KN++          + E    K++ +KK+K
Sbjct: 3   PKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEK 62

Query: 274 KKTTKKKKRKKKRK 287
            +  +K++  +K+ 
Sbjct: 63  MRMEEKRREPEKQV 76



 Score = 31.6 bits (71), Expect = 0.49
 Identities = 17/76 (22%), Positives = 35/76 (46%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           M  KK  KK+++  KK  R  +K    +   +   +R  + +   K+ +    K   +KK
Sbjct: 1   MPPKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKK 60

Query: 282 RKKKRKKKKNQHHRDV 297
            K + ++K+ +  + V
Sbjct: 61  EKMRMEEKRREPEKQV 76


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 32.0 bits (72), Expect = 0.23
 Identities = 12/57 (21%), Positives = 38/57 (66%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           KKKK    ++++ +K R++++    R  W  R  + +E E++++++++++++T +++
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQ 172


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 31.6 bits (72), Expect = 0.23
 Identities = 8/34 (23%), Positives = 19/34 (55%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           ++     +++ K +K +KK K+  + +KR+   K
Sbjct: 27  SKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60



 Score = 31.2 bits (71), Expect = 0.24
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 18/71 (25%)

Query: 221 VMKKKKKRKKKKKK---KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
           ++  K    K+K     + KK  K KK               + + EK +KK K++ +  
Sbjct: 9   ILASKLPASKRKDPILSRSKKLLKAKK---------------KLKSEKLEKKAKRQLRAE 53

Query: 278 KKKKRKKKRKK 288
           K++  +K R K
Sbjct: 54  KRQALEKGRVK 64



 Score = 28.1 bits (63), Expect = 2.8
 Identities = 8/38 (21%), Positives = 21/38 (55%)

Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           SR ++  + KKK K +K +   K++ + ++++   +  
Sbjct: 25  SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGR 62



 Score = 26.9 bits (60), Expect = 7.7
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDV 297
           + KKK K +K +K  K++ R +KR+  +    + V
Sbjct: 32  KAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPV 66


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 32.1 bits (73), Expect = 0.24
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
           K+KKKKK KK+K  K+     V +  +   +E  E+K + 
Sbjct: 90  KQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSET 129


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.25
 Identities = 13/66 (19%), Positives = 32/66 (48%), Gaps = 1/66 (1%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
            +   +   + ++ +KR  +  +    +   R    EE+EE+ ++ KKK K+  K+ +  
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGI-EERIKELEEKEERLEELKKKLKELEKRLEEL 357

Query: 284 KKRKKK 289
           ++R + 
Sbjct: 358 EERHEL 363



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 13/72 (18%), Positives = 34/72 (47%), Gaps = 11/72 (15%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
            + +++ +K+ ++ +K K+K  K K              +  ++E +K ++ KKK    +
Sbjct: 514 KYNLEELEKKAEEYEKLKEKLIKLKGEI-----------KSLKKELEKLEELKKKLAELE 562

Query: 279 KKKRKKKRKKKK 290
           KK  + + +  +
Sbjct: 563 KKLDELEEELAE 574



 Score = 30.0 bits (68), Expect = 1.9
 Identities = 14/68 (20%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR---GEEEEEKKKKKKKKKKKTTKKKK 281
           +K+ + ++ KKK K+ +K+  +      +   ++    E E  KK+      +K  K+ +
Sbjct: 335 EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELE 394

Query: 282 RKKKRKKK 289
             +K K++
Sbjct: 395 ELEKAKEE 402



 Score = 29.6 bits (67), Expect = 2.2
 Identities = 14/84 (16%), Positives = 43/84 (51%), Gaps = 4/84 (4%)

Query: 222 MKKKKKR----KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
           +K+K +      +  ++   + R+ +K     +  IN      +E E+K+++ ++ KK  
Sbjct: 288 LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347

Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
           K+ +++ +  +++++ + +   KK
Sbjct: 348 KELEKRLEELEERHELYEEAKAKK 371



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 14/69 (20%), Positives = 40/69 (57%), Gaps = 3/69 (4%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK-KK 281
           + +K+ ++ +KK  +++ ++ + ++         +    E E+ +K++++ KKT +K K+
Sbjct: 644 ELRKELEELEKKYSEEEYEELREEYLE--LSRELAGLRAELEELEKRREEIKKTLEKLKE 701

Query: 282 RKKKRKKKK 290
             ++R+K K
Sbjct: 702 ELEEREKAK 710



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 10/72 (13%), Positives = 33/72 (45%), Gaps = 8/72 (11%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           ++  ++K+K+ ++  +   + +    ++         EE EK +K+ K+ ++  ++ +  
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPEL--------REELEKLEKEVKELEELKEEIEEL 243

Query: 284 KKRKKKKNQHHR 295
           +K  +      R
Sbjct: 244 EKELESLEGSKR 255



 Score = 29.3 bits (66), Expect = 3.1
 Identities = 14/72 (19%), Positives = 31/72 (43%), Gaps = 7/72 (9%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK----KKKKKKTTKK 279
           + +K  +  +  K+K  +K +    R   I    +  EE E+K K+    K+K ++  K 
Sbjct: 242 ELEKELESLEGSKRKLEEKIRELEER---IEELKKEIEELEEKVKELKELKEKAEEYIKL 298

Query: 280 KKRKKKRKKKKN 291
            +  ++   +  
Sbjct: 299 SEFYEEYLDELR 310



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 13/81 (16%), Positives = 37/81 (45%), Gaps = 3/81 (3%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +++ K+  ++ +K+ +     K+    +   I       EE +K+ ++ ++K K  K+ K
Sbjct: 233 LEELKEEIEELEKELESLEGSKRKLEEK---IRELEERIEELKKEIEELEEKVKELKELK 289

Query: 282 RKKKRKKKKNQHHRDVWQKKY 302
            K +   K ++ + +   +  
Sbjct: 290 EKAEEYIKLSEFYEEYLDELR 310



 Score = 28.1 bits (63), Expect = 7.3
 Identities = 7/34 (20%), Positives = 16/34 (47%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           ++E +K ++ KKK    +KK  +  ++       
Sbjct: 545 KKELEKLEELKKKLAELEKKLDELEEELAELLKE 578



 Score = 28.1 bits (63), Expect = 7.3
 Identities = 14/73 (19%), Positives = 37/73 (50%), Gaps = 4/73 (5%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + +++  +  ++    + + ++ +  R+       + +EE E+++K KK+ +K  K  +R
Sbjct: 663 ELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALER 722

Query: 283 ----KKKRKKKKN 291
               ++K KK K 
Sbjct: 723 VEELREKVKKYKA 735



 Score = 28.1 bits (63), Expect = 7.9
 Identities = 12/65 (18%), Positives = 26/65 (40%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           KK+  + +KK  +  ++          +   S  E EE  K+ +    +    K   K+ 
Sbjct: 555 KKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKEL 614

Query: 286 RKKKK 290
            +++K
Sbjct: 615 EREEK 619



 Score = 27.7 bits (62), Expect = 9.6
 Identities = 16/70 (22%), Positives = 36/70 (51%), Gaps = 7/70 (10%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           + ++ K+ ++K+++ ++ K+K K+ +        R    EE  E  ++ K KK++  + K
Sbjct: 326 IEERIKELEEKEERLEELKKKLKELE-------KRLEELEERHELYEEAKAKKEELERLK 378

Query: 281 KRKKKRKKKK 290
           KR      +K
Sbjct: 379 KRLTGLTPEK 388


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 32.3 bits (74), Expect = 0.25
 Identities = 13/52 (25%), Positives = 36/52 (69%)

Query: 239 KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           ++++++ +  R+  +  T R EE E +++++K+++KK  K++K+++K   +K
Sbjct: 158 RQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209


>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
           of Ustilago maydis Rim23 (PalC), and related domains.
           This family contains the Bro1-like domain of Ustilago
           maydis Rim23 (also known as PalC), and related proteins.
           It belongs to the BRO1_Alix_like superfamily which
           includes the Bro1-like domains of mammalian Alix
           (apoptosis-linked gene-2 interacting protein X),
           His-Domain type N23 protein tyrosine phosphatase
           (HD-PTP, also known as PTPN23), RhoA-binding proteins
           Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, and related
           domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
           interact with the ESCRT (Endosomal Sorting Complexes
           Required for Transport) system. Rim20 and Rim23
           participate in the response to the external pH via the
           Rim101 pathway. Through its Bro1-like domain, Rim23
           allows the interaction between the endosomal and plasma
           membrane complexes. Bro1-like domains are
           boomerang-shape, and part of the domain is a
           tetratricopeptide repeat (TPR)-like structure.
           Intermediates in the Rim101 pathway may play roles in
           the pathogenesis of fungal corneal infection during
           Candida albicans keratitis. This family lacks the
           V-shaped (V) domain found in many members of the
           BRO1_Alix_like superfamily.
          Length = 413

 Score = 32.4 bits (74), Expect = 0.26
 Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 8/58 (13%)

Query: 252 WINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR-------DVWQKKY 302
           W+ R ++ E E+ K       K K  K  K K++ +K +            ++  KKY
Sbjct: 318 WL-RAAKKELEDLKSPSGVASKAKLKKSWKEKREDRKVEKGAGVEEELRTLEMLLKKY 374


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 32.4 bits (74), Expect = 0.26
 Identities = 15/60 (25%), Positives = 19/60 (31%), Gaps = 18/60 (30%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           ++ K    K+  KK                      K KK   KK   KK  K K+  KK
Sbjct: 255 RRAKGGPGKKPAKKA------------------TAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 30.8 bits (70), Expect = 0.74
 Identities = 13/28 (46%), Positives = 13/28 (46%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
            KK    K KKTT KK   KK  K K  
Sbjct: 266 AKKATAAKAKKTTAKKAAAKKAAKTKKA 293



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 11/29 (37%), Positives = 12/29 (41%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
            E   K    KK  KK T  K +K   KK
Sbjct: 253 AERRAKGGPGKKPAKKATAAKAKKTTAKK 281



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 2/33 (6%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKR--KKKRKKKK 290
           E   +    KK  KK T  K K+   KK   KK
Sbjct: 254 ERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKK 286


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 31.7 bits (72), Expect = 0.28
 Identities = 13/82 (15%), Positives = 43/82 (52%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           F++KK  +++++K+   +  + K++ Q        +     E E +   ++++K++  ++
Sbjct: 31  FIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQE 90

Query: 280 KKRKKKRKKKKNQHHRDVWQKK 301
           ++++++ +  K Q      +KK
Sbjct: 91  EEKEEEAEDVKQQEVFSFKRKK 112



 Score = 29.0 bits (65), Expect = 1.9
 Identities = 15/84 (17%), Positives = 35/84 (41%), Gaps = 13/84 (15%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE-------------EEEEKKK 267
               KK  +K+++K+   +  K K +  +  +       E             E+EE  +
Sbjct: 30  TFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQ 89

Query: 268 KKKKKKKKTTKKKKRKKKRKKKKN 291
           +++K+++    K++     K+KK 
Sbjct: 90  EEEKEEEAEDVKQQEVFSFKRKKP 113


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
           family of fungal proteins is conserved towards the
           C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 29.5 bits (66), Expect = 0.28
 Identities = 11/38 (28%), Positives = 19/38 (50%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           NR  + +  E   ++ K    ++  KKK KKK  K+ +
Sbjct: 1   NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDS 38


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 30.5 bits (69), Expect = 0.29
 Identities = 13/28 (46%), Positives = 18/28 (64%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           KK+KKKKK+   T + K  KK +KK  +
Sbjct: 22  KKRKKKKKRTAKTARPKATKKGQKKDKK 49



 Score = 30.1 bits (68), Expect = 0.45
 Identities = 12/61 (19%), Positives = 27/61 (44%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           K++KK +K  K+  +    ++   +  +    +  + K  KK +K  KKK    +  ++ 
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60

Query: 291 N 291
            
Sbjct: 61  K 61



 Score = 29.0 bits (65), Expect = 1.1
 Identities = 13/56 (23%), Positives = 28/56 (50%), Gaps = 11/56 (19%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           KK+KK+KK+  K  + K  KK              + ++++E  +  ++ K++ T+
Sbjct: 22  KKRKKKKKRTAKTARPKATKKG-----------QKKDKKKDEFPEFPEESKRRRTE 66



 Score = 28.2 bits (63), Expect = 1.9
 Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 4/69 (5%)

Query: 235 KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           K++KK +K   +   D++ N   R    ++KKK+  K  +    KK +KK +KK +    
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKR----KKKKKRTAKTARPKATKKGQKKDKKKDEFPEF 56

Query: 295 RDVWQKKYT 303
            +  +++ T
Sbjct: 57  PEESKRRRT 65



 Score = 27.0 bits (60), Expect = 4.4
 Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 21/72 (29%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           I       KK++KKKK+  K  + K                       KK +KK KKK  
Sbjct: 14  IDDIFSNIKKRKKKKKRTAKTARPKAT---------------------KKGQKKDKKKDE 52

Query: 277 TKKKKRKKKRKK 288
             +   + KR++
Sbjct: 53  FPEFPEESKRRR 64



 Score = 26.7 bits (59), Expect = 6.5
 Identities = 10/40 (25%), Positives = 17/40 (42%), Gaps = 1/40 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINR-TSRGEE 261
           KK+  +  + K  KK ++K KK     +        R E+
Sbjct: 28  KKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTED 67


>gnl|CDD|227476 COG5147, REB1, Myb superfamily proteins, including transcription
           factors and mRNA splicing factors [Transcription / RNA
           processing and modification / Cell division and
           chromosome partitioning].
          Length = 512

 Score = 32.5 bits (74), Expect = 0.30
 Identities = 14/79 (17%), Positives = 26/79 (32%), Gaps = 6/79 (7%)

Query: 224 KKKKRK-KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            + + K      KK   R KK    H     N     +  +  K+++++  K   +    
Sbjct: 255 DRDEDKFWPNIYKKLPYRDKKSIYKHLRRKYNIF--EQRGKWTKEEEQELAKLVVEHGGS 312

Query: 283 KKKRKK---KKNQHHRDVW 298
             +  K   +     RD W
Sbjct: 313 WTEIGKLLGRMPNDCRDRW 331


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 32.3 bits (74), Expect = 0.30
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 3/67 (4%)

Query: 230 KKKKKKKKKKRKKKKN---QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           +  KKK  K+R +KKN        + +N  ++   +     ++K   K   K K R+ ++
Sbjct: 302 RPAKKKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARK 361

Query: 287 KKKKNQH 293
           K KK + 
Sbjct: 362 KAKKARL 368



 Score = 29.7 bits (67), Expect = 1.9
 Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 1/70 (1%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            K  K+R +KK   K ++ + + N   + +        E+   K  +K  K ++  KK K
Sbjct: 306 KKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQK-VKNRRARKKAK 364

Query: 282 RKKKRKKKKN 291
           + +  K  K 
Sbjct: 365 KARLAKVAKA 374



 Score = 27.7 bits (62), Expect = 8.2
 Identities = 8/44 (18%), Positives = 19/44 (43%), Gaps = 3/44 (6%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK 266
             ++K   K  +K K +R +KK +      + + ++   +  K 
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKK---ARLAKVAKALYKSIKA 381


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 32.3 bits (74), Expect = 0.30
 Identities = 18/97 (18%), Positives = 35/97 (36%), Gaps = 6/97 (6%)

Query: 11  YTYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIK--GMLMCFIIITAILGNLLVIISV-- 66
               + +   +F    L          +        G+L+    +  ILG LL  +    
Sbjct: 178 LLLALAFFLLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLSLFGLGGILGALLGGLLSDR 237

Query: 67  IKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSL 103
           +  R+L ++    + +L    LL+AL       +V+L
Sbjct: 238 LGRRRLLLLIGLLLAALGL--LLLALAPSLALLLVAL 272



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 15/107 (14%), Positives = 34/107 (31%), Gaps = 7/107 (6%)

Query: 22  FSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLL--VIISVIKHRKLRIITNYF 79
               +L+P  + +    FP   +G  +        LG LL  ++  ++            
Sbjct: 99  LGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLI 158

Query: 80  VVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFST 126
           +  L     L+ L ++    +++L    +F       +     Y   
Sbjct: 159 LAILGLLLALLLLFLLRLLLLLALA---FFLLSF--GYYGLLTYLPL 200


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 32.4 bits (74), Expect = 0.30
 Identities = 7/79 (8%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + ++  ++ +  +   +  ++  +  +     R  + EE+ EK + + ++  +   +  +
Sbjct: 299 EIEELEEELEGLRALLEELEELLEKLKS-LEERLEKLEEKLEKLESELEELAEEKNELAK 357

Query: 283 KKKRKKKKNQHHRDVWQKK 301
             + + K+ +   +  +K+
Sbjct: 358 LLEERLKELEERLEELEKE 376


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.4 bits (72), Expect = 0.32
 Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 17/94 (18%)

Query: 205 YYALISSSISFWIPGFVMKKKKKRKKKK----KKKKKKKRKKKKNQHHRDVWINR----T 256
           Y  L  SS+      F M+K  + ++ K    ++ KK + +K++ +        +     
Sbjct: 100 YQTLYESSV-----AFGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIE 154

Query: 257 SRGEEEEEKKKKKKKKK----KKTTKKKKRKKKR 286
            R EEE + ++K+   +    KK  ++ K + ++
Sbjct: 155 KREEEERQIEEKRHADEIAFLKKQNQQLKSQLEQ 188


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 29.9 bits (68), Expect = 0.32
 Identities = 15/66 (22%), Positives = 36/66 (54%), Gaps = 5/66 (7%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           +K++KKK+ KK K +R+ +     R+  + + +    + E  K K +++    +  +++K
Sbjct: 11  RKEQKKKELKKNKAERQAR-----REAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQK 65

Query: 285 KRKKKK 290
            +K +K
Sbjct: 66  LKKLEK 71



 Score = 27.6 bits (62), Expect = 1.9
 Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 10/66 (15%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK+ K+ K +++ +++ +  KKN            R + E +K K +++  K    +K++
Sbjct: 16  KKELKKNKAERQARREAKLAKKN----------PDRLQRELDKLKAQEESGKLRPVEKQK 65

Query: 283 KKKRKK 288
            KK +K
Sbjct: 66  LKKLEK 71



 Score = 26.5 bits (59), Expect = 5.2
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
           +   K++KKK+ KK   +++ R++ +  KKN
Sbjct: 8   DAYRKEQKKKELKKNKAERQARREAKLAKKN 38


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 32.3 bits (74), Expect = 0.32
 Identities = 10/31 (32%), Positives = 21/31 (67%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E+E+K+  K++K+    KK+ +KK+K+ +  
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586



 Score = 31.2 bits (71), Expect = 0.76
 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 16/51 (31%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           ++++++K+  K++K+ +K K Q                EEKKKK+ +K +K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQ----------------EEKKKKELEKLEK 588



 Score = 30.8 bits (70), Expect = 0.95
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           E+EEK+  K++K+ +  KK++ KKK++ +K
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 30.8 bits (70), Expect = 1.0
 Identities = 11/30 (36%), Positives = 23/30 (76%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
           E+EE++  K++K+ +K  K++++KKK  +K
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 12/31 (38%), Positives = 21/31 (67%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           +E  +++K+ +K KK+  KKKK  +K +K K
Sbjct: 560 KEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 11/31 (35%), Positives = 23/31 (74%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           R +EE+E  K++K+ +K   +++K+KK+ +K
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKELEK 585



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 12/31 (38%), Positives = 22/31 (70%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           EE+E  K++K+ +K K  ++KK+K+  K +K
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 29.6 bits (67), Expect = 2.5
 Identities = 11/32 (34%), Positives = 23/32 (71%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
           + E+E  K++K+ +K KK  +KKK++ ++ +K
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.5 bits (64), Expect = 5.8
 Identities = 10/41 (24%), Positives = 28/41 (68%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
           +++EE +++K++K+    +K+ RK K++++K +   +  +K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 28.1 bits (63), Expect = 7.1
 Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 22/90 (24%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWIN----------------------RTSRGE 260
           +  K++K+ +K KK++++KKK+ +      I                         + GE
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGE 620

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           E  +K++KK  K+     K   +   K  K
Sbjct: 621 EISKKERKKLSKEYDKQAKLHEEYLAKGGK 650



 Score = 27.7 bits (62), Expect = 9.9
 Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 13/81 (16%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE---EEEKKKKK--------KK 271
           +K+  +++K+ +K KK+ +KKK +  +           E    +E K             
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEK--LEKAKIPPAEFFKRQEDKYSAFDETGLPTHD 616

Query: 272 KKKKTTKKKKRKKKRKKKKNQ 292
              +   KK+RKK  K+   Q
Sbjct: 617 ADGEEISKKERKKLSKEYDKQ 637


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 32.0 bits (73), Expect = 0.33
 Identities = 17/59 (28%), Positives = 29/59 (49%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           KK     +    K KKK+KKKK   H+     + +  + +   K KK +++++  KK K
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 3/44 (6%)

Query: 261 EEEEKKKKKKKKKKKTTKKK---KRKKKRKKKKNQHHRDVWQKK 301
            +++KKKKKK   KK   KK   K++K R K K    R+  QKK
Sbjct: 255 VKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKK 298



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 15/27 (55%), Positives = 17/27 (62%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            K KKKKKKKKK   KK + KK   K+
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQ 279



 Score = 28.5 bits (64), Expect = 4.2
 Identities = 16/66 (24%), Positives = 31/66 (46%), Gaps = 7/66 (10%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KK     +    K K+KKKK +           +  + ++   K++K + K  K ++R++
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKK-------VVHKKYKTKKLTGKQRKARVKAKKAQRRER 294

Query: 285 KRKKKK 290
            +KK K
Sbjct: 295 LQKKIK 300


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 31.9 bits (72), Expect = 0.37
 Identities = 17/67 (25%), Positives = 22/67 (32%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
           K   K K     K K           T    EEEE+  K+K K K     K +     K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183

Query: 290 KNQHHRD 296
           K     +
Sbjct: 184 KAAEAGE 190



 Score = 30.7 bits (69), Expect = 0.86
 Identities = 18/61 (29%), Positives = 21/61 (34%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K   K K     K K     +          EEEEE  K+K K K     K K     K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183

Query: 289 K 289
           K
Sbjct: 184 K 184



 Score = 30.3 bits (68), Expect = 1.3
 Identities = 14/63 (22%), Positives = 17/63 (26%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           K   K K     K K      Q             E ++EK K K     K       K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183

Query: 285 KRK 287
           K  
Sbjct: 184 KAA 186



 Score = 28.4 bits (63), Expect = 4.5
 Identities = 15/67 (22%), Positives = 23/67 (34%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            +++K   K K     K K       +       +  EEE +K+K K K       K   
Sbjct: 120 TEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAA 179

Query: 283 KKKRKKK 289
             K+K  
Sbjct: 180 LAKQKAA 186


>gnl|CDD|216782 pfam01914, MarC, MarC family integral membrane protein.  Integral
           membrane protein family that includes the antibiotic
           resistance protein MarC. These proteins may be
           transporters.
          Length = 203

 Score = 31.5 bits (72), Expect = 0.39
 Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 13/76 (17%)

Query: 39  FPFIIKGMLMCFIIITAILGNLLVIISVIKH--RKLRIITNYFVVSLAFADLLVALCVMP 96
             FI    L  F II  I GN+ V IS+ ++   + R         +AF  LL+ L    
Sbjct: 1   LKFIFSAFLSLFAIINPI-GNVPVFISLTENYPAEERKRIILRASIIAFLILLLFL---- 55

Query: 97  FNAIVSLTDEWYFGYF 112
                 +  +     F
Sbjct: 56  ------VFGKLILKLF 65


>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
          Length = 80

 Score = 29.8 bits (67), Expect = 0.39
 Identities = 7/44 (15%), Positives = 11/44 (25%)

Query: 247 HHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
               V I R+   E EE +     +       K  +        
Sbjct: 33  EEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDD 76


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.0 bits (73), Expect = 0.41
 Identities = 11/71 (15%), Positives = 38/71 (53%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +K++ +  K++ +K + + ++ + +    V  +R  R  +   ++ +K+ ++KK   ++ 
Sbjct: 441 LKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEEL 500

Query: 282 RKKKRKKKKNQ 292
            +K  + +K +
Sbjct: 501 ERKLAELRKMR 511


>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
          Length = 307

 Score = 31.6 bits (72), Expect = 0.41
 Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 4/66 (6%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           I  +++K + +++  ++  K   +  K     +DV    T    +      KKKKK +  
Sbjct: 246 ITTYILKPRLRKQIAEEFAKNPPKAYKSPSGRKDV----TPSQNQAIASLPKKKKKNRNA 301

Query: 277 TKKKKR 282
            K++KR
Sbjct: 302 GKQRKR 307


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 32.1 bits (73), Expect = 0.41
 Identities = 15/26 (57%), Positives = 19/26 (73%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQH 247
            +KKKK+KK K+KKKKK R+  K  H
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 31.4 bits (71), Expect = 0.74
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHH 248
           +KKK+KKK K+KKKKK ++   Q H
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 269 KKKKKKKTTKKKKRKKKRKKKKNQH 293
           +KKKKKK +K+KK+KK R+  K  H
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 29.1 bits (65), Expect = 4.2
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 270 KKKKKKTTKKKKRKKKRKKKKNQHH 294
           +KKKKK   K+K+KKK ++   Q H
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708



 Score = 28.3 bits (63), Expect = 6.6
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 268 KKKKKKKKTTKKKKRKKKRKKKK 290
           +KKKKKKK+ +KKK+K +   K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 28.3 bits (63), Expect = 7.0
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKK 289
           +KKKKKKK   KKKK+ ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 28.3 bits (63), Expect = 7.4
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKK 288
           +KKKKKKK K  KKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 32.0 bits (73), Expect = 0.42
 Identities = 11/71 (15%), Positives = 25/71 (35%), Gaps = 3/71 (4%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK---KKKKKKKKKKTTKKKKRKKK 285
           +    K    +  K+  +     WI    + ++  E+      + + KK  T +K  +  
Sbjct: 823 QTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELY 882

Query: 286 RKKKKNQHHRD 296
              KK +  + 
Sbjct: 883 ILAKKEEEFKQ 893


>gnl|CDD|221673 pfam12626, PolyA_pol_arg_C, Polymerase A arginine-rich C-terminus. 
           The C-terminus of polymerase A in E coli is
           arginine-rich and is necessary for full functioning of
           the enzyme.
          Length = 123

 Score = 30.2 bits (69), Expect = 0.45
 Identities = 7/31 (22%), Positives = 20/31 (64%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            +EE+++   +  + +   K++R++ R++KK
Sbjct: 93  ADEEQRRAMIEALQGREGGKRRRRRPRRRKK 123


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 31.3 bits (71), Expect = 0.46
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKK 290
            K+ KKK+ TT +K    K+ KKK
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKK 71



 Score = 29.0 bits (65), Expect = 2.5
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
           KKK+    +K    KK KKK K+K
Sbjct: 52  KKKRPTTPRKPATTKKSKKKDKEK 75



 Score = 27.8 bits (62), Expect = 6.0
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKK 288
           KKK+    +K  T KK +KK ++K
Sbjct: 52  KKKRPTTPRKPATTKKSKKKDKEK 75



 Score = 27.8 bits (62), Expect = 6.4
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRK 287
           +KK+    +K  TTKK K+K K K
Sbjct: 52  KKKRPTTPRKPATTKKSKKKDKEK 75


>gnl|CDD|177120 MTH00045, ND6, NADH dehydrogenase subunit 6; Validated.
          Length = 162

 Score = 30.6 bits (70), Expect = 0.48
 Identities = 31/138 (22%), Positives = 57/138 (41%), Gaps = 44/138 (31%)

Query: 46  MLMCFIIITAILGNLLVIISVIKHRKLRIITNYF------VVSLAFADLLVALCVMPFNA 99
           M+    +I  + G+ LV  S         ++ Y+      +VSL+   LL++L  + F A
Sbjct: 1   MVFYLFLILMLFGSTLVFYS---------LSPYYSALGLVLVSLSGC-LLLSLLGLSFLA 50

Query: 100 IVSLTDEWYFGYFMCDVWNSFDVY-FSTASILHLCCISVDRYYAIVKPLQYPIIMNQN-- 156
           ++ L    Y G  +        V+ +S+A       +S +R         YP + N    
Sbjct: 51  LLLLL--IYMGGMLV-------VFVYSSA-------LSAER---------YPSVSNLGEV 85

Query: 157 TVLLMLSNVWILPGIISF 174
            VL +L ++W+     +F
Sbjct: 86  LVLSLLLSLWVFISFDNF 103


>gnl|CDD|225620 COG3078, COG3078, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 169

 Score = 31.0 bits (70), Expect = 0.50
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 11/71 (15%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           MK+ KK ++ + K  KK R+K            R     E  ++K++KK+K   +  +  
Sbjct: 1   MKRSKKTRRPRSKADKKARRK-----------TREELDAEARDRKRQKKRKGLASGSRHS 49

Query: 282 RKKKRKKKKNQ 292
              +    K Q
Sbjct: 50  GGNENSGNKQQ 60



 Score = 27.5 bits (61), Expect = 6.5
 Identities = 9/36 (25%), Positives = 22/36 (61%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
           K+ KK ++ ++   KK ++K +++ +   RD  ++K
Sbjct: 2   KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQK 37


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 30.1 bits (68), Expect = 0.51
 Identities = 12/56 (21%), Positives = 39/56 (69%), Gaps = 1/56 (1%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           V++K+K+ +  K++K++++ K+ +++  R++ + R  + E+ E +K K+ +++++ 
Sbjct: 56  VLEKRKRDQVLKEQKEEEEAKRLQSELEREL-MKRAQKLEQLELEKAKEDEEQEQI 110


>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein.  This is a novel repeat
           in Pneumocystis carinii Major surface glycoprotein (MSG)
           some members of the alignment have up to nine repeats of
           this family, the repeats containing several conserved
           cysteines. The MSG of P. carinii is an important protein
           in host-pathogen interactions. Surface glycoprotein A
           from Pneumocystis carinii is a main target for the host
           immune system, this protein is implicated in the
           attachment of Pneumocystis carinii to the host alveolar
           epithelial cells, alveolar macrophages, host surfactant
           and possibly accounts in part for the hypoxia seen in
           Pneumocystis carinii pneumonia (PCP).
          Length = 81

 Score = 29.3 bits (66), Expect = 0.53
 Identities = 18/62 (29%), Positives = 33/62 (53%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           ++     K+K +K K+K  K K     +  + +  +GE +++KK KK  K+K T  KK+ 
Sbjct: 20  EEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKKKCKKALKEKCTKLKKES 79

Query: 283 KK 284
            +
Sbjct: 80  NE 81


>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
          Length = 386

 Score = 31.3 bits (71), Expect = 0.54
 Identities = 34/177 (19%), Positives = 75/177 (42%), Gaps = 12/177 (6%)

Query: 38  IFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPF 97
           +F F++ G L+   ++ +    ++ + + +  R  RI+    VV L  A   + L ++P 
Sbjct: 55  LF-FVLSGFLITRSLLRSAAAPVISLAAFLARRLRRILPALLVVLLLTA--ALLLALLPP 111

Query: 98  NAIVSLTDEWYFGYFMCDVWNSFDV--YFSTASILHLCC----ISVDRYYAIVKPLQYPI 151
            A  +L  +    +++ ++  +  V  YF+ + +L        +SV+  + ++ PL   +
Sbjct: 112 GASAALWADLAALFYLSNLLLTLAVGAYFAASGLLSPLLHLWSLSVEEQFYLLFPLLLLL 171

Query: 152 IMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVVNVYYAL 208
           +      L +L  +  L     F  + LG + T     Y      +  F++    AL
Sbjct: 172 LRRVRFALALLLLLLAL--ASFFLLLALGPFATGAALYYGTATR-LWEFLLGALAAL 225


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 31.0 bits (70), Expect = 0.55
 Identities = 17/34 (50%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWIN 254
           V KKKK + KK K K KK   KKK    R+ W N
Sbjct: 89  VEKKKKGKSKKNKLKGKKDEDKKKA---REGWNN 119



 Score = 29.4 bits (66), Expect = 1.6
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKK 288
           EKKKK K KK K   KK   KK+ +
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 28.6 bits (64), Expect = 2.7
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query: 252 WINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           W  + +    E++KK K KK K K  K + +KK R
Sbjct: 80  WRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 27.9 bits (62), Expect = 4.9
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           KKKK K KK   K K+ + +KK +   +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREGWN 118



 Score = 27.9 bits (62), Expect = 6.0
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
           KKKK K KK   K KK + K+K +
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAR 114



 Score = 27.1 bits (60), Expect = 9.3
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRK 287
           E+KKK K KK K   KK + KKK +
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAR 114


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 29.4 bits (67), Expect = 0.58
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 264 EKKKKKKKKKKKTTKKKKRKK 284
            +KKK+KK+ K  +K+   KK
Sbjct: 60  SRKKKEKKEVKAESKRYNAKK 80



 Score = 28.7 bits (65), Expect = 1.0
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 223 KKKKKRKKKKKKKKKKKRKK 242
           +KKK++K+ K + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 27.5 bits (62), Expect = 2.5
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query: 224 KKKKRKKKKKKKKKKKRKKK 243
           +KKK KK+ K + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 27.1 bits (61), Expect = 3.7
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
           +KKK+KK+ K   K+   KK   +
Sbjct: 61  RKKKEKKEVKAESKRYNAKKLTLE 84



 Score = 26.4 bits (59), Expect = 6.8
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query: 268 KKKKKKKKTTKKKKRKKKRKKKKNQ 292
           +KKK+KK+   + KR   +K    Q
Sbjct: 61  RKKKEKKEVKAESKRYNAKKLTLEQ 85



 Score = 26.4 bits (59), Expect = 8.4
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
           KKK+K++ K + K+   +K    Q
Sbjct: 62  KKKEKKEVKAESKRYNAKKLTLEQ 85


>gnl|CDD|219640 pfam07916, TraG_N, TraG-like protein, N-terminal region.  The
           bacterial sequences found in this family are similar to
           the N-terminal region of the TraG protein. This is a
           membrane-spanning protein, with three predicted
           transmembrane segments and two periplasmic regions. TraG
           protein is known to be essential for DNA transfer in the
           process of conjugation, with the N-terminal portion
           being required for F pilus assembly. The protein is
           thought to interact with the periplasmic domain of TraN
           to stabilise mating-cell interactions.
          Length = 462

 Score = 31.5 bits (72), Expect = 0.58
 Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 50  FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYF 109
            I I A LG L   I       LR I N+ +V+L    L+V   ++P    V +TD  + 
Sbjct: 30  LIRIAAWLGVLAEGIDEGNKGDLRRIENWLLVAL----LVVMFLLVPKTT-VQITDRTFS 84

Query: 110 G 110
           G
Sbjct: 85  G 85


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 31.5 bits (71), Expect = 0.59
 Identities = 15/60 (25%), Positives = 39/60 (65%), Gaps = 7/60 (11%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KKKR++++K  +++++++K+ +  R          EEEE+++ K++ ++++    +KR+K
Sbjct: 220 KKKREERRKVLEEEEQRRKQEEADRKS-------REEEEKRRLKEEIERRRAEAAEKRQK 272


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 31.1 bits (70), Expect = 0.60
 Identities = 13/55 (23%), Positives = 27/55 (49%)

Query: 238 KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           KKR K +    +     R +     EE K ++ ++ K+ ++++K+    K+ K Q
Sbjct: 58  KKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQ 112


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 31.3 bits (71), Expect = 0.63
 Identities = 17/66 (25%), Positives = 21/66 (31%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           KK  KR   KK   K      K   +           E    K KK+K  KK+    K+ 
Sbjct: 311 KKDVKRAPLKKNPLKNLNAMLKLNPYAKTARRMALLAEAARVKAKKEKLAKKRKQVSKEE 370

Query: 283 KKKRKK 288
               K 
Sbjct: 371 AAAIKA 376



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 17/67 (25%), Positives = 24/67 (35%), Gaps = 2/67 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K  K+   KK   K      K N + +     R +   E    K KK+K  KK  +  K 
Sbjct: 312 KDVKRAPLKKNPLKNLNAMLKLNPYAKTA--RRMALLAEAARVKAKKEKLAKKRKQVSKE 369

Query: 283 KKKRKKK 289
           +    K 
Sbjct: 370 EAAAIKA 376


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.9 bits (70), Expect = 0.69
 Identities = 8/78 (10%), Positives = 36/78 (46%), Gaps = 4/78 (5%)

Query: 221 VMKKKK----KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
             ++++    +R ++ ++  ++ +K+ +      + +    + EEEE +  +KK  + + 
Sbjct: 2   EAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEE 61

Query: 277 TKKKKRKKKRKKKKNQHH 294
             ++  ++    ++ +  
Sbjct: 62  ENRRLEEEAAASEEERER 79


>gnl|CDD|226159 COG3633, SstT, Na+/serine symporter [Amino acid transport and
          metabolism].
          Length = 407

 Score = 31.2 bits (71), Expect = 0.70
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 38 IFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSL-AFADLLVALCVM 95
          +F  + +G L+  I+I  +LG LL IIS      + ++   FV +L A A +LV + VM
Sbjct: 7  LFRLLARGSLVKQILIGLVLGILLAIISPPAAEAVGLLGTLFVGALKAVAPVLVFILVM 65


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 30.8 bits (70), Expect = 0.70
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           EEE KK+ K +++     K   K KR KKK
Sbjct: 273 EEEAKKQAKAERRAANAPKPGAKPKRSKKK 302



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 5/34 (14%)

Query: 260 EEEEEKKKKKKKKKKKTT-----KKKKRKKKRKK 288
           ++EEE KK+ K +++         K KR KK+  
Sbjct: 271 DKEEEAKKQAKAERRAANAPKPGAKPKRSKKKAP 304



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           EEE +K+ K +++     K   + K+ KKK 
Sbjct: 273 EEEAKKQAKAERRAANAPKPGAKPKRSKKKA 303


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 31.2 bits (71), Expect = 0.71
 Identities = 15/60 (25%), Positives = 28/60 (46%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
            + ++  + K+     +  K   +K +  + +   R    EE   K KKKK+KKK+  +K
Sbjct: 381 NLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 6/74 (8%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRD----VW--INRTSRGEEEEEKKKKKKKKKKKT 276
           +  KK KK K  K    R+  + +            + +    +  EE +++  ++    
Sbjct: 367 RYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLK 426

Query: 277 TKKKKRKKKRKKKK 290
           +KKKKRKKK   +K
Sbjct: 427 SKKKKRKKKEWFEK 440


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 31.0 bits (70), Expect = 0.71
 Identities = 17/61 (27%), Positives = 22/61 (36%), Gaps = 1/61 (1%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK-KKKKKKKKKTT 277
           G   K  KK  K   K    +RK+           NR S  E + E       K K+K +
Sbjct: 2   GKASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLS 61

Query: 278 K 278
           K
Sbjct: 62  K 62


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 30.0 bits (68), Expect = 0.71
 Identities = 14/65 (21%), Positives = 25/65 (38%), Gaps = 5/65 (7%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
            +   K  K    +   N       +  T   + E  K+  K  ++++  K +KR KK K
Sbjct: 68  DEDDIKVGKPINCRVTVNHT-----VELTEEEKAEARKEALKAYQQEELRKIQKRSKKSK 122

Query: 288 KKKNQ 292
           K +  
Sbjct: 123 KAEPV 127


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 30.0 bits (68), Expect = 0.76
 Identities = 9/51 (17%), Positives = 20/51 (39%), Gaps = 6/51 (11%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
           M   +KR   + K+ +KK  K   +             +++++ K K+   
Sbjct: 1   MASSRKRLPFEPKRSRKKGVKALRK------AAVAKSKDKQKKPKSKRAAS 45



 Score = 28.1 bits (63), Expect = 4.1
 Identities = 13/39 (33%), Positives = 18/39 (46%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
             +SR     E K+ +KK  K   K    K K K+KK +
Sbjct: 1   MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPK 39



 Score = 26.9 bits (60), Expect = 9.1
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E ++ +KK  K  +     K K K+KK K++
Sbjct: 11  EPKRSRKKGVKALRKAAVAKSKDKQKKPKSK 41


>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin. 
          Length = 137

 Score = 29.8 bits (67), Expect = 0.76
 Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 6/59 (10%)

Query: 223 KKKKKRKKK--KKKKKKK----KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
             +   +KK  K+K K+K       K +                  EEKK K  K K+K
Sbjct: 72  DGQGGERKKGIKEKIKEKLPGGHGGKGQATTTTTTGTGGGYGQHGHEEKKGKMDKIKEK 130


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 30.5 bits (69), Expect = 0.77
 Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 20/77 (25%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK-------KK 273
              + +K K  K K   +   K  +Q             ++  EK K   K       KK
Sbjct: 14  RALESEKYKANKDKGNPEIYNKINSQ-------------DKAIEKFKLLIKAQMAERVKK 60

Query: 274 KKTTKKKKRKKKRKKKK 290
             + +KK+ KKK KKKK
Sbjct: 61  LHSQEKKEEKKKPKKKK 77



 Score = 29.4 bits (66), Expect = 2.0
 Identities = 17/78 (21%), Positives = 28/78 (35%), Gaps = 4/78 (5%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           +K +  K K   +   K        + +         E  KK   ++KK++  K KK+K 
Sbjct: 19  EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78

Query: 285 KRKKKKNQHHRD----VW 298
             +    Q   D    VW
Sbjct: 79  PLQVNPAQLFVDDEYYVW 96


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 30.5 bits (69), Expect = 0.81
 Identities = 13/72 (18%), Positives = 26/72 (36%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           GF        K++      K+ K  +    +       +R   +EE+     +  ++   
Sbjct: 134 GFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEERAFNAYRTLRQAKL 193

Query: 279 KKKRKKKRKKKK 290
             K   K++KKK
Sbjct: 194 NAKFVGKKEKKK 205


>gnl|CDD|129521 TIGR00427, TIGR00427, membrane protein, MarC family.  MarC is a
           protein that spans the plasma membrane multiple times
           and once was thought to be a multiple antibiotic
           resistance protein. The function for this family is
           unknown [Unknown function, General].
          Length = 201

 Score = 30.4 bits (69), Expect = 0.81
 Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 13/79 (16%)

Query: 36  IHIFPFIIKGMLMCFIIITAILGNLLVIISVIKH--RKLRIITNYFVVSLAFADLLVALC 93
           + I  F  K  +  F II  I GN+ + IS+ ++     R          +F  LL+ L 
Sbjct: 1   MDILSFYFKFFISLFAIINPI-GNIPIFISLTEYYTAAERNKIAKKANISSFIILLIFL- 58

Query: 94  VMPFNAIVSLTDEWYFGYF 112
                    +  +     F
Sbjct: 59  ---------VFGDTILKLF 68


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 31.1 bits (70), Expect = 0.82
 Identities = 17/73 (23%), Positives = 26/73 (35%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           + K    K+ KK+                  TS  +     K K+K K+K    KK R  
Sbjct: 180 RNKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDG 239

Query: 285 KRKKKKNQHHRDV 297
            ++ K+N     V
Sbjct: 240 DKQTKRNSEKSKV 252



 Score = 29.6 bits (66), Expect = 2.9
 Identities = 18/65 (27%), Positives = 29/65 (44%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           K  KK +   K+ K+   K K    H DV I R+   E ++++K      +     K+K 
Sbjct: 231 KNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKS 290

Query: 283 KKKRK 287
           +  RK
Sbjct: 291 QCLRK 295


>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
          Length = 211

 Score = 30.6 bits (69), Expect = 0.84
 Identities = 17/70 (24%), Positives = 22/70 (31%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
             K +    K+ K K   K +  H    W        E EE     K   K   K K+R 
Sbjct: 47  TGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGNPCKDLDKDIIKGKRRP 106

Query: 284 KKRKKKKNQH 293
           +   K  N  
Sbjct: 107 RLTLKNTNNQ 116


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 31.0 bits (71), Expect = 0.84
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 210 SSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN 245
            SSI    P ++  +K++ KKKKK+KK KK + +K 
Sbjct: 719 YSSIFIAAPLWLDLEKRRLKKKKKRKKVKKWEVEKI 754



 Score = 27.9 bits (63), Expect = 9.2
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQ 246
           +K+R KKKKK+KK K+ + +  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754


>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 329

 Score = 30.5 bits (69), Expect = 0.99
 Identities = 11/59 (18%), Positives = 27/59 (45%), Gaps = 5/59 (8%)

Query: 237 KKKRKKKKNQHHRD-----VWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           +++R  KK + H +     V          E  ++ K++  +++  K++ R K++K   
Sbjct: 269 REERLAKKGKDHGESEGKVVAPEGAVAQTTEVREQTKRQTVQRQQPKRQSRAKRQKGGA 327


>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17. 
          Length = 93

 Score = 29.0 bits (65), Expect = 1.00
 Identities = 14/30 (46%), Positives = 16/30 (53%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E K KK   KKKK   K K+  K KK+  Q
Sbjct: 35  EPKPKKAAAKKKKPAVKGKKGAKGKKETKQ 64



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 12/73 (16%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSR-------GEEEEEKKKKKKKKKKKTTKKKKRK 283
           K+K + K   K  K +  R     R++R        + E + KK   KKKK   K KK  
Sbjct: 2   KRKGEAKDAAKVTKQEPKR-----RSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKKGA 56

Query: 284 KKRKKKKNQHHRD 296
           K +K+ K +  ++
Sbjct: 57  KGKKETKQEEAKE 69


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
           proteins is associated with U3 snoRNA. U3 snoRNA is
           required for nucleolar processing of pre-18S ribosomal
           RNA.
          Length = 87

 Score = 28.5 bits (64), Expect = 1.1
 Identities = 14/29 (48%), Positives = 19/29 (65%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
            EEE KK+KKK K +  KK +++ K  KK
Sbjct: 24  YEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 27.4 bits (61), Expect = 2.6
 Identities = 15/35 (42%), Positives = 18/35 (51%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
                  EEE KK+KKK K +   K +KR K  KK
Sbjct: 18  TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52



 Score = 27.4 bits (61), Expect = 3.3
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           E  EE+ KK+KKK K    KK  K+ +  KK
Sbjct: 22  ESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 30.6 bits (68), Expect = 1.1
 Identities = 16/83 (19%), Positives = 45/83 (54%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
           G  +   + + +++K+K +++++K + +  +   I   +  E E+EK+K + +K+    +
Sbjct: 169 GIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKE 228

Query: 279 KKKRKKKRKKKKNQHHRDVWQKK 301
           +K   K+ ++   ++H   + KK
Sbjct: 229 QKDFIKEAEQNCQENHNQFFIKK 251


>gnl|CDD|222892 PHA02586, 68, prohead core protein; Provisional.
          Length = 140

 Score = 29.3 bits (66), Expect = 1.1
 Identities = 19/90 (21%), Positives = 42/90 (46%), Gaps = 4/90 (4%)

Query: 202 VNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE 261
            ++  A++S + +  +   ++K    R +  + K +K R++   Q      +++  R + 
Sbjct: 50  PDLALAMLSINENMSLNEAIVKHVSSRGEVTRTKDRKTRERNAYQ---TTGLSKAKRRQI 106

Query: 262 EEEKKKKKKKKKKKTTK-KKKRKKKRKKKK 290
             +  K K+       K ++KRKK  KK+K
Sbjct: 107 ARKAAKTKRANPSIQRKAQRKRKKALKKRK 136


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.1 bits (68), Expect = 1.2
 Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 11/69 (15%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           VM   KK  KK    K K++               T   E+ +EK    +KK K+   + 
Sbjct: 119 VMGINKKAGKKLALSKFKRKVGLF-----------TEEEEDIDEKLSMLEKKLKELEAED 167

Query: 281 KRKKKRKKK 289
             ++  K +
Sbjct: 168 VDEEDEKDE 176


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 15/70 (21%)

Query: 222 MKKKKKRKKKKKKKKKKKRKK-KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           +++K +R +++    +K++K+  K Q     WI R           K    K KK   + 
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQE----WIRRG----------KAAASKAKKAKSRI 284

Query: 281 KRKKKRKKKK 290
           KR +K + + 
Sbjct: 285 KRLEKLEARL 294


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           + K K K K  KK  ++ KR+ K  + +
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVEPR 127



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            + E K K K K K K  KK + + KR+ K 
Sbjct: 93  PKPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123



 Score = 28.1 bits (63), Expect = 4.8
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKK 244
           IP    K K K K K K  KK + + K+
Sbjct: 92  IPKPEPKPKPKPKPKPKPVKKVEEQPKR 119



 Score = 28.1 bits (63), Expect = 5.3
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
           K K K K  KK +++ KR+ K 
Sbjct: 102 KPKPKPKPVKKVEEQPKREVKP 123



 Score = 27.3 bits (61), Expect = 9.2
 Identities = 12/31 (38%), Positives = 17/31 (54%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDV 251
           V+ K + + K K K K K  KK + Q  R+V
Sbjct: 91  VIPKPEPKPKPKPKPKPKPVKKVEEQPKREV 121


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 14/39 (35%), Positives = 24/39 (61%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           N   +G  +++K KKK+KKKK++       K++KK+K  
Sbjct: 3   NAQGQGNNQKDKNKKKEKKKKESPPPPHEIKRKKKRKGP 41


>gnl|CDD|151203 pfam10710, DUF2512, Protein of unknown function (DUF2512).
           Proteins in this family are predicted to be integral
           membrane proteins, and many of them are annotated as
           being YndM protein. They are all found in Firmicutes.
           The true function is not known.
          Length = 136

 Score = 29.2 bits (66), Expect = 1.3
 Identities = 26/123 (21%), Positives = 47/123 (38%), Gaps = 39/123 (31%)

Query: 37  HIFPFIIKGMLMCFIIITAILGNL-------LVIISVIKHRKLRIITNYFVVSLAFADLL 89
           H+   +IK + M   ++  +LG         +++IS+       ++T   +VS    DL 
Sbjct: 3   HVKALLIKFI-MTLAVLYIVLGLFFGVSFGDVLLISL-------VLT---LVSYLLGDLF 51

Query: 90  VALCVMPF--NAIVSLTD-------EWYFGYFMCDVWNSFDVYFS--TASILHLCCISVD 138
               V+P   N   ++ D        W  G  +       D Y     AS++    I+V 
Sbjct: 52  ----VLPRTGNTTATIADFGLAFLVVWLMGRIL------TDNYLPIAWASLISALVIAVG 101

Query: 139 RYY 141
            Y+
Sbjct: 102 EYF 104


>gnl|CDD|220477 pfam09933, DUF2165, Predicted small integral membrane protein
           (DUF2165).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 160

 Score = 29.6 bits (67), Expect = 1.3
 Identities = 27/119 (22%), Positives = 44/119 (36%), Gaps = 7/119 (5%)

Query: 7   NSLNYTYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISV 66
           N  +Y  N  +     S  + FP N+ ++  I    +       II+  +L  LL  I  
Sbjct: 25  NITDYDTNFQFVQHVLSMDTTFPDNALMYRAITSPWLHHAAYALIIVLELLTGLLCWIGA 84

Query: 67  I---KHRKLRIITNYFVVSLAFADL-LVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFD 121
               +          F  + A+A L L    ++ F   + +  EW FG +    WN  D
Sbjct: 85  WRLLRALNASAAA--FNRAKAWAILGLTLGFLVWFVGFMVVGGEW-FGMWQSKEWNGQD 140


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 7/62 (11%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
            K+  K+ KK++KK  K +  +         +R + ++ +KKKK   K    +++ ++ K
Sbjct: 89  LKKDLKEIKKRRKKLDKARLDYDA-------AREKLKKLRKKKKDSSKLAEAEEELQEAK 141

Query: 286 RK 287
            K
Sbjct: 142 EK 143


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 7/50 (14%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
            K  ++ ++K  ++ + +  +D   N+ +      + +++ ++K+K+T+ 
Sbjct: 691 SKMYQENQQKLAEEAREKALKDA--NKRAPIVAATQAREEVREKRKQTSG 738


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 13/49 (26%), Positives = 27/49 (55%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK 265
           +PG     +KK+++KK+++K+ K  ++ +    D   +    GE  EE+
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEER 222



 Score = 28.8 bits (65), Expect = 4.1
 Identities = 13/51 (25%), Positives = 20/51 (39%)

Query: 201 VVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDV 251
           V     A  S        G V         +KKK++KK+R+K+  +   D 
Sbjct: 152 VQQEKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDD 202


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 30.1 bits (67), Expect = 1.5
 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           G V +K++ R  KK++KK K ++    +   D +       EEEE+  + K ++ +K
Sbjct: 215 GLVSEKRELRGGKKRRKKLKPKQPNGEESGEDDF---QEDPEEEEQLPESKPEETEK 268


>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1.  This family consists
           of several bacterial histone H1-like Hc1 proteins. In
           Chlamydia, Hc1 is expressed in the late stages of the
           life cycle, concomitant with the reorganisation of
           chlamydial reticulate bodies into elementary bodies.
           This suggests that Hc1 protein plays a role in the
           condensation of chromatin during intracellular
           differentiation.
          Length = 123

 Score = 28.9 bits (64), Expect = 1.5
 Identities = 20/68 (29%), Positives = 25/68 (36%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           +K    KKK  KK  K  K  K            +  ++ +  K K   KK    K KK 
Sbjct: 56  EKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVKKM 115

Query: 283 KKKRKKKK 290
            KKR   K
Sbjct: 116 SKKRSTAK 123


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
           family is found from fungi to humans, but its exact
           function is not known.
          Length = 88

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 16/70 (22%), Positives = 38/70 (54%), Gaps = 11/70 (15%)

Query: 234 KKKKKKRKKKKNQHHRDVWINRTSRG-------EEEEEKKKKK---KKKKKKTTKKKKRK 283
           +K+ +        H+      RT RG       ++EEE+K++    +K +++T +++K +
Sbjct: 8   RKRDRSPTPPPEVHYDADAEVRT-RGTGFYAFSKDEEERKEQMEELEKAREETERERKER 66

Query: 284 KKRKKKKNQH 293
           ++RK+K+ + 
Sbjct: 67  EERKEKRKRA 76


>gnl|CDD|219084 pfam06550, DUF1119, Protein of unknown function (DUF1119).  This
           family consists of several hypothetical archaeal
           proteins of unknown function.
          Length = 283

 Score = 29.8 bits (67), Expect = 1.5
 Identities = 34/158 (21%), Positives = 71/158 (44%), Gaps = 31/158 (19%)

Query: 50  FIIITAILGNLLVIISVIKHRKLRI--ITNYFVVSLAFADLLVALCVMPFNA----IVSL 103
           F I+ A++  L V+I + +++K  I  +    VV++ F   L+ L +   NA    I S+
Sbjct: 37  FYIVAALVFTLFVLIYIKRNKKWIIFLVIYSAVVAVIFYVFLILLSIFLVNAGAINIASI 96

Query: 104 T------------DEWYF----GYFM----CDVWN-SFDVYFSTASILHLCCISVDRYYA 142
                         EWY     G+F+      +W  SF V+    +++ L  ++V    +
Sbjct: 97  LLAIGLFYLLYFYPEWYVIDLAGFFLAVGISALWGISFGVW---PAVVLLIALAVYDAIS 153

Query: 143 IVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLG 180
           + K  ++ I + +  + L L  ++++P  + ++ I   
Sbjct: 154 VYKT-KHMITLARGVMDLDLPILFVIPENLDYSFIEES 190


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 29.2 bits (66), Expect = 1.6
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 221 VMKKKKKRKKKKKKKKKKKRK 241
           V KK K ++ +K + KKKK+K
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 27.7 bits (62), Expect = 4.9
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 267 KKKKKKKKKTTKKKKRKKKRK 287
             KK K K++ K + +KKK+K
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 30.3 bits (68), Expect = 1.6
 Identities = 16/65 (24%), Positives = 22/65 (33%), Gaps = 6/65 (9%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK------KKKKKKKKKTT 277
           K   R K+K K+K    K+ K +             ++ EEK          KKKK    
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851

Query: 278 KKKKR 282
               R
Sbjct: 852 DNMTR 856



 Score = 29.5 bits (66), Expect = 2.4
 Identities = 14/59 (23%), Positives = 27/59 (45%)

Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           K   + K++ K+K  + +++ I   S   E E    K++++K          KK+K  K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850



 Score = 28.8 bits (64), Expect = 4.4
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 8/64 (12%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           K   + K+K K+K    K+       V        E E E+   K++++K          
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSV--------ETEGERCTIKQREEKGIDAPAILNV 843

Query: 285 KRKK 288
           K+KK
Sbjct: 844 KKKK 847


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
           All proteins in this family for whcih functions are
           known are G-T or G-U mismatch glycosylases that function
           in base excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University). Used 2pf model [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 328

 Score = 30.0 bits (67), Expect = 1.6
 Identities = 16/78 (20%), Positives = 34/78 (43%), Gaps = 3/78 (3%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHR--DVWINRTSRGEEEEEKKKKKKKK-KK 274
              V  K   ++  ++  K +KRK + N+ +R  +      S+   +  K K+K++K   
Sbjct: 40  TSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQEKITD 99

Query: 275 KTTKKKKRKKKRKKKKNQ 292
           K   K+K  +     + +
Sbjct: 100 KFKVKRKVDRFNGVSEAE 117


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 30.1 bits (68), Expect = 1.7
 Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 1/65 (1%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVW-INRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
             K+ RK+K+    KK+    K + +   W      +   +EE+ K +  K ++  + + 
Sbjct: 353 AIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLEN 412

Query: 282 RKKKR 286
            K+KR
Sbjct: 413 GKQKR 417


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 11/78 (14%)

Query: 222 MKKKKKRKKKKKKKKKKKRKK-----------KKNQHHRDVWINRTSRGEEEEEKKKKKK 270
           +K  K+ KK+ K  ++K+ +K           ++ +  R++   R  + E E+E  +   
Sbjct: 27  IKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFAD 86

Query: 271 KKKKKTTKKKKRKKKRKK 288
           K+K  T+  KK+ ++ +K
Sbjct: 87  KEKFVTSAYKKQLEENRK 104


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 28.7 bits (64), Expect = 1.7
 Identities = 13/28 (46%), Positives = 22/28 (78%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN 245
           P ++ +++ K+ KKK++KKKKK+K  KN
Sbjct: 95  PAWMNQRQAKKLKKKREKKKKKKKGAKN 122



 Score = 28.3 bits (63), Expect = 2.9
 Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 12/60 (20%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            K   + KK  + ++  +   W+N           +++ KK KKK  KKKK+KK  K   
Sbjct: 77  NKAAVRNKKTLRDQHGQY-PAWMN-----------QRQAKKLKKKREKKKKKKKGAKNLA 124



 Score = 27.5 bits (61), Expect = 4.6
 Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 22/100 (22%)

Query: 207 ALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK 266
           A +  ++       +MK  K     K  ++ K++KKK+           +   +E+ E  
Sbjct: 25  ARLKKALGLIDGNVLMKDVKDIATVKPAEEVKEKKKKEGT--------ESEEDDEKMEVD 76

Query: 267 KKKKKKKKKT--------------TKKKKRKKKRKKKKNQ 292
            K   + KKT               + KK KKKR+KKK +
Sbjct: 77  NKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKK 116



 Score = 26.7 bits (59), Expect = 7.7
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 223 KKKKKRKKKKKKKKKKKRKK 242
           K KKKR+KKKKKKK  K   
Sbjct: 105 KLKKKREKKKKKKKGAKNLA 124



 Score = 26.7 bits (59), Expect = 8.0
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
           + KK +KK++KKKKKKK  K  
Sbjct: 102 QAKKLKKKREKKKKKKKGAKNL 123



 Score = 26.7 bits (59), Expect = 8.2
 Identities = 11/24 (45%), Positives = 16/24 (66%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHR 249
            +R+ KK KKK++K+KKKK     
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKN 122


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
           structure and biogenesis].
          Length = 67

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 11/26 (42%), Positives = 19/26 (73%)

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKK 289
           EK  +K+K+KK   +K+K K+ RK++
Sbjct: 39  EKPSEKRKRKKAAARKRKFKRLRKEQ 64



 Score = 26.9 bits (60), Expect = 2.7
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQ 246
           K+KRKK   +K+K KR +K+ Q
Sbjct: 44  KRKRKKAAARKRKFKRLRKEQQ 65


>gnl|CDD|236078 PRK07726, PRK07726, DNA topoisomerase III; Provisional.
          Length = 658

 Score = 30.1 bits (69), Expect = 1.7
 Identities = 15/73 (20%), Positives = 20/73 (27%), Gaps = 12/73 (16%)

Query: 210 SSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
            SS  +        K     K   K K K  K    Q              E  ++K   
Sbjct: 598 QSSEKYKFDNLTGPKCPDCGKPMLKVKGKNGKMLVCQ------------DRECGKRKNVS 645

Query: 270 KKKKKKTTKKKKR 282
           KK   +    KK+
Sbjct: 646 KKTNARCPNCKKK 658


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 9/64 (14%), Positives = 17/64 (26%), Gaps = 15/64 (23%)

Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
             K+KK+K+ K + +  K                   +  +K K    +T          
Sbjct: 765 TEKQKKEKESKSELEALKG---------------VGAKTAEKLKDAGVETVTDLTAADPD 809

Query: 287 KKKK 290
               
Sbjct: 810 AVAA 813



 Score = 28.0 bits (63), Expect = 7.5
 Identities = 7/63 (11%), Positives = 11/63 (17%), Gaps = 13/63 (20%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + K + +  K    K   K K               G E                     
Sbjct: 773 ESKSELEALKGVGAKTAEKLK-------------DAGVETVTDLTAADPDAVAAKVDGVS 819

Query: 283 KKK 285
             +
Sbjct: 820 ADR 822


>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain.  This domain is
            found in the MAP2/Tau family of proteins which includes
            MAP2, MAP4, Tau, and their homologs. All isoforms contain
            a conserved C-terminal domain containing tubulin-binding
            repeats (pfam00418), and a N-terminal projection domain
            of varying size. This domain has a net negative charge
            and exerts a long-range repulsive force. This provides a
            mechanism that can regulate microtubule spacing which
            might facilitate efficient organelle transport.
          Length = 1134

 Score = 30.1 bits (67), Expect = 1.7
 Identities = 15/59 (25%), Positives = 31/59 (52%)

Query: 232  KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
            K+++ +K+ ++   + HR     +T RG     ++K  KK+    ++ + R+KK   KK
Sbjct: 1038 KEERAEKEARRPSLEKHRKEKPFKTGRGRISTPERKVAKKEPSTVSRDEVRRKKAVYKK 1096


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 18/75 (24%), Positives = 26/75 (34%), Gaps = 20/75 (26%)

Query: 223 KKKKKRKK--------KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
           K+ KK +K         K     K + + KN            + E E E        KK
Sbjct: 240 KEMKKIRKWEREAGARLKALAALKGKAEAKN------------KAEIEAEALASATAVKK 287

Query: 275 KTTKKKKRKKKRKKK 289
           K  +  K+  K +KK
Sbjct: 288 KAKEVMKKALKMEKK 302



 Score = 29.2 bits (65), Expect = 2.6
 Identities = 13/62 (20%), Positives = 29/62 (46%)

Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           K+++K+ KK +K + +    +      +G+ E + K + + +   +    K+K K   KK
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKK 295

Query: 291 NQ 292
             
Sbjct: 296 AL 297



 Score = 28.8 bits (64), Expect = 4.0
 Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 9/77 (11%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHR------DVWINRTSR---GEEEEEKKKKKKKKKKKTTK 278
           RK+  + K ++KR K KNQ +        +   R  R    +E+E++ KK +K +++   
Sbjct: 195 RKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGA 254

Query: 279 KKKRKKKRKKKKNQHHR 295
           + K     K K    ++
Sbjct: 255 RLKALAALKGKAEAKNK 271



 Score = 28.5 bits (63), Expect = 4.6
 Identities = 14/69 (20%), Positives = 26/69 (37%), Gaps = 8/69 (11%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           MKK +K +++   + K     K     ++          E E +        KK  K+  
Sbjct: 242 MKKIRKWEREAGARLKALAALKGKAEAKNKA--------EIEAEALASATAVKKKAKEVM 293

Query: 282 RKKKRKKKK 290
           +K  + +KK
Sbjct: 294 KKALKMEKK 302


>gnl|CDD|219555 pfam07753, DUF1609, Protein of unknown function (DUF1609).  This
           region is found in a number of hypothetical proteins
           thought to be expressed by the eukaryote Encephalitozoon
           cuniculi, an obligate intracellular microsporidial
           parasite. It is approximately 200 residues long.
          Length = 230

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDV--WINRTSRGEEEEEKKKKKKKKKKKTTKK 279
           M     R KKK  KKK K  +   + H+ V  W     + +EE +K  ++K K +   + 
Sbjct: 15  MAVGGARAKKKGGKKKSKGGRHCYKIHKRVLRWRKSPEKIKEELDKGSEEKWKGRSIEEI 74

Query: 280 KKRKK 284
           K++K 
Sbjct: 75  KEQKV 79


>gnl|CDD|221144 pfam11595, DUF3245, Protein of unknown function (DUF3245).  This is
           a family of proteins conserved in fungi. The function is
           not known, and there is no S. cerevisiae member.
          Length = 145

 Score = 28.8 bits (64), Expect = 1.9
 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 4/62 (6%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           VM +K+K +    K+K     K+  Q  + V        ++++E + +     KK + KK
Sbjct: 88  VMAEKEKAEGGPVKRKAAVVAKEAKQSSKGV----GDDDDDDDEDESRSAAFGKKGSNKK 143

Query: 281 KR 282
           ++
Sbjct: 144 RK 145


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
           is a family of proteins that seem to be involved,
           directly or indirectly, in the salt sensitivity of some
           cellular functions in yeast. These proteins also
           interact with the splicing factor Msl1p.
          Length = 189

 Score = 29.1 bits (65), Expect = 1.9
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query: 231 KKKKKKKKKRKKKKNQHHRDV 251
           K KKKKKK+ K     H+   
Sbjct: 32  KSKKKKKKRSKATSPSHNASD 52



 Score = 28.7 bits (64), Expect = 3.1
 Identities = 10/25 (40%), Positives = 12/25 (48%)

Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTK 278
           N     +    K KKKKKK+ K T 
Sbjct: 21  NTVIHNDSSSSKSKKKKKKRSKATS 45


>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
          Length = 150

 Score = 29.1 bits (66), Expect = 1.9
 Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 5/49 (10%)

Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
               SRG   E  +K+KK +++    +K +K  R  KK     + W  +
Sbjct: 55  KKGISRGRARERHEKRKKGRRRGPGSRKGKKGARTPKK-----ERWINR 98


>gnl|CDD|130559 TIGR01495, ETRAMP, Plasmodium ring stage membrane protein ETRAMP.
           This model describes a family of proteins from the
           malaria parasite Plasmodium falciparum, several of which
           have been shown to be expressed specifically in the ring
           stage as well as the rident parasite Plasmodium yoelii.
           A homolog from Plasmodium chabaudi was localized to the
           parasitophorous vacuole membrane. Members have an
           initial hydrophobic, Phe/Tyr-rich stretch long enough to
           span the membrane, a highly charged region rich in Lys,
           a second putative transmembrane region, and a second
           highly charged, low complexity sequence region. Some
           members have up to 100 residues of additional C-terminal
           sequence. These genes have been shown to be found in the
           sub-telomeric regions of both P. falciparum and P.
           yoelii chromosomes.
          Length = 85

 Score = 27.7 bits (62), Expect = 1.9
 Identities = 17/43 (39%), Positives = 23/43 (53%)

Query: 204 VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
           +Y+     +I+F  PG+    KK  KK   KK  K  KKKKN+
Sbjct: 7   LYFFAALLAINFIAPGYCNNGKKVNKKGNLKKLTKAEKKKKNK 49


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 30.0 bits (67), Expect = 2.0
 Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 3/65 (4%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
              + +K ++ + + K  K+ K+K    RD     TS+  + E  ++  K  + +    K
Sbjct: 195 RQTQGQKVQEHQMELKYLKQYKEKACEIRD---QITSKEAQLESSREIVKSYENELDPLK 251

Query: 281 KRKKK 285
            R K+
Sbjct: 252 NRLKE 256


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 29.2 bits (66), Expect = 2.0
 Identities = 15/82 (18%), Positives = 42/82 (51%), Gaps = 16/82 (19%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++K+ +  ++ ++++++KR +++ +        R ++   E+ + K+K K+ +K   ++ 
Sbjct: 112 VRKQLRFLEQLEREEEEKRDEEERERLL-----RAAKSRSEQSRLKQKAKEMQKEEDEEM 166

Query: 282 RKK-----------KRKKKKNQ 292
           R +            RKKKK +
Sbjct: 167 RHRAANATALAAIGGRKKKKRR 188


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 11/58 (18%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           K K K K K    K+              + E+  EK  +     K+ T   ++KK R
Sbjct: 244 KPKAKAKPKTAGIKRP-----------VVKMEKTAEKGGRPASNGKRFTSPGRKKKGR 290


>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
            NSA2 appears to be a protein required for the
           maturation of 27S pre-rRNA in yeast; it has been
           characterized in mammalian cells as a nucleolar protein
           that might play a role in the regulation of the cell
           cycle and in cell proliferation.
          Length = 257

 Score = 29.5 bits (67), Expect = 2.1
 Identities = 18/73 (24%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK---KKTTKKKKR 282
           +KR  ++   +++KRKK+  + H+     +  RG + +   KK+ K+K   KKT K  + 
Sbjct: 8   RKRHGRRLDHEERKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKKTIKMHEE 67

Query: 283 KKKRKKKKNQHHR 295
           +  ++K  ++   
Sbjct: 68  RNVKQKVDDKVPE 80


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 29.8 bits (67), Expect = 2.1
 Identities = 15/69 (21%), Positives = 31/69 (44%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           + +KK + K  +   K  KK  N+   +   N+++   ++EE       +KK    K K 
Sbjct: 33  QPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKE 92

Query: 283 KKKRKKKKN 291
           + ++ K + 
Sbjct: 93  EIEKPKDEP 101


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
           contains a C-terminal presumed domain in Sas10 which
           hash been identified as a regulator of chromatin
           silencing.
          Length = 76

 Score = 27.6 bits (62), Expect = 2.2
 Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 2/59 (3%)

Query: 219 GFVMKKKKKRK--KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           G   K+KK R+  + KK+KK +K  K+     R V       G E    K    +  K 
Sbjct: 18  GLTRKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREEDGGYGGETTGIKTNVVRSVKL 76



 Score = 26.1 bits (58), Expect = 7.5
 Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           +K+KK ++  +  K+KK +K  K+ K+Q 
Sbjct: 21  RKRKKDRRNPRVKKRKKYEKAVKRLKSQR 49


>gnl|CDD|225924 COG3389, COG3389, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 277

 Score = 29.4 bits (66), Expect = 2.2
 Identities = 38/201 (18%), Positives = 80/201 (39%), Gaps = 36/201 (17%)

Query: 45  GMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNA----I 100
             +  F+ I      +L+ I + +    R I +  V S+ F   L+ L +         I
Sbjct: 34  NSVYYFVYILVATLFILLAIKLGRKWIFRGIYSVAVASVLFYVFLILLSIFLVLVYAINI 93

Query: 101 VSLT------------DEWYF----GYFM----CDVWN-SFDVYFSTASILHLCCISVDR 139
            S+              EWY     G+F+      ++  SF V     +++ L  ++V  
Sbjct: 94  ASIGLAIGLVYLLYKYPEWYVIDLAGFFLAVGIAAIFGISFGVL---PAVVLLIALAVYD 150

Query: 140 YYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKH-----H 194
             ++ K  ++ I + +  + L L  ++++P  ++++ +   +    D   Y         
Sbjct: 151 AISVYKT-RHMISLAEGVMDLDLPMLFVIPENLAYSFVEDAFENRGDGNAYMIGLGDAVM 209

Query: 195 PNVCIFVVNVYYALISSSISF 215
           P+  I VV+  + LIS  ++F
Sbjct: 210 PS--ILVVSAAFFLISPILAF 228


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 28.8 bits (65), Expect = 2.2
 Identities = 11/36 (30%), Positives = 23/36 (63%), Gaps = 1/36 (2%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
           ++K ++KK+K   + ++R+ K +K+K +   D W K
Sbjct: 36  REKAQEKKRKAEAQAERRELKARKEKLKTRSD-WLK 70



 Score = 27.7 bits (62), Expect = 5.6
 Identities = 7/26 (26%), Positives = 18/26 (69%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKK 244
              +K++K ++KK+K + + +R++ K
Sbjct: 31  ALALKREKAQEKKRKAEAQAERRELK 56


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 14/70 (20%), Positives = 29/70 (41%), Gaps = 3/70 (4%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK---KKKKKKTTKKK 280
           KK+  K   +K   K ++K KN    D   ++  R EE+                + K+ 
Sbjct: 733 KKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKES 792

Query: 281 KRKKKRKKKK 290
           ++  + K+++
Sbjct: 793 RKSSRNKEEE 802


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 10/51 (19%), Positives = 18/51 (35%), Gaps = 4/51 (7%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
                ++  K++ KK + K K+        + +    EE  E   K   K 
Sbjct: 528 GTNFGKRNSKERYKKNEDKIKE----FASALGKEILKEEPTENSSKNILKL 574


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 28.9 bits (65), Expect = 2.3
 Identities = 14/43 (32%), Positives = 21/43 (48%), Gaps = 3/43 (6%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKK---KKNQHHRDVWQK 300
           EE+ K++     KKK   +K  K+ RKK   K  +    V +K
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 29.6 bits (66), Expect = 2.3
 Identities = 20/77 (25%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           +K  +K+   + ++  KK ++   +  +     G+E    KKKK KK K+ TK+ + + K
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEG---KKKKTKKVKEVTKEYEVQNK 264

Query: 286 RKKKKNQHHRDVWQKKY 302
            K    +  +DV +++Y
Sbjct: 265 HKPLWTRDPKDVTKEEY 281


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 28.4 bits (64), Expect = 2.4
 Identities = 13/68 (19%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
           ++ ++  K+ + ++K + K  K               ++E EK  + ++++KK  K +K 
Sbjct: 48  QELRRLPKRIRAEQKTRLKMFKESLK-----IEKKELKQEVEKLPRFQEQEKKRMKAEKE 102

Query: 283 KKKRKKKK 290
           ++++K +K
Sbjct: 103 EQEQKHQK 110


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 28.5 bits (64), Expect = 2.4
 Identities = 13/31 (41%), Positives = 16/31 (51%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           K KKK KK  K  KK  + KKRK + +    
Sbjct: 6   KAKKKAKKAAKAAKKGVKVKKRKVRTSVRFF 36



 Score = 28.1 bits (63), Expect = 3.2
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
            KKK KK  K   K  K KK++ +   +  R
Sbjct: 7   AKKKAKKAAKAAKKGVKVKKRKVRTSVRFFR 37


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 16/72 (22%), Positives = 34/72 (47%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           I G  +  K ++ + +K  K++  K    +    + I R +  E +++  K  K++  + 
Sbjct: 277 IGGKKLPLKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEK 336

Query: 277 TKKKKRKKKRKK 288
            KK  +KK  K+
Sbjct: 337 GKKLAKKKLEKE 348


>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family.  This protein
           is plasmid encoded and found to be essential for plasmid
           replication.
          Length = 279

 Score = 29.2 bits (66), Expect = 2.6
 Identities = 18/72 (25%), Positives = 28/72 (38%), Gaps = 7/72 (9%)

Query: 228 RKKKKKKKKKKKRKKKKNQ-----HHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK--KK 280
           RK++ + + KK +KK           R        R      ++K   KK+++  K  K 
Sbjct: 172 RKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKLAKL 231

Query: 281 KRKKKRKKKKNQ 292
             K  RKK  N 
Sbjct: 232 DEKDIRKKILNA 243


>gnl|CDD|217315 pfam03006, HlyIII, Haemolysin-III related.  Members of this family
           are integral membrane proteins. This family includes a
           protein with hemolytic activity from Bacillus cereus. It
           has been proposed that YOL002c encodes a Saccharomyces
           cerevisiae protein that plays a key role in metabolic
           pathways that regulate lipid and phosphate metabolism.
           In eukaryotes, members are seven-transmembrane pass
           molecules found to encode functional receptors with a
           broad range of apparent ligand specificities, including
           progestin and adipoQ receptors, and hence have been
           named PAQR proteins. The mammalian members include
           progesterone binding proteins. Unlike the case with GPCR
           receptor proteins, the evolutionary ancestry of the
           members of this family can be traced back to the
           Archaea.
          Length = 207

 Score = 28.8 bits (65), Expect = 2.8
 Identities = 22/95 (23%), Positives = 40/95 (42%), Gaps = 7/95 (7%)

Query: 47  LMCFIIITAILGNLLVIISVIKHRKLRI---ITNYFVVSLAFADLLVALCVMPFNAIVSL 103
           L+ FI   A+LG LL +  + + R LR    +   ++  +    L++AL       +V L
Sbjct: 107 LLIFIWGLALLGILLKLFWLKRFRWLRTVLYLLMGWLGIIPIKHLILALGGGGLVLLV-L 165

Query: 104 TDEWYF---GYFMCDVWNSFDVYFSTASILHLCCI 135
               Y     ++       FD++  +  I HL  +
Sbjct: 166 GGVLYTLGAIFYALRFPGPFDIWGHSHQIFHLFVV 200


>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal.  L19e
           is found in the large ribosomal subunit of eukaryotes
           and archaea. L19e is distinct from the ribosomal subunit
           L19, which is found in prokaryotes. It consists of two
           small globular domains connected by an extended segment.
           It is located toward the surface of the large subunit,
           with one exposed end involved in forming the
           intersubunit bridge with the small subunit.  The other
           exposed end is involved in forming the translocon
           binding site, along with L22, L23, L24, L29, and L31e
           subunits.
          Length = 145

 Score = 28.4 bits (64), Expect = 2.8
 Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 14/90 (15%)

Query: 219 GFVMKKKKK-----RKKKKKKKKKKKRKKK---------KNQHHRDVWINRTSRGEEEEE 264
           G +  K KK     R K++ +K+KK R++               ++ WI          +
Sbjct: 45  GVIKAKPKKGISRGRLKERHEKRKKGRRRGPGSRKGKKGARTPKKERWIKTIRALRRYLK 104

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           + + K K  KKT +K  RK K    ++  H
Sbjct: 105 ELRDKGKIDKKTYRKLYRKAKGGSFRSLSH 134


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 15/68 (22%), Positives = 29/68 (42%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           ++K K+    K++K          N+++++         E E+      +  K+K  K K
Sbjct: 210 LIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIK 269

Query: 281 KRKKKRKK 288
             K+K KK
Sbjct: 270 DLKEKAKK 277


>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
          Length = 1121

 Score = 29.5 bits (67), Expect = 2.9
 Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 6/44 (13%)

Query: 236 KKKKRKKKKNQHHRDVW------INRTSRGEEEEEKKKKKKKKK 273
           K  K  K  ++   D W      I+     EEEEEKKK K K K
Sbjct: 252 KAPKILKYVDKLKIDGWEWLEELISGKKEKEEEEEKKKIKPKDK 295


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 28.5 bits (64), Expect = 2.9
 Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 9/55 (16%)

Query: 238 KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           KK  +    + ++         E E +K+    +++KK T +K ++K  +++   
Sbjct: 33  KKDIEINTDYLQE---------ETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 12/77 (15%), Positives = 33/77 (42%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           +   +  K  ++  + +K+ ++ H +         E  EE +  + + ++   K K  + 
Sbjct: 190 EYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRA 249

Query: 285 KRKKKKNQHHRDVWQKK 301
           K K  K +  R+  +++
Sbjct: 250 KEKAAKRREKREELKER 266


>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein.  This family consists of several
           eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
           expression of functional F1-ATPase requires two proteins
           encoded by the ATP11 and ATP12 genes. Atp11p is a
           molecular chaperone of the mitochondrial matrix that
           participates in the biogenesis pathway to form F1, the
           catalytic unit of the ATP synthase.
          Length = 250

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 10/60 (16%)

Query: 234 KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           K + K  +K K      +          E  KK  K K +KK    KK      K+ ++ 
Sbjct: 3   KYRSKLLQKAKESGLEFI----------ERLKKALKDKIEKKEFSAKKPPTGPSKQASKF 52


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.4 bits (67), Expect = 3.0
 Identities = 14/71 (19%), Positives = 36/71 (50%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
            K  +  +K+ ++  K+ KK+ ++  +++   +       +  +  + +K+     +KK 
Sbjct: 565 DKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKE 624

Query: 283 KKKRKKKKNQH 293
           KKK+K+K+ Q 
Sbjct: 625 KKKKKQKEKQE 635



 Score = 28.3 bits (64), Expect = 5.7
 Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 14/87 (16%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWI-NRTSRGE-----EEEE--------KKKKKK 270
            K  +KK+KKKKK+K K+++ +   +V   +   +GE     +++E        K K   
Sbjct: 617 NKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKMKVPL 676

Query: 271 KKKKKTTKKKKRKKKRKKKKNQHHRDV 297
              +K  K KK+KKK+ K      R V
Sbjct: 677 SDLEKIQKPKKKKKKKPKTVKPKPRTV 703



 Score = 27.9 bits (63), Expect = 8.0
 Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN-QHHRDVWINRTSRGEEEEEK 265
           +   +P   ++K +K KKKKKKK K  + K +      D+   R  R EE  E+
Sbjct: 670 MKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLELDL---RGMRYEEALER 720


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 27.9 bits (63), Expect = 3.0
 Identities = 15/40 (37%), Positives = 17/40 (42%), Gaps = 6/40 (15%)

Query: 261 EEEEKKKK---KKKKKKKTTKKKKRKKK---RKKKKNQHH 294
           E E+ KKK    + K KK T KK RK      K      H
Sbjct: 78  EIEDLKKKVNDLRGKFKKPTLKKVRKSADAMLKALLGSKH 117


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 28.0 bits (63), Expect = 3.0
 Identities = 10/68 (14%), Positives = 30/68 (44%), Gaps = 20/68 (29%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++ +++ K ++K+++  +R                    E+E K+ + K    K  KK++
Sbjct: 64  LRAERQAKVEEKEQEVAER--------------------EQELKEAQAKGDADKIEKKQR 103

Query: 282 RKKKRKKK 289
           +  + + +
Sbjct: 104 KLAEAQAE 111


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 29.4 bits (65), Expect = 3.0
 Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 12/68 (17%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           K  RK +++++ +K+R+ KK +   DV            + K KK+   KK  K K   +
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDV------------DLKDKKESINKKNKKGKHAIE 483

Query: 285 KRKKKKNQ 292
           +    K +
Sbjct: 484 RTAASKEE 491



 Score = 28.6 bits (63), Expect = 5.6
 Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 8/66 (12%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           KK    KK   +K+KR K          I +T  G+ E + KK+K +K  +  +  +  K
Sbjct: 564 KKTGGMKKIMDEKRKRLKNN--------IEQTQDGKPELKIKKRKAEKGDQRQELDRIVK 615

Query: 285 KRKKKK 290
             K+  
Sbjct: 616 SIKRSG 621


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 28.7 bits (64), Expect = 3.0
 Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 1/71 (1%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR-DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++K++  KK   K       +     R  V      +   EE K   K   +   +KKKK
Sbjct: 133 REKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKK 192

Query: 282 RKKKRKKKKNQ 292
              K KK K +
Sbjct: 193 YIDKYKKLKEE 203


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 28.3 bits (63), Expect = 3.1
 Identities = 13/73 (17%), Positives = 41/73 (56%), Gaps = 6/73 (8%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI-----NRTSRGEEEEEKKKKKKKKKKKTT 277
           +++++R++ + +++ ++R+ +    HR         +R SR      ++ +K+++ K   
Sbjct: 23  RRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHR-SRSRSPSRRRDRKRERDKDAR 81

Query: 278 KKKKRKKKRKKKK 290
           + KKR++++  K+
Sbjct: 82  EPKKRERQKLIKE 94


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 27.9 bits (62), Expect = 3.1
 Identities = 13/19 (68%), Positives = 14/19 (73%)

Query: 255 RTSRGEEEEEKKKKKKKKK 273
           R    EEEEEK+KKKKK K
Sbjct: 98  RELAREEEEEKRKKKKKNK 116



 Score = 27.5 bits (61), Expect = 3.8
 Identities = 15/62 (24%), Positives = 29/62 (46%), Gaps = 8/62 (12%)

Query: 236 KKKKRKKKKNQHHRDVWINRTSRGEE--EEEKKKKK------KKKKKKTTKKKKRKKKRK 287
           K +KR         D   N  +   E  +EE++  +      +++  +  +++KRKKK+K
Sbjct: 55  KGRKRSYYDRYDEDDALDNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKK 114

Query: 288 KK 289
            K
Sbjct: 115 NK 116


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 28.6 bits (64), Expect = 3.1
 Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 4/49 (8%)

Query: 247 HHRDVWINRTSRGEEEEEKK----KKKKKKKKKTTKKKKRKKKRKKKKN 291
            H    + + +  E + E K    K++K+ K    K K  K K+  KK 
Sbjct: 83  KHLLGGVAKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKA 131



 Score = 27.8 bits (62), Expect = 4.8
 Identities = 16/58 (27%), Positives = 20/58 (34%), Gaps = 15/58 (25%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           K+ K   KK K  K KK  KK                  E EKK  + + +    KK 
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAAL---------------EAEKKVNEARAEAVAEKKA 152


>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 6/88 (6%), Positives = 17/88 (19%), Gaps = 8/88 (9%)

Query: 211 SSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK---- 266
             +    P      K  R   +++       KK     + +      R    +       
Sbjct: 401 VLLPLDEPEDAKLLKTTRPGLEEESDISDEIKKLKSSKKALLRGLGVRFTLSKLLANLGK 460

Query: 267 ----KKKKKKKKKTTKKKKRKKKRKKKK 290
                          +++        + 
Sbjct: 461 EIPGAGDAVTIDPELERRSPNSADDIEY 488


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 28.9 bits (64), Expect = 3.2
 Identities = 9/37 (24%), Positives = 19/37 (51%)

Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           S  +E+E+ K K +K+K    KK ++     + + + 
Sbjct: 256 SDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQER 292



 Score = 28.5 bits (63), Expect = 4.6
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
           E+E+E  K K +K+K    KK RK     +  Q  
Sbjct: 258 EDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQER 292


>gnl|CDD|233390 TIGR01385, TFSII, transcription elongation factor S-II.  This model
           represents eukaryotic transcription elongation factor
           S-II. This protein allows stalled RNA transcription
           complexes to perform a cleavage of the nascent RNA and
           restart at the newly generated 3-prime end.
          Length = 299

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 18/66 (27%), Positives = 27/66 (40%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
            K  KK  K  KK   K K+ H      ++T+ GE     K++ K +  K  + K     
Sbjct: 61  SKLAKKIIKSWKKVVDKNKSDHPGGNPEDKTTVGESVNSVKQEAKSQSDKIEQPKYVSSS 120

Query: 286 RKKKKN 291
            +  KN
Sbjct: 121 PRNAKN 126


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 28.8 bits (63), Expect = 3.4
 Identities = 14/70 (20%), Positives = 37/70 (52%), Gaps = 4/70 (5%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           + K K ++KK+++  ++   K+ K++ HR    ++  + E  + +K +++ +++      
Sbjct: 142 IHKVKNEKKKERQLAEQLAAKRLKDEQHR----HKARKQELRKREKDRERARREDAAAAA 197

Query: 281 KRKKKRKKKK 290
             K+K   KK
Sbjct: 198 AAKQKAAAKK 207


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7 plays
           a role in protein translation. Deletions of the TMA7
           gene results in altered protein synthesis rates.
          Length = 63

 Score = 26.7 bits (59), Expect = 3.4
 Identities = 18/61 (29%), Positives = 26/61 (42%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           R+  KKK  K  +K+KK     D+   +  + E +  K+   K K K        KK  K
Sbjct: 3   RQGGKKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGK 62

Query: 288 K 288
           K
Sbjct: 63  K 63


>gnl|CDD|224977 COG2066, GlsA, Glutaminase [Amino acid transport and metabolism].
          Length = 309

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 11/29 (37%), Positives = 13/29 (44%), Gaps = 6/29 (20%)

Query: 109 FGYFMCDVWNSFDVYFSTASILHLCCISV 137
           FG    DV  + DVYF      H C I +
Sbjct: 179 FGNLEHDVEEALDVYF------HQCAIEM 201


>gnl|CDD|234043 TIGR02871, spore_ylbJ, sporulation integral membrane protein YlbJ. 
           Members of this protein family are found exclusively in
           Firmicutes (low-GC Gram-positive bacterial) and are
           known from studies in Bacillus subtilis to be part of
           the sigma-E regulon. Mutation leads to a sporulation
           defect, confirming that members of this protein family,
           YlbJ, are sporulation proteins. This protein appears to
           be universal among endospore-forming bacteria, but is
           encoded by a pair ORFs distant from eash other in
           Symbiobacterium thermophilum IAM14863 [Cellular
           processes, Sporulation and germination].
          Length = 362

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 205 YYALISSSISFWIPGFVMKKKKKRKKKKKKKKKK 238
           Y  LIS  +S    G +++  K+RK+K   KKKK
Sbjct: 142 YILLISHYLSCITMGLILRFYKRRKRKILSKKKK 175


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 27.9 bits (62), Expect = 3.6
 Identities = 22/130 (16%), Positives = 48/130 (36%), Gaps = 33/130 (25%)

Query: 200 FVVNVYYALISSSISFWI-------------PGFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
           F  N+Y +    S   WI             P F+    K ++K+ +K++ ++ + ++ +
Sbjct: 1   FRNNLYGSFAQMSAKDWIRIIVIVGGYLLIRPYFIKLGAKAQEKEHEKERAEREEAREKK 60

Query: 247 HHRDVWINRTSRG----------EEEEEKK----------KKKKKKKKKTTKKKKRKKKR 286
                   R              EE+EE            KK +K+++K  +K    +++
Sbjct: 61  AKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKLLEAEEQ 120

Query: 287 KKKKNQHHRD 296
            ++      D
Sbjct: 121 LREDQYDDED 130


>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2.  This family
           contains the bacterial histone H1-like nucleoprotein HC2
           (approximately 200 residues long), which seems to be
           found mostly in Chlamydia. HC2 functions in DNA
           condensation, although it has been suggested that it
           also has other roles.
          Length = 187

 Score = 28.6 bits (63), Expect = 3.6
 Identities = 22/77 (28%), Positives = 29/77 (37%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           KK    KK   KK  K+   K    +     +    +   +K   KK   KKT  KK   
Sbjct: 51  KKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKKATAKKVAAKKVVAKKTVAKKAAA 110

Query: 284 KKRKKKKNQHHRDVWQK 300
           KK   KK    + V +K
Sbjct: 111 KKPAAKKAVAKKAVARK 127


>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 25/76 (32%), Positives = 34/76 (44%), Gaps = 11/76 (14%)

Query: 228 RKKKKKKKKKKKR----KKKKNQHHRDVW--INRTSRGEEE-----EEKKKKKKKKKKKT 276
           +KK+ K KKK K     K    Q    VW  ++  S  ++      EE    K KKKK  
Sbjct: 1   KKKEPKPKKKLKPLHWDKVNPAQDRGTVWDELDEESFEKDLDLSELEELFSAKAKKKKSK 60

Query: 277 TKKKKRKKKRKKKKNQ 292
             +KK   K+KKK+  
Sbjct: 61  KSEKKSSSKKKKKEIS 76



 Score = 28.4 bits (64), Expect = 5.6
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
            F  K KKK+ KK +KK   K+KKK+  
Sbjct: 49  LFSAKAKKKKSKKSEKKSSSKKKKKEIS 76


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
           protein sorting and transport from the endosome to the
           vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
           play an important role in the degradation of both lipids
           and cellular proteins. In order to perform this
           degradative function, vacuoles/lysosomes contain
           numerous hydrolases which have been transported in the
           form of inactive precursors via the biosynthetic pathway
           and are proteolytically activated upon delivery to the
           vacuole/lysosome. The delivery of transmembrane
           proteins, such as activated cell surface receptors to
           the lumen of the vacuole/lysosome, either for
           degradation/downregulation, or in the case of
           hydrolases, for proper localisation, requires the
           formation of multivesicular bodies (MVBs). These late
           endosomal structures are formed by invaginating and
           budding of the limiting membrane into the lumen of the
           compartment. During this process, a subset of the
           endosomal membrane proteins is sorted into the forming
           vesicles. Mature MVBs fuse with the vacuole/lysosome,
           thereby releasing cargo containing vesicles into its
           hydrolytic lumen for degradation. Endosomal proteins
           that are not sorted into the intralumenal MVB vesicles
           are either recycled back to the plasma membrane or Golgi
           complex, or remain in the limiting membrane of the MVB
           and are thereby transported to the limiting membrane of
           the vacuole/lysosome as a consequence of fusion.
           Therefore, the MVB sorting pathway plays a critical role
           in the decision between recycling and degradation of
           membrane proteins. A few archaeal sequences are also
           present within this family.
          Length = 169

 Score = 28.4 bits (64), Expect = 3.6
 Identities = 9/45 (20%), Positives = 16/45 (35%)

Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
                  +    E++ +K + + KK  K   K       K+KK  
Sbjct: 9   AIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRY 53



 Score = 27.2 bits (61), Expect = 8.8
 Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)

Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
           + +  R  +++++  +KK KK +   KK  KK  K       +   QKK  
Sbjct: 6   LRKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLK---QKKRY 53


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 27.9 bits (63), Expect = 3.6
 Identities = 18/81 (22%), Positives = 47/81 (58%), Gaps = 6/81 (7%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++K+ K+++ + +K +K+ +K K +  +D      +  E   EKK+K+ +KK +  ++K+
Sbjct: 23  LEKEFKKRQAELEKLEKELQKLKEKLQKD----AATLSEAAREKKEKELQKKVQEFQRKQ 78

Query: 282 RKKKRK--KKKNQHHRDVWQK 300
           +K ++   K++ +  + +  K
Sbjct: 79  QKLQQDLQKRQQEELQKILDK 99


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 28.5 bits (64), Expect = 3.7
 Identities = 12/73 (16%), Positives = 25/73 (34%), Gaps = 11/73 (15%)

Query: 223 KKKKKR-KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK-----KKKKKT 276
           K+ KK  ++K+   + K       Q+        T       +K +K         K+ +
Sbjct: 154 KEDKKSLEEKQAALEDKLETLVALQNE-----LETQLNSLNSQKAEKNALIAALAAKEAS 208

Query: 277 TKKKKRKKKRKKK 289
              +K   + +K 
Sbjct: 209 ALGEKAALEEQKA 221


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
           selection and in elongation by RNA polymerase II
           [Transcription].
          Length = 521

 Score = 28.9 bits (64), Expect = 3.8
 Identities = 10/56 (17%), Positives = 29/56 (51%), Gaps = 1/56 (1%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           KKKK  + KK ++++ + + +  I    R ++ +E ++ ++  +++ T     +  
Sbjct: 144 KKKKLLELKKTREREERLYSERHIEL-QRFKDYKELEESEQGLQEEYTPSYAEEAV 198


>gnl|CDD|223836 COG0765, HisM, ABC-type amino acid transport system, permease
          component [Amino acid transport and metabolism].
          Length = 222

 Score = 28.6 bits (65), Expect = 3.8
 Identities = 13/49 (26%), Positives = 27/49 (55%), Gaps = 6/49 (12%)

Query: 38 IFPFIIKGMLM------CFIIITAILGNLLVIISVIKHRKLRIITNYFV 80
            PF++KG+L+        I++  +LG LL ++ +  ++ LR +   +V
Sbjct: 14 YLPFLLKGLLVTLLLTLLSIVLGLVLGLLLALMRLSGNKPLRWLARAYV 62


>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
          Length = 102

 Score = 27.3 bits (60), Expect = 3.8
 Identities = 14/54 (25%), Positives = 24/54 (44%)

Query: 241 KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
              K  HH      +    + +  KK  K KK++K  ++K +  K+  KK+ H 
Sbjct: 38  APAKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHK 91



 Score = 26.5 bits (58), Expect = 7.1
 Identities = 13/56 (23%), Positives = 24/56 (42%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
            K     KK+ KK  +++ +   +HH++    +    + +  KK  KK   K   K
Sbjct: 40  AKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAK 95


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 28.5 bits (64), Expect = 3.9
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
            EE E     K+++    +KK+ KKK
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKK 195



 Score = 27.3 bits (61), Expect = 7.7
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRK 287
            EE +     K+++    +KK  KK+K
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 15/91 (16%), Positives = 33/91 (36%), Gaps = 19/91 (20%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINR----TSRGEE---------------EEEK 265
           + + K  ++K +K KR+  + +   D         +    +               EEEK
Sbjct: 384 RDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEK 443

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           + K  + KK+  K ++      K + + +  
Sbjct: 444 EDKALEIKKQEWKLEQLAADLSKYEQELYDL 474


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 28.8 bits (64), Expect = 4.1
 Identities = 16/77 (20%), Positives = 26/77 (33%), Gaps = 12/77 (15%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           K +KR    + K       K                   + ++ ++KKK   T  K K  
Sbjct: 5   KSRKRSGNNQNKNASVVNNKAEIAAMI------------DARRLEQKKKGGVTNSKGKTN 52

Query: 284 KKRKKKKNQHHRDVWQK 300
           K    K  +  +DV Q+
Sbjct: 53  KVVDAKLEKEFKDVLQR 69


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
           identified as the binding site of the SRP72 protein to
           SRP RNA.
          Length = 57

 Score = 26.3 bits (58), Expect = 4.1
 Identities = 13/21 (61%), Positives = 15/21 (71%)

Query: 223 KKKKKRKKKKKKKKKKKRKKK 243
           KKK   ++KKKK  KKKRK K
Sbjct: 10  KKKPGTEQKKKKSAKKKRKPK 30



 Score = 25.9 bits (57), Expect = 6.2
 Identities = 14/29 (48%), Positives = 19/29 (65%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           E +K KK  KKK  T +KKK+  K+K+K 
Sbjct: 1   EAKKPKKIAKKKPGTEQKKKKSAKKKRKP 29



 Score = 25.1 bits (55), Expect = 9.1
 Identities = 14/25 (56%), Positives = 15/25 (60%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKK 289
           KK  KKK   +  KKK  KKKRK K
Sbjct: 6   KKIAKKKPGTEQKKKKSAKKKRKPK 30


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 27.2 bits (61), Expect = 4.2
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
           ++EE +K KKK  KK   ++ K+K  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 26.8 bits (60), Expect = 5.0
 Identities = 9/22 (40%), Positives = 16/22 (72%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
           K+++++ KKK  KK + R+ KK
Sbjct: 69  KEERRKDKKKYGKKARLREWKK 90



 Score = 26.8 bits (60), Expect = 5.1
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKR 286
           ++EE++K KKK  KK   ++ +KK  
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 26.8 bits (60), Expect = 5.9
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           R +EE  K KKK  KK +  + KK+  
Sbjct: 67  RDKEERRKDKKKYGKKARLREWKKKVF 93



 Score = 26.4 bits (59), Expect = 7.0
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKN 245
           ++++K KKK  KK + +  KKK 
Sbjct: 70  EERRKDKKKYGKKARLREWKKKV 92



 Score = 26.0 bits (58), Expect = 8.9
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
           K++++K KKK  KK + ++ +KK  
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 28.6 bits (64), Expect = 4.3
 Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 12/73 (16%)

Query: 232 KKKKKKKKRKKKKNQ---HHRDVWINRTSRGEEEEEK---------KKKKKKKKKKTTKK 279
           ++K+  K+    KN      R + ++R     +E +           K+K KK       
Sbjct: 285 RRKEGSKRDASGKNTLKSTFRSLQVSRLPSSGQEAQMTNTMSMTVVTKEKNKKVPVMFLG 344

Query: 280 KKRKKKRKKKKNQ 292
           KK K+K  + K+Q
Sbjct: 345 KKPKEKEVESKSQ 357


>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family.  This is a family of
           sequences found in both bacteria and bacteriophages.
           This region is approximately 130 residues long and in
           some cases is found as part of the PVL (Panton-Valentine
           leukocidin) group of genes, which encode a member of the
           leukocidin group of bacterial toxins that kill
           leukocytes by creation of pores in the cell membrane.
           PVL appears to be a virulence factor associated with a
           number of human diseases.
          Length = 118

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 12/47 (25%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK---KKKNQHHRDVWQK 300
            R +E  E KK +  ++++  ++ +RK++++   ++K  H  +V QK
Sbjct: 49  MRLKEYREIKKSENIEQERKERELERKRRKEAELRRKKPHLFNVPQK 95


>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
           (DMAP1).  DNA methylation can contribute to
           transcriptional silencing through several
           transcriptionally repressive complexes, which include
           methyl-CpG binding domain proteins (MBDs) and histone
           deacetylases (HDACs). The chief enzyme that maintains
           mammalian DNA methylation, DNMT1, can also establish a
           repressive transcription complex. The non-catalytic
           amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
           DNMT1 associated protein), and can mediate
           transcriptional repression. DMAP1 has intrinsic
           transcription repressive activity, and binds to the
           transcriptional co-repressor TSG101. DMAP1 is targeted
           to replication foci through interaction with the far N
           terminus of DNMT1 throughout S phase, whereas HDAC2
           joins DNMT1 and DMAP1 only during late S phase,
           providing a platform for how histones may become
           deacetylated in heterochromatin following replication.
          Length = 175

 Score = 27.9 bits (62), Expect = 4.4
 Identities = 15/67 (22%), Positives = 29/67 (43%), Gaps = 10/67 (14%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           ++K + RKK+++KK +  +K           I    +        + +K++KK   KK  
Sbjct: 9   LRKIEARKKEREKKAQDLQK----------LITAADQQAAGFSTAEARKREKKLPKKKIP 58

Query: 282 RKKKRKK 288
           +K    K
Sbjct: 59  QKPGPSK 65



 Score = 27.5 bits (61), Expect = 6.6
 Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 5/65 (7%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           +K + +KK+++KK +  +K      +      T+     E +K++KK  KKK  +K    
Sbjct: 10  RKIEARKKEREKKAQDLQKLITAADQQAAGFSTA-----EARKREKKLPKKKIPQKPGPS 64

Query: 284 KKRKK 288
           K+  K
Sbjct: 65  KEDSK 69


>gnl|CDD|222911 PHA02616, PHA02616, VP2/VP3; Provisional.
          Length = 259

 Score = 28.5 bits (63), Expect = 4.4
 Identities = 16/73 (21%), Positives = 30/73 (41%), Gaps = 10/73 (13%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           IP +++   ++  K   K   +  K+K+++ H               +K    KK K   
Sbjct: 184 IPDWILYVLEELNKDIYKIPTQAVKRKQDELH----------PVSPTKKAALSKKSKWTG 233

Query: 277 TKKKKRKKKRKKK 289
           TK  +  KKR+ +
Sbjct: 234 TKSSQSSKKRRGR 246


>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
          Length = 399

 Score = 28.4 bits (64), Expect = 4.5
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
            K++KK   +  K  +R+ + KKK  +
Sbjct: 48  SKREKKPYPRPMKLLRREAREKKKLRK 74


>gnl|CDD|233873 TIGR02454, CbiQ_TIGR, cobalt ABC transporter, permease protein
           CbiQ.  This model represents the permease component of
           the cobalt-specific ABC transporter. This model finds
           permeases which are generally next to the other subunits
           of the complex (CbiN and CbiO) or the cobalamin
           biosynthesis protein CbiM which is a transmembrane
           protein which likely interacts with the complex in some
           manner. In genomes which possess all of these subunits
           the ATPase is most likely running in the direction of
           import (for the biosynthesis of coenzyme B12). In other
           genomes, this subunit may be involved in the export of
           cobalt and/or other closely related heavy metals
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 198

 Score = 28.0 bits (63), Expect = 4.5
 Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 7/74 (9%)

Query: 46  MLMCFIIITAILGN-----LLVIISVIKHRKLRI-ITNYFV-VSLAFADLLVALCVMPFN 98
             +  +++  +L N     +L I +++      I +      +++    LLV L ++ F+
Sbjct: 9   SALALLVLAVLLPNPIALLILAIFALLLVGLAGIPLRILLKRLAVPIGFLLVGLLLLLFS 68

Query: 99  AIVSLTDEWYFGYF 112
                      G  
Sbjct: 69  NGTPGEPLLVLGPL 82


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 28.6 bits (64), Expect = 4.5
 Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 6/65 (9%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
                K   +K+KKR K+         I+  +  EE + +   +  K   T  K K +KK
Sbjct: 435 AAADTKSAAEKQKKRAKE------PFEIDFGAPLEEIDFEVIFQPPKANSTLSKSKLQKK 488

Query: 286 RKKKK 290
              + 
Sbjct: 489 WDTRT 493


>gnl|CDD|71622 pfam08188, Protamine_3, Spermatozal protamine family.  This family
           consists of the spermatozal protamines. Spermatozal
           protamines play an important role in remodelling of the
           sperm chromatin during mammalian spermiogenesis. Nuclear
           elongation and chromatin condensation are concomitant
           with modifications in the basic protein complement
           associated with DNA. Somatic histones are initially
           replaced by testis -specific histone variants, then by
           transitional proteins, and ultimately by protamines.
          Length = 48

 Score = 26.0 bits (56), Expect = 4.6
 Identities = 10/38 (26%), Positives = 23/38 (60%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKY 302
           +++   KKK+K+ +++K +K ++K+KN   R      +
Sbjct: 2   RRRHSMKKKRKSVRRRKTRKNQRKRKNSLGRSFKAHGF 39


>gnl|CDD|235222 PRK04125, PRK04125, murein hydrolase regulator LrgA; Provisional.
          Length = 141

 Score = 27.7 bits (62), Expect = 4.6
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
           F++ K +K K+ KK+KK K + ++  +
Sbjct: 115 FILGKTEKEKEDKKEKKVKIKGRQHGK 141


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.8 bits (65), Expect = 4.8
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKK 281
            EEEEE+K+KKKKK  ++T+ + 
Sbjct: 369 EEEEEEEKEKKKKKSAESTRSEL 391


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 27.8 bits (62), Expect = 4.9
 Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKR----KKKRKKKKNQHH 294
           R E +EE K+KK +KK K  ++  R      K++ + ++H 
Sbjct: 103 REESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHE 143


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 28.0 bits (63), Expect = 5.0
 Identities = 18/71 (25%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           +KK+ +  K++  + + + +K K    R+    RT   EE ++ +K+ KK K +  K +K
Sbjct: 74  LKKELEELKQRIAELQAQIEKLKKG--REETEERTELLEELKQLEKELKKLKAELEKYEK 131

Query: 282 RKKKRKKKKNQ 292
              +R +K  +
Sbjct: 132 NDPERIEKLKE 142


>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region.  This family
           contains sequences that are similar to the N-terminal
           region of Red protein. This and related proteins contain
           a RED repeat which consists of a number of RE and RD
           sequence elements. The region in question has several
           conserved NLS sequences and a putative trimeric
           coiled-coil region, suggesting that these proteins are
           expressed in the nucleus. The function of Red protein is
           unknown, but efficient sequestration to nuclear bodies
           suggests that its expression may be tightly regulated of
           that the protein self-aggregates extremely efficiently.
          Length = 238

 Score = 28.3 bits (63), Expect = 5.0
 Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRD 250
           KKK+K    +K+++  +K+ N  +RD
Sbjct: 1   KKKKKYAYLRKQEENAEKEINPKYRD 26


>gnl|CDD|153128 cd02437, CCC1_like_1, CCC1-related protein family.  CCC1_like_1:
           This is a protein family closely related to CCC1, a
           family of proteins involved in iron and manganese
           transport. Yeast CCC1 is a vacuole transmembrane protein
           responsible for the iron and manganese accumulation in
           vacuole.   .
          Length = 175

 Score = 27.8 bits (62), Expect = 5.2
 Identities = 10/62 (16%), Positives = 24/62 (38%), Gaps = 1/62 (1%)

Query: 36  IHIFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLV-ALCV 94
           + + PF+   + +   I  AI+  +L I+ ++  +  +I      V      +   +L  
Sbjct: 110 LPLLPFLPFDLWLAAGIAVAIVLAILFILGLVIGKISKINVAISFVWAVGMVITGISLVF 169

Query: 95  MP 96
             
Sbjct: 170 SV 171



 Score = 27.0 bits (60), Expect = 8.3
 Identities = 6/33 (18%), Positives = 9/33 (27%)

Query: 204 VYYALISSSISFWIPGFVMKKKKKRKKKKKKKK 236
              A+       +I G V+ K  K         
Sbjct: 124 AGIAVAIVLAILFILGLVIGKISKINVAISFVW 156


>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
          Length = 862

 Score = 28.5 bits (64), Expect = 5.2
 Identities = 16/76 (21%), Positives = 29/76 (38%), Gaps = 7/76 (9%)

Query: 217 IPGFVMKKKKK-------RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
           +PGF   +  +       ++   K   + + ++ K    +      T+    + EK   K
Sbjct: 785 LPGFKSSRTGRNFKAYLVKQPDGKIGFEFEPREPKAAAAKKTAAKATAAAATKAEKAAAK 844

Query: 270 KKKKKKTTKKKKRKKK 285
           K   KKT  KK   +K
Sbjct: 845 KAPAKKTAAKKTAARK 860


>gnl|CDD|219137 pfam06687, SUR7, SUR7/PalI family.  This family consists of several
           fungal-specific SUR7 proteins. Its activity regulates
           expression of RVS161, a homologue of human endophilin,
           suggesting a function for both in endocytosis. The
           protein carries four transmembrane domains and is thus
           likely to act as an anchoring protein for the eisosome
           to the plasma membrane. Eisosomes are the immobile
           protein complexes, that include the proteins Pil1 and
           Lsp1, which co-localise with sites of protein and lipid
           endocytosis at the plasma membrane. SUR7 protein may
           play a role in sporulation. This family also includes
           PalI which is part of a pH signal transduction cascade.
           Based on the similarity of PalI to the yeast Rim9
           meiotic signal transduction component it has been
           suggested that PalI might be a membrane sensor for
           ambient pH.
          Length = 205

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 13/90 (14%), Positives = 33/90 (36%), Gaps = 8/90 (8%)

Query: 13  YNINYTS---TNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVI-- 67
           Y  +      T        P++   +++ + ++ + M      I      + +I++ I  
Sbjct: 76  YPFDPVDNFGTESGENLNLPSSFRDNLNTYYYLSRFMF-IVHPIALFFTVIALILAGILA 134

Query: 68  --KHRKLRIITNYFVVSLAFADLLVALCVM 95
                +   + N+ +  LAF   L+A  + 
Sbjct: 135 HFSSPRRGSLVNFLLSLLAFLFTLLAAALD 164


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 9/38 (23%), Positives = 15/38 (39%)

Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
            S     +     +K+K  K   KK    K++K+K   
Sbjct: 212 ASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249


>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
           protein; Provisional.
          Length = 230

 Score = 28.0 bits (62), Expect = 5.3
 Identities = 17/93 (18%), Positives = 35/93 (37%), Gaps = 8/93 (8%)

Query: 193 HHPNVCIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVW 252
             P V       Y  L    +S +    + K KK    +  ++K   + ++K     +  
Sbjct: 101 LQPEV----FVQYVELAKKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNE-- 154

Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
             +    E  EEK   + + + + T+ K++K  
Sbjct: 155 --KVLSPELSEEKSDSELETQPQKTQLKEKKPS 185


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 28.2 bits (63), Expect = 5.4
 Identities = 15/64 (23%), Positives = 23/64 (35%), Gaps = 7/64 (10%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE-EEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           K   KK KK    K   +   D+          +E E+ +K K  KK     +    +  
Sbjct: 12  KAPTKKPKKGDASKDSTEDDEDIL------EFLDELEQSEKAKPPKKPKEASRPGTPRNP 65

Query: 288 KKKN 291
           KK +
Sbjct: 66  KKSS 69


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 28.6 bits (64), Expect = 5.4
 Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 3/45 (6%)

Query: 255 RTSRGEEEEEKK---KKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
           R +R E  E KK   + KKK      KKK+ KK     K++   +
Sbjct: 591 RAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAE 635


>gnl|CDD|220505 pfam09988, DUF2227, Uncharacterized metal-binding protein
           (DUF2227).  Members of this family of hypothetical
           bacterial proteins possess metal binding properties;
           however, their exact function has not, as yet, been
           determined.
          Length = 169

 Score = 27.6 bits (62), Expect = 5.4
 Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 8/79 (10%)

Query: 158 VLLMLSNVWILPGIISFTPIMLGWYTTPDHKQ-------YRKHHPNVCIFVVNVYYALIS 210
           +L +L  V +L  ++++   +LG    P  +Q        R+H   +  F++ +      
Sbjct: 92  LLYLLLWVLLLALLLAWILGLLGGLNWPTPEQAPWLQQLLRQHPAELLAFLIGLELGSWL 151

Query: 211 SSISFWIPGFVMKKKKKRK 229
             I+  +P    K+ +KRK
Sbjct: 152 HLIADGVPSPF-KRSRKRK 169


>gnl|CDD|148288 pfam06589, CRA, Circumsporozoite-related antigen (CRA).  This
           family consists of several circumsporozoite-related
           antigen (CRA) or exported protein-1 (EXP1) sequences
           found specifically in Plasmodium species. The function
           of this family is unknown.
          Length = 157

 Score = 27.6 bits (61), Expect = 5.4
 Identities = 20/56 (35%), Positives = 22/56 (39%), Gaps = 2/56 (3%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDV--WINRTSRGEEEEEKKKKKKKKKKKTT 277
           K K  K       KKK KK   +   DV   I+   R EEE  K  K K   K  T
Sbjct: 22  KNKNGKYGSGNVIKKKNKKGSGEPLIDVHDLISDMVRKEEELVKLTKNKSSYKLAT 77


>gnl|CDD|223596 COG0522, RpsD, Ribosomal protein S4 and related proteins
           [Translation, ribosomal structure and biogenesis].
          Length = 205

 Score = 28.0 bits (63), Expect = 5.5
 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 6/53 (11%)

Query: 223 KKKKKRK------KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
           KKK  R+       K++  K++K   K  QH ++ W   +  G +  EK+K +
Sbjct: 8   KKKLSRRLGFNPWLKERLCKERKLPYKPGQHGQERWKKLSDYGLQLREKQKLR 60


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 27.4 bits (61), Expect = 5.6
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 8/74 (10%)

Query: 223 KKKKKRKKKKKKKKKKKRKKK-KNQHHRDVWINRTSRGEEEEEKKKKKK---KKKKKTTK 278
           K+ K+  +K K+  +KKR K    QH +     +  + +EEE     ++   ++K     
Sbjct: 69  KRDKEMARKVKEYNEKKRGKSLVEQHQKK----QKKKKKEEENDDPSRRPFDREKDLAVG 124

Query: 279 KKKRKKKRKKKKNQ 292
            K  K +R++  N+
Sbjct: 125 GKISKAQRRELINK 138


>gnl|CDD|163538 TIGR03826, YvyF, flagellar operon protein TIGR03826.  This gene is
           found in flagellar operons of Bacillus-related
           organisms. Its function has not been determined and an
           official gene symbol has not been assigned, although the
           gene is designated yvyF in B. subtilus. A tentative
           assignment as a regulator is suggested in the NCBI
           record GI:16080597.
          Length = 137

 Score = 27.3 bits (61), Expect = 5.6
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
           S GE+E+ KKK+K+ +KK T   +  K K
Sbjct: 109 SAGEQEKTKKKEKETEKKDTYYIQDTKNK 137


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 27.4 bits (61), Expect = 5.6
 Identities = 11/64 (17%), Positives = 36/64 (56%), Gaps = 7/64 (10%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K  +++ +   ++K++ +      I +    +E++E+++ +K+K++    +++RK+   +
Sbjct: 22  KFSQEESRLTSKEKRRAE------IEQKLEEQEKQEREELRKEKRELFE-ERRRKQLELR 74

Query: 289 KKNQ 292
           K  Q
Sbjct: 75  KLEQ 78


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
           short proteins that are rich in aspartate, glutamate,
           lysine and arginine. Although the function of these
           proteins is unknown, they are found to be ubiquitously
           expressed.
          Length = 38

 Score = 25.2 bits (56), Expect = 5.7
 Identities = 9/24 (37%), Positives = 11/24 (45%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKN 245
            KK  K+KKK  K + K   K   
Sbjct: 15  AKKAAKKKKKGAKSQLKAAAKALE 38


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 27.6 bits (62), Expect = 5.8
 Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 26/105 (24%)

Query: 199 IFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR 258
           I VV+V   L  S           K  +K+ +K+ KK + + +KK               
Sbjct: 18  IGVVDVQKVLSESP--------AGKAAQKQLEKEFKKLQAELQKK--------------- 54

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKK--KKNQHHRDVWQKK 301
            E+E +K+++K +K+  T  ++ RK K+++  +K Q  +   Q  
Sbjct: 55  -EKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAA 98


>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4.  All proteins in this
           family for which functions are known are involved in
           targeting nucleotide excision repair to specific regions
           of the genome.This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 713

 Score = 28.3 bits (63), Expect = 5.8
 Identities = 9/33 (27%), Positives = 19/33 (57%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           E E++ K +++K +++     +R+KKR K    
Sbjct: 21  EAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLN 53



 Score = 28.3 bits (63), Expect = 6.5
 Identities = 7/34 (20%), Positives = 21/34 (61%)

Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           +E E++ K +++K ++  +   R++K++ K   +
Sbjct: 20  KEAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLN 53


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 3/53 (5%)

Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
            P  V K  ++  + KK     K+K K++   R    +  SR     EK +KK
Sbjct: 80  APDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRA---IEKLRKK 129


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 28.0 bits (63), Expect = 6.0
 Identities = 18/43 (41%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 239 KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           + K+K N  H +++  R  R EE  ++KKKKKKKKKK  + ++
Sbjct: 63  QEKEKLNDEHGELY-ERADRAEELLKEKKKKKKKKKKKEELRE 104



 Score = 27.6 bits (62), Expect = 7.5
 Identities = 11/43 (25%), Positives = 27/43 (62%)

Query: 250 DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
           ++++    +  +E  +  ++  + ++  K+KK+KKK+KKKK +
Sbjct: 59  EIFVQEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEE 101


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 28.0 bits (63), Expect = 6.2
 Identities = 8/32 (25%), Positives = 14/32 (43%)

Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
           R    + EEEK  K++ K        K + ++
Sbjct: 16  RLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 27.4 bits (61), Expect = 6.4
 Identities = 8/50 (16%), Positives = 25/50 (50%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
           + + +K+ ++     +   ++ + HRD          ++ ++ +K++KKK
Sbjct: 92  QDQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141


>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
           recombination, and repair].
          Length = 292

 Score = 28.0 bits (62), Expect = 6.4
 Identities = 15/49 (30%), Positives = 30/49 (61%), Gaps = 6/49 (12%)

Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKK----RKKKKNQHH--RDVWQKK 301
             EE +++K++++K K   K+KK +KK    R+K +  ++   DV +K+
Sbjct: 210 SLEELDREKQRREKMKDDRKEKKLEKKIKELRRKTRTSNYSRMDVREKE 258



 Score = 27.7 bits (61), Expect = 7.5
 Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 19/69 (27%)

Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
           R+K++++K K  RK+KK                    +KK K+ ++K  T    R   R+
Sbjct: 216 REKQRREKMKDDRKEKK-------------------LEKKIKELRRKTRTSNYSRMDVRE 256

Query: 288 KKKNQHHRD 296
           K+K+ H  D
Sbjct: 257 KEKHVHVFD 265


>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
          Length = 631

 Score = 28.1 bits (64), Expect = 6.5
 Identities = 7/24 (29%), Positives = 12/24 (50%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKK 243
             +K  + R +  KK K+KK+   
Sbjct: 375 KAIKAAQARLRAAKKVKRKKKTSG 398


>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
          Length = 763

 Score = 28.3 bits (63), Expect = 6.5
 Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 4/26 (15%)

Query: 258 RGEEEEEK----KKKKKKKKKKTTKK 279
           + E E E     KK KK KKKK   K
Sbjct: 237 QAEAENEAGKSDKKDKKSKKKKVLGK 262


>gnl|CDD|225743 COG3202, COG3202, ATP/ADP translocase [Energy production and
           conversion].
          Length = 509

 Score = 28.0 bits (63), Expect = 6.5
 Identities = 37/200 (18%), Positives = 67/200 (33%), Gaps = 34/200 (17%)

Query: 50  FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYF 109
           F II         + + +    +    +       F+  L A   M     + +  EW +
Sbjct: 95  FYIILGFFLGFFALFAFV----IYPYKDILHPDPEFSRDLFADLPMFLKWFILIVGEWSY 150

Query: 110 G--YFMCDVWNSFDV---YFSTASILHLCCIS-VDRYYAIVKPLQYPIIMNQNTVLLMLS 163
              Y M ++W S  +   ++  A+   +  I    R+Y           +  N  LL+  
Sbjct: 151 SLFYIMAELWGSLVLSLLFWQFAN--EITTIEEAKRFYP-------LFGLGANISLLLSG 201

Query: 164 NVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVC--IFVVNVYYALISSSISFWIPGFV 221
            V               W +        +    +   +F+V V   +I+  +  +I   V
Sbjct: 202 EV-------------TSWLSKHRDALASEFKDLLLQILFIVIVILGIIAILLYRYINRNV 248

Query: 222 MKKKKKRKKKKKKKKKKKRK 241
           +       + KKKKKKKK K
Sbjct: 249 LTDPLFYLRAKKKKKKKKLK 268


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.1 bits (63), Expect = 6.6
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 224 KKKKRKKKKKKKKKKKRKK 242
           K+K+   KK+KKKK K+ K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374



 Score = 27.7 bits (62), Expect = 8.1
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 226 KKRKKKKKKKKKKKRKKKKN 245
           K++++  KK+KKKK KK K 
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 27.9 bits (62), Expect = 6.6
 Identities = 9/42 (21%), Positives = 17/42 (40%), Gaps = 1/42 (2%)

Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
               +       + E   +K+K  +K  + K KK  KK + +
Sbjct: 341 EDGAFDVDAYPKKAEA-ARKRKGDRKGVSHKAKKGGKKNQAE 381


>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 26.0 bits (58), Expect = 6.7
 Identities = 6/33 (18%), Positives = 24/33 (72%)

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           EE+EK+++++++ ++  ++ + +KK + ++ + 
Sbjct: 1   EEDEKRRRRRERNREAARRSRERKKAEIEELER 33



 Score = 25.6 bits (57), Expect = 7.9
 Identities = 5/34 (14%), Positives = 23/34 (67%)

Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
           EE++K+++++++ +   ++ R++K+ + +    +
Sbjct: 1   EEDEKRRRRRERNREAARRSRERKKAEIEELERK 34


>gnl|CDD|147574 pfam05462, Dicty_CAR, Slime mold cyclic AMP receptor.  This family
           consists of cyclic AMP receptor (CAR) proteins from
           slime molds. CAR proteins are responsible for
           controlling development in Dictyostelium discoideum.
          Length = 305

 Score = 27.8 bits (62), Expect = 6.8
 Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 14/98 (14%)

Query: 53  ITAILGNLLVIISVIKHRKLR-----IITNYFVVSLAFADLLVALCVMPFNAIVSLTDEW 107
            T+I+G  LV+I   + + LR     +I+ + + SL   DL+        + I++LT+  
Sbjct: 19  FTSIIGCFLVLIGFWRLKLLRNHITKVISCFCLTSL-LKDLI--------STILTLTNSA 69

Query: 108 YFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVK 145
            +  F C ++     Y S A  L   C++   Y  IVK
Sbjct: 70  QYNGFPCYLYAIVITYGSLACWLWTLCLAFSIYNLIVK 107


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 27.9 bits (62), Expect = 6.8
 Identities = 13/74 (17%), Positives = 31/74 (41%), Gaps = 4/74 (5%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT----TK 278
           K++K    ++ K+ +   + +  +      +N     E E  +K+  +  K+        
Sbjct: 78  KEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNT 137

Query: 279 KKKRKKKRKKKKNQ 292
            +  +KKR+  KN+
Sbjct: 138 AEIIEKKRENNKNE 151


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 28.2 bits (63), Expect = 6.8
 Identities = 15/77 (19%), Positives = 36/77 (46%), Gaps = 11/77 (14%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
           K+++ +++K++ KK  K+KK+             G+++ +K K  +     T   +K+  
Sbjct: 191 KEEKAEERKQESKKGAKRKKDAS-----------GDDKSKKAKTDRDVSTSTAASQKKSS 239

Query: 285 KRKKKKNQHHRDVWQKK 301
             + K     +++W  K
Sbjct: 240 DLESKLEAQSKELWSLK 256


>gnl|CDD|217785 pfam03901, Glyco_transf_22, Alg9-like mannosyltransferase family.
           Members of this family are mannosyltransferase enzymes.
           At least some members are localised in endoplasmic
           reticulum and involved in GPI anchor biosynthesis.
          Length = 412

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 35/207 (16%), Positives = 70/207 (33%), Gaps = 13/207 (6%)

Query: 12  TYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIIT-AILGNLLVIISVIKHR 70
               +            P  S  + ++   ++   L      T A+L   LV+  +++  
Sbjct: 139 MILTSIALYYLLEYGSVPNTSVSYKYLKAVLLIAALAILGRPTSALLWLPLVLYHLLRLP 198

Query: 71  KLRIITNYFVVSLAFADLLVALCVMPFNAIVS---LTDEWYFGYFMCDVWNSFDVYFSTA 127
             R+          F  L ++L ++    ++    L D +++G F+    N F  Y   +
Sbjct: 199 GKRLKL--------FLFLAISLGLLVLLLVLGAVILIDSYFYGRFVFTPLN-FLKYNVLS 249

Query: 128 SILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDH 187
           +   L       +Y                +L  L  V  L  +++   I L  Y+   H
Sbjct: 250 NGSSLYGTHPWYWYFSNGLPNILGPFLLAFILGPLVLVSRLLLLLAPILIWLFVYSLQPH 309

Query: 188 KQYRKHHPNVCIFVVNVYYALISSSIS 214
           K+ R  +P   +  ++   AL   S  
Sbjct: 310 KEERFLYPIYPLICLSAAIALDRLSRK 336


>gnl|CDD|222928 PHA02778, PHA02778, major capsid L1 protein; Provisional.
          Length = 503

 Score = 27.9 bits (63), Expect = 7.1
 Identities = 5/25 (20%), Positives = 17/25 (68%)

Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKK 289
           + +++K+K   ++  +  K++R+K+
Sbjct: 479 RTRRRKRKSTSSSSSRSTKRRRRKR 503



 Score = 27.9 bits (63), Expect = 8.0
 Identities = 5/25 (20%), Positives = 17/25 (68%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
           + +++K+K  ++   +  K+R++K+
Sbjct: 479 RTRRRKRKSTSSSSSRSTKRRRRKR 503



 Score = 27.5 bits (62), Expect = 9.1
 Identities = 7/25 (28%), Positives = 16/25 (64%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKN 291
           + +++K+K T+    R  KR+++K 
Sbjct: 479 RTRRRKRKSTSSSSSRSTKRRRRKR 503


>gnl|CDD|201707 pfam01280, Ribosomal_L19e, Ribosomal protein L19e. 
          Length = 148

 Score = 27.1 bits (61), Expect = 7.2
 Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 5/46 (10%)

Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
            SRG   + K+K++K + +    +K  K  R  KK     ++W ++
Sbjct: 57  HSRGRARKRKEKRRKGRHRGPGSRKGTKGARMPKK-----ELWIRR 97



 Score = 26.7 bits (60), Expect = 8.8
 Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 22/90 (24%)

Query: 219 GFVMKKKKK-----RKKKKKKKKKK-------KRKKKKN--QHHRDVWIN--RTSRGEEE 262
           G ++KK KK     R +K+K+K++K        RK  K      +++WI   R  R    
Sbjct: 47  GLIIKKPKKGHSRGRARKRKEKRRKGRHRGPGSRKGTKGARMPKKELWIRRIRALR---- 102

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
             +  ++ +   K  K   R+  RK K   
Sbjct: 103 --RLLRELRDAGKIDKHTYRELYRKAKGGV 130


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 28.0 bits (63), Expect = 7.2
 Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 12/53 (22%)

Query: 207 ALISSSISFWIPGFVMKKKKKRKKKKKKKKK-------KKRKK----KKNQHH 248
           A+   +I F    +  ++KKK KK  +K+ +       KKRK+     + Q H
Sbjct: 69  AMSLVTIIFSTTTY-FREKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 28.0 bits (62), Expect = 7.4
 Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 8/99 (8%)

Query: 204 VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE 263
           VYY           P    K+++KRK+ +K+ + K  +KK                E +E
Sbjct: 53  VYYQRKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDE 112

Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKY 302
                  ++     ++KK   K K+K N+ +R    K +
Sbjct: 113 SSVANLSEE-----ERKKYAAKLKEKGNKAYR---NKDF 143


>gnl|CDD|223390 COG0313, COG0313, Predicted methyltransferases [General function
           prediction only].
          Length = 275

 Score = 27.6 bits (62), Expect = 7.4
 Identities = 11/37 (29%), Positives = 20/37 (54%), Gaps = 3/37 (8%)

Query: 207 ALISSSI---SFWIPGFVMKKKKKRKKKKKKKKKKKR 240
           AL +S +    F   GF+ +K K+R+K+ +    + R
Sbjct: 121 ALSASGLPSQRFLFEGFLPRKSKERRKRLEALANEPR 157


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
           homology (PH) domain.  Fission yeast Scd1 is an exchange
           factor for Cdc42 and an effector of Ras1, the homolog of
           the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
           binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
           Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
           exchange factor for Cdc42, a member of the Rho family of
           Ras-like proteins. Cdc42 then activates the Shk1/Orb2
           protein kinase. Scd1 interacts with Klp5 and Klp6
           kinesins to mediate cytokinesis. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 148

 Score = 27.2 bits (61), Expect = 7.4
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRK 287
           E KKKKKK   K  +    +KKK+K
Sbjct: 62  EVKKKKKKSSLKSKSSSSSKKKKKK 86


>gnl|CDD|119394 cd06261, TM_PBP2, Transmembrane subunit (TM) found in Periplasmic
           Binding Protein (PBP)-dependent ATP-Binding Cassette
           (ABC) transporters which generally bind type 2 PBPs.
           These types of transporters consist of a PBP, two TMs,
           and two cytoplasmic ABC ATPase subunits, and are mainly
           involved in importing solutes from the environment. The
           solute is captured by the PBP which delivers it to a
           gated translocation pathway formed by the two TMs. The
           two ABCs bind and hydrolyze ATP and drive the transport
           reaction. For these transporters the ABCs and TMs are on
           independent polypeptide chains. These systems transport
           a diverse range of substrates. Most are specific for a
           single substrate or a group of related substrates;
           however some transporters are more promiscuous,
           transporting structurally diverse substrates such as the
           histidine/lysine and arginine transporter in
           Enterobacteriaceae. In the latter case, this is achieved
           through binding different PBPs with different
           specificities to the TMs. For other promiscuous
           transporters such as the multiple-sugar transporter Msm
           of Streptococcus mutans, the PBP has a wide substrate
           specificity. These transporters include the
           maltose-maltodextrin, phosphate and sulfate
           transporters, among others.
          Length = 190

 Score = 27.2 bits (61), Expect = 7.6
 Identities = 13/74 (17%), Positives = 23/74 (31%), Gaps = 3/74 (4%)

Query: 33  TIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVI---KHRKLRIITNYFVVSLAFADLL 89
                I P  +  +L   ++  A       ++S +   +              L   DL 
Sbjct: 117 IFRRIILPLALPPILTGLVLAFARALGEFALVSFLGGGEAPGPGTGLLLIFAILFPGDLG 176

Query: 90  VALCVMPFNAIVSL 103
           VA  V     ++SL
Sbjct: 177 VAAAVALILLLLSL 190


>gnl|CDD|220400 pfam09785, Prp31_C, Prp31 C terminal domain.  This is the C
           terminal domain of the pre-mRNA processing factor Prp31.
           Prp31 is required for U4/U6.U5 tri-snRNP formation. In
           humans this protein has been linked to autosomal
           dominant retinitis pigmentosa.
          Length = 124

 Score = 26.9 bits (60), Expect = 7.6
 Identities = 21/90 (23%), Positives = 31/90 (34%), Gaps = 30/90 (33%)

Query: 223 KKKKKRKKKKKKKKKKK------RKKKKNQHHRDVWINRTSRGEEEEE------------ 264
           K KKKR  ++ +K K++      RK            NR   G+EEEE            
Sbjct: 14  KPKKKRGGRRFRKMKERFAMTELRKA----------QNRMEFGKEEEEVGYDFDEGVGLG 63

Query: 265 --KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
                   + +      K + K  KK K +
Sbjct: 64  MLGSGGGGRIRLSQKDAKTKAKLSKKMKKR 93


>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
          Length = 615

 Score = 28.1 bits (62), Expect = 7.6
 Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 4/74 (5%)

Query: 221 VMKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
           V KKKK   KK+K K+          K+N H    + N+  + E   ++  +   K +  
Sbjct: 39  VSKKKKPTVKKEKPKQSSNNLTLGKNKENFHLEKGFGNKQLQVERIIDRIFQSSLKNRTE 98

Query: 277 TKKKKRKKKRKKKK 290
            K K +   +KK+ 
Sbjct: 99  IKVKPKNNPQKKQN 112



 Score = 27.7 bits (61), Expect = 8.4
 Identities = 13/35 (37%), Positives = 16/35 (45%)

Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
           TS+     E K    KKKK T KK+K K+      
Sbjct: 26  TSQQTTTSENKSAVSKKKKPTVKKEKPKQSSNNLT 60


>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2.  This entry is the
           first approximately 250 residues of cortactin-binding
           protein 2. In addition to being a positional candidate
           for autism this protein is expressed at highest levels
           in the brain in humans. The human protein has six
           associated ankyrin repeat domains pfam00023 towards the
           C-terminus which act as protein-protein interaction
           domains.
          Length = 193

 Score = 27.5 bits (61), Expect = 7.6
 Identities = 14/72 (19%), Positives = 35/72 (48%), Gaps = 4/72 (5%)

Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWIN----RTSRGEEEEEKKKKKKKKKKKTTKKKK 281
           K+++K   + +++KRK  +     D + N       R ++  E++K ++ KK++  +K  
Sbjct: 110 KRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQEKSQQAKKEQEHRKLL 169

Query: 282 RKKKRKKKKNQH 293
              + +  K + 
Sbjct: 170 ATLEEELGKLKS 181


>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
          Length = 128

 Score = 26.7 bits (60), Expect = 7.6
 Identities = 9/14 (64%), Positives = 10/14 (71%)

Query: 232 KKKKKKKKRKKKKN 245
            KKKK  K+K KKN
Sbjct: 1   AKKKKTVKKKVKKN 14


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 27.9 bits (62), Expect = 7.9
 Identities = 18/67 (26%), Positives = 28/67 (41%)

Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
            +K     K  K  K K  K++  RD+ +  T       ++ +KK  K K+T  K  +  
Sbjct: 421 SEKINNPVKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPV 480

Query: 285 KRKKKKN 291
             K KK 
Sbjct: 481 PNKSKKQ 487


>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
           peptidase.  Corresponds to Merops family C37.
           Norwalk-like viruses (NLVs), including the Southampton
           virus, cause acute non-bacterial gastroenteritis in
           humans. The NLV genome encodes three open reading frames
           (ORFs). ORF1 encodes a polyprotein, which is processed
           by the viral protease into six proteins.
          Length = 535

 Score = 27.9 bits (62), Expect = 7.9
 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 5/57 (8%)

Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
           G+ ++ +    + KK K KK R KK N   R     R    EE +E KK ++++  K
Sbjct: 220 GWYIEPQDATPEGKKGKNKKGRGKKHNAFSR-----RGLSDEEYDEYKKIREERGGK 271


>gnl|CDD|215871 pfam00346, Complex1_49kDa, Respiratory-chain NADH dehydrogenase, 49
           Kd subunit. 
          Length = 272

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 8/49 (16%)

Query: 23  SNTSLFP------TNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIIS 65
           S+ S FP           H+ I   + +G L+  ++  AILG++ ++  
Sbjct: 222 SDGSTFPYRWKIRAPGFAHLQILDHLSRGHLLADLV--AILGSIDIVFG 268


>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
           TRaffic INducer (NOSTRIN).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Nitric Oxide Synthase TRaffic
           INducer (NOSTRIN) is expressed in endothelial and
           epithelial cells and is involved in the regulation,
           trafficking and targeting of endothelial NOS (eNOS).
           NOSTRIN facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of NOSTRIN may be correlated to preeclampsia.
           NOSTRIN contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           The F-BAR domain of NOSTRIN is necessary and sufficient
           for its membrane association and is responsible for its
           subcellular localization.
          Length = 239

 Score = 27.3 bits (61), Expect = 8.2
 Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 7/74 (9%)

Query: 230 KKKKKKKKKKRKKKKNQHH-RDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K KKK     R+ +K Q    D   + T       +K   K KK  +   K+  K + K+
Sbjct: 127 KVKKKLHGLARENEKLQDQVEDNKQSCTK------QKMLNKLKKSAEVQDKEDEKLEAKR 180

Query: 289 KKNQHHRDVWQKKY 302
           KK +  R   + +Y
Sbjct: 181 KKGEESRLKAENEY 194


>gnl|CDD|201461 pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein. 
          Length = 96

 Score = 26.3 bits (59), Expect = 8.3
 Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 2/29 (6%)

Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
            +  + KK    K KR+K  +KK  Q HR
Sbjct: 64  LEHGRGKKVIVFKYKRRKNYRKK--QGHR 90


>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
           [Translation, ribosomal structure and biogenesis].
          Length = 98

 Score = 26.2 bits (58), Expect = 8.4
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKK 244
           ++   K  RKK+K  +KK++ KK K
Sbjct: 74  YIPPDKLIRKKRKLPRKKRRPKKPK 98



 Score = 26.2 bits (58), Expect = 8.7
 Identities = 10/37 (27%), Positives = 16/37 (43%), Gaps = 2/37 (5%)

Query: 253 INRTSRGEEEE--EKKKKKKKKKKKTTKKKKRKKKRK 287
             +      +       K  +KK+K  +KK+R KK K
Sbjct: 62  DVKLESLVGDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 27.3 bits (61), Expect = 8.4
 Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHR--DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
            +K R + + K+ +K  KK K +               E E E  + +  + +   ++ +
Sbjct: 17  LEKDRLEPRIKEIRKALKKAKAELEALNKALEAL----EIELEDLENQVSQLESEIQEIR 72

Query: 282 RKKKRKKKK 290
            + KR ++K
Sbjct: 73  ERIKRAEEK 81


>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein. 
          Length = 346

 Score = 27.7 bits (62), Expect = 8.5
 Identities = 9/28 (32%), Positives = 13/28 (46%)

Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
           K+K   KK     KK   +  KK+ N+ 
Sbjct: 206 KQKSSSKKPSRVTKKSAAEAAKKQLNKP 233


>gnl|CDD|223334 COG0256, RplR, Ribosomal protein L18 [Translation, ribosomal
           structure and biogenesis].
          Length = 125

 Score = 26.8 bits (60), Expect = 8.6
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 222 MKKKKKRKKKKKKKKKKKRKK 242
             K K R++++ K+  + RKK
Sbjct: 2   RYKVKFRRRRRGKRAYRIRKK 22


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 27.5 bits (62), Expect = 8.6
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 224 KKKKRKKKKKKKKKKKRKK 242
           KK   K +K KKKKK+RK 
Sbjct: 192 KKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
           systems, permease component [Carbohydrate transport and
           metabolism / Cell envelope biogenesis, outer membrane].
          Length = 263

 Score = 27.5 bits (62), Expect = 8.7
 Identities = 13/83 (15%), Positives = 36/83 (43%), Gaps = 5/83 (6%)

Query: 36  IHIFPFIIKGMLMCFIIITAILGNLLVII---SVIKHRKLR--IITNYFVVSLAFADLLV 90
           ++   +++ G+++ F    AI      ++   ++IK       I+     +S  F  L+ 
Sbjct: 66  LNFLAYLLAGLILWFFFSEAISEGAGSVVANAALIKKINFPPLILPVARTLSRLFNFLIH 125

Query: 91  ALCVMPFNAIVSLTDEWYFGYFM 113
            + ++ F  I+ +   W++   +
Sbjct: 126 LIIILIFLIILGVEPSWHWLLLL 148


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 27.0 bits (60), Expect = 9.0
 Identities = 16/74 (21%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVW---INRTSRGEEEEEKKKKKKKKKKKTTK 278
           ++ + K+++K+ +K +K+ K K+ +   D     ++  ++ E E +K+K      KK  +
Sbjct: 49  LESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQE 108

Query: 279 KKKRKKKRKKKKNQ 292
            +K   +R+ ++ Q
Sbjct: 109 YEKDLNRREAEEEQ 122


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 27.4 bits (61), Expect = 9.1
 Identities = 17/91 (18%), Positives = 36/91 (39%), Gaps = 3/91 (3%)

Query: 204 VYYALISSSISFWIPGFVMKKKKKR---KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE 260
           V+Y  ++  +  WI   ++         K+KK+ +K KK KK K     D+         
Sbjct: 39  VHYVELNRRLDEWITADLINLGAAISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEM 98

Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
                 +++ ++ +    K +   +  + KN
Sbjct: 99  LYSISNEREIRQLRFGGSKVQNPHEGARVKN 129


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.5 bits (61), Expect = 9.1
 Identities = 16/63 (25%), Positives = 34/63 (53%)

Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
           KKK    +K+ K   ++ K+K+   ++         +EEEE+++   +K+++  +  KRK
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178

Query: 284 KKR 286
            K+
Sbjct: 179 NKQ 181


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 26.7 bits (60), Expect = 9.4
 Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 5/33 (15%)

Query: 265 KKKKKKKKKKKT-----TKKKKRKKKRKKKKNQ 292
           K+KKKKKKKKK       KKKK     ++K++Q
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQ 114


>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
           subunit.
          Length = 1079

 Score = 27.6 bits (61), Expect = 9.6
 Identities = 13/33 (39%), Positives = 16/33 (48%)

Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRD 250
           PGF+       +KK  K  KK   KKK+  H D
Sbjct: 652 PGFLSSLCGGSRKKSSKSSKKGSDKKKSGKHVD 684


>gnl|CDD|225058 COG2147, RPL19A, Ribosomal protein L19E [Translation, ribosomal
           structure and biogenesis].
          Length = 150

 Score = 26.9 bits (60), Expect = 9.6
 Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 2/74 (2%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKN--QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           +K+  +KKK +++    RK  K      ++ WI R      E  K +   K  K T +K 
Sbjct: 64  RKRHAQKKKGRRRGPGSRKGTKGARMPSKERWIKRIRALRRELRKLRDDGKIDKHTYRKL 123

Query: 281 KRKKKRKKKKNQHH 294
            R  K    K++ H
Sbjct: 124 YRMAKGGAFKSKSH 137


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 27.2 bits (60), Expect = 9.6
 Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
           K ++ K+ K+ R++++        I +  R   EEE++ + +K  K +T K K+ K +  
Sbjct: 121 KLRELKRIKRDREEREEMEREKAEIEKM-RNMTEEERRAELRKNPKVSTNKAKKGKYKFL 179

Query: 289 KKNQH 293
           +K  H
Sbjct: 180 QKYYH 184


>gnl|CDD|220417 pfam09812, MRP-L28, Mitochondrial ribosomal protein L28.  Members
           of this family are components of the mitochondrial large
           ribosomal subunit. Mature mitochondrial ribosomes
           consist of a small (37S) and a large (54S) subunit. The
           37S subunit contains at least 33 different proteins and
           1 molecule of RNA (15S). The 54S subunit contains at
           least 45 different proteins and 1 molecule of RNA (21S).
          Length = 146

 Score = 26.6 bits (59), Expect = 9.7
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 7/32 (21%)

Query: 222 MKKKKK-------RKKKKKKKKKKKRKKKKNQ 246
           +KKKKK       RK+K+ +K  +KR K    
Sbjct: 6   LKKKKKVDPQREQRKRKRLEKFIRKRTKSARG 37


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 27.6 bits (61), Expect = 9.8
 Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 3/30 (10%)

Query: 251 VWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
           +W NR    E  ++KK KKKK  KK    K
Sbjct: 113 IWKNRV---ESWKDKKNKKKKSAKKKEAHK 139


>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956).  This
           family of proteins with unknown function appears to be
           restricted to Gammaproteobacteria.
          Length = 103

 Score = 26.1 bits (58), Expect = 9.9
 Identities = 8/26 (30%), Positives = 16/26 (61%)

Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHH 248
           KK++K K ++  K +K++ K K +  
Sbjct: 40  KKQQKAKAREADKARKQQLKAKQRQA 65


>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962).  This
           eukaryotic family of proteins has no known function.
          Length = 155

 Score = 26.8 bits (60), Expect = 10.0
 Identities = 11/42 (26%), Positives = 14/42 (33%), Gaps = 11/42 (26%)

Query: 265 KKKKKKKKKKKT-----------TKKKKRKKKRKKKKNQHHR 295
           K  K   KKKK             +   R  K +KKK +   
Sbjct: 1   KVTKNIAKKKKVLHPKSRKAKQLNRATLRDDKLEKKKAERAE 42


>gnl|CDD|224023 COG1098, VacB, Predicted RNA binding protein (contains ribosomal
           protein S1 domain) [Translation, ribosomal structure and
           biogenesis].
          Length = 129

 Score = 26.5 bits (59), Expect = 10.0
 Identities = 13/62 (20%), Positives = 29/62 (46%), Gaps = 6/62 (9%)

Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
           +K +++ +K+ +K +         +R   G E    +  K  ++  +T K++ KKKR  +
Sbjct: 74  RKLEEEPEKQHRKPRFSK------SRPKEGFETLLSRLLKWIEESLSTLKRQTKKKRGGR 127

Query: 290 KN 291
             
Sbjct: 128 GA 129


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.136    0.424 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,607,133
Number of extensions: 1581873
Number of successful extensions: 23841
Number of sequences better than 10.0: 1
Number of HSP's gapped: 14804
Number of HSP's successfully gapped: 1652
Length of query: 303
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 207
Effective length of database: 6,679,618
Effective search space: 1382680926
Effective search space used: 1382680926
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 59 (26.6 bits)