RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16250
         (485 letters)



>gnl|CDD|224155 COG1234, ElaC, Metal-dependent hydrolases of the beta-lactamase
           superfamily III [General function prediction only].
          Length = 292

 Score =  165 bits (420), Expect = 2e-47
 Identities = 83/289 (28%), Positives = 117/289 (40%), Gaps = 30/289 (10%)

Query: 114 SGFPLLDKGKQVLAIRGPGPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLH 173
              P  D+    + +R  G     L DCGEGT  QL+R           ++ A++I+HLH
Sbjct: 11  GAVPTKDRNVSSILLRLEGEK--FLFDCGEGTQHQLLRAGLP-----PRKIDAIFITHLH 63

Query: 174 ADHHLGLFSVIKAWSRVKPECKLTLLAPRQIITW------LSVYAARFESVGHLYRLVPL 227
            DH  GL  ++ + S       L +  P  I  +      LS     +E +GH       
Sbjct: 64  GDHIAGLPGLLVSRSFRGRREPLKIYGPPGIKEFVETSLRLSYSKLTYEIIGHEIEEDAF 123

Query: 228 SLFNTKGLIEGTEQHG----QNRPALDPDTVQILSSLGLESMTTCLVRHCPN------AF 277
                  L  G    G    +       D  ++        +T     H         A 
Sbjct: 124 -EVEALELDHGVPALGYRIEEPDRPGRFDAEKLKGLPPGPLITALKAGHPVEERVITPAD 182

Query: 278 GVTMVTKSGHKITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAI 337
            +    + G  + YSGDT PCD L+ + K +DLLIHEAT ED+LE  A    HST  +A 
Sbjct: 183 RIGE-PRKGKSVVYSGDTRPCDELIDLAKGADLLIHEATFEDDLEDLANEGGHSTAEEAA 241

Query: 338 RIGREMRAKFVLLTHFSQRYAK-LPRLNKDLSE----NVGIAFDNMRFP 381
            I +E   K ++LTHFS RY K    L K+          +A D + F 
Sbjct: 242 EIAKEAGVKKLILTHFSPRYPKDDEELLKEARAIFPGETIVARDGLVFE 290


>gnl|CDD|234602 PRK00055, PRK00055, ribonuclease Z; Reviewed.
          Length = 270

 Score =  131 bits (332), Expect = 4e-35
 Identities = 76/264 (28%), Positives = 109/264 (41%), Gaps = 49/264 (18%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIK---AWSRVKPE 193
            L DCGEGT  QL++           ++  ++I+HLH DH  GL  ++       R +P 
Sbjct: 32  FLFDCGEGTQRQLLKTGIKP-----RKIDKIFITHLHGDHIFGLPGLLSTRSLSGRTEP- 85

Query: 194 CKLTLLAPRQIITWLSVYAARFESVGHLYRLV--------PLSLFNTKGLIEGTE----Q 241
             LT+  P+ I  ++        S+G  YR+                 G+  G      +
Sbjct: 86  --LTIYGPKGIKEFVETLLRASGSLG--YRIAEKDKPGKLDAEKLKALGVPPGPLFGKLK 141

Query: 242 HGQNRPALDPDTVQILSSLGLESMTTCLVRHCPNAFGVTMVTKSGHKITYSGDTMPCDAL 301
            G++    D   +                        V    + G K+ Y GDT PC+AL
Sbjct: 142 RGEDVTLEDGRIINPAD--------------------VLGPPRKGRKVAYCGDTRPCEAL 181

Query: 302 VSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAKLP 361
           V + K +DLL+HEAT  DE E+ A    HST  QA  I +E   K ++LTHFS RY   P
Sbjct: 182 VELAKGADLLVHEATFGDEDEELAKEYGHSTARQAAEIAKEAGVKRLILTHFSPRYTGDP 241

Query: 362 RLNKDLS----ENVGIAFDNMRFP 381
                 +     N  +A D MR  
Sbjct: 242 EELLKEAREIFPNTELAEDLMRVE 265


>gnl|CDD|221727 pfam12706, Lactamase_B_2, Beta-lactamase superfamily domain.  This
           family is part of the beta-lactamase superfamily and is
           related to pfam00753.
          Length = 197

 Score =  116 bits (292), Expect = 3e-30
 Identities = 56/222 (25%), Positives = 90/222 (40%), Gaps = 33/222 (14%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVKPECKL 196
           IL+D G GT     +    A   LL ++ AV +SH HADH LGL ++ +   R+    +L
Sbjct: 3   ILIDPGPGTRDLRQQPL--AAPALLPRIDAVLLSHAHADHILGLLTLRELIKRLPK--RL 58

Query: 197 TLLAPRQIITWLSVYAARFESVGHLYRLVPLSLFNTKGLIEGTEQHGQNRPALDPDTVQI 256
            +  P  +   L         +  ++                          +D      
Sbjct: 59  PVYGPPGVAEDLRAPIFGVLPLFPVFD--------------------IEIFEIDIGEDVE 98

Query: 257 LSSLGLESMTTCLVRHCPNAFGVT---MVTKSGHKITYSGDTMPCDALVSIGKNSDLLIH 313
           +  L +   T   V+H    + +     +   G KI YSGDT     L++  K +DLLI 
Sbjct: 99  VGGLTI---TAFPVQHGSGRYLLREGFRIETPGKKIYYSGDTGYIPDLLAEIKGADLLIL 155

Query: 314 EATHEDELE---KEAALKMHSTVSQAIRIGREMRAKFVLLTH 352
           +AT  D+ E   K A    H T  +A+   +++ AK ++L H
Sbjct: 156 DATFYDDDEEPRKTARRMGHMTPEEALEAAKDLGAKKLILIH 197


>gnl|CDD|233963 TIGR02651, RNase_Z, ribonuclease Z.  Processing of the 3-prime end
           of tRNA precursors may be the result of endonuclease or
           exonuclease activity, and differs in different species.
           Member of this family are ribonuclease Z, a tRNA 3-prime
           endonuclease that processes tRNAs to prepare for
           addition of CCA. In species where all tRNA sequences
           already have the CCA tail, such as E. coli, the need for
           such an enzyme is unclear. Protein similar to the E.
           coli enzyme, matched by TIGR02649, are designated
           ribonuclease BN [Transcription, RNA processing].
          Length = 299

 Score =  108 bits (271), Expect = 3e-26
 Identities = 43/102 (42%), Positives = 57/102 (55%), Gaps = 3/102 (2%)

Query: 283 TKSGHKITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGRE 342
            + G KI Y+GDT PC+ ++   KN+DLLIHEAT  DE +K A    HST +QA  I +E
Sbjct: 197 PRKGRKIAYTGDTRPCEEVIEFAKNADLLIHEATFLDEDKKLAKEYGHSTAAQAAEIAKE 256

Query: 343 MRAKFVLLTHFSQRYAKLPRLNKDLS---ENVGIAFDNMRFP 381
              K ++LTH S RY+    L ++      N  IA D M   
Sbjct: 257 ANVKRLILTHISPRYSDEEELLEEAKKIFPNTYIAEDFMEIE 298



 Score = 54.1 bits (131), Expect = 5e-08
 Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 11/72 (15%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWS---RVKPE 193
            L DCGEGT  Q++R           ++  ++I+HLH DH LGL  ++   S   R +P 
Sbjct: 30  WLFDCGEGTQRQMLRS-----GISPMKIDRIFITHLHGDHILGLPGLLSTMSFQGRKEP- 83

Query: 194 CKLTLLAPRQII 205
             LT+  P  I 
Sbjct: 84  --LTIYGPPGIK 93


>gnl|CDD|131697 TIGR02649, true_RNase_BN, ribonuclease BN.  Members of this protein
           family are ribonuclease BN of Escherichia coli K-12 and
           closely related proteins believed to be equivalent in
           function. Note that E. coli appears to lack RNase Z per
           se, and this protein of E. coli appears orthologous to
           (but not functionally equivalent to) RNase Z of Bacillus
           subtilis and various other species. Meanwhile, the yihY
           gene product of E. coli previously was incorrectly
           identified as RNase BN [Transcription, RNA processing].
          Length = 303

 Score = 72.7 bits (178), Expect = 4e-14
 Identities = 68/259 (26%), Positives = 105/259 (40%), Gaps = 32/259 (12%)

Query: 125 VLAIRGPGPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVI 184
           +L ++ P      L DCGEGT  QL+    +A +    +L  ++ISHLH DH  GL  ++
Sbjct: 21  LLNLQHPTQSGLWLFDCGEGTQHQLLH---TAFNP--GKLDKIFISHLHGDHLFGLPGLL 75

Query: 185 KAWSRVKPECKLTLLAPRQIITWLSV----------YAARFESVGH------------LY 222
            + S       LT+  P+ I  ++            Y      +G              Y
Sbjct: 76  CSRSMSGIIQPLTIYGPQGIREFVETALRISGSWTDYPLEIVEIGAGEILDDGLRKVTAY 135

Query: 223 RLV-PLSLFNTKGLIEGTEQHGQNRPALDPDTVQ---ILSSLGLESMTTCLVRHCPNAFG 278
            L  PL  +  + + E  +    N  AL    V    +   L      T       N   
Sbjct: 136 PLEHPLECYGYR-IEEHDKPGALNAQALKAAGVPPGPLFQELKAGKTITLEDGRQINGAD 194

Query: 279 VTMVTKSGHKITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIR 338
                  G  +   GDT PCDA + + K  D+++HEAT +  +E +A  + HS+  QA  
Sbjct: 195 YLAAPVPGKALAIFGDTGPCDAALDLAKGVDVMVHEATLDITMEAKANSRGHSSTRQAAT 254

Query: 339 IGREMRAKFVLLTHFSQRY 357
           + RE     +++TH S RY
Sbjct: 255 LAREAGVGKLIITHVSSRY 273


>gnl|CDD|235006 PRK02126, PRK02126, ribonuclease Z; Provisional.
          Length = 334

 Score = 58.0 bits (141), Expect = 3e-09
 Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 4/82 (4%)

Query: 286 GHKITYSGDTMP----CDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGR 341
           G KI Y  D          +V +    DLL  EA   DE  ++A  K H T  QA R+ R
Sbjct: 242 GQKIGYVTDIGYTEENLARIVELAAGVDLLFIEAVFLDEDAEKARRKNHLTARQAGRLAR 301

Query: 342 EMRAKFVLLTHFSQRYAKLPRL 363
           E   K +L  HFS RY      
Sbjct: 302 EAGVKRLLPFHFSPRYQGRGAE 323



 Score = 34.1 bits (79), Expect = 0.12
 Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 9/50 (18%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKA 186
           +L D G      L  L       LL  +S +++SH H DH +G   +++ 
Sbjct: 30  LLFDLG-----DLHHL---PPRELLR-ISHIFVSHTHMDHFIGFDRLLRH 70


>gnl|CDD|216099 pfam00753, Lactamase_B, Metallo-beta-lactamase superfamily. 
          Length = 148

 Score = 43.9 bits (103), Expect = 3e-05
 Identities = 18/72 (25%), Positives = 29/72 (40%), Gaps = 2/72 (2%)

Query: 132 GPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVK 191
           G    +L+D G G    L+ L    +D     + A+ ++H HADH  GL  + +A     
Sbjct: 13  GDGGAVLIDTGLGADDALLLLALLGLD--PKDIDAIILTHAHADHIGGLPELKEATGAPV 70

Query: 192 PECKLTLLAPRQ 203
                   A  +
Sbjct: 71  VAAPEDAAALLR 82


>gnl|CDD|214854 smart00849, Lactamase_B, Metallo-beta-lactamase superfamily.  Apart
           from the beta-lactamases a number of other proteins
           contain this domain. These proteins include
           thiolesterases, members of the glyoxalase II family,
           that catalyse the hydrolysis of S-D-lactoyl-glutathione
           to form glutathione and D-lactic acid and a competence
           protein that is essential for natural transformation in
           Neisseria gonorrhoeae and could be a transporter
           involved in DNA uptake. Except for the competence
           protein these proteins bind two zinc ions per molecule
           as cofactor.
          Length = 177

 Score = 42.9 bits (101), Expect = 1e-04
 Identities = 36/197 (18%), Positives = 63/197 (31%), Gaps = 37/197 (18%)

Query: 132 GPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVK 191
                IL+D G G    L+            ++ A+ ++H H DH  GL  +++A     
Sbjct: 7   DDGGAILIDTGPGEAEDLLAELKKL---GPKKIDAIILTHGHPDHIGGLPELLEAP---- 59

Query: 192 PECKLTLLAPRQIITWLSVYAARFESVGHLYRLVPLSLFNTKGLIEGTEQHGQNRPALDP 251
                 + AP      L    A    +G      P      + L +G E        L  
Sbjct: 60  ---GAPVYAPEGTAELLKDLLALLGELGAEAEPAP----PDRTLKDGDELD------LGG 106

Query: 252 DTVQILSSLGLESMTTCLVRHCPNAFGVTMVTKSGHKITYSGDTMPCDALVSIGKNSDLL 311
             ++++ + G          H P +  + +      KI ++GD +             + 
Sbjct: 107 GELEVIHTPG----------HTPGSIVLYL---PEGKILFTGDLLFAGGDGRT----LVD 149

Query: 312 IHEATHEDELEKEAALK 328
             +A   D LE    L 
Sbjct: 150 GGDAAASDALESLLKLL 166


>gnl|CDD|224156 COG1235, PhnP, Metal-dependent hydrolases of the beta-lactamase
           superfamily I [General function prediction only].
          Length = 269

 Score = 42.4 bits (100), Expect = 2e-04
 Identities = 50/254 (19%), Positives = 89/254 (35%), Gaps = 39/254 (15%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVKPECKL 196
           +L+D G     Q +RL  S +D       A+ ++H H+DH  GL  + +A         L
Sbjct: 43  LLIDAGPDLRDQGLRLGVSDLD-------AILLTHEHSDHIQGLDDLRRA-------YTL 88

Query: 197 TLLAPRQIITWLSVYAARFESVGHLYRLVPLSLFNTKGLIEGTEQH-GQNRPALDPDTVQ 255
            +       T  +  + R           P   F+   +             A++P    
Sbjct: 89  PI--YVNPGTLRASTSDRLLGGFPYLFRHPFPPFSLPAIGGLEVTPFPVPHDAIEPVGFV 146

Query: 256 ILSSLGLESMTTCLVRHCPNAFGVTMVTKSGHKITYSGDTM--PCDALVSIGKNS--DLL 311
           I+ +       T +       +G   +      + Y  DT   P +  V +  N    L 
Sbjct: 147 IIRTGRKLHGGTDI------GYG---LEWRIGDVAYLTDTELFPSNHDVELLDNGLYPLD 197

Query: 312 IHEATHEDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQR--YAKLPRLNKDLSE 369
           I +    D       L  H +  +A+ +  +++ K ++LTH S +    +LP L    +E
Sbjct: 198 IKDRILPD----PGHLSNHLSAEEALELIEKLKPKRLVLTHLSHKNDDEELPEL---EAE 250

Query: 370 NVGIAFDNMRFPEK 383
            V  A+  M     
Sbjct: 251 EVEPAYGGMGIDVG 264


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 41.9 bits (99), Expect = 6e-04
 Identities = 18/24 (75%), Positives = 21/24 (87%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            EKK +K+KKKKKRKKK KK+KKK
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKK 411



 Score = 41.5 bits (98), Expect = 7e-04
 Identities = 16/24 (66%), Positives = 21/24 (87%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
           P KKK+++KK +KRKKKKK+KKK 
Sbjct: 382 PPKKKREEKKPQKRKKKKKRKKKG 405



 Score = 41.5 bits (98), Expect = 8e-04
 Identities = 14/26 (53%), Positives = 23/26 (88%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           ++P+  KKK+++KK +K+KKKKK+KK
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKK 403



 Score = 40.7 bits (96), Expect = 0.001
 Identities = 14/24 (58%), Positives = 20/24 (83%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
             +K+KKKKK+KK+ KK+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 40.7 bits (96), Expect = 0.001
 Identities = 14/24 (58%), Positives = 22/24 (91%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
           P+KK+++KK +K++KKKK+KKK K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGK 406



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           R  +K +K+KKKKK++KK KK+KKK 
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 15/26 (57%), Positives = 22/26 (84%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           +  E+KK +K+KKKK++KKK KK+KK
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKK 410



 Score = 38.0 bits (89), Expect = 0.009
 Identities = 13/24 (54%), Positives = 21/24 (87%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            +K+++KK +K+K+KKK+KKK KK
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKK 407



 Score = 36.9 bits (86), Expect = 0.023
 Identities = 13/24 (54%), Positives = 20/24 (83%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            + +K+KKKKK+K+K KK+KKK +
Sbjct: 390 KKPQKRKKKKKRKKKGKKRKKKGR 413



 Score = 35.7 bits (83), Expect = 0.044
 Identities = 13/24 (54%), Positives = 20/24 (83%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKK 402
           +  ++KKKKK+KKK +K+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 32.2 bits (74), Expect = 0.59
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           EK  K  KKK++ KK +K+KKKK
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKK 399



 Score = 31.1 bits (71), Expect = 1.5
 Identities = 12/22 (54%), Positives = 17/22 (77%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K+K  K  KKKR++KK +K+KK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKK 397


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 40.1 bits (94), Expect = 7e-04
 Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 3/99 (3%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFYPAL 441
            +KKKK +    +  +K+ K+K     I I   P I + +D KK    N  +L L   A 
Sbjct: 1   VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGL---AA 57

Query: 442 KAMFAEYQDEIENRAIRRNLKQESTYLARYPSENDKEKP 480
               A   D +        LK+    + R    N  ++P
Sbjct: 58  DPNTAARTDLLIQDKPIPELKKGEARIERDDKGNPLDEP 96


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 39.9 bits (94), Expect = 0.001
 Identities = 22/82 (26%), Positives = 34/82 (41%), Gaps = 15/82 (18%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFYPALK 442
           KKK KK KK+KRK++K+ +K   +               D  K +       K      +
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGD-------------DELKQAAEEAKAEKA--ERDR 58

Query: 443 AMFAEYQDEIENRAIRRNLKQE 464
            +  + Q E E +AI   +KQ 
Sbjct: 59  ELNRQRQAEAEQKAIVAQIKQL 80


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 30/140 (21%), Positives = 47/140 (33%), Gaps = 13/140 (9%)

Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK---------VNNS 408
            K+  ++ D       A D     E  K KK KKKK+KKKK++K               S
Sbjct: 247 LKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGS 306

Query: 409 IQISSRPEICREFDSKKISLANLPKLKLFYPALKAMFAEYQDEIE---NRAIRRNLKQES 465
               SR ++  E    + S     K +           + Q  +      A ++  K   
Sbjct: 307 SDSGSRKDVEEENARLEDS-PKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRP 365

Query: 466 TYLARYPSENDKEKPGSLDD 485
             +AR  +E   E      +
Sbjct: 366 EDIARQIAEERSEDEEEAAE 385



 Score = 29.0 bits (65), Expect = 7.6
 Identities = 16/93 (17%), Positives = 35/93 (37%), Gaps = 9/93 (9%)

Query: 319 DELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIA--FD 376
              E+  A +    + + I   RE R +         +   +  L +D  ++        
Sbjct: 44  KRQEEAEAKRKREELREKIAKAREKRERN-------SKLGGIKTLGEDDDDDDDTKAWLK 96

Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
             +  +KKK+ ++KK     +K+K++     S 
Sbjct: 97  KSKKRQKKKEAERKKALLLDEKEKERAAEYTSE 129


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 39.9 bits (94), Expect = 0.002
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
           ++K+K K KK+ R  K   K++K + +
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 39.5 bits (93), Expect = 0.003
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
            +KK+K+K+K + KK+ +  K +
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNI 422



 Score = 39.5 bits (93), Expect = 0.003
 Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 12/51 (23%)

Query: 379 RFPEKKKKKKKKKK---KRKKKKKKKKKKVNNSIQISSRPEICREFDSKKI 426
           + P +KK  K  KK   KR +KK+K+K+K              R  D+K I
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKK---------RHRDTKNI 422



 Score = 39.2 bits (92), Expect = 0.004
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
            EK+K K KK+ +  K   K++K    S +
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 38.8 bits (91), Expect = 0.006
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           R  +K+K+K+K K K++ +  K   K
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGK 424



 Score = 38.0 bits (89), Expect = 0.009
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
           EK+K+K K KK+ +  K   K++  + 
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSG 430



 Score = 36.1 bits (84), Expect = 0.041
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
           K   K+ +KK+++K+K K KK+  ++  I  R
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 37.6 bits (87), Expect = 0.002
 Identities = 16/24 (66%), Positives = 16/24 (66%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            EK KKKKKK KK KK KK  KK 
Sbjct: 97  TEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 35.6 bits (82), Expect = 0.012
 Identities = 16/29 (55%), Positives = 19/29 (65%)

Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           D+   P +K KKKKKK K+ KK KK  KK
Sbjct: 91  DHPPEPTEKPKKKKKKSKKTKKPKKSSKK 119



 Score = 31.4 bits (71), Expect = 0.33
 Identities = 15/38 (39%), Positives = 20/38 (52%)

Query: 367 LSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           L+ + G        P +  +K KKKKK+ KK KK KK 
Sbjct: 79  LTTSSGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKS 116


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 37.9 bits (88), Expect = 0.004
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNN 407
            + KKKK +  K+RKKKKK+KKKK   
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168



 Score = 31.7 bits (72), Expect = 0.55
 Identities = 17/36 (47%), Positives = 21/36 (58%), Gaps = 7/36 (19%)

Query: 382 EKKKKKKKKKKKR-------KKKKKKKKKKVNNSIQ 410
            K  +KK KKKK        KKKK+KKKKK  +S +
Sbjct: 137 LKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172



 Score = 27.9 bits (62), Expect = 7.9
 Identities = 11/17 (64%), Positives = 15/17 (88%)

Query: 381 PEKKKKKKKKKKKRKKK 397
            E+KKKKK+KKKK+K+ 
Sbjct: 153 KERKKKKKEKKKKKKRH 169


>gnl|CDD|223565 COG0491, GloB, Zn-dependent hydrolases, including glyoxylases
           [General function prediction only].
          Length = 252

 Score = 38.6 bits (89), Expect = 0.004
 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 2/60 (3%)

Query: 132 GPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVK 191
           G    +L+D G G           A+  L   + A+ ++H H DH  G   + +A+    
Sbjct: 33  GEGGAVLIDTGLGDA--DAEALLEALAALGLDVDAILLTHGHFDHIGGAAVLKEAFGAAP 90


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 37.0 bits (86), Expect = 0.005
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
            K+KK+KKKKKK+KKKK  KK      
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 37.0 bits (86), Expect = 0.005
 Identities = 17/24 (70%), Positives = 20/24 (83%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
           EK K+KK+KKKK+KKKKKK  KK 
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKA 130



 Score = 35.4 bits (82), Expect = 0.018
 Identities = 15/22 (68%), Positives = 19/22 (86%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K+K K+KK+KK+KKKKKKKK  
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTS 127



 Score = 35.0 bits (81), Expect = 0.022
 Identities = 15/24 (62%), Positives = 19/24 (79%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            ++KK+KKKKKKK+KKK  KK  K
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 35.0 bits (81), Expect = 0.027
 Identities = 15/23 (65%), Positives = 20/23 (86%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           ++K K+KK+KKK+KKKKKKK  K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSK 128



 Score = 33.9 bits (78), Expect = 0.056
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
           +  K+K K+KK+KKKKKKKKKK  + 
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSK 128



 Score = 33.5 bits (77), Expect = 0.075
 Identities = 15/23 (65%), Positives = 20/23 (86%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           +  K+K K+KK++KKKKKKKKKK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKK 125



 Score = 32.4 bits (74), Expect = 0.21
 Identities = 14/22 (63%), Positives = 19/22 (86%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K+  K+K K+K++KKKKKKKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKK 123



 Score = 29.3 bits (66), Expect = 2.1
 Identities = 15/26 (57%), Positives = 20/26 (76%), Gaps = 2/26 (7%)

Query: 382 EKK--KKKKKKKKKRKKKKKKKKKKV 405
           E+K  K+  K+K K+KK+KKKKKKK 
Sbjct: 97  EQKIVKQVLKEKAKQKKQKKKKKKKK 122



 Score = 28.1 bits (63), Expect = 5.8
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQISS 413
           K+  K+K ++KK+KKKKKK         
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKK 129


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 36.9 bits (86), Expect = 0.007
 Identities = 16/30 (53%), Positives = 21/30 (70%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
           KK+ K++KKK+KKKKKKK KK N   +   
Sbjct: 85  KKRAKRQKKKQKKKKKKKAKKGNKKEEKEG 114



 Score = 34.6 bits (80), Expect = 0.040
 Identities = 15/29 (51%), Positives = 22/29 (75%)

Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           +  R  E+K  KK+ K+++KK+KKKKKKK
Sbjct: 74  EKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102



 Score = 33.5 bits (77), Expect = 0.083
 Identities = 18/33 (54%), Positives = 25/33 (75%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
           +KKK+KKKKKKK KK  KK++K+ + S + SS 
Sbjct: 91  QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSD 123



 Score = 33.5 bits (77), Expect = 0.083
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
           K+ K++KKK+K+KKKKK KK       + S   E
Sbjct: 86  KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119



 Score = 32.7 bits (75), Expect = 0.16
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           +++K  KK+ K++KKK+KKKKKK      
Sbjct: 79  DEEKTAKKRAKRQKKKQKKKKKKKAKKGN 107


>gnl|CDD|188239 TIGR02650, RNase_Z_T_toga, ribonuclease Z, Thermotoga type.
           Members of this protein family are ribonuclease Z as
           found in the genus Thermotoga, where the enzyme cleaves
           after the CCA, in contrast to the activities
           characterized for other enzymes also designated
           ribonuclease Z. In other systems, cleavage occurs
           5-prime to the location of the CCA sequence, and CCA is
           added subsequently. A species may lack ribonuclease Z if
           all tRNA genes encode the CCA sequence, or if the CCA is
           exposed by exonuclease activity rather than endonuclease
           activity. Note that members of this sequence family
           differ considerably from the majority of RNase Z
           sequences [Transcription, RNA processing].
          Length = 277

 Score = 38.1 bits (88), Expect = 0.008
 Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 6/84 (7%)

Query: 278 GVTMVTKSGHK--ITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQ 335
           G   VT+  HK  +T SGD++  D     G  ++LLIHE T  D  ++    K H+ + +
Sbjct: 171 GRDFVTEEYHKKILTISGDSLALDPEEIEG--TELLIHECTFLDARDRR--YKNHAAIDE 226

Query: 336 AIRIGREMRAKFVLLTHFSQRYAK 359
            +   +    K V+L H S RY +
Sbjct: 227 VMESVKAAGVKKVILYHISTRYIR 250



 Score = 28.8 bits (64), Expect = 6.3
 Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 8/48 (16%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVI 184
           IL D GEG  + L    GS V         V+++H H DH  GL+ V+
Sbjct: 21  ILFDAGEGVSTTL----GSKV----YAFKYVFLTHGHVDHIAGLWGVV 60


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 34.6 bits (80), Expect = 0.012
 Identities = 13/26 (50%), Positives = 13/26 (50%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVN 406
             KKKKKKKKKK        K KK  
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKAT 47



 Score = 33.1 bits (76), Expect = 0.045
 Identities = 13/34 (38%), Positives = 16/34 (47%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
            KKKKKKKKKK +      K K   +  +S    
Sbjct: 23  VKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 31.2 bits (71), Expect = 0.18
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
            +KKKKKKKK +      K KK    +  + + 
Sbjct: 24  KKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 30.0 bits (68), Expect = 0.59
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
            +  KKKKKKKKK+ +      K  
Sbjct: 20  ADLVKKKKKKKKKKAEDTAATAKAK 44



 Score = 29.3 bits (66), Expect = 1.0
 Identities = 9/25 (36%), Positives = 11/25 (44%)

Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQ 410
             KKKKKK+KKK +           
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKA 46



 Score = 28.5 bits (64), Expect = 1.6
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNS 408
           P+ +    KKKKK+KKKK +       +
Sbjct: 16  PKTEADLVKKKKKKKKKKAEDTAATAKA 43



 Score = 28.1 bits (63), Expect = 2.6
 Identities = 11/36 (30%), Positives = 13/36 (36%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEI 417
           E    KKKKKKK+KK +                 E 
Sbjct: 19  EADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEG 54


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 36.2 bits (84), Expect = 0.016
 Identities = 15/36 (41%), Positives = 23/36 (63%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
            +K+KKKKK+ KK KK+KK KK+K+        + +
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188



 Score = 36.2 bits (84), Expect = 0.017
 Identities = 17/31 (54%), Positives = 19/31 (61%), Gaps = 2/31 (6%)

Query: 376 DNMRFPEKKKKKKK--KKKKRKKKKKKKKKK 404
                 EKK KK+K  + K  KKKKKKKKKK
Sbjct: 163 VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 35.1 bits (81), Expect = 0.045
 Identities = 15/24 (62%), Positives = 21/24 (87%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            EKK+KKKKK+ K++KK+KK KK+
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKE 175



 Score = 33.5 bits (77), Expect = 0.13
 Identities = 16/34 (47%), Positives = 23/34 (67%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
           E+KK+KKKKK+ +K+KK+KK KK        S+ 
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185



 Score = 33.2 bits (76), Expect = 0.20
 Identities = 15/23 (65%), Positives = 21/23 (91%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E++KK+KKKKK+ KK+KK+KK K
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDK 173



 Score = 29.7 bits (67), Expect = 2.3
 Identities = 11/28 (39%), Positives = 20/28 (71%)

Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
               E + ++++KK+K+KKK+ KK+KK 
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKE 169



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 12/23 (52%), Positives = 21/23 (91%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E ++++KK+KKK+K+ KK+KK+K
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEK 170


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 34.3 bits (79), Expect = 0.017
 Identities = 13/31 (41%), Positives = 17/31 (54%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
             KKKKKKKK+K K K++        + SS 
Sbjct: 18  DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSA 48



 Score = 33.2 bits (76), Expect = 0.053
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K KK   KKKK+KKKKK K K+
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKE 34



 Score = 32.8 bits (75), Expect = 0.076
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
           KK   KKKKKK+KKK K K++ V    +     
Sbjct: 15  KKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSS 47



 Score = 32.0 bits (73), Expect = 0.14
 Identities = 14/23 (60%), Positives = 17/23 (73%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
            KK   KKKKK+KKKK K K++V
Sbjct: 14  GKKIDVKKKKKKKKKKNKSKEEV 36



 Score = 31.2 bits (71), Expect = 0.24
 Identities = 14/25 (56%), Positives = 14/25 (56%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNS 408
             K K K KK   KKKKKKKK  N 
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNK 31



 Score = 31.2 bits (71), Expect = 0.26
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K K KK   KK+KKKKKKK K 
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKS 32



 Score = 30.9 bits (70), Expect = 0.29
 Identities = 14/52 (26%), Positives = 26/52 (50%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKL 434
           KKKKKKKK K +++   +K+++  +S +   +     E + K     +   L
Sbjct: 22  KKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNL 73



 Score = 30.5 bits (69), Expect = 0.44
 Identities = 14/25 (56%), Positives = 15/25 (60%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNS 408
           K K KK   K+KKKKKKKK K    
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEE 35



 Score = 29.7 bits (67), Expect = 0.77
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           +KKKKKKKKK K K++   +K++
Sbjct: 20  KKKKKKKKKKNKSKEEVVTEKEE 42



 Score = 29.7 bits (67), Expect = 0.89
 Identities = 16/26 (61%), Positives = 16/26 (61%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
             K K K KK   KKKKKKKKK N S
Sbjct: 7   GGKLKLKGKKIDVKKKKKKKKKKNKS 32


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 35.6 bits (83), Expect = 0.021
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E K  +K +KK  K K K +KKK
Sbjct: 67  ELKAWEKAEKKAEKAKAKAEKKK 89



 Score = 32.5 bits (75), Expect = 0.20
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
            +KK +K K K  KKK KK++ K
Sbjct: 74  AEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 31.3 bits (72), Expect = 0.54
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           EKK +K K K ++KK KK++ K 
Sbjct: 75  EKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 31.3 bits (72), Expect = 0.58
 Identities = 10/19 (52%), Positives = 13/19 (68%)

Query: 384 KKKKKKKKKKRKKKKKKKK 402
           K +KK +K K K +KKK K
Sbjct: 73  KAEKKAEKAKAKAEKKKAK 91



 Score = 31.0 bits (71), Expect = 0.62
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K  +K +KK ++ K K +KKK 
Sbjct: 69  KAWEKAEKKAEKAKAKAEKKKA 90



 Score = 30.6 bits (70), Expect = 0.86
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
           K +K K K +K+K KK++ K ++
Sbjct: 77  KAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 27.9 bits (63), Expect = 7.3
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
            +K K K +KKK KK++ K +
Sbjct: 78  AEKAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 36.7 bits (85), Expect = 0.033
 Identities = 26/100 (26%), Positives = 39/100 (39%), Gaps = 15/100 (15%)

Query: 349 LLTHFSQRYAKLPRLNKDLSE-----------NVGIAFDN----MRFPEKKKKKKKKKKK 393
           LL     R      L K + +           NV ++FD          K KK KK KK 
Sbjct: 33  LLEQDKARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKKFKKDKKD 92

Query: 394 RKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPK 433
             K++   +K++  S+Q +   EI +   SKK+    L  
Sbjct: 93  LAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLED 132


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 36.0 bits (83), Expect = 0.047
 Identities = 19/58 (32%), Positives = 24/58 (41%), Gaps = 6/58 (10%)

Query: 352 HFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
           +F        +    L      A   MR P+ K +K KKK    K   KKKKKV +S 
Sbjct: 95  NFFDSKNNSKQFAGPL------AISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSK 146



 Score = 28.6 bits (64), Expect = 9.7
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 363 LNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           L+ +   +  +   N +F +    K  KK K K+KKK KKK
Sbjct: 46  LDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKK 86


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 34.1 bits (78), Expect = 0.049
 Identities = 15/30 (50%), Positives = 21/30 (70%), Gaps = 2/30 (6%)

Query: 379 RFPE--KKKKKKKKKKKRKKKKKKKKKKVN 406
           ++P    +++ KK KKKR+KKKKKKK   N
Sbjct: 93  QYPAWMNQRQAKKLKKKREKKKKKKKGAKN 122



 Score = 30.6 bits (69), Expect = 0.75
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKV 405
             +++ KK KKK+++KKKKKK  K +
Sbjct: 98  MNQRQAKKLKKKREKKKKKKKGAKNL 123



 Score = 28.7 bits (64), Expect = 3.5
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
            KK KKK++KKK+KKK  K   
Sbjct: 103 AKKLKKKREKKKKKKKGAKNLA 124



 Score = 28.3 bits (63), Expect = 4.7
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNN 407
            +++ KK KK+++KKKKKKK   N
Sbjct: 99  NQRQAKKLKKKREKKKKKKKGAKN 122



 Score = 27.9 bits (62), Expect = 5.1
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           + KK KKK++KK+KKKK  K   
Sbjct: 102 QAKKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 32.9 bits (76), Expect = 0.053
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKK 402
           P +KKK+KK+ K   K+   KK
Sbjct: 59  PSRKKKEKKEVKAESKRYNAKK 80



 Score = 29.8 bits (68), Expect = 0.70
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 374 AFDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           A   +R    +KKK+KK+ K + K+   KK
Sbjct: 51  AHAAIRADPSRKKKEKKEVKAESKRYNAKK 80



 Score = 28.7 bits (65), Expect = 1.7
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 385 KKKKKKKKKRKKKKKKKKKK 404
           +KKK+KK+ + + K+   KK
Sbjct: 61  RKKKEKKEVKAESKRYNAKK 80



 Score = 27.1 bits (61), Expect = 6.2
 Identities = 9/26 (34%), Positives = 16/26 (61%), Gaps = 2/26 (7%)

Query: 382 EKKKKKKKKKKKRKKK--KKKKKKKV 405
           EKK+ K + K+   KK   +++K +V
Sbjct: 65  EKKEVKAESKRYNAKKLTLEQRKARV 90


>gnl|CDD|163212 TIGR03307, PhnP, phosphonate metabolism protein PhnP.  This family
           of proteins found in operons encoding phosphonate C-P
           lyase systems as is observed in E. coli and is a member
           of the metallo-beta-lactamase superfamily (pfam00753).
           As defined by this model, all instances of this protein
           are associated with the C-P lyase, but not all genomes
           containing the C-P lyase system contain phnP.
          Length = 238

 Score = 35.1 bits (81), Expect = 0.058
 Identities = 26/96 (27%), Positives = 47/96 (48%), Gaps = 6/96 (6%)

Query: 286 GHKITYSGDT--MPCDALVSIGKNS-DLLIHEATHEDELEKEAALKMHSTVSQAIRIGRE 342
           G ++ Y  DT  +P D    +  +  D+LI + +H     +  A + H+ +++A+ I  +
Sbjct: 144 GQRVAYLTDTAGLPPDTEAFLKNHPLDVLILDCSHP---PQSDAPRNHNDLTRALAINEQ 200

Query: 343 MRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAFDNM 378
           +R K V+LTH S +       N DL   V + +D  
Sbjct: 201 LRPKQVILTHISHQLDAWLMENPDLPSGVAVGYDGQ 236


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 35.0 bits (81), Expect = 0.062
 Identities = 31/103 (30%), Positives = 44/103 (42%), Gaps = 7/103 (6%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQI----SSRPEICREFDSKKISLANLPKLKLF 437
             KKKK+ K K  K+  KKKKKK   + +     + RP+   E  S   +L + P+ K  
Sbjct: 83  RLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSS 142

Query: 438 YPALKAMFAEYQDEIENRAIRRNLKQESTYLARYPSENDKEKP 480
             +         + ++ R IRR   Q     AR   E  KEK 
Sbjct: 143 RSSTVQNKEATHERLKEREIRRKKIQAK---ARKRKEKKKEKE 182



 Score = 31.6 bits (72), Expect = 0.72
 Identities = 13/44 (29%), Positives = 20/44 (45%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
            +++ KKKK+ K K  K+  KKK       +  P+       KK
Sbjct: 80  REERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKK 123



 Score = 28.9 bits (65), Expect = 5.3
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKV 405
           R  E++ ++KK + K +K+K+KKK+K 
Sbjct: 156 RLKEREIRRKKIQAKARKRKEKKKEKE 182


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 33.4 bits (77), Expect = 0.080
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
           E+ +K+K+ +K R+KK K+++K+   
Sbjct: 105 ERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 31.9 bits (73), Expect = 0.23
 Identities = 8/23 (34%), Positives = 17/23 (73%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
             K++ +K+K+ RK ++KK K++
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRR 124



 Score = 31.9 bits (73), Expect = 0.24
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
            +R   ++ K++ +K+KR +K ++KK K
Sbjct: 95  ALRLRRERTKERAEKEKRTRKNREKKFK 122



 Score = 29.5 bits (67), Expect = 1.5
 Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 366 DLSE-NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
           DL + N  +  + +R  EKKKK+K+  KKR+   + ++++ 
Sbjct: 63  DLKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERT 103



 Score = 29.5 bits (67), Expect = 1.7
 Identities = 6/24 (25%), Positives = 17/24 (70%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
            ++++ K++ +K K+ +K ++KK 
Sbjct: 98  LRRERTKERAEKEKRTRKNREKKF 121



 Score = 29.2 bits (66), Expect = 1.9
 Identities = 7/23 (30%), Positives = 19/23 (82%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           +++ +K+K+ +K ++KK K+++K
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQK 126



 Score = 29.2 bits (66), Expect = 2.0
 Identities = 8/23 (34%), Positives = 18/23 (78%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E+ K++ +K+K+ +K ++KK K+
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKR 123



 Score = 28.4 bits (64), Expect = 3.7
 Identities = 5/22 (22%), Positives = 15/22 (68%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
              + ++++ K+R +K+K+ +K
Sbjct: 94  IALRLRRERTKERAEKEKRTRK 115



 Score = 27.6 bits (62), Expect = 6.7
 Identities = 8/54 (14%), Positives = 22/54 (40%), Gaps = 10/54 (18%)

Query: 351 THFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
               +  AK+ +      E            +K+K+  KK++   + ++++ K+
Sbjct: 62  IDLKKHNAKVEKELLREKEK----------KKKRKRPGKKRRIALRLRRERTKE 105


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 31.8 bits (73), Expect = 0.086
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           EK + K+ KK+  KK  +KKK+K+ 
Sbjct: 10  EKLRAKRAKKRAAKKAARKKKRKLI 34



 Score = 31.4 bits (72), Expect = 0.13
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKV 405
           R  + + K+ KK+  +K  +KKK+K +
Sbjct: 8   RNEKLRAKRAKKRAAKKAARKKKRKLI 34



 Score = 30.3 bits (69), Expect = 0.31
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           ++ KK+  KK  RKKK+K   K+
Sbjct: 15  KRAKKRAAKKAARKKKRKLIFKR 37



 Score = 30.3 bits (69), Expect = 0.37
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
            K+ KK+  KK  +KKK+K   K
Sbjct: 14  AKRAKKRAAKKAARKKKRKLIFK 36



 Score = 29.5 bits (67), Expect = 0.60
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVN 406
           KK+K+ +K + ++ KK+  KK   
Sbjct: 4   KKRKRNEKLRAKRAKKRAAKKAAR 27



 Score = 27.2 bits (61), Expect = 3.9
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNN 407
           KK+K+ +K + K+ KK+  KK   
Sbjct: 4   KKRKRNEKLRAKRAKKRAAKKAAR 27


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 34.0 bits (78), Expect = 0.089
 Identities = 28/81 (34%), Positives = 32/81 (39%), Gaps = 18/81 (22%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS-IQISSRPEICR-----------EFDSKKISLA 429
           EKKKK K KK K K KK + KKK        S  PEI               D+KK+   
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREGWNNFSYFPEITHIVIKESTVTINRQDNKKM--- 146

Query: 430 NLPKLKLFYPALKAMFAEYQD 450
              +LKL        FA   D
Sbjct: 147 ---ELKLSSRDEALSFAALVD 164


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 31.9 bits (73), Expect = 0.11
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 366 DLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKK----KKK 401
           +  E +  +         ++KKKKKKK +KK K    +KK
Sbjct: 22  EDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 30.3 bits (69), Expect = 0.47
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 359 KLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           +LP               +     +  ++ ++KKK+KKK KKK K
Sbjct: 11  QLPNG-LGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54



 Score = 29.6 bits (67), Expect = 0.83
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 385 KKKKKKKKKRKKKKKKKKKK 404
             ++ ++KK+KKKK KKK K
Sbjct: 35  ISQRNRRKKKKKKKLKKKSK 54



 Score = 28.8 bits (65), Expect = 1.3
 Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 4/44 (9%)

Query: 366 DLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKK----KKKKKV 405
           D S+   I+  N R  +KKKK KKK K+   +KK    ++ K+V
Sbjct: 28  DASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKEV 71



 Score = 28.0 bits (63), Expect = 2.9
 Identities = 9/29 (31%), Positives = 19/29 (65%)

Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           + +   + ++  ++ ++K+KKKKK KKK 
Sbjct: 25  EQIDASKVRRISQRNRRKKKKKKKLKKKS 53


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 25/94 (26%), Positives = 33/94 (35%), Gaps = 11/94 (11%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFYPA 440
           PEKK KK K K K K K K K K      +  S+          + +             
Sbjct: 90  PEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSAASASGAAT 149

Query: 441 LKAMFAEYQDEIENRAIRRNLKQESTYLARYPSE 474
             +  A Y   +  RAIRR          RYP++
Sbjct: 150 GPS--ASYLSGL-RRAIRRA--------PRYPAQ 172



 Score = 30.1 bits (68), Expect = 2.1
 Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKV 405
            P++K K +KK KK K K K K K  
Sbjct: 83  KPKEKPKPEKKPKKPKPKPKPKPKPK 108



 Score = 30.1 bits (68), Expect = 2.1
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           P+ K+K K +KK +K K K K K       
Sbjct: 82  PKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 13/27 (48%), Positives = 15/27 (55%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNN 407
           P K K+K K +KK KK K K K K   
Sbjct: 81  PPKPKEKPKPEKKPKKPKPKPKPKPKP 107


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 33.5 bits (77), Expect = 0.12
 Identities = 27/90 (30%), Positives = 37/90 (41%), Gaps = 5/90 (5%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFY 438
           +   K KKKK KKKK K K KK  KK + S +   +    +  D  K     L  L    
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140

Query: 439 P---ALKAMFAEYQDEIENRAIRRNLKQES 465
           P   AL      YQ  ++ +      K+E+
Sbjct: 141 PRKYALHKDI--YQSRLDRKRRAEVAKKEA 168


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 33.8 bits (78), Expect = 0.12
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 385 KKKKKKKKKRKKKKKKKKKKV 405
           KK+K  KK +KKKK KKK ++
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRI 205



 Score = 33.0 bits (76), Expect = 0.22
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKV 405
           KK+K  KK K+KKK KKK + +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 31.5 bits (72), Expect = 0.58
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           KK+K  KK KK+KK KKK +  
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 31.5 bits (72), Expect = 0.61
 Identities = 10/24 (41%), Positives = 21/24 (87%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            E++++K++++K RKK+K+K+ KK
Sbjct: 8   LEQRRRKREQRKARKKQKRKEAKK 31



 Score = 31.5 bits (72), Expect = 0.68
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 385 KKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           K+K+K+KKK KK+ K++K+KV     
Sbjct: 149 KRKEKQKKKSKKEWKERKEKVEKKKA 174



 Score = 31.5 bits (72), Expect = 0.71
 Identities = 11/34 (32%), Positives = 22/34 (64%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
           +K++++K +KK+K+K+ KKK+    S     + E
Sbjct: 13  RKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNE 46



 Score = 31.1 bits (71), Expect = 0.93
 Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 372 GIAFDNMRFPEKKKKKKKKKKKRKKKKKKK 401
            I F  + F + ++ KK  K K+KKKKKK 
Sbjct: 63  NIVFSKVEFADGEQAKKDLKLKKKKKKKKT 92



 Score = 30.0 bits (68), Expect = 1.9
 Identities = 8/23 (34%), Positives = 21/23 (91%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           ++++K++++K ++K+K+K+ KKK
Sbjct: 10  QRRRKREQRKARKKQKRKEAKKK 32



 Score = 30.0 bits (68), Expect = 2.1
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
           ++K  KK KKKK+ KKK +  
Sbjct: 186 KRKDDKKNKKKKKAKKKGRIL 206



 Score = 29.6 bits (67), Expect = 3.0
 Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 5/28 (17%)

Query: 382 EKKKKKKKKK-----KKRKKKKKKKKKK 404
           EKKK +++KK     KKRK  KK KKKK
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKK 197



 Score = 28.4 bits (64), Expect = 6.0
 Identities = 9/50 (18%), Positives = 26/50 (52%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANL 431
            +++K +KK+K+++ KKK+  +K       +   +  ++    + +  N+
Sbjct: 15  REQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNI 64



 Score = 28.0 bits (63), Expect = 10.0
 Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 5/28 (17%)

Query: 382 EKKKKKKKK-----KKKRKKKKKKKKKK 404
           +KK +++KK     KK++  KK KKKKK
Sbjct: 171 KKKAERQKKREENLKKRKDDKKNKKKKK 198


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 34.3 bits (79), Expect = 0.13
 Identities = 23/53 (43%), Positives = 26/53 (49%), Gaps = 2/53 (3%)

Query: 352 HFSQRYAKLPRLNKDLSENVGIAFDNMRF-PEKKKKKKKKKKKRKKKKKKKKK 403
            FS+ Y K       L +    A   +R  P K KKKKKKKKK   KK K KK
Sbjct: 223 QFSK-YIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKK 274


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 34.0 bits (79), Expect = 0.14
 Identities = 9/22 (40%), Positives = 15/22 (68%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKV 405
           K +KKK+K++ K  K K K ++
Sbjct: 71  KAQKKKEKQEAKAAKAKSKPRL 92



 Score = 33.7 bits (78), Expect = 0.19
 Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 9/56 (16%)

Query: 350 LTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
           +T+ +++Y       K++ E +  A   +   E K   K +KKK K++ K  K K 
Sbjct: 42  VTNLNEQY-------KEMKEELKAAL--LDKKELKAWHKAQKKKEKQEAKAAKAKS 88



 Score = 31.7 bits (73), Expect = 0.89
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
            K +KKK+K++ +  K K K +
Sbjct: 70  HKAQKKKEKQEAKAAKAKSKPR 91



 Score = 29.8 bits (68), Expect = 3.3
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
           KK+ K   K ++KK+K++ K     S
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKS 88


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 33.8 bits (78), Expect = 0.17
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
             +   K++ KKKK K+ K  K  KK       +I S
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKKIDS 285



 Score = 32.7 bits (75), Expect = 0.32
 Identities = 17/68 (25%), Positives = 25/68 (36%)

Query: 337 IRIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKK 396
                +   K +L+  F      L    K   E            EKK+K K++ KK+K 
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263

Query: 397 KKKKKKKK 404
           K+ K  K 
Sbjct: 264 KESKGVKA 271



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 10/37 (27%), Positives = 15/37 (40%)

Query: 368 SENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           +E+        +   KKKK K+ K  +  KK   K  
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGM 280



 Score = 28.8 bits (65), Expect = 5.2
 Identities = 13/40 (32%), Positives = 18/40 (45%)

Query: 365 KDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           +D+      A    +  E+ KKKK K+ K  K  KK   K
Sbjct: 239 EDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 34.0 bits (78), Expect = 0.18
 Identities = 27/122 (22%), Positives = 47/122 (38%), Gaps = 12/122 (9%)

Query: 357 YAKLPRLNKDLSENVGIAFDNMR----FPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQIS 412
           Y     LN+ L E +     N+      P++KK+ +K KK++K K   +    N ++Q+ 
Sbjct: 38  YVHYVELNRRLDEWITADLINLGAAISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLE 97

Query: 413 SRPEICREFDSKKISLANLPKLKLFYPALKAMFAEYQDEIENRAIRRNLKQESTYLARYP 472
               I  E + +++        K+  P              N     N + E  Y + YP
Sbjct: 98  MLYSISNEREIRQL---RFGGSKVQNP--HEGARVKNI---NEIKLGNYEIEPWYFSPYP 149

Query: 473 SE 474
            E
Sbjct: 150 EE 151


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 34.3 bits (79), Expect = 0.19
 Identities = 20/62 (32%), Positives = 27/62 (43%), Gaps = 3/62 (4%)

Query: 378  MRFPEKKK---KKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKL 434
             R   K K    K +K K +KK+KKKKK   + S + S      R    +K  L + P  
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215

Query: 435  KL 436
            K 
Sbjct: 1216 KK 1217



 Score = 30.4 bits (69), Expect = 2.4
 Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 1/58 (1%)

Query: 381  PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK-ISLANLPKLKLF 437
            P KKK KK+ +      KKKKK +   + +  S+  + +   S+    L    K K  
Sbjct: 1305 PTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362


>gnl|CDD|224157 COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold
           involved in RNA processing [Translation, ribosomal
           structure and biogenesis].
          Length = 427

 Score = 33.9 bits (78), Expect = 0.20
 Identities = 15/49 (30%), Positives = 18/49 (36%), Gaps = 5/49 (10%)

Query: 133 PDTCILLDCG-EGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGL 180
             T ILLDCG         R        +     AV ++H H DH   L
Sbjct: 22  GGTRILLDCGLFPGDPSPERPLLPPFPKV----DAVLLTHAHLDHIGAL 66


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 33.9 bits (78), Expect = 0.21
 Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 4/53 (7%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKV---NNSIQISSRPEICREFDSKKISL 428
           R  EK   + K+KKK KKK++KKKK++      I+        +E ++K ++L
Sbjct: 319 RDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKE-ENKTVAL 370


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 31.7 bits (72), Expect = 0.22
 Identities = 13/47 (27%), Positives = 23/47 (48%)

Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
            +  +  K    ++   F N++  +KKKK+  K  + K  KK +KK 
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKD 47


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 33.0 bits (76), Expect = 0.22
 Identities = 11/56 (19%), Positives = 23/56 (41%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKL 436
            E   +K+K  ++ +K K K  +K          P+     D  ++++    K+K 
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKVKA 177



 Score = 31.0 bits (71), Expect = 0.84
 Identities = 8/36 (22%), Positives = 18/36 (50%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
            +++   +K+K  R+++K K K         + +PE
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 7/25 (28%), Positives = 16/25 (64%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKK 404
             ++K  ++++K K K  +KK+K +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPR 150


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 31.5 bits (72), Expect = 0.23
 Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 7/65 (10%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE-ICREFDSKKISLANLPKLKLFYPA 440
            K+K  +     +KKKK+K KKK + +   S+  E + +    KK S       K+ Y A
Sbjct: 37  LKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKKFS------KKINYDA 90

Query: 441 LKAMF 445
           LK++F
Sbjct: 91  LKSLF 95


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 30.2 bits (69), Expect = 0.25
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKK 403
            F EK  +K+K+KK   +K+++KK 
Sbjct: 36  EFYEKPSEKRKRKKAAARKRRRKKL 60



 Score = 29.1 bits (66), Expect = 0.68
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 382 EKKKKKKKKKKKRKKKKKKK 401
           EK+K+KK   +KR++KK  +
Sbjct: 43  EKRKRKKAAARKRRRKKLAR 62


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 33.5 bits (77), Expect = 0.26
 Identities = 24/123 (19%), Positives = 37/123 (30%), Gaps = 9/123 (7%)

Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEI 417
                L+KD +      F  +       ++K   K  +K KK                EI
Sbjct: 447 VCEKLLDKDKN----KPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEI 502

Query: 418 CREFDSKKISLANL-PKLKLFYPALKAMFAEYQDEIENRAIRRNLKQESTYLARYPSEND 476
             + DSK   L  +  K+                EI    ++  L      +AR    N 
Sbjct: 503 LEKVDSKSEKLEKISAKIDNIKELFDESKMSILQEI----LQPALHHIELNIARIYKLNI 558

Query: 477 KEK 479
           K+K
Sbjct: 559 KDK 561


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 32.3 bits (74), Expect = 0.28
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
           EK+  K K++K+R+ ++K+K+
Sbjct: 125 EKELAKLKREKRRENERKQKE 145



 Score = 31.1 bits (71), Expect = 0.58
 Identities = 7/24 (29%), Positives = 17/24 (70%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNN 407
           K+  K K++KR++ ++K+K+ +  
Sbjct: 126 KELAKLKREKRRENERKQKEILKE 149



 Score = 29.9 bits (68), Expect = 1.5
 Identities = 5/23 (21%), Positives = 17/23 (73%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
             +K+  K K++++++ ++K+K+
Sbjct: 123 LLEKELAKLKREKRRENERKQKE 145



 Score = 28.4 bits (64), Expect = 5.5
 Identities = 6/23 (26%), Positives = 16/23 (69%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           ++  K K++K++  ++K+K+  K
Sbjct: 126 KELAKLKREKRRENERKQKEILK 148



 Score = 28.4 bits (64), Expect = 5.6
 Identities = 6/23 (26%), Positives = 17/23 (73%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           ++  +K+  K KR+K+++ ++K+
Sbjct: 121 DELLEKELAKLKREKRRENERKQ 143



 Score = 27.6 bits (62), Expect = 9.5
 Identities = 7/22 (31%), Positives = 17/22 (77%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
            K K++K+++ +RK+K+  K++
Sbjct: 129 AKLKREKRRENERKQKEILKEQ 150


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 32.5 bits (74), Expect = 0.30
 Identities = 17/45 (37%), Positives = 20/45 (44%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
           P KK  KKKK   RKKK  KK KK     +           DS++
Sbjct: 46  PSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEE 90



 Score = 30.2 bits (68), Expect = 1.9
 Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 365 KDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSK 424
           K  +E+V    D++ FPE  +   KK  K+KK   +KKK    + +     E   E  ++
Sbjct: 25  KSTTESVDKVNDDI-FPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83

Query: 425 KIS 427
           ++S
Sbjct: 84  ELS 86


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor involved
           in ribosome biogenesis.
          Length = 112

 Score = 31.3 bits (71), Expect = 0.37
 Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 2/59 (3%)

Query: 358 AKLPRLNKDLSENVGI--AFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
             LPR  +   +  GI  A  N    +KK++KKK K+  +K++K++KK +  +  ++  
Sbjct: 22  TSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDT 80


>gnl|CDD|188365 TIGR03675, arCOG00543, arCOG00543 universal archaeal
           KH-domain/beta-lactamase-domain protein.  This family of
           proteins is universal in the archaea and consistsof an
           N-terminal type-1 KH-domain (pfam00013) a central
           beta-lactamase-domain (pfam00753) with a C-terminal
           motif associated with RNA metabolism (pfam07521).
           KH-domains are associated with RNA-binding, so taken
           together, this protein is a likely metal-dependent
           RNAase. This family was defined in as arCOG01782.
          Length = 630

 Score = 33.0 bits (76), Expect = 0.40
 Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 133 PDTCILLDCGEG----TYSQLVRLYGSAVDTLLSQLSAVYISHLHADH 176
           P++ +LLDCG        +    L     +  L +L AV I+H H DH
Sbjct: 196 PESRVLLDCGVNVGADGDNAYPYLD--VPEFQLDELDAVVITHAHLDH 241


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.41
 Identities = 11/22 (50%), Positives = 20/22 (90%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           K+KK+++K++KR+ K++KKK K
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAK 128



 Score = 31.1 bits (71), Expect = 0.47
 Identities = 10/22 (45%), Positives = 19/22 (86%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
           E+ K++KKK+ K KK+++K++K
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERK 117



 Score = 28.8 bits (65), Expect = 3.6
 Identities = 8/22 (36%), Positives = 20/22 (90%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           ++ K++KKK+ ++KK+++K++K
Sbjct: 96  ERNKQEKKKRSKEKKEEEKERK 117



 Score = 28.0 bits (63), Expect = 5.7
 Identities = 9/24 (37%), Positives = 20/24 (83%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
           E ++ K++KKK+ K+KK+++K++ 
Sbjct: 94  EHERNKQEKKKRSKEKKEEEKERK 117



 Score = 27.6 bits (62), Expect = 7.9
 Identities = 9/23 (39%), Positives = 19/23 (82%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
           ++KKK+ K+KK ++K++K++ K 
Sbjct: 100 QEKKKRSKEKKEEEKERKRQLKQ 122


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 31.9 bits (73), Expect = 0.44
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSI 409
             KK K K+ RK + KKKKKK +N I
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDDNPI 110



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           E  KK K K+ ++ + KKKKKK  N
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKKDDN 108


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 32.3 bits (74), Expect = 0.45
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            + K KK  K K  +K+K K++K+
Sbjct: 54  AKAKAKKALKAKIEEKEKAKREKE 77



 Score = 31.6 bits (72), Expect = 0.80
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
             K K KK  K K ++K+K K++K
Sbjct: 53  AAKAKAKKALKAKIEEKEKAKREK 76



 Score = 31.2 bits (71), Expect = 0.97
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
           K K +K  K  KKKK K K K+N
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLN 216



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 8/24 (33%), Positives = 17/24 (70%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            + KK  K K ++++K K++K++K
Sbjct: 56  AKAKKALKAKIEEKEKAKREKEEK 79



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
             K KK  K K + K+K K++K++
Sbjct: 55  KAKAKKALKAKIEEKEKAKREKEE 78



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
              K K KK  K + ++K+K K++
Sbjct: 52  VAAKAKAKKALKAKIEEKEKAKRE 75



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            + K   K K KK  K K ++K+K
Sbjct: 48  EKAKVAAKAKAKKALKAKIEEKEK 71



 Score = 29.6 bits (67), Expect = 2.9
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           E+K+++K K   + K KK  K K+ 
Sbjct: 43  EEKEEEKAKVAAKAKAKKALKAKIE 67



 Score = 29.2 bits (66), Expect = 3.8
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            EK K   K K K+  K K ++K+
Sbjct: 47  EEKAKVAAKAKAKKALKAKIEEKE 70



 Score = 29.2 bits (66), Expect = 4.4
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
            +K K +K  +  KKKK K K  
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214



 Score = 28.5 bits (64), Expect = 7.1
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            +   K K KK  + K ++K+K K
Sbjct: 50  AKVAAKAKAKKALKAKIEEKEKAK 73



 Score = 28.1 bits (63), Expect = 9.6
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 385 KKKKKKKKKRKKKKKKKKKKVN 406
           K K KK  K K ++K+K K+  
Sbjct: 55  KAKAKKALKAKIEEKEKAKREK 76



 Score = 28.1 bits (63), Expect = 9.7
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E ++K+++K K   K K KK  K
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALK 63



 Score = 28.1 bits (63), Expect = 9.8
 Identities = 8/24 (33%), Positives = 14/24 (58%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            +++K+++K K   K K KK  K 
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKA 64


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 32.4 bits (74), Expect = 0.45
 Identities = 7/31 (22%), Positives = 10/31 (32%)

Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
              +        +K  K K   KK  KK + 
Sbjct: 184 AAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 28.2 bits (63), Expect = 9.6
 Identities = 5/23 (21%), Positives = 7/23 (30%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
                 +K  + K   KK  K  
Sbjct: 190 TASSGDEKSPKSKAAPKKAGKKM 212


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 32.9 bits (76), Expect = 0.46
 Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 6/50 (12%)

Query: 364 NKDLSENVGIAFDNMRFP----EKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
           +K+     GI    M+ P    EK +K KKKKKK+ K  K K + V+  +
Sbjct: 660 DKEAIVQAGIM--KMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 32.8 bits (75), Expect = 0.48
 Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 2/73 (2%)

Query: 353 FSQRYAKLPRLNKDLSENVGIAFDN--MRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           F  R  KL   + D+S  +   F N   +  E K     ++K +KKKKK+KKK+     +
Sbjct: 703 FLDRIDKLFFFSFDISGLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKRE 762

Query: 411 ISSRPEICREFDS 423
             +R EI   +DS
Sbjct: 763 EKARIEIAEAWDS 775



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 19/70 (27%), Positives = 33/70 (47%), Gaps = 7/70 (10%)

Query: 361  PRLNKDLSENVGIAFDNMRFPEKKKK----KKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
              +++   +   + + N    EKK K    + K  + + +K  K+KKK+ N+I IS    
Sbjct: 1003 QMIHESSVQIRSMEWTNSSLTEKKIKDLTDRTKTIRNQIEKITKEKKKITNTINISPNK- 1061

Query: 417  ICREFDSKKI 426
                +DSK I
Sbjct: 1062 --TSYDSKII 1069


>gnl|CDD|237972 PRK15483, PRK15483, type III restriction-modification system StyLTI
           enzyme res; Provisional.
          Length = 986

 Score = 32.7 bits (75), Expect = 0.52
 Identities = 27/126 (21%), Positives = 46/126 (36%), Gaps = 18/126 (14%)

Query: 372 GIAFDNMRFPEKKKKKKKKKKKRKKKK---KKKKKK----VNNSI-QISSRPE--ICREF 421
           G+       PE++    K       KK   ++ KK     V  ++  I    E       
Sbjct: 275 GVDIFYPEMPEEQANNVKIVDSVTAKKLILRRGKKTFELAVGENLADIDPGFEGLTIEYA 334

Query: 422 DSKKISLAN---LPKLKLFYPALKAMFAE-YQDEIENRAIRRNLKQESTYLARYPSENDK 477
               + L+N   L K     P     +++ YQ+ +  +AI ++ + E     R  +E  K
Sbjct: 335 GKTSVVLSNGLELEKGMALVPG---TYSQSYQELMIQQAIDKHFETERANFLR-RNEPPK 390

Query: 478 EKPGSL 483
            K  SL
Sbjct: 391 IKTLSL 396


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.6 bits (75), Expect = 0.53
 Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 5/51 (9%)

Query: 357 YAKL---PRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           Y KL   P L +   E       N+   E+KK +KK++K  KK +K++ +K
Sbjct: 384 YLKLHDKPLLAEGEEEE--GENGNLSPAERKKLRKKQRKAEKKAEKEEAEK 432



 Score = 29.5 bits (67), Expect = 4.7
 Identities = 8/22 (36%), Positives = 16/22 (72%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           KK++K +KK ++++ +K   KK
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKK 437



 Score = 29.1 bits (66), Expect = 6.5
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
           +K++K +KK +K + +K   KKK 
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKA 439


>gnl|CDD|235245 PRK04183, PRK04183, glutamyl-tRNA(Gln) amidotransferase subunit D;
           Validated.
          Length = 419

 Score = 32.5 bits (75), Expect = 0.54
 Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 3/39 (7%)

Query: 370 NVGIAFDN---MRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
           N+GI  D    +   EK +K K++   ++ +K      V
Sbjct: 40  NIGIDIDKIAEIELLEKGEKPKQEPPPKEIEKDPGLPNV 78


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 32.5 bits (75), Expect = 0.55
 Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 12/67 (17%)

Query: 340 GREMRAKFV--LLTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKK 397
           GRE RA  V   L   +++  KL  L    +++          P  KK   K  +K K++
Sbjct: 799 GRETRAPTVEEALELLAEKPEKLRYLADAPAKD----------PAGKKAAVKFSRKTKQQ 848

Query: 398 KKKKKKK 404
               +K 
Sbjct: 849 YVASEKD 855



 Score = 29.8 bits (68), Expect = 4.3
 Identities = 7/24 (29%), Positives = 9/24 (37%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
           EK +K +       K    KK  V
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAV 839


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 32.3 bits (74), Expect = 0.57
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
            K  +K K ++ RKK KK +  KV  ++
Sbjct: 348 AKGAQKVKNRRARKKAKKARLAKVAKAL 375



 Score = 28.9 bits (65), Expect = 6.8
 Identities = 8/27 (29%), Positives = 14/27 (51%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
           +   K  +K K R+ +KK KK ++   
Sbjct: 345 KATAKGAQKVKNRRARKKAKKARLAKV 371


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.4 bits (72), Expect = 0.58
 Identities = 28/83 (33%), Positives = 40/83 (48%), Gaps = 13/83 (15%)

Query: 366 DLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
           D+S       ++  F  KKKKKKKKK+    ++KKKK  V  S +        R+ D K 
Sbjct: 67  DISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVV--SEE--------RKEDQKA 116

Query: 426 IS---LANLPKLKLFYPALKAMF 445
           +    LA + K+ L    LK+ F
Sbjct: 117 VDAALLAAIKKVPLLKGYLKSRF 139


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 30.8 bits (70), Expect = 0.69
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
            KKK KK  K  KK  K KK+KV  S++
Sbjct: 7   AKKKAKKAAKAAKKGVKVKKRKVRTSVR 34



 Score = 28.5 bits (64), Expect = 4.8
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
           +KK KK  K  K+  K KK+K + + 
Sbjct: 8   KKKAKKAAKAAKKGVKVKKRKVRTSV 33



 Score = 28.1 bits (63), Expect = 6.8
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
            KKK KK  K  +K  K KK+K 
Sbjct: 7   AKKKAKKAAKAAKKGVKVKKRKV 29


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 29.9 bits (68), Expect = 0.81
 Identities = 9/25 (36%), Positives = 18/25 (72%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
            +++++K KKK  +K + ++ KKKV
Sbjct: 68  DKEERRKDKKKYGKKARLREWKKKV 92



 Score = 28.7 bits (65), Expect = 1.8
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKK 403
           R  E+++K KKK  K+ + ++ KKK
Sbjct: 67  RDKEERRKDKKKYGKKARLREWKKK 91



 Score = 27.6 bits (62), Expect = 5.0
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKK 403
             +K KKK  KK R ++ KKK  
Sbjct: 71  ERRKDKKKYGKKARLREWKKKVF 93


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 31.8 bits (73), Expect = 0.82
 Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 25/90 (27%)

Query: 341 REMR----AKFVLLTHFSQRYAKL-------------PRLNKDLSENVGIAFDNMRF--- 380
           RE R     K     H S+R  +              P L   L+  + +  + + F   
Sbjct: 345 RETRDSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTG 404

Query: 381 -----PEKKKKKKKKKKKRKKKKKKKKKKV 405
                 + KK  +K +KKR+++KK+KKKK 
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKA 434



 Score = 31.4 bits (72), Expect = 1.3
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
           EK +KK++++KK KKKK    KK     +     +
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451



 Score = 30.7 bits (70), Expect = 2.2
 Identities = 12/44 (27%), Positives = 25/44 (56%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
           +KK    KKK++ ++++K+KK++     +  +  E   E + KK
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 8/23 (34%), Positives = 16/23 (69%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
             +K +KK+++++K+KKKK    
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAG 437



 Score = 30.3 bits (69), Expect = 2.7
 Identities = 23/128 (17%), Positives = 51/128 (39%), Gaps = 14/128 (10%)

Query: 319 DELEKEAALKMHSTVSQAIR-----------IGREMRAKFVLLTHFSQRYAKLPRLNKDL 367
           D + K+ A K+H++  +  R              E+ A+       ++   +    +K  
Sbjct: 349 DSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKA 408

Query: 368 SENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKIS 427
           ++ +    +     EKK++++KK+KK+K    KKK++     +     E   E +  +  
Sbjct: 409 TKKIKKIVEKA---EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465

Query: 428 LANLPKLK 435
                + K
Sbjct: 466 KEEEEEKK 473


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.9 bits (70), Expect = 0.83
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
            K+ ++K+  K +K+KK+KK+KKV
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKV 126



 Score = 30.9 bits (70), Expect = 0.91
 Identities = 10/38 (26%), Positives = 21/38 (55%)

Query: 367 LSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
           L+E   I        +++K+  K+ ++++  K KK+KK
Sbjct: 82  LAEGYSINLFPEESEKEQKEVSKETEEKEAIKAKKEKK 119



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
            EK++K+  K+ + K+  K KK+K 
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKK 119



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
              K+ ++K+  K KK+KK+KK+K
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 29.3 bits (66), Expect = 2.8
 Identities = 9/25 (36%), Positives = 17/25 (68%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           +K+  K+ ++K+  K KK+KK+K  
Sbjct: 99  QKEVSKETEEKEAIKAKKEKKEKKE 123



 Score = 28.6 bits (64), Expect = 4.2
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            ++  K+ ++K+  K KK+KK+KK
Sbjct: 99  QKEVSKETEEKEAIKAKKEKKEKK 122



 Score = 28.2 bits (63), Expect = 5.4
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E+K+  K+ ++K   K KK+KK+
Sbjct: 98  EQKEVSKETEEKEAIKAKKEKKE 120



 Score = 28.2 bits (63), Expect = 7.0
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
           E+K+  K KK+K++KK+KK  +K+
Sbjct: 107 EEKEAIKAKKEKKEKKEKKVAEKL 130


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 32.1 bits (73), Expect = 0.84
 Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 3/52 (5%)

Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQ-ISSRPEICREFDSKKISLANLPKLKL 436
           +KKKKKKK K+KKKKK ++ +  I  + +  E     +++K S    P+LKL
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSET--PQLKL 733



 Score = 29.5 bits (66), Expect = 5.9
 Identities = 13/23 (56%), Positives = 19/23 (82%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
           KKKKKKK K+K+KKK ++  K++
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQI 707



 Score = 29.1 bits (65), Expect = 6.4
 Identities = 13/22 (59%), Positives = 18/22 (81%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           +KKKKKKK K++KKKK ++  K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705


>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
           [Translation, ribosomal structure and biogenesis].
          Length = 98

 Score = 29.6 bits (67), Expect = 0.92
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKK 403
           P  K  +KK+K  RKK++ KK K
Sbjct: 76  PPDKLIRKKRKLPRKKRRPKKPK 98



 Score = 29.6 bits (67), Expect = 0.96
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 384 KKKKKKKKKKRKKKKKKKKKK 404
            K  +KK+K  +KK++ KK K
Sbjct: 78  DKLIRKKRKLPRKKRRPKKPK 98


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 29.5 bits (67), Expect = 0.93
 Identities = 5/24 (20%), Positives = 18/24 (75%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
            K+++K +++++ K+ K  K++++
Sbjct: 3   RKEEEKAQREEELKRLKNLKREEI 26



 Score = 28.8 bits (65), Expect = 2.1
 Identities = 6/23 (26%), Positives = 17/23 (73%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E+K+++K ++++  K+ K  K++
Sbjct: 2   ERKEEEKAQREEELKRLKNLKRE 24



 Score = 28.4 bits (64), Expect = 2.8
 Identities = 5/25 (20%), Positives = 18/25 (72%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           EK +++++ K+ +  K+++ ++K+ 
Sbjct: 7   EKAQREEELKRLKNLKREEIEEKLE 31



 Score = 28.4 bits (64), Expect = 2.9
 Identities = 6/24 (25%), Positives = 16/24 (66%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVN 406
           K++K+++K ++ ++ K+ K  K  
Sbjct: 1   KERKEEEKAQREEELKRLKNLKRE 24


>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
           basal transcription 1 (ABT1) and similar proteins.  This
           subfamily corresponds to the RRM of novel nuclear
           proteins termed ABT1 and its homologous counterpart,
           pre-rRNA-processing protein ESF2 (eighteen S factor 2),
           from yeast. ABT1 associates with the TATA-binding
           protein (TBP) and enhances basal transcription activity
           of class II promoters. Meanwhile, ABT1 could be a
           transcription cofactor that can bind to DNA in a
           sequence-independent manner. The yeast ABT1 homolog,
           ESF2, is a component of 90S preribosomes and 5'
           ETS-based RNPs. It is previously identified as a
           putative partner of the TATA-element binding protein.
           However, it is primarily localized to the nucleolus and
           physically associates with pre-rRNA processing factors.
           ESF2 may play a role in ribosome biogenesis. It is
           required for normal pre-rRNA processing, as well as for
           SSU processome assembly and function. Both ABT1 and ESF2
           contain an RNA recognition motif (RRM), also termed RBD
           (RNA binding domain) or RNP (ribonucleoprotein domain).
           .
          Length = 98

 Score = 29.5 bits (67), Expect = 1.1
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 8/33 (24%)

Query: 381 PEKKKKKKKKKKKRKKKKKK--------KKKKV 405
           PE   K+K++KKK   KKKK        + KKV
Sbjct: 34  PEDPAKRKRRKKKGGNKKKKFTEGWVEFEDKKV 66


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 16/49 (32%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
                L++ L    GI  D+    + ++ K++ +KK+K KKK +K+KVN
Sbjct: 323 LDEKSLSEFLGYMGGIDEDDEDE-DDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 12/54 (22%), Positives = 23/54 (42%), Gaps = 7/54 (12%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKL 434
            E +++   + K   K+KKK++     + +   + E  +   SK       PKL
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK-------PKL 153


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 29.1 bits (66), Expect = 1.1
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
           E  +K +K+ KK    K KKKK
Sbjct: 12  EHHRKLRKEAKKNPTWKSKKKK 33



 Score = 26.8 bits (60), Expect = 6.1
 Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 4/32 (12%)

Query: 377 NMRFPEKKKKKKKKKKKRKKKKK----KKKKK 404
            +R+  +KK  +  +K RK+ KK    K KKK
Sbjct: 1   RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKK 32


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 31.0 bits (70), Expect = 1.1
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKVNN 407
           + EK K   KKK K KK KKK+ K+  N
Sbjct: 205 WHEKDKYSYKKKLKSKKLKKKQAKREAN 232


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
           PE+ KK++  + +RKKK+KK K + 
Sbjct: 366 PERDKKERPGRYRRKKKEKKAKSER 390



 Score = 31.2 bits (71), Expect = 1.2
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVN 406
           P  K K ++ KK+R  + ++KKK+  
Sbjct: 360 PPTKAKPERDKKERPGRYRRKKKEKK 385



 Score = 29.7 bits (67), Expect = 3.8
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
             ++ KK++  + R+KKK+KK K 
Sbjct: 365 KPERDKKERPGRYRRKKKEKKAKS 388



 Score = 29.7 bits (67), Expect = 3.9
 Identities = 7/26 (26%), Positives = 18/26 (69%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
              +KK++  + ++K+K+KK K +++
Sbjct: 366 PERDKKERPGRYRRKKKEKKAKSERR 391



 Score = 29.3 bits (66), Expect = 5.4
 Identities = 8/28 (28%), Positives = 17/28 (60%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
           R  +++  + ++KKK KK K +++   N
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 28.5 bits (64), Expect = 8.1
 Identities = 6/26 (23%), Positives = 16/26 (61%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           + P+   K K ++ K+++  + ++KK
Sbjct: 356 KPPKPPTKAKPERDKKERPGRYRRKK 381


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 13/26 (50%), Positives = 20/26 (76%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
           K + ++++ KK+KKKKKKK KK N +
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNAT 198



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 13/27 (48%), Positives = 20/27 (74%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSI 409
            K + ++++ K+KKKKKKKK K NN+ 
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNAT 198



 Score = 30.0 bits (68), Expect = 2.6
 Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 2/57 (3%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFY 438
           E+++ KKKKKKK+KK KK      +    +SS      E  S    +    KLK   
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAV--PTELSSGAGQVGEAKKLKKKR 231



 Score = 29.2 bits (66), Expect = 4.6
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
           + + ++++ KKK+KKKKKK KK    
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNAT 198


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 31.2 bits (70), Expect = 1.3
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
           +K KK KKK+KK K+K++ K KK 
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKK 220



 Score = 31.2 bits (70), Expect = 1.6
 Identities = 13/47 (27%), Positives = 26/47 (55%)

Query: 369 ENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
            +V       + P+KK+KK+K+K++ K KKK+ +   +  + +   P
Sbjct: 190 GDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSP 236



 Score = 30.4 bits (68), Expect = 2.4
 Identities = 9/26 (34%), Positives = 20/26 (76%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           R  ++ +K K++KKKR+K+K+ +++ 
Sbjct: 89  RHRQRLEKDKREKKKREKEKRGRRRH 114



 Score = 30.0 bits (67), Expect = 3.5
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           +   K KKKK++K+K +KKKKKK   
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHH 300



 Score = 30.0 bits (67), Expect = 3.6
 Identities = 9/23 (39%), Positives = 20/23 (86%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E++ +++ +K KR+KKK++K+K+
Sbjct: 87  ERRHRQRLEKDKREKKKREKEKR 109



 Score = 29.3 bits (65), Expect = 4.8
 Identities = 13/33 (39%), Positives = 24/33 (72%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
           KKKK++K+K+++KKKKK    + ++S   + +P
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313



 Score = 29.3 bits (65), Expect = 5.9
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
           P+ KKKK++K+K+ KKKKKK    
Sbjct: 278 PKHKKKKQRKEKEEKKKKKKHHHH 301



 Score = 28.9 bits (64), Expect = 7.6
 Identities = 8/21 (38%), Positives = 16/21 (76%)

Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
           EK K++KKK++K K+ +++  
Sbjct: 95  EKDKREKKKREKEKRGRRRHH 115



 Score = 28.5 bits (63), Expect = 9.0
 Identities = 9/39 (23%), Positives = 24/39 (61%)

Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
           ++  E+++ +++ +K +++KKK++K+K       S   E
Sbjct: 82  VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120


>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1)
           is a melanoma-associated antigen (MAA).  MUM-1 belongs
           to the mutated or aberrantly expressed type of MAAs,
           along with antigens such as CDK4, beta-catenin,
           gp100-in4, p15, and N-acetylglucosaminyltransferase V.
           It is highly expressed in several types of human
           cancers.  The PWWP domain, named for a conserved
           Pro-Trp-Trp-Pro motif, is a small domain consisting of
           100-150 amino acids. The PWWP domain is found in
           numerous proteins that are involved in cell division,
           growth and differentiation. Most PWWP-domain proteins
           seem to be nuclear, often DNA-binding, proteins that
           function as transcription factors regulating a variety
           of developmental processes.
          Length = 80

 Score = 28.9 bits (65), Expect = 1.3
 Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 6/54 (11%)

Query: 364 NKDLSENVGIAFDNMRFPEKKKKKKKKKKKR------KKKKKKKKKKVNNSIQI 411
            K     V    DNM+  +K  +  K+  K       +K   K K+    +IQI
Sbjct: 26  RKKQKARVNFIGDNMQSEKKGIRVVKRWLKHFDCTEKQKLTNKAKESYEQAIQI 79


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 31.3 bits (70), Expect = 1.4
 Identities = 28/120 (23%), Positives = 43/120 (35%), Gaps = 22/120 (18%)

Query: 374 AFDNMRFPEKKKKKKKKKKKRK---KKKKKKKKKVNNSIQISSRPEICREFDSKKISLAN 430
            FDN+   +   KKKKKKKK+     K K++ K +              E      S+  
Sbjct: 96  EFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETE-----------EIIDVGASVNQ 144

Query: 431 LPKLKLFYPALKAMFAE-----YQDEIENRAIRRNLKQESTYLARYPSENDKEKPGSLDD 485
             + +LF   L     E       D+ E      +  +E ++ A    E   E P   +D
Sbjct: 145 ANETRLFDDTLDDQLEESVSTQSTDDGE---RLFDQNKEPSFTAGLDDEVQSEVPQPEED 201


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 5/60 (8%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFY 438
           R  +     K+  K+R KK + K K+  +++      EI +E  ++  S   L KL+L+Y
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEFASALGK----EILKEEPTENSSKNIL-KLRLYY 578


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
           (ETRAMP).  These sequences represent a family of
           proteins from the malaria parasite Plasmodium
           falciparum, several of which have been shown to be
           expressed specifically in the ring stage as well as the
           rodent parasite Plasmodium yoelii. A homologue from
           Plasmodium chabaudi was localized to the parasitophorous
           vacuole membrane. Members have an initial hydrophobic,
           Phe/Tyr-rich, stretch long enough to span the membrane,
           a highly charged region rich in Lys, a second putative
           transmembrane region and a second highly charged, low
           complexity sequence region. Some members have up to 100
           residues of additional C-terminal sequence. These genes
           have been shown to be found in the sub-telomeric regions
           of both Plasmodium falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 28.8 bits (65), Expect = 1.5
 Identities = 17/33 (51%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
             KKKK KK KK     +KKKK K    I ISS
Sbjct: 28  VAKKKKLKKLKKIDDDLEKKKKNK--KKIIISS 58


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 31.0 bits (71), Expect = 1.5
 Identities = 12/22 (54%), Positives = 18/22 (81%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKV 405
           +K++ KKKKKRKK KK + +K+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754



 Score = 29.8 bits (68), Expect = 3.7
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
           EK++ KKKKK+K+ KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 28.7 bits (65), Expect = 9.1
 Identities = 9/22 (40%), Positives = 16/22 (72%)

Query: 385 KKKKKKKKKRKKKKKKKKKKVN 406
           +K++ KKKK++KK KK + +  
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754


>gnl|CDD|183052 PRK11244, phnP, carbon-phosphorus lyase complex accessory protein;
           Provisional.
          Length = 250

 Score = 30.3 bits (69), Expect = 1.6
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)

Query: 285 SGHKITYSGDT--MPCDALVSIGKNS-DLLIHEATHEDELEKEAALKMHSTVSQAIRIGR 341
           +  ++ Y  DT  +P D L  +  N  DLL+ + +H     +E A + H+ ++ A+ I  
Sbjct: 153 AHSRVAYLTDTVGLPEDTLKFLRNNQPDLLVLDCSHP---PQEDAPRNHNDLTTALAIIE 209

Query: 342 EMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAFDNMRFP 381
            +R   V+LTH S +       N  L   V +A+D M   
Sbjct: 210 VLRPPRVILTHISHQLDAWLMENAALPSGVEVAYDGMEIG 249


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 31.0 bits (70), Expect = 1.7
 Identities = 15/42 (35%), Positives = 16/42 (38%)

Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKIS 427
           + K KK K KKK KK KKK           E     D  K  
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSE 184



 Score = 29.4 bits (66), Expect = 5.1
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQIS 412
           + KK K KKK K+ KKK+ K+    +  + S
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEES 175



 Score = 28.6 bits (64), Expect = 8.4
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            +K K KKK KK +KK+ K+   K
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDK 169



 Score = 28.6 bits (64), Expect = 8.8
 Identities = 10/26 (38%), Positives = 13/26 (50%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVN 406
             K KKK KK KK++ K+   K    
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEE 173


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 11/43 (25%), Positives = 28/43 (65%)

Query: 362 RLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
            LN DL   +      ++  E+K++++++ +KR+++K+++K K
Sbjct: 13  MLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREKNK 55


>gnl|CDD|232831 TIGR00115, tig, trigger factor.  Trigger factor is a
           ribosome-associated molecular chaperone and is the first
           chaperone to interact with nascent polypeptide. Trigger
           factor can bind at the same time as the signal
           recognition particle (SRP), but is excluded by the SRP
           receptor (FtsY). The central domain of trigger factor
           has peptidyl-prolyl cis/trans isomerase activity. This
           protein is found in a single copy in virtually every
           bacterial genome [Protein fate, Protein folding and
           stabilization].
          Length = 408

 Score = 30.6 bits (70), Expect = 1.8
 Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 13/111 (11%)

Query: 359 KLPRLNKDLSENVGIAFDNM-RFPEKKKKKKKKKKKRKKKKKKKKKKV-----NNSIQIS 412
           +LP L+ + ++++G  F+ +       +K  +++KK + K K K++ +     NN  ++ 
Sbjct: 233 ELPELDDEFAKSLGEEFETLEELKADIRKNLEREKKERAKNKLKEQLLDKLVENNEFEL- 291

Query: 413 SRPE--ICREFDSKKISLANLPKLKLFYP--ALKAMFAEYQDEIENRAIRR 459
             PE  + +E D          + +       LK    E ++E    A RR
Sbjct: 292 --PESLVEQEIDRLLEQALQQLQQQGIDLEEYLKDTEEELREEFREEAERR 340


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 30.6 bits (69), Expect = 1.9
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            E K +  K+++K K++ K++KKK
Sbjct: 96  KEPKNESGKEEEKEKEQVKEEKKK 119



 Score = 30.2 bits (68), Expect = 3.0
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
           P+ +  K+++K+K + K++KKKKK 
Sbjct: 98  PKNESGKEEEKEKEQVKEEKKKKKE 122



 Score = 29.5 bits (66), Expect = 4.3
 Identities = 13/44 (29%), Positives = 24/44 (54%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
           +++ K++KKKKK K K++ K +K     +    P+   +   KK
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153



 Score = 29.5 bits (66), Expect = 5.1
 Identities = 10/23 (43%), Positives = 20/23 (86%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKK 403
            E++K+K++ K+++KKKK+K K+
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKE 126



 Score = 29.5 bits (66), Expect = 5.1
 Identities = 10/45 (22%), Positives = 25/45 (55%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
              K+K+K+K+KK ++ + ++++K    ++  SRP+   +     
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185



 Score = 29.1 bits (65), Expect = 6.1
 Identities = 11/24 (45%), Positives = 20/24 (83%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
            E  K+++K+K++ K++KKKKK+K
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEK 123



 Score = 29.1 bits (65), Expect = 7.0
 Identities = 14/45 (31%), Positives = 26/45 (57%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
            EK++ K++KKKK++K K++ K +         RP   +E + +K
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152


>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956).  This
           family of proteins with unknown function appears to be
           restricted to Gammaproteobacteria.
          Length = 103

 Score = 28.8 bits (65), Expect = 1.9
 Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 7/39 (17%)

Query: 372 GIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           GIA       E KK++K K ++  K +K++ K       
Sbjct: 35  GIA-------EYKKQQKAKAREADKARKQQLKAKQRQAA 66


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 29.3 bits (66), Expect = 2.0
 Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 7/64 (10%)

Query: 350 LTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
           L H SQ+  +  R  K+  ++        +  +KK+KK +    R  KK   KK+  +S 
Sbjct: 54  LIHLSQQLQQPQRKRKEQQKSS-------QTHKKKRKKSRHVSSRSAKKISAKKRRRSSS 106

Query: 410 QISS 413
             SS
Sbjct: 107 SSSS 110


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 29.1 bits (66), Expect = 2.0
 Identities = 10/25 (40%), Positives = 18/25 (72%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
            + K  ++++K ++KKKKKKK+  N
Sbjct: 67  SRNKAAEERRKLKEKKKKKKKELEN 91



 Score = 28.0 bits (63), Expect = 5.1
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKK 404
               K  ++++K K KKKKKKK+ +
Sbjct: 66  ASRNKAAEERRKLKEKKKKKKKELE 90


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 30.0 bits (68), Expect = 2.0
 Identities = 17/33 (51%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
             +++KKKKKKK  KKKK KK   V     ISS
Sbjct: 177 NAEEEKKKKKKKSAKKKKLKKVAAVGMK-AISS 208



 Score = 28.8 bits (65), Expect = 5.3
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKK 402
           N    EKKKKKKK  KK+K KK    
Sbjct: 176 NNAEEEKKKKKKKSAKKKKLKKVAAV 201



 Score = 28.8 bits (65), Expect = 5.7
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
                +KK    +++KK+KKKK  KKKK+
Sbjct: 167 EAARDKKKSNNAEEEKKKKKKKSAKKKKL 195



 Score = 28.0 bits (63), Expect = 8.8
 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 374 AFDNMRFPEKKKKKKKKKKKRKKKKKKKK-KKVNNS 408
           A       +    +++KKKK+KK  KKKK KKV   
Sbjct: 166 AEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This is
           a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 27.6 bits (62), Expect = 2.1
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKK 403
            M+FP+  K+KKKK K+  ++  K   
Sbjct: 28  KMKFPQFDKQKKKKAKRNAEQDAKNSA 54


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 30.8 bits (70), Expect = 2.2
 Identities = 11/23 (47%), Positives = 19/23 (82%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E+K+  K++K+ RK KK+++KKK
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKK 580



 Score = 30.0 bits (68), Expect = 3.2
 Identities = 10/23 (43%), Positives = 19/23 (82%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           EK+  K++K+ ++ KK+++KKKK
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKK 581



 Score = 29.6 bits (67), Expect = 4.9
 Identities = 10/23 (43%), Positives = 19/23 (82%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           EK++K+  K++KR +K KK+++K
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEK 578


>gnl|CDD|222366 pfam13764, E3_UbLigase_R4, E3 ubiquitin-protein ligase UBR4.  This
           is a family of E## ubiquitin ligase enzymes.
          Length = 794

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 11/56 (19%), Positives = 25/56 (44%)

Query: 368 SENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDS 423
            E V    D +R   + +K++     R+K+ +    + ++  QI + P +    +S
Sbjct: 403 KEGVAKKIDEVRDETRAEKRRLAMAMREKQLQALGMRTSSGGQIVASPRLLEGIES 458


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 30.0 bits (68), Expect = 2.3
 Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 2/56 (3%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN--NSIQISSRPEICREFDSKKISLANLPKLK 435
           +++ KKK  K   K K KK  KK N   +++   + ++ +    ++  L    KLK
Sbjct: 144 KEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLK 199


>gnl|CDD|224158 COG1237, COG1237, Metal-dependent hydrolases of the beta-lactamase
           superfamily II [General function prediction only].
          Length = 259

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKA 186
           IL D G  T S ++      +   L  + AV +SH H DH  GL  +++ 
Sbjct: 34  ILFDTG--TDSDVLLHNARLLGVDLRDIDAVVLSHGHYDHTGGLPYLLEE 81


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 28.8 bits (65), Expect = 2.3
 Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           +   ++  + K K+K KKKKKKK+ +
Sbjct: 64  KTAREEAVEAKAKEKEKKKKKKKELE 89



 Score = 27.2 bits (61), Expect = 9.9
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
           ++  + K K+++KKKKKKK+  +
Sbjct: 68  EEAVEAKAKEKEKKKKKKKELED 90


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 30.6 bits (69), Expect = 2.4
 Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 379 RFPEKKKKKKKKK---KKRKKKKKKKKKKVNNSIQ 410
           + P +KK K KKK    K K  K  KKK    S +
Sbjct: 744 KTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKK 778



 Score = 29.0 bits (65), Expect = 6.7
 Identities = 11/35 (31%), Positives = 14/35 (40%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEI 417
            K K  K  KK+ K K KK  K       +  P +
Sbjct: 760 TKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPAL 794



 Score = 29.0 bits (65), Expect = 7.5
 Identities = 11/24 (45%), Positives = 12/24 (50%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
            K  K  KKK K K KK  KK+  
Sbjct: 762 GKAAKTVKKKSKAKSKKTTKKRAG 785


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 13/92 (14%)

Query: 318 EDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAK--LPRLNKDLSENVGIAF 375
           E E+ K  A KM   ++      R+   K         R AK    RL  +  E+ G   
Sbjct: 123 EAEIAKNMA-KMPQMIADWRAQKRKREQK--------ARAAKERKERLVAEAREHFGYWV 173

Query: 376 D--NMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
           D  + RF E  ++K+K++KK+ K+ K+++K+ 
Sbjct: 174 DPRDPRFQEMLQQKEKEEKKKVKEAKRREKEE 205


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.4 bits (69), Expect = 2.5
 Identities = 38/165 (23%), Positives = 60/165 (36%), Gaps = 19/165 (11%)

Query: 318 EDELEK--EAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAF 375
           E EL+K      K    + +  +   E+R K  LL      YA L  + + L       F
Sbjct: 292 EKELKKLENKLEKQEDELEELEKAAEELRQKGELL------YANLQLIEEGLKSVRLADF 345

Query: 376 DNMRFPEKKKKKKKKK-------KKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISL 428
                 E K +  K K       +  KK KK K  KVN   Q+S   E    ++S K +L
Sbjct: 346 YGN--EEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTAL 403

Query: 429 ANLPKLKLFYPALKAMFAEYQDEIENRAIRRNLKQESTYLARYPS 473
                 K      + +  E    ++++  +R  K+       + S
Sbjct: 404 EKAEGKKAIEEIREELIEE--GLLKSKKKKRKKKEWFEKFRWFVS 446


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 29.1 bits (65), Expect = 2.7
 Identities = 14/46 (30%), Positives = 31/46 (67%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKIS 427
           EKK+  K++K+++++K+  K++K    I++S + +  RE   KK++
Sbjct: 76  EKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLT 121



 Score = 27.9 bits (62), Expect = 7.1
 Identities = 15/47 (31%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS-RPEICREFDSKKI 426
           PEK+  +K+ K  ++ +K +KKKK++   +I+  R    RE +  K 
Sbjct: 50  PEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKR 96


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 29.9 bits (67), Expect = 2.8
 Identities = 14/44 (31%), Positives = 18/44 (40%), Gaps = 4/44 (9%)

Query: 381 PEKKKKKKKKKKKR----KKKKKKKKKKVNNSIQISSRPEICRE 420
           PE K+ KK KK +     K  K KK KK  +    S +      
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDP 185



 Score = 28.7 bits (64), Expect = 6.5
 Identities = 10/34 (29%), Positives = 14/34 (41%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
           K  K  K KK +KK     +      I   S+P+
Sbjct: 157 KVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 28.4 bits (64), Expect = 3.2
 Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 14/85 (16%)

Query: 366 DLSE-NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSK 424
           D+S   V    ++  F  +KKKKKKK +     +KK+KK+V+            R+ D K
Sbjct: 22  DISGVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQ----------RKADQK 71

Query: 425 KISLANLPKLK---LFYPALKAMFA 446
            +  A L  +K        LK+ F+
Sbjct: 72  AVDAALLAAIKKHPDLKQYLKSRFS 96


>gnl|CDD|179371 PRK02113, PRK02113, putative hydrolase; Provisional.
          Length = 252

 Score = 29.8 bits (67), Expect = 3.2
 Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 7/44 (15%)

Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGL 180
           IL+DCG     Q++RL          ++ AV I+H H DH  GL
Sbjct: 47  ILIDCGPDFREQMLRLP-------FGKIDAVLITHEHYDHVGGL 83


>gnl|CDD|129141 TIGR00030, S21p, ribosomal protein S21.  This model describes
           bacterial ribosomal protein S21 and most mitochondrial
           and chloroplast equivalents [Protein synthesis,
           Ribosomal proteins: synthesis and modification].
          Length = 58

 Score = 27.2 bits (61), Expect = 3.3
 Identities = 11/44 (25%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 360 LPRLNKDLSENVGIA-FDNMRFPEKKKKKKKKKKKRKKKKKKKK 402
           L R  + L +   +      R+ EK  +++++K+K   K+ +KK
Sbjct: 15  LRRFKRKLEKEGILRELKKRRYYEKPSERRRRKEKAAAKRIRKK 58


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 29.5 bits (67), Expect = 3.3
 Identities = 12/25 (48%), Positives = 18/25 (72%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKK 404
           F E+K+KKK   K+ KK  K++K+K
Sbjct: 94  FEEEKEKKKAMSKEEKKAIKEEKEK 118


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 29.4 bits (67), Expect = 3.3
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQI 411
           E+  KK  +K ++ KKKKK++K  +  ++I
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKGFSKDVRI 217


>gnl|CDD|213392 cd12149, Flavi_E_C, Immunoglobulin-like domain III (C-terminal
           domain) of Flavivirus envelope glycoprotein E.  The
           C-terminal domain (domain III) of Flavivirus
           glycoprotein E appears to be involved in low-affinity
           interactions with negatively charged glycoaminoglycans
           on the host cell surface. Domain III may also play a
           role in interactions with alpha-v-beta-3 integrins in
           West Nile virus, Japanese encephalitis virus, and Dengue
           virus. The interface between domain I and domain III
           appears to be destabilized by the low-pH environment of
           the endosome, and domain III may play a vital role in
           the conformational changes of envelope glycoprotein E
           that follow the clathrin-mediated endocytosis of viral
           particles and are a prerequisite to membrane fusion.
          Length = 91

 Score = 28.0 bits (63), Expect = 3.3
 Identities = 8/27 (29%), Positives = 12/27 (44%), Gaps = 5/27 (18%)

Query: 283 TKSGH-----KITYSGDTMPCDALVSI 304
             +GH     ++ YSG   PC   V +
Sbjct: 9   ADTGHGTVVMEVKYSGTDAPCRIPVRV 35


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 29.7 bits (67), Expect = 3.3
 Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 8/55 (14%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKL 436
           E KKK +KKKKK+K+KK+ K +        + +          K +    PK K 
Sbjct: 65  ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK--------SKKTKKKPPKPKP 111


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 28.8 bits (64), Expect = 3.7
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
            +  KKKK+K  ++K K KKK++K N  I
Sbjct: 23  QKSDKKKKEKVSEKKGKSKKKEEKPNGKI 51


>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
           homology (PH) domain.  Fission yeast Scd1 is an exchange
           factor for Cdc42 and an effector of Ras1, the homolog of
           the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
           binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
           Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
           exchange factor for Cdc42, a member of the Rho family of
           Ras-like proteins. Cdc42 then activates the Shk1/Orb2
           protein kinase. Scd1 interacts with Klp5 and Klp6
           kinesins to mediate cytokinesis. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 148

 Score = 28.8 bits (65), Expect = 3.9
 Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 2/42 (4%)

Query: 375 FDNMR--FPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
           F+ +     E KKKKKK   K K     KKKK    +Q+  R
Sbjct: 53  FERILLCCKEVKKKKKKSSLKSKSSSSSKKKKKKGPLQLKGR 94


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.4 bits (66), Expect = 3.9
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 385 KKKKKKKKKRKKKKKKKKK 403
           K   KK KK KKKKKKK+ 
Sbjct: 199 KSSAKKDKKSKKKKKKKRS 217



 Score = 28.6 bits (64), Expect = 6.6
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 5/43 (11%)

Query: 366 DLSENVGIAFDNMR----FPEKKKKKKKKKKKRKKKKKKKKKK 404
           +L+E+VG   D+ +    +  K  K   KK K+ KKKKKKK+ 
Sbjct: 176 NLTEDVGF-LDSFKPLYTWHYKGDKSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 28.4 bits (64), Expect = 3.9
 Identities = 8/25 (32%), Positives = 22/25 (88%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           E+K+++K+++K+ K++K++K+K+  
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEWG 39


>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain. 
          Length = 372

 Score = 29.6 bits (67), Expect = 4.0
 Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 16/76 (21%)

Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKV---------NNSIQISS----RPEICR---EFDS 423
           F  K KKKK KK ++K   KKKKK++         N +I +        EI +   E D 
Sbjct: 50  FSAKAKKKKSKKSEKKSSSKKKKKEISVLDPKRSQNIAILLRKLKLPPEEIVQAILEMDE 109

Query: 424 KKISLANLPKLKLFYP 439
             + L  L  L    P
Sbjct: 110 SVLGLELLENLLKMAP 125


>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
          Length = 143

 Score = 28.4 bits (64), Expect = 4.1
 Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQIS 412
               KK   KKK +K   K+ K     IQI 
Sbjct: 2   LFLLKKMYNKKKNRKSNIKENKWEERVIQIK 32


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 28.8 bits (65), Expect = 4.3
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 2/29 (6%)

Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           FD ++  + K+  K  +KKRKK   K+KK
Sbjct: 142 FDELK--KSKQLDKALEKKRKKNAGKEKK 168



 Score = 28.4 bits (64), Expect = 6.3
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
           +KK  + KK K+  K  +KK+KK   
Sbjct: 139 KKKFDELKKSKQLDKALEKKRKKNAG 164


>gnl|CDD|187770 cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; DEAD/DEAH box helicase DNA
           helicase cas3'; Often but not always is fused to HD
           nuclease domain; signature gene for Type I.
          Length = 353

 Score = 29.3 bits (66), Expect = 4.7
 Identities = 19/84 (22%), Positives = 33/84 (39%), Gaps = 14/84 (16%)

Query: 300 ALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAK 359
            L SI  NS L+  E    DE      L +       + + ++     +L++      A 
Sbjct: 118 TLASIA-NSLLIFDEVHFYDEYTLALILAV-------LEVLKDNDVPILLMS------AT 163

Query: 360 LPRLNKDLSENVGIAFDNMRFPEK 383
           LP+  K+ +E +G   +N     K
Sbjct: 164 LPKFLKEYAEKIGYVEENEPLDLK 187


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 28.1 bits (63), Expect = 4.7
 Identities = 8/28 (28%), Positives = 18/28 (64%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
           EK +KK K++ + +K++  +K +V   +
Sbjct: 40  EKLEKKAKRQLRAEKRQALEKGRVKPVL 67


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
           proteins is associated with U3 snoRNA. U3 snoRNA is
           required for nucleolar processing of pre-18S ribosomal
           RNA.
          Length = 87

 Score = 27.4 bits (61), Expect = 4.8
 Identities = 13/44 (29%), Positives = 18/44 (40%)

Query: 363 LNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
           L     +            E KK+KKK K +  KK +K+ K V 
Sbjct: 8   LEAIDEDRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVK 51



 Score = 26.6 bits (59), Expect = 9.3
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           ++     +KKK K +  KK +K+ K  KK
Sbjct: 24  YEEELKKQKKKNKLRFLKKLEKRPKDVKK 52


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
           is a family of proteins that seem to be involved,
           directly or indirectly, in the salt sensitivity of some
           cellular functions in yeast. These proteins also
           interact with the splicing factor Msl1p.
          Length = 189

 Score = 28.7 bits (64), Expect = 4.9
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 393 KRKKKKKKKKKKVNNSIQIS 412
           K KKKKKK+ K  + S   S
Sbjct: 32  KSKKKKKKRSKATSPSHNAS 51


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
           structure and biogenesis].
          Length = 67

 Score = 26.9 bits (60), Expect = 4.9
 Identities = 10/22 (45%), Positives = 17/22 (77%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
           EK+K+KK   +KRK K+ +K++
Sbjct: 43  EKRKRKKAAARKRKFKRLRKEQ 64



 Score = 26.5 bits (59), Expect = 6.2
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKK 402
            F EK  +K+K+KK   +K+K K+
Sbjct: 36  EFYEKPSEKRKRKKAAARKRKFKR 59


>gnl|CDD|220763 pfam10453, NUFIP1, Nuclear fragile X mental retardation-interacting
           protein 1 (NUFIP1).  Proteins in this family have been
           implicated in the assembly of the large subunit of the
           ribosome and in telomere maintenance. Some proteins in
           this family contain a CCCH zinc finger. This family
           contains a protein called human fragile X mental
           retardation-interacting protein 1, which is known to
           bind RNA and is phosphorylated upon DNA damage.
          Length = 57

 Score = 26.6 bits (59), Expect = 5.1
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKK 402
             +P K   +KKKK + ++ K+ + 
Sbjct: 33  KNYPTKANIEKKKKAEEERIKRGEL 57


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
           heteromeric structure-specific endonuclease found in
           fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
           substrates, particularly simple-Y, 5'-flap, or
           replication fork structures by cleaving the strand
           bearing the 5' non-homologous arm at the branch junction
           and thus generating ligatable nicked products from
           5'-flap or replication fork substrates.
          Length = 627

 Score = 29.6 bits (66), Expect = 5.1
 Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 353 FSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
             Q  ++      D   N G    N+      KKKK KK K +K+ K+K KK+ +   
Sbjct: 45  LRQVISRFTSFESDDQANSG----NVSGKRVPKKKKIKKPKLRKRTKRKNKKIKSLTA 98


>gnl|CDD|178714 PLN03170, PLN03170, chalcone synthase; Provisional.
          Length = 401

 Score = 29.3 bits (65), Expect = 5.1
 Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 11/58 (18%)

Query: 11  CSSVTVLDC--PSEDYLDSLLGESIFSQHQSCAMNEDDKAALVAHFSPHHIMSHPRYK 66
           CS +T +    PSE +LDS++G+++F          D  AA++    P   +  P ++
Sbjct: 194 CSEITAVTFRGPSESHLDSMVGQALFG---------DGAAAVIVGADPDERVERPLFQ 242


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 28.1 bits (63), Expect = 5.1
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
              +K +K+ K +RKK + K+K KV
Sbjct: 103 KAGRKARKELKNRRKKVRGKEKTKV 127


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 28.0 bits (63), Expect = 5.2
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
               +K+    K  +++KKKKKKK+  N
Sbjct: 62  PKTARKESVAAKAAEKEKKKKKKKELKN 89


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7 plays
           a role in protein translation. Deletions of the TMA7
           gene results in altered protein synthesis rates.
          Length = 63

 Score = 26.7 bits (59), Expect = 5.3
 Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 6/51 (11%)

Query: 364 NKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
            K+L E      +++ F +K+K++ K  K+   K K K       I+ S +
Sbjct: 18  KKELDE------EDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGK 62


>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
           (DUF1713).  This domain is found at the C terminal end
           of mitochondrial proteins of unknown function.
          Length = 34

 Score = 25.8 bits (57), Expect = 5.3
 Identities = 9/32 (28%), Positives = 20/32 (62%)

Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNN 407
           D++    ++K KK K KK +K+ +  ++K++ 
Sbjct: 3   DSVLRKRRRKMKKHKYKKLRKRTRALRRKLDK 34


>gnl|CDD|179323 PRK01710, murD, UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
           synthetase; Provisional.
          Length = 458

 Score = 29.3 bits (66), Expect = 5.3
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 6/59 (10%)

Query: 35  SQHQSCAMNEDDKAALVAHFSPHHIMSHPRYKEFMS------KFPSTTQHLVLNESNEC 87
           S  Q   M+   + A+V + SP+H+  H   +E++       K+ S    LVLN+ NE 
Sbjct: 174 SSFQLMTMDVSPEVAVVTNLSPNHLDVHKDMEEYIDAKKNIFKYQSENDLLVLNKDNEI 232


>gnl|CDD|199206 cd08962, GatD, GatD subunit of archaeal Glu-tRNA amidotransferase. 
           GatD is involved in the alternative synthesis of
           Gln-tRNA(Gln) in archaea via the transamidation of
           incorrectly charged Glu-tRNA(Gln). GatD is active as a
           dimer, and it provides the amino group required for this
           reaction. GatD is related to bacterial L-asparaginases
           (amidohydrolases), which catalyze the hydrolysis of
           asparagine to aspartic acid and ammonia. This CD spans
           both the L-asparaginase_like domain and an N-terminal
           supplementary domain.
          Length = 402

 Score = 29.1 bits (66), Expect = 5.5
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 6/50 (12%)

Query: 370 NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI-----QISSR 414
           N+GI        E  +K +K K +  ++ +KK      SI      I+SR
Sbjct: 37  NIGIDISIEEI-ELIEKGEKPKPELGEEIEKKPGLPKVSIISTGGTIASR 85


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 29.4 bits (67), Expect = 5.5
 Identities = 13/53 (24%), Positives = 26/53 (49%)

Query: 374 AFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKI 426
            F+  +  E +K  K+  +  KK + +KKK+  N        +I ++ D ++I
Sbjct: 174 KFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
           P+ K K K K  K+ +++ K++ K
Sbjct: 99  PKPKPKPKPKPVKKVEEQPKREVK 122



 Score = 28.1 bits (63), Expect = 9.5
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
           PE K K K K K +  KK +++ K
Sbjct: 95  PEPKPKPKPKPKPKPVKKVEEQPK 118


>gnl|CDD|197876 smart00792, Agouti, Agouti protein.  The agouti protein regulates
           pigmentation in the mouse hair follicle producing a
           black hair with a subapical yellow band. A highly
           homologous protein agouti signal protein (ASIP) is
           present in humans and is expressed at highest levels in
           adipose tissue where it may play a role in energy
           homeostasis and possibly human pigmentation.
          Length = 124

 Score = 27.9 bits (62), Expect = 5.6
 Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 360 LPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
              LN+    +V I   N + P+K   ++ +KK  +KK+KK    V      S R 
Sbjct: 34  NSSLNQADFPSVSIVGLNKK-PKKISAEEAEKKLLQKKEKKALTNVLRPEPRSPRR 88


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.1 bits (66), Expect = 5.6
 Identities = 10/33 (30%), Positives = 21/33 (63%), Gaps = 10/33 (30%)

Query: 382 EKKKKKKKKKKKR----------KKKKKKKKKK 404
           EKK++KKK++++           +K ++K++KK
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318


>gnl|CDD|227513 COG5186, PAP1, Poly(A) polymerase [RNA processing and
           modification].
          Length = 552

 Score = 29.1 bits (65), Expect = 5.6
 Identities = 11/34 (32%), Positives = 16/34 (47%)

Query: 398 KKKKKKKVNNSIQISSRPEICREFDSKKISLANL 431
             K  KKV+    +    E+CRE+D    S  N+
Sbjct: 471 PVKPGKKVDIEQPVKEFIELCREYDEGDASGMNI 504


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 28.9 bits (65), Expect = 5.7
 Identities = 9/26 (34%), Positives = 13/26 (50%)

Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
           R     +KKK + KK+  K  KK + 
Sbjct: 148 RTEILAEKKKPRSKKKSSKNSKKLRI 173



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           E   +KKK + K+K  K  KK +++
Sbjct: 150 EILAEKKKPRSKKKSSKNSKKLRID 174



 Score = 28.5 bits (64), Expect = 7.8
 Identities = 9/22 (40%), Positives = 12/22 (54%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKK 402
            EKKK + KKK  +  KK +  
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRID 174


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 28.0 bits (63), Expect = 5.9
 Identities = 12/48 (25%), Positives = 29/48 (60%), Gaps = 5/48 (10%)

Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANL 431
           ++  +++KK+R++K+K++ +K          P+I ++F   K +LA +
Sbjct: 72  ERMDERRKKRREQKEKEEIEKYRE-----ENPKIQQQFADLKRNLATV 114


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 29.0 bits (65), Expect = 6.1
 Identities = 10/24 (41%), Positives = 16/24 (66%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
              K + K +K KR KK+KK+++K
Sbjct: 22  RPVKDEAKPRKIKRVKKRKKREEK 45


>gnl|CDD|185509 PTZ00196, PTZ00196, 60S ribosomal protein L36; Provisional.
          Length = 98

 Score = 27.3 bits (61), Expect = 6.2
 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 372 GIAFD-NMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           GIA   N      K+ +K    KRK    K+K+ V + I+
Sbjct: 4   GIAVGLNKGHIVTKRARKPSPSKRKGLLSKRKRLVKDVIR 43


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 28.8 bits (65), Expect = 6.2
 Identities = 7/27 (25%), Positives = 17/27 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNN 407
           P      +KKK+++K+++K+ K+   +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELRED 201


>gnl|CDD|234350 TIGR03766, TIGR03766, conserved hypothetical integral membrane
           protein.  Models TIGR03110, TIGR03111, and TIGR03112
           describe a three-gene system found in several
           Gram-positive bacteria, where TIGR03110 (XrtG) is
           distantly related to a putative transpeptidase,
           exosortase (TIGR02602). This model describes a small
           clade that correlates by both gene clustering and
           phyletic pattern, although imperfectly, to the three
           gene system. Both this narrow clade, and the larger set
           of full-length homologous integral membrane proteins,
           have an especially well-conserved region near the
           C-terminus with an invariant tyrosine. The function is
           unknown.
          Length = 483

 Score = 29.2 bits (66), Expect = 6.4
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 370 NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKK 401
           N       +  P K +K K   K+ K++ K+K
Sbjct: 319 NAEDVLATLSLPTKAEKNKYSIKEIKRRLKEK 350


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
           family of proteins has no known function.
          Length = 44

 Score = 25.7 bits (57), Expect = 6.5
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 390 KKKKRKKKKKKKKKKVNNSIQISSRP 415
           +KKK  +K KKK KK N  +  S++P
Sbjct: 3   RKKKINQKLKKKAKKANAKLHPSNKP 28


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 29.0 bits (65), Expect = 6.6
 Identities = 39/168 (23%), Positives = 72/168 (42%), Gaps = 24/168 (14%)

Query: 305 GKNSDLLIHEATHE--DELEKEAALKMHSTVSQAIRIGREMR---AKFVLLTHFSQRYAK 359
           GK+ ++LI +A  +  ++LE+E A  +     Q + I RE++    K ++    S   A 
Sbjct: 273 GKDYEVLIQQAIEDLVEDLERERANDIIQVSRQLLNILREIKFAEEKRLVCKLGSASIAT 332

Query: 360 LPRLNKDLSENVGIAFDNMRFP------------EKKKKKKKKKKKRKKKKKKKKKKVNN 407
                     +V    DN                E+K++  + +K+ K+  ++ K+K + 
Sbjct: 333 RFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEKGDL 392

Query: 408 SIQIS-SRPEICREFDSKKISLANLPKLKLFYPALKAMFAEYQDEIEN 454
           SI ++ S  E+  + D  KIS        + Y  +K    E   E+EN
Sbjct: 393 SIFVALSLSEVEEDGDKVKISF--DSSKAMHYELMK----EKLPELEN 434


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.2 bits (63), Expect = 6.7
 Identities = 10/25 (40%), Positives = 12/25 (48%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
             K+ KKK+    RK    KK KK 
Sbjct: 47  SPKQPKKKRPTTPRKPATTKKSKKK 71


>gnl|CDD|202819 pfam03939, Ribosomal_L23eN, Ribosomal protein L23, N-terminal
           domain.  The N-terminal domain appears to be specific to
           the eukaryotic ribosomal proteins L25, L23, and L23a.
          Length = 52

 Score = 26.1 bits (58), Expect = 6.7
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
           K K  K KK   K    KKK+K+  S+ 
Sbjct: 2   KAKALKAKKAVLKGVHGKKKRKIRTSVT 29


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 28.8 bits (65), Expect = 6.9
 Identities = 14/45 (31%), Positives = 18/45 (40%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
            +K  KK   K K KKKKKK         + S   +   +   KK
Sbjct: 349 TKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393


>gnl|CDD|224696 COG1782, COG1782, Predicted metal-dependent RNase, consists of a
           metallo-beta-lactamase domain and an RNA-binding KH
           domain [General function prediction only].
          Length = 637

 Score = 28.8 bits (65), Expect = 6.9
 Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 6/48 (12%)

Query: 133 PDTCILLDCG----EGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADH 176
           P++ +LLDCG             L     +    +L AV I+H H DH
Sbjct: 202 PESRVLLDCGVNVAGNGEDAFPYLD--VPEFQPDELDAVIITHAHLDH 247


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 28.3 bits (63), Expect = 7.3
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 381 PEKKKKKKKKKKKRKKKK 398
            E K+KKKKK KK+K  K
Sbjct: 87  QENKQKKKKKDKKKKSPK 104


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 28.6 bits (64), Expect = 7.4
 Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 7/105 (6%)

Query: 359 KLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEIC 418
           K   L ++  +   I   +        KKK K+KK KK+  +   K  +  +   R  I 
Sbjct: 203 KAASLKRNPPKKSNI-MSSF------FKKKTKEKKEKKEASESTVKEESEEESGKRDVIL 255

Query: 419 REFDSKKISLANLPKLKLFYPALKAMFAEYQDEIENRAIRRNLKQ 463
            +  ++   L          P+ +   +E + E + +  R+ LK+
Sbjct: 256 EDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 27.3 bits (61), Expect = 7.5
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 10/39 (25%)

Query: 381 PEKKKKKKKKK--------KKRKKK--KKKKKKKVNNSI 409
           P KK++KK  K        K +KKK  K K K K+NN++
Sbjct: 2   PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNNAV 40


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 28.9 bits (65), Expect = 7.6
 Identities = 12/28 (42%), Positives = 14/28 (50%)

Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           D+    EKK K K  KK  +K KK   K
Sbjct: 766 DSAFRGEKKAKPKAAKKDARKAKKPSAK 793


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 29.1 bits (65), Expect = 7.7
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E+KKKK++K K+++ KK K  +K
Sbjct: 19  ERKKKKEEKAKEKELKKLKAAQK 41


>gnl|CDD|225212 COG2333, ComEC, Predicted hydrolase (metallo-beta-lactamase
           superfamily) [General function prediction only].
          Length = 293

 Score = 28.5 bits (64), Expect = 7.9
 Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 19/83 (22%)

Query: 122 GKQVLAIRGPGPDTCILLDCGEGTYSQLVR------LY--GSAVDTL----------LSQ 163
           G  +L           +LD G+G  + L+R      LY  G+++             + +
Sbjct: 32  GDSILIQAPSNGWKVHMLDVGQG-LATLIRSEGKTILYDTGNSMGQDVIIPYLKSLGVRK 90

Query: 164 LSAVYISHLHADHHLGLFSVIKA 186
           L  + ++H  ADH  GL  V+K 
Sbjct: 91  LDQLILTHPDADHIGGLDEVLKT 113


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
           identified as the binding site of the SRP72 protein to
           SRP RNA.
          Length = 57

 Score = 25.9 bits (57), Expect = 8.0
 Identities = 10/18 (55%), Positives = 12/18 (66%)

Query: 380 FPEKKKKKKKKKKKRKKK 397
              ++KKKK  KKKRK K
Sbjct: 13  PGTEQKKKKSAKKKRKPK 30


>gnl|CDD|225382 COG2826, Tra8, Transposase and inactivated derivatives, IS30 family
           [DNA replication, recombination, and repair].
          Length = 318

 Score = 28.5 bits (64), Expect = 8.1
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query: 390 KKKKRKKKKKKKKKKVNNSIQISSRPEI 417
           + K+RK++K+  + KV   + I  RP+ 
Sbjct: 130 RHKRRKRRKRPTRGKVPFGVSIEQRPKF 157


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 28.5 bits (64), Expect = 8.2
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
           P +K+K  KK  K+    KK+K+K
Sbjct: 223 PSRKRKAPKKVAKKVAAAKKRKQK 246


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
           initially from a Japanese patient with hepatitis of
           unknown aetiology, has since been found to infect both
           healthy and diseased individuals and numerous prevalence
           studies have raised questions about its role in
           unexplained hepatitis. ORF1 is a large 750 residue
           protein. The N-terminal half of this protein corresponds
           to the capsid protein.
          Length = 525

 Score = 28.8 bits (65), Expect = 8.5
 Identities = 9/23 (39%), Positives = 10/23 (43%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
           K K      K R K KK  K K+
Sbjct: 189 KHKILIPSLKTRPKGKKYVKIKI 211


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 26.5 bits (59), Expect = 8.7
 Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
           +KKK+ KK K +R+ +++ K  K N
Sbjct: 14  QKKKELKKNKAERQARREAKLAKKN 38


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 28.6 bits (64), Expect = 8.8
 Identities = 8/23 (34%), Positives = 18/23 (78%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           E +KK+K++  + K+K++K ++K
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERK 433


>gnl|CDD|225919 COG3384, COG3384, Aromatic ring-opening dioxygenase, catalytic LigB
           subunit related    enzyme [Amino acid transport and
           metabolism].
          Length = 268

 Score = 28.4 bits (64), Expect = 9.0
 Identities = 13/63 (20%), Positives = 27/63 (42%), Gaps = 6/63 (9%)

Query: 160 LLSQLSAVYISHLHADHHLGLFSVIKAWSRVKPECKLTLLAPRQIITWLSVYAARFESVG 219
            ++ + A+++SH      L   +  +    +  E     L P  II    V++A +E+ G
Sbjct: 5   KITMMPALFLSHGSPMLALEDNAATRGLRELGRELP--ELRPDTII----VFSAHWETRG 58

Query: 220 HLY 222
             +
Sbjct: 59  AYH 61


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.5 bits (64), Expect = 9.3
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 381 PEKKKKKKKKKKKRKKKKKKK 401
             K+K++  KK+K+KK KK K
Sbjct: 354 SPKRKREGDKKQKKKKSKKLK 374


>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function.  This domain
           family is found in eukaryotes, and is typically between
           96 and 116 amino acids in length.
          Length = 106

 Score = 26.9 bits (60), Expect = 9.4
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
           FD + + +K KK     KK+ KK   KK 
Sbjct: 17  FDKIDYIDKLKKDPNIDKKKFKKIIFKKY 45


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 28.5 bits (64), Expect = 9.4
 Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 10/83 (12%)

Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLF 437
                K  + K+K K ++ K+K++K+    +          +E    K  LA L +LK  
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREA----------KEEKQLKKKLAQLARLKEI 315

Query: 438 YPALKAMFAEYQDEIENRAIRRN 460
              +         + E R  R  
Sbjct: 316 AKEVAQKEKARARKKEQRKERGE 338


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
           contains a C-terminal presumed domain in Sas10 which
           hash been identified as a regulator of chromatin
           silencing.
          Length = 76

 Score = 26.4 bits (59), Expect = 9.5
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
           KK ++  + KKRKK +K  K+ 
Sbjct: 24  KKDRRNPRVKKRKKYEKAVKRL 45


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 28.4 bits (64), Expect = 9.6
 Identities = 13/93 (13%), Positives = 24/93 (25%), Gaps = 2/93 (2%)

Query: 315 ATHEDELEKEAALKMHSTVSQAI--RIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVG 372
           A  E + + +       +        I   + +K        Q    L  + KD  +   
Sbjct: 12  AEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATE 71

Query: 373 IAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
                 +     K    K   +KK K +     
Sbjct: 72  SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 28.4 bits (63), Expect = 9.8
 Identities = 14/23 (60%), Positives = 14/23 (60%)

Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
           EK KKKKKK  K KKK    KK 
Sbjct: 239 EKSKKKKKKLAKNKKKLDDDKKG 261


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.132    0.383 

Gapped
Lambda     K      H
   0.267   0.0616    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,004,425
Number of extensions: 2319991
Number of successful extensions: 11772
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9103
Number of HSP's successfully gapped: 655
Length of query: 485
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 384
Effective length of database: 6,457,848
Effective search space: 2479813632
Effective search space used: 2479813632
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.5 bits)