RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8050
         (311 letters)



>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 41.2 bits (97), Expect = 1e-05
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 158 NLTAHVRTHSGEKPFRCPICDRRFSQ 183
           NL  H+RTH+GEKP++CP+C + FS 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 40.5 bits (95), Expect = 2e-05
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 131 LKTHLRTHSGEKPYRCEDCNKSFSQ 155
           L+ H+RTH+GEKPY+C  C KSFS 
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 37.0 bits (86), Expect = 3e-04
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 186 SVTTHMRTHSGERPYRCRLCKKAFS 210
           ++  HMRTH+GE+PY+C +C K+FS
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFS 25



 Score = 36.2 bits (84), Expect = 6e-04
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 215 LTKHLRIHSGEKPYQCKLCLLRFSQ 239
           L +H+R H+GEKPY+C +C   FS 
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 40.0 bits (94), Expect = 3e-04
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECP 40
           K+ K++KKK+KKKKKKK KK          +  ++ S+  
Sbjct: 86  KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEE 125



 Score = 30.8 bits (70), Expect = 0.38
 Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK-IQDRSECPGSGMGK 46
           KKKKKKK KK  KK++K   K    +S  +E+  +D+ E P   M K
Sbjct: 96  KKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142



 Score = 27.3 bits (61), Expect = 6.4
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 4   KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSEC 39
           +K  KK+ K++KKK+KKK         +K +     
Sbjct: 81  EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 41.1 bits (97), Expect = 4e-04
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI 23
            +KK+K+K+K K KK+ R  K I
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNI 422



 Score = 39.2 bits (92), Expect = 0.002
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           KK+K+K+K K KK+ +  K  G
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIG 423



 Score = 38.8 bits (91), Expect = 0.003
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+ +KK+K+K+K K KKR + 
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRD 418



 Score = 36.5 bits (85), Expect = 0.016
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K K KK+ +  K   KR+K 
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKP 428



 Score = 36.1 bits (84), Expect = 0.017
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
           +K+K+K K KK+ +  K   K    + +S+E
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 35.3 bits (82), Expect = 0.032
 Identities = 8/21 (38%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            K+ +KK+K+K+K K K++ +
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHR 417



 Score = 35.3 bits (82), Expect = 0.037
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K   K+ +KK+K+K+K + KK
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKK 414



 Score = 34.9 bits (81), Expect = 0.040
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           KK   K+ +KK+K+K+K K K    ++ +  K
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424



 Score = 34.9 bits (81), Expect = 0.043
 Identities = 12/38 (31%), Positives = 20/38 (52%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
             K+ +KK+K+K+K K +K+     N   + K    SE
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSE 433



 Score = 34.5 bits (80), Expect = 0.054
 Identities = 10/36 (27%), Positives = 18/36 (50%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
           K  KK   K+ +KK+K+++K  +       + I  R
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 33.0 bits (76), Expect = 0.21
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            K  KK   K+ +KK+K K+K
Sbjct: 389 GKPSKKVLAKRAEKKEKEKEK 409



 Score = 32.2 bits (74), Expect = 0.37
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
             K  KK   K+ +KK+++K+
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKE 408



 Score = 31.5 bits (72), Expect = 0.55
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K  K  KK   K+ +KK K+K
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEK 407



 Score = 29.9 bits (68), Expect = 1.8
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
            +KK  K  KK   K+  KK+        K++ +D  
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420



 Score = 28.4 bits (64), Expect = 5.7
 Identities = 11/31 (35%), Positives = 12/31 (38%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K K   +KK  K  KK   K        KEK
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEK 409


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 41.1 bits (97), Expect = 4e-04
 Identities = 16/22 (72%), Positives = 21/22 (95%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           KKK+++KK +K+KKKKKRKKKG
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKG 405



 Score = 40.3 bits (95), Expect = 9e-04
 Identities = 17/22 (77%), Positives = 20/22 (90%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           K +K+KKKKK+KKK KKRKKKG
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKG 412



 Score = 39.2 bits (92), Expect = 0.002
 Identities = 14/21 (66%), Positives = 19/21 (90%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+KKKKK+KKK KK+KK+ +K
Sbjct: 394 KRKKKKKRKKKGKKRKKKGRK 414



 Score = 38.8 bits (91), Expect = 0.003
 Identities = 14/22 (63%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           KK +K+KKKKK+KKK K++KK 
Sbjct: 390 KKPQKRKKKKKRKKKGKKRKKK 411



 Score = 38.4 bits (90), Expect = 0.003
 Identities = 12/22 (54%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           KK+++KK +K+KKKKK++KK  
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGK 406



 Score = 38.4 bits (90), Expect = 0.004
 Identities = 13/21 (61%), Positives = 19/21 (90%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K+KKKKK+KKK KK+K+K +
Sbjct: 393 QKRKKKKKRKKKGKKRKKKGR 413



 Score = 38.0 bits (89), Expect = 0.004
 Identities = 12/22 (54%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           +++KK +K+KKKKK+KK+ KK 
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKR 408



 Score = 37.3 bits (87), Expect = 0.007
 Identities = 13/22 (59%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           +KK +K+KKKKK+KKK +K+K 
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKK 410



 Score = 37.3 bits (87), Expect = 0.008
 Identities = 13/22 (59%), Positives = 18/22 (81%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
            +K+KKKKK+KKK KK++KK  
Sbjct: 392 PQKRKKKKKRKKKGKKRKKKGR 413



 Score = 37.3 bits (87), Expect = 0.008
 Identities = 12/22 (54%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
            KKK+++KK +K+KKKK++KK 
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKK 404



 Score = 35.7 bits (83), Expect = 0.027
 Identities = 13/22 (59%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           ++KK +K+KKKKK+KKK KK+ 
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRK 409



 Score = 33.8 bits (78), Expect = 0.091
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           K  KKK+++KK +K+KK+KK+ 
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRK 402



 Score = 31.1 bits (71), Expect = 0.79
 Identities = 13/24 (54%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 1   KKK--KKKKKKKKKKKKKKRKKKG 22
           K+K  K  KKK+++KK +KRKKK 
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKK 399


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 37.8 bits (88), Expect = 8e-04
 Identities = 15/52 (28%), Positives = 17/52 (32%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
            KKKKKKKKKK K K +               D  E       +      D
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEED 69



 Score = 31.2 bits (71), Expect = 0.14
 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 5/41 (12%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGS 42
          K K KK   KKKKKKK+KK     N S +E + ++ E   S
Sbjct: 11 KLKGKKIDVKKKKKKKKKK-----NKSKEEVVTEKEEEEKS 46



 Score = 29.7 bits (67), Expect = 0.56
 Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 6/52 (11%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGI------LANSSSKEKIQDRSECPGSGMGK 46
          KKKKKKKK K K++    K++              +++  ++ E    GM  
Sbjct: 22 KKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNL 73


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 37.9 bits (88), Expect = 0.001
 Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 2/49 (4%)

Query: 171 PFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHL 219
           P+ CP+C   FS S S+  H+R    E    C +C K F ++ +   H+
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119



 Score = 31.4 bits (71), Expect = 0.25
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 2/49 (4%)

Query: 143 PYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHM 191
           PY C  C   FS + +L  H+R     K   CP+C + F  + S   H+
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119



 Score = 30.2 bits (68), Expect = 0.61
 Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 210 SDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMR 248
            +SS L K L   +   PY C LCL+ FS S +L +H+R
Sbjct: 57  DESSYLYKLLTSKA-VSPYVCPLCLMPFSSSVSLKQHIR 94



 Score = 27.5 bits (61), Expect = 5.5
 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 2/49 (4%)

Query: 199 PYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHM 247
           PY C LC   FS S +L +H+R     K   C +C   F  + +   H+
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 39.3 bits (91), Expect = 0.002
 Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 7/102 (6%)

Query: 158 NLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGER-PYRCRLCKKAFSDSSTLT 216
           +L++   + S  +  R  +       SS   +   +  G   P + + C  +FS SS LT
Sbjct: 247 SLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLT 306

Query: 217 KHLR--IHSGE--KPYQC--KLCLLRFSQSGNLNRHMRVHGS 252
           +HLR   HSGE  KP+ C   LC   FS++  L RH+ +H S
Sbjct: 307 RHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348



 Score = 38.5 bits (89), Expect = 0.004
 Identities = 31/115 (26%), Positives = 51/115 (44%), Gaps = 10/115 (8%)

Query: 142 KPYRCEDCNKSFSQAANLTAHVRT--HSGE--KPFRCPI--CDRRFSQSSSVTTHMRTHS 195
            P + + CN SFS+++ LT H+R+  HSGE  KPF CP   C + FS++ ++  H+  H+
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347

Query: 196 GERPYRCRL--CKKAFSDSST--LTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRH 246
              P + +L      FS        + L+ +   K  +    L            
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDS 402



 Score = 38.1 bits (88), Expect = 0.004
 Identities = 38/145 (26%), Positives = 61/145 (42%), Gaps = 17/145 (11%)

Query: 123 KTYARPSTLKTHLRT--HSGE--KPYRC--EDCNKSFSQAANLTAHVRTHSGEKPFRCPI 176
            +++R S L  HLR+  HSGE  KP+ C    C K FS+   L  H+  H+   P +  +
Sbjct: 297 ISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKL 356

Query: 177 CDRRFSQSSSVTT-------HMRTHSGERPYRC--RLCKKAFSDSSTLTKHLRIHSGEKP 227
            +     S  +           +    ++        C + F   S L+ H+  H   +P
Sbjct: 357 LNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRP 416

Query: 228 YQCK--LCLLRFSQSGNLNRHMRVH 250
           Y CK   C   F++  NL  H ++H
Sbjct: 417 YNCKNPPCSKSFNRHYNLIPHKKIH 441



 Score = 37.8 bits (87), Expect = 0.005
 Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 2/60 (3%)

Query: 170 KPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRL--CKKAFSDSSTLTKHLRIHSGEKP 227
           +P  CP C   FS+   +T H+R+H+GE+P +C    C K+FS    L++HLR H     
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91



 Score = 37.0 bits (85), Expect = 0.011
 Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 142 KPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRC--PICDRRFSQSSSVTTHMRTHSGERP 199
           +P  C +C  SFS+  +LT H+R+H+GEKP +C    CD+ FS+   ++ H+RTH     
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91

Query: 200 YRC 202
              
Sbjct: 92  DLN 94



 Score = 36.6 bits (84), Expect = 0.015
 Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 198 RPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQC--KLCLLRFSQSGNLNRHMRVHGSQYN 255
           RP  C  C  +FS    LT+H+R H+GEKP QC    C   FS+   L+RH+R H +  +
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91

Query: 256 MLP 258
            L 
Sbjct: 92  DLN 94



 Score = 35.1 bits (80), Expect = 0.038
 Identities = 36/139 (25%), Positives = 59/139 (42%), Gaps = 12/139 (8%)

Query: 119 FIPGKTYARPSTLKTHLRTHSGEKPYRC--EDCNKSFSQAAN-----LTAHVRTHSGEKP 171
            + GK ++R   LK H+  H+   P +    + +  FS   N          +    +K 
Sbjct: 327 SLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKK 386

Query: 172 FRC--PICDRRFSQSSSVTTHMRTHSGERPYRCR--LCKKAFSDSSTLTKHLRIHSGEKP 227
                  C R F + S+++ H+ TH   RPY C+   C K+F+    L  H +IH+   P
Sbjct: 387 SETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446

Query: 228 YQCKLCLLRFSQSGNLNRH 246
             C   L  F +  +L+ H
Sbjct: 447 LLC-SILKSFRRDLDLSNH 464



 Score = 30.0 bits (67), Expect = 1.6
 Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 127 RPSTLKTHLRTHSGEKPYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQS 184
           R   L  H+R+H+GEKP +C    C+KSFS+   L+ H+RTH               +  
Sbjct: 45  RLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSK 104

Query: 185 SS 186
           +S
Sbjct: 105 AS 106


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
          (DUF2058).  This domain, found in various prokaryotic
          proteins, has no known function.
          Length = 177

 Score = 38.4 bits (90), Expect = 0.002
 Identities = 12/35 (34%), Positives = 22/35 (62%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQD 35
          KKK KK KK+K+K++K+ +K     +   K+  ++
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEE 48


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 39.0 bits (91), Expect = 0.003
 Identities = 15/34 (44%), Positives = 17/34 (50%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQD 35
            K KK KKKKKKKKK+RK           E +  
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGS 306



 Score = 37.4 bits (87), Expect = 0.009
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
            K KK KKKKKKKKK+++          +
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELEPEA 301



 Score = 35.9 bits (83), Expect = 0.022
 Identities = 13/38 (34%), Positives = 18/38 (47%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
            KK KKKKKKKKK++K   +  +   +         S 
Sbjct: 275 FKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSR 312



 Score = 35.5 bits (82), Expect = 0.036
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            +  K KK KKKKKKKK+++K
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRK 290



 Score = 32.8 bits (75), Expect = 0.24
 Identities = 13/38 (34%), Positives = 17/38 (44%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K KKKKKKKKK++K     +    A          R +
Sbjct: 277 KPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKD 314



 Score = 27.8 bits (62), Expect = 9.5
 Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 7/32 (21%)

Query: 1   KKKKKKKKKKKKKKKK-------KRKKKGILA 25
           K+K KK  + KKKK         K  K+ IL+
Sbjct: 176 KEKDKKNLELKKKKPDYDPDDDDKFNKRSILS 207


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 34.2 bits (79), Expect = 0.003
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query: 145 RCEDCNKSFSQAANLTAHVRTH 166
           +C DC KSFS+ +NL  H+RTH
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 33.5 bits (77), Expect = 0.005
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 173 RCPICDRRFSQSSSVTTHMRTH 194
           +CP C + FS+ S++  H+RTH
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 32.7 bits (75), Expect = 0.011
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 201 RCRLCKKAFSDSSTLTKHLRIH 222
           +C  C K+FS  S L +HLR H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 31.5 bits (72), Expect = 0.030
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 229 QCKLCLLRFSQSGNLNRHMRVH 250
           +C  C   FS+  NL RH+R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 25.4 bits (56), Expect = 3.8
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query: 122 GKTYARPSTLKTHLRTH 138
           GK+++R S LK HLRTH
Sbjct: 6   GKSFSRKSNLKRHLRTH 22


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 33.6 bits (77), Expect = 0.005
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 172 FRCPICDRRFSQSSSVTTHMRTH 194
           +RCP C + F   S++  HMRTH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 32.1 bits (73), Expect = 0.017
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 144 YRCEDCNKSFSQAANLTAHVRTH 166
           YRC +C K F   + L  H+RTH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 32.1 bits (73), Expect = 0.019
 Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 200 YRCRLCKKAFSDSSTLTKHLRIH 222
           YRC  C K F   S L +H+R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 31.3 bits (71), Expect = 0.036
 Identities = 9/23 (39%), Positives = 10/23 (43%)

Query: 228 YQCKLCLLRFSQSGNLNRHMRVH 250
           Y+C  C   F     L  HMR H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
          protein SURF-6 is shown to be a component of the
          nucleolar matrix and has a strong binding capacity for
          nucleic acids.
          Length = 206

 Score = 36.1 bits (84), Expect = 0.010
 Identities = 13/68 (19%), Positives = 34/68 (50%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
          +++++K++++K +KK+KRK+     ++   E  + ++E   S     P + A+  +    
Sbjct: 9  EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSK 68

Query: 61 AVVHPGQM 68
               G+ 
Sbjct: 69 VEFADGEQ 76



 Score = 33.8 bits (78), Expect = 0.062
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 3   KKKKKKKKKKKKKKKRKKKGIL 24
           KK+K  KK KKKKK +KK  IL
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 32.7 bits (75), Expect = 0.16
 Identities = 12/22 (54%), Positives = 16/22 (72%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGI 23
           KK+K  KK KKKKK K+K + +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206



 Score = 32.3 bits (74), Expect = 0.19
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+K  KK KKKKK KKK +  
Sbjct: 186 KRKDDKKNKKKKKAKKKGRIL 206



 Score = 31.1 bits (71), Expect = 0.49
 Identities = 12/20 (60%), Positives = 15/20 (75%)

Query: 1   KKKKKKKKKKKKKKKKKRKK 20
           KK+K  KK KKKKK KK+ +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGR 204



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 12/20 (60%), Positives = 18/20 (90%)

Query: 2   KKKKKKKKKKKKKKKKRKKK 21
           K+K+K+KKK KK+ K+RK+K
Sbjct: 149 KRKEKQKKKSKKEWKERKEK 168



 Score = 27.7 bits (62), Expect = 6.4
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 5   KKKKKKKKKKKKKRKKK 21
           KK  K+K+K+KKK KK+
Sbjct: 145 KKALKRKEKQKKKSKKE 161


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
          This domain is the C-terminal end of ribosomal L18/L5
          proteins.
          Length = 93

 Score = 34.1 bits (79), Expect = 0.017
 Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
          +KKK+KK+ K + K+   KK   L     K ++  +
Sbjct: 61 RKKKEKKEVKAESKRYNAKK---LTLEQRKARVAQK 93


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 35.8 bits (82), Expect = 0.021
 Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 24/78 (30%)

Query: 140 GEKPYRCE--DCNKSFSQAANLTAHVRTH--------------------SGEKPFRCPIC 177
             KPY+C    CNK +     L  H   H                    + +KP+RC +C
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYH-MLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVC 404

Query: 178 DRRFSQSSSVTTHMRTHS 195
           D+R+   + +  H R HS
Sbjct: 405 DKRYKNLNGLKYH-RKHS 421



 Score = 32.4 bits (73), Expect = 0.31
 Identities = 16/81 (19%), Positives = 35/81 (43%), Gaps = 11/81 (13%)

Query: 168 GEKPFRCPI--CDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGE 225
             KP++CP+  C++++   + +  HM               +   ++ +  K     + +
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGH---------QNQKLHENPSPEKMNIFSAKD 396

Query: 226 KPYQCKLCLLRFSQSGNLNRH 246
           KPY+C++C  R+     L  H
Sbjct: 397 KPYRCEVCDKRYKNLNGLKYH 417


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
          This domain is found to the N-terminus of bacterial
          signal peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 34.0 bits (79), Expect = 0.036
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K  +K +KK +K K K  KKK
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKK 89



 Score = 33.7 bits (78), Expect = 0.049
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 2  KKKKKKKKKKKKKKKKRKK 20
          K +KK +K K K +KK+ K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91



 Score = 33.7 bits (78), Expect = 0.050
 Identities = 10/19 (52%), Positives = 13/19 (68%)

Query: 1  KKKKKKKKKKKKKKKKKRK 19
          K +KK +K K K +KKK K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91



 Score = 32.9 bits (76), Expect = 0.098
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGIL 24
          KK +K K K +KKK KK + K  L
Sbjct: 76 KKAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 32.5 bits (75), Expect = 0.12
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
           K  +K +KK +K K K +KK
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKK 88



 Score = 32.1 bits (74), Expect = 0.16
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 3  KKKKKKKKKKKKKKKRKKK 21
          K +KK +K K K +K+K K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91



 Score = 31.0 bits (71), Expect = 0.41
 Identities = 8/32 (25%), Positives = 15/32 (46%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
           K  +K +KK +K K + +K        K ++
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 30.6 bits (70), Expect = 0.49
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          KK+ K  +K +KK +K + K 
Sbjct: 65 KKELKAWEKAEKKAEKAKAKA 85



 Score = 30.6 bits (70), Expect = 0.57
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          +KK +K K K +KKK K+++ 
Sbjct: 75 EKKAEKAKAKAEKKKAKKEEP 95



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
            +K +KK +K K K ++KK 
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKA 90



 Score = 29.0 bits (66), Expect = 1.6
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
           +KK +K K K +KKK +K++
Sbjct: 74 AEKKAEKAKAKAEKKKAKKEE 94



 Score = 29.0 bits (66), Expect = 1.7
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
          K+ K  +K +KK +K K K +   A     + 
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 34.6 bits (80), Expect = 0.041
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
           K K +K  K  KKKK K K K  +  ++  
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223



 Score = 33.9 bits (78), Expect = 0.077
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
          +K+++K K   K K KK  K  I     +K + +++  
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81



 Score = 30.4 bits (69), Expect = 0.94
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
           K K +K  K  KKKK + K  +    ++ +   
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 7/38 (18%), Positives = 16/38 (42%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
             K K KK  K K ++++K           ++++ + 
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
          K K   K K KK  K K ++K        ++ +++  E
Sbjct: 49 KAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 28.9 bits (65), Expect = 2.8
 Identities = 10/22 (45%), Positives = 12/22 (54%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
            +K K +K  K  KKKK K K 
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKA 213



 Score = 28.9 bits (65), Expect = 3.1
 Identities = 8/23 (34%), Positives = 8/23 (34%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI 23
           K  KKKK K K K          
Sbjct: 202 KGGKKKKGKAKAKLNVGGANDDD 224



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 9/37 (24%), Positives = 21/37 (56%), Gaps = 6/37 (16%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
          + ++K+++K K   K + KK      + K KI+++ +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKK------ALKAKIEEKEK 71



 Score = 28.1 bits (63), Expect = 5.2
 Identities = 8/36 (22%), Positives = 21/36 (58%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
            K K ++K+K K++K+ K    L   + ++++ ++
Sbjct: 61 ALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
          non-specific RNA binding region part 2.  This is a
          region found N terminal to the catalytic domain of
          glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
          but not in Escherichia coli. This region is thought to
          bind RNA in a non-specific manner, enhancing
          interactions between the tRNA and enzyme, but is not
          essential for enzyme function.
          Length = 83

 Score = 32.3 bits (74), Expect = 0.046
 Identities = 15/53 (28%), Positives = 21/53 (39%), Gaps = 3/53 (5%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
            KKKKKKKKKK +      K   A +    +     E     +G+  H   +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGF---LGEKFHKPGE 71



 Score = 26.9 bits (60), Expect = 4.3
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
          K +    KKKKKKKKK+ +       + K   +D SE
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53



 Score = 26.6 bits (59), Expect = 6.4
 Identities = 13/40 (32%), Positives = 19/40 (47%)

Query: 5  KKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGM 44
          K +    KKKKKK+KKK     +++K K     +     M
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 33.9 bits (78), Expect = 0.053
 Identities = 16/38 (42%), Positives = 21/38 (55%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K K KKKK KKKK K K KK     + S K+  ++  +
Sbjct: 83  KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120



 Score = 32.4 bits (74), Expect = 0.15
 Identities = 16/32 (50%), Positives = 17/32 (53%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K+K K KKKK KKKK K K K         EK
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112



 Score = 30.8 bits (70), Expect = 0.64
 Identities = 15/32 (46%), Positives = 20/32 (62%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           +K K KKKK KKKK K + KK    +  S++K
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113



 Score = 28.9 bits (65), Expect = 2.6
 Identities = 13/34 (38%), Positives = 15/34 (44%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
           KKK K K KK  KK  K +KK         E + 
Sbjct: 93  KKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.9 bits (78), Expect = 0.055
 Identities = 17/38 (44%), Positives = 28/38 (73%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K+KKKKK+ KK+KK+KK KK+ ++    SK+K + + +
Sbjct: 155 KEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192



 Score = 31.6 bits (72), Expect = 0.32
 Identities = 13/21 (61%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KK+K  + K  KKKKKK+KKK
Sbjct: 173 KKEKMVEPKGSKKKKKKKKKK 193



 Score = 30.5 bits (69), Expect = 0.80
 Identities = 13/21 (61%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           ++KK+KKKKK+ KK+KK KK 
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKD 172



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 12/21 (57%), Positives = 20/21 (95%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +++KK+KKKKK+ KK+K++KK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKK 171



 Score = 27.4 bits (61), Expect = 9.1
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
            ++++KK+KKKKK+ KK KK+ 
Sbjct: 149 VEEEEKKEKKKKKEVKKEKKEK 170


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
          Provisional.
          Length = 330

 Score = 34.4 bits (80), Expect = 0.061
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K +KKK+K++ K  K K K +
Sbjct: 71 KAQKKKEKQEAKAAKAKSKPR 91



 Score = 32.9 bits (76), Expect = 0.21
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKG 22
          K   K +KKK+K++ K  K K 
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKS 88



 Score = 32.1 bits (74), Expect = 0.33
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
           K   K +KKK+K++ K  K 
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKA 86



 Score = 30.6 bits (70), Expect = 1.1
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          KK+ K   K +KKK+K+  K 
Sbjct: 63 KKELKAWHKAQKKKEKQEAKA 83



 Score = 29.8 bits (68), Expect = 1.8
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K+ K   K +KKK+K++ K  
Sbjct: 64 KELKAWHKAQKKKEKQEAKAA 84


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 33.9 bits (78), Expect = 0.066
 Identities = 15/37 (40%), Positives = 18/37 (48%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
           KKKK+ K K  K+  KK+KKK   A  S K       
Sbjct: 85  KKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPK 121



 Score = 28.9 bits (65), Expect = 3.1
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K+ K K  K+  KKKKK+      +  ++  + + +SE
Sbjct: 88  KRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125



 Score = 28.1 bits (63), Expect = 5.5
 Identities = 17/46 (36%), Positives = 20/46 (43%), Gaps = 6/46 (13%)

Query: 1   KKKKKKKK------KKKKKKKKKRKKKGILANSSSKEKIQDRSECP 40
           KKK+ K K      KKKKKK     K    A    K+K +  S  P
Sbjct: 86  KKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131



 Score = 27.7 bits (62), Expect = 6.7
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
           +++++ KKKK+ K K  K+ 
Sbjct: 78 LQREERLKKKKRVKTKAYKEP 98



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           + ++KK + K +K+K+K+K+K
Sbjct: 161 EIRRKKIQAKARKRKEKKKEK 181


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 30.3 bits (68), Expect = 0.074
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 172 FRCPICDRRFSQSSSVTTHMRTH 194
           F+CP+C + FS   ++  H+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.8 bits (64), Expect = 0.30
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 144 YRCEDCNKSFSQAANLTAHVRTH 166
           ++C  C KSFS    L  H+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.4 bits (63), Expect = 0.40
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 200 YRCRLCKKAFSDSSTLTKHLRIH 222
           ++C LC K+FS    L +HLR H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 27.2 bits (60), Expect = 1.0
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 228 YQCKLCLLRFSQSGNLNRHMRVH 250
           ++C LC   FS    L RH+R H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 33.5 bits (77), Expect = 0.095
 Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSEC--PGSGMGKPPHDVA 52
           K + ++++ KKKKKKKK+K K   A  SS E     +      SG G+      
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKK 226


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 32.9 bits (75), Expect = 0.10
 Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 7/43 (16%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
           KK KKKK +  K++KKK+K+K        K+K +   E PG G
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKK-------KKKKRHSPEHPGVG 177


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 32.2 bits (73), Expect = 0.11
 Identities = 15/21 (71%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K KKKKKK KK KK KK  KK
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 31.8 bits (72), Expect = 0.16
 Identities = 14/20 (70%), Positives = 14/20 (70%)

Query: 2   KKKKKKKKKKKKKKKKRKKK 21
           KKKKKK KK KK KK  KK 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120



 Score = 31.8 bits (72), Expect = 0.17
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 1   KKKKKKKKKKKKKKKKKRKK 20
           KKKKKK KK KK KK  +K 
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120


>gnl|CDD|224508 COG1592, COG1592, Rubrerythrin [Energy production and conversion].
          Length = 166

 Score = 32.7 bits (75), Expect = 0.14
 Identities = 12/39 (30%), Positives = 13/39 (33%), Gaps = 13/39 (33%)

Query: 140 GEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICD 178
             K + C  C               TH GE P  CPIC 
Sbjct: 131 EGKVWVCPVCG-------------YTHEGEAPEVCPICG 156


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 31.7 bits (72), Expect = 0.15
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI 23
           ++ KK KKK++KKKKKK+  K +
Sbjct: 101 RQAKKLKKKREKKKKKKKGAKNL 123



 Score = 28.7 bits (64), Expect = 1.7
 Identities = 12/21 (57%), Positives = 18/21 (85%)

Query: 2   KKKKKKKKKKKKKKKKRKKKG 22
            +++ KK KKK++KKK+KKKG
Sbjct: 99  NQRQAKKLKKKREKKKKKKKG 119



 Score = 27.5 bits (61), Expect = 4.1
 Identities = 12/21 (57%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KK KKK++KKKKKKK  +   
Sbjct: 104 KKLKKKREKKKKKKKGAKNLA 124


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 33.2 bits (76), Expect = 0.15
 Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 4/58 (6%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG---ILANSSSKEKIQDRSECPGSGM-GKPPHDVADV 54
                +K  K K   KK  KK     L      + + D S    +      P DV +V
Sbjct: 191 ASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVLDYSAPDANDENADAPEDVEEV 248


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 32.3 bits (74), Expect = 0.16
 Identities = 6/20 (30%), Positives = 16/20 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKK 20
           K+  K K++K+++ ++K+K+
Sbjct: 126 KELAKLKREKRRENERKQKE 145



 Score = 31.9 bits (73), Expect = 0.21
 Identities = 9/22 (40%), Positives = 17/22 (77%)

Query: 3   KKKKKKKKKKKKKKKRKKKGIL 24
           K+  K K++K+++ +RK+K IL
Sbjct: 126 KELAKLKREKRRENERKQKEIL 147



 Score = 29.6 bits (67), Expect = 1.3
 Identities = 6/21 (28%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K K++K+++ ++K+K+  K++
Sbjct: 130 KLKREKRRENERKQKEILKEQ 150



 Score = 29.2 bits (66), Expect = 1.4
 Identities = 6/20 (30%), Positives = 15/20 (75%)

Query: 1   KKKKKKKKKKKKKKKKKRKK 20
            K K++K+++ ++K+K+  K
Sbjct: 129 AKLKREKRRENERKQKEILK 148



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 5/20 (25%), Positives = 16/20 (80%)

Query: 2   KKKKKKKKKKKKKKKKRKKK 21
           K+  K K++K+++ ++++K+
Sbjct: 126 KELAKLKREKRRENERKQKE 145



 Score = 28.4 bits (64), Expect = 2.6
 Identities = 6/21 (28%), Positives = 14/21 (66%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
             K K++K+++ ++K+K   K
Sbjct: 128 LAKLKREKRRENERKQKEILK 148



 Score = 26.9 bits (60), Expect = 9.4
 Identities = 6/21 (28%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K++K+++ ++K+K+  K + K
Sbjct: 132 KREKRRENERKQKEILKEQMK 152


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 31.6 bits (72), Expect = 0.20
 Identities = 16/21 (76%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+KK+KKKKKKKKKKK  KK 
Sbjct: 110 KQKKQKKKKKKKKKKKTSKKA 130



 Score = 31.6 bits (72), Expect = 0.20
 Identities = 15/23 (65%), Positives = 19/23 (82%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI 23
            K+K K+KK+KKKKKKK+KKK  
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTS 127



 Score = 30.8 bits (70), Expect = 0.33
 Identities = 16/21 (76%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+KKKKKKKKKKK  KK  KK
Sbjct: 113 KQKKKKKKKKKKKTSKKAAKK 133



 Score = 30.8 bits (70), Expect = 0.37
 Identities = 16/21 (76%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            K+KK+KKKKKKKKKKK  KK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKK 129



 Score = 30.8 bits (70), Expect = 0.44
 Identities = 15/22 (68%), Positives = 17/22 (77%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           KK+KKKKKKKKKKK  K+  K 
Sbjct: 112 KKQKKKKKKKKKKKTSKKAAKK 133



 Score = 30.4 bits (69), Expect = 0.54
 Identities = 15/21 (71%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K K+KK+KKKKKKKKKK+  K
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSK 128



 Score = 30.4 bits (69), Expect = 0.56
 Identities = 16/21 (76%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KKKKKKKKKKK  KK  +KKK
Sbjct: 115 KKKKKKKKKKKTSKKAAKKKK 135



 Score = 30.0 bits (68), Expect = 0.63
 Identities = 16/21 (76%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +KKKKKKKKKKK  KK  KKK
Sbjct: 114 QKKKKKKKKKKKTSKKAAKKK 134



 Score = 30.0 bits (68), Expect = 0.68
 Identities = 14/21 (66%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +KK+KKKKKKKKKKK  +K  
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAA 131



 Score = 27.7 bits (62), Expect = 3.8
 Identities = 13/21 (61%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+  K+K K+KK+KKKK+KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKK 122



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 15/32 (46%), Positives = 24/32 (75%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           +  K+K K+KK+KKKKK+KKK   +  ++K+K
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 27.3 bits (61), Expect = 5.3
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K+  K+K K+KK+KKK+KKK     S    K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 32.9 bits (75), Expect = 0.20
 Identities = 16/43 (37%), Positives = 22/43 (51%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
           K KK K KKK KK KKK  K+    +   + + +D S+   S 
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESA 187



 Score = 30.2 bits (68), Expect = 1.5
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGS 42
           + K KK K KKK KK +KK+   ++    E+  +  +   S
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183



 Score = 29.0 bits (65), Expect = 3.6
 Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 4/38 (10%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           + K KK K KKK KK K+K+    A  SS +  ++ SE
Sbjct: 143 ETKAKKGKAKKKTKKSKKKE----AKESSDKDDEEESE 176


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.0 bits (76), Expect = 0.21
 Identities = 12/38 (31%), Positives = 25/38 (65%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           KK KK  +K +KK+++++K+K   A +  K++ ++  E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447



 Score = 32.6 bits (75), Expect = 0.24
 Identities = 10/38 (26%), Positives = 22/38 (57%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K +KK++++KK+KKKK    K        +++ ++  +
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 32.2 bits (74), Expect = 0.30
 Identities = 12/50 (24%), Positives = 19/50 (38%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHD 50
           KK  KK KK  +K +KKR+++       +    +   E       K    
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455



 Score = 32.2 bits (74), Expect = 0.35
 Identities = 10/48 (20%), Positives = 18/48 (37%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPP 48
            KK  KK KK  +K +K++++            + + E       K  
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 31.0 bits (71), Expect = 0.79
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
            +KK++++KK+KKKK    KK        KEK ++  E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456



 Score = 30.7 bits (70), Expect = 0.96
 Identities = 12/38 (31%), Positives = 22/38 (57%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           +++KK+KKKK    KKK +++        +EK ++  E
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461



 Score = 28.7 bits (65), Expect = 4.7
 Identities = 12/38 (31%), Positives = 23/38 (60%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           +KK+KKKK    KKK++ +++        KE+ ++ +E
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463



 Score = 28.3 bits (64), Expect = 6.2
 Identities = 10/37 (27%), Positives = 15/37 (40%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
             KK  KK KK  +K  KK+        K+    + +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKK 440



 Score = 28.0 bits (63), Expect = 7.2
 Identities = 10/47 (21%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 1   KKKKKKKKKKKKKKKKKRKK--KGILANSSSKEKIQDRSECPGSGMG 45
           KK    KKK+++++++K KK  +       ++E+ ++  E       
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.6 bits (75), Expect = 0.27
 Identities = 13/50 (26%), Positives = 23/50 (46%), Gaps = 3/50 (6%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHD 50
           KK +KK++K +KK +K+  +K   A +  K +   +      G  K    
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEK---AAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 28.4 bits (64), Expect = 5.5
 Identities = 12/57 (21%), Positives = 25/57 (43%)

Query: 3   KKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLK 59
             ++KK +KK++K ++K +   A  ++ +K  + +     G       V    L  K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
          This presumed domain is found at the N-terminus of
          Ribosomal L30 proteins and has been termed RL30NT or
          NUC018.
          Length = 71

 Score = 29.5 bits (67), Expect = 0.33
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILA 25
          + K+ KK+  KK  +K+K+K I  
Sbjct: 13 RAKRAKKRAAKKAARKKKRKLIFK 36



 Score = 28.7 bits (65), Expect = 0.83
 Identities = 8/21 (38%), Positives = 14/21 (66%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          + K+ KK+  KK  +KK++K 
Sbjct: 13 RAKRAKKRAAKKAARKKKRKL 33



 Score = 28.3 bits (64), Expect = 1.1
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 2  KKKKKKKKKKKKKKKKRKKK 21
          K + K+ KK+  KK  RKKK
Sbjct: 11 KLRAKRAKKRAAKKAARKKK 30



 Score = 28.3 bits (64), Expect = 1.1
 Identities = 10/38 (26%), Positives = 20/38 (52%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
            KK+K+ +K + K+ K++     A    ++ I  R+E
Sbjct: 2  LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAE 39



 Score = 26.8 bits (60), Expect = 3.6
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          +  KK  +KKK+K   KR +K
Sbjct: 20 RAAKKAARKKKRKLIFKRAEK 40


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 32.1 bits (74), Expect = 0.40
 Identities = 9/32 (28%), Positives = 9/32 (28%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
               K    KK   K  RK K     S    K
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857



 Score = 30.1 bits (69), Expect = 1.8
 Identities = 7/32 (21%), Positives = 10/32 (31%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
                K    KK   K  +K      +S K+ 
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEKDG 856



 Score = 27.8 bits (63), Expect = 9.7
 Identities = 5/21 (23%), Positives = 9/21 (42%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
             K  +K K++    +K  K 
Sbjct: 838 AVKFSRKTKQQYVASEKDGKA 858


>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
          Length = 55

 Score = 29.2 bits (65), Expect = 0.40
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 172 FRCPICDRRFSQSSSVTTHMRTHS 195
           + CPIC   + +  S+ TH+R H+
Sbjct: 6   YECPICGEIYIKRKSMITHLRKHN 29


>gnl|CDD|218591 pfam05443, ROS_MUCR, ROS/MUCR transcriptional regulator protein.
           This family consists of several ROS/MUCR transcriptional
           regulator proteins. The ros chromosomal gene is present
           in octopine and nopaline strains of Agrobacterium
           tumefaciens as well as in Rhizobium meliloti. This gene
           encodes a 15.5-kDa protein that specifically represses
           the virC and virD operons in the virulence region of the
           Ti plasmid and is necessary for succinoglycan
           production. Sinorhizobium meliloti can produce two types
           of acidic exopolysaccharides, succinoglycan and
           galactoglucan, that are interchangeable for infection of
           alfalfa nodules. MucR from Sinorhizobium meliloti acts
           as a transcriptional repressor that blocks the
           expression of the exp genes responsible for
           galactoglucan production therefore allowing the
           exclusive production of succinoglycan.
          Length = 130

 Score = 30.6 bits (70), Expect = 0.43
 Identities = 21/74 (28%), Positives = 26/74 (35%), Gaps = 19/74 (25%)

Query: 74  GDPGAPYQVGAPQQFPTASLSYTVHPHQSSRGRPLNSGIRLILIQFIPGKTYARPSTLKT 133
           G P A       +  P   +  +V P                +I    GK +    TLK 
Sbjct: 43  GAPAAAPAAEEEKPTPAVPIKKSVTPDY--------------IICLEDGKKF---KTLKR 85

Query: 134 HLRTHSGEKP--YR 145
           HLRTH G  P  YR
Sbjct: 86  HLRTHHGLTPEEYR 99


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
          Williams-Beuren syndrome.  This domain family is found
          in eukaryotes, and is typically between 72 and 83 amino
          acids in length. The family is found in association
          with pfam08241. This family is made up of
          S-adenosylmethionine-dependent methyltransferases. The
          proteins are deleted in Williams-Beuren syndrome (WBS),
          a complex developmental disorder with multisystemic
          manifestations including supravalvular aortic stenosis
          (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 29.6 bits (67), Expect = 0.44
 Identities = 13/26 (50%), Positives = 19/26 (73%), Gaps = 4/26 (15%)

Query: 1  KKKKKKKKKKKK----KKKKKRKKKG 22
          KKKKKK KKK K    +KK++ +++G
Sbjct: 43 KKKKKKLKKKSKEWILRKKEQMRRRG 68



 Score = 27.6 bits (62), Expect = 2.4
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 1  KKKKKKKKKKKKKKKKKRK 19
           ++ ++KKKKKKK KKK K
Sbjct: 36 SQRNRRKKKKKKKLKKKSK 54



 Score = 26.1 bits (58), Expect = 8.8
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGIL 24
            ++ ++KKKKKKK KK+ K+ IL
Sbjct: 35 ISQRNRRKKKKKKKLKKKSKEWIL 58


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 32.1 bits (73), Expect = 0.46
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
           KKKKKKK K+KKKKK +   K I A  + +E I+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIE 718


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This
          nucleolar family of proteins are involved in 60S
          ribosomal biogenesis. They are specifically involved in
          the processing beyond the 27S stage of 25S rRNA
          maturation. This family contains sequences that bear
          similarity to the glioma tumour suppressor candidate
          region gene 2 protein (p60). This protein has been
          found to interact with herpes simplex type 1 regulatory
          proteins.
          Length = 387

 Score = 31.6 bits (72), Expect = 0.49
 Identities = 12/36 (33%), Positives = 16/36 (44%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
           +   KKK++KK KK  K   IL N S       + 
Sbjct: 50 DEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKK 85



 Score = 28.5 bits (64), Expect = 5.2
 Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
             + K+K K ++ K+K+RK+    A    + K +
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 27.7 bits (62), Expect = 9.2
 Identities = 8/21 (38%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K  + K+K K ++ K+KR+K+
Sbjct: 271 KPVRPKRKTKAQRNKEKRRKE 291


>gnl|CDD|128804 smart00531, TFIIE, Transcription initiation factor IIE. 
          Length = 147

 Score = 30.4 bits (69), Expect = 0.51
 Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 4/48 (8%)

Query: 131 LKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICD 178
           L+  L   +    Y+C +C   ++    L A  +    +  F CP C 
Sbjct: 87  LEDKLEDETNNAYYKCPNCQSKYTF---LEA-NQLLDMDGTFTCPRCG 130


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.5 bits (69), Expect = 0.54
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           +K+  K KK+KK+KK+K+  + +    S+K  
Sbjct: 108 EKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139



 Score = 30.5 bits (69), Expect = 0.60
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K+ ++K+  K KK+KK++K+K +    + K+ 
Sbjct: 104 KETEEKEAIKAKKEKKEKKEKKVAEKLAKKKS 135



 Score = 28.2 bits (63), Expect = 3.8
 Identities = 13/32 (40%), Positives = 19/32 (59%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K KK+KK+KK+KK  +K  KK     + +  K
Sbjct: 113 KAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144



 Score = 28.2 bits (63), Expect = 4.1
 Identities = 7/21 (33%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K++K+  K+ ++K+  K KK+
Sbjct: 97  KEQKEVSKETEEKEAIKAKKE 117



 Score = 27.8 bits (62), Expect = 5.2
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
           ++K+  K KK+KK+KK++K    LA   S +
Sbjct: 107 EEKEAIKAKKEKKEKKEKKVAEKLAKKKSTK 137



 Score = 27.8 bits (62), Expect = 5.2
 Identities = 7/21 (33%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K+  K+ ++K+  K K++KK
Sbjct: 99  QKEVSKETEEKEAIKAKKEKK 119



 Score = 27.0 bits (60), Expect = 8.3
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
           K+  K+ ++K+  K KK KK+      + K
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
          (ETRAMP).  These sequences represent a family of
          proteins from the malaria parasite Plasmodium
          falciparum, several of which have been shown to be
          expressed specifically in the ring stage as well as the
          rodent parasite Plasmodium yoelii. A homologue from
          Plasmodium chabaudi was localized to the
          parasitophorous vacuole membrane. Members have an
          initial hydrophobic, Phe/Tyr-rich, stretch long enough
          to span the membrane, a highly charged region rich in
          Lys, a second putative transmembrane region and a
          second highly charged, low complexity sequence region.
          Some members have up to 100 residues of additional
          C-terminal sequence. These genes have been shown to be
          found in the sub-telomeric regions of both Plasmodium
          falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 29.1 bits (66), Expect = 0.58
 Identities = 14/29 (48%), Positives = 19/29 (65%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
          K KK KK     +KKKK KKK I+++ +S
Sbjct: 33 KLKKLKKIDDDLEKKKKNKKKIIISSIAS 61



 Score = 27.6 bits (62), Expect = 2.6
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGIL 24
          KK KK KK     +KKK+ KK I+
Sbjct: 32 KKLKKLKKIDDDLEKKKKNKKKII 55


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 29.5 bits (67), Expect = 0.67
 Identities = 8/35 (22%), Positives = 23/35 (65%), Gaps = 6/35 (17%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
          K++K+++K +++++ KR K         +E+I+++
Sbjct: 1  KERKEEEKAQREEELKRLKN------LKREEIEEK 29



 Score = 28.8 bits (65), Expect = 1.2
 Identities = 8/34 (23%), Positives = 22/34 (64%), Gaps = 6/34 (17%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
          K+++K +++++ K+ K  K++ I      +EK++
Sbjct: 4  KEEEKAQREEELKRLKNLKREEI------EEKLE 31


>gnl|CDD|236691 PRK10426, PRK10426, alpha-glucosidase; Provisional.
          Length = 635

 Score = 31.1 bits (71), Expect = 0.90
 Identities = 22/98 (22%), Positives = 35/98 (35%), Gaps = 21/98 (21%)

Query: 160 TAHVRTHSGEKPFRCPICD------RRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSS 213
           T  +RTH G +P      D        F++ + V T ++      PY   L  +A     
Sbjct: 486 TPVMRTHEGNRPGDNWQFDSDAETIAHFARMTRVFTTLK------PYLKELVAEAAKTGL 539

Query: 214 TLTKHLRIHSGEKP--YQCKLCLLRFSQSGNLNRHMRV 249
            + + L +H  +    Y  K   L       L R + V
Sbjct: 540 PVMRPLFLHYEDDAATYTLKYQYL-------LGRDLLV 570



 Score = 30.0 bits (68), Expect = 1.9
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 8/53 (15%)

Query: 180 RFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHL----RIHSGEKPY 228
           R+ + S+ T  MRTH G RP       +  SD+ T+  H     R+ +  KPY
Sbjct: 478 RWCEFSAFTPVMRTHEGNRP---GDNWQFDSDAETI-AHFARMTRVFTTLKPY 526


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
          family of proteins is found in bacteria and viruses.
          Proteins in this family are typically between 180 and
          214 amino acids in length.
          Length = 125

 Score = 29.5 bits (67), Expect = 0.92
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 3/32 (9%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K+K K +KK+++KK + +K   LA  S++EK
Sbjct: 18 AKEKAKWEKKQEEKKSEAEK---LAKMSAEEK 46


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.2 bits (71), Expect = 0.96
 Identities = 12/49 (24%), Positives = 18/49 (36%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPH 49
             KK+ +      KKKKK +KK      S     Q  +      + +P  
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRK 1358



 Score = 30.0 bits (68), Expect = 2.1
 Identities = 13/52 (25%), Positives = 20/52 (38%)

Query: 2    KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
            K+++ K K K K  K RK K        K+   D+S+           D  +
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205



 Score = 29.2 bits (66), Expect = 3.6
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
            K KK+ +      KKKK+ +K       SK +++  S
Sbjct: 1309 KVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQAS 1345



 Score = 28.9 bits (65), Expect = 4.9
 Identities = 16/42 (38%), Positives = 22/42 (52%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGS 42
             K K  K +K K KKK++KKK   A+ S K  +   S+   S
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDS 1203



 Score = 28.1 bits (63), Expect = 8.1
 Identities = 15/47 (31%), Positives = 19/47 (40%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKP 47
             K +K K KKK+KKKKK         S      +  S+       KP
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213



 Score = 27.7 bits (62), Expect = 9.7
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
            + K K K K  K +K K KKK      SS +K +  S
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKAS 1193


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.4 bits (69), Expect = 1.0
 Identities = 11/22 (50%), Positives = 12/22 (54%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
            KKKK K+ K  K  KK   KG
Sbjct: 258 IKKKKPKESKGVKALKKVVAKG 279



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            K++ KKKK K+ K  K  KK
Sbjct: 254 SKEEIKKKKPKESKGVKALKK 274



 Score = 29.6 bits (67), Expect = 1.7
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K K++ KKKK K+ K  +  K
Sbjct: 253 KSKEEIKKKKPKESKGVKALK 273



 Score = 29.2 bits (66), Expect = 2.2
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KK+K K++ KKKK K+ +  K
Sbjct: 250 KKRKSKEEIKKKKPKESKGVK 270


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 7/32 (21%), Positives = 16/32 (50%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
            +K+K  ++++K K K  +KK        + +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157



 Score = 27.6 bits (62), Expect = 8.0
 Identities = 7/21 (33%), Positives = 14/21 (66%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            KK++   +K+K  +++RK K
Sbjct: 119 AKKREAAGEKEKAPRRERKPK 139


>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
           C2H2 Zn-finger [General function prediction only].
          Length = 65

 Score = 28.0 bits (62), Expect = 1.0
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query: 163 VRTHSGEKPFRCPICDRRFSQSSSVTTHM 191
           VR   GE+  RCP C   F +      H+
Sbjct: 9   VRDRDGEEFLRCPRCGMVFRRRKDYIRHV 37


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 29.5 bits (67), Expect = 1.1
 Identities = 8/21 (38%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K+K+ +K ++KK K+++K+K
Sbjct: 108 EKEKRTRKNREKKFKRRQKEK 128



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 7/22 (31%), Positives = 17/22 (77%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           K++ +K+K+ +K ++KK K++ 
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQ 125



 Score = 29.2 bits (66), Expect = 1.5
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            +K+K+ +K ++KK K+R+K+
Sbjct: 107 AEKEKRTRKNREKKFKRRQKE 127



 Score = 28.8 bits (65), Expect = 1.6
 Identities = 7/21 (33%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +++ K++ +K+K+ +K R+KK
Sbjct: 100 RERTKERAEKEKRTRKNREKK 120



 Score = 28.8 bits (65), Expect = 2.0
 Identities = 7/20 (35%), Positives = 17/20 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKK 20
           K+ +K ++KK K+++K+++K
Sbjct: 111 KRTRKNREKKFKRRQKEKEK 130



 Score = 28.4 bits (64), Expect = 2.3
 Identities = 8/22 (36%), Positives = 17/22 (77%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           + +K+K+ +K ++KK KR++K 
Sbjct: 106 RAEKEKRTRKNREKKFKRRQKE 127



 Score = 28.4 bits (64), Expect = 2.7
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K+ +K ++KK K+++K K+K
Sbjct: 110 EKRTRKNREKKFKRRQKEKEK 130



 Score = 26.9 bits (60), Expect = 9.3
 Identities = 5/21 (23%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
             + ++++ K++ +K+KR +K
Sbjct: 95  ALRLRRERTKERAEKEKRTRK 115


>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 129

 Score = 29.3 bits (66), Expect = 1.1
 Identities = 21/90 (23%), Positives = 32/90 (35%), Gaps = 17/90 (18%)

Query: 154 SQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSD-- 211
              A+    +R    +    CP C      SS+V        G + Y+C+ C   F+   
Sbjct: 15  IALADAAYAIRMQITKV--NCPRCK-----SSNVVKIGGIRRGHQRYKCKSCGSTFTVET 67

Query: 212 SSTLTKHLRIHSGEKPYQCKLCLLRFSQSG 241
            S L+K L        Y+ KL  +     G
Sbjct: 68  GSPLSKAL--------YKIKLQAVTLYMLG 89


>gnl|CDD|179886 PRK04860, PRK04860, hypothetical protein; Provisional.
          Length = 160

 Score = 29.9 bits (68), Expect = 1.1
 Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 4/36 (11%)

Query: 171 PFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCK 206
           P+RC   + + +    V  H R   GE  YRCR C 
Sbjct: 119 PYRCKCQEHQLT----VRRHNRVVRGEAVYRCRRCG 150



 Score = 27.2 bits (61), Expect = 8.7
 Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 12/50 (24%)

Query: 192 RTHS-------GER-PYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLC 233
           RTH        G+  PYRC+  +       T+ +H R+  GE  Y+C+ C
Sbjct: 104 RTHQFEVQSVRGKTFPYRCKCQEHQL----TVRRHNRVVRGEAVYRCRRC 149


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.9 bits (69), Expect = 1.2
 Identities = 3/32 (9%), Positives = 13/32 (40%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
             K + +    + +  +++ +        +K+K
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379



 Score = 29.0 bits (64), Expect = 4.5
 Identities = 7/36 (19%), Positives = 16/36 (44%)

Query: 1    KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
            KKK ++ KK  +  K + +     A ++ ++     
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370



 Score = 27.8 bits (61), Expect = 9.4
 Identities = 4/34 (11%), Positives = 11/34 (32%)

Query: 2    KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQD 35
             K + +    + +  + K +         +K  D
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381



 Score = 27.8 bits (61), Expect = 9.6
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 2    KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
            +  ++K +  +KKK++ KKK   A   ++EK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 12/59 (20%), Positives = 17/59 (28%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLK 59
               K   KKK K +    KK    N+  K+   +  +           D  D    L 
Sbjct: 85  AAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLD 143



 Score = 28.8 bits (65), Expect = 4.0
 Identities = 12/38 (31%), Positives = 15/38 (39%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           KK K   K    K   K+K K  L +S   EK     +
Sbjct: 77  KKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114



 Score = 28.8 bits (65), Expect = 4.5
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           KKK K   K    K   +KK     +SS K + ++  +
Sbjct: 76  KKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113



 Score = 28.4 bits (64), Expect = 6.0
 Identities = 10/32 (31%), Positives = 12/32 (37%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
             KKK K   K    K   KK +     S +K
Sbjct: 74  IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105


>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator.  NST1
          is a family of proteins that seem to be involved,
          directly or indirectly, in the salt sensitivity of some
          cellular functions in yeast. These proteins also
          interact with the splicing factor Msl1p.
          Length = 189

 Score = 29.9 bits (67), Expect = 1.2
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 7  KKKKKKKKKKKRKKKGILANSSSKEKIQ 34
          K KKKKKK+ K       A+  S   + 
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNVM 59


>gnl|CDD|116890 pfam08307, Glyco_hydro_98C, Glycosyl hydrolase family 98 C-terminal
           domain.  This putative domain is found at the C-terminus
           of glycosyl hydrolase family 98 proteins. This domain is
           not expected to form part of the catalytic activity.
          Length = 231

 Score = 30.0 bits (67), Expect = 1.2
 Identities = 17/80 (21%), Positives = 27/80 (33%), Gaps = 2/80 (2%)

Query: 90  TASLSYTVHPHQSSRGRPLNSGIRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDC 149
             SL+ T+ PH           + + L  +   K   R  T+   L   S    Y     
Sbjct: 138 CKSLNMTLGPHTYGVVNENADALSIYLNNYRTEKGPLRTDTIT--LSGASSTPTYSNVFG 195

Query: 150 NKSFSQAANLTAHVRTHSGE 169
           ++   QA+ +TA   T    
Sbjct: 196 DRGEYQASTVTATPSTDGLT 215


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 13/21 (61%), Positives = 19/21 (90%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K KKKKKK+KKK+++ KR++K
Sbjct: 744 KTKKKKKKEKKKEEEYKREEK 764



 Score = 28.1 bits (63), Expect = 7.8
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI 23
            KKKKKK+KKK+++ K+ +K  I
Sbjct: 745 TKKKKKKEKKKEEEYKREEKARI 767



 Score = 27.8 bits (62), Expect = 8.9
 Identities = 11/21 (52%), Positives = 19/21 (90%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           ++K KKKKKK+KKK+++ K++
Sbjct: 742 EEKTKKKKKKEKKKEEEYKRE 762


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 12/38 (31%), Positives = 25/38 (65%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           +++K+K++ K++KKKKK K K    +   KE+ +++  
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142



 Score = 29.5 bits (66), Expect = 2.5
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K++KKKKK+K K++ K RK K         ++ +   E
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKE 151



 Score = 29.1 bits (65), Expect = 3.5
 Identities = 8/32 (25%), Positives = 20/32 (62%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
            +  K+++K+K++ K+++KKK        K++
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131



 Score = 28.7 bits (64), Expect = 4.1
 Identities = 11/38 (28%), Positives = 23/38 (60%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           ++K+K++ K++KKKKK++ K+        +E  + R  
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143


>gnl|CDD|179595 PRK03564, PRK03564, formate dehydrogenase accessory protein FdhE;
           Provisional.
          Length = 309

 Score = 30.0 bits (68), Expect = 1.3
 Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 3/50 (6%)

Query: 156 AANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLC 205
           A  +    R   GE+   CP+C    S   S    + T  G R   C LC
Sbjct: 172 AQQIPGKARAEYGEQRQFCPVCG---SMPVSSVVQIGTTQGLRYLHCNLC 218


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor
          involved in ribosome biogenesis.
          Length = 112

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 12/22 (54%), Positives = 19/22 (86%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKG 22
          KK++KKK K+  +K++KR+KKG
Sbjct: 49 KKQRKKKPKRLTRKQRKRQKKG 70



 Score = 26.3 bits (58), Expect = 9.1
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
            KKK++KKK K+  ++++K         E + D 
Sbjct: 46 VSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDT 80


>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family.  This family represents a
           domain found in eukaryotes and prokaryotes. The domain
           contains a characteristic motif of the zinc
           metallopeptidases. This family includes the bacterial
           SprT protein.
          Length = 153

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 8/37 (21%), Positives = 12/37 (32%), Gaps = 8/37 (21%)

Query: 173 RCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF 209
           RC  C + + +   +  H         YRC  C    
Sbjct: 121 RCGSCGQLYPRKRRIRRHK--------YRCGRCGGKL 149


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 29.1 bits (66), Expect = 1.4
 Identities = 9/22 (40%), Positives = 19/22 (86%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           K+++K++K++ K++KKK K +G
Sbjct: 110 KEEEKERKRQLKQQKKKAKHRG 131


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 30.2 bits (69), Expect = 1.4
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K++ KKKKK+KK KK   +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753



 Score = 29.0 bits (66), Expect = 3.4
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           K++ KKKKK+KK KK + +K  
Sbjct: 734 KRRLKKKKKRKKVKKWEVEKIE 755


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region
          covers both the Brf homology II and III regions. This
          region is involved in binding TATA binding protein.
          Length = 95

 Score = 28.4 bits (64), Expect = 1.5
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K  +     KKKKK+K +KK+
Sbjct: 40 KADEGNNSGKKKKKRKAKKKR 60



 Score = 28.4 bits (64), Expect = 1.8
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKG 22
          K+K  +     KKKKK++ KK 
Sbjct: 38 KQKADEGNNSGKKKKKRKAKKK 59



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
          +K  +     KKKKK+K KKK   A  +S
Sbjct: 39 QKADEGNNSGKKKKKRKAKKKRDEAGPAS 67



 Score = 28.0 bits (63), Expect = 2.0
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
           K+K  +     KKKKKRK K
Sbjct: 37 LKQKADEGNNSGKKKKKRKAK 57



 Score = 27.2 bits (61), Expect = 3.7
 Identities = 8/21 (38%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K+ K+K  +     KKK+K+K
Sbjct: 35 KELKQKADEGNNSGKKKKKRK 55



 Score = 26.4 bits (59), Expect = 8.2
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
            +     KKKKK+K K+K+      S++ E ++
Sbjct: 41 ADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVK 74


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
            K K  KK  +K KK   K + I A + +K
Sbjct: 775 AKPKAAKKDARKAKKPSAKTQKIAAATKAK 804



 Score = 28.6 bits (64), Expect = 5.5
 Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 5/33 (15%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
           + +KK K K  KK  +K KK      S+  +KI
Sbjct: 770 RGEKKAKPKAAKKDARKAKK-----PSAKTQKI 797


>gnl|CDD|213534 TIGR00486, YbgI_SA1388, dinuclear metal center protein, YbgI/SA1388
           family.  The characterization of this family of
           uncharacterized proteins as orthologous is tentative.
           Members are found in all three domains of life. Several
           members (from Bacillus subtilis, Listeria monocytogenes,
           and Mycobacterium tuberculosis - all classified as
           Firmicutes within the Eubacteria) share a long insert
           relative to other members [Unknown function, General].
          Length = 249

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 12/62 (19%), Positives = 23/62 (37%), Gaps = 5/62 (8%)

Query: 12  KKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDV---ADVLLS--LKHAVVHPG 66
            + KK    K +L   +  E ++  +   GSG+      +    D  ++  L H   H  
Sbjct: 146 LEIKKVLNVKPLLVVKNGPEYVKKVAVVSGSGLSFIMKALREGVDAYITGDLSHHTAHLA 205

Query: 67  QM 68
           + 
Sbjct: 206 RE 207


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
          of this family (Kri1p) is found to be required for 40S
          ribosome biogenesis in the nucleolus. This is the
          C-terminal domain of the family.
          Length = 93

 Score = 28.3 bits (64), Expect = 1.6
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGIL 24
          ++++K KKK  KK + +  KK + 
Sbjct: 70 EERRKDKKKYGKKARLREWKKKVF 93



 Score = 27.2 bits (61), Expect = 4.1
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K++++K KKK  KK + R+ K
Sbjct: 69 KEERRKDKKKYGKKARLREWK 89


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 9/31 (29%), Positives = 24/31 (77%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
           +++++++ ++KK++KKK +++  LA  S +E
Sbjct: 277 EEERQEEAQEKKEEKKKEEREAKLAKLSPEE 307


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 29.6 bits (67), Expect = 1.7
 Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 2/31 (6%)

Query: 1   KKKKKKKKKKKKKKKKKRKK--KGILANSSS 29
           KKK + KKK  K  KK R       L+ + S
Sbjct: 155 KKKPRSKKKSSKNSKKLRIDLVGEWLSTAIS 185


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 29.6 bits (67), Expect = 1.7
 Identities = 14/22 (63%), Positives = 15/22 (68%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           +KKKKKKK  KKKK KK    G
Sbjct: 181 EKKKKKKKSAKKKKLKKVAAVG 202



 Score = 28.0 bits (63), Expect = 5.6
 Identities = 11/21 (52%), Positives = 11/21 (52%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KKKK  KKKK KK      K 
Sbjct: 185 KKKKSAKKKKLKKVAAVGMKA 205



 Score = 27.3 bits (61), Expect = 8.8
 Identities = 13/21 (61%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            +++KKKKKKK  KKKK KK 
Sbjct: 178 AEEEKKKKKKKSAKKKKLKKV 198


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast)
          and GNL3L (human) are putative GTPases which are
          required for growth and play a role in processing of
          nucleolar pre-rRNA. This family contains a potential
          nuclear localisation signal.
          Length = 80

 Score = 28.0 bits (63), Expect = 1.8
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 2/35 (5%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK--GILANSSSKEKI 33
            +K +K+ KK    K +KKK  GI  +   KE+I
Sbjct: 13 HHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEI 47


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 29.8 bits (68), Expect = 1.8
 Identities = 10/40 (25%), Positives = 17/40 (42%), Gaps = 6/40 (15%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGI--LANSSSKEKIQDRSEC 39
           +   KK ++  +KKK         LA++SS        +C
Sbjct: 347 RAAAKKARELTRKKKLSSISLPGKLADASSA----GPKKC 382


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 30.0 bits (67), Expect = 1.8
 Identities = 13/21 (61%), Positives = 18/21 (85%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KK KKK+KK+K+K++ K KKK
Sbjct: 200 KKPKKKEKKEKEKERDKDKKK 220



 Score = 28.9 bits (64), Expect = 3.7
 Identities = 22/71 (30%), Positives = 30/71 (42%), Gaps = 4/71 (5%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
           KK KK KKK+KK+K+K+R K         K  +    + P S       D A    SL +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEA----SLAN 252

Query: 61  AVVHPGQMTPP 71
            V      + P
Sbjct: 253 TVSGTAPDSEP 263



 Score = 28.5 bits (63), Expect = 6.1
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K K++KKK++K+K+ +R+  
Sbjct: 95  EKDKREKKKREKEKRGRRRHH 115



 Score = 28.1 bits (62), Expect = 7.4
 Identities = 8/29 (27%), Positives = 21/29 (72%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
           + +++ +K K++KKK++++K+G   + S 
Sbjct: 89  RHRQRLEKDKREKKKREKEKRGRRRHHSL 117



 Score = 27.7 bits (61), Expect = 9.0
 Identities = 10/30 (33%), Positives = 21/30 (70%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
           +K K++KKK++K+K+ R++   L   S ++
Sbjct: 95  EKDKREKKKREKEKRGRRRHHSLGTESDED 124


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 29.4 bits (66), Expect = 1.9
 Identities = 13/21 (61%), Positives = 13/21 (61%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KKKK K KK K K KK   KK
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKK 111



 Score = 29.0 bits (65), Expect = 2.7
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KKK K KK K K KK + KKK
Sbjct: 92  KKKGKSKKNKLKGKKDEDKKK 112



 Score = 28.3 bits (63), Expect = 4.2
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KK K KK K K KK + +KK 
Sbjct: 93  KKGKSKKNKLKGKKDEDKKKA 113



 Score = 27.9 bits (62), Expect = 6.0
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 2   KKKKKKKKKKKKKKKKRKKKG 22
           KKKK K KK K K KK + K 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKK 111


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 29.4 bits (66), Expect = 2.0
 Identities = 13/19 (68%), Positives = 13/19 (68%)

Query: 1   KKKKKKKKKKKKKKKKKRK 19
           K   KK KK KKKKKKKR 
Sbjct: 199 KSSAKKDKKSKKKKKKKRS 217


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
          protein involved in ribosome biogenesis.
          Length = 173

 Score = 28.9 bits (65), Expect = 2.1
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKG 22
          +KKKK +    K  +K+ K+K 
Sbjct: 2  RKKKKNRSSNYKVNRKRLKRKD 23



 Score = 28.2 bits (63), Expect = 4.1
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGI 23
          K +    K  +K+ K+K +KK I
Sbjct: 6  KNRSSNYKVNRKRLKRKDRKKKI 28



 Score = 27.0 bits (60), Expect = 9.5
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          KKK +    K  +K+ KRK +
Sbjct: 4  KKKNRSSNYKVNRKRLKRKDR 24


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 5/34 (14%)

Query: 4   KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
           KK+KKK+K+K++   K +G+      KEK +D+ 
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVK-----KEKRKDKK 655



 Score = 28.5 bits (64), Expect = 5.5
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           K+KKK+K+K++   K +  KK
Sbjct: 628 KRKKKRKRKRRFLTKIEGVKK 648



 Score = 28.1 bits (63), Expect = 7.2
 Identities = 13/38 (34%), Positives = 16/38 (42%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           K   K KKKKKK+K        I    S K  + D  +
Sbjct: 529 KAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566



 Score = 27.7 bits (62), Expect = 9.2
 Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 7/28 (25%)

Query: 1   KKKKKKKKKKKKK-------KKKKRKKK 21
           KK+KKK+K+K++        KK+KRK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDK 654


>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type.  This is a zinc-finger
           domain with the CxxCx(12)Hx(6)H motif, found in multiple
           copies in a wide range of proteins from plants to
           metazoans. Some member proteins, particularly those from
           plants, are annotated as being RNA-binding.
          Length = 25

 Score = 26.3 bits (59), Expect = 2.2
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query: 200 YRCRLCKKAFSDSSTLTKHLR 220
           + C LC   F+  S L  HLR
Sbjct: 1   FYCELCNVTFTSESQLKSHLR 21


>gnl|CDD|184535 PRK14144, PRK14144, heat shock protein GrpE; Provisional.
          Length = 199

 Score = 29.3 bits (65), Expect = 2.2
 Identities = 15/60 (25%), Positives = 28/60 (46%)

Query: 5  KKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKHAVVH 64
          K+ KK  KK K + K++  + N   +E+  + S+     +G P +   +  L+L     H
Sbjct: 3  KQNKKDWKKFKDEHKEEHKVENEILEEETDEESQHQEPALGHPSYTALEEQLTLAEQKAH 62


>gnl|CDD|238428 cd00832, CLF, Chain-length factor (CLF) is a factor required for
           polyketide chain initiation of aromatic
           antibiotic-producing polyketide synthases (PKSs) of
           filamentous bacteria. CLFs have been shown to have
           decarboxylase activity towards malonyl-acyl carrier
           protein (ACP). CLFs are similar to other elongation
           ketosynthase domains, but their active site cysteine is
           replaced by a conserved glutamine.
          Length = 399

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 45  GKPPHDVADVLLSLKHAVVHP-GQMTPPPYGDP 76
           G  P DVA  LL+L+  V+ P   +T  P    
Sbjct: 335 GGAPLDVATALLALRDGVIPPTVNVTDVPPAYG 367


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
          protein 2.  THO and TREX form a eukaryotic complex
          which functions in messenger ribonucleoprotein
          metabolism and plays a role in preventing the
          transcription-associated genetic instability. Tho2,
          along with four other subunits forms THO.
          Length = 296

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 13/38 (34%), Positives = 22/38 (57%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
          KKK+KK+ K   KK ++  KK I  N  +K+++ +   
Sbjct: 53 KKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKS 90


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
          (DUF1764).  This is a family of eukaryotic proteins of
          unknown function. This family contains many
          hypothetical proteins.
          Length = 98

 Score = 27.8 bits (62), Expect = 2.3
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
          KK+KKKKK+  K  + K  KKG   +    E 
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDKKKDEF 53


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 29.8 bits (67), Expect = 2.3
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
           K K +K KKK    K   KKK  + +S  +    D +
Sbjct: 119 KPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNN 155



 Score = 29.4 bits (66), Expect = 2.6
 Identities = 8/39 (20%), Positives = 11/39 (28%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSEC 39
               K   KKKKK    + +     N+  K         
Sbjct: 129 ITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLT 167


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 28.3 bits (64), Expect = 2.3
 Identities = 12/36 (33%), Positives = 22/36 (61%), Gaps = 2/36 (5%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI--LANSSSKEKIQ 34
           KK + K++ K +KK ++ K  G   LA ++ KE ++
Sbjct: 79  KKLQAKQQAKAEKKAEENKAAGEAFLAENAKKEGVK 114


>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated.  This family of
          proteins is associated with U3 snoRNA. U3 snoRNA is
          required for nucleolar processing of pre-18S ribosomal
          RNA.
          Length = 87

 Score = 27.8 bits (62), Expect = 2.4
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKG 22
          KKK K +  KK +K+ K  KKG
Sbjct: 32 KKKNKLRFLKKLEKRPKDVKKG 53


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 29.7 bits (67), Expect = 2.4
 Identities = 14/32 (43%), Positives = 19/32 (59%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           +KKKKKKK+KK+ K +   K G      SK+ 
Sbjct: 71  EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102



 Score = 28.9 bits (65), Expect = 4.2
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
           KKKKKKK+KK+ K + + K        S K K 
Sbjct: 72  KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKK 104


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 29.3 bits (66), Expect = 2.4
 Identities = 18/40 (45%), Positives = 20/40 (50%), Gaps = 2/40 (5%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILAN--SSSKEKIQDRSE 38
           K KKKKKKKKK   KK + KK       +  K K   R E
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRE 293


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 27.1 bits (61), Expect = 2.4
 Identities = 7/19 (36%), Positives = 14/19 (73%)

Query: 1  KKKKKKKKKKKKKKKKKRK 19
          K+K+KK   +K+++KK  +
Sbjct: 44 KRKRKKAAARKRRRKKLAR 62



 Score = 26.8 bits (60), Expect = 2.9
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          K  +K+K+KK   +K++RKK 
Sbjct: 40 KPSEKRKRKKAAARKRRRKKL 60



 Score = 26.4 bits (59), Expect = 4.4
 Identities = 7/19 (36%), Positives = 14/19 (73%)

Query: 2  KKKKKKKKKKKKKKKKRKK 20
          K+K+KK   +K+++KK  +
Sbjct: 44 KRKRKKAAARKRRRKKLAR 62



 Score = 26.0 bits (58), Expect = 6.1
 Identities = 6/19 (31%), Positives = 14/19 (73%)

Query: 3  KKKKKKKKKKKKKKKRKKK 21
          K+K+KK   +K+++K+  +
Sbjct: 44 KRKRKKAAARKRRRKKLAR 62


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 29.6 bits (67), Expect = 2.5
 Identities = 12/32 (37%), Positives = 15/32 (46%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
            KK   K K KKKKKK        + +S + K
Sbjct: 353 IKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384



 Score = 28.4 bits (64), Expect = 5.5
 Identities = 16/43 (37%), Positives = 20/43 (46%), Gaps = 5/43 (11%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
           KK  KK   K K KKKK+K       +S+  K +   E   SG
Sbjct: 350 KKPIKKSNSKAKLKKKKKKA-----GASAVPKSETSQEAKSSG 387


>gnl|CDD|190760 pfam03811, Zn_Tnp_IS1, InsA N-terminal domain.  This appears to be
           a short zinc binding domain found in IS1 InsA family
           protein. It is found at the N-terminus of the protein
           and may be a DNA-binding domain.
          Length = 36

 Score = 26.3 bits (58), Expect = 2.5
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 4/33 (12%)

Query: 174 CPICDRRFSQSSSVTTHMRTHSGERPYRCRLCK 206
           CP C    S +  V  H ++ +G + YRCR C+
Sbjct: 8   CPRC----SSTEGVYRHGQSTAGHQRYRCRHCR 36


>gnl|CDD|197903 smart00834, CxxC_CXXC_SSSS, Putative regulatory protein.
           CxxC_CXXC_SSSS represents a region of about 41 amino
           acids found in a number of small proteins in a wide
           range of bacteria. The region usually begins with the
           initiator Met and contains two CxxC motifs separated by
           17 amino acids. One protein in this entry has been noted
           as a putative regulatory protein, designated FmdB. Most
           proteins in this entry have a C-terminal region
           containing highly degenerate sequence.
          Length = 41

 Score = 26.4 bits (59), Expect = 2.7
 Identities = 10/34 (29%), Positives = 12/34 (35%), Gaps = 7/34 (20%)

Query: 144 YRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPIC 177
           YRCEDC  +F          +         CP C
Sbjct: 6   YRCEDCGHTF------EVLQKISDDPLTT-CPEC 32


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 29.5 bits (66), Expect = 2.8
 Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 14/80 (17%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
          KKKK++K K+K+ KK K  +K   A   +++   D +  P     K          S K 
Sbjct: 21 KKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA-SDGTNVPKKSEKK----------SRKR 69

Query: 61 AVVHPGQMTPPPYGDPGAPY 80
           V       P  + DP  P+
Sbjct: 70 DV---EDENPEDFIDPDTPF 86



 Score = 28.3 bits (63), Expect = 5.7
 Identities = 15/45 (33%), Positives = 27/45 (60%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGK 46
          ++ ++KKKK++K K+K  KK   A   +K K+Q +    G+ + K
Sbjct: 16 EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60


>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger. 
          Length = 27

 Score = 26.0 bits (58), Expect = 2.9
 Identities = 8/23 (34%), Positives = 12/23 (52%)

Query: 144 YRCEDCNKSFSQAANLTAHVRTH 166
           + C  C K+FS    L  H ++H
Sbjct: 2   HTCGVCGKTFSSLQALGGHKKSH 24



 Score = 24.9 bits (55), Expect = 7.5
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 200 YRCRLCKKAFSDSSTLTKHLRIH 222
           + C +C K FS    L  H + H
Sbjct: 2   HTCGVCGKTFSSLQALGGHKKSH 24



 Score = 24.5 bits (54), Expect = 8.6
 Identities = 6/23 (26%), Positives = 12/23 (52%)

Query: 172 FRCPICDRRFSQSSSVTTHMRTH 194
             C +C + FS   ++  H ++H
Sbjct: 2   HTCGVCGKTFSSLQALGGHKKSH 24


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
           Provisional.
          Length = 1136

 Score = 29.4 bits (65), Expect = 2.9
 Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 2/58 (3%)

Query: 7   KKKKKKKKKKKRKKKGIL-ANSSSKEKIQDRSECPGSGMGKPP-HDVADVLLSLKHAV 62
           KK+ +K KKKK K++G L       E ++ + E P   +G+   ++  D+L SLK ++
Sbjct: 327 KKRAEKPKKKKSKRRGWLCCGGGDIETVEPQQEEPVQDVGEHQINEYGDILPSLKASI 384


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.0 bits (65), Expect = 3.1
 Identities = 7/22 (31%), Positives = 17/22 (77%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           +KK+K++  + K+K++K ++K 
Sbjct: 413 EKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|189563 pfam00475, IGPD, Imidazoleglycerol-phosphate dehydratase. 
          Length = 145

 Score = 28.2 bits (64), Expect = 3.3
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 151 KSFSQAANLTAHVRTHSGE 169
           +SF+  A +T HVR   G 
Sbjct: 109 RSFAFNAGITLHVRVLYGR 127


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 29.2 bits (65), Expect = 3.3
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K ++++KKK+K++  KK   K        KEK
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKERKIGKEK 590


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 27.6 bits (62), Expect = 3.4
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 15/42 (35%)

Query: 1   KKKKKKKKKKKKKKKKK-------RKKKGILANSSSKEKIQD 35
            K K+K+KKKKKKK+ +       R+KK        KE++ +
Sbjct: 73  AKAKEKEKKKKKKKELEDFYRFQIREKK--------KEELAE 106



 Score = 27.2 bits (61), Expect = 6.0
 Identities = 11/33 (33%), Positives = 18/33 (54%), Gaps = 7/33 (21%)

Query: 1  KKKKKKK-------KKKKKKKKKKRKKKGILAN 26
          +  +K K       + K K+K+KK+KKK  L +
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 29.1 bits (65), Expect = 3.4
 Identities = 13/41 (31%), Positives = 19/41 (46%)

Query: 4   KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGM 44
           K KK   K KK++K KK  I + +S  E+    +E      
Sbjct: 143 KGKKLVNKPKKRQKYKKATIQSPTSLTEEENYNTEICTVAP 183


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation
          of 60S ribosomal subunits.
          Length = 132

 Score = 28.1 bits (63), Expect = 3.5
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILA 25
          KK  K KKK K +K +K+ K+ + A
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRA 52



 Score = 26.6 bits (59), Expect = 9.2
 Identities = 10/35 (28%), Positives = 20/35 (57%), Gaps = 4/35 (11%)

Query: 2  KKKKKKKKKKKKKKKKRKKKG----ILANSSSKEK 32
          K +KK K++ + +K++  +KG    +L     KE+
Sbjct: 41 KLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKER 75


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 28.4 bits (64), Expect = 3.5
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 1   KKKKKKKKKKKKKKKKKRK 19
           KK K K+ +K + KKKK+K
Sbjct: 87  KKAKGKRSRKNQTKKKKKK 105



 Score = 27.7 bits (62), Expect = 6.8
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGI 23
             KK K K+ +K + KK+KKK  
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKKDD 107


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
          Provisional.
          Length = 191

 Score = 28.6 bits (64), Expect = 3.5
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
           KKKK   +KKK  KK +KKK            
Sbjct: 51 SKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is
          a small protein module with a triple-stranded
          beta-sheet fold. This is a family of WW domain binding
          proteins.
          Length = 78

 Score = 26.9 bits (60), Expect = 3.6
 Identities = 8/37 (21%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILAN--SSSKEKIQD 35
          K++KKK+ KK K +++ R++  +        + ++  
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDK 48



 Score = 26.1 bits (58), Expect = 8.2
 Identities = 8/18 (44%), Positives = 14/18 (77%)

Query: 3  KKKKKKKKKKKKKKKRKK 20
          +K++KKK+ KK K +R+ 
Sbjct: 11 RKEQKKKELKKNKAERQA 28


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 29.0 bits (65), Expect = 3.7
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query: 3   KKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           +KK K KKK    K +  K +   S +K K
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSK 777



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 11/31 (35%), Positives = 14/31 (45%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K KKK    K K  K  KKK    +  + +K
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782



 Score = 27.9 bits (62), Expect = 8.8
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
           K KKK    K K  K  +KK    +  ++K++
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783



 Score = 27.9 bits (62), Expect = 9.2
 Identities = 9/31 (29%), Positives = 13/31 (41%)

Query: 4   KKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
           +KK K KKK    K K    +   S  +  +
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKK 778


>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function.  This domain
          family is found in eukaryotes, and is typically between
          96 and 116 amino acids in length.
          Length = 106

 Score = 27.7 bits (62), Expect = 3.8
 Identities = 11/24 (45%), Positives = 13/24 (54%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGIL 24
          K KK     KKK KK   KK G++
Sbjct: 25 KLKKDPNIDKKKFKKIIFKKYGLI 48


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 28.9 bits (65), Expect = 3.9
 Identities = 8/22 (36%), Positives = 17/22 (77%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           K++  + ++KKK+KK K +++G
Sbjct: 371 KERPGRYRRKKKEKKAKSERRG 392



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 7/21 (33%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           + KK++  + ++KKK+K+ K 
Sbjct: 368 RDKKERPGRYRRKKKEKKAKS 388



 Score = 27.7 bits (62), Expect = 7.4
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
            K K ++ KK++  + +RKKK 
Sbjct: 362 TKAKPERDKKERPGRYRRKKKE 383



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 6/22 (27%), Positives = 14/22 (63%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKG 22
           ++  + ++KKK+KK K  ++  
Sbjct: 372 ERPGRYRRKKKEKKAKSERRGL 393



 Score = 27.7 bits (62), Expect = 8.0
 Identities = 7/21 (33%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           KK++  + ++KKK+KK + ++
Sbjct: 370 KKERPGRYRRKKKEKKAKSER 390



 Score = 27.7 bits (62), Expect = 9.3
 Identities = 8/21 (38%), Positives = 15/21 (71%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
            KK++  + ++KKK+KK K +
Sbjct: 369 DKKERPGRYRRKKKEKKAKSE 389


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 28.6 bits (65), Expect = 3.9
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 3   KKKKKKKKKKKKKKKRKKKGI 23
           KK  +K +K KKKKKR+K   
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFS 212



 Score = 27.9 bits (63), Expect = 6.1
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 2   KKKKKKKKKKKKKKKKRKK 20
           KK  +K +K KKKKK+RK 
Sbjct: 192 KKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 3   KKKKKKKKKKKKKKKRKKKGILA 25
             +KK+K+KK++ K +KK+ +L 
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLE 182


>gnl|CDD|130324 TIGR01257, rim_protein, retinal-specific rim ABC transporter.  This
            model describes the photoreceptor protein (rim protein)
            in eukaryotes. It is the member of ABC transporter
            superfamily. Rim protein is a membrane glycoprotein which
            is localized in the photoreceptor outer segment discs.
            Mutation/s in its genetic loci is implicated in the
            recessive Stargardt's disease [Transport and binding
            proteins, Other].
          Length = 2272

 Score = 29.2 bits (65), Expect = 4.1
 Identities = 22/89 (24%), Positives = 25/89 (28%), Gaps = 32/89 (35%)

Query: 30   KEKIQDRSEC--PGSGMGKPPHDVADVLLSLKHAVVHPGQMTPPPYGDPGAPYQVGAPQQ 87
            +E    R  C  P    G+ P                PGQ    P G P           
Sbjct: 1294 RENANLRHPCSGPTEKAGQTPQA---------SHTCSPGQPAAHPEGQPP---------- 1334

Query: 88   FPTASLSYTVHPHQSSRGRPLNSGIRLIL 116
                       P     G PLN+G RLIL
Sbjct: 1335 -----------PEPEDPGVPLNTGARLIL 1352


>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3.  The
           reovirus inner capsid protein lambda-1 displays
           nucleoside triphosphate phosphohydrolase (NTPase),
           RNA-5'-triphosphatase (RTPase), and RNA helicase
           activity and may play a role in the transcription of the
           virus genome, the unwinding or reannealing of
           double-stranded RNA during RNA synthesis. The RTPase
           activity constitutes the first step in the capping of
           RNA, resulting in a 5'-diphosphorylated RNA plus-strand.
           lambda1 is an Orthoreovirus core protein, VP3 is the
           homologous core protein in Aquareoviruses.
          Length = 1166

 Score = 29.0 bits (65), Expect = 4.6
 Identities = 9/25 (36%), Positives = 12/25 (48%), Gaps = 1/25 (4%)

Query: 172 FRCPICDRRFSQSSSVTTHMRT-HS 195
           + C +C   FS    +  H RT HS
Sbjct: 74  YVCNVCMAEFSSMDQLAEHQRTTHS 98



 Score = 28.6 bits (64), Expect = 6.1
 Identities = 11/25 (44%), Positives = 11/25 (44%), Gaps = 1/25 (4%)

Query: 144 YRCEDCNKSFSQAANLTAHVRT-HS 167
           Y C  C   FS    L  H RT HS
Sbjct: 74  YVCNVCMAEFSSMDQLAEHQRTTHS 98


>gnl|CDD|222379 pfam13786, DUF4179, Domain of unknown function (DUF4179). 
          Length = 93

 Score = 26.9 bits (60), Expect = 4.8
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 11 KKKKKKKRKKKGILA 25
          KK KKK++KKK I A
Sbjct: 1  KKVKKKRKKKKKIAA 15


>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies).  This
           family contains two copies of a C2H2-like zinc finger
           domain.
          Length = 100

 Score = 27.2 bits (61), Expect = 4.8
 Identities = 6/21 (28%), Positives = 10/21 (47%)

Query: 172 FRCPICDRRFSQSSSVTTHMR 192
             C  C ++F    ++  HMR
Sbjct: 51  NECLYCGKQFKSLEALRQHMR 71



 Score = 26.8 bits (60), Expect = 5.4
 Identities = 23/84 (27%), Positives = 29/84 (34%), Gaps = 17/84 (20%)

Query: 174 CPICDRRFSQSSSVTTHMR-THS---GERPYRCRLCKKAFSDSSTLTKHLR--IHSGEKP 227
           C  C+           HM  +H     ER Y          D   L  +LR  IH G   
Sbjct: 2   CLFCNHTSDTVEENLEHMFKSHGFFIPEREY--------LVDLEGLLNYLREKIHEG--- 50

Query: 228 YQCKLCLLRFSQSGNLNRHMRVHG 251
            +C  C  +F     L +HMR  G
Sbjct: 51  NECLYCGKQFKSLEALRQHMRDKG 74


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 28.6 bits (64), Expect = 5.0
 Identities = 11/59 (18%), Positives = 14/59 (23%), Gaps = 10/59 (16%)

Query: 59  KHAVVHPGQMTPPPYGDPGAPYQVGAPQQFPTASLSYT---------VHPHQSSRGRPL 108
               V P  M P     P        P  +P     +            P Q+    PL
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPP-EQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPL 232


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 28.4 bits (64), Expect = 5.4
 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 2/35 (5%)

Query: 4   KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
           ++KKK KK  +K+ R  +  L      +++Q  SE
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKR--KELQALSE 116


>gnl|CDD|153419 cd07914, IGPD, Imidazoleglycerol-phosphate dehydratase.
           Imidazoleglycerol-phosphate dehydratase (IGPD; EC
           4.2.1.19) catalyzes the dehydration of imidazole
           glycerol phosphate to imidazole acetol phosphate, the
           sixth step of histidine biosynthesis in plants and
           microorganisms where the histidine is synthesized de
           novo. There is an internal repeat in the protein domain
           that is related by pseudo-dyad symmetry, perhaps as a
           result of an ancient gene duplication. The apo-form of
           IGPD exists as a catalytically inactive trimer which, in
           the presence of specific divalent metal cations such as
           manganese (Mn2+), cobalt (Co2+), cadmium (Cd2+), nickel
           (Ni2+), iron (Fe2+) and zinc (Zn2+), assembles to form a
           biologically active high molecular weight metalloenzyme;
           a 24-mer with 4-3-2 symmetry. Each 24-mer has 24 active
           sites, and contains around 1.5 metal ions per monomer,
           each monomer contributing residues to three separate
           active sites. IGPD enzymes are monofunctional in fungi,
           plants, archaea and some eubacteria while they are
           encoded as bifunctional enzymes in other eubacteria,
           such that the enzyme is fused to histidinol-phosphate
           phosphatase, the penultimate enzyme of the histidine
           biosynthesis pathway. The histidine biosynthesis pathway
           is a potential target for development of herbicides, and
           IGPD is a target for the triazole phosphonate
           herbicides.
          Length = 190

 Score = 27.7 bits (63), Expect = 5.5
 Identities = 7/19 (36%), Positives = 11/19 (57%)

Query: 151 KSFSQAANLTAHVRTHSGE 169
           +SF+  A +T H+R   G 
Sbjct: 136 RSFANNAGITLHIRVLYGR 154


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 28.0 bits (63), Expect = 5.5
 Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 4/40 (10%)

Query: 1   KKKKKKKKKKKKK----KKKKRKKKGILANSSSKEKIQDR 36
           KK KKKKKK+KK     KK  + K G       K K+ D 
Sbjct: 158 KKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDP 197


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7
          plays a role in protein translation. Deletions of the
          TMA7 gene results in altered protein synthesis rates.
          Length = 63

 Score = 26.0 bits (57), Expect = 5.6
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
          K+K+K++ K  K+   K K KG L     K
Sbjct: 29 KQKQKEEAKALKELAAKAKGKGPLGGGGIK 58


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 28.3 bits (63), Expect = 5.8
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
           KK +  K  K  K KK +KK  +   S     I  RS
Sbjct: 151 KKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRS 187


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.5 bits (64), Expect = 5.9
 Identities = 8/21 (38%), Positives = 16/21 (76%)

Query: 1   KKKKKKKKKKKKKKKKKRKKK 21
           +K+  K++K+ +K KK+ +KK
Sbjct: 559 EKEALKEQKRLRKLKKQEEKK 579


>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
           This domain family is found in archaea and eukaryotes,
           and is approximately 30 amino acids in length. The
           mammalian members of this group occur multiple times
           along the protein, joined by flexible linkers, and are
           referred to as JAZ - dsRNA-binding ZF protein -
           zinc-fingers. The JAZ proteins are expressed in all
           tissues tested and localise in the nucleus, particularly
           the nucleolus. JAZ preferentially binds to
           double-stranded (ds) RNA or RNA/DNA hybrids rather than
           DNA. In addition to binding double-stranded RNA, these
           zinc-fingers are required for nucleolar localisation.
          Length = 27

 Score = 24.8 bits (55), Expect = 6.0
 Identities = 6/23 (26%), Positives = 13/23 (56%)

Query: 171 PFRCPICDRRFSQSSSVTTHMRT 193
            F C  CD+ F   +++  H+++
Sbjct: 1   QFYCVACDKYFKSENALENHLKS 23



 Score = 24.5 bits (54), Expect = 8.7
 Identities = 6/23 (26%), Positives = 11/23 (47%)

Query: 143 PYRCEDCNKSFSQAANLTAHVRT 165
            + C  C+K F     L  H+++
Sbjct: 1   QFYCVACDKYFKSENALENHLKS 23


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 28.3 bits (63), Expect = 6.3
 Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 4/52 (7%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
           +     + + ++   K KKKG++  S +++   +  E  GS    PP   A 
Sbjct: 369 RAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGS----PPQVEAT 416


>gnl|CDD|201461 pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein. 
          Length = 96

 Score = 26.7 bits (60), Expect = 6.4
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 2  KKKKKKKKKKKKKKKKRKKKG 22
          + KK    K K++K  RKK+G
Sbjct: 68 RGKKVIVFKYKRRKNYRKKQG 88


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 27.3 bits (61), Expect = 6.9
 Identities = 7/21 (33%), Positives = 20/21 (95%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          +K+++K+++K++K++K+ K+K
Sbjct: 16 EKREEKEREKEEKERKEEKEK 36


>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
          Length = 367

 Score = 28.0 bits (63), Expect = 6.9
 Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 4/39 (10%)

Query: 1  KKKKKKKKKKKKKKKK----KRKKKGILANSSSKEKIQD 35
          KK KKK+    ++  K    K KK+G L      EK+  
Sbjct: 1  KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIP 39


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 27.4 bits (61), Expect = 7.0
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
          KKK KK  K  KK  K +K+K
Sbjct: 8  KKKAKKAAKAAKKGVKVKKRK 28


>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
          Length = 479

 Score = 28.2 bits (63), Expect = 7.1
 Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 1  KKKKKKKKKKKKKKK--KKRKKKGILANSSSKEK 32
          KKKKKKKKK  K+     +  KK I   + S+ K
Sbjct: 9  KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRK 42


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
          is an essential protein in yeast that is involved in
          pre-rRNA processing and ribosome assembly. It is
          speculated to be required for correct assembly of rpS27
          into the pre-ribosomal particle.
          Length = 131

 Score = 26.8 bits (60), Expect = 7.7
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 2  KKKKKKKKKKKKKKKKRKKKGIL 24
          + K  ++++K K+KKK+KKK + 
Sbjct: 68 RNKAAEERRKLKEKKKKKKKELE 90


>gnl|CDD|181614 PRK09027, PRK09027, cytidine deaminase; Provisional.
          Length = 295

 Score = 27.9 bits (63), Expect = 7.7
 Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 9/45 (20%)

Query: 83  GAPQQFPTASLSYTVHPHQSS------RGRPLNSGIRLILIQFIP 121
           GA  +F  A+L  TVH  QS+      RG      I  I + + P
Sbjct: 87  GANMEFAGAALQQTVHAEQSAISHAWLRGE---KAIADITVNYTP 128


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 27.7 bits (62), Expect = 7.9
 Identities = 11/27 (40%), Positives = 13/27 (48%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANS 27
           K+K  KK  KK    KKRK+K      
Sbjct: 226 KRKAPKKVAKKVAAAKKRKQKRSPPPP 252


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 11/42 (26%), Positives = 21/42 (50%)

Query: 2   KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
                +KKK++KK+++++ K +  +    E   D  E  G G
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEG 218


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 28.0 bits (62), Expect = 8.1
 Identities = 18/107 (16%), Positives = 31/107 (28%), Gaps = 11/107 (10%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
              K+     +K++  +             ++ +  +  P S     P  V         
Sbjct: 373 HHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVP-------- 424

Query: 61  AVVHPGQMTPPPYGDPG---APYQVGAPQQFPTASLSYTVHPHQSSR 104
           A   P   TP P  +PG    P      Q    A+      P  ++R
Sbjct: 425 ASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATR 471


>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
           superfamily II helicase [DNA replication, recombination,
           and repair].
          Length = 730

 Score = 28.1 bits (63), Expect = 8.1
 Identities = 14/62 (22%), Positives = 16/62 (25%), Gaps = 25/62 (40%)

Query: 172 FRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCK 231
             CP CD      S +T H  T       RC  C                +    P  C 
Sbjct: 445 AECPNCD------SPLTLHKATG----QLRCHYCG---------------YQEPIPQSCP 479

Query: 232 LC 233
            C
Sbjct: 480 EC 481


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
          describes the eukaryotic 60S (cytosolic) ribosomal
          protein L7 and paralogs that may or may not also be L7.
          Human, Drosophila, and Arabidopsis all have both a
          typical L7 and an L7-related paralog. This family is
          designated subfamily rather than equivalog to reflect
          these uncharacterized paralogs. Members of this family
          average ~ 250 residues in length, somewhat longer than
          the archaeal L30P/L7E homolog (~ 155 residues) and much
          longer than the related bacterial/organellar form (~ 60
          residues).
          Length = 235

 Score = 27.3 bits (61), Expect = 8.2
 Identities = 6/27 (22%), Positives = 15/27 (55%)

Query: 6  KKKKKKKKKKKKRKKKGILANSSSKEK 32
          KK+K +++   +  K+      ++K+K
Sbjct: 5  KKRKARQELAVQVAKQAKAKKKANKKK 31


>gnl|CDD|133125 cd06594, GH31_glucosidase_YihQ, YihQ is a bacterial
           alpha-glucosidase with a conserved glycosyl hydrolase
           family 31 (GH31) domain that catalyzes the release of an
           alpha-glucosyl residue from the non-reducing end of
           alpha-glucoside substrates such as alpha-glucosyl
           fluoride. Orthologs of YihQ that have not yet been
           functionally characterized are present in plants and
           fungi. YihQ has sequence similarity to other GH31
           enzymes such as CtsZ, a 6-alpha-glucosyltransferase from
           Bacillus globisporus, and YicI, an alpha-xylosidase from
           Echerichia coli. In bacteria, YihQ (along with YihO) is
           important for bacterial O-antigen capsule assembly and
           translocation.
          Length = 317

 Score = 27.7 bits (62), Expect = 8.2
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 179 RRFSQSSSVTTHMRTHSGERPYR 201
            R+++ ++ T  MRTH G RP  
Sbjct: 280 LRWAEMAAFTPVMRTHEGNRPDD 302


>gnl|CDD|227562 COG5237, PER1, Predicted membrane protein [Function unknown].
          Length = 319

 Score = 27.5 bits (61), Expect = 8.2
 Identities = 18/70 (25%), Positives = 27/70 (38%), Gaps = 9/70 (12%)

Query: 200 YRC---RLCKKAFSDSSTL---TKHLRIHSGEK-PYQCKLCLLRFSQSGNLNRHMRVHGS 252
            +C     C    +D++      K        K  Y C L  L+F+ SGN+  + R HG 
Sbjct: 32  GKCFENSRCNLNKTDTNMFPLVDKLFGWDCDSKCGYMCHLLCLKFTNSGNIKIYQR-HGK 90

Query: 253 QYNMLPVIYM 262
                 V+ M
Sbjct: 91  WG-FQRVLGM 99


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
          (DUF2040).  This entry is a conserved domain of
          approximately 130 residues of proteins conserved from
          fungi to humans. The proteins do contain a coiled-coil
          domain, but the function is unknown.
          Length = 128

 Score = 26.6 bits (59), Expect = 8.7
 Identities = 10/41 (24%), Positives = 24/41 (58%), Gaps = 3/41 (7%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK---GILANSSSKEKIQDRSE 38
            K++KK+ K  ++K+ RK K    +L  +  +++ ++ +E
Sbjct: 29 AAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAE 69


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 1   KKKKKKKKKKKKKKKKKR 18
            K KKKK+KKK+  +K R
Sbjct: 425 LKSKKKKRKKKEWFEKFR 442


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 26.6 bits (59), Expect = 9.3
 Identities = 14/45 (31%), Positives = 16/45 (35%), Gaps = 5/45 (11%)

Query: 61  AVVHPGQMTPPPYGDPGAP--YQVGAPQQFPTASLSYTVHPHQSS 103
           A VH      P YG   AP  Y    P  +P     Y   P  +S
Sbjct: 52  AYVHLEP--LPAYGQYAAPPPYGP-PPPYYPAPPGVYPTPPPPNS 93


>gnl|CDD|219748 pfam08214, KAT11, Histone acetylation protein.  Histone acetylation
           is required in many cellular processes including
           transcription, DNA repair, and chromatin assembly. This
           family contains the fungal KAT11 protein (previously
           known as RTT109) which is required for H3K56
           acetylation. Loss of KAT11 results in the loss of H3K56
           acetylation, both on bulk histone and on chromatin.
           KAT11 and H3K56 acetylation appear to correlate with
           actively transcribed genes and associate with the
           elongating form of Pol II in yeast. This family also
           incorporates the p300/CBP histone acetyltransferase
           domain which has different catalytic properties and
           cofactor regulation to KAT11.
          Length = 341

 Score = 27.4 bits (61), Expect = 9.5
 Identities = 12/57 (21%), Positives = 21/57 (36%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLS 57
             K    KK+ K+ KK    +     S +   +Q+  +     M K   D   + L+
Sbjct: 274 PPKLVVSKKQYKRAKKAITGEEYDTLSGAASLVQNLMDKLYETMLKKKEDFIVIRLN 330


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 26.8 bits (60), Expect = 9.5
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 2   KKKKKKKKKKKKKKKKRKKK 21
           K+++  KKK++K K K KKK
Sbjct: 118 KEEQAAKKKERKDKVKNKKK 137


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
          7 homolog A (Rrp7A) and similar proteins.  The family
          corresponds to the RRP7 domain of Rrp7A, also termed
          gastric cancer antigen Zg14, and similar proteins which
          are yeast ribosomal RNA-processing protein 7 (Rrp7p)
          homologs mainly found in Metazoans. The cellular
          function of Rrp7A remains unclear currently. Rrp7A
          harbors an N-terminal RNA recognition motif (RRM), also
          termed RBD (RNA binding domain) or RNP
          (ribonucleoprotein domain), and a C-terminal RRP7
          domain.
          Length = 129

 Score = 26.5 bits (59), Expect = 9.6
 Identities = 9/21 (42%), Positives = 14/21 (66%)

Query: 1  KKKKKKKKKKKKKKKKKRKKK 21
           +K+    K  +K+KKK+KKK
Sbjct: 65 ARKESVAAKAAEKEKKKKKKK 85


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 27.8 bits (62), Expect = 9.7
 Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 1  KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
           K+  KKKKKKKKKKKK+        +  
Sbjct: 52 NKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80


>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein.  This small,
           uncommon, poorly conserved protein is found primarily in
           the Firmicutes. It features are pair of CxxC motifs
           separated by about 20 amino acids, followed by a highly
           hydrophobic region of about 45 amino acids. It has no
           conserved gene neighborhood, and its function is
           unknown.
          Length = 94

 Score = 26.2 bits (58), Expect = 9.8
 Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 3/33 (9%)

Query: 145 RCEDCNKSFSQAANLTAHVRTHSGEKPFRCPIC 177
            C++CN+ FS    L +     S  +P +CP C
Sbjct: 2   ICKNCNEKFSYKELLKS---LFSLYRPIKCPNC 31


>gnl|CDD|188353 TIGR03608, L_ocin_972_ABC, putative bacteriocin export ABC
           transporter, lactococcin 972 group.  A gene pair with a
           fairly wide distribution consists of a polypeptide
           related to the lactococcin 972 (see TIGR01653) and
           multiple-membrane-spanning putative immunity protein
           (see TIGR01654). This model represents a small clade
           within the ABC transporters that regularly are found
           adjacent to these bacteriocin system gene pairs and are
           likely serve as export proteins [Cellular processes,
           Toxin production and resistance, Transport and binding
           proteins, Unknown substrate].
          Length = 206

 Score = 27.2 bits (61), Expect = 10.0
 Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 2/33 (6%)

Query: 1   KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
           K KK  KK+K++KKK+  +K G+  N   K+KI
Sbjct: 102 KYKKLSKKEKREKKKEALEKVGL--NLKLKQKI 132


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.132    0.406 

Gapped
Lambda     K      H
   0.267   0.0804    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,079,797
Number of extensions: 1364776
Number of successful extensions: 8957
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7077
Number of HSP's successfully gapped: 617
Length of query: 311
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 214
Effective length of database: 6,635,264
Effective search space: 1419946496
Effective search space used: 1419946496
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.4 bits)