RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8050
(311 letters)
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 41.2 bits (97), Expect = 1e-05
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 158 NLTAHVRTHSGEKPFRCPICDRRFSQ 183
NL H+RTH+GEKP++CP+C + FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 40.5 bits (95), Expect = 2e-05
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 131 LKTHLRTHSGEKPYRCEDCNKSFSQ 155
L+ H+RTH+GEKPY+C C KSFS
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 37.0 bits (86), Expect = 3e-04
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 186 SVTTHMRTHSGERPYRCRLCKKAFS 210
++ HMRTH+GE+PY+C +C K+FS
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFS 25
Score = 36.2 bits (84), Expect = 6e-04
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 215 LTKHLRIHSGEKPYQCKLCLLRFSQ 239
L +H+R H+GEKPY+C +C FS
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 40.0 bits (94), Expect = 3e-04
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECP 40
K+ K++KKK+KKKKKKK KK + ++ S+
Sbjct: 86 KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEE 125
Score = 30.8 bits (70), Expect = 0.38
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK-IQDRSECPGSGMGK 46
KKKKKKK KK KK++K K +S +E+ +D+ E P M K
Sbjct: 96 KKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
Score = 27.3 bits (61), Expect = 6.4
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 4 KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSEC 39
+K KK+ K++KKK+KKK +K +
Sbjct: 81 EKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 41.1 bits (97), Expect = 4e-04
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
+KK+K+K+K K KK+ R K I
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNI 422
Score = 39.2 bits (92), Expect = 0.002
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KK+K+K+K K KK+ + K G
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIG 423
Score = 38.8 bits (91), Expect = 0.003
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+ +KK+K+K+K K KKR +
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRD 418
Score = 36.5 bits (85), Expect = 0.016
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K K KK+ + K KR+K
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKP 428
Score = 36.1 bits (84), Expect = 0.017
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
+K+K+K K KK+ + K K + +S+E
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 35.3 bits (82), Expect = 0.032
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+ +KK+K+K+K K K++ +
Sbjct: 397 AKRAEKKEKEKEKPKVKKRHR 417
Score = 35.3 bits (82), Expect = 0.037
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K+ +KK+K+K+K + KK
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKK 414
Score = 34.9 bits (81), Expect = 0.040
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
KK K+ +KK+K+K+K K K ++ + K
Sbjct: 393 KKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 34.9 bits (81), Expect = 0.043
Identities = 12/38 (31%), Positives = 20/38 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K+ +KK+K+K+K K +K+ N + K SE
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSE 433
Score = 34.5 bits (80), Expect = 0.054
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
K KK K+ +KK+K+++K + + I R
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 33.0 bits (76), Expect = 0.21
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K KK K+ +KK+K K+K
Sbjct: 389 GKPSKKVLAKRAEKKEKEKEK 409
Score = 32.2 bits (74), Expect = 0.37
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K KK K+ +KK+++K+
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKE 408
Score = 31.5 bits (72), Expect = 0.55
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K KK K+ +KK K+K
Sbjct: 387 KTGKPSKKVLAKRAEKKEKEK 407
Score = 29.9 bits (68), Expect = 1.8
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
+KK K KK K+ KK+ K++ +D
Sbjct: 384 SEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTK 420
Score = 28.4 bits (64), Expect = 5.7
Identities = 11/31 (35%), Positives = 12/31 (38%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K K +KK K KK K KEK
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEK 409
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 41.1 bits (97), Expect = 4e-04
Identities = 16/22 (72%), Positives = 21/22 (95%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KKK+++KK +K+KKKKKRKKKG
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKG 405
Score = 40.3 bits (95), Expect = 9e-04
Identities = 17/22 (77%), Positives = 20/22 (90%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K +K+KKKKK+KKK KKRKKKG
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKG 412
Score = 39.2 bits (92), Expect = 0.002
Identities = 14/21 (66%), Positives = 19/21 (90%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+KKKKK+KKK KK+KK+ +K
Sbjct: 394 KRKKKKKRKKKGKKRKKKGRK 414
Score = 38.8 bits (91), Expect = 0.003
Identities = 14/22 (63%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KK +K+KKKKK+KKK K++KK
Sbjct: 390 KKPQKRKKKKKRKKKGKKRKKK 411
Score = 38.4 bits (90), Expect = 0.003
Identities = 12/22 (54%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KK+++KK +K+KKKKK++KK
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGK 406
Score = 38.4 bits (90), Expect = 0.004
Identities = 13/21 (61%), Positives = 19/21 (90%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+KKKKK+KKK KK+K+K +
Sbjct: 393 QKRKKKKKRKKKGKKRKKKGR 413
Score = 38.0 bits (89), Expect = 0.004
Identities = 12/22 (54%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+++KK +K+KKKKK+KK+ KK
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKR 408
Score = 37.3 bits (87), Expect = 0.007
Identities = 13/22 (59%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+KK +K+KKKKK+KKK +K+K
Sbjct: 389 EKKPQKRKKKKKRKKKGKKRKK 410
Score = 37.3 bits (87), Expect = 0.008
Identities = 13/22 (59%), Positives = 18/22 (81%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+K+KKKKK+KKK KK++KK
Sbjct: 392 PQKRKKKKKRKKKGKKRKKKGR 413
Score = 37.3 bits (87), Expect = 0.008
Identities = 12/22 (54%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KKK+++KK +K+KKKK++KK
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKK 404
Score = 35.7 bits (83), Expect = 0.027
Identities = 13/22 (59%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
++KK +K+KKKKK+KKK KK+
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRK 409
Score = 33.8 bits (78), Expect = 0.091
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K KKK+++KK +K+KK+KK+
Sbjct: 381 KPPKKKREEKKPQKRKKKKKRK 402
Score = 31.1 bits (71), Expect = 0.79
Identities = 13/24 (54%), Positives = 18/24 (75%), Gaps = 2/24 (8%)
Query: 1 KKK--KKKKKKKKKKKKKKRKKKG 22
K+K K KKK+++KK +KRKKK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKK 399
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 37.8 bits (88), Expect = 8e-04
Identities = 15/52 (28%), Positives = 17/52 (32%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
KKKKKKKKKK K K + D E + D
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEED 69
Score = 31.2 bits (71), Expect = 0.14
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 5/41 (12%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGS 42
K K KK KKKKKKK+KK N S +E + ++ E S
Sbjct: 11 KLKGKKIDVKKKKKKKKKK-----NKSKEEVVTEKEEEEKS 46
Score = 29.7 bits (67), Expect = 0.56
Identities = 14/52 (26%), Positives = 23/52 (44%), Gaps = 6/52 (11%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI------LANSSSKEKIQDRSECPGSGMGK 46
KKKKKKKK K K++ K++ +++ ++ E GM
Sbjct: 22 KKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNL 73
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 37.9 bits (88), Expect = 0.001
Identities = 16/49 (32%), Positives = 25/49 (51%), Gaps = 2/49 (4%)
Query: 171 PFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHL 219
P+ CP+C FS S S+ H+R E C +C K F ++ + H+
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHV 119
Score = 31.4 bits (71), Expect = 0.25
Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 2/49 (4%)
Query: 143 PYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHM 191
PY C C FS + +L H+R K CP+C + F + S H+
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119
Score = 30.2 bits (68), Expect = 0.61
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 210 SDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMR 248
+SS L K L + PY C LCL+ FS S +L +H+R
Sbjct: 57 DESSYLYKLLTSKA-VSPYVCPLCLMPFSSSVSLKQHIR 94
Score = 27.5 bits (61), Expect = 5.5
Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 2/49 (4%)
Query: 199 PYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHM 247
PY C LC FS S +L +H+R K C +C F + + H+
Sbjct: 73 PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDHV 119
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 39.3 bits (91), Expect = 0.002
Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 7/102 (6%)
Query: 158 NLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGER-PYRCRLCKKAFSDSSTLT 216
+L++ + S + R + SS + + G P + + C +FS SS LT
Sbjct: 247 SLSSSDSSSSASESPRSSLPTASSQSSSPNESDSSSEKGFSLPIKSKQCNISFSRSSPLT 306
Query: 217 KHLR--IHSGE--KPYQC--KLCLLRFSQSGNLNRHMRVHGS 252
+HLR HSGE KP+ C LC FS++ L RH+ +H S
Sbjct: 307 RHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348
Score = 38.5 bits (89), Expect = 0.004
Identities = 31/115 (26%), Positives = 51/115 (44%), Gaps = 10/115 (8%)
Query: 142 KPYRCEDCNKSFSQAANLTAHVRT--HSGE--KPFRCPI--CDRRFSQSSSVTTHMRTHS 195
P + + CN SFS+++ LT H+R+ HSGE KPF CP C + FS++ ++ H+ H+
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347
Query: 196 GERPYRCRL--CKKAFSDSST--LTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRH 246
P + +L FS + L+ + K + L
Sbjct: 348 SISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDS 402
Score = 38.1 bits (88), Expect = 0.004
Identities = 38/145 (26%), Positives = 61/145 (42%), Gaps = 17/145 (11%)
Query: 123 KTYARPSTLKTHLRT--HSGE--KPYRC--EDCNKSFSQAANLTAHVRTHSGEKPFRCPI 176
+++R S L HLR+ HSGE KP+ C C K FS+ L H+ H+ P + +
Sbjct: 297 ISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKL 356
Query: 177 CDRRFSQSSSVTT-------HMRTHSGERPYRC--RLCKKAFSDSSTLTKHLRIHSGEKP 227
+ S + + ++ C + F S L+ H+ H +P
Sbjct: 357 LNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRP 416
Query: 228 YQCK--LCLLRFSQSGNLNRHMRVH 250
Y CK C F++ NL H ++H
Sbjct: 417 YNCKNPPCSKSFNRHYNLIPHKKIH 441
Score = 37.8 bits (87), Expect = 0.005
Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 170 KPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRL--CKKAFSDSSTLTKHLRIHSGEKP 227
+P CP C FS+ +T H+R+H+GE+P +C C K+FS L++HLR H
Sbjct: 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91
Score = 37.0 bits (85), Expect = 0.011
Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 142 KPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRC--PICDRRFSQSSSVTTHMRTHSGERP 199
+P C +C SFS+ +LT H+R+H+GEKP +C CD+ FS+ ++ H+RTH
Sbjct: 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91
Query: 200 YRC 202
Sbjct: 92 DLN 94
Score = 36.6 bits (84), Expect = 0.015
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 198 RPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQC--KLCLLRFSQSGNLNRHMRVHGSQYN 255
RP C C +FS LT+H+R H+GEKP QC C FS+ L+RH+R H + +
Sbjct: 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91
Query: 256 MLP 258
L
Sbjct: 92 DLN 94
Score = 35.1 bits (80), Expect = 0.038
Identities = 36/139 (25%), Positives = 59/139 (42%), Gaps = 12/139 (8%)
Query: 119 FIPGKTYARPSTLKTHLRTHSGEKPYRC--EDCNKSFSQAAN-----LTAHVRTHSGEKP 171
+ GK ++R LK H+ H+ P + + + FS N + +K
Sbjct: 327 SLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKK 386
Query: 172 FRC--PICDRRFSQSSSVTTHMRTHSGERPYRCR--LCKKAFSDSSTLTKHLRIHSGEKP 227
C R F + S+++ H+ TH RPY C+ C K+F+ L H +IH+ P
Sbjct: 387 SETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446
Query: 228 YQCKLCLLRFSQSGNLNRH 246
C L F + +L+ H
Sbjct: 447 LLC-SILKSFRRDLDLSNH 464
Score = 30.0 bits (67), Expect = 1.6
Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 127 RPSTLKTHLRTHSGEKPYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQS 184
R L H+R+H+GEKP +C C+KSFS+ L+ H+RTH +
Sbjct: 45 RLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSK 104
Query: 185 SS 186
+S
Sbjct: 105 AS 106
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 38.4 bits (90), Expect = 0.002
Identities = 12/35 (34%), Positives = 22/35 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQD 35
KKK KK KK+K+K++K+ +K + K+ ++
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEE 48
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 39.0 bits (91), Expect = 0.003
Identities = 15/34 (44%), Positives = 17/34 (50%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQD 35
K KK KKKKKKKKK+RK E +
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGS 306
Score = 37.4 bits (87), Expect = 0.009
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
K KK KKKKKKKKK+++ +
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELEPEA 301
Score = 35.9 bits (83), Expect = 0.022
Identities = 13/38 (34%), Positives = 18/38 (47%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KK KKKKKKKKK++K + + + S
Sbjct: 275 FKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSR 312
Score = 35.5 bits (82), Expect = 0.036
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+ K KK KKKKKKKK+++K
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRK 290
Score = 32.8 bits (75), Expect = 0.24
Identities = 13/38 (34%), Positives = 17/38 (44%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K KKKKKKKKK++K + A R +
Sbjct: 277 KPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKD 314
Score = 27.8 bits (62), Expect = 9.5
Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 7/32 (21%)
Query: 1 KKKKKKKKKKKKKKKK-------KRKKKGILA 25
K+K KK + KKKK K K+ IL+
Sbjct: 176 KEKDKKNLELKKKKPDYDPDDDDKFNKRSILS 207
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 34.2 bits (79), Expect = 0.003
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 145 RCEDCNKSFSQAANLTAHVRTH 166
+C DC KSFS+ +NL H+RTH
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 33.5 bits (77), Expect = 0.005
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 173 RCPICDRRFSQSSSVTTHMRTH 194
+CP C + FS+ S++ H+RTH
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 32.7 bits (75), Expect = 0.011
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 201 RCRLCKKAFSDSSTLTKHLRIH 222
+C C K+FS S L +HLR H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 31.5 bits (72), Expect = 0.030
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 229 QCKLCLLRFSQSGNLNRHMRVH 250
+C C FS+ NL RH+R H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 25.4 bits (56), Expect = 3.8
Identities = 11/17 (64%), Positives = 14/17 (82%)
Query: 122 GKTYARPSTLKTHLRTH 138
GK+++R S LK HLRTH
Sbjct: 6 GKSFSRKSNLKRHLRTH 22
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 33.6 bits (77), Expect = 0.005
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 172 FRCPICDRRFSQSSSVTTHMRTH 194
+RCP C + F S++ HMRTH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 32.1 bits (73), Expect = 0.017
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 144 YRCEDCNKSFSQAANLTAHVRTH 166
YRC +C K F + L H+RTH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 32.1 bits (73), Expect = 0.019
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 200 YRCRLCKKAFSDSSTLTKHLRIH 222
YRC C K F S L +H+R H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 31.3 bits (71), Expect = 0.036
Identities = 9/23 (39%), Positives = 10/23 (43%)
Query: 228 YQCKLCLLRFSQSGNLNRHMRVH 250
Y+C C F L HMR H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 36.1 bits (84), Expect = 0.010
Identities = 13/68 (19%), Positives = 34/68 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
+++++K++++K +KK+KRK+ ++ E + ++E S P + A+ +
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSK 68
Query: 61 AVVHPGQM 68
G+
Sbjct: 69 VEFADGEQ 76
Score = 33.8 bits (78), Expect = 0.062
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 3 KKKKKKKKKKKKKKKRKKKGIL 24
KK+K KK KKKKK +KK IL
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 32.7 bits (75), Expect = 0.16
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGI 23
KK+K KK KKKKK K+K + +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 32.3 bits (74), Expect = 0.19
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+K KK KKKKK KKK +
Sbjct: 186 KRKDDKKNKKKKKAKKKGRIL 206
Score = 31.1 bits (71), Expect = 0.49
Identities = 12/20 (60%), Positives = 15/20 (75%)
Query: 1 KKKKKKKKKKKKKKKKKRKK 20
KK+K KK KKKKK KK+ +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGR 204
Score = 30.0 bits (68), Expect = 1.1
Identities = 12/20 (60%), Positives = 18/20 (90%)
Query: 2 KKKKKKKKKKKKKKKKRKKK 21
K+K+K+KKK KK+ K+RK+K
Sbjct: 149 KRKEKQKKKSKKEWKERKEK 168
Score = 27.7 bits (62), Expect = 6.4
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 5 KKKKKKKKKKKKKRKKK 21
KK K+K+K+KKK KK+
Sbjct: 145 KKALKRKEKQKKKSKKE 161
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.
This domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 34.1 bits (79), Expect = 0.017
Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
+KKK+KK+ K + K+ KK L K ++ +
Sbjct: 61 RKKKEKKEVKAESKRYNAKK---LTLEQRKARVAQK 93
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 35.8 bits (82), Expect = 0.021
Identities = 21/78 (26%), Positives = 31/78 (39%), Gaps = 24/78 (30%)
Query: 140 GEKPYRCE--DCNKSFSQAANLTAHVRTH--------------------SGEKPFRCPIC 177
KPY+C CNK + L H H + +KP+RC +C
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYH-MLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVC 404
Query: 178 DRRFSQSSSVTTHMRTHS 195
D+R+ + + H R HS
Sbjct: 405 DKRYKNLNGLKYH-RKHS 421
Score = 32.4 bits (73), Expect = 0.31
Identities = 16/81 (19%), Positives = 35/81 (43%), Gaps = 11/81 (13%)
Query: 168 GEKPFRCPI--CDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGE 225
KP++CP+ C++++ + + HM + ++ + K + +
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGH---------QNQKLHENPSPEKMNIFSAKD 396
Query: 226 KPYQCKLCLLRFSQSGNLNRH 246
KPY+C++C R+ L H
Sbjct: 397 KPYRCEVCDKRYKNLNGLKYH 417
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 34.0 bits (79), Expect = 0.036
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K +K +KK +K K K KKK
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKK 89
Score = 33.7 bits (78), Expect = 0.049
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 2 KKKKKKKKKKKKKKKKRKK 20
K +KK +K K K +KK+ K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91
Score = 33.7 bits (78), Expect = 0.050
Identities = 10/19 (52%), Positives = 13/19 (68%)
Query: 1 KKKKKKKKKKKKKKKKKRK 19
K +KK +K K K +KKK K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91
Score = 32.9 bits (76), Expect = 0.098
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGIL 24
KK +K K K +KKK KK + K L
Sbjct: 76 KKAEKAKAKAEKKKAKKEEPKPRL 99
Score = 32.5 bits (75), Expect = 0.12
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K +K +KK +K K K +KK
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKK 88
Score = 32.1 bits (74), Expect = 0.16
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 3 KKKKKKKKKKKKKKKRKKK 21
K +KK +K K K +K+K K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91
Score = 31.0 bits (71), Expect = 0.41
Identities = 8/32 (25%), Positives = 15/32 (46%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
K +K +KK +K K + +K K ++
Sbjct: 68 LKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRL 99
Score = 30.6 bits (70), Expect = 0.49
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK+ K +K +KK +K + K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKA 85
Score = 30.6 bits (70), Expect = 0.57
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+KK +K K K +KKK K+++
Sbjct: 75 EKKAEKAKAKAEKKKAKKEEP 95
Score = 29.4 bits (67), Expect = 1.5
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K +KK +K K K ++KK
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKA 90
Score = 29.0 bits (66), Expect = 1.6
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+KK +K K K +KKK +K++
Sbjct: 74 AEKKAEKAKAKAEKKKAKKEE 94
Score = 29.0 bits (66), Expect = 1.7
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K+ K +K +KK +K K K + A +
Sbjct: 66 KELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 34.6 bits (80), Expect = 0.041
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
K K +K K KKKK K K K + ++
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 33.9 bits (78), Expect = 0.077
Identities = 10/38 (26%), Positives = 19/38 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+K+++K K K K KK K I +K + +++
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGL 81
Score = 30.4 bits (69), Expect = 0.94
Identities = 9/33 (27%), Positives = 15/33 (45%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
K K +K K KKKK + K + ++ +
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226
Score = 30.4 bits (69), Expect = 1.1
Identities = 7/38 (18%), Positives = 16/38 (42%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K K KK K K ++++K ++++ +
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTP 89
Score = 30.0 bits (68), Expect = 1.5
Identities = 10/38 (26%), Positives = 17/38 (44%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K K K K KK K K ++K ++ +++ E
Sbjct: 49 KAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 28.9 bits (65), Expect = 2.8
Identities = 10/22 (45%), Positives = 12/22 (54%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+K K +K K KKKK K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKA 213
Score = 28.9 bits (65), Expect = 3.1
Identities = 8/23 (34%), Positives = 8/23 (34%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
K KKKK K K K
Sbjct: 202 KGGKKKKGKAKAKLNVGGANDDD 224
Score = 28.9 bits (65), Expect = 3.4
Identities = 9/37 (24%), Positives = 21/37 (56%), Gaps = 6/37 (16%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+ ++K+++K K K + KK + K KI+++ +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKK------ALKAKIEEKEK 71
Score = 28.1 bits (63), Expect = 5.2
Identities = 8/36 (22%), Positives = 21/36 (58%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
K K ++K+K K++K+ K L + ++++ ++
Sbjct: 61 ALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 32.3 bits (74), Expect = 0.046
Identities = 15/53 (28%), Positives = 21/53 (39%), Gaps = 3/53 (5%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
KKKKKKKKKK + K A + + E +G+ H +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGF---LGEKFHKPGE 71
Score = 26.9 bits (60), Expect = 4.3
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K + KKKKKKKKK+ + + K +D SE
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53
Score = 26.6 bits (59), Expect = 6.4
Identities = 13/40 (32%), Positives = 19/40 (47%)
Query: 5 KKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGM 44
K + KKKKKK+KKK +++K K + M
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 33.9 bits (78), Expect = 0.053
Identities = 16/38 (42%), Positives = 21/38 (55%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K K KKKK KKKK K K KK + S K+ ++ +
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Score = 32.4 bits (74), Expect = 0.15
Identities = 16/32 (50%), Positives = 17/32 (53%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K+K K KKKK KKKK K K K EK
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEK 112
Score = 30.8 bits (70), Expect = 0.64
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+K K KKKK KKKK K + KK + S++K
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKK 113
Score = 28.9 bits (65), Expect = 2.6
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
KKK K K KK KK K +KK E +
Sbjct: 93 KKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 33.9 bits (78), Expect = 0.055
Identities = 17/38 (44%), Positives = 28/38 (73%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K+KKKKK+ KK+KK+KK KK+ ++ SK+K + + +
Sbjct: 155 KEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 31.6 bits (72), Expect = 0.32
Identities = 13/21 (61%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK+K + K KKKKKK+KKK
Sbjct: 173 KKEKMVEPKGSKKKKKKKKKK 193
Score = 30.5 bits (69), Expect = 0.80
Identities = 13/21 (61%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
++KK+KKKKK+ KK+KK KK
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKD 172
Score = 29.7 bits (67), Expect = 1.4
Identities = 12/21 (57%), Positives = 20/21 (95%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+++KK+KKKKK+ KK+K++KK
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKK 171
Score = 27.4 bits (61), Expect = 9.1
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
++++KK+KKKKK+ KK KK+
Sbjct: 149 VEEEEKKEKKKKKEVKKEKKEK 170
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 34.4 bits (80), Expect = 0.061
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K +KKK+K++ K K K K +
Sbjct: 71 KAQKKKEKQEAKAAKAKSKPR 91
Score = 32.9 bits (76), Expect = 0.21
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K K +KKK+K++ K K K
Sbjct: 67 KAWHKAQKKKEKQEAKAAKAKS 88
Score = 32.1 bits (74), Expect = 0.33
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K +KKK+K++ K K
Sbjct: 66 LKAWHKAQKKKEKQEAKAAKA 86
Score = 30.6 bits (70), Expect = 1.1
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK+ K K +KKK+K+ K
Sbjct: 63 KKELKAWHKAQKKKEKQEAKA 83
Score = 29.8 bits (68), Expect = 1.8
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+ K K +KKK+K++ K
Sbjct: 64 KELKAWHKAQKKKEKQEAKAA 84
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 33.9 bits (78), Expect = 0.066
Identities = 15/37 (40%), Positives = 18/37 (48%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
KKKK+ K K K+ KK+KKK A S K
Sbjct: 85 KKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPK 121
Score = 28.9 bits (65), Expect = 3.1
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K+ K K K+ KKKKK+ + ++ + + +SE
Sbjct: 88 KRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125
Score = 28.1 bits (63), Expect = 5.5
Identities = 17/46 (36%), Positives = 20/46 (43%), Gaps = 6/46 (13%)
Query: 1 KKKKKKKK------KKKKKKKKKRKKKGILANSSSKEKIQDRSECP 40
KKK+ K K KKKKKK K A K+K + S P
Sbjct: 86 KKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAP 131
Score = 27.7 bits (62), Expect = 6.7
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+++++ KKKK+ K K K+
Sbjct: 78 LQREERLKKKKRVKTKAYKEP 98
Score = 27.7 bits (62), Expect = 7.8
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+ ++KK + K +K+K+K+K+K
Sbjct: 161 EIRRKKIQAKARKRKEKKKEK 181
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 30.3 bits (68), Expect = 0.074
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 172 FRCPICDRRFSQSSSVTTHMRTH 194
F+CP+C + FS ++ H+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 28.8 bits (64), Expect = 0.30
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 144 YRCEDCNKSFSQAANLTAHVRTH 166
++C C KSFS L H+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 28.4 bits (63), Expect = 0.40
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 200 YRCRLCKKAFSDSSTLTKHLRIH 222
++C LC K+FS L +HLR H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 27.2 bits (60), Expect = 1.0
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 228 YQCKLCLLRFSQSGNLNRHMRVH 250
++C LC FS L RH+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 33.5 bits (77), Expect = 0.095
Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSEC--PGSGMGKPPHDVA 52
K + ++++ KKKKKKKK+K K A SS E + SG G+
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKK 226
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 32.9 bits (75), Expect = 0.10
Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 7/43 (16%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
KK KKKK + K++KKK+K+K K+K + E PG G
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKK-------KKKKRHSPEHPGVG 177
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 32.2 bits (73), Expect = 0.11
Identities = 15/21 (71%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K KKKKKK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 31.8 bits (72), Expect = 0.16
Identities = 14/20 (70%), Positives = 14/20 (70%)
Query: 2 KKKKKKKKKKKKKKKKRKKK 21
KKKKKK KK KK KK KK
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
Score = 31.8 bits (72), Expect = 0.17
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 1 KKKKKKKKKKKKKKKKKRKK 20
KKKKKK KK KK KK +K
Sbjct: 101 KKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|224508 COG1592, COG1592, Rubrerythrin [Energy production and conversion].
Length = 166
Score = 32.7 bits (75), Expect = 0.14
Identities = 12/39 (30%), Positives = 13/39 (33%), Gaps = 13/39 (33%)
Query: 140 GEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICD 178
K + C C TH GE P CPIC
Sbjct: 131 EGKVWVCPVCG-------------YTHEGEAPEVCPICG 156
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 31.7 bits (72), Expect = 0.15
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
++ KK KKK++KKKKKK+ K +
Sbjct: 101 RQAKKLKKKREKKKKKKKGAKNL 123
Score = 28.7 bits (64), Expect = 1.7
Identities = 12/21 (57%), Positives = 18/21 (85%)
Query: 2 KKKKKKKKKKKKKKKKRKKKG 22
+++ KK KKK++KKK+KKKG
Sbjct: 99 NQRQAKKLKKKREKKKKKKKG 119
Score = 27.5 bits (61), Expect = 4.1
Identities = 12/21 (57%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK KKK++KKKKKKK +
Sbjct: 104 KKLKKKREKKKKKKKGAKNLA 124
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 33.2 bits (76), Expect = 0.15
Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 4/58 (6%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG---ILANSSSKEKIQDRSECPGSGM-GKPPHDVADV 54
+K K K KK KK L + + D S + P DV +V
Sbjct: 191 ASSGDEKSPKSKAAPKKAGKKMRKWDLDGDEDDDAVLDYSAPDANDENADAPEDVEEV 248
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 32.3 bits (74), Expect = 0.16
Identities = 6/20 (30%), Positives = 16/20 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKK 20
K+ K K++K+++ ++K+K+
Sbjct: 126 KELAKLKREKRRENERKQKE 145
Score = 31.9 bits (73), Expect = 0.21
Identities = 9/22 (40%), Positives = 17/22 (77%)
Query: 3 KKKKKKKKKKKKKKKRKKKGIL 24
K+ K K++K+++ +RK+K IL
Sbjct: 126 KELAKLKREKRRENERKQKEIL 147
Score = 29.6 bits (67), Expect = 1.3
Identities = 6/21 (28%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K++K+++ ++K+K+ K++
Sbjct: 130 KLKREKRRENERKQKEILKEQ 150
Score = 29.2 bits (66), Expect = 1.4
Identities = 6/20 (30%), Positives = 15/20 (75%)
Query: 1 KKKKKKKKKKKKKKKKKRKK 20
K K++K+++ ++K+K+ K
Sbjct: 129 AKLKREKRRENERKQKEILK 148
Score = 29.2 bits (66), Expect = 1.8
Identities = 5/20 (25%), Positives = 16/20 (80%)
Query: 2 KKKKKKKKKKKKKKKKRKKK 21
K+ K K++K+++ ++++K+
Sbjct: 126 KELAKLKREKRRENERKQKE 145
Score = 28.4 bits (64), Expect = 2.6
Identities = 6/21 (28%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K++K+++ ++K+K K
Sbjct: 128 LAKLKREKRRENERKQKEILK 148
Score = 26.9 bits (60), Expect = 9.4
Identities = 6/21 (28%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K++K+++ ++K+K+ K + K
Sbjct: 132 KREKRRENERKQKEILKEQMK 152
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 31.6 bits (72), Expect = 0.20
Identities = 16/21 (76%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+KK+KKKKKKKKKKK KK
Sbjct: 110 KQKKQKKKKKKKKKKKTSKKA 130
Score = 31.6 bits (72), Expect = 0.20
Identities = 15/23 (65%), Positives = 19/23 (82%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
K+K K+KK+KKKKKKK+KKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTS 127
Score = 30.8 bits (70), Expect = 0.33
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+KKKKKKKKKKK KK KK
Sbjct: 113 KQKKKKKKKKKKKTSKKAAKK 133
Score = 30.8 bits (70), Expect = 0.37
Identities = 16/21 (76%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+KK+KKKKKKKKKKK KK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKK 129
Score = 30.8 bits (70), Expect = 0.44
Identities = 15/22 (68%), Positives = 17/22 (77%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KK+KKKKKKKKKKK K+ K
Sbjct: 112 KKQKKKKKKKKKKKTSKKAAKK 133
Score = 30.4 bits (69), Expect = 0.54
Identities = 15/21 (71%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K+KK+KKKKKKKKKK+ K
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSK 128
Score = 30.4 bits (69), Expect = 0.56
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KKKKKKKKKKK KK +KKK
Sbjct: 115 KKKKKKKKKKKTSKKAAKKKK 135
Score = 30.0 bits (68), Expect = 0.63
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+KKKKKKKKKKK KK KKK
Sbjct: 114 QKKKKKKKKKKKTSKKAAKKK 134
Score = 30.0 bits (68), Expect = 0.68
Identities = 14/21 (66%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+KK+KKKKKKKKKKK +K
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAA 131
Score = 27.7 bits (62), Expect = 3.8
Identities = 13/21 (61%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+ K+K K+KK+KKKK+KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKK 122
Score = 27.7 bits (62), Expect = 4.5
Identities = 15/32 (46%), Positives = 24/32 (75%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+ K+K K+KK+KKKKK+KKK + ++K+K
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 27.3 bits (61), Expect = 5.3
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K+ K+K K+KK+KKK+KKK S K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 32.9 bits (75), Expect = 0.20
Identities = 16/43 (37%), Positives = 22/43 (51%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
K KK K KKK KK KKK K+ + + + +D S+ S
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESA 187
Score = 30.2 bits (68), Expect = 1.5
Identities = 13/41 (31%), Positives = 21/41 (51%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGS 42
+ K KK K KKK KK +KK+ ++ E+ + + S
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183
Score = 29.0 bits (65), Expect = 3.6
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 4/38 (10%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+ K KK K KKK KK K+K+ A SS + ++ SE
Sbjct: 143 ETKAKKGKAKKKTKKSKKKE----AKESSDKDDEEESE 176
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.0 bits (76), Expect = 0.21
Identities = 12/38 (31%), Positives = 25/38 (65%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KK KK +K +KK+++++K+K A + K++ ++ E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 32.6 bits (75), Expect = 0.24
Identities = 10/38 (26%), Positives = 22/38 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K +KK++++KK+KKKK K +++ ++ +
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 32.2 bits (74), Expect = 0.30
Identities = 12/50 (24%), Positives = 19/50 (38%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHD 50
KK KK KK +K +KKR+++ + + E K
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 32.2 bits (74), Expect = 0.35
Identities = 10/48 (20%), Positives = 18/48 (37%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPP 48
KK KK KK +K +K++++ + + E K
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 31.0 bits (71), Expect = 0.79
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+KK++++KK+KKKK KK KEK ++ E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 30.7 bits (70), Expect = 0.96
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+++KK+KKKK KKK +++ +EK ++ E
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 28.7 bits (65), Expect = 4.7
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+KK+KKKK KKK++ +++ KE+ ++ +E
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 28.3 bits (64), Expect = 6.2
Identities = 10/37 (27%), Positives = 15/37 (40%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KK KK KK +K KK+ K+ + +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKK 440
Score = 28.0 bits (63), Expect = 7.2
Identities = 10/47 (21%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 1 KKKKKKKKKKKKKKKKKRKK--KGILANSSSKEKIQDRSECPGSGMG 45
KK KKK+++++++K KK + ++E+ ++ E
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.6 bits (75), Expect = 0.27
Identities = 13/50 (26%), Positives = 23/50 (46%), Gaps = 3/50 (6%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHD 50
KK +KK++K +KK +K+ +K A + K + + G K
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEK---AAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 28.4 bits (64), Expect = 5.5
Identities = 12/57 (21%), Positives = 25/57 (43%)
Query: 3 KKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLK 59
++KK +KK++K ++K + A ++ +K + + G V L K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.
This presumed domain is found at the N-terminus of
Ribosomal L30 proteins and has been termed RL30NT or
NUC018.
Length = 71
Score = 29.5 bits (67), Expect = 0.33
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILA 25
+ K+ KK+ KK +K+K+K I
Sbjct: 13 RAKRAKKRAAKKAARKKKRKLIFK 36
Score = 28.7 bits (65), Expect = 0.83
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+ K+ KK+ KK +KK++K
Sbjct: 13 RAKRAKKRAAKKAARKKKRKL 33
Score = 28.3 bits (64), Expect = 1.1
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 2 KKKKKKKKKKKKKKKKRKKK 21
K + K+ KK+ KK RKKK
Sbjct: 11 KLRAKRAKKRAAKKAARKKK 30
Score = 28.3 bits (64), Expect = 1.1
Identities = 10/38 (26%), Positives = 20/38 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KK+K+ +K + K+ K++ A ++ I R+E
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAE 39
Score = 26.8 bits (60), Expect = 3.6
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+ KK +KKK+K KR +K
Sbjct: 20 RAAKKAARKKKRKLIFKRAEK 40
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 32.1 bits (74), Expect = 0.40
Identities = 9/32 (28%), Positives = 9/32 (28%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K KK K RK K S K
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 30.1 bits (69), Expect = 1.8
Identities = 7/32 (21%), Positives = 10/32 (31%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K KK K +K +S K+
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYVASEKDG 856
Score = 27.8 bits (63), Expect = 9.7
Identities = 5/21 (23%), Positives = 9/21 (42%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K +K K++ +K K
Sbjct: 838 AVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
Length = 55
Score = 29.2 bits (65), Expect = 0.40
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 172 FRCPICDRRFSQSSSVTTHMRTHS 195
+ CPIC + + S+ TH+R H+
Sbjct: 6 YECPICGEIYIKRKSMITHLRKHN 29
>gnl|CDD|218591 pfam05443, ROS_MUCR, ROS/MUCR transcriptional regulator protein.
This family consists of several ROS/MUCR transcriptional
regulator proteins. The ros chromosomal gene is present
in octopine and nopaline strains of Agrobacterium
tumefaciens as well as in Rhizobium meliloti. This gene
encodes a 15.5-kDa protein that specifically represses
the virC and virD operons in the virulence region of the
Ti plasmid and is necessary for succinoglycan
production. Sinorhizobium meliloti can produce two types
of acidic exopolysaccharides, succinoglycan and
galactoglucan, that are interchangeable for infection of
alfalfa nodules. MucR from Sinorhizobium meliloti acts
as a transcriptional repressor that blocks the
expression of the exp genes responsible for
galactoglucan production therefore allowing the
exclusive production of succinoglycan.
Length = 130
Score = 30.6 bits (70), Expect = 0.43
Identities = 21/74 (28%), Positives = 26/74 (35%), Gaps = 19/74 (25%)
Query: 74 GDPGAPYQVGAPQQFPTASLSYTVHPHQSSRGRPLNSGIRLILIQFIPGKTYARPSTLKT 133
G P A + P + +V P +I GK + TLK
Sbjct: 43 GAPAAAPAAEEEKPTPAVPIKKSVTPDY--------------IICLEDGKKF---KTLKR 85
Query: 134 HLRTHSGEKP--YR 145
HLRTH G P YR
Sbjct: 86 HLRTHHGLTPEEYR 99
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association
with pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 29.6 bits (67), Expect = 0.44
Identities = 13/26 (50%), Positives = 19/26 (73%), Gaps = 4/26 (15%)
Query: 1 KKKKKKKKKKKK----KKKKKRKKKG 22
KKKKKK KKK K +KK++ +++G
Sbjct: 43 KKKKKKLKKKSKEWILRKKEQMRRRG 68
Score = 27.6 bits (62), Expect = 2.4
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 1 KKKKKKKKKKKKKKKKKRK 19
++ ++KKKKKKK KKK K
Sbjct: 36 SQRNRRKKKKKKKLKKKSK 54
Score = 26.1 bits (58), Expect = 8.8
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGIL 24
++ ++KKKKKKK KK+ K+ IL
Sbjct: 35 ISQRNRRKKKKKKKLKKKSKEWIL 58
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 32.1 bits (73), Expect = 0.46
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
KKKKKKK K+KKKKK + K I A + +E I+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIE 718
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This
nucleolar family of proteins are involved in 60S
ribosomal biogenesis. They are specifically involved in
the processing beyond the 27S stage of 25S rRNA
maturation. This family contains sequences that bear
similarity to the glioma tumour suppressor candidate
region gene 2 protein (p60). This protein has been
found to interact with herpes simplex type 1 regulatory
proteins.
Length = 387
Score = 31.6 bits (72), Expect = 0.49
Identities = 12/36 (33%), Positives = 16/36 (44%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
+ KKK++KK KK K IL N S +
Sbjct: 50 DEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKK 85
Score = 28.5 bits (64), Expect = 5.2
Identities = 9/34 (26%), Positives = 18/34 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
+ K+K K ++ K+K+RK+ A + K +
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 27.7 bits (62), Expect = 9.2
Identities = 8/21 (38%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K + K+K K ++ K+KR+K+
Sbjct: 271 KPVRPKRKTKAQRNKEKRRKE 291
>gnl|CDD|128804 smart00531, TFIIE, Transcription initiation factor IIE.
Length = 147
Score = 30.4 bits (69), Expect = 0.51
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 4/48 (8%)
Query: 131 LKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICD 178
L+ L + Y+C +C ++ L A + + F CP C
Sbjct: 87 LEDKLEDETNNAYYKCPNCQSKYTF---LEA-NQLLDMDGTFTCPRCG 130
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.5 bits (69), Expect = 0.54
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+K+ K KK+KK+KK+K+ + + S+K
Sbjct: 108 EKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139
Score = 30.5 bits (69), Expect = 0.60
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K+ ++K+ K KK+KK++K+K + + K+
Sbjct: 104 KETEEKEAIKAKKEKKEKKEKKVAEKLAKKKS 135
Score = 28.2 bits (63), Expect = 3.8
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K KK+KK+KK+KK +K KK + + K
Sbjct: 113 KAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144
Score = 28.2 bits (63), Expect = 4.1
Identities = 7/21 (33%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K++K+ K+ ++K+ K KK+
Sbjct: 97 KEQKEVSKETEEKEAIKAKKE 117
Score = 27.8 bits (62), Expect = 5.2
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
++K+ K KK+KK+KK++K LA S +
Sbjct: 107 EEKEAIKAKKEKKEKKEKKVAEKLAKKKSTK 137
Score = 27.8 bits (62), Expect = 5.2
Identities = 7/21 (33%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+ K+ ++K+ K K++KK
Sbjct: 99 QKEVSKETEEKEAIKAKKEKK 119
Score = 27.0 bits (60), Expect = 8.3
Identities = 9/30 (30%), Positives = 16/30 (53%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
K+ K+ ++K+ K KK KK+ + K
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich, stretch long enough
to span the membrane, a highly charged region rich in
Lys, a second putative transmembrane region and a
second highly charged, low complexity sequence region.
Some members have up to 100 residues of additional
C-terminal sequence. These genes have been shown to be
found in the sub-telomeric regions of both Plasmodium
falciparum and P. yoelii chromosomes.
Length = 82
Score = 29.1 bits (66), Expect = 0.58
Identities = 14/29 (48%), Positives = 19/29 (65%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
K KK KK +KKKK KKK I+++ +S
Sbjct: 33 KLKKLKKIDDDLEKKKKNKKKIIISSIAS 61
Score = 27.6 bits (62), Expect = 2.6
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGIL 24
KK KK KK +KKK+ KK I+
Sbjct: 32 KKLKKLKKIDDDLEKKKKNKKKII 55
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 29.5 bits (67), Expect = 0.67
Identities = 8/35 (22%), Positives = 23/35 (65%), Gaps = 6/35 (17%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
K++K+++K +++++ KR K +E+I+++
Sbjct: 1 KERKEEEKAQREEELKRLKN------LKREEIEEK 29
Score = 28.8 bits (65), Expect = 1.2
Identities = 8/34 (23%), Positives = 22/34 (64%), Gaps = 6/34 (17%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
K+++K +++++ K+ K K++ I +EK++
Sbjct: 4 KEEEKAQREEELKRLKNLKREEI------EEKLE 31
>gnl|CDD|236691 PRK10426, PRK10426, alpha-glucosidase; Provisional.
Length = 635
Score = 31.1 bits (71), Expect = 0.90
Identities = 22/98 (22%), Positives = 35/98 (35%), Gaps = 21/98 (21%)
Query: 160 TAHVRTHSGEKPFRCPICD------RRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSS 213
T +RTH G +P D F++ + V T ++ PY L +A
Sbjct: 486 TPVMRTHEGNRPGDNWQFDSDAETIAHFARMTRVFTTLK------PYLKELVAEAAKTGL 539
Query: 214 TLTKHLRIHSGEKP--YQCKLCLLRFSQSGNLNRHMRV 249
+ + L +H + Y K L L R + V
Sbjct: 540 PVMRPLFLHYEDDAATYTLKYQYL-------LGRDLLV 570
Score = 30.0 bits (68), Expect = 1.9
Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 8/53 (15%)
Query: 180 RFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHL----RIHSGEKPY 228
R+ + S+ T MRTH G RP + SD+ T+ H R+ + KPY
Sbjct: 478 RWCEFSAFTPVMRTHEGNRP---GDNWQFDSDAETI-AHFARMTRVFTTLKPY 526
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 29.5 bits (67), Expect = 0.92
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 3/32 (9%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K+K K +KK+++KK + +K LA S++EK
Sbjct: 18 AKEKAKWEKKQEEKKSEAEK---LAKMSAEEK 46
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.2 bits (71), Expect = 0.96
Identities = 12/49 (24%), Positives = 18/49 (36%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPH 49
KK+ + KKKKK +KK S Q + + +P
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRK 1358
Score = 30.0 bits (68), Expect = 2.1
Identities = 13/52 (25%), Positives = 20/52 (38%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
K+++ K K K K K RK K K+ D+S+ D +
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205
Score = 29.2 bits (66), Expect = 3.6
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
K KK+ + KKKK+ +K SK +++ S
Sbjct: 1309 KVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQAS 1345
Score = 28.9 bits (65), Expect = 4.9
Identities = 16/42 (38%), Positives = 22/42 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGS 42
K K K +K K KKK++KKK A+ S K + S+ S
Sbjct: 1162 TKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDS 1203
Score = 28.1 bits (63), Expect = 8.1
Identities = 15/47 (31%), Positives = 19/47 (40%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKP 47
K +K K KKK+KKKKK S + S+ KP
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
Score = 27.7 bits (62), Expect = 9.7
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
+ K K K K K +K K KKK SS +K + S
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKAS 1193
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 30.4 bits (69), Expect = 1.0
Identities = 11/22 (50%), Positives = 12/22 (54%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KKKK K+ K K KK KG
Sbjct: 258 IKKKKPKESKGVKALKKVVAKG 279
Score = 30.4 bits (69), Expect = 1.2
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K++ KKKK K+ K K KK
Sbjct: 254 SKEEIKKKKPKESKGVKALKK 274
Score = 29.6 bits (67), Expect = 1.7
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K K++ KKKK K+ K + K
Sbjct: 253 KSKEEIKKKKPKESKGVKALK 273
Score = 29.2 bits (66), Expect = 2.2
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK+K K++ KKKK K+ + K
Sbjct: 250 KKRKSKEEIKKKKPKESKGVK 270
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 30.3 bits (69), Expect = 1.0
Identities = 7/32 (21%), Positives = 16/32 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+K+K ++++K K K +KK + +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157
Score = 27.6 bits (62), Expect = 8.0
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK++ +K+K +++RK K
Sbjct: 119 AKKREAAGEKEKAPRRERKPK 139
>gnl|CDD|226535 COG4049, COG4049, Uncharacterized protein containing archaeal-type
C2H2 Zn-finger [General function prediction only].
Length = 65
Score = 28.0 bits (62), Expect = 1.0
Identities = 10/29 (34%), Positives = 13/29 (44%)
Query: 163 VRTHSGEKPFRCPICDRRFSQSSSVTTHM 191
VR GE+ RCP C F + H+
Sbjct: 9 VRDRDGEEFLRCPRCGMVFRRRKDYIRHV 37
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 29.5 bits (67), Expect = 1.1
Identities = 8/21 (38%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+K+ +K ++KK K+++K+K
Sbjct: 108 EKEKRTRKNREKKFKRRQKEK 128
Score = 29.2 bits (66), Expect = 1.2
Identities = 7/22 (31%), Positives = 17/22 (77%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K++ +K+K+ +K ++KK K++
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQ 125
Score = 29.2 bits (66), Expect = 1.5
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+K+ +K ++KK K+R+K+
Sbjct: 107 AEKEKRTRKNREKKFKRRQKE 127
Score = 28.8 bits (65), Expect = 1.6
Identities = 7/21 (33%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+++ K++ +K+K+ +K R+KK
Sbjct: 100 RERTKERAEKEKRTRKNREKK 120
Score = 28.8 bits (65), Expect = 2.0
Identities = 7/20 (35%), Positives = 17/20 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKK 20
K+ +K ++KK K+++K+++K
Sbjct: 111 KRTRKNREKKFKRRQKEKEK 130
Score = 28.4 bits (64), Expect = 2.3
Identities = 8/22 (36%), Positives = 17/22 (77%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+ +K+K+ +K ++KK KR++K
Sbjct: 106 RAEKEKRTRKNREKKFKRRQKE 127
Score = 28.4 bits (64), Expect = 2.7
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+ +K ++KK K+++K K+K
Sbjct: 110 EKRTRKNREKKFKRRQKEKEK 130
Score = 26.9 bits (60), Expect = 9.3
Identities = 5/21 (23%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+ ++++ K++ +K+KR +K
Sbjct: 95 ALRLRRERTKERAEKEKRTRK 115
>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 129
Score = 29.3 bits (66), Expect = 1.1
Identities = 21/90 (23%), Positives = 32/90 (35%), Gaps = 17/90 (18%)
Query: 154 SQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSD-- 211
A+ +R + CP C SS+V G + Y+C+ C F+
Sbjct: 15 IALADAAYAIRMQITKV--NCPRCK-----SSNVVKIGGIRRGHQRYKCKSCGSTFTVET 67
Query: 212 SSTLTKHLRIHSGEKPYQCKLCLLRFSQSG 241
S L+K L Y+ KL + G
Sbjct: 68 GSPLSKAL--------YKIKLQAVTLYMLG 89
>gnl|CDD|179886 PRK04860, PRK04860, hypothetical protein; Provisional.
Length = 160
Score = 29.9 bits (68), Expect = 1.1
Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 4/36 (11%)
Query: 171 PFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCK 206
P+RC + + + V H R GE YRCR C
Sbjct: 119 PYRCKCQEHQLT----VRRHNRVVRGEAVYRCRRCG 150
Score = 27.2 bits (61), Expect = 8.7
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 12/50 (24%)
Query: 192 RTHS-------GER-PYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLC 233
RTH G+ PYRC+ + T+ +H R+ GE Y+C+ C
Sbjct: 104 RTHQFEVQSVRGKTFPYRCKCQEHQL----TVRRHNRVVRGEAVYRCRRC 149
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.9 bits (69), Expect = 1.2
Identities = 3/32 (9%), Positives = 13/32 (40%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K + + + + +++ + +K+K
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Score = 29.0 bits (64), Expect = 4.5
Identities = 7/36 (19%), Positives = 16/36 (44%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
KKK ++ KK + K + + A ++ ++
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE 1370
Score = 27.8 bits (61), Expect = 9.4
Identities = 4/34 (11%), Positives = 11/34 (32%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQD 35
K + + + + + K + +K D
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Score = 27.8 bits (61), Expect = 9.6
Identities = 10/31 (32%), Positives = 19/31 (61%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+ ++K + +KKK++ KKK A ++EK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEK 1390
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.3 bits (69), Expect = 1.2
Identities = 12/59 (20%), Positives = 17/59 (28%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLK 59
K KKK K + KK N+ K+ + + D D L
Sbjct: 85 AAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLD 143
Score = 28.8 bits (65), Expect = 4.0
Identities = 12/38 (31%), Positives = 15/38 (39%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KK K K K K+K K L +S EK +
Sbjct: 77 KKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114
Score = 28.8 bits (65), Expect = 4.5
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KKK K K K +KK +SS K + ++ +
Sbjct: 76 KKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113
Score = 28.4 bits (64), Expect = 6.0
Identities = 10/32 (31%), Positives = 12/32 (37%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
KKK K K K KK + S +K
Sbjct: 74 IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKK 105
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 29.9 bits (67), Expect = 1.2
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 7 KKKKKKKKKKKRKKKGILANSSSKEKIQ 34
K KKKKKK+ K A+ S +
Sbjct: 32 KSKKKKKKRSKATSPSHNASDQSTNNVM 59
>gnl|CDD|116890 pfam08307, Glyco_hydro_98C, Glycosyl hydrolase family 98 C-terminal
domain. This putative domain is found at the C-terminus
of glycosyl hydrolase family 98 proteins. This domain is
not expected to form part of the catalytic activity.
Length = 231
Score = 30.0 bits (67), Expect = 1.2
Identities = 17/80 (21%), Positives = 27/80 (33%), Gaps = 2/80 (2%)
Query: 90 TASLSYTVHPHQSSRGRPLNSGIRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDC 149
SL+ T+ PH + + L + K R T+ L S Y
Sbjct: 138 CKSLNMTLGPHTYGVVNENADALSIYLNNYRTEKGPLRTDTIT--LSGASSTPTYSNVFG 195
Query: 150 NKSFSQAANLTAHVRTHSGE 169
++ QA+ +TA T
Sbjct: 196 DRGEYQASTVTATPSTDGLT 215
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.5 bits (69), Expect = 1.3
Identities = 13/21 (61%), Positives = 19/21 (90%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K KKKKKK+KKK+++ KR++K
Sbjct: 744 KTKKKKKKEKKKEEEYKREEK 764
Score = 28.1 bits (63), Expect = 7.8
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
KKKKKK+KKK+++ K+ +K I
Sbjct: 745 TKKKKKKEKKKEEEYKREEKARI 767
Score = 27.8 bits (62), Expect = 8.9
Identities = 11/21 (52%), Positives = 19/21 (90%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
++K KKKKKK+KKK+++ K++
Sbjct: 742 EEKTKKKKKKEKKKEEEYKRE 762
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 30.2 bits (68), Expect = 1.3
Identities = 12/38 (31%), Positives = 25/38 (65%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
+++K+K++ K++KKKKK K K + KE+ +++
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142
Score = 29.5 bits (66), Expect = 2.5
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K++KKKKK+K K++ K RK K ++ + E
Sbjct: 114 KEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKE 151
Score = 29.1 bits (65), Expect = 3.5
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+ K+++K+K++ K+++KKK K++
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131
Score = 28.7 bits (64), Expect = 4.1
Identities = 11/38 (28%), Positives = 23/38 (60%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
++K+K++ K++KKKKK++ K+ +E + R
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143
>gnl|CDD|179595 PRK03564, PRK03564, formate dehydrogenase accessory protein FdhE;
Provisional.
Length = 309
Score = 30.0 bits (68), Expect = 1.3
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 3/50 (6%)
Query: 156 AANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLC 205
A + R GE+ CP+C S S + T G R C LC
Sbjct: 172 AQQIPGKARAEYGEQRQFCPVCG---SMPVSSVVQIGTTQGLRYLHCNLC 218
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor
involved in ribosome biogenesis.
Length = 112
Score = 29.0 bits (65), Expect = 1.4
Identities = 12/22 (54%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KK++KKK K+ +K++KR+KKG
Sbjct: 49 KKQRKKKPKRLTRKQRKRQKKG 70
Score = 26.3 bits (58), Expect = 9.1
Identities = 10/35 (28%), Positives = 18/35 (51%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDR 36
KKK++KKK K+ ++++K E + D
Sbjct: 46 VSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDT 80
>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family. This family represents a
domain found in eukaryotes and prokaryotes. The domain
contains a characteristic motif of the zinc
metallopeptidases. This family includes the bacterial
SprT protein.
Length = 153
Score = 29.3 bits (66), Expect = 1.4
Identities = 8/37 (21%), Positives = 12/37 (32%), Gaps = 8/37 (21%)
Query: 173 RCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF 209
RC C + + + + H YRC C
Sbjct: 121 RCGSCGQLYPRKRRIRRHK--------YRCGRCGGKL 149
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 29.1 bits (66), Expect = 1.4
Identities = 9/22 (40%), Positives = 19/22 (86%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K+++K++K++ K++KKK K +G
Sbjct: 110 KEEEKERKRQLKQQKKKAKHRG 131
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 30.2 bits (69), Expect = 1.4
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K++ KKKKK+KK KK +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 29.0 bits (66), Expect = 3.4
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K++ KKKKK+KK KK + +K
Sbjct: 734 KRRLKKKKKRKKVKKWEVEKIE 755
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region
covers both the Brf homology II and III regions. This
region is involved in binding TATA binding protein.
Length = 95
Score = 28.4 bits (64), Expect = 1.5
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K + KKKKK+K +KK+
Sbjct: 40 KADEGNNSGKKKKKRKAKKKR 60
Score = 28.4 bits (64), Expect = 1.8
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K+K + KKKKK++ KK
Sbjct: 38 KQKADEGNNSGKKKKKRKAKKK 59
Score = 28.0 bits (63), Expect = 1.9
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
+K + KKKKK+K KKK A +S
Sbjct: 39 QKADEGNNSGKKKKKRKAKKKRDEAGPAS 67
Score = 28.0 bits (63), Expect = 2.0
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+K + KKKKKRK K
Sbjct: 37 LKQKADEGNNSGKKKKKRKAK 57
Score = 27.2 bits (61), Expect = 3.7
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+ K+K + KKK+K+K
Sbjct: 35 KELKQKADEGNNSGKKKKKRK 55
Score = 26.4 bits (59), Expect = 8.2
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
+ KKKKK+K K+K+ S++ E ++
Sbjct: 41 ADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVK 74
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 30.5 bits (69), Expect = 1.5
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
K K KK +K KK K + I A + +K
Sbjct: 775 AKPKAAKKDARKAKKPSAKTQKIAAATKAK 804
Score = 28.6 bits (64), Expect = 5.5
Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 5/33 (15%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
+ +KK K K KK +K KK S+ +KI
Sbjct: 770 RGEKKAKPKAAKKDARKAKK-----PSAKTQKI 797
>gnl|CDD|213534 TIGR00486, YbgI_SA1388, dinuclear metal center protein, YbgI/SA1388
family. The characterization of this family of
uncharacterized proteins as orthologous is tentative.
Members are found in all three domains of life. Several
members (from Bacillus subtilis, Listeria monocytogenes,
and Mycobacterium tuberculosis - all classified as
Firmicutes within the Eubacteria) share a long insert
relative to other members [Unknown function, General].
Length = 249
Score = 29.7 bits (67), Expect = 1.5
Identities = 12/62 (19%), Positives = 23/62 (37%), Gaps = 5/62 (8%)
Query: 12 KKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDV---ADVLLS--LKHAVVHPG 66
+ KK K +L + E ++ + GSG+ + D ++ L H H
Sbjct: 146 LEIKKVLNVKPLLVVKNGPEYVKKVAVVSGSGLSFIMKALREGVDAYITGDLSHHTAHLA 205
Query: 67 QM 68
+
Sbjct: 206 RE 207
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 28.3 bits (64), Expect = 1.6
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGIL 24
++++K KKK KK + + KK +
Sbjct: 70 EERRKDKKKYGKKARLREWKKKVF 93
Score = 27.2 bits (61), Expect = 4.1
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K++++K KKK KK + R+ K
Sbjct: 69 KEERRKDKKKYGKKARLREWK 89
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.9 bits (68), Expect = 1.6
Identities = 9/31 (29%), Positives = 24/31 (77%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
+++++++ ++KK++KKK +++ LA S +E
Sbjct: 277 EEERQEEAQEKKEEKKKEEREAKLAKLSPEE 307
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 29.6 bits (67), Expect = 1.7
Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 2/31 (6%)
Query: 1 KKKKKKKKKKKKKKKKKRKK--KGILANSSS 29
KKK + KKK K KK R L+ + S
Sbjct: 155 KKKPRSKKKSSKNSKKLRIDLVGEWLSTAIS 185
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 29.6 bits (67), Expect = 1.7
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+KKKKKKK KKKK KK G
Sbjct: 181 EKKKKKKKSAKKKKLKKVAAVG 202
Score = 28.0 bits (63), Expect = 5.6
Identities = 11/21 (52%), Positives = 11/21 (52%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KKKK KKKK KK K
Sbjct: 185 KKKKSAKKKKLKKVAAVGMKA 205
Score = 27.3 bits (61), Expect = 8.8
Identities = 13/21 (61%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+++KKKKKKK KKKK KK
Sbjct: 178 AEEEKKKKKKKSAKKKKLKKV 198
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast)
and GNL3L (human) are putative GTPases which are
required for growth and play a role in processing of
nucleolar pre-rRNA. This family contains a potential
nuclear localisation signal.
Length = 80
Score = 28.0 bits (63), Expect = 1.8
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 2/35 (5%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK--GILANSSSKEKI 33
+K +K+ KK K +KKK GI + KE+I
Sbjct: 13 HHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEI 47
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 29.8 bits (68), Expect = 1.8
Identities = 10/40 (25%), Positives = 17/40 (42%), Gaps = 6/40 (15%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGI--LANSSSKEKIQDRSEC 39
+ KK ++ +KKK LA++SS +C
Sbjct: 347 RAAAKKARELTRKKKLSSISLPGKLADASSA----GPKKC 382
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.0 bits (67), Expect = 1.8
Identities = 13/21 (61%), Positives = 18/21 (85%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK KKK+KK+K+K++ K KKK
Sbjct: 200 KKPKKKEKKEKEKERDKDKKK 220
Score = 28.9 bits (64), Expect = 3.7
Identities = 22/71 (30%), Positives = 30/71 (42%), Gaps = 4/71 (5%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
KK KK KKK+KK+K+K+R K K + + P S D A SL +
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEA----SLAN 252
Query: 61 AVVHPGQMTPP 71
V + P
Sbjct: 253 TVSGTAPDSEP 263
Score = 28.5 bits (63), Expect = 6.1
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K K++KKK++K+K+ +R+
Sbjct: 95 EKDKREKKKREKEKRGRRRHH 115
Score = 28.1 bits (62), Expect = 7.4
Identities = 8/29 (27%), Positives = 21/29 (72%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
+ +++ +K K++KKK++++K+G + S
Sbjct: 89 RHRQRLEKDKREKKKREKEKRGRRRHHSL 117
Score = 27.7 bits (61), Expect = 9.0
Identities = 10/30 (33%), Positives = 21/30 (70%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKE 31
+K K++KKK++K+K+ R++ L S ++
Sbjct: 95 EKDKREKKKREKEKRGRRRHHSLGTESDED 124
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 29.4 bits (66), Expect = 1.9
Identities = 13/21 (61%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KKKK K KK K K KK KK
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKK 111
Score = 29.0 bits (65), Expect = 2.7
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KKK K KK K K KK + KKK
Sbjct: 92 KKKGKSKKNKLKGKKDEDKKK 112
Score = 28.3 bits (63), Expect = 4.2
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK K KK K K KK + +KK
Sbjct: 93 KKGKSKKNKLKGKKDEDKKKA 113
Score = 27.9 bits (62), Expect = 6.0
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 2 KKKKKKKKKKKKKKKKRKKKG 22
KKKK K KK K K KK + K
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKK 111
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 29.4 bits (66), Expect = 2.0
Identities = 13/19 (68%), Positives = 13/19 (68%)
Query: 1 KKKKKKKKKKKKKKKKKRK 19
K KK KK KKKKKKKR
Sbjct: 199 KSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 28.9 bits (65), Expect = 2.1
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+KKKK + K +K+ K+K
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKD 23
Score = 28.2 bits (63), Expect = 4.1
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
K + K +K+ K+K +KK I
Sbjct: 6 KNRSSNYKVNRKRLKRKDRKKKI 28
Score = 27.0 bits (60), Expect = 9.5
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KKK + K +K+ KRK +
Sbjct: 4 KKKNRSSNYKVNRKRLKRKDR 24
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 29.6 bits (67), Expect = 2.1
Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 5/34 (14%)
Query: 4 KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
KK+KKK+K+K++ K +G+ KEK +D+
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVK-----KEKRKDKK 655
Score = 28.5 bits (64), Expect = 5.5
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K+KKK+K+K++ K + KK
Sbjct: 628 KRKKKRKRKRRFLTKIEGVKK 648
Score = 28.1 bits (63), Expect = 7.2
Identities = 13/38 (34%), Positives = 16/38 (42%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
K K KKKKKK+K I S K + D +
Sbjct: 529 KAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566
Score = 27.7 bits (62), Expect = 9.2
Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 7/28 (25%)
Query: 1 KKKKKKKKKKKKK-------KKKKRKKK 21
KK+KKK+K+K++ KK+KRK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDK 654
>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type. This is a zinc-finger
domain with the CxxCx(12)Hx(6)H motif, found in multiple
copies in a wide range of proteins from plants to
metazoans. Some member proteins, particularly those from
plants, are annotated as being RNA-binding.
Length = 25
Score = 26.3 bits (59), Expect = 2.2
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 200 YRCRLCKKAFSDSSTLTKHLR 220
+ C LC F+ S L HLR
Sbjct: 1 FYCELCNVTFTSESQLKSHLR 21
>gnl|CDD|184535 PRK14144, PRK14144, heat shock protein GrpE; Provisional.
Length = 199
Score = 29.3 bits (65), Expect = 2.2
Identities = 15/60 (25%), Positives = 28/60 (46%)
Query: 5 KKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKHAVVH 64
K+ KK KK K + K++ + N +E+ + S+ +G P + + L+L H
Sbjct: 3 KQNKKDWKKFKDEHKEEHKVENEILEEETDEESQHQEPALGHPSYTALEEQLTLAEQKAH 62
>gnl|CDD|238428 cd00832, CLF, Chain-length factor (CLF) is a factor required for
polyketide chain initiation of aromatic
antibiotic-producing polyketide synthases (PKSs) of
filamentous bacteria. CLFs have been shown to have
decarboxylase activity towards malonyl-acyl carrier
protein (ACP). CLFs are similar to other elongation
ketosynthase domains, but their active site cysteine is
replaced by a conserved glutamine.
Length = 399
Score = 29.6 bits (67), Expect = 2.2
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 45 GKPPHDVADVLLSLKHAVVHP-GQMTPPPYGDP 76
G P DVA LL+L+ V+ P +T P
Sbjct: 335 GGAPLDVATALLALRDGVIPPTVNVTDVPPAYG 367
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex
which functions in messenger ribonucleoprotein
metabolism and plays a role in preventing the
transcription-associated genetic instability. Tho2,
along with four other subunits forms THO.
Length = 296
Score = 29.6 bits (67), Expect = 2.2
Identities = 13/38 (34%), Positives = 22/38 (57%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
KKK+KK+ K KK ++ KK I N +K+++ +
Sbjct: 53 KKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEKS 90
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many
hypothetical proteins.
Length = 98
Score = 27.8 bits (62), Expect = 2.3
Identities = 13/32 (40%), Positives = 17/32 (53%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
KK+KKKKK+ K + K KKG + E
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDKKKDEF 53
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 29.8 bits (67), Expect = 2.3
Identities = 12/37 (32%), Positives = 17/37 (45%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
K K +K KKK K KKK + +S + D +
Sbjct: 119 KPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNN 155
Score = 29.4 bits (66), Expect = 2.6
Identities = 8/39 (20%), Positives = 11/39 (28%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSEC 39
K KKKKK + + N+ K
Sbjct: 129 ITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLT 167
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 28.3 bits (64), Expect = 2.3
Identities = 12/36 (33%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI--LANSSSKEKIQ 34
KK + K++ K +KK ++ K G LA ++ KE ++
Sbjct: 79 KKLQAKQQAKAEKKAEENKAAGEAFLAENAKKEGVK 114
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 27.8 bits (62), Expect = 2.4
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
KKK K + KK +K+ K KKG
Sbjct: 32 KKKNKLRFLKKLEKRPKDVKKG 53
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 29.7 bits (67), Expect = 2.4
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+KKKKKKK+KK+ K + K G SK+
Sbjct: 71 EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102
Score = 28.9 bits (65), Expect = 4.2
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
KKKKKKK+KK+ K + + K S K K
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKK 104
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 29.3 bits (66), Expect = 2.4
Identities = 18/40 (45%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILAN--SSSKEKIQDRSE 38
K KKKKKKKKK KK + KK + K K R E
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRE 293
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 27.1 bits (61), Expect = 2.4
Identities = 7/19 (36%), Positives = 14/19 (73%)
Query: 1 KKKKKKKKKKKKKKKKKRK 19
K+K+KK +K+++KK +
Sbjct: 44 KRKRKKAAARKRRRKKLAR 62
Score = 26.8 bits (60), Expect = 2.9
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
K +K+K+KK +K++RKK
Sbjct: 40 KPSEKRKRKKAAARKRRRKKL 60
Score = 26.4 bits (59), Expect = 4.4
Identities = 7/19 (36%), Positives = 14/19 (73%)
Query: 2 KKKKKKKKKKKKKKKKRKK 20
K+K+KK +K+++KK +
Sbjct: 44 KRKRKKAAARKRRRKKLAR 62
Score = 26.0 bits (58), Expect = 6.1
Identities = 6/19 (31%), Positives = 14/19 (73%)
Query: 3 KKKKKKKKKKKKKKKRKKK 21
K+K+KK +K+++K+ +
Sbjct: 44 KRKRKKAAARKRRRKKLAR 62
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 29.6 bits (67), Expect = 2.5
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
KK K K KKKKKK + +S + K
Sbjct: 353 IKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384
Score = 28.4 bits (64), Expect = 5.5
Identities = 16/43 (37%), Positives = 20/43 (46%), Gaps = 5/43 (11%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
KK KK K K KKKK+K +S+ K + E SG
Sbjct: 350 KKPIKKSNSKAKLKKKKKKA-----GASAVPKSETSQEAKSSG 387
>gnl|CDD|190760 pfam03811, Zn_Tnp_IS1, InsA N-terminal domain. This appears to be
a short zinc binding domain found in IS1 InsA family
protein. It is found at the N-terminus of the protein
and may be a DNA-binding domain.
Length = 36
Score = 26.3 bits (58), Expect = 2.5
Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 4/33 (12%)
Query: 174 CPICDRRFSQSSSVTTHMRTHSGERPYRCRLCK 206
CP C S + V H ++ +G + YRCR C+
Sbjct: 8 CPRC----SSTEGVYRHGQSTAGHQRYRCRHCR 36
>gnl|CDD|197903 smart00834, CxxC_CXXC_SSSS, Putative regulatory protein.
CxxC_CXXC_SSSS represents a region of about 41 amino
acids found in a number of small proteins in a wide
range of bacteria. The region usually begins with the
initiator Met and contains two CxxC motifs separated by
17 amino acids. One protein in this entry has been noted
as a putative regulatory protein, designated FmdB. Most
proteins in this entry have a C-terminal region
containing highly degenerate sequence.
Length = 41
Score = 26.4 bits (59), Expect = 2.7
Identities = 10/34 (29%), Positives = 12/34 (35%), Gaps = 7/34 (20%)
Query: 144 YRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPIC 177
YRCEDC +F + CP C
Sbjct: 6 YRCEDCGHTF------EVLQKISDDPLTT-CPEC 32
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 29.5 bits (66), Expect = 2.8
Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 14/80 (17%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
KKKK++K K+K+ KK K +K A +++ D + P K S K
Sbjct: 21 KKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA-SDGTNVPKKSEKK----------SRKR 69
Query: 61 AVVHPGQMTPPPYGDPGAPY 80
V P + DP P+
Sbjct: 70 DV---EDENPEDFIDPDTPF 86
Score = 28.3 bits (63), Expect = 5.7
Identities = 15/45 (33%), Positives = 27/45 (60%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGK 46
++ ++KKKK++K K+K KK A +K K+Q + G+ + K
Sbjct: 16 EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60
>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger.
Length = 27
Score = 26.0 bits (58), Expect = 2.9
Identities = 8/23 (34%), Positives = 12/23 (52%)
Query: 144 YRCEDCNKSFSQAANLTAHVRTH 166
+ C C K+FS L H ++H
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSH 24
Score = 24.9 bits (55), Expect = 7.5
Identities = 8/23 (34%), Positives = 11/23 (47%)
Query: 200 YRCRLCKKAFSDSSTLTKHLRIH 222
+ C +C K FS L H + H
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSH 24
Score = 24.5 bits (54), Expect = 8.6
Identities = 6/23 (26%), Positives = 12/23 (52%)
Query: 172 FRCPICDRRFSQSSSVTTHMRTH 194
C +C + FS ++ H ++H
Sbjct: 2 HTCGVCGKTFSSLQALGGHKKSH 24
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 29.4 bits (65), Expect = 2.9
Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 7 KKKKKKKKKKKRKKKGIL-ANSSSKEKIQDRSECPGSGMGKPP-HDVADVLLSLKHAV 62
KK+ +K KKKK K++G L E ++ + E P +G+ ++ D+L SLK ++
Sbjct: 327 KKRAEKPKKKKSKRRGWLCCGGGDIETVEPQQEEPVQDVGEHQINEYGDILPSLKASI 384
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.0 bits (65), Expect = 3.1
Identities = 7/22 (31%), Positives = 17/22 (77%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
+KK+K++ + K+K++K ++K
Sbjct: 413 EKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|189563 pfam00475, IGPD, Imidazoleglycerol-phosphate dehydratase.
Length = 145
Score = 28.2 bits (64), Expect = 3.3
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 151 KSFSQAANLTAHVRTHSGE 169
+SF+ A +T HVR G
Sbjct: 109 RSFAFNAGITLHVRVLYGR 127
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 29.2 bits (65), Expect = 3.3
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K ++++KKK+K++ KK K KEK
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKERKIGKEK 590
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 27.6 bits (62), Expect = 3.4
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 15/42 (35%)
Query: 1 KKKKKKKKKKKKKKKKK-------RKKKGILANSSSKEKIQD 35
K K+K+KKKKKKK+ + R+KK KE++ +
Sbjct: 73 AKAKEKEKKKKKKKELEDFYRFQIREKK--------KEELAE 106
Score = 27.2 bits (61), Expect = 6.0
Identities = 11/33 (33%), Positives = 18/33 (54%), Gaps = 7/33 (21%)
Query: 1 KKKKKKK-------KKKKKKKKKKRKKKGILAN 26
+ +K K + K K+K+KK+KKK L +
Sbjct: 58 RGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 29.1 bits (65), Expect = 3.4
Identities = 13/41 (31%), Positives = 19/41 (46%)
Query: 4 KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGM 44
K KK K KK++K KK I + +S E+ +E
Sbjct: 143 KGKKLVNKPKKRQKYKKATIQSPTSLTEEENYNTEICTVAP 183
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation
of 60S ribosomal subunits.
Length = 132
Score = 28.1 bits (63), Expect = 3.5
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILA 25
KK K KKK K +K +K+ K+ + A
Sbjct: 28 KKLLKAKKKLKSEKLEKKAKRQLRA 52
Score = 26.6 bits (59), Expect = 9.2
Identities = 10/35 (28%), Positives = 20/35 (57%), Gaps = 4/35 (11%)
Query: 2 KKKKKKKKKKKKKKKKRKKKG----ILANSSSKEK 32
K +KK K++ + +K++ +KG +L KE+
Sbjct: 41 KLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKER 75
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 28.4 bits (64), Expect = 3.5
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 1 KKKKKKKKKKKKKKKKKRK 19
KK K K+ +K + KKKK+K
Sbjct: 87 KKAKGKRSRKNQTKKKKKK 105
Score = 27.7 bits (62), Expect = 6.8
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGI 23
KK K K+ +K + KK+KKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKKDD 107
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4;
Provisional.
Length = 191
Score = 28.6 bits (64), Expect = 3.5
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
KKKK +KKK KK +KKK
Sbjct: 51 SKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 26.9 bits (60), Expect = 3.6
Identities = 8/37 (21%), Positives = 20/37 (54%), Gaps = 2/37 (5%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILAN--SSSKEKIQD 35
K++KKK+ KK K +++ R++ + + ++
Sbjct: 12 KEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDK 48
Score = 26.1 bits (58), Expect = 8.2
Identities = 8/18 (44%), Positives = 14/18 (77%)
Query: 3 KKKKKKKKKKKKKKKRKK 20
+K++KKK+ KK K +R+
Sbjct: 11 RKEQKKKELKKNKAERQA 28
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 29.0 bits (65), Expect = 3.7
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 3 KKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
+KK K KKK K + K + S +K K
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSK 777
Score = 28.6 bits (64), Expect = 4.8
Identities = 11/31 (35%), Positives = 14/31 (45%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K KKK K K K KKK + + +K
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782
Score = 27.9 bits (62), Expect = 8.8
Identities = 10/32 (31%), Positives = 16/32 (50%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEK 32
K KKK K K K +KK + ++K++
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
Score = 27.9 bits (62), Expect = 9.2
Identities = 9/31 (29%), Positives = 13/31 (41%)
Query: 4 KKKKKKKKKKKKKKRKKKGILANSSSKEKIQ 34
+KK K KKK K K + S + +
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKK 778
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 27.7 bits (62), Expect = 3.8
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGIL 24
K KK KKK KK KK G++
Sbjct: 25 KLKKDPNIDKKKFKKIIFKKYGLI 48
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 28.9 bits (65), Expect = 3.9
Identities = 8/22 (36%), Positives = 17/22 (77%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K++ + ++KKK+KK K +++G
Sbjct: 371 KERPGRYRRKKKEKKAKSERRG 392
Score = 28.1 bits (63), Expect = 6.3
Identities = 7/21 (33%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+ KK++ + ++KKK+K+ K
Sbjct: 368 RDKKERPGRYRRKKKEKKAKS 388
Score = 27.7 bits (62), Expect = 7.4
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
K K ++ KK++ + +RKKK
Sbjct: 362 TKAKPERDKKERPGRYRRKKKE 383
Score = 27.7 bits (62), Expect = 7.8
Identities = 6/22 (27%), Positives = 14/22 (63%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKG 22
++ + ++KKK+KK K ++
Sbjct: 372 ERPGRYRRKKKEKKAKSERRGL 393
Score = 27.7 bits (62), Expect = 8.0
Identities = 7/21 (33%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK++ + ++KKK+KK + ++
Sbjct: 370 KKERPGRYRRKKKEKKAKSER 390
Score = 27.7 bits (62), Expect = 9.3
Identities = 8/21 (38%), Positives = 15/21 (71%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KK++ + ++KKK+KK K +
Sbjct: 369 DKKERPGRYRRKKKEKKAKSE 389
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 28.6 bits (65), Expect = 3.9
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 3 KKKKKKKKKKKKKKKRKKKGI 23
KK +K +K KKKKKR+K
Sbjct: 192 KKLLEKPEKPKKKKKRRKGFS 212
Score = 27.9 bits (63), Expect = 6.1
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 2 KKKKKKKKKKKKKKKKRKK 20
KK +K +K KKKKK+RK
Sbjct: 192 KKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 28.6 bits (64), Expect = 4.0
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 3 KKKKKKKKKKKKKKKRKKKGILA 25
+KK+K+KK++ K +KK+ +L
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLE 182
>gnl|CDD|130324 TIGR01257, rim_protein, retinal-specific rim ABC transporter. This
model describes the photoreceptor protein (rim protein)
in eukaryotes. It is the member of ABC transporter
superfamily. Rim protein is a membrane glycoprotein which
is localized in the photoreceptor outer segment discs.
Mutation/s in its genetic loci is implicated in the
recessive Stargardt's disease [Transport and binding
proteins, Other].
Length = 2272
Score = 29.2 bits (65), Expect = 4.1
Identities = 22/89 (24%), Positives = 25/89 (28%), Gaps = 32/89 (35%)
Query: 30 KEKIQDRSEC--PGSGMGKPPHDVADVLLSLKHAVVHPGQMTPPPYGDPGAPYQVGAPQQ 87
+E R C P G+ P PGQ P G P
Sbjct: 1294 RENANLRHPCSGPTEKAGQTPQA---------SHTCSPGQPAAHPEGQPP---------- 1334
Query: 88 FPTASLSYTVHPHQSSRGRPLNSGIRLIL 116
P G PLN+G RLIL
Sbjct: 1335 -----------PEPEDPGVPLNTGARLIL 1352
>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3. The
reovirus inner capsid protein lambda-1 displays
nucleoside triphosphate phosphohydrolase (NTPase),
RNA-5'-triphosphatase (RTPase), and RNA helicase
activity and may play a role in the transcription of the
virus genome, the unwinding or reannealing of
double-stranded RNA during RNA synthesis. The RTPase
activity constitutes the first step in the capping of
RNA, resulting in a 5'-diphosphorylated RNA plus-strand.
lambda1 is an Orthoreovirus core protein, VP3 is the
homologous core protein in Aquareoviruses.
Length = 1166
Score = 29.0 bits (65), Expect = 4.6
Identities = 9/25 (36%), Positives = 12/25 (48%), Gaps = 1/25 (4%)
Query: 172 FRCPICDRRFSQSSSVTTHMRT-HS 195
+ C +C FS + H RT HS
Sbjct: 74 YVCNVCMAEFSSMDQLAEHQRTTHS 98
Score = 28.6 bits (64), Expect = 6.1
Identities = 11/25 (44%), Positives = 11/25 (44%), Gaps = 1/25 (4%)
Query: 144 YRCEDCNKSFSQAANLTAHVRT-HS 167
Y C C FS L H RT HS
Sbjct: 74 YVCNVCMAEFSSMDQLAEHQRTTHS 98
>gnl|CDD|222379 pfam13786, DUF4179, Domain of unknown function (DUF4179).
Length = 93
Score = 26.9 bits (60), Expect = 4.8
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 11 KKKKKKKRKKKGILA 25
KK KKK++KKK I A
Sbjct: 1 KKVKKKRKKKKKIAA 15
>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies). This
family contains two copies of a C2H2-like zinc finger
domain.
Length = 100
Score = 27.2 bits (61), Expect = 4.8
Identities = 6/21 (28%), Positives = 10/21 (47%)
Query: 172 FRCPICDRRFSQSSSVTTHMR 192
C C ++F ++ HMR
Sbjct: 51 NECLYCGKQFKSLEALRQHMR 71
Score = 26.8 bits (60), Expect = 5.4
Identities = 23/84 (27%), Positives = 29/84 (34%), Gaps = 17/84 (20%)
Query: 174 CPICDRRFSQSSSVTTHMR-THS---GERPYRCRLCKKAFSDSSTLTKHLR--IHSGEKP 227
C C+ HM +H ER Y D L +LR IH G
Sbjct: 2 CLFCNHTSDTVEENLEHMFKSHGFFIPEREY--------LVDLEGLLNYLREKIHEG--- 50
Query: 228 YQCKLCLLRFSQSGNLNRHMRVHG 251
+C C +F L +HMR G
Sbjct: 51 NECLYCGKQFKSLEALRQHMRDKG 74
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 28.6 bits (64), Expect = 5.0
Identities = 11/59 (18%), Positives = 14/59 (23%), Gaps = 10/59 (16%)
Query: 59 KHAVVHPGQMTPPPYGDPGAPYQVGAPQQFPTASLSYT---------VHPHQSSRGRPL 108
V P M P P P +P + P Q+ PL
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPP-EQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPL 232
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 28.4 bits (64), Expect = 5.4
Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 2/35 (5%)
Query: 4 KKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSE 38
++KKK KK +K+ R + L +++Q SE
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKR--KELQALSE 116
>gnl|CDD|153419 cd07914, IGPD, Imidazoleglycerol-phosphate dehydratase.
Imidazoleglycerol-phosphate dehydratase (IGPD; EC
4.2.1.19) catalyzes the dehydration of imidazole
glycerol phosphate to imidazole acetol phosphate, the
sixth step of histidine biosynthesis in plants and
microorganisms where the histidine is synthesized de
novo. There is an internal repeat in the protein domain
that is related by pseudo-dyad symmetry, perhaps as a
result of an ancient gene duplication. The apo-form of
IGPD exists as a catalytically inactive trimer which, in
the presence of specific divalent metal cations such as
manganese (Mn2+), cobalt (Co2+), cadmium (Cd2+), nickel
(Ni2+), iron (Fe2+) and zinc (Zn2+), assembles to form a
biologically active high molecular weight metalloenzyme;
a 24-mer with 4-3-2 symmetry. Each 24-mer has 24 active
sites, and contains around 1.5 metal ions per monomer,
each monomer contributing residues to three separate
active sites. IGPD enzymes are monofunctional in fungi,
plants, archaea and some eubacteria while they are
encoded as bifunctional enzymes in other eubacteria,
such that the enzyme is fused to histidinol-phosphate
phosphatase, the penultimate enzyme of the histidine
biosynthesis pathway. The histidine biosynthesis pathway
is a potential target for development of herbicides, and
IGPD is a target for the triazole phosphonate
herbicides.
Length = 190
Score = 27.7 bits (63), Expect = 5.5
Identities = 7/19 (36%), Positives = 11/19 (57%)
Query: 151 KSFSQAANLTAHVRTHSGE 169
+SF+ A +T H+R G
Sbjct: 136 RSFANNAGITLHIRVLYGR 154
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 28.0 bits (63), Expect = 5.5
Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 1 KKKKKKKKKKKKK----KKKKRKKKGILANSSSKEKIQDR 36
KK KKKKKK+KK KK + K G K K+ D
Sbjct: 158 KKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDP 197
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7
plays a role in protein translation. Deletions of the
TMA7 gene results in altered protein synthesis rates.
Length = 63
Score = 26.0 bits (57), Expect = 5.6
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSK 30
K+K+K++ K K+ K K KG L K
Sbjct: 29 KQKQKEEAKALKELAAKAKGKGPLGGGGIK 58
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 28.3 bits (63), Expect = 5.8
Identities = 13/37 (35%), Positives = 16/37 (43%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRS 37
KK + K K K KK +KK + S I RS
Sbjct: 151 KKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRS 187
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.5 bits (64), Expect = 5.9
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+ K++K+ +K KK+ +KK
Sbjct: 559 EKEALKEQKRLRKLKKQEEKK 579
>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
This domain family is found in archaea and eukaryotes,
and is approximately 30 amino acids in length. The
mammalian members of this group occur multiple times
along the protein, joined by flexible linkers, and are
referred to as JAZ - dsRNA-binding ZF protein -
zinc-fingers. The JAZ proteins are expressed in all
tissues tested and localise in the nucleus, particularly
the nucleolus. JAZ preferentially binds to
double-stranded (ds) RNA or RNA/DNA hybrids rather than
DNA. In addition to binding double-stranded RNA, these
zinc-fingers are required for nucleolar localisation.
Length = 27
Score = 24.8 bits (55), Expect = 6.0
Identities = 6/23 (26%), Positives = 13/23 (56%)
Query: 171 PFRCPICDRRFSQSSSVTTHMRT 193
F C CD+ F +++ H+++
Sbjct: 1 QFYCVACDKYFKSENALENHLKS 23
Score = 24.5 bits (54), Expect = 8.7
Identities = 6/23 (26%), Positives = 11/23 (47%)
Query: 143 PYRCEDCNKSFSQAANLTAHVRT 165
+ C C+K F L H+++
Sbjct: 1 QFYCVACDKYFKSENALENHLKS 23
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 28.3 bits (63), Expect = 6.3
Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 4/52 (7%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVAD 53
+ + + ++ K KKKG++ S +++ + E GS PP A
Sbjct: 369 RAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGS----PPQVEAT 416
>gnl|CDD|201461 pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein.
Length = 96
Score = 26.7 bits (60), Expect = 6.4
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 2 KKKKKKKKKKKKKKKKRKKKG 22
+ KK K K++K RKK+G
Sbjct: 68 RGKKVIVFKYKRRKNYRKKQG 88
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 27.3 bits (61), Expect = 6.9
Identities = 7/21 (33%), Positives = 20/21 (95%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+++K+++K++K++K+ K+K
Sbjct: 16 EKREEKEREKEEKERKEEKEK 36
>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
Length = 367
Score = 28.0 bits (63), Expect = 6.9
Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 4/39 (10%)
Query: 1 KKKKKKKKKKKKKKKK----KRKKKGILANSSSKEKIQD 35
KK KKK+ ++ K K KK+G L EK+
Sbjct: 1 KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKLIP 39
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 27.4 bits (61), Expect = 7.0
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
KKK KK K KK K +K+K
Sbjct: 8 KKKAKKAAKAAKKGVKVKKRK 28
>gnl|CDD|240380 PTZ00362, PTZ00362, hypothetical protein; Provisional.
Length = 479
Score = 28.2 bits (63), Expect = 7.1
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 1 KKKKKKKKKKKKKKK--KKRKKKGILANSSSKEK 32
KKKKKKKKK K+ + KK I + S+ K
Sbjct: 9 KKKKKKKKKASKESSNGSQNSKKTIFMYNLSRRK 42
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 26.8 bits (60), Expect = 7.7
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGIL 24
+ K ++++K K+KKK+KKK +
Sbjct: 68 RNKAAEERRKLKEKKKKKKKELE 90
>gnl|CDD|181614 PRK09027, PRK09027, cytidine deaminase; Provisional.
Length = 295
Score = 27.9 bits (63), Expect = 7.7
Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 9/45 (20%)
Query: 83 GAPQQFPTASLSYTVHPHQSS------RGRPLNSGIRLILIQFIP 121
GA +F A+L TVH QS+ RG I I + + P
Sbjct: 87 GANMEFAGAALQQTVHAEQSAISHAWLRGE---KAIADITVNYTP 128
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 27.7 bits (62), Expect = 7.9
Identities = 11/27 (40%), Positives = 13/27 (48%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANS 27
K+K KK KK KKRK+K
Sbjct: 226 KRKAPKKVAKKVAAAKKRKQKRSPPPP 252
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 27.7 bits (62), Expect = 8.0
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG 43
+KKK++KK+++++ K + + E D E G G
Sbjct: 177 HSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEG 218
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 28.0 bits (62), Expect = 8.1
Identities = 18/107 (16%), Positives = 31/107 (28%), Gaps = 11/107 (10%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLSLKH 60
K+ +K++ + ++ + + P S P V
Sbjct: 373 HHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVP-------- 424
Query: 61 AVVHPGQMTPPPYGDPG---APYQVGAPQQFPTASLSYTVHPHQSSR 104
A P TP P +PG P Q A+ P ++R
Sbjct: 425 ASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATR 471
>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
superfamily II helicase [DNA replication, recombination,
and repair].
Length = 730
Score = 28.1 bits (63), Expect = 8.1
Identities = 14/62 (22%), Positives = 16/62 (25%), Gaps = 25/62 (40%)
Query: 172 FRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCK 231
CP CD S +T H T RC C + P C
Sbjct: 445 AECPNCD------SPLTLHKATG----QLRCHYCG---------------YQEPIPQSCP 479
Query: 232 LC 233
C
Sbjct: 480 EC 481
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 27.3 bits (61), Expect = 8.2
Identities = 6/27 (22%), Positives = 15/27 (55%)
Query: 6 KKKKKKKKKKKKRKKKGILANSSSKEK 32
KK+K +++ + K+ ++K+K
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKK 31
>gnl|CDD|133125 cd06594, GH31_glucosidase_YihQ, YihQ is a bacterial
alpha-glucosidase with a conserved glycosyl hydrolase
family 31 (GH31) domain that catalyzes the release of an
alpha-glucosyl residue from the non-reducing end of
alpha-glucoside substrates such as alpha-glucosyl
fluoride. Orthologs of YihQ that have not yet been
functionally characterized are present in plants and
fungi. YihQ has sequence similarity to other GH31
enzymes such as CtsZ, a 6-alpha-glucosyltransferase from
Bacillus globisporus, and YicI, an alpha-xylosidase from
Echerichia coli. In bacteria, YihQ (along with YihO) is
important for bacterial O-antigen capsule assembly and
translocation.
Length = 317
Score = 27.7 bits (62), Expect = 8.2
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 179 RRFSQSSSVTTHMRTHSGERPYR 201
R+++ ++ T MRTH G RP
Sbjct: 280 LRWAEMAAFTPVMRTHEGNRPDD 302
>gnl|CDD|227562 COG5237, PER1, Predicted membrane protein [Function unknown].
Length = 319
Score = 27.5 bits (61), Expect = 8.2
Identities = 18/70 (25%), Positives = 27/70 (38%), Gaps = 9/70 (12%)
Query: 200 YRC---RLCKKAFSDSSTL---TKHLRIHSGEK-PYQCKLCLLRFSQSGNLNRHMRVHGS 252
+C C +D++ K K Y C L L+F+ SGN+ + R HG
Sbjct: 32 GKCFENSRCNLNKTDTNMFPLVDKLFGWDCDSKCGYMCHLLCLKFTNSGNIKIYQR-HGK 90
Query: 253 QYNMLPVIYM 262
V+ M
Sbjct: 91 WG-FQRVLGM 99
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 26.6 bits (59), Expect = 8.7
Identities = 10/41 (24%), Positives = 24/41 (58%), Gaps = 3/41 (7%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK---GILANSSSKEKIQDRSE 38
K++KK+ K ++K+ RK K +L + +++ ++ +E
Sbjct: 29 AAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAE 69
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 27.7 bits (62), Expect = 9.2
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 1 KKKKKKKKKKKKKKKKKR 18
K KKKK+KKK+ +K R
Sbjct: 425 LKSKKKKRKKKEWFEKFR 442
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 26.6 bits (59), Expect = 9.3
Identities = 14/45 (31%), Positives = 16/45 (35%), Gaps = 5/45 (11%)
Query: 61 AVVHPGQMTPPPYGDPGAP--YQVGAPQQFPTASLSYTVHPHQSS 103
A VH P YG AP Y P +P Y P +S
Sbjct: 52 AYVHLEP--LPAYGQYAAPPPYGP-PPPYYPAPPGVYPTPPPPNS 93
>gnl|CDD|219748 pfam08214, KAT11, Histone acetylation protein. Histone acetylation
is required in many cellular processes including
transcription, DNA repair, and chromatin assembly. This
family contains the fungal KAT11 protein (previously
known as RTT109) which is required for H3K56
acetylation. Loss of KAT11 results in the loss of H3K56
acetylation, both on bulk histone and on chromatin.
KAT11 and H3K56 acetylation appear to correlate with
actively transcribed genes and associate with the
elongating form of Pol II in yeast. This family also
incorporates the p300/CBP histone acetyltransferase
domain which has different catalytic properties and
cofactor regulation to KAT11.
Length = 341
Score = 27.4 bits (61), Expect = 9.5
Identities = 12/57 (21%), Positives = 21/57 (36%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSGMGKPPHDVADVLLS 57
K KK+ K+ KK + S + +Q+ + M K D + L+
Sbjct: 274 PPKLVVSKKQYKRAKKAITGEEYDTLSGAASLVQNLMDKLYETMLKKKEDFIVIRLN 330
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 26.8 bits (60), Expect = 9.5
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 2 KKKKKKKKKKKKKKKKRKKK 21
K+++ KKK++K K K KKK
Sbjct: 118 KEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein
7 homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 26.5 bits (59), Expect = 9.6
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 1 KKKKKKKKKKKKKKKKKRKKK 21
+K+ K +K+KKK+KKK
Sbjct: 65 ARKESVAAKAAEKEKKKKKKK 85
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 27.8 bits (62), Expect = 9.7
Identities = 13/29 (44%), Positives = 16/29 (55%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSS 29
K+ KKKKKKKKKKKK+ +
Sbjct: 52 NKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80
>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein. This small,
uncommon, poorly conserved protein is found primarily in
the Firmicutes. It features are pair of CxxC motifs
separated by about 20 amino acids, followed by a highly
hydrophobic region of about 45 amino acids. It has no
conserved gene neighborhood, and its function is
unknown.
Length = 94
Score = 26.2 bits (58), Expect = 9.8
Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 3/33 (9%)
Query: 145 RCEDCNKSFSQAANLTAHVRTHSGEKPFRCPIC 177
C++CN+ FS L + S +P +CP C
Sbjct: 2 ICKNCNEKFSYKELLKS---LFSLYRPIKCPNC 31
>gnl|CDD|188353 TIGR03608, L_ocin_972_ABC, putative bacteriocin export ABC
transporter, lactococcin 972 group. A gene pair with a
fairly wide distribution consists of a polypeptide
related to the lactococcin 972 (see TIGR01653) and
multiple-membrane-spanning putative immunity protein
(see TIGR01654). This model represents a small clade
within the ABC transporters that regularly are found
adjacent to these bacteriocin system gene pairs and are
likely serve as export proteins [Cellular processes,
Toxin production and resistance, Transport and binding
proteins, Unknown substrate].
Length = 206
Score = 27.2 bits (61), Expect = 10.0
Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 2/33 (6%)
Query: 1 KKKKKKKKKKKKKKKKKRKKKGILANSSSKEKI 33
K KK KK+K++KKK+ +K G+ N K+KI
Sbjct: 102 KYKKLSKKEKREKKKEALEKVGL--NLKLKQKI 132
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.132 0.406
Gapped
Lambda K H
0.267 0.0804 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,079,797
Number of extensions: 1364776
Number of successful extensions: 8957
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7077
Number of HSP's successfully gapped: 617
Length of query: 311
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 214
Effective length of database: 6,635,264
Effective search space: 1419946496
Effective search space used: 1419946496
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.4 bits)