RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16250
(485 letters)
>gnl|CDD|224155 COG1234, ElaC, Metal-dependent hydrolases of the beta-lactamase
superfamily III [General function prediction only].
Length = 292
Score = 165 bits (420), Expect = 2e-47
Identities = 83/289 (28%), Positives = 117/289 (40%), Gaps = 30/289 (10%)
Query: 114 SGFPLLDKGKQVLAIRGPGPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLH 173
P D+ + +R G L DCGEGT QL+R ++ A++I+HLH
Sbjct: 11 GAVPTKDRNVSSILLRLEGEK--FLFDCGEGTQHQLLRAGLP-----PRKIDAIFITHLH 63
Query: 174 ADHHLGLFSVIKAWSRVKPECKLTLLAPRQIITW------LSVYAARFESVGHLYRLVPL 227
DH GL ++ + S L + P I + LS +E +GH
Sbjct: 64 GDHIAGLPGLLVSRSFRGRREPLKIYGPPGIKEFVETSLRLSYSKLTYEIIGHEIEEDAF 123
Query: 228 SLFNTKGLIEGTEQHG----QNRPALDPDTVQILSSLGLESMTTCLVRHCPN------AF 277
L G G + D ++ +T H A
Sbjct: 124 -EVEALELDHGVPALGYRIEEPDRPGRFDAEKLKGLPPGPLITALKAGHPVEERVITPAD 182
Query: 278 GVTMVTKSGHKITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAI 337
+ + G + YSGDT PCD L+ + K +DLLIHEAT ED+LE A HST +A
Sbjct: 183 RIGE-PRKGKSVVYSGDTRPCDELIDLAKGADLLIHEATFEDDLEDLANEGGHSTAEEAA 241
Query: 338 RIGREMRAKFVLLTHFSQRYAK-LPRLNKDLSE----NVGIAFDNMRFP 381
I +E K ++LTHFS RY K L K+ +A D + F
Sbjct: 242 EIAKEAGVKKLILTHFSPRYPKDDEELLKEARAIFPGETIVARDGLVFE 290
>gnl|CDD|234602 PRK00055, PRK00055, ribonuclease Z; Reviewed.
Length = 270
Score = 131 bits (332), Expect = 4e-35
Identities = 76/264 (28%), Positives = 109/264 (41%), Gaps = 49/264 (18%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIK---AWSRVKPE 193
L DCGEGT QL++ ++ ++I+HLH DH GL ++ R +P
Sbjct: 32 FLFDCGEGTQRQLLKTGIKP-----RKIDKIFITHLHGDHIFGLPGLLSTRSLSGRTEP- 85
Query: 194 CKLTLLAPRQIITWLSVYAARFESVGHLYRLV--------PLSLFNTKGLIEGTE----Q 241
LT+ P+ I ++ S+G YR+ G+ G +
Sbjct: 86 --LTIYGPKGIKEFVETLLRASGSLG--YRIAEKDKPGKLDAEKLKALGVPPGPLFGKLK 141
Query: 242 HGQNRPALDPDTVQILSSLGLESMTTCLVRHCPNAFGVTMVTKSGHKITYSGDTMPCDAL 301
G++ D + V + G K+ Y GDT PC+AL
Sbjct: 142 RGEDVTLEDGRIINPAD--------------------VLGPPRKGRKVAYCGDTRPCEAL 181
Query: 302 VSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAKLP 361
V + K +DLL+HEAT DE E+ A HST QA I +E K ++LTHFS RY P
Sbjct: 182 VELAKGADLLVHEATFGDEDEELAKEYGHSTARQAAEIAKEAGVKRLILTHFSPRYTGDP 241
Query: 362 RLNKDLS----ENVGIAFDNMRFP 381
+ N +A D MR
Sbjct: 242 EELLKEAREIFPNTELAEDLMRVE 265
>gnl|CDD|221727 pfam12706, Lactamase_B_2, Beta-lactamase superfamily domain. This
family is part of the beta-lactamase superfamily and is
related to pfam00753.
Length = 197
Score = 116 bits (292), Expect = 3e-30
Identities = 56/222 (25%), Positives = 90/222 (40%), Gaps = 33/222 (14%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVKPECKL 196
IL+D G GT + A LL ++ AV +SH HADH LGL ++ + R+ +L
Sbjct: 3 ILIDPGPGTRDLRQQPL--AAPALLPRIDAVLLSHAHADHILGLLTLRELIKRLPK--RL 58
Query: 197 TLLAPRQIITWLSVYAARFESVGHLYRLVPLSLFNTKGLIEGTEQHGQNRPALDPDTVQI 256
+ P + L + ++ +D
Sbjct: 59 PVYGPPGVAEDLRAPIFGVLPLFPVFD--------------------IEIFEIDIGEDVE 98
Query: 257 LSSLGLESMTTCLVRHCPNAFGVT---MVTKSGHKITYSGDTMPCDALVSIGKNSDLLIH 313
+ L + T V+H + + + G KI YSGDT L++ K +DLLI
Sbjct: 99 VGGLTI---TAFPVQHGSGRYLLREGFRIETPGKKIYYSGDTGYIPDLLAEIKGADLLIL 155
Query: 314 EATHEDELE---KEAALKMHSTVSQAIRIGREMRAKFVLLTH 352
+AT D+ E K A H T +A+ +++ AK ++L H
Sbjct: 156 DATFYDDDEEPRKTARRMGHMTPEEALEAAKDLGAKKLILIH 197
>gnl|CDD|233963 TIGR02651, RNase_Z, ribonuclease Z. Processing of the 3-prime end
of tRNA precursors may be the result of endonuclease or
exonuclease activity, and differs in different species.
Member of this family are ribonuclease Z, a tRNA 3-prime
endonuclease that processes tRNAs to prepare for
addition of CCA. In species where all tRNA sequences
already have the CCA tail, such as E. coli, the need for
such an enzyme is unclear. Protein similar to the E.
coli enzyme, matched by TIGR02649, are designated
ribonuclease BN [Transcription, RNA processing].
Length = 299
Score = 108 bits (271), Expect = 3e-26
Identities = 43/102 (42%), Positives = 57/102 (55%), Gaps = 3/102 (2%)
Query: 283 TKSGHKITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGRE 342
+ G KI Y+GDT PC+ ++ KN+DLLIHEAT DE +K A HST +QA I +E
Sbjct: 197 PRKGRKIAYTGDTRPCEEVIEFAKNADLLIHEATFLDEDKKLAKEYGHSTAAQAAEIAKE 256
Query: 343 MRAKFVLLTHFSQRYAKLPRLNKDLS---ENVGIAFDNMRFP 381
K ++LTH S RY+ L ++ N IA D M
Sbjct: 257 ANVKRLILTHISPRYSDEEELLEEAKKIFPNTYIAEDFMEIE 298
Score = 54.1 bits (131), Expect = 5e-08
Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 11/72 (15%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWS---RVKPE 193
L DCGEGT Q++R ++ ++I+HLH DH LGL ++ S R +P
Sbjct: 30 WLFDCGEGTQRQMLRS-----GISPMKIDRIFITHLHGDHILGLPGLLSTMSFQGRKEP- 83
Query: 194 CKLTLLAPRQII 205
LT+ P I
Sbjct: 84 --LTIYGPPGIK 93
>gnl|CDD|131697 TIGR02649, true_RNase_BN, ribonuclease BN. Members of this protein
family are ribonuclease BN of Escherichia coli K-12 and
closely related proteins believed to be equivalent in
function. Note that E. coli appears to lack RNase Z per
se, and this protein of E. coli appears orthologous to
(but not functionally equivalent to) RNase Z of Bacillus
subtilis and various other species. Meanwhile, the yihY
gene product of E. coli previously was incorrectly
identified as RNase BN [Transcription, RNA processing].
Length = 303
Score = 72.7 bits (178), Expect = 4e-14
Identities = 68/259 (26%), Positives = 105/259 (40%), Gaps = 32/259 (12%)
Query: 125 VLAIRGPGPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVI 184
+L ++ P L DCGEGT QL+ +A + +L ++ISHLH DH GL ++
Sbjct: 21 LLNLQHPTQSGLWLFDCGEGTQHQLLH---TAFNP--GKLDKIFISHLHGDHLFGLPGLL 75
Query: 185 KAWSRVKPECKLTLLAPRQIITWLSV----------YAARFESVGH------------LY 222
+ S LT+ P+ I ++ Y +G Y
Sbjct: 76 CSRSMSGIIQPLTIYGPQGIREFVETALRISGSWTDYPLEIVEIGAGEILDDGLRKVTAY 135
Query: 223 RLV-PLSLFNTKGLIEGTEQHGQNRPALDPDTVQ---ILSSLGLESMTTCLVRHCPNAFG 278
L PL + + + E + N AL V + L T N
Sbjct: 136 PLEHPLECYGYR-IEEHDKPGALNAQALKAAGVPPGPLFQELKAGKTITLEDGRQINGAD 194
Query: 279 VTMVTKSGHKITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIR 338
G + GDT PCDA + + K D+++HEAT + +E +A + HS+ QA
Sbjct: 195 YLAAPVPGKALAIFGDTGPCDAALDLAKGVDVMVHEATLDITMEAKANSRGHSSTRQAAT 254
Query: 339 IGREMRAKFVLLTHFSQRY 357
+ RE +++TH S RY
Sbjct: 255 LAREAGVGKLIITHVSSRY 273
>gnl|CDD|235006 PRK02126, PRK02126, ribonuclease Z; Provisional.
Length = 334
Score = 58.0 bits (141), Expect = 3e-09
Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 4/82 (4%)
Query: 286 GHKITYSGDTMP----CDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGR 341
G KI Y D +V + DLL EA DE ++A K H T QA R+ R
Sbjct: 242 GQKIGYVTDIGYTEENLARIVELAAGVDLLFIEAVFLDEDAEKARRKNHLTARQAGRLAR 301
Query: 342 EMRAKFVLLTHFSQRYAKLPRL 363
E K +L HFS RY
Sbjct: 302 EAGVKRLLPFHFSPRYQGRGAE 323
Score = 34.1 bits (79), Expect = 0.12
Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 9/50 (18%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKA 186
+L D G L L LL +S +++SH H DH +G +++
Sbjct: 30 LLFDLG-----DLHHL---PPRELLR-ISHIFVSHTHMDHFIGFDRLLRH 70
>gnl|CDD|216099 pfam00753, Lactamase_B, Metallo-beta-lactamase superfamily.
Length = 148
Score = 43.9 bits (103), Expect = 3e-05
Identities = 18/72 (25%), Positives = 29/72 (40%), Gaps = 2/72 (2%)
Query: 132 GPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVK 191
G +L+D G G L+ L +D + A+ ++H HADH GL + +A
Sbjct: 13 GDGGAVLIDTGLGADDALLLLALLGLD--PKDIDAIILTHAHADHIGGLPELKEATGAPV 70
Query: 192 PECKLTLLAPRQ 203
A +
Sbjct: 71 VAAPEDAAALLR 82
>gnl|CDD|214854 smart00849, Lactamase_B, Metallo-beta-lactamase superfamily. Apart
from the beta-lactamases a number of other proteins
contain this domain. These proteins include
thiolesterases, members of the glyoxalase II family,
that catalyse the hydrolysis of S-D-lactoyl-glutathione
to form glutathione and D-lactic acid and a competence
protein that is essential for natural transformation in
Neisseria gonorrhoeae and could be a transporter
involved in DNA uptake. Except for the competence
protein these proteins bind two zinc ions per molecule
as cofactor.
Length = 177
Score = 42.9 bits (101), Expect = 1e-04
Identities = 36/197 (18%), Positives = 63/197 (31%), Gaps = 37/197 (18%)
Query: 132 GPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVK 191
IL+D G G L+ ++ A+ ++H H DH GL +++A
Sbjct: 7 DDGGAILIDTGPGEAEDLLAELKKL---GPKKIDAIILTHGHPDHIGGLPELLEAP---- 59
Query: 192 PECKLTLLAPRQIITWLSVYAARFESVGHLYRLVPLSLFNTKGLIEGTEQHGQNRPALDP 251
+ AP L A +G P + L +G E L
Sbjct: 60 ---GAPVYAPEGTAELLKDLLALLGELGAEAEPAP----PDRTLKDGDELD------LGG 106
Query: 252 DTVQILSSLGLESMTTCLVRHCPNAFGVTMVTKSGHKITYSGDTMPCDALVSIGKNSDLL 311
++++ + G H P + + + KI ++GD + +
Sbjct: 107 GELEVIHTPG----------HTPGSIVLYL---PEGKILFTGDLLFAGGDGRT----LVD 149
Query: 312 IHEATHEDELEKEAALK 328
+A D LE L
Sbjct: 150 GGDAAASDALESLLKLL 166
>gnl|CDD|224156 COG1235, PhnP, Metal-dependent hydrolases of the beta-lactamase
superfamily I [General function prediction only].
Length = 269
Score = 42.4 bits (100), Expect = 2e-04
Identities = 50/254 (19%), Positives = 89/254 (35%), Gaps = 39/254 (15%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVKPECKL 196
+L+D G Q +RL S +D A+ ++H H+DH GL + +A L
Sbjct: 43 LLIDAGPDLRDQGLRLGVSDLD-------AILLTHEHSDHIQGLDDLRRA-------YTL 88
Query: 197 TLLAPRQIITWLSVYAARFESVGHLYRLVPLSLFNTKGLIEGTEQH-GQNRPALDPDTVQ 255
+ T + + R P F+ + A++P
Sbjct: 89 PI--YVNPGTLRASTSDRLLGGFPYLFRHPFPPFSLPAIGGLEVTPFPVPHDAIEPVGFV 146
Query: 256 ILSSLGLESMTTCLVRHCPNAFGVTMVTKSGHKITYSGDTM--PCDALVSIGKNS--DLL 311
I+ + T + +G + + Y DT P + V + N L
Sbjct: 147 IIRTGRKLHGGTDI------GYG---LEWRIGDVAYLTDTELFPSNHDVELLDNGLYPLD 197
Query: 312 IHEATHEDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQR--YAKLPRLNKDLSE 369
I + D L H + +A+ + +++ K ++LTH S + +LP L +E
Sbjct: 198 IKDRILPD----PGHLSNHLSAEEALELIEKLKPKRLVLTHLSHKNDDEELPEL---EAE 250
Query: 370 NVGIAFDNMRFPEK 383
V A+ M
Sbjct: 251 EVEPAYGGMGIDVG 264
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 41.9 bits (99), Expect = 6e-04
Identities = 18/24 (75%), Positives = 21/24 (87%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
EKK +K+KKKKKRKKK KK+KKK
Sbjct: 388 EEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 41.5 bits (98), Expect = 7e-04
Identities = 16/24 (66%), Positives = 21/24 (87%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
P KKK+++KK +KRKKKKK+KKK
Sbjct: 382 PPKKKREEKKPQKRKKKKKRKKKG 405
Score = 41.5 bits (98), Expect = 8e-04
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
++P+ KKK+++KK +K+KKKKK+KK
Sbjct: 378 KYPKPPKKKREEKKPQKRKKKKKRKK 403
Score = 40.7 bits (96), Expect = 0.001
Identities = 14/24 (58%), Positives = 20/24 (83%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+K+KKKKK+KK+ KK+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 40.7 bits (96), Expect = 0.001
Identities = 14/24 (58%), Positives = 22/24 (91%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
P+KK+++KK +K++KKKK+KKK K
Sbjct: 383 PKKKREEKKPQKRKKKKKRKKKGK 406
Score = 39.6 bits (93), Expect = 0.003
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
R +K +K+KKKKK++KK KK+KKK
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 39.6 bits (93), Expect = 0.003
Identities = 15/26 (57%), Positives = 22/26 (84%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ E+KK +K+KKKK++KKK KK+KK
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKK 410
Score = 38.0 bits (89), Expect = 0.009
Identities = 13/24 (54%), Positives = 21/24 (87%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+K+++KK +K+K+KKK+KKK KK
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKK 407
Score = 36.9 bits (86), Expect = 0.023
Identities = 13/24 (54%), Positives = 20/24 (83%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+ +K+KKKKK+K+K KK+KKK +
Sbjct: 390 KKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 35.7 bits (83), Expect = 0.044
Identities = 13/24 (54%), Positives = 20/24 (83%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKK 402
+ ++KKKKK+KKK +K+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 32.2 bits (74), Expect = 0.59
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
EK K KKK++ KK +K+KKKK
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKKK 399
Score = 31.1 bits (71), Expect = 1.5
Identities = 12/22 (54%), Positives = 17/22 (77%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K+K K KKKR++KK +K+KK
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKK 397
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 40.1 bits (94), Expect = 7e-04
Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 3/99 (3%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFYPAL 441
+KKKK + + +K+ K+K I I P I + +D KK N +L L A
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGL---AA 57
Query: 442 KAMFAEYQDEIENRAIRRNLKQESTYLARYPSENDKEKP 480
A D + LK+ + R N ++P
Sbjct: 58 DPNTAARTDLLIQDKPIPELKKGEARIERDDKGNPLDEP 96
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 39.9 bits (94), Expect = 0.001
Identities = 22/82 (26%), Positives = 34/82 (41%), Gaps = 15/82 (18%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFYPALK 442
KKK KK KK+KRK++K+ +K + D K + K +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGD-------------DELKQAAEEAKAEKA--ERDR 58
Query: 443 AMFAEYQDEIENRAIRRNLKQE 464
+ + Q E E +AI +KQ
Sbjct: 59 ELNRQRQAEAEQKAIVAQIKQL 80
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 40.1 bits (94), Expect = 0.002
Identities = 30/140 (21%), Positives = 47/140 (33%), Gaps = 13/140 (9%)
Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK---------VNNS 408
K+ ++ D A D E K KK KKKK+KKKK++K S
Sbjct: 247 LKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGS 306
Query: 409 IQISSRPEICREFDSKKISLANLPKLKLFYPALKAMFAEYQDEIE---NRAIRRNLKQES 465
SR ++ E + S K + + Q + A ++ K
Sbjct: 307 SDSGSRKDVEEENARLEDS-PKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRP 365
Query: 466 TYLARYPSENDKEKPGSLDD 485
+AR +E E +
Sbjct: 366 EDIARQIAEERSEDEEEAAE 385
Score = 29.0 bits (65), Expect = 7.6
Identities = 16/93 (17%), Positives = 35/93 (37%), Gaps = 9/93 (9%)
Query: 319 DELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIA--FD 376
E+ A + + + I RE R + + + L +D ++
Sbjct: 44 KRQEEAEAKRKREELREKIAKAREKRERN-------SKLGGIKTLGEDDDDDDDTKAWLK 96
Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
+ +KKK+ ++KK +K+K++ S
Sbjct: 97 KSKKRQKKKEAERKKALLLDEKEKERAAEYTSE 129
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 39.9 bits (94), Expect = 0.002
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
++K+K K KK+ R K K++K + +
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 39.5 bits (93), Expect = 0.003
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
+KK+K+K+K + KK+ + K +
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNI 422
Score = 39.5 bits (93), Expect = 0.003
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 12/51 (23%)
Query: 379 RFPEKKKKKKKKKK---KRKKKKKKKKKKVNNSIQISSRPEICREFDSKKI 426
+ P +KK K KK KR +KK+K+K+K R D+K I
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKK---------RHRDTKNI 422
Score = 39.2 bits (92), Expect = 0.004
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
EK+K K KK+ + K K++K S +
Sbjct: 405 KEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 38.8 bits (91), Expect = 0.006
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
R +K+K+K+K K K++ + K K
Sbjct: 399 RAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 38.0 bits (89), Expect = 0.009
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
EK+K+K K KK+ + K K++ +
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRRKPSG 430
Score = 36.1 bits (84), Expect = 0.041
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
K K+ +KK+++K+K K KK+ ++ I R
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 37.6 bits (87), Expect = 0.002
Identities = 16/24 (66%), Positives = 16/24 (66%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
EK KKKKKK KK KK KK KK
Sbjct: 97 TEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 35.6 bits (82), Expect = 0.012
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
D+ P +K KKKKKK K+ KK KK KK
Sbjct: 91 DHPPEPTEKPKKKKKKSKKTKKPKKSSKK 119
Score = 31.4 bits (71), Expect = 0.33
Identities = 15/38 (39%), Positives = 20/38 (52%)
Query: 367 LSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
L+ + G P + +K KKKKK+ KK KK KK
Sbjct: 79 LTTSSGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKS 116
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 37.9 bits (88), Expect = 0.004
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNN 407
+ KKKK + K+RKKKKK+KKKK
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKR 168
Score = 31.7 bits (72), Expect = 0.55
Identities = 17/36 (47%), Positives = 21/36 (58%), Gaps = 7/36 (19%)
Query: 382 EKKKKKKKKKKKR-------KKKKKKKKKKVNNSIQ 410
K +KK KKKK KKKK+KKKKK +S +
Sbjct: 137 LKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172
Score = 27.9 bits (62), Expect = 7.9
Identities = 11/17 (64%), Positives = 15/17 (88%)
Query: 381 PEKKKKKKKKKKKRKKK 397
E+KKKKK+KKKK+K+
Sbjct: 153 KERKKKKKEKKKKKKRH 169
>gnl|CDD|223565 COG0491, GloB, Zn-dependent hydrolases, including glyoxylases
[General function prediction only].
Length = 252
Score = 38.6 bits (89), Expect = 0.004
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 2/60 (3%)
Query: 132 GPDTCILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKAWSRVK 191
G +L+D G G A+ L + A+ ++H H DH G + +A+
Sbjct: 33 GEGGAVLIDTGLGDA--DAEALLEALAALGLDVDAILLTHGHFDHIGGAAVLKEAFGAAP 90
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 37.0 bits (86), Expect = 0.005
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
K+KK+KKKKKK+KKKK KK
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 37.0 bits (86), Expect = 0.005
Identities = 17/24 (70%), Positives = 20/24 (83%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
EK K+KK+KKKK+KKKKKK KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 35.4 bits (82), Expect = 0.018
Identities = 15/22 (68%), Positives = 19/22 (86%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K+K K+KK+KK+KKKKKKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTS 127
Score = 35.0 bits (81), Expect = 0.022
Identities = 15/24 (62%), Positives = 19/24 (79%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
++KK+KKKKKKK+KKK KK K
Sbjct: 109 AKQKKQKKKKKKKKKKKTSKKAAK 132
Score = 35.0 bits (81), Expect = 0.027
Identities = 15/23 (65%), Positives = 20/23 (86%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
++K K+KK+KKK+KKKKKKK K
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSK 128
Score = 33.9 bits (78), Expect = 0.056
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
+ K+K K+KK+KKKKKKKKKK +
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSK 128
Score = 33.5 bits (77), Expect = 0.075
Identities = 15/23 (65%), Positives = 20/23 (86%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+ K+K K+KK++KKKKKKKKKK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKK 125
Score = 32.4 bits (74), Expect = 0.21
Identities = 14/22 (63%), Positives = 19/22 (86%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K+ K+K K+K++KKKKKKKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKK 123
Score = 29.3 bits (66), Expect = 2.1
Identities = 15/26 (57%), Positives = 20/26 (76%), Gaps = 2/26 (7%)
Query: 382 EKK--KKKKKKKKKRKKKKKKKKKKV 405
E+K K+ K+K K+KK+KKKKKKK
Sbjct: 97 EQKIVKQVLKEKAKQKKQKKKKKKKK 122
Score = 28.1 bits (63), Expect = 5.8
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQISS 413
K+ K+K ++KK+KKKKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKK 129
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 36.9 bits (86), Expect = 0.007
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
KK+ K++KKK+KKKKKKK KK N +
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKKEEKEG 114
Score = 34.6 bits (80), Expect = 0.040
Identities = 15/29 (51%), Positives = 22/29 (75%)
Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ R E+K KK+ K+++KK+KKKKKKK
Sbjct: 74 EKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102
Score = 33.5 bits (77), Expect = 0.083
Identities = 18/33 (54%), Positives = 25/33 (75%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
+KKK+KKKKKKK KK KK++K+ + S + SS
Sbjct: 91 QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSD 123
Score = 33.5 bits (77), Expect = 0.083
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
K+ K++KKK+K+KKKKK KK + S E
Sbjct: 86 KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSE 119
Score = 32.7 bits (75), Expect = 0.16
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
+++K KK+ K++KKK+KKKKKK
Sbjct: 79 DEEKTAKKRAKRQKKKQKKKKKKKAKKGN 107
>gnl|CDD|188239 TIGR02650, RNase_Z_T_toga, ribonuclease Z, Thermotoga type.
Members of this protein family are ribonuclease Z as
found in the genus Thermotoga, where the enzyme cleaves
after the CCA, in contrast to the activities
characterized for other enzymes also designated
ribonuclease Z. In other systems, cleavage occurs
5-prime to the location of the CCA sequence, and CCA is
added subsequently. A species may lack ribonuclease Z if
all tRNA genes encode the CCA sequence, or if the CCA is
exposed by exonuclease activity rather than endonuclease
activity. Note that members of this sequence family
differ considerably from the majority of RNase Z
sequences [Transcription, RNA processing].
Length = 277
Score = 38.1 bits (88), Expect = 0.008
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 6/84 (7%)
Query: 278 GVTMVTKSGHK--ITYSGDTMPCDALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQ 335
G VT+ HK +T SGD++ D G ++LLIHE T D ++ K H+ + +
Sbjct: 171 GRDFVTEEYHKKILTISGDSLALDPEEIEG--TELLIHECTFLDARDRR--YKNHAAIDE 226
Query: 336 AIRIGREMRAKFVLLTHFSQRYAK 359
+ + K V+L H S RY +
Sbjct: 227 VMESVKAAGVKKVILYHISTRYIR 250
Score = 28.8 bits (64), Expect = 6.3
Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 8/48 (16%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVI 184
IL D GEG + L GS V V+++H H DH GL+ V+
Sbjct: 21 ILFDAGEGVSTTL----GSKV----YAFKYVFLTHGHVDHIAGLWGVV 60
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 34.6 bits (80), Expect = 0.012
Identities = 13/26 (50%), Positives = 13/26 (50%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVN 406
KKKKKKKKKK K KK
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKAT 47
Score = 33.1 bits (76), Expect = 0.045
Identities = 13/34 (38%), Positives = 16/34 (47%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
KKKKKKKKKK + K K + +S
Sbjct: 23 VKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 31.2 bits (71), Expect = 0.18
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
+KKKKKKKK + K KK + + +
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56
Score = 30.0 bits (68), Expect = 0.59
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
+ KKKKKKKKK+ + K
Sbjct: 20 ADLVKKKKKKKKKKAEDTAATAKAK 44
Score = 29.3 bits (66), Expect = 1.0
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQ 410
KKKKKK+KKK +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKA 46
Score = 28.5 bits (64), Expect = 1.6
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNS 408
P+ + KKKKK+KKKK + +
Sbjct: 16 PKTEADLVKKKKKKKKKKAEDTAATAKA 43
Score = 28.1 bits (63), Expect = 2.6
Identities = 11/36 (30%), Positives = 13/36 (36%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEI 417
E KKKKKKK+KK + E
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEG 54
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 36.2 bits (84), Expect = 0.016
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
+K+KKKKK+ KK KK+KK KK+K+ + +
Sbjct: 153 EKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Score = 36.2 bits (84), Expect = 0.017
Identities = 17/31 (54%), Positives = 19/31 (61%), Gaps = 2/31 (6%)
Query: 376 DNMRFPEKKKKKKK--KKKKRKKKKKKKKKK 404
EKK KK+K + K KKKKKKKKKK
Sbjct: 163 VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 35.1 bits (81), Expect = 0.045
Identities = 15/24 (62%), Positives = 21/24 (87%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
EKK+KKKKK+ K++KK+KK KK+
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKE 175
Score = 33.5 bits (77), Expect = 0.13
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
E+KK+KKKKK+ +K+KK+KK KK S+
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Score = 33.2 bits (76), Expect = 0.20
Identities = 15/23 (65%), Positives = 21/23 (91%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E++KK+KKKKK+ KK+KK+KK K
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDK 173
Score = 29.7 bits (67), Expect = 2.3
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
E + ++++KK+K+KKK+ KK+KK
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKE 169
Score = 29.3 bits (66), Expect = 3.6
Identities = 12/23 (52%), Positives = 21/23 (91%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E ++++KK+KKK+K+ KK+KK+K
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEK 170
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 34.3 bits (79), Expect = 0.017
Identities = 13/31 (41%), Positives = 17/31 (54%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
KKKKKKKK+K K K++ + SS
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSA 48
Score = 33.2 bits (76), Expect = 0.053
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K KK KKKK+KKKKK K K+
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKE 34
Score = 32.8 bits (75), Expect = 0.076
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
KK KKKKKK+KKK K K++ V +
Sbjct: 15 KKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSS 47
Score = 32.0 bits (73), Expect = 0.14
Identities = 14/23 (60%), Positives = 17/23 (73%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
KK KKKKK+KKKK K K++V
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEV 36
Score = 31.2 bits (71), Expect = 0.24
Identities = 14/25 (56%), Positives = 14/25 (56%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNS 408
K K K KK KKKKKKKK N
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNK 31
Score = 31.2 bits (71), Expect = 0.26
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K K KK KK+KKKKKKK K
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKS 32
Score = 30.9 bits (70), Expect = 0.29
Identities = 14/52 (26%), Positives = 26/52 (50%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKL 434
KKKKKKKK K +++ +K+++ +S + + E + K + L
Sbjct: 22 KKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNL 73
Score = 30.5 bits (69), Expect = 0.44
Identities = 14/25 (56%), Positives = 15/25 (60%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNS 408
K K KK K+KKKKKKKK K
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEE 35
Score = 29.7 bits (67), Expect = 0.77
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+KKKKKKKKK K K++ +K++
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEE 42
Score = 29.7 bits (67), Expect = 0.89
Identities = 16/26 (61%), Positives = 16/26 (61%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
K K K KK KKKKKKKKK N S
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKS 32
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 35.6 bits (83), Expect = 0.021
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E K +K +KK K K K +KKK
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKK 89
Score = 32.5 bits (75), Expect = 0.20
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+KK +K K K KKK KK++ K
Sbjct: 74 AEKKAEKAKAKAEKKKAKKEEPK 96
Score = 31.3 bits (72), Expect = 0.54
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
EKK +K K K ++KK KK++ K
Sbjct: 75 EKKAEKAKAKAEKKKAKKEEPKP 97
Score = 31.3 bits (72), Expect = 0.58
Identities = 10/19 (52%), Positives = 13/19 (68%)
Query: 384 KKKKKKKKKKRKKKKKKKK 402
K +KK +K K K +KKK K
Sbjct: 73 KAEKKAEKAKAKAEKKKAK 91
Score = 31.0 bits (71), Expect = 0.62
Identities = 9/22 (40%), Positives = 14/22 (63%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K +K +KK ++ K K +KKK
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKA 90
Score = 30.6 bits (70), Expect = 0.86
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
K +K K K +K+K KK++ K ++
Sbjct: 77 KAEKAKAKAEKKKAKKEEPKPRL 99
Score = 27.9 bits (63), Expect = 7.3
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
+K K K +KKK KK++ K +
Sbjct: 78 AEKAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 36.7 bits (85), Expect = 0.033
Identities = 26/100 (26%), Positives = 39/100 (39%), Gaps = 15/100 (15%)
Query: 349 LLTHFSQRYAKLPRLNKDLSE-----------NVGIAFDN----MRFPEKKKKKKKKKKK 393
LL R L K + + NV ++FD K KK KK KK
Sbjct: 33 LLEQDKARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKKFKKDKKD 92
Query: 394 RKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPK 433
K++ +K++ S+Q + EI + SKK+ L
Sbjct: 93 LAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLED 132
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 36.0 bits (83), Expect = 0.047
Identities = 19/58 (32%), Positives = 24/58 (41%), Gaps = 6/58 (10%)
Query: 352 HFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
+F + L A MR P+ K +K KKK K KKKKKV +S
Sbjct: 95 NFFDSKNNSKQFAGPL------AISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSK 146
Score = 28.6 bits (64), Expect = 9.7
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 363 LNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
L+ + + + N +F + K KK K K+KKK KKK
Sbjct: 46 LDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKK 86
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 34.1 bits (78), Expect = 0.049
Identities = 15/30 (50%), Positives = 21/30 (70%), Gaps = 2/30 (6%)
Query: 379 RFPE--KKKKKKKKKKKRKKKKKKKKKKVN 406
++P +++ KK KKKR+KKKKKKK N
Sbjct: 93 QYPAWMNQRQAKKLKKKREKKKKKKKGAKN 122
Score = 30.6 bits (69), Expect = 0.75
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKV 405
+++ KK KKK+++KKKKKK K +
Sbjct: 98 MNQRQAKKLKKKREKKKKKKKGAKNL 123
Score = 28.7 bits (64), Expect = 3.5
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
KK KKK++KKK+KKK K
Sbjct: 103 AKKLKKKREKKKKKKKGAKNLA 124
Score = 28.3 bits (63), Expect = 4.7
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNN 407
+++ KK KK+++KKKKKKK N
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKN 122
Score = 27.9 bits (62), Expect = 5.1
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+ KK KKK++KK+KKKK K
Sbjct: 102 QAKKLKKKREKKKKKKKGAKNLA 124
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 32.9 bits (76), Expect = 0.053
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKK 402
P +KKK+KK+ K K+ KK
Sbjct: 59 PSRKKKEKKEVKAESKRYNAKK 80
Score = 29.8 bits (68), Expect = 0.70
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 374 AFDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
A +R +KKK+KK+ K + K+ KK
Sbjct: 51 AHAAIRADPSRKKKEKKEVKAESKRYNAKK 80
Score = 28.7 bits (65), Expect = 1.7
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 385 KKKKKKKKKRKKKKKKKKKK 404
+KKK+KK+ + + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 27.1 bits (61), Expect = 6.2
Identities = 9/26 (34%), Positives = 16/26 (61%), Gaps = 2/26 (7%)
Query: 382 EKKKKKKKKKKKRKKK--KKKKKKKV 405
EKK+ K + K+ KK +++K +V
Sbjct: 65 EKKEVKAESKRYNAKKLTLEQRKARV 90
>gnl|CDD|163212 TIGR03307, PhnP, phosphonate metabolism protein PhnP. This family
of proteins found in operons encoding phosphonate C-P
lyase systems as is observed in E. coli and is a member
of the metallo-beta-lactamase superfamily (pfam00753).
As defined by this model, all instances of this protein
are associated with the C-P lyase, but not all genomes
containing the C-P lyase system contain phnP.
Length = 238
Score = 35.1 bits (81), Expect = 0.058
Identities = 26/96 (27%), Positives = 47/96 (48%), Gaps = 6/96 (6%)
Query: 286 GHKITYSGDT--MPCDALVSIGKNS-DLLIHEATHEDELEKEAALKMHSTVSQAIRIGRE 342
G ++ Y DT +P D + + D+LI + +H + A + H+ +++A+ I +
Sbjct: 144 GQRVAYLTDTAGLPPDTEAFLKNHPLDVLILDCSHP---PQSDAPRNHNDLTRALAINEQ 200
Query: 343 MRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAFDNM 378
+R K V+LTH S + N DL V + +D
Sbjct: 201 LRPKQVILTHISHQLDAWLMENPDLPSGVAVGYDGQ 236
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 35.0 bits (81), Expect = 0.062
Identities = 31/103 (30%), Positives = 44/103 (42%), Gaps = 7/103 (6%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQI----SSRPEICREFDSKKISLANLPKLKLF 437
KKKK+ K K K+ KKKKKK + + + RP+ E S +L + P+ K
Sbjct: 83 RLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSS 142
Query: 438 YPALKAMFAEYQDEIENRAIRRNLKQESTYLARYPSENDKEKP 480
+ + ++ R IRR Q AR E KEK
Sbjct: 143 RSSTVQNKEATHERLKEREIRRKKIQAK---ARKRKEKKKEKE 182
Score = 31.6 bits (72), Expect = 0.72
Identities = 13/44 (29%), Positives = 20/44 (45%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
+++ KKKK+ K K K+ KKK + P+ KK
Sbjct: 80 REERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKK 123
Score = 28.9 bits (65), Expect = 5.3
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKV 405
R E++ ++KK + K +K+K+KKK+K
Sbjct: 156 RLKEREIRRKKIQAKARKRKEKKKEKE 182
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 33.4 bits (77), Expect = 0.080
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
E+ +K+K+ +K R+KK K+++K+
Sbjct: 105 ERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 31.9 bits (73), Expect = 0.23
Identities = 8/23 (34%), Positives = 17/23 (73%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
K++ +K+K+ RK ++KK K++
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRR 124
Score = 31.9 bits (73), Expect = 0.24
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+R ++ K++ +K+KR +K ++KK K
Sbjct: 95 ALRLRRERTKERAEKEKRTRKNREKKFK 122
Score = 29.5 bits (67), Expect = 1.5
Identities = 13/41 (31%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 366 DLSE-NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
DL + N + + +R EKKKK+K+ KKR+ + ++++
Sbjct: 63 DLKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERT 103
Score = 29.5 bits (67), Expect = 1.7
Identities = 6/24 (25%), Positives = 17/24 (70%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
++++ K++ +K K+ +K ++KK
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKF 121
Score = 29.2 bits (66), Expect = 1.9
Identities = 7/23 (30%), Positives = 19/23 (82%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+++ +K+K+ +K ++KK K+++K
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQK 126
Score = 29.2 bits (66), Expect = 2.0
Identities = 8/23 (34%), Positives = 18/23 (78%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E+ K++ +K+K+ +K ++KK K+
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKR 123
Score = 28.4 bits (64), Expect = 3.7
Identities = 5/22 (22%), Positives = 15/22 (68%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
+ ++++ K+R +K+K+ +K
Sbjct: 94 IALRLRRERTKERAEKEKRTRK 115
Score = 27.6 bits (62), Expect = 6.7
Identities = 8/54 (14%), Positives = 22/54 (40%), Gaps = 10/54 (18%)
Query: 351 THFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ AK+ + E +K+K+ KK++ + ++++ K+
Sbjct: 62 IDLKKHNAKVEKELLREKEK----------KKKRKRPGKKRRIALRLRRERTKE 105
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 31.8 bits (73), Expect = 0.086
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
EK + K+ KK+ KK +KKK+K+
Sbjct: 10 EKLRAKRAKKRAAKKAARKKKRKLI 34
Score = 31.4 bits (72), Expect = 0.13
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKV 405
R + + K+ KK+ +K +KKK+K +
Sbjct: 8 RNEKLRAKRAKKRAAKKAARKKKRKLI 34
Score = 30.3 bits (69), Expect = 0.31
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
++ KK+ KK RKKK+K K+
Sbjct: 15 KRAKKRAAKKAARKKKRKLIFKR 37
Score = 30.3 bits (69), Expect = 0.37
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
K+ KK+ KK +KKK+K K
Sbjct: 14 AKRAKKRAAKKAARKKKRKLIFK 36
Score = 29.5 bits (67), Expect = 0.60
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVN 406
KK+K+ +K + ++ KK+ KK
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAAR 27
Score = 27.2 bits (61), Expect = 3.9
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNN 407
KK+K+ +K + K+ KK+ KK
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAAR 27
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 34.0 bits (78), Expect = 0.089
Identities = 28/81 (34%), Positives = 32/81 (39%), Gaps = 18/81 (22%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS-IQISSRPEICR-----------EFDSKKISLA 429
EKKKK K KK K K KK + KKK S PEI D+KK+
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREGWNNFSYFPEITHIVIKESTVTINRQDNKKM--- 146
Query: 430 NLPKLKLFYPALKAMFAEYQD 450
+LKL FA D
Sbjct: 147 ---ELKLSSRDEALSFAALVD 164
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 31.9 bits (73), Expect = 0.11
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 366 DLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKK----KKK 401
+ E + + ++KKKKKKK +KK K +KK
Sbjct: 22 EDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61
Score = 30.3 bits (69), Expect = 0.47
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 359 KLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
+LP + + ++ ++KKK+KKK KKK K
Sbjct: 11 QLPNG-LGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 29.6 bits (67), Expect = 0.83
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 385 KKKKKKKKKRKKKKKKKKKK 404
++ ++KK+KKKK KKK K
Sbjct: 35 ISQRNRRKKKKKKKLKKKSK 54
Score = 28.8 bits (65), Expect = 1.3
Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
Query: 366 DLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKK----KKKKKV 405
D S+ I+ N R +KKKK KKK K+ +KK ++ K+V
Sbjct: 28 DASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKEV 71
Score = 28.0 bits (63), Expect = 2.9
Identities = 9/29 (31%), Positives = 19/29 (65%)
Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ + + ++ ++ ++K+KKKKK KKK
Sbjct: 25 EQIDASKVRRISQRNRRKKKKKKKLKKKS 53
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 34.0 bits (78), Expect = 0.12
Identities = 25/94 (26%), Positives = 33/94 (35%), Gaps = 11/94 (11%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFYPA 440
PEKK KK K K K K K K K K + S+ + +
Sbjct: 90 PEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSAASASGAAT 149
Query: 441 LKAMFAEYQDEIENRAIRRNLKQESTYLARYPSE 474
+ A Y + RAIRR RYP++
Sbjct: 150 GPS--ASYLSGL-RRAIRRA--------PRYPAQ 172
Score = 30.1 bits (68), Expect = 2.1
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKV 405
P++K K +KK KK K K K K K
Sbjct: 83 KPKEKPKPEKKPKKPKPKPKPKPKPK 108
Score = 30.1 bits (68), Expect = 2.1
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
P+ K+K K +KK +K K K K K
Sbjct: 82 PKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
Score = 29.4 bits (66), Expect = 3.4
Identities = 13/27 (48%), Positives = 15/27 (55%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNN 407
P K K+K K +KK KK K K K K
Sbjct: 81 PPKPKEKPKPEKKPKKPKPKPKPKPKP 107
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 33.5 bits (77), Expect = 0.12
Identities = 27/90 (30%), Positives = 37/90 (41%), Gaps = 5/90 (5%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFY 438
+ K KKKK KKKK K K KK KK + S + + + D K L L
Sbjct: 81 KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
Query: 439 P---ALKAMFAEYQDEIENRAIRRNLKQES 465
P AL YQ ++ + K+E+
Sbjct: 141 PRKYALHKDI--YQSRLDRKRRAEVAKKEA 168
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 33.8 bits (78), Expect = 0.12
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 385 KKKKKKKKKRKKKKKKKKKKV 405
KK+K KK +KKKK KKK ++
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRI 205
Score = 33.0 bits (76), Expect = 0.22
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKV 405
KK+K KK K+KKK KKK + +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 31.5 bits (72), Expect = 0.58
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
KK+K KK KK+KK KKK +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 31.5 bits (72), Expect = 0.61
Identities = 10/24 (41%), Positives = 21/24 (87%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
E++++K++++K RKK+K+K+ KK
Sbjct: 8 LEQRRRKREQRKARKKQKRKEAKK 31
Score = 31.5 bits (72), Expect = 0.68
Identities = 12/26 (46%), Positives = 19/26 (73%)
Query: 385 KKKKKKKKKRKKKKKKKKKKVNNSIQ 410
K+K+K+KKK KK+ K++K+KV
Sbjct: 149 KRKEKQKKKSKKEWKERKEKVEKKKA 174
Score = 31.5 bits (72), Expect = 0.71
Identities = 11/34 (32%), Positives = 22/34 (64%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
+K++++K +KK+K+K+ KKK+ S + E
Sbjct: 13 RKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNE 46
Score = 31.1 bits (71), Expect = 0.93
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 372 GIAFDNMRFPEKKKKKKKKKKKRKKKKKKK 401
I F + F + ++ KK K K+KKKKKK
Sbjct: 63 NIVFSKVEFADGEQAKKDLKLKKKKKKKKT 92
Score = 30.0 bits (68), Expect = 1.9
Identities = 8/23 (34%), Positives = 21/23 (91%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
++++K++++K ++K+K+K+ KKK
Sbjct: 10 QRRRKREQRKARKKQKRKEAKKK 32
Score = 30.0 bits (68), Expect = 2.1
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
++K KK KKKK+ KKK +
Sbjct: 186 KRKDDKKNKKKKKAKKKGRIL 206
Score = 29.6 bits (67), Expect = 3.0
Identities = 16/28 (57%), Positives = 19/28 (67%), Gaps = 5/28 (17%)
Query: 382 EKKKKKKKKK-----KKRKKKKKKKKKK 404
EKKK +++KK KKRK KK KKKK
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKK 197
Score = 28.4 bits (64), Expect = 6.0
Identities = 9/50 (18%), Positives = 26/50 (52%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANL 431
+++K +KK+K+++ KKK+ +K + + ++ + + N+
Sbjct: 15 REQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNI 64
Score = 28.0 bits (63), Expect = 10.0
Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 5/28 (17%)
Query: 382 EKKKKKKKK-----KKKRKKKKKKKKKK 404
+KK +++KK KK++ KK KKKKK
Sbjct: 171 KKKAERQKKREENLKKRKDDKKNKKKKK 198
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 34.3 bits (79), Expect = 0.13
Identities = 23/53 (43%), Positives = 26/53 (49%), Gaps = 2/53 (3%)
Query: 352 HFSQRYAKLPRLNKDLSENVGIAFDNMRF-PEKKKKKKKKKKKRKKKKKKKKK 403
FS+ Y K L + A +R P K KKKKKKKKK KK K KK
Sbjct: 223 QFSK-YIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKK 274
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 34.0 bits (79), Expect = 0.14
Identities = 9/22 (40%), Positives = 15/22 (68%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKV 405
K +KKK+K++ K K K K ++
Sbjct: 71 KAQKKKEKQEAKAAKAKSKPRL 92
Score = 33.7 bits (78), Expect = 0.19
Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 9/56 (16%)
Query: 350 LTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
+T+ +++Y K++ E + A + E K K +KKK K++ K K K
Sbjct: 42 VTNLNEQY-------KEMKEELKAAL--LDKKELKAWHKAQKKKEKQEAKAAKAKS 88
Score = 31.7 bits (73), Expect = 0.89
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
K +KKK+K++ + K K K +
Sbjct: 70 HKAQKKKEKQEAKAAKAKSKPR 91
Score = 29.8 bits (68), Expect = 3.3
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
KK+ K K ++KK+K++ K S
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKS 88
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 33.8 bits (78), Expect = 0.17
Identities = 12/37 (32%), Positives = 17/37 (45%)
Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
+ K++ KKKK K+ K K KK +I S
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKVVAKGMKKIDS 285
Score = 32.7 bits (75), Expect = 0.32
Identities = 17/68 (25%), Positives = 25/68 (36%)
Query: 337 IRIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKK 396
+ K +L+ F L K E EKK+K K++ KK+K
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263
Query: 397 KKKKKKKK 404
K+ K K
Sbjct: 264 KESKGVKA 271
Score = 30.8 bits (70), Expect = 1.5
Identities = 10/37 (27%), Positives = 15/37 (40%)
Query: 368 SENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+E+ + KKKK K+ K + KK K
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGM 280
Score = 28.8 bits (65), Expect = 5.2
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 365 KDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+D+ A + E+ KKKK K+ K K KK K
Sbjct: 239 EDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 34.0 bits (78), Expect = 0.18
Identities = 27/122 (22%), Positives = 47/122 (38%), Gaps = 12/122 (9%)
Query: 357 YAKLPRLNKDLSENVGIAFDNMR----FPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQIS 412
Y LN+ L E + N+ P++KK+ +K KK++K K + N ++Q+
Sbjct: 38 YVHYVELNRRLDEWITADLINLGAAISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLE 97
Query: 413 SRPEICREFDSKKISLANLPKLKLFYPALKAMFAEYQDEIENRAIRRNLKQESTYLARYP 472
I E + +++ K+ P N N + E Y + YP
Sbjct: 98 MLYSISNEREIRQL---RFGGSKVQNP--HEGARVKNI---NEIKLGNYEIEPWYFSPYP 149
Query: 473 SE 474
E
Sbjct: 150 EE 151
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 34.3 bits (79), Expect = 0.19
Identities = 20/62 (32%), Positives = 27/62 (43%), Gaps = 3/62 (4%)
Query: 378 MRFPEKKK---KKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKL 434
R K K K +K K +KK+KKKKK + S + S R +K L + P
Sbjct: 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDN 1215
Query: 435 KL 436
K
Sbjct: 1216 KK 1217
Score = 30.4 bits (69), Expect = 2.4
Identities = 16/58 (27%), Positives = 25/58 (43%), Gaps = 1/58 (1%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK-ISLANLPKLKLF 437
P KKK KK+ + KKKKK + + + S+ + + S+ L K K
Sbjct: 1305 PTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSD 1362
>gnl|CDD|224157 COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold
involved in RNA processing [Translation, ribosomal
structure and biogenesis].
Length = 427
Score = 33.9 bits (78), Expect = 0.20
Identities = 15/49 (30%), Positives = 18/49 (36%), Gaps = 5/49 (10%)
Query: 133 PDTCILLDCG-EGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGL 180
T ILLDCG R + AV ++H H DH L
Sbjct: 22 GGTRILLDCGLFPGDPSPERPLLPPFPKV----DAVLLTHAHLDHIGAL 66
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 33.9 bits (78), Expect = 0.21
Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 4/53 (7%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKV---NNSIQISSRPEICREFDSKKISL 428
R EK + K+KKK KKK++KKKK++ I+ +E ++K ++L
Sbjct: 319 RDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKE-ENKTVAL 370
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 31.7 bits (72), Expect = 0.22
Identities = 13/47 (27%), Positives = 23/47 (48%)
Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ + K ++ F N++ +KKKK+ K + K KK +KK
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKD 47
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 33.0 bits (76), Expect = 0.22
Identities = 11/56 (19%), Positives = 23/56 (41%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKL 436
E +K+K ++ +K K K +K P+ D ++++ K+K
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKVKA 177
Score = 31.0 bits (71), Expect = 0.84
Identities = 8/36 (22%), Positives = 18/36 (50%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
+++ +K+K R+++K K K + +PE
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 30.7 bits (70), Expect = 1.3
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKK 404
++K ++++K K K +KK+K +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPR 150
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 31.5 bits (72), Expect = 0.23
Identities = 21/65 (32%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE-ICREFDSKKISLANLPKLKLFYPA 440
K+K + +KKKK+K KKK + + S+ E + + KK S K+ Y A
Sbjct: 37 LKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKKFS------KKINYDA 90
Query: 441 LKAMF 445
LK++F
Sbjct: 91 LKSLF 95
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 30.2 bits (69), Expect = 0.25
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKK 403
F EK +K+K+KK +K+++KK
Sbjct: 36 EFYEKPSEKRKRKKAAARKRRRKKL 60
Score = 29.1 bits (66), Expect = 0.68
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 382 EKKKKKKKKKKKRKKKKKKK 401
EK+K+KK +KR++KK +
Sbjct: 43 EKRKRKKAAARKRRRKKLAR 62
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 33.5 bits (77), Expect = 0.26
Identities = 24/123 (19%), Positives = 37/123 (30%), Gaps = 9/123 (7%)
Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEI 417
L+KD + F + ++K K +K KK EI
Sbjct: 447 VCEKLLDKDKN----KPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEI 502
Query: 418 CREFDSKKISLANL-PKLKLFYPALKAMFAEYQDEIENRAIRRNLKQESTYLARYPSEND 476
+ DSK L + K+ EI ++ L +AR N
Sbjct: 503 LEKVDSKSEKLEKISAKIDNIKELFDESKMSILQEI----LQPALHHIELNIARIYKLNI 558
Query: 477 KEK 479
K+K
Sbjct: 559 KDK 561
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 32.3 bits (74), Expect = 0.28
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
EK+ K K++K+R+ ++K+K+
Sbjct: 125 EKELAKLKREKRRENERKQKE 145
Score = 31.1 bits (71), Expect = 0.58
Identities = 7/24 (29%), Positives = 17/24 (70%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNN 407
K+ K K++KR++ ++K+K+ +
Sbjct: 126 KELAKLKREKRRENERKQKEILKE 149
Score = 29.9 bits (68), Expect = 1.5
Identities = 5/23 (21%), Positives = 17/23 (73%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+K+ K K++++++ ++K+K+
Sbjct: 123 LLEKELAKLKREKRRENERKQKE 145
Score = 28.4 bits (64), Expect = 5.5
Identities = 6/23 (26%), Positives = 16/23 (69%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
++ K K++K++ ++K+K+ K
Sbjct: 126 KELAKLKREKRRENERKQKEILK 148
Score = 28.4 bits (64), Expect = 5.6
Identities = 6/23 (26%), Positives = 17/23 (73%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
++ +K+ K KR+K+++ ++K+
Sbjct: 121 DELLEKELAKLKREKRRENERKQ 143
Score = 27.6 bits (62), Expect = 9.5
Identities = 7/22 (31%), Positives = 17/22 (77%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
K K++K+++ +RK+K+ K++
Sbjct: 129 AKLKREKRRENERKQKEILKEQ 150
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 32.5 bits (74), Expect = 0.30
Identities = 17/45 (37%), Positives = 20/45 (44%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
P KK KKKK RKKK KK KK + DS++
Sbjct: 46 PSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEE 90
Score = 30.2 bits (68), Expect = 1.9
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 365 KDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSK 424
K +E+V D++ FPE + KK K+KK +KKK + + E E ++
Sbjct: 25 KSTTESVDKVNDDI-FPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83
Query: 425 KIS 427
++S
Sbjct: 84 ELS 86
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor involved
in ribosome biogenesis.
Length = 112
Score = 31.3 bits (71), Expect = 0.37
Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Query: 358 AKLPRLNKDLSENVGI--AFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
LPR + + GI A N +KK++KKK K+ +K++K++KK + + ++
Sbjct: 22 TSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDT 80
>gnl|CDD|188365 TIGR03675, arCOG00543, arCOG00543 universal archaeal
KH-domain/beta-lactamase-domain protein. This family of
proteins is universal in the archaea and consistsof an
N-terminal type-1 KH-domain (pfam00013) a central
beta-lactamase-domain (pfam00753) with a C-terminal
motif associated with RNA metabolism (pfam07521).
KH-domains are associated with RNA-binding, so taken
together, this protein is a likely metal-dependent
RNAase. This family was defined in as arCOG01782.
Length = 630
Score = 33.0 bits (76), Expect = 0.40
Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 6/48 (12%)
Query: 133 PDTCILLDCGEG----TYSQLVRLYGSAVDTLLSQLSAVYISHLHADH 176
P++ +LLDCG + L + L +L AV I+H H DH
Sbjct: 196 PESRVLLDCGVNVGADGDNAYPYLD--VPEFQLDELDAVVITHAHLDH 241
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 31.5 bits (72), Expect = 0.41
Identities = 11/22 (50%), Positives = 20/22 (90%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
K+KK+++K++KR+ K++KKK K
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAK 128
Score = 31.1 bits (71), Expect = 0.47
Identities = 10/22 (45%), Positives = 19/22 (86%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
E+ K++KKK+ K KK+++K++K
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERK 117
Score = 28.8 bits (65), Expect = 3.6
Identities = 8/22 (36%), Positives = 20/22 (90%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
++ K++KKK+ ++KK+++K++K
Sbjct: 96 ERNKQEKKKRSKEKKEEEKERK 117
Score = 28.0 bits (63), Expect = 5.7
Identities = 9/24 (37%), Positives = 20/24 (83%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
E ++ K++KKK+ K+KK+++K++
Sbjct: 94 EHERNKQEKKKRSKEKKEEEKERK 117
Score = 27.6 bits (62), Expect = 7.9
Identities = 9/23 (39%), Positives = 19/23 (82%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
++KKK+ K+KK ++K++K++ K
Sbjct: 100 QEKKKRSKEKKEEEKERKRQLKQ 122
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 31.9 bits (73), Expect = 0.44
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSI 409
KK K K+ RK + KKKKKK +N I
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKKDDNPI 110
Score = 30.0 bits (68), Expect = 1.8
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
E KK K K+ ++ + KKKKKK N
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKKDDN 108
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 32.3 bits (74), Expect = 0.45
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+ K KK K K +K+K K++K+
Sbjct: 54 AKAKAKKALKAKIEEKEKAKREKE 77
Score = 31.6 bits (72), Expect = 0.80
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
K K KK K K ++K+K K++K
Sbjct: 53 AAKAKAKKALKAKIEEKEKAKREK 76
Score = 31.2 bits (71), Expect = 0.97
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
K K +K K KKKK K K K+N
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLN 216
Score = 31.2 bits (71), Expect = 1.1
Identities = 8/24 (33%), Positives = 17/24 (70%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+ KK K K ++++K K++K++K
Sbjct: 56 AKAKKALKAKIEEKEKAKREKEEK 79
Score = 30.8 bits (70), Expect = 1.4
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
K KK K K + K+K K++K++
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEE 78
Score = 30.0 bits (68), Expect = 2.3
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
K K KK K + ++K+K K++
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKRE 75
Score = 30.0 bits (68), Expect = 2.6
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+ K K K KK K K ++K+K
Sbjct: 48 EKAKVAAKAKAKKALKAKIEEKEK 71
Score = 29.6 bits (67), Expect = 2.9
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
E+K+++K K + K KK K K+
Sbjct: 43 EEKEEEKAKVAAKAKAKKALKAKIE 67
Score = 29.2 bits (66), Expect = 3.8
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
EK K K K K+ K K ++K+
Sbjct: 47 EEKAKVAAKAKAKKALKAKIEEKE 70
Score = 29.2 bits (66), Expect = 4.4
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
+K K +K + KKKK K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAK 214
Score = 28.5 bits (64), Expect = 7.1
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+ K K KK + K ++K+K K
Sbjct: 50 AKVAAKAKAKKALKAKIEEKEKAK 73
Score = 28.1 bits (63), Expect = 9.6
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 385 KKKKKKKKKRKKKKKKKKKKVN 406
K K KK K K ++K+K K+
Sbjct: 55 KAKAKKALKAKIEEKEKAKREK 76
Score = 28.1 bits (63), Expect = 9.7
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E ++K+++K K K K KK K
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALK 63
Score = 28.1 bits (63), Expect = 9.8
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+++K+++K K K K KK K
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKA 64
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 32.4 bits (74), Expect = 0.45
Identities = 7/31 (22%), Positives = 10/31 (32%)
Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
+ +K K K KK KK +
Sbjct: 184 AAKLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
Score = 28.2 bits (63), Expect = 9.6
Identities = 5/23 (21%), Positives = 7/23 (30%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
+K + K KK K
Sbjct: 190 TASSGDEKSPKSKAAPKKAGKKM 212
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 32.9 bits (76), Expect = 0.46
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 6/50 (12%)
Query: 364 NKDLSENVGIAFDNMRFP----EKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
+K+ GI M+ P EK +K KKKKKK+ K K K + V+ +
Sbjct: 660 DKEAIVQAGIM--KMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 32.8 bits (75), Expect = 0.48
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 2/73 (2%)
Query: 353 FSQRYAKLPRLNKDLSENVGIAFDN--MRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
F R KL + D+S + F N + E K ++K +KKKKK+KKK+ +
Sbjct: 703 FLDRIDKLFFFSFDISGLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKRE 762
Query: 411 ISSRPEICREFDS 423
+R EI +DS
Sbjct: 763 EKARIEIAEAWDS 775
Score = 30.8 bits (70), Expect = 1.9
Identities = 19/70 (27%), Positives = 33/70 (47%), Gaps = 7/70 (10%)
Query: 361 PRLNKDLSENVGIAFDNMRFPEKKKK----KKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
+++ + + + N EKK K + K + + +K K+KKK+ N+I IS
Sbjct: 1003 QMIHESSVQIRSMEWTNSSLTEKKIKDLTDRTKTIRNQIEKITKEKKKITNTINISPNK- 1061
Query: 417 ICREFDSKKI 426
+DSK I
Sbjct: 1062 --TSYDSKII 1069
>gnl|CDD|237972 PRK15483, PRK15483, type III restriction-modification system StyLTI
enzyme res; Provisional.
Length = 986
Score = 32.7 bits (75), Expect = 0.52
Identities = 27/126 (21%), Positives = 46/126 (36%), Gaps = 18/126 (14%)
Query: 372 GIAFDNMRFPEKKKKKKKKKKKRKKKK---KKKKKK----VNNSI-QISSRPE--ICREF 421
G+ PE++ K KK ++ KK V ++ I E
Sbjct: 275 GVDIFYPEMPEEQANNVKIVDSVTAKKLILRRGKKTFELAVGENLADIDPGFEGLTIEYA 334
Query: 422 DSKKISLAN---LPKLKLFYPALKAMFAE-YQDEIENRAIRRNLKQESTYLARYPSENDK 477
+ L+N L K P +++ YQ+ + +AI ++ + E R +E K
Sbjct: 335 GKTSVVLSNGLELEKGMALVPG---TYSQSYQELMIQQAIDKHFETERANFLR-RNEPPK 390
Query: 478 EKPGSL 483
K SL
Sbjct: 391 IKTLSL 396
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 32.6 bits (75), Expect = 0.53
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 5/51 (9%)
Query: 357 YAKL---PRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
Y KL P L + E N+ E+KK +KK++K KK +K++ +K
Sbjct: 384 YLKLHDKPLLAEGEEEE--GENGNLSPAERKKLRKKQRKAEKKAEKEEAEK 432
Score = 29.5 bits (67), Expect = 4.7
Identities = 8/22 (36%), Positives = 16/22 (72%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
KK++K +KK ++++ +K KK
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKK 437
Score = 29.1 bits (66), Expect = 6.5
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
+K++K +KK +K + +K KKK
Sbjct: 416 KKQRKAEKKAEKEEAEKAAAKKKA 439
>gnl|CDD|235245 PRK04183, PRK04183, glutamyl-tRNA(Gln) amidotransferase subunit D;
Validated.
Length = 419
Score = 32.5 bits (75), Expect = 0.54
Identities = 10/39 (25%), Positives = 18/39 (46%), Gaps = 3/39 (7%)
Query: 370 NVGIAFDN---MRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
N+GI D + EK +K K++ ++ +K V
Sbjct: 40 NIGIDIDKIAEIELLEKGEKPKQEPPPKEIEKDPGLPNV 78
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 32.5 bits (75), Expect = 0.55
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 12/67 (17%)
Query: 340 GREMRAKFV--LLTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKK 397
GRE RA V L +++ KL L +++ P KK K +K K++
Sbjct: 799 GRETRAPTVEEALELLAEKPEKLRYLADAPAKD----------PAGKKAAVKFSRKTKQQ 848
Query: 398 KKKKKKK 404
+K
Sbjct: 849 YVASEKD 855
Score = 29.8 bits (68), Expect = 4.3
Identities = 7/24 (29%), Positives = 9/24 (37%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
EK +K + K KK V
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAV 839
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 32.3 bits (74), Expect = 0.57
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
K +K K ++ RKK KK + KV ++
Sbjct: 348 AKGAQKVKNRRARKKAKKARLAKVAKAL 375
Score = 28.9 bits (65), Expect = 6.8
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNS 408
+ K +K K R+ +KK KK ++
Sbjct: 345 KATAKGAQKVKNRRARKKAKKARLAKV 371
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.4 bits (72), Expect = 0.58
Identities = 28/83 (33%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 366 DLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
D+S ++ F KKKKKKKKK+ ++KKKK V S + R+ D K
Sbjct: 67 DISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVV--SEE--------RKEDQKA 116
Query: 426 IS---LANLPKLKLFYPALKAMF 445
+ LA + K+ L LK+ F
Sbjct: 117 VDAALLAAIKKVPLLKGYLKSRF 139
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 30.8 bits (70), Expect = 0.69
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
KKK KK K KK K KK+KV S++
Sbjct: 7 AKKKAKKAAKAAKKGVKVKKRKVRTSVR 34
Score = 28.5 bits (64), Expect = 4.8
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
+KK KK K K+ K KK+K + +
Sbjct: 8 KKKAKKAAKAAKKGVKVKKRKVRTSV 33
Score = 28.1 bits (63), Expect = 6.8
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
KKK KK K +K K KK+K
Sbjct: 7 AKKKAKKAAKAAKKGVKVKKRKV 29
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 29.9 bits (68), Expect = 0.81
Identities = 9/25 (36%), Positives = 18/25 (72%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
+++++K KKK +K + ++ KKKV
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKV 92
Score = 28.7 bits (65), Expect = 1.8
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKK 403
R E+++K KKK K+ + ++ KKK
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKKK 91
Score = 27.6 bits (62), Expect = 5.0
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKK 403
+K KKK KK R ++ KKK
Sbjct: 71 ERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 31.8 bits (73), Expect = 0.82
Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 25/90 (27%)
Query: 341 REMR----AKFVLLTHFSQRYAKL-------------PRLNKDLSENVGIAFDNMRF--- 380
RE R K H S+R + P L L+ + + + + F
Sbjct: 345 RETRDSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTG 404
Query: 381 -----PEKKKKKKKKKKKRKKKKKKKKKKV 405
+ KK +K +KKR+++KK+KKKK
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKA 434
Score = 31.4 bits (72), Expect = 1.3
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
EK +KK++++KK KKKK KK + +
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 30.7 bits (70), Expect = 2.2
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
+KK KKK++ ++++K+KK++ + + E E + KK
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474
Score = 30.3 bits (69), Expect = 2.3
Identities = 8/23 (34%), Positives = 16/23 (69%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
+K +KK+++++K+KKKK
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAG 437
Score = 30.3 bits (69), Expect = 2.7
Identities = 23/128 (17%), Positives = 51/128 (39%), Gaps = 14/128 (10%)
Query: 319 DELEKEAALKMHSTVSQAIR-----------IGREMRAKFVLLTHFSQRYAKLPRLNKDL 367
D + K+ A K+H++ + R E+ A+ ++ + +K
Sbjct: 349 DSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEEEIEFLTGSKKA 408
Query: 368 SENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKIS 427
++ + + EKK++++KK+KK+K KKK++ + E E + +
Sbjct: 409 TKKIKKIVEKA---EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Query: 428 LANLPKLK 435
+ K
Sbjct: 466 KEEEEEKK 473
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.9 bits (70), Expect = 0.83
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
K+ ++K+ K +K+KK+KK+KKV
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKV 126
Score = 30.9 bits (70), Expect = 0.91
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 367 LSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
L+E I +++K+ K+ ++++ K KK+KK
Sbjct: 82 LAEGYSINLFPEESEKEQKEVSKETEEKEAIKAKKEKK 119
Score = 30.5 bits (69), Expect = 1.2
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
EK++K+ K+ + K+ K KK+K
Sbjct: 95 SEKEQKEVSKETEEKEAIKAKKEKK 119
Score = 29.3 bits (66), Expect = 2.5
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
K+ ++K+ K KK+KK+KK+K
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEK 124
Score = 29.3 bits (66), Expect = 2.8
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
+K+ K+ ++K+ K KK+KK+K
Sbjct: 99 QKEVSKETEEKEAIKAKKEKKEKKE 123
Score = 28.6 bits (64), Expect = 4.2
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
++ K+ ++K+ K KK+KK+KK
Sbjct: 99 QKEVSKETEEKEAIKAKKEKKEKK 122
Score = 28.2 bits (63), Expect = 5.4
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E+K+ K+ ++K K KK+KK+
Sbjct: 98 EQKEVSKETEEKEAIKAKKEKKE 120
Score = 28.2 bits (63), Expect = 7.0
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
E+K+ K KK+K++KK+KK +K+
Sbjct: 107 EEKEAIKAKKEKKEKKEKKVAEKL 130
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 32.1 bits (73), Expect = 0.84
Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQ-ISSRPEICREFDSKKISLANLPKLKL 436
+KKKKKKK K+KKKKK ++ + I + + E +++K S P+LKL
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSET--PQLKL 733
Score = 29.5 bits (66), Expect = 5.9
Identities = 13/23 (56%), Positives = 19/23 (82%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
KKKKKKK K+K+KKK ++ K++
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQI 707
Score = 29.1 bits (65), Expect = 6.4
Identities = 13/22 (59%), Positives = 18/22 (81%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
+KKKKKKK K++KKKK ++ K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 29.6 bits (67), Expect = 0.92
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKK 403
P K +KK+K RKK++ KK K
Sbjct: 76 PPDKLIRKKRKLPRKKRRPKKPK 98
Score = 29.6 bits (67), Expect = 0.96
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 384 KKKKKKKKKKRKKKKKKKKKK 404
K +KK+K +KK++ KK K
Sbjct: 78 DKLIRKKRKLPRKKRRPKKPK 98
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 29.5 bits (67), Expect = 0.93
Identities = 5/24 (20%), Positives = 18/24 (75%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
K+++K +++++ K+ K K++++
Sbjct: 3 RKEEEKAQREEELKRLKNLKREEI 26
Score = 28.8 bits (65), Expect = 2.1
Identities = 6/23 (26%), Positives = 17/23 (73%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E+K+++K ++++ K+ K K++
Sbjct: 2 ERKEEEKAQREEELKRLKNLKRE 24
Score = 28.4 bits (64), Expect = 2.8
Identities = 5/25 (20%), Positives = 18/25 (72%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
EK +++++ K+ + K+++ ++K+
Sbjct: 7 EKAQREEELKRLKNLKREEIEEKLE 31
Score = 28.4 bits (64), Expect = 2.9
Identities = 6/24 (25%), Positives = 16/24 (66%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVN 406
K++K+++K ++ ++ K+ K K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKRE 24
>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
basal transcription 1 (ABT1) and similar proteins. This
subfamily corresponds to the RRM of novel nuclear
proteins termed ABT1 and its homologous counterpart,
pre-rRNA-processing protein ESF2 (eighteen S factor 2),
from yeast. ABT1 associates with the TATA-binding
protein (TBP) and enhances basal transcription activity
of class II promoters. Meanwhile, ABT1 could be a
transcription cofactor that can bind to DNA in a
sequence-independent manner. The yeast ABT1 homolog,
ESF2, is a component of 90S preribosomes and 5'
ETS-based RNPs. It is previously identified as a
putative partner of the TATA-element binding protein.
However, it is primarily localized to the nucleolus and
physically associates with pre-rRNA processing factors.
ESF2 may play a role in ribosome biogenesis. It is
required for normal pre-rRNA processing, as well as for
SSU processome assembly and function. Both ABT1 and ESF2
contain an RNA recognition motif (RRM), also termed RBD
(RNA binding domain) or RNP (ribonucleoprotein domain).
.
Length = 98
Score = 29.5 bits (67), Expect = 1.1
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 8/33 (24%)
Query: 381 PEKKKKKKKKKKKRKKKKKK--------KKKKV 405
PE K+K++KKK KKKK + KKV
Sbjct: 34 PEDPAKRKRRKKKGGNKKKKFTEGWVEFEDKKV 66
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 31.3 bits (71), Expect = 1.1
Identities = 16/49 (32%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 358 AKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
L++ L GI D+ + ++ K++ +KK+K KKK +K+KVN
Sbjct: 323 LDEKSLSEFLGYMGGIDEDDEDE-DDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 30.5 bits (69), Expect = 1.1
Identities = 12/54 (22%), Positives = 23/54 (42%), Gaps = 7/54 (12%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKL 434
E +++ + K K+KKK++ + + + E + SK PKL
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK-------PKL 153
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 29.1 bits (66), Expect = 1.1
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
E +K +K+ KK K KKKK
Sbjct: 12 EHHRKLRKEAKKNPTWKSKKKK 33
Score = 26.8 bits (60), Expect = 6.1
Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 4/32 (12%)
Query: 377 NMRFPEKKKKKKKKKKKRKKKKK----KKKKK 404
+R+ +KK + +K RK+ KK K KKK
Sbjct: 1 RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKK 32
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 31.0 bits (70), Expect = 1.1
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKVNN 407
+ EK K KKK K KK KKK+ K+ N
Sbjct: 205 WHEKDKYSYKKKLKSKKLKKKQAKREAN 232
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 31.2 bits (71), Expect = 1.1
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
PE+ KK++ + +RKKK+KK K +
Sbjct: 366 PERDKKERPGRYRRKKKEKKAKSER 390
Score = 31.2 bits (71), Expect = 1.2
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVN 406
P K K ++ KK+R + ++KKK+
Sbjct: 360 PPTKAKPERDKKERPGRYRRKKKEKK 385
Score = 29.7 bits (67), Expect = 3.8
Identities = 9/24 (37%), Positives = 16/24 (66%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
++ KK++ + R+KKK+KK K
Sbjct: 365 KPERDKKERPGRYRRKKKEKKAKS 388
Score = 29.7 bits (67), Expect = 3.9
Identities = 7/26 (26%), Positives = 18/26 (69%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
+KK++ + ++K+K+KK K +++
Sbjct: 366 PERDKKERPGRYRRKKKEKKAKSERR 391
Score = 29.3 bits (66), Expect = 5.4
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
R +++ + ++KKK KK K +++ N
Sbjct: 368 RDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 28.5 bits (64), Expect = 8.1
Identities = 6/26 (23%), Positives = 16/26 (61%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ P+ K K ++ K+++ + ++KK
Sbjct: 356 KPPKPPTKAKPERDKKERPGRYRRKK 381
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 30.8 bits (70), Expect = 1.2
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNS 408
K + ++++ KK+KKKKKKK KK N +
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNAT 198
Score = 30.0 bits (68), Expect = 2.3
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSI 409
K + ++++ K+KKKKKKKK K NN+
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNAT 198
Score = 30.0 bits (68), Expect = 2.6
Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 2/57 (3%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFY 438
E+++ KKKKKKK+KK KK + +SS E S + KLK
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAV--PTELSSGAGQVGEAKKLKKKR 231
Score = 29.2 bits (66), Expect = 4.6
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
+ + ++++ KKK+KKKKKK KK
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNAT 198
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 31.2 bits (70), Expect = 1.3
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
+K KK KKK+KK K+K++ K KK
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKK 220
Score = 31.2 bits (70), Expect = 1.6
Identities = 13/47 (27%), Positives = 26/47 (55%)
Query: 369 ENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
+V + P+KK+KK+K+K++ K KKK+ + + + + P
Sbjct: 190 GDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSP 236
Score = 30.4 bits (68), Expect = 2.4
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
R ++ +K K++KKKR+K+K+ +++
Sbjct: 89 RHRQRLEKDKREKKKREKEKRGRRRH 114
Score = 30.0 bits (67), Expect = 3.5
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ K KKKK++K+K +KKKKKK
Sbjct: 275 KKSPKHKKKKQRKEKEEKKKKKKHHH 300
Score = 30.0 bits (67), Expect = 3.6
Identities = 9/23 (39%), Positives = 20/23 (86%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E++ +++ +K KR+KKK++K+K+
Sbjct: 87 ERRHRQRLEKDKREKKKREKEKR 109
Score = 29.3 bits (65), Expect = 4.8
Identities = 13/33 (39%), Positives = 24/33 (72%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
KKKK++K+K+++KKKKK + ++S + +P
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQP 313
Score = 29.3 bits (65), Expect = 5.9
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
P+ KKKK++K+K+ KKKKKK
Sbjct: 278 PKHKKKKQRKEKEEKKKKKKHHHH 301
Score = 28.9 bits (64), Expect = 7.6
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
EK K++KKK++K K+ +++
Sbjct: 95 EKDKREKKKREKEKRGRRRHH 115
Score = 28.5 bits (63), Expect = 9.0
Identities = 9/39 (23%), Positives = 24/39 (61%)
Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
++ E+++ +++ +K +++KKK++K+K S E
Sbjct: 82 VKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120
>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1)
is a melanoma-associated antigen (MAA). MUM-1 belongs
to the mutated or aberrantly expressed type of MAAs,
along with antigens such as CDK4, beta-catenin,
gp100-in4, p15, and N-acetylglucosaminyltransferase V.
It is highly expressed in several types of human
cancers. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 80
Score = 28.9 bits (65), Expect = 1.3
Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 6/54 (11%)
Query: 364 NKDLSENVGIAFDNMRFPEKKKKKKKKKKKR------KKKKKKKKKKVNNSIQI 411
K V DNM+ +K + K+ K +K K K+ +IQI
Sbjct: 26 RKKQKARVNFIGDNMQSEKKGIRVVKRWLKHFDCTEKQKLTNKAKESYEQAIQI 79
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 31.3 bits (70), Expect = 1.4
Identities = 28/120 (23%), Positives = 43/120 (35%), Gaps = 22/120 (18%)
Query: 374 AFDNMRFPEKKKKKKKKKKKRK---KKKKKKKKKVNNSIQISSRPEICREFDSKKISLAN 430
FDN+ + KKKKKKKK+ K K++ K + E S+
Sbjct: 96 EFDNITDDDTPSKKKKKKKKKGWFWAKSKQESKTIETE-----------EIIDVGASVNQ 144
Query: 431 LPKLKLFYPALKAMFAE-----YQDEIENRAIRRNLKQESTYLARYPSENDKEKPGSLDD 485
+ +LF L E D+ E + +E ++ A E E P +D
Sbjct: 145 ANETRLFDDTLDDQLEESVSTQSTDDGE---RLFDQNKEPSFTAGLDDEVQSEVPQPEED 201
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 31.2 bits (71), Expect = 1.4
Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 5/60 (8%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLFY 438
R + K+ K+R KK + K K+ +++ EI +E ++ S L KL+L+Y
Sbjct: 524 REEQGTNFGKRNSKERYKKNEDKIKEFASALGK----EILKEEPTENSSKNIL-KLRLYY 578
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the parasitophorous
vacuole membrane. Members have an initial hydrophobic,
Phe/Tyr-rich, stretch long enough to span the membrane,
a highly charged region rich in Lys, a second putative
transmembrane region and a second highly charged, low
complexity sequence region. Some members have up to 100
residues of additional C-terminal sequence. These genes
have been shown to be found in the sub-telomeric regions
of both Plasmodium falciparum and P. yoelii chromosomes.
Length = 82
Score = 28.8 bits (65), Expect = 1.5
Identities = 17/33 (51%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
KKKK KK KK +KKKK K I ISS
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNK--KKIIISS 58
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 31.0 bits (71), Expect = 1.5
Identities = 12/22 (54%), Positives = 18/22 (81%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKV 405
+K++ KKKKKRKK KK + +K+
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
Score = 29.8 bits (68), Expect = 3.7
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 382 EKKKKKKKKKKKRKKKKKKKK 402
EK++ KKKKK+K+ KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 28.7 bits (65), Expect = 9.1
Identities = 9/22 (40%), Positives = 16/22 (72%)
Query: 385 KKKKKKKKKRKKKKKKKKKKVN 406
+K++ KKKK++KK KK + +
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
>gnl|CDD|183052 PRK11244, phnP, carbon-phosphorus lyase complex accessory protein;
Provisional.
Length = 250
Score = 30.3 bits (69), Expect = 1.6
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)
Query: 285 SGHKITYSGDT--MPCDALVSIGKNS-DLLIHEATHEDELEKEAALKMHSTVSQAIRIGR 341
+ ++ Y DT +P D L + N DLL+ + +H +E A + H+ ++ A+ I
Sbjct: 153 AHSRVAYLTDTVGLPEDTLKFLRNNQPDLLVLDCSHP---PQEDAPRNHNDLTTALAIIE 209
Query: 342 EMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAFDNMRFP 381
+R V+LTH S + N L V +A+D M
Sbjct: 210 VLRPPRVILTHISHQLDAWLMENAALPSGVEVAYDGMEIG 249
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 31.0 bits (70), Expect = 1.7
Identities = 15/42 (35%), Positives = 16/42 (38%)
Query: 386 KKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKIS 427
+ K KK K KKK KK KKK E D K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSE 184
Score = 29.4 bits (66), Expect = 5.1
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQIS 412
+ KK K KKK K+ KKK+ K+ + + S
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDKDDEEES 175
Score = 28.6 bits (64), Expect = 8.4
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
+K K KKK KK +KK+ K+ K
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDK 169
Score = 28.6 bits (64), Expect = 8.8
Identities = 10/26 (38%), Positives = 13/26 (50%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVN 406
K KKK KK KK++ K+ K
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDKDDEE 173
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 28.1 bits (63), Expect = 1.8
Identities = 11/43 (25%), Positives = 28/43 (65%)
Query: 362 RLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKK 404
LN DL + ++ E+K++++++ +KR+++K+++K K
Sbjct: 13 MLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREKNK 55
>gnl|CDD|232831 TIGR00115, tig, trigger factor. Trigger factor is a
ribosome-associated molecular chaperone and is the first
chaperone to interact with nascent polypeptide. Trigger
factor can bind at the same time as the signal
recognition particle (SRP), but is excluded by the SRP
receptor (FtsY). The central domain of trigger factor
has peptidyl-prolyl cis/trans isomerase activity. This
protein is found in a single copy in virtually every
bacterial genome [Protein fate, Protein folding and
stabilization].
Length = 408
Score = 30.6 bits (70), Expect = 1.8
Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 359 KLPRLNKDLSENVGIAFDNM-RFPEKKKKKKKKKKKRKKKKKKKKKKV-----NNSIQIS 412
+LP L+ + ++++G F+ + +K +++KK + K K K++ + NN ++
Sbjct: 233 ELPELDDEFAKSLGEEFETLEELKADIRKNLEREKKERAKNKLKEQLLDKLVENNEFEL- 291
Query: 413 SRPE--ICREFDSKKISLANLPKLKLFYP--ALKAMFAEYQDEIENRAIRR 459
PE + +E D + + LK E ++E A RR
Sbjct: 292 --PESLVEQEIDRLLEQALQQLQQQGIDLEEYLKDTEEELREEFREEAERR 340
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 30.6 bits (69), Expect = 1.9
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
E K + K+++K K++ K++KKK
Sbjct: 96 KEPKNESGKEEEKEKEQVKEEKKK 119
Score = 30.2 bits (68), Expect = 3.0
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
P+ + K+++K+K + K++KKKKK
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKE 122
Score = 29.5 bits (66), Expect = 4.3
Identities = 13/44 (29%), Positives = 24/44 (54%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
+++ K++KKKKK K K++ K +K + P+ + KK
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKK 153
Score = 29.5 bits (66), Expect = 5.1
Identities = 10/23 (43%), Positives = 20/23 (86%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKK 403
E++K+K++ K+++KKKK+K K+
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKE 126
Score = 29.5 bits (66), Expect = 5.1
Identities = 10/45 (22%), Positives = 25/45 (55%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
K+K+K+K+KK ++ + ++++K ++ SRP+ +
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPN 185
Score = 29.1 bits (65), Expect = 6.1
Identities = 11/24 (45%), Positives = 20/24 (83%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
E K+++K+K++ K++KKKKK+K
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEK 123
Score = 29.1 bits (65), Expect = 7.0
Identities = 14/45 (31%), Positives = 26/45 (57%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
EK++ K++KKKK++K K++ K + RP +E + +K
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152
>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 103
Score = 28.8 bits (65), Expect = 1.9
Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 7/39 (17%)
Query: 372 GIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
GIA E KK++K K ++ K +K++ K
Sbjct: 35 GIA-------EYKKQQKAKAREADKARKQQLKAKQRQAA 66
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 29.3 bits (66), Expect = 2.0
Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 7/64 (10%)
Query: 350 LTHFSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
L H SQ+ + R K+ ++ + +KK+KK + R KK KK+ +S
Sbjct: 54 LIHLSQQLQQPQRKRKEQQKSS-------QTHKKKRKKSRHVSSRSAKKISAKKRRRSSS 106
Query: 410 QISS 413
SS
Sbjct: 107 SSSS 110
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 29.1 bits (66), Expect = 2.0
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
+ K ++++K ++KKKKKKK+ N
Sbjct: 67 SRNKAAEERRKLKEKKKKKKKELEN 91
Score = 28.0 bits (63), Expect = 5.1
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKK 404
K ++++K K KKKKKKK+ +
Sbjct: 66 ASRNKAAEERRKLKEKKKKKKKELE 90
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 30.0 bits (68), Expect = 2.0
Identities = 17/33 (51%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS 413
+++KKKKKKK KKKK KK V ISS
Sbjct: 177 NAEEEKKKKKKKSAKKKKLKKVAAVGMK-AISS 208
Score = 28.8 bits (65), Expect = 5.3
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKK 402
N EKKKKKKK KK+K KK
Sbjct: 176 NNAEEEKKKKKKKSAKKKKLKKVAAV 201
Score = 28.8 bits (65), Expect = 5.7
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
+KK +++KK+KKKK KKKK+
Sbjct: 167 EAARDKKKSNNAEEEKKKKKKKSAKKKKL 195
Score = 28.0 bits (63), Expect = 8.8
Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 374 AFDNMRFPEKKKKKKKKKKKRKKKKKKKK-KKVNNS 408
A + +++KKKK+KK KKKK KKV
Sbjct: 166 AEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAV 201
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This is
a bacterial family of uncharacterized proteins.
Length = 55
Score = 27.6 bits (62), Expect = 2.1
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 377 NMRFPEKKKKKKKKKKKRKKKKKKKKK 403
M+FP+ K+KKKK K+ ++ K
Sbjct: 28 KMKFPQFDKQKKKKAKRNAEQDAKNSA 54
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 30.8 bits (70), Expect = 2.2
Identities = 11/23 (47%), Positives = 19/23 (82%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E+K+ K++K+ RK KK+++KKK
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKK 580
Score = 30.0 bits (68), Expect = 3.2
Identities = 10/23 (43%), Positives = 19/23 (82%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
EK+ K++K+ ++ KK+++KKKK
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKK 581
Score = 29.6 bits (67), Expect = 4.9
Identities = 10/23 (43%), Positives = 19/23 (82%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
EK++K+ K++KR +K KK+++K
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEK 578
>gnl|CDD|222366 pfam13764, E3_UbLigase_R4, E3 ubiquitin-protein ligase UBR4. This
is a family of E## ubiquitin ligase enzymes.
Length = 794
Score = 30.8 bits (70), Expect = 2.3
Identities = 11/56 (19%), Positives = 25/56 (44%)
Query: 368 SENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDS 423
E V D +R + +K++ R+K+ + + ++ QI + P + +S
Sbjct: 403 KEGVAKKIDEVRDETRAEKRRLAMAMREKQLQALGMRTSSGGQIVASPRLLEGIES 458
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 30.0 bits (68), Expect = 2.3
Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN--NSIQISSRPEICREFDSKKISLANLPKLK 435
+++ KKK K K K KK KK N +++ + ++ + ++ L KLK
Sbjct: 144 KEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLK 199
>gnl|CDD|224158 COG1237, COG1237, Metal-dependent hydrolases of the beta-lactamase
superfamily II [General function prediction only].
Length = 259
Score = 30.1 bits (68), Expect = 2.3
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGLFSVIKA 186
IL D G T S ++ + L + AV +SH H DH GL +++
Sbjct: 34 ILFDTG--TDSDVLLHNARLLGVDLRDIDAVVLSHGHYDHTGGLPYLLEE 81
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 28.8 bits (65), Expect = 2.3
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
+ ++ + K K+K KKKKKKK+ +
Sbjct: 64 KTAREEAVEAKAKEKEKKKKKKKELE 89
Score = 27.2 bits (61), Expect = 9.9
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVN 406
++ + K K+++KKKKKKK+ +
Sbjct: 68 EEAVEAKAKEKEKKKKKKKELED 90
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 30.6 bits (69), Expect = 2.4
Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 379 RFPEKKKKKKKKK---KKRKKKKKKKKKKVNNSIQ 410
+ P +KK K KKK K K K KKK S +
Sbjct: 744 KTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKK 778
Score = 29.0 bits (65), Expect = 6.7
Identities = 11/35 (31%), Positives = 14/35 (40%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEI 417
K K K KK+ K K KK K + P +
Sbjct: 760 TKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPAL 794
Score = 29.0 bits (65), Expect = 7.5
Identities = 11/24 (45%), Positives = 12/24 (50%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKV 405
K K KKK K K KK KK+
Sbjct: 762 GKAAKTVKKKSKAKSKKTTKKRAG 785
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 29.8 bits (67), Expect = 2.4
Identities = 26/92 (28%), Positives = 44/92 (47%), Gaps = 13/92 (14%)
Query: 318 EDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAK--LPRLNKDLSENVGIAF 375
E E+ K A KM ++ R+ K R AK RL + E+ G
Sbjct: 123 EAEIAKNMA-KMPQMIADWRAQKRKREQK--------ARAAKERKERLVAEAREHFGYWV 173
Query: 376 D--NMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
D + RF E ++K+K++KK+ K+ K+++K+
Sbjct: 174 DPRDPRFQEMLQQKEKEEKKKVKEAKRREKEE 205
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.4 bits (69), Expect = 2.5
Identities = 38/165 (23%), Positives = 60/165 (36%), Gaps = 19/165 (11%)
Query: 318 EDELEK--EAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVGIAF 375
E EL+K K + + + E+R K LL YA L + + L F
Sbjct: 292 EKELKKLENKLEKQEDELEELEKAAEELRQKGELL------YANLQLIEEGLKSVRLADF 345
Query: 376 DNMRFPEKKKKKKKKK-------KKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISL 428
E K + K K + KK KK K KVN Q+S E ++S K +L
Sbjct: 346 YGN--EEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTAL 403
Query: 429 ANLPKLKLFYPALKAMFAEYQDEIENRAIRRNLKQESTYLARYPS 473
K + + E ++++ +R K+ + S
Sbjct: 404 EKAEGKKAIEEIREELIEE--GLLKSKKKKRKKKEWFEKFRWFVS 446
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 29.1 bits (65), Expect = 2.7
Identities = 14/46 (30%), Positives = 31/46 (67%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKIS 427
EKK+ K++K+++++K+ K++K I++S + + RE KK++
Sbjct: 76 EKKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLT 121
Score = 27.9 bits (62), Expect = 7.1
Identities = 15/47 (31%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISS-RPEICREFDSKKI 426
PEK+ +K+ K ++ +K +KKKK++ +I+ R RE + K
Sbjct: 50 PEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKR 96
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 29.9 bits (67), Expect = 2.8
Identities = 14/44 (31%), Positives = 18/44 (40%), Gaps = 4/44 (9%)
Query: 381 PEKKKKKKKKKKKR----KKKKKKKKKKVNNSIQISSRPEICRE 420
PE K+ KK KK + K K KK KK + S +
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDP 185
Score = 28.7 bits (64), Expect = 6.5
Identities = 10/34 (29%), Positives = 14/34 (41%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPE 416
K K K KK +KK + I S+P+
Sbjct: 157 KVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 28.4 bits (64), Expect = 3.2
Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 14/85 (16%)
Query: 366 DLSE-NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSK 424
D+S V ++ F +KKKKKKK + +KK+KK+V+ R+ D K
Sbjct: 22 DISGVKVPEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQ----------RKADQK 71
Query: 425 KISLANLPKLK---LFYPALKAMFA 446
+ A L +K LK+ F+
Sbjct: 72 AVDAALLAAIKKHPDLKQYLKSRFS 96
>gnl|CDD|179371 PRK02113, PRK02113, putative hydrolase; Provisional.
Length = 252
Score = 29.8 bits (67), Expect = 3.2
Identities = 17/44 (38%), Positives = 23/44 (52%), Gaps = 7/44 (15%)
Query: 137 ILLDCGEGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADHHLGL 180
IL+DCG Q++RL ++ AV I+H H DH GL
Sbjct: 47 ILIDCGPDFREQMLRLP-------FGKIDAVLITHEHYDHVGGL 83
>gnl|CDD|129141 TIGR00030, S21p, ribosomal protein S21. This model describes
bacterial ribosomal protein S21 and most mitochondrial
and chloroplast equivalents [Protein synthesis,
Ribosomal proteins: synthesis and modification].
Length = 58
Score = 27.2 bits (61), Expect = 3.3
Identities = 11/44 (25%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 360 LPRLNKDLSENVGIA-FDNMRFPEKKKKKKKKKKKRKKKKKKKK 402
L R + L + + R+ EK +++++K+K K+ +KK
Sbjct: 15 LRRFKRKLEKEGILRELKKRRYYEKPSERRRRKEKAAAKRIRKK 58
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 29.5 bits (67), Expect = 3.3
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKK 404
F E+K+KKK K+ KK K++K+K
Sbjct: 94 FEEEKEKKKAMSKEEKKAIKEEKEK 118
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 29.4 bits (67), Expect = 3.3
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQI 411
E+ KK +K ++ KKKKK++K + ++I
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKGFSKDVRI 217
>gnl|CDD|213392 cd12149, Flavi_E_C, Immunoglobulin-like domain III (C-terminal
domain) of Flavivirus envelope glycoprotein E. The
C-terminal domain (domain III) of Flavivirus
glycoprotein E appears to be involved in low-affinity
interactions with negatively charged glycoaminoglycans
on the host cell surface. Domain III may also play a
role in interactions with alpha-v-beta-3 integrins in
West Nile virus, Japanese encephalitis virus, and Dengue
virus. The interface between domain I and domain III
appears to be destabilized by the low-pH environment of
the endosome, and domain III may play a vital role in
the conformational changes of envelope glycoprotein E
that follow the clathrin-mediated endocytosis of viral
particles and are a prerequisite to membrane fusion.
Length = 91
Score = 28.0 bits (63), Expect = 3.3
Identities = 8/27 (29%), Positives = 12/27 (44%), Gaps = 5/27 (18%)
Query: 283 TKSGH-----KITYSGDTMPCDALVSI 304
+GH ++ YSG PC V +
Sbjct: 9 ADTGHGTVVMEVKYSGTDAPCRIPVRV 35
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 29.7 bits (67), Expect = 3.3
Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 8/55 (14%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKL 436
E KKK +KKKKK+K+KK+ K + + + K + PK K
Sbjct: 65 ESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK--------SKKTKKKPPKPKP 111
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 28.8 bits (64), Expect = 3.7
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
+ KKKK+K ++K K KKK++K N I
Sbjct: 23 QKSDKKKKEKVSEKKGKSKKKEEKPNGKI 51
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an exchange
factor for Cdc42 and an effector of Ras1, the homolog of
the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
exchange factor for Cdc42, a member of the Rho family of
Ras-like proteins. Cdc42 then activates the Shk1/Orb2
protein kinase. Scd1 interacts with Klp5 and Klp6
kinesins to mediate cytokinesis. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 28.8 bits (65), Expect = 3.9
Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 2/42 (4%)
Query: 375 FDNMR--FPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
F+ + E KKKKKK K K KKKK +Q+ R
Sbjct: 53 FERILLCCKEVKKKKKKSSLKSKSSSSSKKKKKKGPLQLKGR 94
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 29.4 bits (66), Expect = 3.9
Identities = 12/19 (63%), Positives = 13/19 (68%)
Query: 385 KKKKKKKKKRKKKKKKKKK 403
K KK KK KKKKKKK+
Sbjct: 199 KSSAKKDKKSKKKKKKKRS 217
Score = 28.6 bits (64), Expect = 6.6
Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 5/43 (11%)
Query: 366 DLSENVGIAFDNMR----FPEKKKKKKKKKKKRKKKKKKKKKK 404
+L+E+VG D+ + + K K KK K+ KKKKKKK+
Sbjct: 176 NLTEDVGF-LDSFKPLYTWHYKGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 28.4 bits (64), Expect = 3.9
Identities = 8/25 (32%), Positives = 22/25 (88%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
E+K+++K+++K+ K++K++K+K+
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWG 39
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 29.6 bits (67), Expect = 4.0
Identities = 24/76 (31%), Positives = 32/76 (42%), Gaps = 16/76 (21%)
Query: 380 FPEKKKKKKKKKKKRKKKKKKKKKKV---------NNSIQISS----RPEICR---EFDS 423
F K KKKK KK ++K KKKKK++ N +I + EI + E D
Sbjct: 50 FSAKAKKKKSKKSEKKSSSKKKKKEISVLDPKRSQNIAILLRKLKLPPEEIVQAILEMDE 109
Query: 424 KKISLANLPKLKLFYP 439
+ L L L P
Sbjct: 110 SVLGLELLENLLKMAP 125
>gnl|CDD|177060 CHL00138, rps5, ribosomal protein S5; Validated.
Length = 143
Score = 28.4 bits (64), Expect = 4.1
Identities = 11/31 (35%), Positives = 13/31 (41%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSIQIS 412
KK KKK +K K+ K IQI
Sbjct: 2 LFLLKKMYNKKKNRKSNIKENKWEERVIQIK 32
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 28.8 bits (65), Expect = 4.3
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 2/29 (6%)
Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
FD ++ + K+ K +KKRKK K+KK
Sbjct: 142 FDELK--KSKQLDKALEKKRKKNAGKEKK 168
Score = 28.4 bits (64), Expect = 6.3
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNN 407
+KK + KK K+ K +KK+KK
Sbjct: 139 KKKFDELKKSKQLDKALEKKRKKNAG 164
>gnl|CDD|187770 cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; DEAD/DEAH box helicase DNA
helicase cas3'; Often but not always is fused to HD
nuclease domain; signature gene for Type I.
Length = 353
Score = 29.3 bits (66), Expect = 4.7
Identities = 19/84 (22%), Positives = 33/84 (39%), Gaps = 14/84 (16%)
Query: 300 ALVSIGKNSDLLIHEATHEDELEKEAALKMHSTVSQAIRIGREMRAKFVLLTHFSQRYAK 359
L SI NS L+ E DE L + + + ++ +L++ A
Sbjct: 118 TLASIA-NSLLIFDEVHFYDEYTLALILAV-------LEVLKDNDVPILLMS------AT 163
Query: 360 LPRLNKDLSENVGIAFDNMRFPEK 383
LP+ K+ +E +G +N K
Sbjct: 164 LPKFLKEYAEKIGYVEENEPLDLK 187
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 28.1 bits (63), Expect = 4.7
Identities = 8/28 (28%), Positives = 18/28 (64%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVNNSI 409
EK +KK K++ + +K++ +K +V +
Sbjct: 40 EKLEKKAKRQLRAEKRQALEKGRVKPVL 67
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 27.4 bits (61), Expect = 4.8
Identities = 13/44 (29%), Positives = 18/44 (40%)
Query: 363 LNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
L + E KK+KKK K + KK +K+ K V
Sbjct: 8 LEAIDEDRPPTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVK 51
Score = 26.6 bits (59), Expect = 9.3
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
++ +KKK K + KK +K+ K KK
Sbjct: 24 YEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 28.7 bits (64), Expect = 4.9
Identities = 10/20 (50%), Positives = 12/20 (60%)
Query: 393 KRKKKKKKKKKKVNNSIQIS 412
K KKKKKK+ K + S S
Sbjct: 32 KSKKKKKKRSKATSPSHNAS 51
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 26.9 bits (60), Expect = 4.9
Identities = 10/22 (45%), Positives = 17/22 (77%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKK 403
EK+K+KK +KRK K+ +K++
Sbjct: 43 EKRKRKKAAARKRKFKRLRKEQ 64
Score = 26.5 bits (59), Expect = 6.2
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKK 402
F EK +K+K+KK +K+K K+
Sbjct: 36 EFYEKPSEKRKRKKAAARKRKFKR 59
>gnl|CDD|220763 pfam10453, NUFIP1, Nuclear fragile X mental retardation-interacting
protein 1 (NUFIP1). Proteins in this family have been
implicated in the assembly of the large subunit of the
ribosome and in telomere maintenance. Some proteins in
this family contain a CCCH zinc finger. This family
contains a protein called human fragile X mental
retardation-interacting protein 1, which is known to
bind RNA and is phosphorylated upon DNA damage.
Length = 57
Score = 26.6 bits (59), Expect = 5.1
Identities = 7/25 (28%), Positives = 14/25 (56%)
Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKK 402
+P K +KKKK + ++ K+ +
Sbjct: 33 KNYPTKANIEKKKKAEEERIKRGEL 57
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 29.6 bits (66), Expect = 5.1
Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 4/58 (6%)
Query: 353 FSQRYAKLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
Q ++ D N G N+ KKKK KK K +K+ K+K KK+ +
Sbjct: 45 LRQVISRFTSFESDDQANSG----NVSGKRVPKKKKIKKPKLRKRTKRKNKKIKSLTA 98
>gnl|CDD|178714 PLN03170, PLN03170, chalcone synthase; Provisional.
Length = 401
Score = 29.3 bits (65), Expect = 5.1
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 11/58 (18%)
Query: 11 CSSVTVLDC--PSEDYLDSLLGESIFSQHQSCAMNEDDKAALVAHFSPHHIMSHPRYK 66
CS +T + PSE +LDS++G+++F D AA++ P + P ++
Sbjct: 194 CSEITAVTFRGPSESHLDSMVGQALFG---------DGAAAVIVGADPDERVERPLFQ 242
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 28.1 bits (63), Expect = 5.1
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
+K +K+ K +RKK + K+K KV
Sbjct: 103 KAGRKARKELKNRRKKVRGKEKTKV 127
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 28.0 bits (63), Expect = 5.2
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKKVN 406
+K+ K +++KKKKKKK+ N
Sbjct: 62 PKTARKESVAAKAAEKEKKKKKKKELKN 89
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays
a role in protein translation. Deletions of the TMA7
gene results in altered protein synthesis rates.
Length = 63
Score = 26.7 bits (59), Expect = 5.3
Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 6/51 (11%)
Query: 364 NKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSR 414
K+L E +++ F +K+K++ K K+ K K K I+ S +
Sbjct: 18 KKELDE------EDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGK 62
>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
(DUF1713). This domain is found at the C terminal end
of mitochondrial proteins of unknown function.
Length = 34
Score = 25.8 bits (57), Expect = 5.3
Identities = 9/32 (28%), Positives = 20/32 (62%)
Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNN 407
D++ ++K KK K KK +K+ + ++K++
Sbjct: 3 DSVLRKRRRKMKKHKYKKLRKRTRALRRKLDK 34
>gnl|CDD|179323 PRK01710, murD, UDP-N-acetylmuramoyl-L-alanyl-D-glutamate
synthetase; Provisional.
Length = 458
Score = 29.3 bits (66), Expect = 5.3
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 6/59 (10%)
Query: 35 SQHQSCAMNEDDKAALVAHFSPHHIMSHPRYKEFMS------KFPSTTQHLVLNESNEC 87
S Q M+ + A+V + SP+H+ H +E++ K+ S LVLN+ NE
Sbjct: 174 SSFQLMTMDVSPEVAVVTNLSPNHLDVHKDMEEYIDAKKNIFKYQSENDLLVLNKDNEI 232
>gnl|CDD|199206 cd08962, GatD, GatD subunit of archaeal Glu-tRNA amidotransferase.
GatD is involved in the alternative synthesis of
Gln-tRNA(Gln) in archaea via the transamidation of
incorrectly charged Glu-tRNA(Gln). GatD is active as a
dimer, and it provides the amino group required for this
reaction. GatD is related to bacterial L-asparaginases
(amidohydrolases), which catalyze the hydrolysis of
asparagine to aspartic acid and ammonia. This CD spans
both the L-asparaginase_like domain and an N-terminal
supplementary domain.
Length = 402
Score = 29.1 bits (66), Expect = 5.5
Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 6/50 (12%)
Query: 370 NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSI-----QISSR 414
N+GI E +K +K K + ++ +KK SI I+SR
Sbjct: 37 NIGIDISIEEI-ELIEKGEKPKPELGEEIEKKPGLPKVSIISTGGTIASR 85
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 29.4 bits (67), Expect = 5.5
Identities = 13/53 (24%), Positives = 26/53 (49%)
Query: 374 AFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKI 426
F+ + E +K K+ + KK + +KKK+ N +I ++ D ++I
Sbjct: 174 KFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 28.9 bits (65), Expect = 5.5
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
P+ K K K K K+ +++ K++ K
Sbjct: 99 PKPKPKPKPKPVKKVEEQPKREVK 122
Score = 28.1 bits (63), Expect = 9.5
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
PE K K K K K + KK +++ K
Sbjct: 95 PEPKPKPKPKPKPKPVKKVEEQPK 118
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 27.9 bits (62), Expect = 5.6
Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 360 LPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRP 415
LN+ +V I N + P+K ++ +KK +KK+KK V S R
Sbjct: 34 NSSLNQADFPSVSIVGLNKK-PKKISAEEAEKKLLQKKEKKALTNVLRPEPRSPRR 88
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.1 bits (66), Expect = 5.6
Identities = 10/33 (30%), Positives = 21/33 (63%), Gaps = 10/33 (30%)
Query: 382 EKKKKKKKKKKKR----------KKKKKKKKKK 404
EKK++KKK++++ +K ++K++KK
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318
>gnl|CDD|227513 COG5186, PAP1, Poly(A) polymerase [RNA processing and
modification].
Length = 552
Score = 29.1 bits (65), Expect = 5.6
Identities = 11/34 (32%), Positives = 16/34 (47%)
Query: 398 KKKKKKKVNNSIQISSRPEICREFDSKKISLANL 431
K KKV+ + E+CRE+D S N+
Sbjct: 471 PVKPGKKVDIEQPVKEFIELCREYDEGDASGMNI 504
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 28.9 bits (65), Expect = 5.7
Identities = 9/26 (34%), Positives = 13/26 (50%)
Query: 379 RFPEKKKKKKKKKKKRKKKKKKKKKK 404
R +KKK + KK+ K KK +
Sbjct: 148 RTEILAEKKKPRSKKKSSKNSKKLRI 173
Score = 28.5 bits (64), Expect = 6.8
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
E +KKK + K+K K KK +++
Sbjct: 150 EILAEKKKPRSKKKSSKNSKKLRID 174
Score = 28.5 bits (64), Expect = 7.8
Identities = 9/22 (40%), Positives = 12/22 (54%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKK 402
EKKK + KKK + KK +
Sbjct: 153 AEKKKPRSKKKSSKNSKKLRID 174
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 28.0 bits (63), Expect = 5.9
Identities = 12/48 (25%), Positives = 29/48 (60%), Gaps = 5/48 (10%)
Query: 384 KKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANL 431
++ +++KK+R++K+K++ +K P+I ++F K +LA +
Sbjct: 72 ERMDERRKKRREQKEKEEIEKYRE-----ENPKIQQQFADLKRNLATV 114
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 29.0 bits (65), Expect = 6.1
Identities = 10/24 (41%), Positives = 16/24 (66%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
K + K +K KR KK+KK+++K
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEK 45
>gnl|CDD|185509 PTZ00196, PTZ00196, 60S ribosomal protein L36; Provisional.
Length = 98
Score = 27.3 bits (61), Expect = 6.2
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 372 GIAFD-NMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
GIA N K+ +K KRK K+K+ V + I+
Sbjct: 4 GIAVGLNKGHIVTKRARKPSPSKRKGLLSKRKRLVKDVIR 43
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 28.8 bits (65), Expect = 6.2
Identities = 7/27 (25%), Positives = 17/27 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNN 407
P +KKK+++K+++K+ K+ +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELRED 201
>gnl|CDD|234350 TIGR03766, TIGR03766, conserved hypothetical integral membrane
protein. Models TIGR03110, TIGR03111, and TIGR03112
describe a three-gene system found in several
Gram-positive bacteria, where TIGR03110 (XrtG) is
distantly related to a putative transpeptidase,
exosortase (TIGR02602). This model describes a small
clade that correlates by both gene clustering and
phyletic pattern, although imperfectly, to the three
gene system. Both this narrow clade, and the larger set
of full-length homologous integral membrane proteins,
have an especially well-conserved region near the
C-terminus with an invariant tyrosine. The function is
unknown.
Length = 483
Score = 29.2 bits (66), Expect = 6.4
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 370 NVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKK 401
N + P K +K K K+ K++ K+K
Sbjct: 319 NAEDVLATLSLPTKAEKNKYSIKEIKRRLKEK 350
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 25.7 bits (57), Expect = 6.5
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 390 KKKKRKKKKKKKKKKVNNSIQISSRP 415
+KKK +K KKK KK N + S++P
Sbjct: 3 RKKKINQKLKKKAKKANAKLHPSNKP 28
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 29.0 bits (65), Expect = 6.6
Identities = 39/168 (23%), Positives = 72/168 (42%), Gaps = 24/168 (14%)
Query: 305 GKNSDLLIHEATHE--DELEKEAALKMHSTVSQAIRIGREMR---AKFVLLTHFSQRYAK 359
GK+ ++LI +A + ++LE+E A + Q + I RE++ K ++ S A
Sbjct: 273 GKDYEVLIQQAIEDLVEDLERERANDIIQVSRQLLNILREIKFAEEKRLVCKLGSASIAT 332
Query: 360 LPRLNKDLSENVGIAFDNMRFP------------EKKKKKKKKKKKRKKKKKKKKKKVNN 407
+V DN E+K++ + +K+ K+ ++ K+K +
Sbjct: 333 RFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEKGDL 392
Query: 408 SIQIS-SRPEICREFDSKKISLANLPKLKLFYPALKAMFAEYQDEIEN 454
SI ++ S E+ + D KIS + Y +K E E+EN
Sbjct: 393 SIFVALSLSEVEEDGDKVKISF--DSSKAMHYELMK----EKLPELEN 434
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.2 bits (63), Expect = 6.7
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKV 405
K+ KKK+ RK KK KK
Sbjct: 47 SPKQPKKKRPTTPRKPATTKKSKKK 71
>gnl|CDD|202819 pfam03939, Ribosomal_L23eN, Ribosomal protein L23, N-terminal
domain. The N-terminal domain appears to be specific to
the eukaryotic ribosomal proteins L25, L23, and L23a.
Length = 52
Score = 26.1 bits (58), Expect = 6.7
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKVNNSIQ 410
K K K KK K KKK+K+ S+
Sbjct: 2 KAKALKAKKAVLKGVHGKKKRKIRTSVT 29
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 28.8 bits (65), Expect = 6.9
Identities = 14/45 (31%), Positives = 18/45 (40%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKK 425
+K KK K K KKKKKK + S + + KK
Sbjct: 349 TKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
>gnl|CDD|224696 COG1782, COG1782, Predicted metal-dependent RNase, consists of a
metallo-beta-lactamase domain and an RNA-binding KH
domain [General function prediction only].
Length = 637
Score = 28.8 bits (65), Expect = 6.9
Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 6/48 (12%)
Query: 133 PDTCILLDCG----EGTYSQLVRLYGSAVDTLLSQLSAVYISHLHADH 176
P++ +LLDCG L + +L AV I+H H DH
Sbjct: 202 PESRVLLDCGVNVAGNGEDAFPYLD--VPEFQPDELDAVIITHAHLDH 247
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 28.3 bits (63), Expect = 7.3
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 381 PEKKKKKKKKKKKRKKKK 398
E K+KKKKK KK+K K
Sbjct: 87 QENKQKKKKKDKKKKSPK 104
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 28.6 bits (64), Expect = 7.4
Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 7/105 (6%)
Query: 359 KLPRLNKDLSENVGIAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEIC 418
K L ++ + I + KKK K+KK KK+ + K + + R I
Sbjct: 203 KAASLKRNPPKKSNI-MSSF------FKKKTKEKKEKKEASESTVKEESEEESGKRDVIL 255
Query: 419 REFDSKKISLANLPKLKLFYPALKAMFAEYQDEIENRAIRRNLKQ 463
+ ++ L P+ + +E + E + + R+ LK+
Sbjct: 256 EDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 27.3 bits (61), Expect = 7.5
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 10/39 (25%)
Query: 381 PEKKKKKKKKK--------KKRKKK--KKKKKKKVNNSI 409
P KK++KK K K +KKK K K K K+NN++
Sbjct: 2 PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNNAV 40
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 28.9 bits (65), Expect = 7.6
Identities = 12/28 (42%), Positives = 14/28 (50%)
Query: 376 DNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
D+ EKK K K KK +K KK K
Sbjct: 766 DSAFRGEKKAKPKAAKKDARKAKKPSAK 793
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 29.1 bits (65), Expect = 7.7
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E+KKKK++K K+++ KK K +K
Sbjct: 19 ERKKKKEEKAKEKELKKLKAAQK 41
>gnl|CDD|225212 COG2333, ComEC, Predicted hydrolase (metallo-beta-lactamase
superfamily) [General function prediction only].
Length = 293
Score = 28.5 bits (64), Expect = 7.9
Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 19/83 (22%)
Query: 122 GKQVLAIRGPGPDTCILLDCGEGTYSQLVR------LY--GSAVDTL----------LSQ 163
G +L +LD G+G + L+R LY G+++ + +
Sbjct: 32 GDSILIQAPSNGWKVHMLDVGQG-LATLIRSEGKTILYDTGNSMGQDVIIPYLKSLGVRK 90
Query: 164 LSAVYISHLHADHHLGLFSVIKA 186
L + ++H ADH GL V+K
Sbjct: 91 LDQLILTHPDADHIGGLDEVLKT 113
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 25.9 bits (57), Expect = 8.0
Identities = 10/18 (55%), Positives = 12/18 (66%)
Query: 380 FPEKKKKKKKKKKKRKKK 397
++KKKK KKKRK K
Sbjct: 13 PGTEQKKKKSAKKKRKPK 30
>gnl|CDD|225382 COG2826, Tra8, Transposase and inactivated derivatives, IS30 family
[DNA replication, recombination, and repair].
Length = 318
Score = 28.5 bits (64), Expect = 8.1
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 390 KKKKRKKKKKKKKKKVNNSIQISSRPEI 417
+ K+RK++K+ + KV + I RP+
Sbjct: 130 RHKRRKRRKRPTRGKVPFGVSIEQRPKF 157
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 28.5 bits (64), Expect = 8.2
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 381 PEKKKKKKKKKKKRKKKKKKKKKK 404
P +K+K KK K+ KK+K+K
Sbjct: 223 PSRKRKAPKKVAKKVAAAKKRKQK 246
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 28.8 bits (65), Expect = 8.5
Identities = 9/23 (39%), Positives = 10/23 (43%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKKV 405
K K K R K KK K K+
Sbjct: 189 KHKILIPSLKTRPKGKKYVKIKI 211
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 26.5 bits (59), Expect = 8.7
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKKVN 406
+KKK+ KK K +R+ +++ K K N
Sbjct: 14 QKKKELKKNKAERQARREAKLAKKN 38
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 28.6 bits (64), Expect = 8.8
Identities = 8/23 (34%), Positives = 18/23 (78%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
E +KK+K++ + K+K++K ++K
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERK 433
>gnl|CDD|225919 COG3384, COG3384, Aromatic ring-opening dioxygenase, catalytic LigB
subunit related enzyme [Amino acid transport and
metabolism].
Length = 268
Score = 28.4 bits (64), Expect = 9.0
Identities = 13/63 (20%), Positives = 27/63 (42%), Gaps = 6/63 (9%)
Query: 160 LLSQLSAVYISHLHADHHLGLFSVIKAWSRVKPECKLTLLAPRQIITWLSVYAARFESVG 219
++ + A+++SH L + + + E L P II V++A +E+ G
Sbjct: 5 KITMMPALFLSHGSPMLALEDNAATRGLRELGRELP--ELRPDTII----VFSAHWETRG 58
Query: 220 HLY 222
+
Sbjct: 59 AYH 61
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.5 bits (64), Expect = 9.3
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 381 PEKKKKKKKKKKKRKKKKKKK 401
K+K++ KK+K+KK KK K
Sbjct: 354 SPKRKREGDKKQKKKKSKKLK 374
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 26.9 bits (60), Expect = 9.4
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 375 FDNMRFPEKKKKKKKKKKKRKKKKKKKKK 403
FD + + +K KK KK+ KK KK
Sbjct: 17 FDKIDYIDKLKKDPNIDKKKFKKIIFKKY 45
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 28.5 bits (64), Expect = 9.4
Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 10/83 (12%)
Query: 378 MRFPEKKKKKKKKKKKRKKKKKKKKKKVNNSIQISSRPEICREFDSKKISLANLPKLKLF 437
K + K+K K ++ K+K++K+ + +E K LA L +LK
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREA----------KEEKQLKKKLAQLARLKEI 315
Query: 438 YPALKAMFAEYQDEIENRAIRRN 460
+ + E R R
Sbjct: 316 AKEVAQKEKARARKKEQRKERGE 338
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 26.4 bits (59), Expect = 9.5
Identities = 9/22 (40%), Positives = 14/22 (63%)
Query: 383 KKKKKKKKKKKRKKKKKKKKKK 404
KK ++ + KKRKK +K K+
Sbjct: 24 KKDRRNPRVKKRKKYEKAVKRL 45
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.4 bits (64), Expect = 9.6
Identities = 13/93 (13%), Positives = 24/93 (25%), Gaps = 2/93 (2%)
Query: 315 ATHEDELEKEAALKMHSTVSQAI--RIGREMRAKFVLLTHFSQRYAKLPRLNKDLSENVG 372
A E + + + + I + +K Q L + KD +
Sbjct: 12 AEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATE 71
Query: 373 IAFDNMRFPEKKKKKKKKKKKRKKKKKKKKKKV 405
+ K K +KK K +
Sbjct: 72 SDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 28.4 bits (63), Expect = 9.8
Identities = 14/23 (60%), Positives = 14/23 (60%)
Query: 382 EKKKKKKKKKKKRKKKKKKKKKK 404
EK KKKKKK K KKK KK
Sbjct: 239 EKSKKKKKKLAKNKKKLDDDKKG 261
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.132 0.383
Gapped
Lambda K H
0.267 0.0616 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,004,425
Number of extensions: 2319991
Number of successful extensions: 11772
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9103
Number of HSP's successfully gapped: 655
Length of query: 485
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 384
Effective length of database: 6,457,848
Effective search space: 2479813632
Effective search space used: 2479813632
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.5 bits)