RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1088
(905 letters)
>gnl|CDD|225661 COG3119, AslA, Arylsulfatase A and related enzymes [Inorganic ion
transport and metabolism].
Length = 475
Score = 207 bits (528), Expect = 3e-58
Identities = 107/425 (25%), Positives = 168/425 (39%), Gaps = 48/425 (11%)
Query: 56 SSGPPHIIFILADDLGWNDVG-FHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
+ P+I+ I+ADDLG+ D+G + G PTPNID LA G+ N YT C PSR+A
Sbjct: 1 PAKRPNILIIMADDLGYGDLGAYGGPVVGPTPNIDRLAAEGVRFTNAYTTSPCCGPSRAA 60
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++P TG+ N GGLP L + LKE GY T + GKWHLG ++
Sbjct: 61 LLTGRYPFRTGVGGNAEPPGYPGGLPDEVPTLAELLKEAGYYTALFGKWHLGEKDEDPAG 120
Query: 174 TFRGFESHLGYWTGHQD---------YFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
GF+ G+ G D E D + + DL ++
Sbjct: 121 GDHGFDEFYGFLGGLTDEWYPELVDVPPPGDVPEFDQEEGDPYVAGKDSADLADRFRRQA 180
Query: 225 FTAE------AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP---------DHYLNI 269
+ +P+ + + L P H
Sbjct: 181 KEDAPDKPPFLLTAPFAPPAPDPVDFEWPDEYRGFYDRAKRLDRPFFLYLAPPDPHLSRR 240
Query: 270 HRHIEDFK----------RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ + +AA + LD+ +G++++AL++ +L N+I+VF SD+G
Sbjct: 241 LPAADGLPAEEEEDGARLMTVYAACVRYLDDQIGRLLDALKELGLLDNTIVVFTSDHGAW 300
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP-LLESRGIVAEQYVHVSDWLPTLLS 378
A P RG K TL+EGG R +I P ++ G V + V + D LPTLL
Sbjct: 301 L------GAHGTPFRGYKGTLYEGGTRVPLIIRWPGGIKPGGRVVDALVSLIDLLPTLLD 354
Query: 379 AANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYE 438
AA + ++ + + + E+ + Y E
Sbjct: 355 AAGVPPPKDLDGQSLPPKLQKPGLDGVPL----LEWLAGGKAAVREEYGYF-GELFGLRA 409
Query: 439 NRYEN 443
R ++
Sbjct: 410 IRRDD 414
Score = 30.5 bits (69), Expect = 4.6
Identities = 12/31 (38%), Positives = 14/31 (45%), Gaps = 1/31 (3%)
Query: 2 QHNVLYGCERGGLPLSEKILPQYLKELGYRT 32
+ G GGLP L + LKE GY T
Sbjct: 74 GNAEPPGYP-GGLPDEVPTLAELLKEAGYYT 103
>gnl|CDD|216172 pfam00884, Sulfatase, Sulfatase.
Length = 332
Score = 187 bits (476), Expect = 8e-53
Identities = 93/330 (28%), Positives = 145/330 (43%), Gaps = 41/330 (12%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
P+++ +L + L D+G +G + TP +D LA G++ N+Y L PSR A++TG
Sbjct: 1 PNVVLVLGESLRAPDLGLYGYPRPTTPFLDRLAEEGLLFSNFYSGGTLTAPSRFALLTGL 60
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
P + G + GLP +E LP LK GY T +GKWHL +Y ++ GF
Sbjct: 61 PPHNFGSGVSTP-----IGLPRTEPSLPDLLKRAGYNTGAIGKWHLSWYNRQSVYKNLGF 115
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST 238
+ G TG +D + + G S + EA++ + N+
Sbjct: 116 DKFFGRNTG-EDLYKD-------------PEDVGYNCSGGGVSDEALLDEALEFLDNN-- 159
Query: 239 DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-------RSKFAAILHKLDES 291
D+P FL L +H P PD Y + + + + L D++
Sbjct: 160 DKPFFLVLHTMGSHG-----PPYYPDRYPEKYATFKPTSTCSEEQLLNSYDNSLLYTDDA 214
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+G+V+E LE L N+++V+ SD+G + G + G + EGG R LI
Sbjct: 215 IGRVLEKLENGL-LDNTLVVYTSDHGPSLGG------GGYLHGGKTDNAPEGGYRVPLLI 267
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
WSP + +G E V D PT+L A
Sbjct: 268 WSPGGKPKGQKNEALVSHVDLFPTILDLAG 297
>gnl|CDD|237491 PRK13759, PRK13759, arylsulfatase; Provisional.
Length = 485
Score = 123 bits (311), Expect = 1e-29
Identities = 100/399 (25%), Positives = 163/399 (40%), Gaps = 93/399 (23%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
+ P+II I+ D + + +G +G + TPN+D LA G +N Y+ V CTP+R+A
Sbjct: 2 VQTKKPNIILIMVDQMRGDCLGCNGNKAVETPNLDMLASEGYNFENAYSAVPSCTPARAA 61
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH-------LGF 166
++TG H G G + LPQ ++ GY T+ +GK H LGF
Sbjct: 62 LLTGLSQWHHGR-----VGYGDVVPWNYKNTLPQEFRDAGYYTQCIGKMHVFPQRNLLGF 116
Query: 167 YK-----------KEYTPTFRGFESHLGYWTGHQ------DYFDHSAEEMKMWGLDMRRD 209
+ + + F S W + D D G D
Sbjct: 117 HNVLLHDGYLHSGRNEDKSQFDFVSDYLAWLREKAPGKDPDLTDI--------GWDCNSW 168
Query: 210 LEPAWDLHGKYSTDVFTA-EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ WDL + + E+++ + +P FL ++ A HS PY+P P Y +
Sbjct: 169 VARPWDLEERLHPTNWVGSESIEFLRRRDPTKPFFLKMSFARPHS--PYDP---PKRYFD 223
Query: 269 IHRHI--------------------------------EDFKRSK---FAAILHKLDESVG 293
+++ E +R++ + I H +D +G
Sbjct: 224 MYKDADIPDPHIGDWEYAEDQDPEGGSIDALRGNLGEEYARRARAAYYGLITH-IDHQIG 282
Query: 294 KVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
+ ++AL++ +L N+II+FVSD+G L R K +EG +I+
Sbjct: 283 RFLQALKEFGLLDNTIILFVSDHGDMLGDHYL-------FR--KGYPYEGSAHIPFIIYD 333
Query: 354 P---LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
P L +RG V +Q V + D +PTLL A + IP+ V
Sbjct: 334 PGGLLAGNRGTVIDQVVELRDIMPTLLDLAGGT-IPDDV 371
>gnl|CDD|234202 TIGR03417, chol_sulfatase, choline-sulfatase.
Length = 500
Score = 87.1 bits (216), Expect = 1e-17
Identities = 92/369 (24%), Positives = 143/369 (38%), Gaps = 75/369 (20%)
Query: 60 PHIIFILADDLGWNDVGFHG-LDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
P+I+ I+AD L + +G + PN+ LA ++ N Y LC PSR++ M+G
Sbjct: 3 PNILIIMADQLNGTLLPDYGPARWLHAPNLKRLAARSVVFDNAYCASPLCAPSRASFMSG 62
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL-------GF---- 166
+ P TG N YL+ GYRT + GK H GF
Sbjct: 63 QLPSRTGAYDNA------AEFASDIPTYAHYLRRAGYRTALSGKMHFCGPDQLHGFEERL 116
Query: 167 ----YKKEY--TPTFRGFESHLGYW--------------TGHQDYFDHSA--EEMKMWGL 204
Y ++ TP +R + ++ T DY D A K++ L
Sbjct: 117 TTDIYPADFGWTPDWRKPGERIDWYHNMGSVTGAGPCERTNQLDYDDEVAFHARQKLYDL 176
Query: 205 DMRRDLEPAWDL-------HGKYSTDVFTAEAVDIIHNHSTDEP-LFLYLAHAATHS--- 253
+D P + L H Y V + D+ + P + + A HS
Sbjct: 177 ARGKDDRP-FCLTVSFTHPHDPY---VIRRKYWDLYEDCEILMPEVAIPYAEQDPHSQRL 232
Query: 254 --ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIV 311
A D I +R+ F AI + +D+ +G++++ LE+ R ++IIV
Sbjct: 233 LDACDLRNFPITD------EQIRRARRAYFGAISY-IDDKIGELLDTLEETRQADDTIIV 285
Query: 312 FVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSD 371
F SD+G L W K + +EG R +I +P + G V + D
Sbjct: 286 FTSDHGDM-----LGERGLW----YKMSFFEGSARVPLMIAAPGRFAPGRVDAPVSTL-D 335
Query: 372 WLPTLLSAA 380
LPTL+ A
Sbjct: 336 LLPTLVDLA 344
Score = 36.2 bits (84), Expect = 0.088
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 658 SIQCGPVKEVPCEPQIAPC-LFDIKNDPCEKNNLADRSEDQ 697
I+ G K V CE P L+D++ DP E NLAD
Sbjct: 388 MIRRGRYKFVYCE--ADPDQLYDLEADPHELTNLADDPAHA 426
Score = 34.7 bits (80), Expect = 0.23
Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 764 SIQCGPVKEVPCEPQIAPC-LFDIKNDPCEKNNLADRSE 801
I+ G K V CE P L+D++ DP E NLAD
Sbjct: 388 MIRRGRYKFVYCE--ADPDQLYDLEADPHELTNLADDPA 424
>gnl|CDD|224287 COG1368, MdoB, Phosphoglycerol transferase and related proteins,
alkaline phosphatase superfamily [Cell envelope
biogenesis, outer membrane].
Length = 650
Score = 70.5 bits (173), Expect = 2e-12
Identities = 55/301 (18%), Positives = 106/301 (35%), Gaps = 43/301 (14%)
Query: 145 LPQYLKELGYRTRIVGKWHLGFYK-KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWG 203
LP LK+ GY+T + F+ K + F GF+ + FD +A+ WG
Sbjct: 345 LPAILKQQGYKTAALHGGDGSFWNRKSFYKIF-GFDDFFD-----LESFDGNADSEIGWG 398
Query: 204 LDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAP 263
L S E++ ++ +P F ++ + H P+E +
Sbjct: 399 L----------------SDKDLFKESLPLLKKL--KKPFFSFVITLSNHG--PFELPEGK 438
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+ L + + +H DE++G+ ++ L++ + NS+IV D+ G +
Sbjct: 439 RNELLEEPLSASTALANYLQAVHYADEALGQFIDKLKKSGLYKNSVIVLYGDHYGISGNQ 498
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
NL G K+ + R LI +P ++++ + + D PT+L S
Sbjct: 499 NLAMPK---FLG-KSYDIDMLQRVPLLIHAPGIKNKKKIDTVGGQL-DIAPTILGLLGIS 553
Query: 384 ---------DIPNYVNSTVENIIPRYENSILRYENGTHEY--NSPRIENSNTRYENGTHE 432
D+ V + Y G + E + + + +
Sbjct: 554 TKSYAFFGRDLLGDEPYKVPFRNGSFGTDAKGYSVGDNRMYDTRTGNEILDLQLDKVKPK 613
Query: 433 Y 433
Sbjct: 614 K 614
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 56.6 bits (136), Expect = 4e-08
Identities = 43/154 (27%), Positives = 75/154 (48%), Gaps = 15/154 (9%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K EEE + +++ +K K++KKK++K+K+ +++ +S E + A + +E+P
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDI-VTEEMP-- 139
Query: 777 PQIAPCLFDIK--NDPCEKNN------LADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
P D K NDP + LAD ++ H E K
Sbjct: 140 ENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAET----SKSPEKGDVPAV 195
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVL 862
++K KK KKK+KK+K+K++ K KK +G+ +L
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLL 229
Score = 54.7 bits (131), Expect = 2e-07
Identities = 35/138 (25%), Positives = 58/138 (42%), Gaps = 22/138 (15%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
P+K + +KK KK KKK+KK+K+K++ K KKK + + D + V E
Sbjct: 187 PEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEAD 246
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
+ ++ A+ E ++ K KKKK
Sbjct: 247 EASLANTVSGTAPDSEPDEPKDAEAEETKKSP----------------------KHKKKK 284
Query: 835 KKKKKKKKKKKKKKKKKK 852
++K+K++KKKKKK +
Sbjct: 285 QRKEKEEKKKKKKHHHHR 302
Score = 50.1 bits (119), Expect = 4e-06
Identities = 38/142 (26%), Positives = 63/142 (44%), Gaps = 23/142 (16%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
E+ + +KK KK KKK+KK+K+K++ K E EG + L
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSL--------------- 228
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
L + + P ++A+ E N + +P + + A K+ K
Sbjct: 229 --------LLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKH 280
Query: 835 KKKKKKKKKKKKKKKKKKMMKK 856
KKKK++K+K++KKKKKK +
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHR 302
Score = 38.9 bits (90), Expect = 0.013
Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 3/102 (2%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKK 736
L + + P ++A+ E N + D E EE KK K KKKK++K+
Sbjct: 229 LLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKE 288
Query: 737 KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
K++KKKKKK + +Q G V+E P P
Sbjct: 289 KEEKKKKKKHHHHRCHH---SDGGAEQPVQNGAVEEEPLPPM 327
Score = 33.1 bits (75), Expect = 0.72
Identities = 10/35 (28%), Positives = 27/35 (77%)
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+Q L++E++ +++ +K K++KKK++K+K+ ++
Sbjct: 79 DQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRR 113
Score = 31.2 bits (70), Expect = 3.1
Identities = 9/42 (21%), Positives = 32/42 (76%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
+E+++ +++ +K K++KKK++K+K ++ + + ++ ++++A
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIA 126
>gnl|CDD|216635 pfam01663, Phosphodiest, Type I phosphodiesterase / nucleotide
pyrophosphatase. This family consists of
phosphodiesterases, including human plasma-cell membrane
glycoprotein PC-1 / alkaline phosphodiesterase i /
nucleotide pyrophosphatase (nppase). These enzymes
catalyze the cleavage of phosphodiester and
phosphosulfate bonds in NAD, deoxynucleotides and
nucleotide sugars. Also in this family is ATX an
autotaxin, tumour cell motility-stimulating protein
which exhibits type I phosphodiesterases activity. The
alignment encompasses the active site. Also present with
in this family is 60-kDa Ca2+-ATPase form F. odoratum.
Length = 342
Score = 51.6 bits (124), Expect = 9e-07
Identities = 52/267 (19%), Positives = 95/267 (35%), Gaps = 51/267 (19%)
Query: 63 IFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV--QLCTPSRSAIMTGKHP 120
+ I D + + TPN+ ALA G+ V L P+ I+TG +P
Sbjct: 2 LVISLDGFRADYLDRLAGL---TPNLAALAKEGVSAPYLTPVFPTLTFPNHYTIVTGLYP 58
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE---YTPTFRG 177
G+ N Y P+ LKE + ++ G + T +G
Sbjct: 59 GSHGIVGNTWYD-------------PKRLKESTFWDQLP---ESGDGDPKPIWVTAKKQG 102
Query: 178 FESH-LGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNH 236
++ L + H + ++ + D+ + D + +
Sbjct: 103 LKAAALFWPGSHAAKPGYGPLVSELPDRLDAYNGSVPLDVITEPRVDTVLTDWLK--LLL 160
Query: 237 STDEP--LFLYLAH--AATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ P L +YL H P P +ED L ++D ++
Sbjct: 161 DAERPDLLLVYLEEPDRVGHKYGPDSP------------EVEDA--------LRRVDRAI 200
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGA 319
G+++EAL++R +L N+ ++ VSD+G
Sbjct: 201 GRLLEALDERGLLENTNVIVVSDHGMT 227
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 50.3 bits (121), Expect = 3e-06
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+ EKK+K+K+K K KK+ + K K++K S E
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSE 433
Score = 49.9 bits (120), Expect = 4e-06
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
E+K+K+K+K K KK+ + K K++K EE
Sbjct: 400 AEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 49.2 bits (118), Expect = 7e-06
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
K E+++K+K+K K KK+ + K K++K ++EE
Sbjct: 398 KRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 47.2 bits (113), Expect = 3e-05
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P K+ ++ +KK+K+K+K K KK+ + K K
Sbjct: 391 PSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424
Score = 45.7 bits (109), Expect = 8e-05
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
K ++ K+ +KK+K+K+K K KK+ + K
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422
Score = 45.7 bits (109), Expect = 8e-05
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K + +KK+K+K+K K KK+ + K K+
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 45.7 bits (109), Expect = 9e-05
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
A +K+ + KK K+ +KK+K+K+K K KK+ +
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHR 417
Score = 45.3 bits (108), Expect = 1e-04
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYP 859
L K +KK+K+K+K K KK+ + K + K+ P
Sbjct: 396 LAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKP 428
Score = 44.9 bits (107), Expect = 1e-04
Identities = 15/52 (28%), Positives = 28/52 (53%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
P +++ + KK K+ +KK+K+K+K K KK+ + G R+ S +
Sbjct: 383 PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 44.9 bits (107), Expect = 1e-04
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + EKK K KK K+ +KK+K+K+K K
Sbjct: 378 PKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPK 411
Score = 44.5 bits (106), Expect = 2e-04
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K E+K K KK K+ +KK+K+K+K K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKV 412
Score = 44.5 bits (106), Expect = 2e-04
Identities = 11/37 (29%), Positives = 21/37 (56%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
+A +++E+EK+K K KK+ + K K++K
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 42.6 bits (101), Expect = 7e-04
Identities = 11/34 (32%), Positives = 19/34 (55%)
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+A ++K+K+K+K K KK+ + K K K
Sbjct: 395 VLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKP 428
Score = 42.2 bits (100), Expect = 0.001
Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 6/61 (9%)
Query: 802 VQRINHYTTE------VGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
+ +I Y E + L PK + K KK K+ +KK+K+K+K K K
Sbjct: 357 LGKIERYIEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416
Query: 856 K 856
+
Sbjct: 417 R 417
Score = 41.9 bits (99), Expect = 0.001
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K K+ +KK+K+K+K K KK+ + K
Sbjct: 394 KVLAKRAEKKEKEKEKPKVKKRHRDTKN 421
Score = 40.7 bits (96), Expect = 0.003
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
K++K+K+K K KK+ + K K+ G
Sbjct: 402 KKEKEKEKPKVKKRHRDTKNIGKRRKPSG 430
Score = 34.9 bits (81), Expect = 0.17
Identities = 14/42 (33%), Positives = 20/42 (47%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+E K K +KK K KK K+ +KK +E+ K R
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKR 415
Score = 34.5 bits (80), Expect = 0.22
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+E K K +KK K KK K+ +KK+
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKE 404
Score = 33.4 bits (77), Expect = 0.52
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
++ K KK+ + K K++K
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 32.2 bits (74), Expect = 1.2
Identities = 7/24 (29%), Positives = 12/24 (50%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKK 851
+K K KK+ + K K++K
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGT 431
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 50.4 bits (121), Expect = 3e-06
Identities = 21/33 (63%), Positives = 28/33 (84%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
P K++ EEKK +K+KKKKK+KKK KK+KKK +K
Sbjct: 382 PPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 46.1 bits (110), Expect = 7e-05
Identities = 18/29 (62%), Positives = 25/29 (86%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E+KK +K+KKKKK+KKK KK+KKK +K
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 45.7 bits (109), Expect = 7e-05
Identities = 18/35 (51%), Positives = 28/35 (80%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
YP +++ ++KK +K+KKKKK+KKK KK+KKK +
Sbjct: 379 YPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 43.8 bits (104), Expect = 3e-04
Identities = 20/46 (43%), Positives = 32/46 (69%), Gaps = 4/46 (8%)
Query: 716 DKEEEEEKKK----KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
+K EE K+K KKK+++KK +K+KKKKK+KKK +++G +
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 43.4 bits (103), Expect = 4e-04
Identities = 18/41 (43%), Positives = 28/41 (68%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
EE +EK K KKK+++KK +K+KKKKK+KK + + +
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 41.9 bits (99), Expect = 0.001
Identities = 20/42 (47%), Positives = 29/42 (69%), Gaps = 4/42 (9%)
Query: 721 EEKKKK----KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
EE K+K KKK+++KK +K+KKKKK+KKK ++ RK
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 41.5 bits (98), Expect = 0.002
Identities = 18/29 (62%), Positives = 25/29 (86%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
K+K+++KK +K+KKKKK+KKK KK KKG
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKG 412
Score = 41.1 bits (97), Expect = 0.002
Identities = 16/24 (66%), Positives = 21/24 (87%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
K +K+KKKKK+KKK KK+KKK +K
Sbjct: 391 KPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 40.7 bits (96), Expect = 0.003
Identities = 17/28 (60%), Positives = 24/28 (85%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+E+KK +K+KKKKK+KKK KK+KK +K
Sbjct: 387 REEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 40.3 bits (95), Expect = 0.003
Identities = 15/28 (53%), Positives = 22/28 (78%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K ++KK +K+KKKKK+KKK KK+K +
Sbjct: 386 KREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 39.2 bits (92), Expect = 0.008
Identities = 15/29 (51%), Positives = 23/29 (79%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
K++++KK +K+KKKKK+KKK KK+ K
Sbjct: 385 KKREEKKPQKRKKKKKRKKKGKKRKKKGR 413
Score = 38.4 bits (90), Expect = 0.015
Identities = 17/40 (42%), Positives = 26/40 (65%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+R +I + KKK+++KK +K+KKKKK+KK KK
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 36.9 bits (86), Expect = 0.046
Identities = 14/23 (60%), Positives = 20/23 (86%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
+++KKKKK+KKK KK+KKK +K
Sbjct: 392 PQKRKKKKKRKKKGKKRKKKGRK 414
Score = 34.9 bits (81), Expect = 0.16
Identities = 20/59 (33%), Positives = 36/59 (61%), Gaps = 8/59 (13%)
Query: 707 GRFNQIAYPDKEEEEEKKK----KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
GR+ I D+ +EE K+ K+K K KKK+++KK +K+KKK +++G ++ +
Sbjct: 357 GRY--IG--DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 47.9 bits (114), Expect = 4e-06
Identities = 21/36 (58%), Positives = 29/36 (80%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
E+K KKKK + K++KKKKK+KKKKKK++S E G+
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGV 176
Score = 37.9 bits (88), Expect = 0.008
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKK 856
EKK KKKK + K++KKKKK+KK KK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKKKK 167
Score = 37.9 bits (88), Expect = 0.009
Identities = 15/25 (60%), Positives = 21/25 (84%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+K+ KKKK + K++KKKKK+KKKK
Sbjct: 141 EKKHKKKKHEDDKERKKKKKEKKKK 165
Score = 37.5 bits (87), Expect = 0.012
Identities = 16/24 (66%), Positives = 20/24 (83%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
K KKKK + K++KKKKK+KKKKK
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKK 166
Score = 36.7 bits (85), Expect = 0.019
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
K KKKK + K++KKKKK+KKK K+ P+
Sbjct: 143 KHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172
Score = 36.7 bits (85), Expect = 0.023
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
P++ + L +KK KKKK + K++KKKKK+K KK
Sbjct: 127 PEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKK 166
Score = 32.9 bits (75), Expect = 0.44
Identities = 14/23 (60%), Positives = 21/23 (91%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKK 739
K+ E++K++KKKKK+KKKKKK+
Sbjct: 147 KKHEDDKERKKKKKEKKKKKKRH 169
Score = 32.9 bits (75), Expect = 0.45
Identities = 14/25 (56%), Positives = 21/25 (84%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKK 741
K+++ E K++KKKKK+KKKKKK+
Sbjct: 145 KKKKHEDDKERKKKKKEKKKKKKRH 169
Score = 32.1 bits (73), Expect = 0.79
Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 3/43 (6%)
Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKK---KKKKKKKKMMKK 856
+ + D E K +KK KKKK K++KKKKK KK
Sbjct: 122 TQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164
Score = 31.7 bits (72), Expect = 0.94
Identities = 14/18 (77%), Positives = 17/18 (94%)
Query: 828 DKEKKKKKKKKKKKKKKK 845
DKE+KKKKK+KKKKKK+
Sbjct: 152 DKERKKKKKEKKKKKKRH 169
Score = 31.0 bits (70), Expect = 1.6
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMM 854
+ +K++KKKKK+KKKKKK+ + +
Sbjct: 150 EDDKERKKKKKEKKKKKKRHSPEHPGV 176
Score = 30.6 bits (69), Expect = 2.2
Identities = 15/54 (27%), Positives = 25/54 (46%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
+ D EE + K +KK KKKK + K++K +E+ +K R +
Sbjct: 118 KKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSP 171
Score = 30.2 bits (68), Expect = 3.5
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMM 854
D +++KKKKK+KKKKKK+ + +
Sbjct: 151 DDKERKKKKKEKKKKKKRHSPEHPGVG 177
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 47.0 bits (112), Expect = 4e-06
Identities = 22/48 (45%), Positives = 33/48 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
K ++EEK KK+ K++KKK+KKKKKKK KK E+EG + +++
Sbjct: 76 KRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSD 123
Score = 44.7 bits (106), Expect = 2e-05
Identities = 20/40 (50%), Positives = 30/40 (75%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+K+ ++E+K KK+ K++KKK+KKKKKKK KK EE+
Sbjct: 74 EKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKE 113
Score = 39.6 bits (93), Expect = 0.001
Identities = 18/43 (41%), Positives = 29/43 (67%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+ K EE+++K ++K KK+ K++KKK+KKKKKKK +
Sbjct: 67 EFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 38.1 bits (89), Expect = 0.005
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
K+ + +K KK+ K++KKK+KKKKKKK KK
Sbjct: 71 KREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKK 105
Score = 37.7 bits (88), Expect = 0.006
Identities = 17/39 (43%), Positives = 28/39 (71%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+EE++ K ++K KK+ K++KKK+KKKKKKK ++
Sbjct: 72 REEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKE 110
Score = 35.8 bits (83), Expect = 0.027
Identities = 17/39 (43%), Positives = 24/39 (61%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
K + +KKK+KKKKKKK KK KK++K+ K +
Sbjct: 86 KRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124
Score = 34.3 bits (79), Expect = 0.11
Identities = 16/27 (59%), Positives = 21/27 (77%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKK 856
E+K KK+ K++KKK+KKKKKKK K
Sbjct: 80 EEKTAKKRAKRQKKKQKKKKKKKAKKG 106
Score = 33.5 bits (77), Expect = 0.20
Identities = 15/23 (65%), Positives = 19/23 (82%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
+ K++KKK+KKKKKKK KK KK
Sbjct: 87 RAKRQKKKQKKKKKKKAKKGNKK 109
Score = 32.7 bits (75), Expect = 0.35
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+K++K ++K KK+ K++KKK+KKK KK
Sbjct: 74 EKKRKDEEKTAKKRAKRQKKKQKKKKKKKA 103
Score = 32.7 bits (75), Expect = 0.37
Identities = 19/48 (39%), Positives = 34/48 (70%), Gaps = 5/48 (10%)
Query: 716 DKEEEEEKKKKKKKKKK-----KKKKKKKKKKKKKKKKYSNEEEGMRK 758
+ E+EE ++K+++KK+K KK+ K++KKK+KKKK ++G +K
Sbjct: 62 ETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKK 109
Score = 32.3 bits (74), Expect = 0.41
Identities = 18/40 (45%), Positives = 25/40 (62%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
K +KKK+KKKKKKK KK KK++K K + + E+
Sbjct: 89 KRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Score = 31.6 bits (72), Expect = 0.82
Identities = 20/51 (39%), Positives = 33/51 (64%), Gaps = 13/51 (25%)
Query: 717 KEEEEEKKKKKKKKKKKKK-------------KKKKKKKKKKKKKYSNEEE 754
K+E E+++ ++K+++KK+K KKK+KKKKKKK K N++E
Sbjct: 60 KKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKE 110
Score = 30.8 bits (70), Expect = 1.3
Identities = 17/35 (48%), Positives = 27/35 (77%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+++K+KKKKKKK KK KK++K+ K + S++EE
Sbjct: 91 QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEE 125
Score = 28.9 bits (65), Expect = 6.3
Identities = 17/38 (44%), Positives = 27/38 (71%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+++++KKKKKKK KK KK++K+ K ++ EEEG
Sbjct: 91 QKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Score = 28.9 bits (65), Expect = 7.1
Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 9/77 (11%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAV 877
K+ ++ + K ++KK+K ++K KK+ K++KKK KK +K+ N+
Sbjct: 61 KETEDEE-FQQKREEKKRKDEEKTAKKRAKRQKKKQKKK--------KKKKAKKGNKKEE 111
Query: 878 APINKPFDKGGDPKNFD 894
+K ++ D +
Sbjct: 112 KEGSKSSEESSDEEEEG 128
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 47.0 bits (112), Expect = 8e-06
Identities = 20/46 (43%), Positives = 30/46 (65%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
E+E E ++++KK+KKKKK+ KK+KK+KK KK E K +
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187
Score = 44.7 bits (106), Expect = 5e-05
Identities = 20/32 (62%), Positives = 25/32 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
KE ++EKK+KK KK+K + K KKKKKKKKK
Sbjct: 161 KEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 44.7 bits (106), Expect = 6e-05
Identities = 19/31 (61%), Positives = 24/31 (77%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
K+E++EKK KK+K + K KKKKKKKKKK
Sbjct: 163 VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 44.3 bits (105), Expect = 7e-05
Identities = 20/39 (51%), Positives = 27/39 (69%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
+ E EEE+KK+KKKKK+ KK+KK+KK KK+K
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGS 183
Score = 43.6 bits (103), Expect = 1e-04
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
KE++E+K KK+K + K KKKKKKKKKK
Sbjct: 163 VKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 43.2 bits (102), Expect = 2e-04
Identities = 22/36 (61%), Positives = 28/36 (77%), Gaps = 4/36 (11%)
Query: 717 KEEEEEKKKKKKKKKKKKK----KKKKKKKKKKKKK 748
K+++E KK+KK+KK KK+K K KKKKKKKKKK
Sbjct: 158 KKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 43.2 bits (102), Expect = 2e-04
Identities = 18/33 (54%), Positives = 25/33 (75%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K+E +++KK+KK KK+K + K KKKKKKKK
Sbjct: 159 KKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 42.4 bits (100), Expect = 3e-04
Identities = 20/48 (41%), Positives = 34/48 (70%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
++E E ++++KK+KKKKK+ KK+KK+KK KK+ E +G +K +
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 40.9 bits (96), Expect = 0.001
Identities = 20/44 (45%), Positives = 33/44 (75%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+EEE+++KKKKK+ KK+KK+KK KK+K + K S +++ +K
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191
Score = 38.9 bits (91), Expect = 0.005
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
KEKKKKK+ KK+KK+KK KK+K K
Sbjct: 155 KEKKKKKEVKKEKKEKKDKKEKMVEPKG 182
Score = 38.2 bits (89), Expect = 0.008
Identities = 17/29 (58%), Positives = 25/29 (86%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
++EKK+KKKKK+ KK+KK+KK KK+ M +
Sbjct: 151 EEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179
Score = 37.8 bits (88), Expect = 0.012
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+K+ KK+K + K KKKKKKKKKK
Sbjct: 169 EKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 37.4 bits (87), Expect = 0.016
Identities = 16/25 (64%), Positives = 23/25 (92%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
++E+KK+KKKKK+ KK+KK+KK KK
Sbjct: 150 EEEEKKEKKKKKEVKKEKKEKKDKK 174
Score = 37.0 bits (86), Expect = 0.018
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKK 740
+K++++EK + K KKKKKKKKKK
Sbjct: 166 EKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 37.0 bits (86), Expect = 0.019
Identities = 16/29 (55%), Positives = 26/29 (89%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+++K+KKKKK+ KK+KK+KK KK+KM++
Sbjct: 152 EEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180
Score = 36.2 bits (84), Expect = 0.033
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+K+KKK+ KK+KK+KK KK+K + K
Sbjct: 156 EKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Score = 35.8 bits (83), Expect = 0.049
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E E +K+ K +K+ + ++++KK+KKKKK
Sbjct: 128 LGSESETSEKETTAKVEKEAEVEEEEKKEKKKKK 161
Score = 34.7 bits (80), Expect = 0.12
Identities = 12/33 (36%), Positives = 23/33 (69%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E EK+ K +K+ + ++++KK+KKKKK+
Sbjct: 130 SESETSEKETTAKVEKEAEVEEEEKKEKKKKKE 162
Score = 33.9 bits (78), Expect = 0.19
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
+ E+++KK+KKKKK+ KK+KK+KK
Sbjct: 147 AEVEEEEKKEKKKKKEVKKEKKEKKD 172
Score = 33.9 bits (78), Expect = 0.23
Identities = 18/31 (58%), Positives = 22/31 (70%), Gaps = 7/31 (22%)
Query: 829 KEKKKKKKKKKKKKK-------KKKKKKKKK 852
+ KK+KK+KK KK+K KKKKKKKKK
Sbjct: 162 EVKKEKKEKKDKKEKMVEPKGSKKKKKKKKK 192
Score = 33.5 bits (77), Expect = 0.28
Identities = 15/29 (51%), Positives = 24/29 (82%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ E ++++KK+KKKKK+ KK+KK+K KK
Sbjct: 146 EAEVEEEEKKEKKKKKEVKKEKKEKKDKK 174
Score = 32.8 bits (75), Expect = 0.50
Identities = 11/33 (33%), Positives = 22/33 (66%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
+ K +K+ + ++++KK+KKKKK +KK +
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKE 169
Score = 30.8 bits (70), Expect = 1.9
Identities = 12/30 (40%), Positives = 22/30 (73%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
EK+ + ++++KK+KKKKK+ KK+ +K
Sbjct: 141 AKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Score = 30.8 bits (70), Expect = 1.9
Identities = 14/28 (50%), Positives = 22/28 (78%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
KE + ++++KK+KKKKK+ KK+KK K
Sbjct: 145 KEAEVEEEEKKEKKKKKEVKKEKKEKKD 172
Score = 30.5 bits (69), Expect = 2.9
Identities = 12/28 (42%), Positives = 23/28 (82%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+++ + ++++KK+KKKKK+ KK+K KK
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKK 171
Score = 29.7 bits (67), Expect = 4.7
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
K +K+ + ++++KK+KKKKK+ KK K+
Sbjct: 142 KVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
Score = 29.3 bits (66), Expect = 7.5
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + E + +K+ K +K+ + ++++KK+KK
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKK 158
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 45.1 bits (107), Expect = 2e-05
Identities = 21/30 (70%), Positives = 25/30 (83%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+E+ K+KK+KKKKKKKKKKK KK KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 44.3 bits (105), Expect = 3e-05
Identities = 21/30 (70%), Positives = 25/30 (83%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
KE+ ++KK+KKKKKKKKKKK KK KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 43.9 bits (104), Expect = 4e-05
Identities = 21/32 (65%), Positives = 25/32 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K+ +EK K+KK+KKKKKKKKKKK KK KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 43.5 bits (103), Expect = 5e-05
Identities = 20/31 (64%), Positives = 25/31 (80%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
+E+ ++KK+KKKKKKKKKKK KK KKKK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 41.6 bits (98), Expect = 3e-04
Identities = 21/28 (75%), Positives = 23/28 (82%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
KEK K+KK+KKKKKKKKKKK KK KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 41.2 bits (97), Expect = 4e-04
Identities = 19/29 (65%), Positives = 22/29 (75%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+K K+KK+KKKKKKKKKKK KK KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 40.4 bits (95), Expect = 7e-04
Identities = 20/31 (64%), Positives = 25/31 (80%), Gaps = 2/31 (6%)
Query: 720 EEEK--KKKKKKKKKKKKKKKKKKKKKKKKK 748
EE+K K+ K+K K+KK+KKKKKKKKKKK
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 40.1 bits (94), Expect = 8e-04
Identities = 18/29 (62%), Positives = 24/29 (82%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
+K +++++KKKKKKKKKKK KK KKKK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 39.7 bits (93), Expect = 0.001
Identities = 19/28 (67%), Positives = 23/28 (82%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K+ K+K K+KK+KKKKKKKKKKK KK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKK 129
Score = 38.5 bits (90), Expect = 0.003
Identities = 21/34 (61%), Positives = 23/34 (67%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
KQ + A K+KKKKKKKKKKK KK KKKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 38.1 bits (89), Expect = 0.004
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
L ++ K+KK+KKKKKKKKKKK KK KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134
Score = 37.4 bits (87), Expect = 0.007
Identities = 17/29 (58%), Positives = 21/29 (72%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
++K K+KK+KKKKKKKKKKK K K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133
Score = 35.8 bits (83), Expect = 0.029
Identities = 19/35 (54%), Positives = 26/35 (74%), Gaps = 2/35 (5%)
Query: 825 AYLDKEKK--KKKKKKKKKKKKKKKKKKKKMMKKG 857
A+L +E+K K+ K+K K+KK+KKKKKKK KK
Sbjct: 92 AFLVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKT 126
Score = 34.7 bits (80), Expect = 0.056
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
E++K K+ K+K K+KK+KKKKKKKK
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 46.2 bits (110), Expect = 3e-05
Identities = 36/140 (25%), Positives = 63/140 (45%), Gaps = 28/140 (20%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
+ E+E +++++ KKKK+ K K K+ KKKKKK K AA+ + E
Sbjct: 72 EEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTA----AKSPKAAAPRPKKKSERI 127
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
+ + P K++ RS + T +R KE++ +
Sbjct: 128 SW------APTLLDSPRRKSS---RSSTVQNKEAT--------HERL-------KEREIR 163
Query: 835 KKKKKKKKKKKKKKKKKKMM 854
+KK + K +K+K+KKK+K +
Sbjct: 164 RKKIQAKARKRKEKKKEKEL 183
Score = 32.7 bits (75), Expect = 0.71
Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
KE E+ K+++ ++KK + K +K+K+KKK+K +EE +L +A
Sbjct: 150 KEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEE---RLAEAK 193
Score = 31.2 bits (71), Expect = 1.9
Identities = 11/30 (36%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKK-KKKMMKK 856
+ EK+ +++++ KKKK+ K K K+ KK
Sbjct: 73 EGEKELQREERLKKKKRVKTKAYKEPTKKK 102
Score = 31.2 bits (71), Expect = 2.2
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ +++++ KKKK+ K K K+ KK KK
Sbjct: 77 ELQREERLKKKKRVKTKAYKEPTKKKKKK 105
Score = 31.2 bits (71), Expect = 2.2
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ Q K+KK+ K K K+ KKKKKK K
Sbjct: 76 KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112
Score = 29.7 bits (67), Expect = 6.8
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
D+E+ +K+ +++++ KKKK+ K K +
Sbjct: 70 DEEEGEKELQREERLKKKKRVKTKAYKEP 98
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 46.2 bits (110), Expect = 3e-05
Identities = 22/49 (44%), Positives = 34/49 (69%), Gaps = 3/49 (6%)
Query: 716 DKEEEEEKKKKKKK---KKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
D+EE+EEK+++K K K K KK K K ++K+K K EE+G+R+L +
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86
Score = 42.3 bits (100), Expect = 6e-04
Identities = 16/45 (35%), Positives = 24/45 (53%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
EEE+E+K+++K K K K KK K K ++K + E K
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRE 83
Score = 35.8 bits (83), Expect = 0.064
Identities = 13/44 (29%), Positives = 25/44 (56%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
K+ +E++ ++K+++K K K K KK K K +E+ R+
Sbjct: 32 DVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKRE 75
Score = 35.4 bits (82), Expect = 0.095
Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 1/36 (2%)
Query: 716 DKEEEE-EKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+K + E K KKKK K K K Y
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYD 228
Score = 35.0 bits (81), Expect = 0.11
Identities = 11/33 (33%), Positives = 13/33 (39%), Gaps = 2/33 (6%)
Query: 826 YLDKEKKKKKKKKKKKKKKKKKKK--KKKMMKK 856
+K K +K K KKKK K K K
Sbjct: 191 INEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 34.6 bits (80), Expect = 0.18
Identities = 10/33 (30%), Positives = 13/33 (39%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
E+ + +K K KKKK K K K
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 34.2 bits (79), Expect = 0.23
Identities = 13/25 (52%), Positives = 14/25 (56%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKK 851
L EK K +K K KKKK K K K
Sbjct: 190 LINEKLKAEKAAKGGKKKKGKAKAK 214
Score = 34.2 bits (79), Expect = 0.23
Identities = 11/42 (26%), Positives = 24/42 (57%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
K KK K K ++K+K K++K++K + + ++ ++L
Sbjct: 57 KAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98
Score = 33.1 bits (76), Expect = 0.48
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ E+K+++K K K K KK K K+ +K
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEK 69
Score = 32.3 bits (74), Expect = 0.80
Identities = 13/36 (36%), Positives = 16/36 (44%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
K K +K K KKKK K K K+ G D +
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDG 229
Score = 32.3 bits (74), Expect = 0.84
Identities = 10/28 (35%), Positives = 10/28 (35%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
EK K KKKK K K K
Sbjct: 197 AEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 32.3 bits (74), Expect = 0.96
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+ K KK K K ++K+K K++K+
Sbjct: 53 AAKAKAKKALKAKIEEKEKAKREKE 77
Score = 32.3 bits (74), Expect = 0.98
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
K K KK K K ++K+K K++K++
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEE 78
Score = 31.9 bits (73), Expect = 1.2
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
++EK K K K KK K K ++K+ K+
Sbjct: 46 EEEKAKVAAKAKAKKALKAKIEEKEKAKR 74
Score = 31.9 bits (73), Expect = 1.2
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
K K K KK K K ++K+K K+ ++ L + E
Sbjct: 51 KVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPE 90
Score = 31.6 bits (72), Expect = 1.6
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K K K K KK K K ++K+K +K
Sbjct: 49 KAKVAAKAKAKKALKAKIEEKEKAKREK 76
Score = 31.2 bits (71), Expect = 2.0
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 5/34 (14%)
Query: 828 DKEKKKKK-----KKKKKKKKKKKKKKKKKMMKK 856
D+EK+++K K K KK K K ++K+K ++
Sbjct: 42 DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKRE 75
Score = 30.4 bits (69), Expect = 3.2
Identities = 8/29 (27%), Positives = 19/29 (65%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+++++K+++K K K K KK K +++
Sbjct: 40 EEDEEKEEEKAKVAAKAKAKKALKAKIEE 68
Score = 29.6 bits (67), Expect = 7.0
Identities = 11/27 (40%), Positives = 14/27 (51%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+K K +K K KKKK K K + G
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVG 218
Score = 29.2 bits (66), Expect = 9.0
Identities = 14/54 (25%), Positives = 27/54 (50%), Gaps = 5/54 (9%)
Query: 716 DKEEEEEKKK-----KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
D E+E++ K ++ ++K+++K K K K KK + EE + R+
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78
Score = 29.2 bits (66), Expect = 9.5
Identities = 12/35 (34%), Positives = 14/35 (40%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
+K K +K K KKKK K K N D
Sbjct: 192 NEKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 45.1 bits (107), Expect = 3e-05
Identities = 23/48 (47%), Positives = 27/48 (56%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
K+E EEK+K K KKKK KKKK K K KK KK E+ K +
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Score = 44.3 bits (105), Expect = 6e-05
Identities = 22/48 (45%), Positives = 29/48 (60%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
KE EE++K K KKKK KKKK K K KK KK +E++ ++ D
Sbjct: 76 KEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLE 123
Score = 42.0 bits (99), Expect = 4e-04
Identities = 19/49 (38%), Positives = 26/49 (53%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
K + ++KK KKKK K K KK KK K +KK E+ + L + S
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Score = 38.5 bits (90), Expect = 0.005
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K ++++ KKKK K K KK KK K +KK +K
Sbjct: 83 KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
Score = 38.1 bits (89), Expect = 0.008
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K+++++KK KK K +KK +K+ + K + S E
Sbjct: 92 KKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132
Score = 36.2 bits (84), Expect = 0.033
Identities = 21/65 (32%), Positives = 26/65 (40%), Gaps = 5/65 (7%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKG 887
K K KKKK KKKK K K KK KK K + EKE + ++
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSE-----KKDEKEAEDKLEDLTKSYSETLSTL 136
Query: 888 GDPKN 892
+ K
Sbjct: 137 SELKP 141
Score = 35.8 bits (83), Expect = 0.042
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
Query: 716 DKEEEEEKK----KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+E E+ KK K+K K KKKK KKKK K K KK +++
Sbjct: 68 AEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSE 111
Score = 35.1 bits (81), Expect = 0.084
Identities = 19/44 (43%), Positives = 28/44 (63%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
KE EE +K KK+ ++K+K K KKKK KKKK +++ +K
Sbjct: 63 KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKK 106
Score = 34.7 bits (80), Expect = 0.11
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
DK+++++KK K +KK +K+ + K + K +
Sbjct: 97 KDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLS 134
Score = 34.3 bits (79), Expect = 0.16
Identities = 18/56 (32%), Positives = 25/56 (44%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPF 884
K+ KKKK K K KK KK K +K +K D L + K + T + +
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRKY 144
Score = 33.9 bits (78), Expect = 0.19
Identities = 15/72 (20%), Positives = 35/72 (48%)
Query: 690 LADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
LA+ E + + + ++ + K+++++ KK KK K +KK +K+ + K +
Sbjct: 67 LAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126
Query: 750 SNEEEGMRKLRD 761
+ E + L +
Sbjct: 127 KSYSETLSTLSE 138
Score = 33.9 bits (78), Expect = 0.20
Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 15/60 (25%)
Query: 710 NQIAYPDKEEEEEKKKKKKK---------------KKKKKKKKKKKKKKKKKKKYSNEEE 754
A P + E + KKKKK K+K K KKKK KKKK K K +++
Sbjct: 46 RHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK 105
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 45.0 bits (107), Expect = 5e-05
Identities = 13/55 (23%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
Q+ ++E+ +E+++++ + K + ++ + K+K K+ E+E MR R A
Sbjct: 118 FLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH-RAA 171
Score = 31.5 bits (72), Expect = 1.3
Identities = 13/66 (19%), Positives = 31/66 (46%), Gaps = 14/66 (21%)
Query: 816 DPKQRFNQIAYLDKEKKKKK----------KKKKKKKKKKKKKKKKKMMKKGYPDVLSQM 865
D +++ + L++E+++K+ K + ++ + K+K K+M K+ +M
Sbjct: 111 DVRKQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEE----DEEM 166
Query: 866 EKELAN 871
AN
Sbjct: 167 RHRAAN 172
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 46.4 bits (111), Expect = 7e-05
Identities = 28/174 (16%), Positives = 60/174 (34%), Gaps = 11/174 (6%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
EEEE +K++ K++ +++ K + ++ + + A
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARP 98
Query: 779 IAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKK 838
A R + ++ K + + K K +++
Sbjct: 99 AED------EAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRR 152
Query: 839 KKKKKKKKKKKKKKMMKKGYPDVLSQME----KELAN-INRTAVAPINKPFDKG 887
++ +++++KKKKK+ + P + E ELA + A I K F G
Sbjct: 153 RRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAEKMAVKAAEVIKKLFKLG 206
Score = 29.8 bits (68), Expect = 7.5
Identities = 10/52 (19%), Positives = 28/52 (53%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
K K+ K K +++++ +++++KKKKK+K + + + + ++
Sbjct: 134 KRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVA 185
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 45.3 bits (107), Expect = 1e-04
Identities = 33/156 (21%), Positives = 60/156 (38%), Gaps = 34/156 (21%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K +E K + K+++K+K++ K++KKKKK+K ++ K +
Sbjct: 93 KPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPK----------EEAKEKRP 142
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
P+ K EK R + K+R A +K KKK
Sbjct: 143 PK-------EKEKEKEKKVEEPRDREE-------------EKKRERVRAKSRPKKPPKKK 182
Query: 837 KKKKKKKK----KKKKKKKKMMKKGYPDVLSQMEKE 868
KKK+ K+++ ++ +K + E+E
Sbjct: 183 PPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEERE 218
Score = 40.6 bits (95), Expect = 0.003
Identities = 25/142 (17%), Positives = 58/142 (40%), Gaps = 17/142 (11%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS--------IQ 766
KEE +++K K++ K+K+ K+K+K+K+KK ++ + EE ++ R A +
Sbjct: 123 KPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKK 182
Query: 767 CGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAY 826
K+ P E + + + + D K R +
Sbjct: 183 PPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREK---------EEDDGKDRETTTSP 233
Query: 827 LDKEKKKKKKKKKKKKKKKKKK 848
+++++ ++ + ++ KK
Sbjct: 234 MEEDESRQSSEISRRSSSSLKK 255
Score = 34.9 bits (80), Expect = 0.20
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 3/71 (4%)
Query: 704 TEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
E + + + + EE+KK+++ + K + KK KKK KKK EEE K R AA
Sbjct: 144 KEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEE---KQRQAA 200
Query: 764 SIQCGPVKEVP 774
E P
Sbjct: 201 REAVKGKPEEP 211
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 44.8 bits (106), Expect = 2e-04
Identities = 34/145 (23%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
E +E KKKK+ +KK K KKK K K+ K + E+ K + + + + EP
Sbjct: 171 EAQERLTKKKKRGQKKSKYKKKTFKPKRAKSIPDFEKLHEKFQKQLAEKKKSKRPTVPEP 230
Query: 778 ---QIAPCLF--DIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKK 832
Q + + + +R +T+ K + +++K
Sbjct: 231 FNFQESHKSSSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTE-----RKEK 285
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKG 857
+ K++++KK+ +++KKKKK+M K
Sbjct: 286 EAKEQQEKKELEQRKKKKKEMAPKV 310
Score = 35.1 bits (81), Expect = 0.16
Identities = 29/152 (19%), Positives = 59/152 (38%), Gaps = 27/152 (17%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAP 781
K++KKK++KK ++ K +++ K +K E E +K + VP ++ P
Sbjct: 12 TKREKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFK---------ANPVPASVKL-P 61
Query: 782 CLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKK- 840
+I E+ + ++ ++E++KK +K
Sbjct: 62 LYEEIMEQNEERREEVREKSKAIL------------LSSQKPFSFYEREEQKKAILPRKL 109
Query: 841 ----KKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
+++ KK K K + K Y +L +E
Sbjct: 110 RSSTSEREPKKFKAKPVPKSIYIPLLKDKMQE 141
Score = 29.7 bits (67), Expect = 6.7
Identities = 12/36 (33%), Positives = 28/36 (77%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+++E+E K++++KK+ +++KKKKK+ K K+++
Sbjct: 281 ERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEA 316
Score = 29.4 bits (66), Expect = 9.9
Identities = 10/47 (21%), Positives = 27/47 (57%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
+ + ++K+K+ K++++KK+ +++KKKK + ++ Q
Sbjct: 275 VKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQ 321
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 42.4 bits (100), Expect = 2e-04
Identities = 15/35 (42%), Positives = 28/35 (80%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+P++ E+E+K+ K+ ++K+ K KK+KK+KK+KK
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 37.0 bits (86), Expect = 0.015
Identities = 13/44 (29%), Positives = 27/44 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+++E K+ ++K+ K KK+KK+KK+KK +K + ++
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139
Score = 34.4 bits (79), Expect = 0.11
Identities = 13/39 (33%), Positives = 25/39 (64%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
E+ +K++K+ K+ ++K+ K KK+KK E++ KL
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130
Score = 33.6 bits (77), Expect = 0.19
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ K+ ++K+ K KK+KK+KK+KK+ +K
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKKVAEK 129
Score = 32.8 bits (75), Expect = 0.34
Identities = 11/29 (37%), Positives = 20/29 (68%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ EK++K+ K+ ++K+ K KK+K KK
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKK 122
Score = 32.0 bits (73), Expect = 0.65
Identities = 14/44 (31%), Positives = 27/44 (61%)
Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+ + ++ + + E+K+ K KK+KK+KK+KK +K+ KK
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKK 134
Score = 32.0 bits (73), Expect = 0.65
Identities = 15/51 (29%), Positives = 28/51 (54%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAP 879
KE+K+ K+ ++K+ K KK+KK+ +K + L++ + N T A
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKAT 147
Score = 31.3 bits (71), Expect = 1.1
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+ +++KK+KK+KK +K KKK K K
Sbjct: 111 AIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTK 144
Score = 31.3 bits (71), Expect = 1.1
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 3/44 (6%)
Query: 715 PDKEEEEEKKKK---KKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
KE++E+K+KK K KKK K K KK KK + +EEG
Sbjct: 114 AKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157
Score = 29.0 bits (65), Expect = 7.8
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K KK+KK+KK+KK +K KKK K
Sbjct: 112 IKAKKEKKEKKEKKVAEKLAKKKSTKTTK 140
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 45.1 bits (106), Expect = 2e-04
Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 18/139 (12%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE--EEGMRKLRDAASIQCGPVKEVP 774
K++ EE KK + K + + + + ++K + E +E +K DAA + K
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK--- 1391
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDP-KQRFNQIAYLDKEKKK 833
K D +K D+ + + D K++ + D+ KKK
Sbjct: 1392 ------------KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Query: 834 KKKKKKKKKKKKKKKKKKK 852
++ KK + KKK ++ KK
Sbjct: 1440 AEEAKKADEAKKKAEEAKK 1458
Score = 39.0 bits (90), Expect = 0.016
Identities = 32/134 (23%), Positives = 61/134 (45%), Gaps = 11/134 (8%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K EE+EKK + KK+ ++ KK ++ KKK+ + + E ++K + I+ K+ E
Sbjct: 1682 KAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
+ + K D EK +I H E + R + A +++E ++ +
Sbjct: 1742 DKKKA--EEAKKDEEEKK---------KIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE 1790
Query: 837 KKKKKKKKKKKKKK 850
K++ + KK K
Sbjct: 1791 KRRMEVDKKIKDIF 1804
Score = 37.8 bits (87), Expect = 0.033
Identities = 38/190 (20%), Positives = 85/190 (44%), Gaps = 2/190 (1%)
Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQI-AYPDKEEEEEKKKKKKKKKKKKKKKKK 739
KN K A ++E+ RI ++ A K+ EE K K ++ KK +++KKK
Sbjct: 1576 KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635
Query: 740 KKKKKKKKKYSNEEEGMRKLRDAASIQCGPV-KEVPCEPQIAPCLFDIKNDPCEKNNLAD 798
++ KKK+ + + E ++K + I+ K+ + + A + D +
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Query: 799 RSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
+ + + + K++ ++ ++E K K ++ KK+ ++ KKK ++ +
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
Query: 859 PDVLSQMEKE 868
++ ++KE
Sbjct: 1756 KKKIAHLKKE 1765
Score = 37.8 bits (87), Expect = 0.033
Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 7/152 (4%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
E + A D+ E E+K + +KKK++ KKK KKK ++ +E +K +
Sbjct: 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
Query: 765 IQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI 824
+ +K+ + A + + + ++E + + K +
Sbjct: 1406 -KADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK------KADEAKKKAEEAKK 1458
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
A K+K ++ KK + KKK ++ KK KK
Sbjct: 1459 AEEAKKKAEEAKKADEAKKKAEEAKKADEAKK 1490
Score = 36.7 bits (84), Expect = 0.079
Identities = 34/144 (23%), Positives = 58/144 (40%), Gaps = 23/144 (15%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K+ +E KKK ++ KK + KKK ++ KKK + E +K +AA + +
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD---- 1357
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
+ + E+ A + + K++ + D+ KKK ++
Sbjct: 1358 ----------EAEAAEEKAEAAEKKKEEAKKKADAA-----KKKAEEKKKADEAKKKAEE 1402
Query: 837 KKKK----KKKKKKKKKKKKMMKK 856
KKK KK KKK + KK
Sbjct: 1403 DKKKADELKKAAAAKKKADEAKKK 1426
Score = 36.3 bits (83), Expect = 0.11
Identities = 34/137 (24%), Positives = 65/137 (47%), Gaps = 9/137 (6%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
+K++EE KKK KKK ++KKK + KKK ++ + + ++K AA + K+
Sbjct: 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK-AAAAKKKADEAKKKAE 1428
Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
E + A + + + ++E + + K++ + D+ KKK +
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAE--------EAKKKAEEAKKADEAKKKAE 1480
Query: 836 KKKKKKKKKKKKKKKKK 852
+ KK + KKK ++ KK
Sbjct: 1481 EAKKADEAKKKAEEAKK 1497
Score = 35.5 bits (81), Expect = 0.16
Identities = 37/164 (22%), Positives = 70/164 (42%), Gaps = 9/164 (5%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
A DK++ +E KK KKK + KKK ++KKK + + E +K +A K+
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEA--------KK 1451
Query: 773 VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKK 832
E + A + + + ++E + + + K++ ++ + KK
Sbjct: 1452 KAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE-EAKKKADEAKKAAEAKK 1510
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTA 876
K + KK ++ KK + KK K + EK+ A+ + A
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKA 1554
Score = 35.1 bits (80), Expect = 0.20
Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 15/152 (9%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
+ ++ EE KKK ++ KK + KKK ++ KK + + E +K D A K
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK---- 1510
Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
K D +K A +++ + + + + L K ++ KK
Sbjct: 1511 -----------KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559
Query: 836 KKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
++KKK ++ KK ++ K M + + E+
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE 1591
Score = 35.1 bits (80), Expect = 0.21
Identities = 36/152 (23%), Positives = 60/152 (39%), Gaps = 18/152 (11%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K+ EE KK + KKK ++ KKK KKK ++ E + +AA+ + +E
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEE---- 1364
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
K + EK + + E D ++ + + +KK +
Sbjct: 1365 ----------KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE----EDKKKADEL 1410
Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
KK KKK + KKK +K D + +E
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442
Score = 33.6 bits (76), Expect = 0.67
Identities = 33/139 (23%), Positives = 64/139 (46%), Gaps = 2/139 (1%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE--EEGMRKLRDAASIQCGPVKEV 773
+K++ +E KKK ++ KK + KKK ++ KKK + + EE + A + E
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359
Query: 774 PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKK 833
+ A K + +K + A + ++ + + K++ +++ KKK
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Query: 834 KKKKKKKKKKKKKKKKKKK 852
+ KKK ++KKK + KK
Sbjct: 1420 ADEAKKKAEEKKKADEAKK 1438
Score = 32.8 bits (74), Expect = 1.1
Identities = 36/165 (21%), Positives = 71/165 (43%), Gaps = 12/165 (7%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI-------QCG 768
+ ++ +E KKK ++ KKK + KK + KKK + + E +K +A +
Sbjct: 1481 EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
Query: 769 PVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQR-INHYTTEVGYLDPKQRFNQIAYL 827
+E ++ K + +K A ++E + + E + R ++ L
Sbjct: 1541 KAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKL 1600
Query: 828 DKEKKKKKKKKKKKKKKKKKK----KKKKMMKKGYPDVLSQMEKE 868
+E+KK K ++ KK ++ K K KK + KK + + +E
Sbjct: 1601 YEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645
Score = 32.4 bits (73), Expect = 1.3
Identities = 40/157 (25%), Positives = 74/157 (47%), Gaps = 20/157 (12%)
Query: 713 AYPDKEEEEEKKKKKKKKKK----KKKKKKKKKKKKKKKKYSNEEEGMRK---------L 759
A K++ +E KK + KKK KK ++ KK + KK + + + + +K L
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADEL 1551
Query: 760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQ 819
+ A ++ K+ E + A + KN K A ++E RI EV L ++
Sbjct: 1552 KKAEELKKAEEKKKAEEAKKAE---EDKNMALRKAEEAKKAEEARIE----EVMKLYEEE 1604
Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ + K ++ K K ++ KK +++KKK +++ KK
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641
Score = 31.3 bits (70), Expect = 3.7
Identities = 35/183 (19%), Positives = 65/183 (35%), Gaps = 40/183 (21%)
Query: 691 ADRSEDQRINHYT-----TEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
A ++E++R N + F + K EE K + KK ++KKK + KK ++K
Sbjct: 1242 AKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEK 1301
Query: 746 KKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
KK +E +K +A K D +K + +
Sbjct: 1302 KKA----DEAKKKAEEAK-----------------------KADEAKKKAEEAKKKADAA 1334
Query: 806 NHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQM 865
E ++ + A E + + + ++K + +KKK K D +
Sbjct: 1335 KKKAEE------AKKAAEAA--KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Query: 866 EKE 868
+E
Sbjct: 1387 AEE 1389
Score = 29.7 bits (66), Expect = 8.6
Identities = 37/156 (23%), Positives = 60/156 (38%), Gaps = 41/156 (26%)
Query: 717 KEEEEEKKKKKKKKKKKKKK----KKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
K+ EE+KK + KK ++KKK KKK ++ KK + + E +K DAA + K+
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Query: 773 VPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKK 832
+ + A E + +EK
Sbjct: 1344 --------------AAEAAKAEAEAAADEAEA-----------------------AEEKA 1366
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
+ +KKK++ KKK KKK +K D + +E
Sbjct: 1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE 1402
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 40.1 bits (94), Expect = 4e-04
Identities = 18/32 (56%), Positives = 20/32 (62%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K KK KKKKKKKKKK K K++ EEE
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEE 44
Score = 38.6 bits (90), Expect = 0.001
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K K KK KKKKKKKKKK K K++ + +EE
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEE 42
Score = 37.8 bits (88), Expect = 0.002
Identities = 16/36 (44%), Positives = 21/36 (58%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
KK KKKKKKKKKK K K++ + EE+ +
Sbjct: 14 GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAE 49
Score = 36.2 bits (84), Expect = 0.009
Identities = 16/52 (30%), Positives = 25/52 (48%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
KKKKKKKKKK K K++ +K+++ + E K + +E
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEED 69
Score = 35.5 bits (82), Expect = 0.013
Identities = 16/43 (37%), Positives = 20/43 (46%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELAN 871
K K KK KKKKKKKKKK K K+ + + +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLK 53
Score = 34.7 bits (80), Expect = 0.029
Identities = 15/33 (45%), Positives = 24/33 (72%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+ KKKKKKKKKK K K++ +K++++ S+ E
Sbjct: 18 DVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50
Score = 33.9 bits (78), Expect = 0.045
Identities = 16/26 (61%), Positives = 16/26 (61%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
K K K KK KKKKKKKKKK K
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKS 32
Score = 33.9 bits (78), Expect = 0.057
Identities = 17/32 (53%), Positives = 17/32 (53%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
K K K KK KKKKKKKKKK K E
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVT 38
Score = 33.5 bits (77), Expect = 0.059
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
K KKKKKKKKKK K K++ +
Sbjct: 15 KKIDVKKKKKKKKKKNKSKEEVVTE 39
Score = 32.8 bits (75), Expect = 0.13
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKK 852
KKKKKKKKKK K K++ +K+
Sbjct: 15 KKIDVKKKKKKKKKKNKSKEEVVTEKE 41
Score = 32.8 bits (75), Expect = 0.14
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
+D +KKKKKKKKK K K++ +K++
Sbjct: 17 IDVKKKKKKKKKKNKSKEEVVTEKEE 42
Score = 32.8 bits (75), Expect = 0.14
Identities = 15/32 (46%), Positives = 19/32 (59%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
L +K KKKKKKKKKK K K++ K+
Sbjct: 12 LKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEE 43
Score = 32.0 bits (73), Expect = 0.21
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
K K K KK KKKKKKKKK + +E +
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKSKEEVVT 38
Score = 31.2 bits (71), Expect = 0.41
Identities = 15/51 (29%), Positives = 30/51 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
K + ++KKKKKKKK K K++ +K++++K ++ + + D I+
Sbjct: 15 KKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIE 65
Score = 30.1 bits (68), Expect = 0.98
Identities = 15/26 (57%), Positives = 15/26 (57%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
K K K KK KKKKKKKKK K
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNKS 32
Score = 27.8 bits (62), Expect = 6.7
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
K+KKKKKKK K K++ +K++++
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEE 44
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 43.3 bits (102), Expect = 4e-04
Identities = 35/159 (22%), Positives = 65/159 (40%), Gaps = 22/159 (13%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY----SNEEEGMRKLRDAASIQCGPV 770
+ E+E+E K ++ +++ ++K+K+K+K+ K +E+E M + ++ PV
Sbjct: 266 DEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPES------PV 319
Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEV---QRINHYTTEV---GYLDPKQRFNQI 824
+E E P L + + E D +R+ T GYL K+ +
Sbjct: 320 EEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWE 379
Query: 825 AYLDKE------KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
++ + E K K K KK K K K+
Sbjct: 380 SFSEDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKKKQS 418
Score = 36.7 bits (85), Expect = 0.047
Identities = 28/139 (20%), Positives = 49/139 (35%), Gaps = 24/139 (17%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
KE EK + K K K+ KK + ++ +K + AS VKE E
Sbjct: 189 KETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES--TVKEESEE 246
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
I D + D E + ++ +
Sbjct: 247 ES--GKRDVILEDESAEPTGLDEDE---------------DEDEP-----KPSGERSDSE 284
Query: 837 KKKKKKKKKKKKKKKKMMK 855
++ ++K+K+K+K+ KKMM+
Sbjct: 285 EETEEKEKEKRKRLKKMME 303
Score = 34.4 bits (79), Expect = 0.26
Identities = 13/37 (35%), Positives = 15/37 (40%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
D+ E K K K KK K KKKK S+
Sbjct: 383 EDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKKKQSS 419
Score = 32.5 bits (74), Expect = 1.1
Identities = 28/147 (19%), Positives = 59/147 (40%), Gaps = 15/147 (10%)
Query: 726 KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC-EPQIAPCLF 784
KKK K+KK+KK+ + K++ + L D ++ G ++ EP+ +
Sbjct: 222 FKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERS 281
Query: 785 DIKNDPCEKN-----------NLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKK 833
D + + EK D E I E + + + L K++++
Sbjct: 282 DSEEETEEKEKEKRKRLKKMMEDEDEDEEMEI---VPESPVEEEESEEPEPPPLPKKEEE 338
Query: 834 KKKKKKKKKKKKKKKKKKKMMKKGYPD 860
K++ +++ +++ M KK + D
Sbjct: 339 KEEVTVSPDGGRRRGRRRVMKKKTFKD 365
Score = 31.0 bits (70), Expect = 2.9
Identities = 11/35 (31%), Positives = 13/35 (37%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ + E E K K K KK K KKK
Sbjct: 381 FSEDEAEPPPTKPKPKVSTPAVPAAAKKPKAPKKK 415
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 42.3 bits (100), Expect = 4e-04
Identities = 31/143 (21%), Positives = 61/143 (42%), Gaps = 21/143 (14%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
E++++K++++K +KK+K+K+ KKK+ + S EE K P E
Sbjct: 6 ALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEE------VKNEENKSKKKAAPIEN 59
Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK- 836
+F K AD + ++ + K++ + L K + +KKK
Sbjct: 60 AEGNIVFS-------KVEFADGEQAKKDLKLKKK----KKKKKTDYKQLLKKLEARKKKL 108
Query: 837 ---KKKKKKKKKKKKKKKKMMKK 856
+ K + ++K+K K + K
Sbjct: 109 EELDEDKAAEIEEKEKWTKALAK 131
Score = 37.3 bits (87), Expect = 0.017
Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 5/37 (13%)
Query: 717 KEEEEEKKKKKKKK-----KKKKKKKKKKKKKKKKKK 748
KE+ E+KK +++KK KK+K KK KKKKK KKK
Sbjct: 166 KEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKK 202
Score = 35.7 bits (83), Expect = 0.053
Identities = 16/29 (55%), Positives = 18/29 (62%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
K EE KK+K KK KKKKK KKK +
Sbjct: 178 KKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 35.4 bits (82), Expect = 0.082
Identities = 44/172 (25%), Positives = 69/172 (40%), Gaps = 57/172 (33%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
F+++ + D E+ ++ K KKKKKKKK K+ KK + +K + E + + + A
Sbjct: 66 FSKVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARK--KKLEELDEDKAA------ 117
Query: 769 PVKEVPCEPQIAPCLFDIKNDPCEK-NNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYL 827
+I+ EK ++E ++ D K +
Sbjct: 118 ----------------EIEEK--EKWTKALAKAEGVKVKD--------DEKLLKKALKRK 151
Query: 828 DKEKKKKKKK----------------KK------KKKKKKKKKKKKKMMKKG 857
+K+KKK KK+ KK K+K KK KKKKK KKG
Sbjct: 152 EKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAKKKG 203
Score = 35.0 bits (81), Expect = 0.12
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
K+ EE KK+K KK KKKKK KKK +
Sbjct: 178 KKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 34.2 bits (79), Expect = 0.18
Identities = 16/37 (43%), Positives = 24/37 (64%), Gaps = 5/37 (13%)
Query: 717 KEEEEEKKKK-----KKKKKKKKKKKKKKKKKKKKKK 748
++++ E++KK KK+K KK KKKKK KKK +
Sbjct: 170 EKKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 33.0 bits (76), Expect = 0.39
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
+++++ E+ KK+K KK KKKKK KKK +
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
Score = 33.0 bits (76), Expect = 0.46
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKK 852
KK+K KK KKKKK KKK +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 32.7 bits (75), Expect = 0.58
Identities = 13/31 (41%), Positives = 20/31 (64%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
+++KK+++ KK+K KK KKKKK K
Sbjct: 175 ERQKKREENLKKRKDDKKNKKKKKAKKKGRI 205
Score = 31.9 bits (73), Expect = 0.91
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKK 850
K++K KK KKKKK KKK +
Sbjct: 185 KKRKDDKKNKKKKKAKKKGRIL 206
Score = 31.9 bits (73), Expect = 1.2
Identities = 12/35 (34%), Positives = 28/35 (80%)
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
++ A L++ ++K++++K +KK+K+K+ KKK+ +K
Sbjct: 3 SREALLEQRRRKREQRKARKKQKRKEAKKKEDAQK 37
Score = 31.5 bits (72), Expect = 1.5
Identities = 13/23 (56%), Positives = 14/23 (60%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKK 849
L K K KK KKKKK KKK +
Sbjct: 184 LKKRKDDKKNKKKKKAKKKGRIL 206
Score = 30.4 bits (69), Expect = 3.5
Identities = 10/39 (25%), Positives = 24/39 (61%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
E++++K++++K +KK+K+K+ KK + +E
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEE 47
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 41.4 bits (98), Expect = 4e-04
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
Y D K+ A LDK++ K +K +KK +K K K +KK K
Sbjct: 51 YKDLKESLEA-ALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 41.4 bits (98), Expect = 5e-04
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKK 746
++E K +K +KK +K K K +KKK K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAK 91
Score = 41.4 bits (98), Expect = 5e-04
Identities = 16/35 (45%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 716 DKEE--EEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
DK+E EK +KK +K K K +KKK KK++ K +
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 39.0 bits (92), Expect = 0.003
Identities = 14/34 (41%), Positives = 21/34 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
KE + +K +KK +K K K +KKK KK++ K
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 38.3 bits (90), Expect = 0.005
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
KK+ K +K +KK +K K K +KKK EE
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95
Score = 37.5 bits (88), Expect = 0.010
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
+K +KK +K K K +KKK KK++ K
Sbjct: 71 WEKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 36.4 bits (85), Expect = 0.020
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
A+ EKK +K K K +KKK KK++ K
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 34.8 bits (81), Expect = 0.074
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 726 KKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
KK+ K +K +KK +K K K +K ++E +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 32.9 bits (76), Expect = 0.35
Identities = 14/44 (31%), Positives = 21/44 (47%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
KE E KK+ K +K +KK +K K K + ++ K R
Sbjct: 55 KESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 32.1 bits (74), Expect = 0.50
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
KK+ K +K +KK +K K K + K
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKA 90
Score = 31.3 bits (72), Expect = 0.95
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
K+ +K K K +KKK KK++ K +
Sbjct: 76 KKAEKAKAKAEKKKAKKEEPKPR 98
Score = 29.4 bits (67), Expect = 5.2
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKK 847
K +K K K +KKK KK++ K +
Sbjct: 69 KAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 42.7 bits (101), Expect = 5e-04
Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 1/66 (1%)
Query: 694 SEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
+ + + + E +KK+K K++ KKKK K+ K K K
Sbjct: 219 EFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVV-A 277
Query: 754 EGMRKL 759
+GM+K+
Sbjct: 278 KGMKKI 283
Score = 38.5 bits (90), Expect = 0.012
Identities = 15/29 (51%), Positives = 17/29 (58%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
KE+ KKKK K+ K K KK K MKK
Sbjct: 254 SKEEIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 36.2 bits (84), Expect = 0.069
Identities = 14/30 (46%), Positives = 19/30 (63%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
K K K++ KKKK K+ K K KK++ KG
Sbjct: 250 KKRKSKEEIKKKKPKESKGVKALKKVVAKG 279
Score = 35.0 bits (81), Expect = 0.13
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ +KK+K K++ KKKK K+ K K +KK
Sbjct: 245 ESRAEKKRKSKEEIKKKKPKESKGVKALKK 274
Score = 33.8 bits (78), Expect = 0.32
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKK 856
EKK+K K++ KKKK K+ K K + K
Sbjct: 249 EKKRKSKEEIKKKKPKESKGVKALKKV 275
Score = 33.5 bits (77), Expect = 0.46
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKM 853
K+KK K+ K K KK K KK+
Sbjct: 259 KKKKPKESKGVKALKKVVAKGMKKI 283
Score = 32.3 bits (74), Expect = 0.92
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
+ +K++K K++ KKKK K+ K K K
Sbjct: 244 AESRAEKKRKSKEEIKKKKPKESKGVKALK 273
Score = 31.9 bits (73), Expect = 1.2
Identities = 12/44 (27%), Positives = 23/44 (52%), Gaps = 2/44 (4%)
Query: 814 YLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+ K++ + + + + +KK+K K++ KKKK K KG
Sbjct: 227 LKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK--ESKG 268
Score = 29.2 bits (66), Expect = 9.3
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 12/62 (19%)
Query: 807 HYTTEVGYLDPKQRFNQIAYLDKEKKKKKK------------KKKKKKKKKKKKKKKKMM 854
Y + L + YL + KKKK++ +KK+K K++ KKKK K
Sbjct: 207 LYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKES 266
Query: 855 KK 856
K
Sbjct: 267 KG 268
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 43.4 bits (102), Expect = 6e-04
Identities = 28/158 (17%), Positives = 61/158 (38%), Gaps = 10/158 (6%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
K +E+ KK + + K+K + +++ NEE Q
Sbjct: 200 ELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQ-------- 251
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
E + + + + + + ++ E + E+ L ++ + L E++K
Sbjct: 252 EEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKV 311
Query: 835 KKKKKKK--KKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
++K K +K+ KK +K+ +K + L + KEL
Sbjct: 312 DDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELE 349
Score = 40.3 bits (94), Expect = 0.005
Identities = 29/178 (16%), Positives = 63/178 (35%), Gaps = 3/178 (1%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
+ +E EEE++++ +K ++K ++ +++ KKK + + + ++
Sbjct: 346 KELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEE 405
Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
KE +++ D+ + EK E + T+ G L ++ + L
Sbjct: 406 KEAKLLLELSEQEEDLLKE--EKKEELKIVEELEESL-ETKQGKLTEEKEELEKQALKLL 462
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGG 888
K K + KK + K+ K K + E + K G
Sbjct: 463 KDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESKAREGLAVLLALIKDG 520
Score = 37.3 bits (86), Expect = 0.046
Identities = 27/149 (18%), Positives = 70/149 (46%), Gaps = 9/149 (6%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
+E EE+K K + + K++K+K++KK+ +++ +K + EE ++ + + + + + E
Sbjct: 878 EELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESE 937
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRF----NQIAYLDKEKK 832
P+ + + E N + +R+ E+G ++ + Y E K
Sbjct: 938 PEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELK 997
Query: 833 K-----KKKKKKKKKKKKKKKKKKKMMKK 856
K +KK+ ++ ++ ++ K+ ++
Sbjct: 998 KERLEEEKKELLREIIEETCQRFKEFLEL 1026
Score = 36.1 bits (83), Expect = 0.11
Identities = 30/156 (19%), Positives = 72/156 (46%), Gaps = 2/156 (1%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
EEEEK +K K +++K++K K ++++ + + +EE + I+ +
Sbjct: 778 AEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELE 837
Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKK 837
++A L + + EK + ++ + L K+ + L E + K++K
Sbjct: 838 ELALELKEEQKL--EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEK 895
Query: 838 KKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
+K++KK+ +++ +K + + + + + E A I
Sbjct: 896 EKEEKKELEEESQKDNLLEEKENEIEERIAEEAIIL 931
Score = 34.2 bits (78), Expect = 0.38
Identities = 32/150 (21%), Positives = 67/150 (44%), Gaps = 12/150 (8%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE----GMRKLRDAASIQCGP 769
+ E+EE+ + K+ K+++K+KK ++++ K + EEE + KL
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK 316
Query: 770 VKEVPCEPQIAPCLFDIKNDPCEKNNLA---DRSEVQRINHYTTEVGYLDPKQRFNQIAY 826
+KE E ++ ++K + E L E++R E +++ Q
Sbjct: 317 LKE--SEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQ--- 371
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
L++E KKK + ++ K K++++ K
Sbjct: 372 LEEELLAKKKLESERLSSAAKLKEEELELK 401
Score = 31.9 bits (72), Expect = 2.3
Identities = 17/78 (21%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK 738
+ + E N + ++R+ E+G N +A + EE+EE+ K + KK++ +++K
Sbjct: 946 ADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEK 1005
Query: 739 KKKKK---KKKKKYSNEE 753
K+ + ++ + E
Sbjct: 1006 KELLREIIEETCQRFKEF 1023
Score = 29.9 bits (67), Expect = 6.9
Identities = 9/48 (18%), Positives = 25/48 (52%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
E EEK+++ K + KK++ +++KK+ ++ + ++ +
Sbjct: 980 AIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKEFLELF 1027
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 42.8 bits (101), Expect = 7e-04
Identities = 40/141 (28%), Positives = 55/141 (39%), Gaps = 33/141 (23%)
Query: 722 EKKKKKKKKKKKKKKKK-------KKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
EK+K KK + KKKK K K+ KY E EG +K D
Sbjct: 175 EKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFT--------- 225
Query: 775 CEPQIAPCLFDIK---NDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEK 831
D +D EK + ++ +IN+ + + D Y D +
Sbjct: 226 ---------LDSGGSTDDEAEKKRQEVKKKL-KINNVSLD----DDSTETPASDYYDVSE 271
Query: 832 KKKKKKKKKKKKKKKKKKKKK 852
K KK KKKKKKKKK++K
Sbjct: 272 MVKFKKPKKKKKKKKKRRKDL 292
Score = 37.0 bits (86), Expect = 0.047
Identities = 20/90 (22%), Positives = 34/90 (37%)
Query: 698 RINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
+IN+ + + A + E K KK KKKKKKKKK++K + + + E G
Sbjct: 248 KINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSS 307
Query: 758 KLRDAASIQCGPVKEVPCEPQIAPCLFDIK 787
++ + + D
Sbjct: 308 DSGSRKDVEEENARLEDSPKKRKEEQEDDD 337
Score = 33.6 bits (77), Expect = 0.49
Identities = 12/45 (26%), Positives = 21/45 (46%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTA 876
K+K KK + KKKK K +LS+ ++E+ + +
Sbjct: 176 KEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKS 220
Score = 32.4 bits (74), Expect = 1.1
Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 27/63 (42%)
Query: 719 EEEEKKKKKKKKKKK---------------------------KKKKKKKKKKKKKKKYSN 751
+E EKK+++ KKK K KK KKKKKKKKK++K +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLD 293
Query: 752 EEE 754
E+E
Sbjct: 294 EDE 296
Score = 30.1 bits (68), Expect = 6.1
Identities = 9/46 (19%), Positives = 24/46 (52%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
N ++ E + K+++ ++K K ++K+++ K K E++
Sbjct: 41 NWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDD 86
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 38.4 bits (90), Expect = 8e-04
Identities = 14/42 (33%), Positives = 22/42 (52%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
E+ K+ KK+ KK +KKK+K K+ +KY E +
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAER 50
Score = 34.1 bits (79), Expect = 0.030
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
EK + K+ KK+ KK +KKK+K++ K
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFK 36
Score = 32.2 bits (74), Expect = 0.15
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
K+K+ +K + K+ KK+ KK +KKK
Sbjct: 5 KRKRNEKLRAKRAKKRAAKKAARKKK 30
Score = 31.8 bits (73), Expect = 0.17
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+++ + K+ KK+ KK +KKK+K + K+
Sbjct: 9 NEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
Score = 29.5 bits (67), Expect = 1.1
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
KK+K+ +K + K+ KK+ KK +
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAAR 27
Score = 28.3 bits (64), Expect = 3.5
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
KK+K+ +K + K+ KK+ KK KK
Sbjct: 4 KKRKRNEKLRAKRAKKRAAKKAARKK 29
Score = 28.0 bits (63), Expect = 3.7
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
L K K+ +K + K+ KK+ KK +K +K
Sbjct: 3 LKKRKRNEKLRAKRAKKRAAKKAARKKKRKLI 34
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 42.8 bits (101), Expect = 8e-04
Identities = 22/56 (39%), Positives = 37/56 (66%), Gaps = 10/56 (17%)
Query: 715 PDKEEEEEKKKKKKKKKKK----------KKKKKKKKKKKKKKKYSNEEEGMRKLR 760
PD+EEEEEK++KK++++K ++K++KKKK KK K+ + E E + K +
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTK 91
Score = 30.5 bits (69), Expect = 4.5
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYP 859
E+++K++KKKK KK K+ + +++ K P
Sbjct: 61 VDEEEEKEEKKKKTKKVKETTTEWELLNKTKP 92
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 42.6 bits (101), Expect = 9e-04
Identities = 18/54 (33%), Positives = 31/54 (57%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
E+ K+ ++ +K +KK++++KK+KKKK KKK+ EEE +K
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 40.3 bits (95), Expect = 0.004
Identities = 16/38 (42%), Positives = 30/38 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+++ EE+KK+KKKK KKK+++++++K+KK +EE
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 39.9 bits (94), Expect = 0.005
Identities = 19/38 (50%), Positives = 30/38 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K+ EEEKK+KKKK KKK+++++++K+KK+ EEE
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Score = 39.9 bits (94), Expect = 0.005
Identities = 16/46 (34%), Positives = 34/46 (73%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
+EEE+++KKKK KKK+++++++K+KK+++ EEE + ++
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 39.9 bits (94), Expect = 0.006
Identities = 18/48 (37%), Positives = 31/48 (64%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
K EEE+K+KKKK KKK+++++++K+KK++ EEE +
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Score = 39.9 bits (94), Expect = 0.006
Identities = 15/40 (37%), Positives = 29/40 (72%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
E++ E++KK+KKKK KKK+++++++K+K E+E
Sbjct: 418 KAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
Score = 39.5 bits (93), Expect = 0.008
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 716 DKEEEE----EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
+EE E KK KK KK +K +KK++++KK+KK ++ +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 39.1 bits (92), Expect = 0.009
Identities = 14/45 (31%), Positives = 31/45 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
E+ EKK++++KK+KKKK KKK++++++ ++E ++ +
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 37.6 bits (88), Expect = 0.031
Identities = 14/39 (35%), Positives = 30/39 (76%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+K E++ +++KK+KKKK KKK+++++++K+ EE+
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Score = 36.8 bits (86), Expect = 0.049
Identities = 14/39 (35%), Positives = 31/39 (79%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
++E++E+KKK KKK+++++++K+KK+++K+ EE
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 36.4 bits (85), Expect = 0.062
Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 1/64 (1%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGG 888
K+ +K +KK++++KK+KKKK KK + + +KE + ++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE-EEKEEEEEEAEEEKEEEEE 471
Query: 889 DPKN 892
K
Sbjct: 472 KKKK 475
Score = 35.7 bits (83), Expect = 0.10
Identities = 12/40 (30%), Positives = 31/40 (77%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
++++E++KK KKK+++++++K+KK+++K++ E E
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Score = 35.7 bits (83), Expect = 0.10
Identities = 15/45 (33%), Positives = 28/45 (62%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+I + ++ KK KK +K +KK++++KK+KKKK ++E
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441
Score = 34.9 bits (81), Expect = 0.21
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
+ EKK++++KK+KKKK KKK++
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442
Score = 34.9 bits (81), Expect = 0.21
Identities = 14/42 (33%), Positives = 25/42 (59%)
Query: 811 EVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
E+ +L ++ + EK +KK++++KK+KKKK KK
Sbjct: 398 EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKK 439
Score = 34.9 bits (81), Expect = 0.22
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+I +L KK KK KK +K +KK++++K KK
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKK 432
Score = 34.1 bits (79), Expect = 0.33
Identities = 12/33 (36%), Positives = 32/33 (96%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++EEE+EKK+++K++++++ +++K+++++KKKK
Sbjct: 443 EEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 33.4 bits (77), Expect = 0.65
Identities = 11/34 (32%), Positives = 31/34 (91%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
KEEEEE++K+KK+++K++++++ +++K+++++
Sbjct: 438 KKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 33.0 bits (76), Expect = 0.80
Identities = 12/30 (40%), Positives = 23/30 (76%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
++EKK+KKKK KKK+++++++K K+
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452
Score = 33.0 bits (76), Expect = 0.82
Identities = 12/33 (36%), Positives = 32/33 (96%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++EEEEEK+KK+++K++++++ +++K+++++KK
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 32.6 bits (75), Expect = 1.1
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
K+ KK +K +KK++++KK+KKKK + + EKE
Sbjct: 409 TKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 32.2 bits (74), Expect = 1.2
Identities = 11/34 (32%), Positives = 32/34 (94%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++EEEE++KK+++K++++++ +++K+++++KKK
Sbjct: 441 EEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474
Score = 31.4 bits (72), Expect = 2.1
Identities = 10/30 (33%), Positives = 28/30 (93%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
+KE++EE+K++++++ +++K+++++KKKK
Sbjct: 446 EEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 30.7 bits (70), Expect = 4.5
Identities = 11/41 (26%), Positives = 28/41 (68%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
+++K+KKKK KKK+++++++K+ ++ + + E+E
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 42.6 bits (100), Expect = 0.001
Identities = 39/186 (20%), Positives = 61/186 (32%), Gaps = 41/186 (22%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD--------------AA 763
E+ +++K KKKKKK K KKK KK K+ +++ D +A
Sbjct: 233 GEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSA 292
Query: 764 SIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQ 823
S +E P+I +++ E++ E
Sbjct: 293 SGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE---------------------- 330
Query: 824 IAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKP 883
+ KK KK KK K KK K G S ++ E + TA
Sbjct: 331 -----EGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPK 385
Query: 884 FDKGGD 889
++ D
Sbjct: 386 KEEPVD 391
Score = 37.6 bits (87), Expect = 0.030
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+E EEEK +++ KK KK KK K KK + + G
Sbjct: 320 SEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359
Score = 36.1 bits (83), Expect = 0.078
Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 4/67 (5%)
Query: 692 DRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKK----KKKKKKKKKKKKKKKKKK 747
++R + + E+ +I + EE E++K +++ KK KK KK K KK K
Sbjct: 294 GNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDK 353
Query: 748 KYSNEEE 754
S+ +
Sbjct: 354 DDSDSGD 360
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 39.7 bits (93), Expect = 0.001
Identities = 12/47 (25%), Positives = 25/47 (53%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
N +EE+ E +K+ K ++++ +K +K+ KK KK + +
Sbjct: 85 NHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQGQL 131
Score = 31.6 bits (72), Expect = 0.75
Identities = 16/77 (20%), Positives = 29/77 (37%), Gaps = 13/77 (16%)
Query: 714 YPDKEEEEEKKKKK----KKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGP 769
Y + + + K + +++K + +K+ K Y EE +RK++
Sbjct: 67 YDEDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEE--LRKIQKR------S 118
Query: 770 VKEVPCEPQIA-PCLFD 785
K EP LFD
Sbjct: 119 KKSKKAEPVQGQLSLFD 135
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 38.5 bits (90), Expect = 0.001
Identities = 17/45 (37%), Positives = 20/45 (44%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANI 872
D KKKKKKKKKK + K KK + + ME L
Sbjct: 21 DLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLGEK 65
Score = 38.1 bits (89), Expect = 0.001
Identities = 17/48 (35%), Positives = 21/48 (43%), Gaps = 4/48 (8%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK----KKYSNEEEGMRK 758
P E + KKKKKKKKKK + K KK + + EG
Sbjct: 16 PKTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLG 63
Score = 31.6 bits (72), Expect = 0.25
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+ + KKKKKKKKKK + K
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKA 43
Score = 31.2 bits (71), Expect = 0.39
Identities = 13/27 (48%), Positives = 15/27 (55%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKM 853
L K+KKKKKKKK + K KK
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAKKATA 48
Score = 30.8 bits (70), Expect = 0.47
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+ KKKKKKKKKK + K
Sbjct: 18 TEADLVKKKKKKKKKKAEDTAATAK 42
Score = 30.4 bits (69), Expect = 0.79
Identities = 20/64 (31%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV---PCE--PQI 779
K + KKKKKKKKKK + ++ + + A G + E P E PQ
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAMFMEGFLGEKFHKPGENYPQT 76
Query: 780 APCL 783
P L
Sbjct: 77 YPLL 80
Score = 27.3 bits (61), Expect = 8.4
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
K + KKKKKKKKKK + K
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAK 42
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 42.3 bits (100), Expect = 0.001
Identities = 35/176 (19%), Positives = 59/176 (33%), Gaps = 9/176 (5%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
KE +E++ K K K K K +K K KKK+KKK + + +K + + E
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKR-- 1207
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
D K D + N+ E + + + + E +
Sbjct: 1208 ------KLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFS 1261
Query: 837 KKKKKKKKKKKKKKKKMMKKGY-PDVLSQMEKELANINRTAVAPINKPFDKGGDPK 891
K+ K K K++ Y P S+ +N +P K K +
Sbjct: 1262 SDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGS 1317
Score = 34.3 bits (79), Expect = 0.38
Identities = 37/192 (19%), Positives = 56/192 (29%), Gaps = 27/192 (14%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA----------- 763
K + K K KKK+KKKKK K KK S + K +
Sbjct: 1163 KGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSG 1222
Query: 764 --SIQCGPVKEVPCEPQIAPCLFDIKN--------DPCEKNNLADRSEVQRINHYTTEVG 813
K P + + N D ++L+ + + + V
Sbjct: 1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQ 1282
Query: 814 YLDPK------QRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
Y P N + KKK KK+ + KKKKK K K
Sbjct: 1283 YSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVK 1342
Query: 868 ELANINRTAVAP 879
+ + + +
Sbjct: 1343 QASASQSSRLLR 1354
Score = 31.9 bits (73), Expect = 2.1
Identities = 17/64 (26%), Positives = 26/64 (40%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGG 888
KE++ K K K K K +K K KKK KK + + ++ + + D
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
Query: 889 DPKN 892
D K
Sbjct: 1214 DNKK 1217
Score = 29.6 bits (67), Expect = 9.8
Identities = 16/46 (34%), Positives = 21/46 (45%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
K E KKKKK +KK +KKK K + K+ S + R
Sbjct: 1310 VKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRR 1355
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 41.3 bits (97), Expect = 0.001
Identities = 32/172 (18%), Positives = 51/172 (29%), Gaps = 25/172 (14%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
E+E +K+ K K + ++ KK KK +E K + Q E
Sbjct: 98 EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSP---ES 154
Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKK 837
+ + +L E R + A +
Sbjct: 155 SPSS---TPNSSRPSTPHLLKAKEGPS---------------RRAKKAAKLSSTASSGDE 196
Query: 838 KKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGD 889
K K K KK KKM K L E + A ++ +A ++ D D
Sbjct: 197 KSPKSKAAPKKAGKKMRKWD----LDGDEDDDAVLDYSAPDANDENADAPED 244
Score = 29.7 bits (67), Expect = 5.8
Identities = 8/39 (20%), Positives = 12/39 (30%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + ++ K +K K K KK K
Sbjct: 172 KAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGK 210
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 40.8 bits (96), Expect = 0.002
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P +EE E K + ++++ KKKKKKKKKK KK
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKTKKNNA 197
Score = 40.0 bits (94), Expect = 0.003
Identities = 16/38 (42%), Positives = 23/38 (60%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
EEE + K + ++++ KKKKKKKKKK KK +
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAE 203
Score = 38.9 bits (91), Expect = 0.007
Identities = 14/48 (29%), Positives = 19/48 (39%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
+EE +KKKKKKKKK KK + E + +
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224
Score = 35.4 bits (82), Expect = 0.11
Identities = 12/40 (30%), Positives = 17/40 (42%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
++E ++KKKKKKKK KK + E
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216
Score = 33.9 bits (78), Expect = 0.32
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
L++E+ KKKKKKKKKK KK
Sbjct: 176 LEEERAKKKKKKKKKKTKKNNATGSSAEAT 205
Score = 32.3 bits (74), Expect = 0.88
Identities = 10/43 (23%), Positives = 18/43 (41%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
++ +++KKKKKKK KK + + G +
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQ 220
Score = 31.9 bits (73), Expect = 1.1
Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 828 DKEKKKKK--KKKKKKKKKKKKKKKKKMMKKG 857
+ E K + +++ KKKKKKKKKK KK G
Sbjct: 168 EVELLKARLEEERAKKKKKKKKKKTKKNNATG 199
Score = 31.2 bits (71), Expect = 2.0
Identities = 17/57 (29%), Positives = 26/57 (45%)
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAP 879
++ L ++++ KKKKKKKKKK KK V S + EL++
Sbjct: 168 EVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEA 224
Score = 31.2 bits (71), Expect = 2.3
Identities = 18/71 (25%), Positives = 26/71 (36%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGD 889
E+++ KKKKKKKKKK KK + E + K D
Sbjct: 177 EEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPD 236
Query: 890 PKNFDHAWSIF 900
+ + S+F
Sbjct: 237 NEKSEVYKSLF 247
Score = 29.2 bits (66), Expect = 9.6
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
++E + K + ++++ KKKKKKKKK KK
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKK 194
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 38.9 bits (91), Expect = 0.002
Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 5/47 (10%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKK-----KKKKKKKKKKKYS 750
Q+ P ++ +E++K + KKK+KK + KK KK++ S
Sbjct: 58 SQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRRS 104
Score = 29.3 bits (66), Expect = 3.4
Identities = 8/35 (22%), Positives = 18/35 (51%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
++ + + K+K+KK + + KK KK++
Sbjct: 68 RKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRR 102
Score = 28.9 bits (65), Expect = 5.3
Identities = 11/49 (22%), Positives = 20/49 (40%), Gaps = 14/49 (28%)
Query: 818 KQRFNQIAYLDKEKK---------KKKKKKKK-----KKKKKKKKKKKK 852
Q+ +++K KKK+KK + KK KK+++
Sbjct: 55 IHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKKRRR 103
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 39.2 bits (92), Expect = 0.002
Identities = 10/30 (33%), Positives = 23/30 (76%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
+ +E +K+K+ +K ++KK K+++K+K+K
Sbjct: 101 ERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 38.8 bits (91), Expect = 0.002
Identities = 11/31 (35%), Positives = 25/31 (80%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
+E +E+ +K+K+ +K ++KK K+++K+K+K
Sbjct: 100 RERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 37.6 bits (88), Expect = 0.005
Identities = 10/32 (31%), Positives = 25/32 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E +++ +K+K+ +K ++KK K+++K+K+K
Sbjct: 99 RRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 37.3 bits (87), Expect = 0.008
Identities = 10/32 (31%), Positives = 23/32 (71%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E K++ +K+K+ +K ++KK K+++K+K
Sbjct: 97 RLRRERTKERAEKEKRTRKNREKKFKRRQKEK 128
Score = 36.1 bits (84), Expect = 0.017
Identities = 13/45 (28%), Positives = 27/45 (60%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+ K E+E ++K+KKKK+K+ KK++ + +++ + E K
Sbjct: 67 HNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEK 111
Score = 34.2 bits (79), Expect = 0.079
Identities = 9/43 (20%), Positives = 26/43 (60%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+E+E++KK+K+ KK++ + ++++ K++ + R+
Sbjct: 76 LREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNRE 118
Score = 33.8 bits (78), Expect = 0.099
Identities = 9/34 (26%), Positives = 22/34 (64%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
K+R ++ K++ +K+K+ +K ++KK K++
Sbjct: 91 KRRIALRLRRERTKERAEKEKRTRKNREKKFKRR 124
Score = 33.8 bits (78), Expect = 0.13
Identities = 6/29 (20%), Positives = 21/29 (72%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+ ++++ K++ +K+K+ +K ++KK ++
Sbjct: 97 RLRRERTKERAEKEKRTRKNREKKFKRRQ 125
Score = 33.4 bits (77), Expect = 0.17
Identities = 10/24 (41%), Positives = 20/24 (83%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
EK+K+ +K ++KK K+++K+K+K
Sbjct: 107 AEKEKRTRKNREKKFKRRQKEKEK 130
Score = 33.4 bits (77), Expect = 0.18
Identities = 6/28 (21%), Positives = 20/28 (71%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+ ++++ K++ +K+K+ +K ++KK+
Sbjct: 94 IALRLRRERTKERAEKEKRTRKNREKKF 121
Score = 33.0 bits (76), Expect = 0.22
Identities = 10/23 (43%), Positives = 20/23 (86%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
+KEK+ +K ++KK K+++K+K+K
Sbjct: 108 EKEKRTRKNREKKFKRRQKEKEK 130
Score = 32.6 bits (75), Expect = 0.27
Identities = 8/24 (33%), Positives = 19/24 (79%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
+ +K+K+ +K ++KK K+++K+K
Sbjct: 105 ERAEKEKRTRKNREKKFKRRQKEK 128
Score = 32.6 bits (75), Expect = 0.31
Identities = 9/24 (37%), Positives = 20/24 (83%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
KE+ +K+K+ +K ++KK K+++K+
Sbjct: 104 KERAEKEKRTRKNREKKFKRRQKE 127
Score = 32.6 bits (75), Expect = 0.33
Identities = 9/32 (28%), Positives = 23/32 (71%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
++++ K++ +K+K+ +K ++KK K+ E+E
Sbjct: 98 LRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
Score = 32.2 bits (74), Expect = 0.38
Identities = 8/33 (24%), Positives = 22/33 (66%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
P K+ + ++++ K++ +K+K+ +K ++KK
Sbjct: 88 PGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120
Score = 31.5 bits (72), Expect = 0.78
Identities = 5/25 (20%), Positives = 17/25 (68%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKK 856
+ ++++ K++ +K+K+ +K +K
Sbjct: 95 ALRLRRERTKERAEKEKRTRKNREK 119
Score = 30.7 bits (70), Expect = 1.5
Identities = 13/45 (28%), Positives = 27/45 (60%), Gaps = 8/45 (17%)
Query: 719 EEEEKKKKKKKKKKK---KKKKKKKKKKKKKKKYSNEEEGMRKLR 760
E+E+KKK+K+ KK+ + ++++ K++ +K+ RK R
Sbjct: 78 EKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRT-----RKNR 117
Score = 30.3 bits (69), Expect = 2.1
Identities = 8/24 (33%), Positives = 20/24 (83%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKK 851
+++K+ +K ++KK K+++K+K+K
Sbjct: 107 AEKEKRTRKNREKKFKRRQKEKEK 130
Score = 28.8 bits (65), Expect = 5.5
Identities = 6/28 (21%), Positives = 18/28 (64%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ + ++++ K++ +K+K+ +K KK
Sbjct: 93 RIALRLRRERTKERAEKEKRTRKNREKK 120
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 38.7 bits (90), Expect = 0.002
Identities = 31/131 (23%), Positives = 60/131 (45%), Gaps = 15/131 (11%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPC 782
++K + K+KK K+ + KK + M+ ++D A+++ P +EV +
Sbjct: 7 WRRKMRAIKRKKNAPKELARLKKALGLIDGNVLMKDVKDIATVK--PAEEVKEK------ 58
Query: 783 LFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQI-AYLDKEKKKKKKKKKKK 841
K + E + E +++ + + Q A++++ + KK KKK++K
Sbjct: 59 ---KKKEGTES---EEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREK 112
Query: 842 KKKKKKKKKKK 852
KKKKKK K
Sbjct: 113 KKKKKKGAKNL 123
Score = 32.1 bits (73), Expect = 0.36
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
+++ KK KKK++KKKKKKK K
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 29.8 bits (67), Expect = 2.8
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
YP + + KK KKK++KKKKKKK K
Sbjct: 94 YPAWMNQRQAKKLKKKREKKKKKKKGAKNLA 124
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 38.7 bits (90), Expect = 0.002
Identities = 25/64 (39%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
+ C N D H TT G + PD E +K KKKKKK KK KK KK
Sbjct: 60 ETCFYNTGNDGVCQNGECHLTTSSGEPSH---PDDHPPEPTEKPKKKKKKSKKTKKPKKS 116
Query: 743 KKKK 746
KK
Sbjct: 117 SKKD 120
Score = 35.6 bits (82), Expect = 0.025
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKK 747
E +K KKKKKK KK KK KK KK
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 35.6 bits (82), Expect = 0.026
Identities = 16/23 (69%), Positives = 16/23 (69%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKK 852
EK KKKKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 34.9 bits (80), Expect = 0.044
Identities = 15/23 (65%), Positives = 16/23 (69%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
+K KKKKKK KK KK KK KK
Sbjct: 98 EKPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.2 bits (73), Expect = 0.34
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKK 852
+ + +K KKKKKK KK KK KK K
Sbjct: 92 HPPEPTEKPKKKKKKSKKTKKPKKSSK 118
Score = 31.4 bits (71), Expect = 0.64
Identities = 14/20 (70%), Positives = 14/20 (70%)
Query: 729 KKKKKKKKKKKKKKKKKKKK 748
K KKKKKK KK KK KK K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSK 118
Score = 30.6 bits (69), Expect = 1.1
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 732 KKKKKKKKKKKKKKKKKYSNEE 753
K KKKKKK KK KK KK S ++
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.8 bits (67), Expect = 2.3
Identities = 14/22 (63%), Positives = 14/22 (63%)
Query: 835 KKKKKKKKKKKKKKKKKKMMKK 856
K KKKKKK KK KK KK K
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 28.7 bits (64), Expect = 5.5
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 733 KKKKKKKKKKKKKKKKYSNEEE 754
K KKKKKK KK KK K S++++
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 41.0 bits (97), Expect = 0.002
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
++E K K +KKK+K++ K K K K + +
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKPRLF 93
Score = 40.2 bits (95), Expect = 0.003
Identities = 13/34 (38%), Positives = 20/34 (58%)
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ A LDK++ K K +KKK+K++ K K K
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 37.5 bits (88), Expect = 0.027
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
DK+E + K +KKK+K++ K K K K +
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 37.1 bits (87), Expect = 0.032
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K +KK+ K K +KKK+K++ K K K
Sbjct: 57 KAALLDKKELKAWHKAQKKKEKQEAKAAKAKS 88
Score = 34.8 bits (81), Expect = 0.17
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
K +KKK+K++ K K K K +
Sbjct: 69 WHKAQKKKEKQEAKAAKAKSKPR 91
Score = 34.4 bits (80), Expect = 0.21
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKK 849
K +KKK+K++ K K K K +
Sbjct: 69 WHKAQKKKEKQEAKAAKAKSKPR 91
Score = 34.0 bits (79), Expect = 0.28
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 815 LDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKK 848
LD K+ A+ +KKK+K++ K K K K +
Sbjct: 61 LDKKE---LKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 34.0 bits (79), Expect = 0.35
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
KE + K +KKK+K++ K K K K +
Sbjct: 62 DKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 32.9 bits (76), Expect = 0.64
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
KEE + KK+ K K +KKK+K++ K
Sbjct: 53 KEELKAALLDKKELKAWHKAQKKKEKQEAKAA 84
Score = 31.7 bits (73), Expect = 1.6
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
Y + +EE + KK+ K K +KKK+K++ K + +
Sbjct: 49 YKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 31.0 bits (71), Expect = 2.6
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYP 859
KK+ K K +KKK+K++ K K P
Sbjct: 63 KKELKAWHKAQKKKEKQEAKAAKAKSKP 90
Score = 29.4 bits (67), Expect = 8.3
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
KE +EE K KK+ K K +KKK+K++ + + + +L
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRL 92
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 36.9 bits (86), Expect = 0.005
Identities = 16/31 (51%), Positives = 22/31 (70%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EE+EEK+ K+K + KKKKK+K KKK+
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKR 60
Score = 36.9 bits (86), Expect = 0.005
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
EE+E+K+ K+K + KKKKK+K KKK
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 35.3 bits (82), Expect = 0.020
Identities = 14/37 (37%), Positives = 22/37 (59%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
++EE+E K+K + KKKKK+K KKK+ +
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPAS 67
Score = 34.9 bits (81), Expect = 0.026
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++EE+E K+K + KKKKK+K KKK+ +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 34.5 bits (80), Expect = 0.036
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
EE+++K+ K+K + KKKKK+K KK +E
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEA 63
Score = 33.4 bits (77), Expect = 0.081
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
K+ EE+++K+ K+K + KKKKK+K K +E G
Sbjct: 26 KDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
Score = 30.3 bits (69), Expect = 0.92
Identities = 13/44 (29%), Positives = 23/44 (52%), Gaps = 3/44 (6%)
Query: 826 YLDKEKKKKKKKKK---KKKKKKKKKKKKKMMKKGYPDVLSQME 866
YL+++++K+ K+K KKKKK+K K + + E
Sbjct: 28 YLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAE 71
Score = 29.5 bits (67), Expect = 1.8
Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 9/53 (16%)
Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKK---------KKMMKK 856
YL+ ++ D+ KKKKK+K KKK+ + KKM++K
Sbjct: 27 DYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQK 79
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 37.6 bits (88), Expect = 0.005
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 6/38 (15%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKK------KKKKKK 748
K E E K++KKK+ K+KK+++K++K KKK K
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQLKQQKKKAKH 129
Score = 36.1 bits (84), Expect = 0.022
Identities = 12/30 (40%), Positives = 26/30 (86%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
L+ E+ K++KKK+ K+KK+++K++K+ +K+
Sbjct: 93 LEHERNKQEKKKRSKEKKEEEKERKRQLKQ 122
Score = 32.6 bits (75), Expect = 0.35
Identities = 11/24 (45%), Positives = 21/24 (87%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
KEKK+++K++K++ K++KKK K +
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAKHR 130
Score = 31.5 bits (72), Expect = 0.70
Identities = 10/27 (37%), Positives = 22/27 (81%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKY 749
K + ++ K++KKK+ K+KK+++K++K
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKR 118
Score = 31.1 bits (71), Expect = 1.2
Identities = 14/32 (43%), Positives = 25/32 (78%), Gaps = 3/32 (9%)
Query: 829 KEKKKKKKKKKKKKKKKK---KKKKKKMMKKG 857
+EKKK+ K+KK+++K++K K++KKK +G
Sbjct: 100 QEKKKRSKEKKEEEKERKRQLKQQKKKAKHRG 131
Score = 30.7 bits (70), Expect = 1.2
Identities = 11/32 (34%), Positives = 22/32 (68%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKK 849
KQ + + KE++K++K++ K++KKK K +
Sbjct: 99 KQEKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130
Score = 30.3 bits (69), Expect = 1.7
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKK 737
KEEE+E+K++ K++KKK K +
Sbjct: 109 KKEEEKERKRQLKQQKKKAKHR 130
Score = 30.3 bits (69), Expect = 1.7
Identities = 11/24 (45%), Positives = 21/24 (87%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKK 740
KE++EE+K++K++ K++KKK K +
Sbjct: 107 KEKKEEEKERKRQLKQQKKKAKHR 130
Score = 30.3 bits (69), Expect = 1.9
Identities = 12/28 (42%), Positives = 24/28 (85%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K++KKK+ K+KK+++K++K++ K+ KK
Sbjct: 99 KQEKKKRSKEKKEEEKERKRQLKQQKKK 126
Score = 30.3 bits (69), Expect = 1.9
Identities = 11/28 (39%), Positives = 23/28 (82%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+++KKK+ K+KK+++K++K++ K KK
Sbjct: 98 NKQEKKKRSKEKKEEEKERKRQLKQQKK 125
Score = 29.1 bits (66), Expect = 5.2
Identities = 10/23 (43%), Positives = 20/23 (86%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKK 738
+K+EEE+++K++ K++KKK K +
Sbjct: 108 EKKEEEKERKRQLKQQKKKAKHR 130
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 38.0 bits (89), Expect = 0.006
Identities = 9/33 (27%), Positives = 22/33 (66%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
EEE+ + +K+ K K++K+++ ++K+K+
Sbjct: 113 ELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 36.9 bits (86), Expect = 0.014
Identities = 10/44 (22%), Positives = 27/44 (61%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E ++ + +EEE+ + +K+ K K++K+++ ++K+K+
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 36.5 bits (85), Expect = 0.020
Identities = 12/47 (25%), Positives = 30/47 (63%), Gaps = 6/47 (12%)
Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
EV LD +++ +D+ +K+ K K++K+++ ++K+K+++K
Sbjct: 108 EVEVEELDEEEQ------IDELLEKELAKLKREKRRENERKQKEILK 148
Score = 35.3 bits (82), Expect = 0.044
Identities = 10/32 (31%), Positives = 23/32 (71%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
D+ E+E K K++K+++ ++K+K+ K++ K
Sbjct: 121 DELLEKELAKLKREKRRENERKQKEILKEQMK 152
Score = 35.3 bits (82), Expect = 0.046
Identities = 9/33 (27%), Positives = 24/33 (72%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++E+ +E +K+ K K++K+++ ++K+K+ K
Sbjct: 116 EEEQIDELLEKELAKLKREKRRENERKQKEILK 148
Score = 33.0 bits (76), Expect = 0.25
Identities = 8/33 (24%), Positives = 24/33 (72%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++ +E +K+ K K++K+++ ++K+K+ K++
Sbjct: 118 EQIDELLEKELAKLKREKRRENERKQKEILKEQ 150
Score = 29.6 bits (67), Expect = 4.1
Identities = 13/51 (25%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
++EEE E ++ ++++ + +K+ K K++K+ NE RK ++ Q
Sbjct: 104 EEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENE----RKQKEILKEQ 150
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 38.8 bits (91), Expect = 0.007
Identities = 12/53 (22%), Positives = 26/53 (49%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
++ E +K+K ++++K K K +KK+K + E + D + + G
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVG 170
Score = 36.4 bits (85), Expect = 0.039
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
R Q A + E++K ++++K K K +KK+K + +K
Sbjct: 114 RAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 34.5 bits (80), Expect = 0.16
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E++ KK++ +K+K ++++K K K +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPR 144
Score = 34.1 bits (79), Expect = 0.19
Identities = 10/34 (29%), Positives = 21/34 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E+EK ++++K K K +KK+K + +K +
Sbjct: 122 REAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 33.7 bits (78), Expect = 0.29
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 819 QRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
QR Q A + +K+K ++++K K K +KK +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 33.4 bits (77), Expect = 0.37
Identities = 9/50 (18%), Positives = 27/50 (54%), Gaps = 3/50 (6%)
Query: 717 KEEEEEKKKKKKKKKKK---KKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
K + + ++ +++ KK++ +K+K ++++K K K ++ R +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 33.0 bits (76), Expect = 0.47
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
EK+K ++++K K K +KK+K +K P +
Sbjct: 126 GEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163
Score = 33.0 bits (76), Expect = 0.53
Identities = 9/51 (17%), Positives = 22/51 (43%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVA 878
+K++ +K+K ++++K K K KK P + + + +
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168
Score = 31.8 bits (73), Expect = 0.98
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E+E+ ++++K K K +KK+K + +K +
Sbjct: 123 EAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEP 156
Score = 31.8 bits (73), Expect = 1.0
Identities = 7/31 (22%), Positives = 20/31 (64%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EE + K + ++ +++ KK++ +K+K ++
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRR 134
Score = 31.0 bits (71), Expect = 2.3
Identities = 12/62 (19%), Positives = 26/62 (41%), Gaps = 2/62 (3%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK--LRDAASIQCGPVKEVPCEPQ 778
EE K K + ++ +++ KK++ +K+K E + K + P +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSD 163
Query: 779 IA 780
I+
Sbjct: 164 IS 165
Score = 31.0 bits (71), Expect = 2.3
Identities = 8/35 (22%), Positives = 21/35 (60%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
+Q+ + +++K ++++K K K +KK+K +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPR 150
Score = 30.3 bits (69), Expect = 3.1
Identities = 10/36 (27%), Positives = 21/36 (58%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
A +KE+ +++K K K +KK+K + +K + +
Sbjct: 124 AAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159
Score = 30.3 bits (69), Expect = 3.6
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
K+R ++++K K K +KK+K + +K
Sbjct: 120 KKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKP 154
Score = 29.5 bits (67), Expect = 5.7
Identities = 10/42 (23%), Positives = 21/42 (50%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
A ++ ++K K K +KK+K + +K + + S+ E
Sbjct: 125 AGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 37.3 bits (87), Expect = 0.007
Identities = 33/153 (21%), Positives = 60/153 (39%), Gaps = 39/153 (25%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
A K+ + KK K KKK K +K +KK K++ + + + G VK
Sbjct: 16 ASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK----------GRVKP 65
Query: 773 VPCEPQIAPCLFDIKNDPCEKN--NLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
V P+ E+ +A + V + FN + K+
Sbjct: 66 VL--PED---------LEKERRLRKVAQKGVV----------------KLFNAVRAAQKK 98
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLS 863
+K K+ KK + K+++ K++ K+ + D+L
Sbjct: 99 TEKAVKEAGKKARVKREELLKEVSKEKFLDLLR 131
Score = 34.3 bits (79), Expect = 0.086
Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQM---EKELANI 872
K+K K +K +KK K++ + +K++ + K VL + E+ L +
Sbjct: 33 AKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKERRLRKV 80
Score = 32.3 bits (74), Expect = 0.35
Identities = 15/44 (34%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY--PDVLSQMEKE 868
L +KK K +K +KK K++ + +K++ ++KG P + +EKE
Sbjct: 31 LKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKE 74
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 39.2 bits (92), Expect = 0.010
Identities = 14/33 (42%), Positives = 26/33 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+ E+EEK+ K++K+ +K KK+++KKKK+ +K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586
Score = 38.9 bits (91), Expect = 0.012
Identities = 15/34 (44%), Positives = 26/34 (76%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
KEE+E K++K+ +K KK+++KKKK+ +K +K
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 35.8 bits (83), Expect = 0.11
Identities = 18/45 (40%), Positives = 32/45 (71%), Gaps = 3/45 (6%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
DKEE + +K++K+ K++K+ +K KK+++KKKK E E + K +
Sbjct: 549 DKEELQREKEEKEALKEQKRLRKLKKQEEKKKK---ELEKLEKAK 590
Score = 35.0 bits (81), Expect = 0.18
Identities = 13/37 (35%), Positives = 26/37 (70%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
++E +++K+ +K KK+++KKKK+ +K +K K E
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595
Score = 34.2 bits (79), Expect = 0.36
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
E+E K++K+ +K KK+++KKKK+ +K +K E
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAE 595
Score = 33.8 bits (78), Expect = 0.47
Identities = 14/39 (35%), Positives = 29/39 (74%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
D +EE +++K++K+ K++K+ +K KK+++KKK E+
Sbjct: 547 LDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 32.3 bits (74), Expect = 1.4
Identities = 13/48 (27%), Positives = 27/48 (56%), Gaps = 4/48 (8%)
Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ D ++ + EK+ K++K+ +K KK+++KKKK ++K
Sbjct: 542 PSVWKLDDKEELQREKE----EKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 31.5 bits (72), Expect = 2.2
Identities = 11/44 (25%), Positives = 30/44 (68%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
++K++ +++K++K+ K++K+ +K KK+ +++ + KL A
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKI 591
Score = 30.0 bits (68), Expect = 6.1
Identities = 10/38 (26%), Positives = 24/38 (63%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
K K++ +++K++K+ K++K+ +K +EE +K
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKEL 583
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 38.8 bits (91), Expect = 0.011
Identities = 13/40 (32%), Positives = 22/40 (55%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
E KK +KK++K +KK +K++ +K KK + K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAK 447
Score = 38.4 bits (90), Expect = 0.019
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
E+KK +KK++K +KK +K++ +K K E
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441
Score = 37.2 bits (87), Expect = 0.043
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
K E+EE +K KKK + KK K + KK
Sbjct: 424 KAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 36.1 bits (84), Expect = 0.085
Identities = 16/65 (24%), Positives = 27/65 (41%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
P E EE++ + ++KK +KK++K +KK EE + A K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Query: 774 PCEPQ 778
E +
Sbjct: 450 DGETK 454
Score = 34.1 bits (79), Expect = 0.39
Identities = 10/25 (40%), Positives = 19/25 (76%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
E+KK +KK++K +KK +K++ +K
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEK 432
Score = 33.8 bits (78), Expect = 0.40
Identities = 14/47 (29%), Positives = 24/47 (51%), Gaps = 7/47 (14%)
Query: 811 EVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
E G L P +R K +KK++K +KK +K++ +K K +
Sbjct: 402 ENGNLSPAER-------KKLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441
Score = 31.8 bits (73), Expect = 1.7
Identities = 12/31 (38%), Positives = 14/31 (45%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
E +K KKK + KK K KK PD
Sbjct: 429 EAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459
Score = 30.7 bits (70), Expect = 3.9
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
K +KK +K++ +K KKK + KK + +K
Sbjct: 420 KAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 35.4 bits (82), Expect = 0.013
Identities = 17/52 (32%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKK---KKKKKKMMKKG 857
E D + +++ + + ++KKKKKKK KKK K +KK++M ++G
Sbjct: 17 GEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRG 68
Score = 33.0 bits (76), Expect = 0.079
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
D E+ + K ++ ++ ++KKKKKKK KKK K
Sbjct: 23 DDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 32.7 bits (75), Expect = 0.11
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ +E+ + K ++ ++ ++KKKKKKK KKK K
Sbjct: 21 EEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 31.1 bits (71), Expect = 0.38
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 4/50 (8%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKK----KKKKKKKKK 748
QI ++ ++KKKKKKK KKK K +KK++ +++
Sbjct: 18 EEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67
Score = 28.0 bits (63), Expect = 5.0
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+ EE++E+ K ++ ++ ++KKKKKKK KK S
Sbjct: 19 EGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 38.9 bits (91), Expect = 0.014
Identities = 37/150 (24%), Positives = 53/150 (35%), Gaps = 25/150 (16%)
Query: 716 DKEEEEEKKKKKKKKKK-----------KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
D+EE EE KK ++K KK KK+ K + K+KK S+EEE + +A
Sbjct: 408 DEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKV 467
Query: 765 IQCGPVKEVPCEPQI--APCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFN 822
+ E + +P K + +
Sbjct: 468 EKVANKLLKRSEKAQKEEEEEELDEENPWLKTT-------SSVGKSAK-----KQDSKKK 515
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
+ LDK K K K KKKKKK+K
Sbjct: 516 SSSKLDKAANKISKAAVKVKKKKKKEKSID 545
Score = 32.0 bits (73), Expect = 1.7
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
KK+KKK+K+K++ K + KK+K+K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRK 652
Score = 31.6 bits (72), Expect = 2.3
Identities = 13/32 (40%), Positives = 22/32 (68%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
++ KKK+K+K++ K + KK+K+K KK N
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658
Score = 31.6 bits (72), Expect = 2.5
Identities = 17/43 (39%), Positives = 25/43 (58%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
EE K K+ KK K KK + KK+K K++ EE ++ +AA
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAA 258
Score = 30.8 bits (70), Expect = 4.1
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
K+ ++++K+K++ K + KK+K+K KK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657
Score = 30.4 bits (69), Expect = 5.9
Identities = 11/30 (36%), Positives = 21/30 (70%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++ +KK+K+K++ K + KK+K+K KK
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656
Score = 30.0 bits (68), Expect = 7.8
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++ ++K+K+K++ K + KK+K+K KK K
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLK 657
Score = 29.6 bits (67), Expect = 8.8
Identities = 15/34 (44%), Positives = 20/34 (58%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
EE K K+ KK K KK + KK+K K++ EE
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEE 249
>gnl|CDD|225625 COG3083, COG3083, Predicted hydrolase of alkaline phosphatase
superfamily [General function prediction only].
Length = 600
Score = 38.6 bits (90), Expect = 0.018
Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 12/94 (12%)
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
D P F YL+ ++H+ + QA ++++ L ++D +G+V+E
Sbjct: 391 EDNPWFSYLSLNSSHANDDPSSNQAK---------ARPPFKNRYQNALREVDSQIGRVLE 441
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNW 331
L +L N++++ +D+G FN + W
Sbjct: 442 QLRNSGLLDNTVVIITADHG---EEFNEEEQNYW 472
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 37.2 bits (87), Expect = 0.018
Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 821 FNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
F I Y ++EK+KKK K++KK K++K+K GY
Sbjct: 87 FTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEPYGY 125
Score = 33.4 bits (77), Expect = 0.31
Identities = 13/29 (44%), Positives = 22/29 (75%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
EE+K+KKK K++KK K++K+K ++ Y
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEPY 123
Score = 31.1 bits (71), Expect = 1.7
Identities = 13/31 (41%), Positives = 22/31 (70%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
EEE++KKK K++KK K++K+K ++ Y
Sbjct: 95 EEEKEKKKAMSKEEKKAIKEEKEKLEEPYGY 125
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 37.5 bits (87), Expect = 0.018
Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
P ++ + EKK KK K K K K K K K K + K + AA Q
Sbjct: 84 PKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSK-TAAKAPAAPNQ 134
Score = 37.1 bits (86), Expect = 0.029
Identities = 15/35 (42%), Positives = 19/35 (54%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
P+ + K+K K +KK KK K K K K K K K
Sbjct: 77 PEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKV 111
Score = 36.7 bits (85), Expect = 0.031
Identities = 14/39 (35%), Positives = 19/39 (48%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ P + + K+K K +KK KK K K K K K K
Sbjct: 71 TEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109
Score = 36.3 bits (84), Expect = 0.053
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
P + +EK K +KK KK K K K K K K K K + +
Sbjct: 79 PTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKK 119
Score = 35.9 bits (83), Expect = 0.060
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P E++ +K K K K K K K K K + K KK
Sbjct: 87 KPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPP 121
Score = 33.6 bits (77), Expect = 0.34
Identities = 14/40 (35%), Positives = 17/40 (42%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
E K K+K K +KK KK K K K K K +
Sbjct: 74 ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113
Score = 31.7 bits (72), Expect = 1.5
Identities = 13/34 (38%), Positives = 16/34 (47%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P E E + K K+K K +KK KK K K
Sbjct: 67 PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPK 100
Score = 30.5 bits (69), Expect = 3.2
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
KEK K +KK KK K K K K K K
Sbjct: 85 KEKPKPEKKPKKPKPKPKPKPKPK 108
Score = 30.1 bits (68), Expect = 4.8
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P E E + K K+K K +KK KK K K K
Sbjct: 69 PPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPK 102
Score = 29.8 bits (67), Expect = 6.1
Identities = 13/24 (54%), Positives = 13/24 (54%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
EKK KK K K K K K K K K
Sbjct: 90 PEKKPKKPKPKPKPKPKPKPKVKP 113
Score = 29.8 bits (67), Expect = 6.6
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
K K +KK KK K K K K K K K
Sbjct: 87 KPKPEKKPKKPKPKPKPKPKPKPK 110
Score = 29.4 bits (66), Expect = 6.9
Identities = 13/34 (38%), Positives = 17/34 (50%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + E + K K+K K +KK KK K K K
Sbjct: 70 PTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKP 103
Score = 29.4 bits (66), Expect = 7.5
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
++ K +KK KK K K K K K K +K
Sbjct: 86 EKPKPEKKPKKPKPKPKPKPKPKPKVKP 113
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 36.7 bits (85), Expect = 0.020
Identities = 9/43 (20%), Positives = 23/43 (53%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
++ +E +++ + K + K+KKK++ K K + + +K
Sbjct: 103 EESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
Score = 34.7 bits (80), Expect = 0.080
Identities = 9/39 (23%), Positives = 22/39 (56%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
++E +E +++ + K + K+KKK++ K K + +
Sbjct: 102 EEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 34.7 bits (80), Expect = 0.084
Identities = 12/42 (28%), Positives = 22/42 (52%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+E E++ + K + K+KKK++ K K +K+ E K
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147
Score = 34.0 bits (78), Expect = 0.16
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
+ EEE + +++ + K + K+KKK++ K + +E
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKE 137
Score = 33.6 bits (77), Expect = 0.22
Identities = 12/54 (22%), Positives = 24/54 (44%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGP 769
++E EEE + +++ + K + K+KKK++ E+ K P
Sbjct: 98 EEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151
Score = 33.2 bits (76), Expect = 0.27
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
P + + E K+KKK++ K K +K+K K + KK K S
Sbjct: 114 PPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPS 149
Score = 32.8 bits (75), Expect = 0.32
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ E+E+ + K + K+KKK++ K K +K+K K
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
Score = 32.4 bits (74), Expect = 0.48
Identities = 13/39 (33%), Positives = 20/39 (51%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E +E+KK++ K K +K+K K + KK K K
Sbjct: 112 EDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 30.1 bits (68), Expect = 2.8
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
KE+++ + K K +K+K K + KK K K K Y
Sbjct: 122 KEKKKREVPKPKTEKEKPKTEPKKPKPSKPKLY 154
Score = 29.7 bits (67), Expect = 3.3
Identities = 11/40 (27%), Positives = 25/40 (62%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
F +++ D EEEE +++ + +++ + K + K+KKK+
Sbjct: 88 FRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKR 127
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 37.9 bits (88), Expect = 0.020
Identities = 22/83 (26%), Positives = 35/83 (42%), Gaps = 5/83 (6%)
Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK 738
D+K P +KN L + + ++N Y R +A E K KK+K KK+K+
Sbjct: 313 DVKRAPLKKNPLKNLNAMLKLNPYAKTARRMALLA-----EAARVKAKKEKLAKKRKQVS 367
Query: 739 KKKKKKKKKKYSNEEEGMRKLRD 761
K++ K + M D
Sbjct: 368 KEEAAAIKAAGKAWYKTMISDSD 390
Score = 34.0 bits (78), Expect = 0.40
Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 8/81 (9%)
Query: 785 DIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK--KKKKKK 842
D+K P +KN L + + + ++N Y +R +A + K KK+K KK+K+
Sbjct: 313 DVKRAPLKKNPLKNLNAMLKLNPYAKT------ARRMALLAEAARVKAKKEKLAKKRKQV 366
Query: 843 KKKKKKKKKKMMKKGYPDVLS 863
K++ K K Y ++S
Sbjct: 367 SKEEAAAIKAAGKAWYKTMIS 387
>gnl|CDD|224441 COG1524, COG1524, Uncharacterized proteins of the AP superfamily
[General function prediction only].
Length = 450
Score = 37.9 bits (88), Expect = 0.021
Identities = 46/303 (15%), Positives = 99/303 (32%), Gaps = 52/303 (17%)
Query: 34 IMAFAVLPLAF----TLSMVFVDLVASSGPPHIIFILADDLG---WNDVGFHGLDQIPTP 86
+ F++L L F + + ++ I D L + P
Sbjct: 9 VFLFSILDLFFAADVPILSQSYQAATPAPKKKLVLISIDGLRADVLDRKAGI------LP 62
Query: 87 NIDALAYSG----IILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLY----GCERGGL 138
+ +LA +G ++ + T P + ++TG +P G+ N+LY G
Sbjct: 63 FLSSLAENGVHVAELISVFPTTTR--PRHTTLITGSYPDEHGIVGNILYDPETGDSVLQF 120
Query: 139 PLSEKILPQYLKELGYRTRI-VGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE 197
L + + + K L Y
Sbjct: 121 LLDNPTILFPGDDEAGMDVAAPFESLTVSDKLRVAVDLVWDVPILHY----LHIGGPDHI 176
Query: 198 EM--KMWGLDMRRDLEPAWDLHGKYSTDVF-TAEAVDIIHNHSTDEPLFLYLAHAATHSA 254
M + D +++P +D ++ D ++V ++ D L +YL +
Sbjct: 177 TMRRFLI--DEDDNIKPGYDYISEHFLDSLLFLDSVLLLDRADPD-LLLVYLPN------ 227
Query: 255 NPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
+ A H ++A + ++D +G+++E L++R + +++ S
Sbjct: 228 -----IDAIGHKYGPDSP-------EYAEAVREVDSLLGELLELLKKRGLYEEYLVIITS 275
Query: 315 DNG 317
D+G
Sbjct: 276 DHG 278
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 35.4 bits (82), Expect = 0.021
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 821 FNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
F++I Y+DK KK KKK KK KK
Sbjct: 17 FDKIDYIDKLKKDPNIDKKKFKKIIFKKYGLIF 49
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 34.9 bits (81), Expect = 0.025
Identities = 13/27 (48%), Positives = 19/27 (70%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKK 851
Y DKE+++K KKK KK + ++ KKK
Sbjct: 65 PYRDKEERRKDKKKYGKKARLREWKKK 91
Score = 29.9 bits (68), Expect = 1.6
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKK 745
++E+++K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 29.5 bits (67), Expect = 1.7
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKK 744
++EE++K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 29.5 bits (67), Expect = 2.0
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKK 743
++EE +K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 29.1 bits (66), Expect = 2.7
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKK 741
Y DKEE + KKK KK + ++ KKK
Sbjct: 65 PYRDKEERRKDKKKYGKKARLREWKKKVF 93
Score = 28.7 bits (65), Expect = 3.8
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKY 749
K++++K KKK KK + ++ KKK +
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
Score = 28.7 bits (65), Expect = 3.9
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK 747
+K++++K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 28.3 bits (64), Expect = 5.4
Identities = 9/26 (34%), Positives = 18/26 (69%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKK 746
++++++K KKK KK + ++ KKK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 27.9 bits (63), Expect = 6.8
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKK 748
K++++K KKK KK + ++ KKK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 37.9 bits (89), Expect = 0.027
Identities = 9/34 (26%), Positives = 14/34 (41%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+K E+ K KK K +K K++Y
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQY 849
Score = 37.1 bits (87), Expect = 0.053
Identities = 8/32 (25%), Positives = 11/32 (34%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E+ EK + K KK K +K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKT 845
Score = 36.7 bits (86), Expect = 0.063
Identities = 8/32 (25%), Positives = 13/32 (40%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
YL K KK K +K K++ + +
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASE 853
Score = 35.9 bits (84), Expect = 0.11
Identities = 8/33 (24%), Positives = 15/33 (45%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
K KK K +K K++ ++E++G
Sbjct: 824 LADAPAKDPAGKKAAVKFSRKTKQQYVASEKDG 856
Score = 35.5 bits (83), Expect = 0.13
Identities = 7/26 (26%), Positives = 11/26 (42%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK 747
KK K +K K++ +K K
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 35.5 bits (83), Expect = 0.14
Identities = 8/36 (22%), Positives = 14/36 (38%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
Y ++ KK K +K K++ +K K
Sbjct: 822 RYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 35.5 bits (83), Expect = 0.14
Identities = 7/26 (26%), Positives = 11/26 (42%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
KK K +K K++ +K K
Sbjct: 833 AGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 34.8 bits (81), Expect = 0.22
Identities = 8/34 (23%), Positives = 12/34 (35%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P+K K KK K +K K++
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVA 851
Score = 34.8 bits (81), Expect = 0.23
Identities = 8/31 (25%), Positives = 14/31 (45%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
P K+ +K K +K K++ +K K
Sbjct: 828 PAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 34.4 bits (80), Expect = 0.30
Identities = 7/30 (23%), Positives = 13/30 (43%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
++ KK K +K K++ +K K
Sbjct: 829 AKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858
Score = 34.4 bits (80), Expect = 0.32
Identities = 7/37 (18%), Positives = 14/37 (37%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ ++ E+ + K KK K +K K+
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQ 847
Score = 34.4 bits (80), Expect = 0.36
Identities = 8/29 (27%), Positives = 12/29 (41%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
D K KK K +K K++ +K
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 34.0 bits (79), Expect = 0.40
Identities = 7/25 (28%), Positives = 11/25 (44%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
D KK K +K K++ +K
Sbjct: 831 DPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 32.5 bits (75), Expect = 1.3
Identities = 8/27 (29%), Positives = 11/27 (40%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
K KK K +K K++ K G
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDG 856
Score = 32.5 bits (75), Expect = 1.3
Identities = 9/38 (23%), Positives = 14/38 (36%)
Query: 815 LDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
K R+ A KK K +K K++ +K
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854
Score = 32.1 bits (74), Expect = 1.5
Identities = 7/26 (26%), Positives = 10/26 (38%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYS 750
K +K + K KK K+S
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAAVKFS 842
Score = 31.7 bits (73), Expect = 2.2
Identities = 8/34 (23%), Positives = 12/34 (35%), Gaps = 1/34 (2%)
Query: 824 IAYLD-KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ L K +K + K KK K +K
Sbjct: 811 LELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRK 844
Score = 31.3 bits (72), Expect = 2.6
Identities = 7/28 (25%), Positives = 12/28 (42%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K+ KK K +K K++ + K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKDGK 857
Score = 30.5 bits (70), Expect = 5.2
Identities = 6/25 (24%), Positives = 10/25 (40%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
KK K +K K++ +K
Sbjct: 832 PAGKKAAVKFSRKTKQQYVASEKDG 856
Score = 30.5 bits (70), Expect = 5.6
Identities = 6/23 (26%), Positives = 10/23 (43%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
K K +K K++ +K K
Sbjct: 836 KAAVKFSRKTKQQYVASEKDGKA 858
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 37.2 bits (87), Expect = 0.029
Identities = 14/43 (32%), Positives = 29/43 (67%), Gaps = 3/43 (6%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
++ +EE ++KK++KKK++++ K K ++++K EE RK
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKL---EEKERK 317
Score = 37.2 bits (87), Expect = 0.030
Identities = 14/37 (37%), Positives = 26/37 (70%), Gaps = 6/37 (16%)
Query: 718 EEEEEKKKKKKKKKKKKKKKK------KKKKKKKKKK 748
EE +EKK++KKK++++ K K +K ++K++KK
Sbjct: 282 EEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKK 318
Score = 36.8 bits (86), Expect = 0.042
Identities = 13/38 (34%), Positives = 27/38 (71%), Gaps = 6/38 (15%)
Query: 717 KEEEEEKKKKKKKKKKKK------KKKKKKKKKKKKKK 748
+E++EEKKK++++ K K +K ++K++KK+ +K
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 36.5 bits (85), Expect = 0.054
Identities = 18/48 (37%), Positives = 32/48 (66%), Gaps = 7/48 (14%)
Query: 717 KEEEEEKKKKKKKKK-----KKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
+EEEE+ K ++++ +KK++KKK++++ K K S EE+ RKL
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQ--RKL 311
Score = 34.9 bits (81), Expect = 0.15
Identities = 11/38 (28%), Positives = 24/38 (63%), Gaps = 10/38 (26%)
Query: 829 KEKKKKKKKKKKKK----------KKKKKKKKKKMMKK 856
+EKK++KKK++++ +K ++K++KK +K
Sbjct: 285 QEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 33.0 bits (76), Expect = 0.57
Identities = 15/44 (34%), Positives = 30/44 (68%), Gaps = 2/44 (4%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
DK EEE++K K +++++++ ++KK++KKK+ E + KL
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKE--EREAKLAKL 303
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 36.5 bits (85), Expect = 0.029
Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
+ E +KKK +++KKKKKKK KKKK KK + GM+ +
Sbjct: 165 EAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVA--AVGMKAI 206
Score = 34.6 bits (80), Expect = 0.15
Identities = 16/35 (45%), Positives = 18/35 (51%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
+ EEE+KKKKKK KKKK KK K
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 34.6 bits (80), Expect = 0.16
Identities = 16/28 (57%), Positives = 18/28 (64%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
E++KKKKKKK KKKK KK MK
Sbjct: 178 AEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
Score = 32.7 bits (75), Expect = 0.61
Identities = 16/36 (44%), Positives = 20/36 (55%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
A K+ +++KKKKKKK KKKK KK K
Sbjct: 169 ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
Score = 30.7 bits (70), Expect = 2.2
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
K++ N A +K+KKKKK KKKK KK K +
Sbjct: 171 DKKKSNN-AEEEKKKKKKKSAKKKKLKKVAAVGMKAI 206
Score = 29.6 bits (67), Expect = 6.7
Identities = 14/36 (38%), Positives = 20/36 (55%)
Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
R + + +EE+++KKKK KKKK KK K
Sbjct: 170 RDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 37.5 bits (87), Expect = 0.037
Identities = 20/49 (40%), Positives = 29/49 (59%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVK 771
+KKKKKKK K+KKKKK ++ K+ N EEG+ + S + +K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLK 732
Score = 29.8 bits (67), Expect = 7.7
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKK 852
+KKKKKKK K+KKKKK ++ K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705
Score = 29.8 bits (67), Expect = 8.6
Identities = 17/40 (42%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 834 KKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
+KKKKKKK K+KKKKK ++ K+ + L +E+ + N
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH--ALENIEEGIEGSN 721
Score = 29.5 bits (66), Expect = 9.6
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKM 853
KKKKKKK K+KKKKK ++ K++
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQI 707
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 37.0 bits (86), Expect = 0.039
Identities = 13/42 (30%), Positives = 26/42 (61%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
KE+ + K K ++ KK++ + ++KKK+KK +E G++
Sbjct: 354 KEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 31.2 bits (71), Expect = 2.9
Identities = 11/55 (20%), Positives = 27/55 (49%), Gaps = 6/55 (10%)
Query: 804 RINHYTTEVG----YLDPKQRFNQI--AYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
RI+ ++ E + ++R ++ K K ++ KK++ + ++KKK+
Sbjct: 329 RIDAFSGEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKE 383
Score = 30.1 bits (68), Expect = 5.7
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
E+ KK++ + ++KKK+KK K +
Sbjct: 366 PERDKKERPGRYRRKKKEKKAKSERRGL 393
Score = 30.1 bits (68), Expect = 6.5
Identities = 8/25 (32%), Positives = 18/25 (72%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
DK+++ + ++KKK+KK K +++
Sbjct: 369 DKKERPGRYRRKKKEKKAKSERRGL 393
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 37.0 bits (86), Expect = 0.041
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
K E+++KKKK+KK+ K + + K K KK K +
Sbjct: 67 KKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKT 102
Score = 37.0 bits (86), Expect = 0.047
Identities = 18/53 (33%), Positives = 25/53 (47%), Gaps = 2/53 (3%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE--EGMRKLRDAASI 765
KE++E K + + K K KK KK KKK K NE+ K+ + A
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128
Score = 35.5 bits (82), Expect = 0.13
Identities = 22/52 (42%), Positives = 25/52 (48%), Gaps = 8/52 (15%)
Query: 716 DKEEEEEKKKKKKKKKKKK--------KKKKKKKKKKKKKKYSNEEEGMRKL 759
EEE K KK K KKK +KKKKKKK+K + EG KL
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKL 91
Score = 32.4 bits (74), Expect = 1.3
Identities = 16/38 (42%), Positives = 26/38 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K+++E+K + KKK +KKKKKKK+KK+ K + +
Sbjct: 56 KDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Score = 32.0 bits (73), Expect = 1.6
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+ E KKK +KKKKKKK+KK+ K + K G
Sbjct: 62 KNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Score = 30.8 bits (70), Expect = 3.7
Identities = 14/40 (35%), Positives = 24/40 (60%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
++E++ + KKK +KKKKKKK+KK+ K + +
Sbjct: 54 TKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Score = 29.7 bits (67), Expect = 8.4
Identities = 12/39 (30%), Positives = 17/39 (43%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
K+K+KK+ K + + K K KK K P E
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 36.7 bits (85), Expect = 0.043
Identities = 26/162 (16%), Positives = 58/162 (35%), Gaps = 19/162 (11%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
+ E+ ++K+ +++ +K K+ ++ K+ + ++ K + AA + E
Sbjct: 82 KQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAK--AKAEAE 139
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
E + + K E+ +E + K+ + E K K
Sbjct: 140 AEKKAKE---EAKKQAEEEAKAKAAAEAK--------------KKAAEAKKKAEAEAKAK 182
Query: 835 KKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTA 876
+ K K K ++ K K + K + ++ E E A
Sbjct: 183 AEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAE 224
Score = 36.0 bits (83), Expect = 0.071
Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
Q K+ EE K K+ + K K + + +KK K++ KK EEE K A
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK-QAEEEAKAKAAAEA 164
Score = 32.9 bits (75), Expect = 0.79
Identities = 13/54 (24%), Positives = 26/54 (48%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
Q ++ E K K + + +KK K++ KK+ +++ K E +K +A
Sbjct: 119 KQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAK 172
Score = 32.1 bits (73), Expect = 1.5
Identities = 14/41 (34%), Positives = 24/41 (58%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
A +K+ +EE KK+ +++ K K + KKK + KK + E
Sbjct: 138 AEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE 178
Score = 30.2 bits (68), Expect = 4.7
Identities = 22/138 (15%), Positives = 47/138 (34%), Gaps = 37/138 (26%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
EE E+++ ++ ++K+ +++ +K K+ E K + Q K
Sbjct: 76 QAEEAEKQRAAEQARQKELEQRAAAEKAAKQ------AEQAAKQAEEKQKQAEEAKAKQA 129
Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
A ++E + +++ + A E++ K
Sbjct: 130 AEAKA------------------KAEAEA-------------EKKAKEEAKKQAEEEAKA 158
Query: 836 KKKKKKKKKKKKKKKKKM 853
K + KKK + KKK
Sbjct: 159 KAAAEAKKKAAEAKKKAE 176
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 37.1 bits (87), Expect = 0.044
Identities = 35/132 (26%), Positives = 51/132 (38%), Gaps = 37/132 (28%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR-KLRDAASIQCGPVKEVP 774
E E +K+ K +KK+KKKKK+K+K+++ K G K Q G V +P
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEELK-----VGDEVKYLSLG--QKGEVLSIP 658
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
D E + G + K + + + K KKKK
Sbjct: 659 -----------------------DDKEAI------VQAGIMKMKVPLSDLEKIQKPKKKK 689
Query: 835 KKKKKKKKKKKK 846
KKK K K K +
Sbjct: 690 KKKPKTVKPKPR 701
Score = 36.0 bits (84), Expect = 0.12
Identities = 26/135 (19%), Positives = 49/135 (36%), Gaps = 38/135 (28%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
EE E K+ +K K++ ++KK+K ++++ K E+ ++ A
Sbjct: 537 EEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEA-------------- 582
Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKK 837
+ E I ++ E +K+ K
Sbjct: 583 ---------------------KKEADEIIKELRQLQKGGYASVKAHEL---IEARKRLNK 618
Query: 838 KKKKKKKKKKKKKKK 852
+KK+KKKKK+K+K
Sbjct: 619 ANEKKEKKKKKQKEK 633
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 35.4 bits (82), Expect = 0.045
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
KK KKK KK K KK K KK+K ++
Sbjct: 4 AKKAKKKAKKAAKAAKKGVKVKKRKVRTS 32
Score = 34.7 bits (80), Expect = 0.069
Identities = 15/30 (50%), Positives = 17/30 (56%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
KK KKK KK K KK K KK+K + S
Sbjct: 4 AKKAKKKAKKAAKAAKKGVKVKKRKVRTSV 33
Score = 30.4 bits (69), Expect = 1.7
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKM 853
K+ KKK KK K KK K KK+K+
Sbjct: 3 PAKKAKKKAKKAAKAAKKGVKVKKRKV 29
Score = 30.4 bits (69), Expect = 2.1
Identities = 14/27 (51%), Positives = 14/27 (51%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKY 749
K KK KKK KK K KK K KK
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKR 27
Score = 30.1 bits (68), Expect = 2.8
Identities = 13/30 (43%), Positives = 14/30 (46%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
K KK KKK KK K KK K +K
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVR 30
Score = 29.7 bits (67), Expect = 3.7
Identities = 13/31 (41%), Positives = 17/31 (54%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
K ++ KKK KK K KK K KK+K +
Sbjct: 1 KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRT 31
Score = 29.7 bits (67), Expect = 3.9
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+KK KK K KK K KK+K +
Sbjct: 6 KAKKKAKKAAKAAKKGVKVKKRKVR 30
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 36.7 bits (85), Expect = 0.054
Identities = 19/61 (31%), Positives = 26/61 (42%)
Query: 694 SEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
D +++ Q A P K K K +K KKK K KKKKK S+++
Sbjct: 88 HIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKD 147
Query: 754 E 754
E
Sbjct: 148 E 148
Score = 34.0 bits (78), Expect = 0.41
Identities = 15/37 (40%), Positives = 18/37 (48%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
P + E+ KKK K KKKKK K + KY N
Sbjct: 118 PKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDN 154
Score = 32.9 bits (75), Expect = 1.0
Identities = 33/148 (22%), Positives = 46/148 (31%), Gaps = 52/148 (35%)
Query: 716 DKEEEEEKKKKKKKKKK-------KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
+K+ E+ KK KK K KK K K+KKK KKK ++ + ++ + G
Sbjct: 49 NKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAG 108
Query: 769 PVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
P+ + P K
Sbjct: 109 PLA------------ISLMRKPKPKTE--------------------------------- 123
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K KKK K KKKKK K + K
Sbjct: 124 KLKKKITVNKSTNKKKKKVLSSKDELIK 151
Score = 31.3 bits (71), Expect = 2.6
Identities = 14/60 (23%), Positives = 19/60 (31%)
Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
K L + +F + + K +K KKK K KKKKK
Sbjct: 83 IKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKV 142
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 35.3 bits (82), Expect = 0.060
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
KKK KK KK+K+K++K+ +K ++E
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDE 41
Score = 33.0 bits (76), Expect = 0.33
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
KKK KK KK+K+K++K+ +K +E +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELK 43
Score = 33.0 bits (76), Expect = 0.43
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKK 744
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 33.0 bits (76), Expect = 0.43
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKK 745
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 33.0 bits (76), Expect = 0.43
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKK 852
KKK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 32.6 bits (75), Expect = 0.52
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 728 KKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
KKK KK KK+K+K++K+ +K + ++ +++
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQ 44
Score = 32.6 bits (75), Expect = 0.53
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
DK+K KK KK+K+K++K+ +K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGA 35
Score = 32.6 bits (75), Expect = 0.53
Identities = 13/34 (38%), Positives = 22/34 (64%)
Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
KKK KK KK+K+K++K +KG D ++++
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAE 47
Score = 32.2 bits (74), Expect = 0.62
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKK 851
A L +KK KK KK+K+K++K+ +K
Sbjct: 9 AGLVDKKKAKKAKKEKRKQRKQARKGA 35
Score = 32.2 bits (74), Expect = 0.76
Identities = 10/22 (45%), Positives = 17/22 (77%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKK 743
+KK KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
Score = 31.8 bits (73), Expect = 0.83
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 731 KKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
KKK KK KK+K+K++K+ ++G +L+ AA
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQAA 46
Score = 29.5 bits (67), Expect = 4.8
Identities = 9/22 (40%), Positives = 17/22 (77%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKK 742
++K KK KK+K+K++K+ +K
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGA 35
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 36.8 bits (85), Expect = 0.060
Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 9/48 (18%)
Query: 716 DKEEEEEKKKKKKK---------KKKKKKKKKKKKKKKKKKKYSNEEE 754
D + EE ++K ++ KK ++ KK KK KK S E
Sbjct: 1172 DAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESET 1219
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 35.6 bits (82), Expect = 0.061
Identities = 22/52 (42%), Positives = 27/52 (51%)
Query: 703 TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
T V + N +P+ E KK KKKK +KKK KK KKKKK+ E
Sbjct: 28 TESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79
Score = 35.2 bits (81), Expect = 0.077
Identities = 21/46 (45%), Positives = 25/46 (54%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
D E +K KKKK +KKK KK KKKKK+K E +L D
Sbjct: 42 DVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSD 87
Score = 32.5 bits (74), Expect = 0.62
Identities = 18/45 (40%), Positives = 24/45 (53%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
D E+ + KK KKKK +KKK KK KK E+E + +L
Sbjct: 36 DDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80
Score = 30.9 bits (70), Expect = 1.9
Identities = 17/40 (42%), Positives = 26/40 (65%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
P K+ ++KK +KKK KK KKKKK+K++ + + EE
Sbjct: 46 PSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEEL 85
Score = 30.9 bits (70), Expect = 2.2
Identities = 17/44 (38%), Positives = 22/44 (50%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRT 875
KK KKKK +KKK KK K KK +V +EL++
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEEN 91
Score = 30.5 bits (69), Expect = 2.5
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
K KKKK +KKK KK KKKKK
Sbjct: 49 KTSKKKKTTPRKKKTTKKTKKKKK 72
Score = 29.8 bits (67), Expect = 4.3
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLS 863
K+ KKKK +KKK KK KKKK K+ P++ +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAA 82
>gnl|CDD|197876 smart00792, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP) is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 124
Score = 34.5 bits (79), Expect = 0.062
Identities = 17/69 (24%), Positives = 30/69 (43%), Gaps = 13/69 (18%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPC 782
KK KK ++ +KK +KK+KK + + + +C +++ C+ PC
Sbjct: 52 KKPKKISAEEAEKKLLQKKEKKALTN-------VLRPEPRSPRRCVRLRD-SCKGPQPPC 103
Query: 783 LFDIKNDPC 791
DPC
Sbjct: 104 C-----DPC 107
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 33.7 bits (78), Expect = 0.062
Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+KKK+KK+ K + K+ KK + E+ RK R
Sbjct: 61 RKKKEKKEVKAESKRYNAKKLTLEQ---RKAR 89
Score = 31.0 bits (71), Expect = 0.50
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
D +KKK+KK+ K + K+ KK
Sbjct: 58 DPSRKKKEKKEVKAESKRYNAKK 80
Score = 31.0 bits (71), Expect = 0.66
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 833 KKKKKKKKKKKKKKKKKKKKM 853
+KKK+KK+ K + K+ KK+
Sbjct: 61 RKKKEKKEVKAESKRYNAKKL 81
Score = 30.2 bits (69), Expect = 1.2
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 722 EKKKKKKKKKKKKKKKKKKKK 742
+KKK+KK+ K + K+ KK
Sbjct: 60 SRKKKEKKEVKAESKRYNAKK 80
Score = 29.8 bits (68), Expect = 1.3
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
KKK+KK+ K + K+ KK +++K
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRK 87
Score = 29.4 bits (67), Expect = 1.9
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 724 KKKKKKKKKKKKKKKKKKKK 743
+KKK+KK+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 29.4 bits (67), Expect = 1.9
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 725 KKKKKKKKKKKKKKKKKKKK 744
+KKK+KK+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 29.4 bits (67), Expect = 1.9
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 726 KKKKKKKKKKKKKKKKKKKK 745
+KKK+KK+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 29.4 bits (67), Expect = 1.9
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 832 KKKKKKKKKKKKKKKKKKKK 851
+KKK+KK+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 29.1 bits (66), Expect = 3.1
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+KK+KK+ K + K+ KK +++K
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRKA 88
Score = 27.5 bits (62), Expect = 8.2
Identities = 9/33 (27%), Positives = 19/33 (57%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
K+E++E K + K+ KK +++K + +K
Sbjct: 61 RKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93
Score = 27.5 bits (62), Expect = 9.0
Identities = 8/28 (28%), Positives = 18/28 (64%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++K+KK+ K + K+ KK +++K +
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQRKAR 89
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 35.6 bits (82), Expect = 0.066
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
EKKKK K KK K K KK + KKK ++ ++N
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAREGWNN 119
Score = 33.7 bits (77), Expect = 0.24
Identities = 18/50 (36%), Positives = 24/50 (48%)
Query: 699 INHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ Y V I++ K +KKKK K KK K K KK + KKK +
Sbjct: 65 SHDYEVLVTGNTGISWRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 32.5 bits (74), Expect = 0.61
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 728 KKKKKKKKKKKKKKKKKKKKKYSNEE 753
KKKK K KK K K KK + KK + E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 32.5 bits (74), Expect = 0.64
Identities = 15/23 (65%), Positives = 16/23 (69%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKK 852
EKKKK K KK K K KK + KKK
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKK 112
Score = 32.1 bits (73), Expect = 0.80
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKK 852
++K+KK K KK K K KK + KKK +
Sbjct: 89 VEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 30.6 bits (69), Expect = 2.9
Identities = 13/26 (50%), Positives = 14/26 (53%)
Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEEE 754
KKKK K KK K K KK + K E
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREG 116
Score = 30.2 bits (68), Expect = 3.2
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGY 858
KKKK K KK K K KK + KK ++G+
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREGW 117
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 33.5 bits (77), Expect = 0.068
Identities = 16/35 (45%), Positives = 21/35 (60%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
P+ EEE KK+KKK K + KK +K+ K KK
Sbjct: 18 TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 32.0 bits (73), Expect = 0.24
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
P + EEE KK+KKK K + KK +K+ K KK
Sbjct: 20 PPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 31.6 bits (72), Expect = 0.29
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P E EE+ KK+KKK K + KK +K+ K KK
Sbjct: 19 PPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 30.1 bits (68), Expect = 0.98
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKG 857
+E+ KK+KKK K + KK +K+ K +KKG
Sbjct: 24 YEEELKKQKKKNKLRFLKKLEKRPKDVKKG 53
Score = 29.7 bits (67), Expect = 1.7
Identities = 12/36 (33%), Positives = 17/36 (47%)
Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
P K++KKK K + KK +K+ K KK
Sbjct: 17 PTPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 33.6 bits (77), Expect = 0.069
Identities = 16/39 (41%), Positives = 20/39 (51%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+KKKK+ K + K KK +KK KKK E K R
Sbjct: 25 KKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63
Score = 33.6 bits (77), Expect = 0.083
Identities = 16/49 (32%), Positives = 26/49 (53%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+ I K+ +++KK+ K + K KK +KK KKK + EE R+
Sbjct: 15 DDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRR 63
Score = 31.3 bits (71), Expect = 0.54
Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 4/53 (7%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE----EEGMR 757
F+ I K+++ K + K KK +KK KKK + + ++ E+G+
Sbjct: 18 FSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTEDGLP 70
Score = 29.7 bits (67), Expect = 1.5
Identities = 15/45 (33%), Positives = 24/45 (53%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
K ++++K+ K + K KK +KK KKK + EE R+ D
Sbjct: 23 KRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTED 67
Score = 28.6 bits (64), Expect = 4.0
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
KK+KKKKK+ K + K KK K
Sbjct: 16 DIFSNIKKRKKKKKRTAKTARPKATKKGQKKD 47
Score = 28.2 bits (63), Expect = 6.2
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
++KKKK+ K + K KK +KK K +P+
Sbjct: 24 RKKKKKRTAKTARPKATKKGQKKDKKKDEFPEF 56
Score = 28.2 bits (63), Expect = 6.8
Identities = 13/46 (28%), Positives = 23/46 (50%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
K E+ +K+ KK+KKKKK+ K + ++G +K +
Sbjct: 5 KNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKK 50
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 34.2 bits (79), Expect = 0.071
Identities = 11/45 (24%), Positives = 26/45 (57%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
KE+ + +KK+++KK + +K K ++K + + E+ + +L
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELE 62
Score = 33.8 bits (78), Expect = 0.10
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
++E +K K+K K +KK+++KK + +K K S EE+
Sbjct: 10 DKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEK 46
Score = 32.2 bits (74), Expect = 0.37
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
E EE+K K+ K K+K K +KK+++ +E E + K+
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAE 44
Score = 32.2 bits (74), Expect = 0.38
Identities = 8/43 (18%), Positives = 22/43 (51%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+K+ E EK K ++K + + +K +K+ ++ + ++
Sbjct: 30 EKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKA 72
Score = 32.2 bits (74), Expect = 0.43
Identities = 10/39 (25%), Positives = 23/39 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+K++EE+K + +K K ++K + + +K +K+ E
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63
Score = 31.8 bits (73), Expect = 0.43
Identities = 11/47 (23%), Positives = 26/47 (55%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
+EE++ + +K K ++K + + +K +K+ + E R+L+ A
Sbjct: 28 QEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEA 74
Score = 31.5 bits (72), Expect = 0.70
Identities = 9/42 (21%), Positives = 23/42 (54%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+ E+++++KK + +K K ++K + + +K E E +
Sbjct: 22 AKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63
Score = 30.3 bits (69), Expect = 1.5
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
DK +EK K +KK+++KK + +K K ++K E E + K
Sbjct: 14 DKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEK 56
Score = 29.9 bits (68), Expect = 2.2
Identities = 13/63 (20%), Positives = 30/63 (47%), Gaps = 10/63 (15%)
Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
EK +SE +++ + E +K E E +K +K+ ++ + + +++ K +
Sbjct: 25 EKKQEEKKSEAEKLAKMSAE----------EKAEYELEKLEKELEELEAELARRELKAEA 74
Query: 746 KKK 748
KK
Sbjct: 75 KKM 77
Score = 29.5 bits (67), Expect = 3.0
Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
+ KEK K +KK+++KK + +K K +K + L ++EKEL +
Sbjct: 15 KAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYE-LEKLEKELEELEA 63
Score = 28.4 bits (64), Expect = 6.9
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
K+K K +KK+++KK + +K K E + KL
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKE 57
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 33.4 bits (77), Expect = 0.083
Identities = 12/56 (21%), Positives = 31/56 (55%), Gaps = 10/56 (17%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
KE +EE+K +++++ K+ K K+++ ++K + K++ A ++ + E
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEK----------LEKIKKVAGLRGADLSE 46
Score = 31.1 bits (71), Expect = 0.55
Identities = 7/28 (25%), Positives = 21/28 (75%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
++EK +++++ K+ K K+++ ++K+ K
Sbjct: 5 EEEKAQREEELKRLKNLKREEIEEKLEK 32
Score = 31.1 bits (71), Expect = 0.63
Identities = 8/29 (27%), Positives = 22/29 (75%), Gaps = 1/29 (3%)
Query: 829 KEKKKK-KKKKKKKKKKKKKKKKKKMMKK 856
KE+K++ K +++++ K+ K K++++ +K
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEEK 29
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 35.5 bits (82), Expect = 0.090
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
+KK+K+KK++ K +KK+ +KY EE + ++ Q
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQ 201
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 35.8 bits (83), Expect = 0.10
Identities = 32/171 (18%), Positives = 68/171 (39%), Gaps = 25/171 (14%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK----------KYSNEEE 754
E + Q+A +++ E++ KK + K +K++ + ++ +K ++ EE
Sbjct: 276 ERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEE 335
Query: 755 GMRKLRDAASIQCGPVKEVPCEPQIAPCLFD-------IKNDPCEKNNLADRSEVQRINH 807
G++ +R A +K + + P K + N SE++
Sbjct: 336 GLKSVRLADFYGNEEIK-IELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIA 394
Query: 808 YTTEVGYLDPKQRFNQIAYLD--KEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
Y K + +E +++ ++ K KKKK+KKK+ +K
Sbjct: 395 YYESA-----KTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 33.5 bits (77), Expect = 0.49
Identities = 15/65 (23%), Positives = 27/65 (41%), Gaps = 1/65 (1%)
Query: 692 DRSEDQRINHYTTEVGRF-NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
SE + Y + EE ++ ++ K KKKK+KKK+ +K ++
Sbjct: 385 QLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRWF 444
Query: 751 NEEEG 755
+G
Sbjct: 445 VSSDG 449
Score = 32.0 bits (73), Expect = 1.9
Identities = 18/90 (20%), Positives = 28/90 (31%), Gaps = 24/90 (26%)
Query: 680 IKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEE------EKKKKKKKKKKK 733
I+ D + QR ++ E + E K +K +
Sbjct: 353 IELDK----SKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEG 408
Query: 734 KK--------------KKKKKKKKKKKKKY 749
KK K KKKK+KKK+ +
Sbjct: 409 KKAIEEIREELIEEGLLKSKKKKRKKKEWF 438
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 34.9 bits (81), Expect = 0.11
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 821 FNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
F I Y + EK+KKK K++KK K++K K GY
Sbjct: 88 FTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEPYGY 126
Score = 32.9 bits (76), Expect = 0.53
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
E +K+KKK K++KK K++K K ++ Y
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEPY 124
Score = 30.2 bits (69), Expect = 3.3
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
E EK+KKK K++KK K++K K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 30.2 bits (69), Expect = 3.4
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
E E+EKKK K++KK K++K K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Score = 29.9 bits (68), Expect = 4.9
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
E E++KKK K++KK K++K K ++ Y
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEPYGY 126
Score = 29.1 bits (66), Expect = 8.6
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
+ E+E+KK K++KK K++K K ++
Sbjct: 96 EAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 36.0 bits (83), Expect = 0.11
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
Y + +KK K KKK K K K KKK K
Sbjct: 739 YAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSK 773
Score = 35.6 bits (82), Expect = 0.16
Identities = 14/27 (51%), Positives = 14/27 (51%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EKK K KKK K K K KKK K
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKA 774
Score = 34.8 bits (80), Expect = 0.25
Identities = 14/38 (36%), Positives = 19/38 (50%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
E++ K KKK K K K KKK K K K + ++
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAG 785
Score = 34.8 bits (80), Expect = 0.25
Identities = 15/45 (33%), Positives = 18/45 (40%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGP 769
+KK K KKK K K K KKK + + K R P
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSP 792
Score = 34.0 bits (78), Expect = 0.48
Identities = 17/53 (32%), Positives = 21/53 (39%), Gaps = 4/53 (7%)
Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
D I Y + Y K + ++K K K K KKK K K KK K
Sbjct: 733 DAVITKYAGT----PKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTK 781
Score = 33.6 bits (77), Expect = 0.62
Identities = 18/54 (33%), Positives = 20/54 (37%), Gaps = 4/54 (7%)
Query: 823 QIAYLDKEKKKK----KKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANI 872
+ Y K K KK K K K KKK K K K K L LA +
Sbjct: 744 KTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAM 797
Score = 33.6 bits (77), Expect = 0.62
Identities = 15/38 (39%), Positives = 20/38 (52%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
E++ + KKK K K K KKK K K KK + + G
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAG 785
Score = 32.1 bits (73), Expect = 1.5
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
K++ K K K KKK K K KK KK+
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 35.1 bits (81), Expect = 0.12
Identities = 6/35 (17%), Positives = 30/35 (85%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+++++K++++++K++KK+++++++K+K ++ +
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQKAEEAWQK 226
Score = 32.8 bits (75), Expect = 0.67
Identities = 24/142 (16%), Positives = 74/142 (52%), Gaps = 25/142 (17%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE--EEGMR-KLRDAASIQCGPVKEVP 774
+ ++K +K ++K+K++++K++++ + +++ + E EE R K + AA +
Sbjct: 98 AQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPK----- 152
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
K + E + + + + + E + K+R + + K+ K++
Sbjct: 153 -----------HKKEAAESASSSLSGSAKPERNVSQE----EAKKRLQE--WELKKLKQQ 195
Query: 835 KKKKKKKKKKKKKKKKKKMMKK 856
++K++++++K++KK++++ +K
Sbjct: 196 QQKREEERRKQRKKQQEEEERK 217
Score = 30.9 bits (70), Expect = 2.4
Identities = 24/141 (17%), Positives = 58/141 (41%), Gaps = 42/141 (29%)
Query: 716 DKEEEEEKKKKKKKKKKKK----KKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVK 771
+KE EE + +++ K+K + +K ++ K++ K E A+S G K
Sbjct: 117 EKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAES------ASSSLSGSAK 170
Query: 772 EVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEK 831
Q K +Q + K + ++K
Sbjct: 171 PERNVSQEE-----AKK------------RLQ---------EWELKKLK------QQQQK 198
Query: 832 KKKKKKKKKKKKKKKKKKKKK 852
++++++K++KK+++++++K+K
Sbjct: 199 REEERRKQRKKQQEEEERKQK 219
Score = 29.7 bits (67), Expect = 7.2
Identities = 13/47 (27%), Positives = 33/47 (70%), Gaps = 4/47 (8%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
K+ +E + KK K++++K++++++K++KK++ EEE ++ + A
Sbjct: 181 KKRLQEWELKKLKQQQQKREEERRKQRKKQQ----EEEERKQKAEEA 223
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 35.5 bits (82), Expect = 0.12
Identities = 14/51 (27%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR-KLRDAASI 765
+ E E K + K+K K ++ K+K++K+ ++ + EE+ ++ KL A +
Sbjct: 262 FESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312
Score = 33.5 bits (77), Expect = 0.54
Identities = 11/45 (24%), Positives = 25/45 (55%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+ E E + K + K+K K ++ K+K++K+ E + ++L+
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLK 303
Score = 32.4 bits (74), Expect = 1.00
Identities = 12/48 (25%), Positives = 25/48 (52%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRT 875
+ E K + K+K K ++ K+K++ + Q++K+LA + R
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312
Score = 32.4 bits (74), Expect = 1.3
Identities = 11/51 (21%), Positives = 25/51 (49%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI 765
+ E E + K + K+K K ++ K+K++K+ +E + + A +
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQL 309
Score = 32.0 bits (73), Expect = 1.6
Identities = 8/35 (22%), Positives = 23/35 (65%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+K ++K K ++ K+K++K+ +++ K++K+
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQL 302
Score = 31.6 bits (72), Expect = 2.1
Identities = 34/161 (21%), Positives = 75/161 (46%), Gaps = 22/161 (13%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
+EE EK+ K +KK+++ ++ ++KK +K + S +E L + + +
Sbjct: 199 LQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA 258
Query: 777 PQIAPCLFDIKNDPCE--------KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
+ ++ N P +N R E++R ++ K++ Q+A L
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQL-----KKKLAQLARLK 313
Query: 829 K---------EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
+ + + +KK+++K++ +KKK K++K+ K YP+
Sbjct: 314 EIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKYPE 354
Score = 31.2 bits (71), Expect = 2.3
Identities = 12/66 (18%), Positives = 35/66 (53%), Gaps = 10/66 (15%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK----------KKKYSNEEEGMRKL 759
N+ P ++ + ++ K+K++K+ +++ K++K+ KKK K+ + +E+ +
Sbjct: 270 NKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARARK 329
Query: 760 RDAASI 765
++
Sbjct: 330 KEQRKE 335
Score = 31.2 bits (71), Expect = 2.5
Identities = 26/136 (19%), Positives = 51/136 (37%), Gaps = 47/136 (34%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
E E + + K + K+K K ++ K+K++K E E K
Sbjct: 258 AWEGFESEYEPINKPVRPKRKTKAQRNKEKRRK---ELEREAKEEKQ------------- 301
Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
K LA + ++ I ++A +K + +KK
Sbjct: 302 ----------------LKKKLAQLARLKEIAK---------------EVAQKEKARARKK 330
Query: 836 KKKKKKKKKKKKKKKK 851
+++K++ +KKK K++K
Sbjct: 331 EQRKERGEKKKLKRRK 346
Score = 30.8 bits (70), Expect = 3.8
Identities = 18/59 (30%), Positives = 27/59 (45%), Gaps = 16/59 (27%)
Query: 715 PDKEEEEEKKKKKKKKK----------------KKKKKKKKKKKKKKKKKYSNEEEGMR 757
D+ ++K++KK KK KK+ KKK KKKK K + +E +R
Sbjct: 49 GDEATIKKKRRKKLKKPLKSDEILENRSKVPAPSVKKRHVKKKSKKKKSKQVSRKELLR 107
Score = 29.7 bits (67), Expect = 6.7
Identities = 15/45 (33%), Positives = 31/45 (68%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
++ R +IA ++E+ + +KK+++K++ +KKK K++K K KY
Sbjct: 308 QLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKY 352
Score = 29.7 bits (67), Expect = 8.3
Identities = 9/36 (25%), Positives = 22/36 (61%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
Y + K+K K ++ K+K++K+ +++ K++K
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEK 300
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 35.0 bits (81), Expect = 0.12
Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 821 FNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
F Q+ AY +K++KK K++KK K +K+K + G+
Sbjct: 87 FTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYGF 125
Score = 30.8 bits (70), Expect = 2.2
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+ +K++KK K++KK K +K+K +++Y
Sbjct: 95 KAQKEEKKAMSKEEKKAIKAEKEKLEEEY 123
Score = 29.6 bits (67), Expect = 5.8
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
F Q+ K ++EEKK K++KK K +K+K +++
Sbjct: 87 FTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 35.2 bits (81), Expect = 0.12
Identities = 14/36 (38%), Positives = 16/36 (44%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P E K+ KK KK + K K K KK KK
Sbjct: 134 VPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKK 169
Score = 34.9 bits (80), Expect = 0.15
Identities = 17/38 (44%), Positives = 19/38 (50%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
E +E KK KK + K K K KK KKK SN M
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKM 180
Score = 32.5 bits (74), Expect = 0.86
Identities = 15/33 (45%), Positives = 17/33 (51%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
E K+ KK KK + K K K KK KKK S
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGS 172
Score = 31.0 bits (70), Expect = 2.4
Identities = 13/51 (25%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKK----KKKKKYSNEEEGMRKLRD 761
P+ +E ++ KK + K K K KK KKK + K + +
Sbjct: 142 PEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWK 192
>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Missing In
Metastasis. The IMD domain, also called
Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
dimerization and lipid-binding module that bends
membranes and induces membrane protrusions. Members of
this subfamily include missing in metastasis (MIM) or
metastasis suppressor 1 (MTSS1), metastasis suppressor
1-like (MTSSL) or ABBA (Actin-Bundling protein with
BAIAP2 homology), and similar proteins. They contain an
N-terminal IMD and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus. MIM was originally identified
as a missing transcript from metastatic bladder and
prostate cancer cells. It is a scaffold protein that
functions in a signaling pathway between the PDGF
receptor, Src kinases, and actin assembly. It may also
function as a cofactor of the Sonic hedgehog (Shh)
transcriptional pathway and may participate in tumor
development and progression via this pathway. ABBA
regulates actin and plasma membrane dynamics to promote
the extension of radial glia, which is important in
neuronal migration, axon guidance and neurogenesis. The
IMD domain of MIM binds and bundles actin filaments,
binds membranes, and interacts with the small GTPase
Rac.
Length = 231
Score = 34.7 bits (80), Expect = 0.12
Identities = 15/51 (29%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKK--KKKKMMKKGYPDVLSQMEKELANIN 873
LDK+ K+ KK +++ KKK + +K +KG D+ Q++ + ++N
Sbjct: 114 NQLDKDHAKEYKKARQEIKKKSSDTIRLQKKARKGKGDLQPQLDSAMQDVN 164
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 35.4 bits (82), Expect = 0.13
Identities = 22/153 (14%), Positives = 48/153 (31%), Gaps = 25/153 (16%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
K + +++ KKK KK K K K +E +++ ++ + +V
Sbjct: 1 MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF-- 58
Query: 779 IAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKK 838
+ + ++ + ++ + K KKK
Sbjct: 59 -------LSGMVKDTDDATESDIPKKKTKTAAK----------------AAAAKAPAKKK 95
Query: 839 KKKKKKKKKKKKKKMMKKGYPDVLSQMEKELAN 871
K + KK +KK D+ + ++ N
Sbjct: 96 LKDELDSSKKAEKKNALDKDDDLNYVKDIDVLN 128
Score = 35.4 bits (82), Expect = 0.15
Identities = 14/68 (20%), Positives = 22/68 (32%)
Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKK 740
+ E DQ + + V + D +++ K K K KKK K
Sbjct: 38 IKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLK 97
Query: 741 KKKKKKKK 748
+ KK
Sbjct: 98 DELDSSKK 105
Score = 33.4 bits (77), Expect = 0.55
Identities = 11/40 (27%), Positives = 18/40 (45%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
P K+ + K K KKK K + KK +K + ++
Sbjct: 75 PKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114
Score = 32.3 bits (74), Expect = 1.4
Identities = 9/36 (25%), Positives = 13/36 (36%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+ +KK K + KK +KK K Y
Sbjct: 86 AAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYV 121
Score = 31.5 bits (72), Expect = 2.2
Identities = 11/44 (25%), Positives = 19/44 (43%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
++ K+ ++ + KKK K K K KKK +E +
Sbjct: 59 LSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102
Score = 31.5 bits (72), Expect = 2.4
Identities = 17/78 (21%), Positives = 22/78 (28%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKG 887
+ + KKK K K K KK D L +K +N D
Sbjct: 66 TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125
Query: 888 GDPKNFDHAWSIFGDDLK 905
+ D DDL
Sbjct: 126 VLNQADDDDDDDDDDDLD 143
Score = 31.1 bits (71), Expect = 2.6
Identities = 10/38 (26%), Positives = 15/38 (39%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
K + K KKK K + KK +KK ++
Sbjct: 79 TKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116
Score = 31.1 bits (71), Expect = 3.3
Identities = 10/49 (20%), Positives = 21/49 (42%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
+ + K KKK K + KK +KK ++++ + ++D
Sbjct: 78 KTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDV 126
Score = 30.7 bits (70), Expect = 3.7
Identities = 12/50 (24%), Positives = 17/50 (34%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
+ KKK K + KK +KK K N + + L A
Sbjct: 83 AKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 35.0 bits (81), Expect = 0.13
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 3/40 (7%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
F +I DKEEE +K+ K +++ K K K+ KKK
Sbjct: 267 FGKI---DKEEEAKKQAKAERRAANAPKPGAKPKRSKKKA 303
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 35.6 bits (82), Expect = 0.13
Identities = 19/39 (48%), Positives = 21/39 (53%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
DK E K KK K KKK KK KKK+ K+ K EE
Sbjct: 137 DKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEES 175
Score = 34.8 bits (80), Expect = 0.19
Identities = 17/35 (48%), Positives = 19/35 (54%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
E K KK K KKK KK KKK+ K+ K E E
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESES 177
Score = 33.7 bits (77), Expect = 0.51
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 725 KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
+ K KK K KKK KK KKK+ K+ ++E + D +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESK 182
Score = 32.5 bits (74), Expect = 0.99
Identities = 32/156 (20%), Positives = 56/156 (35%), Gaps = 28/156 (17%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD-----------AASIQCGPVK 771
KK K +++K ++ K +KK K K + EE + L+ + +
Sbjct: 14 KKAKGFERQKLIRRIKNAEKKSKNDKLARIEEEIEALKTLDLDQLAKKYLFSKLIKVITS 73
Query: 772 EVPCEPQIAPCLFDIKND-------------PCEKNNLADR--SEVQRINHYTTEVGYLD 816
+ +P I ND EK + S + + +
Sbjct: 74 KPTKAWAESPEFRQIINDKSHKANPSRSEAEVVEKLKSLNNVVSRLLNSKPVKNVLESIM 133
Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
++I ++ + KK K KKK KK KKK+ K+
Sbjct: 134 E--GLDKILGIETKAKKGKAKKKTKKSKKKEAKESS 167
Score = 32.5 bits (74), Expect = 1.1
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
E + +K K KKK KK KKK+ K+ K ++ S E+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180
Score = 32.5 bits (74), Expect = 1.1
Identities = 14/46 (30%), Positives = 26/46 (56%)
Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
++I + + ++ K KKK KK KKK+ K+ K +++ S +E
Sbjct: 135 GLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180
Score = 31.0 bits (70), Expect = 3.4
Identities = 18/46 (39%), Positives = 25/46 (54%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
E + KK K KKK KK KKK+ K+ K + +E E K ++A
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAE 188
Score = 30.2 bits (68), Expect = 5.2
Identities = 13/39 (33%), Positives = 24/39 (61%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+ + ++ K KKK KK KKK+ K+ K +++ +E+E
Sbjct: 142 IETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180
Score = 30.2 bits (68), Expect = 5.3
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
+K K KKK KK KKK+ K+ K
Sbjct: 145 KAKKGKAKKKTKKSKKKEAKESSDK 169
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 35.4 bits (82), Expect = 0.14
Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 3/66 (4%)
Query: 687 KNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
KN L +R R+N ++ + +A +E+ K +K K ++ +KK KK + K
Sbjct: 316 KNPLKNRRVRLRLNPAAKKLRKLAVLA---QEKATAKGAQKVKNRRARKKAKKARLAKVA 372
Query: 747 KKYSNE 752
K
Sbjct: 373 KALYKS 378
Score = 35.0 bits (81), Expect = 0.18
Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 7/60 (11%)
Query: 793 KNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
KN L +R R+N K R + +K K +K K ++ +KK KK +
Sbjct: 316 KNPLKNRRVRLRLNPAA-------KKLRKLAVLAQEKATAKGAQKVKNRRARKKAKKARL 368
>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 34.4 bits (79), Expect = 0.14
Identities = 13/67 (19%), Positives = 27/67 (40%)
Query: 419 IENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENG 478
+++ + + + N +EN N YN ++ Y N YN N + +
Sbjct: 105 LDSKSDNPQANAMQDNSFHENFNNNYPGNYNNPSQDPYMNQAQSYNQNAYAKENQQAQPP 164
Query: 479 THEYNIP 485
+E ++P
Sbjct: 165 KYENSVP 171
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 33.0 bits (75), Expect = 0.14
Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 6/67 (8%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
K + ++KK +K K K KKK KKKKK++ +L D KEVP
Sbjct: 2 APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIR------KEVP 55
Query: 775 CEPQIAP 781
E + P
Sbjct: 56 RERVVTP 62
Score = 29.5 bits (66), Expect = 2.6
Identities = 13/35 (37%), Positives = 20/35 (57%)
Query: 817 PKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
PK + ++ +K K K KKK KKKKK++ +
Sbjct: 3 PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEAR 37
Score = 28.7 bits (64), Expect = 5.0
Identities = 15/38 (39%), Positives = 19/38 (50%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQ 864
L KEKK +K K K KKK KKKK + + +
Sbjct: 7 LSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDE 44
Score = 28.3 bits (63), Expect = 6.3
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
K + K+KK +K K K KKK+S +++ R
Sbjct: 1 MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARR 38
>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
(ETRAMP). These sequences represent a family of
proteins from the malaria parasite Plasmodium
falciparum, several of which have been shown to be
expressed specifically in the ring stage as well as the
rodent parasite Plasmodium yoelii. A homologue from
Plasmodium chabaudi was localized to the parasitophorous
vacuole membrane. Members have an initial hydrophobic,
Phe/Tyr-rich, stretch long enough to span the membrane,
a highly charged region rich in Lys, a second putative
transmembrane region and a second highly charged, low
complexity sequence region. Some members have up to 100
residues of additional C-terminal sequence. These genes
have been shown to be found in the sub-telomeric regions
of both Plasmodium falciparum and P. yoelii chromosomes.
Length = 82
Score = 32.2 bits (74), Expect = 0.16
Identities = 16/30 (53%), Positives = 17/30 (56%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
KKKK KK KK +KKKK KKK S
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKKKIIIS 57
Score = 31.8 bits (73), Expect = 0.26
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMM 854
K+KK KK KK +KKKK KKK++
Sbjct: 30 KKKKLKKLKKIDDDLEKKKKNKKKII 55
Score = 30.3 bits (69), Expect = 0.70
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKK 745
+KKK KK KK +KKKK KKK
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 29.1 bits (66), Expect = 1.8
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKKMM 854
+ +KKK KK KK +KKKK KK +
Sbjct: 26 NVVAKKKKLKKLKKIDDDLEKKKKNKKKI 54
Score = 28.8 bits (65), Expect = 2.6
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKK 744
++KK KK KK +KKKK KKK
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKKK 53
Score = 28.8 bits (65), Expect = 3.0
Identities = 14/34 (41%), Positives = 18/34 (52%)
Query: 821 FNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMM 854
N K+K KK KK +KKKK KKK ++
Sbjct: 23 LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKKIII 56
Score = 27.6 bits (62), Expect = 6.7
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKK 743
+++K KK KK +KKKK KKK
Sbjct: 28 VAKKKKLKKLKKIDDDLEKKKKNKKK 53
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 34.1 bits (79), Expect = 0.16
Identities = 19/52 (36%), Positives = 23/52 (44%), Gaps = 15/52 (28%)
Query: 718 EEEEEKKKKKKKKKK------KKK---------KKKKKKKKKKKKKYSNEEE 754
E + K K + +KK KK K KKKK K KK K S E+E
Sbjct: 48 EGNTQSKGKGQAQKKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPKLSPEQE 99
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 31.8 bits (73), Expect = 0.18
Identities = 10/36 (27%), Positives = 21/36 (58%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
KK+KK +K K+ K +++ +++ Y E G++
Sbjct: 33 KKRKKYEKAVKRLKSQRRVVREEDGGYGGETTGIKT 68
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 35.2 bits (81), Expect = 0.18
Identities = 17/69 (24%), Positives = 36/69 (52%), Gaps = 3/69 (4%)
Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
++ L + + YT ++G D EE E K K++K K+++ +KK+K++ +
Sbjct: 366 KRQELLKEYNKK-LQDYTKKLGEVKD--ETDASEEAEAKAKEEKLKQEENEKKQKEQADE 422
Query: 746 KKKYSNEEE 754
K+ ++E
Sbjct: 423 DKEKRQKDE 431
Score = 32.5 bits (74), Expect = 1.2
Identities = 12/27 (44%), Positives = 22/27 (81%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKK 743
K+EE EKK+K++ + K+K++K ++KK
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 32.5 bits (74), Expect = 1.2
Identities = 10/27 (37%), Positives = 21/27 (77%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKK 742
+EE E+K+K++ + K+K++K ++KK
Sbjct: 408 KQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 33.0 bits (76), Expect = 0.19
Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 22/55 (40%)
Query: 716 DKEEEEEKKKKKK-------------KKKKKKK---------KKKKKKKKKKKKK 748
DKEEEEEK++K+K KK ++ K K +K+KKKKKKK
Sbjct: 31 DKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKK 85
Score = 30.3 bits (69), Expect = 1.8
Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 22/61 (36%)
Query: 713 AYPDKEEEEEKKKKK-------------KKKKKKK---------KKKKKKKKKKKKKKYS 750
Y +EEEE+++K+K KK ++ K K +K+KKKKKKK+
Sbjct: 29 EYDKEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELK 88
Query: 751 N 751
N
Sbjct: 89 N 89
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 34.2 bits (79), Expect = 0.19
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 4/47 (8%)
Query: 827 LDKEKKKKKKKKKK----KKKKKKKKKKKKMMKKGYPDVLSQMEKEL 869
K KKKKKK+KK KK K K + K +K Y V +M+K+L
Sbjct: 157 YKKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDPRMKKDL 203
Score = 28.8 bits (65), Expect = 9.2
Identities = 15/31 (48%), Positives = 17/31 (54%), Gaps = 4/31 (12%)
Query: 723 KKKKKKKKK----KKKKKKKKKKKKKKKKKY 749
KKKKK+KK KK K K + K K KY
Sbjct: 162 KKKKKRKKTYVVAKKGNKGKAGRPKGVKGKY 192
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 35.3 bits (81), Expect = 0.21
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
EEE E+KKKK++K K+K+ KK K +K+ K
Sbjct: 15 EEELERKKKKEEKAKEKELKKLKAAQKEAK 44
Score = 34.9 bits (80), Expect = 0.23
Identities = 13/34 (38%), Positives = 24/34 (70%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+K+ E++ ++KKKK++K K+K+ KK K +K
Sbjct: 8 AEKKILTEEELERKKKKEEKAKEKELKKLKAAQK 41
Score = 34.9 bits (80), Expect = 0.24
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+EE E KKKK++K K+K+ KK K +K+ K K
Sbjct: 14 TEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
Score = 32.9 bits (75), Expect = 0.81
Identities = 15/36 (41%), Positives = 25/36 (69%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
EEE ++KKKK++K K+K+ KK K +K+ K + +
Sbjct: 15 EEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50
Score = 31.4 bits (71), Expect = 3.1
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
E E+K +++ ++KKKK++K K+K+ KK + ++E KL+ +
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA 52
Score = 29.9 bits (67), Expect = 7.4
Identities = 15/48 (31%), Positives = 29/48 (60%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
L +E+ ++KKKK++K K+K+ KK K K+ + +Q + N+ +
Sbjct: 13 LTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPK 60
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 33.9 bits (78), Expect = 0.24
Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 4/68 (5%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE---- 772
K +E + K+ K KKK KKK+K K+ S +E + R+ ++ K
Sbjct: 8 KARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGKFY 67
Query: 773 VPCEPQIA 780
VP E ++
Sbjct: 68 VPAEHKLV 75
Score = 30.4 bits (69), Expect = 3.7
Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 4/62 (6%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAV 877
K++ Q + K+ K KKK KKK+K K+ + K Y + E+E+ + R A
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEY----RKAEREIIRLKRLAK 61
Query: 878 AP 879
P
Sbjct: 62 KP 63
Score = 29.6 bits (67), Expect = 5.5
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K + ++ + K+ K KKK KKK+K K
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFK 37
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 34.7 bits (80), Expect = 0.25
Identities = 14/32 (43%), Positives = 16/32 (50%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
PD EKK K K KK +K KK K +K
Sbjct: 765 PDSAFRGEKKAKPKAAKKDARKAKKPSAKTQK 796
Score = 34.3 bits (79), Expect = 0.30
Identities = 17/68 (25%), Positives = 27/68 (39%)
Query: 681 KNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKK 740
K K R +++N R + A P +++ +K KK K +K K
Sbjct: 743 KKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATK 802
Query: 741 KKKKKKKK 748
K+ KKK
Sbjct: 803 AKRAAKKK 810
Score = 31.3 bits (71), Expect = 3.1
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKM 853
+K+ K K KK +K KK K +K+
Sbjct: 772 EKKAKPKAAKKDARKAKKPSAKTQKI 797
Score = 30.5 bits (69), Expect = 5.5
Identities = 10/38 (26%), Positives = 15/38 (39%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K + +K +K KK K +K K K+ E
Sbjct: 776 KPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
Score = 30.5 bits (69), Expect = 5.6
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
K F + EKK K K KK +K KK K
Sbjct: 760 KVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAK 793
Score = 30.1 bits (68), Expect = 7.6
Identities = 11/38 (28%), Positives = 17/38 (44%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
K +++ +K KK K +K K K+ KK E
Sbjct: 776 KPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 33.4 bits (77), Expect = 0.25
Identities = 27/141 (19%), Positives = 51/141 (36%), Gaps = 41/141 (29%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
D E+E ++ +K KK K ++K++ K+ + + + ++ I K+
Sbjct: 58 DDYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQ--- 114
Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKK 835
E + I + K+ + YL K + KK
Sbjct: 115 -------------------------EKELIK---------EGKKPY----YLKKSEIKKL 136
Query: 836 KKKKKKKKKKKKKKKKKMMKK 856
KKK + KK K+ K ++K
Sbjct: 137 VLKKKFDELKKSKQLDKALEK 157
Score = 30.3 bits (69), Expect = 2.4
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
++ K +E +K K+ K +KK+KK K+KK
Sbjct: 134 KKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168
Score = 29.2 bits (66), Expect = 6.0
Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 717 KEEEEEKKK--KKKKKKKKKKKKKKKKKKKKKKK 748
K ++K KK K+ K +KK+KK K+KK
Sbjct: 135 KLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168
Score = 28.8 bits (65), Expect = 7.3
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K+ +KK + KK K+ K +KK+KK K
Sbjct: 134 KKLVLKKKFDELKKSKQLDKALEKKRKKNAGK 165
Score = 28.8 bits (65), Expect = 8.7
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
D+E++ K+ KK++K+ K+ KK +KK L +K
Sbjct: 102 DREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKK 141
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 34.4 bits (79), Expect = 0.26
Identities = 28/151 (18%), Positives = 46/151 (30%), Gaps = 2/151 (1%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
A K++ EE K+ K+K+ ++ K K K E E R A
Sbjct: 111 LAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAK--AEAEAKRAAAAAKKAAAEAK 168
Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
K+ E K E A ++ + + K++ A
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAA 228
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
K + K +K K +K K +V
Sbjct: 229 KAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
Score = 33.6 bits (77), Expect = 0.42
Identities = 8/46 (17%), Positives = 16/46 (34%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
K+ E K K + K +K K + + + ++ D
Sbjct: 216 KKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDD 261
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 34.5 bits (80), Expect = 0.29
Identities = 20/82 (24%), Positives = 33/82 (40%), Gaps = 8/82 (9%)
Query: 804 RINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK------G 857
+I Y GY D +Q+ Q L + KKK++ K + K+ KK+ K
Sbjct: 515 KIGRYVG--GYHDARQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQ 572
Query: 858 YPDVLSQMEKELANINRTAVAP 879
P +L +E E+ +
Sbjct: 573 LPQLLEDLEAEIEALQAQVADA 594
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 33.2 bits (76), Expect = 0.30
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+KKKK + K +K+ K+K +KKK
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKK 27
Score = 32.4 bits (74), Expect = 0.58
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
KK + K +K+ K+K +KKK N
Sbjct: 5 KKNRSSNYKVNRKRLKRKDRKKKINIIGNPI 35
Score = 32.0 bits (73), Expect = 0.80
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
KKKK + K +K+ K+K +KKK
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKI 28
Score = 29.7 bits (67), Expect = 4.3
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKK 856
+KKKK + K +K+ K+K KK
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKK 26
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 34.2 bits (79), Expect = 0.30
Identities = 12/44 (27%), Positives = 22/44 (50%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+ E + K+K K K ++ +K + K+KKK+ EE+
Sbjct: 297 KKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340
Score = 33.1 bits (76), Expect = 0.61
Identities = 13/43 (30%), Positives = 21/43 (48%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
E K+K K K ++ +K + K+KKK +EE +K
Sbjct: 301 LLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKK 343
Score = 32.3 bits (74), Expect = 1.1
Identities = 13/60 (21%), Positives = 30/60 (50%)
Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
++I ++ R ++ + + K+K K K ++ +K + K+KKK+K E++
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342
Score = 30.4 bits (69), Expect = 4.8
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
D EK + K+KKK+KKK++KKKK+
Sbjct: 320 DNEKLDAEVKEKKKEKKKEEKKKKQ 344
Score = 30.4 bits (69), Expect = 5.1
Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 7/102 (6%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K + + K+ KK + K+K K K + NE+ + KE +
Sbjct: 287 KALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKK----KEEKKK 342
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPK 818
QI I+ + D+ E + + T+++ Y+DP+
Sbjct: 343 KQIERLEERIEK---LEVQATDKEENKTVALGTSKINYIDPR 381
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 34.2 bits (78), Expect = 0.31
Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 17/153 (11%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
+ + K + ++ ++KKK +N + K+ DA K+V
Sbjct: 10 SGNNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKL--EKEFKDVL 67
Query: 775 CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKK 834
Q+ ND K D + R+ P + A +D E +
Sbjct: 68 QRFQVQE------NDT-PKEITKDEKNMSRM------KSRKAPTI--HMEAEIDDEYGFE 112
Query: 835 KKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
+ K +++ ++KK+K ++ Y + +
Sbjct: 113 DRMKPYREESLSRQKKRKALQHRYEQLKLVVPY 145
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 32.2 bits (74), Expect = 0.31
Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 7/38 (18%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKK-------KKKKKKK 745
+E E K K+K+KKKKKKK+ + ++KKK++
Sbjct: 66 AREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103
Score = 32.2 bits (74), Expect = 0.35
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 7/42 (16%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKK-------KKKKKKK 747
EE + K K+K+KKKKKKK+ + ++KKK++
Sbjct: 62 KGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103
Score = 31.1 bits (71), Expect = 0.91
Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 24/60 (40%)
Query: 716 DKEEEEEKKKKKKK-------------KKKKK-----------KKKKKKKKKKKKKKYSN 751
DK EEEEK++ K+ + +K K K+K+KKKKKKK+ +
Sbjct: 31 DKREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELED 90
Score = 30.3 bits (69), Expect = 1.7
Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 7/44 (15%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKK-------KKKKKKK 743
GR + A + E + K+K+KKKKKKK+ + ++KKK++
Sbjct: 60 GRKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEE 103
Score = 29.1 bits (66), Expect = 4.0
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKK 850
+E + K K+K+KKKKKKK+ +
Sbjct: 67 REEAVEAKAKEKEKKKKKKKELE 89
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 34.0 bits (78), Expect = 0.32
Identities = 14/36 (38%), Positives = 26/36 (72%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
D+++E+E ++ K++ +KK+K KKK +K+K SN
Sbjct: 339 DEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSN 374
Score = 32.8 bits (75), Expect = 0.70
Identities = 11/38 (28%), Positives = 25/38 (65%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+ Y +E+++ + ++ K++ +KK+K KKK +K+K
Sbjct: 332 LGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKV 369
Score = 31.3 bits (71), Expect = 2.4
Identities = 14/39 (35%), Positives = 26/39 (66%)
Query: 706 VGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
+G I D++E++E+ K++ +KK+K KKK +K+K
Sbjct: 332 LGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
Score = 29.4 bits (66), Expect = 8.4
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKK 852
D E+ K++ +KK+K KKK +K+K
Sbjct: 346 DDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 31.1 bits (71), Expect = 0.35
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EKK + +K +K+ KK K KKKK
Sbjct: 7 EKKVAEHHRKLRKEAKKNPTWKSKKKK 33
Score = 29.5 bits (67), Expect = 1.6
Identities = 11/31 (35%), Positives = 26/31 (83%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EE EEKK+K++++K+++K+ +K ++ + +K+
Sbjct: 49 EEIEEKKRKQEEEKERRKEARKAERAEARKR 79
Score = 29.1 bits (66), Expect = 2.1
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
E +K +K+ KK K KKKK
Sbjct: 11 AEHHRKLRKEAKKNPTWKSKKKK 33
>gnl|CDD|220628 pfam10198, Ada3, Histone acetyltransferases subunit 3. Ada3 is a
family of proteins conserved from yeasts to humans. It
is an essential component of the Ada transcriptional
coactivator (alteration/deficiency in activation)
complex. Ada3 plays a key role in linking histone
acetyltransferase-containing complexes to p53 (tumour
suppressor protein) thereby regulating p53 acetylation,
stability and transcriptional activation following DNA
damage.
Length = 127
Score = 32.3 bits (74), Expect = 0.36
Identities = 17/61 (27%), Positives = 29/61 (47%), Gaps = 11/61 (18%)
Query: 818 KQRFNQIAY------LDKE-----KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
K+ + Y LDK+ K+ + + KKKKK+K+ +K +G + S +E
Sbjct: 67 KEEMARQEYKTILDDLDKQVDQAYLKRIRARGKKKKKEKRPGAAQKAANEGQESLKSLLE 126
Query: 867 K 867
K
Sbjct: 127 K 127
Score = 29.2 bits (66), Expect = 4.8
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
K+ + + KKKKK+K+ EG L+
Sbjct: 91 LKRIRARGKKKKKEKRPGAAQKAANEGQESLKS 123
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 33.2 bits (76), Expect = 0.38
Identities = 30/141 (21%), Positives = 64/141 (45%), Gaps = 20/141 (14%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
EEE E + ++ ++++K+K+ +++ + + + M K+ P
Sbjct: 92 EEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKM-----------------P 134
Query: 778 QIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGY-LDPK-QRFNQIAYLDKEKKKKK 835
Q+ K +K A + + + GY +DP+ RF Q KEK++KK
Sbjct: 135 QMIADWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRF-QEMLQQKEKEEKK 193
Query: 836 KKKKKKKKKKKKKKKKKMMKK 856
K K+ K+++K++K+ ++
Sbjct: 194 KVKEAKRREKEEKRMAALVAA 214
Score = 29.4 bits (66), Expect = 6.3
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
KE+EE+KK K+ K+++K++K+
Sbjct: 186 QKEKEEKKKVKEAKRREKEEKRMAALVAA 214
>gnl|CDD|218396 pfam05039, Agouti, Agouti protein. The agouti protein regulates
pigmentation in the mouse hair follicle producing a
black hair with a subapical yellow band. A highly
homologous protein agouti signal protein (ASIP)is
present in humans and is expressed at highest levels in
adipose tissue where it may play a role in energy
homeostasis and possibly human pigmentation.
Length = 90
Score = 31.4 bits (71), Expect = 0.38
Identities = 18/64 (28%), Positives = 24/64 (37%), Gaps = 6/64 (9%)
Query: 728 KKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIK 787
K KK K + ++KKK K E + R +C + E C PC
Sbjct: 16 LNKGLKKFNKNEAEEKKKSSKKEAEALTEAQGRPPPPRRCVRLHE-SCLGPQTPCC---- 70
Query: 788 NDPC 791
DPC
Sbjct: 71 -DPC 73
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 30.6 bits (70), Expect = 0.39
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMK 855
EK +K+K+KK +K+++KK +
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAREE 64
Score = 29.8 bits (68), Expect = 0.76
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKK 745
EK +K+K+KK +K+++KK +
Sbjct: 39 EKPSEKRKRKKAAARKRRRKKLAR 62
Score = 27.5 bits (62), Expect = 4.9
Identities = 8/22 (36%), Positives = 15/22 (68%)
Query: 728 KKKKKKKKKKKKKKKKKKKKKY 749
K +K+K+KK +K+++KK
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLA 61
Score = 27.5 bits (62), Expect = 5.2
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 729 KKKKKKKKKKKKKKKKKKKKYSNEE 753
K +K+K+KK +K+++KK + EE
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLAREE 64
Score = 26.8 bits (60), Expect = 9.8
Identities = 7/20 (35%), Positives = 15/20 (75%)
Query: 721 EEKKKKKKKKKKKKKKKKKK 740
E++K+KK +K+++KK +
Sbjct: 43 EKRKRKKAAARKRRRKKLAR 62
>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
I [DNA replication, recombination, and repair].
Length = 140
Score = 32.1 bits (73), Expect = 0.46
Identities = 28/149 (18%), Positives = 43/149 (28%), Gaps = 21/149 (14%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
++E + K K + K K KK K + P +
Sbjct: 2 IAEEEVKVVLKDLKTGQICPKCGKNMVKKFGKYGIF-------------LGCSNYPKCDY 48
Query: 774 PCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVG--YLDPKQRFNQIAYLDKEK 831
+ + + C K V + + PK RF KEK
Sbjct: 49 YEPEKA---IAEKTGVKCPKCGKGL--LVLKKGRFGKNFLGCSNYPKCRFT-EKPKPKEK 102
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGYPD 860
K K +K +KK+KKK + P
Sbjct: 103 KCPKCGSRKLVEKKEKKKGVFLECSNKPC 131
Score = 31.4 bits (71), Expect = 0.99
Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 7/79 (8%)
Query: 684 PCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
C K + + + YP E+ K K+KK K +K +KK+
Sbjct: 62 KCPKCGKGL--LVLKKGRFGKNFLGCSN--YPKCRFTEKPKPKEKKCPKCGSRKLVEKKE 117
Query: 744 KKKKKY---SNEEEGMRKL 759
KKK + SN+ ++
Sbjct: 118 KKKGVFLECSNKPCCLKVE 136
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 32.3 bits (73), Expect = 0.47
Identities = 15/32 (46%), Positives = 22/32 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
KE + + +K KKKK+K +KK K KKK++K
Sbjct: 15 KESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46
Score = 31.5 bits (71), Expect = 0.84
Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 13/72 (18%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ---------- 766
K+ +E K + +K KKKK+K +KK K KKK EE+ K+ + Q
Sbjct: 12 KQRKESGKTQSQKSDKKKKEKVSEKKGKSKKK---EEKPNGKIPEHEPNQEVTEVEVIIE 68
Query: 767 CGPVKEVPCEPQ 778
PV V P
Sbjct: 69 KEPVPAVAVAPV 80
Score = 29.6 bits (66), Expect = 4.7
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
Q DK+KK+K +KK K KKK++K K+
Sbjct: 20 TQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKI 51
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 32.1 bits (74), Expect = 0.49
Identities = 21/51 (41%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
K+KKKKKKKKK+ + ++KKKK +E + + DAA + +K+VP
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAV-DAALLAA--IKKVP 129
Score = 31.0 bits (71), Expect = 1.3
Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 6/38 (15%)
Query: 825 AYLDKEKKKKKKKK------KKKKKKKKKKKKKKMMKK 856
AY ++KKKKKKKK +KKKK +++K+ K
Sbjct: 79 AYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
Score = 28.3 bits (64), Expect = 9.7
Identities = 11/32 (34%), Positives = 23/32 (71%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K ++++KKKKK+ + ++KKKK +++K+
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKED 113
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 33.1 bits (76), Expect = 0.49
Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 6/73 (8%)
Query: 703 TTEVGR-FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
++V R +A KEEEE ++ +++++ + ++ + K+K K++ E E R R
Sbjct: 108 RSDVRRQLRFLAQKQKEEEE-RRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRH-RA 165
Query: 762 A---ASIQCGPVK 771
A A G K
Sbjct: 166 ANATALAAIGGRK 178
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 31.9 bits (73), Expect = 0.50
Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 9/45 (20%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK---------KKKKK 748
P EE K +++KK+K+KKKKKKK+ + KKK++
Sbjct: 62 KGPAAGEEAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEE 106
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 33.1 bits (76), Expect = 0.51
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
I P+ + + + K K K KK +++ K++ K + + S E
Sbjct: 91 VIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFEN 134
Score = 32.3 bits (74), Expect = 0.93
Identities = 13/54 (24%), Positives = 19/54 (35%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKP 883
E K K K K K K KK +++ K K + + A T+
Sbjct: 96 EPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSSTATAAA 149
Score = 32.0 bits (73), Expect = 1.0
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + E K K K K K K KK +++ K++ K
Sbjct: 89 PVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVK 122
Score = 31.2 bits (71), Expect = 2.2
Identities = 10/37 (27%), Positives = 19/37 (51%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ P E + + K K K K KK +++ K++ K +
Sbjct: 90 VVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126
Score = 30.4 bits (69), Expect = 3.6
Identities = 11/44 (25%), Positives = 22/44 (50%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
+ K K K K KK +++ K++ K + + ++ E R +S
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSS 143
Score = 29.7 bits (67), Expect = 7.1
Identities = 11/34 (32%), Positives = 16/34 (47%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + K + K K K K K K KK +++ K
Sbjct: 85 PKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPK 118
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 32.7 bits (75), Expect = 0.52
Identities = 16/54 (29%), Positives = 34/54 (62%), Gaps = 3/54 (5%)
Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY---SNEEEGMRK 758
R Q ++E EE KK ++K++ ++K++++ +++++KKK+ EE RK
Sbjct: 17 RRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARK 70
Score = 31.2 bits (71), Expect = 1.5
Identities = 13/44 (29%), Positives = 34/44 (77%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+E EEE+++++KK ++K++ ++K++++ ++++ +EE RK R
Sbjct: 21 REAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKER 64
Score = 30.8 bits (70), Expect = 2.1
Identities = 13/46 (28%), Positives = 32/46 (69%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
EEEE +++KK ++K++ ++K++++ +++++K EEE + A
Sbjct: 24 EEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69
Score = 30.4 bits (69), Expect = 2.5
Identities = 17/46 (36%), Positives = 33/46 (71%), Gaps = 2/46 (4%)
Query: 716 DKEEEEEKKKK--KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
+ EEE EKKK+ ++K+++++ +K++++ +K K + EEEG KL
Sbjct: 47 ELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKL 92
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor involved
in ribosome biogenesis.
Length = 112
Score = 31.7 bits (72), Expect = 0.52
Identities = 12/40 (30%), Positives = 24/40 (60%), Gaps = 2/40 (5%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
+KK++KKK K+ +K++K++KK + + V + K
Sbjct: 46 VSKKKQRKKKPKRLTRKQRKRQKKGLERAE--IVTDTLSK 83
Score = 30.9 bits (70), Expect = 0.77
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
E KKK++KKK K+ +K++K++KK E L
Sbjct: 44 EGVSKKKQRKKKPKRLTRKQRKRQKKGLERAEIVTDTL 81
Score = 30.5 bits (69), Expect = 1.1
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKK 747
E KKK++KKK K+ +K++K++KK
Sbjct: 44 EGVSKKKQRKKKPKRLTRKQRKRQKKG 70
Score = 30.2 bits (68), Expect = 1.6
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKK 746
E KKK++KKK K+ +K++K++KK
Sbjct: 44 EGVSKKKQRKKKPKRLTRKQRKRQKKG 70
Score = 30.2 bits (68), Expect = 1.7
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
K E KKK++KKK K+ +K++K++KK
Sbjct: 42 KNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70
Score = 28.6 bits (64), Expect = 5.2
Identities = 10/25 (40%), Positives = 20/25 (80%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKK 851
+ K+K++KKK K+ +K++K++KK
Sbjct: 46 VSKKKQRKKKPKRLTRKQRKRQKKG 70
>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
[Posttranslational modification, protein turnover,
chaperones].
Length = 320
Score = 33.1 bits (75), Expect = 0.53
Identities = 23/149 (15%), Positives = 43/149 (28%), Gaps = 25/149 (16%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
+ E K K+ KK ++ K + KK S + G + + Q
Sbjct: 177 KAEAKAKEALALLKKGVREAKADFAELAKKQSEDPSSKNG---------GGLLGWNKKGQ 227
Query: 779 IAPCLFDIKNDPCEKNNLA---------------DRSEVQRINHYTTEVGYLDPKQRFNQ 823
+ P + ++ + + + V + KQ Q
Sbjct: 228 LVPEFRKAAFI-LKVGEVSAPVKTSFGYHIIKVEKKRDAKLDFPEVKAVKEISLKQELLQ 286
Query: 824 IAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
A L+K K ++K K K+
Sbjct: 287 KAALEKLLKGEQKSFDKLLKEAAIDILSA 315
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 33.5 bits (76), Expect = 0.53
Identities = 27/171 (15%), Positives = 56/171 (32%), Gaps = 43/171 (25%)
Query: 700 NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
N + G +I +EE + KKK + K+ KK + EEG +
Sbjct: 40 NKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELV 99
Query: 760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQ 819
+ + D+ + P++
Sbjct: 100 VNEDAA-------------------------------LDKESKKT------------PRR 116
Query: 820 RFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELA 870
+ A + +++K +KK +K++K KK + + +G +S +E+
Sbjct: 117 TRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 33.1 bits (76), Expect = 0.56
Identities = 13/60 (21%), Positives = 24/60 (40%)
Query: 690 LADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+ + + + R ++ ++ E +++ KK K KKKK K K K K
Sbjct: 175 KSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKWK 234
Score = 32.3 bits (74), Expect = 0.89
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
+ E +++ KK K KKKK K K K K+ E
Sbjct: 202 QRLELQRELMKKGKGKKKKIVKDKDGKVVYKWKKE 236
Score = 29.6 bits (67), Expect = 6.0
Identities = 13/53 (24%), Positives = 25/53 (47%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
D+ E + + +K +K+KKK KKK+K Y +E + + ++
Sbjct: 149 LLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVE 201
Score = 29.6 bits (67), Expect = 7.0
Identities = 31/171 (18%), Positives = 64/171 (37%), Gaps = 30/171 (17%)
Query: 712 IAYPDKEEEEEKKKKKKK------------KKKKKKKKKKKKKK-----------KKKKK 748
+ +EE+E K++ K K++ + KK +K K+ K
Sbjct: 67 VHILKREEKEVLSKEQVKLLKTQDLNYVRTKRQAEAKKIEKLKEELHLTDSGSAGPKHIV 126
Query: 749 YSNEEEGMRKLRDAASIQCGP--VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRIN 806
+ ++EE + A P + P+I+ ++ +S ++
Sbjct: 127 FVDDEEEQKSFDPAEYFDTTPELLDRRENRPRIS----QLEKTSLVDEKQKKKSAKKKRK 182
Query: 807 HYTTEVGYLDPKQRFNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
Y + +++ ++ L+ +++ KK K KKKK K K K + K
Sbjct: 183 LYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKKKIVKDKDGKVVYKW 233
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 32.7 bits (75), Expect = 0.62
Identities = 12/36 (33%), Positives = 25/36 (69%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+EE++E+ +KK ++ + KK +K+ +K++ KYS
Sbjct: 159 KQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSE 194
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 32.7 bits (75), Expect = 0.63
Identities = 10/33 (30%), Positives = 15/33 (45%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
D ++ K KKK +K + K K+K K
Sbjct: 187 DFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAK 219
Score = 32.7 bits (75), Expect = 0.69
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
+ + KKK +K + K K+K K K+
Sbjct: 192 DAAKGGSKKKGRKGGAARGKPNAKRKAKDAKF 223
Score = 30.0 bits (68), Expect = 4.5
Identities = 10/32 (31%), Positives = 14/32 (43%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
D + KKK +K + K K+K K K
Sbjct: 191 DDAAKGGSKKKGRKGGAARGKPNAKRKAKDAK 222
Score = 30.0 bits (68), Expect = 5.1
Identities = 9/46 (19%), Positives = 16/46 (34%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
+ + K KKK +K + K K+K + + G
Sbjct: 179 GGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDAKFG 224
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 33.7 bits (78), Expect = 0.64
Identities = 14/50 (28%), Positives = 23/50 (46%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
KEEE+EK K+ + KK + +KKK+ K + +K+
Sbjct: 179 KEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEITP 228
Score = 32.5 bits (75), Expect = 1.4
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 805 INHYTTEVGYLDPKQRFNQ-IAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLS 863
+ D K+ + A ++E +K K+ + KK + +KKK+ +
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPV 215
Query: 864 QMEKELANIN 873
Q+ K++
Sbjct: 216 QIGKKIDKEE 225
Score = 32.1 bits (74), Expect = 2.0
Identities = 14/53 (26%), Positives = 25/53 (47%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI 765
+ ++EEE++K K+ + KK + +KKK+ K + K D I
Sbjct: 174 KFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEI 226
Score = 29.8 bits (68), Expect = 8.9
Identities = 11/39 (28%), Positives = 23/39 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+ E+ E +K+++ +K K+ + KK + +KKK S +
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFD 209
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 31.5 bits (72), Expect = 0.66
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKK 856
K +K+ K ++KK + K+K K KK
Sbjct: 107 KARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 30.4 bits (69), Expect = 1.9
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKK 851
KE K ++KK + K+K K KKK
Sbjct: 110 KELKNRRKKVRGKEKTKVSGKKK 132
Score = 29.6 bits (67), Expect = 3.0
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
K +K+ K ++KK + K+K K KKK
Sbjct: 107 KARKELKNRRKKVRGKEKTKVSGKKK 132
Score = 29.6 bits (67), Expect = 3.5
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+KK +K +K+ K ++KK + K+K K
Sbjct: 101 KKKAGRKARKELKNRRKKVRGKEKTKVSG 129
Score = 28.5 bits (64), Expect = 9.0
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKY 749
KKKK +K +K+ K ++KK + K+K
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKT 125
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 33.6 bits (76), Expect = 0.68
Identities = 11/47 (23%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 716 DKEEEEEKKKKKKKKKKK--KKKKKKKKKKKKKKKYSNEEEGMRKLR 760
D+EE+E + ++ K+++ KK+K+++++ +K + +E ++L+
Sbjct: 1028 DQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074
>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 103
Score = 31.1 bits (71), Expect = 0.69
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
E KK++K K ++ K +K++ K K+++ +N++E
Sbjct: 37 AEYKKQQKAKAREADKARKQQLKAKQRQAANDDE 70
Score = 30.0 bits (68), Expect = 1.8
Identities = 11/36 (30%), Positives = 24/36 (66%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
E KK++K K ++ K +K++ K K+++ +++EE
Sbjct: 37 AEYKKQQKAKAREADKARKQQLKAKQRQAANDDEED 72
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 32.3 bits (74), Expect = 0.72
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
EE E K+++ + +KK+ KKKK
Sbjct: 169 AAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 32.3 bits (74), Expect = 0.80
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKK 744
EE E K+++ + +KK+ KKKK
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.5 bits (72), Expect = 1.4
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
+ +E E K+++ + +KK+ KKKK
Sbjct: 167 EFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.1 bits (71), Expect = 1.7
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKK 745
EE + K+++ + +KK+ KKKK
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 31.1 bits (71), Expect = 1.9
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EE ++ + K+++ + +KK+ KKKK
Sbjct: 164 LLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|198301 cd03192, GST_C_Sigma_like, C-terminal, alpha helical domain of
Class Sigma-like Glutathione S-transferases.
Glutathione S-transferase (GST) C-terminal domain
family, Class Sigma_like; composed of GSTs belonging to
class Sigma and similar proteins, including GSTs from
class Mu, Pi, and Alpha. GSTs are cytosolic dimeric
proteins involved in cellular detoxification by
catalyzing the conjugation of glutathione (GSH) with a
wide range of endogenous and xenobiotic alkylating
agents, including carcinogens, therapeutic drugs,
environmental toxins, and products of oxidative stress.
The GST fold contains an N-terminal thioredoxin-fold
domain and a C-terminal alpha helical domain, with an
active site located in a cleft between the two domains.
GSH binds to the N-terminal domain while the hydrophobic
substrate occupies a pocket in the C-terminal domain.
Vertebrate class Sigma GSTs are characterized as
GSH-dependent hematopoietic prostaglandin (PG) D
synthases and are responsible for the production of PGD2
by catalyzing the isomerization of PGH2. The functions
of PGD2 include the maintenance of body temperature,
inhibition of platelet aggregation, bronchoconstriction,
vasodilation, and mediation of allergy and inflammation.
Other class Sigma-like members include the class II
insect GSTs, S-crystallins from cephalopods,
nematode-specific GSTs, and 28-kDa GSTs from parasitic
flatworms. Drosophila GST2 is associated with indirect
flight muscle and exhibits preference for catalyzing GSH
conjugation to lipid peroxidation products, indicating
an anti-oxidant role. S-crystallin constitutes the major
lens protein in cephalopod eyes and is responsible for
lens transparency and proper refractive index. The
28-kDa GST from Schistosoma is a multifunctional enzyme,
exhibiting GSH transferase, GSH peroxidase, and PGD2
synthase activities, and may play an important role in
host-parasite interactions. Members also include novel
GSTs from the fungus Cunninghamella elegans, designated
as class Gamma, and from the protozoan Blepharisma
japonicum, described as a light-inducible GST.
Length = 104
Score = 31.1 bits (71), Expect = 0.73
Identities = 10/42 (23%), Positives = 20/42 (47%)
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
+ + ++KK+KKK+ +++ P L + EK L
Sbjct: 19 EFAPYFYEPDGEEKKEKKKEFLEEALPKFLGKFEKILKKSGG 60
>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986). This
family of proteins has no known function.
Length = 44
Score = 29.2 bits (66), Expect = 0.75
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKY 749
KKK +K KKK KK K K KY
Sbjct: 4 KKKINQKLKKKAKKANAKLHPSNKPKY 30
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 30.8 bits (70), Expect = 0.78
Identities = 9/40 (22%), Positives = 18/40 (45%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++ + + K +KK+K +KK++ KK K
Sbjct: 59 DDRDVKLESLVGDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98
Score = 30.0 bits (68), Expect = 1.6
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKK 850
Y+ +K +KK+K +KK++ KK K
Sbjct: 70 GDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 31.6 bits (72), Expect = 0.78
Identities = 10/36 (27%), Positives = 21/36 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+EE E K+ +++KK +K K+K ++++ N
Sbjct: 43 LQEETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78
Score = 30.0 bits (68), Expect = 2.9
Identities = 11/57 (19%), Positives = 25/57 (43%), Gaps = 1/57 (1%)
Query: 699 INHY-TTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
N Y ++ + E ++E +++KK +K K+K ++++ K E
Sbjct: 28 PNRYEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSEN 84
Score = 29.3 bits (66), Expect = 5.2
Identities = 18/98 (18%), Positives = 41/98 (41%), Gaps = 20/98 (20%)
Query: 679 DIKNDPCEKNNLADRSEDQRI-NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKK 737
+K + EK ++ ++ + E+ F EE+++ +K K+K ++++
Sbjct: 25 QVKPNRYEKKDIEINTDYLQEETELDKEL--FT------PEEQKKITFQKHKEKPEQEEL 76
Query: 738 KKK-----------KKKKKKKKYSNEEEGMRKLRDAAS 764
K + K KK+ +S+E E ++ S
Sbjct: 77 KNQLFSENATENNTVKATKKQLFSSEYEQTSSSSESTS 114
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 30.8 bits (70), Expect = 0.78
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 726 KKKKKKKKKKKKKKKKKKKKKKKYS 750
KK++KK K K KKKK+S
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWS 28
Score = 30.0 bits (68), Expect = 1.6
Identities = 13/37 (35%), Positives = 19/37 (51%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
P K+++++ K K KKKK K K K K +N
Sbjct: 2 PPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDKLNN 38
Score = 28.5 bits (64), Expect = 6.2
Identities = 11/27 (40%), Positives = 13/27 (48%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKG 857
KK++KK K K KKK KG
Sbjct: 4 KKQQKKAAKAAAASAGGKAKKKKWSKG 30
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 33.3 bits (76), Expect = 0.79
Identities = 40/190 (21%), Positives = 75/190 (39%), Gaps = 32/190 (16%)
Query: 689 NLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
L + + INHY + R + E E + K +K KK+K+ + ++ K
Sbjct: 461 TLGEEKSNHIINHYNEKKSRLEE----KIREIEIEVKDIDEKIVDLKKRKEYLESEEINK 516
Query: 749 YSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHY 808
NE + R L DIK E + D+ E + N Y
Sbjct: 517 SINEYNKIESARAD--------------------LEDIKIKINELKDKHDKYE-EIKNRY 555
Query: 809 TT-EVGYLDPKQR-----FNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVL 862
+ ++ LD K+ I+ +D E + + + KK+ + + +++ + G+PD
Sbjct: 556 KSLKLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEI-EIGFPDDK 614
Query: 863 SQMEKELANI 872
S ++K + I
Sbjct: 615 SYIDKSIREI 624
>gnl|CDD|190559 pfam03195, DUF260, Protein of unknown function DUF260.
Length = 101
Score = 30.6 bits (70), Expect = 0.82
Identities = 14/55 (25%), Positives = 21/55 (38%), Gaps = 12/55 (21%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSII 310
PY P + P + N+H+ F S +L L EQR S++
Sbjct: 20 PYFPAEQPARFANVHKL---FGASNVTKLLKALPP---------EQRDDAMRSLL 62
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 33.1 bits (75), Expect = 0.85
Identities = 29/151 (19%), Positives = 63/151 (41%), Gaps = 15/151 (9%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
KE E ++ K+ ++ K++ KK+ K ++K + D A Q V++ E
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQ-------DNADKQRDEVRQKQQE 256
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
+ P P + ++ + +V E ++ K+ ++ A K+ K
Sbjct: 257 AKNLP-------KPADTSSPKEDKQVAENQKREIEKAQIEIKKN-DEEALKAKDHKAFDL 308
Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
K++ K +K+ + K+ +K V ++K
Sbjct: 309 KQESKASEKEAEDKELEAQKKREPVAEDLQK 339
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 30.6 bits (69), Expect = 0.86
Identities = 18/40 (45%), Positives = 22/40 (55%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
P K E + KK KKKK K KK K KK+ K+ +EE
Sbjct: 31 PPKPEPKPKKAAAKKKKPAVKGKKGAKGKKETKQEEAKEE 70
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 32.0 bits (73), Expect = 0.89
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K+ KKK+ +K KK KKK K+ EEE
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEE 80
Score = 31.7 bits (72), Expect = 1.3
Identities = 12/45 (26%), Positives = 21/45 (46%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
I P ++ ++K+ +K KK KKK K+K ++ E
Sbjct: 41 EDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPES 85
Score = 30.1 bits (68), Expect = 4.0
Identities = 15/41 (36%), Positives = 20/41 (48%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+P+ E K+ KKK+ +K KK KKK K EE
Sbjct: 39 FPEDIEIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEE 79
Score = 29.3 bits (66), Expect = 7.2
Identities = 13/44 (29%), Positives = 20/44 (45%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANIN 873
+KK+ +K KK KKK K+ + + S +K N N
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEN 95
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 32.8 bits (75), Expect = 0.91
Identities = 19/66 (28%), Positives = 30/66 (45%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEP 777
E ++ + K+ KKKKKKKKKKKKK + Y + AA ++++ P
Sbjct: 44 ISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWP 103
Query: 778 QIAPCL 783
+
Sbjct: 104 EQEWKQ 109
Score = 32.4 bits (74), Expect = 1.1
Identities = 17/35 (48%), Positives = 21/35 (60%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+ +E E K+ KKKKKKKKKKKKK + Y
Sbjct: 46 ENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYD 80
Score = 32.4 bits (74), Expect = 1.2
Identities = 16/32 (50%), Positives = 20/32 (62%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E +E + K+ KKKKKKKKKKKKK +
Sbjct: 45 SENQEAENKQNNKKKKKKKKKKKKKNLGEAYD 76
Score = 31.6 bits (72), Expect = 1.9
Identities = 17/48 (35%), Positives = 23/48 (47%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
D E + ++ + K+ KKKKKKKKKKKKK E +
Sbjct: 37 DAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVV 84
Score = 31.2 bits (71), Expect = 2.7
Identities = 19/66 (28%), Positives = 27/66 (40%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
++E E ++ KKKKKKKKKKKKK + +D + I+
Sbjct: 47 NQEAENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQE 106
Query: 776 EPQIAP 781
Q P
Sbjct: 107 WKQTQP 112
Score = 30.8 bits (70), Expect = 3.3
Identities = 24/103 (23%), Positives = 35/103 (33%), Gaps = 25/103 (24%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKK----------------------KKKKKKKKK 747
K+++++ K+K KKK KK KK KKKKKKK
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 748 KYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDP 790
K ++ A + PV Q + + N P
Sbjct: 64 KKKKKKNLGEAYDLAYDL---PVVWSSAAFQDNSHIRKLGNWP 103
Score = 30.8 bits (70), Expect = 3.7
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E + + K++K++ K+K KKK KK KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKD 32
Score = 30.1 bits (68), Expect = 5.5
Identities = 16/42 (38%), Positives = 22/42 (52%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
+ ++ KKKKKKKKKKKKK + + P V S +
Sbjct: 50 AENKQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQ 91
Score = 29.7 bits (67), Expect = 8.7
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
E + + K++K++ K+K KKK KK KK ++++
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAF 39
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 32.5 bits (74), Expect = 0.96
Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 2/40 (5%)
Query: 715 PDKEEEEEKKKKKKKKK--KKKKKKKKKKKKKKKKKYSNE 752
K E+E KKK K+ + + K++K++K+KK KK+
Sbjct: 123 FTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPV 162
Score = 29.4 bits (66), Expect = 9.0
Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 6/37 (16%)
Query: 822 NQIAYLDKEKKKKKKK------KKKKKKKKKKKKKKK 852
++ +KE KKK K + K++K++K+KK KK
Sbjct: 121 DRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKK 157
>gnl|CDD|216134 pfam00816, Histone_HNS, H-NS histone family.
Length = 92
Score = 30.3 bits (69), Expect = 0.96
Identities = 13/59 (22%), Positives = 21/59 (35%), Gaps = 19/59 (32%)
Query: 716 DKEEEEEKKKKKKKKKKK-------------------KKKKKKKKKKKKKKKYSNEEEG 755
+KE E +K ++ + KKK KKK+ KY + + G
Sbjct: 11 EKEIEAARKAERAAALAEIRELAAEYGLTLEELLGGAAKKKAKKKRAPVPAKYRDPDTG 69
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 31.5 bits (72), Expect = 1.1
Identities = 12/64 (18%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 699 INHYTTEVGRFNQIAYPDKEEEEEKKKKK------KKKKKKKKKKKKKKKKKKKKKYSNE 752
+ +E+ ++ I D++ E+ +K++K ++ +K K KK +K++++ +
Sbjct: 25 LRGLLSELLQWLLI---DRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMED 81
Query: 753 EEGM 756
++ M
Sbjct: 82 QKEM 85
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 31.0 bits (71), Expect = 1.1
Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 6/48 (12%)
Query: 716 DKEEEEEKKKKKKKKKKKK----KKKKKKKKKKKKKKYSNEEEGMRKL 759
D EE+ KK K+ + KKK + K KK KK +K ++ ++ L
Sbjct: 67 DIEEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSADAM--LKAL 112
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 29.6 bits (67), Expect = 1.1
Identities = 8/26 (30%), Positives = 23/26 (88%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKK 742
K EEE+++++++ +K++++K+++K K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNK 55
Score = 29.2 bits (66), Expect = 1.4
Identities = 8/26 (30%), Positives = 23/26 (88%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKK 743
+ EEEK++++++ +K++++K+++K K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNK 55
Score = 27.7 bits (62), Expect = 4.4
Identities = 10/38 (26%), Positives = 30/38 (78%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K +++E K +++K++++++ +K++++K+++K S EE
Sbjct: 23 KAKKKELKAEEEKREEEEEARKREERKEREKNKSFEEL 60
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 31.0 bits (70), Expect = 1.2
Identities = 20/65 (30%), Positives = 37/65 (56%), Gaps = 3/65 (4%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKK---KKKKKKKKKKYSNEEEGMRKLRDAASIQCG 768
A P+KE E++ K K+ +K +KKKK KK+ K++K+ E+E ++ ++ I+
Sbjct: 47 YAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELS 106
Query: 769 PVKEV 773
K+
Sbjct: 107 KKKQK 111
Score = 28.7 bits (64), Expect = 7.0
Identities = 10/40 (25%), Positives = 27/40 (67%)
Query: 709 FNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ K++ +EKK+ K++K+++++K+ K++K+ +K
Sbjct: 63 KEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEK 102
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 29.5 bits (67), Expect = 1.2
Identities = 13/41 (31%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVL-SQMEK 867
D +K++KKK+ KK K +++ +++ + K PD L +++K
Sbjct: 8 DAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDK 48
Score = 28.8 bits (65), Expect = 2.7
Identities = 11/41 (26%), Positives = 24/41 (58%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
+ +K++KKK+ KK K +++ +++ K N + R+L
Sbjct: 8 DAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDK 48
Score = 28.4 bits (64), Expect = 3.2
Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE-EEGMRKLRDAASIQ 766
+ K++KKK+ KK K +++ +++ K KK + + + KL+
Sbjct: 8 DAYRKEQKKKELKKNKAERQARREAKLAKKNPDRLQRELDKLKAQEESG 56
Score = 27.2 bits (61), Expect = 8.9
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
D +E+KKK+ KK K +++ +++ K KK
Sbjct: 8 DAYRKEQKKKELKKNKAERQARREAKLAKK 37
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 31.2 bits (71), Expect = 1.2
Identities = 12/32 (37%), Positives = 21/32 (65%), Gaps = 3/32 (9%)
Query: 828 DKEKKKKKKKKKKKKKK---KKKKKKKKMMKK 856
K K+K ++ K K KK+ K+K+K+++ MK
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97
Score = 30.0 bits (68), Expect = 2.7
Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 4/38 (10%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
E +KK K+K ++ K K KK++ +K+K+K EE K
Sbjct: 63 EGRKKAKEKLRRDKLKAKKEEAEKEKEK----EERFMK 96
Score = 29.6 bits (67), Expect = 3.5
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
KK K+K ++ K K KK++ +K+K+K+ +
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKA 97
Score = 29.3 bits (66), Expect = 5.3
Identities = 12/44 (27%), Positives = 28/44 (63%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
K+ +E+ ++ K K KK++ +K+K+K+++ K + E+ +L
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELE 109
Score = 28.9 bits (65), Expect = 6.2
Identities = 8/38 (21%), Positives = 26/38 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+++ + KK++ +K+K+K+++ K + +K++ E++
Sbjct: 74 RDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKK 111
Score = 28.9 bits (65), Expect = 6.3
Identities = 11/37 (29%), Positives = 24/37 (64%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
KEE E++K+K+++ K + +K++ + +KKK +
Sbjct: 81 KEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKL 117
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 32.0 bits (72), Expect = 1.2
Identities = 14/71 (19%), Positives = 29/71 (40%), Gaps = 10/71 (14%)
Query: 723 KKKKKKKKKK---------KKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
K + K + K++ +KK+K + ++K+ E++ +R D ++ C
Sbjct: 35 KNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRREPEKQVIRAGVDPKTVVCALFLNK 94
Query: 774 PCEPQIAPCLF 784
C C F
Sbjct: 95 TCAKG-DACKF 104
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 32.8 bits (75), Expect = 1.2
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
K +I+ +EK KKKKKK+KKK+++ K+++K
Sbjct: 731 KDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
Score = 32.4 bits (74), Expect = 1.4
Identities = 14/28 (50%), Positives = 23/28 (82%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
+ EEK KKKKKK+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 32.4 bits (74), Expect = 1.5
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+E+ E KK+K KK+ K + + K+Y
Sbjct: 1558 EEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLR 1592
Score = 32.0 bits (73), Expect = 2.1
Identities = 12/36 (33%), Positives = 25/36 (69%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM 853
F ++++ KKKKKK+KKK+++ K+++K ++
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARI 767
Score = 30.8 bits (70), Expect = 4.2
Identities = 14/28 (50%), Positives = 23/28 (82%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
D EE+ KKKKKK+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 30.8 bits (70), Expect = 4.2
Identities = 13/28 (46%), Positives = 23/28 (82%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
+ E+K KKKKKK+KKK+++ K+++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 30.1 bits (68), Expect = 7.3
Identities = 12/34 (35%), Positives = 25/34 (73%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ + + ++K KKKKKK+KKK+++ K+++K +
Sbjct: 733 AEFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 30.1 bits (68), Expect = 7.5
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K+ E + ++K KKKKKK+KKK+++ K++
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKRE 762
Score = 30.1 bits (68), Expect = 8.0
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
EE+ +KKKKK+KKK+++ K+++K +
Sbjct: 738 SDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 32.2 bits (74), Expect = 1.3
Identities = 13/56 (23%), Positives = 34/56 (60%), Gaps = 9/56 (16%)
Query: 718 EEEEEKKKKKKKKKKKKKKK-------KKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
+EE+E + ++K++K+KK ++ ++ ++K+++K+ EEE K+ + +
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEE--LKILEYQREK 162
Score = 31.4 bits (72), Expect = 2.1
Identities = 16/75 (21%), Positives = 42/75 (56%), Gaps = 11/75 (14%)
Query: 686 EKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
E+ +R E+ +I Y E K E EE+++ +++++K++K+++ + + +
Sbjct: 142 EEEKEREREEELKILEYQRE-----------KAEREEEREAERRERKEEKEREVARLRAQ 190
Query: 746 KKKYSNEEEGMRKLR 760
+++ +E E + +LR
Sbjct: 191 QEEAEDEREELDELR 205
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 32.5 bits (74), Expect = 1.3
Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 4/44 (9%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKK----KKKKKKKKKKYSNEEE 754
PD EE+ KK KK +KK + K K K KK+ S E+
Sbjct: 439 PDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDN 482
Score = 31.3 bits (71), Expect = 2.5
Identities = 13/51 (25%), Positives = 18/51 (35%)
Query: 706 VGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
A ++EE+ KK KK +KK + K K K E
Sbjct: 428 TSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKS 478
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 31.6 bits (72), Expect = 1.3
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
EE++K++ KKK K +K K+ +KK + +
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPK 190
Score = 30.0 bits (68), Expect = 3.8
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 5/42 (11%)
Query: 717 KEEEEEKKK-----KKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
K EE+ K++ KKK K +K K+ +KK K K
Sbjct: 154 KLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAV 195
Score = 28.9 bits (65), Expect = 9.2
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
KE+ KKK K +K K+ +KK + K +++ ++K
Sbjct: 160 KERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 32.6 bits (74), Expect = 1.3
Identities = 40/203 (19%), Positives = 77/203 (37%), Gaps = 30/203 (14%)
Query: 689 NLADRSEDQRINHYTTEVGRFNQIAYPDKEEE---EEKKKKKKKKKKKKKKKKKKKKKKK 745
+L + ++ +I T + + P+ EE++ + +K+K + K +
Sbjct: 306 DLKEDQKESKIKSGTNLKLSSEKESIPETSYVNVLEEEQSPLVRLQKRKLARSTSKTLES 365
Query: 746 KKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRI 805
K+ ++ K + A S + + E P P + DP K+ +
Sbjct: 366 LKEVFEDQASSVKNKQAQSEE--NLNESP----KTPIA--VTGDPHLKDPTIIAGKKLM- 416
Query: 806 NHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKK-------------KKKKKKKKKKKK 852
N T+E K + + K K K +KK+ K+ +KK K K+
Sbjct: 417 NKLTSEKINNPVK-----VVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQ 471
Query: 853 MMKKGYPDVLSQMEKELANINRT 875
K + V ++ +K+LAN N
Sbjct: 472 TKLKNFKPVPNKSKKQLANNNSQ 494
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 32.5 bits (75), Expect = 1.3
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKK 849
+ ++++ KKKKK+KK KK + +K
Sbjct: 726 APLWLDLEKRRLKKKKKRKKVKKWEVEK 753
Score = 30.6 bits (70), Expect = 4.4
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKY 749
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755
Score = 30.2 bits (69), Expect = 5.6
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 722 EKKKKKKKKKKKKKKKKKKKK 742
EK++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 6.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 723 KKKKKKKKKKKKKKKKKKKKK 743
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 6.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 724 KKKKKKKKKKKKKKKKKKKKK 744
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 6.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 725 KKKKKKKKKKKKKKKKKKKKK 745
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 6.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 726 KKKKKKKKKKKKKKKKKKKKK 746
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 6.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 831 KKKKKKKKKKKKKKKKKKKKK 851
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
Score = 30.2 bits (69), Expect = 6.8
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 832 KKKKKKKKKKKKKKKKKKKKK 852
+K++ KKKKK+KK KK + +K
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEK 753
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 30.7 bits (70), Expect = 1.3
Identities = 16/65 (24%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 692 DRSEDQRINHYTT--EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
D ED+ Y G F+ Y D++EE ++ + ++ +++KK++++K+K++
Sbjct: 32 DDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIE 91
Query: 750 SNEEE 754
EE
Sbjct: 92 KYREE 96
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.9 bits (73), Expect = 1.4
Identities = 10/44 (22%), Positives = 27/44 (61%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
D+EE++E KK K+ ++ ++ +++ ++++ E+E +L
Sbjct: 134 DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 30.7 bits (70), Expect = 1.4
Identities = 19/65 (29%), Positives = 38/65 (58%), Gaps = 11/65 (16%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK-------KKKKKKK----KKKYSNEEEG 755
GR + + EE++K K+KKKKKKK+ + ++KKK++ +KK+ +++
Sbjct: 59 GRKRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEELAELRKKFEEDKKR 118
Query: 756 MRKLR 760
+ +L+
Sbjct: 119 IEQLK 123
Score = 29.5 bits (67), Expect = 3.7
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 24/57 (42%)
Query: 716 DKEEEEEKKKKKK------------------------KKKKKKKKKKKKKKKKKKKK 748
DK EEE K++ K + K ++++K K+KKKKKKK
Sbjct: 31 DKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKK 87
Score = 28.3 bits (64), Expect = 8.6
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKK 851
A + K ++++K K+KKKKKKK+ +
Sbjct: 64 AGASRNKAAEERRKLKEKKKKKKKELE 90
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 32.1 bits (73), Expect = 1.4
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
E G F+ AYP K E K+K +K K KK KK + +
Sbjct: 341 EDGAFDVDAYPKKAEAARKRKGDRKGVSHKAKKGGKKNQAE 381
Score = 31.0 bits (70), Expect = 3.5
Identities = 12/35 (34%), Positives = 16/35 (45%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
KK + +K+K +K K KK KK E G
Sbjct: 350 YPKKAEAARKRKGDRKGVSHKAKKGGKKNQAETAG 384
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 30.7 bits (70), Expect = 1.4
Identities = 14/44 (31%), Positives = 32/44 (72%), Gaps = 5/44 (11%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKK-----KKKKKKKKYSNEEE 754
D EE+ E+K+++K++K++K++K+K+ +K++++K E E
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELE 56
Score = 30.4 bits (69), Expect = 1.9
Identities = 12/41 (29%), Positives = 30/41 (73%), Gaps = 3/41 (7%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
D E+K+++K+++K++K++K++K+K+ KG V + ++
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKE-WGKG--LVQKEEREK 50
Score = 28.8 bits (65), Expect = 5.8
Identities = 11/43 (25%), Positives = 32/43 (74%), Gaps = 4/43 (9%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKK----KKKKKKKYSNEEE 754
+++ EE++++K++K++K++K+K+ K K++++K+ E+
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57
>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as TM1802 family.
Length = 579
Score = 32.3 bits (74), Expect = 1.4
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 5/57 (8%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKK-----KKKKKKKYSNEEEGMRKLRD 761
N+I Y K E KK KK +K + KK K K N E+ ++ ++D
Sbjct: 84 NKITYLLKTTEPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQD 140
>gnl|CDD|216723 pfam01823, MACPF, MAC/Perforin domain. The membrane-attack complex
(MAC) of the complement system forms transmembrane
channels. These channels disrupt the phospholipid
bilayer of target cells, leading to cell lysis and
death. A number of proteins participate in the assembly
of the MAC. Freshly activated C5b binds to C6 to form a
C5b-6 complex, then to C7 forming the C5b-7 complex. The
C5b-7 complex binds to C8, which is composed of three
chains (alpha, beta, and gamma), thus forming the C5b-8
complex. C5b-8 subsequently binds to C9 and acts as a
catalyst in the polymerisation of C9. Active MAC has a
subunit composition of C5b-C6-C7-C8-C9{n}. Perforin is a
protein found in cytolytic T-cell and killer cells. In
the presence of calcium, perforin polymerises into
transmembrane tubules and is capable of lysing,
non-specifically, a variety of target cells. There are a
number of regions of similarity in the sequences of
complement components C6, C7, C8-alpha, C8-beta, C9 and
perforin. The X-ray crystal structure of a MACPF domain
reveals that it shares a common fold with bacterial
cholesterol dependent cytolysins (pfam01289) such as
perfringolysin O. Three key pieces of evidence suggests
that MACPF domains and CDCs are homologous: Functional
similarity (pore formation), conservation of three
glycine residues at a hinge in both families and
conservation of a complex core fold.
Length = 209
Score = 31.6 bits (72), Expect = 1.4
Identities = 15/68 (22%), Positives = 23/68 (33%), Gaps = 14/68 (20%)
Query: 701 HYTTEV---GRFNQIAYPDKEEEEEKKKKKKK-----------KKKKKKKKKKKKKKKKK 746
HY T V G+ + D + ++ K K K K +K+K
Sbjct: 78 HYITSVTLGGKIVYVLKLDSSQVKKLKLTGSDCVSAEAGASFGKVSSKGCKSTSSSQKEK 137
Query: 747 KKYSNEEE 754
K S+ E
Sbjct: 138 KSSSSSIE 145
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.8 bits (72), Expect = 1.4
Identities = 14/47 (29%), Positives = 22/47 (46%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
K + E+K + ++K +KKK K K K + + RK AA
Sbjct: 214 KAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAA 260
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 32.0 bits (73), Expect = 1.4
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
P K ++++KKKKK KK K KK K++K + K +
Sbjct: 252 PSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQ 290
Score = 31.2 bits (71), Expect = 2.4
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+ KKKKKKKKK KK K KK K++K
Sbjct: 250 ANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRK 281
Score = 29.3 bits (66), Expect = 8.5
Identities = 16/27 (59%), Positives = 17/27 (62%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMMKKGY 858
K KKKKKKKKK KK K KK+ K
Sbjct: 254 KVKKKKKKKKKVVHKKYKTKKLTGKQR 280
Score = 29.3 bits (66), Expect = 9.2
Identities = 15/24 (62%), Positives = 16/24 (66%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKK 851
+ K KKKKKKKKK KK K KK
Sbjct: 251 NPSKVKKKKKKKKKVVHKKYKTKK 274
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 32.1 bits (74), Expect = 1.5
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
D EEE K++ + K+ ++K+KK +K
Sbjct: 511 DVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
Score = 31.7 bits (73), Expect = 2.0
Identities = 10/35 (28%), Positives = 20/35 (57%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
+P + EEE K++ + K+ ++K+KK +K
Sbjct: 507 FPRIDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKP 541
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 29.7 bits (67), Expect = 1.6
Identities = 7/34 (20%), Positives = 14/34 (41%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P + + ++ + + +K K KK KK
Sbjct: 29 PKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62
Score = 29.3 bits (66), Expect = 2.6
Identities = 10/41 (24%), Positives = 16/41 (39%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
K ++ + + +K K KK KK K + EG
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEG 71
Score = 28.6 bits (64), Expect = 4.7
Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 10/43 (23%)
Query: 716 DKEEEEEK-KKKKKKKKKKKK---------KKKKKKKKKKKKK 748
DK+E++ K K K+ K KK +K K KK KK
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 31.1 bits (71), Expect = 1.6
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 10/57 (17%)
Query: 823 QIAYLDKEKKKKKKKKKKKKKKKKKK------KKKKMMKKGYPDVLSQMEKELANIN 873
+ L+K+ KK + + KK KK K K+KK +K L Q++ +LAN+
Sbjct: 16 KQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEK----QLDQLDGQLANLE 68
Score = 29.9 bits (68), Expect = 3.7
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 6/57 (10%)
Query: 815 LDPKQRF--NQIAYLDKEKKKKKKKKKKKKKK---KKKKKKKKMMKKGYPDVLSQME 866
LD KQ +I L+ E KK KK K K+KK+ +K + + L+ +E
Sbjct: 13 LDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQLD-GQLANLE 68
Score = 28.8 bits (65), Expect = 8.1
Identities = 10/36 (27%), Positives = 18/36 (50%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
E++ +K + + KK KK K K+KK ++
Sbjct: 21 EKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQ 56
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 28.6 bits (64), Expect = 1.6
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 724 KKKKKKKKKKKKKKKKKKKKKKKKKY 749
KK KKK ++KKKK KKK+K K
Sbjct: 6 KKIAKKKPGTEQKKKKSAKKKRKPKL 31
Score = 27.8 bits (62), Expect = 3.5
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E KK KK KKK ++KKKK KKK+K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRK 28
Score = 27.4 bits (61), Expect = 4.5
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKK 747
KK KKK ++KKKK KKK+K K
Sbjct: 6 KKIAKKKPGTEQKKKKSAKKKRKPK 30
Score = 27.0 bits (60), Expect = 7.3
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E +K KK KKK ++KKKK KKK+K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKP 29
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 32.4 bits (73), Expect = 1.7
Identities = 27/110 (24%), Positives = 42/110 (38%), Gaps = 12/110 (10%)
Query: 422 SNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT---NPRYENG 478
SN Y N + NP Y N + T NP N Y+ P N N N Y N
Sbjct: 410 SNAGYSNPGNS-NPGYNNAPNSNTPYNNPPNSNT------PYSNPPNSNPPYSNLPYSNT 462
Query: 479 THEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 528
Y+ L N+ + + + +YQ+ ++ + N+P+ N
Sbjct: 463 P--YSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPAANN 510
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.0 bits (73), Expect = 1.7
Identities = 15/49 (30%), Positives = 30/49 (61%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
+ EE E+ KK+ +K +K ++ ++ ++K KK K +E + K+ + AS
Sbjct: 702 ELEEREKAKKELEKLEKALERVEELREKVKKYKALLKERALSKVGEIAS 750
Score = 32.0 bits (73), Expect = 1.8
Identities = 30/158 (18%), Positives = 71/158 (44%), Gaps = 18/158 (11%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
KEE EE +K+ + + K+K ++K ++ +++ ++E + +L + VKE+
Sbjct: 237 KEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE-IEELEEK-------VKELK-- 286
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
++K E L++ E E ++ N I KE ++K++
Sbjct: 287 --------ELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEE 338
Query: 837 KKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKELANINR 874
+ ++ KKK K+ +K+ + ++ + + + + R
Sbjct: 339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELER 376
Score = 29.6 bits (67), Expect = 8.6
Identities = 28/142 (19%), Positives = 67/142 (47%), Gaps = 13/142 (9%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQI 779
E K+ +K+ K+ ++K++K +K+ ++ +K +E + KL++ A +KE+ +
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAE----QLKELEEK--- 511
Query: 780 APCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKK 839
L + EK +++ E+ L K+ ++ L K K + +KK
Sbjct: 512 ---LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSL--KKELEKLEELKK-KLAELEKKL 565
Query: 840 KKKKKKKKKKKKKMMKKGYPDV 861
+ +++ + K++ + G+ V
Sbjct: 566 DELEEELAELLKELEELGFESV 587
Score = 29.6 bits (67), Expect = 8.7
Identities = 24/144 (16%), Positives = 65/144 (45%), Gaps = 8/144 (5%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQ 778
EE ++ KK+ ++ ++K K+ K+ K+K ++Y E + D ++ + E +
Sbjct: 265 EERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDE--LREIEKRLSRLEEE 322
Query: 779 IAPC---LFDIKNDPCEKNNLADR-SEVQR-INHYTTEV-GYLDPKQRFNQIAYLDKEKK 832
I + +++ L + E+++ + Y + K + ++ L K
Sbjct: 323 INGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLT 382
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKK 856
+K +K+ ++ +K K+++ ++
Sbjct: 383 GLTPEKLEKELEELEKAKEEIEEE 406
Score = 29.6 bits (67), Expect = 9.8
Identities = 32/162 (19%), Positives = 67/162 (41%), Gaps = 31/162 (19%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
+ E E+ K+K+K+ ++ ++ + + + E E + K VKE
Sbjct: 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKE----------VKE---- 232
Query: 777 PQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKK 836
L ++K E L E + E ++ +I L KE ++ ++
Sbjct: 233 ------LEELKE---EIEELEKELESLEGSKRKLEE---KIRELEERIEELKKEIEELEE 280
Query: 837 KKKKKKKKKKKKKK----KKMMKKGYPDVLSQMEKELANINR 874
K K+ K+ K+K ++ + ++ Y D L ++EK L+ +
Sbjct: 281 KVKELKELKEKAEEYIKLSEFYEE-YLDELREIEKRLSRLEE 321
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 31.9 bits (73), Expect = 1.7
Identities = 13/74 (17%), Positives = 34/74 (45%), Gaps = 2/74 (2%)
Query: 687 KNNLADRSEDQRINHYTTEVGRFNQ--IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
K + + I E+ + ++ +EEEE ++ K+K+++ K +++ +
Sbjct: 153 KEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLAR 212
Query: 745 KKKKYSNEEEGMRK 758
+ K + E+ +R
Sbjct: 213 LESKEAALEKQLRL 226
Score = 31.5 bits (72), Expect = 2.4
Identities = 24/187 (12%), Positives = 68/187 (36%), Gaps = 17/187 (9%)
Query: 692 DRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
D + + V + ++ + ++ + ++ ++ + + K + +
Sbjct: 54 DPTLRDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKS 113
Query: 752 EEEGMRKLRDAASIQCGPVKEVPCEPQIA------PCLFDIKNDPCEKNNLADRSEVQRI 805
+E + L + +KE +P + L D + NL D E I
Sbjct: 114 IQELVEALEELLEE---LLKETASDPVVQELVSIFNDLIDSIKED----NLKDDLE-SLI 165
Query: 806 NHYTTEVGYLDPKQRFNQI-AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQ 864
E+ L ++ ++ A ++E ++ K+K+++ K +++ ++ + Q
Sbjct: 166 ASAKEELDQL--SKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQ 223
Query: 865 MEKELAN 871
+ E
Sbjct: 224 LRLEFER 230
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.9 bits (73), Expect = 1.8
Identities = 24/99 (24%), Positives = 46/99 (46%), Gaps = 12/99 (12%)
Query: 788 NDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE----KKKKKKKKKKKKK 843
N+ ++N ++I++ E K + +A L ++ +K+KK +K
Sbjct: 374 NELIREHN-------EKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINS 426
Query: 844 KKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINK 882
+K+ K+ + K + ++EK+L NI TA INK
Sbjct: 427 LEKEIKQLEAEIKALEKEIKELEKQLTNIEPTADE-INK 464
Score = 31.1 bits (71), Expect = 3.3
Identities = 13/45 (28%), Positives = 25/45 (55%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
E E + ++ KK+ KK ++K ++ + + +KK E+ K D A
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKA 136
Score = 30.7 bits (70), Expect = 4.4
Identities = 13/73 (17%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 682 NDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKK 741
N+ ++N + + N ++ + KE+ + +K+KK +K +K+
Sbjct: 374 NELIREHNEKIDNLKKEKNKAKKKLWLH--LVAELKEDIDAYQKEKKGLEKAINSLEKEI 431
Query: 742 KKKKKKKYSNEEE 754
K+ + + + E+E
Sbjct: 432 KQLEAEIKALEKE 444
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 31.6 bits (72), Expect = 1.9
Identities = 18/92 (19%), Positives = 27/92 (29%), Gaps = 18/92 (19%)
Query: 657 ASIQCGPVKEVPCEPQIAPCLFDIKNDPC--EKNNLADRSEDQRINHYTTEVGRFNQIAY 714
G V + P D P ++ + +
Sbjct: 299 EPEPTGSVSDRPRHLSS-----DSSPSPPDTSDSDPSTETPPPASL-----------SHS 342
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
P E K+K++ KK+KKKK KK K
Sbjct: 343 PPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
Score = 29.3 bits (66), Expect = 9.0
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 729 KKKKKKKKKKKKKKKKKKKKYSN 751
K+K++ KK+KKKK KK K N
Sbjct: 356 KRKREGDKKQKKKKSKKLKLTFN 378
>gnl|CDD|214998 smart01054, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains..
Binding of the proteins to calmodulin depends on the
presence of calcium ions.. These proteins are thought to
be involved in various processes, such as plant defence
responses.and stolonisation or tuberization.
Length = 115
Score = 30.1 bits (68), Expect = 2.0
Identities = 12/55 (21%), Positives = 22/55 (40%)
Query: 708 RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
F + D + E E+ +K K KK K+ K + K++ + E +
Sbjct: 6 EFRRGKIEDPKPEGERPRKWKFLKKVILLKRFVKSLEDKRRLNPREPNVLPTEVE 60
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 32.0 bits (73), Expect = 2.0
Identities = 15/70 (21%), Positives = 28/70 (40%), Gaps = 3/70 (4%)
Query: 679 DIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKK 738
D +N + N + NH+ TE N Y EE+ +++ ++ K+
Sbjct: 876 DTQNSIRGEENTVKGQYSELQNHHKTEALSQN-NKY--NEEKSAQERMPGADSPEELMKR 932
Query: 739 KKKKKKKKKK 748
K+ + K K
Sbjct: 933 AKEYQDKHKG 942
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 30.8 bits (70), Expect = 2.0
Identities = 8/45 (17%), Positives = 19/45 (42%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
++K K+ ++KK + +K +KK + +E +
Sbjct: 93 QMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQEAAL 137
Score = 30.4 bits (69), Expect = 3.1
Identities = 14/44 (31%), Positives = 20/44 (45%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
P EE + KK K+K K+ ++KK S E +RK
Sbjct: 85 PKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRK 128
Score = 29.2 bits (66), Expect = 6.6
Identities = 7/37 (18%), Positives = 16/37 (43%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMM 854
K+ + + K ++KK + +K +KK+
Sbjct: 95 KKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLA 131
Score = 29.2 bits (66), Expect = 7.0
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
E+ + KK K+K K+ ++KK S+ + KLR
Sbjct: 88 SEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLR 127
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 30.8 bits (70), Expect = 2.1
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKK 744
KEEEE K ++ + KKK+ +KKKK+K
Sbjct: 151 QKEEEEAKLAEEALEALKKKEAEKKKKEK 179
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 31.9 bits (73), Expect = 2.1
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
P E++KK+K+ K + + K K +K K
Sbjct: 759 NPKCNYTEKQKKEKESKSELEALKGVGAKTAEKLK 793
Score = 31.5 bits (72), Expect = 2.6
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
EK+KK+K+ K + + K K +K E + L A
Sbjct: 763 NYTEKQKKEKESKSELEALKGVGAKTAEKLKDAGVETVTDLTAA 806
>gnl|CDD|225054 COG2143, COG2143, Thioredoxin-related protein [Posttranslational
modification, protein turnover, chaperones].
Length = 182
Score = 30.6 bits (69), Expect = 2.2
Identities = 13/40 (32%), Positives = 18/40 (45%)
Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
GY+ P+Q + Y+ K K K + KK K K K
Sbjct: 134 GYMPPEQFLAVLKYVADGKYKDTKTDEDFTKKLKAYIKYK 173
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 31.0 bits (70), Expect = 2.2
Identities = 24/130 (18%), Positives = 51/130 (39%), Gaps = 21/130 (16%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDI 786
K +K K+ ++K+ K+ +++ E ++ ++ + P AP I
Sbjct: 102 GKLLSEKWKELTDEEKEPYYKEANSDRERYQREKE---------EYNKKLPNKAPIGPFI 152
Query: 787 KNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKK 846
+N+P R +V+ + + ++ I+ E + KKKK K KK
Sbjct: 153 ENEP------KIRPKVEGPS---PDKALVEE---TKIISKAWSELDESKKKKYIDKYKKL 200
Query: 847 KKKKKKMMKK 856
K++ +
Sbjct: 201 KEEYDSFYPE 210
Score = 30.6 bits (69), Expect = 3.0
Identities = 15/62 (24%), Positives = 24/62 (38%)
Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
P +N R + + + V I+ E +E KKKK K KK K++
Sbjct: 149 GPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFY 208
Query: 743 KK 744
+
Sbjct: 209 PE 210
Score = 29.1 bits (65), Expect = 8.1
Identities = 14/44 (31%), Positives = 18/44 (40%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
PDK EE K K + + KKKK K KK + +
Sbjct: 167 PDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE 210
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 29.5 bits (67), Expect = 2.2
Identities = 14/44 (31%), Positives = 29/44 (65%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
DKEE +E+ +++K+ +K+ K +K+ +K +K+ +E + KL
Sbjct: 59 DKEEVKEELEERKETLEKEIKTLEKQLEKLEKELEELKEELYKL 102
>gnl|CDD|180161 PRK05609, nusG, transcription antitermination protein NusG;
Validated.
Length = 181
Score = 30.5 bits (70), Expect = 2.3
Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 9/52 (17%)
Query: 685 CEKN---NLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKK 733
EK NL +R E + ++ P +E E K KKKK ++K
Sbjct: 17 YEKKVKENLENRIETLGMED------LIGEVLVPTEEVVEVKNGKKKKVERK 62
>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
recombination, and repair].
Length = 292
Score = 31.1 bits (70), Expect = 2.3
Identities = 15/40 (37%), Positives = 28/40 (70%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME 866
LD+EK++++K K +K+KK +KK K++ +K S+M+
Sbjct: 214 LDREKQRREKMKDDRKEKKLEKKIKELRRKTRTSNYSRMD 253
Score = 30.7 bits (69), Expect = 3.2
Identities = 12/42 (28%), Positives = 25/42 (59%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
+ EK++++K K +K+KK +KK K+ ++K +R+
Sbjct: 215 DREKQRREKMKDDRKEKKLEKKIKELRRKTRTSNYSRMDVRE 256
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 29.5 bits (67), Expect = 2.3
Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 6/38 (15%)
Query: 825 AYLDKEKKKKKKKKK------KKKKKKKKKKKKKMMKK 856
Y +EKKKKKKK + KK+KK+ +++K K
Sbjct: 35 EYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72
Score = 29.1 bits (66), Expect = 3.6
Identities = 16/40 (40%), Positives = 26/40 (65%), Gaps = 6/40 (15%)
Query: 715 PDKEEEEEKKKKKKKKKKKKK------KKKKKKKKKKKKK 748
P+K +E K++KKKKKKK + KK+KK+ +++K
Sbjct: 29 PEKINDEYFKREKKKKKKKSEGEFFAEKKEKKEVSEQRKA 68
Score = 28.0 bits (63), Expect = 9.5
Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 8/34 (23%)
Query: 723 KKKKKKKKKK--------KKKKKKKKKKKKKKKK 748
K++KKKKKKK KK+KK+ +++K +K
Sbjct: 38 KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQK 71
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 30.8 bits (70), Expect = 2.3
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 819 QRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKK 851
+ FN L + K K KK+KKK+ K+KKK
Sbjct: 180 RAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 30.9 bits (70), Expect = 2.4
Identities = 14/36 (38%), Positives = 23/36 (63%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
E E+ KKK++ + KKKK+ ++ K +KK +Y E
Sbjct: 85 LEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120
Score = 30.9 bits (70), Expect = 2.7
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
P+ E+E + ++ KKK++ + KKKK+ +
Sbjct: 74 PEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105
Score = 30.5 bits (69), Expect = 3.5
Identities = 11/34 (32%), Positives = 20/34 (58%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E ++ KKK++ + KKKK+ ++ K +KK
Sbjct: 80 LSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
Score = 29.7 bits (67), Expect = 7.0
Identities = 9/33 (27%), Positives = 20/33 (60%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
++E+E + ++ KKK++ + KKKK+ +
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEME 105
Score = 29.3 bits (66), Expect = 9.4
Identities = 11/45 (24%), Positives = 26/45 (57%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+ ++ E++K+ + ++ KKK++ + KKKK E + ++K
Sbjct: 68 FLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQK 112
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 31.6 bits (72), Expect = 2.5
Identities = 25/157 (15%), Positives = 59/157 (37%), Gaps = 23/157 (14%)
Query: 694 SEDQRINHYTTEVG--RFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+ + R ++ E EEE+ K + ++++ + ++ + +++ +
Sbjct: 787 ITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNL 846
Query: 752 EEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTE 811
++E +L + + +E+ + + KNDP + N L +V
Sbjct: 847 KQEIKLELSEIQEAE----EEIQNINENKNEFVEFKNDPKKLNKLIIAKDVL-------- 894
Query: 812 VGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKK 848
+ D+ K+ +K KKKKK +K
Sbjct: 895 ---------IKLVISSDEIKQDEKTTKKKKKDLEKTD 922
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 30.8 bits (70), Expect = 2.5
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMK 855
+KKK + KKK K KK + ++
Sbjct: 152 LAEKKKPRSKKKSSKNSKKLRIDLVG 177
Score = 30.8 bits (70), Expect = 2.9
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
++ E +KKK + KKK K KK +
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174
Score = 30.0 bits (68), Expect = 4.4
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKK 745
E E +KKK + KKK K KK +
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174
Score = 30.0 bits (68), Expect = 4.6
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKK 746
E E +KKK + KKK K KK +
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 30.4 bits (69), Expect = 2.6
Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE-----EGMRKLRDAAS--IQCG 768
K E+ ++KK+K + + ++++ K +K+K K + +E +LRDA I CG
Sbjct: 35 KREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLKEAQAAVNKYIRLRDAGLPCISCG 93
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 30.6 bits (69), Expect = 2.6
Identities = 15/39 (38%), Positives = 20/39 (51%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
Y ++E E ++ K+KKKKK KKKK K N
Sbjct: 71 GYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLN 109
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 31.0 bits (70), Expect = 2.6
Identities = 31/164 (18%), Positives = 48/164 (29%), Gaps = 18/164 (10%)
Query: 699 INHYTTEVGR-FNQIAYPD-----KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
I+ T E+GR + + K + KK K K KK KKK K S +
Sbjct: 180 IDEVTREIGRVLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAK 239
Query: 753 EEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEV 812
+ + A + K + + K ++
Sbjct: 240 KAAKTAAKAAKKAKKTAKKALKKAAKAV------------KKAAKKAAKAAAKAAKGAAK 287
Query: 813 GYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+ + K K K K+ K KK KK+ KK
Sbjct: 288 ATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 30.8 bits (70), Expect = 2.6
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+ +K+K KK KK KK+K+K+
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249
Score = 30.4 bits (69), Expect = 3.6
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
+ +K+K KK KK KK+K+K S
Sbjct: 216 PTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRS 248
Score = 30.0 bits (68), Expect = 5.7
Identities = 10/34 (29%), Positives = 16/34 (47%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
+ +K+K KK KK KK+K+K+
Sbjct: 219 QSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPP 252
Score = 29.7 bits (67), Expect = 6.1
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
+ +K+K KK KK KK+K+K+
Sbjct: 217 TPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSPPPPFN 254
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 30.5 bits (69), Expect = 2.6
Identities = 11/61 (18%), Positives = 24/61 (39%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
K ++ KK + +KK + + + +KK + + + + P +E P
Sbjct: 122 SKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPA 181
Query: 776 E 776
E
Sbjct: 182 E 182
Score = 29.0 bits (65), Expect = 7.5
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
KEE+E K KK K K KK KK + +KK
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKK 138
>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
Length = 160
Score = 30.4 bits (69), Expect = 2.7
Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 7/63 (11%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
+ E+ K+ K + KKK + K++ + R A ++ P + V
Sbjct: 98 KSRCREDFLKRVAANDAIKAEAKKKGELPSTKRQPAQP-------RPAGFVKPKPTEVVT 150
Query: 775 CEP 777
P
Sbjct: 151 VRP 153
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 31.2 bits (70), Expect = 2.8
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMM 854
DK+ K K KK+K +++++KK K +M
Sbjct: 522 DKDVNKSKNKKRKVDEEEEEKKLKMIM 548
Score = 30.0 bits (67), Expect = 6.6
Identities = 19/47 (40%), Positives = 22/47 (46%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
EEEEEKK K K+KK KK K KK E +K + A
Sbjct: 536 DEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIA 582
Score = 29.7 bits (66), Expect = 9.1
Identities = 13/39 (33%), Positives = 24/39 (61%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
E +K K K KK+K +++++KK K SN+++ + K
Sbjct: 520 EADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYK 558
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 31.4 bits (72), Expect = 2.8
Identities = 33/176 (18%), Positives = 71/176 (40%), Gaps = 24/176 (13%)
Query: 697 QRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE--- 753
QR+ E+ + E++EKKK+KKK KKKK K + K K K+ + ++
Sbjct: 376 QRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLEL 435
Query: 754 ---------EGMRKLR--DAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADR--- 799
E ++ R A + + V F+ EK L
Sbjct: 436 LKNVWGEILESLKAQRKSLRALLVNS--EPVAASEDTVVLAFE-YEIHFEKAMLDKELND 492
Query: 800 ---SEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
+ + ++ + E+ + P+ + +I +K K ++++++++++ + K
Sbjct: 493 TIENILSKLLGKSVEIVAV-PEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAK 547
Score = 30.2 bits (69), Expect = 6.2
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEKE 868
+Q Q+ ++K+KKK+KKK KKKK K K Y VL + ++
Sbjct: 382 EQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYK-VLKEATRQ 431
>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 432
Score = 31.1 bits (70), Expect = 2.9
Identities = 31/166 (18%), Positives = 72/166 (43%), Gaps = 28/166 (16%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
QI P+K+ EK ++K + K+ K+K + ++KK + +L +
Sbjct: 53 QIGIPEKQASREKIERKLQDLKELLKQKSYTEPERKKTLQTLKPLSERLEE--------- 103
Query: 771 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
KE + L ++ +P +R VG K+ F + ++L+ E
Sbjct: 104 KESAIYEIESFELEEVITEP-------ERP-----------VGLSFEKELFEKNSFLESE 145
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGY-PDVLSQMEKELANINRT 875
+K+K ++ ++ ++ M + Y ++ S++E+ L+ +++
Sbjct: 146 TTIVRKEKDSPLQRLGERLVREGMSQSYVEEMASKLEERLSPVDQG 191
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.5 bits (69), Expect = 2.9
Identities = 10/42 (23%), Positives = 15/42 (35%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
E EK K K K + K + K +K+ + R
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERV 58
Score = 30.5 bits (69), Expect = 3.3
Identities = 9/44 (20%), Positives = 18/44 (40%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
++ + K K + K + K +K K + E ++KL
Sbjct: 18 SEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKL 61
Score = 29.8 bits (67), Expect = 4.8
Identities = 13/24 (54%), Positives = 14/24 (58%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKK 744
K K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 29.4 bits (66), Expect = 6.6
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKK 741
+ +K KK KK KKKKKKK+
Sbjct: 194 HYKGDKSSAKKDKKSKKKKKKKRS 217
Score = 29.0 bits (65), Expect = 9.4
Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 11/44 (25%)
Query: 716 DKEEEEEKKKKKKKKKKK-----------KKKKKKKKKKKKKKK 748
+K ++K +K K K ++KK++KKK KKKK
Sbjct: 34 NKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKK 77
Score = 29.0 bits (65), Expect = 9.9
Identities = 19/44 (43%), Positives = 23/44 (52%), Gaps = 4/44 (9%)
Query: 809 TTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKK 852
T +VG+LD + Y K K KK KK KKKKKKK+
Sbjct: 178 TEDVGFLDSFKPL----YTWHYKGDKSSAKKDKKSKKKKKKKRS 217
>gnl|CDD|99975 cd03802, GT1_AviGT4_like, This family is most closely related to
the GT1 family of glycosyltransferases. aviGT4 in
Streptomyces viridochromogenes has been shown to be
involved in biosynthesis of oligosaccharide antibiotic
avilamycin A. Inactivation of aviGT4 resulted in a
mutant that accumulated a novel avilamycin derivative
lacking the terminal eurekanate residue.
Length = 335
Score = 31.0 bits (71), Expect = 2.9
Identities = 12/41 (29%), Positives = 13/41 (31%), Gaps = 4/41 (9%)
Query: 227 AEAVDIIHNHSTDEPLFLYLAHA----ATHSANPYEPLQAP 263
A DI+HNHS PL T P L
Sbjct: 85 AGDFDIVHNHSLHLPLPFARPLPVPVVTTLHGPPDPELLKL 125
>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
(DUF1713). This domain is found at the C terminal end
of mitochondrial proteins of unknown function.
Length = 34
Score = 27.3 bits (61), Expect = 3.0
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
+K+++K KK K KK +K+ + ++K
Sbjct: 6 LRKRRRKMKKHKYKKLRKRTRALRRKLDK 34
Score = 26.2 bits (58), Expect = 8.4
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
L K ++K KK K KK +K+ + ++K+ K
Sbjct: 6 LRKRRRKMKKHKYKKLRKRTRALRRKLDK 34
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 31.1 bits (70), Expect = 3.0
Identities = 14/55 (25%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 705 EVGRFNQIAYPDKEE--EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
E+ + + K E +K ++++KKK+K++ KK K KK++ +E+
Sbjct: 539 EIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDG 593
Score = 30.0 bits (67), Expect = 8.0
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASI 765
E + K ++++KKK+K++ KK K KK+ +E + + +S
Sbjct: 553 ELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSS 600
>gnl|CDD|171438 PRK12363, PRK12363, phosphoglycerol transferase I; Provisional.
Length = 703
Score = 31.0 bits (70), Expect = 3.0
Identities = 64/272 (23%), Positives = 98/272 (36%), Gaps = 49/272 (18%)
Query: 59 PPHIIFILADDLG---WNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
+I++I + L +++ F GL PN+ LA + ++N + T +
Sbjct: 156 RKNIVWIYGESLERTYFDEDVFPGL----MPNLTRLATEAVDVRNLAS----TEGSGWTI 207
Query: 116 TGKHPIHTGMQHNVLYGCE----RGGLPLSE-KILPQYLKELGYRTRIVGKWHLGFYKKE 170
G G+ G E R G L E + L YLK+ GY VG F K
Sbjct: 208 AGMVASMCGVPLTTAQGDENSMDRMGHFLPEARCLGDYLKDQGYTNHYVGGADASFAGKG 267
Query: 171 YTPTFRGFESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
+ SH G+ H +YF H + AW +H DV +A
Sbjct: 268 -----KFLSSH-GFDEVHDVNYFLHDK-------GVAPKHFS-AWGVH----DDVLLDDA 309
Query: 230 VDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHRHIEDFKRSKFAAILHK 287
D S +P L TH + P Y + I +LH
Sbjct: 310 YDEFETLSRAGQPFMLTTLTMDTHHPAGHLPSACKGQRYDSPLGDI---------GMLHA 360
Query: 288 L---DESVGKVVEALEQRRMLSNSIIVFVSDN 316
+ D +G++V+ + R N+IIV SD+
Sbjct: 361 IKCSDRLIGQLVDRIRNSRYGKNTIIVIASDH 392
>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain. The
sequences featured in this family are found repeated in
a number of plant calmodulin-binding proteins, and are
thought to constitute the calmodulin-binding domains.
Binding of the proteins to calmodulin depends on the
presence of calcium ions. These proteins are thought to
be involved in various processes, such as plant defence
responses and stolonisation or tuberization.
Length = 115
Score = 29.3 bits (66), Expect = 3.0
Identities = 15/44 (34%), Positives = 21/44 (47%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
KEE KK+ K +K+K K KK K++ E +RK
Sbjct: 5 KEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEKVRKFN 48
>gnl|CDD|219748 pfam08214, KAT11, Histone acetylation protein. Histone acetylation
is required in many cellular processes including
transcription, DNA repair, and chromatin assembly. This
family contains the fungal KAT11 protein (previously
known as RTT109) which is required for H3K56
acetylation. Loss of KAT11 results in the loss of H3K56
acetylation, both on bulk histone and on chromatin.
KAT11 and H3K56 acetylation appear to correlate with
actively transcribed genes and associate with the
elongating form of Pol II in yeast. This family also
incorporates the p300/CBP histone acetyltransferase
domain which has different catalytic properties and
cofactor regulation to KAT11.
Length = 341
Score = 30.8 bits (70), Expect = 3.1
Identities = 11/53 (20%), Positives = 17/53 (32%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
K KK+ K+ KK ++ S ++ L D KE
Sbjct: 271 VINPPKLVVSKKQYKRAKKAITGEEYDTLSGAASLVQNLMDKLYETMLKKKED 323
>gnl|CDD|233955 TIGR02627, rhamnulo_kin, rhamnulokinase. This model describes
rhamnulokinase, an enzyme that catalyzes the second step
in rhamnose catabolism.
Length = 454
Score = 30.9 bits (70), Expect = 3.1
Identities = 18/79 (22%), Positives = 29/79 (36%), Gaps = 16/79 (20%)
Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
YEN + + +R++NG +NG ++I LE I N
Sbjct: 16 SYENECQKLTLEEIHRFKNGLV-------------SQNGHECWDIDALEQEIR---LGLN 59
Query: 500 RSNDNSYQNEIDGIDVWSV 518
+ + + GID W V
Sbjct: 60 KVDAEGIAPDSIGIDTWGV 78
>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
basal transcription 1 (ABT1) and similar proteins. This
subfamily corresponds to the RRM of novel nuclear
proteins termed ABT1 and its homologous counterpart,
pre-rRNA-processing protein ESF2 (eighteen S factor 2),
from yeast. ABT1 associates with the TATA-binding
protein (TBP) and enhances basal transcription activity
of class II promoters. Meanwhile, ABT1 could be a
transcription cofactor that can bind to DNA in a
sequence-independent manner. The yeast ABT1 homolog,
ESF2, is a component of 90S preribosomes and 5'
ETS-based RNPs. It is previously identified as a
putative partner of the TATA-element binding protein.
However, it is primarily localized to the nucleolus and
physically associates with pre-rRNA processing factors.
ESF2 may play a role in ribosome biogenesis. It is
required for normal pre-rRNA processing, as well as for
SSU processome assembly and function. Both ABT1 and ESF2
contain an RNA recognition motif (RRM), also termed RBD
(RNA binding domain) or RNP (ribonucleoprotein domain).
.
Length = 98
Score = 29.1 bits (66), Expect = 3.2
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 8/39 (20%)
Query: 826 YLDKEKKKKKKKKKKKKKKKKKK--------KKKKMMKK 856
YL E K+K++KKK KKKK + KK+ K+
Sbjct: 31 YLQPEDPAKRKRRKKKGGNKKKKFTEGWVEFEDKKVAKR 69
Score = 27.6 bits (62), Expect = 8.8
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGY 858
+ + K+K++KKK KKK +G+
Sbjct: 33 QPEDPAKRKRRKKKGGNKKKKFTEGW 58
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 30.7 bits (70), Expect = 3.4
Identities = 15/64 (23%), Positives = 30/64 (46%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPC 775
+ EE E++ K+ + +K+ + +KK +KK KK + + L + E P
Sbjct: 69 ELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEPEPPL 128
Query: 776 EPQI 779
P++
Sbjct: 129 RPRL 132
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.1 bits (71), Expect = 3.4
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 12/62 (19%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKK----------KKKKKKKKKKYSNEEEGMRKLRDAAS 764
PD+E E K K + KK++K + ++K K+KK+K + + M+++
Sbjct: 744 PDRERAELNKLKAQLKKERKGAMRELRKDNRFIAREKLKEKKEKDAEYHKKMKRI--VNE 801
Query: 765 IQ 766
IQ
Sbjct: 802 IQ 803
>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
Members of the NSCC2 family have been sequenced from
various yeast, fungal and animals species including
Saccharomyces cerevisiae, Drosophila melanogaster and
Homo sapiens. These proteins are the Sec62 proteins,
believed to be associated with the Sec61 and Sec63
constituents of the general protein secretary systems of
yeast microsomes. They are also the non-selective cation
(NS) channels of the mammalian cytoplasmic membrane. The
yeast Sec62 protein has been shown to be essential for
cell growth. The mammalian NS channel proteins has been
implicated in platelet derived growth factor(PGDF)
dependent single channel current in fibroblasts. These
channels are essentially closed in serum deprived
tissue-culture cells and are specifically opened by
exposure to PDGF. These channels are reported to exhibit
equal selectivity for Na+, K+ and Cs+ with low
permeability to Ca2+, and no permeability to anions
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 232
Score = 30.3 bits (68), Expect = 3.5
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSN 751
K K KKK K KK KKK+ K+ +N
Sbjct: 208 KDKYSYKKKLKSKKLKKKQAKREAN 232
Score = 29.1 bits (65), Expect = 8.2
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKK 744
EK K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
Score = 29.1 bits (65), Expect = 8.2
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKK 852
EK K KKK K KK KKK+ K+
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKR 229
>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 158 and 187 amino
acids in length. This family includes the PcfB protein.
Length = 137
Score = 29.5 bits (67), Expect = 3.5
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKK 748
KK KK K+++ KKK++K K K KKK
Sbjct: 112 KKLKKFKEEQAAKKKERKDKVKNKKK 137
Score = 29.1 bits (66), Expect = 4.6
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 827 LDKEKKKKKKKKKKKKKKKKKKKK 850
L K K+++ KKK++K K K KKK
Sbjct: 114 LKKFKEEQAAKKKERKDKVKNKKK 137
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 30.6 bits (70), Expect = 3.5
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKK 742
K+ EE KK +K +K KKKKK++K
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210
Score = 30.2 bits (69), Expect = 4.4
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKMMK 855
E+ KK +K +K KKKKK++K K
Sbjct: 188 EELIKKLLEKPEKPKKKKKRRKGFSK 213
Score = 29.4 bits (67), Expect = 8.5
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKK 743
+ EE KK +K +K KKKKK++K
Sbjct: 186 QTEELIKKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 30.0 bits (68), Expect = 3.5
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 830 EKKKKKKKKKKKKKKKKKKKKKKM 853
E KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKKDD 107
Score = 30.0 bits (68), Expect = 3.5
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 832 KKKKKKKKKKKKKKKKKKKKKMM 854
KK K K+ +K + KKKKKK
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKKDD 107
Score = 30.0 bits (68), Expect = 3.6
Identities = 12/22 (54%), Positives = 15/22 (68%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKK 743
E KK K K+ +K + KKKKKK
Sbjct: 84 EVSKKAKGKRSRKNQTKKKKKK 105
Score = 29.2 bits (66), Expect = 6.0
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 728 KKKKKKKKKKKKKKKKKKKKKYSN 751
KK K K+ +K + KKKKKK N
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKKDDN 108
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 30.3 bits (69), Expect = 3.6
Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 714 YPDKEEEEEKKKKKKKKKKKKKKKKKK--KKKKKKKKYSNEEEGMRKL 759
+K EEE++K +KKK ++ K+K ++ + ++ + E + L
Sbjct: 7 LEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDLSCL 54
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 28.8 bits (65), Expect = 3.6
Identities = 11/35 (31%), Positives = 25/35 (71%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
P E E +K++++K++KK+K +KK +++ K++
Sbjct: 54 PPPEPTETWLEKREREKREKKEKLEKKLEEELKEW 88
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 31.1 bits (70), Expect = 3.6
Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 14/60 (23%)
Query: 712 IAYPDKEEEEEKK--------------KKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMR 757
YP EE K K + K+K K+K K+ K + S E EG R
Sbjct: 765 FEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVETEGER 824
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 30.9 bits (70), Expect = 3.7
Identities = 9/35 (25%), Positives = 16/35 (45%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
EE+ K+ K++ KK + K K+ +E
Sbjct: 525 EEQGTNFGKRNSKERYKKNEDKIKEFASALGKEIL 559
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 30.3 bits (69), Expect = 3.8
Identities = 17/33 (51%), Positives = 24/33 (72%), Gaps = 1/33 (3%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDV 861
E+ K+KKKKKKKKKKK++ ++ M +K DV
Sbjct: 82 AEELLKEKKKKKKKKKKKEELREWMCEKFI-DV 113
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 27.2 bits (61), Expect = 4.0
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK 747
E+ KK KKKKK K + K K
Sbjct: 12 ERNAKKAAKKKKKGAKSQLKAAAKAL 37
Score = 26.4 bits (59), Expect = 6.6
Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 2/29 (6%)
Query: 827 LDKEK--KKKKKKKKKKKKKKKKKKKKKM 853
+E+ KK KKKKK K + K K +
Sbjct: 9 QARERNAKKAAKKKKKGAKSQLKAAAKAL 37
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 28.1 bits (63), Expect = 4.0
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
EK +K+K+KK +K+K K+ +K+++
Sbjct: 39 EKPSEKRKRKKAAARKRKFKRLRKEQQR 66
Score = 27.3 bits (61), Expect = 7.0
Identities = 9/30 (30%), Positives = 20/30 (66%)
Query: 826 YLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
+ +K +K+K+KK +K+K K+ +K+ +
Sbjct: 37 FYEKPSEKRKRKKAAARKRKFKRLRKEQQR 66
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 30.2 bits (68), Expect = 4.2
Identities = 10/55 (18%), Positives = 27/55 (49%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
R + + E+ +K K+ ++KK+++K++ + K+ S+ + +D
Sbjct: 161 KRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKD 215
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 27.2 bits (60), Expect = 4.3
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
E K + + KKK KKK K+ K+K ++E
Sbjct: 11 ANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48
>gnl|CDD|215856 pfam00318, Ribosomal_S2, Ribosomal protein S2.
Length = 205
Score = 29.8 bits (68), Expect = 4.3
Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKE 772
K K+ ++ KK+ K K++ +K +KY + M+KL D + P KE
Sbjct: 101 KSIKRLEELSKKEALKLKRELEKLEKYLGGIKNMKKLPDLVIV-VDPNKE 149
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.8 bits (69), Expect = 4.4
Identities = 13/48 (27%), Positives = 33/48 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
K EE +K K++ ++K +++++++K+K+K+++ E E R + ++S
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSS 627
>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14 and
HMG 17.
Length = 88
Score = 28.4 bits (63), Expect = 4.5
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
P K E + KK KKK + K KK K K+ + + +EE
Sbjct: 29 PPKPEPKPKKAAAKKKSENVAAKGKKGAKGKQTEEAGKEE 68
>gnl|CDD|218660 pfam05620, DUF788, Protein of unknown function (DUF788). This
family consists of several eukaryotic proteins of
unknown function.
Length = 166
Score = 29.6 bits (67), Expect = 4.5
Identities = 6/26 (23%), Positives = 14/26 (53%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKK 740
+ E + K K+++K +K+ +K
Sbjct: 141 LPSSQGAETNETKSKRQEKLEKRGEK 166
>gnl|CDD|237619 PRK14135, recX, recombination regulator RecX; Provisional.
Length = 263
Score = 30.2 bits (69), Expect = 4.5
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 831 KKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQM 865
+K KK +K K K+K + ++ KG+ + +
Sbjct: 163 EKLLKKYQKLPFKALKQKIIQSLLTKGFSYEVIKA 197
>gnl|CDD|178262 PLN02656, PLN02656, tyrosine transaminase.
Length = 409
Score = 30.3 bits (68), Expect = 4.6
Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 6/73 (8%)
Query: 603 WQISALTKGKWKLVKVVKVMRYQVDLTGGPD---QVYLSGL---SDREWLALAMRKLRDA 656
W ++ G ++ K+V+ ++ D+ GGP Q + + +D + + L+ +
Sbjct: 252 WFVTTDPSGSFRDPKIVERIKKYFDILGGPATFIQAAVPTILEQTDESFFKKTINILKQS 311
Query: 657 ASIQCGPVKEVPC 669
+ I C +KE+PC
Sbjct: 312 SDICCDRIKEIPC 324
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 29.7 bits (66), Expect = 4.7
Identities = 12/47 (25%), Positives = 34/47 (72%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
++EEE ++K++K K+K ++++K++++++++ +EE + R+ A
Sbjct: 77 RQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEA 123
>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
Length = 205
Score = 30.0 bits (68), Expect = 4.7
Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 12/74 (16%)
Query: 700 NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
HY ++G K+ ++E++ KK KKK + K + +KK K E +
Sbjct: 115 KHYGIDLG-------KKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLEQFQSG 167
Query: 760 RDAASI-----QCG 768
R A I Q G
Sbjct: 168 RLLARISSRPGQSG 181
>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein. The PcfJ-like protein family
includes the E. faecalis PcfJ protein, which is
functionally uncharacterized. It is found in bacteria
and viruses, and is typically between 159 and 170 amino
acids in length. There is a conserved HCV sequence
motif.
Length = 169
Score = 29.6 bits (67), Expect = 4.7
Identities = 8/38 (21%), Positives = 24/38 (63%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEE 753
+ ++EK+K +K +K+ + +K ++ K K ++++ +
Sbjct: 37 KYQAKKEKEKAEKMRKRYPELEKIFRELKSKFEFTDGD 74
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 30.4 bits (69), Expect = 4.8
Identities = 9/35 (25%), Positives = 21/35 (60%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
+KKK++KK+++K+ K+ ++ +E+E
Sbjct: 176 GHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 29.6 bits (67), Expect = 4.8
Identities = 11/37 (29%), Positives = 22/37 (59%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
I ++ +E KK+++KK++ K +K++ K KK
Sbjct: 128 IEVKPNDDPKEDPFAKKRREKKERVAKNEKRELKNKK 164
>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
2-oxoacid:ferredoxin oxidoreductases, beta subunit
[Energy production and conversion].
Length = 294
Score = 30.0 bits (68), Expect = 4.8
Identities = 15/77 (19%), Positives = 29/77 (37%)
Query: 684 PCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
PC N + + E G + Y + EEEK K ++ + ++
Sbjct: 204 PCPTGNGRNTMKTIEEAKLAVETGYWPLYRYEPGKAEEEKGKDIPIGIFYPVEEYLEYEE 263
Query: 744 KKKKKYSNEEEGMRKLR 760
+ K + E ++KL+
Sbjct: 264 RFKHLTKSNPELIQKLQ 280
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 30.7 bits (70), Expect = 4.8
Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
Query: 818 KQRFNQ-IAYLDKEKKKKKKKKKKKKKKKKKKKK 850
K RF A L++EK ++ + KK + + K K
Sbjct: 452 KARFEARQARLEREKAAREARHKKAAEARAAKDK 485
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 28.8 bits (65), Expect = 4.8
Identities = 12/54 (22%), Positives = 27/54 (50%), Gaps = 11/54 (20%)
Query: 717 KEEEEEKKKKKKKK--------KKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
+ E + K ++K+++ K+ + K K +KK++K E +L++A
Sbjct: 65 RAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKL---AEAQAELQEA 115
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays
a role in protein translation. Deletions of the TMA7
gene results in altered protein synthesis rates.
Length = 63
Score = 27.5 bits (61), Expect = 5.0
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKK 747
KE +EE K+K+K++ K K+ K K K
Sbjct: 19 KELDEEDLAFKQKQKEEAKALKELAAKAKGK 49
>gnl|CDD|235620 PRK05841, flgE, flagellar hook protein FlgE; Validated.
Length = 603
Score = 30.6 bits (69), Expect = 5.1
Identities = 42/186 (22%), Positives = 74/186 (39%), Gaps = 43/186 (23%)
Query: 479 THEYNIPRLENSINGNGTSENRSNDNSYQ----NEIDGIDVWSVL----------SRNEP 524
TH++ + L N+I T RSN ++ + +D ++ SV+ S N
Sbjct: 14 THQFGLDSLSNNIANVNTLGYRSNIPEFKTLFSSHLDALNAKSVVANDRNYGVTGSSNAI 73
Query: 525 SKRNTILHNIDDEWQISALTRGKWKLVKEN-----SINGNGTSENRSN----DNSYQNEI 575
S ++ D E+ ++ +G W +V N +IN +G S+ + N D S+ +
Sbjct: 74 SNKDGEYKPSDGEFHMAYQGKG-WFVVGPNKNGSMTINKDGFSKKQDNYFTRDGSFARDA 132
Query: 576 DG------------IDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGKWKLVKVVKVMR 623
DG ID+ + K T+ DE I L +++ + +
Sbjct: 133 DGYLVNPEGYYVYGIDLGKI-------KDGTLNSTARDEADIEKLHGNTLSPLQIPQDLT 185
Query: 624 YQVDLT 629
YQ LT
Sbjct: 186 YQPVLT 191
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 30.4 bits (68), Expect = 5.2
Identities = 15/47 (31%), Positives = 18/47 (38%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
E E E KKK K+ KK K +KK N + DA
Sbjct: 273 EIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAKDADYFGDADK 319
Score = 30.0 bits (67), Expect = 6.3
Identities = 12/32 (37%), Positives = 15/32 (46%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
K E E + KKK K+ KK K +KK
Sbjct: 271 KAEIEAEALASATAVKKKAKEVMKKALKMEKK 302
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an exchange
factor for Cdc42 and an effector of Ras1, the homolog of
the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
exchange factor for Cdc42, a member of the Rho family of
Ras-like proteins. Cdc42 then activates the Shk1/Orb2
protein kinase. Scd1 interacts with Klp5 and Klp6
kinesins to mediate cytokinesis. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 29.2 bits (66), Expect = 5.2
Identities = 15/25 (60%), Positives = 15/25 (60%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKK 745
E KKKKKK K K KKKKKK
Sbjct: 62 EVKKKKKKSSLKSKSSSSSKKKKKK 86
Score = 28.4 bits (64), Expect = 8.4
Identities = 15/29 (51%), Positives = 15/29 (51%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYS 750
E KKKKKK K K KKKKKK
Sbjct: 62 EVKKKKKKSSLKSKSSSSSKKKKKKGPLQ 90
>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
chromatin remodeling [Chromatin structure and dynamics].
Length = 237
Score = 29.7 bits (67), Expect = 5.3
Identities = 10/43 (23%), Positives = 18/43 (41%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+EE+ K+ + + K+ KKKK S K++
Sbjct: 84 KEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVK 126
Score = 29.3 bits (66), Expect = 8.0
Identities = 11/39 (28%), Positives = 16/39 (41%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEG 755
KEE+ K+ + + K+ KKKK N G
Sbjct: 84 KEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSG 122
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel repeat
in Pneumocystis carinii Major surface glycoprotein (MSG)
some members of the alignment have up to nine repeats of
this family, the repeats containing several conserved
cysteines. The MSG of P. carinii is an important protein
in host-pathogen interactions. Surface glycoprotein A
from Pneumocystis carinii is a main target for the host
immune system, this protein is implicated in the
attachment of Pneumocystis carinii to the host alveolar
epithelial cells, alveolar macrophages, host surfactant
and possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 27.8 bits (62), Expect = 5.3
Identities = 19/35 (54%), Positives = 22/35 (62%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
EE K+ K + KKKKK KK K+K K KK SNE
Sbjct: 47 EELLLKELKGELKKKKKCKKALKEKCTKLKKESNE 81
Score = 27.4 bits (61), Expect = 7.9
Identities = 14/47 (29%), Positives = 25/47 (53%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
A P +E+ EK K+K K KK K ++ K+ K + +++ + L
Sbjct: 22 ACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKKKCKKAL 68
>gnl|CDD|220714 pfam10357, Kin17_mid, Domain of Kin17 curved DNA-binding protein.
Kin17_mid is the conserved central 169 residue region of
a family of Kin17 proteins. Towards the N-terminal end
there is a zinc-finger domain, and in human and mouse
members there is a RecA-like domain further downstream.
The Kin17 protein in humans forms intra-nuclear foci
during cell proliferation and is re-distributed in the
nucleoplasm during the cell cycle.
Length = 127
Score = 28.8 bits (65), Expect = 5.4
Identities = 11/41 (26%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 816 DPKQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
K F I Y+D+ + K++++ +KK+K++K ++ +K
Sbjct: 82 TEKGWF--ITYIDRSPEALKRQEELRKKEKQEKTDEEREQK 120
Score = 28.4 bits (64), Expect = 8.7
Identities = 14/47 (29%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 702 YTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
TE G F I Y D+ E K++++ +KK+K++K +++++K ++
Sbjct: 80 EETEKGWF--ITYIDRSPEALKRQEELRKKEKQEKTDEEREQKLLEE 124
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 29.9 bits (67), Expect = 5.5
Identities = 32/154 (20%), Positives = 65/154 (42%), Gaps = 23/154 (14%)
Query: 710 NQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQC-- 767
I ++E +EK +++ K+K +++K++ K +++ E R A+I
Sbjct: 48 ITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVD 107
Query: 768 --GPVKEVPCEPQIAPCLFDIKNDPCEKNNLA-DRSEVQRINHYTTEVGYLDPKQRFNQI 824
+E E L IK D E+ + +++E++++ + T E
Sbjct: 108 TDDENEEEEYEAWKLRELKRIKRDREEREEMEREKAEIEKMRNMTEE------------- 154
Query: 825 AYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKKGY 858
E++ + +K K K KK K K ++K Y
Sbjct: 155 -----ERRAELRKNPKVSTNKAKKGKYKFLQKYY 183
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 30.3 bits (68), Expect = 5.7
Identities = 12/51 (23%), Positives = 21/51 (41%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
++EE +++K K K K K K+K + EE + + A
Sbjct: 153 VTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKA 203
Score = 29.6 bits (66), Expect = 9.6
Identities = 13/50 (26%), Positives = 22/50 (44%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
++EEE +K+K K K K K K+K + E + + A +
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKA 203
>gnl|CDD|218845 pfam05991, NYN_YacP, YacP-like NYN domain. This family consists of
bacterial proteins related to YacP. This family is
uncharacterized functionally, but it has been suggested
that these proteins are nucleases due to them containing
a NYN domain. NYN (for N4BP1, YacP-like Nuclease)
domains were discovered by Anantharaman and Aravind.
Based on gene neighborhoods it was suggested that the
bacterial YacP proteins interact with the Ribonuclease
III and TrmH methylase in a processome complex that
catalyzes the maturation of rRNA and tRNA.
Length = 165
Score = 29.1 bits (66), Expect = 5.7
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
+E EE K+ +KK +KK +K++K +KK + E + KLR
Sbjct: 121 RELAEEVKRAEKKIRKKAEKRRKSRKKYLDRLSDEVLEKLEKLRR 165
Score = 29.1 bits (66), Expect = 5.7
Identities = 13/35 (37%), Positives = 25/35 (71%)
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
K+ +KK +KK +K++K +KK + + +VL ++EK
Sbjct: 128 KRAEKKIRKKAEKRRKSRKKYLDRLSDEVLEKLEK 162
>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
Fer (Fes related) tyrosine kinases. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Fes (feline sarcoma), also called
Fps (Fujinami poultry sarcoma), and Fer (Fes related)
are cytoplasmic (or nonreceptor) tyrosine kinases that
play roles in haematopoiesis, inflammation and immunity,
growth factor signaling, cytoskeletal regulation, cell
migration and adhesion, and the regulation of cell-cell
interactions. Although Fes and Fer show redundancy in
their biological functions, they show differences in
their expression patterns. Fer is ubiquitously expressed
while Fes is expressed predominantly in myeloid and
endothelial cells. Fes and Fer contain an N-terminal
F-BAR domain, an SH2 domain, and a C-terminal catalytic
kinase domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules. The
F-BAR domain of Fes is critical in its role in
microtubule nucleation and bundling.
Length = 237
Score = 29.7 bits (67), Expect = 5.8
Identities = 13/54 (24%), Positives = 24/54 (44%), Gaps = 1/54 (1%)
Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY 749
D++ T EV + + Y E+ + K K ++ K + +K K + KY
Sbjct: 118 DEQYKKLTDEVEKL-KSEYQKLLEDYKAAKSKFEEAVVKGGRGGRKLDKARDKY 170
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to 196
amino acids in length.
Length = 149
Score = 29.2 bits (66), Expect = 5.8
Identities = 11/37 (29%), Positives = 16/37 (43%)
Query: 720 EEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
E ++ +KK K +K K K K+KK K
Sbjct: 11 EPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASES 47
>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus.
Length = 139
Score = 28.8 bits (65), Expect = 5.8
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 6/41 (14%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKK---KYSN---EEEGM 756
+KKKK K+K KK +KK KK K KYS E+G+
Sbjct: 89 QKKKKFSKQKFSVKKLQKKHGKKPKFGSYKYSAKQLYEKGV 129
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 29.9 bits (67), Expect = 5.8
Identities = 15/40 (37%), Positives = 17/40 (42%)
Query: 828 DKEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
K KK K K KK K KK K +K V + EK
Sbjct: 30 AKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEK 69
Score = 29.5 bits (66), Expect = 8.4
Identities = 13/42 (30%), Positives = 17/42 (40%)
Query: 713 AYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
K ++ K K KK K KK K K KK + + E
Sbjct: 27 KETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFE 68
Score = 29.5 bits (66), Expect = 8.7
Identities = 15/59 (25%), Positives = 22/59 (37%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEV 773
+E K KK K K KK K KK K + ++ K S++ V +
Sbjct: 23 ATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKK 81
Score = 29.1 bits (65), Expect = 9.7
Identities = 11/50 (22%), Positives = 19/50 (38%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAAS 764
++ K KK K K KK K +K + K + + K ++
Sbjct: 41 TKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 30.0 bits (68), Expect = 5.9
Identities = 19/45 (42%), Positives = 24/45 (53%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
KEE EK KK KKK +K+ K K K +K + E G +LR
Sbjct: 329 TKEEAVEKGKKLAKKKLEKEIDKDAKIKDEKVLHERVENGKVRLR 373
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 30.1 bits (68), Expect = 6.0
Identities = 11/33 (33%), Positives = 14/33 (42%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
++ + KK KK K KK KK KK
Sbjct: 254 ERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKK 286
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 29.6 bits (67), Expect = 6.0
Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 2/56 (3%)
Query: 715 PDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPV 770
K E + K+ KK KKKKKKKK N +R++ A + PV
Sbjct: 74 VSKTESVSFRTPYYKRTTKKMKKKKKKKKVVMSDKINYL--IRQVLKIAKKKGKPV 127
>gnl|CDD|178645 PLN03097, FHY3, Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional.
Length = 846
Score = 30.3 bits (68), Expect = 6.0
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 834 KKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQMEK 867
K KKK KK+K ++++ D L QM+K
Sbjct: 711 KTNKKKNPTKKRKVNSEQEVTTVAAQDSLQQMDK 744
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 30.4 bits (68), Expect = 6.0
Identities = 45/216 (20%), Positives = 89/216 (41%), Gaps = 30/216 (13%)
Query: 381 NKSDIPNYVNS--TVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYE 438
N +D+ N+ + T+EN + + + + +N HE S +I T+Y N H + +
Sbjct: 2469 NANDVKNHFSKDLTIENELIQIQKRLEDIKNAAHEIRSEQI----TKYTNAIHNHIEEQF 2524
Query: 439 NRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYE-----NGTHEYN--IPRLENSI 491
+ EN +++ N +N + N N P + + +E+ IP ++N I
Sbjct: 2525 KKIENNSNKDEVYKINEIDNIIEKI---INYNKEPEVKLHAIIDNKNEFASIIPDIKNLI 2581
Query: 492 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ-ISALTRGKWKL 550
+ +N+ SY+ I + + + + +N + H I + I L K +
Sbjct: 2582 ALIESEYGNNNNISYKVAIKHEEDANNIILDLNKSQNILNHLIHKNKKIIEDLGNIKHGI 2641
Query: 551 VKENSING-------------NGTSENRSNDNSYQN 573
N+++ N TS N ++D Y+N
Sbjct: 2642 HNNNNLHTINKHEEISKIKHPNNTSHNNNDDAKYKN 2677
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 29.6 bits (67), Expect = 6.1
Identities = 35/155 (22%), Positives = 68/155 (43%), Gaps = 16/155 (10%)
Query: 717 KEEEEEKKKKKKK--KKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP 774
KEE EE K+K+ + K+ + +K+ + K ++ + + +R+ RD + + +KE
Sbjct: 19 KEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQELKEKR 78
Query: 775 CEP-QIAPCLFDIKNDPCEKNNLAD---------RSEVQRINHYTTEVGYLDP---KQRF 821
E L + EK N + E++R+ + L P ++
Sbjct: 79 DEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLE-KKQQTSVLTPEEERELV 137
Query: 822 NQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMKK 856
+I L KE + KK ++ +K K+ K + +KK
Sbjct: 138 QKIKELRKELEDAKKALEENEKLKELKAEIDELKK 172
>gnl|CDD|225208 COG2327, WcaK, Polysaccharide pyruvyl transferase family protein
[Cell wall/membrane/envelope biogenesis].
Length = 385
Score = 30.1 bits (68), Expect = 6.2
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 195 SAEEMKMWGLDMRRDLEPAWDL 216
S E +K G++ R +PA+ L
Sbjct: 163 SYELLKQLGINARLVTDPAFLL 184
>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV.
Length = 336
Score = 29.8 bits (67), Expect = 6.2
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 727 KKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
+ K + K +K K+ KK+KK++ +E +
Sbjct: 22 RPVKDEAKPRKIKRVKKRKKREEKDELDDE 51
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 27.0 bits (60), Expect = 6.5
Identities = 11/40 (27%), Positives = 27/40 (67%)
Query: 721 EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
E++ + KK ++K+K + +K++ + ++ EEE +++LR
Sbjct: 8 EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEEAIKQLR 47
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 29.3 bits (66), Expect = 6.7
Identities = 9/39 (23%), Positives = 23/39 (58%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
+E +EE+K+ + K+ K +K+ ++ +++ E+E
Sbjct: 62 IEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE 100
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 29.3 bits (66), Expect = 6.7
Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 716 DKEEEEEKKKKKKKKKK-----KKKKKKKKKKKKKKKKYSNEEEGMRKLRD 761
D +E ++++KK K + ++K KK +KKKK K + EE +++ ++
Sbjct: 92 DLKEIKKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLAEAEEELQEAKE 142
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 29.6 bits (67), Expect = 6.8
Identities = 12/44 (27%), Positives = 32/44 (72%)
Query: 705 EVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
E R ++A + EE E++++++K+++KK+ K++K+++K+ +
Sbjct: 165 EQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAE 208
Score = 29.3 bits (66), Expect = 8.6
Identities = 10/34 (29%), Positives = 27/34 (79%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN 751
EE E +++++K+++KK+ K++K+++K+ +K +
Sbjct: 179 EEAERRRREEKERRKKQDKERKQREKETAEKIAA 212
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 29.9 bits (67), Expect = 7.0
Identities = 14/26 (53%), Positives = 14/26 (53%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKK 737
I P E E E K KK KK KKKK
Sbjct: 233 IEKPQAEAENEAGKSDKKDKKSKKKK 258
>gnl|CDD|214444 MTH00165, ND5, NADH dehydrogenase subunit 5; Provisional.
Length = 573
Score = 29.7 bits (68), Expect = 7.2
Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 8/36 (22%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGW 72
F L L F LSM+ L+ S P++I IL LGW
Sbjct: 87 FIYLVLLFVLSMML--LIIS---PNLISIL---LGW 114
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 28.3 bits (64), Expect = 7.3
Identities = 12/44 (27%), Positives = 29/44 (65%)
Query: 719 EEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDA 762
+E E K+++ +KK +++KK +K+K+++ K+ E E ++ +
Sbjct: 56 KENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEK 99
>gnl|CDD|236413 PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated.
Length = 367
Score = 29.6 bits (67), Expect = 7.3
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKL 759
KK KKK+ ++ K+ +K KK+ +E KL
Sbjct: 1 KKTKKKETTTLEQVVKELIEKGKKRGTLTYDEIAEKL 37
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 29.9 bits (67), Expect = 7.3
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
K+ KKKK KK K +K+ K+K KK K
Sbjct: 69 KRVPKKKKIKKPKLRKRTKRKNKKIKSLTA 98
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 30.0 bits (66), Expect = 7.5
Identities = 36/144 (25%), Positives = 63/144 (43%), Gaps = 15/144 (10%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSN-----EEEGMRKLRDAASIQCGPV 770
+K+E + KK + +KKK+K K +K +K +K KK S EE ++K+ + S P
Sbjct: 134 EKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVKKVANDRSAAPAPE 193
Query: 771 KEVPCEPQIAPC--LFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLD 828
+ P P + +I +KN SE +V Q +
Sbjct: 194 PKTPTNTPAEPAEQVQEITGKKNKKNKKKSESEATAAPASVEQV--------VEQPKVVT 245
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKK 852
+E ++ ++KK KK K+K + +
Sbjct: 246 EEPHQQAAPQEKKNKKNKRKSESE 269
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 29.7 bits (67), Expect = 7.6
Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKY--SNEEEGMRKLRDAAS 764
I+ P ++++ EK KK+KK K + + + + Y SNE E +R+LR S
Sbjct: 62 AISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNERE-IRQLRFGGS 116
>gnl|CDD|184389 PRK13912, PRK13912, nuclease NucT; Provisional.
Length = 177
Score = 29.0 bits (65), Expect = 7.6
Identities = 11/38 (28%), Positives = 12/38 (31%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKMMK 855
+ I YLDK K K K K K K
Sbjct: 86 NNDQSTIGYLDKYPNIKVCLLKGLKAKNGKYYGIMHQK 123
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 28.8 bits (65), Expect = 7.6
Identities = 9/47 (19%), Positives = 32/47 (68%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAA 763
+++E+E +K+++K +K+ ++ +K K+++ +++ +++ + AA
Sbjct: 52 QKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAA 98
>gnl|CDD|184543 PRK14157, PRK14157, heat shock protein GrpE; Provisional.
Length = 227
Score = 29.2 bits (65), Expect = 7.6
Identities = 16/30 (53%), Positives = 18/30 (60%), Gaps = 5/30 (16%)
Query: 274 EDFKRSKFAAILHKLD-----ESVGKVVEA 298
EDF +K AILHK D E+V VVEA
Sbjct: 177 EDFDPTKHDAILHKPDPDAEKETVDTVVEA 206
>gnl|CDD|180505 PRK06278, PRK06278, cobyrinic acid a,c-diamide synthase; Validated.
Length = 476
Score = 29.6 bits (67), Expect = 7.9
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGM 756
KE+E+E+ KK K K+K K + KK+ N+ +G+
Sbjct: 202 KEDEKEEIFKKNKILKEKLKSRSGISTNNKKEERNKPKGI 241
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 29.0 bits (65), Expect = 8.0
Identities = 30/140 (21%), Positives = 60/140 (42%), Gaps = 27/140 (19%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCE 776
K+E EE +K++ + ++ K + K++ + + M+ L + + + E P
Sbjct: 45 KQEIEELEKERIMNDRLLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTY 104
Query: 777 PQIAPCLFD-IKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKK-K 834
++ LFD +K++ EKN A E+Q K + K K
Sbjct: 105 NEMVEDLFDQVKDEVDEKNGAALIEELQ-------------------------KHRDKLK 139
Query: 835 KKKKKKKKKKKKKKKKKKMM 854
K++K+ KK + +K++K
Sbjct: 140 KEQKELLKKLDELEKEEKKK 159
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 27.4 bits (61), Expect = 8.1
Identities = 5/31 (16%), Positives = 8/31 (25%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
EE E+ + K K +
Sbjct: 48 EEPEDDDDDEDDDDDDDKDDKDDDDDDDDED 78
Score = 27.4 bits (61), Expect = 9.2
Identities = 6/35 (17%), Positives = 9/35 (25%)
Query: 718 EEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNE 752
E EE + + K K +E
Sbjct: 46 EGEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDDE 80
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 30.0 bits (68), Expect = 8.2
Identities = 22/121 (18%), Positives = 44/121 (36%), Gaps = 20/121 (16%)
Query: 700 NHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKK-KKKYSNEEEGMRK 758
HY T I P E EE+ +K++++ +K+ + + +K K +E +
Sbjct: 448 PHYVT------VIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQ 501
Query: 759 LRDAASIQCGP---VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYL 815
+ P + +VP I+ E ++V + +T + YL
Sbjct: 502 PDSEEDLATLPTLKLGDVPD---------PIEKTSLETEVSN-EAKVLHHDLFTNGITYL 551
Query: 816 D 816
Sbjct: 552 R 552
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 29.8 bits (67), Expect = 8.4
Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 4/59 (6%)
Query: 696 DQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
D+ E N DKE E +K+ + K+ +K+ + + +KK+ +N+ E
Sbjct: 97 DEIAKLEALE--LLNL--EKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNE 151
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 29.2 bits (66), Expect = 8.5
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 712 IAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
I P + ++E ++ +K+ K+ KKKK+KK
Sbjct: 22 IYVPKELYDKEIERLEKQIKELDSSSSGIDKKKKEKK 58
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 30.0 bits (68), Expect = 8.6
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRK 758
+KKK KK +K+ + + KK+K+ E R
Sbjct: 85 EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
Score = 29.6 bits (67), Expect = 9.6
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 722 EKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEE 754
EKKK KK +K+ + + KK+K+ + E+
Sbjct: 85 EKKKYKKDVEKRNRSYRLYLDKKRKELQALSEK 117
>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
transduction mechanisms].
Length = 208
Score = 29.1 bits (65), Expect = 8.7
Identities = 7/64 (10%), Positives = 23/64 (35%), Gaps = 6/64 (9%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKK----KKKKKKKKKKKKKYSNEEEGMRKLR--DAAS 764
Q+A + ++ +++ KK+++ K ++ + D ++
Sbjct: 102 QLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPREEVHTPVSDISA 161
Query: 765 IQCG 768
+ G
Sbjct: 162 LTVG 165
>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103). This
family consists of several repeats of around 30 residues
in length which are found specifically in
mature-parasite-infected erythrocyte surface antigen
proteins from Plasmodium falciparum. This family often
found in conjunction with pfam00226.
Length = 215
Score = 29.1 bits (64), Expect = 8.7
Identities = 19/45 (42%), Positives = 28/45 (62%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
DK EK +K+ K+K +K+ K+K KKK KK N+ EG K++
Sbjct: 20 DKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKVK 64
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 29.4 bits (66), Expect = 8.8
Identities = 10/50 (20%), Positives = 23/50 (46%)
Query: 711 QIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLR 760
+ A K EK + K +KK+ +K++ +++ ++ M + R
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARAR 377
>gnl|CDD|217372 pfam03105, SPX, SPX domain. We have named this region the SPX
domain after (SYG1, Pho81 and XPR1). This 180 residue
length domain is found at the amino terminus of a
variety of proteins. In the yeast protein SYG1, the
N-terminus directly binds to the G- protein beta subunit
and inhibits transduction of the mating pheromone
signal. This finding suggests that all the members of
this family are involved in G-protein associated signal
transduction. The N-termini of several proteins involved
in the regulation of phosphate transport, including the
putative phosphate level sensors PHO81 from
Saccharomyces cerevisiae and NUC-2 from Neurospora
crassa, are also members of this family. The SPX domain
of S. cerevisiae low-affinity phosphate transporters
Pho87 and Pho90 auto-regulates uptake and prevents
efflux. This SPX dependent inhibition is mediated by the
physical interaction with Spl2 NUC-2 contains several
ankyrin repeats pfam00023. Several members of this
family are annotated as XPR1 proteins: the xenotropic
and polytropic retrovirus receptor confers
susceptibility to infection with murine leukaemia
viruses (MLV). The similarity between SYG1, phosphate
regulators and XPR1 sequences has been previously noted,
as has the additional similarity to several predicted
proteins, of unknown function, from Drosophila
melanogaster, Arabidopsis thaliana, Caenorhabditis
elegans, Schizosaccharomyces pombe, and Saccharomyces
cerevisiae. In addition, given the similarities between
XPR1 and SYG1 and phosphate regulatory proteins, it has
been proposed that XPR1 might be involved in G-protein
associated signal transduction and may itself function
as a phosphate sensor.
Length = 176
Score = 28.9 bits (65), Expect = 9.0
Identities = 14/60 (23%), Positives = 27/60 (45%), Gaps = 2/60 (3%)
Query: 829 KEKKKKKKKKKKKKKKKKKKKKKKMMKKGYPDVLSQME--KELANINRTAVAPINKPFDK 886
+E++ + K K K + +KK ++ +++ K +NRT I K +DK
Sbjct: 102 EERRDETSSAKSSPSDKNLLKAFEELKKALLELYRELQLLKSYIELNRTGFRKILKKYDK 161
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 27.7 bits (62), Expect = 9.2
Identities = 8/21 (38%), Positives = 18/21 (85%)
Query: 716 DKEEEEEKKKKKKKKKKKKKK 736
D+EEE +K+++++K+ +K+K
Sbjct: 86 DREEERLEKEEEREKRARKRK 106
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 29.6 bits (67), Expect = 9.3
Identities = 21/135 (15%), Positives = 33/135 (24%), Gaps = 24/135 (17%)
Query: 723 KKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVP-------- 774
+K K K K K ++K + ++K D ++
Sbjct: 449 EKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEILEKVDS 508
Query: 775 ----CEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKE 830
E I L D S+ E+ IA + K
Sbjct: 509 KSEKLE--------KISAKIDNIKELFDESK----MSILQEILQPALHHIELNIARIYKL 556
Query: 831 KKKKKKKKKKKKKKK 845
K K+ K K
Sbjct: 557 NIKDKEDFLNKLYIK 571
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 28.9 bits (65), Expect = 9.3
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 716 DKEEEEEKKKKKKKKKKKKKKKKKKKKK 743
EE +E KK +K + KKKK
Sbjct: 169 AAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|239466 cd03368, Ribosomal_S12, S12-like family, 30S ribosomal protein S12
subfamily; S12 is located at the interface of the large
and small ribosomal subunits of prokaryotes,
chloroplasts and mitochondria, where it plays an
important role in both tRNA and ribosomal subunit
interactions. S12 is essential for maintenance of a
pretranslocation state and, together with S13, functions
as a control element for the rRNA- and tRNA-driven
movements of translocation. Antibiotics such as
streptomycin bind S12 and cause the ribosome to misread
the genetic code.
Length = 108
Score = 27.9 bits (63), Expect = 9.4
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 833 KKKKKKKKKKKKKKKKKKKKMMKKGY 858
+K +KKKKKK K + KKG
Sbjct: 6 IRKGRKKKKKKSKSPALEGCPQKKGV 31
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 29.3 bits (66), Expect = 9.6
Identities = 18/87 (20%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 818 KQRFNQIAYLDKEKKKKKKKKKKKKKKKKKKKKKKM--MKKGYPDVLSQMEKELANI-NR 874
+ + N+++ + K +KK K ++ KK+ K+ K+++ + + ++++ +L +I N
Sbjct: 50 QAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNI 109
Query: 875 TA-VAPI------NKPFDKGGDPKNFD 894
P+ N + G P FD
Sbjct: 110 PHESVPVGKDEEDNLEVKRWGTPPVFD 136
>gnl|CDD|206363 pfam14195, DUF4316, Domain of unknown function (DUF4316). This
domain is functionally uncharacterized. This domain is
found in bacteria, and is typically between 56 and 95
amino acids in length.
Length = 65
Score = 26.7 bits (59), Expect = 9.9
Identities = 13/42 (30%), Positives = 18/42 (42%)
Query: 707 GRFNQIAYPDKEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKK 748
G N EEK K K+ KK ++ +K K+KK
Sbjct: 21 GIINNTPEAPTVPPEEKPSILDKLKEDKKARRSRKPVKQKKH 62
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 29.4 bits (66), Expect = 10.0
Identities = 15/50 (30%), Positives = 23/50 (46%)
Query: 717 KEEEEEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQ 766
KE E E+ KK KK+KK+ K + EE + + + SI+
Sbjct: 29 KEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDAVKEFEEFCKAIEEHLSIE 78
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.133 0.403
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,158,537
Number of extensions: 4789846
Number of successful extensions: 34216
Number of sequences better than 10.0: 1
Number of HSP's gapped: 23710
Number of HSP's successfully gapped: 1950
Length of query: 905
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 799
Effective length of database: 6,236,078
Effective search space: 4982626322
Effective search space used: 4982626322
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.5 bits)