RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17902
(454 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 152 bits (385), Expect = 1e-42
Identities = 77/299 (25%), Positives = 136/299 (45%), Gaps = 36/299 (12%)
Query: 139 KPHRASITCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCS----SPFVKTDRKSYPPIL 194
K H +TC+A + D K + + DG I W LET + + + V+
Sbjct: 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRD--------- 56
Query: 195 ALAVANNNELVASGDNNSNIKLWSSSLEH-LHTFTGHKAPITGLAISFECDTLYSSSNDR 253
A A+ L + + + I+LW + T TGH + ++ +A S + L SSS D+
Sbjct: 57 VAASADGTYLASGSSDKT-IRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDK 115
Query: 254 SIKVWNIEDRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEES-QLVFNG 312
+IKVW++E + TL GH + S+ ++S D ++++W ++ G
Sbjct: 116 TIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTG 175
Query: 313 HSRGIDMTRRV----NDQQFITCGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWIT 368
H+ + V + ++ ++ DGT+ LW + T+ H N +
Sbjct: 176 HTGEV---NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTL-RGH--------ENGVN 223
Query: 369 SVACLHNSDLVASGSYDGCVKLWKITPKRTLEFLQDIPL-SGVINSLVFTSDHKHLVAG 426
SVA + L+ASGS DG +++W + RT E +Q + + + SL ++ D K L +G
Sbjct: 224 SVAFSPDGYLLASGSEDGTIRVWDL---RTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
Score = 132 bits (335), Expect = 1e-35
Identities = 62/257 (24%), Positives = 113/257 (43%), Gaps = 18/257 (7%)
Query: 139 KPHRASITCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAV 198
K H + +A ++D ++ S D I W LET V + + + ++A
Sbjct: 48 KGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRT---LTGHTS---YVSSVAF 101
Query: 199 ANNNELVASGDNNSNIKLWS-SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKV 257
+ + +++S + IK+W + + L T GH + +A S + + SSS D +IK+
Sbjct: 102 SPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
Query: 258 WNIEDRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEESQL-VFNGHSRG 316
W++ V TL GH + S+ + S +D ++++W + L GH G
Sbjct: 162 WDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENG 221
Query: 317 IDMTRRVNDQQFIT-CGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWITSVACLHN 375
++ D + EDGT+ +W + T+ H N +TS+A +
Sbjct: 222 VNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GH--------TNSVTSLAWSPD 272
Query: 376 SDLVASGSYDGCVKLWK 392
+ASGS DG +++W
Sbjct: 273 GKRLASGSADGTIRIWD 289
Score = 115 bits (289), Expect = 5e-29
Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 24/235 (10%)
Query: 223 HLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWNIEDRAYVETLFGHQSAITSLDIL 282
T GH +T +A S + L + S D +IKVW++E + TL GH + +
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 283 AKNRVISSGGTDKSVRVWKIQEESQL-VFNGHS---RGIDMTRRVNDQQFITCGEDGTLS 338
A ++SG +DK++R+W ++ + GH+ + + + + + D T+
Sbjct: 61 ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP--DGRILSSSSRDKTIK 118
Query: 339 LWSILKKVPTHTIPVAHGYDSSSSEPNWITSVACLHNSDLVASGSYDGCVKLWKITPKRT 398
+W + T+ H +W+ SVA + VAS S DG +KLW + +
Sbjct: 119 VWDVETGKCLTTLR-GH--------TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKC 169
Query: 399 LEFLQDIPLSGVINSLVFTSDHKHLVAGIGT------EHRFGRW-KTIKGVKNTV 446
+ L +G +NS+ F+ D + L++ + G+ T++G +N V
Sbjct: 170 VATLTG--HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGV 222
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 110 bits (276), Expect = 2e-26
Identities = 89/360 (24%), Positives = 151/360 (41%), Gaps = 18/360 (5%)
Query: 91 EKEKLNDSKDKHVLKKLETVVLEQEGKLRVNIADRLANATVVCTKFNG---KPHRASITC 147
K KL + +++ + + + L+++ V + + H SIT
Sbjct: 11 NKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITS 70
Query: 148 IAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVANNNELVAS 207
IA + D + + S DG I W L+ K+ S S + + N+ L+AS
Sbjct: 71 IAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKS-LEGLHDSSVSKLALSSPDGNSILLAS 129
Query: 208 GDNNSNIKLW--SSSLEHLHTFTGHKAPITGLAISFECDTL-YSSSNDRSIKVWNIEDRA 264
+ +KLW S+ + + T GH +T LA S + L SS D +IK+W++
Sbjct: 130 SSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGK 189
Query: 265 YVETLFGHQSAITSLDILAKNR-VISSGGTDKSVRVWKIQ--EESQLVFNGHSRGIDMTR 321
+ TL GH ++SL +I+SG +D ++R+W + + + +GHS + +
Sbjct: 190 PLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSF 249
Query: 322 RVNDQQFITCGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWITSVACLHNSDLVAS 381
+ + DGT+ LW + H + SVA + L+AS
Sbjct: 250 SPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSS--------SVLSVAFSPDGKLLAS 301
Query: 382 GSYDGCVKLWKITPKRTLEFLQDIPLSGVINSLVFTSDHKHLVAGIGTEHRFGRWKTIKG 441
GS DG V+LW + + L L G ++SL F+ D LV+G + W G
Sbjct: 302 GSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361
Score = 84.4 bits (207), Expect = 2e-17
Identities = 60/262 (22%), Positives = 116/262 (44%), Gaps = 22/262 (8%)
Query: 139 KPHRASITCIAVTSDSK-HVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALA 197
H ++ +A + D + S DG I W L T +++ + + +
Sbjct: 195 AGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLST------GKLLRSTLSGHSDSVVSS 248
Query: 198 VANNNELVASGDNNSNIKLWS--SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSI 255
+ + L+ASG ++ I+LW SS L T +GH + + +A S + L S S+D ++
Sbjct: 249 FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTV 308
Query: 256 KVWNIE--DRAYVETLFGHQSAITSLDILAKN-RVISSGGTDKSVRVWKIQEESQLV-FN 311
++W++E TL GH+ ++SL ++S G D ++R+W ++ L
Sbjct: 309 RLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLE 368
Query: 312 GHSRGIDMTRRVNDQQFITCGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWITSVA 371
GHS + ++ + + + DGT+ LW + + + +TS+
Sbjct: 369 GHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLR---------NLDGHTSRVTSLD 419
Query: 372 CLHNSDLVASGSYDGCVKLWKI 393
+ +ASGS D ++LW +
Sbjct: 420 FSPDGKSLASGSSDNTIRLWDL 441
Score = 73.2 bits (178), Expect = 6e-14
Identities = 52/209 (24%), Positives = 97/209 (46%), Gaps = 12/209 (5%)
Query: 146 TCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVANNNELV 205
+ + D + S DG I W L + S + ++T +L++A + + +L+
Sbjct: 245 VVSSFSPDGSLLASGSSDGTIRLWDLRSSSSL-----LRTLSGHSSSVLSVAFSPDGKLL 299
Query: 206 ASGDNNSNIKLWSSS---LEHLHTFTGHKAPITGLAISFECDTLYSS-SNDRSIKVWNIE 261
ASG ++ ++LW L T GH+ P++ L+ S + L S S+D +I++W++
Sbjct: 300 ASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLR 359
Query: 262 DRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEESQLVFNGHSRG--IDM 319
++TL GH S + S+ RV+SSG TD +VR+W + S L +
Sbjct: 360 TGKPLKTLEGH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSL 418
Query: 320 TRRVNDQQFITCGEDGTLSLWSILKKVPT 348
+ + + D T+ LW + + +
Sbjct: 419 DFSPDGKSLASGSSDNTIRLWDLKTSLKS 447
Score = 67.8 bits (164), Expect = 3e-12
Identities = 40/164 (24%), Positives = 82/164 (50%), Gaps = 5/164 (3%)
Query: 141 HRASITCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVA- 199
H +S+ +A + D K + S DG + W LET + S T + P+ +L+ +
Sbjct: 283 HSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSS----LTLKGHEGPVSSLSFSP 338
Query: 200 NNNELVASGDNNSNIKLWSSSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWN 259
+ + LV+ G ++ I+LW + + ++ S + + S S D ++++W+
Sbjct: 339 DGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWD 398
Query: 260 IEDRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQ 303
+ + + L GH S +TSLD + ++SG +D ++R+W ++
Sbjct: 399 LSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLK 442
Score = 48.5 bits (114), Expect = 5e-06
Identities = 28/124 (22%), Positives = 60/124 (48%), Gaps = 9/124 (7%)
Query: 141 HRASITCIAVTSDSKHVYSAC-KDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVA 199
H ++ ++ + D + S DG I W L T + + + +L+++ +
Sbjct: 327 HEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPL-------KTLEGHSNVLSVSFS 379
Query: 200 NNNELVASGDNNSNIKLWS-SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVW 258
+ +V+SG + ++LW S+ L GH + +T L S + +L S S+D +I++W
Sbjct: 380 PDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLW 439
Query: 259 NIED 262
+++
Sbjct: 440 DLKT 443
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 45.0 bits (107), Expect = 8e-07
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 222 EHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWN 259
+ L T GH P+T +A S + + L S S+D +++VW+
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 39.3 bits (92), Expect = 8e-05
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 266 VETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWK 301
+ TL GH +TS+ +++SG D +VRVW
Sbjct: 4 LRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 37.7 bits (88), Expect = 3e-04
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 365 NWITSVACLHNSDLVASGSYDGCVKLWK 392
+TSVA + +L+ASGS DG V++W
Sbjct: 12 GPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 33.5 bits (77), Expect = 0.012
Identities = 8/29 (27%), Positives = 13/29 (44%)
Query: 141 HRASITCIAVTSDSKHVYSACKDGIIIKW 169
H +T +A + D + S DG + W
Sbjct: 10 HTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 28.1 bits (63), Expect = 0.95
Identities = 6/39 (15%), Positives = 15/39 (38%), Gaps = 1/39 (2%)
Query: 304 EESQLVFNGHSRGI-DMTRRVNDQQFITCGEDGTLSLWS 341
+ GH+ + + + + +DGT+ +W
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 28.1 bits (63), Expect = 1.00
Identities = 7/27 (25%), Positives = 18/27 (66%)
Query: 192 PILALAVANNNELVASGDNNSNIKLWS 218
P+ ++A + + L+ASG ++ +++W
Sbjct: 13 PVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 43.8 bits (104), Expect = 2e-06
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 220 SLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWN 259
S E L T GH P+T +A S + L S S+D +IK+W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 38.1 bits (89), Expect = 2e-04
Identities = 10/36 (27%), Positives = 21/36 (58%)
Query: 266 VETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWK 301
++TL GH +TS+ + ++SG D ++++W
Sbjct: 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 38.1 bits (89), Expect = 3e-04
Identities = 13/28 (46%), Positives = 17/28 (60%)
Query: 365 NWITSVACLHNSDLVASGSYDGCVKLWK 392
+TSVA + +ASGS DG +KLW
Sbjct: 13 GPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 34.6 bits (80), Expect = 0.004
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 139 KPHRASITCIAVTSDSKHVYSACKDGIIIKW 169
K H +T +A + D K++ S DG I W
Sbjct: 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 28.4 bits (64), Expect = 0.59
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 192 PILALAVANNNELVASGDNNSNIKLWS 218
P+ ++A + + + +ASG ++ IKLW
Sbjct: 14 PVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 28.0 bits (63), Expect = 1.1
Identities = 7/34 (20%), Positives = 15/34 (44%), Gaps = 1/34 (2%)
Query: 309 VFNGHSRGI-DMTRRVNDQQFITCGEDGTLSLWS 341
GH+ + + + + + +DGT+ LW
Sbjct: 7 TLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 38.0 bits (88), Expect = 0.010
Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 14/91 (15%)
Query: 229 GHKAPITGLAISFE---CDTLYSSSNDRSIKVWNI--EDRAYVET------LFGHQSAIT 277
GH + I L + F + L S S D +I+VW I D + E L GH+ I+
Sbjct: 72 GHTSSI--LDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS 129
Query: 278 SLDILAKNRVI-SSGGTDKSVRVWKIQEESQ 307
+D N I S G D V +W I+ E +
Sbjct: 130 IIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160
Score = 33.4 bits (76), Expect = 0.25
Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 12/89 (13%)
Query: 266 VETLFGHQSAITSLDI-LAKNRVISSGGTDKSVRVWKIQEES---------QLVFNGHSR 315
V L GH S+I L + +++SG D ++RVW+I Q + GH +
Sbjct: 67 VIKLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKK 126
Query: 316 GIDMTRRVNDQQFITC--GEDGTLSLWSI 342
I + +I C G D +++W I
Sbjct: 127 KISIIDWNPMNYYIMCSSGFDSFVNIWDI 155
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 35.4 bits (81), Expect = 0.067
Identities = 40/183 (21%), Positives = 74/183 (40%), Gaps = 22/183 (12%)
Query: 205 VASGDNNSNIKLWSSSLEHLHT-FTGHKAPITGLAISFECDTLYSS-SNDRSIKVWNIED 262
VAS + +++W + L T H+ + + S TL +S S+D S+K+W+I
Sbjct: 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQ 607
Query: 263 RAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEES--QLVFNGHSRGIDMT 320
+ T+ + R ++ G D V + ++ GHS+ +
Sbjct: 608 GVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYV 667
Query: 321 RRVNDQQFITCGEDGTLSLWSI------LKKVPTHT------------IPVAHGYDSSSS 362
R V+ ++ D TL LW + + + P H+ + V+ GY ++ S
Sbjct: 668 RFVDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGS 727
Query: 363 EPN 365
E N
Sbjct: 728 ETN 730
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 34.5 bits (79), Expect = 0.10
Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 5/87 (5%)
Query: 216 LWSSSLEHLHTFTGHKAPITGLAISFE---CDTLYSSSNDRSIKVWNIEDRAYVETLFGH 272
L + + + GH + G+ +SF + L S+ D + VW++E VE + H
Sbjct: 110 LTQNISDPIVHLQGHTKKV-GI-VSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCH 167
Query: 273 QSAITSLDILAKNRVISSGGTDKSVRV 299
ITSL+ ++ + DK + +
Sbjct: 168 SDQITSLEWNLDGSLLCTTSKDKKLNI 194
Score = 33.3 bits (76), Expect = 0.26
Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 11/85 (12%)
Query: 229 GHKAPITGLAIS-FECDTLYSSSNDRSIKVWNIEDRAYVET-------LFGH--QSAITS 278
G + PI +A + F+ L+++S D +I W I + + L GH + I S
Sbjct: 73 GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVS 132
Query: 279 LDILAKNRVISSGGTDKSVRVWKIQ 303
A N V++S G D V VW ++
Sbjct: 133 FHPSAMN-VLASAGADMVVNVWDVE 156
Score = 31.0 bits (70), Expect = 1.2
Identities = 35/162 (21%), Positives = 72/162 (44%), Gaps = 12/162 (7%)
Query: 153 DSKHVYSACKDGIIIKWLL--ETRSKVCSSPFV--KTDRKSYPPILALAVANNNELVASG 208
D + +++A +DG I+ W + E ++ S P V + K + A N ++AS
Sbjct: 87 DPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMN--VLASA 144
Query: 209 DNNSNIKLWS-SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWNIEDRAYVE 267
+ + +W + + H IT L + + L ++S D+ + + + D V
Sbjct: 145 GADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVS 204
Query: 268 TLFGHQSAITSLDILAKNR-VISSGGTDKS----VRVWKIQE 304
++ H SA + + AK + +I + G KS + +W ++
Sbjct: 205 SVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRK 246
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 33.8 bits (78), Expect = 0.20
Identities = 21/95 (22%), Positives = 42/95 (44%), Gaps = 2/95 (2%)
Query: 7 NKKPAATNNNSLKSKKRK-FSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSE 65
KK ++S K++K+ + + L++ D+DD+ D D ++ +
Sbjct: 93 KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152
Query: 66 VEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKD 100
E+D + + + + E+ E E EKL+D D
Sbjct: 153 DEDDDEDDDDDDVDDEDE-EKKEAKELEKLSDDDD 186
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 33.5 bits (77), Expect = 0.21
Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 1/82 (1%)
Query: 38 PDLDEELTSDEDDEGIGFDSQ-DFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLN 96
D D E SD++ GF+S+ + + T+ +R +K LE K EK+
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304
Query: 97 DSKDKHVLKKLETVVLEQEGKL 118
LK++ V ++E
Sbjct: 305 KLAQLARLKEIAKEVAQKEKAR 326
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 33.9 bits (77), Expect = 0.21
Identities = 21/82 (25%), Positives = 39/82 (47%), Gaps = 8/82 (9%)
Query: 6 RNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSE 65
R ++ AA ++ ++ +K +EK + ++ VK + D +D+G + D EE
Sbjct: 116 RTRRKAAAASSDVEEEK---TEKKVRKRRKVKKMDE-----DVEDQGSESEVSDVEESEF 167
Query: 66 VEEDLTEQERKVILAKKYLEEI 87
V E E ++ L K E+I
Sbjct: 168 VTSLENESEEELDLEKDDGEDI 189
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 33.4 bits (76), Expect = 0.24
Identities = 31/109 (28%), Positives = 45/109 (41%), Gaps = 17/109 (15%)
Query: 19 KSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQ--DFEEH--------SEVEE 68
+ K +K +K K+KK K D D++ DD+ +DS D E S
Sbjct: 238 EEKSKKKKKKLAKNKK--KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGN 295
Query: 69 DLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGK 117
D E+E K E K E E+ DS++ K E L ++GK
Sbjct: 296 DPEERED-----KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 33.7 bits (77), Expect = 0.26
Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 17/95 (17%)
Query: 39 DLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDS 98
DL + T+D +D F E + E+ LT++E ++ + LE + E +L+ S
Sbjct: 715 DLFDCGTADTEDA--------FREAAREEQQLTQRESRLESLEAQLEGVAA-EAYELSAS 765
Query: 99 KDKHVLKKLETVVLE--------QEGKLRVNIADR 125
D+ LK+ E +LE + +L +A
Sbjct: 766 LDQRELKEEELALLEEAIDALDEEVEELHAQVAAL 800
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 32.8 bits (75), Expect = 0.29
Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 9/51 (17%)
Query: 24 KFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQE 74
K S K K+ D+DEE DE++E EE E +ED + +
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEE---------EEEEEEDEDFDDDD 193
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.8 bits (75), Expect = 0.30
Identities = 17/107 (15%), Positives = 39/107 (36%), Gaps = 15/107 (14%)
Query: 6 RNKKPAATNNNSLKS--KKRKFSEKPTKSKKTVKPDLDEE---LTSDEDDEGIGFDSQDF 60
+ K A + + + + E + + + ++E+ +D+ D DS D
Sbjct: 74 KENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDD 133
Query: 61 EEHSEVEEDLTE----------QERKVILAKKYLEEIEKVEKEKLND 97
+ + ED T +ER ++ E+ + EK + +
Sbjct: 134 DSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 0.31
Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 8/66 (12%)
Query: 70 LTEQERKVILAKKYLEEIEKVEKEKLN----DSKDKHVLKKLETVVLEQEGKLRVN--IA 123
LTE+ RK +L + Y E++++EKE + K + L++LE VL++E +L +A
Sbjct: 445 LTEEHRKELLEE-YTAELKRIEKELKEIEEKERKLRKELRELEK-VLKKESELIKLKELA 502
Query: 124 DRLANA 129
++L
Sbjct: 503 EQLKEL 508
Score = 28.5 bits (64), Expect = 10.0
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 4/60 (6%)
Query: 64 SEVEEDLTEQERKVILAKKYL----EEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLR 119
E E++L E R++ L EE+EK+EKE + K +++LE + EG R
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 31.8 bits (72), Expect = 0.34
Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 9/87 (10%)
Query: 33 KKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQE--RKVILAKKYLEEIEKV 90
K+ V L + E G +S EE E++E L E E RK K +EE
Sbjct: 6 KQMVGNQLKDVKGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRK--MEE---- 59
Query: 91 EKEKL-NDSKDKHVLKKLETVVLEQEG 116
E+E + +DK+ +KK E E +
Sbjct: 60 EREVMRQGIRDKYGIKKKEEDEEEPQA 86
>gnl|CDD|197742 smart00466, SRA, SET and RING finger associated domain. Domain of
unknown function in SET domain containing proteins and
in Deinococcus radiodurans DRA1533.
Length = 155
Score = 32.0 bits (73), Expect = 0.35
Identities = 22/102 (21%), Positives = 36/102 (35%), Gaps = 14/102 (13%)
Query: 271 GHQSAITSLDILAKNR----VISSGGTDKSVRVWKIQEESQLVFNGHSRGIDMTRRVNDQ 326
Q+ I L+ V+SSGG + + L++ G + DQ
Sbjct: 31 PTQAGIDGLESDEGEPGATSVVSSGGYED-----DTDDGDVLIYTGQGGRDMTHGQPEDQ 85
Query: 327 QFITCGEDGTLSL-WSILKKVPTHTIPVAHGYDSSSSEPNWI 367
+ E G L+L S K +P + GY + +I
Sbjct: 86 KL----ERGNLALEASCRKGIPVRVVRGMKGYSKYAPGKGYI 123
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 30.5 bits (69), Expect = 0.42
Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 16 NSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEED--LTEQ 73
+K KK+K +K ++ V +EE +S E D G + +D E+ + E+ LTE
Sbjct: 17 IDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNLTEA 76
Query: 74 ERKVILA-KKYLEE 86
ER A +K L+E
Sbjct: 77 ERAFEEAQRKRLKE 90
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 31.8 bits (73), Expect = 0.45
Identities = 22/61 (36%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 61 EEHSEVEEDLTEQERKVILA--KKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKL 118
EEH+E QE K ++ KK EE+E++ K+ + KDK LK L+ + E E +L
Sbjct: 43 EEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKEL 102
Query: 119 R 119
+
Sbjct: 103 K 103
>gnl|CDD|222926 PHA02745, PHA02745, hypothetical protein; Provisional.
Length = 265
Score = 31.8 bits (72), Expect = 0.63
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 1/102 (0%)
Query: 5 IRNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHS 64
+R A + +K R+ K + DL E+++ D D D EEH
Sbjct: 18 LRLLHSPAIRPSVIKEPTRRNYLSNCSQVKKLLEDLFEDISGDLDIRTSTGDYVSEEEHF 77
Query: 65 EVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKK 106
D E + L K +++ +E K ++S+ ++ L K
Sbjct: 78 -GRVDTQEVDEFRGLNAKIRDKVLPIEDNKHDNSETRNPLNK 118
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 31.6 bits (72), Expect = 0.74
Identities = 18/95 (18%), Positives = 37/95 (38%), Gaps = 22/95 (23%)
Query: 19 KSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVI 78
+ + + + PT P E+L D++ E S D E ++ +D +E+ER
Sbjct: 123 RQELLEAAAAPTAPPHVAAPVNGEQLEPDDNGEEA---SADLETDPDM-KDRSEEER--- 175
Query: 79 LAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLE 113
+ E K++ + +L+ + E
Sbjct: 176 --------VTYAE-------KNERLQTQLQALKSE 195
>gnl|CDD|109884 pfam00846, Hanta_nucleocap, Hantavirus nucleocapsid protein.
Length = 428
Score = 31.8 bits (72), Expect = 0.75
Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 11/80 (13%)
Query: 65 EVEEDLTEQERKVILAKKYLEEIEK-VEKEKLNDSKDKHVLKKLETVVLEQEGK---LRV 120
E++E++T E+++++A++ L++ EK VE + D +K L+ V E K L+
Sbjct: 6 ELQEEITAHEQQLVIARQKLKDAEKAVEVDP--DDVNKSTLQSRRAAVSALEAKLAELKR 63
Query: 121 NIADRLANATVVCTKFNGKP 140
+ADR+A K GKP
Sbjct: 64 QLADRIATQ-----KLAGKP 78
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 31.7 bits (73), Expect = 0.96
Identities = 16/60 (26%), Positives = 31/60 (51%), Gaps = 9/60 (15%)
Query: 66 VEE--DLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLRVNIA 123
VEE L+ +E++ L + E +E+ E++K ++K L L ++GK+ + A
Sbjct: 55 VEEVNSLSLEEQRERLEELAPELLEEEEEKK----EEKKGLPPLPNA---EKGKVVMRFA 107
>gnl|CDD|222512 pfam14048, MBD_C, C-terminal domain of methyl-CpG binding protein
2 and 3. CpG-methylation is a frequently occurring
epigenetic modification of vertebrate genomes resulting
in transcriptional repression. This domain was found at
the C-terminus of the methyl-CpG-binding domain (MBD)
containing proteins MBD2 and MBD3, the latter was shown
to not bind directly to methyl-CpG DNA but rather
interact with components of the NuRD/Mi2 complex, an
abundant deacetylase complex. The domain is subject to
structure determination by the Joint Center of
Structural Genomics.
Length = 96
Score = 29.6 bits (67), Expect = 1.0
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 67 EEDLTEQERKVILAKKYLEEI 87
EED+ QER+V A+K L E
Sbjct: 75 EEDIRRQERRVKDARKRLAEA 95
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 31.3 bits (71), Expect = 1.2
Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 6/70 (8%)
Query: 30 TKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEK 89
K + PD +EE E+ + + D EE + EE+ E+++K K ++E
Sbjct: 28 EKEVEKEVPD-EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKK----TKKVKE-TT 81
Query: 90 VEKEKLNDSK 99
E E LN +K
Sbjct: 82 TEWELLNKTK 91
>gnl|CDD|200601 cd10979, CE4_PuuE_like, Putative catalytic domain of
uncharacterized prokaryotic polysaccharide deacetylases
similar to bacterial PuuE allantoinases. The family
includes a group of uncharacterized prokaryotic
polysaccharide deacetylases (DCAs) that show high
sequence similarity to the catalytic domain of bacterial
PuuE (purine utilization E) allantoinases. PuuE
allantoinase specifically catalyzes the hydrolysis of
(S)-allantoin into allantoic acid. It functions as a
homotetramer. Its monomer is composed of a 7-stranded
barrel with detectable sequence similarity to the
6-stranded barrel NodB homology domain of DCA-like
proteins in the CE4 superfamily, which removes N-linked
or O-linked acetyl groups from cell wall
polysaccharides. PuuE allantoinase appears to be
metal-independent and acts on a small substrate
molecule, which is distinct from the common feature of
DCAs which are normally metal ion dependent and
recognize multimeric substrates.
Length = 281
Score = 30.7 bits (70), Expect = 1.3
Identities = 13/62 (20%), Positives = 25/62 (40%), Gaps = 11/62 (17%)
Query: 190 YPPILALAVANNNELVASGDNNSNI----------KLWSSSLEHLHTFTGHKAPITGLAI 239
YP ++ E +A G +NS + ++ + SL+ + TG + P L+
Sbjct: 91 YPELIEAIRERGWEFIAHGISNSTLHAGLDEAQEREVIAESLDRIEKATG-QRPRGWLSP 149
Query: 240 SF 241
Sbjct: 150 GL 151
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 31.0 bits (71), Expect = 1.4
Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 18/108 (16%)
Query: 20 SKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDS---------QDFEEHSEVEEDL 70
S+KR+ D+DEE+ +EG F S ++ E +E+
Sbjct: 344 SEKRQAEFDAILDLFMEALDIDEEIAQLLVEEG--FSSLEELAYVPVEELLEIEGFDEET 401
Query: 71 TEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKL 118
E+ R+ AK+ LE ++EKL D L LE + E KL
Sbjct: 402 VEELRER--AKEALETEALAQEEKLADD-----LLSLEGLDRELAFKL 442
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 31.1 bits (70), Expect = 1.4
Identities = 19/94 (20%), Positives = 34/94 (36%), Gaps = 9/94 (9%)
Query: 5 IRNKKPAATNNNSLKSKKRKFSEKPTK--SKKTVKPDLDEELTSDEDDEGIGFDSQDFEE 62
+R + ++ NN S K + + E S E D I + Q E
Sbjct: 60 LRPRVSSSWNNESYNRLPIKTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLE 119
Query: 63 HSEVEED-------LTEQERKVILAKKYLEEIEK 89
++ + +E+ER + K +EE E+
Sbjct: 120 EQKIAPEIPVKQQIDSEKERIASICTKIIEEPEE 153
>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin,
subunit D2 [Chromatin structure and dynamics / Cell
division and chromosome partitioning].
Length = 1128
Score = 30.7 bits (69), Expect = 1.7
Identities = 18/79 (22%), Positives = 28/79 (35%)
Query: 8 KKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVE 67
+K + N L S K E +K V+ D + T E +S+ E +
Sbjct: 433 EKNLGSAENQLNSLKSGLQETLCDGEKEVEQDEGQCRTELEGSFNKSAESRIENEVENIN 492
Query: 68 EDLTEQERKVILAKKYLEE 86
T K+ L Y E+
Sbjct: 493 ATNTSVLMKLKLMIVYYED 511
>gnl|CDD|227772 COG5485, COG5485, Predicted ester cyclase [General function
prediction only].
Length = 131
Score = 29.3 bits (66), Expect = 1.9
Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 6/48 (12%)
Query: 226 TFTGHKA------PITGLAISFECDTLYSSSNDRSIKVWNIEDRAYVE 267
TF + P G + F + Y N + ++VW++ D+ +E
Sbjct: 80 TFDCTPSGEIMGIPPNGKRVRFSENVFYEFENGKIVEVWSVIDKMAIE 127
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 30.7 bits (69), Expect = 1.9
Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 4/81 (4%)
Query: 19 KSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEE-DLTEQERKV 77
K+ E +K ++ + D ++E DE++ + + +E DL +
Sbjct: 280 PDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELS 339
Query: 78 ILAK---KYLEEIEKVEKEKL 95
K K ++IE++EKE L
Sbjct: 340 SFEKRQAKLKQQIEQLEKENL 360
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or
more acidic residues. The function of members of this
family is unknown.
Length = 104
Score = 28.8 bits (65), Expect = 2.1
Identities = 8/53 (15%), Positives = 21/53 (39%)
Query: 17 SLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEED 69
+ +E K + + + D L D+DD+ ++ ++++D
Sbjct: 41 AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDD 93
>gnl|CDD|107089 PHA01819, PHA01819, hypothetical protein.
Length = 129
Score = 29.2 bits (64), Expect = 2.2
Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 9/85 (10%)
Query: 30 TKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEK 89
T+ + VKPD D+E TS+ +E + + E E E+ TE K +E IE+
Sbjct: 2 TEFDEIVKPD-DKEETSESTEENLESTEETSESTEESTEESTED--------KTVETIEE 52
Query: 90 VEKEKLNDSKDKHVLKKLETVVLEQ 114
+ KL + K + VVLEQ
Sbjct: 53 ENENKLEPTTTDEDSSKFDPVVLEQ 77
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 30.2 bits (69), Expect = 2.4
Identities = 23/78 (29%), Positives = 37/78 (47%), Gaps = 13/78 (16%)
Query: 53 IGFDSQDFEEHSE-VEEDLTEQERKVILAKKYLEEIEKV------EKEKLNDSKDKHVLK 105
IG D + E +EE E E+K A+ L+E EK+ +KEKL + +DK + +
Sbjct: 511 IGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEE 570
Query: 106 KLETVVLEQEGKLRVNIA 123
E+E + + A
Sbjct: 571 ------AEKEAQQAIKEA 582
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 30.2 bits (68), Expect = 2.7
Identities = 30/128 (23%), Positives = 47/128 (36%), Gaps = 7/128 (5%)
Query: 9 KPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEE 68
K NNNS ++ K K K + K D+ E + + E D Q S ++
Sbjct: 485 KKQLANNNS-QNIKSKKVVKAKTNNKANLQDVGECSSPPNNKE--KNDKQTSTSSSVLKS 541
Query: 69 DLTEQERKVILAK-KYLEEIEKVEKEKLNDSKDKHVLKKLETV---VLEQEGKLRVNIAD 124
D + E + A K LE+ K + + L + T V + VNI D
Sbjct: 542 DRSSIEVRNPNANVKKLEDTTYNAKFPTVSKNNAYTLVDISTSEDAVNSADDTRSVNITD 601
Query: 125 RLANATVV 132
+ T+
Sbjct: 602 ISESTTIS 609
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 30.0 bits (68), Expect = 3.1
Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 39/134 (29%)
Query: 7 NKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDS--------- 57
+K K K + + ++K D+D+E DEDDE + F
Sbjct: 528 SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDE--EDEDDEELPFLFKQKDLIKEA 585
Query: 58 -------QDFE-EHSEV-------EEDLT-------------EQERKVILAKKYLEEIEK 89
+FE E EV E DLT ++++K +++L +IE
Sbjct: 586 FAGDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEG 645
Query: 90 VEKEKLNDSKDKHV 103
V+KEK D K K+V
Sbjct: 646 VKKEKRKDKKLKNV 659
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The
CAF-1 or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression
in addition to DNA-based checkpoints.
Length = 76
Score = 27.6 bits (62), Expect = 3.1
Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 4/40 (10%)
Query: 34 KTVKPDLDEELTSD----EDDEGIGFDSQDFEEHSEVEED 69
P LD + SD E++EG +S+D E+ E ++D
Sbjct: 32 SQDLPGLDYDYDSDAEWEEEEEGEDLESEDEEDEEEDDDD 71
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.1 bits (66), Expect = 3.1
Identities = 14/43 (32%), Positives = 25/43 (58%)
Query: 60 FEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKH 102
+E ++EE+ E E++V + LE IEK E+E+ + +H
Sbjct: 126 EQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRH 168
>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
II-B/NMENI. Members of this family of CRISPR-associated
(cas) protein are found, so far, in CRISPR/cas loci in
Wolinella succinogenes DSM 1740, Legionella pneumophila
str. Paris, and Francisella tularensis, where the last
probably is an example of a degenerate CRISPR locus,
having neither repeats nor a functional Cas1. The
characteristic repeat length is 37 base pairs and period
is about 72. One region of this large protein shows
sequence similarity to pfam01844, HNH endonuclease
[Mobile and extrachromosomal element functions, Other].
Length = 802
Score = 30.0 bits (67), Expect = 3.5
Identities = 23/97 (23%), Positives = 36/97 (37%), Gaps = 12/97 (12%)
Query: 39 DLDE----ELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEK 94
DLDE + E + F + E + ++ L + + Y EE K+ K
Sbjct: 104 DLDEYIKDATEINLLKELLPFKLRSISEDNFIDWFLQKIQSSADYLANYSEEFRKILVSK 163
Query: 95 LNDSKDKHVLKKLETVVLEQEGKLRVNIADRLANATV 131
+ KD LK K + NI + L AT+
Sbjct: 164 TEEKKDDKELK--------NAVKDKYNIKEFLTGATI 192
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 28.9 bits (65), Expect = 4.0
Identities = 15/72 (20%), Positives = 23/72 (31%)
Query: 7 NKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEV 66
KK ++ K + E K K D ++E E++ + E E
Sbjct: 51 AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKET 110
Query: 67 EEDLTEQERKVI 78
EE K I
Sbjct: 111 EEKTESNVEKEI 122
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 29.7 bits (67), Expect = 4.0
Identities = 20/104 (19%), Positives = 41/104 (39%), Gaps = 14/104 (13%)
Query: 16 NSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQER 75
L+ ++ K E+ + ++ + L++E+ + + E E+E + E E
Sbjct: 726 EQLEQEEEKLKERLEELEEDLS-SLEQEIENVKS------------ELKELEARIEELEE 772
Query: 76 KVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLR 119
+ ++ L ++E L KLE V E +LR
Sbjct: 773 DLHKLEEALNDLEARLSHSRIPEIQAE-LSKLEEEVSRIEARLR 815
>gnl|CDD|223642 COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit
(sigma70/sigma32) [Transcription].
Length = 342
Score = 29.2 bits (66), Expect = 4.1
Identities = 29/104 (27%), Positives = 43/104 (41%), Gaps = 32/104 (30%)
Query: 39 DLDEELTSDEDDEGIGF----DSQDFEEHSE-----------VEEDLTEQERKVILA--- 80
LD + DED E F S E+ E + E LTE+ER+VI
Sbjct: 234 SLDTPIGDDEDSELGDFLEDDKSVSPEDAVERESLKEDLNEVLAEALTERERRVIRLRFG 293
Query: 81 -----KKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLR 119
K LEE+ E+ S+++ ++++E L KLR
Sbjct: 294 LDDGEPKTLEEL----GEEFGISRER--VRQIEAKALR---KLR 328
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 29.4 bits (66), Expect = 4.4
Identities = 40/139 (28%), Positives = 60/139 (43%), Gaps = 30/139 (21%)
Query: 8 KKPAATNN----NSLKSKKRK----FSEKPTKSKKTVKPDL-DEELTSDEDDEGIGFDSQ 58
KK + NN N+LK K +K F + T+ ++ P+ DEE DEDD+ G D +
Sbjct: 99 KKKMSKNNAKALNTLKQKVKKNNKQFEDDITRYRED--PESEDEEEEEDEDDDDDGSDDE 156
Query: 59 DFEEHS----------------EVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKH 102
D +E V+ED E E + K LEE + +K + D
Sbjct: 157 DEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVLEEPKMFKKPEEITWDD-- 214
Query: 103 VLKKLETVVLEQEGKLRVN 121
V KKL+ ++ GK +
Sbjct: 215 VFKKLKE-IMSARGKKTTD 232
>gnl|CDD|233819 TIGR02307, RNA_lig_RNL2, RNA ligase, Rnl2 family. Members of this
family ligate (seal breaks in) RNA. Members so far
include phage proteins that can counteract a host
defense of cleavage of specific tRNA molecules,
trypanosome ligases involved in RNA editing, but no
prokaryotic host proteins [Mobile and extrachromosomal
element functions, Prophage functions, Transcription,
RNA processing].
Length = 325
Score = 29.0 bits (65), Expect = 4.7
Identities = 27/120 (22%), Positives = 47/120 (39%), Gaps = 12/120 (10%)
Query: 5 IRNKKPAATNNNS-LKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEH 63
R K N + +K K KFSEK K+ K + + + S+ ++ + E+
Sbjct: 198 CRPGKWLRNGNRTIIKCKNSKFSEKKNKAGKPILAKV---VLSEAQNKLVALLLCYVTEN 254
Query: 64 ------SEVEEDLTEQERKV--ILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQE 115
S++ + + V +LA+ L+E + N D L K V + QE
Sbjct: 255 RLNNVLSKIGTEPPKDFGMVMGLLAQDALKEFLRETPGITNTQADNRKLIKRSLVFMAQE 314
>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 508
Score = 29.2 bits (65), Expect = 5.2
Identities = 23/105 (21%), Positives = 45/105 (42%), Gaps = 10/105 (9%)
Query: 1 MSFFIRNKKPAATNNNSLKSKKRKFSEKPTKSKK-TVKPDLDEE-----LTSDEDDEGIG 54
++ F+R+ P + N++ ++ E+ SK + + +E L S D E I
Sbjct: 405 LTLFLRS--PGSYTFNNISKDEQGALEQFLHSKGIKARNEEVQERLQTDLGSISDSEDIN 462
Query: 55 FDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSK 99
S E+ SE E+ + V A++Y + + E +D +
Sbjct: 463 MGSAGEEDESEDEDFQMVSDSDV--AEEYDLQAALSDAEGGSDEE 505
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.9 bits (65), Expect = 5.4
Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 3/72 (4%)
Query: 47 DEDDEGIGFDSQDFEEHSEVEEDLTEQE---RKVILAKKYLEEIEKVEKEKLNDSKDKHV 103
DED++ DS D EE E EE+ + + K +EE EK ++EK +
Sbjct: 26 DEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85
Query: 104 LKKLETVVLEQE 115
E + E+
Sbjct: 86 EDTPEDELAEKL 97
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 28.6 bits (64), Expect = 5.5
Identities = 20/93 (21%), Positives = 37/93 (39%), Gaps = 2/93 (2%)
Query: 2 SFFIRN--KKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQD 59
+F I+ +K N+ K ++ +K + + + E D ++ +
Sbjct: 30 TFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQ 89
Query: 60 FEEHSEVEEDLTEQERKVILAKKYLEEIEKVEK 92
EE E ED+ +QE KK +E+ EK
Sbjct: 90 EEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122
>gnl|CDD|220491 pfam09958, DUF2192, Uncharacterized protein conserved in archaea
(DUF2192). This domain, found in various hypothetical
archaeal proteins, has no known function.
Length = 231
Score = 28.4 bits (64), Expect = 5.8
Identities = 28/95 (29%), Positives = 39/95 (41%), Gaps = 21/95 (22%)
Query: 28 KPTKSKKTVKPDL-DEELTS-----------DEDDEGIGFDSQDFEEHSEVEEDLTEQER 75
KP + PDL D+EL S DED G F++ +E E+
Sbjct: 45 KPFRGLS--PPDLYDKELASLYVVGKYGLGLDEDYPG------VFDKLFYIENRY-EEAL 95
Query: 76 KVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETV 110
++IL K+ E EKV E S + V + L V
Sbjct: 96 EIILEKEPEEAREKVTGELGGKSDENKVARVLRLV 130
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 29.3 bits (66), Expect = 6.0
Identities = 15/86 (17%), Positives = 34/86 (39%), Gaps = 3/86 (3%)
Query: 18 LKSKKRKFSEKPTKSKKTVKP--DLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQER 75
L++ K + + + + E+L + ++ + +++ +E
Sbjct: 822 LQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEE- 880
Query: 76 KVILAKKYLEEIEKVEKEKLNDSKDK 101
ILAKK E + E+E LN+ +
Sbjct: 881 LYILAKKEEEFKQFAEEEGLNEEELA 906
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 29.2 bits (65), Expect = 6.1
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 8/54 (14%)
Query: 41 DEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEK 94
+E D G DS++ EE E EE+ E+E + EE E+ E E+
Sbjct: 847 QDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE--------EEEEEEENEE 892
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 28.4 bits (63), Expect = 6.9
Identities = 22/115 (19%), Positives = 43/115 (37%), Gaps = 9/115 (7%)
Query: 5 IRNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEE----LTSDEDDEGIGFDSQDF 60
I +KP T N + E P +PD+D E L + E +
Sbjct: 144 ILAEKPRVTRFNIVWDNDEDNDEAPP-----AQPDVDNEEEERLEESDGREEEEDEEVGS 198
Query: 61 EEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQE 115
+ + E +L E+E + E++ E E+++ + + + E + L +
Sbjct: 199 DSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEI 253
>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
[Transcription].
Length = 507
Score = 28.9 bits (64), Expect = 7.3
Identities = 21/116 (18%), Positives = 41/116 (35%), Gaps = 20/116 (17%)
Query: 16 NSLKSKKRKFSEKPTKSKKTVKPDL-------------DEELTSDEDDEGIGFDSQDFEE 62
N K+ + KP+ +KK K E TS E DE + +
Sbjct: 149 NLKTFKRHRVLGKPSSAKKPAKISPPTAMTDSLDRNFSSETSTSREADENENYVISKVK- 207
Query: 63 HSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKL 118
++ + + + E AK +++E E + + + E + ++ KL
Sbjct: 208 --DIPKKVRDGES----AKYFIDEENFTMAELCKPNFPIQISENFEKSKMAKKAKL 257
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 28.4 bits (64), Expect = 7.4
Identities = 23/102 (22%), Positives = 44/102 (43%), Gaps = 24/102 (23%)
Query: 11 AATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFE-EH------ 63
++S KK++ ++ K K ++ D D++ D+D EG G ++ + EH
Sbjct: 173 GVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEERQREHPFIVTE 232
Query: 64 -SEVEE----------DLTEQERKVILAKKYLEEIEKVEKEK 94
EV L EQ R ++L +++ + KE+
Sbjct: 233 PGEVARGKKNGLDYLFHLYEQCR------EFLIQVQNIAKER 268
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 28.5 bits (64), Expect = 7.4
Identities = 14/63 (22%), Positives = 26/63 (41%), Gaps = 3/63 (4%)
Query: 41 DEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEE-IEKVEKEKLNDSK 99
D + EDDE + E E++ + +++K + K Y E +K +K+
Sbjct: 55 DSDFDDSEDDEPE--SDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112
Query: 100 DKH 102
K
Sbjct: 113 PKA 115
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.0 bits (65), Expect = 7.5
Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 40 LDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSK 99
LDE ++ ++ + +E E+L + + L + + +E +E+E K
Sbjct: 160 LDELFGLEKYEKLSELLKEVIKEAKAKIEEL--EGQLSELLEDIEDLLEALEEELKELKK 217
Query: 100 DKHVLKKLETVVLEQE 115
+ + ++ E LEQE
Sbjct: 218 LEEIQEEQEEEELEQE 233
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.8 bits (65), Expect = 7.8
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 11/84 (13%)
Query: 39 DLDEELTSDEDDEGIGFDSQDFEE-HSEVEEDLTEQERKVILAKKYLEEIEKVEK----- 92
D D++L +E+D + + +D E+ S+ E+D E+E + KK E + E
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPFTFP 396
Query: 93 -----EKLNDSKDKHVLKKLETVV 111
E+ + L+ TVV
Sbjct: 397 CPKSHEEFLELLKGVSLEDQPTVV 420
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 27.5 bits (61), Expect = 9.2
Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 18/91 (19%)
Query: 19 KSKKRKFSEKPTKSKKTVKPDL---------DEELTSDEDDEGIGFDSQDFEEHSEVEED 69
K+R E+ + K + P+ EE T DE+DE DF S V +
Sbjct: 46 HEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEE------DFATPSAVPQW 99
Query: 70 LTEQERKVILAKKYLEEIEKVEKEKLNDSKD 100
+ ++ +K + ++ + E++ D D
Sbjct: 100 GKKARKR---QRKVIRKLLEAEEQLREDQYD 127
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 28.5 bits (64), Expect = 9.2
Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 3/73 (4%)
Query: 39 DLDEELTSDEDDEGIGFDSQDFEEHSEVEED---LTEQERKVILAKKYLEEIEKVEKEKL 95
+L +++ E++ Q E EE EQ+ K L EI++ E+E
Sbjct: 806 ELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQ 865
Query: 96 NDSKDKHVLKKLE 108
N +++K+ + +
Sbjct: 866 NINENKNEFVEFK 878
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 28.6 bits (64), Expect = 9.3
Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 9/71 (12%)
Query: 39 DLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDS 98
D D + + EG + D EE S+ EED +++ K +K +EK +
Sbjct: 151 DSDNKEAPETGREGDDGEDDD-EEGSDGEEDGAKKKAKKKRQRKPESTLEK--------N 201
Query: 99 KDKHVLKKLET 109
+ +KKL+
Sbjct: 202 FEALNVKKLDL 212
>gnl|CDD|225435 COG2880, COG2880, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 67
Score = 25.9 bits (57), Expect = 9.7
Identities = 10/38 (26%), Positives = 22/38 (57%)
Query: 59 DFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLN 96
D +E +V+ + +E+ + K L+EI+++ +E L
Sbjct: 23 DLKEGEKVKIVIRVKEKIYEILKGSLKEIKEILEEILE 60
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 28.3 bits (64), Expect = 9.8
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 1/73 (1%)
Query: 4 FIRNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEH 63
F+ K A + K K E+ K KK ++ +E+++ + + EE
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE-KKEEEKEEEE 459
Query: 64 SEVEEDLTEQERK 76
E EE+ E+E K
Sbjct: 460 EEAEEEKEEEEEK 472
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.130 0.377
Gapped
Lambda K H
0.267 0.0701 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,406,942
Number of extensions: 2126527
Number of successful extensions: 2304
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2148
Number of HSP's successfully gapped: 173
Length of query: 454
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 354
Effective length of database: 6,502,202
Effective search space: 2301779508
Effective search space used: 2301779508
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 61 (27.3 bits)