RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14440
(711 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 239 bits (612), Expect = 1e-73
Identities = 98/286 (34%), Positives = 143/286 (50%), Gaps = 11/286 (3%)
Query: 382 TLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLK--------AMKSSDQLQDI 433
TL VTC S D +LA G + IK+W L +L ++ D
Sbjct: 4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADG 63
Query: 434 EMEC---DDILVRIMDERNAETTRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLL 490
D +R+ D E RTL GH V ++FSPD +L S S+D TI++W +
Sbjct: 64 TYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVE 123
Query: 491 LWRCLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCV 550
+CL +GH V + FSP G + AS+ D +LW + + GH +V+ V
Sbjct: 124 TGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSV 183
Query: 551 QFHPNSNYVATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNV 610
F P+ + + SSD TI+LWD TG + L GH+ V +++FS G LAS DG +
Sbjct: 184 AFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTI 243
Query: 611 HMWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWD 656
+WDL G + L+GHT+ V +L +S DG LAS D ++++WD
Sbjct: 244 RVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 225 bits (576), Expect = 2e-68
Identities = 85/207 (41%), Positives = 116/207 (56%)
Query: 453 TRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLLLWRCLVVYKGHGHPVWDIKFSP 512
RTL GH G V ++FSPD LL + S D TI++W L L KGH PV D+ S
Sbjct: 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA 61
Query: 513 HGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWD 572
G Y AS D+ RLW ++ +R GH S V V F P+ +++ S D+TI++WD
Sbjct: 62 DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD 121
Query: 573 CVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWDLSNGHLLTLLTGHTDVVR 632
TG + L GH V +++FS G F+AS+ DG + +WDL G + LTGHT V
Sbjct: 122 VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVN 181
Query: 633 ALCFSRDGNILASSGQDCSVKLWDFAK 659
++ FS DG L SS D ++KLWD +
Sbjct: 182 SVAFSPDGEKLLSSSSDGTIKLWDLST 208
Score = 170 bits (433), Expect = 5e-48
Identities = 69/168 (41%), Positives = 93/168 (55%)
Query: 494 CLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFH 553
KGH V + FSP G A+ D ++W ++ LR GH V V
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 554 PNSNYVATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMW 613
+ Y+A+GSSD+TIRLWD TG VR LTGH + V +++FS GR L+S+ D + +W
Sbjct: 61 ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120
Query: 614 DLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDFAKLS 661
D+ G LT L GHTD V ++ FS DG +ASS QD ++KLWD
Sbjct: 121 DVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK 168
Score = 144 bits (365), Expect = 1e-38
Identities = 60/193 (31%), Positives = 83/193 (43%), Gaps = 31/193 (16%)
Query: 380 CYTLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLKAMKSSDQLQDIEMECDD 439
TL V S D + +A + IKLW L
Sbjct: 128 LTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDL----------------------- 164
Query: 440 ILVRIMDERNAETTRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLLLWRCLVVYK 499
R + TL GH G V ++FSPD LLS S D TI+LW L +CL +
Sbjct: 165 --------RTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLR 216
Query: 500 GHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV 559
GH + V + FSP GY AS D R+W + ++ GH + V + + P+ +
Sbjct: 217 GHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRL 276
Query: 560 ATGSSDRTIRLWD 572
A+GS+D TIR+WD
Sbjct: 277 ASGSADGTIRIWD 289
>gnl|CDD|176269 cd08044, TAF5_NTD2, TAF5_NTD2 is the second conserved N-terminal
region of TATA Binding Protein (TBP) Associated Factor 5
(TAF5), involved in forming Transcription Factor IID
(TFIID). The TATA Binding Protein (TBP) Associated
Factor 5 (TAF5) is one of several TAFs that bind TBP and
are involved in forming Transcription Factor IID (TFIID)
complex. TAF5 contains three domains, two conserved
sequence motifs at the N-terminal and one at the
C-terminal region. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) involved in accurate initiation of
transcription by RNA polymerase II in eukaryotes. TFIID
plays an important role in the recognition of promoter
DNA and assembly of the preinitiation complex. TFIID
complex is composed of the TBP and at least 13 TAFs. In
yeast and human cells, TAFs have been found as
components of other complexes besides TFIID. TAF5 may
play a major role in forming TFIID and its related
complexes. TAFs from various species were originally
named by their predicted molecular weight or their
electrophoretic mobility in polyacrylamide gels. A new,
unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. TAF5 has a paralog gene (TAF5L) which has
a redundant function. Several hypotheses are proposed
for TAFs functions such as serving as activator-binding
sites, core-promoter recognition or a role in essential
catalytic activity. C-terminus of TAF5 contains six WD40
repeats that likely form a closed beta propeller
structure and may be involved in protein-protein
interaction. The first part of the TAF5 N-terminal
(TAF5_NTD1) homodimerizes in the absence of other TAFs.
The second conserved N-terminal part of TAF5 (TAF5_NTD2)
has an alpha-helical domain. One study has shown that
TAF5_NTD2 homodimerizes only at high concentration of
calcium but not any other metals. No dimerization was
observed in other structural studies of TAF_NTD2.
Several TAFs interact via histone-fold (HFD) motifs; HFD
is the interaction motif involved in heterodimerization
of the core histones and their assembly into nucleosome
octamer. However, TAF5 does not have a HFD motif.
Length = 133
Score = 167 bits (425), Expect = 9e-49
Identities = 61/133 (45%), Positives = 91/133 (68%)
Query: 142 PNLYEDAYQDLKKFIENSLDVYKYELGLVLYPVLVHMYIELVYNGHEHQATEMLHKHLMD 201
PN YE AY L+K+IE+SLD+YKYEL +LYP+ VH Y++LV +GH +A + D
Sbjct: 1 PNDYEQAYSKLRKWIESSLDIYKYELSQLLYPIFVHSYLDLVASGHLEEAKSFFERFSGD 60
Query: 202 HEPHYQEDLRKLSSVTKRDQIRTNEIAETFRSTEFIIRMSRDTLSLLKRHLQEKKHTVLL 261
E + ED++KLSS+T + ++ NE+A+ FRS +++IRMSRD SLL R L+ ++LL
Sbjct: 61 FEDSHSEDIKKLSSITTPEHLKENELAKLFRSNKYVIRMSRDAYSLLLRFLESWGGSLLL 120
Query: 262 NIVQEHLYFDMYE 274
I+ EH+ D+ +
Sbjct: 121 KILNEHIDIDVRD 133
>gnl|CDD|203028 pfam04494, TFIID_90kDa, WD40 associated region in TFIID subunit.
This region, possibly a domain is found in subunits of
transcription factor TFIID. The function of this region
is unknown.
Length = 141
Score = 158 bits (402), Expect = 1e-45
Identities = 62/141 (43%), Positives = 88/141 (62%), Gaps = 3/141 (2%)
Query: 132 LLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKYELGLVLYPVLVHMYIELVYNGHEHQA 191
+LSA E DP YE AY L+ +IE+SLD+YK EL +LYP+ VH Y++LV GH +A
Sbjct: 1 VLSASPEESDPEQYERAYSLLRNWIEDSLDIYKPELSRLLYPLFVHSYLDLVAKGHPSEA 60
Query: 192 TEMLHKHLMDHEPHYQEDLRKLSSVTKRDQIRTNEIAETFRSTEFIIRMSRDTLSLLKRH 251
K D E + ED+ KL S++ + ++ NE+A FR ++ IR+SRD+ SLL R
Sbjct: 61 RSFFDKFHGDFEQLHGEDIEKLRSISLPEHLKENELARAFRDNKYRIRLSRDSFSLLLRF 120
Query: 252 LQEKKHTV---LLNIVQEHLY 269
LQE ++T LL I+ +HL
Sbjct: 121 LQENENTGGSLLLRILNQHLQ 141
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 159 bits (402), Expect = 4e-42
Identities = 111/331 (33%), Positives = 150/331 (45%), Gaps = 23/331 (6%)
Query: 374 SLPSIACYTLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLK----AMKSSDQ 429
SLP ++ L +T S D +L G S+ IKLW L +
Sbjct: 52 SLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSS 111
Query: 430 LQDIE------------MECDDILVRIMD-ERNAETTRTLVGHEGPVRKLSFSPDRSLLL 476
+ + D V++ D + RTL GH V L+FSPD LL
Sbjct: 112 VSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLA 171
Query: 477 SCSQ-DNTIRLWSLLLWRCLVVYKGHGHPVWDIKFSPHG-YYFASAGHDRMARLWATDSY 534
S S D TI+LW L + L GH PV + FSP G AS D RLW +
Sbjct: 172 SGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTG 231
Query: 535 HPLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWDCVTG-SHVRLLTGHKAPVYALS 593
LR + +SD F P+ + +A+GSSD TIRLWD + S +R L+GH + V +++
Sbjct: 232 KLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVA 291
Query: 594 FSVCGRFLASAGGDGNVHMWDLSNGHLLTLLT--GHTDVVRALCFSRDGNILASSG-QDC 650
FS G+ LAS DG V +WDL G LL+ LT GH V +L FS DG++L S G D
Sbjct: 292 FSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDG 351
Query: 651 SVKLWDFAKLSDEVGTEEVNLSHSADIKNSS 681
+++LWD E + S
Sbjct: 352 TIRLWDLRTGKPLKTLEGHSNVLSVSFSPDG 382
Score = 156 bits (395), Expect = 3e-41
Identities = 113/311 (36%), Positives = 144/311 (46%), Gaps = 23/311 (7%)
Query: 390 VTCAEISEDVSILAVG-FSESYIKLWSLVPQKLKA--MKSSDQLQDIEMECD-------- 438
VT S D +LA G + IKLW L K + +D + + D
Sbjct: 158 VTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASG 217
Query: 439 --DILVRIMDERNAETTR-TLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLL-LWRC 494
D +R+ D + R TL GH V SFSPD SLL S S D TIRLW L
Sbjct: 218 SSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSDGTIRLWDLRSSSSL 276
Query: 495 LVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSY--HPLRLFVGHYSDVDCVQF 552
L GH V + FSP G AS D RLW ++ GH V + F
Sbjct: 277 LRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSF 336
Query: 553 HPNSNYVATG-SSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVH 611
P+ + + +G S D TIRLWD TG ++ L GH V ++SFS GR ++S DG V
Sbjct: 337 SPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVR 395
Query: 612 MWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDFAKLSDEV---GTEE 668
+WDLS G LL L GHT V +L FS DG LAS D +++LWD V +
Sbjct: 396 LWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDGK 455
Query: 669 VNLSHSADIKN 679
V S S+D+
Sbjct: 456 VLASKSSDLSV 466
Score = 140 bits (352), Expect = 2e-35
Identities = 91/286 (31%), Positives = 133/286 (46%), Gaps = 13/286 (4%)
Query: 406 FSESYIKLWSLVPQKLKAMKSSDQLQDIEMECDDILVRIMDERNAETTRTLVGHEGPVRK 465
+K L P S + + +L + ++ L GHE +
Sbjct: 11 NKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITS 70
Query: 466 LSFSPDRSLLLSCSQDNTIRLWSLLL-WRCLVVYKG-HGHPVWDIKF-SPHGYYFASAG- 521
++FSPD LLLS S D TI+LW L + + +G H V + SP G A
Sbjct: 71 IAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASS 130
Query: 522 -HDRMARLWA-TDSYHPLRLFVGHYSDVDCVQFHPNSNYVATGSS-DRTIRLWDCVTGSH 578
D +LW + +R GH V + F P+ +A+GSS D TI+LWD TG
Sbjct: 131 SLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKP 190
Query: 579 VRLLTGHKAPVYALSFSVCGRF-LASAGGDGNVHMWDLSNGHLL-TLLTGHTDVVRALCF 636
+ L GH PV +L+FS G +AS DG + +WDLS G LL + L+GH+D V F
Sbjct: 191 LSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSF 249
Query: 637 SRDGNILASSGQDCSVKLWDFAKLSDEVGTEEVNLSHSADIKNSSL 682
S DG++LAS D +++LWD S + HS+ + + +
Sbjct: 250 SPDGSLLASGSSDGTIRLWDLRSSSSLL---RTLSGHSSSVLSVAF 292
Score = 97.1 bits (240), Expect = 3e-21
Identities = 78/231 (33%), Positives = 98/231 (42%), Gaps = 43/231 (18%)
Query: 381 YTLLNARQQVTCAEISEDVSILAVGFSESYIKLWSLVPQKLKAMKSSDQLQDIEMECDDI 440
TL V S D +LA G S+ ++LW L
Sbjct: 278 RTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDL------------------------ 313
Query: 441 LVRIMDERNAETTRTLVGHEGPVRKLSFSPDRS-LLLSCSQDNTIRLWSLLLWRCLVVYK 499
+ ++ TL GHEGPV LSFSPD S L+ S D TIRLW L + L +
Sbjct: 314 -----ETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLE 368
Query: 500 GHGHPVWDIKFSPHGYYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV 559
GH V + FSP G +S D RLW + LR GH S V + F P+ +
Sbjct: 369 GH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSL 427
Query: 560 ATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNV 610
A+GSSD TIRLWD T +SFS G+ LAS D +V
Sbjct: 428 ASGSSDNTIRLWDLKTSLKS------------VSFSPDGKVLASKSSDLSV 466
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 55.4 bits (134), Expect = 3e-10
Identities = 17/37 (45%), Positives = 22/37 (59%)
Query: 536 PLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWD 572
LR GH V V F P+ N +A+GS D T+R+WD
Sbjct: 3 LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 53.9 bits (130), Expect = 1e-09
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 451 ETTRTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWS 488
+ RTL GH GPV ++FSPD +LL S S D T+R+W
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 53.9 bits (130), Expect = 1e-09
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 618 GHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWD 656
G LL L GHT V ++ FS DGN+LAS D +V++WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 51.6 bits (124), Expect = 6e-09
Identities = 15/38 (39%), Positives = 18/38 (47%)
Query: 492 WRCLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLW 529
+ L KGH PV + FSP G AS D R+W
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 50.0 bits (120), Expect = 2e-08
Identities = 18/39 (46%), Positives = 23/39 (58%)
Query: 576 GSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWD 614
G +R L GH PV +++FS G LAS DG V +WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 54.6 bits (132), Expect = 6e-10
Identities = 18/40 (45%), Positives = 23/40 (57%)
Query: 533 SYHPLRLFVGHYSDVDCVQFHPNSNYVATGSSDRTIRLWD 572
S L+ GH V V F P+ Y+A+GS D TI+LWD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 53.1 bits (128), Expect = 2e-09
Identities = 20/40 (50%), Positives = 25/40 (62%)
Query: 617 NGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWD 656
+G LL L GHT V ++ FS DG LAS D ++KLWD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 53.1 bits (128), Expect = 2e-09
Identities = 19/35 (54%), Positives = 23/35 (65%)
Query: 454 RTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWS 488
+TL GH GPV ++FSPD L S S D TI+LW
Sbjct: 6 KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 51.5 bits (124), Expect = 6e-09
Identities = 16/38 (42%), Positives = 18/38 (47%)
Query: 492 WRCLVVYKGHGHPVWDIKFSPHGYYFASAGHDRMARLW 529
L KGH PV + FSP G Y AS D +LW
Sbjct: 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 50.8 bits (122), Expect = 1e-08
Identities = 16/40 (40%), Positives = 26/40 (65%)
Query: 575 TGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWD 614
+G ++ L GH PV +++FS G++LAS DG + +WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 55.7 bits (134), Expect = 5e-08
Identities = 51/229 (22%), Positives = 92/229 (40%), Gaps = 42/229 (18%)
Query: 456 LVGHEGPVRKLSFSP-DRSLLLSCSQDNTIRLWSLLLWRCLVVYKGHGHPVWDIKFSPHG 514
L+G EGP+ ++F+P D L + S+D TI W + P +
Sbjct: 71 LLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGI--------------PEEGLT----- 111
Query: 515 YYFASAGHDRMARLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV-ATGSSDRTIRLWDC 573
+ P+ GH V V FHP++ V A+ +D + +WD
Sbjct: 112 ----------------QNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDV 155
Query: 574 VTGSHVRLLTGHKAPVYALSFSVCGRFLASAGGDGNVHMWDLSNGHLLTLLTGH--TDVV 631
G V ++ H + +L +++ G L + D +++ D +G +++ + H
Sbjct: 156 ERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQ 215
Query: 632 RALCFSRDGNILA---SSGQDCSVKLWDFAKLSDEVGTEEVNLSHSADI 677
R L R I+ S Q + LWD K++ T +++ S + I
Sbjct: 216 RCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFI 264
Score = 51.4 bits (123), Expect = 1e-06
Identities = 35/128 (27%), Positives = 61/128 (47%), Gaps = 11/128 (8%)
Query: 539 LFVGHYSDVDCVQFHP-NSNYVATGSSDRTIRLWDCV-------TGSHVRLLTGHKAPVY 590
+ +G + V F+P + + T S D TI W + L GH V
Sbjct: 70 ILLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVG 129
Query: 591 ALSF--SVCGRFLASAGGDGNVHMWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQ 648
+SF S LASAG D V++WD+ G + ++ H+D + +L ++ DG++L ++ +
Sbjct: 130 IVSFHPSAMN-VLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSK 188
Query: 649 DCSVKLWD 656
D + + D
Sbjct: 189 DKKLNIID 196
Score = 32.9 bits (75), Expect = 0.56
Identities = 31/112 (27%), Positives = 50/112 (44%), Gaps = 18/112 (16%)
Query: 581 LLTGHKAPVYALSFS-VCGRFLASAGGDGNVHMWDLSNGHL-------LTLLTGHTDVVR 632
+L G + P+ ++F+ + L +A DG + W + L + L GHT V
Sbjct: 70 ILLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVG 129
Query: 633 ALCFSRDG-NILASSGQDCSVKLWDFAK---------LSDEVGTEEVNLSHS 674
+ F N+LAS+G D V +WD + SD++ + E NL S
Sbjct: 130 IVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS 181
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 50.6 bits (121), Expect = 3e-07
Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 11/82 (13%)
Query: 1 MDQLPSLMDDPPQESFHSVPII--QQTLSPQQP--PVSPQHQNMMSPIQQH--SLSPHTQ 54
Q P P H +P++ QQ + PQQP PV QH M+P Q H +L Q
Sbjct: 43 SQQHPPTHTLQPH---HHIPVLPAQQPVVPQQPLMPVPGQHS--MTPTQHHQPNLPQPAQ 97
Query: 55 APSPQPPAQPQQVPSPYHTPPP 76
P P QP Q P PP
Sbjct: 98 QPFQPQPLQPPQPQQPMQPQPP 119
Score = 37.5 bits (87), Expect = 0.007
Identities = 17/62 (27%), Positives = 19/62 (30%), Gaps = 4/62 (6%)
Query: 7 LMDDPPQESF----HSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
LM P Q S H P + Q P Q P+Q PQPP
Sbjct: 73 LMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPL 132
Query: 63 QP 64
P
Sbjct: 133 PP 134
Score = 31.3 bits (71), Expect = 0.84
Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 19 VPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
+P+ QQ P + P H + P Q + P Q P P Q P+ +H P Q
Sbjct: 40 IPVSQQH--PPTHTLQPHHHIPVLP-AQQPVVP--QQPLMPVPGQHSMTPTQHHQPNLPQ 94
Query: 79 TSKQ 82
++Q
Sbjct: 95 PAQQ 98
Score = 29.4 bits (66), Expect = 3.6
Identities = 17/50 (34%), Positives = 20/50 (40%), Gaps = 1/50 (2%)
Query: 29 QQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
Q PVS QH +Q H P A P P QP H+ P+Q
Sbjct: 38 QIIPVSQQHPPTH-TLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQ 86
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 50.2 bits (120), Expect = 3e-06
Identities = 43/223 (19%), Positives = 65/223 (29%), Gaps = 31/223 (13%)
Query: 1 MDQLPSLMDD--PPQE-SFHSVPIIQQTLSPQQPPVSP---QHQNMMSPIQQHSLSPHTQ 54
+ QL DD + H P+ Q + QP + + S Q S
Sbjct: 359 LAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSAT 418
Query: 55 APSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNT------LITMLKLLRQYNLRGTE 108
P+ PP P+ PPS + A K ++ L ++
Sbjct: 419 QPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478
Query: 109 EQLKKEINVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKYELG 168
EQ I G EI E D Y + + E +L K +
Sbjct: 479 EQATGNIKEAPTGTQK---EIFT-------EEDLQYYWQEFAGTRPQEEKAL---KETMI 525
Query: 169 LVLYPVLVHMYIELVYNGHEHQATEMLHKHLMDHEPHYQEDLR 211
+L E+ + +E Q K L + P LR
Sbjct: 526 NCRPKLLNGTTFEVAVD-NELQE-----KELTNLIPDLLGFLR 562
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 49.2 bits (117), Expect = 5e-06
Identities = 23/87 (26%), Positives = 27/87 (31%), Gaps = 7/87 (8%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
Q P P Q V + Q +PQQ P P P P Q + QP
Sbjct: 89 QSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPP-------PAQQPQAQQP 141
Query: 61 PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
PQ + PP Q Q P
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPPP 168
Score = 46.1 bits (109), Expect = 4e-05
Identities = 24/78 (30%), Positives = 29/78 (37%), Gaps = 4/78 (5%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP-S 69
PP + Q PQ PP PQ Q SP QQ Q P PQ + PQ
Sbjct: 124 PPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQ---QNPPPQAQSAPQVSGLY 180
Query: 70 PYHTPPPSQTSKQSGKAP 87
P +P Q+ + P
Sbjct: 181 PEESPYQPQSYPPNEPLP 198
Score = 41.5 bits (97), Expect = 0.001
Identities = 21/73 (28%), Positives = 24/73 (32%), Gaps = 5/73 (6%)
Query: 17 HSVPIIQQTLSPQQPPVSPQHQNMMSPI--QQHSLSPHTQAPSPQPPAQPQQVPSPYHTP 74
V Q P Q Q +P Q S P Q S PQQ P + P
Sbjct: 67 QQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEP---YYP 123
Query: 75 PPSQTSKQSGKAP 87
PPSQ + P
Sbjct: 124 PPSQPQPPPAQQP 136
Score = 38.8 bits (90), Expect = 0.008
Identities = 20/69 (28%), Positives = 27/69 (39%), Gaps = 3/69 (4%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP-QPPAQPQQVPS 69
P Q+ + S P QQ Q PP Q +S + Q+ P +P +
Sbjct: 148 PQQQQYQSPP--QQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQP 205
Query: 70 PYHTPPPSQ 78
PY PPSQ
Sbjct: 206 PYSGAPPSQ 214
Score = 36.8 bits (85), Expect = 0.032
Identities = 23/83 (27%), Positives = 32/83 (38%), Gaps = 6/83 (7%)
Query: 11 PPQESFHSVPIIQQ--TLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP-- 64
PP + S QQ + +P Q P Q + S QQ + P P P P
Sbjct: 74 PPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPA 133
Query: 65 QQVPSPYHTPPPSQTSKQSGKAP 87
QQ + PPP +Q ++P
Sbjct: 134 QQPQAQQPQPPPQVPQQQQYQSP 156
Score = 35.7 bits (82), Expect = 0.075
Identities = 15/76 (19%), Positives = 22/76 (28%), Gaps = 1/76 (1%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
Q ++ PP +P Q+ QQ S +P + Q P Q Q
Sbjct: 54 EQVAKHELADAPLQQVNAALPP-APAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQ 112
Query: 71 YHTPPPSQTSKQSGKA 86
TP +
Sbjct: 113 QPTPQQEPYYPPPSQP 128
Score = 34.1 bits (78), Expect = 0.26
Identities = 8/79 (10%), Positives = 17/79 (21%), Gaps = 3/79 (3%)
Query: 10 DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
D ++ + + L+ + P Q Q P Q
Sbjct: 45 DSTKQPPAPEQVAKHELADAPLQQVNA---ALPPAPAPQSPQPDQQQQSQAPPSHQYPSQ 101
Query: 70 PYHTPPPSQTSKQSGKAPD 88
S + + +
Sbjct: 102 LPPQQVQSVPQQPTPQQEP 120
Score = 33.0 bits (75), Expect = 0.58
Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 9/69 (13%)
Query: 23 QQTLSPQQPPVSPQ---HQNMMSPIQQHSLS-PHTQAP-SPQPPAQPQQVPSPYHT---- 73
S +QPP Q H+ +P+QQ + + P AP SPQP Q Q P H
Sbjct: 42 FHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQ 101
Query: 74 PPPSQTSKQ 82
PP Q
Sbjct: 102 LPPQQVQSV 110
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 48.6 bits (116), Expect = 9e-06
Identities = 25/79 (31%), Positives = 27/79 (34%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
Q P + Q Q P PP PQ + Q LS P PQP
Sbjct: 203 YPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQP 262
Query: 61 PAQPQQVPSPYHTPPPSQT 79
P Q QQ P P PPP
Sbjct: 263 PQQQQQPPQPQAQPPPQNQ 281
Score = 47.5 bits (113), Expect = 2e-05
Identities = 25/88 (28%), Positives = 27/88 (30%), Gaps = 3/88 (3%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS-PQ 59
+ P S PQQPP Q Q Q P Q P Q
Sbjct: 210 HPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQM--PPPPPQPPQQQQ 267
Query: 60 PPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
P QPQ P P + P P Q AP
Sbjct: 268 QPPQPQAQPPPQNQPTPHPGLPQGQNAP 295
Score = 47.1 bits (112), Expect = 3e-05
Identities = 14/74 (18%), Positives = 15/74 (20%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
+ PP P Q P Q P Q Q Q P P
Sbjct: 238 QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLP 297
Query: 71 YHTPPPSQTSKQSG 84
P Q
Sbjct: 298 PPQQPQLLPLVQQP 311
Score = 42.8 bits (101), Expect = 6e-04
Identities = 16/72 (22%), Positives = 20/72 (27%), Gaps = 1/72 (1%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
PQ Q P+QPP PQ Q + +P P P Q+P
Sbjct: 180 LPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQ 239
Query: 71 YHTPPPSQTSKQ 82
P
Sbjct: 240 -PPPLQQPQFPG 250
Score = 42.4 bits (100), Expect = 8e-04
Identities = 19/69 (27%), Positives = 23/69 (33%), Gaps = 4/69 (5%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHT----PPPSQ 78
PQ P Q Q + Q P PQ P QQ P + PPP Q
Sbjct: 202 GYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQ 261
Query: 79 TSKQSGKAP 87
+Q + P
Sbjct: 262 PPQQQQQPP 270
Score = 42.1 bits (99), Expect = 0.001
Identities = 22/71 (30%), Positives = 24/71 (33%), Gaps = 7/71 (9%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPH---TQAPSPQPPAQPQQV 67
P S P Q PQQ PQ Q P Q + P Q PP QPQ +
Sbjct: 248 FPGLSQQMPPPPPQP--PQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLL 305
Query: 68 PSPYHTPPPSQ 78
P P Q
Sbjct: 306 PL--VQQPQGQ 314
Score = 41.7 bits (98), Expect = 0.001
Identities = 20/78 (25%), Positives = 23/78 (29%), Gaps = 3/78 (3%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
Q P PPQ P Q T P Q +P + P Q PQ
Sbjct: 261 QPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQF 320
Query: 61 PAQPQQVPSPYHTPPPSQ 78
Q Q+ S SQ
Sbjct: 321 REQLVQL-SQQQREALSQ 337
Score = 39.8 bits (93), Expect = 0.006
Identities = 20/88 (22%), Positives = 23/88 (26%), Gaps = 5/88 (5%)
Query: 3 QLPSLMDDPPQESFHSVP---IIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQ 59
Q PQ +P +Q PQQ P P Q P
Sbjct: 164 QQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFL-PA 222
Query: 60 PPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
P P Q P P P Q + P
Sbjct: 223 PSQAPAQPPLP-PQLPQQPPPLQQPQFP 249
Score = 38.6 bits (90), Expect = 0.013
Identities = 15/72 (20%), Positives = 18/72 (25%), Gaps = 3/72 (4%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
PP Q QPP PQ+Q P + P QP P
Sbjct: 256 PPPPPQPPQQQQQPPQPQAQPP--PQNQPTPHPGLPQGQNA-PLPPPQQPQLLPLVQQPQ 312
Query: 71 YHTPPPSQTSKQ 82
P +
Sbjct: 313 GQQRGPQFREQL 324
Score = 35.5 bits (82), Expect = 0.10
Identities = 22/83 (26%), Positives = 26/83 (31%), Gaps = 7/83 (8%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
PSL P+ QP Q M+S + + Q PQ P P
Sbjct: 123 PSLWGTAPKPEPQPPQA-----PESQPQPQTPAQKMLSLEEVEAQLQQRQQA-PQLPQPP 176
Query: 65 QQVPSPYHTPPPSQTSKQSGKAP 87
QQV P PP Q G
Sbjct: 177 QQVL-PQGMPPRQAAFPQQGPPE 198
Score = 34.7 bits (80), Expect = 0.18
Identities = 18/84 (21%), Positives = 23/84 (27%), Gaps = 6/84 (7%)
Query: 6 SLMDDPPQESFHSVPIIQQTLSPQQPPVSPQ----HQNMMS--PIQQHSLSPHTQAPSPQ 59
+ D P S + PQ P PQ Q M+S ++ PQ
Sbjct: 115 YIPDLQPDPSLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQ 174
Query: 60 PPAQPQQVPSPYHTPPPSQTSKQS 83
PP Q P Q
Sbjct: 175 PPQQVLPQGMPPRQAAFPQQGPPE 198
Score = 34.7 bits (80), Expect = 0.19
Identities = 21/100 (21%), Positives = 28/100 (28%), Gaps = 16/100 (16%)
Query: 4 LPSLMDDPPQESFHSVP--------------IIQQTLSPQQPPVSPQHQNMMSPIQ--QH 47
+ P ++ S P + Q QQ P PQ + P
Sbjct: 128 TAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPR 187
Query: 48 SLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
+ Q P QPP PQ P Q +AP
Sbjct: 188 QAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAP 227
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 48.5 bits (115), Expect = 1e-05
Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 4/102 (3%)
Query: 559 VATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSF-SVCGRFLASAGGDGNVHMWDLSN 617
+A+GS D +++LW G + + KA + + F S GR LA D V+ +DL N
Sbjct: 591 LASGSDDGSVKLWSINQGVSIGTIKT-KANICCVQFPSESGRSLAFGSADHKVYYYDLRN 649
Query: 618 GHL-LTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDFA 658
L L + GH+ V + F D + L SS D ++KLWD +
Sbjct: 650 PKLPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDLS 690
Score = 38.5 bits (89), Expect = 0.011
Identities = 44/188 (23%), Positives = 84/188 (44%), Gaps = 19/188 (10%)
Query: 439 DILVRIMDERNAETTRTLVGHEGPVRKLSFSP-DRSLLLSCSQDNTIRLWSLLLWRCLVV 497
+ +V++ D ++ + HE V + +S D +LL S S D +++LWS+ +
Sbjct: 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGT 613
Query: 498 YKGHGHPVWDIKF---SPHGYYFASAGHDRMARLWATDSYHP---LRLFVGHYSDVDCVQ 551
K + + ++F S F SA H +++ D +P L +GH V V+
Sbjct: 614 IKTKAN-ICCVQFPSESGRSLAFGSADH----KVYYYDLRNPKLPLCTMIGHSKTVSYVR 668
Query: 552 FHPNSNYVATGSSDRTIRLWDC------VTGSHVRLLTGHKAPVYALSFSVCGRFLASAG 605
F +S V++ S+D T++LWD + + + GH + SV ++A+
Sbjct: 669 FVDSSTLVSS-STDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGS 727
Query: 606 GDGNVHMW 613
V ++
Sbjct: 728 ETNEVFVY 735
Score = 36.6 bits (84), Expect = 0.047
Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 13/106 (12%)
Query: 396 SEDVSILAVGFSESYIKLWSLVPQ------KLKA-----MKSSDQLQDIEMECDDILVRI 444
S D ++LA G + +KLWS+ K KA S+ + + D V
Sbjct: 585 SADPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYY 644
Query: 445 MDERNAETTR-TLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSL 489
D RN + T++GH V + F D S L+S S DNT++LW L
Sbjct: 645 YDLRNPKLPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDL 689
Score = 36.2 bits (83), Expect = 0.065
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 601 LASAGGDGNVHMWDLSNGHLLTLLTGHTDVVRALCFSR-DGNILASSGQDCSVKLW 655
+AS+ +G V +WD++ L+T + H V ++ +S D +LAS D SVKLW
Sbjct: 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLW 603
Score = 34.3 bits (78), Expect = 0.23
Identities = 32/159 (20%), Positives = 65/159 (40%), Gaps = 9/159 (5%)
Query: 508 IKFSPHGYYFASAGHDRMARLWATDS--------YHPLRLFVGHYSDVDCVQFHPNSNYV 559
I F G +FA+AG ++ +++ +S ++P+ + V
Sbjct: 489 IGFDRDGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQV 548
Query: 560 ATGSSDRTIRLWDCVTGSHVRLLTGHKAPVYALSFSVCG-RFLASAGGDGNVHMWDLSNG 618
A+ + + +++WD V + H+ V+++ +S LAS DG+V +W ++ G
Sbjct: 549 ASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQG 608
Query: 619 HLLTLLTGHTDVVRALCFSRDGNILASSGQDCSVKLWDF 657
+ + ++ S G LA D V +D
Sbjct: 609 VSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDL 647
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 43.1 bits (102), Expect = 3e-04
Identities = 17/87 (19%), Positives = 25/87 (28%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
D+ + SF + SP PP S + SP + P P
Sbjct: 157 EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP 216
Query: 61 PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
+ P P P + + AP
Sbjct: 217 SSFQSDTPPPSPESPTNPSPPPGPAAP 243
Score = 35.0 bits (81), Expect = 0.098
Identities = 20/99 (20%), Positives = 26/99 (26%), Gaps = 8/99 (8%)
Query: 5 PSLMDDPPQESFHSVPIIQQTL-------SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS 57
S P S VP PP Q+ P S + + P
Sbjct: 180 ASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPG 239
Query: 58 PQPPAQPQQVPSPYH-TPPPSQTSKQSGKAPDKNTLITM 95
P P P P T P+ S + AP +
Sbjct: 240 PAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDD 278
Score = 32.7 bits (75), Expect = 0.52
Identities = 22/82 (26%), Positives = 31/82 (37%), Gaps = 6/82 (7%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
+ PS +DP S S+P + PP SP+ SP P AP P P
Sbjct: 194 VPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPP------PGPAAPPPPP 247
Query: 61 PAQPQQVPSPYHTPPPSQTSKQ 82
Q + + TPP + +
Sbjct: 248 VQQVPPLSTAKPTPPSASATPA 269
Score = 30.0 bits (68), Expect = 4.0
Identities = 10/78 (12%), Positives = 16/78 (20%), Gaps = 1/78 (1%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
S + P P P P+QQ + P+P +
Sbjct: 209 DSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAK-PTPPSASAT 267
Query: 65 QQVPSPYHTPPPSQTSKQ 82
+ Q
Sbjct: 268 PAPIGGITLDDDAIAKAQ 285
Score = 29.3 bits (66), Expect = 7.0
Identities = 10/79 (12%), Positives = 13/79 (16%), Gaps = 5/79 (6%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
PS PP S + P P P P P
Sbjct: 207 PSDSSLPPAPSSFQSDTPPPSPESPTNPSPPP-----GPAAPPPPPVQQVPPLSTAKPTP 261
Query: 65 QQVPSPYHTPPPSQTSKQS 83
+ +
Sbjct: 262 PSASATPAPIGGITLDDDA 280
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 42.0 bits (99), Expect = 6e-04
Identities = 15/72 (20%), Positives = 25/72 (34%), Gaps = 2/72 (2%)
Query: 10 DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
+ + S IQQ QQPP + ++P +Q + Q P+P V
Sbjct: 109 EQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQ 168
Query: 70 PYHTPPPSQTSK 81
P ++
Sbjct: 169 A--VEAPKVEAE 178
Score = 40.0 bits (94), Expect = 0.002
Identities = 22/93 (23%), Positives = 34/93 (36%), Gaps = 6/93 (6%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSL--SPHTQAPSP 58
++Q+ + M P VP +QT + V Q Q + P T
Sbjct: 88 LEQMQADMRQQPT-QLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQT 146
Query: 59 QPPAQPQQVPSPYHTPP--PSQTSKQSGKAPDK 89
P QPQ P+P T P P + ++ K +
Sbjct: 147 TAPVQPQT-PAPVRTQPAAPVTQAVEAPKVEAE 178
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 42.2 bits (99), Expect = 8e-04
Identities = 26/84 (30%), Positives = 39/84 (46%), Gaps = 10/84 (11%)
Query: 456 LVGHEGPVRKLSFSPDRS-LLLSCSQDNTIRLWSLL--------LWRCLVVYKGHGHPVW 506
L GH + L F+P S +L S S+D TIR+W + + + KGH +
Sbjct: 70 LKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS 129
Query: 507 DIKFSPHGYY-FASAGHDRMARLW 529
I ++P YY S+G D +W
Sbjct: 130 IIDWNPMNYYIMCSSGFDSFVNIW 153
Score = 40.7 bits (95), Expect = 0.002
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 10/86 (11%)
Query: 542 GHYSDVDCVQFHP-NSNYVATGSSDRTIRLWDCV-TGSHVR-------LLTGHKAPVYAL 592
GH S + +QF+P S +A+GS D TIR+W+ V+ +L GHK + +
Sbjct: 72 GHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISII 131
Query: 593 SFSVCGRF-LASAGGDGNVHMWDLSN 617
++ + + S+G D V++WD+ N
Sbjct: 132 DWNPMNYYIMCSSGFDSFVNIWDIEN 157
Score = 37.6 bits (87), Expect = 0.021
Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 10/99 (10%)
Query: 568 IRLWDCVTGSHVRLLTGHKAPVYALSFSVC-GRFLASAGGDGNVHMWDLSNGHL------ 620
IRL + + V L GH + + L F+ C LAS D + +W++ +
Sbjct: 56 IRLENQMRKPPVIKLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIK 115
Query: 621 --LTLLTGHTDVVRALCFS-RDGNILASSGQDCSVKLWD 656
+L GH + + ++ + I+ SSG D V +WD
Sbjct: 116 DPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWD 154
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 42.2 bits (99), Expect = 0.001
Identities = 17/78 (21%), Positives = 23/78 (29%), Gaps = 6/78 (7%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
S S + P +P P P P PP +PQ +P
Sbjct: 2890 AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP----PPQPQPPPPPPPRPQPPLAP 2945
Query: 71 YHTPPPSQTSKQSGKAPD 88
T P+ + SG P
Sbjct: 2946 --TTDPAGAGEPSGAVPQ 2961
Score = 41.5 bits (97), Expect = 0.002
Identities = 21/74 (28%), Positives = 24/74 (32%), Gaps = 1/74 (1%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
ESF P + Q P PQ Q P Q P P PQPP P P+
Sbjct: 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP-PRPQPPLAPTTDPAG 2951
Query: 71 YHTPPPSQTSKQSG 84
P + G
Sbjct: 2952 AGEPSGAVPQPWLG 2965
Score = 39.9 bits (93), Expect = 0.006
Identities = 22/63 (34%), Positives = 26/63 (41%), Gaps = 2/63 (3%)
Query: 30 QPPVSPQHQNMMSPIQQHSLSPHTQAPS-PQPPAQPQQVPSPYHTPPPSQTSKQSGKAPD 88
+P VS ++ P Q P QAP PQP QP P P PPP Q AP
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP-QPPPPPPPRPQPPLAPT 2946
Query: 89 KNT 91
+
Sbjct: 2947 TDP 2949
Score = 33.8 bits (77), Expect = 0.44
Identities = 13/61 (21%), Positives = 16/61 (26%), Gaps = 4/61 (6%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
+P +PPV + P S P Q P P P Q
Sbjct: 2877 APARPPVR----RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
Query: 87 P 87
P
Sbjct: 2933 P 2933
Score = 33.0 bits (75), Expect = 0.74
Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 3/67 (4%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHT-QAPSPQPPAQPQQVPS 69
PP SVP + P +P V+ + + +P Q P P P
Sbjct: 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSA--RPRAPVDDRGDPRGPAPPSPL 2617
Query: 70 PYHTPPP 76
P T P
Sbjct: 2618 PPDTHAP 2624
Score = 32.2 bits (73), Expect = 1.3
Identities = 20/75 (26%), Positives = 25/75 (33%), Gaps = 8/75 (10%)
Query: 9 DDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP 68
D PPQ + P+ + P SP P H+ P PSP P A
Sbjct: 2590 DAPPQSARPRAPVDDRGDPRGPAPPSP------LPPDTHAPDP--PPPSPSPAANEPDPH 2641
Query: 69 SPYHTPPPSQTSKQS 83
P PPP +
Sbjct: 2642 PPPTVPPPERPRDDP 2656
Score = 31.8 bits (72), Expect = 1.8
Identities = 23/85 (27%), Positives = 31/85 (36%), Gaps = 17/85 (20%)
Query: 9 DDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLS----------PHTQAPSP 58
D PP S P + P +P P H M++ I L P P+
Sbjct: 2507 DAPPAPS-RLAP----AILPDEPVGEPVHPRMLTWI--RGLEELASDDAGDPPPPLPPAA 2559
Query: 59 QPPAQPQQVPSPYHTPPPSQTSKQS 83
P A + VP P P PS+ + S
Sbjct: 2560 PPAAPDRSVPPPRPAPRPSEPAVTS 2584
Score = 29.5 bits (66), Expect = 7.7
Identities = 11/38 (28%), Positives = 12/38 (31%)
Query: 50 SPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
AP P P P P PPP T S +
Sbjct: 404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPG 441
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 40.8 bits (95), Expect = 0.003
Identities = 29/104 (27%), Positives = 38/104 (36%), Gaps = 3/104 (2%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPV-SPQHQNMMSPIQQHSLSPHTQAPSPQPP 61
Q P Q+ V Q PQQP PQ+Q P+ Q P P
Sbjct: 756 QQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV-APQ 814
Query: 62 AQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLR 105
Q QQ P P Q +Q ++TL+ L L+R + R
Sbjct: 815 PQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPL-LMRNGDSR 857
Score = 40.5 bits (94), Expect = 0.004
Identities = 22/84 (26%), Positives = 30/84 (35%), Gaps = 5/84 (5%)
Query: 4 LPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ 63
+ +L+DD P E P+ + P Q P P Q ++P Q PQ P
Sbjct: 732 MKALLDDGPHE-----PLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVA 786
Query: 64 PQQVPSPYHTPPPSQTSKQSGKAP 87
PQ P Q Q + P
Sbjct: 787 PQPQYQQPQQPVAPQPQYQQPQQP 810
Score = 30.1 bits (67), Expect = 5.3
Identities = 15/61 (24%), Positives = 22/61 (36%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
+P+ P Q+ + P PA Q PY+ P P Q ++Q A
Sbjct: 376 APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYA 435
Query: 87 P 87
P
Sbjct: 436 P 436
Score = 29.7 bits (66), Expect = 8.0
Identities = 17/81 (20%), Positives = 23/81 (28%), Gaps = 12/81 (14%)
Query: 11 PPQESFHSVPIIQQTLSPQ----QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ 66
P E + Q Q PV PQ +Q + P+ QP QP
Sbjct: 375 PAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
Query: 67 VPSP--------YHTPPPSQT 79
P+P + T
Sbjct: 435 APAPEQPVAGNAWQAEEQQST 455
>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
This is a family of eukaryotic translation initiation
factors.
Length = 194
Score = 38.8 bits (91), Expect = 0.004
Identities = 25/140 (17%), Positives = 47/140 (33%), Gaps = 16/140 (11%)
Query: 445 MDERNAETTRTLVGHEGPVRKLSFSPDRSLLLSC--SQDNTIRLWSLLLWRCLVVYKGHG 502
+ E + EGP+ ++SP+ I + L + V++
Sbjct: 44 IGETGGPDEVVELDKEGPIHDFAWSPNGKEFAVIYGYMPAKITFFDL---KGNVIHSLGE 100
Query: 503 HPVWDIKFSPHGYYFASAGHDRMA---RLWATDSYHPLRLFVGHYSDVDCVQFHPNSNYV 559
P I +SP G AG +A W + + + DC ++ P+ Y
Sbjct: 101 QPRNTIFWSPFGRLVLLAGFGNLAGQIEFWDVKNKKKIATAE-ASNATDC-EWSPDGRYF 158
Query: 560 ATGSS------DRTIRLWDC 573
T ++ D ++W
Sbjct: 159 LTATTSPRLRVDNGFKIWHY 178
>gnl|CDD|221042 pfam11244, Med25_NR-box, Mediator complex subunit 25 C-terminal
NR box-containing. The overall function of the
full-length Med25 is efficiently to coordinate the
transcriptional activation of RAR/RXR (retinoic acid
receptor/retinoic X receptor) in higher eukaryotic
cells. Human Med25 consists of several domains with
different binding properties, the N-terminal, VWA,
domain, an SD1 - synapsin 1 - domain from residues
229-381, a PTOV(B) or ACID domain from 395-545, an SD2
domain from residues 564-645 and this C-terminal NR
box-containing domain (646-650) from C69-747. The NR
box of MED25 is critical for its recruitment to the
promoter, probably through an interaction with pre
bound RAR.
Length = 89
Score = 36.6 bits (84), Expect = 0.005
Identities = 19/62 (30%), Positives = 24/62 (38%), Gaps = 7/62 (11%)
Query: 24 QTLSPQQPPVS-PQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
QP ++ Q Q M Q H L P P+ P Q QQ+ P Q S Q
Sbjct: 1 HQQQQGQPGLAAMQQQAMGQQQQGHQL-PM-PGPAQFPLQQLQQM----RGPGGGQMSMQ 54
Query: 83 SG 84
+G
Sbjct: 55 AG 56
Score = 31.6 bits (71), Expect = 0.28
Identities = 16/57 (28%), Positives = 19/57 (33%), Gaps = 7/57 (12%)
Query: 20 PIIQQTLSPQQPPVSP------QHQNMMSPIQ-QHSLSPHTQAPSPQPPAQPQQVPS 69
+ QQ Q P P Q Q M P Q S+ +AP QP S
Sbjct: 17 AMGQQQQGHQLPMPGPAQFPLQQLQQMRGPGGGQMSMQAGGRAPQQMHALQPLLGQS 73
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 37.5 bits (87), Expect = 0.006
Identities = 23/68 (33%), Positives = 30/68 (44%), Gaps = 5/68 (7%)
Query: 23 QQTLSPQQPPVSPQH---QNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
++ L+P PP P+ SP P T P+ PPA VP+P PPPS
Sbjct: 76 RKILAPLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAV-PPAPAAAVPAPAAAPPPSDP 134
Query: 80 SKQSGKAP 87
Q +AP
Sbjct: 135 P-QPPRAP 141
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins
is found in bacteria. Proteins in this family are
typically between 115 and 181 amino acids in length.
There are two completely conserved residues (F and G)
that may be functionally important.
Length = 110
Score = 36.2 bits (84), Expect = 0.008
Identities = 17/65 (26%), Positives = 20/65 (30%), Gaps = 6/65 (9%)
Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP-QPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
QQP + M P P Q P QP SP PP S +
Sbjct: 16 QQQPYQQQPYHQQMPPPPYS--PPQQQQGHFMPPQPQPYPKQSPQQQQPP---QFSSFLS 70
Query: 87 PDKNT 91
KN+
Sbjct: 71 QFKNS 75
Score = 35.1 bits (81), Expect = 0.020
Identities = 17/75 (22%), Positives = 18/75 (24%), Gaps = 12/75 (16%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
Q Q P PP Q M P P P PQQ P
Sbjct: 16 QQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQ------------PQPYPKQSPQQQQPP 63
Query: 71 YHTPPPSQTSKQSGK 85
+ SQ G
Sbjct: 64 QFSSFLSQFKNSDGT 78
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 39.1 bits (92), Expect = 0.009
Identities = 15/64 (23%), Positives = 21/64 (32%), Gaps = 2/64 (3%)
Query: 16 FHSVPIIQQTLSPQQP-PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTP 74
FH + + P Q + Q +P + P P A PQQ P+
Sbjct: 359 FHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASA-PQQAPAVPLPE 417
Query: 75 PPSQ 78
SQ
Sbjct: 418 TTSQ 421
Score = 32.5 bits (75), Expect = 0.95
Identities = 21/110 (19%), Positives = 33/110 (30%), Gaps = 4/110 (3%)
Query: 9 DDPPQESFHSVPIIQQTLSPQQ----PPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
+ P +S Q T +P P S QQ P + S A+
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQ 427
Query: 65 QQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKE 114
Q + T + +A N+ + L +R + KKE
Sbjct: 428 QLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKE 477
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 37.2 bits (86), Expect = 0.010
Identities = 26/87 (29%), Positives = 30/87 (34%), Gaps = 17/87 (19%)
Query: 3 QLP--SLMDDPPQESFHSVPIIQQTLSPQQP--PVSPQHQNM--------MSPIQQHSLS 50
Q+P P H + I L PQQP PV H + + P QH L
Sbjct: 56 QMPQQQQSAHPKLTPHHQLLI----LPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQHPLQ 111
Query: 51 PHT-QAPSPQPPAQPQQVPSPYHTPPP 76
P Q P PQ P Q P P
Sbjct: 112 PTYGQNPQPQQPTHTQPPVQPQQPADP 138
Score = 36.4 bits (84), Expect = 0.018
Identities = 23/76 (30%), Positives = 29/76 (38%), Gaps = 5/76 (6%)
Query: 4 LPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHT-QAPSPQPPA 62
+P P Q+ + T Q + PQ M P H + P T Q P QPPA
Sbjct: 51 IPLSPQMPQQQQSA---HPKLTPHHQLLILPPQQPMMPVPGH-HPMVPMTGQQPHLQPPA 106
Query: 63 QPQQVPSPYHTPPPSQ 78
Q P+ P P Q
Sbjct: 107 QHPLQPTYGQNPQPQQ 122
>gnl|CDD|150370 pfam09682, Holin_LLH, Phage holin protein (Holin_LLH). This entry
identifies a family of putative phage holin from a
number of phage and prophage regions of Gram-positive
bacteria. Like other holins, it is small (about 100
amino acids) with stretches of hydrophobic sequence and
is encoded adjacent to lytic enzymes.
Length = 108
Score = 35.8 bits (83), Expect = 0.012
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 78 QTSKQSGKAPDKNTLIT--MLKLLRQYNLRGTEEQLKKEI 115
Q +K+ GK +K T + L + N+ T+ QLK I
Sbjct: 58 QIAKEGGKGEEKLTQAKNKLKAELDKANIHPTDAQLKGFI 97
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 37.4 bits (87), Expect = 0.018
Identities = 10/72 (13%), Positives = 14/72 (19%), Gaps = 4/72 (5%)
Query: 8 MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMM----SPIQQHSLSPHTQAPSPQPPAQ 63
+ P S + P + Q P P Q + P
Sbjct: 45 AEFPWDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALP 104
Query: 64 PQQVPSPYHTPP 75
T P
Sbjct: 105 AGPAGPTIQTEP 116
Score = 34.7 bits (80), Expect = 0.15
Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 2/38 (5%)
Query: 56 PSPQPPAQPQQVPSPYHTPPPSQTSKQS--GKAPDKNT 91
PSPQ P Q+P P PPP+Q + G NT
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNT 88
Score = 30.1 bits (68), Expect = 4.4
Identities = 29/155 (18%), Positives = 44/155 (28%), Gaps = 20/155 (12%)
Query: 19 VPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQV----PSPYHTP 74
VP++ P P Q + +QQ +P S Q P+Q Q S
Sbjct: 124 VPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPA----SGQLPSQQQSAQKNDESQLQQQ 179
Query: 75 P-----PSQTSKQSGKAPDKNTLITMLK---LLRQYNLRGTEEQLKKEINVNDLGIDPND 126
P P QT + + R G +
Sbjct: 180 PNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQAKSSKRR 239
Query: 127 SEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLD 161
+ + EGD + +D +D IE+ LD
Sbjct: 240 TIAQIDGIDSDDEGDGSD-DDDDED---AIESDLD 270
Score = 30.1 bits (68), Expect = 4.6
Identities = 15/88 (17%), Positives = 21/88 (23%), Gaps = 8/88 (9%)
Query: 5 PSLMDDPPQESFHSVPIIQ---QTLSPQQPPVSPQ---HQNMMSPIQQHSLSPHTQAPSP 58
I Q Q P + Q + + P QQ +L Q
Sbjct: 96 NPPATFALPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRAL-QQLQQRYG 154
Query: 59 QPPA-QPQQVPSPYHTPPPSQTSKQSGK 85
P + Q SQ +Q
Sbjct: 155 APASGQLPSQQQSAQKNDESQLQQQPNG 182
Score = 30.1 bits (68), Expect = 4.6
Identities = 10/61 (16%), Positives = 13/61 (21%), Gaps = 3/61 (4%)
Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP-SPYHTPPPSQTSKQSGKA 86
P P Q P Q P PP+ + +G A
Sbjct: 51 PSPQAPPPVAQL--PQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPA 108
Query: 87 P 87
Sbjct: 109 G 109
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 36.7 bits (85), Expect = 0.019
Identities = 13/71 (18%), Positives = 18/71 (25%), Gaps = 1/71 (1%)
Query: 8 MDDPPQES-FHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ 66
P + + QT QPP + H + P P+P Q
Sbjct: 106 ACRPQRPVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPP 165
Query: 67 VPSPYHTPPPS 77
PP
Sbjct: 166 PGLLQPQGPPP 176
>gnl|CDD|221625 pfam12533, Neuro_bHLH, Neuronal helix-loop-helix transcription
factor. This domain family is found in eukaryotes, and
is approximately 80 amino acids in length. The family is
found C-terminal to pfam00010. There is a single
completely conserved residue W that may be functionally
important. Neuronal basic helix-loop-helix (bHLH)
transcription factors such as neuroD and neurogenin have
been shown to play important roles in neuronal
development.
Length = 120
Score = 35.4 bits (82), Expect = 0.020
Identities = 23/89 (25%), Positives = 28/89 (31%), Gaps = 12/89 (13%)
Query: 3 QLPS---LMDDPPQESFHSVPIIQQTLSPQQPP--VSPQHQNMMSPIQQHSLSPHTQAPS 57
QL L + S Q P Q P SP H +M S L PH +
Sbjct: 30 QLNPRTLLPEQQQDYSHSGSSPFSQHPYPYQSPGLPSPPHGSM-SSSHSLHLKPHGYCSA 88
Query: 58 PQP---PAQPQQVPSPY---HTPPPSQTS 80
+P P PY +PP S
Sbjct: 89 YEPFYESHSPDCGSPPYDGPLSPPLSING 117
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 37.3 bits (87), Expect = 0.025
Identities = 12/72 (16%), Positives = 19/72 (26%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
PP+ + QP P +Q Q +SP + PQ
Sbjct: 108 PPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQA 167
Query: 71 YHTPPPSQTSKQ 82
+ P +
Sbjct: 168 FQPAEPVAAPQP 179
Score = 36.6 bits (85), Expect = 0.035
Identities = 24/95 (25%), Positives = 32/95 (33%), Gaps = 14/95 (14%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP-- 60
P+ P + P +Q P Q PVSPQ P+ P+ Q
Sbjct: 119 PRPAQPAPQPVQQPAYQPQPEQ---PLQQPVSPQVAPAPQPVHS------APQPAQQAFQ 169
Query: 61 PAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITM 95
PA+P P P P+ K K +I M
Sbjct: 170 PAEPVAAPQPEPVAEPAPVMD---KPKRKEAVIVM 201
Score = 36.6 bits (85), Expect = 0.042
Identities = 22/81 (27%), Positives = 27/81 (33%), Gaps = 5/81 (6%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQH--QNMMSPIQQHSLSPHTQAPSPQPPAQP--QQ 66
PP S +QQ Q PP Q P+QQ + P + P Q P P
Sbjct: 94 PPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQP-LQQPVSPQVAP 152
Query: 67 VPSPYHTPPPSQTSKQSGKAP 87
P P H+ P P
Sbjct: 153 APQPVHSAPQPAQQAFQPAEP 173
Score = 35.4 bits (82), Expect = 0.087
Identities = 14/82 (17%), Positives = 18/82 (21%), Gaps = 4/82 (4%)
Query: 5 PSLMDDPPQESFHSV-PIIQQTLSPQQPPVSPQHQNMMSPIQQHSL---SPHTQAPSPQP 60
+ H P Q P+ Q + + P Q
Sbjct: 78 AQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQ 137
Query: 61 PAQPQQVPSPYHTPPPSQTSKQ 82
P QP Q P P Q
Sbjct: 138 PEQPLQQPVSPQVAPAPQPVHS 159
Score = 31.5 bits (72), Expect = 1.5
Identities = 13/56 (23%), Positives = 17/56 (30%), Gaps = 2/56 (3%)
Query: 32 PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
P + Q P QH P + P+ P Q P P + Q P
Sbjct: 75 PANAQEHEAARPSPQHQYQPPYASAQPRQPVQQP--PEAQVPPQHAPRPAQPAPQP 128
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 36.7 bits (85), Expect = 0.026
Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 7/67 (10%)
Query: 21 IIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
+I++ + QPP M P PH P+ P AQP +P +PPP+ S
Sbjct: 134 VIRKKEALPQPPPPAPVVMMQPP------PPHAMPPASPPAAQPAP-SAPASSPPPTPAS 186
Query: 81 KQSGKAP 87
KAP
Sbjct: 187 PPPAKAP 193
Score = 35.2 bits (81), Expect = 0.086
Identities = 19/57 (33%), Positives = 26/57 (45%), Gaps = 3/57 (5%)
Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAP-SPQPPAQPQQVPSPYHTPPPSQTSKQS 83
PQ PP +P MM P H++ P + P P A P +PPP++ K S
Sbjct: 142 PQPPPPAPVV--MMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSS 196
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 37.0 bits (85), Expect = 0.030
Identities = 17/74 (22%), Positives = 28/74 (37%)
Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
+P P QN + + Q Q P QP Q P+P + ++Q K
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKRS 259
Query: 88 DKNTLITMLKLLRQ 101
+ LK+L++
Sbjct: 260 REQGNQEFLKMLKE 273
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 37.0 bits (85), Expect = 0.031
Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 21/111 (18%)
Query: 21 IIQQTLSPQQPPVSP--------QHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP---- 68
+IQ+ L + V P + ++ S + P T P P PPA+ +Q P
Sbjct: 144 VIQEFLVKEGDTVEPGTKVAIISKSEDAASQVTPSQKIPETTDPKPSPPAEDKQKPKVES 203
Query: 69 --------SPYHTPPPSQTSKQSGKAP-DKNTLITMLKLLRQYNLRGTEEQ 110
+P PPP Q++K+ P ++ + M +L ++ R + Q
Sbjct: 204 APVAEKPKAPSSPPPPKQSAKEPQLPPKERERRVPMTRLRKRVATRLKDSQ 254
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.0 bits (85), Expect = 0.038
Identities = 30/96 (31%), Positives = 35/96 (36%), Gaps = 14/96 (14%)
Query: 5 PSLMDDP---PQESFHSVPIIQQTLSPQ------QPPVSPQHQNMMSPIQQHSLSPHTQA 55
PS+ P S Q + PQ QP PQ + +S I SL P +
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHP-QRL 237
Query: 56 PSPQPPAQP----QQVPSPYHTPPPSQTSKQSGKAP 87
PSP PP QP QQ P P S G P
Sbjct: 238 PSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGP 273
Score = 35.0 bits (80), Expect = 0.15
Identities = 22/79 (27%), Positives = 28/79 (35%), Gaps = 7/79 (8%)
Query: 11 PPQESFHSVPIIQQTLSPQQ-----PPVSPQHQNMMSPIQQHSLSPHTQAP--SPQPPAQ 63
PQ+ I +L PQ+ PP+ PQ + SP S H Q+ P PP
Sbjct: 217 QPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMP 276
Query: 64 PQQVPSPYHTPPPSQTSKQ 82
P PS Q
Sbjct: 277 HALQQGPVFLQHPSSNPPQ 295
Score = 32.4 bits (73), Expect = 1.0
Identities = 23/78 (29%), Positives = 29/78 (37%), Gaps = 9/78 (11%)
Query: 17 HSVPIIQQTLSPQQPPVSP----QHQNMMSPIQQHSLSPHT---QAPSPQPPAQPQQVPS 69
S P L+ Q P P + +P Q +L P + P P P+ P P
Sbjct: 290 SSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPP 349
Query: 70 PYHTPPPSQTSKQSGKAP 87
P T P Q QS K P
Sbjct: 350 P--TTPIPQLPNQSHKHP 365
Score = 32.0 bits (72), Expect = 1.4
Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 6/62 (9%)
Query: 24 QTLSPQ---QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
Q+L P+ P SP QH + PP+ P P+ PP + +
Sbjct: 429 QSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAA---PPRASSG 485
Query: 81 KQ 82
Q
Sbjct: 486 SQ 487
Score = 31.6 bits (71), Expect = 1.7
Identities = 31/103 (30%), Positives = 35/103 (33%), Gaps = 25/103 (24%)
Query: 11 PPQESFHSVPIIQQTLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPS---------PQ 59
PP+E P+ P PP +P Q P Q H PH Q PS P
Sbjct: 331 PPREQ----PLPPAPSMPHIKPPPTTPIPQL---PNQSHKHPPHLQGPSPFPQMPSNLPP 383
Query: 60 PPA------QPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITML 96
PPA P P H PPP Q QS +L
Sbjct: 384 PPALKPLSSLPTHHPPSAH-PPPLQLMPQSQPLQSVPAQPPVL 425
Score = 30.4 bits (68), Expect = 3.8
Identities = 29/103 (28%), Positives = 39/103 (37%), Gaps = 14/103 (13%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQ---NMMSPIQQ---HSLSPHTQAP 56
+LPS P Q S Q + P S H M +QQ P + P
Sbjct: 236 RLPS-PHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPP 294
Query: 57 SPQPPAQ----PQQVPS---PYHTPPPSQTSKQSGKAPDKNTL 92
P AQ P +PS P+ PPSQ++ Q + P + L
Sbjct: 295 QPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPL 337
Score = 30.0 bits (67), Expect = 5.1
Identities = 19/63 (30%), Positives = 22/63 (34%), Gaps = 2/63 (3%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS--PYHTPPPSQTSKQSG 84
P QPPV Q Q++ H S P P AQ P PPPS +
Sbjct: 418 VPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPA 477
Query: 85 KAP 87
P
Sbjct: 478 APP 480
Score = 29.3 bits (65), Expect = 9.2
Identities = 21/82 (25%), Positives = 29/82 (35%), Gaps = 7/82 (8%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
PP+ S H + PP SP Q+ + ++ P P+ P A P+ S
Sbjct: 432 PPKASTHP-----HSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRA--SS 484
Query: 71 YHTPPPSQTSKQSGKAPDKNTL 92
PP S G A L
Sbjct: 485 GSQPPGSALPSSGGCAGPGPPL 506
>gnl|CDD|214832 smart00817, Amelin, Ameloblastin precursor (Amelin). This family
consists of several mammalian Ameloblastin precursor
(Amelin) proteins. Matrix proteins of tooth enamel
consist mainly of amelogenin but also of non-amelogenin
proteins, which, although their volumetric percentage is
low, have an important role in enamel mineralisation.
One of the non-amelogenin proteins is ameloblastin, also
known as amelin and sheathlin. Ameloblastin (AMBN) is
one of the enamel sheath proteins which is though to
have a role in determining the prismatic structure of
growing enamel crystals.
Length = 411
Score = 36.8 bits (85), Expect = 0.040
Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 6/70 (8%)
Query: 13 QESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQA----PSPQPPAQPQQVP 68
Q+ +S+P+ L P QP + PQ Q + P Q + P QA PQPP Q P
Sbjct: 87 QQYEYSLPVHPPPL-PSQPSLQPQ-QPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPP 144
Query: 69 SPYHTPPPSQ 78
P
Sbjct: 145 LQQAELPMIP 154
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 36.5 bits (84), Expect = 0.054
Identities = 22/108 (20%), Positives = 37/108 (34%), Gaps = 5/108 (4%)
Query: 11 PPQESFHSVP-IIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
PQ+ + Q PQQ P Q + + +QQ P P PQ+
Sbjct: 262 QPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGP-PPQR--- 317
Query: 70 PYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINV 117
P P Q +Q + + L M + + ++Q ++ N
Sbjct: 318 PGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNH 365
Score = 34.2 bits (78), Expect = 0.28
Identities = 21/60 (35%), Positives = 24/60 (40%), Gaps = 5/60 (8%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ--PQQVPSP-YHTPPPSQTSKQS 83
SP P +Q+M P Q SP P QPP +PSP P PS QS
Sbjct: 413 SPSPVPQVQTNQSMPQPPQPSVPSP--GGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQS 470
Score = 33.4 bits (76), Expect = 0.49
Identities = 19/78 (24%), Positives = 25/78 (32%), Gaps = 2/78 (2%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA-QPQQVPS 69
P S S QT P P + P Q + P PPA P P
Sbjct: 408 PGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPP-QSVSGGMIPSPPALMPSPSPQ 466
Query: 70 PYHTPPPSQTSKQSGKAP 87
+P +T +Q +P
Sbjct: 467 MSQSPASQRTIQQDMVSP 484
Score = 33.1 bits (75), Expect = 0.63
Identities = 19/84 (22%), Positives = 20/84 (23%), Gaps = 8/84 (9%)
Query: 12 PQESFHSVPIIQQTLSPQQPPVSPQHQ--------NMMSPIQQHSLSPHTQAPSPQPPAQ 63
PQ QQ + QQ P Q Q M P Q P Q
Sbjct: 224 PQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQPPQQQ 283
Query: 64 PQQVPSPYHTPPPSQTSKQSGKAP 87
PQQ Q P
Sbjct: 284 PQQSQLGMLPNQMQQMPGGGQGGP 307
Score = 31.5 bits (71), Expect = 1.9
Identities = 22/80 (27%), Positives = 25/80 (31%), Gaps = 8/80 (10%)
Query: 19 VPIIQQTLSPQQ-PPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPS 77
IQ P+ MMS S SP Q + Q QP Q P P S
Sbjct: 388 YLNIQGNQGGLGANPMQQGQPGMMS-----SPSPVPQVQTNQSMPQPPQPSVPSPGGPGS 442
Query: 78 Q--TSKQSGKAPDKNTLITM 95
Q S G P L+
Sbjct: 443 QPPQSVSGGMIPSPPALMPS 462
Score = 30.7 bits (69), Expect = 2.8
Identities = 12/65 (18%), Positives = 17/65 (26%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
Q P +QQ + Q + QQ + Q P A
Sbjct: 309 QPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKLRNMRGQQQTQQQQQQQGGNHPAA 368
Query: 63 QPQQV 67
QQ+
Sbjct: 369 HQQQM 373
>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1. The
overall function of the full-length Med25 is efficiently
to coordinate the transcriptional activation of RAR/RXR
(retinoic acid receptor/retinoic X receptor) in higher
eukaryotic cells. Human Med25 consists of several
domains with different binding properties, the
N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
from residues 229-381, a PTOV(B) or ACID domain from
395-545, an SD2 domain from residues 564-645 and a
C-terminal NR box-containing domain (646-650) from
646-747. This The function of the SD domains is unclear.
Length = 168
Score = 34.8 bits (79), Expect = 0.063
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 9/65 (13%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
+Q L + P++P Q P +QAP+P P P P P P SQ S
Sbjct: 72 KQGLKNRFSPITPLQQA------PIVGPPFSQAPAPVLPPGPPGAPKP---SPASQLSLV 122
Query: 83 SGKAP 87
+ +P
Sbjct: 123 TTVSP 127
Score = 28.6 bits (63), Expect = 8.4
Identities = 20/68 (29%), Positives = 23/68 (33%), Gaps = 1/68 (1%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
P Q++ P Q +P PP P SP Q SL S P QQ P
Sbjct: 84 PLQQAPIVGPPFSQAPAPVLPP-GPPGAPKPSPASQLSLVTTVSPGSGLAPVLTQQQVPP 142
Query: 71 YHTPPPSQ 78
PS
Sbjct: 143 QQPQQPSM 150
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 35.8 bits (83), Expect = 0.067
Identities = 21/90 (23%), Positives = 33/90 (36%), Gaps = 17/90 (18%)
Query: 1 MDQLPSLMDDPPQES--------FHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPH 52
D +L D P++S SVP Q +PQQP S ++ + + S
Sbjct: 176 DDYGANLAVDEPEQSTMSRPQEVKQSVPAQQAPPNPQQPMPSASSES--ATSKSAST--- 230
Query: 53 TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
S + QPQ P P ++ +
Sbjct: 231 ----SRESSPQPQSPPPRRVPAPTVDSTAK 256
>gnl|CDD|147567 pfam05454, DAG1, Dystroglycan (Dystrophin-associated glycoprotein
1). Dystroglycan is one of the dystrophin-associated
glycoproteins, which is encoded by a 5.5 kb transcript
in human. The protein product is cleaved into two
non-covalently associated subunits, [alpha] (N-terminal)
and [beta] (C-terminal). In skeletal muscle the
dystroglycan complex works as a transmembrane linkage
between the extracellular matrix and the cytoskeleton.
[alpha]-dystroglycan is extracellular and binds to
merosin ([alpha]-2 laminin) in the basement membrane,
while [beta]-dystroglycan is a transmembrane protein and
binds to dystrophin, which is a large rod-like
cytoskeletal protein, absent in Duchenne muscular
dystrophy patients. Dystrophin binds to intracellular
actin cables. In this way, the dystroglycan complex,
which links the extracellular matrix to the
intracellular actin cables, is thought to provide
structural integrity in muscle tissues. The dystroglycan
complex is also known to serve as an agrin receptor in
muscle, where it may regulate agrin-induced
acetylcholine receptor clustering at the neuromuscular
junction. There is also evidence which suggests the
function of dystroglycan as a part of the signal
transduction pathway because it is shown that Grb2, a
mediator of the Ras-related signal pathway, can interact
with the cytoplasmic domain of dystroglycan. In general,
aberrant expression of dystrophin-associated protein
complex underlies the pathogenesis of Duchenne muscular
dystrophy, Becker muscular dystrophy and severe
childhood autosomal recessive muscular dystrophy.
Interestingly, no genetic disease has been described for
either [alpha]- or [beta]-dystroglycan. Dystroglycan is
widely distributed in non-muscle tissues as well as in
muscle tissues. During epithelial morphogenesis of
kidney, the dystroglycan complex is shown to act as a
receptor for the basement membrane. Dystroglycan
expression in mouse brain and neural retina has also
been reported. However, the physiological role of
dystroglycan in non-muscle tissues has remained unclear.
Length = 290
Score = 35.4 bits (81), Expect = 0.084
Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 9/85 (10%)
Query: 8 MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNM--MSPIQQHSLSPHTQAPSPQPPAQPQ 65
+DD S+P+I + P PP +QN+ +P+ Q L +T P A P
Sbjct: 199 LDDSKPPPSSSMPLILKEEKPPLPPPEYPNQNVPETTPLNQDLLGEYTPLRDEDPNAPPY 258
Query: 66 QVPSPYHTP-------PPSQTSKQS 83
Q P P+ TP P + T +S
Sbjct: 259 QPPPPFTTPMEGKGSRPKNMTPYRS 283
>gnl|CDD|191194 pfam05111, Amelin, Ameloblastin precursor (Amelin). This family
consists of several mammalian Ameloblastin precursor
(Amelin) proteins. Matrix proteins of tooth enamel
consist mainly of amelogenin but also of non-amelogenin
proteins, which, although their volumetric percentage is
low, have an important role in enamel mineralisation.
One of the non-amelogenin proteins is ameloblastin, also
known as amelin and sheathlin. Ameloblastin (AMBN) is
one of the enamel sheath proteins which is though to
have a role in determining the prismatic structure of
growing enamel crystals.
Length = 410
Score = 35.3 bits (81), Expect = 0.11
Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 6/60 (10%)
Query: 13 QESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQA----PSPQPPAQPQQVP 68
Q+ +SVP+ L P QP + P Q + P Q + + Q PQPP P Q+P
Sbjct: 102 QQYEYSVPVHPPPL-PSQPSLQP-QQPGLKPFLQPTAATAIQDTPQKAGPQPPMHPGQLP 159
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin
carboxyl carrier protein subunit; Validated.
Length = 153
Score = 33.7 bits (77), Expect = 0.12
Identities = 13/47 (27%), Positives = 21/47 (44%), Gaps = 1/47 (2%)
Query: 41 MSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
+S +Q+ +P AP+P P+ P V P P+ + AP
Sbjct: 45 LSAVQEQVPTP-APAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAP 90
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 34.8 bits (80), Expect = 0.13
Identities = 16/91 (17%), Positives = 30/91 (32%), Gaps = 10/91 (10%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP---- 60
P+ + D + + +++Q Q+ P + P TQ+ Q
Sbjct: 83 PAAVKDAERLTPEQRQLLEQMEVDQKA--QPTQLGEQPEQARIEEQPRTQSQKAQSQATT 140
Query: 61 ----PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
P +P+ P P+ + KAP
Sbjct: 141 VQTQPVKPKPRPEKPQPVAPAPAPEPVEKAP 171
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 35.3 bits (82), Expect = 0.13
Identities = 11/72 (15%), Positives = 19/72 (26%)
Query: 20 PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
I + + +P P +P T+ + P P + P P
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTE 423
Query: 80 SKQSGKAPDKNT 91
+ A N
Sbjct: 424 PSPTPPANAANA 435
Score = 33.8 bits (78), Expect = 0.35
Identities = 14/91 (15%), Positives = 19/91 (20%), Gaps = 10/91 (10%)
Query: 4 LPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ 63
L L+ S + P +P PSP P
Sbjct: 356 LLGLLPSAFI----SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKA 411
Query: 64 ----PQQVPSPYHTPPPSQTSKQSGKAPDKN 90
P +P P + P N
Sbjct: 412 SPPIPVPAEPTEPSPTPPANAAN--APPSLN 440
Score = 33.4 bits (77), Expect = 0.43
Identities = 13/75 (17%), Positives = 20/75 (26%)
Query: 17 HSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPP 76
S +P P SP P+ Q + + P P PP Q + P
Sbjct: 513 QSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPA 572
Query: 77 SQTSKQSGKAPDKNT 91
+ +
Sbjct: 573 DSSPPPPIPEEPTPS 587
Score = 30.7 bits (70), Expect = 3.1
Identities = 22/108 (20%), Positives = 28/108 (25%), Gaps = 16/108 (14%)
Query: 22 IQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSK 81
I+ L Q + P Q+ P P PQP A P PPP+ T
Sbjct: 506 IKLNLESQSGS-ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPP--PPPPTATQA 562
Query: 82 QSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINVNDLGIDPNDSEI 129
S Q + E D + EI
Sbjct: 563 SSNAPA-------------QIPADSSPPPPIPEEPTPSPTKDSSPEEI 597
>gnl|CDD|218161 pfam04589, RFX1_trans_act, RFX1 transcription activation region.
The RFX family is a family of winged-helix DNA binding
proteins. RFX1 is a regulatory factor essential for
expression of MHC class II genes. This region is to
found N terminal to the RFX DNA binding region
(pfam02257) in some mammalian RFX proteins, and is
thought to activate transcription when associated with
DNA. Deletion analysis has identified the region
233-351 in human RFX1 as being required for maximal
activation.
Length = 150
Score = 33.4 bits (76), Expect = 0.14
Identities = 17/53 (32%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 24 QTLSPQQPPVSPQHQN-MMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPP 75
QT P Q PV Q ++ Q P Q P + PQQV H P
Sbjct: 17 QTSVPAQAPVPASQQRSVVQATSQTKGGPVQQLPVHRVQQVPQQVQQVQHVYP 69
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 34.6 bits (79), Expect = 0.15
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 6/68 (8%)
Query: 25 TLSPQQPPVSPQH-QNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
+ SPQ PP + Q+ Q+H+ T P+ P +P T +T
Sbjct: 244 SGSPQHPPSTTSQDQSTTGDGQEHTQRRKT-PPATSNRRSPHSTATPPPTTKRQET---- 298
Query: 84 GKAPDKNT 91
G+ + T
Sbjct: 299 GRPTPRPT 306
Score = 31.6 bits (71), Expect = 1.5
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 5/62 (8%)
Query: 29 QQPPVSPQHQNMMSPIQQHSLSPHTQ-----APSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
Q+ P N SP + P T+ P+P+P A Q SP H+ PP + +
Sbjct: 269 QRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
Query: 84 GK 85
+
Sbjct: 329 TQ 330
>gnl|CDD|221628 pfam12540, DUF3736, Protein of unknown function (DUF3736). This
domain family is found in eukaryotes, and is typically
between 135 and 160 amino acids in length.
Length = 147
Score = 33.3 bits (76), Expect = 0.20
Identities = 20/102 (19%), Positives = 36/102 (35%), Gaps = 11/102 (10%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
P ++D ++ + + + Q +Q M ++ H ++PSPQ
Sbjct: 51 PSTPPLVLDASTKK----LEFLWRVGLTCQQQACAYYQGKMGIRRRPL---HEESPSPQR 103
Query: 61 PAQPQQVPSPYHTPPPSQTSK----QSGKAPDKNTLITMLKL 98
VPSP + P + S Q + K + L L
Sbjct: 104 LQGAPPVPSPSNQAYPFELSPSSVMQDPRFQPKKQFLAQLNL 145
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 34.3 bits (78), Expect = 0.20
Identities = 21/99 (21%), Positives = 28/99 (28%), Gaps = 13/99 (13%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQP-----PVSPQHQNMMSPIQQHSLSPH-TQ 54
M P PP P + P P Q+ NMM + + P
Sbjct: 182 MGMRPGFNQMPPHMPGMP-PNQMRPGFNPMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGG 240
Query: 55 APSPQPPAQPQQV------PSPYHTPPPSQTSKQSGKAP 87
P P P P+ PPP ++G P
Sbjct: 241 FNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRAGFPP 279
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 34.3 bits (78), Expect = 0.29
Identities = 22/60 (36%), Positives = 27/60 (45%), Gaps = 2/60 (3%)
Query: 32 PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNT 91
PVS Q N + P + T A + Q A+ Q S TPP +Q SK KA D T
Sbjct: 424 PVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAETPPENQVSKN--KAADNET 481
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is
found to the N-terminus of the pfam00170 transcription
factor domain. It is between 150 and 200 amino acids in
length. The N-terminal half is rather rich in proline
residues and has been termed the PRD (proline rich
domain), whereas the C-terminal half is more polar and
has been called the MFMR (multifunctional mosaic
region). It has been suggested that this family is
composed of three sub-families called A, B and C,
classified according to motif composition. It has been
suggested that some of these motifs may be involved in
mediating protein-protein interactions. The MFMR region
contains a nuclear localisation signal in bZIP opaque
and GBF-2. The MFMR also contains a transregulatory
activity in TAF-1. The MFMR in CPRF-2 contains
cytoplasmic retention signals.
Length = 189
Score = 32.9 bits (75), Expect = 0.31
Identities = 19/69 (27%), Positives = 22/69 (31%), Gaps = 7/69 (10%)
Query: 15 SFHSVPIIQQTLSPQQPPVSPQHQNMMS---PIQQHSLSPHTQAPSPQPP----AQPQQV 67
S P P V P M + P + A SPQP Q +
Sbjct: 9 PSKSSPKTSVQEDTPTPTVYPDWSAMQAYYGPRPPPPYFNSSVASSPQPHPYMWGPQQPM 68
Query: 68 PSPYHTPPP 76
PY TPPP
Sbjct: 69 MPPYGTPPP 77
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 33.7 bits (77), Expect = 0.38
Identities = 18/75 (24%), Positives = 26/75 (34%), Gaps = 2/75 (2%)
Query: 4 LPSLMDDPPQESFHSVPIIQQTLSPQ--QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPP 61
+P + PP S +P +L+P P + + +P AP P
Sbjct: 182 IPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIP 241
Query: 62 AQPQQVPSPYHTPPP 76
A P Y PPP
Sbjct: 242 APPIPPVIQYVAPPP 256
Score = 33.4 bits (76), Expect = 0.51
Identities = 12/51 (23%), Positives = 19/51 (37%)
Query: 18 SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVP 68
S P P P + + P Q H +A + + PA P+ +P
Sbjct: 234 SAPPASIPAPPIPPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPANPRDIP 284
Score = 29.5 bits (66), Expect = 7.9
Identities = 19/74 (25%), Positives = 22/74 (29%), Gaps = 3/74 (4%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
PS + P S P T + P VS N P + A P PP P
Sbjct: 191 PSSLPGLPPGSSSLAPSASSTPGNRLPRVS---FNPFLPGPSPAQPSAPPASIPAPPIPP 247
Query: 65 QQVPSPYHTPPPSQ 78
PP Q
Sbjct: 248 VIQYVAPPPVPPPQ 261
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 33.3 bits (76), Expect = 0.39
Identities = 20/87 (22%), Positives = 31/87 (35%), Gaps = 5/87 (5%)
Query: 10 DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
DPP + P + Q P++ Q +P HS +P + Q PA P+Q
Sbjct: 111 DPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPVHSAAPQPAVQTVQ-PAVPEQQVQ 169
Query: 70 PYHTPPPSQTSKQSGKAPDKNTLITML 96
P K P + + +L
Sbjct: 170 ----PEEVVEPAPEVKRPPRKDTVIIL 192
Score = 30.2 bits (68), Expect = 3.6
Identities = 17/77 (22%), Positives = 19/77 (24%), Gaps = 5/77 (6%)
Query: 2 DQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPP 61
Q P P S + Q P Q P H P P P
Sbjct: 83 RQSPQHQYQPEYASAQIKIPVPQ---PPQISDPPAHPQPTQPALDQEQPPEEARQPVLP- 138
Query: 62 AQPQQVPSPYHTPPPSQ 78
Q P P H+ P
Sbjct: 139 -QEAPAPQPVHSAAPQP 154
>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
Validated.
Length = 451
Score = 33.6 bits (77), Expect = 0.41
Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 4/76 (5%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSP--QQPPVSPQHQNMMSPIQQHSLSPHTQAPSP 58
+ QL S + P Q S QQ SPQ Q + Q L+ +
Sbjct: 355 LSQLVSQIKSPAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASPQTKK 414
Query: 59 QPPAQPQQVPSPYHTP 74
+ Q+VPSP +
Sbjct: 415 ET--IHQEVPSPSLSK 428
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 33.4 bits (76), Expect = 0.42
Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 6/89 (6%)
Query: 11 PPQESFHS--VPIIQQTLSPQQPPVSPQHQNMMSPIQQHS--LSPHTQAPSPQPPAQPQQ 66
PP+ES + + + P P + P LSP+T +PP P
Sbjct: 329 PPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSP-- 386
Query: 67 VPSPYHTPPPSQTSKQSGKAPDKNTLITM 95
+P+P + P S S + P + ++
Sbjct: 387 IPTPPSSSPASSKSVDAVAKPAEPDVVPS 415
>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
activation/proliferation-associated protein-1 C term.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 343 and 708 amino
acids in length. This family is the C terminal region of
caprin-1. Caprin-1 is a protein involved in regulating
cellular proliferation. In mutated phenotypes, the G1
phase of the cell cycle is greatly lengthened, impairing
normal proliferation. The C terminal region of caprin-1
contains RGG motifs which are characteristic of RNA
binding domains. It is possible that caprin-1 functions
through an RNA binding mechanism.
Length = 319
Score = 33.0 bits (75), Expect = 0.44
Identities = 24/77 (31%), Positives = 31/77 (40%), Gaps = 7/77 (9%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQH-SLSP-HTQAPSPQP 60
Q+P M P E + S P + Q +P PQ + + PIQ SL+ T S P
Sbjct: 65 QVP--MVSPTSEGYTSSPPLYQPSHTAEP--RPQ-TDPIDPIQASMSLNSEQTPTSSSLP 119
Query: 61 PAQPQQVPSPYHTPPPS 77
A QV P S
Sbjct: 120 AASQPQVFQTGSKPLHS 136
>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
Length = 732
Score = 33.4 bits (76), Expect = 0.46
Identities = 29/114 (25%), Positives = 49/114 (42%), Gaps = 17/114 (14%)
Query: 54 QAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLK- 112
Q PQ P QPQQ P P K+S + G E++LK
Sbjct: 621 QPQQPQQPQQPQQPQQPQQPVSPVINDKKSDAGVNVPA-------------GGIEQELKM 667
Query: 113 KEINVNDLGIDPNDSEIGDL--LSAYESEGDPNLYEDAYQDLKKFIENSLDVYK 164
K + + P SE G++ ++AYE+ N + D Q +++ E +++V++
Sbjct: 668 KPEEEMEQQLPPGISESGEVVDMAAYEAWQQEN-HPDIQQQMQRREEVNINVHR 720
>gnl|CDD|140276 PTZ00249, PTZ00249, variable surface protein Vir28; Provisional.
Length = 516
Score = 33.5 bits (76), Expect = 0.47
Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 4/87 (4%)
Query: 10 DPPQESFHSVPI-IQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ-QV 67
DPP+E +V + +S + P P+H + SP T+ P+P + PQ
Sbjct: 216 DPPREEQKAVTAHAHRRISGEARP--PKHISFSSPHAHGRPPVETRPPNPVSVSSPQAHG 273
Query: 68 PSPYHTPPPSQTSKQSGKAPDKNTLIT 94
P T P + S KA D+N + T
Sbjct: 274 RHPGETHTPPLVTVPSSKAHDRNPVQT 300
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 32.7 bits (75), Expect = 0.58
Identities = 17/66 (25%), Positives = 21/66 (31%), Gaps = 1/66 (1%)
Query: 26 LSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGK 85
LS QQ V+ QN +P T APS P + P T K +
Sbjct: 153 LSQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAK-VPATAETHPTPPQKPATKKPAVN 211
Query: 86 APDKNT 91
T
Sbjct: 212 HHKTAT 217
>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER). This
family consists of several eukaryotic immediate early
response (IER) 2 and 5 proteins. The role of IER5 is
unclear although it play an important role in mediating
the cellular response to mitogenic signals. Again,
little is known about the function of IER2 although it
is thought to play a role in mediating the cellular
responses to a variety of extracellular signals.
Length = 272
Score = 32.6 bits (74), Expect = 0.63
Identities = 12/49 (24%), Positives = 18/49 (36%)
Query: 35 PQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
Q M+ P +Q +P PS P P + SP P + +
Sbjct: 60 QAEQPMVPPPEQQPGAPRLHPPSGAEPPLPARAASPEEEQPAPEPAFYR 108
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 32.3 bits (74), Expect = 0.63
Identities = 14/77 (18%), Positives = 23/77 (29%), Gaps = 13/77 (16%)
Query: 10 DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
DP + + P+ + + P P P PQ + P
Sbjct: 86 DPATVAPPNTPVEPEPAPVEPPKPKPV-------------EKPKPKPKPQQKVEAPPAPK 132
Query: 70 PYHTPPPSQTSKQSGKA 86
P P + + +GKA
Sbjct: 133 PEPKPVVEEKAAPTGKA 149
Score = 31.5 bits (72), Expect = 1.2
Identities = 18/75 (24%), Positives = 30/75 (40%), Gaps = 6/75 (8%)
Query: 18 SVPIIQQTLSPQQPP----VSPQHQNMMSP-IQQHSLSPHTQAPSPQPPAQPQQVPSPYH 72
+P Q L P QPP + + + +P + +++P P+P P P
Sbjct: 55 MMPAATQAL-PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVE 113
Query: 73 TPPPSQTSKQSGKAP 87
P P +Q +AP
Sbjct: 114 KPKPKPKPQQKVEAP 128
>gnl|CDD|176758 cd08780, Death_TRADD, Death Domain of Tumor Necrosis Factor
Receptor 1-Associated Death Domain protein. Death
domain (DD) of TRADD (TNF Receptor 1-Associated Death
Domain or TNFRSF1A-associated via death domain) protein.
TRADD is a central signaling adaptor for TNF-receptor 1
(TNFR1), mediating activation of Nuclear Factor -kappaB
(NF-kB) and c-Jun N-terminal kinase (JNK), as well as
caspase-dependent apoptosis. It also carries important
immunological roles including germinal center formation,
DR3-mediated T-cell stimulation, and TNFalpha-mediated
inflammatory responses. In general, DDs are
protein-protein interaction domains found in a variety
of domain architectures. Their common feature is that
they form homodimers by self-association or heterodimers
by associating with other members of the DD superfamily
including CARD (Caspase activation and recruitment
domain), DED (Death Effector Domain), and PYRIN. They
serve as adaptors in signaling pathways and can recruit
other proteins into signaling complexes.
Length = 90
Score = 30.6 bits (69), Expect = 0.66
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 3/33 (9%)
Query: 125 NDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIE 157
D I +L Y+ EG LYE AYQ L++FI+
Sbjct: 31 RDPAIDNLAYEYDREG---LYEQAYQLLRRFIQ 60
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 32.8 bits (75), Expect = 0.70
Identities = 19/87 (21%), Positives = 26/87 (29%), Gaps = 15/87 (17%)
Query: 15 SFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP-----------PAQ 63
H + Q +QP + Q P + SL H S P+
Sbjct: 95 HVHETLKVSQA-PGEQPKLPRAPQLPSHPRRHRSLPHHLHHSSKHRNRLHARRSDVLPSL 153
Query: 64 PQQVPSPYHT-PPPSQTSKQSGKAPDK 89
P H P PS Q ++P K
Sbjct: 154 SN--SRPVHELPKPSTPVLQPRRSPRK 178
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 32.7 bits (75), Expect = 0.76
Identities = 11/69 (15%), Positives = 19/69 (27%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
P + P P S Q + + + +P P+ PA P+
Sbjct: 440 APPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499
Query: 71 YHTPPPSQT 79
P +
Sbjct: 500 PAAPAGADD 508
Score = 32.3 bits (74), Expect = 1.2
Identities = 17/65 (26%), Positives = 23/65 (35%), Gaps = 4/65 (6%)
Query: 28 PQQPPVSPQHQNMMSPIQQHS----LSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
P +PQ +P +P APSP P A P P+P P T+ +
Sbjct: 422 APAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481
Query: 84 GKAPD 88
P
Sbjct: 482 PAPPA 486
Score = 31.9 bits (73), Expect = 1.3
Identities = 16/65 (24%), Positives = 21/65 (32%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQ 82
Q P +P +P P APS QP P P P P P+ +
Sbjct: 429 PQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAP 488
Query: 83 SGKAP 87
+ A
Sbjct: 489 APAAA 493
Score = 31.1 bits (71), Expect = 2.2
Identities = 14/64 (21%), Positives = 22/64 (34%), Gaps = 3/64 (4%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPH---TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
+P P + SP + S A +P+P A P P P + + +
Sbjct: 440 APPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499
Query: 84 GKAP 87
AP
Sbjct: 500 PAAP 503
Score = 30.7 bits (70), Expect = 3.6
Identities = 8/58 (13%), Positives = 13/58 (22%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSG 84
+ P + +P + P P P P P + S
Sbjct: 407 AAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464
Score = 29.6 bits (67), Expect = 7.9
Identities = 7/46 (15%), Positives = 12/46 (26%), Gaps = 1/46 (2%)
Query: 42 SPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
+ + +P P+P P P+ AP
Sbjct: 397 AAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP-APAPAP 441
>gnl|CDD|180413 PRK06128, PRK06128, oxidoreductase; Provisional.
Length = 300
Score = 32.1 bits (73), Expect = 0.79
Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 5/49 (10%)
Query: 51 PHTQAPSPQPPAQPQQVPSPYH--TPPP---SQTSKQSGKAPDKNTLIT 94
P TQ P P P Q Q+ P H P P Q+ K G+ + LIT
Sbjct: 13 PLTQYPQPPFPEQTQEAPGTIHEMQPKPDHGEQSYKGFGRLQGRKALIT 61
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 31.4 bits (71), Expect = 0.83
Identities = 12/41 (29%), Positives = 15/41 (36%)
Query: 36 QHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPP 76
Q + +P QA PQ AQ Q P+P P
Sbjct: 118 DDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQ 158
>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
protein. This model is designed to identify biotin
carboxyl carrier protein as a peptide of acetyl-CoA
carboxylase. Scoring below the trusted cutoff is a
related protein encoded in a region associated with
polyketide synthesis in the prokaryote
Saccharopolyspora hirsuta, and a reported
chloroplast-encoded biotin carboxyl carrier protein
that may be highly derived from the last common
ancestral sequence. Scoring below the noise cutoff are
biotin carboxyl carrier domains of other enzymes such
as pyruvate carboxylase.The gene name is accB or fabE
[Fatty acid and phospholipid metabolism, Biosynthesis].
Length = 156
Score = 31.3 bits (71), Expect = 0.87
Identities = 13/58 (22%), Positives = 19/58 (32%), Gaps = 10/58 (17%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ---VPSP-----YHTPPP 76
S Q +P P + +P +P P ++ V SP Y P P
Sbjct: 42 SAVQQAAAPVP--AQVPAAPSAQAPAPAVCAPAPAKADKKGHFVRSPMVGTFYRAPSP 97
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 32.3 bits (73), Expect = 0.87
Identities = 25/107 (23%), Positives = 42/107 (39%), Gaps = 11/107 (10%)
Query: 13 QESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLS--PHTQAPSPQPPAQPQQVPSP 70
Q + + P + +PQ P S + + +P + + + P T+ +PQ S
Sbjct: 26 QAANATTPSSTKVEAPQSTPPSTKVE---APQSKPNATTPPSTKVEAPQQTPNATTPSST 82
Query: 71 YHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINV 117
P S T+KQ + K LR Y + + E K EI +
Sbjct: 83 KVETPQSPTTKQVPTEINPK-----FKDLRAYYTKPSLE-FKNEIGI 123
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 32.2 bits (73), Expect = 0.92
Identities = 14/79 (17%), Positives = 19/79 (24%)
Query: 6 SLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ 65
S D P F SP P + + P+ S Q P P
Sbjct: 120 SAYDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPPP 179
Query: 66 QVPSPYHTPPPSQTSKQSG 84
P + +Q
Sbjct: 180 MSSPDSSYPSEHRFQRQLS 198
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 31.8 bits (72), Expect = 0.94
Identities = 21/94 (22%), Positives = 33/94 (35%), Gaps = 12/94 (12%)
Query: 10 DPPQ-----ESFHSVPIIQQTLS-PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQ 63
PP+ + +P + Q ++ P QPP ++ Q + + PQP AQ
Sbjct: 43 LPPKPQGDRDEPRVLPAVVQVVALPTQPPEG------VAQEIQDAGDAAAASVDPQPVAQ 96
Query: 64 PQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLK 97
P +P P +QT K K
Sbjct: 97 PPVESTPAGVPVAAQTPKPVKPPKQPPAGAVPAK 130
Score = 29.5 bits (66), Expect = 5.7
Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 11/70 (15%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
P + PP ES +P PV+ Q + P +Q P+P+P +P
Sbjct: 91 PQPVAQPPVES-----------TPAGVPVAAQTPKPVKPPKQPPAGAVPAKPTPKPEPKP 139
Query: 65 QQVPSPYHTP 74
P+ T
Sbjct: 140 VAEPAAAPTG 149
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 31.7 bits (72), Expect = 0.95
Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 4/57 (7%)
Query: 24 QTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
+ SP+ S Q + P + + P+P PA P TP + +S
Sbjct: 140 EYYSPKHS--SSQGTSTTRPSDGSATPNTSAPPTPGNPAAQP--EKPAETPKGNGSS 192
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 30.4 bits (69), Expect = 1.0
Identities = 12/49 (24%), Positives = 15/49 (30%)
Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPP 76
Q P + P P+ A PQ+ P PY PP
Sbjct: 63 GQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPPQ 111
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 32.1 bits (73), Expect = 1.0
Identities = 24/110 (21%), Positives = 31/110 (28%), Gaps = 16/110 (14%)
Query: 1 MDQLPSLMDDPPQESFHSVPII----QQTLSPQQPPVSPQHQNMMSPIQ-QHSLSPHTQA 55
M QLP P P Q QP P+ M +P+ L P+ A
Sbjct: 382 MRQLPMGS---PMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLA 438
Query: 56 PSPQ---PPAQPQQVPS-----PYHTPPPSQTSKQSGKAPDKNTLITMLK 97
P P Q P PP Q+ S P + +
Sbjct: 439 PMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGG 488
Score = 31.7 bits (72), Expect = 1.4
Identities = 16/77 (20%), Positives = 26/77 (33%)
Query: 20 PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
P + +P P + + + S + Q + +PP QP P Y + P SQ
Sbjct: 417 PRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQD 476
Query: 80 SKQSGKAPDKNTLITML 96
Q + L
Sbjct: 477 LPQPQSTASQGGQNKKL 493
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 31.7 bits (72), Expect = 1.0
Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 10/87 (11%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQ----QPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
P+ P E P Q + P P+ + P ++ P P+P
Sbjct: 52 PTEEPQPEPE----PPEEQPKPPTEPETPPEPTPPKPKEKPKPEKK--PKKPKPKPKPKP 105
Query: 61 PAQPQQVPSPYHTPPPSQTSKQSGKAP 87
+P+ P P PPS+T+ ++ AP
Sbjct: 106 KPKPKVKPQPKPKKPPSKTAAKAPAAP 132
>gnl|CDD|238879 cd01841, NnaC_like, NnaC (CMP-NeuNAc synthetase) _like subfamily of
SGNH_hydrolases, a diverse family of lipases and
esterases. The tertiary fold of the enzyme is
substantially different from that of the alpha/beta
hydrolase family and unique among all known hydrolases;
its active site closely resembles two of the three
components of typical Ser-His-Asp(Glu) triad from other
serine hydrolases. E. coli NnaC appears to be involved
in polysaccharide synthesis.
Length = 174
Score = 31.1 bits (71), Expect = 1.0
Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 5/46 (10%)
Query: 115 INVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSL 160
I++ND+ +D G+L Y ++G + YQ L + +E L
Sbjct: 134 IDLNDVLVDEF----GNLKKEYTTDG-LHFNPKGYQKLLEILEEYL 174
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 30.8 bits (70), Expect = 1.1
Identities = 19/81 (23%), Positives = 28/81 (34%), Gaps = 2/81 (2%)
Query: 18 SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPS 77
S+P ++ PP P +A SP PP P + P+ P+
Sbjct: 46 SIPGLEDVTVQTTPPPPASAITNGGPPPP--PPARAEAASPPPPEAPAEPPAEPEPEAPA 103
Query: 78 QTSKQSGKAPDKNTLITMLKL 98
+ + K P MLKL
Sbjct: 104 ENTVTVAKDPRYAKYFKMLKL 124
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 32.3 bits (73), Expect = 1.2
Identities = 21/157 (13%), Positives = 36/157 (22%), Gaps = 23/157 (14%)
Query: 12 PQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQ-------QHSLSPHTQAPSPQPPAQP 64
++ Q+ P P S ++ M+ L SP P
Sbjct: 371 APQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGP 430
Query: 65 QQVPSPYHTPP-PSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKEINVNDLGID 123
+ V P P G E + K+ + +
Sbjct: 431 EPVGPVPPQPTNPYVMPISMANMVYPGHPQE----------HGHERKRKRGGELKE---- 476
Query: 124 PNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSL 160
E L+ + E + E K I+
Sbjct: 477 -ELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIA 512
>gnl|CDD|220023 pfam08801, Nucleoporin_N, Nup133 N terminal like. Nup133 is a
nucleoporin that is crucial for nuclear pore complex
(NPC) biogenesis. The N terminal forms a seven-bladed
beta propeller structure. This family now contains other
sized nucleoporins, including Nup155, Nup8, Nuo132,
Nup15 and Nup170.
Length = 411
Score = 32.1 bits (73), Expect = 1.2
Identities = 22/116 (18%), Positives = 42/116 (36%), Gaps = 12/116 (10%)
Query: 598 GRFLASAGGDGNVHMWDLSNGHLLTLLTGHTDVVRALCFSRDGNILASSGQDCS--VKLW 655
GR L + G + +WDLS L D+ + +I S+ S +K+
Sbjct: 203 GRLLYTLTSKGTIRVWDLSPSGE--KLVREVDIYEIILRDLQESIPKSASTLSSKSLKIL 260
Query: 656 DFAKLSDEVGTEEVNLSHSADIKNSSLAYLVRSYKTKASPLLHLIFSRRNILLRGP 711
D + + E +L I ++ VR Y + L+ + ++ +
Sbjct: 261 DISPIPS----GESSLLLLVAITSNG----VRLYFSLLLTLVSVRLPPSSVTISST 308
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 31.6 bits (72), Expect = 1.2
Identities = 18/69 (26%), Positives = 26/69 (37%), Gaps = 14/69 (20%)
Query: 304 DDDEEEAGEGGDKPKKKKAKKDSM--------FSKKTKSDPNAPPVDRIPLPPLKDSEKI 355
DD+EE E + +KK K+ S KK +P+ L D +
Sbjct: 14 GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDT------SFLPDKARE 67
Query: 356 DKLKALREA 364
+K LRE
Sbjct: 68 EKEAELREE 76
>gnl|CDD|151572 pfam11128, Nucleocap_ssRNA, Plant viral coat protein nucleocapsid.
This family of nucleocapsid proteins is from ssRNA
negative-strand viruses of plant origin.
Length = 180
Score = 30.9 bits (70), Expect = 1.2
Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 20/87 (22%)
Query: 321 KAKKDSMFSKKTKSDPNAPPVDRIP---LPPLKDSEKIDKLKALR----------EATKR 367
KA + ++F++K + D + P P ++ +EK++K KA+ EA K+
Sbjct: 63 KAIRYAIFAQKFEKDKLSEPTGATPESIKEYMEKNEKLEKAKAIVDFLCSLGSNFEAQKK 122
Query: 368 V-TLGPESLPSIACYTLLNARQQVTCA 393
+ L PE PS +TL Q+TCA
Sbjct: 123 MHPLSPER-PSRKNFTL-----QLTCA 143
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 32.1 bits (73), Expect = 1.3
Identities = 19/77 (24%), Positives = 29/77 (37%), Gaps = 18/77 (23%)
Query: 287 YYGMPREPDLQNTIAVPDDDEEEAGEGGDKPKKKKAKKDSMFSKKTKSDPNAPPVDRIPL 346
YY + + D G+ G K KK + KK + FS++ SD
Sbjct: 208 YYDGCESEEDDEGWYIEPQDATPEGKKG-KNKKGRGKKHNAFSRRGLSD----------- 255
Query: 347 PPLKDSEKIDKLKALRE 363
E+ D+ K +RE
Sbjct: 256 ------EEYDEYKKIRE 266
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 32.0 bits (73), Expect = 1.3
Identities = 14/81 (17%), Positives = 25/81 (30%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
P + D ES +P I+ + Q +P + + S P
Sbjct: 654 PDVTDSSQLESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPP 713
Query: 65 QQVPSPYHTPPPSQTSKQSGK 85
+ + P Q+ +SG
Sbjct: 714 SSSQAFSLSDLPMQSQSESGL 734
>gnl|CDD|219753 pfam08226, DUF1720, Domain of unknown function (DUF1720). This
domain is found in different combinations with cortical
patch components EF hand, SH3 and ENTH and is therefore
likely to be involved in cytoskeletal processes. This
family contains many hypothetical proteins.
Length = 73
Score = 29.0 bits (65), Expect = 1.5
Identities = 15/60 (25%), Positives = 18/60 (30%), Gaps = 3/60 (5%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA-QPQQ--VPSPYHTPPPSQT 79
QQT PQ Q + P + P Q QPQQ P+ T
Sbjct: 4 QQTGYQPPQQQQPQQQGPLQPQPTGFMQPQPTGFGQQQQGLQPQQTGFQPQAGQQMPTGT 63
>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
Length = 499
Score = 31.6 bits (71), Expect = 1.5
Identities = 18/69 (26%), Positives = 27/69 (39%), Gaps = 3/69 (4%)
Query: 22 IQQTLSPQQPPVSPQHQNMMSP--IQQHSL-SPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
+ PP++ Q M P +QQ + +PH P QPP Q P ++
Sbjct: 405 APKNDHHLLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNE 464
Query: 79 TSKQSGKAP 87
K S + P
Sbjct: 465 APKPSAQEP 473
Score = 29.7 bits (66), Expect = 6.1
Identities = 22/79 (27%), Positives = 31/79 (39%), Gaps = 8/79 (10%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQH--QNMMSP--IQQHSLSPHTQAPS---PQPPAQ 63
S + Q L QQ P+ H M P +QQ + P T S P+P AQ
Sbjct: 412 LLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNEAPKPSAQ 471
Query: 64 -PQQVPSPYHTPPPSQTSK 81
P + + + P S+ K
Sbjct: 472 EPVHIDASFAQDPVSKIQK 490
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 31.6 bits (71), Expect = 1.6
Identities = 20/78 (25%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 8 MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQV 67
+ P QE Q++L+ P Q Q+ + S +P T+ +P PQ +
Sbjct: 95 ISQPKQEE-------QKSLNISATPAPKQEQSQTTT---ESTTPKTKVTTPPSTNTPQPM 144
Query: 68 PSPYHTPPPSQTSKQSGK 85
S P S T KQ+
Sbjct: 145 QSTKSDTPQSPTIKQAQT 162
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 31.7 bits (72), Expect = 1.6
Identities = 11/60 (18%), Positives = 17/60 (28%)
Query: 19 VPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
P + + PV P P + + + P + P P P P P
Sbjct: 365 APQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHT 424
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 31.2 bits (70), Expect = 2.0
Identities = 13/73 (17%), Positives = 23/73 (31%), Gaps = 6/73 (8%)
Query: 22 IQQTLSPQQPPVSPQ--HQNMMSPIQQHSLSPHTQAPSPQPPAQPQQ--VPSPYHT--PP 75
+ + Q + + Q Q + + P QP Q + +P HT
Sbjct: 162 LAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQS 221
Query: 76 PSQTSKQSGKAPD 88
Q + +A D
Sbjct: 222 KPQQAAPVTRAAD 234
Score = 30.0 bits (67), Expect = 4.1
Identities = 17/84 (20%), Positives = 25/84 (29%), Gaps = 3/84 (3%)
Query: 1 MDQLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP-- 58
++Q+ + M P + VP +QT +Q + Q Q QQ
Sbjct: 122 LEQMQADMRQQPTQ-LVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQ 180
Query: 59 QPPAQPQQVPSPYHTPPPSQTSKQ 82
Q Q P S Q
Sbjct: 181 QQTRTSQAAPVQAQPRQSKPASTQ 204
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as
a single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through
DAZ repeats.
Length = 136
Score = 29.8 bits (67), Expect = 2.2
Identities = 13/59 (22%), Positives = 17/59 (28%), Gaps = 6/59 (10%)
Query: 20 PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQ 78
P L V PQ Q + +P+ P AQ V PP +
Sbjct: 5 PPAYSELYQPSYVVPPQAQMPQAS------APYPGPSMYLPMAQVMAVGPQSSHPPMAY 57
>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
Length = 137
Score = 30.0 bits (67), Expect = 2.2
Identities = 14/64 (21%), Positives = 16/64 (25%), Gaps = 5/64 (7%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS-----PQPPAQPQQVPSPYHTPPPS 77
+T P P H H L P P P PQ + P
Sbjct: 40 HRTWDPADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHHPLSPQCQQNVTGKPVLQ 99
Query: 78 QTSK 81
Q K
Sbjct: 100 QEPK 103
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 30.8 bits (69), Expect = 2.2
Identities = 16/61 (26%), Positives = 20/61 (32%)
Query: 26 LSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGK 85
L PP + IQ + Q +PQ A P Q P P T + Q
Sbjct: 15 LVANVPPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQAP 74
Query: 86 A 86
A
Sbjct: 75 A 75
Score = 30.8 bits (69), Expect = 2.5
Identities = 24/141 (17%), Positives = 38/141 (26%), Gaps = 9/141 (6%)
Query: 12 PQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPY 71
P + V Q + P QP Q Q S T Q PAQP P
Sbjct: 25 PTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQAPAQPAPAAPPA 84
Query: 72 HTPPPSQTSKQSG---KAPDKNTLITMLKLLRQYNLRGTEEQLKKEINVNDLGIDPNDSE 128
+ + P+ + T+ L + QL ++ + + +
Sbjct: 85 AGAALPEALEVPPPPAFTPNGEIVGTLAGNLEG------DPQLAPSVSYLEAFSGLDKLD 138
Query: 129 IGDLLSAYESEGDPNLYEDAY 149
DP + Y
Sbjct: 139 TVRAFGKAAENRDPRFIDTHY 159
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.1 bits (70), Expect = 2.2
Identities = 15/73 (20%), Positives = 24/73 (32%), Gaps = 3/73 (4%)
Query: 20 PIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQT 79
P +++ P +H P + P PSP P+Q P P P
Sbjct: 388 PDVKKKAPEPDLPQPDRHPGPAKP-EAPGARPAE-LPSPASAPTPEQQP-PVARSAPLPP 444
Query: 80 SKQSGKAPDKNTL 92
S Q+ + +
Sbjct: 445 SPQASAPRNVASG 457
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 30.8 bits (70), Expect = 2.2
Identities = 24/74 (32%), Positives = 32/74 (43%), Gaps = 9/74 (12%)
Query: 303 PDDDEEEAGEGG----DKPKKKKAKKDSM----FSKKTKSDPNAPP-VDRIPLPPLKDSE 353
P+ D+EE GE ++ KKKK K KK K DP A P K SE
Sbjct: 66 PESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSE 125
Query: 354 KIDKLKALREATKR 367
+I L ++ +R
Sbjct: 126 RISWAPTLLDSPRR 139
>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
Provisional.
Length = 177
Score = 30.2 bits (68), Expect = 2.2
Identities = 11/53 (20%), Positives = 16/53 (30%), Gaps = 1/53 (1%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPP 75
Q +P + Q Q QQ + P Q +P + PP
Sbjct: 117 QGGGAPAGGNIGGQQQGGWGQPQQPQGGNQFSGGAQSRPQQSAP-AAPSNEPP 168
>gnl|CDD|214939 smart00960, Robl_LC7, Roadblock/LC7 domain. This family includes
proteins that are about 100 amino acids long and have
been shown to be related. Members of this family of
proteins are associated with both flagellar outer arm
dynein and Drosophila and rat brain cytoplasmic dynein.
It is proposed that roadblock/LC7 family members may
modulate specific dynein functions. This family also
includes Golgi-associated MP1 adapter protein and MglB
from Myxococcus xanthus, a protein involved in gliding
motility. However the family also includes members from
non-motile bacteria such as Streptomyces coelicolor,
suggesting that the protein may play a structural or
regulatory role.
Length = 88
Score = 28.7 bits (65), Expect = 2.2
Identities = 16/31 (51%), Positives = 17/31 (54%)
Query: 619 HLLTLLTGHTDVVRALCFSRDGNILASSGQD 649
LL LL V AL SRDG +LASSG
Sbjct: 3 ELLDLLARVPGVRGALVVSRDGLLLASSGLS 33
>gnl|CDD|233777 TIGR02205, septum_zipA, cell division protein ZipA. This model
represents the full length of bacterial cell division
protein ZipA. The N-terminal hydrophobic stretch is an
uncleaved signal-anchor sequence. This is followed by an
unconserved, variable length, low complexity region, and
then a conserved C-terminal region of about 140 amino
acids (see pfam04354) that interacts with the
tubulin-like cell division protein FtsZ [Cellular
processes, Cell division].
Length = 284
Score = 30.6 bits (69), Expect = 2.3
Identities = 14/76 (18%), Positives = 19/76 (25%), Gaps = 4/76 (5%)
Query: 9 DDPPQESF----HSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
DDP E +S + P QH +S + + P QP
Sbjct: 46 DDPTSEEMVQPDNSPNTRVERGEHPIPQPREQHLPSISELVAYQRDKSVDDEEASIPMQP 105
Query: 65 QQVPSPYHTPPPSQTS 80
Q P
Sbjct: 106 TQQQYDMPQPNNVAQQ 121
Score = 29.5 bits (66), Expect = 5.8
Identities = 17/80 (21%), Positives = 26/80 (32%), Gaps = 5/80 (6%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
+ L+ +S QQ PQ N + Q ++ P P+
Sbjct: 81 SISELVAYQRDKSVDDEEASIPMQPTQQQYDMPQPNN----VAQQTVEPRVAKSLPEASP 136
Query: 63 QPQQVPSPYH-TPPPSQTSK 81
Q ++V T PP Q K
Sbjct: 137 QEEEVGKNLEVTAPPKQKDK 156
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 130
Score = 29.8 bits (67), Expect = 2.4
Identities = 12/44 (27%), Positives = 16/44 (36%), Gaps = 1/44 (2%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
+ P P + P + S QAP P A +PSP
Sbjct: 25 AQAAAPAQPASTPVPVPTEA-SPQVEAQAPQPAAAAGADAMPSP 67
>gnl|CDD|233990 TIGR02740, TraF-like, TraF-like protein. This protein is related
to the F-type conjugation system pilus assembly proteins
TraF (TIGR02739)and TrbB (TIGR02738) both of which
exhibit a thioredoxin fold. The protein represented by
this model has the same length and architecture as TraF,
but lacks the CXXC-motif found in TrbB and believed to
be responsible for the disulfide isomerase activity of
that protein.
Length = 271
Score = 30.5 bits (69), Expect = 2.8
Identities = 20/109 (18%), Positives = 39/109 (35%), Gaps = 5/109 (4%)
Query: 56 PSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLI--TMLKLLRQYNLRGTEEQLKK 113
P P + + + P PP+Q + AP + + +L + T E ++
Sbjct: 46 EDPPPKEEEEPL-EPAPPAPPAQQPAREELAPFSVAWLRENLPELRDKAIDNPTPENVRA 104
Query: 114 EINVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDV 162
+ + +D S D +S DP L E + + ++ D
Sbjct: 105 YLEAQRIMLD-KASRFAD-VSQRVIWTDPILDETLRRPVSTLALDAHDT 151
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 30.6 bits (69), Expect = 3.0
Identities = 10/31 (32%), Positives = 12/31 (38%)
Query: 53 TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
P A P P+P +PP S S S
Sbjct: 80 GTGKVAPPAATPTSAPTPTPSPPASPASGMS 110
Score = 29.1 bits (65), Expect = 8.8
Identities = 8/47 (17%), Positives = 19/47 (40%)
Query: 45 QQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNT 91
+ + + P+ P + P PSP +P ++ + +K+
Sbjct: 76 KPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSP 122
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 28.4 bits (64), Expect = 3.1
Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 13/58 (22%)
Query: 316 KPKKKKAKKDSMFSKKTKSDPNAPPVDRIP-LPPLKDS------EKIDKLKALREATK 366
+ +K+AKK+ + K K DP IP P K+ EK K + +E K
Sbjct: 15 RKLRKEAKKNPTWKSKKKKDPG------IPNSFPFKEEILEEIEEKKRKQEEEKERRK 66
>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
box-containing transcription factors [Transcription].
Length = 412
Score = 30.4 bits (68), Expect = 3.2
Identities = 16/91 (17%), Positives = 25/91 (27%), Gaps = 14/91 (15%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ----- 65
P H+ + Q P Q S + H PH+ + +PPA+
Sbjct: 185 PMGSFQHNGSPQTNFIPLQNPQT--QQYQQHSSRKDHPTVPHSNTNNGRPPAKFMIPELH 242
Query: 66 --QVPSPYHTPPPSQT-----SKQSGKAPDK 89
+ S + S S D
Sbjct: 243 SSHSTLDLPSDFISDSGFPNQSSTSIFPLDS 273
>gnl|CDD|216839 pfam02002, TFIIE_alpha, TFIIE alpha subunit. The general
transcription factor TFIIE has an essential role in
eukaryotic transcription initiation together with RNA
polymerase II and other general factors. Human TFIIE
consists of two subunits TFIIE-alpha and TFIIE-beta, and
joins the pre-initiation complex after RNA polymerase II
and TFIIF. This family consists of the conserved amino
terminal region of eukaryotic TFIIE-alpha and proteins
from archaebacteria that are presumed to be TFIIE-alpha
subunits also Archaeoglobus fulgidus tfe.
Length = 105
Score = 28.8 bits (65), Expect = 3.3
Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 15/59 (25%)
Query: 107 TEEQLKKEINVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKY 165
TEE L + LGID N E+ LL LY+ ++ ++ Y+Y
Sbjct: 29 TEEDLAEL-----LGIDLN--ELRKLLY--------RLYDARLVKYRRRRDDETGWYRY 72
>gnl|CDD|187832 cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Multidomain protein with permuted HD nuclease domain,
inactivated palm domain and Zn-ribbon; signature gene
for type III.
Length = 909
Score = 30.5 bits (69), Expect = 3.7
Identities = 17/58 (29%), Positives = 25/58 (43%), Gaps = 11/58 (18%)
Query: 115 INVNDLGIDPNDSEIGDLLSAYESEGDPNLYEDAYQDLKKFIENSLDVYKYELGLVLY 172
I N LG+ +I D+L+ E L E+ +KK +E Y LG +Y
Sbjct: 231 IRFNGLGLLLKGLKIPDILARKE------LLEEILDRVKKLLE-----ETYPLGNEVY 277
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 30.4 bits (69), Expect = 3.7
Identities = 13/79 (16%), Positives = 26/79 (32%), Gaps = 3/79 (3%)
Query: 8 MDDPPQESFHSVPIIQQTLSPQQPPVSPQHQN--MMSPIQQHSLSPHTQAPSPQPPAQPQ 65
+ P +P+ TL + P +P + + +P P+P P+
Sbjct: 209 VQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPS-SA 267
Query: 66 QVPSPYHTPPPSQTSKQSG 84
+P P + + S
Sbjct: 268 PATAPAAAAPQAAATSSSA 286
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 30.6 bits (69), Expect = 4.2
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 53 TQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTL 92
+ P P PPA Q+ +P P + + + AP +
Sbjct: 75 PEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVE 114
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 30.6 bits (69), Expect = 4.2
Identities = 8/47 (17%), Positives = 13/47 (27%), Gaps = 1/47 (2%)
Query: 51 PHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLK 97
P P+P P + P PP + + L+
Sbjct: 916 PGAGPPAPPQAVPPPRTTQP-PAAPPRGPDVPPAAVAELRETLADLR 961
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 30.4 bits (69), Expect = 4.3
Identities = 15/72 (20%), Positives = 21/72 (29%), Gaps = 3/72 (4%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSP 70
+S S + S PP S + S HT + P P PS
Sbjct: 8 SRSQSHASSS--SSSSSQSSPPSSTSPRPRRRKPSASS-LLHTPSILPLPKLSSPSPPSV 64
Query: 71 YHTPPPSQTSKQ 82
P + + Q
Sbjct: 65 TLPPAATTQTPQ 76
>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region.
Length = 57
Score = 27.4 bits (61), Expect = 4.3
Identities = 14/38 (36%), Positives = 18/38 (47%), Gaps = 3/38 (7%)
Query: 43 PIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTS 80
P++Q+S P SP PP + SP PPP S
Sbjct: 9 PVKQYSPPPPYYYKSPPPPVKSPVYKSP---PPPVYKS 43
>gnl|CDD|151560 pfam11116, DUF2624, Protein of unknown function (DUF2624). This
family is conserved in the Bacillaceae family. Several
members are named as YqfT. The function is not known.
Length = 84
Score = 28.1 bits (63), Expect = 4.5
Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 2/64 (3%)
Query: 95 MLKLLRQYNLRGTEEQLKKEIN-VNDLGIDP-NDSEIGDLLSAYESEGDPNLYEDAYQDL 152
+LK +QY + T Q K+ + I+ N+SE LL P ++ +
Sbjct: 20 LLKYAKQYGIPITRAQAKQIAKLLRGKNINIFNESERSKLLKELAKITSPQTAQEVNELF 79
Query: 153 KKFI 156
+F+
Sbjct: 80 AQFV 83
>gnl|CDD|234764 PRK00442, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 28.3 bits (63), Expect = 4.6
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 56 PSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
P+ QP Q VP +T P +Q G+A
Sbjct: 52 PAEQPAPPAQPVPPVQNTAPLNQPHTIDGQA 82
>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
Length = 332
Score = 30.1 bits (68), Expect = 4.6
Identities = 14/80 (17%), Positives = 28/80 (35%), Gaps = 7/80 (8%)
Query: 10 DPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSP---IQQHSLSPHTQAPSPQ---PPAQ 63
PQ + P Q ++ + Q + + ++ Q P+ ++
Sbjct: 61 LNPQSYVETTPPPFQQPQTEESESENEVQIQQEVEQSVDEIKITLPNQEPAYYMQNHRSE 120
Query: 64 PQQVPSPYHTPPPSQTSKQS 83
P Q P+ P+QT+ S
Sbjct: 121 PIQ-PTQPQYQSPTQTNVAS 139
>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 100
amino acids in length. This domain is found associated
with pfam00096. This domain has two conserved sequence
motifs: YPSPV and PSP.
Length = 100
Score = 28.2 bits (62), Expect = 5.0
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 42 SPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPS 77
I +S SP + PSP + P VP+ Y +P S
Sbjct: 12 PNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSS 47
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 29.9 bits (67), Expect = 5.4
Identities = 14/62 (22%), Positives = 24/62 (38%), Gaps = 5/62 (8%)
Query: 29 QQPPVSPQHQNMMSPIQQHSL-SPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKAP 87
P++ + ++ +P+Q ++ S QAP P+P PS S P
Sbjct: 288 ISTPMTEEIKSWQTPLQTPAMYSSDYQAPKPEP---IYTWEELLRERFPSDLFAISSL-P 343
Query: 88 DK 89
D
Sbjct: 344 DS 345
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 29.2 bits (65), Expect = 5.4
Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 9/68 (13%)
Query: 31 PPVSPQHQNMMSPIQQHSLSPHTQAPSPQ-------PPAQPQQVPSPYHTPPPSQTSKQS 83
PP + +P +P++Q P+ Q P A + T P T++Q
Sbjct: 35 PPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQ 94
Query: 84 GKAPDKNT 91
D++T
Sbjct: 95 --TVDRHT 100
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 29.1 bits (65), Expect = 5.6
Identities = 17/77 (22%), Positives = 24/77 (31%), Gaps = 2/77 (2%)
Query: 3 QLPSLMDDPPQ--ESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP 60
+M +P + F VP Q S Q ++ Q+Q H Q P
Sbjct: 76 MPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGA 135
Query: 61 PAQPQQVPSPYHTPPPS 77
P P PP+
Sbjct: 136 GQPPGMQHMPAPALPPN 152
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 29.8 bits (67), Expect = 5.7
Identities = 17/84 (20%), Positives = 22/84 (26%), Gaps = 8/84 (9%)
Query: 12 PQESFH---SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPS-----PQPPAQ 63
PQ S P T PP + + Q + A
Sbjct: 300 PQSSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRA 359
Query: 64 PQQVPSPYHTPPPSQTSKQSGKAP 87
P+ PS PPS + S A
Sbjct: 360 PRGRPSGSGQSPPSNSPLLSSLAD 383
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.0 bits (67), Expect = 5.7
Identities = 20/68 (29%), Positives = 29/68 (42%), Gaps = 12/68 (17%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
+PQ PP +P +P Q+ AP+PQPP PQ P+ P + +Q
Sbjct: 775 TPQPPPQAPP-----APQQRPR-----GAPTPQPP--PQAGPTSMQLMPRAAPGQQGPTK 822
Query: 87 PDKNTLIT 94
L+T
Sbjct: 823 QILRQLLT 830
Score = 29.3 bits (65), Expect = 9.8
Identities = 17/70 (24%), Positives = 23/70 (32%), Gaps = 9/70 (12%)
Query: 11 PPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQP--PAQPQQVP 68
PP + +PP + + +P P P P P PQQ P
Sbjct: 736 PPAAAPGRARPPAAAPGRARPP-----AAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRP 790
Query: 69 --SPYHTPPP 76
+P PPP
Sbjct: 791 RGAPTPQPPP 800
>gnl|CDD|221182 pfam11715, Nup160, Nucleoporin Nup120/160. Nup120 is conserved
from fungi to plants to humans, and is homologous with
the Nup160 of vertebrates. The nuclear core complex, or
NPC, mediates macromolecular transport across the
nuclear envelope. Deletion of the NUP120 gene causes
clustering of NPCs at one side of the nuclear envelope,
moderate nucleolar fragmentation and slower cell growth.
The vertebrate NPC is estimated to contain between 30
and 60 different proteins. most of which are not known.
Two important ones in creating the nucleoporin basket
are Nup98 and Nup153, and Nup120, in conjunction with
Nup 133, interacts with these two and itself plays a
role in mRNA export. Nup160, Nup133, Nup96, and Nup107
are all targets of phosphorylation. The phosphorylation
sites are clustered mainly at the N-terminal regions of
these proteins, which are predicted to be natively
disordered. The entire Nup107-160 subcomplex is stable
throughout the cell cycle, thus it seems unlikely that
phosphorylation affects interactions within the
Nup107-160 subcomplex, but rather that it regulates the
association of the subcomplex with the NPC and other
proteins.
Length = 511
Score = 29.8 bits (67), Expect = 6.0
Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 1/32 (3%)
Query: 546 DVDCVQFHPNSNYVATGSSDRTIRLWDCVTGS 577
V + + ++ T S D T+R+W+ TG
Sbjct: 223 IVSSSEVN-GQTFLFTLSLDGTLRVWNLDTGQ 253
>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
Length = 808
Score = 30.0 bits (67), Expect = 6.2
Identities = 17/83 (20%), Positives = 28/83 (33%), Gaps = 9/83 (10%)
Query: 9 DDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHT---------QAPSPQ 59
P+ P + SP+ Q+ ++ +Q L P+T Q
Sbjct: 22 ASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTELQPSPLPPNTTLDAPVSDSQGDESS 81
Query: 60 PPAQPQQVPSPYHTPPPSQTSKQ 82
QPQ S PPP + ++
Sbjct: 82 SEQQPQNPNSTEPAPPPKKRRRR 104
>gnl|CDD|240424 PTZ00455, PTZ00455, 3-ketoacyl-CoA thiolase; Provisional.
Length = 438
Score = 29.5 bits (66), Expect = 6.5
Identities = 16/41 (39%), Positives = 24/41 (58%), Gaps = 8/41 (19%)
Query: 107 TEEQLKKEINVNDLGIDPNDSEIGDLLSAYESEGDPNLYED 147
+EE L+K +G+ PNDS + ++ S + G NLYED
Sbjct: 269 SEEGLQK------MGLSPNDSRLVEIKSLACASG--NLYED 301
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 29.7 bits (66), Expect = 6.6
Identities = 29/158 (18%), Positives = 51/158 (32%), Gaps = 32/158 (20%)
Query: 12 PQESFHSV---PIIQQTLSP---QQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQ 65
P++ H V I +T+S + V P P+ SLS P P P
Sbjct: 581 PKKKRHKVTDTRITPRTVSRILALRDAVGPAAGTHHQPLHPSSLSASM-GFHPPPFRHPF 639
Query: 66 QVPS------------------PYHTP----PPSQTSKQSGKAPDKNTLITMLK---LLR 100
+P P+ +P P + +PD T L L
Sbjct: 640 PLPLTVAIPNPSLHQSEVFMGYPFQSPHLGAPSGSPPGKDRDSPDLPRPTTSLHPKLLSA 699
Query: 101 QYNLRGTEEQLKKEINVNDLGIDPNDSEIGDLLSAYES 138
++ + + L + ++ G + S+I + Y+
Sbjct: 700 HHHPGSSPDGLSLSLIKSECGDLQDMSDISSADAPYDG 737
>gnl|CDD|220093 pfam09030, Creb_binding, Creb binding. The Creb binding domain
assumes a structure comprising of three alpha-helices
which pack in a bundle, exposing a hydrophobic groove
between alpha-1 and alpha-3 within which complimentary
domains found in the protein 'activator for thyroid
hormone and retinoid receptors' (ACTR) can dock.
Docking of these domains is required for the
recruitment of RNA polymerase II and the basal
transcription machinery.
Length = 104
Score = 27.7 bits (61), Expect = 6.8
Identities = 9/47 (19%), Positives = 10/47 (21%), Gaps = 1/47 (2%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPS 69
Q P M P+ Q P P QP
Sbjct: 4 GQWPQGPLPQQQQMQPGMPRPVMQMVAQHAVAGPRPGLV-QPGISRG 49
>gnl|CDD|130324 TIGR01257, rim_protein, retinal-specific rim ABC transporter. This
model describes the photoreceptor protein (rim protein)
in eukaryotes. It is the member of ABC transporter
superfamily. Rim protein is a membrane glycoprotein which
is localized in the photoreceptor outer segment discs.
Mutation/s in its genetic loci is implicated in the
recessive Stargardt's disease [Transport and binding
proteins, Other].
Length = 2272
Score = 29.6 bits (66), Expect = 7.7
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 23 QQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSP--QPPAQPQQVPSPYHT 73
++ + + P P + +P H+ SP A P QPP +P+ P +T
Sbjct: 1294 RENANLRHPCSGPTEKAGQTPQASHTCSPGQPAAHPEGQPPPEPEDPGVPLNT 1346
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 29.4 bits (66), Expect = 7.8
Identities = 22/112 (19%), Positives = 28/112 (25%), Gaps = 9/112 (8%)
Query: 3 QLPSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPA 62
S D P S + + QQ SPQ P S S P
Sbjct: 137 PGHSSFSDSPSRSASPSRKFSPSSTIQQ---SPQLTPSNKPASPSSSYQSPSYSSSLGPV 193
Query: 63 QPQQVPSPYHTPPPSQTSKQSGKAPDKNTLITMLKLLRQYNLRGTEEQLKKE 114
S P DK + T K L + L +E+
Sbjct: 194 NSSGNRSNL-RSSPWALRSSG----DKKDITTDEKYLETF-LAEVDEEQHMI 239
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 29.2 bits (66), Expect = 8.3
Identities = 17/91 (18%), Positives = 29/91 (31%), Gaps = 10/91 (10%)
Query: 5 PSLMDDPPQESFHSVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQP 64
+ P + + P V PQ +++P Q + + T AP+
Sbjct: 177 APAAPVDTTPTNSQTPAVA---TAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDG 233
Query: 65 QQVPSPYHTPPPSQTSKQSGKAPDKNTLITM 95
P P+ + S A D N L+
Sbjct: 234 A-------APLPTDQAGVSTPAADPNALVMN 257
>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3. The
C.elegans Notch pathway, involved in the control of
growth, differentiation and patterning in animal
development, relies on either of the receptors GLP-1 or
LIN-12. Both these receptors promote signalling by the
recruitment of LAG-3 to target promoters, where it then
acts as a transcriptional activator. LAG-3 works as a
ternary complex together with the DNA binding protein,
LAG-1.
Length = 476
Score = 29.2 bits (64), Expect = 8.5
Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 12/78 (15%)
Query: 23 QQTLSPQQP--PVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPY---HTPPPS 77
QQ S QQ ++ QH + M Q H Q PS VP+P H P P+
Sbjct: 163 QQYHSQQQQYIMMAAQHHHFMGMQQIHH-----QMPSTSSADAIHSVPTPAGSIHQPSPA 217
Query: 78 QTSKQSGKAPDKNTLITM 95
+ Q+G +N + M
Sbjct: 218 EM--QNGCGMSRNATMDM 233
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 29.5 bits (67), Expect = 8.6
Identities = 10/74 (13%), Positives = 20/74 (27%), Gaps = 1/74 (1%)
Query: 27 SPQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQSGKA 86
+ S + A +P P +P + P + + + A
Sbjct: 49 AAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA-AAAPAAPPAAAAAAA 107
Query: 87 PDKNTLITMLKLLR 100
P + + LR
Sbjct: 108 PAAAAVEDEVTPLR 121
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 29.2 bits (65), Expect = 8.7
Identities = 8/56 (14%), Positives = 13/56 (23%)
Query: 28 PQQPPVSPQHQNMMSPIQQHSLSPHTQAPSPQPPAQPQQVPSPYHTPPPSQTSKQS 83
P + + P + AP P PA + P +
Sbjct: 210 PAEEEAPAPSEAGSEPAPDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSG 265
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 29.3 bits (65), Expect = 9.3
Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 16/79 (20%)
Query: 18 SVPIIQQTLSPQQPPVSPQHQNMMSPIQQHSLSP-----HTQAPSPQPPAQPQQVPSP-- 70
+P + + + Q+P +P P Q S +P T A S +P Q P+P
Sbjct: 27 PIPELHTSAATQKPDPAPA------PHQAASRAPDPAVAPTSAASRKP--DLAQAPTPAA 78
Query: 71 YHTPPPS-QTSKQSGKAPD 88
P+ + + +APD
Sbjct: 79 SEKFDPAPAPHQAASRAPD 97
>gnl|CDD|227546 COG5221, DOP1, Dopey and related predicted leucine zipper
transcription factors [Transcription].
Length = 1618
Score = 29.1 bits (65), Expect = 9.8
Identities = 14/55 (25%), Positives = 26/55 (47%)
Query: 454 RTLVGHEGPVRKLSFSPDRSLLLSCSQDNTIRLWSLLLWRCLVVYKGHGHPVWDI 508
+ ++K F R L+L S D+ LW +LL+ +V + + P +D+
Sbjct: 1437 TLMNSPSSKLKKEVFLLSRMLILRISHDHLGNLWPILLYDLGIVVESYESPDFDV 1491
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.133 0.397
Gapped
Lambda K H
0.267 0.0792 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,978,354
Number of extensions: 3512614
Number of successful extensions: 6012
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5150
Number of HSP's successfully gapped: 331
Length of query: 711
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 607
Effective length of database: 6,324,786
Effective search space: 3839145102
Effective search space used: 3839145102
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (27.9 bits)