RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10114
(819 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 151 bits (383), Expect = 8e-41
Identities = 85/288 (29%), Positives = 138/288 (47%), Gaps = 22/288 (7%)
Query: 519 VDWSSQN--VLSVGLGSCVYLWSACTSQV-TRLCDLSADGNSVTSVAWNERGNLVAVGTH 575
V +S + + + +W T ++ L + V + A G +A G+
Sbjct: 15 VAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD---GTYLASGSS 71
Query: 576 HGYVQVWDVSVAKQVHKLVGHTARVGALAW--NGDMLSSGSRDRMILQRDVRTPNSQSER 633
+++WD+ + V L GHT+ V ++A+ +G +LSS SRD+ I DV T +
Sbjct: 72 DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG--KCLT 129
Query: 634 RLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHH 693
L GH V + +SPD ++AS D + +W+L + + T T H V ++A+SP
Sbjct: 130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDG 189
Query: 694 HGLLASGGGTADRCIRFWNTLTGQPMQCVDT----GSQVCNLAWSKHSSELVSTHGYSQN 749
LL+S D I+ W+ TG+ C+ T + V ++A+S L S G
Sbjct: 190 EKLLSSSS---DGTIKLWDLSTGK---CLGTLRGHENGVNSVAFSPDGYLLAS--GSEDG 241
Query: 750 QILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWN 797
I VW T V L+GH+ V LA SPDG+ + +G+ D T+R W+
Sbjct: 242 TIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 146 bits (371), Expect = 4e-39
Identities = 80/254 (31%), Positives = 127/254 (50%), Gaps = 12/254 (4%)
Query: 556 NSVTSVAWNERGNLVAVGTHHGYVQVWDVSVAKQVHKLVGHTARVGALAW--NGDMLSSG 613
VT VA++ G L+A G+ G ++VWD+ + + L GHT V +A +G L+SG
Sbjct: 10 GGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASG 69
Query: 614 SRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSP 673
S D+ I D+ T + R L GH V + +SPD + L+S D + VW++ +
Sbjct: 70 SSDKTIRLWDLETG--ECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKC 127
Query: 674 LQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDTG--SQVCNL 731
L T H V ++A+SP +AS D I+ W+ TG+ + TG +V ++
Sbjct: 128 LTTLRGHTDWVNSVAFSPDGT-FVASSS--QDGTIKLWDLRTGKCVA-TLTGHTGEVNSV 183
Query: 732 AWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDE 791
A+S +L+S S I +W T + L GH V +A SPDG + +G+ D
Sbjct: 184 AFSPDGEKLLS--SSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDG 241
Query: 792 TLRFWNVFSKVRSQ 805
T+R W++ + Q
Sbjct: 242 TIRVWDLRTGECVQ 255
Score = 134 bits (339), Expect = 7e-35
Identities = 66/212 (31%), Positives = 110/212 (51%), Gaps = 10/212 (4%)
Query: 590 VHKLVGHTARVGALAWN--GDMLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKW 647
L GHT V +A++ G +L++GS D I D+ T + R L GH V +
Sbjct: 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETG--ELLRTLKGHTGPVRDVAA 59
Query: 648 SPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRC 707
S D YLASG +D + +W+L + ++T T H + V ++A+SP +L+S D+
Sbjct: 60 SADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGR-ILSSSS--RDKT 116
Query: 708 IRFWNTLTGQPMQCVDT-GSQVCNLAWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLT 766
I+ W+ TG+ + + V ++A+S V++ I +W T VA LT
Sbjct: 117 IKVWDVETGKCLTTLRGHTDWVNSVAFS-PDGTFVAS-SSQDGTIKLWDLRTGKCVATLT 174
Query: 767 GHSYRVLYLAMSPDGEAIVTGAGDETLRFWNV 798
GH+ V +A SPDGE +++ + D T++ W++
Sbjct: 175 GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
Score = 111 bits (280), Expect = 4e-27
Identities = 53/172 (30%), Positives = 80/172 (46%), Gaps = 12/172 (6%)
Query: 631 SERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWS 690
R L GH V + +SPD + LA+G D + VW+L + L+T H V+ +A S
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 691 PHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDT----GSQVCNLAWSKHSSELVSTHGY 746
L + D+ IR W+ TG +CV T S V ++A+S L S
Sbjct: 61 ADGTYLASGSS---DKTIRLWDLETG---ECVRTLTGHTSYVSSVAFSPDGRILSS--SS 112
Query: 747 SQNQILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWNV 798
I VW T + L GH+ V +A SPDG + + + D T++ W++
Sbjct: 113 RDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDL 164
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 144 bits (362), Expect = 1e-36
Identities = 87/313 (27%), Positives = 156/313 (49%), Gaps = 15/313 (4%)
Query: 515 YLNLVDWSSQN---VLSVGLGSCVYLWSACTSQVTRLCDLSADGNSVTSVAWNERGNLVA 571
+ + +S L + LW T + L+ + V+S+A++ G L+
Sbjct: 157 SVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST--LAGHTDPVSSLAFSPDGGLLI 214
Query: 572 VGTHH-GYVQVWDVSVAKQVH-KLVGHTAR-VGALAWNGDMLSSGSRDRMILQRDVRTPN 628
G +++WD+S K + L GH+ V + + +G +L+SGS D I D+R+ +
Sbjct: 215 ASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRS-S 273
Query: 629 SQSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWNLHSMSPLQ--TYTEHLAAVKA 686
S R L GH V + +SPD + LASG +D + +W+L + L T H V +
Sbjct: 274 SSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSS 333
Query: 687 IAWSPHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDTGSQVCNLAWSKHSSELVSTHGY 746
+++SP L++ GG+ D IR W+ TG+P++ ++ S V ++++S V + G
Sbjct: 334 LSFSPDGSLLVS--GGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPD--GRVVSSGS 389
Query: 747 SQNQILVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWNVFSKVRSQR 806
+ + +W T + + L GH+ RV L SPDG+++ +G+ D T+R W++ + ++S
Sbjct: 390 TDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVS 449
Query: 807 ESKSVLNLFSSIR 819
S L S
Sbjct: 450 FSPDGKVLASKSS 462
Score = 95.9 bits (237), Expect = 1e-20
Identities = 85/337 (25%), Positives = 144/337 (42%), Gaps = 23/337 (6%)
Query: 477 SPVGPKSQKLLRSPRKATRKISRIPFKVLDAPELQDDFYLNLVDWSSQNVLSVGLGSCVY 536
S G LL + +S + + +D + +LS +
Sbjct: 36 SESGILLLALLSDSLVSLPDLSSLLLR-----GHEDSITSIAFSPDGELLLSGSSDGTIK 90
Query: 537 LWSACTSQVTRLCDLSADGNSVTSVAW---NERGNLVAVGTHHGYVQVWDVS-VAKQVHK 592
LW + +SV+ +A + L+A + G V++WD+S K +
Sbjct: 91 LWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRT 150
Query: 593 LVGHTARVGALAW--NGDMLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPD 650
L GH+ V +LA+ +G +L+SGS + + + L GH V L +SPD
Sbjct: 151 LEGHSESVTSLAFSPDGKLLASGSSLDGTI-KLWDLRTGKPLSTLAGHTDPVSSLAFSPD 209
Query: 651 NQ-YLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIR 709
+ASG +D + +W+L + L++ + ++SP LLASG +D IR
Sbjct: 210 GGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGS-LLASGS--SDGTIR 266
Query: 710 FWNTLTGQPMQCVDTG--SQVCNLAWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLT- 766
W+ + + +G S V ++A+S L S G S + +W T ++ LT
Sbjct: 267 LWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLAS--GSSDGTVRLWDLETGKLLSSLTL 324
Query: 767 -GHSYRVLYLAMSPDGEAIVTGAG-DETLRFWNVFSK 801
GH V L+ SPDG +V+G D T+R W++ +
Sbjct: 325 KGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361
Score = 78.2 bits (191), Expect = 4e-15
Identities = 54/226 (23%), Positives = 90/226 (39%), Gaps = 11/226 (4%)
Query: 601 GALAWNGDMLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPDNQYLASGGND 660
+L G S ++ V P+ S L GH + + +SPD + L SG +D
Sbjct: 28 LSLLSLGSSESGILLLALLSDSLVSLPDLSS-LLLRGHEDSITSIAFSPDGELLLSGSSD 86
Query: 661 NRLYVWNLHSMSPL--QTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWN--TLTG 716
+ +W+L + L H ++V +A S + + D ++ W+ T
Sbjct: 87 GTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGK 146
Query: 717 QPMQCVDTGSQVCNLAWSKHSSELVSTHGYSQNQILVWKYPTLTQVAKLTGHSYRVLYLA 776
V +LA+S L S I +W T ++ L GH+ V LA
Sbjct: 147 LIRTLEGHSESVTSLAFSPDGKLLASGSSLD-GTIKLWDLRTGKPLSTLAGHTDPVSSLA 205
Query: 777 MSPDGEAIVTGAG-DETLRFWNVFSKVRSQR----ESKSVLNLFSS 817
SPDG ++ D T+R W++ + + S SV++ FS
Sbjct: 206 FSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSP 251
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 47.0 bits (112), Expect = 3e-07
Identities = 15/38 (39%), Positives = 21/38 (55%)
Query: 630 QSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWN 667
+ R L GH V + +SPD LASG +D + VW+
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 40.0 bits (94), Expect = 1e-04
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 760 TQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWN 797
+ L GH+ V +A SPDG + +G+ D T+R W+
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 39.3 bits (92), Expect = 1e-04
Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 671 MSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWN 712
L+T H V ++A+SP + LLASG D +R W+
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGN-LLASGS--DDGTVRVWD 39
Score = 34.3 bits (79), Expect = 0.010
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 551 LSADGNSVTSVAWNERGNLVAVGTHHGYVQVWD 583
L VTSVA++ GNL+A G+ G V+VWD
Sbjct: 7 LKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 33.1 bits (76), Expect = 0.025
Identities = 13/34 (38%), Positives = 22/34 (64%), Gaps = 2/34 (5%)
Query: 588 KQVHKLVGHTARVGALAW--NGDMLSSGSRDRMI 619
K + L GHT V ++A+ +G++L+SGS D +
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTV 35
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 46.9 bits (112), Expect = 4e-07
Identities = 14/40 (35%), Positives = 24/40 (60%)
Query: 628 NSQSERRLVGHRQEVCGLKWSPDNQYLASGGNDNRLYVWN 667
+ + + L GH V + +SPD +YLASG +D + +W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 40.4 bits (95), Expect = 6e-05
Identities = 13/40 (32%), Positives = 24/40 (60%)
Query: 758 TLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDETLRFWN 797
+ + L GH+ V +A SPDG+ + +G+ D T++ W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 40.0 bits (94), Expect = 1e-04
Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 3/43 (6%)
Query: 670 SMSPLQTYTEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWN 712
S L+T H V ++A+SP LASG D I+ W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGK-YLASGS--DDGTIKLWD 40
Score = 33.4 bits (77), Expect = 0.018
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 551 LSADGNSVTSVAWNERGNLVAVGTHHGYVQVWD 583
L VTSVA++ G +A G+ G +++WD
Sbjct: 8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 32.3 bits (74), Expect = 0.054
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 588 KQVHKLVGHTARVGALAW--NGDMLSSGSRDRMI 619
+ + L GHT V ++A+ +G L+SGS D I
Sbjct: 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTI 36
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 42.4 bits (99), Expect = 0.001
Identities = 44/196 (22%), Positives = 85/196 (43%), Gaps = 25/196 (12%)
Query: 622 RDVRTPNSQSERRLVGHRQEVCGLKWSPDNQYL----ASGGNDNRLYVWNLHSMSPLQTY 677
RD+ P + + R ++ G+ W N Y+ AS + + VW++ +
Sbjct: 520 RDIHYPVVE-----LASRSKLSGICW---NSYIKSQVASSNFEGVVQVWDVARSQLVTEM 571
Query: 678 TEHLAAVKAIAWSPHHHGLLASGGGTADRCIRFWNTLTGQPMQCVDTGSQVCNLAWSKHS 737
EH V +I +S LLAS G+ D ++ W+ G + + T + +C + + S
Sbjct: 572 KEHEKRVWSIDYSSADPTLLAS--GSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSES 629
Query: 738 SELVSTHGYSQNQILVW-----KYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGDET 792
++ G + +++ + K P T + GHS V Y+ D +V+ + D T
Sbjct: 630 GRSLA-FGSADHKVYYYDLRNPKLPLCTMI----GHSKTVSYVRFV-DSSTLVSSSTDNT 683
Query: 793 LRFWNVFSKVRSQRES 808
L+ W++ + E+
Sbjct: 684 LKLWDLSMSISGINET 699
Score = 38.9 bits (90), Expect = 0.011
Identities = 34/141 (24%), Positives = 68/141 (48%), Gaps = 9/141 (6%)
Query: 553 ADGNSVTSVAWNER-GNLVAVGTHHGYVQVWDVSVAKQVHKLVGHTARVGALAWNG---D 608
A + ++ + WN + VA G VQVWDV+ ++ V ++ H RV ++ ++
Sbjct: 530 ASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPT 589
Query: 609 MLSSGSRDRMILQRDVRTPNSQSERRLVGHRQEVCGLKWSPDN-QYLASGGNDNRLYVWN 667
+L+SGS D + + S + + +C +++ ++ + LA G D+++Y ++
Sbjct: 590 LLASGSDDGSVKLWSINQGVSIGT---IKTKANICCVQFPSESGRSLAFGSADHKVYYYD 646
Query: 668 LHSMS-PLQTYTEHLAAVKAI 687
L + PL T H V +
Sbjct: 647 LRNPKLPLCTMIGHSKTVSYV 667
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 38.8 bits (90), Expect = 0.012
Identities = 37/138 (26%), Positives = 68/138 (49%), Gaps = 20/138 (14%)
Query: 543 SQVTRLCDLSADGNSVTSVAWN-ERGNLVAVGTHHGYVQVWDVSVAKQVHKLVGHTARVG 601
S+V C ++ V +V W E G L+ G +++ + V KL GHT+ +
Sbjct: 26 SRVIDSCGIACSSGFV-AVPWEVEGGGLI------GAIRLENQMRKPPVIKLKGHTSSIL 78
Query: 602 ALAWN---GDMLSSGSRDRMILQRDVRTPNSQSERR-------LVGHRQEVCGLKWSPDN 651
L +N ++L+SGS D I ++ N +S + L GH++++ + W+P N
Sbjct: 79 DLQFNPCFSEILASGSEDLTIRVWEIPH-NDESVKEIKDPQCILKGHKKKISIIDWNPMN 137
Query: 652 QY-LASGGNDNRLYVWNL 668
Y + S G D+ + +W++
Sbjct: 138 YYIMCSSGFDSFVNIWDI 155
>gnl|CDD|227668 COG5377, COG5377, Phage-related protein, predicted endonuclease
[DNA replication, recombination, and repair].
Length = 319
Score = 35.3 bits (81), Expect = 0.11
Identities = 17/83 (20%), Positives = 29/83 (34%), Gaps = 6/83 (7%)
Query: 270 PPKLLHTSSASHTPKRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNKHLDQNMNM 329
PP L +S+A +A TE DL + S R+ ++
Sbjct: 198 PPALDGSSAAEKYLALRYAKTEAVKTVDLPHFYAEALAVLH-----SIKRDISELEDFAK 252
Query: 330 DLENE-NNEMEMENLATEDENTA 351
++EN NE++ + A
Sbjct: 253 EIENRLKNELKEAEYGFTADYDA 275
>gnl|CDD|227829 COG5542, COG5542, Predicted integral membrane protein [Function
unknown].
Length = 420
Score = 33.5 bits (77), Expect = 0.45
Identities = 23/108 (21%), Positives = 36/108 (33%), Gaps = 13/108 (12%)
Query: 8 PLPLVPILAFSLYWTNILLPIIVHIVLHVTSR-VFIYWKLPSNRTYSAYWVYLPNWKVGL 66
LP + SL +L + R + + N W+ W VGL
Sbjct: 231 LLPSGSLTYLSLLMPAWILGRNAFEIFLGYWRQSNGFKEQGKNAPNIYSWIPNNYWNVGL 290
Query: 67 PTLCINTLAFSLYWTNILLPIIVHILLHVTSRVFIYWKLPSNRTYSAY 114
P YW + +P + ILL V + KL + Y ++
Sbjct: 291 P----------KYWISNNIPNFLFILL--IIIVVGFSKLYFTKQYLSH 326
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 33.3 bits (76), Expect = 0.57
Identities = 17/49 (34%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 579 VQVWDVSVAKQVHKLVGHTARVGALAWN--GDMLSSGSRDRMILQRDVR 625
V VWDV K V + H+ ++ +L WN G +L + S+D+ + D R
Sbjct: 150 VNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPR 198
>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
unknown].
Length = 397
Score = 32.7 bits (74), Expect = 0.80
Identities = 20/94 (21%), Positives = 30/94 (31%), Gaps = 22/94 (23%)
Query: 284 KRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNKHLDQNMNMDLENENNEMEMENL 343
K AA +ENSN + RK + + D S R + DQ++ E +
Sbjct: 62 KNEEAAIDENSNVEAAERKRKHISTDF--SDMSLLRKRKNDQSLQ---PTR------EPM 110
Query: 344 ATEDENTAPPLDLYDIMSPAYRSTSANLEVKADR 377
+ D + S L DR
Sbjct: 111 DSRDSGQDFTE-----------AQSGELGDTGDR 133
>gnl|CDD|132654 TIGR03615, RutF, pyrimidine utilization flavin reductase protein F.
This protein is observed in operons extremely similar
to that characterized in E. coli K-12 responsible for
the import and catabolism of pyrimidines, primarily
uracil. This protein is a member of the flavin reductase
family defined by pfam01613. Presumably, this protein
recycles the flavin of the RutA luciferase-like
oxidoreductase.
Length = 156
Score = 30.9 bits (70), Expect = 1.5
Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 5/35 (14%)
Query: 108 NRTYSAYWVYLPNWKVGLPTLCINTLLFHHRHHST 142
NR+ SAY + N TLC+NTL S
Sbjct: 51 NRSASAYPAFKQN-----GTLCVNTLAAGQEDLSN 80
>gnl|CDD|217344 pfam03060, NMO, Nitronate monooxygenase. Nitronate monooxygenase
(NMO), formerly referred to as 2-nitropropane
dioxygenase (NPD) (EC:1.13.11.32), is an FMN-dependent
enzyme that uses molecular oxygen to oxidize (anionic)
alkyl nitronates and, in the case of the enzyme from
Neurospora crassa, (neutral) nitroalkanes to the
corresponding carbonyl compounds and nitrite. Previously
classified as 2-nitropropane dioxygenase, but it is now
recognized that this was the result of the slow
ionization of nitroalkanes to their nitronate (anionic)
forms. The enzymes from the fungus Neurospora crassa and
the yeast Williopsis saturnus var. mrakii (formerly
classified as Hansenula mrakii) contain non-covalently
bound FMN as the cofactor. Active towards linear alkyl
nitronates of lengths between 2 and 6 carbon atoms and,
with lower activity, towards propyl-2-nitronate. The
enzyme from N. crassa can also utilize neutral
nitroalkanes, but with lower activity. One atom of
oxygen is incorporated into the carbonyl group of the
aldehyde product. The reaction appears to involve the
formation of an enzyme-bound nitronate radical and an
a-peroxynitroethane species, which then decomposes,
either in the active site of the enzyme or after
release, to acetaldehyde and nitrite.
Length = 329
Score = 31.6 bits (72), Expect = 1.8
Identities = 17/99 (17%), Positives = 26/99 (26%), Gaps = 15/99 (15%)
Query: 695 GLLASGGGTADRC---IRFWNTLTGQPMQCVDTGSQVCNLAWSKHSSELVSTHGYSQNQI 751
G++ SG T DR IR LT +P NL + + ++
Sbjct: 38 GVIGSGYLTPDRLEKEIRKVRELTDKPFG--------VNLFLPQPAEGDDFALDVKEHGN 89
Query: 752 LVWKYPTLTQVAKLTGHSYRVLYLAMSPDGEAIVTGAGD 790
K V + + G G
Sbjct: 90 EALKLAIEEGV----PDYGDDDDSLKDAKPKVVSFGFGL 124
>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin.
Length = 293
Score = 31.2 bits (70), Expect = 1.9
Identities = 14/47 (29%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 275 HTSSASHTPKRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNK 321
S + K + + E +S+ +DS++ SKV + +H+F SH +K
Sbjct: 220 REQSKEYKRKANDESNE-HSDV-IDSQESSKVSQEHQSHEFHSHEDK 264
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a
family of proteins of unknown function possibly
containing a coiled-coil domain.
Length = 308
Score = 31.1 bits (70), Expect = 2.2
Identities = 31/125 (24%), Positives = 49/125 (39%), Gaps = 8/125 (6%)
Query: 222 LWPRT-PLEF-KLGLRENGSTPLN-PPS----TIMGDDECEMSNYYLKRLVTTVTP-PKL 273
LW R LE K L+E P++ PPS +D ++ L + V +
Sbjct: 183 LWKRMDKLEAEKRMLQEKLDQPVSEPPSPRDIFTEAEDTVGGIASHITSLRSEVRRLRQN 242
Query: 274 LHTSSASHTPKRHHAATEENSNTDLDSRKLSKVQRNLMTHDFSSHRNKHLDQNMNMDLEN 333
L S A +T K A EE + + R K+ R + + + + ++ MD E
Sbjct: 243 LAKSEADYTEKMEQYAKEERQTREENIRLQRKLLREVERREALCRQLSESESSLEMDDER 302
Query: 334 ENNEM 338
NE
Sbjct: 303 YYNES 307
>gnl|CDD|184663 PRK14411, PRK14411, membrane protein; Provisional.
Length = 204
Score = 30.5 bits (69), Expect = 2.8
Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 20/88 (22%)
Query: 53 SAYWVYLPNWKVGLPTLCINTLAFSLYWTNILLPIIVHILLHVTSRVFIYWKLPSNRTYS 112
+A +W+ GL TL I L + N+ +P +V +LL ++ +W
Sbjct: 109 TAQVAVFYDWRAGLITLLI-ALVLTAIMQNLTIPPLVFMLLF---SIYTFWN-------- 156
Query: 113 AYWVYLPNWKVGLPTLCINTLL-FHHRH 139
N + GL L I ++ F R
Sbjct: 157 -------NQEAGLVFLIITLIMIFKFRK 177
>gnl|CDD|216201 pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal
region. This family is an alignment of the region to
the N-terminal side of the active site. The Prosite
motif does not correspond to this Pfam entry.
Length = 348
Score = 30.7 bits (70), Expect = 2.9
Identities = 20/63 (31%), Positives = 22/63 (34%), Gaps = 18/63 (28%)
Query: 646 KWSPDNQYLASGGNDNRLYVWNLHSMSPLQ-TYT----------------EHLAAVKAIA 688
KWSPD LA DN LYV L + Q T E + A
Sbjct: 49 KWSPDGHKLAY-VRDNNLYVRELATGKETQITSDGSNGIFNGLADWVYEEEVFGSNSAYW 107
Query: 689 WSP 691
WSP
Sbjct: 108 WSP 110
>gnl|CDD|214343 CHL00035, psbC, photosystem II 44 kDa protein.
Length = 473
Score = 30.5 bits (69), Expect = 4.6
Identities = 13/25 (52%), Positives = 16/25 (64%), Gaps = 3/25 (12%)
Query: 774 YLAMSPDGEAIVTGAGDETLRFWNV 798
YL SP GE I G ET+RFW++
Sbjct: 340 YLMRSPTGEIIF---GGETMRFWDL 361
>gnl|CDD|171470 PRK12400, PRK12400, D-amino acid aminotransferase; Reviewed.
Length = 290
Score = 29.6 bits (66), Expect = 7.3
Identities = 11/33 (33%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 175 YCFALDLPIYLAAWLSCQT--SLWVEYGFKSIS 205
+ F+ D+P + A+++ + +LW+EYG ++IS
Sbjct: 107 HTFSYDVPPTIYAYITKKERPALWIEYGVRAIS 139
>gnl|CDD|217394 pfam03155, Alg6_Alg8, ALG6, ALG8 glycosyltransferase family.
N-linked (asparagine-linked) glycosylation of proteins
is mediated by a highly conserved pathway in eukaryotes,
in which a lipid (dolichol phosphate)-linked
oligosaccharide is assembled at the endoplasmic
reticulum membrane prior to the transfer of the
oligosaccharide moiety to the target asparagine
residues. This oligosaccharide is composed of
Glc(3)Man(9)GlcNAc(2). The addition of the three glucose
residues is the final series of steps in the synthesis
of the oligosaccharide precursor. Alg6 transfers the
first glucose residue, and Alg8 transfers the second
one. In the human alg6 gene, a C->T transition, which
causes Ala333 to be replaced with Val, has been
identified as the cause of a congenital disorder of
glycosylation, designated as type Ic OMIM:603147.
Length = 463
Score = 29.3 bits (66), Expect = 8.6
Identities = 28/109 (25%), Positives = 43/109 (39%), Gaps = 24/109 (22%)
Query: 6 IIPLPLVPILAFSLYWTNILLP-IIVHIVLHVTSRVFIYWKLPSNR-TYSAYW-----VY 58
I+ L LV + F++ I LP + + + V SR+F P R + +Y
Sbjct: 217 ILKLALVVVGTFAI----IFLPFLYLKQLPQVLSRLF-----PFARGLFEDKVANFWCLY 267
Query: 59 LPNWK----VGLPTLCINTLAFSLYWTNI-LLPIIVHILLHVTSRVFIY 102
K V LP + S T I LP +V + L T + F+
Sbjct: 268 NFVDKIKEVVPLPQI---QPYISFILTLIGSLPALVKLFLRPTKKGFLL 313
>gnl|CDD|224423 COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
[Amino acid transport and metabolism].
Length = 620
Score = 29.4 bits (66), Expect = 9.2
Identities = 15/53 (28%), Positives = 20/53 (37%), Gaps = 1/53 (1%)
Query: 645 LKWSPDNQYLASGGNDNRLYVWNLHSMSPLQTYTEHLAAVKAIAWSPHHHGLL 697
L+WSPD LA D V L+ + T+ V WSP +
Sbjct: 65 LRWSPDGSVLAFVSTDGG-RVAQLYLVDVGGLITKTAFGVSDARWSPDGDRIA 116
>gnl|CDD|193528 cd05651, M20_ArgE_DapE_like5, M20 Peptidases with similarity to
acetylornithine deacetylases and
succinyl-diaminopimelate desuccinylases. Peptidase M20
family, Uncharacterized protein subfamily with
similarity to acetylornithine
deacetylase/succinyl-diaminopimelate desuccinylase
(ArgE/DapE) subfamily. ArgE/DapE enzymes catalyze
analogous reactions and share a common activator, the
metal ion (usually Co2+ or Zn2+). ArgE catalyzes a broad
range of substrates, including N-acetylornithine,
alpha-N-acetylmethionine and alpha-N-formylmethionine,
while DapE catalyzes the hydrolysis of
N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
L,L-diaminopimelate and succinate. Proteins in this
subfamily are bacterial, and have been inferred by
homology as being related to both ArgE and DapE.
Length = 342
Score = 29.2 bits (66), Expect = 9.4
Identities = 10/43 (23%), Positives = 17/43 (39%), Gaps = 6/43 (13%)
Query: 100 FIYWKLPSNRTYSAYWVYLPNWKVGLPTLCINTLLFHHRHHST 142
+P R + W ++ G PT+ +N+ HH T
Sbjct: 31 LEQKGIPPKRKGNNIWAKNGHFDEGKPTILLNS------HHDT 67
>gnl|CDD|213589 TIGR01153, psbC, photosystem II 44 kDa subunit reaction center
protein (also called P6 protein, CP43), bacterial and
chloroplast. This model describes the Photosystem II,
44kDa subunit (also called P6 protein, CP43) in
bacterial and its equivalents in chloroplast of algae
and higher plants. Photosystem II is in many ways
functionally equivalent to bacterial reaction center. At
the core of Photosystem II are several light harvesting
cofactors including plastoquinones, pheophytins,
phyloquinones etc. These cofactors are intimately
associated with the polypeptides, which principally
including subunits 44 kDa protein,DI, DII, Cyt.b, Cyt.f,
iron-sulphur protein and others. Functinally 44 kDa
subunit is imlicated in chlorophyll binding. Together
they participate in the electron transfer reactions that
lead to the net production of the reducting equivalents
in the form of NADPH, which are used for reduction of
CO2 to carbohydrates(C6H1206). Phosystem II operates
during oxygenic photosynthesis and principal electron
donor is H2O. Although no high resolution X-ray
structural data is presently available, recently a 3D
structure of the supercomplex has been described by
cryo-electron microscopy. Besides a huge body of
literature exits that describes function using a variety
of biochemical and biophysical techniques [Energy
metabolism, Electron transport, Energy metabolism,
Photosynthesis].
Length = 432
Score = 29.4 bits (66), Expect = 9.5
Identities = 13/24 (54%), Positives = 15/24 (62%), Gaps = 3/24 (12%)
Query: 774 YLAMSPDGEAIVTGAGDETLRFWN 797
YL SP GE I G ET+RFW+
Sbjct: 308 YLMRSPTGEIIF---GGETMRFWD 328
>gnl|CDD|153070 cd03600, CLECT_thrombomodulin_like, C-type lectin-like domain
(CTLD) of the type found in human thrombomodulin(TM),
Endosialin, C14orf27, and C1qR.
CLECT_thrombomodulin_like: C-type lectin-like domain
(CTLD) of the type found in human thrombomodulin(TM),
Endosialin, C14orf27, and C1qR. CTLD refers to a domain
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. In these
thrombomodulin-like proteins the residues involved in
coordinating Ca2+ in the classical MBP-A CTLD are not
conserved. TM exerts anti-fibrinolytic and
anti-inflammatory activity. TM also regulates blood
coagulation in the anticoagulant protein C pathway. In
this pathway, the procoagulant properties of thrombin
(T) are lost when it binds TM. TM also plays a key role
in tumor biology. It is expressed on endothelial cells
and on several type of tumor cell including squamous
cell carcinoma. Loss of TM expression correlates with
advanced stage and poor prognosis. Loss of function of
TM function may be associated with arterial or venous
thrombosis and with late fetal loss. Soluble molecules
of TM retaining the CTLD are detected in human plasma
and urine where higher levels indicate injury and/or
enhanced turnover of the endothelium. C1qR is expressed
on endothelial cells and stem cells. It is also
expressed on monocots and neutrophils, where it is
subject to ectodomain shedding. Soluble forms of C1qR
retaining the CTLD is detected in human plasma. C1qR
modulates the phagocytosis of apoptotic cells in vivo.
C1qR-deficient mice are defective in clearance of
apoptotic cells in vivo. The cytoplasmic tail of C1qR,
C-terminal to the CTLD of CD93, contains a PDZ binding
domain which interacts with the PDZ domain-containing
adaptor protein, GIPC. The juxtamembrane region of this
tail interacts with the ezrin/radixin/moesin family.
Endosialin functions in the growth and progression of
abdominal tumors and is expressed in the stroma of
several tumors.
Length = 141
Score = 28.2 bits (63), Expect = 9.5
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 680 HLAAVKAIAWSPHHHGLLASGGGTADRC-IRFWNTLTGQPMQCVDT 724
+LA V++ + LLA+G G R +R W L +P QC D
Sbjct: 29 NLATVRSGEEADVVSLLLAAGPGRHGRGSLRLWIGLQREPRQCSDP 74
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.134 0.427
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,026,796
Number of extensions: 3942631
Number of successful extensions: 2986
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2918
Number of HSP's successfully gapped: 60
Length of query: 819
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 714
Effective length of database: 6,280,432
Effective search space: 4484228448
Effective search space used: 4484228448
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.0 bits)