RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy13207
(197 letters)
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 54.7 bits (130), Expect = 5e-09
Identities = 42/188 (22%), Positives = 76/188 (40%), Gaps = 17/188 (9%)
Query: 12 LNQWTRRSNLVDSRTSVTDVKFGPKQLGLVLATCAADGLVRIYEAPDVMNLSQWCLQHEV 71
L S L V+ + F P GL++A+ ++DG +R+++ L H
Sbjct: 185 LRTGKPLSTLAGHTDPVSSLAFSPD-GGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSD 243
Query: 72 ACPTLASCLTWNHSF----SRFNSISI-----MYEAHSINAEHKDLIHDVAYDYYGLRMA 122
+ S + + S S +I + + H + VA+ G +A
Sbjct: 244 SV---VSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLA 300
Query: 123 TCSSDQNIKVTRDVMGTRFDIETIGNFEDHYCTVWRVSWNITGTIVASSG-DDGCVRLWK 181
+ SSD +++ G T + H V +S++ G+++ S G DDG +RLW
Sbjct: 301 SGSSDGTVRLWDLETGKLLSSLT---LKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWD 357
Query: 182 LKKKRKKK 189
L+ + K
Sbjct: 358 LRTGKPLK 365
Score = 48.9 bits (115), Expect = 4e-07
Identities = 36/181 (19%), Positives = 66/181 (36%), Gaps = 20/181 (11%)
Query: 27 SVTDVKFGPKQLGLVLATCAADGLVRIYEAPDVMNLSQWCLQHEVACPTLASCLTWNHSF 86
SV V F P G +LA+ ++DG VR++ D+ S L+++
Sbjct: 286 SVLSVAFSPD--GKLLASGSSDGTVRLW---DLETGKLLSSLTLKGHEGPVSSLSFSPDG 340
Query: 87 SRFNSI----------SIMYEAHSINAEHKDLIHDVAYDYYGLRMATCSSDQNIKVTRDV 136
S S + E + V++ G +++ S+D +++
Sbjct: 341 SLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLS 400
Query: 137 MGTRFDIETIGNFEDHYCTVWRVSWNITGTIVASSGDDGCVRLWKLKKKRKKKKEEEEEE 196
G + N + H V + ++ G +AS D +RLW LK K + +
Sbjct: 401 TG-----SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDGK 455
Query: 197 N 197
Sbjct: 456 V 456
Score = 38.5 bits (88), Expect = 0.001
Identities = 25/103 (24%), Positives = 48/103 (46%), Gaps = 6/103 (5%)
Query: 92 ISIMYEAHSINAEHKDLIHDVAYDYYGLRMATCSSDQNIKVTRDVMGTRFDIETIGNFED 151
+S+ + + H+D I +A+ G + + SSD IK+ D+ I+++
Sbjct: 51 VSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKL-WDLDNGEKLIKSL--EGL 107
Query: 152 HYCTVWRVSW---NITGTIVASSGDDGCVRLWKLKKKRKKKKE 191
H +V +++ + ++ASS DG V+LW L K +
Sbjct: 108 HDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRT 150
Score = 33.9 bits (76), Expect = 0.041
Identities = 31/184 (16%), Positives = 71/184 (38%), Gaps = 20/184 (10%)
Query: 27 SVTDVKFGPKQLGLVLATCAADGLVRIYEAPD--------VMNLSQWCLQHEVACP---T 75
S+T + F P G +L + ++DG +++++ + + ++ P +
Sbjct: 67 SITSIAFSPD--GELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNS 124
Query: 76 LASCLTWNHSFSRFNSISIMYEAHSINAEHKDLIHDVAYDYYGLRMATCSS-DQNIKVTR 134
+ + + +S + H + + +A+ G +A+ SS D IK+
Sbjct: 125 ILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWD 184
Query: 135 DVMGTRFDIETIGNFEDHYCTVWRVSWNITGT-IVASSGDDGCVRLWKLKKKRKKKKEEE 193
G + + H V ++++ G ++AS DG +RLW L + +
Sbjct: 185 LRTG-----KPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLS 239
Query: 194 EEEN 197
+
Sbjct: 240 GHSD 243
Score = 33.5 bits (75), Expect = 0.052
Identities = 34/158 (21%), Positives = 66/158 (41%), Gaps = 17/158 (10%)
Query: 40 LVLATCAADGLVRIYEAPDVMNLSQWCLQHEVAC---------PTLASCLTWNHSFSRFN 90
++LA+ + DG V++++ L + H + LAS + + + ++
Sbjct: 125 ILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWD 184
Query: 91 SISIMYEAHSINAEHKDLIHDVAYDYYG-LRMATCSSDQNIKVTRDVMGTRFDIETIGNF 149
+ + S A H D + +A+ G L +A+ SSD I++
Sbjct: 185 LRT--GKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWD----LSTGKLLRSTL 238
Query: 150 EDHYCTVWRVSWNITGTIVASSGDDGCVRLWKLKKKRK 187
H +V S++ G+++AS DG +RLW L+
Sbjct: 239 SGHSDSVVS-SFSPDGSLLASGSSDGTIRLWDLRSSSS 275
Score = 27.7 bits (60), Expect = 5.0
Identities = 14/75 (18%), Positives = 32/75 (42%), Gaps = 2/75 (2%)
Query: 125 SSDQNIKVTRDVMGTRFDIETIGN--FEDHYCTVWRVSWNITGTIVASSGDDGCVRLWKL 182
SS+ I + + + + + + H ++ ++++ G ++ S DG ++LW L
Sbjct: 35 SSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDL 94
Query: 183 KKKRKKKKEEEEEEN 197
K K E +
Sbjct: 95 DNGEKLIKSLEGLHD 109
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 52.3 bits (126), Expect = 2e-08
Identities = 23/88 (26%), Positives = 39/88 (44%), Gaps = 11/88 (12%)
Query: 105 HKDLIHDVAYDYYGLRMATCSSDQNIKVTRDVMGTRFDIET---IGNFEDHYCTVWRVSW 161
H D ++ VA+ G +A+ S D IK+ +D+ T + H V V++
Sbjct: 134 HTDWVNSVAFSPDGTFVASSSQDGTIKL--------WDLRTGKCVATLTGHTGEVNSVAF 185
Query: 162 NITGTIVASSGDDGCVRLWKLKKKRKKK 189
+ G + SS DG ++LW L +
Sbjct: 186 SPDGEKLLSSSSDGTIKLWDLSTGKCLG 213
Score = 52.0 bits (125), Expect = 3e-08
Identities = 37/168 (22%), Positives = 63/168 (37%), Gaps = 50/168 (29%)
Query: 25 RTSVTDVKFGPKQLGLVLATCAADGLVRIYEAPDVMNLSQWCLQHEVACPTLASCLTWNH 84
VT V F P G +LAT + DG +++ W L+ TL
Sbjct: 9 TGGVTCVAFSPD--GKLLATGSGDGTIKV-----------WDLETGELLRTLK------- 48
Query: 85 SFSRFNSISIMYEAHSINAEHKDLIHDVAYDYYGLRMATCSSDQNIKVTRDVMGTRFDIE 144
H + DVA G +A+ SSD+ I++ +D+E
Sbjct: 49 -------------------GHTGPVRDVAASADGTYLASGSSDKTIRL--------WDLE 81
Query: 145 T---IGNFEDHYCTVWRVSWNITGTIVASSGDDGCVRLWKLKKKRKKK 189
T + H V V+++ G I++SS D +++W ++ +
Sbjct: 82 TGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
Score = 44.6 bits (106), Expect = 8e-06
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 11/86 (12%)
Query: 105 HKDLIHDVAYDYYGLRMATCSSDQNIKVTRDVMGTRFDIET---IGNFEDHYCTVWRVSW 161
H + VA+ G +AT S D IKV +D+ET + + H V V+
Sbjct: 8 HTGGVTCVAFSPDGKLLATGSGDGTIKV--------WDLETGELLRTLKGHTGPVRDVAA 59
Query: 162 NITGTIVASSGDDGCVRLWKLKKKRK 187
+ GT +AS D +RLW L+
Sbjct: 60 SADGTYLASGSSDKTIRLWDLETGEC 85
Score = 43.9 bits (104), Expect = 1e-05
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 11/89 (12%)
Query: 105 HKDLIHDVAYDYYGLRMATCSSDQNIKVTRDVMGTRFDIET---IGNFEDHYCTVWRVSW 161
H ++ VA+ G ++ + SSD IK+ +D+ T +G H V V++
Sbjct: 176 HTGEVNSVAFSPDGEKLLSSSSDGTIKL--------WDLSTGKCLGTLRGHENGVNSVAF 227
Query: 162 NITGTIVASSGDDGCVRLWKLKKKRKKKK 190
+ G ++AS +DG +R+W L+ +
Sbjct: 228 SPDGYLLASGSEDGTIRVWDLRTGECVQT 256
Score = 41.9 bits (99), Expect = 7e-05
Identities = 31/168 (18%), Positives = 65/168 (38%), Gaps = 28/168 (16%)
Query: 27 SVTDVKFGPKQLGLVLATCAADGLVRIYEAPDVMNLSQWCLQHEVACPTLASCLTWNHSF 86
V V F P G +A+ + DG +++++ H + + ++
Sbjct: 137 WVNSVAFSPD--GTFVASSSQDGTIKLWDL-RTGKCVATLTGHTGEV----NSVAFSPDG 189
Query: 87 SRF------NSISI----MYEAHSINAEHKDLIHDVAYDYYGLRMATCSSDQNIKVTRDV 136
+ +I + + H++ ++ VA+ G +A+ S D I+V
Sbjct: 190 EKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRV---- 245
Query: 137 MGTRFDIET---IGNFEDHYCTVWRVSWNITGTIVASSGDDGCVRLWK 181
+D+ T + H +V ++W+ G +AS DG +R+W
Sbjct: 246 ----WDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 29.6 bits (67), Expect = 1.0
Identities = 9/41 (21%), Positives = 21/41 (51%)
Query: 149 FEDHYCTVWRVSWNITGTIVASSGDDGCVRLWKLKKKRKKK 189
+ H V V+++ G ++A+ DG +++W L+ +
Sbjct: 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLR 45
Score = 26.5 bits (59), Expect = 10.0
Identities = 12/30 (40%), Positives = 19/30 (63%), Gaps = 2/30 (6%)
Query: 26 TSVTDVKFGPKQLGLVLATCAADGLVRIYE 55
SVT + + P G LA+ +ADG +RI++
Sbjct: 262 NSVTSLAWSPD--GKRLASGSADGTIRIWD 289
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 38.1 bits (89), Expect = 1e-04
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 144 ETIGNFEDHYCTVWRVSWNITGTIVASSGDDGCVRLWK 181
+ + + H V V+++ G ++AS DDG VR+W
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 29.2 bits (66), Expect = 0.17
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 105 HKDLIHDVAYDYYGLRMATCSSDQNIKV 132
H + VA+ G +A+ S D ++V
Sbjct: 10 HTGPVTSVAFSPDGNLLASGSDDGTVRV 37
Score = 27.3 bits (61), Expect = 0.65
Identities = 12/28 (42%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 27 SVTDVKFGPKQLGLVLATCAADGLVRIY 54
VT V F P G +LA+ + DG VR++
Sbjct: 13 PVTSVAFSPD--GNLLASGSDDGTVRVW 38
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 37.7 bits (88), Expect = 1e-04
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 146 IGNFEDHYCTVWRVSWNITGTIVASSGDDGCVRLWK 181
+ + H V V+++ G +AS DDG ++LW
Sbjct: 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 30.0 bits (68), Expect = 0.081
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 105 HKDLIHDVAYDYYGLRMATCSSDQNIKV 132
H + VA+ G +A+ S D IK+
Sbjct: 11 HTGPVTSVAFSPDGKYLASGSDDGTIKL 38
Score = 26.9 bits (60), Expect = 1.1
Identities = 10/29 (34%), Positives = 17/29 (58%), Gaps = 2/29 (6%)
Query: 27 SVTDVKFGPKQLGLVLATCAADGLVRIYE 55
VT V F P G LA+ + DG +++++
Sbjct: 14 PVTSVAFSPD--GKYLASGSDDGTIKLWD 40
>gnl|CDD|184855 PRK14853, nhaA, pH-dependent sodium/proton antiporter;
Provisional.
Length = 423
Score = 30.0 bits (68), Expect = 0.74
Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 4/40 (10%)
Query: 19 SNLVDSRTSVTDVKFGPKQLGL----VLATCAADGLVRIY 54
S DS ++ D K GP+ GL L T AADGL+ I+
Sbjct: 35 SPWGDSYFALRDFKLGPEPGGLHLSLSLGTWAADGLLAIF 74
>gnl|CDD|234234 TIGR03502, lipase_Pla1_cef, extracellular lipase, Pla-1/cef family.
Members of this protein family are bacterial
lipoproteins largely from the Gammaproteobacteria.
Characterized members are expressed extracellularly and
have esterase activity. Members include the lipase Pla-1
from Aeromonas hydrophila (AF092033) and CHO cell
elongation factor (cef) from Vibrio hollisae.
Length = 792
Score = 29.8 bits (67), Expect = 0.95
Identities = 15/46 (32%), Positives = 20/46 (43%), Gaps = 3/46 (6%)
Query: 34 GPKQLGLVLATCAADGLVRIYEAPDVMNLSQWCLQHEVACPTLASC 79
PK L A+ A G VR++EA LS + A P +C
Sbjct: 103 LPKGHSLDAASVAQPGAVRLFEATQGGTLSDAECK---ANPAGKAC 145
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.0 bits (63), Expect = 1.0
Identities = 8/14 (57%), Positives = 11/14 (78%)
Query: 183 KKKRKKKKEEEEEE 196
+ +KK+EEEEEE
Sbjct: 65 AAEEEKKEEEEEEE 78
Score = 27.2 bits (61), Expect = 1.8
Identities = 8/14 (57%), Positives = 12/14 (85%)
Query: 183 KKKRKKKKEEEEEE 196
++ KK++EEEEEE
Sbjct: 66 AEEEKKEEEEEEEE 79
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 28.3 bits (63), Expect = 1.2
Identities = 10/14 (71%), Positives = 13/14 (92%)
Query: 184 KKRKKKKEEEEEEN 197
KK +KK+EEEEEE+
Sbjct: 91 KKEEKKEEEEEEED 104
Score = 26.8 bits (59), Expect = 4.5
Identities = 9/15 (60%), Positives = 14/15 (93%)
Query: 183 KKKRKKKKEEEEEEN 197
KK+ KK++EEEEE++
Sbjct: 91 KKEEKKEEEEEEEDD 105
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 27.8 bits (62), Expect = 3.2
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 181 KLKKKRKKKKEEEEEE 196
K +KK ++KKEEEEEE
Sbjct: 31 KHQKKEEQKKEEEEEE 46
Score = 27.8 bits (62), Expect = 3.6
Identities = 10/15 (66%), Positives = 13/15 (86%)
Query: 183 KKKRKKKKEEEEEEN 197
KK+ +KK+EEEEEE
Sbjct: 34 KKEEQKKEEEEEEER 48
>gnl|CDD|235014 PRK02227, PRK02227, hypothetical protein; Provisional.
Length = 238
Score = 27.2 bits (61), Expect = 4.9
Identities = 8/32 (25%), Positives = 12/32 (37%)
Query: 98 AHSINAEHKDLIHDVAYDYYGLRMATCSSDQN 129
A S+ E + + D G+R A C
Sbjct: 186 AGSLKFEDIPALKRLGPDILGVRGAVCGGGDR 217
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 26.7 bits (59), Expect = 5.6
Identities = 8/16 (50%), Positives = 12/16 (75%)
Query: 181 KLKKKRKKKKEEEEEE 196
KL K++++KEEE E
Sbjct: 92 KLTSKKEEEKEEEIPE 107
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 26.6 bits (59), Expect = 9.3
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 179 LWKLKKKRKKKKEEEEEEN 197
L K+KK+R ++KE EEEE
Sbjct: 151 LEKIKKERAEEKEREEEEK 169
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 25.5 bits (56), Expect = 9.5
Identities = 7/17 (41%), Positives = 13/17 (76%)
Query: 181 KLKKKRKKKKEEEEEEN 197
+KKK+KKKK++ + +
Sbjct: 18 DVKKKKKKKKKKNKSKE 34
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.132 0.416
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,695,399
Number of extensions: 846353
Number of successful extensions: 1316
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1291
Number of HSP's successfully gapped: 39
Length of query: 197
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 105
Effective length of database: 6,857,034
Effective search space: 719988570
Effective search space used: 719988570
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 56 (25.4 bits)