RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14456
(93 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 56.2 bits (136), Expect = 6e-11
Identities = 33/88 (37%), Positives = 43/88 (48%), Gaps = 6/88 (6%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSL-YNWG 60
H G V V F D LL S G K++ D +TG+ L GH V S+ ++
Sbjct: 176 HTGEVNSVAFSPD---GEKLLSSSSDGTIKLW--DLSTGKCLGTLRGHENGVNSVAFSPD 230
Query: 61 GATVLSGSHDRTVRFWDLRTRGCTNVLT 88
G + SGS D T+R WDLRT C L+
Sbjct: 231 GYLLASGSEDGTIRVWDLRTGECVQTLS 258
Score = 53.1 bits (128), Expect = 8e-10
Identities = 31/89 (34%), Positives = 41/89 (46%), Gaps = 6/89 (6%)
Query: 1 MHDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSL-YNW 59
H V V F D ++S + D I V D TG+ L GH+ V S+ ++
Sbjct: 91 GHTSYVSSVAFSPDGR-----ILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
Query: 60 GGATVLSGSHDRTVRFWDLRTRGCTNVLT 88
G V S S D T++ WDLRT C LT
Sbjct: 146 DGTFVASSSQDGTIKLWDLRTGKCVATLT 174
Score = 50.8 bits (122), Expect = 5e-09
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 6/88 (6%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSL-YNWG 60
H V V F D ++ + D I + D TG+ L+GH+G V S+ ++
Sbjct: 134 HTDWVNSVAFSPD-----GTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD 188
Query: 61 GATVLSGSHDRTVRFWDLRTRGCTNVLT 88
G +LS S D T++ WDL T C L
Sbjct: 189 GEKLLSSSSDGTIKLWDLSTGKCLGTLR 216
Score = 49.3 bits (118), Expect = 2e-08
Identities = 35/89 (39%), Positives = 44/89 (49%), Gaps = 8/89 (8%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHV--LSLYNW 59
H G V V F S LL +G GD I V D TG+ + L GH+G V ++
Sbjct: 8 HTGGVTCVAF----SPDGKLLATGS-GDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD 62
Query: 60 GGATVLSGSHDRTVRFWDLRTRGCTNVLT 88
G + SGS D+T+R WDL T C LT
Sbjct: 63 GT-YLASGSSDKTIRLWDLETGECVRTLT 90
Score = 47.7 bits (114), Expect = 6e-08
Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSL-YNWG 60
H+ V V F + L++ G+ D I V D TG+ Q LSGH+ V SL ++
Sbjct: 218 HENGVNSVAF-----SPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPD 272
Query: 61 GATVLSGSHDRTVRFWD 77
G + SGS D T+R WD
Sbjct: 273 GKRLASGSADGTIRIWD 289
Score = 47.3 bits (113), Expect = 1e-07
Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 6/91 (6%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSL-YNWG 60
H G VRDV ++ + L +++ D TG+ + L+GH+ +V S+ ++
Sbjct: 50 HTGPVRDVAAS---ADGTYLASGSSDKTIRLW--DLETGECVRTLTGHTSYVSSVAFSPD 104
Query: 61 GATVLSGSHDRTVRFWDLRTRGCTNVLTPIT 91
G + S S D+T++ WD+ T C L T
Sbjct: 105 GRILSSSSRDKTIKVWDVETGKCLTTLRGHT 135
Score = 29.6 bits (67), Expect = 0.17
Identities = 15/46 (32%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 44 QALSGHSGHVLSL-YNWGGATVLSGSHDRTVRFWDLRTRGCTNVLT 88
+ L GH+G V + ++ G + +GS D T++ WDL T L
Sbjct: 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK 48
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 45.8 bits (107), Expect = 3e-07
Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQ-ALSGHSGHVLSLYNWG 60
H V + F S LLI+ G+ D I + D +TG+ + LSGHS V+S ++
Sbjct: 197 HTDPVSSLAF----SPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252
Query: 61 GATVLSGSHDRTVRFWDLRT 80
G+ + SGS D T+R WDLR+
Sbjct: 253 GSLLASGSSDGTIRLWDLRS 272
Score = 41.6 bits (96), Expect = 1e-05
Identities = 31/82 (37%), Positives = 41/82 (50%), Gaps = 3/82 (3%)
Query: 12 IEDTSNKSSLLISGGAGDCKIYVTDCATG-QAFQALSGHSGHVLSL-YNWGGATVLSGSH 69
+ S SLL SG + D I + D + + LSGHS VLS+ ++ G + SGS
Sbjct: 246 VSSFSPDGSLLASGSS-DGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSS 304
Query: 70 DRTVRFWDLRTRGCTNVLTPIT 91
D TVR WDL T + LT
Sbjct: 305 DGTVRLWDLETGKLLSSLTLKG 326
Score = 38.1 bits (87), Expect = 2e-04
Identities = 33/79 (41%), Positives = 43/79 (54%), Gaps = 4/79 (5%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSLYNWGG 61
H+G V + F D SLL+SGG+ D I + D TG+ + L GHS + ++ G
Sbjct: 327 HEGPVSSLSFSPD----GSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPDG 382
Query: 62 ATVLSGSHDRTVRFWDLRT 80
V SGS D TVR WDL T
Sbjct: 383 RVVSSGSTDGTVRLWDLST 401
Score = 36.2 bits (82), Expect = 8e-04
Identities = 28/90 (31%), Positives = 48/90 (53%), Gaps = 5/90 (5%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCAT-GQAFQALSGHSGHVLSL-YNW 59
HD +V + + + +S+L++ + D + + D +T G+ + L GHS V SL ++
Sbjct: 108 HDSSVSKLAL--SSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSP 165
Query: 60 GGATVLSGSH-DRTVRFWDLRTRGCTNVLT 88
G + SGS D T++ WDLRT + L
Sbjct: 166 DGKLLASGSSLDGTIKLWDLRTGKPLSTLA 195
Score = 34.3 bits (77), Expect = 0.004
Identities = 33/90 (36%), Positives = 40/90 (44%), Gaps = 9/90 (10%)
Query: 2 HDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALS--GHSGHV--LSLY 57
H +V V F S LL SG D + + D TG+ +L+ GH G V LS
Sbjct: 283 HSSSVLSVAF----SPDGKLLASGS-SDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFS 337
Query: 58 NWGGATVLSGSHDRTVRFWDLRTRGCTNVL 87
G V GS D T+R WDLRT L
Sbjct: 338 PDGSLLVSGGSDDGTIRLWDLRTGKPLKTL 367
Score = 30.1 bits (66), Expect = 0.13
Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 6/90 (6%)
Query: 1 MHDGTVRDVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHSGHVLSLYNW- 59
V V F + ++S G+ D + + D +TG + L GH+ V SL
Sbjct: 368 EGHSNVLSVSF-----SPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSP 422
Query: 60 GGATVLSGSHDRTVRFWDLRTRGCTNVLTP 89
G ++ SGS D T+R WDL+T + +P
Sbjct: 423 DGKSLASGSSDNTIRLWDLKTSLKSVSFSP 452
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats
(blades) of the beta propeller domain.
Length = 40
Score = 33.1 bits (76), Expect = 0.001
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 39 TGQAFQALSGHSGHVLSL-YNWGGATVLSGSHDRTVRFWD 77
+G+ + L GH+G V S+ ++ G + SGS D T++ WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 32.7 bits (75), Expect = 0.002
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 46 LSGHSGHVLSL-YNWGGATVLSGSHDRTVRFWD 77
L GH+G V S+ ++ G + SGS D TVR WD
Sbjct: 7 LKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 30.1 bits (67), Expect = 0.15
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 5/79 (6%)
Query: 4 GTVR---DVCFIEDTSNKSSLLISGGAGDCKIYVTDCATGQ-AFQALSGHSGHVLSLYNW 59
GT++ ++C ++ S L G A D K+Y D + + GHS V +
Sbjct: 612 GTIKTKANICCVQFPSESGRSLAFGSA-DHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFV 670
Query: 60 GGATVLSGSHDRTVRFWDL 78
+T++S S D T++ WDL
Sbjct: 671 DSSTLVSSSTDNTLKLWDL 689
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 27.6 bits (61), Expect = 0.85
Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 46 LSGHSGHVLSL-YNWGGATVL-SGSHDRTVRFWDLR 79
L GH+ +L L +N + +L SGS D T+R W++
Sbjct: 70 LKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIP 105
>gnl|CDD|224785 COG1873, COG1873, Protein implicated in RNA metabolism, contains
PRC-barrel domain [General function prediction
only].
Length = 87
Score = 26.1 bits (58), Expect = 1.4
Identities = 7/26 (26%), Positives = 9/26 (34%)
Query: 4 GTVRDVCFIEDTSNKSSLLISGGAGD 29
GTV DV + LL+
Sbjct: 25 GTVSDVVLDIKEGKITGLLVIPTNKG 50
>gnl|CDD|236528 PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed.
Length = 777
Score = 27.1 bits (61), Expect = 1.5
Identities = 24/75 (32%), Positives = 31/75 (41%), Gaps = 18/75 (24%)
Query: 20 SLLISG-GAGDCKIYV---------TDCATGQAF-QALSGHSGHVLS------LYNW-GG 61
SL+I G GA C YV D +AF +AL GH G V+S GG
Sbjct: 464 SLMIFGQGAIRCHPYVLKEMEAAQNNDKQALKAFDKALFGHIGFVVSNAVRSFWLGLTGG 523
Query: 62 ATVLSGSHDRTVRFW 76
+ D T R++
Sbjct: 524 RLSAAPVDDATKRYY 538
>gnl|CDD|200504 cd11243, Sema_7A, The Sema domain, a protein interacting module,
of semaphorin 7A (Sema7A, also called CD108). Sema7A
plays regulatory roles in both immune and nervous
systems. Unlike other semaphorins, which act as
repulsive guidance cues, Sema7A enhances central and
peripheral axon growth and is required for proper axon
tract formation during embryonic development. Sema7A
also plays a critical role in the negative regulation
of T cell activation and function. Sema7A is a
membrane-anchored member of the semaphorin family of
proteins. Semaphorins are regulatory molecules in the
development of the nervous system and in axonal
guidance. They also play important roles in other
biological processes, such as angiogenesis, immune
regulation, respiration systems and cancer. The Sema
domain is located at the N-terminus and contains four
disulfide bonds formed by eight conserved cysteine
residues. It serves as a receptor-recognition and
-binding module.
Length = 414
Score = 25.2 bits (55), Expect = 5.5
Identities = 8/32 (25%), Positives = 12/32 (37%)
Query: 4 GTVRDVCFIEDTSNKSSLLISGGAGDCKIYVT 35
G+ V I D + DC+ Y+T
Sbjct: 31 GSAVIVKKIPDEKTEKDCKKRATLDDCENYIT 62
>gnl|CDD|206711 cd04140, ARHI_like, A Ras homolog member I (ARHI). ARHI (A Ras
homolog member I) is a member of the Ras family with
several unique structural and functional properties.
ARHI is expressed in normal human ovarian and breast
tissue, but its expression is decreased or eliminated
in breast and ovarian cancer. ARHI contains an
N-terminal extension of 34 residues (human) that is
required to retain its tumor suppressive activity.
Unlike most other Ras family members, ARHI is
maintained in the constitutively active (GTP-bound)
state in resting cells and has modest GTPase activity.
ARHI inhibits STAT3 (signal transducers and activators
of transcription 3), a latent transcription factor
whose abnormal activation plays a critical role in
oncogenesis. Most Ras proteins contain a lipid
modification site at the C-terminus, with a typical
sequence motif CaaX, where a = an aliphatic amino acid
and X = any amino acid. Lipid binding is essential for
membrane attachment, a key feature of most Ras
proteins. Due to the presence of truncated sequences in
this CD, the lipid modification site is not available
for annotation.
Length = 165
Score = 25.2 bits (55), Expect = 5.7
Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 6/48 (12%)
Query: 12 IEDTSNKSSLLISGGAGDCKIYVTDCATGQAFQALSGHS---GHVLSL 56
IEDT + +IS C + +TD F A+ S GH L
Sbjct: 34 IEDTYRQ---VISCSKSICTLQITDTTGSHQFPAMQRLSISKGHAFIL 78
>gnl|CDD|237263 PRK12999, PRK12999, pyruvate carboxylase; Reviewed.
Length = 1146
Score = 25.1 bits (56), Expect = 7.6
Identities = 7/10 (70%), Positives = 7/10 (70%)
Query: 54 LSLYNWGGAT 63
SL WGGAT
Sbjct: 572 FSLEMWGGAT 581
>gnl|CDD|234545 TIGR04326, O_ant_LIC13510, surface carbohydrate biosynthesis
protein, LIC13510 family. This uncharacterized, rare
protein occurs in the highly variable O-antigen region
of some strains of Leptospira, as well as strains of and
serves as a phylogenetic marker for the likely presence
of six additional proteins, including an activated
sugar-nucleotidyltransferase, an activated sugar
epimerase and a dehydratase, an aldolase, and a DegT
family aminotransferase. The patterns suggests a role in
preparing a novel sugar for O-antigen incorporation.
Length = 602
Score = 24.7 bits (54), Expect = 9.3
Identities = 9/20 (45%), Positives = 10/20 (50%)
Query: 60 GGATVLSGSHDRTVRFWDLR 79
GG + G TVR DLR
Sbjct: 359 GGNGPVFGVQHSTVRPTDLR 378
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.136 0.437
Gapped
Lambda K H
0.267 0.0728 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 4,420,397
Number of extensions: 333806
Number of successful extensions: 281
Number of sequences better than 10.0: 1
Number of HSP's gapped: 272
Number of HSP's successfully gapped: 25
Length of query: 93
Length of database: 10,937,602
Length adjustment: 60
Effective length of query: 33
Effective length of database: 8,276,362
Effective search space: 273119946
Effective search space used: 273119946
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.2 bits)