RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11287
(112 letters)
>gnl|CDD|197624 smart00280, KAZAL, Kazal type serine protease inhibitors. Kazal
type serine protease inhibitors and follistatin-like
domains.
Length = 46
Score = 54.6 bits (132), Expect = 1e-11
Identities = 23/45 (51%), Positives = 29/45 (64%), Gaps = 1/45 (2%)
Query: 45 CPS-CSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
CP C EY+PVCGSDG++Y NEC L AC+ + I V + G C
Sbjct: 2 CPEACPREYDPVCGSDGVTYSNECHLCKAACESGKSIEVKHDGPC 46
>gnl|CDD|238052 cd00104, KAZAL_FS, Kazal type serine protease inhibitors and
follistatin-like domains. Kazal inhibitors inhibit
serine proteases, such as, trypsin, chyomotrypsin,
avian ovomucoids, and elastases. The inhibitory domain
has one reactive site peptide bond, which serves the
cognate enzyme as substrate. The reactive site peptide
bond is a combining loop which has an identical
conformation in all Kazal inhibitors and in all
enzyme/inhibitor complexes. These Kazal domains (small
hydrophobic core of alpha/beta structure with 3 to 4
disulfide bonds) often occur in tandem arrays. Similar
domains are also present in follistatin (FS) and
follistatin-like family members, which play an
important role in tissue specific regulation. The FS
domain consists of an N-terminal beta hairpin
(FOLN/EGF-like domain) and a Kazal-like domain and has
five disulfide bonds. Although the Kazal-like FS
substructure is similar to Kazal proteinase inhibitors,
no FS domain has yet been shown to be a proteinase
inhibitor. Follistatin-like family members include
SPARC, also known as, BM-40 or osteonectin, the Gallus
gallus Flik protein, as well as, agrin which has a long
array of FS domains. The kazal-type inhibitor domain
has also been detected in an extracellular loop region
of solute carrier 21 (SLC21) family members (organic
anion transporters) , which may regulate the
specificity of anion uptake. The distant homolog,
Ascidian trypsin inhibitor, is included in this CD.
Length = 41
Score = 51.1 bits (123), Expect = 3e-10
Identities = 22/41 (53%), Positives = 27/41 (65%)
Query: 48 CSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
C EY+PVCGSDG +Y NEC L AC+ R I+V + G C
Sbjct: 1 CPKEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41
>gnl|CDD|200959 pfam00050, Kazal_1, Kazal-type serine protease inhibitor domain.
Usually indicative of serine protease inhibitors.
However, kazal-like domains are also seen in the
extracellular part of agrins, which are not known to be
protease inhibitors. Kazal domains often occur in
tandem arrays. Small alpha+beta fold containing three
disulphides. Alignment also includes a single domain
from transporters in the OATP/PGT family.
Length = 48
Score = 47.4 bits (113), Expect = 8e-09
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 48 CSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
C EY+PVCGSDG++Y NEC+L E + I + G C
Sbjct: 8 CPREYDPVCGSDGVTYSNECELCNENVKQGTNIHKKHDGPC 48
>gnl|CDD|238648 cd01327, KAZAL_PSTI, Kazal-type pancreatic secretory trypsin
inhibitors (PSTI) and related proteins, including the
second domain of the ovomucoid turkey inhibitor and the
C-terminal domain of the esophagus cancer-related
gene-2 protein (ECRG-2), are members of the superfamily
of kazal-type proteinase inhibitors and
follistatin-like proteins.
Length = 45
Score = 41.9 bits (99), Expect = 9e-07
Identities = 17/43 (39%), Positives = 25/43 (58%)
Query: 46 PSCSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
C +Y+PVCG+DG++Y NEC L E + I + + G C
Sbjct: 3 FGCPKDYDPVCGTDGVTYSNECLLCAENLKRQTNIRIKHDGEC 45
>gnl|CDD|191798 pfam07648, Kazal_2, Kazal-type serine protease inhibitor domain.
Usually indicative of serine protease inhibitors.
However, kazal-like domains are also seen in the
extracellular part of agrins, which are not known to be
protease inhibitors. Kazal domains often occur in
tandem arrays. Small alpha+beta fold containing three
disulphides.
Length = 42
Score = 41.6 bits (98), Expect = 1e-06
Identities = 17/39 (43%), Positives = 21/39 (53%)
Query: 50 AEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
PVCGSDG +Y N+C L +C +I V Y G C
Sbjct: 4 ENDEPVCGSDGKTYSNKCALCCASCLLGEKIKVKYEGSC 42
>gnl|CDD|233136 TIGR00805, oat, sodium-independent organic anion transporter. The
Organo Anion Transporter (OAT) Family (TC
2.A.60)Proteins of the OAT family catalyze the
Na+-independent facilitated transport of organic anions
such as bromosulfobromophthalein and prostaglandins as
well as conjugated and unconjugated bile acids
(taurocholate and cholate, respectively). These
transporters have been characterized in mammals, but
homologues are present in C. elegans and A. thaliana.
Some of the mammalian proteins exhibit a high degree of
tissue specificity. For example, the rat OAT is found at
high levels in liver and kidney and at lower levels in
other tissues. These proteins possess 10-12 putative
a-helical transmembrane spanners. They may catalyze
electrogenic anion uniport or anion exchange.
Length = 633
Score = 32.0 bits (73), Expect = 0.040
Identities = 10/25 (40%), Positives = 17/25 (68%), Gaps = 2/25 (8%)
Query: 43 CKCPSCSAEYNPVCGSDGISYENEC 67
C C S + ++PVCG +G++Y + C
Sbjct: 453 CSCDS--SFFDPVCGDNGLAYLSPC 475
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional
significance of EGF-like domains in what appear to be
unrelated proteins is not yet clear; a common feature
is that these repeats are found in the extracellular
domain of membrane-bound proteins or in proteins known
to be secreted (exception: prostaglandin G/H synthase);
the domain includes six cysteine residues which have
been shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 27.4 bits (61), Expect = 0.29
Identities = 12/24 (50%), Positives = 13/24 (54%)
Query: 24 SSNPCRNDGHCVVKNGKAVCKCPS 47
+SNPC N G CV G C CP
Sbjct: 4 ASNPCSNGGTCVNTPGSYRCVCPP 27
>gnl|CDD|217384 pfam03137, OATP, Organic Anion Transporter Polypeptide (OATP)
family. This family consists of several eukaryotic
Organic-Anion-Transporting Polypeptides (OATPs). Several
have been identified mostly in human and rat. Different
OATPs vary in tissue distribution and substrate
specificity. Since the numbering of different OATPs in
particular species was based originally on the order of
discovery, similarly numbered OATPs in humans and rats
did not necessarily correspond in function, tissue
distribution and substrate specificity (in spite of the
name, some OATPs also transport organic cations and
neutral molecules). Thus, Tamai et al. initiated the
current scheme of using digits for rat OATPs and letters
for human ones. Prostaglandin transporter (PGT) proteins
are also considered to be OATP family members. In
addition, the methotrexate transporter OATK is closely
related to OATPs. This family also includes several
predicted proteins from Caenorhabditis elegans and
Drosophila melanogaster. This similarity was not
previously noted. Note: Members of this family are
described (in the Swiss-Prot database) as belonging to
the SLC21 family of transporters.
Length = 582
Score = 29.2 bits (66), Expect = 0.42
Identities = 13/81 (16%), Positives = 21/81 (25%), Gaps = 27/81 (33%)
Query: 25 SNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVCGSDGISY--------------ENECKLN 70
+ C D C C + + + PVCG +G++Y +
Sbjct: 422 LSSCNED-----------CSCDT--SFFPPVCGDNGLTYLSPCHAGSSSSGTGCDTSCST 468
Query: 71 LEACQHSRQISVLYIGLCSKG 91
S G C
Sbjct: 469 WSNNCSSGNSHSASKGYCPSD 489
>gnl|CDD|238649 cd01328, FSL_SPARC, Follistatin-like SPARC (secreted protein,
acidic, and rich in cysteines) domain;
SPARC/BM-40/osteonectin is a multifunctional
glycoprotein which modulates cellular interaction with
the extracellular matrix by its binding to structural
matrix proteins such as collagen and vitronectin. The
protein it composed of an N-terminal acidic region, a
follistatin (FS) domain and an EF-hand calcium binding
domain. The FS domain consists of an N-terminal beta
hairpin (FOLN/EGF-like domain) and a small hydrophobic
core of alpha/beta structure (Kazal domain) and has
five disulfide bonds and a conserved N-glycosylation
site. The FSL_SPARC domain is a member of the
superfamily of kazal-like proteinase inhibitors and
follistatin-like proteins.
Length = 86
Score = 27.4 bits (61), Expect = 0.68
Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 5/58 (8%)
Query: 22 PCSSNPCRNDGHCVV-KNGKAVCKC-PSCSAEYNP---VCGSDGISYENECKLNLEAC 74
PC ++ C C V C C C E + VC +D +++++C+L C
Sbjct: 1 PCENHHCGAGKVCEVDDENTPKCVCIDPCPEEVDDRRKVCTNDNETFDSDCELYRTRC 58
>gnl|CDD|238650 cd01330, KAZAL_SLC21, The kazal-type serine protease inhibitor
domain has been detected in an extracellular loop
region of solute carrier 21 (SLC21) family members
(organic anion transporters) , which may regulate the
specificity of anion uptake. The KAZAL_SLC21 domain is
a member of the superfamily of kazal-like proteinase
inhibitors and follistatin-like proteins.
Length = 54
Score = 26.9 bits (60), Expect = 0.70
Identities = 11/25 (44%), Positives = 16/25 (64%), Gaps = 2/25 (8%)
Query: 43 CKCPSCSAEYNPVCGSDGISYENEC 67
C C + Y+PVCG +GI+Y + C
Sbjct: 9 CSCSE--SAYSPVCGENGITYFSPC 31
>gnl|CDD|151186 pfam10690, Myticin-prepro, Myticin pre-proprotein from the
mussel. Myticin is a cysteine-rich peptide produced in
three isoforms, A, B and C, by Mytilus
galloprovincialis, the Mediterranean mussel. Some
isoforms show antibacterial activity against
gram-positive bacteria, while others are additionally
active against the fungus Fusarium oxysporum and a
gram-negative bacterium, Escherichia coli D31.
Myticin-prepro is the precursor peptide. The mature
molecule, named myticin, consists of 40 residues, with
four intramolecular disulfide bridges and a cysteine
array in the primary structure different from that of
previously characterized cysteine-rich antimicrobial
peptides. The first 20 amino acids are a putative
signal peptide, and the antimicrobial peptide sequence
is a 36-residue C-terminal extension. Such a structure
suggests that myticins are synthesised as
prepro-proteins that are then processed by various
proteolytic events before storage in the haemocytes as
the active peptide. Myticin precursors are expressed
mainly in the haemocytes. The family Mytilin has been
merged into this family.
Length = 98
Score = 27.2 bits (60), Expect = 0.88
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 1/48 (2%)
Query: 23 CSSNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVCGSDGISYENECKLN 70
C S C CV+ + CKC CS E++P + ENE L+
Sbjct: 34 CGSARCTYYV-CVLLHPGKYCKCLHCSREHSPFALTMKAKPENEGPLD 80
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular
(mostly animal) proteins. Many of these proteins
require calcium for their biological function and
calcium-binding sites have been found to be located at
the N-terminus of particular EGF-like domains;
calcium-binding may be crucial for numerous
protein-protein interactions. Six conserved core
cysteines form three disulfide bridges as in non
calcium-binding EGF domains, whose structures are very
similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 26.1 bits (58), Expect = 0.90
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 24 SSNPCRNDGHCVVKNGKAVCKCP 46
S NPC+N G CV G C CP
Sbjct: 7 SGNPCQNGGTCVNTVGSYRCSCP 29
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 27.9 bits (62), Expect = 1.0
Identities = 8/16 (50%), Positives = 10/16 (62%)
Query: 39 GKAVCKCPSCSAEYNP 54
GK KCP C A ++P
Sbjct: 376 GKPDVKCPYCGASFSP 391
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar,
but has 8 instead of 6 conserved cysteines. Includes
some cytokine receptors. The EGF domain misses the
N-terminus regions of the Ca2+ binding EGF domains
(this is the main reason of discrepancy between
swiss-prot domain start/end and Pfam). The family is
hard to model due to many similar but different
sub-types of EGF domains. Pfam certainly misses a
number of EGF domains.
Length = 32
Score = 25.5 bits (56), Expect = 1.4
Identities = 13/25 (52%), Positives = 14/25 (56%), Gaps = 1/25 (4%)
Query: 23 CSSN-PCRNDGHCVVKNGKAVCKCP 46
CS N PC N G CV G C+CP
Sbjct: 1 CSPNNPCSNGGTCVDTPGGYTCECP 25
>gnl|CDD|181992 PRK09615, ggt, gamma-glutamyltranspeptidase; Reviewed.
Length = 581
Score = 27.6 bits (61), Expect = 1.8
Identities = 11/24 (45%), Positives = 16/24 (66%), Gaps = 2/24 (8%)
Query: 89 SKGLLREID--KARQENEIPPGKL 110
+K + +ID KA+ +EI PGKL
Sbjct: 361 AKSIADQIDINKAKPSSEIRPGKL 384
>gnl|CDD|215096 PLN00192, PLN00192, aldehyde oxidase.
Length = 1344
Score = 27.4 bits (61), Expect = 2.1
Identities = 16/63 (25%), Positives = 22/63 (34%), Gaps = 12/63 (19%)
Query: 5 LASRFQFDSVGYLGETGPCSSNPCRNDGHCVVKNGKAVCKCPSC------------SAEY 52
LA F FD + L E+ SSN + G +N + EY
Sbjct: 517 LAVGFLFDFLSPLIESNAKSSNGWLDGGSNTKQNPDQHDDVKKPTLLLSSKQQVEENNEY 576
Query: 53 NPV 55
+PV
Sbjct: 577 HPV 579
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 24.9 bits (55), Expect = 2.6
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 24 SSNPCRNDGHCVVKNGKAVCKCP 46
S NPC+N G CV G C+CP
Sbjct: 7 SGNPCQNGGTCVNTVGSYRCECP 29
>gnl|CDD|234962 PRK01584, PRK01584, alanyl-tRNA synthetase; Provisional.
Length = 594
Score = 26.3 bits (58), Expect = 4.5
Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 10/42 (23%)
Query: 15 GYLGETGPCSSNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVC 56
G +GETGPC D V GK P CS P C
Sbjct: 163 GPVGETGPCGP-----DTEIFVDTGK-----PKCSVNCRPTC 194
>gnl|CDD|165263 PHA02954, PHA02954, EEV membrane glycoprotein; Provisional.
Length = 317
Score = 25.8 bits (56), Expect = 6.2
Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 49 SAEYNPVCGSDGISYENECKLNLEACQHSRQISVLY 84
S + N VC +D YEN CK + C S +S LY
Sbjct: 54 SLDPNAVCETDKWKYENPCK---KMCTVSDYVSELY 86
>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
Length = 389
Score = 25.6 bits (56), Expect = 6.4
Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 5/51 (9%)
Query: 17 LGETGPCSSNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVCGSDGISYENEC 67
L PC + C G KNG CP+CS S G S C
Sbjct: 163 LTSQAPCPA--CSGTG---AKNGTTPRVCPTCSGTGQVSRNSGGFSLSEPC 208
>gnl|CDD|149142 pfam07903, PaRep2a, PaRep2a protein. This is a family of proteins
expressed by the crenarchaeon Pyrobaculum aerophilum.
The members are highly variable in length and level of
conservation. The presence of numerous frameshifts and
internal stop codons in multiple alignments are thought
to indicate that most family members are no longer
functional.
Length = 122
Score = 24.8 bits (54), Expect = 9.1
Identities = 12/36 (33%), Positives = 16/36 (44%)
Query: 34 CVVKNGKAVCKCPSCSAEYNPVCGSDGISYENECKL 69
C V +AVCK AE+ DG+ + E L
Sbjct: 74 CFVTEREAVCKVGEAKAEFYVFDTPDGVYLKPEIPL 109
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.136 0.429
Gapped
Lambda K H
0.267 0.0737 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,231,002
Number of extensions: 393890
Number of successful extensions: 323
Number of sequences better than 10.0: 1
Number of HSP's gapped: 323
Number of HSP's successfully gapped: 36
Length of query: 112
Length of database: 10,937,602
Length adjustment: 76
Effective length of query: 36
Effective length of database: 7,566,698
Effective search space: 272401128
Effective search space used: 272401128
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 53 (24.2 bits)