RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11287
         (112 letters)



>gnl|CDD|197624 smart00280, KAZAL, Kazal type serine protease inhibitors.  Kazal
          type serine protease inhibitors and follistatin-like
          domains.
          Length = 46

 Score = 54.6 bits (132), Expect = 1e-11
 Identities = 23/45 (51%), Positives = 29/45 (64%), Gaps = 1/45 (2%)

Query: 45 CPS-CSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
          CP  C  EY+PVCGSDG++Y NEC L   AC+  + I V + G C
Sbjct: 2  CPEACPREYDPVCGSDGVTYSNECHLCKAACESGKSIEVKHDGPC 46


>gnl|CDD|238052 cd00104, KAZAL_FS, Kazal type serine protease inhibitors and
          follistatin-like domains. Kazal inhibitors inhibit
          serine proteases, such as, trypsin, chyomotrypsin,
          avian ovomucoids, and elastases. The inhibitory domain
          has one reactive site peptide bond, which serves the
          cognate enzyme as substrate. The reactive site peptide
          bond is a combining loop which has an identical
          conformation in all Kazal inhibitors and in all
          enzyme/inhibitor complexes. These Kazal domains (small
          hydrophobic core of alpha/beta structure with 3 to 4
          disulfide bonds) often occur in tandem arrays. Similar
          domains are also present in follistatin (FS) and
          follistatin-like family members, which play an
          important role in tissue specific regulation. The FS
          domain consists of an N-terminal beta hairpin
          (FOLN/EGF-like domain) and a Kazal-like domain and has
          five disulfide bonds. Although the  Kazal-like FS
          substructure is similar to Kazal proteinase inhibitors,
          no FS domain has yet been shown to be a proteinase
          inhibitor. Follistatin-like family members include
          SPARC, also known as, BM-40 or osteonectin, the Gallus
          gallus Flik protein, as well as, agrin which has a long
          array of FS domains. The kazal-type inhibitor domain
          has also been detected in an extracellular loop region
          of solute carrier 21 (SLC21) family members (organic
          anion transporters) , which may regulate the
          specificity of anion uptake. The distant homolog,
          Ascidian trypsin inhibitor, is included in this CD.
          Length = 41

 Score = 51.1 bits (123), Expect = 3e-10
 Identities = 22/41 (53%), Positives = 27/41 (65%)

Query: 48 CSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
          C  EY+PVCGSDG +Y NEC L   AC+  R I+V + G C
Sbjct: 1  CPKEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41


>gnl|CDD|200959 pfam00050, Kazal_1, Kazal-type serine protease inhibitor domain. 
          Usually indicative of serine protease inhibitors.
          However, kazal-like domains are also seen in the
          extracellular part of agrins, which are not known to be
          protease inhibitors. Kazal domains often occur in
          tandem arrays. Small alpha+beta fold containing three
          disulphides. Alignment also includes a single domain
          from transporters in the OATP/PGT family.
          Length = 48

 Score = 47.4 bits (113), Expect = 8e-09
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 48 CSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
          C  EY+PVCGSDG++Y NEC+L  E  +    I   + G C
Sbjct: 8  CPREYDPVCGSDGVTYSNECELCNENVKQGTNIHKKHDGPC 48


>gnl|CDD|238648 cd01327, KAZAL_PSTI, Kazal-type pancreatic secretory trypsin
          inhibitors (PSTI) and related proteins, including the
          second domain of the ovomucoid turkey inhibitor and the
          C-terminal domain of the esophagus cancer-related
          gene-2 protein (ECRG-2), are members of the superfamily
          of kazal-type proteinase inhibitors and
          follistatin-like proteins.
          Length = 45

 Score = 41.9 bits (99), Expect = 9e-07
 Identities = 17/43 (39%), Positives = 25/43 (58%)

Query: 46 PSCSAEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
            C  +Y+PVCG+DG++Y NEC L  E  +    I + + G C
Sbjct: 3  FGCPKDYDPVCGTDGVTYSNECLLCAENLKRQTNIRIKHDGEC 45


>gnl|CDD|191798 pfam07648, Kazal_2, Kazal-type serine protease inhibitor domain. 
          Usually indicative of serine protease inhibitors.
          However, kazal-like domains are also seen in the
          extracellular part of agrins, which are not known to be
          protease inhibitors. Kazal domains often occur in
          tandem arrays. Small alpha+beta fold containing three
          disulphides.
          Length = 42

 Score = 41.6 bits (98), Expect = 1e-06
 Identities = 17/39 (43%), Positives = 21/39 (53%)

Query: 50 AEYNPVCGSDGISYENECKLNLEACQHSRQISVLYIGLC 88
              PVCGSDG +Y N+C L   +C    +I V Y G C
Sbjct: 4  ENDEPVCGSDGKTYSNKCALCCASCLLGEKIKVKYEGSC 42


>gnl|CDD|233136 TIGR00805, oat, sodium-independent organic anion transporter.  The
           Organo Anion Transporter (OAT) Family (TC
           2.A.60)Proteins of the OAT family catalyze the
           Na+-independent facilitated transport of organic anions
           such as bromosulfobromophthalein and prostaglandins as
           well as conjugated and unconjugated bile acids
           (taurocholate and cholate, respectively). These
           transporters have been characterized in mammals, but
           homologues are present in C. elegans and A. thaliana.
           Some of the mammalian proteins exhibit a high degree of
           tissue specificity. For example, the rat OAT is found at
           high levels in liver and kidney and at lower levels in
           other tissues. These proteins possess 10-12 putative
           a-helical transmembrane spanners. They may catalyze
           electrogenic anion uniport or anion exchange.
          Length = 633

 Score = 32.0 bits (73), Expect = 0.040
 Identities = 10/25 (40%), Positives = 17/25 (68%), Gaps = 2/25 (8%)

Query: 43  CKCPSCSAEYNPVCGSDGISYENEC 67
           C C S  + ++PVCG +G++Y + C
Sbjct: 453 CSCDS--SFFDPVCGDNGLAYLSPC 475


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
          growth factor (EGF) presents in a large number of
          proteins, mostly animal; the list of proteins currently
          known to contain one or more copies of an EGF-like
          pattern is large and varied; the functional
          significance of EGF-like domains in what appear to be
          unrelated proteins is not yet clear; a common feature
          is that these repeats are found in the extracellular
          domain of membrane-bound proteins or in proteins known
          to be secreted (exception: prostaglandin G/H synthase);
          the domain includes six cysteine residues which have
          been shown to be involved in disulfide bonds; the main
          structure is a two-stranded beta-sheet followed by a
          loop to a C-terminal short two-stranded sheet;
          Subdomains between the conserved cysteines vary in
          length; the region between the 5th and 6th cysteine
          contains two conserved glycines of which at  least  one
           is  present  in  most EGF-like domains; a subset of
          these bind calcium.
          Length = 36

 Score = 27.4 bits (61), Expect = 0.29
 Identities = 12/24 (50%), Positives = 13/24 (54%)

Query: 24 SSNPCRNDGHCVVKNGKAVCKCPS 47
          +SNPC N G CV   G   C CP 
Sbjct: 4  ASNPCSNGGTCVNTPGSYRCVCPP 27


>gnl|CDD|217384 pfam03137, OATP, Organic Anion Transporter Polypeptide (OATP)
           family.  This family consists of several eukaryotic
           Organic-Anion-Transporting Polypeptides (OATPs). Several
           have been identified mostly in human and rat. Different
           OATPs vary in tissue distribution and substrate
           specificity. Since the numbering of different OATPs in
           particular species was based originally on the order of
           discovery, similarly numbered OATPs in humans and rats
           did not necessarily correspond in function, tissue
           distribution and substrate specificity (in spite of the
           name, some OATPs also transport organic cations and
           neutral molecules). Thus, Tamai et al. initiated the
           current scheme of using digits for rat OATPs and letters
           for human ones. Prostaglandin transporter (PGT) proteins
           are also considered to be OATP family members. In
           addition, the methotrexate transporter OATK is closely
           related to OATPs. This family also includes several
           predicted proteins from Caenorhabditis elegans and
           Drosophila melanogaster. This similarity was not
           previously noted. Note: Members of this family are
           described (in the Swiss-Prot database) as belonging to
           the SLC21 family of transporters.
          Length = 582

 Score = 29.2 bits (66), Expect = 0.42
 Identities = 13/81 (16%), Positives = 21/81 (25%), Gaps = 27/81 (33%)

Query: 25  SNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVCGSDGISY--------------ENECKLN 70
            + C  D           C C +  + + PVCG +G++Y               +     
Sbjct: 422 LSSCNED-----------CSCDT--SFFPPVCGDNGLTYLSPCHAGSSSSGTGCDTSCST 468

Query: 71  LEACQHSRQISVLYIGLCSKG 91
                 S        G C   
Sbjct: 469 WSNNCSSGNSHSASKGYCPSD 489


>gnl|CDD|238649 cd01328, FSL_SPARC, Follistatin-like SPARC (secreted protein,
          acidic, and rich in cysteines) domain;
          SPARC/BM-40/osteonectin is a multifunctional
          glycoprotein which modulates cellular interaction with
          the extracellular matrix by its binding to structural
          matrix proteins such as collagen and vitronectin. The
          protein it composed of an N-terminal acidic region, a
          follistatin (FS) domain and an EF-hand calcium binding
          domain. The FS domain consists of an N-terminal beta
          hairpin (FOLN/EGF-like domain) and a small hydrophobic
          core of alpha/beta structure (Kazal domain) and has
          five disulfide bonds and a conserved N-glycosylation
          site. The FSL_SPARC domain is a member of the
          superfamily of kazal-like proteinase inhibitors and
          follistatin-like proteins.
          Length = 86

 Score = 27.4 bits (61), Expect = 0.68
 Identities = 15/58 (25%), Positives = 25/58 (43%), Gaps = 5/58 (8%)

Query: 22 PCSSNPCRNDGHCVV-KNGKAVCKC-PSCSAEYNP---VCGSDGISYENECKLNLEAC 74
          PC ++ C     C V       C C   C  E +    VC +D  +++++C+L    C
Sbjct: 1  PCENHHCGAGKVCEVDDENTPKCVCIDPCPEEVDDRRKVCTNDNETFDSDCELYRTRC 58


>gnl|CDD|238650 cd01330, KAZAL_SLC21, The kazal-type serine protease inhibitor
          domain has been detected in an extracellular loop
          region of solute carrier 21 (SLC21) family members
          (organic anion transporters) , which may regulate the
          specificity of anion uptake. The KAZAL_SLC21 domain is
          a member of the superfamily of kazal-like proteinase
          inhibitors and follistatin-like proteins.
          Length = 54

 Score = 26.9 bits (60), Expect = 0.70
 Identities = 11/25 (44%), Positives = 16/25 (64%), Gaps = 2/25 (8%)

Query: 43 CKCPSCSAEYNPVCGSDGISYENEC 67
          C C    + Y+PVCG +GI+Y + C
Sbjct: 9  CSCSE--SAYSPVCGENGITYFSPC 31


>gnl|CDD|151186 pfam10690, Myticin-prepro, Myticin pre-proprotein from the
          mussel.  Myticin is a cysteine-rich peptide produced in
          three isoforms, A, B and C, by Mytilus
          galloprovincialis, the Mediterranean mussel. Some
          isoforms show antibacterial activity against
          gram-positive bacteria, while others are additionally
          active against the fungus Fusarium oxysporum and a
          gram-negative bacterium, Escherichia coli D31.
          Myticin-prepro is the precursor peptide. The mature
          molecule, named myticin, consists of 40 residues, with
          four intramolecular disulfide bridges and a cysteine
          array in the primary structure different from that of
          previously characterized cysteine-rich antimicrobial
          peptides. The first 20 amino acids are a putative
          signal peptide, and the antimicrobial peptide sequence
          is a 36-residue C-terminal extension. Such a structure
          suggests that myticins are synthesised as
          prepro-proteins that are then processed by various
          proteolytic events before storage in the haemocytes as
          the active peptide. Myticin precursors are expressed
          mainly in the haemocytes. The family Mytilin has been
          merged into this family.
          Length = 98

 Score = 27.2 bits (60), Expect = 0.88
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 23 CSSNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVCGSDGISYENECKLN 70
          C S  C     CV+ +    CKC  CS E++P   +     ENE  L+
Sbjct: 34 CGSARCTYYV-CVLLHPGKYCKCLHCSREHSPFALTMKAKPENEGPLD 80


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 26.1 bits (58), Expect = 0.90
 Identities = 12/23 (52%), Positives = 13/23 (56%)

Query: 24 SSNPCRNDGHCVVKNGKAVCKCP 46
          S NPC+N G CV   G   C CP
Sbjct: 7  SGNPCQNGGTCVNTVGSYRCSCP 29


>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus.  This
           family represents the C-terminus (approximately 500
           residues) of the eukaryotic coatomer alpha subunit.
           Coatomer (COPI) is a large cytosolic protein complex
           which forms a coat around vesicles budding from the
           Golgi apparatus. Such coatomer-coated vesicles have been
           proposed to play a role in many distinct steps of
           intracellular transport. Note that many family members
           also contain the pfam04053 domain.
          Length = 421

 Score = 27.9 bits (62), Expect = 1.0
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query: 39  GKAVCKCPSCSAEYNP 54
           GK   KCP C A ++P
Sbjct: 376 GKPDVKCPYCGASFSP 391


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
          between noise and signal. pfam00053 is very similar,
          but has 8 instead of 6 conserved cysteines. Includes
          some cytokine receptors. The EGF domain misses the
          N-terminus regions of the Ca2+ binding EGF domains
          (this is the main reason of discrepancy between
          swiss-prot domain start/end and Pfam). The family is
          hard to model due to many similar but different
          sub-types of EGF domains. Pfam certainly misses a
          number of EGF domains.
          Length = 32

 Score = 25.5 bits (56), Expect = 1.4
 Identities = 13/25 (52%), Positives = 14/25 (56%), Gaps = 1/25 (4%)

Query: 23 CSSN-PCRNDGHCVVKNGKAVCKCP 46
          CS N PC N G CV   G   C+CP
Sbjct: 1  CSPNNPCSNGGTCVDTPGGYTCECP 25


>gnl|CDD|181992 PRK09615, ggt, gamma-glutamyltranspeptidase; Reviewed.
          Length = 581

 Score = 27.6 bits (61), Expect = 1.8
 Identities = 11/24 (45%), Positives = 16/24 (66%), Gaps = 2/24 (8%)

Query: 89  SKGLLREID--KARQENEIPPGKL 110
           +K +  +ID  KA+  +EI PGKL
Sbjct: 361 AKSIADQIDINKAKPSSEIRPGKL 384


>gnl|CDD|215096 PLN00192, PLN00192, aldehyde oxidase.
          Length = 1344

 Score = 27.4 bits (61), Expect = 2.1
 Identities = 16/63 (25%), Positives = 22/63 (34%), Gaps = 12/63 (19%)

Query: 5   LASRFQFDSVGYLGETGPCSSNPCRNDGHCVVKNGKAVCKCPSC------------SAEY 52
           LA  F FD +  L E+   SSN   + G    +N                      + EY
Sbjct: 517 LAVGFLFDFLSPLIESNAKSSNGWLDGGSNTKQNPDQHDDVKKPTLLLSSKQQVEENNEY 576

Query: 53  NPV 55
           +PV
Sbjct: 577 HPV 579


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 24.9 bits (55), Expect = 2.6
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 24 SSNPCRNDGHCVVKNGKAVCKCP 46
          S NPC+N G CV   G   C+CP
Sbjct: 7  SGNPCQNGGTCVNTVGSYRCECP 29


>gnl|CDD|234962 PRK01584, PRK01584, alanyl-tRNA synthetase; Provisional.
          Length = 594

 Score = 26.3 bits (58), Expect = 4.5
 Identities = 16/42 (38%), Positives = 17/42 (40%), Gaps = 10/42 (23%)

Query: 15  GYLGETGPCSSNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVC 56
           G +GETGPC       D    V  GK     P CS    P C
Sbjct: 163 GPVGETGPCGP-----DTEIFVDTGK-----PKCSVNCRPTC 194


>gnl|CDD|165263 PHA02954, PHA02954, EEV membrane glycoprotein; Provisional.
          Length = 317

 Score = 25.8 bits (56), Expect = 6.2
 Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 49 SAEYNPVCGSDGISYENECKLNLEACQHSRQISVLY 84
          S + N VC +D   YEN CK   + C  S  +S LY
Sbjct: 54 SLDPNAVCETDKWKYENPCK---KMCTVSDYVSELY 86


>gnl|CDD|237665 PRK14295, PRK14295, chaperone protein DnaJ; Provisional.
          Length = 389

 Score = 25.6 bits (56), Expect = 6.4
 Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 5/51 (9%)

Query: 17  LGETGPCSSNPCRNDGHCVVKNGKAVCKCPSCSAEYNPVCGSDGISYENEC 67
           L    PC +  C   G    KNG     CP+CS        S G S    C
Sbjct: 163 LTSQAPCPA--CSGTG---AKNGTTPRVCPTCSGTGQVSRNSGGFSLSEPC 208


>gnl|CDD|149142 pfam07903, PaRep2a, PaRep2a protein.  This is a family of proteins
           expressed by the crenarchaeon Pyrobaculum aerophilum.
           The members are highly variable in length and level of
           conservation. The presence of numerous frameshifts and
           internal stop codons in multiple alignments are thought
           to indicate that most family members are no longer
           functional.
          Length = 122

 Score = 24.8 bits (54), Expect = 9.1
 Identities = 12/36 (33%), Positives = 16/36 (44%)

Query: 34  CVVKNGKAVCKCPSCSAEYNPVCGSDGISYENECKL 69
           C V   +AVCK     AE+      DG+  + E  L
Sbjct: 74  CFVTEREAVCKVGEAKAEFYVFDTPDGVYLKPEIPL 109


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.136    0.429 

Gapped
Lambda     K      H
   0.267   0.0737    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,231,002
Number of extensions: 393890
Number of successful extensions: 323
Number of sequences better than 10.0: 1
Number of HSP's gapped: 323
Number of HSP's successfully gapped: 36
Length of query: 112
Length of database: 10,937,602
Length adjustment: 76
Effective length of query: 36
Effective length of database: 7,566,698
Effective search space: 272401128
Effective search space used: 272401128
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 53 (24.2 bits)