RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9419
         (739 letters)



>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 41.1 bits (97), Expect = 4e-05
 Identities = 17/31 (54%), Positives = 21/31 (67%), Gaps = 1/31 (3%)

Query: 67 DVDECAESRHLCGPGAVCINHPGSYTCQCPP 97
          D+DECA S + C  G  C+N  GSY C+CPP
Sbjct: 1  DIDECA-SGNPCQNGGTCVNTVGSYRCECPP 30



 Score = 40.7 bits (96), Expect = 5e-05
 Identities = 17/31 (54%), Positives = 19/31 (61%)

Query: 569 DIDECWSSNTCGSNAVCINTPGSYDCRCKEG 599
           DIDEC S N C +   C+NT GSY C C  G
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPG 31



 Score = 37.6 bits (88), Expect = 5e-04
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 187 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTT 219
           D+DEC   +PC +   CVN  G ++C CP G T
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 37.6 bits (88), Expect = 5e-04
 Identities = 15/33 (45%), Positives = 20/33 (60%)

Query: 246 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTT 278
           D+DEC   +PC +   CVN  G ++C CP G T
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 26.8 bits (60), Expect = 4.6
 Identities = 14/29 (48%), Positives = 16/29 (55%), Gaps = 1/29 (3%)

Query: 510 CDSGAGLCGPGAQCLETGGSVECQCPAGY 538
           C SG   C  G  C+ T GS  C+CP GY
Sbjct: 5   CASG-NPCQNGGTCVNTVGSYRCECPPGY 32


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
          large number of membrane-bound and extracellular
          (mostly animal) proteins. Many of these proteins
          require calcium for their biological function and
          calcium-binding sites have been found to be located at
          the N-terminus of particular EGF-like domains;
          calcium-binding may be crucial for numerous
          protein-protein interactions. Six conserved core
          cysteines form three disulfide bridges as in non
          calcium-binding EGF domains, whose structures are very
          similar. EGF_CA can be found in tandem repeat
          arrangements.
          Length = 38

 Score = 39.5 bits (93), Expect = 1e-04
 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 67 DVDECAESRHLCGPGAVCINHPGSYTCQCPP 97
          D+DECA S + C  G  C+N  GSY C CPP
Sbjct: 1  DIDECA-SGNPCQNGGTCVNTVGSYRCSCPP 30



 Score = 39.2 bits (92), Expect = 2e-04
 Identities = 17/31 (54%), Positives = 19/31 (61%)

Query: 569 DIDECWSSNTCGSNAVCINTPGSYDCRCKEG 599
           DIDEC S N C +   C+NT GSY C C  G
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPG 31



 Score = 38.8 bits (91), Expect = 2e-04
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 187 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGD 221
           D+DEC   +PC +   CVN  G ++C CP G TG 
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35



 Score = 38.8 bits (91), Expect = 2e-04
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 246 DVDECLGVSPCASSALCVNEKGGFKCVCPKGTTGD 280
           D+DEC   +PC +   CVN  G ++C CP G TG 
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35



 Score = 29.1 bits (66), Expect = 0.55
 Identities = 15/31 (48%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 510 CDSGAGLCGPGAQCLETGGSVECQCPAGYKG 540
           C SG   C  G  C+ T GS  C CP GY G
Sbjct: 5   CASG-NPCQNGGTCVNTVGSYRCSCPPGYTG 34


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 35.0 bits (81), Expect = 0.005
 Identities = 16/32 (50%), Positives = 19/32 (59%)

Query: 67 DVDECAESRHLCGPGAVCINHPGSYTCQCPPN 98
          DVDECA+  H C    VC+N  GS+ C CP  
Sbjct: 1  DVDECADGTHNCPANTVCVNTIGSFECVCPDG 32



 Score = 32.3 bits (74), Expect = 0.043
 Identities = 15/32 (46%), Positives = 23/32 (71%), Gaps = 1/32 (3%)

Query: 569 DIDECWS-SNTCGSNAVCINTPGSYDCRCKEG 599
           D+DEC   ++ C +N VC+NT GS++C C +G
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDG 32



 Score = 31.6 bits (72), Expect = 0.080
 Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 187 DVDEC-LGVSPCASSALCVNEKGGFKCVCPKG 217
           DVDEC  G   C ++ +CVN  G F+CVCP G
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDG 32



 Score = 31.6 bits (72), Expect = 0.080
 Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 246 DVDEC-LGVSPCASSALCVNEKGGFKCVCPKG 276
           DVDEC  G   C ++ +CVN  G F+CVCP G
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDG 32



 Score = 27.7 bits (62), Expect = 2.3
 Identities = 14/33 (42%), Positives = 16/33 (48%)

Query: 510 CDSGAGLCGPGAQCLETGGSVECQCPAGYKGNP 542
           C  G   C     C+ T GS EC CP GY+ N 
Sbjct: 5   CADGTHNCPANTVCVNTIGSFECVCPDGYENNE 37


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 37.4 bits (87), Expect = 0.014
 Identities = 15/33 (45%), Positives = 17/33 (51%), Gaps = 2/33 (6%)

Query: 63  GYCEDVDECAESRHLCGPGAVCINHPGSYTCQC 95
             C   D CA   H+C    VCI+ PGSY C C
Sbjct: 182 KICVVPDLCATLSHVCQ--QVCISTPGSYLCAC 212



 Score = 36.2 bits (84), Expect = 0.032
 Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 567 CVDIDECWSSNTCGSNAVCINTPGSYDCRCKEGNAGNP 604
           CV  D C ++ +     VCI+TPGSY C C EG A   
Sbjct: 184 CVVPDLC-ATLSHVCQQVCISTPGSYLCACTEGYALLE 220


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 33.3 bits (77), Expect = 0.018
 Identities = 15/32 (46%), Positives = 17/32 (53%)

Query: 510 CDSGAGLCGPGAQCLETGGSVECQCPAGYKGN 541
           C    G C P A C  TGGS  C C +GY G+
Sbjct: 1   CAENNGGCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 32.1 bits (74), Expect = 0.052
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 71  CAESRHLCGPGAVCINHPGSYTCQCPPNSSGD 102
           CAE+   C P A C N  GS+TC C    +GD
Sbjct: 1   CAENNGGCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 31.4 bits (72), Expect = 0.096
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 579 CGSNAVCINTPGSYDCRCKEGNAGN 603
           C  NA C NT GS+ C CK G  G+
Sbjct: 8   CHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 26.7 bits (60), Expect = 4.3
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 196 PCASSALCVNEKGGFKCVCPKGTTGD 221
            C  +A C N  G F C C  G TGD
Sbjct: 7   GCHPNATCTNTGGSFTCTCKSGYTGD 32



 Score = 26.7 bits (60), Expect = 4.3
 Identities = 12/26 (46%), Positives = 13/26 (50%)

Query: 255 PCASSALCVNEKGGFKCVCPKGTTGD 280
            C  +A C N  G F C C  G TGD
Sbjct: 7   GCHPNATCTNTGGSFTCTCKSGYTGD 32


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 32.1 bits (73), Expect = 0.052
 Identities = 15/28 (53%), Positives = 18/28 (64%)

Query: 572 ECWSSNTCGSNAVCINTPGSYDCRCKEG 599
           EC +SN C +   C+NTPGSY C C  G
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPG 28



 Score = 31.3 bits (71), Expect = 0.11
 Identities = 18/33 (54%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 70  ECAESRHLCGPGAVCINHPGSYTCQCPPNSSGD 102
           ECA S + C  G  C+N PGSY C CPP  +GD
Sbjct: 1   ECAAS-NPCSNGGTCVNTPGSYRCVCPPGYTGD 32



 Score = 30.9 bits (70), Expect = 0.14
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 190 ECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPY 223
           EC   +PC++   CVN  G ++CVCP G TGD  
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 30.9 bits (70), Expect = 0.14
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 249 ECLGVSPCASSALCVNEKGGFKCVCPKGTTGDPY 282
           EC   +PC++   CVN  G ++CVCP G TGD  
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 27.8 bits (62), Expect = 1.7
 Identities = 12/29 (41%), Positives = 14/29 (48%)

Query: 515 GLCGPGAQCLETGGSVECQCPAGYKGNPY 543
             C  G  C+ T GS  C CP GY G+  
Sbjct: 6   NPCSNGGTCVNTPGSYRCVCPPGYTGDRS 34


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 29.4 bits (66), Expect = 0.46
 Identities = 17/28 (60%), Positives = 17/28 (60%), Gaps = 1/28 (3%)

Query: 572 ECWSSNTCGSNAVCINTPGSYDCRCKEG 599
           EC S   C SN  CINTPGSY C C  G
Sbjct: 1   ECASGGPC-SNGTCINTPGSYTCSCPPG 27



 Score = 25.9 bits (57), Expect = 7.6
 Identities = 17/31 (54%), Positives = 18/31 (58%), Gaps = 1/31 (3%)

Query: 73  ESRHLCGPGAVCINHPGSYTCQCPPNSSGDP 103
            S   C  G  CIN PGSYTC CPP  +GD 
Sbjct: 3   ASGGPCSNG-TCINTPGSYTCSCPPGYTGDK 32


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 28.2 bits (64), Expect = 1.0
 Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 1/24 (4%)

Query: 550 SVECQCPAGYKGNP-YVQCVDIDE 572
           S  C CP GY+ +     C DIDE
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 27.8 bits (63), Expect = 1.3
 Identities = 11/22 (50%), Positives = 15/22 (68%), Gaps = 1/22 (4%)

Query: 50 YCACPKGFRPKEDGY-CEDVDE 70
           C+CP G++   DG  CED+DE
Sbjct: 3  TCSCPPGYQLSGDGRTCEDIDE 24


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 29.7 bits (67), Expect = 4.6
 Identities = 37/141 (26%), Positives = 49/141 (34%), Gaps = 20/141 (14%)

Query: 577 NTCGSNAVCINTPGSYDCRCKEG------NAGNPFVACTPVAVVPHSCEDPATCVCSKNA 630
           +T   N   I     ++C+C EG      N     V C  +  V   C + ATC+   N 
Sbjct: 5   DTICKNGYLIQMSNHFECKCNEGYVLKNENTCEEKVKCDKLENVNKVCGEYATCINQANK 64

Query: 631 P--------CPSGYVCKNSRCT-DLCANVRCGPRALCVQG----QCLCPSDLIGNPTDLT 677
                    C +GY      C  + C N  CG     V         C  + IG   D  
Sbjct: 65  AEEKALKCGCINGYTLSQGVCVPNKCNNKVCGSGKCIVDPANPNNTTCSCN-IGKVPDQN 123

Query: 678 RGCQVKGQCANDLECKPNEIC 698
             C   G+    L+CK NE C
Sbjct: 124 GKCTKTGETKCSLKCKENEEC 144


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.139    0.492 

Gapped
Lambda     K      H
   0.267   0.0580    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,515,591
Number of extensions: 3119193
Number of successful extensions: 2211
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2160
Number of HSP's successfully gapped: 158
Length of query: 739
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 635
Effective length of database: 6,324,786
Effective search space: 4016239110
Effective search space used: 4016239110
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.4 bits)