RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9424
         (535 letters)



>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 32.2 bits (74), Expect = 0.035
 Identities = 17/34 (50%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 280 IDFCAAK-PCGPGARCDNSRGSYKCLCPLGLVGD 312
           ID CA+  PC  G  C N+ GSY+C CP G  G 
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGR 35


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 30.7 bits (70), Expect = 0.12
 Identities = 16/30 (53%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 280 IDFCAAK-PCGPGARCDNSRGSYKCLCPLG 308
           ID CA+  PC  G  C N+ GSY+C CP G
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCECPPG 31


>gnl|CDD|214709 smart00532, LIGANc, Ligase N family. 
          Length = 441

 Score = 32.2 bits (74), Expect = 0.76
 Identities = 14/52 (26%), Positives = 19/52 (36%), Gaps = 15/52 (28%)

Query: 213 PVGCHRVECNSHADCSGDKVCEDHRCKISCLANNPCGPNALCSAEKHKQICY 264
           P  C    C S       +V  D RC           PN LC A+  ++I +
Sbjct: 399 PTHC--PSCGSELVREEGEV--DIRC-----------PNPLCPAQLIERIIH 435


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 28.2 bits (63), Expect = 0.80
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 287 PCGPGARCDNSRGSYKCLCPLGLVGDPY 314
           PC  G  C N+ GSY+C+CP G  GD  
Sbjct: 7   PCSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 26.7 bits (59), Expect = 3.6
 Identities = 14/31 (45%), Positives = 15/31 (48%)

Query: 242 CLANNPCGPNALCSAEKHKQICYCQPGYTGD 272
           C A+NPC     C        C C PGYTGD
Sbjct: 2   CAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32


>gnl|CDD|236571 PRK09565, PRK09565, hypothetical protein; Reviewed.
          Length = 533

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 11/49 (22%), Positives = 13/49 (26%)

Query: 164 ATCSCKPGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGG 212
              +   G  G            H GH GG+ G        S   H  G
Sbjct: 266 PEDAADGGTGGTHDAEEFGEHGHHGGHPGGEDGEHPHGHEDSGGHHGSG 314


>gnl|CDD|106181 PRK13213, araD, L-ribulose-5-phosphate 4-epimerase; Reviewed.
          Length = 231

 Score = 30.1 bits (67), Expect = 2.4
 Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 4/47 (8%)

Query: 186 SHSGHAG--GKHGPGLSPGATSHSSHSGGPVGCHRVECNSHADCSGD 230
           +HS HA    + G  LS   T+H+ +  GP+ C R+   + A+ +GD
Sbjct: 96  THSRHATIWAQAGKSLSALGTTHADYFYGPIPCTRLM--TEAEITGD 140


>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin. 
          Length = 137

 Score = 29.0 bits (65), Expect = 3.1
 Identities = 14/43 (32%), Positives = 15/43 (34%)

Query: 170 PGYVGHPGPSMGTGPSSHSGHAGGKHGPGLSPGATSHSSHSGG 212
            G  G  GP    G   H  H  G+HG        S SS S  
Sbjct: 25  KGEGGGTGPGGHGGGGEHGTHGHGEHGKLGGLLRRSGSSSSSS 67


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 25.2 bits (56), Expect = 10.0
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 151 CGRNALCTASDHHATCSCKPGYVG 174
           C  NA CT +    TC+CK GY G
Sbjct: 8   CHPNATCTNTGGSFTCTCKSGYTG 31


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.135    0.476 

Gapped
Lambda     K      H
   0.267   0.0580    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,079,522
Number of extensions: 2076356
Number of successful extensions: 1671
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1617
Number of HSP's successfully gapped: 123
Length of query: 535
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 433
Effective length of database: 6,413,494
Effective search space: 2777042902
Effective search space used: 2777042902
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.6 bits)