RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3619
         (418 letters)



>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 51.9 bits (125), Expect = 3e-09
 Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 1/38 (2%)

Query: 173 NIDDCKS-NPCLNGGICRDGVAKFTCDCPPGWTGSRCE 209
           +ID+C S NPC NGG C + V  + C CPPG+TG  CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 49.6 bits (119), Expect = 2e-08
 Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 96  IDDCASE-PCQNGGSCVDMLDGFMCQCRPGFVGLQCE 131
           ID+CAS  PCQNGG+CV+ +  + C C PG+ G  CE
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 43.4 bits (103), Expect = 3e-06
 Identities = 19/40 (47%), Positives = 24/40 (60%), Gaps = 3/40 (7%)

Query: 133 DIDECLS-DPCSPEGTDKCLDLDNKFQCECNVGYTGVMCE 171
           DIDEC S +PC   GT  C++    ++C C  GYTG  CE
Sbjct: 1   DIDECASGNPCQNGGT--CVNTVGSYRCSCPPGYTGRNCE 38



 Score = 39.9 bits (94), Expect = 5e-05
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 70 GTCTDLVGGFSCSCDLGFTGKHCQ 93
          GTC + VG + CSC  G+TG++C+
Sbjct: 15 GTCVNTVGSYRCSCPPGYTGRNCE 38


>gnl|CDD|222092 pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases
           superfamily.  This domain belongs to the Concanavalin
           A-like lectin/glucanases superfamily.
          Length = 156

 Score = 53.5 bits (129), Expect = 1e-08
 Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 8/86 (9%)

Query: 274 ATINDGQWHHVAIVWTQGTLTLITEGLIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAY 333
           A +  GQWHHVA+ +  GTL L   G++ G     G+  +    G + +G          
Sbjct: 79  APLPPGQWHHVAVTYDGGTLKLYVNGVLVGSTTLSGTI-TSGTTGPLYIGA------SNG 131

Query: 334 TEAGFEGKLTKVQIWNRALDFTNEIQ 359
            +  F G + +V+I+NRAL    EI 
Sbjct: 132 GDRYFNGAIDEVRIYNRALS-AAEIA 156


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 46.5 bits (111), Expect = 2e-07
 Identities = 21/39 (53%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 173 NIDDCKS-NPCLNGGICRDGVAKFTCDCPPGWT-GSRCE 209
           +ID+C S NPC NGG C + V  + C+CPPG+T G  CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 44.9 bits (107), Expect = 9e-07
 Identities = 20/38 (52%), Positives = 27/38 (71%), Gaps = 2/38 (5%)

Query: 96  IDDCASE-PCQNGGSCVDMLDGFMCQCRPGFV-GLQCE 131
           ID+CAS  PCQNGG+CV+ +  + C+C PG+  G  CE
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 39.2 bits (92), Expect = 9e-05
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 4/41 (9%)

Query: 133 DIDECLS-DPCSPEGTDKCLDLDNKFQCECNVGYT-GVMCE 171
           DIDEC S +PC   GT  C++    ++CEC  GYT G  CE
Sbjct: 1   DIDECASGNPCQNGGT--CVNTVGSYRCECPPGYTDGRNCE 39



 Score = 34.9 bits (81), Expect = 0.003
 Identities = 11/25 (44%), Positives = 17/25 (68%), Gaps = 1/25 (4%)

Query: 70 GTCTDLVGGFSCSCDLGFT-GKHCQ 93
          GTC + VG + C C  G+T G++C+
Sbjct: 15 GTCVNTVGSYRCECPPGYTDGRNCE 39


>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain.  This family includes the
           Thrombospondin N-terminal-like domain, a Laminin G
           subfamily.
          Length = 124

 Score = 47.1 bits (112), Expect = 1e-06
 Identities = 28/115 (24%), Positives = 39/115 (33%), Gaps = 14/115 (12%)

Query: 247 LIQAHSSGVQVSLFPELQDAFLAFH------------EYATINDGQWHHVAIVWTQGTLT 294
           L+ A        L  EL+D  L                   +NDGQWH V++     +LT
Sbjct: 10  LLYAGGEDGLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQWHRVSVSRDGRSLT 69

Query: 295 LITEGLIAGKVENYGSGKSLPQYGWVVLGK--PRPDNIKAYTEAGFEGKLTKVQI 347
           L  +G         GS   L   G + LG               GF G +  V++
Sbjct: 70  LSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGLSLLPVTEGFVGCIRNVRV 124


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 42.1 bits (99), Expect = 9e-06
 Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 179 SNPCLNGGICRDGVAKFTCDCPPGWTGS-RCE 209
           SNPC NGG C +    + C CPPG+TG   CE
Sbjct: 5   SNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36



 Score = 37.8 bits (88), Expect = 3e-04
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 100 ASEPCQNGGSCVDMLDGFMCQCRPGFVG 127
           AS PC NGG+CV+    + C C PG+ G
Sbjct: 4   ASNPCSNGGTCVNTPGSYRCVCPPGYTG 31



 Score = 31.3 bits (71), Expect = 0.050
 Identities = 10/25 (40%), Positives = 15/25 (60%), Gaps = 1/25 (4%)

Query: 70 GTCTDLVGGFSCSCDLGFTG-KHCQ 93
          GTC +  G + C C  G+TG + C+
Sbjct: 12 GTCVNTPGSYRCVCPPGYTGDRSCE 36



 Score = 29.4 bits (66), Expect = 0.24
 Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 4/38 (10%)

Query: 136 EC-LSDPCSPEGTDKCLDLDNKFQCECNVGYTGVM-CE 171
           EC  S+PCS  GT  C++    ++C C  GYTG   CE
Sbjct: 1   ECAASNPCSNGGT--CVNTPGSYRCVCPPGYTGDRSCE 36


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 39.7 bits (93), Expect = 4e-05
 Identities = 16/28 (57%), Positives = 19/28 (67%)

Query: 180 NPCLNGGICRDGVAKFTCDCPPGWTGSR 207
           NPC NGG C D    +TC+CP G+TG R
Sbjct: 5   NPCSNGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 34.3 bits (79), Expect = 0.004
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 103 PCQNGGSCVDMLDGFMCQCRPGFVG 127
           PC NGG+CVD   G+ C+C  G+ G
Sbjct: 6   PCSNGGTCVDTPGGYTCECPEGYTG 30



 Score = 29.3 bits (66), Expect = 0.28
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query: 70 GTCTDLVGGFSCSCDLGFTGKH 91
          GTC D  GG++C C  G+TGK 
Sbjct: 11 GTCVDTPGGYTCECPEGYTGKR 32



 Score = 26.2 bits (58), Expect = 3.3
 Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 2/27 (7%)

Query: 141 PCSPEGTDKCLDLDNKFQCECNVGYTG 167
           PCS  GT  C+D    + CEC  GYTG
Sbjct: 6   PCSNGGT--CVDTPGGYTCECPEGYTG 30


>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
           Ca++ mediated receptors that can have binding sites for
           steroids, beta1 integrins, heparin, sulfatides,
           fibulin-1, and alpha-dystroglycans. Proteins that
           contain LamG domains serve a variety of purposes
           including signal transduction via cell-surface steroid
           receptors, adhesion, migration and differentiation
           through mediation of cell adhesion molecules.
          Length = 151

 Score = 42.8 bits (101), Expect = 6e-05
 Identities = 20/77 (25%), Positives = 37/77 (48%), Gaps = 4/77 (5%)

Query: 275 TINDGQWHHVAIVWTQGTLTLITEGLIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAYT 334
            +NDGQWH V++     ++TL  +G    +  + G    L   G + LG   P+++K+  
Sbjct: 76  PLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGG-LPEDLKSPG 134

Query: 335 EA---GFEGKLTKVQIW 348
                GF G +  +++ 
Sbjct: 135 LPVSPGFVGCIRDLKVN 151


>gnl|CDD|184753 PRK14581, hmsF, outer membrane N-deacetylase; Provisional.
          Length = 672

 Score = 43.0 bits (101), Expect = 3e-04
 Identities = 22/52 (42%), Positives = 30/52 (57%), Gaps = 12/52 (23%)

Query: 327 PDNIKAYTEAGFEGKLTKV-------QIWNR-----ALDFTNEIQKQVRDCR 366
           PD I+AY +AGF G +T +       Q W R      +DFTNE+ ++VRD R
Sbjct: 478 PDAIRAYEKAGFPGSITTIRQDPEMMQRWTRYKSKYLIDFTNELTREVRDIR 529


>gnl|CDD|238086 cd00152, PTX, Pentraxins are plasma proteins characterized by their
           pentameric discoid assembly and their Ca2+ dependent
           ligand binding, such as Serum amyloid P component (SAP)
           and C-reactive Protein (CRP), which are
           cytokine-inducible acute-phase proteins implicated in
           innate immunity. CRP binds to ligands containing
           phosphocholine, SAP binds to amyloid fibrils, DNA,
           chromatin, fibronectin, C4-binding proteins and
           glycosaminoglycans. "Long" pentraxins have N-terminal
           extensions to the common pentraxin domain; one group,
           the neuronal pentraxins, may be involved in synapse
           formation and remodeling, and they may also be able to
           form heteromultimers.
          Length = 201

 Score = 40.7 bits (96), Expect = 5e-04
 Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 24/124 (19%)

Query: 277 NDGQWHHVAIVWT--QGTLTLITEG-LIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAY 333
           +DG WHH+ + W    G   L   G L   K      G ++   G ++LG+   D+   Y
Sbjct: 88  SDGAWHHICVTWESTSGIAELWVNGKLSVRKS--LKKGYTVGPGGSIILGQ-EQDS---Y 141

Query: 334 ------TEAGFEGKLTKVQIWNRALDFTNEIQKQVRDCRTEPVLHTGLVLTWTG--YDDI 385
                 T++ F G+++ V +W+  L  + E   ++++  +E    +G +L W    Y +I
Sbjct: 142 GGGFDATQS-FVGEISDVNMWDSVL--SPE---EIKNVYSEGGTLSGNILNWRALNY-EI 194

Query: 386 NGGV 389
           NGGV
Sbjct: 195 NGGV 198


>gnl|CDD|214598 smart00282, LamG, Laminin G domain. 
          Length = 132

 Score = 39.2 bits (92), Expect = 7e-04
 Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 4/79 (5%)

Query: 275 TINDGQWHHVAIVWTQGTLTLITEGLIAGKVENYGSGKSLPQYGWVVLGKPRPDNIKAYT 334
            +NDGQWH VA+     ++TL  +G      E+ G    L   G + LG   P+++K   
Sbjct: 55  PLNDGQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGG-LPEDLKLPP 113

Query: 335 EA---GFEGKLTKVQIWNR 350
                GF G +  +++  +
Sbjct: 114 LPVTPGFRGCIRNLKVNGK 132


>gnl|CDD|128463 smart00159, PTX, Pentraxin / C-reactive protein / pentaxin family. 
           This family form a doscoid pentameric structure. Human
           serum amyloid P demonstrates calcium-mediated
           ligand-binding.
          Length = 206

 Score = 38.8 bits (91), Expect = 0.003
 Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 16/121 (13%)

Query: 276 INDGQWHHVAIVWTQGTLTLITEGLIAGKV---ENYGSGKSLPQYGWVVLGKPRPDNIKA 332
            +DG+WHH+   W   + + I E  + GK    +    G ++   G ++LG+ + D+   
Sbjct: 87  ESDGKWHHICTTWE--SSSGIAELWVDGKPGVRKGLAKGYTVKPGGSIILGQEQ-DSYGG 143

Query: 333 YTEAG--FEGKLTKVQIWNRALDFTNEIQKQVRDCRTEPVLHTGLVLTW--TGYDDINGG 388
             +A   F G++  + +W+  L  + E  K V    T      G +L W    Y +++GG
Sbjct: 144 GFDATQSFVGEIGDLNMWDSVL--SPEEIKSVYKGST---FSIGNILNWRALNY-EVHGG 197

Query: 389 V 389
           V
Sbjct: 198 V 198


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 32.1 bits (73), Expect = 0.024
 Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 2/32 (6%)

Query: 179 SNPCLNGGICRDGVAKFTCDCPPGWTGS-RCE 209
             PC NG  C +    +TC CPPG+TG  RCE
Sbjct: 5   GGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35



 Score = 30.2 bits (68), Expect = 0.14
 Identities = 13/31 (41%), Positives = 19/31 (61%), Gaps = 2/31 (6%)

Query: 98  DCASE-PCQNGGSCVDMLDGFMCQCRPGFVG 127
           +CAS  PC NG +C++    + C C PG+ G
Sbjct: 1   ECASGGPCSNG-TCINTPGSYTCSCPPGYTG 30



 Score = 29.8 bits (67), Expect = 0.17
 Identities = 12/26 (46%), Positives = 17/26 (65%), Gaps = 1/26 (3%)

Query: 69 PGTCTDLVGGFSCSCDLGFTG-KHCQ 93
           GTC +  G ++CSC  G+TG K C+
Sbjct: 10 NGTCINTPGSYTCSCPPGYTGDKRCE 35



 Score = 27.1 bits (60), Expect = 1.6
 Identities = 15/38 (39%), Positives = 18/38 (47%), Gaps = 5/38 (13%)

Query: 136 ECLS-DPCSPEGTDKCLDLDNKFQCECNVGYTGV-MCE 171
           EC S  PCS      C++    + C C  GYTG   CE
Sbjct: 1   ECASGGPCSN---GTCINTPGSYTCSCPPGYTGDKRCE 35


>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF.  hEGF, or human
           growth factor-like EGF, domains have six conserved
           residues disulfide-bonded into the characteristic
           'ababcc' pattern. They are involved in growth and
           proliferation of cells, in proteins of the Notch/Delta
           pathway, neurogulin and selectins. hEGFs are also found
           in mosaic proteins with four-disulfide laminin EGFs such
           as aggrecan and perlecan. The core fold of the EGF
           domain consists of two small beta-hairpins packed
           against each other. Two major structural variants have
           been identified based on the structural context of the
           C-terminal Cys residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
           C-terminal thiol resides in the beta-turn, resulting in
           shorter loop-lengths between the Cys residues of
           disulfide 'c', typically C[8-9]XC. These shorter
           loop-lengths are also typical of the four-disulfide EGF
           domains, laminin ad integrin. Tandem hEGF domains have
           six linking residues between terminal cysteines of
           adjacent domains. hEGF domains may or may not bind
           calcium in the linker region. hEGF domains with the
           consensus motif CXD4X[F,Y]XCXC are hydroxylated
           exclusively in the Asp residue.
          Length = 13

 Score = 30.0 bits (69), Expect = 0.14
 Identities = 9/13 (69%), Positives = 10/13 (76%)

Query: 196 TCDCPPGWTGSRC 208
            C CPPG+TG RC
Sbjct: 1   KCQCPPGYTGPRC 13


>gnl|CDD|165214 PHA02887, PHA02887, EGF-like protein; Provisional.
          Length = 126

 Score = 31.4 bits (71), Expect = 0.28
 Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 7/44 (15%)

Query: 129 QCEADI-DECLSDPCSPEGTDKCLDLDNKFQCECNVGYTGVMCE 171
           +C+ D  D C++  C        +DLD KF C CN GYTG+ C+
Sbjct: 85  KCKNDFNDFCINGECM-----NIIDLDEKF-CICNKGYTGIRCD 122



 Score = 31.4 bits (71), Expect = 0.29
 Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 6/38 (15%)

Query: 177 CKS---NPCLNGGICRDGV--AKFTCDCPPGWTGSRCE 209
           CK+   + C+NG  C + +   +  C C  G+TG RC+
Sbjct: 86  CKNDFNDFCING-ECMNIIDLDEKFCICNKGYTGIRCD 122


>gnl|CDD|214722 smart00560, LamGL, LamG-like jellyroll fold domain. 
          Length = 133

 Score = 30.9 bits (70), Expect = 0.60
 Identities = 25/80 (31%), Positives = 36/80 (45%), Gaps = 16/80 (20%)

Query: 279 GQWHHVAIVWTQ--GTLTLITEGLIAGKVEN--YGSGKSLPQYGWVVL--GKPRPDNIKA 332
           G W H+A V+    G L+L   G+     E     S  +LPQ G  +L  G         
Sbjct: 61  GVWVHLAGVYDGGAGKLSLYVNGVEVATSETQPSPSSGNLPQ-GGRILLGGAGG------ 113

Query: 333 YTEAGFEGKLTKVQIWNRAL 352
                F G+L +V+++NRAL
Sbjct: 114 ---ENFSGRLDEVRVYNRAL 130


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 28.5 bits (64), Expect = 0.66
 Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 2/32 (6%)

Query: 96  IDDCASEP--CQNGGSCVDMLDGFMCQCRPGF 125
           +D+CA     C     CV+ +  F C C  G+
Sbjct: 2   VDECADGTHNCPANTVCVNTIGSFECVCPDGY 33



 Score = 26.5 bits (59), Expect = 3.1
 Identities = 10/34 (29%), Positives = 15/34 (44%)

Query: 133 DIDECLSDPCSPEGTDKCLDLDNKFQCECNVGYT 166
           D+DEC     +      C++    F+C C  GY 
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34


>gnl|CDD|184754 PRK14582, pgaB, outer membrane N-deacetylase; Provisional.
          Length = 671

 Score = 32.0 bits (73), Expect = 0.72
 Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 12/52 (23%)

Query: 327 PDNIKAYTEAGFEGKLTKV-------QIWNR----AL-DFTNEIQKQVRDCR 366
              I AY +AGF G L+++       + W R    AL DFT E+  +V+  R
Sbjct: 478 APAITAYQQAGFSGSLSEIRQNPEQFKQWTRFKSRALTDFTLELSARVKAIR 529


>gnl|CDD|165381 PHA03099, PHA03099, epidermal growth factor-like protein (EGF-like
           protein); Provisional.
          Length = 139

 Score = 30.0 bits (67), Expect = 1.1
 Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 9/38 (23%)

Query: 142 CSPEGTDKCL--------DLDNKFQCECNVGYTGVMCE 171
           C PEG   CL        D+D  + C C+ GYTG+ C+
Sbjct: 45  CGPEGDGYCLHGDCIHARDIDGMY-CRCSHGYTGIRCQ 81



 Score = 27.7 bits (61), Expect = 7.5
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 76 VGGFSCSCDLGFTGKHCQHLI 96
          + G  C C  G+TG  CQH++
Sbjct: 64 IDGMYCRCSHGYTGIRCQHVV 84


>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain.  This family contains EGF
           domains found in a variety of extracellular proteins.
          Length = 31

 Score = 27.4 bits (61), Expect = 1.3
 Identities = 10/30 (33%), Positives = 12/30 (40%), Gaps = 2/30 (6%)

Query: 179 SNPCLNGGICRDGVAKFTCDCPPGWTGSRC 208
           S  C   G C     K  C C  G+ G+ C
Sbjct: 4   SGICNGRGTCVRPCGK--CVCDSGYQGATC 31



 Score = 25.9 bits (57), Expect = 4.1
 Identities = 11/31 (35%), Positives = 13/31 (41%), Gaps = 2/31 (6%)

Query: 100 ASEPCQNGGSCVDMLDGFMCQCRPGFVGLQC 130
           AS  C   G+CV       C C  G+ G  C
Sbjct: 3   ASGICNGRGTCVR--PCGKCVCDSGYQGATC 31


>gnl|CDD|214560 smart00210, TSPN, Thrombospondin N-terminal -like domains.
           Heparin-binding and cell adhesion domain of
           thrombospondin.
          Length = 184

 Score = 30.0 bits (68), Expect = 1.5
 Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 15/61 (24%)

Query: 240 SVQGRRPLI----QAHSSGVQVSLFPELQDAFLAFHEYATINDGQWHHVAIVWTQGTLTL 295
            V GR   +    Q          F  L    LA        DGQWH +A+  +  + TL
Sbjct: 85  EVDGRANTLLLRYQGVDGKQHTVSFRNLP---LA--------DGQWHKLALSVSGSSATL 133

Query: 296 I 296
            
Sbjct: 134 Y 134


>gnl|CDD|113868 pfam05113, DUF693, Protein of unknown function (DUF693).  This
           family consists of several uncharacterized proteins from
           Borrelia burgdorferi (Lyme disease spirochete).
          Length = 311

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 341 KLTKVQIWNRALDFTNEIQ 359
           K  K+ +WN  LDF++ I+
Sbjct: 61  KKAKIVLWNLPLDFSDHIK 79


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 26.6 bits (60), Expect = 1.8
 Identities = 11/25 (44%), Positives = 13/25 (52%), Gaps = 5/25 (20%)

Query: 116 GFMCQCRPGFV----GLQCEADIDE 136
            + C C PG+     G  CE DIDE
Sbjct: 1   SYTCSCPPGYQLSGDGRTCE-DIDE 24


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
          EGF-like domain homologues. This family includes the
          C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 26.0 bits (58), Expect = 4.0
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 71 TCTDLVGGFSCSCDLGFTG 89
          TCT+  G F+C+C  G+TG
Sbjct: 13 TCTNTGGSFTCTCKSGYTG 31



 Score = 26.0 bits (58), Expect = 4.1
 Identities = 11/28 (39%), Positives = 12/28 (42%), Gaps = 2/28 (7%)

Query: 140 DPCSPEGTDKCLDLDNKFQCECNVGYTG 167
             C P  T  C +    F C C  GYTG
Sbjct: 6   GGCHPNAT--CTNTGGSFTCTCKSGYTG 31


>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain. 
          Length = 131

 Score = 28.1 bits (63), Expect = 4.7
 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 2/24 (8%)

Query: 274 ATINDGQWHHV--AIVWTQGTLTL 295
             +NDG+WH V        GTL++
Sbjct: 48  DKLNDGKWHSVELERNGRSGTLSV 71


>gnl|CDD|143593 cd07569, DCase, N-carbamyl-D-amino acid amidohydrolase (DCase,
           class 6 nitrilases).  DCase hydrolyses
           N-carbamyl-D-amino acids to produce D-amino acids. It is
           an important biocatalyst in the pharmaceutical industry,
           producing useful D-amino acids for example in the
           preparation of beta-lactam antibiotics. This subgroup
           belongs to a larger nitrilase superfamily comprised of
           nitrile- or amide-hydrolyzing enzymes and
           amide-condensing enzymes, which depend on a Glu-Lys-Cys
           catalytic triad. This superfamily has been classified in
           the literature based on global and structure based
           sequence analysis into thirteen different enzyme classes
           (referred to as 1-13), this subgroup corresponds to
           class 6. Members of this superfamily generally form
           homomeric complexes, the basic building block of which
           is a homodimer. Agrobacterium radiobacter DCase forms a
           tetramer (dimer of dimers). Some DCases may form
           trimers.
          Length = 302

 Score = 28.4 bits (64), Expect = 6.1
 Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 2/24 (8%)

Query: 247 LIQAHSSGVQVSLFPELQDAFLAF 270
           L +A S G Q+ +FPEL  A   F
Sbjct: 31  LEEAASRGAQLVVFPEL--ALTTF 52


>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
           Pvs28.  This family consists of several ookinete surface
           protein (Pvs28) from several species of Plasmodium.
           Pvs25 and Pvs28 are expressed on the surface of
           ookinetes. These proteins are potential candidates for
           vaccine and induce antibodies that block the infectivity
           of Plasmodium vivax in immunised animals.
          Length = 196

 Score = 28.2 bits (63), Expect = 7.4
 Identities = 32/120 (26%), Positives = 45/120 (37%), Gaps = 20/120 (16%)

Query: 104 CQNGGSCVDMLDGFMCQCRPGFVGLQ---CE----ADIDECLSDPCSPEGT---DKCLDL 153
           C+NG   + M + F C+C  G+V      CE     D  E ++  C    T         
Sbjct: 8   CKNG-YLIQMSNHFECKCNEGYVLKNENTCEEKVKCDKLENVNKVCGEYATCINQANKAE 66

Query: 154 DNKFQCECNVGYTGVMCETNIDDCKSNPCLNGGICRDGVAKFTCDCPPGWTGSRCEKDIG 213
           +   +C C  GYT      +   C  N C N  +C  G  K   D P     + C  +IG
Sbjct: 67  EKALKCGCINGYT-----LSQGVCVPNKC-NNKVCGSG--KCIVD-PANPNNTTCSCNIG 117


>gnl|CDD|235057 PRK02628, nadE, NAD synthetase; Reviewed.
          Length = 679

 Score = 28.3 bits (64), Expect = 9.5
 Identities = 7/15 (46%), Positives = 11/15 (73%)

Query: 249 QAHSSGVQVSLFPEL 263
           +A   GV +++FPEL
Sbjct: 40  RAADDGVALAVFPEL 54


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.139    0.467 

Gapped
Lambda     K      H
   0.267   0.0676    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,531,180
Number of extensions: 1896631
Number of successful extensions: 1268
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1253
Number of HSP's successfully gapped: 86
Length of query: 418
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 319
Effective length of database: 6,546,556
Effective search space: 2088351364
Effective search space used: 2088351364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.0 bits)