RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9685
         (979 letters)



>gnl|CDD|214598 smart00282, LamG, Laminin G domain. 
          Length = 132

 Score = 73.9 bits (182), Expect = 1e-15
 Identities = 38/134 (28%), Positives = 52/134 (38%), Gaps = 19/134 (14%)

Query: 98  DISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPE 157
            IS   RT    G + Y G  G                Y+A E+  G L +R    S P 
Sbjct: 1   SISFSFRTTSPNGLLLYAGSKGG-------------GDYLALELRDGRLVLRYDLGSGPA 47

Query: 158 SYNVGGVKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKTISSTG---ILDVQ-VLYLGGI 213
                   L DG  H + V RN   V + ++G    R +  S G   IL++   LYLGG+
Sbjct: 48  RLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGGN--RVSGESPGGLTILNLDGPLYLGGL 105

Query: 214 PETVHHHRSIRSPY 227
           PE +       +P 
Sbjct: 106 PEDLKLPPLPVTPG 119



 Score = 58.1 bits (141), Expect = 5e-10
 Identities = 32/132 (24%), Positives = 43/132 (32%), Gaps = 35/132 (26%)

Query: 295 DISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPE 354
            IS   RT    G + Y G  G                Y+A E+  G L +R    S P 
Sbjct: 1   SISFSFRTTSPNGLLLYAGSKGG-------------GDYLALELRDGRLVLRYDLGSGPA 47

Query: 355 SYNVGGVKLADGNNHLI------------------QTISSTG---ILDVQ-VLYLGGIPE 392
                   L DG  H +                   +  S G   IL++   LYLGG+PE
Sbjct: 48  RLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGGLPE 107

Query: 393 TVHHHRSIRSPY 404
            +       +P 
Sbjct: 108 DLKLPPLPVTPG 119



 Score = 38.5 bits (90), Expect = 0.003
 Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 5/57 (8%)

Query: 467 SSNG-----AGSEETSYIAAEMEAGELFVRLQFNSTPESYNVGGVKLADGNNHLIQV 518
           S NG            Y+A E+  G L +R    S P         L DG  H + V
Sbjct: 10  SPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAV 66


>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
           Ca++ mediated receptors that can have binding sites for
           steroids, beta1 integrins, heparin, sulfatides,
           fibulin-1, and alpha-dystroglycans. Proteins that
           contain LamG domains serve a variety of purposes
           including signal transduction via cell-surface steroid
           receptors, adhesion, migration and differentiation
           through mediation of cell adhesion molecules.
          Length = 151

 Score = 67.4 bits (165), Expect = 4e-13
 Identities = 35/139 (25%), Positives = 49/139 (35%), Gaps = 16/139 (11%)

Query: 91  RAVRNIVDISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRL 150
            A R  + IS   RT    G + Y G                   ++A E+E G L +R 
Sbjct: 16  PAPRTRLSISFSFRTTSPNGLLLYAGSQNG-------------GDFLALELEDGRLVLRY 62

Query: 151 QFNSTPESYNVGGVKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKT--ISSTGILDVQVL 208
              S     +     L DG  H + V RN   V + ++G           S  +     L
Sbjct: 63  DLGSGSLVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPL 121

Query: 209 YLGGIPETVHHHRSIRSPY 227
           YLGG+PE +       SP 
Sbjct: 122 YLGGLPEDLKSPGLPVSPG 140



 Score = 50.1 bits (120), Expect = 5e-07
 Identities = 30/139 (21%), Positives = 43/139 (30%), Gaps = 36/139 (25%)

Query: 288 RAVRNIVDISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRL 347
            A R  + IS   RT    G + Y G                   ++A E+E G L +R 
Sbjct: 16  PAPRTRLSISFSFRTTSPNGLLLYAGSQNG-------------GDFLALELEDGRLVLRY 62

Query: 348 QFNSTPESYNVGGVKLADGNNHLIQTISSTGILDVQV----------------------L 385
              S     +     L DG  H +    +   + + V                      L
Sbjct: 63  DLGSGSLVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPL 121

Query: 386 YLGGIPETVHHHRSIRSPY 404
           YLGG+PE +       SP 
Sbjct: 122 YLGGLPEDLKSPGLPVSPG 140



 Score = 31.2 bits (71), Expect = 1.4
 Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 6/57 (10%)

Query: 467 SSNG-----AGSEETSYIAAEMEAGELFVRLQFNSTPESYNVGGVKLADGNNHLIQV 518
           S NG            ++A E+E G L +R    S     +     L DG  H + V
Sbjct: 32  SPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSS-KTPLNDGQWHSVSV 87


>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain. 
          Length = 131

 Score = 59.3 bits (144), Expect = 2e-10
 Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 17/116 (14%)

Query: 103 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 162
            RT +  G + Y G   +R              ++A E+  G L V     S   +    
Sbjct: 1   FRTTEPSGLLLYNGTQTERD-------------FLALELRDGRLEVSYDLGSGA-AVVRS 46

Query: 163 GVKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKT--ISSTGILDVQ-VLYLGGIPE 215
           G KL DG  H +++ RN     + ++G         + +T  LDV   LY+GG+P 
Sbjct: 47  GDKLNDGKWHSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPS 102



 Score = 40.4 bits (95), Expect = 6e-04
 Identities = 24/116 (20%), Positives = 37/116 (31%), Gaps = 37/116 (31%)

Query: 300 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 359
            RT +  G + Y G   +R              ++A E+  G L V     S   +    
Sbjct: 1   FRTTEPSGLLLYNGTQTERD-------------FLALELRDGRLEVSYDLGSGA-AVVRS 46

Query: 360 GVKLADGNNHLIQTISSTGILDVQV-----------------------LYLGGIPE 392
           G KL DG  H ++   +     + V                       LY+GG+P 
Sbjct: 47  GDKLNDGKWHSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPS 102



 Score = 31.1 bits (71), Expect = 1.1
 Identities = 13/49 (26%), Positives = 20/49 (40%), Gaps = 1/49 (2%)

Query: 470 GAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVGGVKLADGNNHLIQV 518
                E  ++A E+  G L V     S   +    G KL DG  H +++
Sbjct: 13  NGTQTERDFLALELRDGRLEVSYDLGSGA-AVVRSGDKLNDGKWHSVEL 60


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 55.0 bits (133), Expect = 6e-10
 Identities = 20/38 (52%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 601 DIDECVTHN-CQNGARCIDGVARYSCECTPGWEGALCE 637
           DIDEC + N CQNG  C++ V  Y C C PG+ G  CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 50.7 bits (122), Expect = 2e-08
 Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 640 IDECLS-NPCMNGGQCEDRLAGFVCNCSEEYVGERCE 675
           IDEC S NPC NGG C + +  + C+C   Y G  CE
Sbjct: 2   IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38



 Score = 43.8 bits (104), Expect = 5e-06
 Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 824 DINECELSSDMCGNNGECINQPGDYKCACQ 853
           DI+EC  S + C N G C+N  G Y+C+C 
Sbjct: 1   DIDECA-SGNPCQNGGTCVNTVGSYRCSCP 29



 Score = 43.0 bits (102), Expect = 9e-06
 Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 4/39 (10%)

Query: 865 DPCKDE-PCQNGGTCHEDCRHQADYKCDCLPGWTGKNCT 902
           D C    PCQNGGTC         Y+C C PG+TG+NC 
Sbjct: 3   DECASGNPCQNGGTCVN---TVGSYRCSCPPGYTGRNCE 38



 Score = 40.3 bits (95), Expect = 7e-05
 Identities = 15/28 (53%), Positives = 22/28 (78%), Gaps = 1/28 (3%)

Query: 722 NIDECASG-PCKNSGQCIDDVNAFICNC 748
           +IDECASG PC+N G C++ V ++ C+C
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSC 28



 Score = 36.8 bits (86), Expect = 0.001
 Identities = 16/40 (40%), Positives = 17/40 (42%), Gaps = 7/40 (17%)

Query: 682 CADQ-PCYFGAVCQDTKISPYFPQGPICDCPPGYRGSRCE 720
           CA   PC  G  C +T           C CPPGY G  CE
Sbjct: 5   CASGNPCQNGGTCVNT------VGSYRCSCPPGYTGRNCE 38



 Score = 30.3 bits (69), Expect = 0.27
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 236 GGHCTDLWRDFSCSCVRPFLGHTCQ 260
           GG C +    + CSC   + G  C+
Sbjct: 14  GGTCVNTVGSYRCSCPPGYTGRNCE 38


>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain.  This family includes the
           Thrombospondin N-terminal-like domain, a Laminin G
           subfamily.
          Length = 124

 Score = 55.9 bits (135), Expect = 3e-09
 Identities = 36/136 (26%), Positives = 52/136 (38%), Gaps = 21/136 (15%)

Query: 103 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 162
            RT Q  G + Y GG                  ++A E+E G L +R    S      + 
Sbjct: 1   FRTTQPNGLLLYAGGEDG-------------LDFLALELEDGRLVLRYDLGSGGSVLLLS 47

Query: 163 GVKLADGNNHLIQVVRNVTLVQVKLNG-TEYFRKTISSTGILDVQV-LYLGGIPETVHHH 220
           G KL DG  H + V R+   + + ++G T        S+ IL++   LYLGG+PE     
Sbjct: 48  GKKLNDGQWHRVSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPE----- 102

Query: 221 RSIRSPYQPSRSYPSG 236
                   P      G
Sbjct: 103 -DSGLSLLPVTEGFVG 117



 Score = 39.4 bits (92), Expect = 0.001
 Identities = 29/127 (22%), Positives = 40/127 (31%), Gaps = 35/127 (27%)

Query: 300 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 359
            RT Q  G + Y GG                  ++A E+E G L +R    S      + 
Sbjct: 1   FRTTQPNGLLLYAGGEDG-------------LDFLALELEDGRLVLRYDLGSGGSVLLLS 47

Query: 360 GVKLADG---------------------NNHLIQTISSTGILDVQV-LYLGGIPETVHHH 397
           G KL DG                              S+ IL++   LYLGG+PE     
Sbjct: 48  GKKLNDGQWHRVSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGLS 107

Query: 398 RSIRSPY 404
               +  
Sbjct: 108 LLPVTEG 114


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 51.1 bits (123), Expect = 1e-08
 Identities = 21/39 (53%), Positives = 25/39 (64%), Gaps = 2/39 (5%)

Query: 601 DIDECVTHN-CQNGARCIDGVARYSCECTPGWE-GALCE 637
           DIDEC + N CQNG  C++ V  Y CEC PG+  G  CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 46.5 bits (111), Expect = 6e-07
 Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 2/39 (5%)

Query: 639 EIDECLS-NPCMNGGQCEDRLAGFVCNCSEEYV-GERCE 675
           +IDEC S NPC NGG C + +  + C C   Y  G  CE
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39



 Score = 43.4 bits (103), Expect = 6e-06
 Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 824 DINECELSSDMCGNNGECINQPGDYKCACQ 853
           DI+EC  S + C N G C+N  G Y+C C 
Sbjct: 1   DIDECA-SGNPCQNGGTCVNTVGSYRCECP 29



 Score = 38.8 bits (91), Expect = 3e-04
 Identities = 14/28 (50%), Positives = 20/28 (71%), Gaps = 1/28 (3%)

Query: 722 NIDECAS-GPCKNSGQCIDDVNAFICNC 748
           +IDECAS  PC+N G C++ V ++ C C
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCEC 28



 Score = 35.3 bits (82), Expect = 0.005
 Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 5/40 (12%)

Query: 865 DPCKDE-PCQNGGTCHEDCRHQADYKCDCLPGWT-GKNCT 902
           D C    PCQNGGTC         Y+C+C PG+T G+NC 
Sbjct: 3   DECASGNPCQNGGTCV---NTVGSYRCECPPGYTDGRNCE 39


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 40.2 bits (94), Expect = 9e-05
 Identities = 15/36 (41%), Positives = 18/36 (50%), Gaps = 2/36 (5%)

Query: 604 ECVTHN-CQNGARCIDGVARYSCECTPGWEGAL-CE 637
           EC   N C NG  C++    Y C C PG+ G   CE
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36



 Score = 37.5 bits (87), Expect = 9e-04
 Identities = 16/36 (44%), Positives = 19/36 (52%), Gaps = 2/36 (5%)

Query: 642 EC-LSNPCMNGGQCEDRLAGFVCNCSEEYVGE-RCE 675
           EC  SNPC NGG C +    + C C   Y G+  CE
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36



 Score = 35.5 bits (82), Expect = 0.004
 Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 4/33 (12%)

Query: 871 PCQNGGTCHEDCRHQADYKCDCLPGWTG-KNCT 902
           PC NGGTC         Y+C C PG+TG ++C 
Sbjct: 7   PCSNGGTCVNTP---GSYRCVCPPGYTGDRSCE 36



 Score = 34.8 bits (80), Expect = 0.008
 Identities = 13/27 (48%), Positives = 16/27 (59%), Gaps = 1/27 (3%)

Query: 827 ECELSSDMCGNNGECINQPGDYKCACQ 853
           EC  S+  C N G C+N PG Y+C C 
Sbjct: 1   ECAASNP-CSNGGTCVNTPGSYRCVCP 26



 Score = 32.1 bits (73), Expect = 0.080
 Identities = 17/41 (41%), Positives = 18/41 (43%), Gaps = 8/41 (19%)

Query: 682 CADQ-PCYFGAVCQDTKISPYFPQGPICDCPPGYRGS-RCE 720
           CA   PC  G  C +T      P    C CPPGY G   CE
Sbjct: 2   CAASNPCSNGGTCVNT------PGSYRCVCPPGYTGDRSCE 36



 Score = 31.7 bits (72), Expect = 0.11
 Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 1/25 (4%)

Query: 725 ECA-SGPCKNSGQCIDDVNAFICNC 748
           ECA S PC N G C++   ++ C C
Sbjct: 1   ECAASNPCSNGGTCVNTPGSYRCVC 25


>gnl|CDD|215652 pfam00008, EGF, EGF-like domain.  There is no clear separation
           between noise and signal. pfam00053 is very similar, but
           has 8 instead of 6 conserved cysteines. Includes some
           cytokine receptors. The EGF domain misses the N-terminus
           regions of the Ca2+ binding EGF domains (this is the
           main reason of discrepancy between swiss-prot domain
           start/end and Pfam). The family is hard to model due to
           many similar but different sub-types of EGF domains.
           Pfam certainly misses a number of EGF domains.
          Length = 32

 Score = 37.8 bits (88), Expect = 5e-04
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 607 THNCQNGARCIDGVARYSCECTPGWEG 633
            + C NG  C+D    Y+CEC  G+ G
Sbjct: 4   NNPCSNGGTCVDTPGGYTCECPEGYTG 30



 Score = 37.0 bits (86), Expect = 0.001
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 868 KDEPCQNGGTCHEDCRHQADYKCDCLPGWTGKN 900
            + PC NGGTC +       Y C+C  G+TGK 
Sbjct: 3   PNNPCSNGGTCVD---TPGGYTCECPEGYTGKR 32



 Score = 34.7 bits (80), Expect = 0.008
 Identities = 15/28 (53%), Positives = 17/28 (60%)

Query: 646 NPCMNGGQCEDRLAGFVCNCSEEYVGER 673
           NPC NGG C D   G+ C C E Y G+R
Sbjct: 5   NPCSNGGTCVDTPGGYTCECPEGYTGKR 32



 Score = 28.6 bits (64), Expect = 1.00
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 835 CGNNGECINQPGDYKCACQ 853
           C N G C++ PG Y C C 
Sbjct: 7   CSNGGTCVDTPGGYTCECP 25



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 10/25 (40%), Positives = 12/25 (48%)

Query: 730 PCKNSGQCIDDVNAFICNCTNTATG 754
           PC N G C+D    + C C    TG
Sbjct: 6   PCSNGGTCVDTPGGYTCECPEGYTG 30



 Score = 27.8 bits (62), Expect = 2.1
 Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 7/38 (18%)

Query: 682 CADQ-PCYFGAVCQDTKISPYFPQGPICDCPPGYRGSR 718
           C+   PC  G  C DT      P G  C+CP GY G R
Sbjct: 1   CSPNNPCSNGGTCVDT------PGGYTCECPEGYTGKR 32


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 38.1 bits (89), Expect = 6e-04
 Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 601 DIDECV--THNCQNGARCIDGVARYSCECTPGWE 632
           D+DEC   THNC     C++ +  + C C  G+E
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34



 Score = 37.3 bits (87), Expect = 0.001
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 824 DINECELSSDMCGNNGECINQPGDYKCAC 852
           D++EC   +  C  N  C+N  G ++C C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVC 29


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 32.5 bits (74), Expect = 0.056
 Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 604 ECVTHN-CQNGARCIDGVARYSCECTPGWEG 633
           EC +   C NG  CI+    Y+C C PG+ G
Sbjct: 1   ECASGGPCSNG-TCINTPGSYTCSCPPGYTG 30



 Score = 29.0 bits (65), Expect = 0.90
 Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 642 ECLS-NPCMNGGQCEDRLAGFVCNCSEEYVG-ERCE 675
           EC S  PC NG  C +    + C+C   Y G +RCE
Sbjct: 1   ECASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35



 Score = 28.6 bits (64), Expect = 1.2
 Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 3/34 (8%)

Query: 871 PCQNGGTCHED-CRHQAD-YKCDCLPGWTG-KNC 901
            C +GG C    C +    Y C C PG+TG K C
Sbjct: 1   ECASGGPCSNGTCINTPGSYTCSCPPGYTGDKRC 34



 Score = 27.9 bits (62), Expect = 1.8
 Identities = 13/27 (48%), Positives = 14/27 (51%), Gaps = 2/27 (7%)

Query: 827 ECELSSDMCGNNGECINQPGDYKCACQ 853
           EC      C N G CIN PG Y C+C 
Sbjct: 1   ECASGGP-CSN-GTCINTPGSYTCSCP 25


>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF.  hEGF, or human
           growth factor-like EGF, domains have six conserved
           residues disulfide-bonded into the characteristic
           'ababcc' pattern. They are involved in growth and
           proliferation of cells, in proteins of the Notch/Delta
           pathway, neurogulin and selectins. hEGFs are also found
           in mosaic proteins with four-disulfide laminin EGFs such
           as aggrecan and perlecan. The core fold of the EGF
           domain consists of two small beta-hairpins packed
           against each other. Two major structural variants have
           been identified based on the structural context of the
           C-terminal Cys residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
           C-terminal thiol resides in the beta-turn, resulting in
           shorter loop-lengths between the Cys residues of
           disulfide 'c', typically C[8-9]XC. These shorter
           loop-lengths are also typical of the four-disulfide EGF
           domains, laminin ad integrin. Tandem hEGF domains have
           six linking residues between terminal cysteines of
           adjacent domains. hEGF domains may or may not bind
           calcium in the linker region. hEGF domains with the
           consensus motif CXD4X[F,Y]XCXC are hydroxylated
           exclusively in the Asp residue.
          Length = 13

 Score = 30.4 bits (70), Expect = 0.23
 Identities = 8/13 (61%), Positives = 9/13 (69%)

Query: 889 KCDCLPGWTGKNC 901
           KC C PG+TG  C
Sbjct: 1   KCQCPPGYTGPRC 13



 Score = 27.7 bits (63), Expect = 1.9
 Identities = 9/13 (69%), Positives = 9/13 (69%)

Query: 707 ICDCPPGYRGSRC 719
            C CPPGY G RC
Sbjct: 1   KCQCPPGYTGPRC 13



 Score = 25.8 bits (58), Expect = 9.0
 Identities = 6/13 (46%), Positives = 8/13 (61%)

Query: 624 SCECTPGWEGALC 636
            C+C PG+ G  C
Sbjct: 1   KCQCPPGYTGPRC 13


>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain.  This family contains EGF
           domains found in a variety of extracellular proteins.
          Length = 31

 Score = 29.7 bits (67), Expect = 0.38
 Identities = 10/33 (30%), Positives = 11/33 (33%), Gaps = 5/33 (15%)

Query: 869 DEPCQNGGTCHEDCRHQADYKCDCLPGWTGKNC 901
              C   GTC          KC C  G+ G  C
Sbjct: 4   SGICNGRGTC--VRP---CGKCVCDSGYQGATC 31


>gnl|CDD|153064 cd03594, CLECT_REG-1_like, C-type lectin-like domain (CTLD) of the
           type found in Human REG-1 (lithostathine), REG-4, and
           avian eggshell-specific proteins: ansocalcin,
           structhiocalcin-1(SCA-1), and -2(SCA-2).
           CLECT_REG-1_like: C-type lectin-like domain (CTLD) of
           the type found in Human REG-1 (lithostathine), REG-4,
           and avian eggshell-specific proteins: ansocalcin,
           structhiocalcin-1(SCA-1), and -2(SCA-2).  CTLD refers to
           a domain homologous to the carbohydrate-recognition
           domains (CRDs) of the C-type lectins.  REG-1 is a
           proliferating factor which participates in various kinds
           of tissue regeneration including pancreatic beta-cell
           regeneration, regeneration of intestinal mucosa,
           regeneration of motor neurons, and perhaps in tissue
           regeneration of damaged heart.  REG-1 may play a role on
           the pathophysiology of Alzheimer's disease and in the
           development of gastric cancers.  Its expression is
           correlated with reduced survival from early-stage
           colorectal cancer.  REG-1 also binds and aggregates
           several bacterial strains from the intestinal flora and
           it has been suggested that it is involved in the control
           of the intestinal bacterial ecosystem.  Rat
           lithostathine has calcium carbonate crystal inhibitor
           activity in vitro.  REG-IV is unregulated in pancreatic,
           gastric, hepatocellular, and prostrate adenocarcinomas. 
           REG-IV activates the EGF receptor/Akt/AP-1 signaling
           pathway in colorectal carcinoma.  Ansocalcin, SCA-1 and
           -2 are found at high concentration in the calcified egg
           shell layer of goose and ostrich, respectively and tend
           to form aggregates.  Ansocalcin nucleates calcite
           crystal aggregates in vitro.
          Length = 129

 Score = 31.6 bits (72), Expect = 0.67
 Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 11/38 (28%)

Query: 232 SYPSGGHCTDL--------WRDFSCSCVRPFLGHTCQY 261
            Y  GG+C +L        W D +C    PF+   C+Y
Sbjct: 95  PYARGGYCAELSRSTGFLKWNDANCEERNPFI---CKY 129


>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 29.0 bits (66), Expect = 0.70
 Identities = 13/25 (52%), Positives = 13/25 (52%), Gaps = 5/25 (20%)

Query: 584 SYACKCPAGYS----SETCAVDIDE 604
           SY C CP GY       TC  DIDE
Sbjct: 1   SYTCSCPPGYQLSGDGRTC-EDIDE 24


>gnl|CDD|153068 cd03598, CLECT_EMBP_like, C-type lectin-like domain (CTLD) of the
           type found in the human proteins, eosinophil major basic
           protein (EMBP) and prepro major basic protein homolog
           (MBPH).  CLECT_EMBP_like: C-type lectin-like domain
           (CTLD) of the type found in the human proteins,
           eosinophil major basic protein (EMBP) and prepro major
           basic protein homolog (MBPH).  CTLD refers to a domain
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  Eosinophils and basophils
           carry out various functions in allergic, parasitic, and
           inflammatory diseases.  EMBP is stored in eosinophil
           crystalloid granules and is released upon degranulation.
            EMBP is also expressed in basophils.  The proform of
           EMBP is expressed in placental X cells and breast tissue
           and increases significantly during human pregnancy.
           EMBP has cytotoxic properties and damages bacteria and
           mammalian cells, in vitro, as well as, helminth
           parasites.  EMBP deposition has been observed in the
           inflamed tissue of allergy patients in a variety of
           diseases including asthma, atopic dermatitis, and
           rhinitis. In addition to its cytotoxic functions, EMBP
           activates cells and stimulates cytokine production.
           EMBP has been shown to bind the proteoglycan heparin.
           The binding site is similar to the carbohydrate binding
           site of other classical CTLD, such as mannose-binding
           protein (MBP1), however, heparin binding to EMBP is
           calcium ion independent.  MBPH has reduced potency in
           cytotoxic and cytostimulatory assays compared with EMBP.
          Length = 117

 Score = 30.9 bits (70), Expect = 0.92
 Identities = 13/43 (30%), Positives = 18/43 (41%), Gaps = 9/43 (20%)

Query: 225 SPYQPSRSYPSGGHCTDL------WRDFSCSCVRPFLGHTCQY 261
           + + P +     GHC +L      WR   C   RPF+   C Y
Sbjct: 78  AYWAPGQPGNRRGHCVELCTRGGHWRRAHCKLRRPFI---CSY 117


>gnl|CDD|216090 pfam00733, Asn_synthase, Asparagine synthase.  This family is
           always found associated with pfam00310. Members of this
           family catalyze the conversion of aspartate to
           asparagine.
          Length = 195

 Score = 31.8 bits (73), Expect = 1.2
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 924 LAILLLGGLISMAVLCLMARQKRGRRGTYS 953
           + +LL GGL S  +  L ARQ      T+S
Sbjct: 20  VGVLLSGGLDSSLIAALAARQSDPPLKTFS 49


>gnl|CDD|233184 TIGR00917, 2A060601, Niemann-Pick C type protein family.  The model
           describes Niemann-Pick C type protein in eukaryotes. The
           defective protein has been associated with Niemann-Pick
           disease which is described in humans as autosomal
           recessive lipidosis. It is characterized by the
           lysosomal accumulation of unestrified cholesterol. It is
           an integral membrane protein, which indicates that this
           protein is most likely involved in cholesterol transport
           or acts as some component of cholesterol homeostasis
           [Transport and binding proteins, Other].
          Length = 1204

 Score = 32.5 bits (74), Expect = 1.5
 Identities = 26/158 (16%), Positives = 46/158 (29%), Gaps = 25/158 (15%)

Query: 820 MCEIDINECELSSDMCGNNGECINQPGDYKCACQFDTCGYLCNFPD------PCKDE--P 871
           +C    N C   ++  G      N    +        C               C D    
Sbjct: 147 LCGGAANFCN-WTNWIGQMAGVNNPGAPFGITFLPTPCPVSSGMRPMNVSTYSCGDAVLG 205

Query: 872 CQNGGTCHEDCRHQADYKCDCLPGWTGKNCTEVPE-YLPSRVVDLALLIIGPILAILLLG 930
           C  G +C +         CDC          + P    PS  + L +     I+AIL + 
Sbjct: 206 CSCGCSCSD---------CDCSCKA------KPPTPPKPSCSIILGVDAYDVIMAILYIV 250

Query: 931 GLISMAVLCLMARQKRGRRGTYSPSSQEYCNPRVEMNN 968
            ++      L+   +  ++ ++  +  E        NN
Sbjct: 251 LVLVFLGGGLLFPVRGKKKTSFMGTLSEASGEINSSNN 288


>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
            alpha; Validated.
          Length = 1220

 Score = 32.2 bits (74), Expect = 1.9
 Identities = 18/49 (36%), Positives = 19/49 (38%), Gaps = 26/49 (53%)

Query: 815  GYTGEMCEIDINECELSSDMCGN-----NGECINQPGDYKCACQFDTCG 858
            GYTGE C             CGN     NG C+      KC    DTCG
Sbjct: 1187 GYTGEACS-----------ECGNFTLVRNGTCL------KC----DTCG 1214


>gnl|CDD|212128 cd11663, GH119_BcIgtZ-like, putative catalytic domain of glycoside
           hydrolase family 119 (GH119).  The prokaryotic subgroup
           is represented by IgtZ, an alpha-amylase from a Bacillus
           circulans strain. The GH119 family is related to GH57, a
           chiefly prokaryotic family with the majority of
           thermostable enzymes coming from extremophiles (many of
           these are archaeal hyperthermophiles), which exhibit the
           enzyme specificities of alpha-amylase (EC 3.2.1.1),
           4-alpha-glucanotransferase (EC 2.4.1.25),
           amylopullulanase (EC 3.2.1.1/41), and
           alpha-galactosidase (EC 3.2.1.22). GH57s cleave
           alpha-glycosidic bonds by employing a retaining
           mechanism, which involves a glycosyl-enzyme
           intermediate, allowing transglycosylation.
          Length = 363

 Score = 31.1 bits (70), Expect = 3.1
 Identities = 25/95 (26%), Positives = 38/95 (40%), Gaps = 13/95 (13%)

Query: 395 HHHRSIRSPYQPSRSYPRWATYDEMENYTTAR---------RRYSRQTAGEKQFSPLPNF 445
           +  + +R+ Y P  S P W  Y +      +R           +     G+     L  F
Sbjct: 212 NEQQVVRNKY-PFASTPHWVRYVDPATGAESRVVGVPVAQAESWEEGYQGQVTADALKPF 270

Query: 446 KGIIHDVQIYFISGSGDRSSPSSNGAGSEETSYIA 480
           +G++   Q + I+  GD    SS  AGSEET   A
Sbjct: 271 EGLVPQKQFFVIAHDGDN---SSGRAGSEETWRNA 302


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 27.1 bits (61), Expect = 3.3
 Identities = 8/26 (30%), Positives = 12/26 (46%)

Query: 828 CELSSDMCGNNGECINQPGDYKCACQ 853
           C  ++  C  N  C N  G + C C+
Sbjct: 1   CAENNGGCHPNATCTNTGGSFTCTCK 26


>gnl|CDD|238949 cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine
           Synthase B. This domain is always found associated
           n-terminal amidotransferase domain. Family members that
           contain this domain catalyse the conversion of aspartate
           to asparagine. Asparagine synthetase B  catalyzes the
           assembly of asparagine from aspartate, Mg(2+)ATP, and
           glutamine. The three-dimensional architecture of the
           N-terminal domain of asparagine synthetase B is similar
           to that observed for glutamine
           phosphoribosylpyrophosphate amidotransferase while the
           molecular motif of the C-domain is reminiscent to that
           observed for GMP synthetase .
          Length = 269

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query: 924 LAILLLGGLISMAVLCLMARQKRGRRGTYS 953
           + +LL GGL S  V  L AR       T+S
Sbjct: 18  VGVLLSGGLDSSLVAALAARLLPEPVKTFS 47


>gnl|CDD|212095 cd00451, GH38N_AMII_euk, N-terminal catalytic domain of eukaryotic
           class II alpha-mannosidases; glycoside hydrolase family
           38 (GH38).  The family corresponds to a group of
           eukaryotic class II alpha-mannosidases (AlphaMII), which
           contain Golgi alpha-mannosidases II (GMII), the major
           broad specificity lysosomal alpha-mannosidases (LAM,
           MAN2B1), the noval core-specific lysosomal alpha
           1,6-mannosidases (Epman, MAN2B2), and similar proteins.
           GMII catalyzes the hydrolysis of the terminal both
           alpha-1,3-linked and alpha-1,6-linked mannoses from the
           high-mannose oligosaccharide GlcNAc(Man)5(GlcNAc)2 to
           yield GlcNAc(Man)3(GlcNAc)2 (GlcNAc,
           N-acetylglucosmine), which is the committed step of
           complex N-glycan synthesis. LAM is a broad specificity
           exoglycosidase hydrolyzing all known alpha 1,2-, alpha
           1,3-, and alpha 1,6-mannosidic linkages from numerous
           high mannose type oligosaccharides. Different from LAM,
           Epman can efficiently cleave only the alpha 1,6-linked
           mannose residue from (Man)3GlcNAc, but not
           (Man)3(GlcNAc)2 or other larger high mannose
           oligosaccharides, in the core of N-linked glycans.
           Members in this family are retaining glycosyl hydrolases
           of family GH38 that employs a two-step mechanism
           involving the formation of a covalent glycosyl enzyme
           complex.  Two carboxylic acids positioned within the
           active site act in concert: one as a catalytic
           nucleophile and the other as a general acid/base
           catalyst.
          Length = 258

 Score = 30.3 bits (69), Expect = 4.6
 Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 16/78 (20%)

Query: 426 RRRYSRQTAGEKQFSPLPNFKGIIHDVQIYFISG---SGDRSSPSSNGAGSEETSYIAAE 482
            R +  Q    KQ      FK ++ + Q+ F+ G     D +  +           I  +
Sbjct: 55  ERWWEDQGNDTKQ-----QFKKLVKNGQLEFVGGGWVMNDEACTTYES--------IIDQ 101

Query: 483 MEAGELFVRLQFNSTPES 500
           M  G  F++  F   P  
Sbjct: 102 MTEGHQFLKDTFGVRPRV 119


>gnl|CDD|153095 cd04706, PLA2_plant, PLA2_plant: Plant-specific sub-family of
           Phospholipase A2, a super-family of secretory and
           cytosolic enzymes; the latter are either Ca dependent or
           Ca independent. Enzymatically active PLA2 cleaves the
           sn-2 position of the glycerol backbone of phospholipids;
           secreted PLA2s have also been found to specifically bind
           to a variety of soluble and membrane proteins in
           mammals, including receptors. As a toxin, PLA2 is a
           potent presynaptic neurotoxin which blocks nerve
           terminals by binding to the nerve membrane and
           hydrolyzing stable membrane lipids. The products of the
           hydrolysis cannot form bilayers leading to a change in
           membrane conformation and ultimately to a block in the
           release of neurotransmitters. PLA2 may form dimers or
           oligomers. This sub-family does not appear to have a
           conserved active site and metal-binding loop.
          Length = 117

 Score = 28.9 bits (65), Expect = 5.0
 Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 7/87 (8%)

Query: 622 RYSCECTPGWEGALCEKEIDECLSNPCMNGGQCEDRL-AGFV-CNCSEEYVGERCESLRQ 679
           RY   C PG+ G   E+  D+ L   CM    C       ++   C+E++       +R+
Sbjct: 23  RYGKYCGPGYSGCPGERPCDD-LDACCMTHDACVQAKKNDYLSLECNEKFK----NCVRR 77

Query: 680 ISCADQPCYFGAVCQDTKISPYFPQGP 706
              A +P + G  C  T + P      
Sbjct: 78  FRKARKPTFEGNKCIVTFVIPVITVVM 104


>gnl|CDD|180947 PRK07366, PRK07366, succinyldiaminopimelate transaminase;
           Validated.
          Length = 388

 Score = 30.4 bits (69), Expect = 5.4
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 12/78 (15%)

Query: 148 VRLQFNSTPESYNVGG--VKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKTISSTGILDV 205
           V ++F +  +SYN+GG  +  A GN  LIQ +R V  V V  N  +Y        GIL+ 
Sbjct: 230 VSIEFFTLSKSYNMGGFRIGFAIGNAQLIQALRQVKAV-VDFN--QY-------RGILNG 279

Query: 206 QVLYLGGIPETVHHHRSI 223
            +  L G   TV     I
Sbjct: 280 AIAALTGPQATVQQTVQI 297


>gnl|CDD|133156 cd05489, xylanase_inhibitor_I_like, TAXI-I inhibits degradation of
           xylan in the cell wall.  Xylanase inhibitor-I (TAXI-I)
           is a member of potent TAXI-type inhibitors of fungal and
           bacterial family 11 xylanases. Plants developed a
           diverse battery of defense mechanisms in response to
           continual challenges by a broad spectrum of pathogenic
           microorganisms. Their defense arsenal includes
           inhibitors of cell wall-degrading enzymes, which hinder
           a possible invasion and colonization by antagonists.
           Xylanases of fungal and bacterial pathogens are the key
           enzymes in the degradation of xylan in the cell wall.
           Plants secrete proteins that inhibit these degradation
           glycosidases, including xylanase. Surprisingly, TAXI-I
           displays structural homology with the pepsin-like family
           of aspartic proteases but is proteolytically
           nonfunctional, because one or more residues of the
           essential catalytic triad are absent. The structure of
           the TAXI-inhibitor, Aspergillus niger xylanase I
           complex, illustrates the ability of tight binding and
           inhibition with subnanomolar affinity and indicates the
           importance of the C-terminal end for the differences in
           xylanase specificity among different TAXI-type
           inhibitors. This family also contains pepsin-like
           aspartic proteinases homologous to TAXI-I. Unlike
           TAXI-I, they have active site aspartates and are
           functionally active. This family of aspartate proteases
           is classified by MEROPS as the peptidase family A1
           (pepsin A, clan AA).
          Length = 362

 Score = 29.2 bits (66), Expect = 9.9
 Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 29  TIYPINPITQFEQPTSLSQSQTSFPTT--TNFTFVTSPDFTAA 69
           T +P NP+T       L+Q   S  TT  +N   V   +F  +
Sbjct: 69  TAHPYNPVTGECATGDLTQDVLSANTTDGSNPLLVVIFNFVFS 111


>gnl|CDD|216331 pfam01150, GDA1_CD39, GDA1/CD39 (nucleoside phosphatase) family. 
          Length = 421

 Score = 29.7 bits (67), Expect = 10.0
 Identities = 17/76 (22%), Positives = 27/76 (35%), Gaps = 26/76 (34%)

Query: 635 LCEKEIDECLSNPCMNGGQCEDRLAGF--VCNCSEEYV----------GERC-ESLRQIS 681
           L     +  LS+PC+          GF    + SE              E+C +S+R++ 
Sbjct: 225 LISNSSNLILSDPCL--------PPGFNKTVSYSEVEFDVFAIRGTGNYEQCSQSIRELL 276

Query: 682 -----CADQPCYFGAV 692
                C  + C F  V
Sbjct: 277 NKNAVCPYEQCTFNGV 292


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.427 

Gapped
Lambda     K      H
   0.267   0.0715    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,797,016
Number of extensions: 4701939
Number of successful extensions: 3683
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3646
Number of HSP's successfully gapped: 90
Length of query: 979
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 873
Effective length of database: 6,236,078
Effective search space: 5444096094
Effective search space used: 5444096094
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.5 bits)