RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9685
(979 letters)
>gnl|CDD|214598 smart00282, LamG, Laminin G domain.
Length = 132
Score = 73.9 bits (182), Expect = 1e-15
Identities = 38/134 (28%), Positives = 52/134 (38%), Gaps = 19/134 (14%)
Query: 98 DISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPE 157
IS RT G + Y G G Y+A E+ G L +R S P
Sbjct: 1 SISFSFRTTSPNGLLLYAGSKGG-------------GDYLALELRDGRLVLRYDLGSGPA 47
Query: 158 SYNVGGVKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKTISSTG---ILDVQ-VLYLGGI 213
L DG H + V RN V + ++G R + S G IL++ LYLGG+
Sbjct: 48 RLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGGN--RVSGESPGGLTILNLDGPLYLGGL 105
Query: 214 PETVHHHRSIRSPY 227
PE + +P
Sbjct: 106 PEDLKLPPLPVTPG 119
Score = 58.1 bits (141), Expect = 5e-10
Identities = 32/132 (24%), Positives = 43/132 (32%), Gaps = 35/132 (26%)
Query: 295 DISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPE 354
IS RT G + Y G G Y+A E+ G L +R S P
Sbjct: 1 SISFSFRTTSPNGLLLYAGSKGG-------------GDYLALELRDGRLVLRYDLGSGPA 47
Query: 355 SYNVGGVKLADGNNHLI------------------QTISSTG---ILDVQ-VLYLGGIPE 392
L DG H + + S G IL++ LYLGG+PE
Sbjct: 48 RLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGGLPE 107
Query: 393 TVHHHRSIRSPY 404
+ +P
Sbjct: 108 DLKLPPLPVTPG 119
Score = 38.5 bits (90), Expect = 0.003
Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 5/57 (8%)
Query: 467 SSNG-----AGSEETSYIAAEMEAGELFVRLQFNSTPESYNVGGVKLADGNNHLIQV 518
S NG Y+A E+ G L +R S P L DG H + V
Sbjct: 10 SPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAV 66
>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
Ca++ mediated receptors that can have binding sites for
steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that
contain LamG domains serve a variety of purposes
including signal transduction via cell-surface steroid
receptors, adhesion, migration and differentiation
through mediation of cell adhesion molecules.
Length = 151
Score = 67.4 bits (165), Expect = 4e-13
Identities = 35/139 (25%), Positives = 49/139 (35%), Gaps = 16/139 (11%)
Query: 91 RAVRNIVDISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRL 150
A R + IS RT G + Y G ++A E+E G L +R
Sbjct: 16 PAPRTRLSISFSFRTTSPNGLLLYAGSQNG-------------GDFLALELEDGRLVLRY 62
Query: 151 QFNSTPESYNVGGVKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKT--ISSTGILDVQVL 208
S + L DG H + V RN V + ++G S + L
Sbjct: 63 DLGSGSLVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPL 121
Query: 209 YLGGIPETVHHHRSIRSPY 227
YLGG+PE + SP
Sbjct: 122 YLGGLPEDLKSPGLPVSPG 140
Score = 50.1 bits (120), Expect = 5e-07
Identities = 30/139 (21%), Positives = 43/139 (30%), Gaps = 36/139 (25%)
Query: 288 RAVRNIVDISMFIRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRL 347
A R + IS RT G + Y G ++A E+E G L +R
Sbjct: 16 PAPRTRLSISFSFRTTSPNGLLLYAGSQNG-------------GDFLALELEDGRLVLRY 62
Query: 348 QFNSTPESYNVGGVKLADGNNHLIQTISSTGILDVQV----------------------L 385
S + L DG H + + + + V L
Sbjct: 63 DLGSGSLVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPL 121
Query: 386 YLGGIPETVHHHRSIRSPY 404
YLGG+PE + SP
Sbjct: 122 YLGGLPEDLKSPGLPVSPG 140
Score = 31.2 bits (71), Expect = 1.4
Identities = 15/57 (26%), Positives = 21/57 (36%), Gaps = 6/57 (10%)
Query: 467 SSNG-----AGSEETSYIAAEMEAGELFVRLQFNSTPESYNVGGVKLADGNNHLIQV 518
S NG ++A E+E G L +R S + L DG H + V
Sbjct: 32 SPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSS-KTPLNDGQWHSVSV 87
>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain.
Length = 131
Score = 59.3 bits (144), Expect = 2e-10
Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 17/116 (14%)
Query: 103 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 162
RT + G + Y G +R ++A E+ G L V S +
Sbjct: 1 FRTTEPSGLLLYNGTQTERD-------------FLALELRDGRLEVSYDLGSGA-AVVRS 46
Query: 163 GVKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKT--ISSTGILDVQ-VLYLGGIPE 215
G KL DG H +++ RN + ++G + +T LDV LY+GG+P
Sbjct: 47 GDKLNDGKWHSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPS 102
Score = 40.4 bits (95), Expect = 6e-04
Identities = 24/116 (20%), Positives = 37/116 (31%), Gaps = 37/116 (31%)
Query: 300 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 359
RT + G + Y G +R ++A E+ G L V S +
Sbjct: 1 FRTTEPSGLLLYNGTQTERD-------------FLALELRDGRLEVSYDLGSGA-AVVRS 46
Query: 360 GVKLADGNNHLIQTISSTGILDVQV-----------------------LYLGGIPE 392
G KL DG H ++ + + V LY+GG+P
Sbjct: 47 GDKLNDGKWHSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPS 102
Score = 31.1 bits (71), Expect = 1.1
Identities = 13/49 (26%), Positives = 20/49 (40%), Gaps = 1/49 (2%)
Query: 470 GAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVGGVKLADGNNHLIQV 518
E ++A E+ G L V S + G KL DG H +++
Sbjct: 13 NGTQTERDFLALELRDGRLEVSYDLGSGA-AVVRSGDKLNDGKWHSVEL 60
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 55.0 bits (133), Expect = 6e-10
Identities = 20/38 (52%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 601 DIDECVTHN-CQNGARCIDGVARYSCECTPGWEGALCE 637
DIDEC + N CQNG C++ V Y C C PG+ G CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 50.7 bits (122), Expect = 2e-08
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 640 IDECLS-NPCMNGGQCEDRLAGFVCNCSEEYVGERCE 675
IDEC S NPC NGG C + + + C+C Y G CE
Sbjct: 2 IDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
Score = 43.8 bits (104), Expect = 5e-06
Identities = 14/30 (46%), Positives = 19/30 (63%), Gaps = 1/30 (3%)
Query: 824 DINECELSSDMCGNNGECINQPGDYKCACQ 853
DI+EC S + C N G C+N G Y+C+C
Sbjct: 1 DIDECA-SGNPCQNGGTCVNTVGSYRCSCP 29
Score = 43.0 bits (102), Expect = 9e-06
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 865 DPCKDE-PCQNGGTCHEDCRHQADYKCDCLPGWTGKNCT 902
D C PCQNGGTC Y+C C PG+TG+NC
Sbjct: 3 DECASGNPCQNGGTCVN---TVGSYRCSCPPGYTGRNCE 38
Score = 40.3 bits (95), Expect = 7e-05
Identities = 15/28 (53%), Positives = 22/28 (78%), Gaps = 1/28 (3%)
Query: 722 NIDECASG-PCKNSGQCIDDVNAFICNC 748
+IDECASG PC+N G C++ V ++ C+C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCSC 28
Score = 36.8 bits (86), Expect = 0.001
Identities = 16/40 (40%), Positives = 17/40 (42%), Gaps = 7/40 (17%)
Query: 682 CADQ-PCYFGAVCQDTKISPYFPQGPICDCPPGYRGSRCE 720
CA PC G C +T C CPPGY G CE
Sbjct: 5 CASGNPCQNGGTCVNT------VGSYRCSCPPGYTGRNCE 38
Score = 30.3 bits (69), Expect = 0.27
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 236 GGHCTDLWRDFSCSCVRPFLGHTCQ 260
GG C + + CSC + G C+
Sbjct: 14 GGTCVNTVGSYRCSCPPGYTGRNCE 38
>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain. This family includes the
Thrombospondin N-terminal-like domain, a Laminin G
subfamily.
Length = 124
Score = 55.9 bits (135), Expect = 3e-09
Identities = 36/136 (26%), Positives = 52/136 (38%), Gaps = 21/136 (15%)
Query: 103 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 162
RT Q G + Y GG ++A E+E G L +R S +
Sbjct: 1 FRTTQPNGLLLYAGGEDG-------------LDFLALELEDGRLVLRYDLGSGGSVLLLS 47
Query: 163 GVKLADGNNHLIQVVRNVTLVQVKLNG-TEYFRKTISSTGILDVQV-LYLGGIPETVHHH 220
G KL DG H + V R+ + + ++G T S+ IL++ LYLGG+PE
Sbjct: 48 GKKLNDGQWHRVSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPE----- 102
Query: 221 RSIRSPYQPSRSYPSG 236
P G
Sbjct: 103 -DSGLSLLPVTEGFVG 117
Score = 39.4 bits (92), Expect = 0.001
Identities = 29/127 (22%), Positives = 40/127 (31%), Gaps = 35/127 (27%)
Query: 300 IRTRQLRGAIFYLGGSGDRSSPSSNGAGSEETSYIAAEMEAGELFVRLQFNSTPESYNVG 359
RT Q G + Y GG ++A E+E G L +R S +
Sbjct: 1 FRTTQPNGLLLYAGGEDG-------------LDFLALELEDGRLVLRYDLGSGGSVLLLS 47
Query: 360 GVKLADG---------------------NNHLIQTISSTGILDVQV-LYLGGIPETVHHH 397
G KL DG S+ IL++ LYLGG+PE
Sbjct: 48 GKKLNDGQWHRVSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGLS 107
Query: 398 RSIRSPY 404
+
Sbjct: 108 LLPVTEG 114
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 51.1 bits (123), Expect = 1e-08
Identities = 21/39 (53%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 601 DIDECVTHN-CQNGARCIDGVARYSCECTPGWE-GALCE 637
DIDEC + N CQNG C++ V Y CEC PG+ G CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 46.5 bits (111), Expect = 6e-07
Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 639 EIDECLS-NPCMNGGQCEDRLAGFVCNCSEEYV-GERCE 675
+IDEC S NPC NGG C + + + C C Y G CE
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNCE 39
Score = 43.4 bits (103), Expect = 6e-06
Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 824 DINECELSSDMCGNNGECINQPGDYKCACQ 853
DI+EC S + C N G C+N G Y+C C
Sbjct: 1 DIDECA-SGNPCQNGGTCVNTVGSYRCECP 29
Score = 38.8 bits (91), Expect = 3e-04
Identities = 14/28 (50%), Positives = 20/28 (71%), Gaps = 1/28 (3%)
Query: 722 NIDECAS-GPCKNSGQCIDDVNAFICNC 748
+IDECAS PC+N G C++ V ++ C C
Sbjct: 1 DIDECASGNPCQNGGTCVNTVGSYRCEC 28
Score = 35.3 bits (82), Expect = 0.005
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 865 DPCKDE-PCQNGGTCHEDCRHQADYKCDCLPGWT-GKNCT 902
D C PCQNGGTC Y+C+C PG+T G+NC
Sbjct: 3 DECASGNPCQNGGTCV---NTVGSYRCECPPGYTDGRNCE 39
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 40.2 bits (94), Expect = 9e-05
Identities = 15/36 (41%), Positives = 18/36 (50%), Gaps = 2/36 (5%)
Query: 604 ECVTHN-CQNGARCIDGVARYSCECTPGWEGAL-CE 637
EC N C NG C++ Y C C PG+ G CE
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36
Score = 37.5 bits (87), Expect = 9e-04
Identities = 16/36 (44%), Positives = 19/36 (52%), Gaps = 2/36 (5%)
Query: 642 EC-LSNPCMNGGQCEDRLAGFVCNCSEEYVGE-RCE 675
EC SNPC NGG C + + C C Y G+ CE
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDRSCE 36
Score = 35.5 bits (82), Expect = 0.004
Identities = 15/33 (45%), Positives = 19/33 (57%), Gaps = 4/33 (12%)
Query: 871 PCQNGGTCHEDCRHQADYKCDCLPGWTG-KNCT 902
PC NGGTC Y+C C PG+TG ++C
Sbjct: 7 PCSNGGTCVNTP---GSYRCVCPPGYTGDRSCE 36
Score = 34.8 bits (80), Expect = 0.008
Identities = 13/27 (48%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 827 ECELSSDMCGNNGECINQPGDYKCACQ 853
EC S+ C N G C+N PG Y+C C
Sbjct: 1 ECAASNP-CSNGGTCVNTPGSYRCVCP 26
Score = 32.1 bits (73), Expect = 0.080
Identities = 17/41 (41%), Positives = 18/41 (43%), Gaps = 8/41 (19%)
Query: 682 CADQ-PCYFGAVCQDTKISPYFPQGPICDCPPGYRGS-RCE 720
CA PC G C +T P C CPPGY G CE
Sbjct: 2 CAASNPCSNGGTCVNT------PGSYRCVCPPGYTGDRSCE 36
Score = 31.7 bits (72), Expect = 0.11
Identities = 11/25 (44%), Positives = 15/25 (60%), Gaps = 1/25 (4%)
Query: 725 ECA-SGPCKNSGQCIDDVNAFICNC 748
ECA S PC N G C++ ++ C C
Sbjct: 1 ECAASNPCSNGGTCVNTPGSYRCVC 25
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 37.8 bits (88), Expect = 5e-04
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 607 THNCQNGARCIDGVARYSCECTPGWEG 633
+ C NG C+D Y+CEC G+ G
Sbjct: 4 NNPCSNGGTCVDTPGGYTCECPEGYTG 30
Score = 37.0 bits (86), Expect = 0.001
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 868 KDEPCQNGGTCHEDCRHQADYKCDCLPGWTGKN 900
+ PC NGGTC + Y C+C G+TGK
Sbjct: 3 PNNPCSNGGTCVD---TPGGYTCECPEGYTGKR 32
Score = 34.7 bits (80), Expect = 0.008
Identities = 15/28 (53%), Positives = 17/28 (60%)
Query: 646 NPCMNGGQCEDRLAGFVCNCSEEYVGER 673
NPC NGG C D G+ C C E Y G+R
Sbjct: 5 NPCSNGGTCVDTPGGYTCECPEGYTGKR 32
Score = 28.6 bits (64), Expect = 1.00
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 835 CGNNGECINQPGDYKCACQ 853
C N G C++ PG Y C C
Sbjct: 7 CSNGGTCVDTPGGYTCECP 25
Score = 28.6 bits (64), Expect = 1.2
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 730 PCKNSGQCIDDVNAFICNCTNTATG 754
PC N G C+D + C C TG
Sbjct: 6 PCSNGGTCVDTPGGYTCECPEGYTG 30
Score = 27.8 bits (62), Expect = 2.1
Identities = 16/38 (42%), Positives = 18/38 (47%), Gaps = 7/38 (18%)
Query: 682 CADQ-PCYFGAVCQDTKISPYFPQGPICDCPPGYRGSR 718
C+ PC G C DT P G C+CP GY G R
Sbjct: 1 CSPNNPCSNGGTCVDT------PGGYTCECPEGYTGKR 32
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 38.1 bits (89), Expect = 6e-04
Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 601 DIDECV--THNCQNGARCIDGVARYSCECTPGWE 632
D+DEC THNC C++ + + C C G+E
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGYE 34
Score = 37.3 bits (87), Expect = 0.001
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 824 DINECELSSDMCGNNGECINQPGDYKCAC 852
D++EC + C N C+N G ++C C
Sbjct: 1 DVDECADGTHNCPANTVCVNTIGSFECVC 29
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 32.5 bits (74), Expect = 0.056
Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 604 ECVTHN-CQNGARCIDGVARYSCECTPGWEG 633
EC + C NG CI+ Y+C C PG+ G
Sbjct: 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTG 30
Score = 29.0 bits (65), Expect = 0.90
Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 642 ECLS-NPCMNGGQCEDRLAGFVCNCSEEYVG-ERCE 675
EC S PC NG C + + C+C Y G +RCE
Sbjct: 1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGDKRCE 35
Score = 28.6 bits (64), Expect = 1.2
Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 3/34 (8%)
Query: 871 PCQNGGTCHED-CRHQAD-YKCDCLPGWTG-KNC 901
C +GG C C + Y C C PG+TG K C
Sbjct: 1 ECASGGPCSNGTCINTPGSYTCSCPPGYTGDKRC 34
Score = 27.9 bits (62), Expect = 1.8
Identities = 13/27 (48%), Positives = 14/27 (51%), Gaps = 2/27 (7%)
Query: 827 ECELSSDMCGNNGECINQPGDYKCACQ 853
EC C N G CIN PG Y C+C
Sbjct: 1 ECASGGP-CSN-GTCINTPGSYTCSCP 25
>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF. hEGF, or human
growth factor-like EGF, domains have six conserved
residues disulfide-bonded into the characteristic
'ababcc' pattern. They are involved in growth and
proliferation of cells, in proteins of the Notch/Delta
pathway, neurogulin and selectins. hEGFs are also found
in mosaic proteins with four-disulfide laminin EGFs such
as aggrecan and perlecan. The core fold of the EGF
domain consists of two small beta-hairpins packed
against each other. Two major structural variants have
been identified based on the structural context of the
C-terminal Cys residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
C-terminal thiol resides in the beta-turn, resulting in
shorter loop-lengths between the Cys residues of
disulfide 'c', typically C[8-9]XC. These shorter
loop-lengths are also typical of the four-disulfide EGF
domains, laminin ad integrin. Tandem hEGF domains have
six linking residues between terminal cysteines of
adjacent domains. hEGF domains may or may not bind
calcium in the linker region. hEGF domains with the
consensus motif CXD4X[F,Y]XCXC are hydroxylated
exclusively in the Asp residue.
Length = 13
Score = 30.4 bits (70), Expect = 0.23
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 889 KCDCLPGWTGKNC 901
KC C PG+TG C
Sbjct: 1 KCQCPPGYTGPRC 13
Score = 27.7 bits (63), Expect = 1.9
Identities = 9/13 (69%), Positives = 9/13 (69%)
Query: 707 ICDCPPGYRGSRC 719
C CPPGY G RC
Sbjct: 1 KCQCPPGYTGPRC 13
Score = 25.8 bits (58), Expect = 9.0
Identities = 6/13 (46%), Positives = 8/13 (61%)
Query: 624 SCECTPGWEGALC 636
C+C PG+ G C
Sbjct: 1 KCQCPPGYTGPRC 13
>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain. This family contains EGF
domains found in a variety of extracellular proteins.
Length = 31
Score = 29.7 bits (67), Expect = 0.38
Identities = 10/33 (30%), Positives = 11/33 (33%), Gaps = 5/33 (15%)
Query: 869 DEPCQNGGTCHEDCRHQADYKCDCLPGWTGKNC 901
C GTC KC C G+ G C
Sbjct: 4 SGICNGRGTC--VRP---CGKCVCDSGYQGATC 31
>gnl|CDD|153064 cd03594, CLECT_REG-1_like, C-type lectin-like domain (CTLD) of the
type found in Human REG-1 (lithostathine), REG-4, and
avian eggshell-specific proteins: ansocalcin,
structhiocalcin-1(SCA-1), and -2(SCA-2).
CLECT_REG-1_like: C-type lectin-like domain (CTLD) of
the type found in Human REG-1 (lithostathine), REG-4,
and avian eggshell-specific proteins: ansocalcin,
structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to
a domain homologous to the carbohydrate-recognition
domains (CRDs) of the C-type lectins. REG-1 is a
proliferating factor which participates in various kinds
of tissue regeneration including pancreatic beta-cell
regeneration, regeneration of intestinal mucosa,
regeneration of motor neurons, and perhaps in tissue
regeneration of damaged heart. REG-1 may play a role on
the pathophysiology of Alzheimer's disease and in the
development of gastric cancers. Its expression is
correlated with reduced survival from early-stage
colorectal cancer. REG-1 also binds and aggregates
several bacterial strains from the intestinal flora and
it has been suggested that it is involved in the control
of the intestinal bacterial ecosystem. Rat
lithostathine has calcium carbonate crystal inhibitor
activity in vitro. REG-IV is unregulated in pancreatic,
gastric, hepatocellular, and prostrate adenocarcinomas.
REG-IV activates the EGF receptor/Akt/AP-1 signaling
pathway in colorectal carcinoma. Ansocalcin, SCA-1 and
-2 are found at high concentration in the calcified egg
shell layer of goose and ostrich, respectively and tend
to form aggregates. Ansocalcin nucleates calcite
crystal aggregates in vitro.
Length = 129
Score = 31.6 bits (72), Expect = 0.67
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 11/38 (28%)
Query: 232 SYPSGGHCTDL--------WRDFSCSCVRPFLGHTCQY 261
Y GG+C +L W D +C PF+ C+Y
Sbjct: 95 PYARGGYCAELSRSTGFLKWNDANCEERNPFI---CKY 129
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 29.0 bits (66), Expect = 0.70
Identities = 13/25 (52%), Positives = 13/25 (52%), Gaps = 5/25 (20%)
Query: 584 SYACKCPAGYS----SETCAVDIDE 604
SY C CP GY TC DIDE
Sbjct: 1 SYTCSCPPGYQLSGDGRTC-EDIDE 24
>gnl|CDD|153068 cd03598, CLECT_EMBP_like, C-type lectin-like domain (CTLD) of the
type found in the human proteins, eosinophil major basic
protein (EMBP) and prepro major basic protein homolog
(MBPH). CLECT_EMBP_like: C-type lectin-like domain
(CTLD) of the type found in the human proteins,
eosinophil major basic protein (EMBP) and prepro major
basic protein homolog (MBPH). CTLD refers to a domain
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. Eosinophils and basophils
carry out various functions in allergic, parasitic, and
inflammatory diseases. EMBP is stored in eosinophil
crystalloid granules and is released upon degranulation.
EMBP is also expressed in basophils. The proform of
EMBP is expressed in placental X cells and breast tissue
and increases significantly during human pregnancy.
EMBP has cytotoxic properties and damages bacteria and
mammalian cells, in vitro, as well as, helminth
parasites. EMBP deposition has been observed in the
inflamed tissue of allergy patients in a variety of
diseases including asthma, atopic dermatitis, and
rhinitis. In addition to its cytotoxic functions, EMBP
activates cells and stimulates cytokine production.
EMBP has been shown to bind the proteoglycan heparin.
The binding site is similar to the carbohydrate binding
site of other classical CTLD, such as mannose-binding
protein (MBP1), however, heparin binding to EMBP is
calcium ion independent. MBPH has reduced potency in
cytotoxic and cytostimulatory assays compared with EMBP.
Length = 117
Score = 30.9 bits (70), Expect = 0.92
Identities = 13/43 (30%), Positives = 18/43 (41%), Gaps = 9/43 (20%)
Query: 225 SPYQPSRSYPSGGHCTDL------WRDFSCSCVRPFLGHTCQY 261
+ + P + GHC +L WR C RPF+ C Y
Sbjct: 78 AYWAPGQPGNRRGHCVELCTRGGHWRRAHCKLRRPFI---CSY 117
>gnl|CDD|216090 pfam00733, Asn_synthase, Asparagine synthase. This family is
always found associated with pfam00310. Members of this
family catalyze the conversion of aspartate to
asparagine.
Length = 195
Score = 31.8 bits (73), Expect = 1.2
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 924 LAILLLGGLISMAVLCLMARQKRGRRGTYS 953
+ +LL GGL S + L ARQ T+S
Sbjct: 20 VGVLLSGGLDSSLIAALAARQSDPPLKTFS 49
>gnl|CDD|233184 TIGR00917, 2A060601, Niemann-Pick C type protein family. The model
describes Niemann-Pick C type protein in eukaryotes. The
defective protein has been associated with Niemann-Pick
disease which is described in humans as autosomal
recessive lipidosis. It is characterized by the
lysosomal accumulation of unestrified cholesterol. It is
an integral membrane protein, which indicates that this
protein is most likely involved in cholesterol transport
or acts as some component of cholesterol homeostasis
[Transport and binding proteins, Other].
Length = 1204
Score = 32.5 bits (74), Expect = 1.5
Identities = 26/158 (16%), Positives = 46/158 (29%), Gaps = 25/158 (15%)
Query: 820 MCEIDINECELSSDMCGNNGECINQPGDYKCACQFDTCGYLCNFPD------PCKDE--P 871
+C N C ++ G N + C C D
Sbjct: 147 LCGGAANFCN-WTNWIGQMAGVNNPGAPFGITFLPTPCPVSSGMRPMNVSTYSCGDAVLG 205
Query: 872 CQNGGTCHEDCRHQADYKCDCLPGWTGKNCTEVPE-YLPSRVVDLALLIIGPILAILLLG 930
C G +C + CDC + P PS + L + I+AIL +
Sbjct: 206 CSCGCSCSD---------CDCSCKA------KPPTPPKPSCSIILGVDAYDVIMAILYIV 250
Query: 931 GLISMAVLCLMARQKRGRRGTYSPSSQEYCNPRVEMNN 968
++ L+ + ++ ++ + E NN
Sbjct: 251 LVLVFLGGGLLFPVRGKKKTSFMGTLSEASGEINSSNN 288
>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 1220
Score = 32.2 bits (74), Expect = 1.9
Identities = 18/49 (36%), Positives = 19/49 (38%), Gaps = 26/49 (53%)
Query: 815 GYTGEMCEIDINECELSSDMCGN-----NGECINQPGDYKCACQFDTCG 858
GYTGE C CGN NG C+ KC DTCG
Sbjct: 1187 GYTGEACS-----------ECGNFTLVRNGTCL------KC----DTCG 1214
>gnl|CDD|212128 cd11663, GH119_BcIgtZ-like, putative catalytic domain of glycoside
hydrolase family 119 (GH119). The prokaryotic subgroup
is represented by IgtZ, an alpha-amylase from a Bacillus
circulans strain. The GH119 family is related to GH57, a
chiefly prokaryotic family with the majority of
thermostable enzymes coming from extremophiles (many of
these are archaeal hyperthermophiles), which exhibit the
enzyme specificities of alpha-amylase (EC 3.2.1.1),
4-alpha-glucanotransferase (EC 2.4.1.25),
amylopullulanase (EC 3.2.1.1/41), and
alpha-galactosidase (EC 3.2.1.22). GH57s cleave
alpha-glycosidic bonds by employing a retaining
mechanism, which involves a glycosyl-enzyme
intermediate, allowing transglycosylation.
Length = 363
Score = 31.1 bits (70), Expect = 3.1
Identities = 25/95 (26%), Positives = 38/95 (40%), Gaps = 13/95 (13%)
Query: 395 HHHRSIRSPYQPSRSYPRWATYDEMENYTTAR---------RRYSRQTAGEKQFSPLPNF 445
+ + +R+ Y P S P W Y + +R + G+ L F
Sbjct: 212 NEQQVVRNKY-PFASTPHWVRYVDPATGAESRVVGVPVAQAESWEEGYQGQVTADALKPF 270
Query: 446 KGIIHDVQIYFISGSGDRSSPSSNGAGSEETSYIA 480
+G++ Q + I+ GD SS AGSEET A
Sbjct: 271 EGLVPQKQFFVIAHDGDN---SSGRAGSEETWRNA 302
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 27.1 bits (61), Expect = 3.3
Identities = 8/26 (30%), Positives = 12/26 (46%)
Query: 828 CELSSDMCGNNGECINQPGDYKCACQ 853
C ++ C N C N G + C C+
Sbjct: 1 CAENNGGCHPNATCTNTGGSFTCTCK 26
>gnl|CDD|238949 cd01991, Asn_Synthase_B_C, The C-terminal domain of Asparagine
Synthase B. This domain is always found associated
n-terminal amidotransferase domain. Family members that
contain this domain catalyse the conversion of aspartate
to asparagine. Asparagine synthetase B catalyzes the
assembly of asparagine from aspartate, Mg(2+)ATP, and
glutamine. The three-dimensional architecture of the
N-terminal domain of asparagine synthetase B is similar
to that observed for glutamine
phosphoribosylpyrophosphate amidotransferase while the
molecular motif of the C-domain is reminiscent to that
observed for GMP synthetase .
Length = 269
Score = 30.5 bits (69), Expect = 3.8
Identities = 12/30 (40%), Positives = 15/30 (50%)
Query: 924 LAILLLGGLISMAVLCLMARQKRGRRGTYS 953
+ +LL GGL S V L AR T+S
Sbjct: 18 VGVLLSGGLDSSLVAALAARLLPEPVKTFS 47
>gnl|CDD|212095 cd00451, GH38N_AMII_euk, N-terminal catalytic domain of eukaryotic
class II alpha-mannosidases; glycoside hydrolase family
38 (GH38). The family corresponds to a group of
eukaryotic class II alpha-mannosidases (AlphaMII), which
contain Golgi alpha-mannosidases II (GMII), the major
broad specificity lysosomal alpha-mannosidases (LAM,
MAN2B1), the noval core-specific lysosomal alpha
1,6-mannosidases (Epman, MAN2B2), and similar proteins.
GMII catalyzes the hydrolysis of the terminal both
alpha-1,3-linked and alpha-1,6-linked mannoses from the
high-mannose oligosaccharide GlcNAc(Man)5(GlcNAc)2 to
yield GlcNAc(Man)3(GlcNAc)2 (GlcNAc,
N-acetylglucosmine), which is the committed step of
complex N-glycan synthesis. LAM is a broad specificity
exoglycosidase hydrolyzing all known alpha 1,2-, alpha
1,3-, and alpha 1,6-mannosidic linkages from numerous
high mannose type oligosaccharides. Different from LAM,
Epman can efficiently cleave only the alpha 1,6-linked
mannose residue from (Man)3GlcNAc, but not
(Man)3(GlcNAc)2 or other larger high mannose
oligosaccharides, in the core of N-linked glycans.
Members in this family are retaining glycosyl hydrolases
of family GH38 that employs a two-step mechanism
involving the formation of a covalent glycosyl enzyme
complex. Two carboxylic acids positioned within the
active site act in concert: one as a catalytic
nucleophile and the other as a general acid/base
catalyst.
Length = 258
Score = 30.3 bits (69), Expect = 4.6
Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 16/78 (20%)
Query: 426 RRRYSRQTAGEKQFSPLPNFKGIIHDVQIYFISG---SGDRSSPSSNGAGSEETSYIAAE 482
R + Q KQ FK ++ + Q+ F+ G D + + I +
Sbjct: 55 ERWWEDQGNDTKQ-----QFKKLVKNGQLEFVGGGWVMNDEACTTYES--------IIDQ 101
Query: 483 MEAGELFVRLQFNSTPES 500
M G F++ F P
Sbjct: 102 MTEGHQFLKDTFGVRPRV 119
>gnl|CDD|153095 cd04706, PLA2_plant, PLA2_plant: Plant-specific sub-family of
Phospholipase A2, a super-family of secretory and
cytosolic enzymes; the latter are either Ca dependent or
Ca independent. Enzymatically active PLA2 cleaves the
sn-2 position of the glycerol backbone of phospholipids;
secreted PLA2s have also been found to specifically bind
to a variety of soluble and membrane proteins in
mammals, including receptors. As a toxin, PLA2 is a
potent presynaptic neurotoxin which blocks nerve
terminals by binding to the nerve membrane and
hydrolyzing stable membrane lipids. The products of the
hydrolysis cannot form bilayers leading to a change in
membrane conformation and ultimately to a block in the
release of neurotransmitters. PLA2 may form dimers or
oligomers. This sub-family does not appear to have a
conserved active site and metal-binding loop.
Length = 117
Score = 28.9 bits (65), Expect = 5.0
Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 7/87 (8%)
Query: 622 RYSCECTPGWEGALCEKEIDECLSNPCMNGGQCEDRL-AGFV-CNCSEEYVGERCESLRQ 679
RY C PG+ G E+ D+ L CM C ++ C+E++ +R+
Sbjct: 23 RYGKYCGPGYSGCPGERPCDD-LDACCMTHDACVQAKKNDYLSLECNEKFK----NCVRR 77
Query: 680 ISCADQPCYFGAVCQDTKISPYFPQGP 706
A +P + G C T + P
Sbjct: 78 FRKARKPTFEGNKCIVTFVIPVITVVM 104
>gnl|CDD|180947 PRK07366, PRK07366, succinyldiaminopimelate transaminase;
Validated.
Length = 388
Score = 30.4 bits (69), Expect = 5.4
Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 12/78 (15%)
Query: 148 VRLQFNSTPESYNVGG--VKLADGNNHLIQVVRNVTLVQVKLNGTEYFRKTISSTGILDV 205
V ++F + +SYN+GG + A GN LIQ +R V V V N +Y GIL+
Sbjct: 230 VSIEFFTLSKSYNMGGFRIGFAIGNAQLIQALRQVKAV-VDFN--QY-------RGILNG 279
Query: 206 QVLYLGGIPETVHHHRSI 223
+ L G TV I
Sbjct: 280 AIAALTGPQATVQQTVQI 297
>gnl|CDD|133156 cd05489, xylanase_inhibitor_I_like, TAXI-I inhibits degradation of
xylan in the cell wall. Xylanase inhibitor-I (TAXI-I)
is a member of potent TAXI-type inhibitors of fungal and
bacterial family 11 xylanases. Plants developed a
diverse battery of defense mechanisms in response to
continual challenges by a broad spectrum of pathogenic
microorganisms. Their defense arsenal includes
inhibitors of cell wall-degrading enzymes, which hinder
a possible invasion and colonization by antagonists.
Xylanases of fungal and bacterial pathogens are the key
enzymes in the degradation of xylan in the cell wall.
Plants secrete proteins that inhibit these degradation
glycosidases, including xylanase. Surprisingly, TAXI-I
displays structural homology with the pepsin-like family
of aspartic proteases but is proteolytically
nonfunctional, because one or more residues of the
essential catalytic triad are absent. The structure of
the TAXI-inhibitor, Aspergillus niger xylanase I
complex, illustrates the ability of tight binding and
inhibition with subnanomolar affinity and indicates the
importance of the C-terminal end for the differences in
xylanase specificity among different TAXI-type
inhibitors. This family also contains pepsin-like
aspartic proteinases homologous to TAXI-I. Unlike
TAXI-I, they have active site aspartates and are
functionally active. This family of aspartate proteases
is classified by MEROPS as the peptidase family A1
(pepsin A, clan AA).
Length = 362
Score = 29.2 bits (66), Expect = 9.9
Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 2/43 (4%)
Query: 29 TIYPINPITQFEQPTSLSQSQTSFPTT--TNFTFVTSPDFTAA 69
T +P NP+T L+Q S TT +N V +F +
Sbjct: 69 TAHPYNPVTGECATGDLTQDVLSANTTDGSNPLLVVIFNFVFS 111
>gnl|CDD|216331 pfam01150, GDA1_CD39, GDA1/CD39 (nucleoside phosphatase) family.
Length = 421
Score = 29.7 bits (67), Expect = 10.0
Identities = 17/76 (22%), Positives = 27/76 (35%), Gaps = 26/76 (34%)
Query: 635 LCEKEIDECLSNPCMNGGQCEDRLAGF--VCNCSEEYV----------GERC-ESLRQIS 681
L + LS+PC+ GF + SE E+C +S+R++
Sbjct: 225 LISNSSNLILSDPCL--------PPGFNKTVSYSEVEFDVFAIRGTGNYEQCSQSIRELL 276
Query: 682 -----CADQPCYFGAV 692
C + C F V
Sbjct: 277 NKNAVCPYEQCTFNGV 292
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.427
Gapped
Lambda K H
0.267 0.0715 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,797,016
Number of extensions: 4701939
Number of successful extensions: 3683
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3646
Number of HSP's successfully gapped: 90
Length of query: 979
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 873
Effective length of database: 6,236,078
Effective search space: 5444096094
Effective search space used: 5444096094
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.5 bits)