RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9228
(834 letters)
>gnl|CDD|238058 cd00110, LamG, Laminin G domain; Laminin G-like domains are usually
Ca++ mediated receptors that can have binding sites for
steroids, beta1 integrins, heparin, sulfatides,
fibulin-1, and alpha-dystroglycans. Proteins that
contain LamG domains serve a variety of purposes
including signal transduction via cell-surface steroid
receptors, adhesion, migration and differentiation
through mediation of cell adhesion molecules.
Length = 151
Score = 109 bits (275), Expect = 6e-28
Identities = 53/157 (33%), Positives = 76/157 (48%), Gaps = 8/157 (5%)
Query: 654 VHFLGEGYVELKKELIEERRNEETIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVV 713
V F G YV L R +I+F F T N LLL+ G + G +F+A+ +
Sbjct: 2 VSFSGSSYVRLPTL--PAPRTRLSISFSFRTTSPNGLLLYAG-----SQNGGDFLALELE 54
Query: 714 NGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQD 773
+G L YDLG G + + SK P+NDG HSV+V R + +L VD V + SPG
Sbjct: 55 DGRLVLRYDLGSGSLVLS-SKTPLNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSA 113
Query: 774 VINTRGNIYLGGTPNMDLMTGGRYVHPMSGLMMNIHI 810
++N G +YLGG P G G + ++ +
Sbjct: 114 LLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKV 150
Score = 107 bits (269), Expect = 4e-27
Identities = 59/152 (38%), Positives = 80/152 (52%), Gaps = 11/152 (7%)
Query: 393 SYLALPTLTDAHLHFSIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFR 452
SY+ LPTL SI SF+ T NGL++Y G + GDF++ LEDG V R
Sbjct: 8 SYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGS------QNGGDFLALELEDGRLVLR 61
Query: 453 FDVGL--VVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTP 510
+D+G +VL SK L +W V++ ++ + LSV GE + V S +LNL P
Sbjct: 62 YDLGSGSLVLSSKTPLNDGQWHSVSVERNGRSVTLSVDGERV-VESGSPGGSALLNLDGP 120
Query: 511 LYLGGYNIYHVTPSLSVEVTEGFHGCISTIDV 542
LYLGG +P L V+ GF GCI + V
Sbjct: 121 LYLGGLPEDLKSPGLP--VSPGFVGCIRDLKV 150
Score = 53.6 bits (129), Expect = 3e-08
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 222 FKHGSYLAYPTPK-TMRKFKVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEF 280
F SY+ PT + +S +G++LY+G GDF++L + + +
Sbjct: 4 FSGSSYVRLPTLPAPRTRLSISFSFRTTS-PNGLLLYAG--SQNGGDFLALELEDGRLVL 60
Query: 281 RFDTGSATPLYSNDAP 296
R+D GS + + S+ P
Sbjct: 61 RYDLGSGSLVLSSKTP 76
Score = 47.8 bits (114), Expect = 2e-06
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 6/69 (8%)
Query: 53 KLNVER-----MMFVDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFK 107
++VER + VDG S G L+L +Y+G +P+ + S GF
Sbjct: 84 SVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPV-SPGFV 142
Query: 108 GCVSRLKYN 116
GC+ LK N
Sbjct: 143 GCIRDLKVN 151
>gnl|CDD|214598 smart00282, LamG, Laminin G domain.
Length = 132
Score = 104 bits (262), Expect = 2e-26
Identities = 44/137 (32%), Positives = 65/137 (47%), Gaps = 5/137 (3%)
Query: 677 TIAFDFVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKP 736
+I+F F T N LLL+ G + G +++A+ + +G L YDLG G + P
Sbjct: 1 SISFSFRTTSPNGLLLYAG-----SKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTP 55
Query: 737 VNDGIKHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGR 796
+NDG H V V R + +L VD GESPG ++N G +YLGG P +
Sbjct: 56 LNDGQWHRVAVERNGRSVTLSVDGGNRVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLP 115
Query: 797 YVHPMSGLMMNIHIQNK 813
G + N+ + K
Sbjct: 116 VTPGFRGCIRNLKVNGK 132
Score = 101 bits (254), Expect = 2e-25
Identities = 54/140 (38%), Positives = 72/140 (51%), Gaps = 12/140 (8%)
Query: 408 SIELSFKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG--LVVLRSKVT 465
SI SF+ T NGL++Y G KG GD+++ L DG V R+D+G L S T
Sbjct: 1 SISFSFRTTSPNGLLLYAGS------KGGGDYLALELRDGRLVLRYDLGSGPARLTSDPT 54
Query: 466 LV-PHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPS 524
+ +W V + ++ + LSV G + G +PG L +LNL PLYLGG P
Sbjct: 55 PLNDGQWHRVAVERNGRSVTLSVDGGNRVSGESPG-GLTILNLDGPLYLGGLPEDLKLPP 113
Query: 525 LSVEVTEGFHGCISTIDVLG 544
L VT GF GCI + V G
Sbjct: 114 LP--VTPGFRGCIRNLKVNG 131
Score = 51.2 bits (123), Expect = 9e-08
Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 60 MFVDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKYN 116
+ VDG SGES G L+L +Y+G +P+ ++ P + GF+GC+ LK N
Sbjct: 75 LSVDGGNRVSGESPGGLTILNLDGPLYLGGLPEDLKL-PPLPVTPGFRGCIRNLKVN 130
Score = 43.9 bits (104), Expect = 4e-05
Identities = 15/61 (24%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
Query: 240 KVSLRLNPRDVRDGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSATPLYSNDAPAFN 299
+S +G++LY+G G D+++L +R+ + R+D GS ++D N
Sbjct: 1 SISFSFRTTS-PNGLLLYAGSKGGG--DYLALELRDGRLVLRYDLGSGPARLTSDPTPLN 57
Query: 300 P 300
Sbjct: 58 D 58
>gnl|CDD|216930 pfam02210, Laminin_G_2, Laminin G domain. This family includes the
Thrombospondin N-terminal-like domain, a Laminin G
subfamily.
Length = 124
Score = 96.7 bits (241), Expect = 1e-23
Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 5/129 (3%)
Query: 682 FVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGI 741
F T N LLL+ G G +F+A+ + +G L YDLG G + S K +NDG
Sbjct: 1 FRTTQPNGLLLYAGGED-----GLDFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDGQ 55
Query: 742 KHSVNVTRINKFGSLEVDSVIVGKGESPGSQDVINTRGNIYLGGTPNMDLMTGGRYVHPM 801
H V+V+R + +L VD V PGS ++N G +YLGG P ++
Sbjct: 56 WHRVSVSRDGRSLTLSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGLSLLPVTEGF 115
Query: 802 SGLMMNIHI 810
G + N+ +
Sbjct: 116 VGCIRNVRV 124
Score = 89.8 bits (223), Expect = 3e-21
Identities = 49/133 (36%), Positives = 67/133 (50%), Gaps = 12/133 (9%)
Query: 413 FKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVGL---VVLRSKVTLVPH 469
F+ T NGL++Y G + DF++ LEDG V R+D+G V+L S L
Sbjct: 1 FRTTQPNGLLLYAGGEDGL------DFLALELEDGRLVLRYDLGSGGSVLLLSGKKLNDG 54
Query: 470 EWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEV 529
+W V++ +D + LSV G ++ + PG LNL PLYLGG L V
Sbjct: 55 QWHRVSVSRDGRSLTLSVDGGTVVSEALPGSSSI-LNLNGPLYLGGLPEDSGLSLLP--V 111
Query: 530 TEGFHGCISTIDV 542
TEGF GCI + V
Sbjct: 112 TEGFVGCIRNVRV 124
Score = 43.2 bits (102), Expect = 5e-05
Identities = 14/56 (25%), Positives = 27/56 (48%), Gaps = 1/56 (1%)
Query: 60 MFVDGIGPFSGESQGAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFKGCVSRLKY 115
+ VDG S G+ L+L+ +Y+G +P+ + + GF GC+ ++
Sbjct: 70 LSVDGGTVVSEALPGSSSILNLNGPLYLGGLPEDSGL-SLLPVTEGFVGCIRNVRV 124
Score = 37.0 bits (86), Expect = 0.007
Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 2/48 (4%)
Query: 252 DGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSATPLYSNDAPAFN 299
+G++LY+G D DF++L + + + R+D GS + N
Sbjct: 7 NGLLLYAGGEDGL--DFLALELEDGRLVLRYDLGSGGSVLLLSGKKLN 52
>gnl|CDD|215681 pfam00054, Laminin_G_1, Laminin G domain.
Length = 131
Score = 88.9 bits (221), Expect = 7e-21
Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 8/134 (5%)
Query: 682 FVTDDKNALLLWNGQPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKFSKKPVNDGI 741
F T + + LLL+NG + R+F+A+ + +G LE SYDLG G + S +NDG
Sbjct: 1 FRTTEPSGLLLYNGTQT-----ERDFLALELRDGRLEVSYDLGSGAAVV-RSGDKLNDGK 54
Query: 742 KHSVNVTRINKFGSLEVDSVIVGKGESPGSQDV-INTRGNIYLGGTPNM-DLMTGGRYVH 799
HSV + R + G+L VD GESP ++ G +Y+GG P++ +
Sbjct: 55 WHSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPSLAVKLRRLAISP 114
Query: 800 PMSGLMMNIHIQNK 813
G + ++ + K
Sbjct: 115 SFDGCIRDVIVNGK 128
Score = 82.0 bits (203), Expect = 2e-18
Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 9/138 (6%)
Query: 413 FKPTDYNGLIMYTGDSNMKSYKGKGDFVSFGLEDGYPVFRFDVG--LVVLRSKVTLVPHE 470
F+ T+ +GL++Y G + DF++ L DG +D+G V+RS L +
Sbjct: 1 FRTTEPSGLLLYNGTQT------ERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGK 54
Query: 471 WVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVLNLRTPLYLGGYNIYHVTPSLSVEVT 530
W V + ++ + G LSV GE + G +P L++ PLY+GG V
Sbjct: 55 WHSVELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPSLAVKLRRLAISP 114
Query: 531 EGFHGCISTIDVLGSELD 548
F GCI + V G LD
Sbjct: 115 -SFDGCIRDVIVNGKPLD 131
Score = 37.3 bits (87), Expect = 0.006
Identities = 12/36 (33%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 252 DGIILYSGQSDDGLGDFISLAIREKHMEFRFDTGSA 287
G++LY+G + DF++L +R+ +E +D GS
Sbjct: 7 SGLLLYNGTQTER--DFLALELRDGRLEVSYDLGSG 40
Score = 35.0 bits (81), Expect = 0.038
Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 4/69 (5%)
Query: 52 VKLNVER---MMFVDGIGPFSGESQ-GAFQGLDLSELVYIGAVPDFGEIHPSAGFSNGFK 107
V+L + VDG +GES GA LD+ +Y+G +P S F
Sbjct: 58 VELERNGRSGTLSVDGEARVTGESPLGATTDLDVDGPLYVGGLPSLAVKLRRLAISPSFD 117
Query: 108 GCVSRLKYN 116
GC+ + N
Sbjct: 118 GCIRDVIVN 126
>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type.
Length = 63
Score = 47.0 bits (112), Expect = 6e-07
Identities = 19/62 (30%), Positives = 28/62 (45%), Gaps = 4/62 (6%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSREN--TLTLQEIKNSDAGMYVCKVS 366
G +TLTC + TW K +G LP + TLT++ + D+G+Y C
Sbjct: 2 GQSVTLTCPAEGNPVPNITWLK--DGKPLPESNRFVASGSTLTIKSVSLEDSGLYTCVAE 59
Query: 367 NK 368
N
Sbjct: 60 NS 61
Score = 37.0 bits (86), Expect = 0.002
Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 5/44 (11%)
Query: 4 IKWSRADGLPLQR----YAEGNVLRITNARLQDSGKYKCEIQGH 43
I W + DG PL A G+ L I + L+DSG Y C +
Sbjct: 19 ITWLK-DGKPLPESNRFVASGSTLTIKSVSLEDSGLYTCVAENS 61
>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like. IG domains that cannot
be classified into one of IGv1, IGc1, IGc2, IG.
Length = 85
Score = 46.3 bits (110), Expect = 2e-06
Identities = 22/77 (28%), Positives = 32/77 (41%), Gaps = 5/77 (6%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSREN-----TLTLQEIKNSDAGMYVC 363
G +TL+C P E TW K+ + G FS TLT+ + D+G Y C
Sbjct: 9 GESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTC 68
Query: 364 KVSNKDMTVEIPSILLV 380
+N + + L V
Sbjct: 69 AATNSSGSASSGTTLTV 85
Score = 33.6 bits (77), Expect = 0.054
Identities = 16/64 (25%), Positives = 21/64 (32%), Gaps = 9/64 (14%)
Query: 1 NAYIKWSRADGLPLQ--------RYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYV 52
+ W + G L R + L I+N +DSG Y C S S
Sbjct: 23 PPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATN-SSGSASSGT 81
Query: 53 KLNV 56
L V
Sbjct: 82 TLTV 85
>gnl|CDD|214652 smart00409, IG, Immunoglobulin.
Length = 85
Score = 46.3 bits (110), Expect = 2e-06
Identities = 22/77 (28%), Positives = 32/77 (41%), Gaps = 5/77 (6%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSREN-----TLTLQEIKNSDAGMYVC 363
G +TL+C P E TW K+ + G FS TLT+ + D+G Y C
Sbjct: 9 GESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTC 68
Query: 364 KVSNKDMTVEIPSILLV 380
+N + + L V
Sbjct: 69 AATNSSGSASSGTTLTV 85
Score = 33.6 bits (77), Expect = 0.054
Identities = 16/64 (25%), Positives = 21/64 (32%), Gaps = 9/64 (14%)
Query: 1 NAYIKWSRADGLPLQ--------RYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYV 52
+ W + G L R + L I+N +DSG Y C S S
Sbjct: 23 PPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATN-SSGSASSGT 81
Query: 53 KLNV 56
L V
Sbjct: 82 TLTV 85
>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
large number of membrane-bound and extracellular (mostly
animal) proteins. Many of these proteins require calcium
for their biological function and calcium-binding sites
have been found to be located at the N-terminus of
particular EGF-like domains; calcium-binding may be
crucial for numerous protein-protein interactions. Six
conserved core cysteines form three disulfide bridges as
in non calcium-binding EGF domains, whose structures are
very similar. EGF_CA can be found in tandem repeat
arrangements.
Length = 38
Score = 43.8 bits (104), Expect = 4e-06
Identities = 16/39 (41%), Positives = 20/39 (51%), Gaps = 4/39 (10%)
Query: 136 NTCKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRC 174
+ C S N C N G C + T Y C CPPG++G C
Sbjct: 3 DECAS--GNPCQNGGTCVN--TVGSYRCSCPPGYTGRNC 37
Score = 43.0 bits (102), Expect = 7e-06
Identities = 16/33 (48%), Positives = 18/33 (54%)
Query: 612 CMKGDVCKNGGMCKVTPDSYECLCSLGYAPPNC 644
C G+ C+NGG C T SY C C GY NC
Sbjct: 5 CASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
Score = 41.5 bits (98), Expect = 3e-05
Identities = 19/37 (51%), Positives = 21/37 (56%), Gaps = 4/37 (10%)
Query: 571 CAPK-PCQNYGICYPTDTSERGYNCSCLTGYSGDHCE 606
CA PCQN G C T S Y CSC GY+G +CE
Sbjct: 5 CASGNPCQNGGTCVNTVGS---YRCSCPPGYTGRNCE 38
Score = 33.4 bits (77), Expect = 0.020
Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 5/41 (12%)
Query: 172 DRCSVLGEPCYPGACGDGSCQDVDGAMKCLCPIGTAGKRCE 212
D C+ G PC G G+C + G+ +C CP G G+ CE
Sbjct: 3 DECA-SGNPCQNG----GTCVNTVGSYRCSCPPGYTGRNCE 38
>gnl|CDD|215652 pfam00008, EGF, EGF-like domain. There is no clear separation
between noise and signal. pfam00053 is very similar, but
has 8 instead of 6 conserved cysteines. Includes some
cytokine receptors. The EGF domain misses the N-terminus
regions of the Ca2+ binding EGF domains (this is the
main reason of discrepancy between swiss-prot domain
start/end and Pfam). The family is hard to model due to
many similar but different sub-types of EGF domains.
Pfam certainly misses a number of EGF domains.
Length = 32
Score = 43.2 bits (102), Expect = 7e-06
Identities = 17/33 (51%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 141 SKHNNCINNGLCQDAATRIGYTCICPPGFSGDR 173
S +N C N G C D T GYTC CP G++G R
Sbjct: 2 SPNNPCSNGGTCVD--TPGGYTCECPEGYTGKR 32
Score = 35.1 bits (81), Expect = 0.005
Identities = 13/32 (40%), Positives = 14/32 (43%)
Query: 612 CMKGDVCKNGGMCKVTPDSYECLCSLGYAPPN 643
C + C NGG C TP Y C C GY
Sbjct: 1 CSPNNPCSNGGTCVDTPGGYTCECPEGYTGKR 32
Score = 33.6 bits (77), Expect = 0.016
Identities = 15/35 (42%), Positives = 17/35 (48%), Gaps = 4/35 (11%)
Query: 571 CAP-KPCQNYGICYPTDTSERGYNCSCLTGYSGDH 604
C+P PC N G C T GY C C GY+G
Sbjct: 1 CSPNNPCSNGGTCVDTPG---GYTCECPEGYTGKR 32
Score = 29.3 bits (66), Expect = 0.47
Identities = 11/24 (45%), Positives = 12/24 (50%)
Query: 187 GDGSCQDVDGAMKCLCPIGTAGKR 210
G+C D G C CP G GKR
Sbjct: 9 NGGTCVDTPGGYTCECPEGYTGKR 32
>gnl|CDD|143231 cd05754, Ig3_Perlecan_like, Third immunoglobulin (Ig)-like domain
found in Perlecan and similar proteins.
Ig3_Perlecan_like: domain similar to the third
immunoglobulin (Ig)-like domain found in Perlecan.
Perlecan is a large multi-domain heparin sulfate
proteoglycan, important in tissue development and
organogenesis. Perlecan can be represented as 5 major
portions; its fourth major portion (domain IV) is a
tandem repeat of immunoglobulin-like domains (Ig2-Ig15),
which can vary in size due to alternative splicing.
Perlecan binds many cellular and extracellular ligands.
Its domain IV region has many binding sites. Some of
these have been mapped at the level of individual
Ig-like domains, including a site restricted to the Ig5
domain for heparin/sulfatide, a site restricted to the
Ig3 domain for nidogen-1 and nidogen-2, a site
restricted to Ig4-5 for fibronectin, and sites
restricted to Ig2 and to Ig13-15 for fibulin-2.
Length = 85
Score = 42.5 bits (100), Expect = 4e-05
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 5/77 (6%)
Query: 299 NPVSTKEAPYGSKITLTCNNDLEAPVEYT--WSKRSNGHVLPFGAFSRENTLTLQEIKNS 356
++E G+ ++ C ++P YT W++ G LP A LT++ ++ S
Sbjct: 6 EEPRSQEVRPGADVSFICRAKSKSPA-YTLVWTRVGGG--LPSRAMDFNGILTIRNVQLS 62
Query: 357 DAGMYVCKVSNKDMTVE 373
DAG YVC SN T E
Sbjct: 63 DAGTYVCTGSNMLDTDE 79
Score = 31.4 bits (71), Expect = 0.36
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 4 IKWSRADG-LPLQRYAEGNVLRITNARLQDSGKYKC 38
+ W+R G LP + +L I N +L D+G Y C
Sbjct: 34 LVWTRVGGGLPSRAMDFNGILTIRNVQLSDAGTYVC 69
>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
growth factor (EGF) presents in a large number of
proteins, mostly animal; the list of proteins currently
known to contain one or more copies of an EGF-like
pattern is large and varied; the functional significance
of EGF-like domains in what appear to be unrelated
proteins is not yet clear; a common feature is that
these repeats are found in the extracellular domain of
membrane-bound proteins or in proteins known to be
secreted (exception: prostaglandin G/H synthase); the
domain includes six cysteine residues which have been
shown to be involved in disulfide bonds; the main
structure is a two-stranded beta-sheet followed by a
loop to a C-terminal short two-stranded sheet;
Subdomains between the conserved cysteines vary in
length; the region between the 5th and 6th cysteine
contains two conserved glycines of which at least one
is present in most EGF-like domains; a subset of
these bind calcium.
Length = 36
Score = 40.5 bits (95), Expect = 5e-05
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 141 SKHNNCINNGLCQDAATRIGYTCICPPGFSGDR 173
+ N C N G C + T Y C+CPPG++GDR
Sbjct: 3 AASNPCSNGGTCVN--TPGSYRCVCPPGYTGDR 33
Score = 38.2 bits (89), Expect = 4e-04
Identities = 17/38 (44%), Positives = 18/38 (47%), Gaps = 5/38 (13%)
Query: 571 CA-PKPCQNYGICYPTDTSERGYNCSCLTGYSGD-HCE 606
CA PC N G C T Y C C GY+GD CE
Sbjct: 2 CAASNPCSNGGTCVNT---PGSYRCVCPPGYTGDRSCE 36
Score = 37.8 bits (88), Expect = 6e-04
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 612 CMKGDVCKNGGMCKVTPDSYECLCSLGY 639
C + C NGG C TP SY C+C GY
Sbjct: 2 CAASNPCSNGGTCVNTPGSYRCVCPPGY 29
Score = 26.7 bits (59), Expect = 4.7
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 6/40 (15%)
Query: 174 CSVLGEPCYPGACGDGSCQDVDGAMKCLCPIGTAG-KRCE 212
C+ PC G G+C + G+ +C+CP G G + CE
Sbjct: 2 CAASN-PCSNG----GTCVNTPGSYRCVCPPGYTGDRSCE 36
>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain. Ig: immunoglobulin (Ig) domain
found in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
this group are components of immunoglobulin, neuroglia,
cell surface glycoproteins, such as, T-cell receptors,
CD2, CD4, CD8, and membrane glycoproteins, such as,
butyrophilin and chondroitin sulfate proteoglycan core
protein. A predominant feature of most Ig domains is a
disulfide bridge connecting the two beta-sheets with a
tryptophan residue packed against the disulfide bond.
Length = 74
Score = 41.7 bits (97), Expect = 6e-05
Identities = 25/72 (34%), Positives = 30/72 (41%), Gaps = 12/72 (16%)
Query: 312 ITLTCNNDLEAPVEYTWSKRSNGHVLP----------FGAFSRENTLTLQEIKNSDAGMY 361
+TLTC P TW K NG LP G S +TLT+ + D+G Y
Sbjct: 1 VTLTCLASGPPPPTITWLK--NGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTY 58
Query: 362 VCKVSNKDMTVE 373
C SN TV
Sbjct: 59 TCVASNSAGTVS 70
Score = 30.9 bits (69), Expect = 0.46
Identities = 9/20 (45%), Positives = 11/20 (55%)
Query: 22 VLRITNARLQDSGKYKCEIQ 41
L I+N L+DSG Y C
Sbjct: 44 TLTISNVTLEDSGTYTCVAS 63
>gnl|CDD|143209 cd05732, Ig5_NCAM-1_like, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar
proteins. Ig5_NCAM-1 like: domain similar to the fifth
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-1 (NCAM). NCAM plays important roles in
the development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-non-NCAM), interactions. NCAM is expressed as
three major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
this model, Ig1 and Ig2 mediate dimerization of NCAM
molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain. Also included in this group is
NCAM-2 (also known as OCAM/mamFas II and RNCAM) NCAM-2
is differentially expressed in the developing and mature
olfactory epithelium (OE).
Length = 96
Score = 42.1 bits (99), Expect = 7e-05
Identities = 26/95 (27%), Positives = 45/95 (47%), Gaps = 17/95 (17%)
Query: 296 PAFNPVSTKEAPYGSKITLTCNNDLEAPVEYTWS----------KRSNGHVLPFGAFSRE 345
P + + A +ITLTC + + E TW K +G ++ G R
Sbjct: 3 PKITYLENQTAVELEQITLTCEAEGDPIPEITWRRATRNFSEGDKSLDGRIVVRGHA-RV 61
Query: 346 NTLTLQEIKNSDAGMYVCKVSN------KDMTVEI 374
++LTL++++ +DAG Y C+ SN + M +E+
Sbjct: 62 SSLTLKDVQLTDAGRYDCEASNRIGGDQQSMYLEV 96
>gnl|CDD|143169 cd04968, Ig3_Contactin_like, Third Ig domain of contactin.
Ig3_Contactin_like: Third Ig domain of contactins.
Contactins are neural cell adhesion molecules and are
comprised of six Ig domains followed by four
fibronectin type III(FnIII) domains anchored to the
membrane by glycosylphosphatidylinositol. The first
four Ig domains form the intermolecular binding
fragment, which arranges as a compact U-shaped module
via contacts between Ig domains 1 and 4, and between Ig
domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play
a part in the neuronal processes of neurite outgrowth,
axon guidance and fasciculation, and neuronal
migration. This group also includes contactin-1 and
contactin-5. The different contactins show different
expression patterns in the central nervous system.
During development and in adulthood, contactin-2 is
transiently expressed in subsets of central and
peripheral neurons. Contactin-5 is expressed
specifically in the rat postnatal nervous system,
peaking at about 3 weeks postnatal, and a lack of
contactin-5 (NB-2) results in an impairment of neuronal
act ivity in the rat auditory system. Contactin-5 is
highly expressed in the adult human brain in the
occipital lobe and in the amygdala. Contactin-1 is
differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 88
Score = 40.5 bits (95), Expect = 2e-04
Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 4 IKWSRADGLPLQRY---AEGNVLRITNARLQDSGKYKCE---IQGHDSFRGSDYV 52
IKW + DG G VL+I N + +D G Y+CE I+G D+ +G YV
Sbjct: 33 IKWRKVDGSMPSSAEISMSGAVLKIPNIQFEDEGTYECEAENIKGKDTHQGRIYV 87
>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain.
Length = 39
Score = 38.4 bits (90), Expect = 4e-04
Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 8/44 (18%)
Query: 132 VDSCNTCKSSKHNNCINNGLCQDAATRIGYTCICPPGFS-GDRC 174
+D C + N C N G C + T Y C CPPG++ G C
Sbjct: 2 IDECAS-----GNPCQNGGTCVN--TVGSYRCECPPGYTDGRNC 38
Score = 38.4 bits (90), Expect = 4e-04
Identities = 16/34 (47%), Positives = 18/34 (52%), Gaps = 1/34 (2%)
Query: 612 CMKGDVCKNGGMCKVTPDSYECLCSLGYAP-PNC 644
C G+ C+NGG C T SY C C GY NC
Sbjct: 5 CASGNPCQNGGTCVNTVGSYRCECPPGYTDGRNC 38
Score = 34.5 bits (80), Expect = 0.009
Identities = 18/38 (47%), Positives = 20/38 (52%), Gaps = 5/38 (13%)
Query: 571 CAPK-PCQNYGICYPTDTSERGYNCSCLTGYS-GDHCE 606
CA PCQN G C T S Y C C GY+ G +CE
Sbjct: 5 CASGNPCQNGGTCVNTVGS---YRCECPPGYTDGRNCE 39
Score = 28.8 bits (65), Expect = 1.1
Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 5/36 (13%)
Query: 178 GEPCYPGACGDGSCQDVDGAMKCLCPIG-TAGKRCE 212
G PC G G+C + G+ +C CP G T G+ CE
Sbjct: 8 GNPCQNG----GTCVNTVGSYRCECPPGYTDGRNCE 39
>gnl|CDD|222092 pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases
superfamily. This domain belongs to the Concanavalin
A-like lectin/glucanases superfamily.
Length = 156
Score = 41.6 bits (98), Expect = 4e-04
Identities = 37/152 (24%), Positives = 48/152 (31%), Gaps = 28/152 (18%)
Query: 393 SYLALPTLTDAHLHFSIELSFKPTDYNG---LIMYTGDSNMKSYKGKGDFVSFGLE-DGY 448
Y+ LP L F++ KP G L++ S GL+ G
Sbjct: 10 DYVTLPNLDLPTGSFTVSAWVKPDSLPGGTRLLIGGSGSGG---------FGLGLDGSGK 60
Query: 449 PVFRF---DVGLVVLRSKVTLVPHEWVVVTIIKDFKEGKLSVGGEPLIVGSTPGEKLQVL 505
F G + S L P +W V + D KL V G VGST
Sbjct: 61 LRFTVGGGGGGAATVTSGAPLPPGQWHHVAVTYDGGTLKLYVNGVL--VGSTTLSGTITS 118
Query: 506 NLRTPLYLGGYNIYHVTPSLSVEVTEGFHGCI 537
PLY+G N F+G I
Sbjct: 119 GTTGPLYIGASNGG----------DRYFNGAI 140
>gnl|CDD|206066 pfam13895, Ig_2, Immunoglobulin domain. This domain contains
immunoglobulin-like domains.
Length = 80
Score = 38.6 bits (90), Expect = 8e-04
Identities = 20/59 (33%), Positives = 26/59 (44%), Gaps = 6/59 (10%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSRENTLTLQEIKNSDAGMYVCKVSN 367
G +TLTC+ P YTW K +G L S +N + D+G Y C SN
Sbjct: 14 GEDVTLTCSAPGNPPPNYTWYK--DGVPLS----SSQNGFFTPNVSAEDSGTYTCVASN 66
Score = 27.0 bits (60), Expect = 9.3
Identities = 16/56 (28%), Positives = 21/56 (37%), Gaps = 3/56 (5%)
Query: 1 NAYIKWSRADGLPLQRYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
W + DG+PL N N +DSG Y C + S+ V L V
Sbjct: 28 PPNYTWYK-DGVPLSSS--QNGFFTPNVSAEDSGTYTCVASNGGGGKTSNPVTLTV 80
>gnl|CDD|143264 cd05856, Ig2_FGFRL1-like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor_like-1(FGFRL1).
Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain
of fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal
peptide, three extracellular Ig-like modules, a
transmembrane segment, and a short intracellular domain.
FGFRL1 is expressed preferentially in skeletal tissues.
Similar to FGF receptors, the expressed protein
interacts specifically with heparin and with FGF2.
FGFRL1 does not have a protein tyrosine kinase domain at
its C terminus; neither does its cytoplasmic domain
appear to interact with a signaling partner. It has been
suggested that FGFRL1 may not have any direct signaling
function, but instead acts as a decoy receptor trapping
FGFs and preventing them from binding other receptors.
Length = 82
Score = 38.7 bits (90), Expect = 8e-04
Identities = 21/66 (31%), Positives = 30/66 (45%), Gaps = 2/66 (3%)
Query: 305 EAPYGSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSREN--TLTLQEIKNSDAGMYV 362
P GS + L C + TW K + SR+ TL+L+ +K D+G Y
Sbjct: 5 ARPVGSSVRLKCVASGNPRPDITWLKDNKPLTPTEIGESRKKKWTLSLKNLKPEDSGKYT 64
Query: 363 CKVSNK 368
C VSN+
Sbjct: 65 CHVSNR 70
>gnl|CDD|143191 cd05714, Ig_CSPGs_LP, Immunoglobulin (Ig)-like domain of
chondroitin sulfate proteoglycans (CSPGs), human
cartilage link protein (LP) and similar proteins.
Ig_CSPGs_LP: immunoglobulin (Ig)-like domain similar to
that found in chondroitin sulfate proteoglycans (CSPGs)
and human cartilage link protein (LP). Included in this
group are the CSPGs aggrecan, versican, and neurocan. In
CSPGs this Ig-like domain is followed by hyaluronan
(HA)-binding tandem repeats, and a C-terminal region
with epidermal growth factor-like, lectin-like, and
complement regulatory protein-like domains. Separating
these N- and C-terminal regions is a nonhomologous
glycosaminoglycan attachment region. In cartilage,
aggrecan forms cartilage link protein stabilized
aggregates with hyaluronan (HA). These aggregates
contribute to the tissue's load bearing properties.
Aggrecan and versican have a wide distribution in
connective tissue and extracellular matrices. Neurocan
is localized almost exclusively in nervous tissue.
Aggregates having other CSPGs substituting for aggrecan
may contribute to the structural integrity of many
different tissues. There is considerable evidence that
HA-binding CSPGs are involved in developmental processes
in the central nervous system. Members of the vertebrate
HPLN (hyaluronan/HA and proteoglycan binding link)
protein family are physically linked adjacent to CSPG
genes.
Length = 106
Score = 39.2 bits (92), Expect = 9e-04
Identities = 21/49 (42%), Positives = 28/49 (57%), Gaps = 6/49 (12%)
Query: 12 LPLQRYAEGNV-LRITNARLQDSGKYKCE-IQG-HDSFRGSDYVKLNVE 57
+PL G+ L IT+ RL+DSG+Y+CE I G D D V+L V
Sbjct: 58 VPLYPADPGDASLVITDLRLEDSGRYRCEVIDGIEDE---QDTVELEVR 103
Score = 29.1 bits (66), Expect = 2.6
Identities = 18/98 (18%), Positives = 34/98 (34%), Gaps = 31/98 (31%)
Query: 309 GSKITLTCNNDLEAPVEYT------WSK-------------RSNGHVLPFGAF------- 342
G +TL C LE + W+K G V +G+
Sbjct: 2 GGNVTLPCRFHLEPALSAPHGPRIKWTKLESDGAKEVDVLVAIGGRVKVYGSGRVRVPLY 61
Query: 343 ---SRENTLTLQEIKNSDAGMYVCKVSN--KDMTVEIP 375
+ +L + +++ D+G Y C+V + +D +
Sbjct: 62 PADPGDASLVITDLRLEDSGRYRCEVIDGIEDEQDTVE 99
>gnl|CDD|222457 pfam13927, Ig_3, Immunoglobulin domain. This family contains
immunoglobulin-like domains.
Length = 74
Score = 37.7 bits (87), Expect = 0.001
Identities = 19/60 (31%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Query: 309 GSKITLTCNNDL-EAPVEYTWSKRSNGHVLPFGAFSRENTLTLQEIKNSDAGMYVCKVSN 367
G +TLTC+ + P +W + + G S +TLTL + + D+G Y C SN
Sbjct: 15 GGGVTLTCSAEGGPPPPTISWYRNGSISGGSGGLGSSGSTLTLSSVTSEDSGTYTCVASN 74
Score = 29.7 bits (66), Expect = 1.1
Identities = 10/44 (22%), Positives = 17/44 (38%), Gaps = 3/44 (6%)
Query: 1 NAYIKWSRADGL---PLQRYAEGNVLRITNARLQDSGKYKCEIQ 41
I W R + + G+ L +++ +DSG Y C
Sbjct: 30 PPTISWYRNGSISGGSGGLGSSGSTLTLSSVTSEDSGTYTCVAS 73
>gnl|CDD|143190 cd05713, Ig_MOG_like, Immunoglobulin (Ig)-like domain of myelin
oligodendrocyte glycoprotein (MOG). Ig_MOG_like:
immunoglobulin (Ig)-like domain of myelin
oligodendrocyte glycoprotein (MOG). MOG, a minor
component of the myelin sheath, is an important
CNS-specific autoantigen, linked to the pathogenesis of
multiple sclerosis (MS) and experimental autoimmune
encephalomyelitis (EAE). It is a transmembrane protein
having an extracellular Ig domain. MOG is expressed in
the CNS on the outermost lamellae of the myelin sheath,
and on the surface of oligodendrocytes, and may
participate in the completion, compaction, and/or
maintenance of myelin. This group also includes
butyrophilin (BTN). BTN is the most abundant protein in
bovine milk-fat globule membrane (MFGM).
Length = 100
Score = 38.3 bits (90), Expect = 0.002
Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 18 AEGNV-LRITNARLQDSGKYKCEIQGHDSF 46
AEG+V LRI N R D G Y C Q +
Sbjct: 61 AEGSVALRIHNVRASDEGLYTCFFQSDGFY 90
>gnl|CDD|215677 pfam00047, ig, Immunoglobulin domain. Members of the
immunoglobulin superfamily are found in hundreds of
proteins of different functions. Examples include
antibodies, the giant muscle kinase titin and receptor
tyrosine kinases. Immunoglobulin-like domains may be
involved in protein-protein and protein-ligand
interactions. The Pfam alignments do not include the
first and last strand of the immunoglobulin-like domain.
Length = 62
Score = 37.1 bits (86), Expect = 0.002
Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 5/62 (8%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVLP-----FGAFSRENTLTLQEIKNSDAGMYVC 363
GS +TLTC+ V+ TW K G TLT+ + D+G Y C
Sbjct: 1 GSSVTLTCSVSGPPQVDVTWFKEGKGLEESTTVGTDENRVSSITLTISNVTPEDSGTYTC 60
Query: 364 KV 365
V
Sbjct: 61 VV 62
>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM).
Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
important roles in the development and regeneration of
the central nervous system, in synaptogenesis and neural
migration. NCAM mediates cell-cell and cell-substratum
recognition and adhesion via homophilic (NCAM-NCAM) and
heterophilic (NCAM-non-NCAM) interactions. NCAM is
expressed as three major isoforms having different
intracellular extensions. The extracellular portion of
NCAM has five N-terminal Ig-like domains and two
fibronectin type III domains. The double zipper adhesion
complex model for NCAM homophilic binding involves Ig1,
Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
dimerization of NCAM molecules situated on the same cell
surface (cis interactions), and Ig3 domains mediate
interactions between NCAM molecules expressed on the
surface of opposing cells (trans interactions), through
binding to the Ig1 and Ig2 domains. The adhesive ability
of NCAM is modulated by the addition of polysialic acid
chains to the fifth Ig-like domain.
Length = 97
Score = 37.3 bits (86), Expect = 0.003
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 11/67 (16%)
Query: 311 KITLTCNNDLEAPVEYTWS----------KRSNGHVLPFGAFSRENTLTLQEIKNSDAGM 360
+ITLTC + TW K +GH++ + +R ++LTL+ I+ +DAG
Sbjct: 19 QITLTCEASGDPIPSITWRTSTRNISSEEKTLDGHIV-VRSHARVSSLTLKYIQYTDAGE 77
Query: 361 YVCKVSN 367
Y+C SN
Sbjct: 78 YLCTASN 84
>gnl|CDD|143219 cd05742, Ig1_VEGFR_like, First immunoglobulin (Ig)-like domain of
vascular endothelial growth factor (VEGF) receptor (R)
and similar proteins. Ig1_VEGFR_like: first
immunoglobulin (Ig)-like domain of vascular endothelial
growth factor (VEGF) receptor(R) related proteins. The
VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. The VEGFR family consists of three
members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
VEGFR-3 (Flt-4). VEGF-A interacts with both VEGFR-1 and
VEGFR-2. VEGFR-1 binds strongest to VEGF, VEGF-2 binds
more weakly. VEGFR-3 appears not to bind VEGF, but binds
other members of the VEGF family (VEGF-C and -D). VEGFRs
bind VEGFs with high affinity with the IG-like domains.
VEGF-A is important to the growth and maintenance of
vascular endothelial cells and to the development of new
blood- and lymphatic-vessels in physiological and
pathological states. VEGFR-2 is a major mediator of the
mitogenic, angiogenic and microvascular
permeability-enhancing effects of VEGF-A. VEGFR-1 may
play an inhibitory part in these processes by binding
VEGF and interfering with its interaction with VEGFR-2.
VEGFR-1 has a signaling role in mediating monocyte
chemotaxis. VEGFR-2 and -1 may mediate a chemotactic and
a survival signal in hematopoietic stem cells or
leukemia cells. VEGFR-3 has been shown to be involved in
tumor angiogenesis and growth. This group also contains
alpha-type platelet-derived growth factor receptor
precursor (PDGFR)-alpha (CD140a), and PDGFR-beta
(CD140b). PDGFRs alpha and beta have an extracellular
component with five Ig-like domains, a transmembrane
segment, and a cytoplasmic portion that has protein
tyrosine kinase activity.
Length = 84
Score = 36.2 bits (84), Expect = 0.005
Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 11/84 (13%)
Query: 309 GSKITLTCN--NDLEAPVEYTW---SKRSNGHVLPFGAFSRE------NTLTLQEIKNSD 357
G + L C +L V++ W K+ S +TLT+ D
Sbjct: 1 GETLVLNCTVLTELNEGVDFQWTYPGKKRGRGKSMVTRQSLSEATELSSTLTIPNATLKD 60
Query: 358 AGMYVCKVSNKDMTVEIPSILLVT 381
+G Y C S+ M + + + +
Sbjct: 61 SGTYTCAASSGTMDQKESTKVNIH 84
>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain.
Length = 35
Score = 34.8 bits (80), Expect = 0.006
Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 3/33 (9%)
Query: 141 SKHNNCINNGLCQDAATRIGYTCICPPGFSGDR 173
+ C N G C + T YTC CPPG++GD+
Sbjct: 3 ASGGPCSN-GTCIN--TPGSYTCSCPPGYTGDK 32
Score = 32.1 bits (73), Expect = 0.052
Identities = 14/28 (50%), Positives = 14/28 (50%), Gaps = 1/28 (3%)
Query: 612 CMKGDVCKNGGMCKVTPDSYECLCSLGY 639
C G C NG C TP SY C C GY
Sbjct: 2 CASGGPCSNGT-CINTPGSYTCSCPPGY 28
Score = 32.1 bits (73), Expect = 0.065
Identities = 19/38 (50%), Positives = 20/38 (52%), Gaps = 6/38 (15%)
Query: 571 CAPK-PCQNYGICYPTDTSERGYNCSCLTGYSGD-HCE 606
CA PC N G C T S Y CSC GY+GD CE
Sbjct: 2 CASGGPCSN-GTCINTPGS---YTCSCPPGYTGDKRCE 35
Score = 30.2 bits (68), Expect = 0.29
Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 2/35 (5%)
Query: 180 PCYPGA-CGDGSCQDVDGAMKCLCPIGTAG-KRCE 212
C G C +G+C + G+ C CP G G KRCE
Sbjct: 1 ECASGGPCSNGTCINTPGSYTCSCPPGYTGDKRCE 35
>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig3_Robo: domain similar to
the third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS),
and are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 69
Score = 35.5 bits (82), Expect = 0.009
Identities = 14/39 (35%), Positives = 18/39 (46%), Gaps = 3/39 (7%)
Query: 4 IKWSRADG-LPLQRYA--EGNVLRITNARLQDSGKYKCE 39
+ W + DG LP R + L+I N D G Y CE
Sbjct: 15 VLWRKEDGELPKGRAEILDDKSLKIRNVTAGDEGSYTCE 53
>gnl|CDD|204999 pfam12661, hEGF, Human growth factor-like EGF. hEGF, or human
growth factor-like EGF, domains have six conserved
residues disulfide-bonded into the characteristic
'ababcc' pattern. They are involved in growth and
proliferation of cells, in proteins of the Notch/Delta
pathway, neurogulin and selectins. hEGFs are also found
in mosaic proteins with four-disulfide laminin EGFs such
as aggrecan and perlecan. The core fold of the EGF
domain consists of two small beta-hairpins packed
against each other. Two major structural variants have
been identified based on the structural context of the
C-terminal Cys residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In hEGFs the
C-terminal thiol resides in the beta-turn, resulting in
shorter loop-lengths between the Cys residues of
disulfide 'c', typically C[8-9]XC. These shorter
loop-lengths are also typical of the four-disulfide EGF
domains, laminin ad integrin. Tandem hEGF domains have
six linking residues between terminal cysteines of
adjacent domains. hEGF domains may or may not bind
calcium in the linker region. hEGF domains with the
consensus motif CXD4X[F,Y]XCXC are hydroxylated
exclusively in the Asp residue.
Length = 13
Score = 33.9 bits (79), Expect = 0.011
Identities = 8/13 (61%), Positives = 10/13 (76%)
Query: 162 TCICPPGFSGDRC 174
C CPPG++G RC
Sbjct: 1 KCQCPPGYTGPRC 13
Score = 26.9 bits (61), Expect = 3.1
Identities = 6/12 (50%), Positives = 7/12 (58%)
Query: 594 CSCLTGYSGDHC 605
C C GY+G C
Sbjct: 2 CQCPPGYTGPRC 13
>gnl|CDD|212881 cd11948, SH3_GRAP_N, N-terminal Src homology 3 domain of
GRB2-related adaptor protein. GRAP is a GRB-2 like
adaptor protein that is highly expressed in lymphoid
tissues. It acts as a negative regulator of T cell
receptor (TCR)-induced lymphocyte proliferation by
downregulating the signaling to the Ras/ERK pathway. It
has been identified as a regulator of TGFbeta signaling
in diabetic kidney tubules and may have a role in the
pathogenesis of the disease. GRAP contains an
N-terminal SH3 domain, a central SH2 domain, and a
C-terminal SH3 domain. The N-terminal SH3 domain of the
related protein GRB2 binds to Sos and Sos-derived
proline-rich peptides. SH3 domains are protein
interaction domains that bind to proline-rich ligands
with moderate affinity and selectivity, preferentially
to PxxP motifs. They play versatile and diverse roles
in the cell including the regulation of enzymes,
changing the subcellular localization of signaling
pathway components, and mediating the formation of
multiprotein complex assemblies.
Length = 54
Score = 34.4 bits (79), Expect = 0.013
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 7 SRADGLPLQRYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKL 54
+ +D LP Q +G++L+I N D YK E+QG + + +Y+K+
Sbjct: 11 TESDELPFQ---KGDILKILNME-DDQNWYKAELQGREGYIPKNYIKV 54
>gnl|CDD|143285 cd05877, Ig_LP_like, Immunoglobulin (Ig)-like domain of human
cartilage link protein (LP). Ig_LP_like: immunoglobulin
(Ig)-like domain similar to that that found in human
cartilage link protein (LP). In cartilage,
chondroitin-keratan sulfate proteoglycan (CSPG),
aggrecan, forms cartilage link protein stabilized
aggregates with hyaluronan (HA). These aggregates
contribute to the tissue's load bearing properties.
Aggregates having other CSPGs substituting for aggrecan
may contribute to the structural integrity of many
different tissues. Members of the vertebrate HPLN
(hyaluronan/HA and proteoglycan binding link) protein
family are physically linked adjacent to CSPG genes.
Length = 106
Score = 35.8 bits (83), Expect = 0.014
Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 6/47 (12%)
Query: 14 LQRYAEGNV-LRITNARLQDSGKYKCE-IQG-HDSFRGSDYVKLNVE 57
L+R + + L IT+ RL+D G+Y+CE I G D S V L +
Sbjct: 60 LRRAHDLDASLVITDLRLEDYGRYRCEVIDGLED---ESVVVALRLR 103
>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain.
Length = 90
Score = 35.3 bits (82), Expect = 0.016
Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 6/74 (8%)
Query: 300 PVSTKEAPYGSKITLTCNNDLEAPVEYTWSK-----RSNGHVLPFGAFSRENTLTLQEIK 354
E G TC + +W K RS+ TLT+ ++
Sbjct: 6 KPKDVEVQEGESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFK-VTYEGGTYTLTISNVQ 64
Query: 355 NSDAGMYVCKVSNK 368
D G Y C +N
Sbjct: 65 PDDEGKYTCVATNS 78
Score = 32.6 bits (75), Expect = 0.14
Identities = 16/43 (37%), Positives = 21/43 (48%), Gaps = 9/43 (20%)
Query: 4 IKWSRADGLPL---QRY---AEGNV--LRITNARLQDSGKYKC 38
+ W + DG PL R+ EG L I+N + D GKY C
Sbjct: 32 VSWFK-DGQPLRSSDRFKVTYEGGTYTLTISNVQPDDEGKYTC 73
>gnl|CDD|143217 cd05740, Ig_CEACAM_D4, Fourth immunoglobulin (Ig)-like domain of
carcinoembryonic antigen (CEA) related cell adhesion
molecule (CEACAM). Ig_CEACAM_D4: immunoglobulin
(Ig)-like domain 4 in carcinoembryonic antigen (CEA)
related cell adhesion molecule (CEACAM) protein
subfamily. The CEA family is a group of anchored or
secreted glycoproteins, expressed by epithelial cells,
leukocytes, endothelial cells and placenta. The CEA
family is divided into the CEACAM and pregnancy-specific
glycoprotein (PSG) subfamilies. This group represents
the CEACAM subfamily. CEACAM1 has many important
cellular functions, it is a cell adhesion molecule, and
a signaling molecule that regulates the growth of tumor
cells, it is an angiogenic factor, and is a receptor for
bacterial and viral pathogens, including mouse hepatitis
virus (MHV). In mice, four isoforms of CEACAM1 generated
by alternative splicing have either two [D1, D4] or four
[D1-D4] Ig-like domains on the cell surface. This family
corresponds to the D4 Ig-like domain.
Length = 91
Score = 34.9 bits (80), Expect = 0.020
Identities = 24/72 (33%), Positives = 31/72 (43%), Gaps = 6/72 (8%)
Query: 299 NPVSTKEAPYGSKITLTCNNDLEAPVEYTWSKRSNGHVLPFG--AFSREN-TLTLQEIKN 355
N V + +TLTC + E Y W NG +L S +N TLT +
Sbjct: 8 NSVGNQPPEDNQPVTLTC--EAEGQATYIWWVN-NGSLLVPPRLQLSNDNRTLTFNNVTR 64
Query: 356 SDAGMYVCKVSN 367
SD G Y C+ SN
Sbjct: 65 SDTGHYQCEASN 76
Score = 34.9 bits (80), Expect = 0.021
Identities = 19/58 (32%), Positives = 22/58 (37%), Gaps = 4/58 (6%)
Query: 3 YIKWSRADGLP----LQRYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
YI W L LQ + L N D+G Y+CE S SD LNV
Sbjct: 33 YIWWVNNGSLLVPPRLQLSNDNRTLTFNNVTRSDTGHYQCEASNEVSNMTSDPYILNV 90
>gnl|CDD|218955 pfam06247, Plasmod_Pvs28, Plasmodium ookinete surface protein
Pvs28. This family consists of several ookinete surface
protein (Pvs28) from several species of Plasmodium.
Pvs25 and Pvs28 are expressed on the surface of
ookinetes. These proteins are potential candidates for
vaccine and induce antibodies that block the infectivity
of Plasmodium vivax in immunised animals.
Length = 196
Score = 37.0 bits (86), Expect = 0.020
Identities = 22/89 (24%), Positives = 31/89 (34%), Gaps = 19/89 (21%)
Query: 559 IMDCSDLESSPVCAPKPCQNYGICYPTDTS--ERGYNCSCLTGYSGDHCEKENNMCMKGD 616
+ C LE+ K C Y C E+ C C+ GY +C +
Sbjct: 39 KVKCDKLENVN----KVCGEYATCINQANKAEEKALKCGCINGY-----TLSQGVC-VPN 88
Query: 617 VCKN----GGMCKVTPDSYE---CLCSLG 638
C N G C V P + C C++G
Sbjct: 89 KCNNKVCGSGKCIVDPANPNNTTCSCNIG 117
Score = 29.3 bits (66), Expect = 5.4
Identities = 24/76 (31%), Positives = 29/76 (38%), Gaps = 14/76 (18%)
Query: 570 VCAPKPCQNY----GICYPTDTSERGYNCSCLTGYSGDHCEKENNMCMK-GDV-----CK 619
VC P C N G C + CSC G D +N C K G+ CK
Sbjct: 84 VCVPNKCNNKVCGSGKCIVDPANPNNTTCSCNIGKVPD----QNGKCTKTGETKCSLKCK 139
Query: 620 NGGMCKVTPDSYECLC 635
CK+ YEC+C
Sbjct: 140 ENEECKLVGGYYECVC 155
>gnl|CDD|143258 cd05850, Ig1_Contactin-2, First Ig domain of contactin-2.
Ig1_Contactin-2: First Ig domain of the neural cell
adhesion molecule contactin-2-like. Contactins are
comprised of six Ig domains followed by four fibronectin
type III (FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-2 (TAG-1,
axonin-1) facilitates cell adhesion by homophilic
binding between molecules in apposed membranes. It may
play a part in the neuronal processes of neurite
outgrowth, axon guidance and fasciculation, and neuronal
migration. The first four Ig domains form the
intermolecular binding fragment, which arranges as a
compact U-shaped module by contacts between IG domains 1
and 4, and domains 2 and 3. The different contactins
show different expression patterns in the central
nervous system. During development and in adulthood,
contactin-2 is transiently expressed in subsets of
central and peripheral neurons. Contactin-2 is also
expressed in retinal amacrine cells in the developing
chick retina, corresponding to the period of formation
and maturation of AC processes.
Length = 94
Score = 34.5 bits (79), Expect = 0.027
Identities = 25/74 (33%), Positives = 30/74 (40%), Gaps = 10/74 (13%)
Query: 307 PYGS---KITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSRE-----NTLTLQEIKNSDA 358
P GS K+TL C P Y W + NG + F SR N + K DA
Sbjct: 14 PEGSPEEKVTLGCRARASPPATYRW--KMNGTEIKFAPESRYTLVAGNLVINNPQKARDA 71
Query: 359 GMYVCKVSNKDMTV 372
G Y C N+ TV
Sbjct: 72 GSYQCLAINRCGTV 85
>gnl|CDD|143259 cd05851, Ig3_Contactin-1, Third Ig domain of contactin-1.
Ig3_Contactin-1: Third Ig domain of the neural cell
adhesion molecule contactin-1. Contactins are comprised
of six Ig domains followed by four fibronectin type III
(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-1 is
differentially expressed in tumor tissues and may
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 88
Score = 34.6 bits (79), Expect = 0.030
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 8/57 (14%)
Query: 4 IKWSRADGLPLQRYAE----GNVLRITNARLQDSGKYKCE---IQGHDSFRGSDYVK 53
I+W + P+ AE G VL+I N + +D G Y+CE I+G D + YV+
Sbjct: 33 IRWRKILE-PMPATAEISMSGAVLKIFNIQPEDEGTYECEAENIKGKDKHQARVYVQ 88
>gnl|CDD|143278 cd05870, Ig5_NCAM-2, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-2 (also known as
OCAM/mamFas II and RNCAM). Ig5_NCAM-2: the fifth
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-2 (also known as OCAM/mamFas II and
RNCAM). NCAM-2 is organized similarly to NCAM ,
including five N-terminal Ig-like domains and two
fibronectin type III domains. NCAM-2 is differentially
expressed in the developing and mature olfactory
epithelium (OE), and may function like NCAM, as an
adhesion molecule.
Length = 98
Score = 34.6 bits (79), Expect = 0.031
Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 11/71 (15%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSR-----------ENTLTLQEIKNSD 357
TL+C + E E TW + S+GH G S E++L ++++K SD
Sbjct: 16 NGAATLSCKAEGEPIPEITWKRASDGHTFSEGDKSPDGRIEVKGQHGESSLHIKDVKLSD 75
Query: 358 AGMYVCKVSNK 368
+G Y C+ +++
Sbjct: 76 SGRYDCEAASR 86
>gnl|CDD|219677 pfam07974, EGF_2, EGF-like domain. This family contains EGF
domains found in a variety of extracellular proteins.
Length = 31
Score = 32.0 bits (73), Expect = 0.049
Identities = 10/34 (29%), Positives = 11/34 (32%), Gaps = 5/34 (14%)
Query: 572 APKPCQNYGICYPTDTSERGYNCSCLTGYSGDHC 605
A C G C C C +GY G C
Sbjct: 3 ASGICNGRGTCV-----RPCGKCVCDSGYQGATC 31
Score = 27.0 bits (60), Expect = 3.0
Identities = 10/34 (29%), Positives = 12/34 (35%), Gaps = 4/34 (11%)
Query: 141 SKHNNCINNGLCQDAATRIGYTCICPPGFSGDRC 174
S C G C R C+C G+ G C
Sbjct: 2 SASGICNGRGTC----VRPCGKCVCDSGYQGATC 31
>gnl|CDD|143270 cd05862, Ig1_VEGFR, First immunoglobulin (Ig)-like domain of
vascular endothelial growth factor (VEGF) receptor(R).
IG1_VEGFR: first immunoglobulin (Ig)-like domain of
vascular endothelial growth factor (VEGF) receptor(R).
The VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. The VEGFR family consists of three
members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
VEGFR-3 (Flt-4). VEGF_A interacts with both VEGFR-1 and
VEGFR-2. VEGFR-1 binds strongest to VEGF, VEGF-2 binds
more weakly. VEGFR-3 appears not to bind VEGF, but binds
other members of the VEGF family (VEGF-C and -D). VEGFRs
bind VEGFs with high affinity with the IG-like domains.
VEGF-A is important to the growth and maintenance of
vascular endothelial cells and to the development of new
blood- and lymphatic-vessels in physiological and
pathological states. VEGFR-2 is a major mediator of the
mitogenic, angiogenic and microvascular
permeability-enhancing effects of VEGF-A. VEGFR-1 may
play an inhibitory part in these processes by binding
VEGF and interfering with its interaction with VEGFR-2.
VEGFR-1 has a signaling role in mediating monocyte
chemotaxis. VEGFR-2 and -1 may mediate a chemotactic and
a survival signal in hematopoietic stem cells or
leukemia cells. VEGFR-3 has been shown to be involved in
tumor angiogenesis and growth.
Length = 86
Score = 33.6 bits (77), Expect = 0.053
Identities = 20/84 (23%), Positives = 36/84 (42%), Gaps = 12/84 (14%)
Query: 309 GSKITLTCN--NDLEAPVEYTW----SKRSNGHVLPFGAFSRE------NTLTLQEIKNS 356
G K+ L C +L +++ W K + S + +TLT++ + S
Sbjct: 1 GEKLVLNCTARTELNVGIDFQWDYPGKKEQRAKSVSENRRSLQEHTELSSTLTIENVTLS 60
Query: 357 DAGMYVCKVSNKDMTVEIPSILLV 380
D G Y C S+ M + +I++V
Sbjct: 61 DLGRYTCTASSGQMIAKNSTIVIV 84
>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like. cEGF, or complement
Clr-like EGF, domains have six conserved cysteine
residues disulfide-bonded into the characteristic
pattern 'ababcc'. They are found in blood coagulation
proteins such as fibrillin, Clr and Cls, thrombomodulin,
and the LDL receptor. The core fold of the EGF domain
consists of two small beta-hairpins packed against each
other. Two major structural variants have been
identified based on the structural context of the
C-terminal cysteine residue of disulfide 'c' in the
C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
C-terminal thiol resides on the C-terminal beta-sheet,
resulting in long loop-lengths between the cysteine
residues of disulfide 'c', typically C[10+]XC. These
longer loop-lengths may have arisen by selective
cysteine loss from a four-disulfide EGF template such as
laminin or integrin. Tandem cEGF domains have five
linking residues between terminal cysteines of adjacent
domains. cEGF domains may or may not bind calcium in the
linker region. cEGF domains with the consensus motif
CXN4X[F,Y]XCXC are hydroxylated exclusively on the
asparagine residue.
Length = 24
Score = 31.7 bits (73), Expect = 0.060
Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 15/37 (40%)
Query: 160 GYTCICPPGFSGDRCSVLGEPCYPGACGDG-SCQDVD 195
YTC CPPG+ GDG +C+D+D
Sbjct: 1 SYTCSCPPGYQLS--------------GDGRTCEDID 23
>gnl|CDD|143218 cd05741, Ig_CEACAM_D1_like, First immunoglobulin (Ig)-like domain
of carcinoembryonic antigen (CEA) related cell adhesion
molecule (CEACAM) and similar proteins.
Ig_CEACAM_D1_like : immunoglobulin (IG)-like domain 1 in
carcinoembryonic antigen (CEA) related cell adhesion
molecule (CEACAM) protein subfamily-like. The CEA family
is a group of anchored or secreted glycoproteins,
expressed by epithelial cells, leukocytes, endothelial
cells and placenta. The CEA family is divided into the
CEACAM and pregnancy-specific glycoprotein (PSG)
subfamilies. This group represents the CEACAM subfamily.
CEACAM1 has many important cellular functions, it is a
cell adhesion molecule, and a signaling molecule that
regulates the growth of tumor cells, it is an angiogenic
factor, and is a receptor for bacterial and viral
pathogens, including mouse hepatitis virus (MHV). In
mice, four isoforms of CEACAM1 generated by alternative
splicing have either two [D1, D4] or four [D1-D4]
Ig-like domains on the cell surface. This family
corresponds to the D1 Ig-like domain. Also belonging to
this group is the N-terminal immunoglobulin (Ig)-like
domain of the signaling lymphocyte activation molecule
(SLAM) family, CD84-like family. The SLAM family is a
group of immune-cell specific receptors that can
regulate both adaptive and innate immune responses. SLAM
family proteins are organized as an extracellular domain
with having two or four Ig-like domains, a single
transmembrane segment, and a cytoplasmic region having
tyr-based motifs. The extracellular domain is organized
as a membrane-distal Ig variable (IgV) domain that is
responsible for ligand recognition and a
membrane-proximal truncated Ig constant-2 (IgC2) domain.
Length = 92
Score = 33.4 bits (77), Expect = 0.062
Identities = 21/84 (25%), Positives = 33/84 (39%), Gaps = 19/84 (22%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRS------------NGHVLPFG-AFS-REN-----TLT 349
G +TL +N E E TW K + FG A+S RE +L
Sbjct: 1 GESVTLPVSNLPENLQEITWYKGKDKSVEAEIASYIATNSTVFGPAYSGRETIYPNGSLL 60
Query: 350 LQEIKNSDAGMYVCKVSNKDMTVE 373
+Q + D+G Y ++ + + E
Sbjct: 61 IQNLTKEDSGTYTLQIISTNGVTE 84
>gnl|CDD|238012 cd00055, EGF_Lam, Laminin-type epidermal growth factor-like domain;
laminins are the major noncollagenous components of
basement membranes that mediate cell adhesion, growth
migration, and differentiation; the laminin-type
epidermal growth factor-like module occurs in tandem
arrays; the domain contains 4 disulfide bonds (loops
a-d) the first three resemble epidermal growth factor
(EGF); the number of copies of this domain in the
different forms of laminins is highly variable ranging
from 3 up to 22 copies.
Length = 50
Score = 32.3 bits (74), Expect = 0.067
Identities = 10/34 (29%), Positives = 14/34 (41%), Gaps = 2/34 (5%)
Query: 180 PCYPGACGDGSCQDVDGAMKCLCPIGTAGKRCEQ 213
C G C G +C C T G+RC++
Sbjct: 3 DCNGHGSLSGQCDPGTG--QCECKPNTTGRRCDR 34
>gnl|CDD|143229 cd05752, Ig1_FcgammaR_like, Frst immunoglobulin (Ig)-like domain
of Fcgamma-receptors (FcgammaRs) and similar proteins.
Ig1_FcgammaR_like: domain similar to the first
immunoglobulin (Ig)-like domain of Fcgamma-receptors
(FcgammaRs). Interactions between IgG and FcgammaR are
important to the initiation of cellular and humoral
response. IgG binding to FcgammaR leads to a cascade of
signals and ultimately to functions such as
antibody-dependent-cellular-cytotoxicity (ADCC),
endocytosis, phagocytosis, release of inflammatory
mediators, etc. FcgammaR has two Ig-like domains. This
group also contains FcepsilonRI, which binds IgE with
high affinity.
Length = 78
Score = 32.3 bits (74), Expect = 0.11
Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 4/38 (10%)
Query: 19 EGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
N RI A DSG+Y+C+ QG SD V L V
Sbjct: 45 TTNSYRIRAA-NNDSGEYRCQTQGSSL---SDPVHLEV 78
>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain. This domain is
found in antibodies as well as neural protein P0 and
CTL4 amongst others.
Length = 114
Score = 33.0 bits (75), Expect = 0.16
Identities = 12/34 (35%), Positives = 16/34 (47%)
Query: 23 LRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
L I+N RL DSG Y C + + +L V
Sbjct: 81 LTISNLRLSDSGTYTCAVSNPNELVFGAGTRLTV 114
Score = 31.8 bits (72), Expect = 0.44
Identities = 20/98 (20%), Positives = 31/98 (31%), Gaps = 26/98 (26%)
Query: 309 GSKITLTC-NNDLEAPVEYTWSKR--------------SNGHVLPFGAFS---------- 343
G +TL C + W K+ S + F
Sbjct: 16 GGSVTLPCSFSSSSGSTSVYWYKQPLGKGPELIIHYVTSTPNGKVGPRFKGRVTLSGNGS 75
Query: 344 -RENTLTLQEIKNSDAGMYVCKVSNKDMTVEIPSILLV 380
+ +LT+ ++ SD+G Y C VSN + V L
Sbjct: 76 KNDFSLTISNLRLSDSGTYTCAVSNPNELVFGAGTRLT 113
>gnl|CDD|165214 PHA02887, PHA02887, EGF-like protein; Provisional.
Length = 126
Score = 33.0 bits (75), Expect = 0.17
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 1/39 (2%)
Query: 138 CKSSKHNNCINNGLCQDAATRIGYTCICPPGFSGDRCSV 176
CK+ ++ CIN G C + CIC G++G RC
Sbjct: 86 CKNDFNDFCIN-GECMNIIDLDEKFCICNKGYTGIRCDE 123
>gnl|CDD|143203 cd05726, Ig4_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig4_Robo: domain similar to the
third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 90
Score = 31.5 bits (71), Expect = 0.41
Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 9/85 (10%)
Query: 309 GSKITLTCNNDLEAPVEYTWSKRSNGHVL-------PFGAFSRENT--LTLQEIKNSDAG 359
G +T C W K + ++L FS T LT+ ++ SD G
Sbjct: 1 GRTVTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTGDLTITNVQRSDVG 60
Query: 360 MYVCKVSNKDMTVEIPSILLVTDSV 384
Y+C+ N ++ + L VTD +
Sbjct: 61 YYICQTLNVAGSILTKAYLEVTDVI 85
>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain
of the L1 cell adhesion molecule (CAM).
Ig3_L1-CAM_like: domain similar to the third
immunoglobulin (Ig)-like domain of the L1 cell adhesion
molecule (CAM). L1 belongs to the L1 subfamily of cell
adhesion molecules (CAMs) and is comprised of an
extracellular region having six Ig-like domains and
five fibronectin type III domains, a transmembrane
region and an intracellular domain. L1 is primarily
expressed in the nervous system and is involved in its
development and function. L1 is associated with an
X-linked recessive disorder, X-linked hydrocephalus,
MASA syndrome, or spastic paraplegia type 1, that
involves abnormalities of axonal growth. This group
also contains the chicken neuron-glia cell adhesion
molecule, Ng-CAM and human neurofascin.
Length = 71
Score = 30.8 bits (70), Expect = 0.44
Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 4/39 (10%)
Query: 4 IKWSRADG-LPLQRYAEGN---VLRITNARLQDSGKYKC 38
I W + G LP R N L+I N +D G+Y+C
Sbjct: 15 ISWIKIGGELPADRTKFENFNKTLKIDNVSEEDDGEYRC 53
>gnl|CDD|214543 smart00180, EGF_Lam, Laminin-type epidermal growth factor-like
domai.
Length = 46
Score = 29.6 bits (67), Expect = 0.52
Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 2/34 (5%)
Query: 180 PCYPGACGDGSCQDVDGAMKCLCPIGTAGKRCEQ 213
C PG G+C G +C C G+RC++
Sbjct: 2 DCDPGGSASGTCDPDTG--QCECKPNVTGRRCDR 33
>gnl|CDD|143227 cd05750, Ig_Pro_neuregulin, Immunoglobulin (Ig)-like domain in
neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin
(Ig)-like domain in neuregulins (NRGs). NRGs are
signaling molecules, which participate in cell-cell
interactions in the nervous system, breast, heart, and
other organ systems, and are implicated in the
pathology of diseases including schizophrenia, multiple
sclerosis, and breast cancer. There are four members of
the neuregulin gene family (NRG1, -2, -3, and -4). The
NRG-1 protein, binds to and activates the tyrosine
kinases receptors ErbB3 and ErbB4, initiating signaling
cascades. The other NRGs proteins bind one or the other
or both of these ErbBs. NRG-1 has multiple functions;
for example, in the brain it regulates various
processes such as radial glia formation and neuronal
migration, dendritic development, and expression of
neurotransmitters receptors; in the peripheral nervous
system NRG-1 regulates processes such as target cell
differentiation, and Schwann cell survival. There are
many NRG-1 isoforms, which arise from the alternative
splicing of mRNA. Less is known of the functions of the
other NRGs. NRG-2 and -3 are expressed predominantly in
the nervous system. NRG-2 is expressed by motor neurons
and terminal Schwann cells, and is concentrated near
synaptic sites and may be a signal that regulates
synaptic differentiation. NRG-4 has been shown to
direct pancreatic islet cell development towards the
delta-cell lineage.
Length = 75
Score = 30.6 bits (69), Expect = 0.56
Identities = 11/30 (36%), Positives = 19/30 (63%), Gaps = 3/30 (10%)
Query: 23 LRITNARLQDSGKYKCEIQ---GHDSFRGS 49
L+I A+L DSG+Y C ++ G+D+ +
Sbjct: 45 LQINKAKLADSGEYTCVVENILGNDTVTAN 74
>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V). This
family is like pfam00008 but has 8 conserved cysteines
instead of six.
Length = 49
Score = 29.6 bits (67), Expect = 0.64
Identities = 10/33 (30%), Positives = 14/33 (42%), Gaps = 2/33 (6%)
Query: 181 CYPGACGDGSCQDVDGAMKCLCPIGTAGKRCEQ 213
C P +C G CLC G G+ C++
Sbjct: 3 CNPHGSLSDTCDPETGQ--CLCKPGVTGRHCDR 33
>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural
cell adhesion molecule contactin-2 and similar
proteins. Ig4_Contactin-2-like: fourth Ig domain of
the neural cell adhesion molecule contactin-2.
Contactins are comprised of six Ig domains followed by
four fibronectin type III (FnIII) domains anchored to
the membrane by glycosylphosphatidylinositol.
Contactin-2 (aliases TAG-1, axonin-1) facilitates cell
adhesion by homophilic binding between molecules in
apposed membranes. The first four Ig domains form the
intermolecular binding fragment which arranges as a
compact U-shaped module by contacts between Ig domains
1 and 4, and domains 2 and 3. It has been proposed that
a linear zipper-like array forms, from contactin-2
molecules alternatively provided by the two apposed
membranes.
Length = 85
Score = 30.3 bits (68), Expect = 0.79
Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 5/39 (12%)
Query: 4 IKWSRADGLPLQR----YAEGNVLRITNARLQDSGKYKC 38
+W + +G PL E LRIT L DSG Y+C
Sbjct: 31 YRWLK-NGQPLASENRIEVEAGDLRITKLSLSDSGMYQC 68
Score = 29.1 bits (65), Expect = 2.1
Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 4/49 (8%)
Query: 326 YTWSKRSNGHVLPF-GAFSREN-TLTLQEIKNSDAGMYVCKVSNKDMTV 372
Y W K NG L E L + ++ SD+GMY C NK T+
Sbjct: 31 YRWLK--NGQPLASENRIEVEAGDLRITKLSLSDSGMYQCVAENKHGTI 77
>gnl|CDD|205157 pfam12947, EGF_3, EGF domain. This family includes a variety of
EGF-like domain homologues. This family includes the
C-terminal domain of the malaria parasite MSP1 protein.
Length = 36
Score = 28.7 bits (65), Expect = 0.99
Identities = 12/29 (41%), Positives = 15/29 (51%), Gaps = 4/29 (13%)
Query: 148 NNGLCQDAA----TRIGYTCICPPGFSGD 172
NNG C A T +TC C G++GD
Sbjct: 4 NNGGCHPNATCTNTGGSFTCTCKSGYTGD 32
Score = 27.9 bits (63), Expect = 2.0
Identities = 9/23 (39%), Positives = 13/23 (56%), Gaps = 3/23 (13%)
Query: 581 ICYPTDTSERGYNCSCLTGYSGD 603
C T S + C+C +GY+GD
Sbjct: 13 TCTNTGGS---FTCTCKSGYTGD 32
>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig2_Robo: domain similar to
the second immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS),
and are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 86
Score = 29.7 bits (67), Expect = 1.2
Identities = 20/59 (33%), Positives = 24/59 (40%), Gaps = 7/59 (11%)
Query: 4 IKWSRADGLPLQ------RYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
+ W R DG PL R + L I AR D G YKC R S +L+V
Sbjct: 29 VSW-RKDGQPLNLDNERVRIVDDGNLLIAEARKSDEGTYKCVATNMVGERESAAARLSV 86
>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor and similar
proteins. Ig2_FGFR_like: domain similar to the second
immunoglobulin (Ig)-like domain of fibroblast growth
factor (FGF) receptor. FGF receptors bind FGF signaling
polypeptides. FGFs participate in multiple processes
such as morphogenesis, development, and angiogenesis.
FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
-2, -3, -4). Receptor diversity is controlled by
alternative splicing producing splice variants with
different ligand binding characteristics and different
expression patterns. FGFRs have an extracellular region
comprised of three Ig-like domains, a single
transmembrane helix, and an intracellular tyrosine
kinase domain. Ligand binding and specificity reside in
the Ig-like domains 2 and 3, and the linker region that
connects these two. FGFR activation and signaling depend
on FGF-induced dimerization, a process involving cell
surface heparin or heparin sulfate proteoglycans. This
group also contains fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 does not have a protein
tyrosine kinase domain at its C terminus; neither does
its cytoplasmic domain appear to interact with a
signaling partner. It has been suggested that FGFRL1 may
not have any direct signaling function, but instead acts
as a decoy receptor trapping FGFs and preventing them
from binding other receptors.
Length = 85
Score = 29.7 bits (67), Expect = 1.4
Identities = 23/71 (32%), Positives = 28/71 (39%), Gaps = 13/71 (18%)
Query: 307 PYGSKITLTCNNDLEAPVEYTWSKRSNGHVLPFGAFSREN---------TLTLQEIKNSD 357
P GS + L C TW K +G PF R TL L+ + SD
Sbjct: 7 PAGSTVRLKCPASGNPRPTITWLK--DGK--PFKKEHRIGGYKVRKKKWTLILESVVPSD 62
Query: 358 AGMYVCKVSNK 368
+G Y C V NK
Sbjct: 63 SGKYTCIVENK 73
>gnl|CDD|143303 cd05895, Ig_Pro_neuregulin-1, Immunoglobulin (Ig)-like domain
found in neuregulin (NRG)-1. Ig_Pro_neuregulin-1:
immunoglobulin (Ig)-like domain found in neuregulin
(NRG)-1. There are many NRG-1 isoforms which arise from
the alternative splicing of mRNA. NRG-1 belongs to the
neuregulin gene family, which is comprised of four
genes. This group represents NRG-1. NRGs are signaling
molecules, which participate in cell-cell interactions
in the nervous system, breast, and heart, and other
organ systems, and are implicated in the pathology of
diseases including schizophrenia, multiple sclerosis,
and breast cancer. The NRG-1 protein binds to and
activates the tyrosine kinases receptors ErbB3 and
ErbB4, initiating signaling cascades. NRG-1 has
multiple functions; for example, in the brain it
regulates various processes such as radial glia
formation and neuronal migration, dendritic
development, and expression of neurotransmitters
receptors; in the peripheral nervous system NRG-1
regulates processes such as target cell
differentiation, and Schwann cell survival.
Length = 76
Score = 29.2 bits (65), Expect = 1.5
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 3/34 (8%)
Query: 23 LRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
L+I+ A L D+G+YKC + S G+D V NV
Sbjct: 46 LQISKASLADNGEYKCMVS---SKLGNDSVTANV 76
>gnl|CDD|239320 cd03022, DsbA_HCCA_Iso, DsbA family,
2-hydroxychromene-2-carboxylate (HCCA) isomerase
subfamily; HCCA isomerase is a glutathione (GSH)
dependent enzyme involved in the naphthalene catabolic
pathway. It converts HCCA, a hemiketal formed
spontaneously after ring cleavage of
1,2-dihydroxynapthalene by a dioxygenase, into
cis-o-hydroxybenzylidenepyruvate (cHBPA). This is the
fourth reaction in a six-step pathway that converts
napthalene into salicylate. HCCA isomerase is unique to
bacteria that degrade polycyclic aromatic compounds. It
is closely related to the eukaryotic protein, GSH
transferase kappa (GSTK).
Length = 192
Score = 31.1 bits (71), Expect = 1.5
Identities = 12/51 (23%), Positives = 17/51 (33%), Gaps = 1/51 (1%)
Query: 391 PLSYLALPTLTDAHLHFSIELSFKPTDYNGLIMYTG-DSNMKSYKGKGDFV 440
P SYLA L + ++P G+ TG KG +
Sbjct: 10 PYSYLAHERLPALAARHGATVRYRPILLGGVFKATGNVPPANRPPAKGRYR 60
>gnl|CDD|214554 smart00200, SEA, Domain found in sea urchin sperm protein,
enterokinase, agrin. Proposed function of regulating or
binding carbohydrate sidechains.
Length = 121
Score = 30.1 bits (68), Expect = 1.6
Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 7/66 (10%)
Query: 714 NGYLEYSYDLGDGVVTIKFSKKPVNDGIKHSVNVTRINKF--GSLEVDSVIVGKGESPGS 771
Y E D+ + I + K + T + +F GS+ VD ++
Sbjct: 31 EEYQELVRDVEKLLEQI-YGKTDLKPDFV----GTEVIEFRNGSVVVDLGLLFNEGVTNG 85
Query: 772 QDVINT 777
QDV
Sbjct: 86 QDVEED 91
>gnl|CDD|165173 PHA02826, PHA02826, IL-1 receptor-like protein; Provisional.
Length = 227
Score = 31.0 bits (70), Expect = 1.9
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 25/120 (20%)
Query: 281 RFDTGSATPLYSNDA-PAFNPVS---TKEAPYGSKITLTCNNDLEAPVEYTWSK-RSNGH 335
++ G TP+Y+ P + K++ + K +T N TWSK S
Sbjct: 23 KYRGGDLTPVYAKFGDPMVLLCTGKHYKKSIFFDKTFITSYN-------VTWSKTDSLAF 75
Query: 336 VLPFGAF------------SRENTLTLQEIKNSDAGMYVCKVSNKDMTVEIPSILLVTDS 383
V GA R L + + N D G+Y+C +S+ ++ E +I L DS
Sbjct: 76 VRDSGARTKIKKITHNEIGDRSENLWIGNVINIDEGIYICTISSGNICEES-TIRLTFDS 134
>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain.
Length = 42
Score = 28.1 bits (63), Expect = 1.9
Identities = 13/39 (33%), Positives = 16/39 (41%), Gaps = 6/39 (15%)
Query: 603 DHCEKENNMCMKGDVCKNGGMCKVTPDSYECLCSLGYAP 641
D C + C VC N T S+EC+C GY
Sbjct: 3 DECADGTHNCPANTVCVN------TIGSFECVCPDGYEN 35
Score = 26.9 bits (60), Expect = 5.2
Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 6/38 (15%)
Query: 132 VDSCNTCKSSKHNNCINNGLCQDAATRIGYTCICPPGF 169
VD C +NC N +C + T + C+CP G+
Sbjct: 2 VDECADGT----HNCPANTVCVN--TIGSFECVCPDGY 33
>gnl|CDD|143230 cd05753, Ig2_FcgammaR_like, Second immunoglobulin (Ig)-like
domain of Fcgamma-receptors (FcgammaRs) and similar
proteins. Ig2_FcgammaR_like: domain similar to the
second immunoglobulin (Ig)-like domain of
Fcgamma-receptors (FcgammaRs). Interactions between IgG
and FcgammaR are important to the initiation of
cellular and humoral response. IgG binding to FcgammaR
leads to a cascade of signals and ultimately to
functions such as
antibody-dependent-cellular-cytotoxicity (ADCC),
endocytosis, phagocytosis, release of inflammatory
mediators, etc. FcgammaR has two Ig-like domains. This
group also contains FcepsilonRI, which binds IgE with
high affinity.
Length = 83
Score = 28.8 bits (65), Expect = 2.1
Identities = 14/47 (29%), Positives = 18/47 (38%)
Query: 10 DGLPLQRYAEGNVLRITNARLQDSGKYKCEIQGHDSFRGSDYVKLNV 56
DG + + L I A L DSG Y C S+ V + V
Sbjct: 37 DGKAKKYSHSNSNLSIPQATLSDSGSYHCSGIIGSYDYSSEPVSITV 83
>gnl|CDD|234740 PRK00377, cbiT, cobalt-precorrin-6Y C(15)-methyltransferase;
Provisional.
Length = 198
Score = 30.5 bits (69), Expect = 2.7
Identities = 17/47 (36%), Positives = 31/47 (65%), Gaps = 5/47 (10%)
Query: 742 KHSVNVTRIN--KFGSLEVDSVIVGKGESPGSQDVINTRGN-IYLGG 785
+ ++N+TR N KFG L +++++ KGE+P IN + + I++GG
Sbjct: 75 EKAINLTRRNAEKFGVL--NNIVLIKGEAPEILFTINEKFDRIFIGG 119
>gnl|CDD|224003 COG1077, MreB, Actin-like ATPase involved in cell morphogenesis
[Cell division and chromosome partitioning].
Length = 342
Score = 31.0 bits (71), Expect = 2.7
Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 466 LVPHEWVVVTIIKDFKEGK-LSVGGEPL-IVGSTPG 499
+V +E VV I + K L+VG E ++G TPG
Sbjct: 27 IVLNEPSVVAIESEGKTKVVLAVGEEAKQMLGRTPG 62
>gnl|CDD|219015 pfam06415, iPGM_N, BPG-independent PGAM N-terminus (iPGM_N). This
family represents the N-terminal region of the
2,3-bisphosphoglycerate-independent phosphoglycerate
mutase (or phosphoglyceromutase or BPG-independent PGAM)
protein (EC:5.4.2.1). The family is found in conjunction
with pfam01676 (located in the C-terminal region of the
protein).
Length = 223
Score = 30.7 bits (70), Expect = 2.9
Identities = 12/37 (32%), Positives = 16/37 (43%)
Query: 696 QPSYKNGIGREFIAVAVVNGYLEYSYDLGDGVVTIKF 732
Q SY GI EF+ V+ + GD V+ F
Sbjct: 141 QASYAEGITDEFVKPTVITDKPVGTIKDGDAVIFFNF 177
>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
Ig5_Contactin_like: Fifth Ig domain of contactins.
Contactins are neural cell adhesion molecules and are
comprised of six Ig domains followed by four fibronectin
type III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of
neuronal act ivity in the rat auditory system.
Contactin-5 is highly expressed in the adult human brain
in the occipital lobe and in the amygdala. Contactin-1
is differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 73
Score = 27.8 bits (62), Expect = 4.8
Identities = 12/59 (20%), Positives = 20/59 (33%), Gaps = 9/59 (15%)
Query: 327 TWSK-----RSNGHVLPFGAFSRENTLTLQEIKNSDAGMYVCKVSNKDMTVEIPSILLV 380
+WSK ++ + + +L + + SD G Y C N L V
Sbjct: 19 SWSKGTELLTNSSRIC----IWPDGSLEILNVTKSDEGKYTCFAENFFGKANSTGSLSV 73
>gnl|CDD|143286 cd05878, Ig_Aggrecan_like, Immunoglobulin (Ig)-like domain of the
aggrecan-like chondroitin sulfate proteoglycan core
protein (CSPG). Ig_Aggrecan_like: immunoglobulin
(Ig)-like domain of the aggrecan-like chondroitin
sulfate proteoglycan core protein (CSPG)s. Included in
this group are the Ig domains of other CSPGs: versican,
and neurocan. In CSPGs this Ig-like domain is followed
by hyaluronan (HA)-binding tandem repeats, and a
C-terminal region with epidermal growth factor-like,
lectin-like, and complement regulatory protein-like
domains. Separating these N- and C-terminal regions is a
nonhomologous glycosaminoglycan attachment region. In
cartilage, aggrecan forms cartilage link protein
stabilized aggregates with hyaluronan (HA). These
aggregates contribute to the tissue's load bearing
properties. Aggrecan and versican have a wide
distribution in connective tissue and extracellular
matrices. Neurocan is localized almost exclusively in
nervous tissue. Aggregates having other CSPGs
substituting for aggrecan may contribute to the
structural integrity of many different tissues. Members
of the vertebrate HPLN (hyaluronan/HA and proteoglycan
binding link) protein family are physically linked
adjacent to CSPG genes.
Length = 110
Score = 28.5 bits (64), Expect = 5.3
Identities = 7/19 (36%), Positives = 15/19 (78%)
Query: 347 TLTLQEIKNSDAGMYVCKV 365
+L + +++SD+G+Y C+V
Sbjct: 73 SLEISRLRSSDSGVYRCEV 91
>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
(D2) of the human basement membrane heparan sulfate
proteoglycan perlecan, also known as HSPG2.
Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
II (D2) of the human basement membrane heparan sulfate
proteoglycan perlecan, also known as HSPG2. Perlecan
consists of five domains. Domain I has three putative
heparan sulfate attachment sites; domain II has four LDL
receptor-like repeats, and one Ig-like repeat; domain
III resembles the short arm of laminin chains; domain IV
has multiple Ig-like repeats (21 repeats in human
perlecan); and domain V resembles the globular G domain
of the laminin A chain and internal repeats of EGF.
Perlecan may participate in a variety of biological
functions including cell binding, LDL-metabolism,
basement membrane assembly and selective permeability,
calcium binding, and growth- and neurite-promoting
activities.
Length = 78
Score = 27.5 bits (61), Expect = 6.7
Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 347 TLTLQEIKNSDAGMYVCKVSN-KDMTVEIP-SILLV 380
TLT++++K SD G Y C+ N + M IP IL V
Sbjct: 43 TLTIRDVKESDQGAYTCEAINTRGMVFGIPDGILTV 78
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 29.9 bits (67), Expect = 7.2
Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 1/43 (2%)
Query: 344 RENTLTLQEIKNSDAGMYVCKVSN-KDMTVEIPSILLVTDSVP 385
TL L+E + +D+GMYV VS + T + LV P
Sbjct: 122 VNGTLVLREARETDSGMYVLSVSRAPNSTAARAVVFLVVGPRP 164
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.138 0.422
Gapped
Lambda K H
0.267 0.0637 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,839,909
Number of extensions: 4083298
Number of successful extensions: 3019
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2970
Number of HSP's successfully gapped: 123
Length of query: 834
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 729
Effective length of database: 6,280,432
Effective search space: 4578434928
Effective search space used: 4578434928
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.2 bits)