RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1807
(618 letters)
>gnl|CDD|222943 PHA02927, PHA02927, secreted complement-binding protein;
Provisional.
Length = 263
Score = 98.2 bits (244), Expect = 8e-23
Identities = 61/187 (32%), Positives = 89/187 (47%), Gaps = 14/187 (7%)
Query: 332 QCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGD----WSGK 387
QC+ C P I G + G GS TYSC G+ L G ++ C LG W+ +
Sbjct: 80 QCIKRRCPSPRDIDNGQLDIGGVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPE 139
Query: 388 APTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWS 447
AP C+ V C SP +I NG ++ T GSV Y+C+ Y L + V G+WS
Sbjct: 140 APICESVKCQSPPSISNGRHNGYEDFYTDGSVVTYSCNSGYSLIGNS---GVLCSGGEWS 196
Query: 448 GDTPSCELITCPEPEVPKG----GYVVGYDLNVHSTLEFHCEPGHLLVGQASLKCTRQGD 503
D P+C+++ CP P + G G+ Y N ++F C+ G+ L G +S C+
Sbjct: 197 -DPPTCQIVKCPHPTISNGYLSSGFKRSYSYN--DNVDFKCKYGYKLSGSSSSTCSPGNT 253
Query: 504 WDNMSPQ 510
W P+
Sbjct: 254 WQPELPK 260
Score = 78.9 bits (194), Expect = 3e-16
Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 12/180 (6%)
Query: 220 CGSPDRHLNTTFVGTVNTKLGSTISYACPEGNMLVGSATRTCKEGF-----WTGVAPTCQ 274
C SP R ++ + GS+I+Y+C G L+G + C+ G W AP C+
Sbjct: 86 CPSP-RDIDNGQLDIGGVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPEAPICE 144
Query: 275 YVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
V C I +G T G+V Y+C+ Y+LIG + +C GG+W+ P C
Sbjct: 145 SVKCQSPPSISNGRHNGYEDFYTDGSVVTYSCNSGYSLIGNSGVLC-SGGEWS-DPPTCQ 202
Query: 335 FDWCAEPPQISGGIVTTSGRRTGSV---ATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
C P IS G +++ +R+ S + C+ G+ L GS+ C G W + P C
Sbjct: 203 IVKCPH-PTISNGYLSSGFKRSYSYNDNVDFKCKYGYKLSGSSSSTCSPGNTWQPELPKC 261
Score = 47.3 bits (112), Expect = 9e-06
Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 3/96 (3%)
Query: 240 GSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRT-TH 298
GS ++Y+C G L+G++ C G W+ PTCQ V C I +G ++ R+ ++
Sbjct: 169 GSVVTYSCNSGYSLIGNSGVLCSGGEWSD-PPTCQIVKCPH-PTISNGYLSSGFKRSYSY 226
Query: 299 GAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
+ C Y L G + C G W P+C+
Sbjct: 227 NDNVDFKCKYGYKLSGSSSSTCSPGNTWQPELPKCV 262
>gnl|CDD|153060 cd03590, CLECT_DC-SIGN_like, C-type lectin-like domain (CTLD) of
the type found in human dendritic cell (DC)-specific
intercellular adhesion molecule 3-grabbing non-integrin
(DC-SIGN) and the related receptor, DC-SIGN receptor
(DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like
domain (CTLD) of the type found in human dendritic cell
(DC)-specific intercellular adhesion molecule 3-grabbing
non-integrin (DC-SIGN) and the related receptor, DC-SIGN
receptor (DC-SIGNR). This group also contains proteins
similar to hepatic asialoglycoprotein receptor (ASGP-R)
and langerin in human. These proteins are type II
membrane proteins with a CTLD ectodomain. CTLD refers
to a domain homologous to the carbohydrate-recognition
domains (CRDs) of the C-type lectins. DC-SIGN is
thought to mediate the initial contact between dendritic
cells and resting T cells, and may also mediate the
rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind
to oligosaccharides present on human tissues, as well
as, on pathogens including parasites, bacteria, and
viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing
viral infection of T cells. DC-SIGN and DC-SIGNR are
homotetrameric, and contain four CTLDs stabilized by a
coiled coil of alpha helices. The hepatic ASGP-R is an
endocytic recycling receptor which binds and
internalizes desialylated glycoproteins having a
terminal galactose or N-acetylgalactosamine residues on
their N-linked carbohydrate chains, via the
clathrin-coated pit mediated endocytic pathway, and
delivers them to lysosomes for degradation. It has been
proposed that glycoproteins bearing terminal Sia (sialic
acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are
endogenous ligands for ASGP-R and that ASGP-R
participates in regulating the relative concentration of
serum glycoproteins bearing alpha 2,6-linked Sia. The
human ASGP-R is a hetero-oligomer composed of two
subunits, both of which are found within this group.
Langerin is expressed in a subset of dendritic
leukocytes, the Langerhans cells (LC). Langerin induces
the formation of Birbeck Granules (BGs) and associates
with these BGs following internalization. Langerin
binds, in a calcium-dependent manner, to
glyco-conjugates containing mannose and related sugars
mediating their uptake and degradation. Langerin
molecules oligomerize as trimers with three CTLDs held
together by a coiled-coil of alpha helices.
Length = 126
Score = 80.4 bits (199), Expect = 4e-18
Identities = 45/136 (33%), Positives = 62/136 (45%), Gaps = 24/136 (17%)
Query: 87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNT-----FLLGELERRKDRL 141
SF +CY F+ +K S+ ++R C+ G LV I N+ F+ L +
Sbjct: 6 KSFQSSCYFFSTEK-KSWEESRQFCEDMGAHLV-----IINSQEEQEFISKILSGNRS-- 57
Query: 142 KTLLVWIGAQKDPGITARTWKWVDG--EVVTKPSWGKDQPNNYNG-EQNCVVLDGGRGWL 198
WIG D T WKWVDG +K W +PNN+ G ++C L
Sbjct: 58 ----YWIGL-SDEE-TEGEWKWVDGTPLNSSKTFWHPGEPNNWGGGGEDCAELV-YDSGG 110
Query: 199 WNDVGCKLDYLHWICQ 214
WNDV C L+Y WIC+
Sbjct: 111 WNDVPCNLEY-RWICE 125
>gnl|CDD|153057 cd00037, CLECT, C-type lectin (CTL)/C-type lectin-like (CTLD)
domain. CLECT: C-type lectin (CTL)/C-type lectin-like
(CTLD) domain; protein domains homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. This group is chiefly comprised of eukaryotic
CTLDs, but contains some, as yet functionally
uncharacterized, bacterial CTLDs. Many CTLDs are
calcium-dependent carbohydrate binding modules; other
CTLDs bind protein ligands, lipids, and inorganic
surfaces, including CaCO3 and ice. Animal C-type
lectins are involved in such functions as extracellular
matrix organization, endocytosis, complement activation,
pathogen recognition, and cell-cell interactions. For
example: mannose-binding lectin and lung surfactant
proteins A and D bind carbohydrates on surfaces (e.g.
pathogens, allergens, necrotic, and apoptotic cells) and
mediate functions associated with killing and
phagocytosis; P (platlet)-, E (endothelial)-, and L
(leukocyte)- selectins (sels) mediate the initial
attachment, tethering, and rolling of lymphocytes on
inflamed vascular walls enabling subsequent lymphocyte
adhesion and transmigration. CTLDs may bind a variety
of carbohydrate ligands including mannose,
N-acetylglucosamine, galactose, N-acetylgalactosamine,
and fucose. Several CTLDs bind to protein ligands, and
only some of these binding interactions are
Ca2+-dependent; including the CTLDs of Coagulation
Factors IX/X (IX/X) and Von Willebrand Factor (VWF)
binding proteins, and natural killer cell receptors.
C-type lectins, such as lithostathine, and some type II
antifreeze glycoproteins function in a Ca2+-independent
manner to bind inorganic surfaces. Many proteins in
this group contain a single CTLD; these CTLDs associate
with each other through several different surfaces to
form dimers, trimers, or tetramers, from which
ligand-binding sites project in different orientations.
Various vertebrate type 1 transmembrane proteins
including macrophage mannose receptor, endo180,
phospholipase A2 receptor, and dendritic and epithelial
cell receptor (DEC205) have extracellular domains
containing 8 or more CTLDs; these CTLDs remain in the
parent model. In some members (IX/X and VWF binding
proteins), a loop extends to the adjoining domain to
form a loop-swapped dimer. A similar conformation is
seen in the macrophage mannose receptor CRD4's putative
non-sugar bound form of the domain in the acid
environment of the endosome. Lineage specific
expansions of CTLDs have occurred in several animal
lineages including Drosophila melanogaster and
Caenorhabditis elegans; these CTLDs also remain in the
parent model.
Length = 116
Score = 79.2 bits (195), Expect = 9e-18
Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 10/125 (8%)
Query: 92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQ 151
+CY+F+ +K ++ +A+ C++ GG L I + L + + VWIG
Sbjct: 1 SCYKFSTEK-LTWEEAQEYCRSLGGHLA----SIHSEEENDFLASLLKKSSSSDVWIGLN 55
Query: 152 KDPGITARTWKWVDG-EVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLH 210
+ TWKW DG +V +W +PN E +CVVL WNDV C L
Sbjct: 56 DLS--SEGTWKWSDGSPLVDYTNWAPGEPNPGGSE-DCVVLSSSSDGKWNDVSCS-SKLP 111
Query: 211 WICQH 215
+IC+
Sbjct: 112 FICEK 116
>gnl|CDD|214480 smart00034, CLECT, C-type lectin (CTL) or carbohydrate-recognition
domain (CRD). Many of these domains function as
calcium-dependent carbohydrate binding modules.
Length = 124
Score = 73.0 bits (179), Expect = 1e-15
Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 11/129 (8%)
Query: 87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLV 146
S+ CY+F+ +K ++ DA++ CQ+ GG L N F+ L K+ +
Sbjct: 6 ISYGGKCYKFSTEK-KTWEDAQAFCQSLGGHLASIHSEAENDFVASLL---KNSGSSDYY 61
Query: 147 WIGAQKDPGITARTWKWVDGEVVTKPS-WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCK 205
WIG + +W+W DG S W +PNN +G +CVVL G WNDV C
Sbjct: 62 WIGLSDPD--SNGSWQWSDGSGPVSYSNWAPGEPNNSSG--DCVVLS-TSGGKWNDVSC- 115
Query: 206 LDYLHWICQ 214
L ++C+
Sbjct: 116 TSKLPFVCE 124
>gnl|CDD|214478 smart00032, CCP, Domain abundant in complement control proteins;
SUSHI repeat; short complement-like repeat (SCR). The
complement control protein (CCP) modules (also known as
short consensus repeats SCRs or SUSHI repeats) contain
approximately 60 amino acid residues and have been
identified in several proteins of the complement system.
A missense mutation in seventh CCP domain causes
deficiency of the b subunit of factor XIII.
Length = 56
Score = 65.6 bits (160), Expect = 9e-14
Identities = 20/56 (35%), Positives = 28/56 (50%)
Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333
C IE+GTVT + ++G Y+C YTLIG + C + G W+ P C
Sbjct: 1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
Score = 60.6 bits (147), Expect = 5e-12
Identities = 25/56 (44%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 338 CAEPPQISGGIVTTSGR--RTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
C PP I G VT+S G TYSC+PG+ L GS+ + C G WS PTC
Sbjct: 1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
Score = 57.2 bits (138), Expect = 8e-11
Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 2/58 (3%)
Query: 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC 453
C P +I+NG + +GT ++G Y+CD Y L + C+ +G WS P+C
Sbjct: 1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTIT--CLENGTWSPPPPTC 56
Score = 47.9 bits (114), Expect = 2e-07
Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 220 CGSPDRHLNTTFVGTVNT-KLGSTISYACPEGNMLVGSATRTCKE-GFWTGVAPTC 273
C P N T + T G T++Y+C G L+GS+T TC E G W+ PTC
Sbjct: 1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
Score = 47.9 bits (114), Expect = 2e-07
Identities = 15/55 (27%), Positives = 24/55 (43%), Gaps = 2/55 (3%)
Query: 458 CPEPEVPKGGYVVGYDLNVH--STLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQ 510
CP P + G V T+ + C+PG+ L+G +++ C G W P
Sbjct: 1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPT 55
>gnl|CDD|153056 cd00033, CCP, Complement control protein (CCP) modules (aka short
consensus repeats SCRs or SUSHI repeats) have been
identified in several proteins of the complement system.
SUSHI repeats (short complement-like repeat, SCR) are
abundant in complement control proteins. The complement
control protein (CCP) modules (also known as short
consensus repeats SCRs or SUSHI repeats) contain
approximately 60 amino acid residues and have been
identified in several proteins of the complement system.
Typically, 2 to 4 modules contribute to a binding site,
implying that the orientation of the modules to each
other is critical for function.
Length = 57
Score = 65.2 bits (159), Expect = 1e-13
Identities = 19/56 (33%), Positives = 29/56 (51%)
Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333
C E+GTVT ++G+ Y+C+E YTL+G + C + G W+ P C
Sbjct: 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTC 56
Score = 61.7 bits (150), Expect = 3e-12
Identities = 24/57 (42%), Positives = 28/57 (49%), Gaps = 2/57 (3%)
Query: 338 CAEPPQISGGIVTTSGRRT--GSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCK 392
C PP G VT S GS TYSC G+ L GS+ + C G WS PTC+
Sbjct: 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57
Score = 54.8 bits (132), Expect = 6e-10
Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCE 454
C P +NG + G+ ++GS Y+C++ Y L G+ C +G WS P+CE
Sbjct: 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLV-GSSTIT-CTENGGWSPPPPTCE 57
Score = 51.7 bits (124), Expect = 7e-09
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 458 CPEPEVPKGGYVVGY--DLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQ 510
CP P VP+ G V G + ST+ + C G+ LVG +++ CT G W P
Sbjct: 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPT 55
Score = 49.8 bits (119), Expect = 4e-08
Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 220 CGSPDRHLNTTFVGTVNT-KLGSTISYACPEGNMLVGSATRTCKE-GFWTGVAPTCQ 274
C P N T G+ + GST++Y+C EG LVGS+T TC E G W+ PTC+
Sbjct: 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57
>gnl|CDD|215684 pfam00059, Lectin_C, Lectin C-type domain. This family includes
both long and short form C-type.
Length = 108
Score = 66.2 bits (162), Expect = 2e-13
Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 12/115 (10%)
Query: 103 SFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLL--VWIGAQKDPGITART 160
++A+A + CQ GG LV I + E + +K WIG T T
Sbjct: 4 TWAEAEAACQKLGGHLV----SIQS---AEEQDFLTSLVKASNTYAWIGLTDIN--TEGT 54
Query: 161 WKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215
W W DG V +W +PNN G ++CV + WND C ++C+
Sbjct: 55 WVWTDGSPVNYTNWAPGEPNNKGGNEDCVEIYTDTNGKWNDEPC-GSKRLYVCEF 108
>gnl|CDD|165022 PHA02639, PHA02639, EEV host range protein; Provisional.
Length = 295
Score = 68.5 bits (167), Expect = 1e-12
Identities = 47/179 (26%), Positives = 74/179 (41%), Gaps = 12/179 (6%)
Query: 337 WCAEPPQISGGIVTTSGRR--TGSVATYSCEPGFILFGSNELKCGLGGD---WSGKAPTC 391
+C +P IS G +T + G + Y+C + L G C + WS KAP C
Sbjct: 21 YCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFC 80
Query: 392 KYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAM--RRQVCMRDGKWSGD 449
+C P +I NG G Y C++ ++ + + C++D W D
Sbjct: 81 MLKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPD 140
Query: 450 TPSCELITCPEPEVPKGGYVVGYDLN----VHSTLEFHCEPGHLLVGQASLKCTRQGDW 504
P C++I C P + + GY+ G N + + F C+ G LVG+ C W
Sbjct: 141 PPICKMINCRFPAL-QNGYINGIPSNKKFYYKTRVGFSCKSGFDLVGEKYSTCNINATW 198
Score = 65.8 bits (160), Expect = 8e-12
Identities = 48/185 (25%), Positives = 70/185 (37%), Gaps = 14/185 (7%)
Query: 220 CGSPDRHLNTTFVGTVNTK--LGSTISYACPEGNMLVGSATRTC----KEGFWTGVAPTC 273
C PD ++ F+ + K +G I Y C L+G RTC W+ AP C
Sbjct: 22 CDKPD-DISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFC 80
Query: 274 QYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHEN----YTLIGETRRVCGDGGKWNGT 329
+C I +G + + G Y C+E+ Y+L+G + C W
Sbjct: 81 MLKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPD 140
Query: 330 EPQCLFDWCAEPPQISG---GIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSG 386
P C C P +G GI + + +SC+ GF L G C + W
Sbjct: 141 PPICKMINCRFPALQNGYINGIPSNKKFYYKTRVGFSCKSGFDLVGEKYSTCNINATWFP 200
Query: 387 KAPTC 391
PTC
Sbjct: 201 SIPTC 205
Score = 63.9 bits (155), Expect = 4e-11
Identities = 44/187 (23%), Positives = 76/187 (40%), Gaps = 14/187 (7%)
Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVC---GDGGKWNGTEPQCL 334
C K + I +G +T + G + Y C+ +Y LIG+ R C + W+ P C+
Sbjct: 22 CDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCM 81
Query: 335 FDWCAEPPQISGGIVTTSGR--RTGSVATYSCEP----GFILFGSNELKCGLGGDWSGKA 388
C +PP I G + + G Y C + L G+ ++ C W
Sbjct: 82 LKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDP 141
Query: 389 PTCKYVDCGSPANIDNGHYSLL--NGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKW 446
P CK ++C PA + NG+ + + N + + ++C + L + C + W
Sbjct: 142 PICKMINCRFPA-LQNGYINGIPSNKKFYYKTRVGFSCKSGFDLV--GEKYSTCNINATW 198
Query: 447 SGDTPSC 453
P+C
Sbjct: 199 FPSIPTC 205
Score = 52.4 bits (125), Expect = 2e-07
Identities = 44/166 (26%), Positives = 65/166 (39%), Gaps = 27/166 (16%)
Query: 392 KYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGK---WSG 448
K + C P +I NG + L G + EYTC+ DY L R C++D WS
Sbjct: 18 KSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFR--TCIKDKNNAIWSN 75
Query: 449 DTPSCELITCPEP------------EVPKGGYVVGYDLNVHSTLEFHCEPGHLLVGQASL 496
P C L C +P E+ K G + Y N H +++ LVG +
Sbjct: 76 KAPFCMLKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYS------LVGNEKI 129
Query: 497 KCTRQGDWDNMSP--QFISFYFPPSET--ISAVSEAQTLGYMISVG 538
C + W P + I+ FP + I+ + + Y VG
Sbjct: 130 TCIQDKSWKPDPPICKMINCRFPALQNGYINGIPSNKKFYYKTRVG 175
Score = 37.7 bits (87), Expect = 0.012
Identities = 24/104 (23%), Positives = 47/104 (45%), Gaps = 8/104 (7%)
Query: 238 KLGSTISYACPEGN----MLVGSATRTC-KEGFWTGVAPTCQYVDCGKLEHIEHGTVT-L 291
K+G I Y C E LVG+ TC ++ W P C+ ++C + +++G + +
Sbjct: 104 KVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICKMINC-RFPALQNGYINGI 162
Query: 292 ETTRTTHGAVAI-YACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
+ + + + ++C + L+GE C W + P C+
Sbjct: 163 PSNKKFYYKTRVGFSCKSGFDLVGEKYSTCNINATWFPSIPTCV 206
>gnl|CDD|165167 PHA02817, PHA02817, EEV Host range protein; Provisional.
Length = 225
Score = 60.7 bits (147), Expect = 2e-10
Identities = 39/127 (30%), Positives = 55/127 (43%), Gaps = 8/127 (6%)
Query: 273 CQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHEN-----YTLIGETRRVCGDGGKWN 327
C C I++G + + T G+ + C N YTL+GE +C GKWN
Sbjct: 19 CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGEKNIICEKDGKWN 78
Query: 328 GTEPQCLFDWCAEPPQISG---GIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDW 384
P C C P +G GI + S ++SC+PGF+L G+ CG+ W
Sbjct: 79 KEFPVCKIIRCRFPALQNGFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSW 138
Query: 385 SGKAPTC 391
K P C
Sbjct: 139 IPKVPIC 145
Score = 51.5 bits (123), Expect = 2e-07
Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 12/123 (9%)
Query: 391 CKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDD-----YWLEAGAMRRQVCMRDGK 445
C C P +I NG+ GS + C ++ Y L + +C +DGK
Sbjct: 19 CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGE--KNIICEKDGK 76
Query: 446 WSGDTPSCELITCPEPEVPKGGYVVGYDLN----VHSTLEFHCEPGHLLVGQASLKCTRQ 501
W+ + P C++I C P + + G+V G + S + F C+PG +L+G C
Sbjct: 77 WNKEFPVCKIIRCRFPAL-QNGFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGIN 135
Query: 502 GDW 504
W
Sbjct: 136 SSW 138
Score = 40.7 bits (95), Expect = 0.001
Identities = 34/133 (25%), Positives = 50/133 (37%), Gaps = 18/133 (13%)
Query: 333 CLFDWCAEPPQISGGIVTTSGR--RTGSVATYSCEPG-----FILFGSNELKCGLGGDWS 385
C + C PP I G + GS T+ C + L G + C G W+
Sbjct: 19 CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGEKNIICEKDGKWN 78
Query: 386 GKAPTCKYVDCGSPANIDNGHYSLLNGTTT-----HGSVAEYTCDDDYWLEAGAMRRQVC 440
+ P CK + C PA + NG +NG + S ++C + L G + VC
Sbjct: 79 KEFPVCKIIRCRFPA-LQNGF---VNGIPDSKKFYYESEVSFSCKPGFVL-IGT-KYSVC 132
Query: 441 MRDGKWSGDTPSC 453
+ W P C
Sbjct: 133 GINSSWIPKVPIC 145
>gnl|CDD|153059 cd03589, CLECT_CEL-1_like, C-type lectin-like domain (CTLD) of the
type found in CEL-1 from Cucumaria echinata and
Echinoidin from Anthocidaris crassispina.
CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of
the type found in CEL-1 from Cucumaria echinata and
Echinoidin from Anthocidaris crassispina. CTLD refers
to a domain homologous to the carbohydrate-recognition
domains (CRDs) of the C-type lectins. The CEL-1 CTLD
binds three calcium ions and has a high specificity for
N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong
cytotoxicity which is inhibited by GalNAc. This protein
may play a role as a toxin defending against predation.
Echinoidin is found in the coelomic fluid of the sea
urchin and is specific for GalBeta1-3GalNAc. Echinoidin
has a cell adhesive activity towards human cancer cells
which is not mediated through the CTLD. Both CEL-1 and
Echinoidin are multimeric proteins comprised of multiple
dimers linked by disulfide bonds.
Length = 137
Score = 55.8 bits (135), Expect = 2e-09
Identities = 37/138 (26%), Positives = 57/138 (41%), Gaps = 15/138 (10%)
Query: 87 TSFDRNCYEFNIDKGASFADARSKCQ---ASG--GDLVHGFRGITNTFLLGELERRKDRL 141
T+F CY F ++ +A +C+ G LV N F+ E +
Sbjct: 6 TAFGGYCYRF-FGDRLTWEEAELRCRSFSIPGLIAHLVSIHSQEENDFVYDLFESSRGPD 64
Query: 142 KTLLVWIGA--QKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVL--DGGRGW 197
+WIG + G ++W DG V W QP+NY G ++CV + G G
Sbjct: 65 TPYGLWIGLHDRTSEG----PFEWTDGSPVDFTKWAGGQPDNYGGNEDCVQMWRRGDAGQ 120
Query: 198 LWNDVGCKLDYLHWICQH 215
WND+ C + +IC+
Sbjct: 121 SWNDMPCDAVF-PYICKM 137
>gnl|CDD|215703 pfam00084, Sushi, Sushi domain (SCR repeat).
Length = 56
Score = 53.0 bits (127), Expect = 3e-09
Identities = 17/56 (30%), Positives = 24/56 (42%)
Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333
C I HG ++ G+V Y C Y L+G + C D G W+ P+C
Sbjct: 1 CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSPPLPEC 56
Score = 52.2 bits (125), Expect = 6e-09
Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 338 CAEPPQISGGIVTTSGRRT--GSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
C PP I G+++T+ GSV Y C+PG++L GS+ + C G WS P C
Sbjct: 1 CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSPPLPEC 56
Score = 48.7 bits (116), Expect = 9e-08
Identities = 20/58 (34%), Positives = 23/58 (39%), Gaps = 2/58 (3%)
Query: 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC 453
C +P I +G S + GSV Y CD Y L C DG WS P C
Sbjct: 1 CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLV--GSSVITCQDDGTWSPPLPEC 56
Score = 42.9 bits (101), Expect = 9e-06
Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 220 CGSPDRHLNTTFVGTVN-TKLGSTISYACPEGNMLVGSATRTCKE-GFWTGVAPTC 273
C +P + T N ++GS + Y C G +LVGS+ TC++ G W+ P C
Sbjct: 1 CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSPPLPEC 56
Score = 42.2 bits (99), Expect = 2e-05
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 2/51 (3%)
Query: 458 CPEPEVPKGGYVVGY--DLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDN 506
CP P G + V S + + C+PG++LVG + + C G W
Sbjct: 1 CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSP 51
>gnl|CDD|153061 cd03591, CLECT_collectin_like, C-type lectin-like domain (CTLD) of
the type found in human collectins including lung
surfactant proteins A and D, mannose- or mannan binding
lectin (MBL), and CL-L1 (collectin liver 1).
CLECT_collectin_like: C-type lectin-like domain (CTLD)
of the type found in human collectins including lung
surfactant proteins A and D, mannose- or mannan binding
lectin (MBL), and CL-L1 (collectin liver 1). CTLD
refers to a domain homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. The CTLDs of these collectins bind
carbohydrates on surfaces (e.g. pathogens, allergens,
necrotic, or apoptotic cells) and mediate functions
associated with killing and phagocytosis. MBPs
recognize high mannose oligosaccharides in a calcium
dependent manner, bind to a broad range of pathogens,
and trigger cell killing by activating the complement
pathway. MBP also acts directly as an opsonin. SP-A
and SP-D in addition to functioning as host defense
components, are components of pulmonary surfactant which
play a role in surfactant homeostasis. Pulmonary
surfactant is a phospholipid-protein complex which
reduces the surface tension within the lungs. SP-A
binds the major surfactant lipid:
dipalmitoylphosphatidylcholine (DPPC). SP-D binds two
minor components of surfactant that contain sugar
moieties: glucosylceramide and phosphatidylinositol
(PI). MBP and SP-A, -D monomers are homotrimers with an
N-terminal collagen region and three CTLDs. Multiple
homotrimeric units associate to form supramolecular
complexes. MBL deficiency results in an increased
susceptibility to a large number of different infections
and to inflammatory disease, such as rheumatoid
arthritis.
Length = 114
Score = 51.5 bits (124), Expect = 4e-08
Identities = 28/112 (25%), Positives = 47/112 (41%), Gaps = 12/112 (10%)
Query: 104 FADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163
F DA+ C +GG L N + +++ +IG T + +
Sbjct: 13 FDDAQKLCSEAGGTLAMPRNAAENAAIASYVKKGNTY-----AFIGITDLE--TEGQFVY 65
Query: 164 VDGEVVTKPSWGKDQPNNYNGEQNCVV-LDGGRGWLWNDVGCKLDYLHWICQ 214
+DG +T +W +PNN G ++CV G+ WNDV C ++C+
Sbjct: 66 LDGGPLTYTNWKPGEPNNAGGGEDCVEMYTSGK---WNDVAC-NLTRLFVCE 113
>gnl|CDD|153062 cd03592, CLECT_selectins_like, C-type lectin-like domain (CTLD) of
the type found in the type 1 transmembrane proteins:
P(platlet)-, E(endothelial)-, and L(leukocyte)-
selectins (sels). CLECT_selectins_like: C-type
lectin-like domain (CTLD) of the type found in the type
1 transmembrane proteins: P(platlet)-, E(endothelial)-,
and L(leukocyte)- selectins (sels). CTLD refers to a
domain homologous to the carbohydrate-recognition
domains (CRDs) of the C-type lectins. P- E- and L-sels
are cell adhesion receptors that mediate the initial
attachment, tethering, and rolling of lymphocytes on
inflamed vascular walls enabling subsequent lymphocyte
adhesion and transmigration. L- sel is expressed
constitutively on most leukocytes. P-sel is stored in
the Weibel-Palade bodies of endothelial cells and in the
alpha granules of platlets. E- sels are present on
endothelial cells. Following platelet and/or
endothelial cell activation P- sel is rapidly
translocated to the cell surface and E-sel expression is
induced. The initial step in leukocyte migration
involves interactions of selectins with fucosylated,
sialylated, and sulfated carbohydrate moieties on target
ligands displayed on glycoprotein scaffolds on
endothelial cells and leucocytes. A major ligand of P-
E- and L-sels is PSGL-1 (P-sel glycoprotein ligand).
Interactions of E- and P- sels with tumor cells may
promote extravasation of cancer cells. Regulation of
L-sel and P-sel function includes proteolytic shedding
of the most extracellular portion (containing the CTLD)
from the cell surface. Increased levels of the soluble
form of P-sel in the plasma have been found in a number
of diseases including coronary disease and diabetes. E-
and P- sel also play roles in the development of
synovial inflammation in inflammatory arthritis.
Platelet P-sel, but not endothelial P-sel, plays a role
in the inflammatory response and neointimal formation
after arterial injury. Selectins may also function as
signal-transducing receptors.
Length = 115
Score = 51.2 bits (123), Expect = 5e-08
Identities = 28/119 (23%), Positives = 40/119 (33%), Gaps = 22/119 (18%)
Query: 103 SFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWK 162
+F +A C++ G DLV N L G + WI
Sbjct: 11 TFNEAVKYCKSRGTDLVAIQNAEENALLNGFALKYNLG----YYWIDGND----INNEGT 62
Query: 163 WVD--GEVVTKPSWGKDQPNNYNGEQNCVV----LDGGRGWLWNDVGCKLDYLHW-ICQ 214
WVD + + +W +PNN +NC+ +G WND C IC
Sbjct: 63 WVDTDKKELEYKNWAPGEPNN-GRNENCLEIYIKDNG----KWNDEPC--SKKKSAICY 114
>gnl|CDD|153068 cd03598, CLECT_EMBP_like, C-type lectin-like domain (CTLD) of the
type found in the human proteins, eosinophil major basic
protein (EMBP) and prepro major basic protein homolog
(MBPH). CLECT_EMBP_like: C-type lectin-like domain
(CTLD) of the type found in the human proteins,
eosinophil major basic protein (EMBP) and prepro major
basic protein homolog (MBPH). CTLD refers to a domain
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. Eosinophils and basophils
carry out various functions in allergic, parasitic, and
inflammatory diseases. EMBP is stored in eosinophil
crystalloid granules and is released upon degranulation.
EMBP is also expressed in basophils. The proform of
EMBP is expressed in placental X cells and breast tissue
and increases significantly during human pregnancy.
EMBP has cytotoxic properties and damages bacteria and
mammalian cells, in vitro, as well as, helminth
parasites. EMBP deposition has been observed in the
inflamed tissue of allergy patients in a variety of
diseases including asthma, atopic dermatitis, and
rhinitis. In addition to its cytotoxic functions, EMBP
activates cells and stimulates cytokine production.
EMBP has been shown to bind the proteoglycan heparin.
The binding site is similar to the carbohydrate binding
site of other classical CTLD, such as mannose-binding
protein (MBP1), however, heparin binding to EMBP is
calcium ion independent. MBPH has reduced potency in
cytotoxic and cytostimulatory assays compared with EMBP.
Length = 117
Score = 51.3 bits (123), Expect = 5e-08
Identities = 36/124 (29%), Positives = 46/124 (37%), Gaps = 10/124 (8%)
Query: 93 CYEFNIDKGASFADARSKCQ-ASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQ 151
CY F F DA+ C+ G+L I + ++R L VWIG
Sbjct: 3 CYRFVKSPRT-FRDAQVICRRCYRGNLA----SIHSFAFNYRVQRLVSTLNQAQVWIGGI 57
Query: 152 KDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHW 211
R + WVDG V W QP N +CV L G W CKL +
Sbjct: 58 ITGKGRCRRFSWVDGSVWNYAYWAPGQP--GNRRGHCVELCTRGGH-WRRAHCKL-RRPF 113
Query: 212 ICQH 215
IC +
Sbjct: 114 ICSY 117
>gnl|CDD|153073 cd03603, CLECT_VCBS, A bacterial subgroup of the C-type lectin-like
(CTLD) domain; a subgroup of bacterial protein domains
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. CLECT_VCBS: A bacterial
subgroup of the C-type lectin-like (CTLD) domain; a
subgroup of bacterial protein domains homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. Many CTLDs are calcium-dependent carbohydrate
binding modules; other CTLDs bind protein ligands,
lipids, and inorganic surfaces including CaCO3 and ice.
Bacterial CTLDs within this group are functionally
uncharacterized. Animal C-type lectins are involved in
such functions as extracellular matrix organization,
endocytosis, complement activation, pathogen
recognition, and cell-cell interactions. CTLDs may bind
a variety of carbohydrate ligands including mannose,
N-acetylglucosamine, galactose, N-acetylgalactosamine,
and fucose. CTLDs associate with each other through
several different surfaces to form dimers, trimers, or
tetramers from which ligand-binding sites project in
different orientations. In some CTLDs a loop extends to
the adjoining domain to form a loop-swapped dimer.
Length = 118
Score = 48.6 bits (116), Expect = 4e-07
Identities = 34/121 (28%), Positives = 49/121 (40%), Gaps = 28/121 (23%)
Query: 94 YEFNIDKGASFADARSKCQASGGDLVHGFRGIT----NTFLLGELERRKDRLKTLLVWIG 149
Y+F +D G ++ A++ ++ GG LV I N +LL WIG
Sbjct: 3 YKF-VDGGMTWEAAQTLAESLGGHLVT----INSAEENDWLLSNFGGYGA------SWIG 51
Query: 150 AQKDPGITARTWKWVDGEVVTKPSWGKDQP-------NNYNGEQNCVVLDGGRGWLWNDV 202
A T TWKW DGE T +WG +P +Y + + G WND+
Sbjct: 52 ASDAA--TEGTWKWSDGEESTYTNWGSGEPHNNGGGNEDYAAINHFPGISGK----WNDL 105
Query: 203 G 203
Sbjct: 106 A 106
>gnl|CDD|165176 PHA02831, PHA02831, EEV host range protein; Provisional.
Length = 268
Score = 50.4 bits (120), Expect = 1e-06
Identities = 31/127 (24%), Positives = 54/127 (42%), Gaps = 14/127 (11%)
Query: 219 NCGSPDRHLNTTFVGTVNTK-----LGSTISYACP----EGNMLVGSATRTCKEGFWTGV 269
NC P LN G + K G +++YAC E +VG+ T C W
Sbjct: 77 NCKDPVTILN----GYIKNKKDQYSFGDSVTYACKVNKLEKYSIVGNETVKCINKQWVPK 132
Query: 270 APTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGT 329
P C+ + C K +++G + + + +G + + C + + L+G + C W
Sbjct: 133 YPVCKLIRC-KYPALQNGFLNVFEKKFYYGDIVNFKCKKGFILLGSSVSTCDINSIWYPG 191
Query: 330 EPQCLFD 336
P+C+ D
Sbjct: 192 IPKCVKD 198
Score = 49.2 bits (117), Expect = 2e-06
Identities = 33/138 (23%), Positives = 55/138 (39%), Gaps = 8/138 (5%)
Query: 260 TCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACH----ENYTLIGE 315
TC G W+ +C I +G + + + + G YAC E Y+++G
Sbjct: 60 TCNNGSWSTKNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVTYACKVNKLEKYSIVGN 119
Query: 316 TRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTTSGRR--TGSVATYSCEPGFILFGS 373
C + +W P C C + P + G + ++ G + + C+ GFIL GS
Sbjct: 120 ETVKCINK-QWVPKYPVCKLIRC-KYPALQNGFLNVFEKKFYYGDIVNFKCKKGFILLGS 177
Query: 374 NELKCGLGGDWSGKAPTC 391
+ C + W P C
Sbjct: 178 SVSTCDINSIWYPGIPKC 195
Score = 38.4 bits (89), Expect = 0.006
Identities = 34/147 (23%), Positives = 56/147 (38%), Gaps = 10/147 (6%)
Query: 362 YSCEPGFILFGSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAE 421
Y C F + C G WS K +C P I NG+ + G
Sbjct: 48 YKCNNNF---DKVFVTCN-NGSWSTKNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVT 103
Query: 422 YTCDDDYWLEAGAM--RRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVGYDLNVH-- 477
Y C + LE ++ V + +W P C+LI C P + + G++ ++ +
Sbjct: 104 YACKVNK-LEKYSIVGNETVKCINKQWVPKYPVCKLIRCKYPAL-QNGFLNVFEKKFYYG 161
Query: 478 STLEFHCEPGHLLVGQASLKCTRQGDW 504
+ F C+ G +L+G + C W
Sbjct: 162 DIVNFKCKKGFILLGSSVSTCDINSIW 188
>gnl|CDD|165263 PHA02954, PHA02954, EEV membrane glycoprotein; Provisional.
Length = 317
Score = 47.8 bits (113), Expect = 8e-06
Identities = 35/120 (29%), Positives = 51/120 (42%), Gaps = 6/120 (5%)
Query: 272 TCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEP 331
TC +C L+ +EHG+ + + G C Y +IG + C WN P
Sbjct: 124 TCPNAECQPLQ-LEHGSCQPVKEKYSFGEHITINCDVGYEVIGASYISC-TANSWN-VIP 180
Query: 332 QCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
C + P +S G+++ S G V SC+ GF L GS C + G W+ P C
Sbjct: 181 SC--QQKCDIPSLSNGLISGSTFSIGGVIHLSCKSGFTLTGSPSSTC-IDGKWNPVLPIC 237
Score = 38.1 bits (88), Expect = 0.010
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 6/96 (6%)
Query: 239 LGSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRTTH 298
G I+ C G ++G++ +C W V P+CQ C + + +G ++ T
Sbjct: 149 FGEHITINCDVGYEVIGASYISCTANSWN-VIPSCQQ-KC-DIPSLSNGLISGSTFSI-- 203
Query: 299 GAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
G V +C +TL G C D GKWN P C+
Sbjct: 204 GGVIHLSCKSGFTLTGSPSSTCID-GKWNPVLPICV 238
Score = 33.5 bits (76), Expect = 0.25
Identities = 19/61 (31%), Positives = 25/61 (40%), Gaps = 5/61 (8%)
Query: 213 CQHNPANCGSPDRHLNTTFVGTVNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVAPT 272
CQ C P L+ + +G I +C G L GS + TC +G W V P
Sbjct: 182 CQQK---CDIPS--LSNGLISGSTFSIGGVIHLSCKSGFTLTGSPSSTCIDGKWNPVLPI 236
Query: 273 C 273
C
Sbjct: 237 C 237
>gnl|CDD|153064 cd03594, CLECT_REG-1_like, C-type lectin-like domain (CTLD) of the
type found in Human REG-1 (lithostathine), REG-4, and
avian eggshell-specific proteins: ansocalcin,
structhiocalcin-1(SCA-1), and -2(SCA-2).
CLECT_REG-1_like: C-type lectin-like domain (CTLD) of
the type found in Human REG-1 (lithostathine), REG-4,
and avian eggshell-specific proteins: ansocalcin,
structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to
a domain homologous to the carbohydrate-recognition
domains (CRDs) of the C-type lectins. REG-1 is a
proliferating factor which participates in various kinds
of tissue regeneration including pancreatic beta-cell
regeneration, regeneration of intestinal mucosa,
regeneration of motor neurons, and perhaps in tissue
regeneration of damaged heart. REG-1 may play a role on
the pathophysiology of Alzheimer's disease and in the
development of gastric cancers. Its expression is
correlated with reduced survival from early-stage
colorectal cancer. REG-1 also binds and aggregates
several bacterial strains from the intestinal flora and
it has been suggested that it is involved in the control
of the intestinal bacterial ecosystem. Rat
lithostathine has calcium carbonate crystal inhibitor
activity in vitro. REG-IV is unregulated in pancreatic,
gastric, hepatocellular, and prostrate adenocarcinomas.
REG-IV activates the EGF receptor/Akt/AP-1 signaling
pathway in colorectal carcinoma. Ansocalcin, SCA-1 and
-2 are found at high concentration in the calcified egg
shell layer of goose and ostrich, respectively and tend
to form aggregates. Ansocalcin nucleates calcite
crystal aggregates in vitro.
Length = 129
Score = 45.1 bits (107), Expect = 9e-06
Identities = 25/70 (35%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 146 VWIGAQKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWL-WNDVGC 204
VWIG DP +R W+W DG + SW ++ P Y C L G+L WND C
Sbjct: 64 VWIGLH-DPQ-QSRGWEWSDGSKLDYRSWDRNPP--YARGGYCAELSRSTGFLKWNDANC 119
Query: 205 KLDYLHWICQ 214
+ + +IC+
Sbjct: 120 E-ERNPFICK 128
>gnl|CDD|153063 cd03593, CLECT_NK_receptors_like, C-type lectin-like domain (CTLD)
of the type found in natural killer cell receptors
(NKRs). CLECT_NK_receptors_like: C-type lectin-like
domain (CTLD) of the type found in natural killer cell
receptors (NKRs), including proteins similar to oxidized
low density lipoprotein (OxLDL) receptor (LOX-1), CD94,
CD69, NKG2-A and -D, osteoclast inhibitory lectin
(OCIL), dendritic cell-associated C-type lectin-1
(dectin-1), human myeloid inhibitory C-type lectin-like
receptor (MICL), mast cell-associated functional antigen
(MAFA), killer cell lectin-like receptors: subfamily F,
member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and
lys49 receptors. CTLD refers to a domain homologous to
the carbohydrate-recognition domains (CRDs) of the
C-type lectins. NKRs are variously associated with
activation or inhibition of natural killer (NK) cells.
Activating NKRs stimulate cytolysis by NK cells of
virally infected or transformed cells; inhibitory NKRs
block cytolysis upon recognition of markers of healthy
self cells. Most Lys49 receptors are inhibitory; some
are stimulatory. OCIL inhibits NK cell function via
binding to the receptor NKRP1D. Murine OCIL in addition
to inhibiting NK cell function inhibits osteoclast
differentiation. MAFA clusters with the type I Fc
epsilon receptor (FcepsilonRI) and inhibits the mast
cells secretory response to FcepsilonRI stimulus. CD72
is a negative regulator of B cell receptor signaling.
NKG2D is an activating receptor for stress-induced
antigens; human NKG2D ligands include the stress induced
MHC-I homologs, MICA, MICB, and ULBP family of
glycoproteins Several NKRs have a carbohydrate-binding
capacity which is not mediated through calcium ions
(e.g. OCIL binds a range of high molecular weight
sulfated glycosaminoglycans including dextran sulfate,
fucoidan, and gamma-carrageenan sugars). Dectin-1 binds
fungal beta-glucans and in involved in the innate immune
responses to fungal pathogens. MAFA binds saccharides
having terminal alpha-D mannose residues in a
calcium-dependent manner. LOX-1 is the major receptor
for OxLDL in endothelial cells and thought to play a
role in the pathology of atherosclerosis. Some NKRs
exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and
some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can
function as a monomer in vitro.
Length = 116
Score = 41.2 bits (97), Expect = 2e-04
Identities = 25/127 (19%), Positives = 48/127 (37%), Gaps = 18/127 (14%)
Query: 89 FDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWI 148
+ CY F+++K ++ +++ C + L+ I + ELE + ++ + WI
Sbjct: 8 YGNKCYYFSMEKK-TWNESKEACSSKNSSLLK----IDDE---EELEFLQSQIGSSSYWI 59
Query: 149 GAQKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDY 208
G ++ + + WKW+DG + NC L C
Sbjct: 60 GLSREK--SEKPWKWIDGSPLNN----LFNIRGSTKSGNCAYLSSTGI---YSEDCSTKK 110
Query: 209 LHWICQH 215
WIC+
Sbjct: 111 -RWICEK 116
>gnl|CDD|153071 cd03601, CLECT_TC14_like, C-type lectin-like domain (CTLD) of the
type found in lectins TC14, TC14-2, TC14-3, and TC14-4
from the budding tunicate Polyandrocarpa misakiensis and
PfG6 from the Acorn worm. CLECT_TC14_like: C-type
lectin-like domain (CTLD) of the type found in lectins
TC14, TC14-2, TC14-3, and TC14-4 from the budding
tunicate Polyandrocarpa misakiensis and PfG6 from the
Acorn worm. CTLD refers to a domain homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. TC14 is homodimeric. The CTLD of TC14 binds
D-galactose and D-fucose. TC14 is expressed
constitutively by multipotent epithelial and mesenchymal
cells and plays in role during budding, in inducing the
aggregation of undifferentiated mesenchymal cells to
give rise to epithelial forming tissue. TC14-2 and
TC14-3 shows calcium-dependent galactose binding
activity. TC14-3 is a cytostatic factor which blocks
cell growth and dedifferentiation of the atrial
epithelium during asexual reproduction. It may also act
as a differentiation inducing factor. Galactose
inhibits the cytostatic activity of TC14-3. The gene
for Acorn worm PfG6 is gill-specific; PfG6 may be a
secreted protein.
Length = 119
Score = 38.7 bits (90), Expect = 0.001
Identities = 28/113 (24%), Positives = 45/113 (39%), Gaps = 17/113 (15%)
Query: 99 DKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLV-----WIGAQKD 153
D+ ++A A + C++ G L + + + E R L LV W+GA +
Sbjct: 7 DETMNYAKAGAFCRSRGMRLA--------SLAMRDSEMRDAILAFTLVKGHGYWVGAD-N 57
Query: 154 PGITARTWKWVDGEVVTKPS--WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGC 204
+ W DG + S W ++P+N Q CV L L +D C
Sbjct: 58 LQDGEYDFLWNDGVSLPTDSDLWAPNEPSNPQSRQLCVQLWSKYN-LLDDEYC 109
>gnl|CDD|153065 cd03595, CLECT_chondrolectin_like, C-type lectin-like domain (CTLD)
of the type found in the human type-1A transmembrane
proteins chondrolectin (CHODL) and layilin.
CLECT_chondrolectin_like: C-type lectin-like domain
(CTLD) of the type found in the human type-1A
transmembrane proteins chondrolectin (CHODL) and
layilin. CTLD refers to a domain homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. CHODL is predominantly expressed in muscle
cells and is associated with T-cell maturation. Various
alternatively spliced isoforms have been of CHODL have
been identified. The transmembrane form of CHODL is
localized in the ER-Golgi apparatus. Layilin is widely
expressed in different cell types. The extracellular
CTLD of layilin binds hyaluronan (HA), a major
constituent of the extracellular matrix (ECM). The
cytoplasmic tail of layilin binds various members of the
band 4.1/ERM superfamily (talin, radixin, and merlin).
The ERM proteins are cytoskeleton-membrane linker
molecules which link actin to receptors in the plasma
membrane. Layilin co-localizes in with talin in
membrane ruffles and may mediate signals from the ECM to
the cell cytoskeleton.
Length = 149
Score = 38.7 bits (90), Expect = 0.002
Identities = 35/154 (22%), Positives = 69/154 (44%), Gaps = 39/154 (25%)
Query: 89 FDRNCYEF----NIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERR--KDRLK 142
++ CY+ + + +F +AR C+ GG+L+ + E E++ + ++
Sbjct: 8 TEKPCYKIAYFQDSRRRLNFEEARQACREDGGELLS---------IESENEQKLIERFIQ 58
Query: 143 TLLV-----WIGAQKDPGITART------WKWVDGEVVTKPSWGKDQPNNYNGEQNCVVL 191
TL WIG ++ + + W+DG + T +W D+P+ G + CVV+
Sbjct: 59 TLRASDGDFWIGLRRSSQYNVTSSACSSLYYWLDGSISTFRNWYVDEPS--CGSEVCVVM 116
Query: 192 ------DGGRG----WLWNDVGCKLDYLHWICQH 215
G+G + WND C + ++IC++
Sbjct: 117 YHQPSAPAGQGGPYLFQWNDDNCNMKN-NFICKY 149
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The Polycystin
Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
huge protein of 4303aas. Its repeated leucine-rich (LRR)
segment is found in many proteins. It contains 16
polycystic kidney disease (PKD) domains, one
LDL-receptor class A domain, one C-type lectin family
domain, and 16-18 putative TMSs in positions between
residues 2200 and 4100. Polycystin-L has been shown to
be a cation (Na+, K+ and Ca2+) channel that is activated
by Ca2+. Two members of the PCC family (polycystin 1 and
2) are mutated in autosomal dominant polycystic kidney
disease, and polycystin-L is deleted in mice with renal
and retinal defects. Note: this model is restricted to
the amino half for technical reasons.
Length = 2740
Score = 40.5 bits (94), Expect = 0.003
Identities = 55/243 (22%), Positives = 82/243 (33%), Gaps = 30/243 (12%)
Query: 93 CYEFNIDKGASFADARSKCQA-SGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQ 151
C++ ++ A++ DA+ +C A +G L FL ++ DR VWIG
Sbjct: 331 CFQIVPEE-AAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDR----GVWIGFS 385
Query: 152 KDPGITARTWKWVDG-EVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLH 210
G + E +P+ E CV LD RG +D+ C +
Sbjct: 386 DVNGAEKGPAHQGEAFEAEECEEGLAGEPHPARAEH-CVRLD-PRGQCNSDL-CNAPH-A 441
Query: 211 WICQHNPANCGSPDRHLNTTFVGTVNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVA 270
++C+ NP G P +G + L + M A EG +
Sbjct: 442 YVCELNP---GGPVPDAENFAMGAASFDLHGLLQ---ALAAMDGLPAPP--HEGVEVLLF 493
Query: 271 PTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTE 330
P ++ L E G T E R H IY G GG G E
Sbjct: 494 PALRFSRAAFLSSAEFG--TQELRRPAHILFQIYRLRCRLP---------GAGGPACGPE 542
Query: 331 PQC 333
+C
Sbjct: 543 AEC 545
>gnl|CDD|153058 cd03588, CLECT_CSPGs, C-type lectin-like domain (CTLD) of the type
found in chondroitin sulfate proteoglycan core proteins.
CLECT_CSPGs: C-type lectin-like domain (CTLD) of the
type found in chondroitin sulfate proteoglycan core
proteins (CSPGs) in human and chicken aggrecan, frog
brevican, and zebra fish dermacan. CTLD refers to a
domain homologous to the carbohydrate-recognition
domains (CRDs) of the C-type lectins. In cartilage,
aggrecan forms cartilage link protein stabilized
aggregates with hyaluronan (HA). These aggregates
contribute to the tissue's load bearing properties.
Aggregates having other CSPGs substituting for aggrecan
may contribute to the structural integrity of many
different tissues. Xenopus brevican is expressed in the
notochord and the brain during early embryogenesis.
Zebra fish dermacan is expressed in dermal bones and may
play a role in dermal bone development. CSPGs do
contain LINK domain(s) which bind HA. These LINK
domains are considered by one classification system to
be a variety of CTLD, but are omitted from this
hierarchical classification based on insignificant
sequence similarity.
Length = 124
Score = 37.9 bits (88), Expect = 0.003
Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 22/124 (17%)
Query: 87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLV 146
F +CY D+ ++ DA +C+ G L ++ + E + +
Sbjct: 6 DKFQGHCYRHFPDR-ETWEDAERRCREQQGHL--------SSIVTPEEQEFVNNNAQDYQ 56
Query: 147 WIGAQKDPGITART----WKWVDGEVVTKPSWGKDQPNNY--NGEQNCVVLDGGRGWLWN 200
WIG + RT ++W DG + +W +QP+N+ GE +CVV+ WN
Sbjct: 57 WIG------LNDRTIEGDFRWSDGHPLQFENWRPNQPDNFFATGE-DCVVMIWHEEGEWN 109
Query: 201 DVGC 204
DV C
Sbjct: 110 DVPC 113
>gnl|CDD|223646 COG0573, PstC, ABC-type phosphate transport system, permease
component [Inorganic ion transport and metabolism].
Length = 310
Score = 38.8 bits (91), Expect = 0.007
Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 6/55 (10%)
Query: 508 SPQFISFYFPPSETISAVSEAQT----LGYMISV--GAGVVLFLLILLAIIYARL 556
+ + F P T++++ + G S G+VLF++ LL I AR
Sbjct: 247 TARIPLSLFSPGTTLTSIIANEFGEASGGLHTSALFALGLVLFVITLLLNILARY 301
>gnl|CDD|224194 COG1275, TehA, Tellurite resistance protein and related permeases
[Inorganic ion transport and metabolism].
Length = 329
Score = 36.9 bits (86), Expect = 0.027
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 10/76 (13%)
Query: 494 ASLKCTRQGDWDNMSPQFISFYFPP------SETISAVSEAQTLGYMISVGAGVVLFLLI 547
SL ++ + + ++P S++ PP S T S + A + + GAG + L+
Sbjct: 126 YSLFRGKKFELETITP---SWFLPPVGIIVASLTASVLLGAYFVIGYVFFGAGFFSY-LV 181
Query: 548 LLAIIYARLRKATPVK 563
L I+ RL A +
Sbjct: 182 LFPIVLYRLFTAGLLP 197
>gnl|CDD|183812 PRK12884, ubiA, prenyltransferase; Reviewed.
Length = 279
Score = 33.8 bits (78), Expect = 0.25
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 530 TLGYMISVGAGVVLFLLILLAIIYARLRKATPV 562
Y+IS A +V+ L+ +L I+Y K +
Sbjct: 97 IAAYLISPLAFLVVILVSVLGILYNWKLKEYGL 129
>gnl|CDD|153072 cd03602, CLECT_1, C-type lectin (CTL)/C-type lectin-like (CTLD)
domain subgroup 1; a subgroup of protein domains
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. CLECT_1: C-type lectin
(CTL)/C-type lectin-like (CTLD) domain subgroup 1; a
subgroup of protein domains homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. Many CTLDs are calcium-dependent carbohydrate
binding modules; other CTLDs bind protein ligands,
lipids, and inorganic surfaces including CaCO3 and ice.
Animal C-type lectins are involved in such functions as
extracellular matrix organization, endocytosis,
complement activation, pathogen recognition, and
cell-cell interactions. CTLDs may bind a variety of
carbohydrate ligands including mannose,
N-acetylglucosamine, galactose, N-acetylgalactosamine,
and fucose. CTLDs associate with each other through
several different surfaces to form dimers, trimers, or
tetramers from which ligand-binding sites project in
different orientations. In some CTLDs a loop extends to
the adjoining domain to form a loop-swapped dimer.
Length = 108
Score = 31.6 bits (72), Expect = 0.31
Identities = 19/69 (27%), Positives = 28/69 (40%), Gaps = 12/69 (17%)
Query: 146 VWIGAQKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLD-GGRGWLWNDVGC 204
WIG +D +W+W DG + +W QP G+ +C + GR W C
Sbjct: 49 AWIGLYRDVD----SWRWSDGSESSFRNWNTFQP---FGQGDCATMYSSGR---WYAALC 98
Query: 205 KLDYLHWIC 213
+IC
Sbjct: 99 -SALKPFIC 106
>gnl|CDD|153070 cd03600, CLECT_thrombomodulin_like, C-type lectin-like domain
(CTLD) of the type found in human thrombomodulin(TM),
Endosialin, C14orf27, and C1qR.
CLECT_thrombomodulin_like: C-type lectin-like domain
(CTLD) of the type found in human thrombomodulin(TM),
Endosialin, C14orf27, and C1qR. CTLD refers to a domain
homologous to the carbohydrate-recognition domains
(CRDs) of the C-type lectins. In these
thrombomodulin-like proteins the residues involved in
coordinating Ca2+ in the classical MBP-A CTLD are not
conserved. TM exerts anti-fibrinolytic and
anti-inflammatory activity. TM also regulates blood
coagulation in the anticoagulant protein C pathway. In
this pathway, the procoagulant properties of thrombin
(T) are lost when it binds TM. TM also plays a key role
in tumor biology. It is expressed on endothelial cells
and on several type of tumor cell including squamous
cell carcinoma. Loss of TM expression correlates with
advanced stage and poor prognosis. Loss of function of
TM function may be associated with arterial or venous
thrombosis and with late fetal loss. Soluble molecules
of TM retaining the CTLD are detected in human plasma
and urine where higher levels indicate injury and/or
enhanced turnover of the endothelium. C1qR is expressed
on endothelial cells and stem cells. It is also
expressed on monocots and neutrophils, where it is
subject to ectodomain shedding. Soluble forms of C1qR
retaining the CTLD is detected in human plasma. C1qR
modulates the phagocytosis of apoptotic cells in vivo.
C1qR-deficient mice are defective in clearance of
apoptotic cells in vivo. The cytoplasmic tail of C1qR,
C-terminal to the CTLD of CD93, contains a PDZ binding
domain which interacts with the PDZ domain-containing
adaptor protein, GIPC. The juxtamembrane region of this
tail interacts with the ezrin/radixin/moesin family.
Endosialin functions in the growth and progression of
abdominal tumors and is expressed in the stroma of
several tumors.
Length = 141
Score = 32.0 bits (73), Expect = 0.35
Identities = 32/128 (25%), Positives = 47/128 (36%), Gaps = 14/128 (10%)
Query: 102 ASFADARSKCQASGGDLV---HGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPG--- 155
+F +A+ C GG+L G + LL R R L WIG Q++P
Sbjct: 14 LTFLEAQRSCIELGGNLATVRSGEEADVVSLLLAAGPGRHGRGSLRL-WIGLQREPRQCS 72
Query: 156 ---ITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVL--DGGRGWL--WNDVGCKLDY 208
+ R + WV G+ T S +P CV L G W D C
Sbjct: 73 DPSLPLRGFSWVTGDQDTDFSNWLQEPAGTCTSPRCVALSAAGSTPDNLKWKDGPCSARA 132
Query: 209 LHWICQHN 216
++C+ +
Sbjct: 133 DGYLCKFS 140
>gnl|CDD|233744 TIGR02138, phosphate_pstC, phosphate ABC transporter, permease
protein PstC. The typical operon for the high affinity
inorganic phosphate ABC transporter encodes an
ATP-binding protein, a phosphate-binding protein, and
two permease proteins. This family consists of one of
the two permease proteins, PstC, which is homologous to
PstA (TIGR00974). In the model bacterium Escherichia
coli, this transport system is induced when the
concentration of extrallular inorganic phosphate is low.
A constitutive, lower affinity transporter operates
otherwise [Transport and binding proteins, Anions].
Length = 295
Score = 32.6 bits (75), Expect = 0.49
Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 7/58 (12%)
Query: 506 NMSPQFISFYFPPSETISAV-----SEAQTLGYMISV--GAGVVLFLLILLAIIYARL 556
N+ P F P TI+++ EA S G+VLF++ LL I AR
Sbjct: 232 NVFPISPLSLFDPGTTITSLIANQFGEASGGSVHTSALFALGLVLFVITLLVNILARY 289
>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat. Approx. 40
amino acid repeat. Tandem repeats form super-helix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. CAUTION: This family
does not contain all known armadillo repeats.
Length = 41
Score = 28.6 bits (65), Expect = 0.71
Identities = 10/41 (24%), Positives = 19/41 (46%), Gaps = 1/41 (2%)
Query: 1 KDNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLVT 41
+NK+ ++ G + P+ LL+S E+ + L L
Sbjct: 2 PENKQAVIEAGAVPPLVQ-LLSSPDEEVQEEAAWALSNLAA 41
>gnl|CDD|213052 cd12087, TM_EGFR-like, Transmembrane domain of the Epidermal Growth
Factor Receptor family of Protein Tyrosine Kinases.
PTKs catalyze the transfer of the gamma-phosphoryl group
from ATP to tyrosine (tyr) residues in protein
substrates. EGFR (HER, ErbB) subfamily members include
EGFR (HER1, ErbB1), HER2 (ErbB2), HER3 (ErbB3), HER4
(ErbB4), and similar proteins. They are receptor PTKs
(RTKs) containing an extracellular EGF-related
ligand-binding region, a transmembrane (TM) helix, and a
cytoplasmic region with a tyr kinase domain and a
regulatory C-terminal tail. They are activated by
ligand-induced dimerization, resulting in the
phosphorylation of tyr residues in the C-terminal tail,
which serve as binding sites for downstream signaling
molecules. Collectively, they can recognize a variety of
ligands including EGF, TGFalpha, and neuregulins, among
others. All four subfamily members can form homo- or
heterodimers. HER3 contains an impaired kinase domain
and depends on its heterodimerization partner for
activation. EGFR subfamily members are involved in
signaling pathways leading to a broad range of cellular
responses including cell proliferation, differentiation,
migration, growth inhibition, and apoptosis. The TM
domain not only serves as a membrane anchor, but also
plays an important role in receptor dimerization and
optimal activation. Mutations in the TM domain of EGFR
family RTKs have been associated with increased breast
cancer risk.
Length = 38
Score = 28.7 bits (65), Expect = 0.81
Identities = 9/28 (32%), Positives = 14/28 (50%), Gaps = 1/28 (3%)
Query: 531 LGYMISVGA-GVVLFLLILLAIIYARLR 557
I+ G G +L L+IL I++ R
Sbjct: 4 KTTSIAAGVVGGLLVLVILGLIVFLFRR 31
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 31.0 bits (71), Expect = 2.1
Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 2/53 (3%)
Query: 523 SAVSEAQTLGYMISV-GAGVVLFLLILLAIIYARL-RKATPVKNTENIEGAVR 573
S V+E LG V G V++ L L+A + L R++ K + + VR
Sbjct: 637 SVVAELSPLGLADDVIGQAVIIIALALIAFLVWPLCRESWRDKESHTLRLVVR 689
>gnl|CDD|235993 PRK07331, PRK07331, cobalt transport protein CbiM; Provisional.
Length = 322
Score = 30.8 bits (70), Expect = 2.3
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 529 QTLGYMISVGAGVVLFLLILLAIIYARLRK 558
+ LGY++S GV++FL+I I + K
Sbjct: 292 EVLGYILSAVVGVLIFLIIFKIISSLKKNK 321
>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
Length = 542
Score = 30.6 bits (70), Expect = 2.9
Identities = 15/86 (17%), Positives = 32/86 (37%), Gaps = 2/86 (2%)
Query: 512 ISFYFPPSETISAVSEAQTLGYM--ISVGAGVVLFLLILLAIIYARLRKATPVKNTENIE 569
+ F E + + Q +M I + AG +L L+++ ++ RLR P + E
Sbjct: 416 VPFDEEAEEEAAELPFWQQPWFMDLIKLAAGALLILILIFFVLRPRLRPLLPEEELAEEE 475
Query: 570 GAVRKEEQTAAVMSYASLSDGGGDNI 595
A + + + + +
Sbjct: 476 AAAEEAALEEDEEAALEVELSDDEEL 501
>gnl|CDD|234485 TIGR04154, archaeo_STT3, oligosaccharyl transferase, archaeosortase
A system-associated. Members of this protein family
occur, one to three members per genome, in the same
species of Euryarchaeota as contain the predicted
protein-sorting enzyme archaeosortase (TIGR04125) and
its cognate protein-sorting signal PGF-CTERM
(TIGR04126).
Length = 815
Score = 30.4 bits (69), Expect = 3.3
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 11/68 (16%)
Query: 504 WDNMSPQFISFYFPPSETISAVSEAQTLGYM----------ISVGAGVVLFLLILLAIIY 553
+ ++ + F+F T ++EAQ L + G G +L L L ++Y
Sbjct: 325 FSLIN-NGLRFFFGGGGTALTIAEAQPLFSTGGEFSLAPAWSNFGLGFLLALAGLALLLY 383
Query: 554 ARLRKATP 561
LRK P
Sbjct: 384 RPLRKYRP 391
>gnl|CDD|223744 COG0672, FTR1, High-affinity Fe2+/Pb2+ permease [Inorganic ion
transport and metabolism].
Length = 383
Score = 30.1 bits (68), Expect = 3.6
Identities = 15/86 (17%), Positives = 25/86 (29%), Gaps = 14/86 (16%)
Query: 529 QTLGYMISVGAGVVLFLLILLAIIYARLRKATPVKNTENIEGAVR------------KEE 576
+ VG +L+L+ ++ P+K I A E
Sbjct: 277 AATPSALEVGVYFGYLILVLILLLIRYRSVKLPLKPFFQITSAFLFYMCFVFAGKGVLEL 336
Query: 577 QTAAVMSYASLSDGGGD--NIYENVA 600
Q A V+S + IY +
Sbjct: 337 QEAGVISPSLEPGPSISWLGIYPSYE 362
>gnl|CDD|237095 PRK12417, secY, preprotein translocase subunit SecY; Reviewed.
Length = 404
Score = 30.0 bits (68), Expect = 3.7
Identities = 17/77 (22%), Positives = 33/77 (42%), Gaps = 13/77 (16%)
Query: 510 QFISFYFPPSETISAVSEAQTLGYMISVGAGVVLFLLIL--LAIIYARL-----RKATPV 562
I FP + ISA + + G + GV ++L++ L+ ++ + + A +
Sbjct: 258 ILILSIFPNNSGISADMDMLSFGSPL----GVTVYLILQFLLSYFFSFVNINTKQIAKDM 313
Query: 563 -KNTENIEGAVRKEEQT 578
K+ I G V + T
Sbjct: 314 LKSGNYIPG-VYPGKDT 329
>gnl|CDD|163525 TIGR03813, put_Glu_GABA_T, putative glutamate/gamma-aminobutyrate
antiporter. Members of this protein family are putative
putative glutamate/gamma-aminobutyrate antiporters. Each
member of the seed alignment is found adjacent to a
glutamate decarboxylase, which converts glutamate (Glu)
to gamma-aminobutyrate (GABA). However, the majority
belong to genome contexts with a glutaminase (converts
Gln to Glu) as well as the decarboxylase that converts
Glu to GABA. The specificity of the transporter remains
uncertain.
Length = 474
Score = 30.2 bits (68), Expect = 4.1
Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 6/59 (10%)
Query: 511 FISFYFPPSETISAVSEAQTLGYMISVGAGVVLFLLILLAIIYARLRKATPVKNTENIE 569
F+ + PPS+ I S + +G +I G+ + +IYAR RK N E
Sbjct: 417 FVLSFIPPSQ-IGTGSPTRYVGLLI----GLAALFVTAPFLIYAR-RKPHWRDPNSNFE 469
>gnl|CDD|216184 pfam00907, T-box, T-box. The T-box encodes a 180 amino acid domain
that binds to DNA. Genes encoding T-box proteins are
found in a wide range of animals, but not in other
kingdoms such as plants. Family members are all thought
to bind to the DNA consensus sequence TCACACCT. they are
found exclusively in the nucleus, and perform
DNA-binding and transcriptional activation/repression
roles. They are generally required for development of
the specific tissues they are expressed in, and
mutations in T-box genes are implicated in human
conditions such as DiGeorge syndrome and X-linked cleft
palate, which feature malformations.
Length = 182
Score = 29.1 bits (66), Expect = 4.2
Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 18/70 (25%)
Query: 348 IVTTSGRR---------TG--SVATYSCEPGFILFGSNELKCGLGGDW--SGKA----PT 390
I+T SGRR +G A YS F+ + K G+W +GKA P
Sbjct: 21 IITKSGRRMFPTLKVSVSGLDPNALYSVLLDFVPVDDHRYKFH-NGEWVVAGKAEPHPPP 79
Query: 391 CKYVDCGSPA 400
Y+ SPA
Sbjct: 80 RVYIHPDSPA 89
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 29.9 bits (68), Expect = 4.4
Identities = 14/62 (22%), Positives = 21/62 (33%), Gaps = 11/62 (17%)
Query: 506 NMSPQFISFY----------FPPSETISAVSEAQTLGYMISV-GAGVVLFLLILLAIIYA 554
+ ++ P IS +E G V G + LLILL++
Sbjct: 116 GWPKERVARLRRRLRRLIWVLVPLLFISIFAENLPDGLARDVLGRLAFILLLILLSLFLW 175
Query: 555 RL 556
RL
Sbjct: 176 RL 177
>gnl|CDD|153066 cd03596, CLECT_tetranectin_like, C-type lectin-like domain (CTLD)
of the type found in the tetranectin (TN), cartilage
derived C-type lectin (CLECSF1), and stem cell growth
factor (SCGF). CLECT_tetranectin_like: C-type
lectin-like domain (CTLD) of the type found in the
tetranectin (TN), cartilage derived C-type lectin
(CLECSF1), and stem cell growth factor (SCGF). CTLD
refers to a domain homologous to the
carbohydrate-recognition domains (CRDs) of the C-type
lectins. TN binds to plasminogen and stimulates
activation of plasminogen, playing a key role in the
regulation of proteolytic processes. The TN CTLD binds
two calcium ions. Its calcium free form binds to
various kringle-like protein ligands. Two residues
involved in the coordination of calcium are critical for
the binding of TN to the fourth kringle (K4) domain of
plasminogen (Plg K4). TN binds the kringle 1-4 form of
angiostatin (AST K1-4). AST K1-4 is a fragment of Plg,
commonly found in cancer tissues. TN inhibits the
binding of Plg and AST K1-4 to the extracellular matrix
(EMC) of endothelial cells and counteracts the
antiproliferative effects of AST K1-4 on these cells.
TN also binds the tenth kringle domain of apolipoprotein
(a). In addition, TN binds fibrin and complex
polysaccharides in a Ca2+ dependent manner. The binding
site for complex sulfated polysaccharides is N-terminal
to the CTLD. TN is homotrimeric; N-terminal to the CTLD
is an alpha helical domain responsible for trimerization
of monomeric units. TN may modulate angiogenesis
through interactions with angiostatin and coagulation
through interaction with fibrin. TN may play a role in
myogenesis and in bone development. Mice having a
deletion in the TN gene exhibit a kyphotic spine
abnormality. TN is a useful prognostic marker of
certain cancer types. CLECSF1 is expressed in cartilage
tissue, which is primarily intracellular matrix (ECM),
and is a candidate for organizing ECM. SCGF is strongly
expressed in bone marrow and is a cytokine for primitive
hematopoietic progenitor cells.
Length = 129
Score = 28.5 bits (64), Expect = 4.5
Identities = 25/119 (21%), Positives = 45/119 (37%), Gaps = 17/119 (14%)
Query: 103 SFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLL-----VWIGAQKDPGIT 157
+ +A C A GG L E + +D +K + VW+G +
Sbjct: 20 HYHEASEDCIARGGTLATPRDS-------DENDALRDYVKASVPGNWEVWLGIND--MVA 70
Query: 158 ARTWKWVDGEVVTKPSWGKDQPNNYNG--EQNCVVLDGGRGWLWNDVGCKLDYLHWICQ 214
W V+G ++ +W ++ +G +NCV L W D C+ ++C+
Sbjct: 71 EGKWVDVNGSPISYFNWEREITAQPDGGKRENCVALSSSAQGKWFDEDCR-REKPYVCE 128
>gnl|CDD|223279 COG0201, SecY, Preprotein translocase subunit SecY [Intracellular
trafficking and secretion].
Length = 436
Score = 29.5 bits (67), Expect = 5.8
Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 9/85 (10%)
Query: 509 PQFISFYFPPSETISAVSEAQTLGYMISVGAGVVLFLLILLAIIYARLRKAT---PVKNT 565
P I+ + +S ++ +S G+ V + L ++L I ++ P +
Sbjct: 282 PSTIAQFLGNGGLAYYLSPLTSISDALSPGSPVYIALYVVLIIFFSYFYTEIQFNPEEIA 341
Query: 566 ENIE--GA----VRKEEQTAAVMSY 584
EN++ G +R + T ++
Sbjct: 342 ENLKKSGGFIPGIRPGKDTEKYLNR 366
>gnl|CDD|233140 TIGR00816, tdt, C4-dicarboxylate transporter/malic acid transport
protein. The Tellurite-Resistance/Dicarboxylate
Transporter (TDT) Family (TC 2.A.16)Two members of the
TDT family have been functionally characterized. One is
the TehA protein of E. coli which has been implicated in
resistance to tellurite; the other is the Mae1 protein
of S. pombe which functions in the uptake of malate and
other dicarboxylates by a proton symportmechanism. These
proteins exhibit 10 putative transmembrane
a-helicalspanners (TMSs) [Transport and binding
proteins, Carbohydrates, organic alcohols, and acids].
Length = 320
Score = 29.3 bits (66), Expect = 6.9
Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 11/61 (18%)
Query: 513 SFYFPPSETI------SAVSEAQTLGYMISV----GAGVVLFLLILLAIIYARLRKATPV 562
+Y PP I A+ TLG + GAG L+ L L A++ RLR +
Sbjct: 133 GWYLPPVALIFISAAALAILPYLTLGLTVINYLFWGAGFFLY-LSLEAVLLLRLRLHGLL 191
Query: 563 K 563
Sbjct: 192 P 192
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs, mainly
the seven transmembrane helices, GCPRs of the rhodopsin
superfamily. All opsins bind a chromophore, such as
11-cis-retinal. The function of most opsins other than
the photoisomerases is split into two steps: light
absorption and G-protein activation. Photoisomerases, on
the other hand, are not coupled to G-proteins - they are
thought to generate and supply the chromophore that is
used by visual opsins.
Length = 251
Score = 28.4 bits (64), Expect = 8.9
Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 9/81 (11%)
Query: 511 FISFYFPPSE-----TISAVSEAQTLGY--MISVGAGVVLFLLILLA--IIYARLRKATP 561
F I E+ Y + ++ V+ L+IL+ +I LRK
Sbjct: 116 FSWLRTVEEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRKRAR 175
Query: 562 VKNTENIEGAVRKEEQTAAVM 582
++ +E+ AA M
Sbjct: 176 SGASQARAKRSSSKERKAAKM 196
>gnl|CDD|185731 cd08990, GH43_AXH_like, Glycosyl hydrolase family 43, includes
arabinoxylan arabinofuranohydrolase, beta-xylosidase,
endo-1,4-beta-xylanase, alpha-L-arabinofuranosidase.
This glycosyl hydrolase family 43 (GH43) includes
enzymes that have been characterized with
beta-xylosidase (EC 3.2.1.37),
alpha-L-arabinofuranosidase (EC 3.2.1.55),
endo-alpha-L-arabinanase as well as arabinoxylan
arabinofuranohydrolase (AXH) activities. These are all
inverting enzymes (i.e. they invert the stereochemistry
of the anomeric carbon atom of the substrate) that have
an aspartate as the catalytic general base, a glutamate
as the catalytic general acid and another aspartate that
is responsible for pKa modulation and orienting the
catalytic acid. Many of the enzymes in this family
display both alpha-L-arabinofuranosidase and
beta-D-xylosidase activity using aryl-glycosides as
substrates. AXHs specifically hydrolyze the glycosidic
bond between arabinofuranosyl substituents and
xylopyranosyl backbone residues of arabinoxylan. Several
of these enzymes also contain carbohydrate binding
modules (CBMs) that bind cellulose or xylan. A common
structural feature of GH43 enzymes is a 5-bladed
beta-propeller domain that contains the catalytic acid
and catalytic base. A long V-shaped groove, partially
enclosed at one end, forms a single extended
substrate-binding surface across the face of the
propeller.
Length = 274
Score = 28.7 bits (65), Expect = 9.8
Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 10/39 (25%)
Query: 411 NGTTTHGSVAEYTCDDDYWL--------EAGAMRRQVCM 441
NG T HGS+ E+ ++L G RR VC+
Sbjct: 225 NGGTNHGSIVEF--KGQWYLFYHRATLSGGGDFRRSVCI 261
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.135 0.431
Gapped
Lambda K H
0.267 0.0746 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,659,270
Number of extensions: 2899601
Number of successful extensions: 2533
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2470
Number of HSP's successfully gapped: 78
Length of query: 618
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 515
Effective length of database: 6,369,140
Effective search space: 3280107100
Effective search space used: 3280107100
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.6 bits)