RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1672
(600 letters)
>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like. IG domains that cannot
be classified into one of IGv1, IGc1, IGc2, IG.
Length = 85
Score = 69.5 bits (170), Expect = 9e-15
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 8/93 (8%)
Query: 261 SRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSS 320
V E+ T+ C PP ++WY G LL + R V G S+
Sbjct: 1 PPSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAES-----GRFSVSRSG---STST 52
Query: 321 LVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
L ++N DSG + C A N +G A + TL V
Sbjct: 53 LTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85
>gnl|CDD|214652 smart00409, IG, Immunoglobulin.
Length = 85
Score = 69.5 bits (170), Expect = 9e-15
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 8/93 (8%)
Query: 261 SRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSS 320
V E+ T+ C PP ++WY G LL + R V G S+
Sbjct: 1 PPSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAES-----GRFSVSRSG---STST 52
Query: 321 LVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
L ++N DSG + C A N +G A + TL V
Sbjct: 53 LTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85
>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain. Ig: immunoglobulin (Ig) domain
found in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
this group are components of immunoglobulin, neuroglia,
cell surface glycoproteins, such as, T-cell receptors,
CD2, CD4, CD8, and membrane glycoproteins, such as,
butyrophilin and chondroitin sulfate proteoglycan core
protein. A predominant feature of most Ig domains is a
disulfide bridge connecting the two beta-sheets with a
tryptophan residue packed against the disulfide bond.
Length = 74
Score = 64.4 bits (156), Expect = 4e-13
Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 5/79 (6%)
Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
T+ C PP I+W NG+ L ++ G S+L ++N DS
Sbjct: 1 VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSR-----GTSSGSSTLTISNVTLEDS 55
Query: 332 GRFYCVAENRAGIADANFT 350
G + CVA N AG A+ T
Sbjct: 56 GTYTCVASNSAGTVSASVT 74
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 59.9 bits (146), Expect = 1e-11
Identities = 25/60 (41%), Positives = 33/60 (55%)
Query: 76 NLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPI 135
NL+ L L+ + I GA GL NL +DLS N LTSI F + LR L+L+ N +
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 58.7 bits (143), Expect = 3e-11
Identities = 28/59 (47%), Positives = 40/59 (67%)
Query: 125 LRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRL 183
L+ L+L+ N ++ I GAF+ +P L LD+S + L ISPEAF+G SL S+ L+GN L
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 54.1 bits (131), Expect = 1e-09
Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 2/60 (3%)
Query: 52 QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLL 111
+ LD+S N L ++P AF+ GL NL+ L L+ ++ I A GL +L +DLS N L
Sbjct: 3 KSLDLSNNRLTVIPDGAFK--GLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 50.6 bits (122), Expect = 2e-08
Identities = 24/60 (40%), Positives = 36/60 (60%)
Query: 100 NLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRL 159
NL +DLS+N LT IP F+ + L+ L+L+ N ++ I AF +P L LD+S + L
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 45.2 bits (108), Expect = 1e-06
Identities = 19/59 (32%), Positives = 29/59 (49%)
Query: 148 GLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPVRSVEPLLKLMMIELHDNP 206
L LD+S +RL I AF G +L+ + L+GN L+ + L L ++L N
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNN 59
>gnl|CDD|214507 smart00082, LRRCT, Leucine rich repeat C-terminal domain.
Length = 51
Score = 52.8 bits (127), Expect = 3e-09
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 3/52 (5%)
Query: 205 NPWVCDCNMRSIKMWLADKKNVPVQPA--CTGPERLSGKVFSDLHADDFACK 254
NP++CDC +R + WL +++ C P L G + LH+ +F C
Sbjct: 1 NPFICDCELRWLLRWLQANEHLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain.
Length = 90
Score = 53.8 bits (130), Expect = 4e-09
Identities = 27/87 (31%), Positives = 41/87 (47%), Gaps = 11/87 (12%)
Query: 269 SENATVV--CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNA 326
E + C V P +SW+ +G+ L ++ F E G Y +L ++N
Sbjct: 13 QEGESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFK-----VTYEGGTY----TLTISNV 63
Query: 327 QESDSGRFYCVAENRAGIADANFTLQV 353
Q D G++ CVA N AG A+A+ L V
Sbjct: 64 QPDDEGKYTCVATNSAGEAEASAELTV 90
>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
unknown].
Length = 394
Score = 54.6 bits (131), Expect = 7e-08
Identities = 39/147 (26%), Positives = 66/147 (44%), Gaps = 4/147 (2%)
Query: 74 LLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARN 133
LL L L L + + L LTNL +DL +N +T IP L L++L+L+ N
Sbjct: 92 LLPLPSLDLNLNRLR-SNISELLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDN 150
Query: 134 PISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPVRSVEP 193
I + + +P L LD+S + L + + +L ++ L+GN++S P
Sbjct: 151 KIESLP-SPLRNLPNLKNLDLSFNDLSDLPKLL-SNLSNLNNLDLSGNKISDLPPEIELL 208
Query: 194 LLKLMMIELHDNPWVCDCNMRSIKMWL 220
L ++L +N + + S L
Sbjct: 209 -SALEELDLSNNSIIELLSSLSNLKNL 234
Score = 44.2 bits (104), Expect = 1e-04
Identities = 38/136 (27%), Positives = 68/136 (50%), Gaps = 8/136 (5%)
Query: 52 QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLL 111
+ LD+S N+L LPK + L NL L L+ I + ++ L+ L E+DLS+N +
Sbjct: 166 KNLDLSFNDLSDLPKL---LSNLSNLNNLDLSGNKISDL-PPEIELLSALEELDLSNNSI 221
Query: 112 TSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAK 171
+ S + +++ L L L+ N + + + + L LD+S +++ IS
Sbjct: 222 IELLS-SLSNLKNLSGLELSNNKLEDLPES-IGNLSNLETLDLSNNQISSISSLG--SLT 277
Query: 172 SLESIKLNGNRLSHFP 187
+L + L+GN LS+
Sbjct: 278 NLRELDLSGNSLSNAL 293
Score = 38.8 bits (90), Expect = 0.006
Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 7/137 (5%)
Query: 43 PEAPESELTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLI 102
LD+SGN + LP E L L++L L+ I ++ S +L L NL
Sbjct: 180 KLLSNLSNLNNLDLSGNKISDLPPEI---ELLSALEELDLSNNSIIELLS-SLSNLKNLS 235
Query: 103 EIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHI 162
++LS+N L +P ++ L L+L+ N IS I + L +LD+S + L +
Sbjct: 236 GLELSNNKLEDLPES-IGNLSNLETLDLSNNQISSISSLG--SLTNLRELDLSGNSLSNA 292
Query: 163 SPEAFTGAKSLESIKLN 179
P LE +
Sbjct: 293 LPLIALLLLLLELLLNL 309
>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type.
Length = 63
Score = 46.2 bits (110), Expect = 7e-07
Identities = 18/74 (24%), Positives = 28/74 (37%), Gaps = 13/74 (17%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
++ T+ C + P I+W + R S+L + +
Sbjct: 3 QSVTLTCPAEGNPVPNITWL------KDGKPLPESNRF-------VASGSTLTIKSVSLE 49
Query: 330 DSGRFYCVAENRAG 343
DSG + CVAEN AG
Sbjct: 50 DSGLYTCVAENSAG 63
>gnl|CDD|206066 pfam13895, Ig_2, Immunoglobulin domain. This domain contains
immunoglobulin-like domains.
Length = 80
Score = 45.9 bits (109), Expect = 1e-06
Identities = 27/101 (26%), Positives = 34/101 (33%), Gaps = 22/101 (21%)
Query: 254 KPEIRMDSRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQG 313
KP + V E+ T+ C PP +WY + SS Q F
Sbjct: 1 KPVLTPSPTVVFE--GEDVTLTCSAPGNPPPNYTWY------KDGVPLSSSQNGFFT--- 49
Query: 314 EYERKSSLVLTNAQESDSGRFYCVAENRAGIADAN-FTLQV 353
N DSG + CVA N G +N TL V
Sbjct: 50 ----------PNVSAEDSGTYTCVASNGGGGKTSNPVTLTV 80
>gnl|CDD|143235 cd05758, Ig5_KIRREL3-like, Fifth immunoglobulin (Ig)-like domain of
Kirrel (kin of irregular chiasm-like) 3 (also known as
Neph2) and similar proteins. Ig5_KIRREL3-like: domain
similar to the fifth immunoglobulin (Ig)-like domain of
Kirrel (kin of irregular chiasm-like) 3 (also known as
Neph2). This protein has five Ig-like domains, one
transmembrane domain, and a cytoplasmic tail. Included
in this group is mammalian Kirrel (Neph1), Kirrel2
(Neph3), and Drosophila RST (irregular chiasm
C-roughest) protein. These proteins contain multiple Ig
domains, have properties of cell adhesion molecules, and
are important in organ development.
Length = 98
Score = 46.2 bits (110), Expect = 2e-06
Identities = 28/102 (27%), Positives = 39/102 (38%), Gaps = 9/102 (8%)
Query: 255 PEIRMDSRYVEAVSSENATVVCRVDSIPPA-AISWYWNGRLLLNNTAFSSYQRIFVIEQG 313
P I A+ + V C + S PP I W W L S R + +E
Sbjct: 2 PPIITSEATQYAILGDKGRVECFIFSTPPPDRIVWTWKENEL----ESGSSGR-YTVETD 56
Query: 314 EYER--KSSLVLTNAQESD-SGRFYCVAENRAGIADANFTLQ 352
S+L ++N QESD + C A N G A +L+
Sbjct: 57 PSPGGVLSTLTISNTQESDFQTSYNCTAWNSFGSGTAIISLE 98
>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
Ig5_Contactin_like: Fifth Ig domain of contactins.
Contactins are neural cell adhesion molecules and are
comprised of six Ig domains followed by four fibronectin
type III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of
neuronal act ivity in the rat auditory system.
Contactin-5 is highly expressed in the adult human brain
in the occipital lobe and in the amygdala. Contactin-1
is differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 73
Score = 45.1 bits (107), Expect = 2e-06
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 12/78 (15%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
C+ + P ISW LL N++ RI + G SL + N +SD G++
Sbjct: 8 CKPKAAPKPTISWSKGTELLTNSS------RICIWPDG------SLEILNVTKSDEGKYT 55
Query: 336 CVAENRAGIADANFTLQV 353
C AEN G A++ +L V
Sbjct: 56 CFAENFFGKANSTGSLSV 73
>gnl|CDD|143207 cd05730, Ig3_NCAM-1_like, Third immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM).
Ig3_NCAM-1_like: domain similar to the third
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-1 (NCAM). NCAM plays important roles in
the development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-non-NCAM), interactions. NCAM is expressed as
three major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
this model, Ig1,and Ig2 mediate dimerization of NCAM
molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain.
Length = 95
Score = 45.7 bits (108), Expect = 3e-06
Identities = 27/106 (25%), Positives = 43/106 (40%), Gaps = 22/106 (20%)
Query: 255 PEIRMDSRYVEAVS--SENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQ 312
P IR V A + ++ T+ C D P ++W +G IE
Sbjct: 2 PTIRARQSEVNATANLGQSVTLACDADGFPEPTMTWTKDGE---------------PIES 46
Query: 313 GE-----YERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
GE E S + + + + D + C+AEN+AG +A L+V
Sbjct: 47 GEEKYSFNEDGSEMTILDVDKLDEAEYTCIAENKAGEQEAEIHLKV 92
>gnl|CDD|143179 cd04978, Ig4_L1-NrCAM_like, Fourth immunoglobulin (Ig)-like domain
of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule),
and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth
immunoglobulin (Ig)-like domain of L1, Ng-CAM
(Neuron-glia CAM cell adhesion molecule), and NrCAM
(Ng-CAM-related). These proteins belong to the L1
subfamily of cell adhesion molecules (CAMs) and are
comprised of an extracellular region having six Ig-like
domains and five fibronectin type III domains, a
transmembrane region and an intracellular domain. These
molecules are primarily expressed in the nervous system.
L1 is associated with an X-linked recessive disorder,
X-linked hydrocephalus, MASA syndrome, or spastic
paraplegia type 1, that involves abnormalities of axonal
growth.
Length = 76
Score = 45.1 bits (107), Expect = 3e-06
Identities = 21/84 (25%), Positives = 34/84 (40%), Gaps = 10/84 (11%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
E + C + IP I+W NG + +R+ +L+L+N Q +
Sbjct: 2 ETGRLDCEAEGIPQPTITWRLNGVPI-EELPPDPRRRVD---------GGTLILSNVQPN 51
Query: 330 DSGRFYCVAENRAGIADANFTLQV 353
D+ + C A N G AN + V
Sbjct: 52 DTAVYQCNASNVHGYLLANAFVHV 75
>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor and similar
proteins. Ig2_FGFR_like: domain similar to the second
immunoglobulin (Ig)-like domain of fibroblast growth
factor (FGF) receptor. FGF receptors bind FGF signaling
polypeptides. FGFs participate in multiple processes
such as morphogenesis, development, and angiogenesis.
FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
-2, -3, -4). Receptor diversity is controlled by
alternative splicing producing splice variants with
different ligand binding characteristics and different
expression patterns. FGFRs have an extracellular region
comprised of three Ig-like domains, a single
transmembrane helix, and an intracellular tyrosine
kinase domain. Ligand binding and specificity reside in
the Ig-like domains 2 and 3, and the linker region that
connects these two. FGFR activation and signaling depend
on FGF-induced dimerization, a process involving cell
surface heparin or heparin sulfate proteoglycans. This
group also contains fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 does not have a protein
tyrosine kinase domain at its C terminus; neither does
its cytoplasmic domain appear to interact with a
signaling partner. It has been suggested that FGFRL1 may
not have any direct signaling function, but instead acts
as a decoy receptor trapping FGFs and preventing them
from binding other receptors.
Length = 85
Score = 44.3 bits (105), Expect = 5e-06
Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 10/79 (12%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS-SLVLTNAQESDSGRF 334
C P I+W +G+ ++ I + +K +L+L + SDSG++
Sbjct: 16 CPASGNPRPTITWLKDGKPF---------KKEHRIGGYKVRKKKWTLILESVVPSDSGKY 66
Query: 335 YCVAENRAGIADANFTLQV 353
C+ EN+ G + + + V
Sbjct: 67 TCIVENKYGSINHTYKVDV 85
>gnl|CDD|222457 pfam13927, Ig_3, Immunoglobulin domain. This family contains
immunoglobulin-like domains.
Length = 74
Score = 44.3 bits (104), Expect = 5e-06
Identities = 25/88 (28%), Positives = 34/88 (38%), Gaps = 15/88 (17%)
Query: 254 KPEIRMDSRYVEAVSSENATVVCRVDSIPPAA-ISWYWNGRLLLNNTAFSSYQRIFVIEQ 312
KP I + S T+ C + PP ISWY NG + + S
Sbjct: 1 KPVITVSPSPSV-TSGGGVTLTCSAEGGPPPPTISWYRNGSISGGSGGLGSSG------- 52
Query: 313 GEYERKSSLVLTNAQESDSGRFYCVAEN 340
S+L L++ DSG + CVA N
Sbjct: 53 ------STLTLSSVTSEDSGTYTCVASN 74
>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
kinase; Provisional.
Length = 968
Score = 48.7 bits (116), Expect = 7e-06
Identities = 44/137 (32%), Positives = 61/137 (44%), Gaps = 7/137 (5%)
Query: 50 LTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHI--GQIDSGALDGLTNLIEIDLS 107
L LD+S NNLQ + R+ + +LQ L LAR G DS L NL DLS
Sbjct: 429 LVYFLDISNNNLQ--GRINSRKWDMPSLQMLSLARNKFFGGLPDSFGSKRLENL---DLS 483
Query: 108 DNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAF 167
N + S+ L L L+ N +S LV LD+S ++L P +F
Sbjct: 484 RNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASF 543
Query: 168 TGAKSLESIKLNGNRLS 184
+ L + L+ N+LS
Sbjct: 544 SEMPVLSQLDLSQNQLS 560
Score = 40.6 bits (95), Expect = 0.002
Identities = 33/132 (25%), Positives = 62/132 (46%), Gaps = 3/132 (2%)
Query: 52 QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLL 111
Q+L ++ N ++F L+ L L+R L L+ L+++ LS+N L
Sbjct: 455 QMLSLARNKFFGGLPDSFGSK---RLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKL 511
Query: 112 TSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAK 171
+ S + L L+L+ N +S +F +P L +LD+S+++L P+ +
Sbjct: 512 SGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVE 571
Query: 172 SLESIKLNGNRL 183
SL + ++ N L
Sbjct: 572 SLVQVNISHNHL 583
Score = 38.7 bits (90), Expect = 0.009
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 13/138 (9%)
Query: 88 GQIDSGALDGLTNLIEIDLSDNLLT-SIPSLTFQSVRFLRDLNLARNPIS-KIEKGAFQF 145
G+I S A+ L + I+LS+N L+ IP F + LR LNL+ N + I +G
Sbjct: 83 GKI-SSAIFRLPYIQTINLSNNQLSGPIPDDIFTTSSSLRYLNLSNNNFTGSIPRG---S 138
Query: 146 VPGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLS-HFPVRSVEPLLKLMMIELHD 204
+P L LD+S + L P SL+ + L GN L P S+ L L + L
Sbjct: 139 IPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDLGGNVLVGKIPN-SLTNLTSLEFLTLAS 197
Query: 205 NPWVCDC-----NMRSIK 217
N V M+S+K
Sbjct: 198 NQLVGQIPRELGQMKSLK 215
Score = 37.5 bits (87), Expect = 0.024
Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 10/135 (7%)
Query: 54 LDMSGNNL--QILPKEAFRRAGLLNLQKLFLARCHI-GQIDSGALDGLTNLIEIDLSDNL 110
LD+ NNL I L NLQ LFL + + G I ++ L LI +DLSDN
Sbjct: 241 LDLVYNNLTGPIPSS----LGNLKNLQYLFLYQNKLSGPIPP-SIFSLQKLISLDLSDNS 295
Query: 111 LT-SIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTG 169
L+ IP L Q ++ L L+L N + A +P L L + ++ P+
Sbjct: 296 LSGEIPELVIQ-LQNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGK 354
Query: 170 AKSLESIKLNGNRLS 184
+L + L+ N L+
Sbjct: 355 HNNLTVLDLSTNNLT 369
Score = 37.1 bits (86), Expect = 0.026
Identities = 43/155 (27%), Positives = 76/155 (49%), Gaps = 7/155 (4%)
Query: 53 VLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHI-GQIDSGALDGLTNLIEIDLSDNLL 111
VLD+S NNL E +G NL KL L + G+I +L +L + L DN
Sbjct: 360 VLDLSTNNLTGEIPEGLCSSG--NLFKLILFSNSLEGEIPK-SLGACRSLRRVRLQDNSF 416
Query: 112 TSIPSLTFQSVRFLRDLNLARNPIS-KIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGA 170
+ F + + L+++ N + +I + +P L L ++ ++ P++F G+
Sbjct: 417 SGELPSEFTKLPLVYFLDISNNNLQGRINSRKWD-MPSLQMLSLARNKFFGGLPDSF-GS 474
Query: 171 KSLESIKLNGNRLSHFPVRSVEPLLKLMMIELHDN 205
K LE++ L+ N+ S R + L +LM ++L +N
Sbjct: 475 KRLENLDLSRNQFSGAVPRKLGSLSELMQLKLSEN 509
Score = 36.7 bits (85), Expect = 0.037
Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 4/136 (2%)
Query: 35 RDKFLITIPEAPESELTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGA 94
R+KF +P++ S+ + LD+S N L L +L L+ +
Sbjct: 461 RNKFFGGLPDSFGSKRLENLDLSRNQFSGAVPRKLGS--LSELMQLKLSENKLSGEIPDE 518
Query: 95 LDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDM 154
L L+ +DLS N L+ +F + L L+L++N +S V LV++++
Sbjct: 519 LSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVESLVQVNI 578
Query: 155 SESRLEHISPEAFTGA 170
S + L P TGA
Sbjct: 579 SHNHLHGSLP--STGA 592
Score = 32.5 bits (74), Expect = 0.77
Identities = 33/113 (29%), Positives = 59/113 (52%), Gaps = 4/113 (3%)
Query: 95 LDGLTNLIEIDLS-DNLLTSIPSLTFQSVRFLRDLNLARNPIS-KIEKGAFQFVPGLVKL 152
+ GLT+L +DL +NL IPS + +++ L+ L L +N +S I F + L+ L
Sbjct: 232 IGGLTSLNHLDLVYNNLTGPIPS-SLGNLKNLQYLFLYQNKLSGPIPPSIFS-LQKLISL 289
Query: 153 DMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPVRSVEPLLKLMMIELHDN 205
D+S++ L PE ++LE + L N + ++ L +L +++L N
Sbjct: 290 DLSDNSLSGEIPELVIQLQNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSN 342
>gnl|CDD|205486 pfam13306, LRR_5, Leucine rich repeats (6 copies). This family
includes a number of leucine rich repeats. This family
contains a large number of BSPA-like surface antigens
from Trichomonas vaginalis.
Length = 128
Score = 44.4 bits (106), Expect = 1e-05
Identities = 22/83 (26%), Positives = 35/83 (42%), Gaps = 3/83 (3%)
Query: 99 TNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESR 158
+L I + ++ TSI F L+ + L + ++ I AF L + + S
Sbjct: 11 CSLTSITIPSSV-TSIGEYAFSGCTSLKSITLPSS-LTSIGSYAFYNCSSLTSITIPSS- 67
Query: 159 LEHISPEAFTGAKSLESIKLNGN 181
L I AF+ SL SI + N
Sbjct: 68 LTSIGEYAFSNCSSLTSITIPSN 90
Score = 33.7 bits (78), Expect = 0.073
Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 6/86 (6%)
Query: 59 NNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLT 118
++L + AF +L + + + I A ++L I + N LT+I S
Sbjct: 43 SSLTSIGSYAF--YNCSSLTSITIP-SSLTSIGEYAFSNCSSLTSITIPSN-LTTIGSYA 98
Query: 119 FQSVRFLRDLNLARNPISKIEKGAFQ 144
F + L+ + + + ++ I AF
Sbjct: 99 FSNCS-LKSITIPSS-VTTIGDYAFS 122
>gnl|CDD|143209 cd05732, Ig5_NCAM-1_like, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar
proteins. Ig5_NCAM-1 like: domain similar to the fifth
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-1 (NCAM). NCAM plays important roles in
the development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-non-NCAM), interactions. NCAM is expressed as
three major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
this model, Ig1 and Ig2 mediate dimerization of NCAM
molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain. Also included in this group is
NCAM-2 (also known as OCAM/mamFas II and RNCAM) NCAM-2
is differentially expressed in the developing and mature
olfactory epithelium (OE).
Length = 96
Score = 42.5 bits (100), Expect = 4e-05
Identities = 30/93 (32%), Positives = 41/93 (44%), Gaps = 11/93 (11%)
Query: 254 KPEIRMDSRYVE---AVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVI 310
+P+I Y+E AV E T+ C + P I+W + S RI V
Sbjct: 2 QPKIT----YLENQTAVELEQITLTCEAEGDPIPEITWR-RATRNFSEGDKSLDGRIVVR 56
Query: 311 EQGEYERKSSLVLTNAQESDSGRFYCVAENRAG 343
R SSL L + Q +D+GR+ C A NR G
Sbjct: 57 GH---ARVSSLTLKDVQLTDAGRYDCEASNRIG 86
>gnl|CDD|143224 cd05747, Ig5_Titin_like, M5, fifth immunoglobulin (Ig)-like domain
of human titin C terminus and similar proteins.
Ig5_Titin_like: domain similar to the M5, fifth
immunoglobulin (Ig)-like domain from the human titin C
terminus. Titin (also called connectin) is a fibrous
sarcomeric protein specifically found in vertebrate
striated muscle. Titin is gigantic; depending on isoform
composition it ranges from 2970 to 3700 kDa, and is of a
length that spans half a sarcomere. Titin largely
consists of multiple repeats of Ig-like and fibronectin
type 3 (FN-III)-like domains. Titin connects the ends of
myosin thick filaments to Z disks and extends along the
thick filament to the H zone, and appears to function
similar to an elastic band, keeping the myosin filaments
centered in the sarcomere during muscle contraction or
stretching.
Length = 92
Score = 42.3 bits (99), Expect = 4e-05
Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 9/82 (10%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
E+A C VD P ++W G+++ S QR I EY KS+ ++ Q S
Sbjct: 19 ESARFSCDVDGEPAPTVTWMREGQII------VSSQR-HQITSTEY--KSTFEISKVQMS 69
Query: 330 DSGRFYCVAENRAGIADANFTL 351
D G + V EN G +A FTL
Sbjct: 70 DEGNYTVVVENSEGKQEAQFTL 91
>gnl|CDD|215677 pfam00047, ig, Immunoglobulin domain. Members of the
immunoglobulin superfamily are found in hundreds of
proteins of different functions. Examples include
antibodies, the giant muscle kinase titin and receptor
tyrosine kinases. Immunoglobulin-like domains may be
involved in protein-protein and protein-ligand
interactions. The Pfam alignments do not include the
first and last strand of the immunoglobulin-like domain.
Length = 62
Score = 41.0 bits (96), Expect = 5e-05
Identities = 15/71 (21%), Positives = 27/71 (38%), Gaps = 10/71 (14%)
Query: 269 SENATVVCRVDSIPPAAISWY-WNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQ 327
+ T+ C V P ++W+ L + T + R+ +L ++N
Sbjct: 1 GSSVTLTCSVSGPPQVDVTWFKEGKGLEESTTVGTDENRVS---------SITLTISNVT 51
Query: 328 ESDSGRFYCVA 338
DSG + CV
Sbjct: 52 PEDSGTYTCVV 62
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number of
proteins with diverse functions and cellular locations.
These repeats are usually involved in protein-protein
interactions. Each Leucine Rich Repeat is composed of a
beta-alpha unit. These units form elongated non-globular
structures. Leucine Rich Repeats are often flanked by
cysteine rich domains.
Length = 43
Score = 39.8 bits (94), Expect = 8e-05
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 76 NLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLT 118
NL+ L L+ I + L L NL +DLS N +T + L+
Sbjct: 2 NLETLDLSNNQITDLP--PLSNLPNLETLDLSGNKITDLSPLS 42
Score = 37.9 bits (89), Expect = 4e-04
Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 99 TNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIE 139
TNL +DLS+N +T +P L ++ L L+L+ N I+ +
Sbjct: 1 TNLETLDLSNNQITDLPPL--SNLPNLETLDLSGNKITDLS 39
Score = 28.2 bits (64), Expect = 0.99
Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 4/41 (9%)
Query: 52 QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDS 92
+ LD+S N + LP + L NL+ L L+ I +
Sbjct: 4 ETLDLSNNQITDLPP----LSNLPNLETLDLSGNKITDLSP 40
Score = 27.1 bits (61), Expect = 3.1
Identities = 10/42 (23%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 147 PGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPV 188
L LD+S +++ + P + +LE++ L+GN+++
Sbjct: 1 TNLETLDLSNNQITDLPP--LSNLPNLETLDLSGNKITDLSP 40
>gnl|CDD|143242 cd05765, Ig_3, Subgroup of the immunoglobulin (Ig) superfamily.
Ig_3: subgroup of the immunoglobulin (Ig) domain found
in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
the Ig superfamily are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of most
Ig domains is a disulfide bridge connecting the two
beta-sheets with a tryptophan residue packed against the
disulfide bond.
Length = 81
Score = 40.6 bits (95), Expect = 1e-04
Identities = 28/86 (32%), Positives = 38/86 (44%), Gaps = 8/86 (9%)
Query: 270 ENATVVCRVDSIPPAAISW--YWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQ 327
E A+ C V PP I+W +G+ L + V G+ LV+ NAQ
Sbjct: 2 ETASFHCDVTGRPPPEITWEKQVHGKENLIMRPNHVRGNVVVTNIGQ------LVIYNAQ 55
Query: 328 ESDSGRFYCVAENRAGIADANFTLQV 353
D+G + C A N G+ ANF L V
Sbjct: 56 PQDAGLYTCTARNSGGLLRANFPLSV 81
>gnl|CDD|143260 cd05852, Ig5_Contactin-1, Fifth Ig domain of contactin-1.
Ig5_Contactin-1: fifth Ig domain of the neural cell
adhesion molecule contactin-1. Contactins are comprised
of six Ig domains followed by four fibronectin type III
(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-1 is
differentially expressed in tumor tissues and may
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 73
Score = 39.6 bits (92), Expect = 2e-04
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 12/78 (15%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
C+ + P SW LL+NN+ RI + + G SL + N + D G +
Sbjct: 8 CKPKAAPKPKFSWSKGTELLVNNS------RISIWDDG------SLEILNITKLDEGSYT 55
Query: 336 CVAENRAGIADANFTLQV 353
C AEN G A++ L V
Sbjct: 56 CFAENNRGKANSTGVLSV 73
>gnl|CDD|143168 cd04967, Ig1_Contactin, First Ig domain of contactin.
Ig1_Contactin: First Ig domain of contactins. Contactins
are neural cell adhesion molecules and are comprised of
six Ig domains followed by four fibronectin type
III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of
neuronal activity in the rat auditory system.
Contactin-5 is highly expressed in the adult human brain
in the occipital lobe and in the amygdala. Contactin-1
is differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 91
Score = 39.7 bits (93), Expect = 3e-04
Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 20/90 (22%)
Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS 319
D+ + E ++ CR PP W L+N T I+ R S
Sbjct: 10 DTIFPEESDEGKVSLNCRARGSPPPTYRW------LMNGTE---------IDDEPDSRYS 54
Query: 320 ----SLVLTNAQES-DSGRFYCVAENRAGI 344
+LV++N ++ D+GR+ C+A N G
Sbjct: 55 LVGGNLVISNPSKAKDAGRYQCLASNIVGT 84
>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain of
the L1 cell adhesion molecule (CAM). Ig3_L1-CAM_like:
domain similar to the third immunoglobulin (Ig)-like
domain of the L1 cell adhesion molecule (CAM). L1
belongs to the L1 subfamily of cell adhesion molecules
(CAMs) and is comprised of an extracellular region
having six Ig-like domains and five fibronectin type III
domains, a transmembrane region and an intracellular
domain. L1 is primarily expressed in the nervous system
and is involved in its development and function. L1 is
associated with an X-linked recessive disorder, X-linked
hydrocephalus, MASA syndrome, or spastic paraplegia type
1, that involves abnormalities of axonal growth. This
group also contains the chicken neuron-glia cell
adhesion molecule, Ng-CAM and human neurofascin.
Length = 71
Score = 38.9 bits (91), Expect = 4e-04
Identities = 20/79 (25%), Positives = 31/79 (39%), Gaps = 14/79 (17%)
Query: 276 CRVDSIPPAAISWY-WNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRF 334
C + +P ISW G L + T F ++ + +L + N E D G +
Sbjct: 5 CIAEGLPTPEISWIKIGGELPADRTKFENFNK-------------TLKIDNVSEEDDGEY 51
Query: 335 YCVAENRAGIADANFTLQV 353
C A N G A ++ V
Sbjct: 52 RCTASNSLGSARHTISVTV 70
>gnl|CDD|143176 cd04975, Ig4_SCFR_like, Fourth immunoglobulin (Ig)-like domain of
stem cell factor receptor (SCFR) and similar proteins.
Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of
stem cell factor receptor (SCFR). In addition to SCFR
this group also includes the fourth Ig domain of
platelet-derived growth factor receptors (PDGFR), alpha
and beta, the fourth Ig domain of macrophage colony
stimulating factor (M-CSF), and the Ig domain of the
receptor tyrosine kinase KIT. SCFR and the PDGFR alpha
and beta have similar organization: an extracellular
component having five Ig-like domains, a transmembrane
segment, and a cytoplasmic portion having protein
tyrosine kinase activity. SCFR and its ligand SCF are
critical for normal hematopoiesis, mast cell
development, melanocytes and gametogenesis. SCF binds to
the second and third Ig-like domains of SCFR, this
fourth Ig-like domain participates in SCFR dimerization,
which follows ligand binding. Deletion of this fourth
SCFR_Ig-like domain abolishes the ligand-induced
dimerization of SCFR and completely inhibits signal
transduction. PDGF is a potent mitogen for connective
tissue cells. PDGF-stimulated processes are mediated by
three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
binds to all three PDGFs, whereas the PDGFR beta, binds
only to PDGF-B. In mice, PDGFR alpha, and PDGFR beta,
are essential for normal development.
Length = 101
Score = 39.6 bits (93), Expect = 4e-04
Identities = 21/76 (27%), Positives = 30/76 (39%), Gaps = 8/76 (10%)
Query: 282 PPAAISWYWNGRLLLNNTAFSSYQRIFV--IEQGEYERKSSLVLTNAQESDSGRFYCVAE 339
PP I+W ++ R L N V + EY S L L +ES++G + +A
Sbjct: 32 PPPHINWTYDNRTLTNK------LTEIVTSENESEYRYVSELKLVRLKESEAGTYTFLAS 85
Query: 340 NRAGIADANFTLQVTY 355
N F L V
Sbjct: 86 NSDASKSLTFELYVNV 101
>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig2_Robo: domain similar to the
second immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 86
Score = 38.9 bits (91), Expect = 5e-04
Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 12/81 (14%)
Query: 264 VEAVSSENATVVCRVD-SIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLV 322
+ E A + C P +SW +G+ L +R+ +++ G +L+
Sbjct: 6 TQVAVGEMAVLECSPPRGHPEPTVSWRKDGQPLN-----LDNERVRIVDDG------NLL 54
Query: 323 LTNAQESDSGRFYCVAENRAG 343
+ A++SD G + CVA N G
Sbjct: 55 IAEARKSDEGTYKCVATNMVG 75
>gnl|CDD|143180 cd04979, Ig_Semaphorin_C, Immunoglobulin (Ig)-like domain of
semaphorin. Ig_Semaphorin_C; Immunoglobulin (Ig)-like
domain in semaphorins. Semaphorins are transmembrane
protein that have important roles in a variety of
tissues. Functionally, semaphorins were initially
characterized for their importance in the development of
the nervous system and in axonal guidance. Later they
have been found to be important for the formation and
functioning of the cardiovascular, endocrine,
gastrointestinal, hepatic, immune, musculoskeletal,
renal, reproductive, and respiratory systems.
Semaphorins function through binding to their receptors
and transmembrane semaphorins also serves as receptors
themselves. Although molecular mechanism of semaphorins
is poorly understood, the Ig-like domains may involve in
ligand binding or dimerization.
Length = 89
Score = 38.5 bits (90), Expect = 7e-04
Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 12/72 (16%)
Query: 270 ENATVV--CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQ 327
E +V C S A++ W + G L R+ V E G L++ +
Sbjct: 10 EGNSVFLECSPKS-NLASVVWLFQGGPLQRKEEPEE--RLLVTEDG-------LLIRSVS 59
Query: 328 ESDSGRFYCVAE 339
+D+G + C +
Sbjct: 60 PADAGVYTCQSV 71
>gnl|CDD|143221 cd05744, Ig_Myotilin_C_like, Immunoglobulin (Ig)-like domain of
myotilin, palladin, and myopalladin.
Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in
myotilin, palladin, and myopalladin. Myotilin,
palladin, and myopalladin function as scaffolds that
regulate actin organization. Myotilin and myopalladin
are most abundant in skeletal and cardiac muscle;
palladin is ubiquitously expressed in the organs of
developing vertebrates and plays a key role in cellular
morphogenesis. The three family members each interact
with specific molecular partners: all three bind to
alpha-actinin; in addition, palladin also binds to
vasodilator-stimulated phosphoprotein (VASP) and ezrin,
myotilin binds to filamin and actin, and myopalladin
also binds to nebulin and cardiac ankyrin repeat protein
(CARP).
Length = 75
Score = 37.9 bits (88), Expect = 0.001
Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 7/78 (8%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
CRV +IPP I W N +L NT RI + Q R L++ NA + D+G +
Sbjct: 5 CRVSAIPPPQIFWKKNNEMLTYNT-----DRI-SLYQDNCGR-ICLLIQNANKEDAGWYT 57
Query: 336 CVAENRAGIADANFTLQV 353
A N AG+ N L V
Sbjct: 58 VSAVNEAGVVSCNARLDV 75
>gnl|CDD|143265 cd05857, Ig2_FGFR, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor. Ig2_FGFR:
second immunoglobulin (Ig)-like domain of fibroblast
growth factor (FGF) receptor. FGF receptors bind FGF
signaling polypeptides. FGFs participate in multiple
processes such as morphogenesis, development, and
angiogenesis. FGFs bind to four FGF receptor tyrosine
kinases (FGFR1, -2, -3, -4). Receptor diversity is
controlled by alternative splicing producing splice
variants with different ligand binding characteristics
and different expression patterns. FGFRs have an
extracellular region comprised of three IG-like domains,
a single transmembrane helix, and an intracellular
tyrosine kinase domain. Ligand binding and specificity
reside in the Ig-like domains 2 and 3, and the linker
region that connects these two. FGFR activation and
signaling depend on FGF-induced dimerization, a process
involving cell surface heparin or heparin sulfate
proteoglycans.
Length = 85
Score = 38.3 bits (89), Expect = 0.001
Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 14/81 (17%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYE---RKSSLVLTNAQESDSG 332
C P + W NG+ F RI G Y+ + SL++ + SD G
Sbjct: 16 CPAAGNPTPTMRWLKNGK------EFKQEHRI-----GGYKVRNQHWSLIMESVVPSDKG 64
Query: 333 RFYCVAENRAGIADANFTLQV 353
+ CV EN G + + L V
Sbjct: 65 NYTCVVENEYGSINHTYHLDV 85
>gnl|CDD|143225 cd05748, Ig_Titin_like, Immunoglobulin (Ig)-like domain of titin
and similar proteins. Ig_Titin_like: immunoglobulin
(Ig)-like domain found in titin-like proteins. Titin
(also called connectin) is a fibrous sarcomeric protein
specifically found in vertebrate striated muscle. Titin
is gigantic, depending on isoform composition it ranges
from 2970 to 3700 kDa, and is of a length that spans
half a sarcomere. Titin largely consists of multiple
repeats of Ig-like and fibronectin type 3 (FN-III)-like
domains. Titin connects the ends of myosin thick
filaments to Z disks and extends along the thick
filament to the H zone. It appears to function
similarly to an elastic band, keeping the myosin
filaments centered in the sarcomere during muscle
contraction or stretching. Within the sarcomere, titin
is also attached to or is associated with myosin binding
protein C (MyBP-C). MyBP-C appears to contribute to the
generation of passive tension by titin, and similar to
titin has repeated Ig-like and FN-III domains. Also
included in this group are worm twitchin and insect
projectin, thick filament proteins of invertebrate
muscle, which also have repeated Ig-like and FN-III
domains.
Length = 74
Score = 37.6 bits (88), Expect = 0.001
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 9/73 (12%)
Query: 281 IPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAEN 340
P ++W +G+ L + IE +SLV+ NA+ SDSG++ +N
Sbjct: 11 RPTPTVTWSKDGKPLKLSGRVQ-------IETTAS--STSLVIKNAERSDSGKYTLTLKN 61
Query: 341 RAGIADANFTLQV 353
AG A ++V
Sbjct: 62 PAGEKSATINVKV 74
>gnl|CDD|143275 cd05867, Ig4_L1-CAM_like, Fourth immunoglobulin (Ig)-like domain of
the L1 cell adhesion molecule (CAM). Ig4_L1-CAM_like:
fourth immunoglobulin (Ig)-like domain of the L1 cell
adhesion molecule (CAM). L1 is comprised of an
extracellular region having six Ig-like domains and five
fibronectin type III domains, a transmembrane region and
an intracellular domain. L1 is primarily expressed in
the nervous system and is involved in its development
and function. L1 is associated with an X-linked
recessive disorder, X-linked hydrocephalus, MASA
syndrome, or spastic paraplegia type 1, that involves
abnormalities of axonal growth. This group also contains
the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Length = 76
Score = 37.2 bits (86), Expect = 0.002
Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 10/84 (11%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
E A + C+V+ IP I+W NG + + + +L+LT+ Q S
Sbjct: 2 ETARLDCQVEGIPTPNITWSINGAPIEGTDP----------DPRRHVSSGALILTDVQPS 51
Query: 330 DSGRFYCVAENRAGIADANFTLQV 353
D+ + C A NR G AN + V
Sbjct: 52 DTAVYQCEARNRHGNLLANAHVHV 75
>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
inhibitor (RI)-like subfamily. LRRs are 20-29 residue
sequence motifs present in many proteins that
participate in protein-protein interactions and have
different functions and cellular locations. LRRs
correspond to structural units consisting of a beta
strand (LxxLxLxxN/CxL conserved pattern) and an alpha
helix. This alignment contains 12 strands corresponding
to 11 full repeats, consistent with the extent observed
in the subfamily acting as Ran GTPase Activating
Proteins (RanGAP1).
Length = 319
Score = 40.4 bits (95), Expect = 0.002
Identities = 48/199 (24%), Positives = 71/199 (35%), Gaps = 36/199 (18%)
Query: 35 RDKFLITIPEAPESELT--------QVLDMSGNNLQIL---PKEAFRRAGLLNLQKLFLA 83
IP +S L Q LD+S N L E+ R+ +LQ+L L
Sbjct: 59 SLNETGRIPRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLLRSS--SLQELKLN 116
Query: 84 RCHIGQIDSGAL--DGLT----NLIEIDLSDNLLTSIP----SLTFQSVRFLRDLNLARN 133
+G L GL L ++ L N L + ++ R L++LNLA N
Sbjct: 117 NNGLG-DRGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANN 175
Query: 134 PISKIEKG------AFQFVPGLVKLDMSESRLEHISPEAFTGA----KSLESIKLNGNRL 183
I + G + L LD++ + L A KSLE + L N L
Sbjct: 176 GIG--DAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNL 233
Query: 184 SHFPVRSVEPLLKLMMIEL 202
+ ++ L I L
Sbjct: 234 TDAGAAALASALLSPNISL 252
Score = 36.9 bits (86), Expect = 0.023
Identities = 25/130 (19%), Positives = 47/130 (36%), Gaps = 29/130 (22%)
Query: 52 QVLDMSGNNL---------QILPKEAFRRAGLLNLQKLFLARCHIGQID-SGALDGLTNL 101
+ L+++ N + + L A +L+L L G + L L +L
Sbjct: 168 KELNLANNGIGDAGIRALAEGLK--ANCNLEVLDLNNNGL--TDEGASALAETLASLKSL 223
Query: 102 IEIDLSDNLLTSIPSLTFQS-----VRFLRDLNLARNPISKIEKGAFQFV-------PGL 149
++L DN LT + S L L+L+ N I + + L
Sbjct: 224 EVLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCN---DITDDGAKDLAEVLAEKESL 280
Query: 150 VKLDMSESRL 159
++LD+ ++
Sbjct: 281 LELDLRGNKF 290
>gnl|CDD|143300 cd05892, Ig_Myotilin_C, C-terminal immunoglobulin (Ig)-like domain
of myotilin. Ig_Myotilin_C: C-terminal immunoglobulin
(Ig)-like domain of myotilin. Mytolin belongs to the
palladin-myotilin-myopalladin family. Proteins belonging
to the latter family contain multiple Ig-like domains
and function as scaffolds, modulating actin
cytoskeleton. Myotilin is most abundant in skeletal and
cardiac muscle, and is involved in maintaining sarcomere
integrity. It binds to alpha-actinin, filamin and actin.
Mutations in myotilin lead to muscle disorders.
Length = 75
Score = 36.9 bits (85), Expect = 0.002
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
C++ +IPP I W N ++ NT RI + + + + +L++ N + D+G +
Sbjct: 5 CQISAIPPPKIFWKRNNEMVQYNT-----DRISLYQ--DNSGRVTLLIKNVNKKDAGWYT 57
Query: 336 CVAENRAGIADANFTLQV 353
A N AG+A + L V
Sbjct: 58 VSAVNEAGVATCHARLDV 75
>gnl|CDD|143203 cd05726, Ig4_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig4_Robo: domain similar to the
third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 90
Score = 36.9 bits (85), Expect = 0.003
Identities = 27/86 (31%), Positives = 33/86 (38%), Gaps = 8/86 (9%)
Query: 271 NATVVCRVDSIPPAAISWYWNGR--LLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQE 328
T C P AI W G LL + S R V + G+ L +TN Q
Sbjct: 3 TVTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTGD------LTITNVQR 56
Query: 329 SDSGRFYCVAENRAGIADANFTLQVT 354
SD G + C N AG L+VT
Sbjct: 57 SDVGYYICQTLNVAGSILTKAYLEVT 82
>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural cell
adhesion molecule contactin-2 and similar proteins.
Ig4_Contactin-2-like: fourth Ig domain of the neural
cell adhesion molecule contactin-2. Contactins are
comprised of six Ig domains followed by four fibronectin
type III (FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-2 (aliases
TAG-1, axonin-1) facilitates cell adhesion by homophilic
binding between molecules in apposed membranes. The
first four Ig domains form the intermolecular binding
fragment which arranges as a compact U-shaped module by
contacts between Ig domains 1 and 4, and domains 2 and
3. It has been proposed that a linear zipper-like array
forms, from contactin-2 molecules alternatively provided
by the two apposed membranes.
Length = 85
Score = 36.4 bits (84), Expect = 0.004
Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 13/78 (16%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
C+ P A W NG+ L +S RI V E G+ L +T SDSG +
Sbjct: 21 CKASGNPRPAYRWLKNGQPL------ASENRIEV-EAGD------LRITKLSLSDSGMYQ 67
Query: 336 CVAENRAGIADANFTLQV 353
CVAEN+ G A+ L V
Sbjct: 68 CVAENKHGTIYASAELAV 85
>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig3_Robo: domain similar to the
third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 69
Score = 35.8 bits (83), Expect = 0.004
Identities = 21/85 (24%), Positives = 27/85 (31%), Gaps = 19/85 (22%)
Query: 272 ATVVCRVDSIPPAAISWYWN-GRLLLNNTAFSSYQRIFVIEQGEYE--RKSSLVLTNAQE 328
C V P + W G L G E SL + N
Sbjct: 1 VEFQCEVGGDPVPTVLWRKEDGELPK----------------GRAEILDDKSLKIRNVTA 44
Query: 329 SDSGRFYCVAENRAGIADANFTLQV 353
D G + C AEN G +A+ +L V
Sbjct: 45 GDEGSYTCEAENMVGKIEASASLTV 69
>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
Length = 623
Score = 39.8 bits (93), Expect = 0.004
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 1/71 (1%)
Query: 114 IPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAKSL 173
IP+ +R L+ +NL+ N I + + L LD+S + PE+ SL
Sbjct: 434 IPN-DISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSL 492
Query: 174 ESIKLNGNRLS 184
+ LNGN LS
Sbjct: 493 RILNLNGNSLS 503
>gnl|CDD|143258 cd05850, Ig1_Contactin-2, First Ig domain of contactin-2.
Ig1_Contactin-2: First Ig domain of the neural cell
adhesion molecule contactin-2-like. Contactins are
comprised of six Ig domains followed by four fibronectin
type III (FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-2 (TAG-1,
axonin-1) facilitates cell adhesion by homophilic
binding between molecules in apposed membranes. It may
play a part in the neuronal processes of neurite
outgrowth, axon guidance and fasciculation, and neuronal
migration. The first four Ig domains form the
intermolecular binding fragment, which arranges as a
compact U-shaped module by contacts between IG domains 1
and 4, and domains 2 and 3. The different contactins
show different expression patterns in the central
nervous system. During development and in adulthood,
contactin-2 is transiently expressed in subsets of
central and peripheral neurons. Contactin-2 is also
expressed in retinal amacrine cells in the developing
chick retina, corresponding to the period of formation
and maturation of AC processes.
Length = 94
Score = 35.7 bits (82), Expect = 0.009
Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 12/82 (14%)
Query: 263 YVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLV 322
+ E E T+ CR + PPA W NG + S Y + +LV
Sbjct: 13 FPEGSPEEKVTLGCRARASPPATYRWKMNGT-EIKFAPESRYTLV----------AGNLV 61
Query: 323 LTNAQES-DSGRFYCVAENRAG 343
+ N Q++ D+G + C+A NR G
Sbjct: 62 INNPQKARDAGSYQCLAINRCG 83
>gnl|CDD|143184 cd04983, IgV_TCR_alpha_like, Immunoglobulin (Ig) variable (V)
domain of T-cell receptor (TCR) alpha chain and similar
proteins. IgV_TCR_alpha: immunoglobulin (Ig) variable
domain of the alpha chain of alpha/beta T-cell antigen
receptors (TCRs). TCRs mediate antigen recognition by T
lymphocytes, and are composed of alpha and beta, or
gamma and delta, polypeptide chains with variable (V)
and constant (C) regions. This group represents the
variable domain of the alpha chain of TCRs and also
includes the variable domain of delta chains of TCRs.
Alpha/beta TCRs recognize antigen as peptide fragments
presented by major histocompatibility complex (MHC)
molecules. The variable domain of TCRs is responsible
for antigen recognition, and is located at the
N-terminus of the receptor. Gamma/delta TCRs recognize
intact protein antigens; they recognize proteins
antigens directly and without antigen processing, and
MHC independently of the bound peptide.
Length = 109
Score = 35.7 bits (83), Expect = 0.010
Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 7/91 (7%)
Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWY-WNGR----LLLNNTAFSSYQRI--FVIEQ 312
+ + EN T+ C + + WY L+ ++ + F
Sbjct: 4 SPQSLSVQEGENVTLNCNYSTSTFYYLFWYRQYPGQGPQFLIYISSNGEEKEKGRFSATL 63
Query: 313 GEYERKSSLVLTNAQESDSGRFYCVAENRAG 343
+ + SSL ++ AQ SDS ++C G
Sbjct: 64 DKSRKSSSLHISAAQLSDSAVYFCALSESGG 94
>gnl|CDD|143171 cd04970, Ig6_Contactin_like, Sixth Ig domain of contactin.
Ig6_Contactin_like: Sixth Ig domain of contactins.
Contactins are neural cell adhesion molecules and are
comprised of six Ig domains followed by four fibronectin
type III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of neur
onal act ivity in the rat auditory system. Contactin-5
is highly expressed in the adult human brain in the
occipital lobe and in the amygdala. Contactin-1 is
differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 85
Score = 35.2 bits (81), Expect = 0.010
Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 11/91 (12%)
Query: 270 ENATVVCRVDSIPPAAISWYW--NGRLLLNNTAFSSYQRIFVIEQ-GEYERKSSLVLTNA 326
E+ T+ C P +++ W NG + + Y+R+ + G+ L++ NA
Sbjct: 1 ESITLQCHASHDPTLDLTFTWSFNGVPIDFDKDGGHYRRVGGKDSNGD------LMIRNA 54
Query: 327 QESDSGRFYCVAENRAGIADANFTLQVTYRG 357
Q +G++ C A+ A+ L V RG
Sbjct: 55 QLKHAGKYTCTAQTVVDSLSASADLIV--RG 83
>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM).
Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
important roles in the development and regeneration of
the central nervous system, in synaptogenesis and neural
migration. NCAM mediates cell-cell and cell-substratum
recognition and adhesion via homophilic (NCAM-NCAM) and
heterophilic (NCAM-non-NCAM) interactions. NCAM is
expressed as three major isoforms having different
intracellular extensions. The extracellular portion of
NCAM has five N-terminal Ig-like domains and two
fibronectin type III domains. The double zipper adhesion
complex model for NCAM homophilic binding involves Ig1,
Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
dimerization of NCAM molecules situated on the same cell
surface (cis interactions), and Ig3 domains mediate
interactions between NCAM molecules expressed on the
surface of opposing cells (trans interactions), through
binding to the Ig1 and Ig2 domains. The adhesive ability
of NCAM is modulated by the addition of polysialic acid
chains to the fifth Ig-like domain.
Length = 97
Score = 35.3 bits (81), Expect = 0.012
Identities = 29/104 (27%), Positives = 45/104 (43%), Gaps = 12/104 (11%)
Query: 254 KPEIRMDSRYVEAVSS----ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFV 309
KP+I YVE ++ E T+ C P +I+W + R + + + I V
Sbjct: 2 KPKIT----YVENQTAMELEEQITLTCEASGDPIPSITWRTSTRNISSE-EKTLDGHIVV 56
Query: 310 IEQGEYERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
+ R SSL L Q +D+G + C A N G + L+V
Sbjct: 57 ---RSHARVSSLTLKYIQYTDAGEYLCTASNTIGQDSQSMYLEV 97
>gnl|CDD|143237 cd05760, Ig2_PTK7, Second immunoglobulin (Ig)-like domain of
protein tyrosine kinase (PTK) 7, also known as CCK4.
Ig2_PTK7: domain similar to the second immunoglobulin
(Ig)-like domain in protein tyrosine kinase (PTK) 7,
also known as CCK4. PTK7 is a subfamily of the receptor
protein tyrosine kinase family, and is referred to as an
RPTK-like molecule. RPTKs transduce extracellular
signals across the cell membrane, and play important
roles in regulating cell proliferation, migration, and
differentiation. PTK7 is organized as an extracellular
portion having seven Ig-like domains, a single
transmembrane region, and a cytoplasmic tyrosine
kinase-like domain. PTK7 is considered a pseudokinase as
it has several unusual residues in some of the highly
conserved tyrosine kinase (TK) motifs; it is predicted
to lack TK activity. PTK7 may function as a
cell-adhesion molecule. PTK7 mRNA is expressed at high
levels in placenta, melanocytes, liver, lung, pancreas,
and kidney. PTK7 is overexpressed in several cancers,
including melanoma and colon cancer lines.
Length = 77
Score = 34.9 bits (80), Expect = 0.013
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 18/86 (20%)
Query: 273 TVVCRVDSIPPAAISWYWNGRLLLN---NTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
T+ C +D P W+ +G L + N + SS +R +L L +A
Sbjct: 2 TLRCHIDGHPRPTYQWFRDGTPLSDGQGNYSVSSKER-------------TLTLRSAGPD 48
Query: 330 DSGRFYCVAENRAG--IADANFTLQV 353
DSG +YC A N G + NFTL +
Sbjct: 49 DSGLYYCCAHNAFGSVCSSQNFTLSI 74
>gnl|CDD|143241 cd05764, Ig_2, Subgroup of the immunoglobulin (Ig) superfamily.
Ig_2: subgroup of the immunoglobulin (Ig) domain found
in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
the Ig superfamily are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of most
Ig domains is a disulfide bridge connecting the two
beta-sheets with a tryptophan residue packed against the
disulfide bond.
Length = 74
Score = 34.8 bits (80), Expect = 0.013
Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 13/83 (15%)
Query: 272 ATVVCRVDSIPPAAISWYW-NGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESD 330
AT+ C+ P AI W +G+L+ N++ R V + G +L + D
Sbjct: 4 ATLRCKARGDPEPAIHWISPDGKLISNSS------RTLVYDNG------TLDILITTVKD 51
Query: 331 SGRFYCVAENRAGIADANFTLQV 353
+G F C+A N AG A A L +
Sbjct: 52 TGSFTCIASNAAGEATATVELHI 74
>gnl|CDD|143278 cd05870, Ig5_NCAM-2, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-2 (also known as
OCAM/mamFas II and RNCAM). Ig5_NCAM-2: the fifth
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-2 (also known as OCAM/mamFas II and
RNCAM). NCAM-2 is organized similarly to NCAM ,
including five N-terminal Ig-like domains and two
fibronectin type III domains. NCAM-2 is differentially
expressed in the developing and mature olfactory
epithelium (OE), and may function like NCAM, as an
adhesion molecule.
Length = 98
Score = 35.3 bits (81), Expect = 0.013
Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 12/87 (13%)
Query: 262 RYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQ-----RIFVIEQGEYE 316
+ V + AT+ C+ + P I+W + + FS RI V +G++
Sbjct: 9 KNETTVENGAATLSCKAEGEPIPEITW----KRASDGHTFSEGDKSPDGRIEV--KGQHG 62
Query: 317 RKSSLVLTNAQESDSGRFYCVAENRAG 343
+SSL + + + SDSGR+ C A +R G
Sbjct: 63 -ESSLHIKDVKLSDSGRYDCEAASRIG 88
>gnl|CDD|143167 cd00099, IgV, Immunoglobulin variable domain (IgV). IgV:
Immunoglobulin variable domain (IgV). Members of the IgV
family are components of immunoglobulin (Ig) and T cell
receptors. The basic structure of Ig molecules is a
tetramer of two light chains and two heavy chains linked
by disulfide bonds. In Ig, each chain is composed of one
variable domain (IgV) and one or more constant domains
(IgC); these names reflect the fact that the variability
in sequences is higher in the variable domain than in
the constant domain. Within the variable domain, there
are regions of even more variability called the
hypervariable or complementarity-determining regions
(CDRs) which are responsible for antigen binding. A
predominant feature of most Ig domains is the disulfide
bridge connecting 2 beta-sheets with a tryptophan
residue packed against the disulfide bond.
Length = 105
Score = 35.4 bits (82), Expect = 0.015
Identities = 17/97 (17%), Positives = 32/97 (32%), Gaps = 11/97 (11%)
Query: 264 VEAVSSENATVVCRV-DSIPPAAISWY---------WNGRLLLNNTAFSS-YQRIFVIEQ 312
+ E+ T+ C S I WY + N + ++ + F +
Sbjct: 1 LSVSEGESVTLSCTYSGSFSSYYIFWYRQKPGKGPELLIYISSNGSQYAGGVKGRFSGTR 60
Query: 313 GEYERKSSLVLTNAQESDSGRFYCVAENRAGIADANF 349
+ +L +++ Q DS +YC G F
Sbjct: 61 DSSKSSFTLTISSLQPEDSAVYYCAVSLSGGTYKLYF 97
>gnl|CDD|143267 cd05859, Ig4_PDGFR-alpha, Fourth immunoglobulin (Ig)-like domain of
platelet-derived growth factor receptor (PDGFR) alpha.
IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like
domain of platelet-derived growth factor receptor
(PDGFR) alpha. PDGF is a potent mitogen for connective
tissue cells. PDGF-stimulated processes are mediated by
three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
binds to all three PDGFs, whereas the PDGFR beta (not
included in this group) binds only to PDGF-B. PDGF alpha
is organized as an extracellular component having five
Ig-like domains, a transmembrane segment, and a
cytoplasmic portion having protein tyrosine kinase
activity. In mice, PDGFR alpha and PDGFR beta are
essential for normal development.
Length = 101
Score = 34.8 bits (80), Expect = 0.019
Identities = 24/84 (28%), Positives = 35/84 (41%), Gaps = 3/84 (3%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
E V V++ PP I W + R L+ N + E S L L A+E
Sbjct: 19 EVKEFVVEVEAYPPPQIRWLKDNRTLIENLTEITTS---EHNVQETRYVSKLKLIRAKEE 75
Query: 330 DSGRFYCVAENRAGIADANFTLQV 353
DSG + +A+N + F LQ+
Sbjct: 76 DSGLYTALAQNEDAVKSYTFALQI 99
>gnl|CDD|143317 cd07693, Ig1_Robo, First immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors and similar proteins. Ig1_Robo:
domain similar to the first immunoglobulin (Ig)-like
domain in Robo (roundabout) receptors. Robo receptors
play a role in the development of the central nervous
system (CNS), and are receptors of Slit protein. Slit is
a repellant secreted by the neural cells in the midline.
Slit acts through Robo to prevent most neurons from
crossing the midline from either side. Three mammalian
Robo homologs (robo1, -2, and -3), and three mammalian
Slit homologs (Slit-1,-2, -3), have been identified.
Commissural axons, which cross the midline, express low
levels of Robo; longitudinal axons, which avoid the
midline, express high levels of Robo. robo1, -2, and -3
are expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 100
Score = 34.4 bits (79), Expect = 0.023
Identities = 24/85 (28%), Positives = 38/85 (44%), Gaps = 3/85 (3%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
+ AT+ C+ + P I W NG+ L + RI + + + +V S
Sbjct: 17 DPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLPSGSLFFLR--VVHGRKGRS 74
Query: 330 DSGRFYCVAENRAGIADA-NFTLQV 353
D G + CVA N G A + N +L+V
Sbjct: 75 DEGVYVCVAHNSLGEAVSRNASLEV 99
>gnl|CDD|143215 cd05738, Ig2_RPTP_IIa_LAR_like, Second immunoglobulin (Ig)-like
domain of the receptor protein tyrosine phosphatase
(RPTP)-F, also known as LAR. Ig2_RPTP_IIa_LAR_like:
domain similar to the second immunoglobulin (Ig)-like
domain found in the receptor protein tyrosine
phosphatase (RPTP)-F, also known as LAR. LAR belongs to
the RPTP type IIa subfamily. Members of this subfamily
are cell adhesion molecule-like proteins involved in
central nervous system (CNS) development. They have
large extracellular portions, comprised of multiple
Ig-like domains and two to nine fibronectin type III
(FNIII) domains, and a cytoplasmic portion having two
tandem phosphatase domains.
Length = 74
Score = 33.9 bits (77), Expect = 0.026
Identities = 21/74 (28%), Positives = 33/74 (44%), Gaps = 14/74 (18%)
Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYE--RKSSLVLTNAQES 329
AT++C P I+W F + + G + R +L + N++ES
Sbjct: 1 ATMLCAASGNPDPEITW------------FKDFLPVDTTSNGRIKQLRSGALQIENSEES 48
Query: 330 DSGRFYCVAENRAG 343
D G++ CVA N AG
Sbjct: 49 DQGKYECVATNSAG 62
>gnl|CDD|143175 cd04974, Ig3_FGFR, Third immunoglobulin (Ig)-like domain of
fibroblast growth factor receptor (FGFR). Ig3_FGFR:
third immunoglobulin (Ig)-like domain of fibroblast
growth factor receptor (FGFR). Fibroblast growth factors
(FGFs) participate in morphogenesis, development,
angiogenesis, and wound healing. These FGF-stimulated
processes are mediated by four FGFR tyrosine kinases
(FGRF1-4). FGFRs are comprised of an extracellular
portion consisting of three Ig-like domains, a
transmembrane helix, and a cytoplasmic portion having
protein tyrosine kinase activity. The highly conserved
Ig-like domains 2 and 3, and the linker region between
D2 and D3 define a general binding site for FGFs.
Length = 90
Score = 33.9 bits (78), Expect = 0.030
Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 8/89 (8%)
Query: 271 NATVVCRVDSIPPAAISWYWNGRLLLNNTAFS----SYQRI-FVIEQGEYERKSS-LVLT 324
+ C+V S I W + +N + + Y + V + +S L L
Sbjct: 3 DVEFHCKVYSDAQPHIQWL--KHVEVNGSKYGPDGLPYVTVLKVAGINTTDNESEVLYLR 60
Query: 325 NAQESDSGRFYCVAENRAGIADANFTLQV 353
N D+G + C+A N G + + L V
Sbjct: 61 NVSFDDAGEYTCLAGNSIGPSHHSAWLTV 89
>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers.
Length = 24
Score = 31.9 bits (74), Expect = 0.036
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 98 LTNLIEIDLSDNLLTSIPSLTFQS 121
L NL E+DLS+N L+S+P FQ
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
Score = 28.9 bits (66), Expect = 0.42
Identities = 9/19 (47%), Positives = 12/19 (63%)
Query: 52 QVLDMSGNNLQILPKEAFR 70
+ LD+S N L LP AF+
Sbjct: 5 RELDLSNNQLSSLPPGAFQ 23
Score = 26.9 bits (61), Expect = 2.2
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 125 LRDLNLARNPISKIEKGAFQ 144
LR+L+L+ N +S + GAFQ
Sbjct: 4 LRELDLSNNQLSSLPPGAFQ 23
>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
subfamily.
Length = 24
Score = 31.9 bits (74), Expect = 0.036
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 98 LTNLIEIDLSDNLLTSIPSLTFQS 121
L NL E+DLS+N L+S+P FQ
Sbjct: 1 LPNLRELDLSNNQLSSLPPGAFQG 24
Score = 28.9 bits (66), Expect = 0.42
Identities = 9/19 (47%), Positives = 12/19 (63%)
Query: 52 QVLDMSGNNLQILPKEAFR 70
+ LD+S N L LP AF+
Sbjct: 5 RELDLSNNQLSSLPPGAFQ 23
Score = 26.9 bits (61), Expect = 2.2
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 125 LRDLNLARNPISKIEKGAFQ 144
LR+L+L+ N +S + GAFQ
Sbjct: 4 LRELDLSNNQLSSLPPGAFQ 23
>gnl|CDD|143178 cd04977, Ig1_NCAM-1_like, First immunoglobulin (Ig)-like domain of
neural cell adhesion molecule NCAM-1 and similar
proteins. Ig1_NCAM-1 like: first immunoglobulin
(Ig)-like domain of neural cell adhesion molecule
NCAM-1. NCAM-1 plays important roles in the development
and regeneration of the central nervous system, in
synaptogenesis and neural migration. NCAM mediates
cell-cell and cell-substratum recognition and adhesion
via homophilic (NCAM-NCAM), and heterophilic
(NCAM-nonNCAM), interactions. NCAM is expressed as three
major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves the Ig1, Ig2, and Ig3
domains. By this model, Ig1 and Ig2 mediate dimerization
of NCAM molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain. Also included in this group is
NCAM-2 (also known as OCAM/mamFas II and RNCAM). NCAM-2
is differentially expressed in the developing and mature
olfactory epithelium (OE).
Length = 92
Score = 33.6 bits (77), Expect = 0.044
Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 10/82 (12%)
Query: 263 YVEAVSSENATVVCRVDSIPPAAISWYW-NGRLLLNNTAFSSYQRIFVIEQGEYERKSSL 321
E E+ +C+V P ISW+ NG L+ + Q+I V++ + +S+L
Sbjct: 9 QGEISVGESKFFLCQVIG-EPKDISWFSPNGEKLV------TQQQISVVQNDDV--RSTL 59
Query: 322 VLTNAQESDSGRFYCVAENRAG 343
+ NA D+G + CVA + G
Sbjct: 60 TIYNANIEDAGIYKCVATDAKG 81
>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain. This domain is
found in antibodies as well as neural protein P0 and
CTL4 amongst others.
Length = 114
Score = 34.1 bits (78), Expect = 0.048
Identities = 19/95 (20%), Positives = 28/95 (29%), Gaps = 15/95 (15%)
Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS 319
R V + T+ C S + S YW + L + R
Sbjct: 7 PPRPVTVAEGGSVTLPCSF-SSSSGSTSVYWYKQPLGKGPELIIHYVTSTPNGKVGPRFK 65
Query: 320 --------------SLVLTNAQESDSGRFYCVAEN 340
SL ++N + SDSG + C N
Sbjct: 66 GRVTLSGNGSKNDFSLTISNLRLSDSGTYTCAVSN 100
>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein. The Polycystin
Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
huge protein of 4303aas. Its repeated leucine-rich (LRR)
segment is found in many proteins. It contains 16
polycystic kidney disease (PKD) domains, one
LDL-receptor class A domain, one C-type lectin family
domain, and 16-18 putative TMSs in positions between
residues 2200 and 4100. Polycystin-L has been shown to
be a cation (Na+, K+ and Ca2+) channel that is activated
by Ca2+. Two members of the PCC family (polycystin 1 and
2) are mutated in autosomal dominant polycystic kidney
disease, and polycystin-L is deleted in mice with renal
and retinal defects. Note: this model is restricted to
the amino half for technical reasons.
Length = 2740
Score = 36.2 bits (83), Expect = 0.062
Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 3/82 (3%)
Query: 178 LNGNRLSHFPVRSVEPLLKLMMIELHDNPWVCDCNMRSIKMWLADKKNVPVQP---ACTG 234
++ N++S L L I+L NP+ CDC + + W +K QP C G
Sbjct: 2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAG 61
Query: 235 PERLSGKVFSDLHADDFACKPE 256
P L+G+ + D C E
Sbjct: 62 PGALAGQPLLGIPLLDSGCDEE 83
>gnl|CDD|143240 cd05763, Ig_1, Subgroup of the immunoglobulin (Ig) superfamily.
Ig_1: subgroup of the immunoglobulin (Ig) domain found
in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
the Ig superfamily are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of most
Ig domains is a disulfide bridge connecting the two
beta-sheets with a tryptophan residue packed against the
disulfide bond.
Length = 75
Score = 32.6 bits (74), Expect = 0.076
Identities = 20/82 (24%), Positives = 36/82 (43%), Gaps = 8/82 (9%)
Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
A + C P I+W +G + + +R+ V+ + + + + + D+
Sbjct: 1 ARLECAATGHPTPQIAWQKDGG---TDFPAARERRMHVMPEDD-----VFFIVDVKIEDT 52
Query: 332 GRFYCVAENRAGIADANFTLQV 353
G + C A+N AG AN TL V
Sbjct: 53 GVYSCTAQNTAGSISANATLTV 74
>gnl|CDD|143273 cd05865, Ig1_NCAM-1, First immunoglobulin (Ig)-like domain of
neural cell adhesion molecule NCAM-1. Ig1_NCAM-1: first
immunoglobulin (Ig)-like domain of neural cell adhesion
molecule NCAM-1. NCAM-1 plays important roles in the
development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-nonNCAM), interactions. NCAM is expressed as three
major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves the Ig1, Ig2, and Ig3
domains. By this model, Ig1 and Ig2 mediate dimerization
of NCAM molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain.
Length = 96
Score = 32.7 bits (74), Expect = 0.091
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 8/59 (13%)
Query: 286 ISWYW-NGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAENRAG 343
ISW+ NG L N QRI V+ +Y S+L + NA D+G + CV N
Sbjct: 33 ISWFSPNGEKLTPNQ-----QRISVVRNDDY--SSTLTIYNANIDDAGIYKCVVSNEDE 84
>gnl|CDD|143213 cd05736, Ig2_Follistatin_like, Second immunoglobulin (Ig)-like
domain of a follistatin-like molecule encoded by the
Mahya gene and similar proteins. Ig2_Follistatin_like:
domain similar to the second immunoglobulin (Ig)-like
domain found in a follistatin-like molecule encoded by
the CNS-related Mahya gene. Mahya genes have been
retained in certain Bilaterian branches during
evolution. They are conserved in Hymenoptera and
Deuterostomes, but are absent from other metazoan
species such as fruit fly and nematode. Mahya proteins
are secretory, with a follistatin-like domain
(Kazal-type serine/threonine protease inhibitor domain
and EF-hand calcium-binding domain), two Ig-like
domains, and a novel C-terminal domain. Mahya may be
involved in learning and memory and in processing of
sensory information in Hymenoptera and vertebrates.
Follistatin is a secreted, multidomain protein that
binds activins with high affinity and antagonizes their
signaling.
Length = 76
Score = 32.2 bits (73), Expect = 0.099
Identities = 19/73 (26%), Positives = 38/73 (52%), Gaps = 9/73 (12%)
Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
A++ C + IP ++W NG + + +++ +I G S L ++N + D+
Sbjct: 1 ASLRCHAEGIPLPRLTWLKNGMDITPKLS----KQLTLIANG-----SELHISNVRYEDT 51
Query: 332 GRFYCVAENRAGI 344
G + C+A+N AG+
Sbjct: 52 GAYTCIAKNEAGV 64
>gnl|CDD|143250 cd05773, Ig8_hNephrin_like, Eighth immunoglobulin-like domain of
nephrin. Ig8_hNephrin_like: domain similar to the
eighth immunoglobulin-like domain in human nephrin.
Nephrin is an integral component of the slit diaphragm,
and is a central component of the glomerular
ultrafilter. Nephrin plays a structural role, and has a
role in signaling. Nephrin is a transmembrane protein
having a short intracellular portion, and an
extracellular portion comprised of eight Ig-like
domains, and one fibronectin type III-like domain. The
extracellular portions of nephrin, from neighboring foot
processes of separate podocyte cells, may interact with
each other, and in association with other components of
the slit diaphragm, form a porous molecular sieve within
the slit pore. The intracellular portion of nephrin is
associated with linker proteins, which connect nephrin
to the actin cytoskeleton. The intracellular portion is
tyrosine phosphorylated, and mediates signaling from the
slit diaphragm into the podocytes.
Length = 109
Score = 33.0 bits (75), Expect = 0.10
Identities = 25/94 (26%), Positives = 34/94 (36%), Gaps = 13/94 (13%)
Query: 268 SSENATVVCRVDSIPPAAISWYWNG-RLLLNNTAFSSYQRIFVIEQGEYE---RKSSLVL 323
S +A +VC+ +P W NG L L N + E E+ S L +
Sbjct: 22 GSSDANLVCQAQGVPRVQFRWAKNGVPLDLGNPRYE--------ETTEHTGTVHTSILTI 73
Query: 324 TNAQES-DSGRFYCVAENRAGIADANFTLQVTYR 356
N + D F C A N G + L T R
Sbjct: 74 INVSAALDYALFTCTAHNSLGEDSLDIQLVSTSR 107
>gnl|CDD|143264 cd05856, Ig2_FGFRL1-like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor_like-1(FGFRL1).
Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain
of fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal
peptide, three extracellular Ig-like modules, a
transmembrane segment, and a short intracellular domain.
FGFRL1 is expressed preferentially in skeletal tissues.
Similar to FGF receptors, the expressed protein
interacts specifically with heparin and with FGF2.
FGFRL1 does not have a protein tyrosine kinase domain at
its C terminus; neither does its cytoplasmic domain
appear to interact with a signaling partner. It has been
suggested that FGFRL1 may not have any direct signaling
function, but instead acts as a decoy receptor trapping
FGFs and preventing them from binding other receptors.
Length = 82
Score = 31.7 bits (72), Expect = 0.19
Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 13/79 (16%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS-SLVLTNAQESDSGRF 334
C P I+W + + L E GE +K +L L N + DSG++
Sbjct: 16 CVASGNPRPDITWLKDNKPLTPT------------EIGESRKKKWTLSLKNLKPEDSGKY 63
Query: 335 YCVAENRAGIADANFTLQV 353
C NRAG +A + + V
Sbjct: 64 TCHVSNRAGEINATYKVDV 82
>gnl|CDD|143199 cd05722, Ig1_Neogenin, First immunoglobulin (Ig)-like domain in
neogenin and similar proteins. Ig1_Neogenin: first
immunoglobulin (Ig)-like domain in neogenin and related
proteins. Neogenin is a cell surface protein which is
expressed in the developing nervous system of vertebrate
embryos in the growing nerve cells. It is also expressed
in other embryonic tissues, and may play a general role
in developmental processes such as cell migration,
cell-cell recognition, and tissue growth regulation.
Included in this group is the tumor suppressor protein
DCC, which is deleted in colorectal carcinoma . DCC and
neogenin each have four Ig-like domains followed by six
fibronectin type III domains, a transmembrane domain,
and an intracellular domain.
Length = 95
Score = 32.1 bits (73), Expect = 0.19
Identities = 23/80 (28%), Positives = 33/80 (41%), Gaps = 15/80 (18%)
Query: 266 AVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTN 325
AV + C + PP I W +G LL S +R + G SL++T+
Sbjct: 11 AVRGGPVVLNCSAEGEPPPKIEWKKDGVLL----NLVSDERRQQLPNG------SLLITS 60
Query: 326 AQES-----DSGRFYCVAEN 340
S D G + CVA+N
Sbjct: 61 VVHSKHNKPDEGFYQCVAQN 80
>gnl|CDD|191413 pfam05970, PIF1, PIF1-like helicase. This family includes
homologues of the PIF1 helicase, which inhibits
telomerase activity and is cell cycle regulated. This
family includes a large number of largely
uncharacterized plant proteins. This family includes a
P-loop motif that is involved in nucleotide binding.
Length = 364
Score = 33.9 bits (78), Expect = 0.21
Identities = 27/116 (23%), Positives = 48/116 (41%), Gaps = 20/116 (17%)
Query: 462 LHSVINISNPDLINDTRKPEGLSPE----PHNDDVLFQNNYWNQNIRQPTNSELGFDSND 517
+ ++++ PD++ ++ P L P N+DV NNY + Q E + S+D
Sbjct: 243 IEAIVSEVYPDIVQNSTDPNYLCERAILCPTNEDVDEINNY---ILSQLPGEEKIYLSSD 299
Query: 518 KTPIIDGVSIGGELDDNYPPDYGLPIVGQGQNELLPNNIHPNAKTLRVWQRGVPVL 573
I + + D YP ++ N L N + + L+V G PV+
Sbjct: 300 S--ISKSDTDIPDDDALYPTEF--------LNSLKANGLPNHVLKLKV---GAPVM 342
>gnl|CDD|143285 cd05877, Ig_LP_like, Immunoglobulin (Ig)-like domain of human
cartilage link protein (LP). Ig_LP_like: immunoglobulin
(Ig)-like domain similar to that that found in human
cartilage link protein (LP). In cartilage,
chondroitin-keratan sulfate proteoglycan (CSPG),
aggrecan, forms cartilage link protein stabilized
aggregates with hyaluronan (HA). These aggregates
contribute to the tissue's load bearing properties.
Aggregates having other CSPGs substituting for aggrecan
may contribute to the structural integrity of many
different tissues. Members of the vertebrate HPLN
(hyaluronan/HA and proteoglycan binding link) protein
family are physically linked adjacent to CSPG genes.
Length = 106
Score = 31.9 bits (73), Expect = 0.22
Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 22/108 (20%)
Query: 270 ENATVVCRVDSIPPAA------ISWYW--NGRLLLNNT---------AFSSYQ-RIFVIE 311
N T+ CR P + + W + L + ++ SYQ R+F+
Sbjct: 3 GNVTLPCRYHYEPELSAPRKIRVKWTKLESDYLKEEDVLVAIGTRHKSYGSYQGRVFLRR 62
Query: 312 QGEYERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQVTYRGVG 359
+ +SLV+T+ + D GR+ C E G+ D + + + RGV
Sbjct: 63 AHD--LDASLVITDLRLEDYGRYRC--EVIDGLEDESVVVALRLRGVV 106
>gnl|CDD|143214 cd05737, Ig_Myomesin_like_C, C-temrinal immunoglobulin (Ig)-like
domain of myomesin and M-protein. Ig_Myomesin_like_C:
domain similar to the C-temrinal immunoglobulin
(Ig)-like domain of myomesin and M-protein. Myomesin and
M-protein are both structural proteins localized to the
M-band, a transverse structure in the center of the
sarcomere, and are candidates for M-band bridges. Both
proteins are modular, consisting mainly of repetitive
Ig-like and fibronectin type III (FnIII) domains.
Myomesin is expressed in all types of vertebrate
striated muscle; M-protein has a muscle-type specific
expression pattern. Myomesin is present in both slow and
fast fibers; M-protein is present only in fast fibers.
It has been suggested that myomesin acts as a molecular
spring with alternative splicing as a means of modifying
its elasticity.
Length = 92
Score = 31.3 bits (71), Expect = 0.27
Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 8/78 (10%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
C V P +SW N + L A S + + V EQG+Y +SL + DSG++
Sbjct: 23 CTVFGDPDPEVSWLKNDQAL----ALSDHYNVKV-EQGKY---ASLTIKGVSSEDSGKYG 74
Query: 336 CVAENRAGIADANFTLQV 353
V +N+ G + T+ V
Sbjct: 75 IVVKNKYGGETVDVTVSV 92
>gnl|CDD|219745 pfam08205, C2-set_2, CD80-like C2-set immunoglobulin domain. These
domains belong to the immunoglobulin superfamily.
Length = 89
Score = 31.2 bits (71), Expect = 0.29
Identities = 26/83 (31%), Positives = 30/83 (36%), Gaps = 9/83 (10%)
Query: 264 VEAVSSENATVV--CRV-DSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSS 320
V + EN VV C P I+WY +GR L T S E G Y S+
Sbjct: 7 VSLLEGENLEVVATCSSAGGKPAPRITWYLDGRELEAITTSSE----QDPESGLYTVTST 62
Query: 321 LVLTNAQESDSGR-FYCVAENRA 342
L L D GR C A
Sbjct: 63 LKLV-PSREDHGRSLTCQVSYGA 84
>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
(D2) of the human basement membrane heparan sulfate
proteoglycan perlecan, also known as HSPG2.
Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
II (D2) of the human basement membrane heparan sulfate
proteoglycan perlecan, also known as HSPG2. Perlecan
consists of five domains. Domain I has three putative
heparan sulfate attachment sites; domain II has four LDL
receptor-like repeats, and one Ig-like repeat; domain
III resembles the short arm of laminin chains; domain IV
has multiple Ig-like repeats (21 repeats in human
perlecan); and domain V resembles the globular G domain
of the laminin A chain and internal repeats of EGF.
Perlecan may participate in a variety of biological
functions including cell binding, LDL-metabolism,
basement membrane assembly and selective permeability,
calcium binding, and growth- and neurite-promoting
activities.
Length = 78
Score = 30.9 bits (70), Expect = 0.31
Identities = 19/74 (25%), Positives = 29/74 (39%), Gaps = 9/74 (12%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
E C +P I+W LN R+ + +G Y +L + + +ES
Sbjct: 2 ETVEFTCVATGVPTPIINWR------LNWGHVPDSARVSITSEGGY---GTLTIRDVKES 52
Query: 330 DSGRFYCVAENRAG 343
D G + C A N G
Sbjct: 53 DQGAYTCEAINTRG 66
>gnl|CDD|143302 cd05894, Ig_C5_MyBP-C, C5 immunoglobulin (Ig) domain of cardiac
myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the
C5 immunoglobulin (Ig) domain of cardiac myosin binding
protein C (MyBP-C). MyBP_C consists of repeated domains,
Ig and fibronectin type 3, and various linkers. Three
isoforms of MYBP_C exist and are included in this group:
cardiac(c), and fast and slow skeletal muscle (s)
MyBP_C. cMYBP_C has insertions between and inside
domains and an additional cardiac-specific Ig domain at
the N-terminus. For cMYBP_C an interaction has been
demonstrated between this C5 domain and the Ig C8
domain.
Length = 86
Score = 31.0 bits (70), Expect = 0.36
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 319 SSLVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
SS V+ A+ D G + N G A+ ++V
Sbjct: 52 SSFVIEGAEREDEGVYTITVTNPVGEDHASLFVKV 86
>gnl|CDD|143284 cd05876, Ig3_L1-CAM, Third immunoglobulin (Ig)-like domain of the
L1 cell adhesion molecule (CAM). Ig3_L1-CAM: third
immunoglobulin (Ig)-like domain of the L1 cell adhesion
molecule (CAM). L1 belongs to the L1 subfamily of cell
adhesion molecules (CAMs) and is comprised of an
extracellular region having six Ig-like domains, five
fibronectin type III domains, a transmembrane region and
an intracellular domain. L1 is primarily expressed in
the nervous system and is involved in its development
and function. L1 is associated with an X-linked
recessive disorder, X-linked hydrocephalus, MASA
syndrome, or spastic paraplegia type 1, that involves
abnormalities of axonal growth. This group also contains
the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Length = 71
Score = 30.3 bits (68), Expect = 0.48
Identities = 22/82 (26%), Positives = 34/82 (41%), Gaps = 14/82 (17%)
Query: 273 TVVCRVDSIPPAAISW-YWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
+ C + +P + W +G L N T + + +L L N ESD
Sbjct: 2 VLECIAEGLPTPEVHWDRIDGPLSPNRTKKLNNNK-------------TLQLDNVLESDD 48
Query: 332 GRFYCVAENRAGIADANFTLQV 353
G + C AEN G A ++T+ V
Sbjct: 49 GEYVCTAENSEGSARHHYTVTV 70
>gnl|CDD|143272 cd05864, Ig2_VEGFR-2, Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 2 (VEGFR-2).
Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 2 (VEGFR-2).
The VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. VEGFRs bind VEGFs with high
affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is
a major mediator of the mitogenic, angiogenic and
microvascular permeability-enhancing effects of VEGF-A;
VEGF-A is important to the growth and maintenance of
vascular endothelial cells and to the development of new
blood- and lymphatic-vessels in physiological and
pathological states. VEGF-A also interacts with VEGFR-1,
which it binds more strongly than VEGFR-2. VEGFR-2 and
-1 may mediate a chemotactic and a survival signal in
hematopoietic stem cells or leukemia cells.
Length = 70
Score = 30.3 bits (68), Expect = 0.48
Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 14/59 (23%)
Query: 282 PPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAEN 340
PP + WY NG+L++ N F +R L + E D+G + V N
Sbjct: 11 PPPEVKWYKNGQLIVLNHTF--------------KRGVHLTIYEVTEKDAGNYTVVLTN 55
>gnl|CDD|143262 cd05854, Ig6_Contactin-2, Sixth Ig domain of contactin-2.
Ig6_Contactin-2: Sixth Ig domain of the neural cell
adhesion molecule contactin-2-like. Contactins are
comprised of six Ig domains followed by four fibronectin
type III (FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-2 (TAG-1,
axonin-1) facilitates cell adhesion by homophilic
binding between molecules in apposed membranes. It may
play a part in the neuronal processes of neurite
outgrowth, axon guidance and fasciculation, and neuronal
migration. The first four Ig domains form the
intermolecular binding fragment, which arranges as a
compact U-shaped module by contacts between IG domains 1
and 4, and domains 2 and 3. The different contactins
show different expression patterns in the central
nervous system. During development and in adulthood,
contactin-2 is transiently expressed in subsets of
central and peripheral neurons. Contactin-2 is also
expressed in retinal amacrine cells in the developing
chick retina, corresponding to the period of formation
and maturation of AC proce sses.
Length = 85
Score = 30.4 bits (68), Expect = 0.60
Identities = 24/90 (26%), Positives = 36/90 (40%), Gaps = 15/90 (16%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS------SLVL 323
EN T+ C P +++ W+ L++ G Y R LV+
Sbjct: 1 ENLTLQCHASHDPTMDLTFTWS----LDDFPID-----LDKPNGHYRRMEVKETIGDLVI 51
Query: 324 TNAQESDSGRFYCVAENRAGIADANFTLQV 353
NAQ S +G + C A+ A A+ TL V
Sbjct: 52 VNAQLSHAGTYTCTAQTVVDSASASATLVV 81
>gnl|CDD|143276 cd05868, Ig4_NrCAM, Fourth immunoglobulin (Ig)-like domain of NrCAM
(NgCAM-related cell adhesion molecule). Ig4_ NrCAM:
fourth immunoglobulin (Ig)-like domain of NrCAM
(NgCAM-related cell adhesion molecule). NrCAM belongs to
the L1 subfamily of cell adhesion molecules (CAMs) and
is comprised of an extracellular region having six
IG-like domains and five fibronectin type III domains, a
transmembrane region and an intracellular domain. NrCAM
is primarily expressed in the nervous system.
Length = 76
Score = 30.0 bits (67), Expect = 0.62
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 16/87 (18%)
Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERK---SSLVLTNA 326
E+ T++CR + P +ISW NG + I + RK +++ +
Sbjct: 2 EDGTLICRANGNPKPSISWLTNGVPI-------------EIAPTDPSRKVDGDTIIFSKV 48
Query: 327 QESDSGRFYCVAENRAGIADANFTLQV 353
QE S + C A N G AN + V
Sbjct: 49 QERSSAVYQCNASNEYGYLLANAFVNV 75
>gnl|CDD|218711 pfam05709, Sipho_tail, Phage tail protein. This family consists of
several Siphovirus and other phage tail component
proteins as well as some bacterial proteins of unknown
function.
Length = 242
Score = 31.9 bits (73), Expect = 0.73
Identities = 11/45 (24%), Positives = 21/45 (46%)
Query: 507 TNSELGFDSNDKTPIIDGVSIGGELDDNYPPDYGLPIVGQGQNEL 551
T L DS T +++G++ L + P++ G+NE+
Sbjct: 179 TGDVLVIDSATDTVVLNGINTLNGLAIGANTNSDFPVLPPGENEI 223
>gnl|CDD|212460 cd05723, Ig4_Neogenin, Fourth immunoglobulin (Ig)-like domain in
neogenin and similar proteins. Ig4_Neogenin: fourth
immunoglobulin (Ig)-like domain in neogenin and related
proteins. Neogenin is a cell surface protein which is
expressed in the developing nervous system of vertebrate
embryos in the growing nerve cells. It is also expressed
in other embryonic tissues, and may play a general role
in developmental processes such as cell migration,
cell-cell recognition, and tissue growth regulation.
Included in this group is the tumor suppressor protein
DCC, which is deleted in colorectal carcinoma . DCC and
neogenin each have four Ig-like domains followed by six
fibronectin type III domains, a transmembrane domain,
and an intracellular domain.
Length = 71
Score = 29.5 bits (66), Expect = 0.74
Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 12/76 (15%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
C V P + W NG +++ S Y +I ++ +L + +SD G +
Sbjct: 6 CEVTGKPTPTVKWVKNGDMVIP----SDYFKIV--------KEHNLQVLGLVKSDEGFYQ 53
Query: 336 CVAENRAGIADANFTL 351
C+AEN G A L
Sbjct: 54 CIAENDVGNVQAGAQL 69
>gnl|CDD|143257 cd05849, Ig1_Contactin-1, First Ig domain of contactin-1.
Ig1_Contactin-1: First Ig domain of the neural cell
adhesion molecule contactin-1. Contactins are comprised
of six Ig domains followed by four fibronectin type III
(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-1 is
differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 93
Score = 29.9 bits (67), Expect = 0.81
Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 13/86 (15%)
Query: 259 MDSRYVEAVSSENATVVCRVDSIPPAAISWYWN-GRLLLNNTAFSSYQRIFVIEQGEYER 317
+D+ Y E + +V CR + P W N + L N +S VI + +
Sbjct: 9 IDTIYPEESTEGKVSVNCRARANPFPIYKWRKNNLDIDLTNDRYSMVGGNLVINNPDKYK 68
Query: 318 KSSLVLTNAQESDSGRFYCVAENRAG 343
D+GR+ C+ N G
Sbjct: 69 ------------DAGRYVCIVSNIYG 82
>gnl|CDD|143177 cd04976, Ig2_VEGFR, Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor (VEGFR).
Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor (VEGFR). The
VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. The VEGFR family consists of three
members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at
the Ig-like domains. VEGF-A is important to the growth
and maintenance of vascular endothelial cells and to the
development of new blood- and lymphatic-vessels in
physiological and pathological states. VEGFR-2 is a
major mediator of the mitogenic, angiogenic and
microvascular permeability-enhancing effects of VEGF-A.
VEGFR-1 may play an inhibitory part in these processes
by binding VEGF and interfering with its interaction
with VEGFR-2. VEGFR-1 has a signaling role in mediating
monocyte chemotaxis. VEGFR-2 and -1 may mediate a
chemotactic and a survival signal in hematopoietic stem
cells or leukemia cells. VEGFR-3 has been shown to be
involved in tumor angiogenesis and growth.
Length = 71
Score = 29.3 bits (66), Expect = 0.87
Identities = 18/71 (25%), Positives = 26/71 (36%), Gaps = 13/71 (18%)
Query: 282 PPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAENR 341
PP I WY NG+L+ S R SL + + E D+G + V N+
Sbjct: 11 PPPEIQWYKNGKLI------SEKNRTKK-------SGHSLTIKDVTEEDAGNYTVVLTNK 57
Query: 342 AGIADANFTLQ 352
+ T
Sbjct: 58 QAKLEKRLTFT 68
>gnl|CDD|143301 cd05893, Ig_Palladin_C, C-terminal immunoglobulin (Ig)-like domain
of palladin. Ig_Palladin_C: C-terminal immunoglobulin
(Ig)-like domain of palladin. Palladin belongs to the
palladin-myotilin-myopalladin family. Proteins belonging
to this family contain multiple Ig-like domains and
function as scaffolds, modulating actin cytoskeleton.
Palladin binds to alpha-actinin ezrin,
vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP,
mDia interacting protein), and Src. Palladin also binds
F-actin directly, via its Ig3 domain. Palladin is
expressed as several alternatively spliced isoforms,
having various combinations of Ig-like domains, in a
cell-type-specific manner. It has been suggested that
palladin's different Ig-like domains may be specialized
for distinct functions.
Length = 75
Score = 29.6 bits (66), Expect = 0.91
Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 7/78 (8%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
CRV +P I W L +NT S + Q L++ A + D+G +
Sbjct: 5 CRVSGVPHPQIFWKKENESLTHNTDRVS------MHQDNCGY-ICLLIQGATKEDAGWYT 57
Query: 336 CVAENRAGIADANFTLQV 353
A+N AGI L V
Sbjct: 58 VSAKNEAGIVSCTARLDV 75
>gnl|CDD|143281 cd05873, Ig_Sema4D_like, Immunoglobulin (Ig)-like domain of the
class IV semaphorin Sema4D. Ig_Sema4D_like;
Immunoglobulin (Ig)-like domain of Sema4D. Sema4D is a
Class IV semaphorin. Semaphorins are classified based on
structural features additional to the Sema domain.
Sema4D has extracellular Sema and Ig domains, a
transmembrane domain, and a short cytoplasmic domain.
Sema4D plays a part in the development of GABAergic
synapses. Sema4D in addition is an immune semaphorin. It
is abundant on resting T cells; its expression is weak
on resting B cells and antigen presenting cells (APCs),
but is upregulated by various stimuli. The receptor used
by Sema4D in the immune system is CD72. Sem4D enhances
the activation of B cells and DCs through binding CD72,
perhaps by reducing CD72s inhibitory signals. The
receptor used by Sema4D in the non-lymphatic tissues is
plexin-B1. Sem4D is anchored to the cell surface but its
extracellular domain can be released from the cell
surface by a metalloprotease-dependent process. Sem4D
may mediate its effects in its membrane bound form,
and/or its cleaved form.
Length = 87
Score = 29.8 bits (67), Expect = 1.0
Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 13/72 (18%)
Query: 271 NATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESD 330
NA + C S A + W ++G++L T S+ ++ + L++ NA E+D
Sbjct: 13 NAELKCSPKS-NLARVVWKFDGKVL---TPESAKYLLY---------RDGLLIFNASEAD 59
Query: 331 SGRFYCVAENRA 342
+GR+ C++ ++
Sbjct: 60 AGRYQCLSVEKS 71
>gnl|CDD|143256 cd05848, Ig1_Contactin-5, First Ig domain of contactin-5.
Ig1_Contactin-5: First Ig domain of the neural cell
adhesion molecule contactin-5. Contactins are comprised
of six Ig domains followed by four fibronectin type III
(FnIII) domains, anchored to the membrane by
glycosylphosphatidylinositol. The different contactins
show different expression patterns in the central
nervous system. In rats, a lack of contactin-5 (NB-2)
results in an impairment of the neuronal activity in the
auditory system. Contactin-5 is expressed specifically
in the postnatal nervous system, peaking at about 3
weeks postnatal. Contactin-5 is highly expressed in the
adult human brain in the occipital lobe and in the
amygdala; lower levels of expression have been detected
in the corpus callosum, caudate nucleus, and spinal
cord.
Length = 94
Score = 29.5 bits (66), Expect = 1.3
Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 12/70 (17%)
Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES-DSGRF 334
C P W NG S R +I+ +L+++N E DSGR+
Sbjct: 26 CEARGNPVPTYRWLRNG----TEIDTESDYRYSLID-------GNLIISNPSEVKDSGRY 74
Query: 335 YCVAENRAGI 344
C+A N G
Sbjct: 75 QCLATNSIGS 84
>gnl|CDD|143191 cd05714, Ig_CSPGs_LP, Immunoglobulin (Ig)-like domain of
chondroitin sulfate proteoglycans (CSPGs), human
cartilage link protein (LP) and similar proteins.
Ig_CSPGs_LP: immunoglobulin (Ig)-like domain similar to
that found in chondroitin sulfate proteoglycans (CSPGs)
and human cartilage link protein (LP). Included in this
group are the CSPGs aggrecan, versican, and neurocan. In
CSPGs this Ig-like domain is followed by hyaluronan
(HA)-binding tandem repeats, and a C-terminal region
with epidermal growth factor-like, lectin-like, and
complement regulatory protein-like domains. Separating
these N- and C-terminal regions is a nonhomologous
glycosaminoglycan attachment region. In cartilage,
aggrecan forms cartilage link protein stabilized
aggregates with hyaluronan (HA). These aggregates
contribute to the tissue's load bearing properties.
Aggrecan and versican have a wide distribution in
connective tissue and extracellular matrices. Neurocan
is localized almost exclusively in nervous tissue.
Aggregates having other CSPGs substituting for aggrecan
may contribute to the structural integrity of many
different tissues. There is considerable evidence that
HA-binding CSPGs are involved in developmental processes
in the central nervous system. Members of the vertebrate
HPLN (hyaluronan/HA and proteoglycan binding link)
protein family are physically linked adjacent to CSPG
genes.
Length = 106
Score = 29.5 bits (67), Expect = 1.4
Identities = 17/40 (42%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 320 SLVLTNAQESDSGRFYC-VAENRAGIADANFTLQVTYRGV 358
SLV+T+ + DSGR+ C V + GI D T+++ RGV
Sbjct: 69 SLVITDLRLEDSGRYRCEVID---GIEDEQDTVELEVRGV 105
>gnl|CDD|144887 pfam01463, LRRCT, Leucine rich repeat C-terminal domain. Leucine
Rich Repeats pfam00560 are short sequence motifs present
in a number of proteins with diverse functions and
cellular locations. Leucine Rich Repeats are often
flanked by cysteine rich domains. This domain is often
found at the C-terminus of tandem leucine rich repeats.
Length = 25
Score = 27.2 bits (61), Expect = 1.6
Identities = 10/23 (43%), Positives = 13/23 (56%), Gaps = 1/23 (4%)
Query: 232 CTGPERLSGKVFSDLHADDFACK 254
C GPE L G + S + DF+C
Sbjct: 4 CAGPESLRGPLLSLPPS-DFSCP 25
>gnl|CDD|143231 cd05754, Ig3_Perlecan_like, Third immunoglobulin (Ig)-like domain
found in Perlecan and similar proteins.
Ig3_Perlecan_like: domain similar to the third
immunoglobulin (Ig)-like domain found in Perlecan.
Perlecan is a large multi-domain heparin sulfate
proteoglycan, important in tissue development and
organogenesis. Perlecan can be represented as 5 major
portions; its fourth major portion (domain IV) is a
tandem repeat of immunoglobulin-like domains (Ig2-Ig15),
which can vary in size due to alternative splicing.
Perlecan binds many cellular and extracellular ligands.
Its domain IV region has many binding sites. Some of
these have been mapped at the level of individual
Ig-like domains, including a site restricted to the Ig5
domain for heparin/sulfatide, a site restricted to the
Ig3 domain for nidogen-1 and nidogen-2, a site
restricted to Ig4-5 for fibronectin, and sites
restricted to Ig2 and to Ig13-15 for fibulin-2.
Length = 85
Score = 29.1 bits (65), Expect = 1.7
Identities = 22/93 (23%), Positives = 32/93 (34%), Gaps = 16/93 (17%)
Query: 260 DSRYVEAVSSENATVVCRVDSIPPA-AISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERK 318
+ R E + + +CR S PA + W G L + A
Sbjct: 7 EPRSQEVRPGADVSFICRAKSKSPAYTLVWTRVGGGLPSR-AMDFNGI------------ 53
Query: 319 SSLVLTNAQESDSGRFYCVAENRAGIADANFTL 351
L + N Q SD+G + C N +A TL
Sbjct: 54 --LTIRNVQLSDAGTYVCTGSNMLDTDEATATL 84
>gnl|CDD|185285 PRK15387, PRK15387, E3 ubiquitin-protein ligase SspH2; Provisional.
Length = 788
Score = 31.3 bits (70), Expect = 1.9
Identities = 52/195 (26%), Positives = 84/195 (43%), Gaps = 46/195 (23%)
Query: 36 DKFLITIPEAPESELTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGAL 95
D L ++P P + L++SGN L LP GLL L H+ + SG
Sbjct: 231 DNNLTSLPALPPE--LRTLEVSGNQLTSLP---VLPPGLLELSIFSNPLTHLPALPSGLC 285
Query: 96 ------DGLTNLI-------EIDLSDNLLTSIPSLTFQSVRF----------------LR 126
+ LT+L E+ +SDN L S+P+L + + L+
Sbjct: 286 KLWIFGNQLTSLPVLPPGLQELSVSDNQLASLPALPSELCKLWAYNNQLTSLPTLPSGLQ 345
Query: 127 DLNLARNPISKIEKGAFQFVPG-LVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSH 185
+L+++ N ++ + +P L KL +RL + P +G K L ++GNRL+
Sbjct: 346 ELSVSDNQLASLPT-----LPSELYKLWAYNNRLTSL-PALPSGLKEL---IVSGNRLTS 396
Query: 186 FPVRSVEPLLKLMMI 200
PV E LK +M+
Sbjct: 397 LPVLPSE--LKELMV 409
>gnl|CDD|219476 pfam07584, BatA, Aerotolerance regulator N-terminal. These
proteins share a highly-conserved sequence at their
N-terminus. They include several proteins from
Rhodopirellula baltica and also several from
proteobacteria. The proteins are produced by the Batl
operon which appears to be important in pathogenicity
and aerotolerance. This family is the conserved
N-terminus, but the full length proteins carry multiple
membrane-spanning domains. BatA ensures bacterial
survival in the early stages of the infection process,
when the infected sites are aerobic, and is produced
under conditions of oxidative stress.
Length = 77
Score = 28.6 bits (65), Expect = 1.9
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 373 LALFFLIILILIIIIYLLIRMRTITYP 399
L L+ L++L L II++LL+R R
Sbjct: 8 LLLWGLLLLPLPIILHLLLRRRPRRVK 34
>gnl|CDD|143173 cd04972, Ig_TrkABC_d4, Fourth domain (immunoglobulin-like) of Trk
receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth
domain of Trk receptors TrkA, TrkB and TrkC, this is an
immunoglobulin (Ig)-like domain which binds to
neurotrophin. The Trk family of receptors are tyrosine
kinase receptors. They are activated by dimerization,
leading to autophosphorylation of intracellular tyrosine
residues, and triggering the signal transduction
pathway. TrkA, TrkB, and TrkC share significant sequence
homology and domain organization. The first three
domains are leucine-rich domains. The fourth and fifth
domains are Ig-like domains playing a part in ligand
binding. TrkA, Band C mediate the trophic effects of the
neurotrophin Nerve growth factor (NGF) family. TrkA is
recognized by NGF. TrKB is recognized by brain-derived
neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
is recognized by NT-3. NT-3 is promiscuous as in some
cell systems it activates TrkA and TrkB receptors. TrkA
is a receptor found in all major NGF targets, including
the sympathetic, trigeminal, and dorsal root ganglia,
cholinergic neurons of the basal forebrain and the
striatum. TrKB transcripts are found throughout multiple
structures of the central and peripheral nervous
systems. The TrkC gene is expressed throughout the
mammalian nervous system.
Length = 90
Score = 29.0 bits (65), Expect = 2.0
Identities = 22/101 (21%), Positives = 32/101 (31%), Gaps = 25/101 (24%)
Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWYWNG------RLLLNNTAFSSYQRIFVIEQG 313
++ V AT+ C + P + W G R T Y
Sbjct: 8 NATVVY--EGGTATIRCTAEGSPLPKVEWIIAGLIVIQTRTDTLETTVDIY--------- 56
Query: 314 EYERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQVT 354
+L L+N C AEN G A+ ++QVT
Sbjct: 57 ------NLQLSNITSETQTTVTCTAENPVGQANV--SVQVT 89
>gnl|CDD|197684 smart00365, LRR_SD22, Leucine-rich repeat, SDS22-like subfamily.
Length = 22
Score = 26.9 bits (61), Expect = 2.3
Identities = 10/17 (58%), Positives = 12/17 (70%)
Query: 98 LTNLIEIDLSDNLLTSI 114
LTNL E+DL DN + I
Sbjct: 1 LTNLEELDLGDNKIKKI 17
>gnl|CDD|225994 COG3463, COG3463, Predicted membrane protein [Function unknown].
Length = 458
Score = 30.9 bits (70), Expect = 2.3
Identities = 28/121 (23%), Positives = 42/121 (34%), Gaps = 8/121 (6%)
Query: 360 LPFLGGGHINGISLALFFLIILILIIIIYLLIRMRTITY-PNSKNPAQIEVMANGNAH-- 416
LPFL G + GIS ILI II+I +L + I Y P + + +E A N
Sbjct: 305 LPFLFLGALYGISKIKSVKKILIKIILIGILASLALIPYTPIAPHSPFVEQGAMINLAVS 364
Query: 417 AVVNKTPSLTPVIETSSFTERKQFPPPSYHSTEMISPNGQLPNKTLHSVINISNPDLIND 476
V+ + +I K + + + N S L+N
Sbjct: 365 KVIPGKEASFELIAII-----KDSKGYLLTINNLYPVFANDFDAYVLPKNNNSRVYLVNL 419
Query: 477 T 477
Sbjct: 420 E 420
>gnl|CDD|224190 COG1271, CydA, Cytochrome bd-type quinol oxidase, subunit 1 [Energy
production and conversion].
Length = 457
Score = 30.7 bits (70), Expect = 2.4
Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 355 YRGVGLPFLGGGHINGISLALFFLIILILIII-IYLLIRM 393
V + G + SL LF ++ +L+I +YLL+R+
Sbjct: 397 KTDVVSSAVTAGSV-LFSLILFMVLYTVLLIAEVYLLLRL 435
>gnl|CDD|143271 cd05863, Ig2_VEGFR-3, Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 3 (VEGFR-3).
Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 3 (VEGFR-3).
The VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. VEGFRs bind VEGFs with high
affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds
two members of the VEGF family (VEGF-C and -D) and is
involved in tumor angiogenesis and growth.
Length = 67
Score = 28.0 bits (62), Expect = 2.5
Identities = 15/77 (19%), Positives = 30/77 (38%), Gaps = 17/77 (22%)
Query: 277 RVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYC 336
+V + PP WY +G+L+ + SL + + E+ +G +
Sbjct: 6 KVAAYPPPEFQWYKDGKLISGK-----------------HSQHSLQIKDVTEASAGTYTL 48
Query: 337 VAENRAGIADANFTLQV 353
V N A + +L++
Sbjct: 49 VLWNSAAGLEKRISLEL 65
>gnl|CDD|218858 pfam06024, DUF912, Nucleopolyhedrovirus protein of unknown function
(DUF912). This family consists of several
Nucleopolyhedrovirus proteins of unknown function.
Length = 101
Score = 28.4 bits (64), Expect = 3.0
Identities = 13/45 (28%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 363 LGGGHINGISLALFFLIILILIIIIYLLIRMRTITYPNSKNPAQI 407
G+I I L FF ++++L I Y +I +R ++ NP+ +
Sbjct: 58 ANAGNIILIGLLAFFCVLVLLYAIYYFVI-LRERRKYSTNNPSYV 101
>gnl|CDD|143197 cd05720, Ig_CD8_alpha, Immunoglobulin (Ig) like domain of CD8 alpha
chain. Ig_CD8_alpha: immunoglobulin (Ig)-like domain in
CD8 alpha. The CD8 glycoprotein plays an essential role
in the control of T-cell selection, maturation and the
T-cell receptor (TCR)-mediated response to peptide
antigen. CD8 is comprised of alpha and beta subunits and
is expressed as either an alphaalpha or alphabeta dimer.
Both dimeric isoforms can serve as a coreceptor for T
cell activation and differentiation, however they have
distinct physiological roles, different cellular
distributions, unique binding partners etc. Each CD8
subunit is comprised of an extracellular domain
containing a v-type Ig-like domain, a single pass
transmembrane portion and a short intracellular domain.
The Ig domain of CD8 alpha binds to antibodies.
Length = 104
Score = 28.6 bits (64), Expect = 3.5
Identities = 17/79 (21%), Positives = 29/79 (36%), Gaps = 13/79 (16%)
Query: 273 TVVCRVDSIPPAAISWYWNGRLLLNNTAF-----SSYQRIFVIEQGEYER----KSS--- 320
+ C V + P SW + F S + + E+ +R +SS
Sbjct: 10 ELKCEVLNSSPTGCSWLFQPPGSAPQPTFLVYLSGSSKITWDEEELSSKRFSGSRSSNSF 69
Query: 321 -LVLTNAQESDSGRFYCVA 338
L L N Q+ + G ++C
Sbjct: 70 VLTLKNFQKENEGYYFCSV 88
>gnl|CDD|143172 cd04971, Ig_TrKABC_d5, Fifth domain (immunoglobulin-like) of Trk
receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth
domain of Trk receptors TrkA, TrkB and TrkC, this is an
immunoglobulin (Ig)-like domain which binds to
neurotrophin. The Trk family of receptors are tyrosine
kinase receptors. They are activated by dimerization,
leading to autophosphorylation of intracellular tyrosine
residues, and triggering the signal transduction
pathway. TrkA, TrkB, and TrkC share significant sequence
homology and domain organization. The first three
domains are leucine-rich domains. The fourth and fifth
domains are Ig-like domains playing a part in ligand
binding. TrkA, Band C mediate the trophic effects of the
neurotrophin Nerve growth factor (NGF) family. TrkA is
recognized by NGF. TrkB is recognized by brain-derived
neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
is recognized by NT-3. NT-3 is promiscuous as in some
cell systems it activates TrkA and TrkB receptors. TrkA
is a receptor found in all major NGF targets, including
the sympathetic, trigeminal, and dorsal root ganglia,
cholinergic neurons of the basal forebrain and the
striatum. TrKB transcripts are found throughout multiple
structures of the central and peripheral nervous
systems. The TrkC gene is expressed throughout the
mammalian nervous system.
Length = 81
Score = 27.7 bits (62), Expect = 3.5
Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 2/71 (2%)
Query: 278 VDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCV 337
V P ++WY NG +L + I E L N ++G + V
Sbjct: 7 VRGNPKPTLTWYHNGAVLNESD--YIRTEIHYEVTTPTEYHGCLQFDNPTHVNNGNYTLV 64
Query: 338 AENRAGIADAN 348
A N G +
Sbjct: 65 ASNEYGQDSKS 75
>gnl|CDD|143229 cd05752, Ig1_FcgammaR_like, Frst immunoglobulin (Ig)-like domain of
Fcgamma-receptors (FcgammaRs) and similar proteins.
Ig1_FcgammaR_like: domain similar to the first
immunoglobulin (Ig)-like domain of Fcgamma-receptors
(FcgammaRs). Interactions between IgG and FcgammaR are
important to the initiation of cellular and humoral
response. IgG binding to FcgammaR leads to a cascade of
signals and ultimately to functions such as
antibody-dependent-cellular-cytotoxicity (ADCC),
endocytosis, phagocytosis, release of inflammatory
mediators, etc. FcgammaR has two Ig-like domains. This
group also contains FcepsilonRI, which binds IgE with
high affinity.
Length = 78
Score = 27.7 bits (62), Expect = 3.6
Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 4/49 (8%)
Query: 270 ENATVVCRVDSIP-PAAISWYWNGRLLLNNTAFSSYQRIFVIEQ-GEYE 316
E T+ C + P + WY NG+LL T +SY+ GEY
Sbjct: 16 EKVTLTCNGFNSPEQNSTQWYHNGKLLETTT--NSYRIRAANNDSGEYR 62
>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
This family includes attachment proteins from
respiratory synctial virus. Glycoprotein G has not been
shown to have any neuraminidase or hemagglutinin
activity. The amino terminus is thought to be
cytoplasmic, and the carboxyl terminus extracellular.
The extracellular region contains four completely
conserved cysteine residues.
Length = 263
Score = 29.7 bits (66), Expect = 3.7
Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 17/98 (17%)
Query: 372 SLALFFLIILILIIIIYLLIRMRTITYPNSKNPAQIEVMANGNAHAVVNKTPSLTPVIET 431
SLA L IL +II L+I A I +++ N TP+ TP +
Sbjct: 28 SLAQIALSILAMIISTSLII-------------AAIIFISSANHKV----TPTTTPTQQI 70
Query: 432 SSFTERKQFPPPSYHSTEMISPNGQLPNKTLHSVINIS 469
++ + + H+ SP+ Q L I
Sbjct: 71 TNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDD 108
>gnl|CDD|164750 MTH00204, ND4, NADH dehydrogenase subunit 4; Provisional.
Length = 485
Score = 30.0 bits (68), Expect = 3.7
Identities = 11/31 (35%), Positives = 22/31 (70%), Gaps = 2/31 (6%)
Query: 368 INGISLALFFLIILILIIIIYLLIRMRTITY 398
++G+SL FF+++ L+I I +LI ++I +
Sbjct: 81 VDGVSL--FFILLTTLLIPICILISWKSIKF 109
>gnl|CDD|130784 TIGR01723, hmd_TIGR, 5,10-methenyltetrahydromethanopterin
hydrogenase. This model represents a clade of
authenticated coenzyme
N(5),N(10)-methenyltetrahydromethanopterin reductases.
This enzyme does not use F420. This enzyme acts in
methanogenesis and as such is restricted to methanogenic
archaeal species. This clade is one of two clades in
Pfam model pfam03201 [Energy metabolism,
Methanogenesis].
Length = 340
Score = 29.9 bits (67), Expect = 4.3
Identities = 17/52 (32%), Positives = 22/52 (42%), Gaps = 7/52 (13%)
Query: 228 VQPACTGPERLSGKVFSDLHADDF-------ACKPEIRMDSRYVEAVSSENA 272
V ACT P K+F DL +D C PE++ E +SE A
Sbjct: 170 VTHACTIPTTKFAKIFEDLGREDLNVTSYHPGCVPEMKGQVYIAEGYASEEA 221
>gnl|CDD|226560 COG4074, Mth, H2-forming N5,N10-methylenetetrahydromethanopterin
dehydrogenase [Energy production and conversion].
Length = 343
Score = 29.9 bits (67), Expect = 4.3
Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 7/52 (13%)
Query: 228 VQPACTGPERLSGKVFSDLHADDF-------ACKPEIRMDSRYVEAVSSENA 272
V ACT P K+F D+ +D PE++ E +SE A
Sbjct: 170 VTHACTIPTTKFKKIFEDMGREDLNVTSYHPGTVPEMKGQVYIAEGYASEEA 221
>gnl|CDD|143234 cd05757, Ig2_IL1R_like, Second immunoglobulin (Ig)-like domain of
interleukin-1 receptor (IL1R) and similar proteins.
Ig2_IL1R_like: domain similar to the second
immunoglobulin (Ig)-like domain of interleukin-1
receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines
which participate in the regulation of inflammation,
immune responses, and hematopoiesis. These cytokines
bind to the IL-1 receptor type 1 (IL1R1), which is
activated on additional association with an accessory
protein, IL1RAP. IL-1 also binds a second receptor
designated type II (IL1R2). Mature IL1R1 consists of
three IG-like domains, a transmembrane domain, and a
large cytoplasmic domain. Mature IL1R2 is organized
similarly except that it has a short cytoplasmic domain.
The latter does not initiate signal transduction. A
naturally occurring cytokine IL-1RA (IL-1 receptor
antagonist) is widely expressed and binds to IL-1
receptors, inhibiting the binding of IL-1 alpha and IL-1
beta. This group also contains ILIR-like 1 (IL1R1L)
which maps to the same chromosomal location as IL1R1 and
IL1R2.
Length = 92
Score = 28.1 bits (63), Expect = 4.4
Identities = 18/86 (20%), Positives = 33/86 (38%), Gaps = 22/86 (25%)
Query: 260 DSRYVEAVSSENATVVC-------RVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQ 312
S S++ +VC +++PP + WY + +LL R +
Sbjct: 1 ISYKQILFSTKGGKIVCPDLDDFKNENTLPP--VQWYKDCKLLEG-------DRKRFV-- 49
Query: 313 GEYERKSSLVLTNAQESDSGRFYCVA 338
+ S L++ N E D+G + C
Sbjct: 50 ----KGSKLLIQNVTEEDAGNYTCKL 71
>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein. The CorA
transport system is the primary Mg2+ influx system of
Salmonella typhimurium and Escherichia coli. CorA is
virtually ubiquitous in the Bacteria and Archaea. There
are also eukaryotic relatives of this protein. The
family includes the MRS2 protein from yeast that is
thought to be an RNA splicing protein. However its
membership of this family suggests that its effect on
splicing is due to altered magnesium levels in the cell.
Length = 291
Score = 29.2 bits (66), Expect = 5.6
Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 2/34 (5%)
Query: 359 GLPFLGGGHINGISLALFFLIILILIIIIYLLIR 392
G+P L + G + ++L+L I++ L R
Sbjct: 259 GMPELDWPY--GYPFWIVLGLMLLLAILLILYFR 290
>gnl|CDD|100598 PRK00561, ppnK, inorganic polyphosphate/ATP-NAD kinase;
Provisional.
Length = 259
Score = 29.1 bits (65), Expect = 5.9
Identities = 13/51 (25%), Positives = 19/51 (37%)
Query: 76 NLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLR 126
A C + I++G L T+ E DL N + L F + L
Sbjct: 49 TAANYNCAGCKVVGINTGHLGFYTSFNETDLDQNFANKLDQLKFTQIDLLE 99
>gnl|CDD|216842 pfam02009, Rifin_STEVOR, Rifin/stevor family. Several multicopy
gene families have been described in Plasmodium
falciparum, including the stevor family of subtelomeric
open reading frames and the rif interspersed repetitive
elements. Both families contain three predicted
transmembrane segments. It has been proposed that stevor
and rif are members of a larger superfamily that code
for variant surface antigens.
Length = 290
Score = 28.9 bits (65), Expect = 6.9
Identities = 11/27 (40%), Positives = 21/27 (77%), Gaps = 2/27 (7%)
Query: 368 INGISLALFFLIILILIIIIYLLIRMR 394
I ++A+ LII+++++IIYL++R R
Sbjct: 249 IYASAIAI--LIIVLVMLIIYLILRYR 273
>gnl|CDD|214338 CHL00025, ndhF, NADH dehydrogenase subunit 5.
Length = 741
Score = 29.1 bits (66), Expect = 7.8
Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 357 GVGLPFLGGGHING-ISLALFFLIILILIIIIYLLIR 392
G G+ ++GGG I+ + L LF++ I +LI+ +
Sbjct: 705 GEGIKYVGGGRISSYLFLYLFYVSIFLLILYFFFSFI 741
>gnl|CDD|177215 MTH00158, ATP8, ATP synthase F0 subunit 8; Provisional.
Length = 32
Score = 25.5 bits (57), Expect = 7.9
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 368 INGISLALFFLIILILIIII 387
+N + L + FLI IL I+
Sbjct: 7 MNWLILFILFLITFILFNIL 26
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 29.2 bits (65), Expect = 8.0
Identities = 35/163 (21%), Positives = 60/163 (36%), Gaps = 32/163 (19%)
Query: 133 NPISKIEKGAFQFVPGLVK---LDMSESRLEHISPEAFTGAKSL------ESI--KLNGN 181
NP E+ A V M + EHI + + +SL E+I +
Sbjct: 1305 NPEQIFEELAGNESNDDVTGAPCPMGDIDAEHIIGDDYDTFESLSDELLEETITNDIESL 1364
Query: 182 RLSHFPVRSVEPLLK----LMMIELHDNPWVCDCNMRSIKMWLADKKNVPVQPACTGPER 237
F +++ LK L ++HDN +CD + KKN+ V PE
Sbjct: 1365 EAKDFEQYTLKVNLKAPKLLKPAKIHDNEHLCDFS----------KKNLIV------PEP 1408
Query: 238 LSGKVFSDLHADDFACKPEIR-MDSRYVEAVSSENATVVCRVD 279
L + + D C ++ +D+ YV+ + + A +
Sbjct: 1409 LKEEEELGGNPPDIHCYAALKPLDTLYVKCPTEKAAYEAAKGK 1451
>gnl|CDD|220496 pfam09972, DUF2207, Predicted membrane protein (DUF2207). This
domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 503
Score = 28.9 bits (65), Expect = 8.6
Identities = 5/47 (10%), Positives = 18/47 (38%), Gaps = 1/47 (2%)
Query: 357 GVGLPFLGGGHI-NGISLALFFLIILILIIIIYLLIRMRTITYPNSK 402
+ + L I ++ + I+L++ +I ++ + +
Sbjct: 404 TLIILILSFILISLVLAALVLLAIVLVIGSVIAAILPRKLFGRWTPE 450
>gnl|CDD|224039 COG1114, BrnQ, Branched-chain amino acid permeases [Amino acid
transport and metabolism].
Length = 431
Score = 28.7 bits (65), Expect = 8.7
Identities = 13/42 (30%), Positives = 18/42 (42%), Gaps = 13/42 (30%)
Query: 357 GVGLPFLG-------GGHINGIS------LALFFLIILILII 385
GVGLP LG GG + ++ + F I + L I
Sbjct: 49 GVGLPLLGIIAVALYGGGVESLATRIGPWFGVLFAIAIYLSI 90
>gnl|CDD|133063 cd06913, beta3GnTL1_like, Beta 1, 3-N-acetylglucosaminyltransferase
is essential for the formation of
poly-N-acetyllactosamine . This family includes human
Beta3GnTL1 and related eukaryotic proteins. Human
Beta3GnTL1 is a putative
beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is
expressed at various levels in most of tissues examined.
Beta 1, 3-N-acetylglucosaminyltransferase has been found
to be essential for the formation of
poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a
unique carbohydrate composed of N-acetyllactosamine
repeats. It is often an important part of
cell-type-specific oligosaccharide structures and some
functional oligosaccharides. It has been shown that the
structure and biosynthesis of poly-N-acetyllactosamine
display a dramatic change during development and
oncogenesis. Several members of beta-1,
3-N-acetylglucosaminyltransferase have been identified.
Length = 219
Score = 28.6 bits (64), Expect = 9.1
Identities = 19/71 (26%), Positives = 31/71 (43%), Gaps = 7/71 (9%)
Query: 239 SGKVFSDLHADDFACKPEIRMDSRYVEAVSSENATVVCRVDSIPPAAISWY--WNGRL-- 294
SG+ L +DD IR+ +Y A+ N+ + C+V IP + Y W L
Sbjct: 84 SGRYLCFLDSDDVMMPQRIRL--QYEAALQHPNSIIGCQVRRIPEDSTERYTRWINTLTR 141
Query: 295 -LLNNTAFSSY 304
L ++S+
Sbjct: 142 EQLLTQVYTSH 152
>gnl|CDD|220767 pfam10459, Peptidase_S46, Peptidase S46. Dipeptidyl-peptidase 7
(DPP-7) is the best characterized member of this family.
It is a serine peptidase that is located on the cell
surface and is predicted to have two N-terminal
transmembrane domains.
Length = 696
Score = 29.1 bits (66), Expect = 9.2
Identities = 8/18 (44%), Positives = 10/18 (55%)
Query: 338 AENRAGIADANFTLQVTY 355
R DAN TL++TY
Sbjct: 543 KSGRPVYPDANSTLRLTY 560
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.137 0.412
Gapped
Lambda K H
0.267 0.0637 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,992,313
Number of extensions: 3062624
Number of successful extensions: 5013
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4897
Number of HSP's successfully gapped: 173
Length of query: 600
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 498
Effective length of database: 6,413,494
Effective search space: 3193920012
Effective search space used: 3193920012
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.9 bits)