RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8139
(1101 letters)
>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain.
Length = 90
Score = 96.2 bits (240), Expect = 7e-24
Identities = 34/88 (38%), Positives = 42/88 (47%)
Query: 453 PSFIRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLII 512
P F + D E E FT V G P P VSW+KDG + SS R ++ + TL I
Sbjct: 1 PKFTQKPKDVEVQEGESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFKVTYEGGTYTLTI 60
Query: 513 HQAALMDEGEIKCTATNRAGHSITKARL 540
DEG+ C ATN AG + A L
Sbjct: 61 SNVQPDDEGKYTCVATNSAGEAEASAEL 88
Score = 68.1 bits (167), Expect = 6e-14
Identities = 27/76 (35%), Positives = 40/76 (52%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
GE + ++ G P PT W +G+PL S R+++T+ L IS+ + D G+Y
Sbjct: 15 GESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQPDDEGKYTCV 74
Query: 623 GVNSLGEDVASFLVTV 638
NS GE AS +TV
Sbjct: 75 ATNSAGEAEASAELTV 90
>gnl|CDD|238020 cd00063, FN3, Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein
fibronectin. Its tenth fibronectin type III repeat
contains an RGD cell recognition sequence in a flexible
loop between 2 strands. Approximately 2% of all animal
proteins contain the FN3 repeat; including extracellular
and intracellular proteins, membrane spanning cytokine
receptors, growth hormone receptors, tyrosine
phosphatase receptors, and adhesion molecules. FN3-like
domains are also found in bacterial glycosyl hydrolases.
Length = 93
Score = 85.6 bits (212), Expect = 5e-20
Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 345 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSPYWVRASPHMVE 404
P PP ++ + T+ W PPE DGG P+ GY+VE+R GS W
Sbjct: 1 PSPPTNLRVTDVTS----TSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGS 55
Query: 405 DTELMVSGLEPGWRYQFRITAENVVGFSEPGPLSEPLTVT 444
+T ++GL+PG Y+FR+ A N G S P SE +TVT
Sbjct: 56 ETSYTLTGLKPGTEYEFRVRAVNGGGESPP---SESVTVT 92
Score = 85.6 bits (212), Expect = 5e-20
Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 684 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSPYWVRASPHMVE 743
P PP ++ + T+ W PPE DGG P+ GY+VE+R GS W
Sbjct: 1 PSPPTNLRVTDVTS----TSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGS 55
Query: 744 DTELMVSGLEPGWRYQFRITAENVVGFSEPGPLSEPLTVT 783
+T ++GL+PG Y+FR+ A N G S P SE +TVT
Sbjct: 56 ETSYTLTGLKPGTEYEFRVRAVNGGGESPP---SESVTVT 92
>gnl|CDD|214495 smart00060, FN3, Fibronectin type 3 domain. One of three types of
internal repeat within the plasma protein, fibronectin.
The tenth fibronectin type III repeat contains a RGD
cell recognition sequence in a flexible loop between 2
strands. Type III modules are present in both
extracellular and intracellular proteins.
Length = 83
Score = 64.9 bits (158), Expect = 6e-13
Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 7/89 (7%)
Query: 345 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGG-SPVLGYLVEHRRTGSPYWVRASPHMV 403
P PP ++ + T+ WEPP DG ++GY VE+R GS W +
Sbjct: 1 PSPPSNLRVTDVTS----TSVTLSWEPPPDDGITGYIVGYRVEYREEGSE-WKEVNVT-P 54
Query: 404 EDTELMVSGLEPGWRYQFRITAENVVGFS 432
T ++GL+PG Y+FR+ A N G
Sbjct: 55 SSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
Score = 64.9 bits (158), Expect = 6e-13
Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 7/89 (7%)
Query: 684 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGG-SPVLGYLVEHRRTGSPYWVRASPHMV 742
P PP ++ + T+ WEPP DG ++GY VE+R GS W +
Sbjct: 1 PSPPSNLRVTDVTS----TSVTLSWEPPPDDGITGYIVGYRVEYREEGSE-WKEVNVT-P 54
Query: 743 EDTELMVSGLEPGWRYQFRITAENVVGFS 771
T ++GL+PG Y+FR+ A N G
Sbjct: 55 SSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like. IG domains that cannot
be classified into one of IGv1, IGc1, IGc2, IG.
Length = 85
Score = 64.4 bits (157), Expect = 1e-12
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 1/82 (1%)
Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFE-IFSSRRQRIVTDNDISTLIIHQAALMD 519
T E E V + + G P P+V+WYK G + + S R + STL I D
Sbjct: 3 SVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPED 62
Query: 520 EGEIKCTATNRAGHSITKARLR 541
G C ATN +G + + L
Sbjct: 63 SGTYTCAATNSSGSASSGTTLT 84
Score = 57.9 bits (140), Expect = 2e-10
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 1/77 (1%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNG-EPLTSGGRYEITHTDRYLNLRISDARRADRGEYQA 621
GE + L +G PPP W G + L GR+ ++ + L IS+ D G Y
Sbjct: 9 GESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTC 68
Query: 622 HGVNSLGEDVASFLVTV 638
NS G + +TV
Sbjct: 69 AATNSSGSASSGTTLTV 85
>gnl|CDD|214652 smart00409, IG, Immunoglobulin.
Length = 85
Score = 64.4 bits (157), Expect = 1e-12
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 1/82 (1%)
Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFE-IFSSRRQRIVTDNDISTLIIHQAALMD 519
T E E V + + G P P+V+WYK G + + S R + STL I D
Sbjct: 3 SVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPED 62
Query: 520 EGEIKCTATNRAGHSITKARLR 541
G C ATN +G + + L
Sbjct: 63 SGTYTCAATNSSGSASSGTTLT 84
Score = 57.9 bits (140), Expect = 2e-10
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 1/77 (1%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNG-EPLTSGGRYEITHTDRYLNLRISDARRADRGEYQA 621
GE + L +G PPP W G + L GR+ ++ + L IS+ D G Y
Sbjct: 9 GESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTC 68
Query: 622 HGVNSLGEDVASFLVTV 638
NS G + +TV
Sbjct: 69 AATNSSGSASSGTTLTV 85
>gnl|CDD|143225 cd05748, Ig_Titin_like, Immunoglobulin (Ig)-like domain of titin
and similar proteins. Ig_Titin_like: immunoglobulin
(Ig)-like domain found in titin-like proteins. Titin
(also called connectin) is a fibrous sarcomeric protein
specifically found in vertebrate striated muscle. Titin
is gigantic, depending on isoform composition it ranges
from 2970 to 3700 kDa, and is of a length that spans
half a sarcomere. Titin largely consists of multiple
repeats of Ig-like and fibronectin type 3 (FN-III)-like
domains. Titin connects the ends of myosin thick
filaments to Z disks and extends along the thick
filament to the H zone. It appears to function
similarly to an elastic band, keeping the myosin
filaments centered in the sarcomere during muscle
contraction or stretching. Within the sarcomere, titin
is also attached to or is associated with myosin binding
protein C (MyBP-C). MyBP-C appears to contribute to the
generation of passive tension by titin, and similar to
titin has repeated Ig-like and FN-III domains. Also
included in this group are worm twitchin and insect
projectin, thick filament proteins of invertebrate
muscle, which also have repeated Ig-like and FN-III
domains.
Length = 74
Score = 60.3 bits (147), Expect = 2e-11
Identities = 27/73 (36%), Positives = 40/73 (54%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVN 625
++L+V ++G P PT W +G+PL GR +I T +L I +A R+D G+Y N
Sbjct: 2 VRLEVPISGRPTPTVTWSKDGKPLKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLKN 61
Query: 626 SLGEDVASFLVTV 638
GE A+ V V
Sbjct: 62 PAGEKSATINVKV 74
Score = 50.7 bits (122), Expect = 5e-08
Identities = 24/64 (37%), Positives = 32/64 (50%)
Query: 469 KVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTAT 528
V V + G PTP V+W KDG + S R +I T ++L+I A D G+ T
Sbjct: 1 SVRLEVPISGRPTPTVTWSKDGKPLKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLK 60
Query: 529 NRAG 532
N AG
Sbjct: 61 NPAG 64
>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain. Ig: immunoglobulin (Ig) domain
found in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
this group are components of immunoglobulin, neuroglia,
cell surface glycoproteins, such as, T-cell receptors,
CD2, CD4, CD8, and membrane glycoproteins, such as,
butyrophilin and chondroitin sulfate proteoglycan core
protein. A predominant feature of most Ig domains is a
disulfide bridge connecting the two beta-sheets with a
tryptophan residue packed against the disulfide bond.
Length = 74
Score = 59.8 bits (144), Expect = 3e-11
Identities = 22/74 (29%), Positives = 29/74 (39%), Gaps = 4/74 (5%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEI----FSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
V T G P P ++W K+G + + R T + STL I L D G C
Sbjct: 1 VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTC 60
Query: 526 TATNRAGHSITKAR 539
A+N AG
Sbjct: 61 VASNSAGTVSASVT 74
Score = 45.2 bits (106), Expect = 4e-06
Identities = 24/73 (32%), Positives = 31/73 (42%), Gaps = 4/73 (5%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLN----LRISDARRADRGEYQA 621
+ L +G PPPT WL NG+PL S + + + L IS+ D G Y
Sbjct: 1 VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTC 60
Query: 622 HGVNSLGEDVASF 634
NS G AS
Sbjct: 61 VASNSAGTVSASV 73
>gnl|CDD|200951 pfam00041, fn3, Fibronectin type III domain.
Length = 84
Score = 54.7 bits (132), Expect = 2e-09
Identities = 23/92 (25%), Positives = 34/92 (36%), Gaps = 12/92 (13%)
Query: 346 GPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSP---YWVRASPHM 402
P + + T+ W PP G P+ GY VE+R +
Sbjct: 1 SAPTNLTVTDVTS----TSLTLSWSPPP--GNGPITGYEVEYRPVNGGEEWKEITVPGT- 53
Query: 403 VEDTELMVSGLEPGWRYQFRITAENVVGFSEP 434
T ++GL+PG Y+ R+ A N G P
Sbjct: 54 --TTSYTLTGLKPGTEYEVRVQAVNGAGEGPP 83
Score = 54.7 bits (132), Expect = 2e-09
Identities = 23/92 (25%), Positives = 34/92 (36%), Gaps = 12/92 (13%)
Query: 685 GPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSP---YWVRASPHM 741
P + + T+ W PP G P+ GY VE+R +
Sbjct: 1 SAPTNLTVTDVTS----TSLTLSWSPPP--GNGPITGYEVEYRPVNGGEEWKEITVPGT- 53
Query: 742 VEDTELMVSGLEPGWRYQFRITAENVVGFSEP 773
T ++GL+PG Y+ R+ A N G P
Sbjct: 54 --TTSYTLTGLKPGTEYEVRVQAVNGAGEGPP 83
>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type.
Length = 63
Score = 53.2 bits (128), Expect = 5e-09
Identities = 25/67 (37%), Positives = 30/67 (44%), Gaps = 4/67 (5%)
Query: 466 EDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
E + V T EG P P ++W KDG + R V STL I +L D G C
Sbjct: 1 EGQSVTLTCPAEGNPVPNITWLKDGKPL--PESNRFVASG--STLTIKSVSLEDSGLYTC 56
Query: 526 TATNRAG 532
A N AG
Sbjct: 57 VAENSAG 63
Score = 49.3 bits (118), Expect = 1e-07
Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 4/67 (5%)
Query: 562 MGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQA 621
G+ + L G P P WL +G+PL R + L I D G Y
Sbjct: 1 EGQSVTLTCPAEGNPVPNITWLKDGKPLPESNR--FVASGST--LTIKSVSLEDSGLYTC 56
Query: 622 HGVNSLG 628
NS G
Sbjct: 57 VAENSAG 63
>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig2_Robo: domain similar to the
second immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 86
Score = 52.8 bits (127), Expect = 1e-08
Identities = 29/65 (44%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 478 GIPTPKVSWYKDGFEI-FSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAGHSIT 536
G P P VSW KDG + + R RIV D + L+I +A DEG KC ATN G +
Sbjct: 23 GHPEPTVSWRKDGQPLNLDNERVRIVDDGN---LLIAEARKSDEGTYKCVATNMVGERES 79
Query: 537 K-ARL 540
ARL
Sbjct: 80 AAARL 84
Score = 39.3 bits (92), Expect = 7e-04
Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 4/57 (7%)
Query: 574 GMPPPTARWLHNGEPLTSGG-RYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGE 629
G P PT W +G+PL R I NL I++AR++D G Y+ N +GE
Sbjct: 23 GHPEPTVSWRKDGQPLNLDNERVRIVDDG---NLLIAEARKSDEGTYKCVATNMVGE 76
>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig3_Robo: domain similar to the
third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 69
Score = 48.2 bits (115), Expect = 4e-07
Identities = 25/72 (34%), Positives = 30/72 (41%), Gaps = 4/72 (5%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
VEF +V G P P V W K+ E+ R I+ D +L I DEG C A N
Sbjct: 1 VEFQCEVGGDPVPTVLWRKEDGEL-PKGRAEILDDK---SLKIRNVTAGDEGSYTCEAEN 56
Query: 530 RAGHSITKARLR 541
G A L
Sbjct: 57 MVGKIEASASLT 68
Score = 33.1 bits (76), Expect = 0.083
Identities = 22/71 (30%), Positives = 31/71 (43%), Gaps = 4/71 (5%)
Query: 568 LKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSL 627
+ + G P PT W L G R EI D+ +L+I + D G Y N +
Sbjct: 3 FQCEVGGDPVPTVLWRKEDGELPKG-RAEIL-DDK--SLKIRNVTAGDEGSYTCEAENMV 58
Query: 628 GEDVASFLVTV 638
G+ AS +TV
Sbjct: 59 GKIEASASLTV 69
>gnl|CDD|143302 cd05894, Ig_C5_MyBP-C, C5 immunoglobulin (Ig) domain of cardiac
myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the
C5 immunoglobulin (Ig) domain of cardiac myosin binding
protein C (MyBP-C). MyBP_C consists of repeated domains,
Ig and fibronectin type 3, and various linkers. Three
isoforms of MYBP_C exist and are included in this group:
cardiac(c), and fast and slow skeletal muscle (s)
MyBP_C. cMYBP_C has insertions between and inside
domains and an additional cardiac-specific Ig domain at
the N-terminus. For cMYBP_C an interaction has been
demonstrated between this C5 domain and the Ig C8
domain.
Length = 86
Score = 48.3 bits (115), Expect = 5e-07
Identities = 25/77 (32%), Positives = 33/77 (42%), Gaps = 1/77 (1%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSG-GRYEITHTDRYLNLRISDARRADRGEYQA 621
G ++L V ++G P PT W + T GR + + I A R D G Y
Sbjct: 10 GNKLRLDVPISGEPAPTVTWSRGDKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTI 69
Query: 622 HGVNSLGEDVASFLVTV 638
N +GED AS V V
Sbjct: 70 TVTNPVGEDHASLFVKV 86
Score = 39.8 bits (93), Expect = 5e-04
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 3/66 (4%)
Query: 469 KVEFTVQVEGIPTPKVSWYKDGFEIF--SSRRQRIVTDNDISTLIIHQAALMDEGEIKCT 526
K+ V + G P P V+W + G + F + R R+ + D+S+ +I A DEG T
Sbjct: 12 KLRLDVPISGEPAPTVTWSR-GDKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTIT 70
Query: 527 ATNRAG 532
TN G
Sbjct: 71 VTNPVG 76
>gnl|CDD|143173 cd04972, Ig_TrkABC_d4, Fourth domain (immunoglobulin-like) of Trk
receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth
domain of Trk receptors TrkA, TrkB and TrkC, this is an
immunoglobulin (Ig)-like domain which binds to
neurotrophin. The Trk family of receptors are tyrosine
kinase receptors. They are activated by dimerization,
leading to autophosphorylation of intracellular tyrosine
residues, and triggering the signal transduction
pathway. TrkA, TrkB, and TrkC share significant sequence
homology and domain organization. The first three
domains are leucine-rich domains. The fourth and fifth
domains are Ig-like domains playing a part in ligand
binding. TrkA, Band C mediate the trophic effects of the
neurotrophin Nerve growth factor (NGF) family. TrkA is
recognized by NGF. TrKB is recognized by brain-derived
neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
is recognized by NT-3. NT-3 is promiscuous as in some
cell systems it activates TrkA and TrkB receptors. TrkA
is a receptor found in all major NGF targets, including
the sympathetic, trigeminal, and dorsal root ganglia,
cholinergic neurons of the basal forebrain and the
striatum. TrKB transcripts are found throughout multiple
structures of the central and peripheral nervous
systems. The TrkC gene is expressed throughout the
mammalian nervous system.
Length = 90
Score = 47.5 bits (113), Expect = 1e-06
Identities = 20/80 (25%), Positives = 29/80 (36%)
Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDE 520
T E EG P PKV W G + +R + T DI L + +
Sbjct: 9 ATVVYEGGTATIRCTAEGSPLPKVEWIIAGLIVIQTRTDTLETTVDIYNLQLSNITSETQ 68
Query: 521 GEIKCTATNRAGHSITKARL 540
+ CTA N G + ++
Sbjct: 69 TTVTCTAENPVGQANVSVQV 88
>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
(D2) of the human basement membrane heparan sulfate
proteoglycan perlecan, also known as HSPG2.
Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
II (D2) of the human basement membrane heparan sulfate
proteoglycan perlecan, also known as HSPG2. Perlecan
consists of five domains. Domain I has three putative
heparan sulfate attachment sites; domain II has four LDL
receptor-like repeats, and one Ig-like repeat; domain
III resembles the short arm of laminin chains; domain IV
has multiple Ig-like repeats (21 repeats in human
perlecan); and domain V resembles the globular G domain
of the laminin A chain and internal repeats of EGF.
Perlecan may participate in a variety of biological
functions including cell binding, LDL-metabolism,
basement membrane assembly and selective permeability,
calcium binding, and growth- and neurite-promoting
activities.
Length = 78
Score = 47.1 bits (112), Expect = 1e-06
Identities = 22/65 (33%), Positives = 30/65 (46%)
Query: 468 EKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTA 527
E VEFT G+PTP ++W + + S R I ++ TL I D+G C A
Sbjct: 2 ETVEFTCVATGVPTPIINWRLNWGHVPDSARVSITSEGGYGTLTIRDVKESDQGAYTCEA 61
Query: 528 TNRAG 532
N G
Sbjct: 62 INTRG 66
Score = 39.0 bits (91), Expect = 7e-04
Identities = 19/66 (28%), Positives = 28/66 (42%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
GE ++ G+P P W N + R IT Y L I D + +D+G Y
Sbjct: 1 GETVEFTCVATGVPTPIINWRLNWGHVPDSARVSITSEGGYGTLTIRDVKESDQGAYTCE 60
Query: 623 GVNSLG 628
+N+ G
Sbjct: 61 AINTRG 66
>gnl|CDD|143224 cd05747, Ig5_Titin_like, M5, fifth immunoglobulin (Ig)-like domain
of human titin C terminus and similar proteins.
Ig5_Titin_like: domain similar to the M5, fifth
immunoglobulin (Ig)-like domain from the human titin C
terminus. Titin (also called connectin) is a fibrous
sarcomeric protein specifically found in vertebrate
striated muscle. Titin is gigantic; depending on isoform
composition it ranges from 2970 to 3700 kDa, and is of a
length that spans half a sarcomere. Titin largely
consists of multiple repeats of Ig-like and fibronectin
type 3 (FN-III)-like domains. Titin connects the ends of
myosin thick filaments to Z disks and extends along the
thick filament to the H zone, and appears to function
similar to an elastic band, keeping the myosin filaments
centered in the sarcomere during muscle contraction or
stretching.
Length = 92
Score = 47.4 bits (112), Expect = 1e-06
Identities = 24/75 (32%), Positives = 36/75 (48%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
GE + + G P PT W+ G+ + S R++IT T+ IS + +D G Y
Sbjct: 18 GESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVV 77
Query: 623 GVNSLGEDVASFLVT 637
NS G+ A F +T
Sbjct: 78 VENSEGKQEAQFTLT 92
Score = 45.4 bits (107), Expect = 7e-06
Identities = 24/70 (34%), Positives = 34/70 (48%)
Query: 463 TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGE 522
T E E F+ V+G P P V+W ++G I SS+R +I + ST I + + DEG
Sbjct: 14 TVSEGESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGN 73
Query: 523 IKCTATNRAG 532
N G
Sbjct: 74 YTVVVENSEG 83
>gnl|CDD|143199 cd05722, Ig1_Neogenin, First immunoglobulin (Ig)-like domain in
neogenin and similar proteins. Ig1_Neogenin: first
immunoglobulin (Ig)-like domain in neogenin and related
proteins. Neogenin is a cell surface protein which is
expressed in the developing nervous system of vertebrate
embryos in the growing nerve cells. It is also expressed
in other embryonic tissues, and may play a general role
in developmental processes such as cell migration,
cell-cell recognition, and tissue growth regulation.
Included in this group is the tumor suppressor protein
DCC, which is deleted in colorectal carcinoma . DCC and
neogenin each have four Ig-like domains followed by six
fibronectin type III domains, a transmembrane domain,
and an intracellular domain.
Length = 95
Score = 47.1 bits (112), Expect = 1e-06
Identities = 27/95 (28%), Positives = 36/95 (37%), Gaps = 4/95 (4%)
Query: 454 SFIRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIH 513
F+ D A+ V EG P PK+ W KDG + +R + S LI
Sbjct: 1 WFLSEPSDIVAVRGGPVVLNCSAEGEPPPKIEWKKDGVLLNLVSDERRQQLPNGSLLITS 60
Query: 514 ----QAALMDEGEIKCTATNRAGHSITKARLRLEA 544
+ DEG +C A N + SI RL
Sbjct: 61 VVHSKHNKPDEGFYQCVAQNDSLGSIVSRTARLTV 95
>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain of
the L1 cell adhesion molecule (CAM). Ig3_L1-CAM_like:
domain similar to the third immunoglobulin (Ig)-like
domain of the L1 cell adhesion molecule (CAM). L1
belongs to the L1 subfamily of cell adhesion molecules
(CAMs) and is comprised of an extracellular region
having six Ig-like domains and five fibronectin type III
domains, a transmembrane region and an intracellular
domain. L1 is primarily expressed in the nervous system
and is involved in its development and function. L1 is
associated with an X-linked recessive disorder, X-linked
hydrocephalus, MASA syndrome, or spastic paraplegia type
1, that involves abnormalities of axonal growth. This
group also contains the chicken neuron-glia cell
adhesion molecule, Ng-CAM and human neurofascin.
Length = 71
Score = 46.2 bits (110), Expect = 2e-06
Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 6/63 (9%)
Query: 477 EGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG---H 533
EG+PTP++SW K G E+ + R +N TL I + D+GE +CTA+N G H
Sbjct: 8 EGLPTPEISWIKIGGELPAD---RTKFENFNKTLKIDNVSEEDDGEYRCTASNSLGSARH 64
Query: 534 SIT 536
+I+
Sbjct: 65 TIS 67
Score = 33.9 bits (78), Expect = 0.035
Identities = 19/66 (28%), Positives = 27/66 (40%), Gaps = 3/66 (4%)
Query: 573 AGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVA 632
G+P P W+ G L + + L+I + D GEY+ NSLG
Sbjct: 8 EGLPTPEISWIKIGGELPAD---RTKFENFNKTLKIDNVSEEDDGEYRCTASNSLGSARH 64
Query: 633 SFLVTV 638
+ VTV
Sbjct: 65 TISVTV 70
>gnl|CDD|143222 cd05745, Ig3_Peroxidasin, Third immunoglobulin (Ig)-like domain of
peroxidasin. Ig3_Peroxidasin: the third immunoglobulin
(Ig)-like domain in peroxidasin. Peroxidasin has a
peroxidase domain and interacting extracellular motifs
containing four Ig-like domains. It has been suggested
that peroxidasin is secreted and has functions related
to the stabilization of the extracellular matrix. It may
play a part in various other important processes such as
removal and destruction of cells which have undergone
programmed cell death, and protection of the organism
against non-self.
Length = 74
Score = 46.5 bits (110), Expect = 2e-06
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 466 EDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
E + V+F + +G P P ++W K G ++ RR +++ TL I + AL D+G+ +C
Sbjct: 1 EGQTVDFLCEAQGYPQPVIAWTKGGSQLSVDRRHLVLSS---GTLRISRVALHDQGQYEC 57
Query: 526 TATNRAGHSITKARL 540
A N G T A+L
Sbjct: 58 QAVNIVGSQRTVAQL 72
Score = 36.8 bits (85), Expect = 0.004
Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 3/76 (3%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
G+ + G P P W G L+ R+ + + LRIS D+G+Y+
Sbjct: 2 GQTVDFLCEAQGYPQPVIAWTKGGSQLSVDRRHLVLSSG---TLRISRVALHDQGQYECQ 58
Query: 623 GVNSLGEDVASFLVTV 638
VN +G +TV
Sbjct: 59 AVNIVGSQRTVAQLTV 74
>gnl|CDD|143207 cd05730, Ig3_NCAM-1_like, Third immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM).
Ig3_NCAM-1_like: domain similar to the third
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-1 (NCAM). NCAM plays important roles in
the development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-non-NCAM), interactions. NCAM is expressed as
three major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
this model, Ig1,and Ig2 mediate dimerization of NCAM
molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain.
Length = 95
Score = 45.7 bits (108), Expect = 5e-06
Identities = 27/85 (31%), Positives = 37/85 (43%), Gaps = 4/85 (4%)
Query: 453 PSFIRALHDT---TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDIST 509
P IRA TA + V +G P P ++W KDG E S ++ + D S
Sbjct: 1 PPTIRARQSEVNATANLGQSVTLACDADGFPEPTMTWTKDG-EPIESGEEKYSFNEDGSE 59
Query: 510 LIIHQAALMDEGEIKCTATNRAGHS 534
+ I +DE E C A N+AG
Sbjct: 60 MTILDVDKLDEAEYTCIAENKAGEQ 84
Score = 43.4 bits (102), Expect = 4e-05
Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 2/94 (2%)
Query: 545 PPTIRLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYL 604
PPTIR +Q E +G+ + L G P PT W +GEP+ SG + D
Sbjct: 1 PPTIRA-RQSEVNATANLGQSVTLACDADGFPEPTMTWTKDGEPIESGEEKYSFNEDGS- 58
Query: 605 NLRISDARRADRGEYQAHGVNSLGEDVASFLVTV 638
+ I D + D EY N GE A + V
Sbjct: 59 EMTILDVDKLDEAEYTCIAENKAGEQEAEIHLKV 92
>gnl|CDD|143240 cd05763, Ig_1, Subgroup of the immunoglobulin (Ig) superfamily.
Ig_1: subgroup of the immunoglobulin (Ig) domain found
in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
the Ig superfamily are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of most
Ig domains is a disulfide bridge connecting the two
beta-sheets with a tryptophan residue packed against the
disulfide bond.
Length = 75
Score = 44.9 bits (106), Expect = 6e-06
Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 477 EGIPTPKVSWYKDGFEIFSSRRQR---IVTDNDISTLIIHQAALMDEGEIKCTATNRAGH 533
G PTP+++W KDG F + R+R ++ ++D+ I + D G CTA N AG
Sbjct: 8 TGHPTPQIAWQKDGGTDFPAARERRMHVMPEDDV--FFIVDVKIEDTGVYSCTAQNTAGS 65
Query: 534 SITKARL 540
A L
Sbjct: 66 ISANATL 72
>gnl|CDD|212460 cd05723, Ig4_Neogenin, Fourth immunoglobulin (Ig)-like domain in
neogenin and similar proteins. Ig4_Neogenin: fourth
immunoglobulin (Ig)-like domain in neogenin and related
proteins. Neogenin is a cell surface protein which is
expressed in the developing nervous system of vertebrate
embryos in the growing nerve cells. It is also expressed
in other embryonic tissues, and may play a general role
in developmental processes such as cell migration,
cell-cell recognition, and tissue growth regulation.
Included in this group is the tumor suppressor protein
DCC, which is deleted in colorectal carcinoma . DCC and
neogenin each have four Ig-like domains followed by six
fibronectin type III domains, a transmembrane domain,
and an intracellular domain.
Length = 71
Score = 44.6 bits (105), Expect = 6e-06
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
+ F +V G PTP V W K+G + S +IV ++++ L + ++ DEG +C A N
Sbjct: 2 IVFECEVTGKPTPTVKWVKNGDMVIPSDYFKIVKEHNLQVLGLVKS---DEGFYQCIAEN 58
Query: 530 RAGHSITKARL 540
G+ A+L
Sbjct: 59 DVGNVQAGAQL 69
Score = 34.2 bits (78), Expect = 0.038
Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 3/68 (4%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVN 625
I + + G P PT +W+ NG+ + ++I NL++ ++D G YQ N
Sbjct: 2 IVFECEVTGKPTPTVKWVKNGDMVIPSDYFKIVKEH---NLQVLGLVKSDEGFYQCIAEN 58
Query: 626 SLGEDVAS 633
+G A
Sbjct: 59 DVGNVQAG 66
>gnl|CDD|143209 cd05732, Ig5_NCAM-1_like, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar
proteins. Ig5_NCAM-1 like: domain similar to the fifth
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-1 (NCAM). NCAM plays important roles in
the development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-non-NCAM), interactions. NCAM is expressed as
three major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
this model, Ig1 and Ig2 mediate dimerization of NCAM
molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain. Also included in this group is
NCAM-2 (also known as OCAM/mamFas II and RNCAM) NCAM-2
is differentially expressed in the developing and mature
olfactory epithelium (OE).
Length = 96
Score = 44.1 bits (104), Expect = 2e-05
Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 7/83 (8%)
Query: 456 IRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQ----RIVTDNDI--ST 509
I L + TA+E E++ T + EG P P+++W + FS + RIV S+
Sbjct: 5 ITYLENQTAVELEQITLTCEAEGDPIPEITW-RRATRNFSEGDKSLDGRIVVRGHARVSS 63
Query: 510 LIIHQAALMDEGEIKCTATNRAG 532
L + L D G C A+NR G
Sbjct: 64 LTLKDVQLTDAGRYDCEASNRIG 86
>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural cell
adhesion molecule contactin-2 and similar proteins.
Ig4_Contactin-2-like: fourth Ig domain of the neural
cell adhesion molecule contactin-2. Contactins are
comprised of six Ig domains followed by four fibronectin
type III (FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-2 (aliases
TAG-1, axonin-1) facilitates cell adhesion by homophilic
binding between molecules in apposed membranes. The
first four Ig domains form the intermolecular binding
fragment which arranges as a compact U-shaped module by
contacts between Ig domains 1 and 4, and domains 2 and
3. It has been proposed that a linear zipper-like array
forms, from contactin-2 molecules alternatively provided
by the two apposed membranes.
Length = 85
Score = 43.4 bits (102), Expect = 3e-05
Identities = 26/66 (39%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 573 AGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVA 632
+G P P RWL NG+PL S R E+ D LRI+ +D G YQ N G A
Sbjct: 24 SGNPRPAYRWLKNGQPLASENRIEVEAGD----LRITKLSLSDSGMYQCVAENKHGTIYA 79
Query: 633 SFLVTV 638
S + V
Sbjct: 80 SAELAV 85
Score = 34.5 bits (79), Expect = 0.033
Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 12/69 (17%)
Query: 478 GIPTPKVSWYKDGFEIFSSRRQRIVTDNDI----STLIIHQAALMDEGEIKCTATNRAGH 533
G P P W K+G Q + ++N I L I + +L D G +C A N+ G
Sbjct: 25 GNPRPAYRWLKNG--------QPLASENRIEVEAGDLRITKLSLSDSGMYQCVAENKHGT 76
Query: 534 SITKARLRL 542
A L +
Sbjct: 77 IYASAELAV 85
>gnl|CDD|143178 cd04977, Ig1_NCAM-1_like, First immunoglobulin (Ig)-like domain of
neural cell adhesion molecule NCAM-1 and similar
proteins. Ig1_NCAM-1 like: first immunoglobulin
(Ig)-like domain of neural cell adhesion molecule
NCAM-1. NCAM-1 plays important roles in the development
and regeneration of the central nervous system, in
synaptogenesis and neural migration. NCAM mediates
cell-cell and cell-substratum recognition and adhesion
via homophilic (NCAM-NCAM), and heterophilic
(NCAM-nonNCAM), interactions. NCAM is expressed as three
major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves the Ig1, Ig2, and Ig3
domains. By this model, Ig1 and Ig2 mediate dimerization
of NCAM molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain. Also included in this group is
NCAM-2 (also known as OCAM/mamFas II and RNCAM). NCAM-2
is differentially expressed in the developing and mature
olfactory epithelium (OE).
Length = 92
Score = 42.9 bits (101), Expect = 4e-05
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 472 FTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDND--ISTLIIHQAALMDEGEIKCTATN 529
F QV G P +SW+ E +++Q V ND STL I+ A + D G KC AT+
Sbjct: 20 FLCQVIGEPK-DISWFSPNGEKLVTQQQISVVQNDDVRSTLTIYNANIEDAGIYKCVATD 78
Query: 530 RAG 532
G
Sbjct: 79 AKG 81
>gnl|CDD|222457 pfam13927, Ig_3, Immunoglobulin domain. This family contains
immunoglobulin-like domains.
Length = 74
Score = 41.6 bits (97), Expect = 9e-05
Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 5/75 (6%)
Query: 456 IRALHDTTALEDEKVEFTVQVEGIP-TPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQ 514
I + V T EG P P +SWY++G S + STL +
Sbjct: 4 ITVSPSPSVTSGGGVTLTCSAEGGPPPPTISWYRNG----SISGGSGGLGSSGSTLTLSS 59
Query: 515 AALMDEGEIKCTATN 529
D G C A+N
Sbjct: 60 VTSEDSGTYTCVASN 74
>gnl|CDD|143223 cd05746, Ig4_Peroxidasin, Fourth immunoglobulin (Ig)-like domain of
peroxidasin. Ig4_Peroxidasin: the fourth immunoglobulin
(Ig)-like domain in peroxidasin. Peroxidasin has a
peroxidase domain and interacting extracellular motifs
containing four Ig-like domains. It has been suggested
that peroxidasin is secreted, and has functions related
to the stabilization of the extracellular matrix. It may
play a part in various other important processes such as
removal and destruction of cells, which have undergone
programmed cell death, and protection of the organism
against non-self.
Length = 69
Score = 41.4 bits (97), Expect = 9e-05
Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 3/71 (4%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
V+ +G P P ++W KDG ++ S + I + L I + D+G +C A N
Sbjct: 1 VQIPCSAQGDPEPTITWNKDGVQVTESGKFHISPE---GYLAIRDVGVADQGRYECVARN 57
Query: 530 RAGHSITKARL 540
G++ L
Sbjct: 58 TIGYASVSMVL 68
Score = 33.7 bits (77), Expect = 0.045
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVN 625
+++ S G P PT W +G +T G++ I+ + YL +R D AD+G Y+ N
Sbjct: 1 VQIPCSAQGDPEPTITWNKDGVQVTESGKFHISP-EGYLAIR--DVGVADQGRYECVARN 57
Query: 626 SLGEDVASFLVT 637
++G S +++
Sbjct: 58 TIGYASVSMVLS 69
>gnl|CDD|143227 cd05750, Ig_Pro_neuregulin, Immunoglobulin (Ig)-like domain in
neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin
(Ig)-like domain in neuregulins (NRGs). NRGs are
signaling molecules, which participate in cell-cell
interactions in the nervous system, breast, heart, and
other organ systems, and are implicated in the pathology
of diseases including schizophrenia, multiple sclerosis,
and breast cancer. There are four members of the
neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1
protein, binds to and activates the tyrosine kinases
receptors ErbB3 and ErbB4, initiating signaling
cascades. The other NRGs proteins bind one or the other
or both of these ErbBs. NRG-1 has multiple functions;
for example, in the brain it regulates various processes
such as radial glia formation and neuronal migration,
dendritic development, and expression of
neurotransmitters receptors; in the peripheral nervous
system NRG-1 regulates processes such as target cell
differentiation, and Schwann cell survival. There are
many NRG-1 isoforms, which arise from the alternative
splicing of mRNA. Less is known of the functions of the
other NRGs. NRG-2 and -3 are expressed predominantly in
the nervous system. NRG-2 is expressed by motor neurons
and terminal Schwann cells, and is concentrated near
synaptic sites and may be a signal that regulates
synaptic differentiation. NRG-4 has been shown to direct
pancreatic islet cell development towards the delta-cell
lineage.
Length = 75
Score = 41.4 bits (97), Expect = 9e-05
Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 3/64 (4%)
Query: 480 PTPKVSWYKDGFEIFSSRRQRIVT---DNDISTLIIHQAALMDEGEIKCTATNRAGHSIT 536
P+ + W+KDG E+ + R + S L I++A L D GE C N G+
Sbjct: 12 PSLRFKWFKDGKELNRKNKPRNIKIRNKKKNSELQINKAKLADSGEYTCVVENILGNDTV 71
Query: 537 KARL 540
A +
Sbjct: 72 TANV 75
Score = 37.5 bits (87), Expect = 0.003
Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Query: 573 AGMPPPTARWLHNGEPLTSGGRYE---ITHTDRYLNLRISDARRADRGEYQAHGVNSLGE 629
+ P +W +G+ L + I + + L+I+ A+ AD GEY N LG
Sbjct: 9 SEYPSLRFKWFKDGKELNRKNKPRNIKIRNKKKNSELQINKAKLADSGEYTCVVENILGN 68
Query: 630 DVAS 633
D +
Sbjct: 69 DTVT 72
>gnl|CDD|143273 cd05865, Ig1_NCAM-1, First immunoglobulin (Ig)-like domain of
neural cell adhesion molecule NCAM-1. Ig1_NCAM-1: first
immunoglobulin (Ig)-like domain of neural cell adhesion
molecule NCAM-1. NCAM-1 plays important roles in the
development and regeneration of the central nervous
system, in synaptogenesis and neural migration. NCAM
mediates cell-cell and cell-substratum recognition and
adhesion via homophilic (NCAM-NCAM), and heterophilic
(NCAM-nonNCAM), interactions. NCAM is expressed as three
major isoforms having different intracellular
extensions. The extracellular portion of NCAM has five
N-terminal Ig-like domains and two fibronectin type III
domains. The double zipper adhesion complex model for
NCAM homophilic binding involves the Ig1, Ig2, and Ig3
domains. By this model, Ig1 and Ig2 mediate dimerization
of NCAM molecules situated on the same cell surface (cis
interactions), and Ig3 domains mediate interactions
between NCAM molecules expressed on the surface of
opposing cells (trans interactions), through binding to
the Ig1 and Ig2 domains. The adhesive ability of NCAM is
modulated by the addition of polysialic acid chains to
the fifth Ig-like domain.
Length = 96
Score = 41.2 bits (96), Expect = 2e-04
Identities = 22/65 (33%), Positives = 30/65 (46%), Gaps = 4/65 (6%)
Query: 472 FTVQVEGIPTPK-VSWYKDGFEIFSSRRQRIV---TDNDISTLIIHQAALMDEGEIKCTA 527
F QV G K +SW+ E + +QRI D+ STL I+ A + D G KC
Sbjct: 20 FLCQVAGEAKDKDISWFSPNGEKLTPNQQRISVVRNDDYSSTLTIYNANIDDAGIYKCVV 79
Query: 528 TNRAG 532
+N
Sbjct: 80 SNEDE 84
>gnl|CDD|143239 cd05762, Ig8_MLCK, Eighth immunoglobulin (Ig)-like domain of human
myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth
immunoglobulin (Ig)-like domain of human myosin
light-chain kinase (MLCK). MLCK is a key regulator of
different forms of cell motility involving actin and
myosin II. Agonist stimulation of smooth muscle cells
increases cytosolic Ca2+, which binds calmodulin. This
Ca2+-calmodulin complex in turn binds to and activates
MLCK. Activated MLCK leads to the phosphorylation of the
20 kDa myosin regulatory light chain (RLC) of myosin II
and the stimulation of actin-activated myosin MgATPase
activity. MLCK is widely present in vertebrate tissues;
it phosphorylates the 20 kDa RLC of both smooth and
nonmuscle myosin II. Phosphorylation leads to the
activation of the myosin motor domain and altered
structural properties of myosin II. In smooth muscle
MLCK it is involved in initiating contraction. In
nonmuscle cells, MLCK may participate in cell division
and cell motility; it has been suggested MLCK plays a
role in cardiomyocyte differentiation and contraction
through regulation of nonmuscle myosin II.
Length = 98
Score = 41.1 bits (96), Expect = 2e-04
Identities = 25/93 (26%), Positives = 43/93 (46%)
Query: 553 QYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDAR 612
Q+ + + GE ++L + G P T W+ + + G +I +T+ L I++ +
Sbjct: 5 QFPEDMKVRAGESVELFCKVTGTQPITCTWMKFRKQIQEGEGIKIENTENSSKLTITEGQ 64
Query: 613 RADRGEYQAHGVNSLGEDVASFLVTVTDRPLPP 645
+ G Y N LG A +TV D+P PP
Sbjct: 65 QEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97
>gnl|CDD|143213 cd05736, Ig2_Follistatin_like, Second immunoglobulin (Ig)-like
domain of a follistatin-like molecule encoded by the
Mahya gene and similar proteins. Ig2_Follistatin_like:
domain similar to the second immunoglobulin (Ig)-like
domain found in a follistatin-like molecule encoded by
the CNS-related Mahya gene. Mahya genes have been
retained in certain Bilaterian branches during
evolution. They are conserved in Hymenoptera and
Deuterostomes, but are absent from other metazoan
species such as fruit fly and nematode. Mahya proteins
are secretory, with a follistatin-like domain
(Kazal-type serine/threonine protease inhibitor domain
and EF-hand calcium-binding domain), two Ig-like
domains, and a novel C-terminal domain. Mahya may be
involved in learning and memory and in processing of
sensory information in Hymenoptera and vertebrates.
Follistatin is a secreted, multidomain protein that
binds activins with high affinity and antagonizes their
signaling.
Length = 76
Score = 40.3 bits (94), Expect = 3e-04
Identities = 19/57 (33%), Positives = 28/57 (49%)
Query: 476 VEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG 532
EGIP P+++W K+G +I +++ + S L I D G C A N AG
Sbjct: 7 AEGIPLPRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKNEAG 63
Score = 37.2 bits (86), Expect = 0.003
Identities = 23/71 (32%), Positives = 31/71 (43%), Gaps = 2/71 (2%)
Query: 568 LKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSL 627
L+ G+P P WL NG +T ++T L IS+ R D G Y N
Sbjct: 3 LRCHAEGIPLPRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKNEA 62
Query: 628 G--EDVASFLV 636
G ED++S V
Sbjct: 63 GVDEDISSLFV 73
>gnl|CDD|143235 cd05758, Ig5_KIRREL3-like, Fifth immunoglobulin (Ig)-like domain of
Kirrel (kin of irregular chiasm-like) 3 (also known as
Neph2) and similar proteins. Ig5_KIRREL3-like: domain
similar to the fifth immunoglobulin (Ig)-like domain of
Kirrel (kin of irregular chiasm-like) 3 (also known as
Neph2). This protein has five Ig-like domains, one
transmembrane domain, and a cytoplasmic tail. Included
in this group is mammalian Kirrel (Neph1), Kirrel2
(Neph3), and Drosophila RST (irregular chiasm
C-roughest) protein. These proteins contain multiple Ig
domains, have properties of cell adhesion molecules, and
are important in organ development.
Length = 98
Score = 40.9 bits (96), Expect = 3e-04
Identities = 25/98 (25%), Positives = 36/98 (36%), Gaps = 8/98 (8%)
Query: 452 APSFIRALHDTTALEDEKVEFTVQVEGIPTP-KVSW-YKDGF-EIFSSRRQRIVTDND-- 506
P I + A+ +K + P P ++ W +K+ E SS R + TD
Sbjct: 1 GPPIITSEATQYAILGDKGRVECFIFSTPPPDRIVWTWKENELESGSSGRYTVETDPSPG 60
Query: 507 --ISTLIIHQAALMD-EGEIKCTATNRAGHSITKARLR 541
+STL I D + CTA N G L
Sbjct: 61 GVLSTLTISNTQESDFQTSYNCTAWNSFGSGTAIISLE 98
Score = 30.5 bits (69), Expect = 1.2
Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 14/98 (14%)
Query: 544 APPTI-RLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTA-RWLHNGEPLTSG--GRY--EI 597
PP I QY +L G+ +++ + PPP W L SG GRY E
Sbjct: 1 GPPIITSEATQY--AIL---GDKGRVECFIFSTPPPDRIVWTWKENELESGSSGRYTVET 55
Query: 598 THTDRYL--NLRISDARRAD-RGEYQAHGVNSLGEDVA 632
+ + L IS+ + +D + Y NS G A
Sbjct: 56 DPSPGGVLSTLTISNTQESDFQTSYNCTAWNSFGSGTA 93
>gnl|CDD|143256 cd05848, Ig1_Contactin-5, First Ig domain of contactin-5.
Ig1_Contactin-5: First Ig domain of the neural cell
adhesion molecule contactin-5. Contactins are comprised
of six Ig domains followed by four fibronectin type III
(FnIII) domains, anchored to the membrane by
glycosylphosphatidylinositol. The different contactins
show different expression patterns in the central
nervous system. In rats, a lack of contactin-5 (NB-2)
results in an impairment of the neuronal activity in the
auditory system. Contactin-5 is expressed specifically
in the postnatal nervous system, peaking at about 3
weeks postnatal. Contactin-5 is highly expressed in the
adult human brain in the occipital lobe and in the
amygdala; lower levels of expression have been detected
in the corpus callosum, caudate nucleus, and spinal
cord.
Length = 94
Score = 40.3 bits (94), Expect = 4e-04
Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 10/86 (11%)
Query: 453 PSFIRALHD---TTALEDEKVEFTVQVEGIPTPKVSWYKDGFEI--FSSRRQRIVTDNDI 507
P F++ D T +++KV + G P P W ++G EI S R ++
Sbjct: 2 PVFVQEPDDAIFPTDSDEKKVILNCEARGNPVPTYRWLRNGTEIDTESDYRYSLID---- 57
Query: 508 STLIIHQAALM-DEGEIKCTATNRAG 532
LII + + D G +C ATN G
Sbjct: 58 GNLIISNPSEVKDSGRYQCLATNSIG 83
Score = 34.1 bits (78), Expect = 0.064
Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 3/64 (4%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRA-DRGEYQAHGV 624
+ L G P PT RWL NG + + Y + D NL IS+ D G YQ
Sbjct: 22 VILNCEARGNPVPTYRWLRNGTEIDTESDYRYSLIDG--NLIISNPSEVKDSGRYQCLAT 79
Query: 625 NSLG 628
NS+G
Sbjct: 80 NSIG 83
>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
Ig5_Contactin_like: Fifth Ig domain of contactins.
Contactins are neural cell adhesion molecules and are
comprised of six Ig domains followed by four fibronectin
type III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of
neuronal act ivity in the rat auditory system.
Contactin-5 is highly expressed in the adult human brain
in the occipital lobe and in the amygdala. Contactin-1
is differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 73
Score = 39.3 bits (92), Expect = 6e-04
Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 3/55 (5%)
Query: 574 GMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLG 628
P PT W E LT+ R I D +L I + ++D G+Y N G
Sbjct: 12 AAPKPTISWSKGTELLTNSSRICIW-PDG--SLEILNVTKSDEGKYTCFAENFFG 63
Score = 37.8 bits (88), Expect = 0.002
Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 477 EGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG 532
+ P P +SW K + +S R I D +L I DEG+ C A N G
Sbjct: 11 KAAPKPTISWSKGTELLTNSSRICIWPD---GSLEILNVTKSDEGKYTCFAENFFG 63
>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor and similar
proteins. Ig2_FGFR_like: domain similar to the second
immunoglobulin (Ig)-like domain of fibroblast growth
factor (FGF) receptor. FGF receptors bind FGF signaling
polypeptides. FGFs participate in multiple processes
such as morphogenesis, development, and angiogenesis.
FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
-2, -3, -4). Receptor diversity is controlled by
alternative splicing producing splice variants with
different ligand binding characteristics and different
expression patterns. FGFRs have an extracellular region
comprised of three Ig-like domains, a single
transmembrane helix, and an intracellular tyrosine
kinase domain. Ligand binding and specificity reside in
the Ig-like domains 2 and 3, and the linker region that
connects these two. FGFR activation and signaling depend
on FGF-induced dimerization, a process involving cell
surface heparin or heparin sulfate proteoglycans. This
group also contains fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 does not have a protein
tyrosine kinase domain at its C terminus; neither does
its cytoplasmic domain appear to interact with a
signaling partner. It has been suggested that FGFRL1 may
not have any direct signaling function, but instead acts
as a decoy receptor trapping FGFs and preventing them
from binding other receptors.
Length = 85
Score = 39.7 bits (93), Expect = 6e-04
Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 1/77 (1%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHT-DRYLNLRISDARRADRGEYQA 621
G ++LK +G P PT WL +G+P R + L + +D G+Y
Sbjct: 9 GSTVRLKCPASGNPRPTITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVVPSDSGKYTC 68
Query: 622 HGVNSLGEDVASFLVTV 638
N G ++ V V
Sbjct: 69 IVENKYGSINHTYKVDV 85
Score = 37.7 bits (88), Expect = 0.003
Identities = 17/66 (25%), Positives = 22/66 (33%), Gaps = 1/66 (1%)
Query: 469 KVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIV-TDNDISTLIIHQAALMDEGEIKCTA 527
V G P P ++W KDG R TLI+ D G+ C
Sbjct: 11 TVRLKCPASGNPRPTITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVVPSDSGKYTCIV 70
Query: 528 TNRAGH 533
N+ G
Sbjct: 71 ENKYGS 76
>gnl|CDD|143300 cd05892, Ig_Myotilin_C, C-terminal immunoglobulin (Ig)-like domain
of myotilin. Ig_Myotilin_C: C-terminal immunoglobulin
(Ig)-like domain of myotilin. Mytolin belongs to the
palladin-myotilin-myopalladin family. Proteins belonging
to the latter family contain multiple Ig-like domains
and function as scaffolds, modulating actin
cytoskeleton. Myotilin is most abundant in skeletal and
cardiac muscle, and is involved in maintaining sarcomere
integrity. It binds to alpha-actinin, filamin and actin.
Mutations in myotilin lead to muscle disorders.
Length = 75
Score = 38.4 bits (89), Expect = 0.001
Identities = 22/73 (30%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEI-FSSRRQRIVTDNDIS-TLIIHQAALMDEGEIKCTA 527
V+ Q+ IP PK+ W ++ + +++ R + DN TL+I D G +A
Sbjct: 1 VKLECQISAIPPPKIFWKRNNEMVQYNTDRISLYQDNSGRVTLLIKNVNKKDAGWYTVSA 60
Query: 528 TNRAGHSITKARL 540
N AG + ARL
Sbjct: 61 VNEAGVATCHARL 73
Score = 31.1 bits (70), Expect = 0.45
Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 2/65 (3%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLT-SGGRYEITHTDR-YLNLRISDARRADRGEYQAHG 623
+KL+ ++ +PPP W N E + + R + + + L I + + D G Y
Sbjct: 1 VKLECQISAIPPPKIFWKRNNEMVQYNTDRISLYQDNSGRVTLLIKNVNKKDAGWYTVSA 60
Query: 624 VNSLG 628
VN G
Sbjct: 61 VNEAG 65
>gnl|CDD|143221 cd05744, Ig_Myotilin_C_like, Immunoglobulin (Ig)-like domain of
myotilin, palladin, and myopalladin.
Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in
myotilin, palladin, and myopalladin. Myotilin,
palladin, and myopalladin function as scaffolds that
regulate actin organization. Myotilin and myopalladin
are most abundant in skeletal and cardiac muscle;
palladin is ubiquitously expressed in the organs of
developing vertebrates and plays a key role in cellular
morphogenesis. The three family members each interact
with specific molecular partners: all three bind to
alpha-actinin; in addition, palladin also binds to
vasodilator-stimulated phosphoprotein (VASP) and ezrin,
myotilin binds to filamin and actin, and myopalladin
also binds to nebulin and cardiac ankyrin repeat protein
(CARP).
Length = 75
Score = 38.2 bits (89), Expect = 0.001
Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEI-FSSRRQRIVTDN-DISTLIIHQAALMDEGEIKCTA 527
V +V IP P++ W K+ + +++ R + DN L+I A D G +A
Sbjct: 1 VRLECRVSAIPPPQIFWKKNNEMLTYNTDRISLYQDNCGRICLLIQNANKEDAGWYTVSA 60
Query: 528 TNRAGHSITKARL 540
N AG ARL
Sbjct: 61 VNEAGVVSCNARL 73
Score = 31.3 bits (71), Expect = 0.39
Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLT-SGGRYEITHTDR-YLNLRISDARRADRGEYQAHG 623
++L+ ++ +PPP W N E LT + R + + + L I +A + D G Y
Sbjct: 1 VRLECRVSAIPPPQIFWKKNNEMLTYNTDRISLYQDNCGRICLLIQNANKEDAGWYTVSA 60
Query: 624 VNSLG 628
VN G
Sbjct: 61 VNEAG 65
>gnl|CDD|143242 cd05765, Ig_3, Subgroup of the immunoglobulin (Ig) superfamily.
Ig_3: subgroup of the immunoglobulin (Ig) domain found
in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
the Ig superfamily are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of most
Ig domains is a disulfide bridge connecting the two
beta-sheets with a tryptophan residue packed against the
disulfide bond.
Length = 81
Score = 38.3 bits (89), Expect = 0.001
Identities = 25/71 (35%), Positives = 31/71 (43%), Gaps = 7/71 (9%)
Query: 468 EKVEFTVQVEGIPTPKVSWYK--DGFEIFSSR----RQRIVTDNDISTLIIHQAALMDEG 521
E F V G P P+++W K G E R R +V N I L+I+ A D G
Sbjct: 2 ETASFHCDVTGRPPPEITWEKQVHGKENLIMRPNHVRGNVVVTN-IGQLVIYNAQPQDAG 60
Query: 522 EIKCTATNRAG 532
CTA N G
Sbjct: 61 LYTCTARNSGG 71
>gnl|CDD|143177 cd04976, Ig2_VEGFR, Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor (VEGFR).
Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor (VEGFR). The
VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. The VEGFR family consists of three
members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at
the Ig-like domains. VEGF-A is important to the growth
and maintenance of vascular endothelial cells and to the
development of new blood- and lymphatic-vessels in
physiological and pathological states. VEGFR-2 is a
major mediator of the mitogenic, angiogenic and
microvascular permeability-enhancing effects of VEGF-A.
VEGFR-1 may play an inhibitory part in these processes
by binding VEGF and interfering with its interaction
with VEGFR-2. VEGFR-1 has a signaling role in mediating
monocyte chemotaxis. VEGFR-2 and -1 may mediate a
chemotactic and a survival signal in hematopoietic stem
cells or leukemia cells. VEGFR-3 has been shown to be
involved in tumor angiogenesis and growth.
Length = 71
Score = 38.2 bits (89), Expect = 0.001
Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 4/54 (7%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEY 619
++L V + PPP +W NG+ ++ R + +L I D D G Y
Sbjct: 1 VRLPVKVKAYPPPEIQWYKNGKLISEKNRT---KKSGH-SLTIKDVTEEDAGNY 50
Score = 32.0 bits (73), Expect = 0.20
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 4/62 (6%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
V V+V+ P P++ WYK+G I S + + +L I D G TN
Sbjct: 1 VRLPVKVKAYPPPEIQWYKNGKLI-SEKNRTKK---SGHSLTIKDVTEEDAGNYTVVLTN 56
Query: 530 RA 531
+
Sbjct: 57 KQ 58
>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM).
Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
important roles in the development and regeneration of
the central nervous system, in synaptogenesis and neural
migration. NCAM mediates cell-cell and cell-substratum
recognition and adhesion via homophilic (NCAM-NCAM) and
heterophilic (NCAM-non-NCAM) interactions. NCAM is
expressed as three major isoforms having different
intracellular extensions. The extracellular portion of
NCAM has five N-terminal Ig-like domains and two
fibronectin type III domains. The double zipper adhesion
complex model for NCAM homophilic binding involves Ig1,
Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
dimerization of NCAM molecules situated on the same cell
surface (cis interactions), and Ig3 domains mediate
interactions between NCAM molecules expressed on the
surface of opposing cells (trans interactions), through
binding to the Ig1 and Ig2 domains. The adhesive ability
of NCAM is modulated by the addition of polysialic acid
chains to the fifth Ig-like domain.
Length = 97
Score = 38.4 bits (89), Expect = 0.002
Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 8/87 (9%)
Query: 456 IRALHDTTALE-DEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQ----RIVTDNDI--S 508
I + + TA+E +E++ T + G P P ++W + SS + IV + S
Sbjct: 5 ITYVENQTAMELEEQITLTCEASGDPIPSITW-RTSTRNISSEEKTLDGHIVVRSHARVS 63
Query: 509 TLIIHQAALMDEGEIKCTATNRAGHSI 535
+L + D GE CTA+N G
Sbjct: 64 SLTLKYIQYTDAGEYLCTASNTIGQDS 90
Score = 28.0 bits (62), Expect = 7.9
Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 5/83 (6%)
Query: 561 EMGEIIKLKVSMAGMPPPTARWLH-----NGEPLTSGGRYEITHTDRYLNLRISDARRAD 615
E+ E I L +G P P+ W + E T G + R +L + + D
Sbjct: 15 ELEEQITLTCEASGDPIPSITWRTSTRNISSEEKTLDGHIVVRSHARVSSLTLKYIQYTD 74
Query: 616 RGEYQAHGVNSLGEDVASFLVTV 638
GEY N++G+D S + V
Sbjct: 75 AGEYLCTASNTIGQDSQSMYLEV 97
>gnl|CDD|143237 cd05760, Ig2_PTK7, Second immunoglobulin (Ig)-like domain of
protein tyrosine kinase (PTK) 7, also known as CCK4.
Ig2_PTK7: domain similar to the second immunoglobulin
(Ig)-like domain in protein tyrosine kinase (PTK) 7,
also known as CCK4. PTK7 is a subfamily of the receptor
protein tyrosine kinase family, and is referred to as an
RPTK-like molecule. RPTKs transduce extracellular
signals across the cell membrane, and play important
roles in regulating cell proliferation, migration, and
differentiation. PTK7 is organized as an extracellular
portion having seven Ig-like domains, a single
transmembrane region, and a cytoplasmic tyrosine
kinase-like domain. PTK7 is considered a pseudokinase as
it has several unusual residues in some of the highly
conserved tyrosine kinase (TK) motifs; it is predicted
to lack TK activity. PTK7 may function as a
cell-adhesion molecule. PTK7 mRNA is expressed at high
levels in placenta, melanocytes, liver, lung, pancreas,
and kidney. PTK7 is overexpressed in several cancers,
including melanoma and colon cancer lines.
Length = 77
Score = 37.2 bits (86), Expect = 0.003
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLT-SGGRYEITHTDRYLNLRISDARRADRGEYQAHGV 624
+ L+ + G P PT +W +G PL+ G Y ++ +R L LR A D G Y
Sbjct: 1 VTLRCHIDGHPRPTYQWFRDGTPLSDGQGNYSVSSKERTLTLR--SAGPDDSGLYYCCAH 58
Query: 625 NSLGEDVAS--FLVTVTD 640
N+ G +S F +++ D
Sbjct: 59 NAFGSVCSSQNFTLSIID 76
Score = 35.3 bits (81), Expect = 0.016
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 1/64 (1%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
V ++G P P W++DG + + V+ + TL + A D G C A N
Sbjct: 1 VTLRCHIDGHPRPTYQWFRDGTPLSDGQGNYSVSSKE-RTLTLRSAGPDDSGLYYCCAHN 59
Query: 530 RAGH 533
G
Sbjct: 60 AFGS 63
>gnl|CDD|206066 pfam13895, Ig_2, Immunoglobulin domain. This domain contains
immunoglobulin-like domains.
Length = 80
Score = 37.0 bits (86), Expect = 0.004
Identities = 23/76 (30%), Positives = 29/76 (38%), Gaps = 10/76 (13%)
Query: 462 TTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEG 521
T E E V T G P P +WYKDG + SS+ N T + D G
Sbjct: 9 TVVFEGEDVTLTCSAPGNPPPNYTWYKDGVPLSSSQ-------NGFFTPNVSAE---DSG 58
Query: 522 EIKCTATNRAGHSITK 537
C A+N G +
Sbjct: 59 TYTCVASNGGGGKTSN 74
Score = 30.9 bits (70), Expect = 0.52
Identities = 15/56 (26%), Positives = 19/56 (33%), Gaps = 5/56 (8%)
Query: 545 PPTIRLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHT 600
P + GE + L S G PPP W +G PL+S T
Sbjct: 1 KPVLTPSPTV-----VFEGEDVTLTCSAPGNPPPNYTWYKDGVPLSSSQNGFFTPN 51
>gnl|CDD|143317 cd07693, Ig1_Robo, First immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors and similar proteins. Ig1_Robo:
domain similar to the first immunoglobulin (Ig)-like
domain in Robo (roundabout) receptors. Robo receptors
play a role in the development of the central nervous
system (CNS), and are receptors of Slit protein. Slit is
a repellant secreted by the neural cells in the midline.
Slit acts through Robo to prevent most neurons from
crossing the midline from either side. Three mammalian
Robo homologs (robo1, -2, and -3), and three mammalian
Slit homologs (Slit-1,-2, -3), have been identified.
Commissural axons, which cross the midline, express low
levels of Robo; longitudinal axons, which avoid the
midline, express high levels of Robo. robo1, -2, and -3
are expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 100
Score = 37.5 bits (87), Expect = 0.004
Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 11/102 (10%)
Query: 452 APSFIRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEI----FSSRRQRIVTDNDI 507
P + D + + + EG PTP + W K+G + R RIV +
Sbjct: 1 PPRIVEHPSDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLPS-- 58
Query: 508 STL----IIH-QAALMDEGEIKCTATNRAGHSITKARLRLEA 544
+L ++H + DEG C A N G ++++ A
Sbjct: 59 GSLFFLRVVHGRKGRSDEGVYVCVAHNSLGEAVSRNASLEVA 100
Score = 32.9 bits (75), Expect = 0.15
Identities = 29/96 (30%), Positives = 41/96 (42%), Gaps = 10/96 (10%)
Query: 545 PPTIRLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITH----- 599
PP I ++ L+ G+ L G P PT +WL NG+PL + +H
Sbjct: 1 PPRI---VEHPSDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLP 57
Query: 600 TDRYLNLRISDARRA--DRGEYQAHGVNSLGEDVAS 633
+ LR+ R+ D G Y NSLGE V+
Sbjct: 58 SGSLFFLRVVHGRKGRSDEGVYVCVAHNSLGEAVSR 93
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.7 bits (95), Expect = 0.005
Identities = 41/219 (18%), Positives = 85/219 (38%), Gaps = 33/219 (15%)
Query: 854 DNEDDYDI-VETNEHTGTGAPSDNENESDYFPEKTIDE------------------SVYG 894
+ E + +I + +H G + E+E + E T DE G
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG 751
Query: 895 YDTIVYGYDSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLG 954
+ D + + T E +E+ED + + +KG E ++ + E
Sbjct: 752 KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811
Query: 955 DVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASN 1014
+ S D ++K + + E + +G+ + D + + S+
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEK------------GVDGGGGSD 859
Query: 1015 ITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINF 1053
D E+EE+EE+ + +E E+ EEE +E++ ++P++
Sbjct: 860 GGDSEEEEEEEEEEEEEE--EEEEEEEEEEEENEEPLSL 896
>gnl|CDD|143265 cd05857, Ig2_FGFR, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor. Ig2_FGFR:
second immunoglobulin (Ig)-like domain of fibroblast
growth factor (FGF) receptor. FGF receptors bind FGF
signaling polypeptides. FGFs participate in multiple
processes such as morphogenesis, development, and
angiogenesis. FGFs bind to four FGF receptor tyrosine
kinases (FGFR1, -2, -3, -4). Receptor diversity is
controlled by alternative splicing producing splice
variants with different ligand binding characteristics
and different expression patterns. FGFRs have an
extracellular region comprised of three IG-like domains,
a single transmembrane helix, and an intracellular
tyrosine kinase domain. Ligand binding and specificity
reside in the Ig-like domains 2 and 3, and the linker
region that connects these two. FGFR activation and
signaling depend on FGF-induced dimerization, a process
involving cell surface heparin or heparin sulfate
proteoglycans.
Length = 85
Score = 36.8 bits (85), Expect = 0.006
Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGR---YEITHTDRYLNLRISDARRADRGEY 619
+K + AG P PT RWL NG+ R Y++ + ++ +L + +D+G Y
Sbjct: 9 ANTVKFRCPAAGNPTPTMRWLKNGKEFKQEHRIGGYKVRN--QHWSLIMESVVPSDKGNY 66
Score = 33.3 bits (76), Expect = 0.091
Identities = 19/64 (29%), Positives = 25/64 (39%), Gaps = 1/64 (1%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRR-QRIVTDNDISTLIIHQAALMDEGEIKCTAT 528
V+F G PTP + W K+G E R N +LI+ D+G C
Sbjct: 12 VKFRCPAAGNPTPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPSDKGNYTCVVE 71
Query: 529 NRAG 532
N G
Sbjct: 72 NEYG 75
>gnl|CDD|143215 cd05738, Ig2_RPTP_IIa_LAR_like, Second immunoglobulin (Ig)-like
domain of the receptor protein tyrosine phosphatase
(RPTP)-F, also known as LAR. Ig2_RPTP_IIa_LAR_like:
domain similar to the second immunoglobulin (Ig)-like
domain found in the receptor protein tyrosine
phosphatase (RPTP)-F, also known as LAR. LAR belongs to
the RPTP type IIa subfamily. Members of this subfamily
are cell adhesion molecule-like proteins involved in
central nervous system (CNS) development. They have
large extracellular portions, comprised of multiple
Ig-like domains and two to nine fibronectin type III
(FNIII) domains, and a cytoplasmic portion having two
tandem phosphatase domains.
Length = 74
Score = 36.6 bits (84), Expect = 0.006
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 478 GIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG 532
G P P+++W+KD + ++ RI L I + D+G+ +C ATN AG
Sbjct: 9 GNPDPEITWFKDFLPVDTTSNGRI-KQLRSGALQIENSEESDQGKYECVATNSAG 62
Score = 28.9 bits (64), Expect = 2.6
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 573 AGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLG 628
+G P P W + P+ + I L+I ++ +D+G+Y+ NS G
Sbjct: 8 SGNPDPEITWFKDFLPVDTTSNGRIKQLRSGA-LQIENSEESDQGKYECVATNSAG 62
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 40.1 bits (94), Expect = 0.007
Identities = 37/165 (22%), Positives = 64/165 (38%), Gaps = 31/165 (18%)
Query: 15 TILIQTLWRSKLAMRRDEREFCMIRSKTIVIQKYFRGYLLMRKE---------------- 58
IQ R + RR + I+ K VIQ FR L+ E
Sbjct: 748 ATRIQRAIRGRYLRRRYLQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKLQPLLSLL 806
Query: 59 --RQEYLAMKSSAVKIQEWYRNLQCMRQARQQYLALKHATLKQR--------EEFLKLKH 108
R+EY + + +K+Q+ + + +R+ + +LK L Q+ + F LK
Sbjct: 807 GSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKK 866
Query: 109 ATIAIQTLYKAKLLMKRDRAAYTELKQACVSVQQRWRANLTMRKQ 153
TI +Q+ + +L ++ ELK S+ NL + +
Sbjct: 867 ETIYLQSAQRVELAERQ----LQELKIDVKSISSLKLVNLELESE 907
Score = 37.8 bits (88), Expect = 0.047
Identities = 26/138 (18%), Positives = 47/138 (34%), Gaps = 16/138 (11%)
Query: 171 YRNTKLMRLEASYLHELKAATITIQRRYRANVAMRTQRERYVALRTATITIQTRFRAYLI 230
++ L LE +L IQR R R R RY+ IQ + +
Sbjct: 728 FKAGVLAALEDMRDAKLDNIATRIQRAIRG----RYLRRRYLQALKRIKKIQVIQHGFRL 783
Query: 231 AKNQRDE---YAELKQARRFRFKLN---LRKYERVIELLKLKREQERQ------EKYRHQ 278
+ E +K + R Y I L+ ++E++ ++ +
Sbjct: 784 RRLVDYELKWRLFIKLQPLLSLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLK 843
Query: 279 CAVKIQSLWKMYRVRKKF 296
V IQ + + +K+F
Sbjct: 844 AEVLIQKFGRSLKAKKRF 861
Score = 33.5 bits (77), Expect = 0.85
Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 14/105 (13%)
Query: 210 RYVALRTATITIQTRFRAYLIAKNQRDEYAELKQARRFRFKLNLRKYERVIELLKLKREQ 269
R L IQ R + + QA + ++K + + +L+R
Sbjct: 740 RDAKLDNIATRIQRAIRGRYLRR-------RYLQALKR-----IKKIQVIQHGFRLRRLV 787
Query: 270 ERQEKYRHQCAVKIQSLWKMYRVRKKFADIIEQKKQAKKTADNQF 314
+ + K+R +K+Q L + RK++ + + +KT +
Sbjct: 788 DYELKWR--LFIKLQPLLSLLGSRKEYRSYLACIIKLQKTIKREK 830
>gnl|CDD|143210 cd05733, Ig6_L1-CAM_like, Sixth immunoglobulin (Ig)-like domain of
the L1 cell adhesion molecule (CAM) and similar
proteins. Ig6_L1-CAM_like: domain similar to the sixth
immunoglobulin (Ig)-like domain of the L1 cell adhesion
molecule (CAM). L1 belongs to the L1 subfamily of cell
adhesion molecules (CAMs) and is comprised of an
extracellular region having six Ig-like domains and five
fibronectin type III domains, a transmembrane region and
an intracellular domain. L1 is primarily expressed in
the nervous system and is involved in its development
and function. L1 is associated with an X-linked
recessive disorder, X-linked hydrocephalus, MASA
syndrome, or spastic paraplegia type 1, that involves
abnormalities of axonal growth. This group also contains
NrCAM [Ng(neuronglia)CAM-related cell adhesion
molecule], which is primarily expressed in the nervous
system, and human neurofascin.
Length = 77
Score = 35.8 bits (83), Expect = 0.009
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 472 FTVQVE--GIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLII--HQAALMD--EGEIKC 525
++ E G P P SW ++G + R+ D TL+I + EGE +C
Sbjct: 1 IVIKCEAKGNPPPTFSWTRNGTHFDPEKDPRVTMKPDSGTLVIDNMNGGRAEDYEGEYQC 60
Query: 526 TATNRAGHSIT-KARLR 541
A+N G +I+ + LR
Sbjct: 61 YASNELGTAISNEIHLR 77
Score = 29.7 bits (67), Expect = 1.6
Identities = 21/67 (31%), Positives = 27/67 (40%), Gaps = 4/67 (5%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDR--YLNLRISDARRADR--GEYQA 621
I +K G PPPT W NG +T L + + RA+ GEYQ
Sbjct: 1 IVIKCEAKGNPPPTFSWTRNGTHFDPEKDPRVTMKPDSGTLVIDNMNGGRAEDYEGEYQC 60
Query: 622 HGVNSLG 628
+ N LG
Sbjct: 61 YASNELG 67
>gnl|CDD|143303 cd05895, Ig_Pro_neuregulin-1, Immunoglobulin (Ig)-like domain found
in neuregulin (NRG)-1. Ig_Pro_neuregulin-1:
immunoglobulin (Ig)-like domain found in neuregulin
(NRG)-1. There are many NRG-1 isoforms which arise from
the alternative splicing of mRNA. NRG-1 belongs to the
neuregulin gene family, which is comprised of four
genes. This group represents NRG-1. NRGs are signaling
molecules, which participate in cell-cell interactions
in the nervous system, breast, and heart, and other
organ systems, and are implicated in the pathology of
diseases including schizophrenia, multiple sclerosis,
and breast cancer. The NRG-1 protein binds to and
activates the tyrosine kinases receptors ErbB3 and
ErbB4, initiating signaling cascades. NRG-1 has multiple
functions; for example, in the brain it regulates
various processes such as radial glia formation and
neuronal migration, dendritic development, and
expression of neurotransmitters receptors; in the
peripheral nervous system NRG-1 regulates processes such
as target cell differentiation, and Schwann cell
survival.
Length = 76
Score = 35.7 bits (82), Expect = 0.010
Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 4/57 (7%)
Query: 581 RWLHNGEPLTSGGRYE----ITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVAS 633
+W NG+ + + + + I + L+IS A AD GEY+ + LG D +
Sbjct: 17 KWFKNGKEIGAKNKPDNKIKIRKKKKSSELQISKASLADNGEYKCMVSSKLGNDSVT 73
Score = 33.8 bits (77), Expect = 0.057
Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 480 PTPKVSWYKDGFEIFSSRR----QRIVTDNDISTLIIHQAALMDEGEIKCTATNRAGHSI 535
P+ + W+K+G EI + + +I S L I +A+L D GE KC +++ G+
Sbjct: 12 PSLRFKWFKNGKEIGAKNKPDNKIKIRKKKKSSELQISKASLADNGEYKCMVSSKLGNDS 71
Query: 536 TKA 538
A
Sbjct: 72 VTA 74
>gnl|CDD|143168 cd04967, Ig1_Contactin, First Ig domain of contactin.
Ig1_Contactin: First Ig domain of contactins. Contactins
are neural cell adhesion molecules and are comprised of
six Ig domains followed by four fibronectin type
III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of
neuronal activity in the rat auditory system.
Contactin-5 is highly expressed in the adult human brain
in the occipital lobe and in the amygdala. Contactin-1
is differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 91
Score = 36.3 bits (84), Expect = 0.011
Identities = 20/70 (28%), Positives = 29/70 (41%), Gaps = 7/70 (10%)
Query: 466 EDEKVEFTVQVEGIPTPKVSWYKDGFEIF--SSRRQRIVTDNDISTLIIHQAALM-DEGE 522
++ KV + G P P W +G EI R +V N L+I + D G
Sbjct: 18 DEGKVSLNCRARGSPPPTYRWLMNGTEIDDEPDSRYSLVGGN----LVISNPSKAKDAGR 73
Query: 523 IKCTATNRAG 532
+C A+N G
Sbjct: 74 YQCLASNIVG 83
Score = 33.2 bits (76), Expect = 0.12
Identities = 26/67 (38%), Positives = 31/67 (46%), Gaps = 11/67 (16%)
Query: 569 KVSMA----GMPPPTARWLHNGE--PLTSGGRYEITHTDRYLNLRISDARRA-DRGEYQA 621
KVS+ G PPPT RWL NG RY + NL IS+ +A D G YQ
Sbjct: 21 KVSLNCRARGSPPPTYRWLMNGTEIDDEPDSRYSLVGG----NLVISNPSKAKDAGRYQC 76
Query: 622 HGVNSLG 628
N +G
Sbjct: 77 LASNIVG 83
>gnl|CDD|201341 pfam00612, IQ, IQ calmodulin-binding motif. Calmodulin-binding
motif.
Length = 21
Score = 33.8 bits (79), Expect = 0.015
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 277 HQCAVKIQSLWKMYRVRKKF 296
+ A+KIQ+ W+ Y RK++
Sbjct: 1 RKAAIKIQAAWRGYLARKRY 20
Score = 28.5 bits (65), Expect = 1.2
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 39 RSKTIVIQKYFRGYLLMRKER 59
R I IQ +RGYL ++ +
Sbjct: 1 RKAAIKIQAAWRGYLARKRYK 21
>gnl|CDD|143275 cd05867, Ig4_L1-CAM_like, Fourth immunoglobulin (Ig)-like domain of
the L1 cell adhesion molecule (CAM). Ig4_L1-CAM_like:
fourth immunoglobulin (Ig)-like domain of the L1 cell
adhesion molecule (CAM). L1 is comprised of an
extracellular region having six Ig-like domains and five
fibronectin type III domains, a transmembrane region and
an intracellular domain. L1 is primarily expressed in
the nervous system and is involved in its development
and function. L1 is associated with an X-linked
recessive disorder, X-linked hydrocephalus, MASA
syndrome, or spastic paraplegia type 1, that involves
abnormalities of axonal growth. This group also contains
the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Length = 76
Score = 35.2 bits (81), Expect = 0.017
Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 7/76 (9%)
Query: 468 EKVEFTVQVEGIPTPKVSWYKDGFEIFSS---RRQRIVTDNDISTLIIHQAALMDEGEIK 524
E QVEGIPTP ++W +G I + R+ + + LI+ D +
Sbjct: 2 ETARLDCQVEGIPTPNITWSINGAPIEGTDPDPRRHVSS----GALILTDVQPSDTAVYQ 57
Query: 525 CTATNRAGHSITKARL 540
C A NR G+ + A +
Sbjct: 58 CEARNRHGNLLANAHV 73
>gnl|CDD|143176 cd04975, Ig4_SCFR_like, Fourth immunoglobulin (Ig)-like domain of
stem cell factor receptor (SCFR) and similar proteins.
Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of
stem cell factor receptor (SCFR). In addition to SCFR
this group also includes the fourth Ig domain of
platelet-derived growth factor receptors (PDGFR), alpha
and beta, the fourth Ig domain of macrophage colony
stimulating factor (M-CSF), and the Ig domain of the
receptor tyrosine kinase KIT. SCFR and the PDGFR alpha
and beta have similar organization: an extracellular
component having five Ig-like domains, a transmembrane
segment, and a cytoplasmic portion having protein
tyrosine kinase activity. SCFR and its ligand SCF are
critical for normal hematopoiesis, mast cell
development, melanocytes and gametogenesis. SCF binds to
the second and third Ig-like domains of SCFR, this
fourth Ig-like domain participates in SCFR dimerization,
which follows ligand binding. Deletion of this fourth
SCFR_Ig-like domain abolishes the ligand-induced
dimerization of SCFR and completely inhibits signal
transduction. PDGF is a potent mitogen for connective
tissue cells. PDGF-stimulated processes are mediated by
three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
binds to all three PDGFs, whereas the PDGFR beta, binds
only to PDGF-B. In mice, PDGFR alpha, and PDGFR beta,
are essential for normal development.
Length = 101
Score = 35.8 bits (83), Expect = 0.017
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%)
Query: 561 EMGEIIKLKVSM-AGMPPPTARWLHNGEPLTSGGRYEIT----HTDRYLN-LRISDARRA 614
+GE + L V + A PPP W ++ LT+ +T RY++ L++ + +
Sbjct: 16 NLGENLNLVVEVEAYPPPPHINWTYDNRTLTNKLTEIVTSENESEYRYVSELKLVRLKES 75
Query: 615 DRGEYQAHGVNSLGEDVASFLVTV 638
+ G Y NS +F + V
Sbjct: 76 EAGTYTFLASNSDASKSLTFELYV 99
>gnl|CDD|143179 cd04978, Ig4_L1-NrCAM_like, Fourth immunoglobulin (Ig)-like domain
of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule),
and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth
immunoglobulin (Ig)-like domain of L1, Ng-CAM
(Neuron-glia CAM cell adhesion molecule), and NrCAM
(Ng-CAM-related). These proteins belong to the L1
subfamily of cell adhesion molecules (CAMs) and are
comprised of an extracellular region having six Ig-like
domains and five fibronectin type III domains, a
transmembrane region and an intracellular domain. These
molecules are primarily expressed in the nervous system.
L1 is associated with an X-linked recessive disorder,
X-linked hydrocephalus, MASA syndrome, or spastic
paraplegia type 1, that involves abnormalities of axonal
growth.
Length = 76
Score = 35.1 bits (81), Expect = 0.020
Identities = 20/67 (29%), Positives = 28/67 (41%), Gaps = 5/67 (7%)
Query: 468 EKVEFTVQVEGIPTPKVSWYKDG--FEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
E + EGIP P ++W +G E +R V D TLI+ D +C
Sbjct: 2 ETGRLDCEAEGIPQPTITWRLNGVPIEELPPDPRRRV---DGGTLILSNVQPNDTAVYQC 58
Query: 526 TATNRAG 532
A+N G
Sbjct: 59 NASNVHG 65
Score = 33.1 bits (76), Expect = 0.076
Identities = 23/81 (28%), Positives = 32/81 (39%), Gaps = 11/81 (13%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLN-----LRISDARRADRG 617
GE +L G+P PT W NG P+ E D L +S+ + D
Sbjct: 1 GETGRLDCEAEGIPQPTITWRLNGVPI------EELPPDPRRRVDGGTLILSNVQPNDTA 54
Query: 618 EYQAHGVNSLGEDVASFLVTV 638
YQ + N G +A+ V V
Sbjct: 55 VYQCNASNVHGYLLANAFVHV 75
>gnl|CDD|143203 cd05726, Ig4_Robo, Third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Ig4_Robo: domain similar to the
third immunoglobulin (Ig)-like domain in Robo
(roundabout) receptors. Robo receptors play a role in
the development of the central nervous system (CNS), and
are receptors of Slit protein. Slit is a repellant
secreted by the neural cells in the midline. Slit acts
through Robo to prevent most neurons from crossing the
midline from either side. Three mammalian Robo homologs
(robo1, -2, and -3), and three mammalian Slit homologs
(Slit-1,-2, -3), have been identified. Commissural
axons, which cross the midline, express low levels of
Robo; longitudinal axons, which avoid the midline,
express high levels of Robo. robo1, -2, and -3 are
expressed by commissural neurons in the vertebrate
spinal cord and Slits 1, -2, -3 are expressed at the
ventral midline. Robo-3 is a divergent member of the
Robo family which instead of being a positive regulator
of slit responsiveness, antagonizes slit responsiveness
in precrossing axons. The Slit-Robo interaction is
mediated by the second leucine-rich repeat (LRR) domain
of Slit and the two N-terminal Ig domains of Robo, Ig1
and Ig2. The primary Robo binding site for Slit2 has
been shown by surface plasmon resonance experiments and
mutational analysis to be is the Ig1 domain, while the
Ig2 domain has been proposed to harbor a weak secondary
binding site.
Length = 90
Score = 35.3 bits (81), Expect = 0.021
Identities = 25/87 (28%), Positives = 36/87 (41%), Gaps = 5/87 (5%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFE--IFSSRRQRIVTDNDIST---LIIHQAALMDEGEIK 524
V F + G P P + W K+G + +FS + + + +S L I D G
Sbjct: 4 VTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTGDLTITNVQRSDVGYYI 63
Query: 525 CTATNRAGHSITKARLRLEAPPTIRLP 551
C N AG +TKA L + R P
Sbjct: 64 CQTLNVAGSILTKAYLEVTDVIADRPP 90
Score = 33.8 bits (77), Expect = 0.067
Identities = 21/86 (24%), Positives = 35/86 (40%), Gaps = 11/86 (12%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNG--------EPLTSGGRYEITHTDRYLNLRISDARRA 614
G + + G P P W G +P S R+ ++ T +L I++ +R+
Sbjct: 1 GRTVTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTG---DLTITNVQRS 57
Query: 615 DRGEYQAHGVNSLGEDVASFLVTVTD 640
D G Y +N G + + VTD
Sbjct: 58 DVGYYICQTLNVAGSILTKAYLEVTD 83
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 35.8 bits (83), Expect = 0.030
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 989 EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDED 1045
E+D E++ K + M +I DD+ D++ FD ++L E++ E+E DE+
Sbjct: 75 EIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDD--FDEEDLDEEDDEDEEDEE 129
>gnl|CDD|143212 cd05735, Ig8_DSCAM, Eight immunoglobulin (Ig) domain of Down
Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM:
the eight immunoglobulin (Ig) domain of Down Syndrome
Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion
molecule expressed largely in the developing nervous
system. The gene encoding DSCAM is located at human
chromosome 21q22, the locus associated with the mental
retardation phenotype of Down Syndrome. DSCAM is
predicted to be the largest member of the IG
superfamily. It has been demonstrated that DSCAM can
mediate cation-independent homophilic intercellular
adhesion.
Length = 88
Score = 34.6 bits (79), Expect = 0.031
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 606 LRISDARRADRGEYQAHGVNSLGEDVASFLVTVTDRPLPP 645
L+I R D G + H +NS GED +TV + P PP
Sbjct: 49 LQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDPP 88
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 38.1 bits (88), Expect = 0.032
Identities = 38/184 (20%), Positives = 82/184 (44%), Gaps = 16/184 (8%)
Query: 905 DDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLP 964
DDL+ D + + ES ++ ES + G +++ V +++ S
Sbjct: 3835 DDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENEN-----------SDSEE 3883
Query: 965 VNSDIQIKIDK-PDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEED 1023
N D+ +++ P+D + + + ++ NE L ++ Q S+E S A+N +D +ED
Sbjct: 3884 ENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETE---QKSNEQSAANNESDLVSKED 3940
Query: 1024 EEDSF-DFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQE 1082
+ + D D +++ EE D+ D I +N + E++ P+ + + +
Sbjct: 3941 DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000
Query: 1083 DLDE 1086
D+ +
Sbjct: 4001 DVSK 4004
Score = 37.7 bits (87), Expect = 0.042
Identities = 50/251 (19%), Positives = 96/251 (38%), Gaps = 52/251 (20%)
Query: 846 SSFRDKYVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSD 905
+S +K D ++ D++ET + + + ++NE++ S
Sbjct: 3901 NSLNEKLWDEPNEEDLLETEQKSNEQSAANNESD----------------------LVSK 3938
Query: 906 DLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPV 965
+ D + +E+ED E + D G ++E I+ D EN + L +
Sbjct: 3939 EDDNKALEDKDRQEKEDEEEMSDD-----VGIDDE---IQPDI-QENNSQPPPENEDLDL 3989
Query: 966 NSDIQI-----KIDKPDDEPDYVI----KGKYEVDNEMLLKRSKLKPQYSSEMSEASNIT 1016
D+++ + K D D + + K E D E K +P + E +N T
Sbjct: 3990 PEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAE------KDEPMQDEDPLEENN-T 4042
Query: 1017 DDEDEEDEEDS---FDFDELFEDNPEEEY--DEDDRDQPINFARNRHNKYIEDDQEEIYH 1071
DED + ++ S D +++ ED EE +E+ + + + +DQ H
Sbjct: 4043 LDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNH 4102
Query: 1072 PKLMTMRSSQE 1082
PK+ +
Sbjct: 4103 PKMDAKSTFAS 4113
Score = 35.0 bits (80), Expect = 0.28
Identities = 43/209 (20%), Positives = 74/209 (35%), Gaps = 40/209 (19%)
Query: 854 DNEDDYDIVE----TNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDR 909
+ D+ + E N T S+NEN + +DE V +D+
Sbjct: 3850 SDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEV------------NDIPE 3897
Query: 910 HYPT------LDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSL 963
DE EE+ E+ K E S E + +DD
Sbjct: 3898 DLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDN-------------KA 3944
Query: 964 PVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSS-EMSEASNITDDEDEE 1022
+ D Q K D+ + D I E+ ++ S+ P+ ++ E + + E +
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGIDD--EIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDV 4002
Query: 1023 DEE-DSFDFDELFEDNPEEEYDEDDRDQP 1050
++ D D D D +EE D ++D+P
Sbjct: 4003 SKDSDLEDMDMEAADENKEEAD-AEKDEP 4030
Score = 34.2 bits (78), Expect = 0.58
Identities = 60/274 (21%), Positives = 99/274 (36%), Gaps = 33/274 (12%)
Query: 850 DKYVDNED-DYDIVETNEHTGTGAPSDNEN------ESDYFPEKTIDESVYGYDTIVYGY 902
K D ED D + + N+ +D E E T+DE + D
Sbjct: 4003 SKDSDLEDMDMEAADENKEE-----ADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAE 4057
Query: 903 DSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYS 962
D + ++ + +E EE E VK E +G+ E + I D+ + +
Sbjct: 4058 DDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAI-DNHPKMDAKSTFASAEA 4116
Query: 963 LPVNSDIQIKIDKPD-DEPDYV-----IKGKYEVDNEMLLK----RSKLKPQYSS----- 1007
N+D I + + E D V G++E E S+ QY S
Sbjct: 4117 DEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHL 4176
Query: 1008 -EMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQ 1066
E +A+ I + ED + + D F + DE++ Q + A K I+ D+
Sbjct: 4177 REWQQANRIHEWEDLTESQSQAFDDSEFM---HVKEDEEEDLQALGNAEKDQIKSIDRDE 4233
Query: 1067 EEIYHPKLMTMRSSQEDLDEAPPVPEHLDDGPEI 1100
+P M + ED + + L DG +I
Sbjct: 4234 SANQNPDSMNSTNIAEDEADE-VGDKQLQDGQDI 4266
Score = 30.4 bits (68), Expect = 7.1
Identities = 34/175 (19%), Positives = 67/175 (38%), Gaps = 16/175 (9%)
Query: 916 EEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDK 975
+ E E+ +E +E+ EA D Y++LGD L +++ +I +
Sbjct: 4145 DGEFEQVQEDTSTPKEA-----MSEA-----DRQYQSLGDHL-REW----QQANRIHEWE 4189
Query: 976 PDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFE 1035
E E + + L+ ++E + +I DE DS + + E
Sbjct: 4190 DLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNSTNIAE 4249
Query: 1036 DNPEEEYDEDDRD-QPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQEDLDEAPP 1089
D +E D+ +D Q I+ + + + I + + S ED+++ P
Sbjct: 4250 DEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDELP 4304
Score = 30.4 bits (68), Expect = 8.9
Identities = 39/214 (18%), Positives = 71/214 (33%), Gaps = 27/214 (12%)
Query: 860 DIVETNEHTGTGA--PSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHYPTLDEE 917
+ E E G +D E E T E++ + DR Y +L +
Sbjct: 4128 ENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAM------------SEADRQYQSLGDH 4175
Query: 918 EEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIK-IDKP 976
E + + + + E E +++ D E+ D +L QIK ID+
Sbjct: 4176 LREWQQANRIHEWEDL---TESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRD 4232
Query: 977 ---DDEPDYVIKGKY------EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDS 1027
+ PD + EV ++ L + + ++ ++
Sbjct: 4233 ESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFE 4292
Query: 1028 FDFDELFEDNPEEEYDEDDRDQPINFARNRHNKY 1061
DE ED + + PI+ AR+ NK+
Sbjct: 4293 LSEDEDIEDELPDYNVKITPAMPIDEARDLWNKH 4326
>gnl|CDD|143214 cd05737, Ig_Myomesin_like_C, C-temrinal immunoglobulin (Ig)-like
domain of myomesin and M-protein. Ig_Myomesin_like_C:
domain similar to the C-temrinal immunoglobulin
(Ig)-like domain of myomesin and M-protein. Myomesin and
M-protein are both structural proteins localized to the
M-band, a transverse structure in the center of the
sarcomere, and are candidates for M-band bridges. Both
proteins are modular, consisting mainly of repetitive
Ig-like and fibronectin type III (FnIII) domains.
Myomesin is expressed in all types of vertebrate
striated muscle; M-protein has a muscle-type specific
expression pattern. Myomesin is present in both slow and
fast fibers; M-protein is present only in fast fibers.
It has been suggested that myomesin acts as a molecular
spring with alternative splicing as a means of modifying
its elasticity.
Length = 92
Score = 34.4 bits (79), Expect = 0.039
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 2/79 (2%)
Query: 456 IRALHDT-TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRI-VTDNDISTLIIH 513
+ L D T +E + + T V G P P+VSW K+ + S + V ++L I
Sbjct: 4 VGGLPDVVTIMEGKTLNLTCTVFGDPDPEVSWLKNDQALALSDHYNVKVEQGKYASLTIK 63
Query: 514 QAALMDEGEIKCTATNRAG 532
+ D G+ N+ G
Sbjct: 64 GVSSEDSGKYGIVVKNKYG 82
Score = 34.0 bits (78), Expect = 0.066
Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 1/77 (1%)
Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEIT-HTDRYLNLRISDARRADRGEYQA 621
G+ + L ++ G P P WL N + L Y + +Y +L I D G+Y
Sbjct: 16 GKTLNLTCTVFGDPDPEVSWLKNDQALALSDHYNVKVEQGKYASLTIKGVSSEDSGKYGI 75
Query: 622 HGVNSLGEDVASFLVTV 638
N G + V+V
Sbjct: 76 VVKNKYGGETVDVTVSV 92
>gnl|CDD|197470 smart00015, IQ, Calmodulin-binding motif. Short calmodulin-binding
motif containing conserved Ile and Gln residues.
Length = 23
Score = 32.3 bits (75), Expect = 0.048
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 276 RHQCAVKIQSLWKMYRVRKKF 296
+ A+ IQ+ W+ Y RK++
Sbjct: 2 LTRAAIIIQAAWRGYLARKRY 22
Score = 28.8 bits (66), Expect = 0.85
Identities = 9/16 (56%), Positives = 11/16 (68%), Gaps = 1/16 (6%)
Query: 43 IVIQKYFRGYLLMRKE 58
I+IQ +RGY L RK
Sbjct: 7 IIIQAAWRGY-LARKR 21
>gnl|CDD|143278 cd05870, Ig5_NCAM-2, Fifth immunoglobulin (Ig)-like domain of
Neural Cell Adhesion Molecule NCAM-2 (also known as
OCAM/mamFas II and RNCAM). Ig5_NCAM-2: the fifth
immunoglobulin (Ig)-like domain of Neural Cell Adhesion
Molecule NCAM-2 (also known as OCAM/mamFas II and
RNCAM). NCAM-2 is organized similarly to NCAM ,
including five N-terminal Ig-like domains and two
fibronectin type III domains. NCAM-2 is differentially
expressed in the developing and mature olfactory
epithelium (OE), and may function like NCAM, as an
adhesion molecule.
Length = 98
Score = 34.6 bits (79), Expect = 0.052
Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 9/85 (10%)
Query: 456 IRALHDTTALEDEKVEFTVQVEGIPTPKVSWYK--DGFEIFSS------RRQRIVTDNDI 507
I L + T +E+ + + EG P P+++W + DG FS R + +
Sbjct: 5 IIQLKNETTVENGAATLSCKAEGEPIPEITWKRASDGHT-FSEGDKSPDGRIEVKGQHGE 63
Query: 508 STLIIHQAALMDEGEIKCTATNRAG 532
S+L I L D G C A +R G
Sbjct: 64 SSLHIKDVKLSDSGRYDCEAASRIG 88
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 36.5 bits (84), Expect = 0.10
Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 6/94 (6%)
Query: 855 NEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHYPTL 914
+D E + SD+E+ + E + E+ D D D+
Sbjct: 908 MKDPISFFEDGGWSFLMVGSDDES-DESEEEVSEYEA--SSDDESDETDEDEESDES--- 961
Query: 915 DEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDE 948
E+ E++ E+ D E E E+K D
Sbjct: 962 SEDLSEDESENDSSDEEDGEDWDELESKAAYDSR 995
>gnl|CDD|143172 cd04971, Ig_TrKABC_d5, Fifth domain (immunoglobulin-like) of Trk
receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth
domain of Trk receptors TrkA, TrkB and TrkC, this is an
immunoglobulin (Ig)-like domain which binds to
neurotrophin. The Trk family of receptors are tyrosine
kinase receptors. They are activated by dimerization,
leading to autophosphorylation of intracellular tyrosine
residues, and triggering the signal transduction
pathway. TrkA, TrkB, and TrkC share significant sequence
homology and domain organization. The first three
domains are leucine-rich domains. The fourth and fifth
domains are Ig-like domains playing a part in ligand
binding. TrkA, Band C mediate the trophic effects of the
neurotrophin Nerve growth factor (NGF) family. TrkA is
recognized by NGF. TrkB is recognized by brain-derived
neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
is recognized by NT-3. NT-3 is promiscuous as in some
cell systems it activates TrkA and TrkB receptors. TrkA
is a receptor found in all major NGF targets, including
the sympathetic, trigeminal, and dorsal root ganglia,
cholinergic neurons of the basal forebrain and the
striatum. TrKB transcripts are found throughout multiple
structures of the central and peripheral nervous
systems. The TrkC gene is expressed throughout the
mammalian nervous system.
Length = 81
Score = 32.7 bits (75), Expect = 0.12
Identities = 19/67 (28%), Positives = 25/67 (37%), Gaps = 7/67 (10%)
Query: 574 GMPPPTARWLHNGEPL-------TSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNS 626
G P PT W HNG L T T T+ + L+ + + G Y N
Sbjct: 9 GNPKPTLTWYHNGAVLNESDYIRTEIHYEVTTPTEYHGCLQFDNPTHVNNGNYTLVASNE 68
Query: 627 LGEDVAS 633
G+D S
Sbjct: 69 YGQDSKS 75
>gnl|CDD|143263 cd05855, Ig_TrkB_d5, Fifth domain (immunoglobulin-like) of Trk
receptor TrkB. TrkB_d5: the fifth domain of Trk
receptor TrkB, this is an immunoglobulin (Ig)-like
domain which binds to neurotrophin. The Trk family of
receptors are tyrosine kinase receptors, which mediate
the trophic effects of the neurotrophin Nerve growth
factor (NGF) family. The Trks are activated by
dimerization, leading to autophosphorylation of
intracellular tyrosine residues, and triggering the
signal transduction pathway. TrkB shares significant
sequence homology and domain organization with TrkA, and
TrkC. The first three domains are leucine-rich domains.
The fourth and fifth domains are Ig-like domains playing
a part in ligand binding. TrKB is recognized by
brain-derived neurotrophic factor (BDNF) and
neurotrophin (NT)-4. In some cell systems NT-3 can
activate TrkA and TrkB receptors. TrKB transcripts are
found throughout multiple structures of the central and
peripheral nervous systems.
Length = 79
Score = 32.5 bits (74), Expect = 0.14
Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 5/65 (7%)
Query: 571 SMAGMPPPTARWLHNGEPLTSGGR-----YEITHTDRYLNLRISDARRADRGEYQAHGVN 625
++ G P PT +W H G L + I +T+ + L++ + + G Y N
Sbjct: 6 TVKGNPKPTLQWFHEGAILNESEYICTKIHVINNTEYHGCLQLDNPTHLNNGIYTLVAKN 65
Query: 626 SLGED 630
GED
Sbjct: 66 EYGED 70
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 35.6 bits (82), Expect = 0.16
Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 5/133 (3%)
Query: 902 YDSDDLDRHYPTLDEEEEEEDRESLVKDR--ESSVKGKEEEAKVIKDDEYYENLGDVLTK 959
Y+ D+ D T E ++D + DR ++ + ++ D+ N
Sbjct: 98 YEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDS-SNSQTDQGL 156
Query: 960 KYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDE 1019
K S VN K+ + Y N+ +K + S N +D +
Sbjct: 157 KQS--VNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQ 214
Query: 1020 DEEDEEDSFDFDE 1032
++ + +S + D+
Sbjct: 215 GDDGDPESAEEDK 227
>gnl|CDD|143301 cd05893, Ig_Palladin_C, C-terminal immunoglobulin (Ig)-like domain
of palladin. Ig_Palladin_C: C-terminal immunoglobulin
(Ig)-like domain of palladin. Palladin belongs to the
palladin-myotilin-myopalladin family. Proteins belonging
to this family contain multiple Ig-like domains and
function as scaffolds, modulating actin cytoskeleton.
Palladin binds to alpha-actinin ezrin,
vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP,
mDia interacting protein), and Src. Palladin also binds
F-actin directly, via its Ig3 domain. Palladin is
expressed as several alternatively spliced isoforms,
having various combinations of Ig-like domains, in a
cell-type-specific manner. It has been suggested that
palladin's different Ig-like domains may be specialized
for distinct functions.
Length = 75
Score = 32.3 bits (73), Expect = 0.17
Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 2/65 (3%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDR--YLNLRISDARRADRGEYQAHG 623
++L+ ++G+P P W E LT H D Y+ L I A + D G Y
Sbjct: 1 VRLECRVSGVPHPQIFWKKENESLTHNTDRVSMHQDNCGYICLLIQGATKEDAGWYTVSA 60
Query: 624 VNSLG 628
N G
Sbjct: 61 KNEAG 65
Score = 30.8 bits (69), Expect = 0.62
Identities = 22/74 (29%), Positives = 31/74 (41%), Gaps = 4/74 (5%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDND---ISTLIIHQAALMDEGEIKCT 526
V +V G+P P++ W K+ E + R+ D L+I A D G +
Sbjct: 1 VRLECRVSGVPHPQIFWKKEN-ESLTHNTDRVSMHQDNCGYICLLIQGATKEDAGWYTVS 59
Query: 527 ATNRAGHSITKARL 540
A N AG ARL
Sbjct: 60 AKNEAGIVSCTARL 73
>gnl|CDD|215677 pfam00047, ig, Immunoglobulin domain. Members of the
immunoglobulin superfamily are found in hundreds of
proteins of different functions. Examples include
antibodies, the giant muscle kinase titin and receptor
tyrosine kinases. Immunoglobulin-like domains may be
involved in protein-protein and protein-ligand
interactions. The Pfam alignments do not include the
first and last strand of the immunoglobulin-like domain.
Length = 62
Score = 31.7 bits (72), Expect = 0.22
Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 1/61 (1%)
Query: 468 EKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDND-ISTLIIHQAALMDEGEIKCT 526
V T V G P V+W+K+G + S + TL I D G C
Sbjct: 2 SSVTLTCSVSGPPQVDVTWFKEGKGLEESTTVGTDENRVSSITLTISNVTPEDSGTYTCV 61
Query: 527 A 527
Sbjct: 62 V 62
>gnl|CDD|143272 cd05864, Ig2_VEGFR-2, Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 2 (VEGFR-2).
Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 2 (VEGFR-2).
The VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. VEGFRs bind VEGFs with high
affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is
a major mediator of the mitogenic, angiogenic and
microvascular permeability-enhancing effects of VEGF-A;
VEGF-A is important to the growth and maintenance of
vascular endothelial cells and to the development of new
blood- and lymphatic-vessels in physiological and
pathological states. VEGF-A also interacts with VEGFR-1,
which it binds more strongly than VEGFR-2. VEGFR-2 and
-1 may mediate a chemotactic and a survival signal in
hematopoietic stem cells or leukemia cells.
Length = 70
Score = 31.8 bits (72), Expect = 0.22
Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 5/54 (9%)
Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEY 619
+K+ V G PPP +W NG+ + ++ L I + D G Y
Sbjct: 1 VKIPVKYYGYPPPEVKWYKNGQLIVLNHTFKRGVH-----LTIYEVTEKDAGNY 49
>gnl|CDD|143284 cd05876, Ig3_L1-CAM, Third immunoglobulin (Ig)-like domain of the
L1 cell adhesion molecule (CAM). Ig3_L1-CAM: third
immunoglobulin (Ig)-like domain of the L1 cell adhesion
molecule (CAM). L1 belongs to the L1 subfamily of cell
adhesion molecules (CAMs) and is comprised of an
extracellular region having six Ig-like domains, five
fibronectin type III domains, a transmembrane region and
an intracellular domain. L1 is primarily expressed in
the nervous system and is involved in its development
and function. L1 is associated with an X-linked
recessive disorder, X-linked hydrocephalus, MASA
syndrome, or spastic paraplegia type 1, that involves
abnormalities of axonal growth. This group also contains
the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Length = 71
Score = 31.8 bits (72), Expect = 0.23
Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 3/65 (4%)
Query: 574 GMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVAS 633
G+P P W PL S R + + ++ L ++ + +D GEY NS G
Sbjct: 9 GLPTPEVHWDRIDGPL-SPNRTKKLNNNKTL--QLDNVLESDDGEYVCTAENSEGSARHH 65
Query: 634 FLVTV 638
+ VTV
Sbjct: 66 YTVTV 70
Score = 29.9 bits (67), Expect = 1.2
Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 8/65 (12%)
Query: 477 EGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN-----RA 531
EG+PTP+V W + + +R +++ N+ TL + D+GE CTA N R
Sbjct: 8 EGLPTPEVHWDRIDGPLSPNRTKKL---NNNKTLQLDNVLESDDGEYVCTAENSEGSARH 64
Query: 532 GHSIT 536
+++T
Sbjct: 65 HYTVT 69
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 34.0 bits (78), Expect = 0.23
Identities = 22/87 (25%), Positives = 39/87 (44%), Gaps = 6/87 (6%)
Query: 965 VNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDE 1024
+ + + K K D+ + E D + L + + E+ + DDEDE+D+
Sbjct: 91 IQAMTEKKDIKAKDKEVDAFE---EGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD 147
Query: 1025 EDSFDF---DELFEDNPEEEYDEDDRD 1048
E DE+ ED ++E +ED+ D
Sbjct: 148 EIVEILIEDDEVDEDEDDDEDEEDEED 174
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 35.0 bits (81), Expect = 0.24
Identities = 22/124 (17%), Positives = 41/124 (33%), Gaps = 12/124 (9%)
Query: 928 KDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGK 987
+ + K ++E K E L Y ++ Q D DD+ D +
Sbjct: 87 AAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDD 146
Query: 988 YEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFD--FDELFEDNPEEEYDED 1045
+ D++ + + + DDEDEE +E + + +E+ E
Sbjct: 147 IDDDDDDED----------DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEA 196
Query: 1046 DRDQ 1049
R
Sbjct: 197 LRQA 200
Score = 31.9 bits (73), Expect = 1.9
Identities = 23/141 (16%), Positives = 48/141 (34%), Gaps = 6/141 (4%)
Query: 908 DRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNS 967
+ ++ +VKD + + + + K K KK ++S
Sbjct: 44 SKKKTPEQIDQVLIFLSGMVKDTDDATE-SDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102
Query: 968 DIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDS 1027
+ + D+ D Y D + + + + + D+D++DE+D
Sbjct: 103 SKKAEKKNALDKDD---DLNYVKDID--VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDD 157
Query: 1028 FDFDELFEDNPEEEYDEDDRD 1048
D D+ D+ +EE E
Sbjct: 158 EDDDDDDVDDEDEEKKEAKEL 178
>gnl|CDD|143264 cd05856, Ig2_FGFRL1-like, Second immunoglobulin (Ig)-like domain of
fibroblast growth factor (FGF) receptor_like-1(FGFRL1).
Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain
of fibroblast growth factor (FGF)
receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal
peptide, three extracellular Ig-like modules, a
transmembrane segment, and a short intracellular domain.
FGFRL1 is expressed preferentially in skeletal tissues.
Similar to FGF receptors, the expressed protein
interacts specifically with heparin and with FGF2.
FGFRL1 does not have a protein tyrosine kinase domain at
its C terminus; neither does its cytoplasmic domain
appear to interact with a signaling partner. It has been
suggested that FGFRL1 may not have any direct signaling
function, but instead acts as a decoy receptor trapping
FGFs and preventing them from binding other receptors.
Length = 82
Score = 32.1 bits (73), Expect = 0.26
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 4/78 (5%)
Query: 562 MGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDR-YLNLRISDARRADRGEYQ 620
+G ++LK +G P P WL + +PLT EI + + L + + + D G+Y
Sbjct: 8 VGSSVRLKCVASGNPRPDITWLKDNKPLT---PTEIGESRKKKWTLSLKNLKPEDSGKYT 64
Query: 621 AHGVNSLGEDVASFLVTV 638
H N GE A++ V V
Sbjct: 65 CHVSNRAGEINATYKVDV 82
>gnl|CDD|143274 cd05866, Ig1_NCAM-2, First immunoglobulin (Ig)-like domain of
neural cell adhesion molecule NCAM-2. Ig1_NCAM-2:
first immunoglobulin (Ig)-like domain of neural cell
adhesion molecule NCAM-2 (OCAM/mamFas II, RNCAM). NCAM-2
is organized similarly to NCAM , including five
N-terminal Ig-like domains and two fibronectin type III
domains. NCAM-2 is differentially expressed in the
developing and mature olfactory epithelium (OE), and may
function like NCAM, as an adhesion molecule.
Length = 92
Score = 31.9 bits (72), Expect = 0.30
Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 472 FTVQVEGIPTPKVSWYK-DGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNR 530
FT G P + WY G +I SS+R + + S L I+ A + D G +C AT+
Sbjct: 20 FTCTAIGEPE-SIDWYNPQGEKIVSSQRVVVQKEGVRSRLTIYNANIEDAGIYRCQATDA 78
Query: 531 AGHS 534
G +
Sbjct: 79 KGQT 82
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 34.7 bits (80), Expect = 0.31
Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 1007 SEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNK 1060
E+ +A+ DDE+EEDEE+ + DE E +E D+++ ++ ++ +
Sbjct: 379 LEIEDANTERDDEEEEDEEEEEEEDED-EGPSKEHSDDEEFEEDDVESKYEDSD 431
>gnl|CDD|143299 cd05891, Ig_M-protein_C, C-terminal immunoglobulin (Ig)-like domain
of M-protein (also known as myomesin-2).
Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like
domain of M-protein (also known as myomesin-2).
M-protein is a structural protein localized to the
M-band, a transverse structure in the center of the
sarcomere, and is a candidate for M-band bridges.
M-protein is modular consisting mainly of repetitive
IG-like and fibronectin type III (FnIII) domains, and
has a muscle-type specific expression pattern. M-protein
is present in fast fibers.
Length = 92
Score = 31.8 bits (72), Expect = 0.34
Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 1/71 (1%)
Query: 463 TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDN-DISTLIIHQAALMDEG 521
T +E + + T V G P P+V W+K+ +I S + + ++L I D G
Sbjct: 12 TIMEGKTLNLTCTVFGNPDPEVIWFKNDQDIELSEHYSVKLEQGKYASLTIKGVTSEDSG 71
Query: 522 EIKCTATNRAG 532
+ N+ G
Sbjct: 72 KYSINVKNKYG 82
>gnl|CDD|143267 cd05859, Ig4_PDGFR-alpha, Fourth immunoglobulin (Ig)-like domain of
platelet-derived growth factor receptor (PDGFR) alpha.
IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like
domain of platelet-derived growth factor receptor
(PDGFR) alpha. PDGF is a potent mitogen for connective
tissue cells. PDGF-stimulated processes are mediated by
three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
binds to all three PDGFs, whereas the PDGFR beta (not
included in this group) binds only to PDGF-B. PDGF alpha
is organized as an extracellular component having five
Ig-like domains, a transmembrane segment, and a
cytoplasmic portion having protein tyrosine kinase
activity. In mice, PDGFR alpha and PDGFR beta are
essential for normal development.
Length = 101
Score = 31.8 bits (72), Expect = 0.42
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 468 EKVEFTVQVEGIPTPKVSWYKDG 490
E EF V+VE P P++ W KD
Sbjct: 19 EVKEFVVEVEAYPPPQIRWLKDN 41
Score = 29.8 bits (67), Expect = 2.2
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 8/68 (11%)
Query: 561 EMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTD------RYLN-LRISDARR 613
+ E+ + V + PPP RWL + L EIT ++ RY++ L++ A+
Sbjct: 16 NLHEVKEFVVEVEAYPPPQIRWLKDNRTL-IENLTEITTSEHNVQETRYVSKLKLIRAKE 74
Query: 614 ADRGEYQA 621
D G Y A
Sbjct: 75 EDSGLYTA 82
>gnl|CDD|143241 cd05764, Ig_2, Subgroup of the immunoglobulin (Ig) superfamily.
Ig_2: subgroup of the immunoglobulin (Ig) domain found
in the Ig superfamily. The Ig superfamily is a
heterogenous group of proteins, built on a common fold
comprised of a sandwich of two beta sheets. Members of
the Ig superfamily are components of immunoglobulin,
neuroglia, cell surface glycoproteins, such as T-cell
receptors, CD2, CD4, CD8, and membrane glycoproteins,
such as butyrophilin and chondroitin sulfate
proteoglycan core protein. A predominant feature of most
Ig domains is a disulfide bridge connecting the two
beta-sheets with a tryptophan residue packed against the
disulfide bond.
Length = 74
Score = 31.3 bits (71), Expect = 0.43
Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 2/65 (3%)
Query: 478 GIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAGHSITK 537
G P P + W ++ S+ + +V DN TL I + D G C A+N AG +
Sbjct: 12 GDPEPAIHWISPDGKLISNSSRTLVYDN--GTLDILITTVKDTGSFTCIASNAAGEATAT 69
Query: 538 ARLRL 542
L +
Sbjct: 70 VELHI 74
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 34.2 bits (78), Expect = 0.48
Identities = 31/170 (18%), Positives = 53/170 (31%), Gaps = 11/170 (6%)
Query: 913 TLDEEEEEEDRESLVKDRESSVK-----GKEEEAKVIKDDEYYENLGDVLTKKYSLPVNS 967
E EEE D E + GK++E +DE + G++ + + P
Sbjct: 204 LEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEK 263
Query: 968 DIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDS 1027
D + D E + K V E K + E + DE+E E
Sbjct: 264 DKKKDAGD-DAELEDDEPDKEAVKKEADSKPEE-----EDEEDDEQEDDQDEEEPPEAAM 317
Query: 1028 FDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKLMTM 1077
+ + + + + IE ++E PK T+
Sbjct: 318 DKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKSWTL 367
Score = 32.7 bits (74), Expect = 1.2
Identities = 18/95 (18%), Positives = 32/95 (33%), Gaps = 9/95 (9%)
Query: 1006 SSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDD 1065
S S + D++EE+EED DE+ +D E + + + + DD
Sbjct: 104 SDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESS---------LEDLSDD 154
Query: 1066 QEEIYHPKLMTMRSSQEDLDEAPPVPEHLDDGPEI 1100
+ E K M + E+ +
Sbjct: 155 ETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSG 189
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region of
the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is dispensable
for binding to Spt4.
Length = 92
Score = 31.3 bits (71), Expect = 0.50
Identities = 17/53 (32%), Positives = 30/53 (56%)
Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
DDE+EE+EE+ D ++L +++ + E + D+ R K E+D EE+
Sbjct: 9 DDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEEL 61
Score = 27.8 bits (62), Expect = 9.7
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 1011 EASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDR 1047
EA ++E+EE+EED + ++ +E EDDR
Sbjct: 5 EAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDR 41
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits in
yeasts and in human cells. Twelve of these are akin to
RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
and Rpc82 form a cluster of enzyme-specific subunits that
contribute to transcription initiation in S.cerevisiae
and H.sapiens. There is evidence that these subunits are
anchored at or near the N-terminal Zn-fold of Rpc1,
itself prolonged by a highly conserved but RNA polymerase
III-specific domain.
Length = 221
Score = 33.2 bits (76), Expect = 0.59
Identities = 17/54 (31%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 993 EMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDD 1046
++ K S L+ + +E + D++DEE+EE+ + DE F+D +++ D+DD
Sbjct: 148 DIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDD--DDDDDDDD 199
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the maturation
of the 18S rRNA and for 40S ribosome production.
Length = 809
Score = 33.8 bits (78), Expect = 0.69
Identities = 51/218 (23%), Positives = 84/218 (38%), Gaps = 54/218 (24%)
Query: 884 PEKTIDESVYGYDTIVY--GYDSDDLDRHYPT----LDEE---EEEEDRESLVKDRESSV 934
P T +E YD V +D R PT +EE EE E + L +R +
Sbjct: 227 PPMTPEEKDDEYDQRVRELTFDR----RAQPTDRTKTEEELAKEEAERLKKLEAERLRRM 282
Query: 935 KGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEM 994
+G+EE+ D+E D + D DDE + + D+
Sbjct: 283 RGEEED-----DEE-----------------EEDSKESADDLDDEFEP------DDDDNF 314
Query: 995 LLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFA 1054
L + + + + + + DD D++ EE+ D D E+ EE+ D DD D
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDD-DDDLEEEEEDVDLSDEEEDEEDEDSDDEDDE---- 369
Query: 1055 RNRHNKYIEDDQEEIYHPKLMTMRSSQEDLDEAPPVPE 1092
E+++EE K + S++ +L P P+
Sbjct: 370 --------EEEEEEKEKKKKKSAESTRSELPFTFPCPK 399
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein. Genome
duplication is precisely regulated by cyclin-dependent
kinases CDKs, which bring about the onset of S phase by
activating replication origins and then prevent
relicensing of origins until mitosis is completed. The
optimum sequence motif for CDK phosphorylation is
S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
potential phosphorylation sites. Drc1 is required for DNA
synthesis and S-M replication checkpoint control. Drc1
associates with Cdc2 and is phosphorylated at the onset
of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
replication by phosphorylating Drc1 and regulating its
association with Cut5. Sld2 and Sld3 represent the
minimal set of S-CDK substrates required for DNA
replication.
Length = 397
Score = 33.2 bits (76), Expect = 0.72
Identities = 28/128 (21%), Positives = 50/128 (39%), Gaps = 17/128 (13%)
Query: 925 SLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVI 984
LV++ ES +E V+++ E E + ++ V+S +DEP V
Sbjct: 238 ELVQEEESID----DELDVLREIEAEEAGIGPIEEEV---VDSQA------ANDEPRRVF 284
Query: 985 KGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDD-EDEEDEEDSFDFDELFEDNPEEEYD 1043
K K + + +R K++P + E S D +E + D E + D
Sbjct: 285 KKKGQ---KRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDED 341
Query: 1044 EDDRDQPI 1051
++D D
Sbjct: 342 DEDEDDEE 349
>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor. This
product is encoded by astrovirus ORF2, one of the three
astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
undergoes an intracellular cleavage to form a 79kD
protein. Subsequently, extracellular trypsin cleavage
yields the three proteins forming the infectious virion.
Length = 787
Score = 33.6 bits (77), Expect = 0.78
Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 1003 PQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARN 1056
S E ++ + EDE+DE D FD + PE++ DE++R ++ N
Sbjct: 668 DLISLEETD-TEDESTEDEDDELDRFDLHDSSGSEPEDD-DENNRVTLLSTLIN 719
>gnl|CDD|143280 cd05872, Ig_Sema4B_like, Immunoglobulin (Ig)-like domain of the
class IV semaphorin Sema4B. Ig_Sema4B_like;
Immunoglobulin (Ig)-like domain of Sema4B_like. Sema4B
is a Class IV semaphorin. Semaphorins are classified
based on structural features additional to the Sema
domain. Sema4B has extracellular Sema and Ig domains, a
transmembrane domain and a short cytoplasmic domain.
Sema4B has been shown to preferentially regulate the
development of the postsynaptic specialization at the
glutamatergic synapses. This cytoplasmic domain includes
a PDZ-binding motif upon which the synaptic localization
of Sem4B is dependent. Sema4B is a ligand of CLCP1,
CLCP1 was identified in an expression profiling
analysis, which compared a highly metastic lung cancer
subline with its low metastic parental line. Sema4B was
shown to promote CLCP1 endocytosis, and their
interaction is a potential target for therapeutic
intervention of metastasis.
Length = 85
Score = 30.1 bits (68), Expect = 1.3
Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 5/61 (8%)
Query: 579 TARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGV-NSLGEDVASFLVT 637
+ WL NG PL + Y + TD L I G Y+ + + VAS+ +
Sbjct: 26 SPVWLFNGTPLNAQFSYRVG-TD---GLLILVTSPEHSGTYRCYSEEEGFQQLVASYSLN 81
Query: 638 V 638
V
Sbjct: 82 V 82
>gnl|CDD|222477 pfam13965, SID-1_RNA_chan, dsRNA-gated channel SID-1. This is a
family of proteins that are transmembrane dsRNA-gated
channels. They passively transport dsRNA into cells and
do not act as ATP-dependent pumps. They are required for
systemic RNA interference.
Length = 567
Score = 32.4 bits (74), Expect = 1.4
Identities = 14/62 (22%), Positives = 22/62 (35%), Gaps = 2/62 (3%)
Query: 1013 SNITDDEDEEDEEDSFDFDELFEDNP--EEEYDEDDRDQPINFARNRHNKYIEDDQEEIY 1070
+I E E+ + D + E E D DQ I R + + Y+ D +
Sbjct: 150 RDIISFEPSPSEQRAMDLQPDQSEEDSSERENDILMADQQIMVIREKASLYVSDLSRKDQ 209
Query: 1071 HP 1072
P
Sbjct: 210 RP 211
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane translocase
(Tom) import receptor, five proteins of the Tom channel
complex, five proteins of the inner membrane translocase
(Tim) and three "motor" proteins. This family represents
the Tom22 proteins. The N terminal region of Tom22 has
been shown to have chaperone-like activity, and the C
terminal region faces the intermembrane face.
Length = 136
Score = 31.1 bits (71), Expect = 1.4
Identities = 10/44 (22%), Positives = 20/44 (45%)
Query: 989 EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDE 1032
EV++E ++ + E S+ + D + + D D DF+
Sbjct: 6 EVEDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFEN 49
>gnl|CDD|143266 cd05858, Ig3_FGFR-2, Third immunoglobulin (Ig)-like domain of
fibroblast growth factor receptor 2 (FGFR2).
Ig3_FGFR-2-like; domain similar to the third
immunoglobulin (Ig)-like domain of human fibroblast
growth factor receptor 2 (FGFR2). Fibroblast growth
factors (FGFs) participate in morphogenesis,
development, angiogenesis, and wound healing. These
FGF-stimulated processes are mediated by four FGFR
tyrosine kinases (FGRF1-4). FGFRs are comprised of an
extracellular portion consisting of three Ig-like
domains, a transmembrane helix, and a cytoplasmic
portion having protein tyrosine kinase activity. The
highly conserved Ig-like domains 2 and 3, and the linker
region between D2 and D3 define a general binding site
for FGFs. FGFR2 is required for male sex determination.
Length = 90
Score = 29.9 bits (67), Expect = 1.4
Identities = 23/84 (27%), Positives = 30/84 (35%), Gaps = 13/84 (15%)
Query: 470 VEFTVQVEGIPTPKVSWYK-----------DGFEIFSSRRQRIV--TDNDISTLIIHQAA 516
VEF +V P + W K DG + + V TD ++ L +
Sbjct: 4 VEFVCKVYSDAQPHIQWLKHVEKNGSKYGPDGLPYVTVLKTAGVNTTDKEMEVLYLRNVT 63
Query: 517 LMDEGEIKCTATNRAGHSITKARL 540
D GE C A N G S A L
Sbjct: 64 FEDAGEYTCLAGNSIGISHHSAWL 87
>gnl|CDD|143250 cd05773, Ig8_hNephrin_like, Eighth immunoglobulin-like domain of
nephrin. Ig8_hNephrin_like: domain similar to the
eighth immunoglobulin-like domain in human nephrin.
Nephrin is an integral component of the slit diaphragm,
and is a central component of the glomerular
ultrafilter. Nephrin plays a structural role, and has a
role in signaling. Nephrin is a transmembrane protein
having a short intracellular portion, and an
extracellular portion comprised of eight Ig-like
domains, and one fibronectin type III-like domain. The
extracellular portions of nephrin, from neighboring foot
processes of separate podocyte cells, may interact with
each other, and in association with other components of
the slit diaphragm, form a porous molecular sieve within
the slit pore. The intracellular portion of nephrin is
associated with linker proteins, which connect nephrin
to the actin cytoskeleton. The intracellular portion is
tyrosine phosphorylated, and mediates signaling from the
slit diaphragm into the podocytes.
Length = 109
Score = 30.7 bits (69), Expect = 1.4
Identities = 27/77 (35%), Positives = 32/77 (41%), Gaps = 10/77 (12%)
Query: 574 GMPPPTARWLHNGEPLTSGG-RYEIT-------HTDRYLNLRISDARRADRGEYQAHGVN 625
G+P RW NG PL G RYE T HT + +S A D + N
Sbjct: 34 GVPRVQFRWAKNGVPLDLGNPRYEETTEHTGTVHTSILTIINVSAAL--DYALFTCTAHN 91
Query: 626 SLGEDVASFLVTVTDRP 642
SLGED + T RP
Sbjct: 92 SLGEDSLDIQLVSTSRP 108
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 32.2 bits (73), Expect = 1.6
Identities = 21/101 (20%), Positives = 29/101 (28%), Gaps = 12/101 (11%)
Query: 999 SKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRH 1058
S L P+ +S S S + E P E ++ QP +F + H
Sbjct: 116 SGLSPENTSGSSPESPASHSPPPSPPSHPGPH----EPAPPESHNPSPNQQPSSFLQPSH 171
Query: 1059 NKYIEDDQEEIYHPKLMTMRSSQEDLDEAPPVPEHLDDGPE 1099
ED EE P + D P P
Sbjct: 172 ----EDSPEEPEPPT----SEPEPDSPGPPQSETPTSSPPP 204
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 32.1 bits (73), Expect = 1.6
Identities = 11/46 (23%), Positives = 17/46 (36%)
Query: 1015 ITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNK 1060
DED + D D D D+ + + + D N +R K
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEK 323
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 32.4 bits (73), Expect = 1.7
Identities = 33/205 (16%), Positives = 64/205 (31%), Gaps = 35/205 (17%)
Query: 872 APSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHYPTLDEEEEEEDRESLVKDRE 931
+ + F ++ + + D LD + P ++ + D +
Sbjct: 300 NGQYEQTIREIFADRATKLELDLQTVFESNMNRDTLDEYAPEGEDLRSDYDEDF------ 353
Query: 932 SSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPV-NSDIQIKIDKPDDEPDYVIKGKYEV 990
+ V DD + + +KK ++P S Q K + ++E D +
Sbjct: 354 ----EYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEEST 409
Query: 991 DNEMLLKRSKLKPQYSSEMSEASNITDDEDEE-DEEDSFDFDELFEDNPEEEYDEDDR-- 1047
+ + + E N DEE +D+ F+EL + E + E
Sbjct: 410 MSAI----------DDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDME 459
Query: 1048 -----------DQPINFARNRHNKY 1061
QP A R+ +Y
Sbjct: 460 KEDREFPDEAELQPSESAIERYKEY 484
>gnl|CDD|143282 cd05874, Ig6_NrCAM, Sixth immunoglobulin (Ig)-like domain of NrCAM
(Ng (neuronglia) CAM-related cell adhesion molecule).
Ig6_NrCAM: sixth immunoglobulin (Ig)-like domain of
NrCAM (Ng (neuronglia) CAM-related cell adhesion
molecule). NrCAM belongs to the L1 subfamily of cell
adhesion molecules (CAMs) and is comprised of an
extracellular region having six Ig-like domains and five
fibronectin type III domains, a transmembrane region,
and an intracellular domain. NrCAM is primarily
expressed in the nervous system.
Length = 77
Score = 29.5 bits (66), Expect = 1.7
Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 475 QVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIH----QAALMDEGEIKCTATNR 530
+ +G P P SW ++G + ++ + TL+I+ + A EG +CTA N
Sbjct: 6 EAKGKPPPSFSWTRNGTHFDIDKDPKVTMKPNTGTLVINIMNGEKAEAYEGVYQCTARNE 65
Query: 531 AGHSIT 536
G +++
Sbjct: 66 RGAAVS 71
>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination, and
repair / Cell division and chromosome partitioning /
Translation].
Length = 953
Score = 32.2 bits (73), Expect = 1.8
Identities = 35/186 (18%), Positives = 63/186 (33%), Gaps = 20/186 (10%)
Query: 917 EEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKP 976
+ +E+R++ +R S K ++E K + Y + K ++ + K
Sbjct: 380 DHIQEERKNESLERFSLRKERKEGLKGMPRVVYEQKKLIGSIKPTL--MDQLQEKKSPDL 437
Query: 977 DDEPDY----VIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDE 1032
DE + K ++E+ LKR S + S + + DS D DE
Sbjct: 438 PDEEFIDTLALPKDLDMKNHELFLKRFANDLGLSISKAIKSKGNYSLEMDIASDSPDEDE 497
Query: 1033 LFEDNPEEEYDEDDR-----------DQPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQ 1081
+ E E D + ++ N N +E Y KL S+
Sbjct: 498 ---EEFESEVDSIRKIPDKYVNIIVEEEEENETEKTVNLRFPGWKERYYTSKLHFTTDSE 554
Query: 1082 EDLDEA 1087
E + +
Sbjct: 555 EKIRDM 560
>gnl|CDD|143271 cd05863, Ig2_VEGFR-3, Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 3 (VEGFR-3).
Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of
vascular endothelial growth factor receptor 3 (VEGFR-3).
The VEGFRs have an extracellular component with seven
Ig-like domains, a transmembrane segment, and an
intracellular tyrosine kinase domain interrupted by a
kinase-insert domain. VEGFRs bind VEGFs with high
affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds
two members of the VEGF family (VEGF-C and -D) and is
involved in tumor angiogenesis and growth.
Length = 67
Score = 28.8 bits (64), Expect = 2.4
Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 8/62 (12%)
Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
V+ V+V P P+ WYKDG ++ S + + +L I G N
Sbjct: 1 VKLPVKVAAYPPPEFQWYKDG-KLISGKHSQ-------HSLQIKDVTEASAGTYTLVLWN 52
Query: 530 RA 531
A
Sbjct: 53 SA 54
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 30.8 bits (70), Expect = 2.5
Identities = 9/37 (24%), Positives = 21/37 (56%)
Query: 1009 MSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDED 1045
M ++ +I D E ++D+ + ++ +E E++ E D
Sbjct: 1 MEDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESSD 37
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known to
suppress GCR2 and is highly expressed in the muscle and
heart. The function of this family is unknown although it
has been speculated that SGT1 may be functionally
analogous to the Gcr2p protein of Saccharomyces
cerevisiae which is known to be a regulatory factor of
glycolytic gene expression.
Length = 557
Score = 31.6 bits (72), Expect = 2.5
Identities = 19/85 (22%), Positives = 36/85 (42%), Gaps = 7/85 (8%)
Query: 1008 EMSEASNITDDEDEEDEEDSFDFDELFE------DNPEEEYDEDDRDQPINFARNRHNKY 1061
E + + +D ED++ SFD DE FE ++E D D D + A ++
Sbjct: 437 ADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDS-DDADEDDDED 495
Query: 1062 IEDDQEEIYHPKLMTMRSSQEDLDE 1086
++D++ L + + +D
Sbjct: 496 DDEDEDSSSDSTLEELEEYMDQMDA 520
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is a
family of uncharacterized proteins.
Length = 449
Score = 31.5 bits (71), Expect = 2.9
Identities = 18/55 (32%), Positives = 31/55 (56%)
Query: 994 MLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRD 1048
+K+ + E E + +D+DEED++D D DE +D+ ++E DED+ D
Sbjct: 30 DTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDD 84
>gnl|CDD|197329 cd09095, INPP5c_INPP5E-like, Catalytic inositol polyphosphate
5-phosphatase (INPP5c) domain of Inositol
polyphosphate-5-phosphatase E and related proteins.
INPP5c domain of Inositol polyphosphate-5-phosphatase E
(also called type IV or 72 kDa 5-phosphatase), rat
pharbin, and related proteins. This subfamily belongs to
a family of Mg2+-dependent inositol polyphosphate
5-phosphatases, which hydrolyze the 5-phosphate from the
inositol ring of various 5-position phosphorylated
phosphoinositides (PIs) and inositol phosphates (IPs),
and to the large EEP
(exonuclease/endonuclease/phosphatase) superfamily that
contains functionally diverse enzymes that share a common
catalytic mechanism of cleaving phosphodiester bonds.
INPP5E hydrolyzes the 5-phosphate from PI(3,5)P2,
PI(4,5)P2 and PI(3,4,5)P3, forming PI3P, PI4P, and
PI(3,4)P2, respectively. It is a very potent PI(3,4,5)P3
5-phosphatase. Its intracellular localization is chiefly
cytosolic, with pronounced perinuclear/Golgi
localization. INPP5E also has an N-terminal proline rich
domain (PRD) and a C-terminal CAAX motif. This protein is
expressed in a variety of tissues, including the breast,
brain, testis, and haemopoietic cells. It is
differentially expressed in several cancers, for example,
it is up-regulated in cervical cancer and down-regulated
in stomach cancer. It is a candidate target for
therapeutics of obesity and related disorders, as it is
expressed in the hypothalamus, and following insulin
stimulation, it undergoes tyrosine phosphorylation,
associates with insulin receptor substrate-1, -2, and
PI3-kinase, and become active as a 5-phosphatase. INPP5E
may play a role, along with other 5-phosphatases SHIP2
and SKIP, in regulating glucose homoeostasis and energy
metabolism. Mice deficient in INPPE5 develop a
multi-organ disorder associated with structural defects
of the primary cilium.
Length = 298
Score = 31.2 bits (71), Expect = 3.0
Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 4/72 (5%)
Query: 942 KVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKL 1001
+ + + Y GDV T+ + D ++ P D +I EVD LL+
Sbjct: 156 RNVPTNPYKSESGDVTTRFDEVFWFGDFNFRLSGPRHLVDALINQGQEVDVSALLQ---- 211
Query: 1002 KPQYSSEMSEAS 1013
Q + EMS+ S
Sbjct: 212 HDQLTREMSKGS 223
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair of
CXXC motifs, followed by a long region of low complexity
sequence in which roughly half the residues are Asp and
Glu, including multiple runs of five or more acidic
residues. The function of members of this family is
unknown.
Length = 104
Score = 29.2 bits (66), Expect = 3.2
Identities = 14/58 (24%), Positives = 29/58 (50%), Gaps = 14/58 (24%)
Query: 1011 EASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEE 1068
E + +D++D++D D +L +D+ + + D+DD ++EDD +E
Sbjct: 61 EDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDD--------------FLEDDDDE 104
>gnl|CDD|220149 pfam09234, DUF1963, Domain of unknown function (DUF1963). This
domain is found in a set of hypothetical bacterial
proteins. Its exact function has not, as yet, been
described.
Length = 221
Score = 30.8 bits (70), Expect = 3.5
Identities = 20/91 (21%), Positives = 34/91 (37%), Gaps = 9/91 (9%)
Query: 965 VNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEM----SEASNITDDED 1020
++ D D P+D+ + + Y D L L +S E + D
Sbjct: 52 IDLDDDDWGDSPEDQTGFRVI--YFEDIIEDLLPKDLIEDFSFLKAPFEGELKLPFEKSD 109
Query: 1021 EEDEEDSFDFDELFEDNP---EEEYDEDDRD 1048
E ED + F++ +E EEE +E +
Sbjct: 110 EPISEDDYSFEQEYESEILELEEEDEELIEE 140
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction that
requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 30.0 bits (68), Expect = 3.5
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEE 1041
D+DEEDEE+ D ++ ED EEE
Sbjct: 114 SDDDEEDEEEEDDEEDDDEDESEEE 138
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim have
been identified in both mice and humans (mTim and hTIM,
respectively). Mammalian TIM is not the true orthologue
of Drosophila TIM, but is the likely orthologue of a fly
gene, timeout (also called tim-2). mTim has been shown to
be essential for embryonic development, but does not have
substantiated circadian function. Some family members
contain a SANT domain in this region.
Length = 507
Score = 31.2 bits (70), Expect = 3.5
Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 4/83 (4%)
Query: 989 EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDR- 1047
E E K K Q + + + + + DEE ++ S D D D+ D+
Sbjct: 425 EALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGGLPRIHKDKR 484
Query: 1048 ---DQPINFARNRHNKYIEDDQE 1067
+ R K +EDD E
Sbjct: 485 AGASLTQSPLSRRRLKVVEDDDE 507
>gnl|CDD|214441 MTH00157, ATP6, ATP synthase F0 subunit 6; Provisional.
Length = 223
Score = 30.5 bits (70), Expect = 3.7
Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 5/25 (20%)
Query: 824 QSTSPMLAAFMLLMLFTFIETISSF 848
Q T P+L FM+L IETIS+
Sbjct: 131 QGTPPILMPFMVL-----IETISNL 150
>gnl|CDD|165173 PHA02826, PHA02826, IL-1 receptor-like protein; Provisional.
Length = 227
Score = 30.3 bits (68), Expect = 4.2
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 5/57 (8%)
Query: 484 VSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCT---ATNRAGHSITK 537
++WYK+G + + RI N+ STL+I A D G C N ++ITK
Sbjct: 166 LTWYKNGNIVLYT--DRIQLRNNNSTLVIKSATHDDSGIYTCNLRFNKNSNNYNITK 220
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during metabolism,
and (2) salt tolerance (in plants), due to Na+ uptake
into vacuoles. This model is specific for the fungal
members of this family [Transport and binding proteins,
Cations and iron carrying compounds].
Length = 810
Score = 31.0 bits (70), Expect = 4.3
Identities = 48/252 (19%), Positives = 93/252 (36%), Gaps = 63/252 (25%)
Query: 842 IETISSFRDKYVDNEDDYDI--VETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIV 899
+ TI DK + ++ D+ V T+ + G + D+
Sbjct: 582 VNTIYGL-DKLARDTENRDVTYVPTSRYDGIESEIDDVYT-------------------- 620
Query: 900 YGYDSDDLD----RHYPTLDEEEEE------EDRESLVKDRESSV-KGKEEEAKVIKDDE 948
Y DS+ + R L EEE++ ED + ++++R+ + + + + +D E
Sbjct: 621 YENDSESIASSERRRIKKLREEEQQAYIAYTEDNQVIIENRQGEILEYVDIHDRGARDAE 680
Query: 949 YYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLK----------- 997
+ G L + S P+ QI ++ Y Y+V N+++++
Sbjct: 681 VGVHNGGRLKRALSPPLEKLHQI-TNEAKKSKYYA----YKVGNDLIIEDESGEVFRRYR 735
Query: 998 ------RSKLKPQYSSEMS--EASNITDDED--EEDEEDSFDF-DELFEDNPEEEYDED- 1045
+ K+K + S +S E I E D DE+ +D E E +D
Sbjct: 736 ISPHGGKRKIKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMADDEAESENMDDY 795
Query: 1046 -DRDQPINFARN 1056
D D +++
Sbjct: 796 EDSDDNAYESKD 807
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.9 bits (70), Expect = 4.9
Identities = 15/72 (20%), Positives = 35/72 (48%), Gaps = 2/72 (2%)
Query: 1015 ITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKL 1074
+ D+E+EE++E+ + +E D EE +E+++++ + E E + K
Sbjct: 35 VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEW--ELLNKTKP 92
Query: 1075 MTMRSSQEDLDE 1086
+ R+ ++ E
Sbjct: 93 IWTRNPKDVTKE 104
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 30.6 bits (69), Expect = 4.9
Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 13/84 (15%)
Query: 1007 SEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDED---DRDQPINFARNRHNKYIE 1063
S M A + D+ + D ED+ D ED+P+E+ D+D + + + + + +
Sbjct: 204 SSMDMAEELGDEPESADSEDNED-----EDDPKEDEDDDQGEEEESGSSDSLSEDSDASS 258
Query: 1064 DDQEEIYHPKLMTM-RSSQEDLDE 1086
++ E M +S +D +
Sbjct: 259 EEMESGE----MEAAEASADDTPD 278
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 30.8 bits (69), Expect = 5.0
Identities = 39/216 (18%), Positives = 65/216 (30%), Gaps = 17/216 (7%)
Query: 112 AIQTLYKAKLLMKRDRAAYTELKQACVSVQQRWRANLTMRKQRAHFLLMKQKASVIQQWY 171
+++ + + RA TEL A ++ A R+ +Q+ ++Q
Sbjct: 69 LRSGVFQLDDIRPQLRALRTELGTA---QGEKRAAETEREAARSELQKARQEREAVRQ-- 123
Query: 172 RNTKLMRLEASYLHELKAATITIQRRYRANVAMRTQRERYVALRTATITIQTRFRAYLIA 231
+ A EL T Q + QR + A + Q + +A
Sbjct: 124 ELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQ 183
Query: 232 KNQRDEYAELKQARRFRFKLNLRKYERVIELLKLKREQERQEKYRHQCAVKIQSLWKMYR 291
+ +L+ A+ + NL + E R+ Q A IQ
Sbjct: 184 LKSQVLDLKLRSAQIEQEAQNLATRANAAQ--ARTEELARRAAAAQQTAQAIQQR----- 236
Query: 292 VRKKFADIIEQKKQAKKTADNQFENQAPLYVRLEEA 327
I QK Q Q + RLE A
Sbjct: 237 -----DAQISQKAQQIAARAEQIRERERQLQRLETA 267
>gnl|CDD|143259 cd05851, Ig3_Contactin-1, Third Ig domain of contactin-1.
Ig3_Contactin-1: Third Ig domain of the neural cell
adhesion molecule contactin-1. Contactins are comprised
of six Ig domains followed by four fibronectin type III
(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. Contactin-1 is
differentially expressed in tumor tissues and may
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 88
Score = 28.4 bits (63), Expect = 5.3
Identities = 23/87 (26%), Positives = 34/87 (39%), Gaps = 12/87 (13%)
Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDIST----LIIHQAA 516
DT AL+ + V G P P + W K + + +IS L I
Sbjct: 10 DTYALKGQNVTLECFALGNPVPVIRWRK--------ILEPMPATAEISMSGAVLKIFNIQ 61
Query: 517 LMDEGEIKCTATNRAGHSITKARLRLE 543
DEG +C A N G +AR+ ++
Sbjct: 62 PEDEGTYECEAENIKGKDKHQARVYVQ 88
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 30.8 bits (70), Expect = 5.3
Identities = 32/149 (21%), Positives = 54/149 (36%), Gaps = 16/149 (10%)
Query: 905 DDLDRHYPTLDEEEEEEDRESLVKDRESS-----------VKGKEEEAKVIKDDEYYENL 953
+DLD+ L+E+EE E++E + R S K K++E K K
Sbjct: 1132 EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSAD--KS 1189
Query: 954 GDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEAS 1013
S V+SD + K+D D G + D+E + K + + +
Sbjct: 1190 KKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNN 1249
Query: 1014 NITDDEDEEDEEDSFDFDELFEDNPEEEY 1042
+ ED ++ D E P+
Sbjct: 1250 ---SSKSSEDNDEFSSDDLSKEGKPKNAP 1275
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46. Homologues
of CDC45 have been identified in human, mouse and smut
fungus among others.
Length = 583
Score = 30.7 bits (70), Expect = 5.6
Identities = 13/53 (24%), Positives = 29/53 (54%)
Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
DDE+ ++E++ E ED+ +++ D+D + + R R + E+ + E+
Sbjct: 129 DDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAEL 181
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 30.4 bits (69), Expect = 5.8
Identities = 32/198 (16%), Positives = 65/198 (32%), Gaps = 28/198 (14%)
Query: 915 DEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKID 974
D E EE RE ++ + +E K + ++ G+ + L + + K
Sbjct: 391 DAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEK 450
Query: 975 KPDDEPDY---VIKGKYEVDNEMLLKRSKLK-------------PQYSSEMSEASNITDD 1018
K DE + + K E LLKRS+ P + S +
Sbjct: 451 KESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQ 510
Query: 1019 EDEEDEEDSFDFDELFEDN---------PEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
+ ++ D +E+ + D D + + E+D+++
Sbjct: 511 DSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDE 570
Query: 1070 YHPKLMTMRSSQEDLDEA 1087
+L + ++ + EA
Sbjct: 571 ---ELPFLFKQKDLIKEA 585
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of the
RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.5 bits (69), Expect = 5.9
Identities = 36/167 (21%), Positives = 64/167 (38%), Gaps = 36/167 (21%)
Query: 904 SDDLDRHYPTLDEEEEEED------RESLVKDRESSVKGKEEEAKVIKDDE---YYENL- 953
D L++ L + +E+E+ RE +K + + K E + D+ YY+ L
Sbjct: 255 EDTLEKRSDDLHDYDEDEEYKFKRVREYDMKVKSKATKLNELALFFVSDENGVVYYKPLR 314
Query: 954 ----------GDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKP 1003
DV+ N + +K+ P + +R++L P
Sbjct: 315 SRVELRRRRVNDVIRPLVREHNNDQLNVKLRNPSTK----------ESKMRDKRRARLDP 364
Query: 1004 QYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQP 1050
E+ E DEDEE+E+ S + +E ++ EEE + D
Sbjct: 365 IDFEEVDE------DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGS 405
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 29.4 bits (66), Expect = 6.4
Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 13/99 (13%)
Query: 852 YVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHY 911
+ + D DI++ NE D+E E + E+ + D D
Sbjct: 27 KITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE-------------DIEDEEE 73
Query: 912 PTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYY 950
DEEEEEED E V ++ K + +DD
Sbjct: 74 IVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQ 112
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit of
transcription factor IIF (TFIIF), which is essential for
accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 30.3 bits (68), Expect = 6.6
Identities = 41/185 (22%), Positives = 62/185 (33%), Gaps = 28/185 (15%)
Query: 849 RDKYVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTID---ESVYGYDTIVYGYDSD 905
+D D+EDD D + G S + + +K +D + G D YDSD
Sbjct: 216 KDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275
Query: 906 DLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPV 965
D D+E EED S D +S EE + + P
Sbjct: 276 D-------GDDEGREEDYIS---DSSASGNDPEEREDKLSPEI---------------PA 310
Query: 966 NSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEE 1025
+I+ D + E + + LK+ K K + S D+ + D E
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGE 370
Query: 1026 DSFDF 1030
DS
Sbjct: 371 DSVSL 375
>gnl|CDD|187811 cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Multidomain protein with
permuted HD nuclease domain, palm domain and Zn-ribbon;
signature gene for type III; also known as Csm1 family.
Length = 650
Score = 30.4 bits (69), Expect = 6.7
Identities = 24/105 (22%), Positives = 35/105 (33%), Gaps = 1/105 (0%)
Query: 902 YDSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKY 961
Y +L P +E + RE V RE ++E+ + E LG L K
Sbjct: 372 YSYLELAALNPRDSKEGSKGTRECKVCGREEP-IAEDEDEGLCPTCERLYELGKELLKDD 430
Query: 962 SLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYS 1006
S V P Y++ + E L +L YS
Sbjct: 431 SFLVTEKEDGGKKLPKFNGYYLLFAYEADEYEELALEDELVRIYS 475
>gnl|CDD|184468 PRK14035, PRK14035, citrate synthase; Provisional.
Length = 371
Score = 30.1 bits (68), Expect = 6.7
Identities = 10/31 (32%), Positives = 17/31 (54%)
Query: 167 IQQWYRNTKLMRLEASYLHELKAATITIQRR 197
I + Y++ ++MR A Y+ E I I+ R
Sbjct: 341 ILEQYKDNRIMRPRAKYIGETNRKYIPIEER 371
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.0 bits (67), Expect = 6.9
Identities = 37/184 (20%), Positives = 64/184 (34%), Gaps = 15/184 (8%)
Query: 912 PTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDE-------YYENLGDVLTKKYSLP 964
T++EEE+EE RE + E+ K E+ +D E E + K+ SL
Sbjct: 109 ETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLE 168
Query: 965 VNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDE 1024
N+ + E + G E + KLK + E ++ ++ E
Sbjct: 169 ENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALEL----EELKKKRE 224
Query: 1025 EDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQEDL 1084
E +E + +EE D R++ R K + + K +
Sbjct: 225 ERRKVLEEEEQRRKQEEADRKSREE----EEKRRLKEEIERRRAEAAEKRQKVPEDGLSE 280
Query: 1085 DEAP 1088
D+ P
Sbjct: 281 DKKP 284
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 30.3 bits (68), Expect = 7.4
Identities = 43/295 (14%), Positives = 88/295 (29%), Gaps = 24/295 (8%)
Query: 38 IRSKTIVIQKYFRGYLLMRKERQEYLAMKSSAVKIQEWYRNLQCMRQARQ--QYLALKHA 95
+++ T+ +Q L E + K+Q +Q Q LALK
Sbjct: 589 LQNITVRLQDL--TEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLT 646
Query: 96 TLKQREEFLKLKHATIAIQTLYKAKLLMKRDRAAYTELKQACVSVQQRWRANLTMRKQ-- 153
L + T+ + + + L ++ +Q + Q + LT K+
Sbjct: 647 ALHALQL-------TLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEML 699
Query: 154 -RAHFLLMKQKASVIQQWYRNTKLMRLEASYLHELKAATITIQRRYRANVAMRTQRERYV 212
+ LL + + + + ++ +S +L A + + +
Sbjct: 700 AQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQ-----SLKELMHQART 754
Query: 213 ALRTATITIQTRFRAYLIAKNQRDEYAELKQARRFRFKLNLRKYERVIELLKLKREQERQ 272
L+ T A E + L +F R E LLK + Q
Sbjct: 755 VLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFF----NRLREEDTHLLKTLEAEIGQ 810
Query: 273 EKYRHQCAVKIQSLWKMYRVRKKFADIIEQKKQAKKTADNQFENQAPLYVRLEEA 327
E + + + ++F +E+K +Q +L +
Sbjct: 811 EI-PSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQL 864
>gnl|CDD|143234 cd05757, Ig2_IL1R_like, Second immunoglobulin (Ig)-like domain of
interleukin-1 receptor (IL1R) and similar proteins.
Ig2_IL1R_like: domain similar to the second
immunoglobulin (Ig)-like domain of interleukin-1
receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines
which participate in the regulation of inflammation,
immune responses, and hematopoiesis. These cytokines
bind to the IL-1 receptor type 1 (IL1R1), which is
activated on additional association with an accessory
protein, IL1RAP. IL-1 also binds a second receptor
designated type II (IL1R2). Mature IL1R1 consists of
three IG-like domains, a transmembrane domain, and a
large cytoplasmic domain. Mature IL1R2 is organized
similarly except that it has a short cytoplasmic domain.
The latter does not initiate signal transduction. A
naturally occurring cytokine IL-1RA (IL-1 receptor
antagonist) is widely expressed and binds to IL-1
receptors, inhibiting the binding of IL-1 alpha and IL-1
beta. This group also contains ILIR-like 1 (IL1R1L)
which maps to the same chromosomal location as IL1R1 and
IL1R2.
Length = 92
Score = 28.1 bits (63), Expect = 7.4
Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 4/48 (8%)
Query: 481 TPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTAT 528
P V WYKD + R++ + L+I D G C T
Sbjct: 29 LPPVQWYKDCKLLEGDRKRFVKGS----KLLIQNVTEEDAGNYTCKLT 72
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is part
of complex B but is not, however, required for the core
subunits to stay associated. This protein is known as
Huntington-interacting protein-1 in humans.
Length = 355
Score = 30.1 bits (68), Expect = 7.6
Identities = 15/55 (27%), Positives = 25/55 (45%)
Query: 994 MLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRD 1048
L + K +S + + N D+E+ DE+D+ E E+ E E +DD
Sbjct: 108 DLADAALKKKGFSFKRPKYPNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEG 162
>gnl|CDD|143169 cd04968, Ig3_Contactin_like, Third Ig domain of contactin.
Ig3_Contactin_like: Third Ig domain of contactins.
Contactins are neural cell adhesion molecules and are
comprised of six Ig domains followed by four fibronectin
type III(FnIII) domains anchored to the membrane by
glycosylphosphatidylinositol. The first four Ig domains
form the intermolecular binding fragment, which arranges
as a compact U-shaped module via contacts between Ig
domains 1 and 4, and between Ig domains 2 and 3.
Contactin-2 (TAG-1, axonin-1) may play a part in the
neuronal processes of neurite outgrowth, axon guidance
and fasciculation, and neuronal migration. This group
also includes contactin-1 and contactin-5. The different
contactins show different expression patterns in the
central nervous system. During development and in
adulthood, contactin-2 is transiently expressed in
subsets of central and peripheral neurons. Contactin-5
is expressed specifically in the rat postnatal nervous
system, peaking at about 3 weeks postnatal, and a lack
of contactin-5 (NB-2) results in an impairment of
neuronal act ivity in the rat auditory system.
Contactin-5 is highly expressed in the adult human brain
in the occipital lobe and in the amygdala. Contactin-1
is differentially expressed in tumor tissues and may,
through a RhoA mechanism, facilitate invasion and
metastasis of human lung adenocarcinoma.
Length = 88
Score = 27.8 bits (62), Expect = 7.7
Identities = 22/80 (27%), Positives = 31/80 (38%), Gaps = 4/80 (5%)
Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDE 520
DT AL+ + V G P P++ W K + SS + L I DE
Sbjct: 10 DTYALKGQNVTLECFALGNPVPQIKWRKVDGSMPSSA----EISMSGAVLKIPNIQFEDE 65
Query: 521 GEIKCTATNRAGHSITKARL 540
G +C A N G + R+
Sbjct: 66 GTYECEAENIKGKDTHQGRI 85
>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein. Mei5 is
one of a pair of meiosis-specific proteins which
facilitate the loading of Dmc1 on to Rad51 on DNA at
double-strand breaks during recombination. Recombination
is carried out by a large protein complex based around
the two RecA homologues, Rad51 and Dmc1. This complex may
play both a catalytic and a structural role in the
interaction between homologous chromosomes during
meiosis. Mei5 is seen to contain a coiled-coli region.
Length = 212
Score = 29.4 bits (66), Expect = 8.2
Identities = 23/120 (19%), Positives = 47/120 (39%), Gaps = 12/120 (10%)
Query: 918 EEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDV--LTKKYSLPVNSDIQIKIDK 975
E E +++ + ESS+K + E +++ E + + L + KI++
Sbjct: 59 ENFELDQAVSEPPESSLKNIDSEENETSNEKLIEKWRTICQSESRSIL---NSSSPKINR 115
Query: 976 PDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFE 1035
D+ K E+ ++ KL+ Q E + + E E + D + EL +
Sbjct: 116 MGGYKDFKRK-------ELEAEKRKLEYQVDEESDDLRRLKLVEKYEIKNDLSELQELIK 168
>gnl|CDD|218584 pfam05422, SIN1, Stress-activated map kinase interacting protein 1
(SIN1). This family consists of several stress-activated
map kinase interacting protein 1 (MAPKAP1 OR SIN1)
sequences. The fission yeast Sty1/Spc1 mitogen-activated
protein (MAP) kinase is a member of the eukaryotic
stress-activated MAP kinase (SAPK) family. Sin1 interacts
with Sty1/Spc1. Cells lacking Sin1 display many, but not
all, of the phenotypes of cells lacking the Sty1/Spc1 MAP
kinase including sterility, multiple stress sensitivity
and a cell-cycle delay. Sin1 is phosphorylated after
stress but this is not Sty1/Spc1-dependent.
Length = 482
Score = 30.0 bits (67), Expect = 8.4
Identities = 27/136 (19%), Positives = 48/136 (35%), Gaps = 7/136 (5%)
Query: 871 GAPSD----NENESDYFPEKTIDESVYGYDTI--VYGYDSDDLDRHYPTLDEEEEEEDRE 924
GA +SDY ++S G D ++ Y + R T E E +
Sbjct: 51 GAGGQVRHSRAEDSDYATSDLSEDSDVGDDDSSDIFSYSEVPIHRRSNTAQELERLDQAV 110
Query: 925 SLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVI 984
+L ++S++K K + D L + +KK LP+ + + + I
Sbjct: 111 NLTSAKQSAIKIKSSVSTDYDDLRSISELDFLFSKK-ELPLTTHNTVNKARSVSNAKAPI 169
Query: 985 KGKYEVDNEMLLKRSK 1000
G + L + S
Sbjct: 170 SGLQSLLEHKLEENSS 185
>gnl|CDD|216760 pfam01881, Cas_Cas6, CRISPR associated protein Cas6. This group of
families is one of several protein families that are
always found associated with prokaryotic CRISPRs,
themselves a family of clustered regularly interspaced
short palindromic repeats, DNA repeats found in nearly
half of all bacterial and archaeal genomes. These DNA
repeat regions have a remarkably regular structure:
unique sequences of constant size, called spacers, sit
between each pair of repeats. It has been shown that the
CRISPRs are virus-derived sequences acquired by the host
to enable them to resist viral infection. The Cas
proteins from the host use the CRISPRs to mediate an
antiviral response. After transcription of the CRISPR, a
complex of Cas proteins termed Cascade cleaves a CRISPR
RNA precursor in each repeat and retains the cleavage
products containing the virus-derived sequence. Assisted
by the helicase Cas3, these mature CRISPR RNAs then
serve as small guide RNAs that enable Cascade to
interfere with virus proliferation. Cas5 contains an
endonuclease motif, whose inactivation leads to loss of
resistance, even in the presence of phage-derived
spacers.
Length = 152
Score = 28.8 bits (65), Expect = 8.8
Identities = 14/71 (19%), Positives = 25/71 (35%), Gaps = 19/71 (26%)
Query: 944 IKDDEYYENLGDVLTKKYSL----PVNSDIQIKIDKPD----------DEPDYVIKG--- 986
D+E+ E L + L KKY + + + + + I+G
Sbjct: 60 PDDEEFEELLKENLIKKYEAFYGEKPEKEFKFEPLVFKKKVVKHKRIKIKKNTYIRGYLG 119
Query: 987 --KYEVDNEML 995
+ E D E+L
Sbjct: 120 KFRLEGDPELL 130
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 30.1 bits (67), Expect = 9.0
Identities = 28/162 (17%), Positives = 53/162 (32%), Gaps = 31/162 (19%)
Query: 900 YGYDSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTK 959
Y + + D DE +EED + ++ K + + ++ L
Sbjct: 44 YSFGVNSEDDEEIDSDEAFDEEDEKRFADWSFNASKSGKS------NKDH-----KNLNN 92
Query: 960 KYSLPVN-SDIQIKIDKPDDEPDYVIKGKYEV-------------------DNEMLLKRS 999
+ +N SD + DK ++E + E+ ++ K +
Sbjct: 93 TKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQPEKNESGNNDHATDKEN 152
Query: 1000 KLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEE 1041
L+ SS S +D E E + DS DE + +
Sbjct: 153 LLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNL 194
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 30.1 bits (67), Expect = 9.1
Identities = 41/220 (18%), Positives = 88/220 (40%), Gaps = 29/220 (13%)
Query: 850 DKYVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDR 909
+K + N+++ EH D E + E+ ++E+V
Sbjct: 925 NKELKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENV----------------- 967
Query: 910 HYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDI 969
+E EE E++ ++ E +V+ EE EN+ + + + V +I
Sbjct: 968 -----EENVEENVEENVEENVEENVEENVEE-------NVEENVEENIEENVEENVEENI 1015
Query: 970 QIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFD 1029
+ +++ D+E ++ E +E ++ + + + E + NI + ++E EE +
Sbjct: 1016 EENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEEN 1075
Query: 1030 FDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
+E E+N EE +E+ + N N E+ +E
Sbjct: 1076 IEENIEENVEENVEENVEEIEENVEENVEENAEENAEENA 1115
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts harbouring
premature signals for translation termination are
recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 28.9 bits (65), Expect = 9.1
Identities = 11/39 (28%), Positives = 22/39 (56%)
Query: 1010 SEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRD 1048
S + + +DD +E++E D D+ D E + +D++D
Sbjct: 1 SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQD 39
>gnl|CDD|219912 pfam08574, DUF1762, Protein of unknown function (DUF1762). This is a
family of proteins of unknown function. Yeast IWR1 is
known to interact with RNA polymerase II and deletion of
this protein results in hypersensitivity to the K1 killer
toxin.
Length = 77
Score = 27.4 bits (61), Expect = 9.2
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEEYDEDD 1046
D++D+ D+ S D D E+ +Y +D+
Sbjct: 48 DEDDDADQVLSDDEDSNAENYYRNDYPDDE 77
>gnl|CDD|227472 COG5143, SNC1, Synaptobrevin/VAMP-like protein [Intracellular
trafficking and secretion].
Length = 190
Score = 29.3 bits (66), Expect = 9.2
Identities = 29/148 (19%), Positives = 48/148 (32%), Gaps = 23/148 (15%)
Query: 873 PSDNENESDYFPEKTIDESVYGYDTIVYGYDSD---DLDRHYPTLDEEEEEEDRESLVKD 929
S ES + + S IVY SD Y L+ E + S ++
Sbjct: 46 ASRASIESGDYFFHYLKMS----SGIVYVPISDKEYPNKLAYGYLNSIATEFLKSSALEQ 101
Query: 930 RESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYE 989
G N+ V+ K Y P D K+D+ E + + +
Sbjct: 102 LIDDTVG-----------IMRVNIDKVIEKGYRDPSIQD---KLDQLQQELEETKRVLNK 147
Query: 990 VDNEMLLKRSKLKP--QYSSEMSEASNI 1015
++L + KL SS + +S +
Sbjct: 148 NIEKVLYRDEKLDLLVDLSSILLLSSKM 175
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional regulation.
Length = 106
Score = 28.1 bits (63), Expect = 9.4
Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 1010 SEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDD 1065
+ DD DEE++ED DF E +D+ E++YD D+ N R R+ +DD
Sbjct: 6 ERSRRSRDDYDEEEDEDMDDFIE--DDDEEDDYDRDEIWAMFNKGRKRYAYRDDDD 59
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.0 bits (67), Expect = 9.5
Identities = 20/100 (20%), Positives = 32/100 (32%), Gaps = 20/100 (20%)
Query: 970 QIKIDKPDDEPDYVIKGKYE------VDNEMLLKRSKLKPQYSSEMSEASNITDDEDEED 1023
+ ++ + DY++ D K + E SE +E+ ED
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIE--NRNPEESE-----HEEEVED 391
Query: 1024 EEDSFDF------DELFEDNPEEEYDEDDRDQPINFARNR 1057
ED D D+ E N DE + + F R
Sbjct: 392 YEDENDHSKRICDDDELE-NHFRAADEKNSHLVVGFRNER 430
>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 350 to 473 amino acids in length. This domain is
found associated with pfam07923.
Length = 402
Score = 29.6 bits (67), Expect = 9.6
Identities = 13/51 (25%), Positives = 19/51 (37%), Gaps = 4/51 (7%)
Query: 1019 EDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
+ E D D E ++ E Y DQP + + + D EEI
Sbjct: 39 DTESLVGDPLDISESVKELKLEMYTSLAEDQP----KKEEIERLSTDSEEI 85
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 30.0 bits (67), Expect = 9.8
Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 8/77 (10%)
Query: 982 YVIKGKYEV-DNEMLL-KRSKLKPQYSSEMSEASNIT---DDEDEEDEEDSFDFDELFED 1036
Y I GK E+ D +M++ KR + + +MS N DDEDE++++D + DE E+
Sbjct: 116 YPIHGKAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEE 175
Query: 1037 NPEEEY---DEDDRDQP 1050
EE DED+ D+
Sbjct: 176 EEEEIKGFDDEDEEDEG 192
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 29.5 bits (67), Expect = 9.9
Identities = 23/136 (16%), Positives = 46/136 (33%), Gaps = 6/136 (4%)
Query: 138 VSVQQRWRANLTMRKQRAHFLL---MKQKASVIQQWYRNTKLMRLE---ASYLHELKAAT 191
V L ++R H L + QK ++ R + E + L
Sbjct: 260 VPDSAELLQQLDQLQRRLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDELA 319
Query: 192 ITIQRRYRANVAMRTQRERYVALRTATITIQTRFRAYLIAKNQRDEYAELKQARRFRFKL 251
I ++R +A++ QR + R + + R + + + +R R +
Sbjct: 320 IRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLEA 379
Query: 252 NLRKYERVIELLKLKR 267
+ + E + L L R
Sbjct: 380 LVEQLESLSPLATLAR 395
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.134 0.393
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 58,143,471
Number of extensions: 5976772
Number of successful extensions: 8254
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7446
Number of HSP's successfully gapped: 361
Length of query: 1101
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 994
Effective length of database: 6,191,724
Effective search space: 6154573656
Effective search space used: 6154573656
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.5 bits)