RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8139
         (1101 letters)



>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain. 
          Length = 90

 Score = 96.2 bits (240), Expect = 7e-24
 Identities = 34/88 (38%), Positives = 42/88 (47%)

Query: 453 PSFIRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLII 512
           P F +   D    E E   FT  V G P P VSW+KDG  + SS R ++  +    TL I
Sbjct: 1   PKFTQKPKDVEVQEGESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFKVTYEGGTYTLTI 60

Query: 513 HQAALMDEGEIKCTATNRAGHSITKARL 540
                 DEG+  C ATN AG +   A L
Sbjct: 61  SNVQPDDEGKYTCVATNSAGEAEASAEL 88



 Score = 68.1 bits (167), Expect = 6e-14
 Identities = 27/76 (35%), Positives = 40/76 (52%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
           GE  +   ++ G P PT  W  +G+PL S  R+++T+      L IS+ +  D G+Y   
Sbjct: 15  GESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQPDDEGKYTCV 74

Query: 623 GVNSLGEDVASFLVTV 638
             NS GE  AS  +TV
Sbjct: 75  ATNSAGEAEASAELTV 90


>gnl|CDD|238020 cd00063, FN3, Fibronectin type 3 domain; One of three types of
           internal repeats found in the plasma protein
           fibronectin. Its tenth fibronectin type III repeat
           contains an RGD cell recognition sequence in a flexible
           loop between 2 strands. Approximately 2% of all animal
           proteins contain the FN3 repeat; including extracellular
           and intracellular proteins, membrane spanning cytokine
           receptors, growth hormone receptors, tyrosine
           phosphatase receptors, and adhesion molecules. FN3-like
           domains are also found in bacterial glycosyl hydrolases.
          Length = 93

 Score = 85.6 bits (212), Expect = 5e-20
 Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 8/100 (8%)

Query: 345 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSPYWVRASPHMVE 404
           P PP   ++   +        T+ W PPE DGG P+ GY+VE+R  GS  W         
Sbjct: 1   PSPPTNLRVTDVTS----TSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGS 55

Query: 405 DTELMVSGLEPGWRYQFRITAENVVGFSEPGPLSEPLTVT 444
           +T   ++GL+PG  Y+FR+ A N  G S P   SE +TVT
Sbjct: 56  ETSYTLTGLKPGTEYEFRVRAVNGGGESPP---SESVTVT 92



 Score = 85.6 bits (212), Expect = 5e-20
 Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 8/100 (8%)

Query: 684 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSPYWVRASPHMVE 743
           P PP   ++   +        T+ W PPE DGG P+ GY+VE+R  GS  W         
Sbjct: 1   PSPPTNLRVTDVTS----TSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGS 55

Query: 744 DTELMVSGLEPGWRYQFRITAENVVGFSEPGPLSEPLTVT 783
           +T   ++GL+PG  Y+FR+ A N  G S P   SE +TVT
Sbjct: 56  ETSYTLTGLKPGTEYEFRVRAVNGGGESPP---SESVTVT 92


>gnl|CDD|214495 smart00060, FN3, Fibronectin type 3 domain.  One of three types of
           internal repeat within the plasma protein, fibronectin.
           The tenth fibronectin type III repeat contains a RGD
           cell recognition sequence in a flexible loop between 2
           strands. Type III modules are present in both
           extracellular and intracellular proteins.
          Length = 83

 Score = 64.9 bits (158), Expect = 6e-13
 Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 7/89 (7%)

Query: 345 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGG-SPVLGYLVEHRRTGSPYWVRASPHMV 403
           P PP   ++   +        T+ WEPP  DG    ++GY VE+R  GS  W   +    
Sbjct: 1   PSPPSNLRVTDVTS----TSVTLSWEPPPDDGITGYIVGYRVEYREEGSE-WKEVNVT-P 54

Query: 404 EDTELMVSGLEPGWRYQFRITAENVVGFS 432
             T   ++GL+PG  Y+FR+ A N  G  
Sbjct: 55  SSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83



 Score = 64.9 bits (158), Expect = 6e-13
 Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 7/89 (7%)

Query: 684 PGPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGG-SPVLGYLVEHRRTGSPYWVRASPHMV 742
           P PP   ++   +        T+ WEPP  DG    ++GY VE+R  GS  W   +    
Sbjct: 1   PSPPSNLRVTDVTS----TSVTLSWEPPPDDGITGYIVGYRVEYREEGSE-WKEVNVT-P 54

Query: 743 EDTELMVSGLEPGWRYQFRITAENVVGFS 771
             T   ++GL+PG  Y+FR+ A N  G  
Sbjct: 55  SSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83


>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like.  IG domains that cannot
           be classified into one of IGv1, IGc1, IGc2, IG.
          Length = 85

 Score = 64.4 bits (157), Expect = 1e-12
 Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 1/82 (1%)

Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFE-IFSSRRQRIVTDNDISTLIIHQAALMD 519
             T  E E V  + +  G P P+V+WYK G + +  S R  +      STL I      D
Sbjct: 3   SVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPED 62

Query: 520 EGEIKCTATNRAGHSITKARLR 541
            G   C ATN +G + +   L 
Sbjct: 63  SGTYTCAATNSSGSASSGTTLT 84



 Score = 57.9 bits (140), Expect = 2e-10
 Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNG-EPLTSGGRYEITHTDRYLNLRISDARRADRGEYQA 621
           GE + L    +G PPP   W   G + L   GR+ ++ +     L IS+    D G Y  
Sbjct: 9   GESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTC 68

Query: 622 HGVNSLGEDVASFLVTV 638
              NS G   +   +TV
Sbjct: 69  AATNSSGSASSGTTLTV 85


>gnl|CDD|214652 smart00409, IG, Immunoglobulin. 
          Length = 85

 Score = 64.4 bits (157), Expect = 1e-12
 Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 1/82 (1%)

Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFE-IFSSRRQRIVTDNDISTLIIHQAALMD 519
             T  E E V  + +  G P P+V+WYK G + +  S R  +      STL I      D
Sbjct: 3   SVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPED 62

Query: 520 EGEIKCTATNRAGHSITKARLR 541
            G   C ATN +G + +   L 
Sbjct: 63  SGTYTCAATNSSGSASSGTTLT 84



 Score = 57.9 bits (140), Expect = 2e-10
 Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNG-EPLTSGGRYEITHTDRYLNLRISDARRADRGEYQA 621
           GE + L    +G PPP   W   G + L   GR+ ++ +     L IS+    D G Y  
Sbjct: 9   GESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTC 68

Query: 622 HGVNSLGEDVASFLVTV 638
              NS G   +   +TV
Sbjct: 69  AATNSSGSASSGTTLTV 85


>gnl|CDD|143225 cd05748, Ig_Titin_like, Immunoglobulin (Ig)-like domain of titin
           and similar proteins.  Ig_Titin_like: immunoglobulin
           (Ig)-like domain found in titin-like proteins. Titin
           (also called connectin) is a fibrous sarcomeric protein
           specifically found in vertebrate striated muscle. Titin
           is gigantic, depending on isoform composition it ranges
           from 2970 to 3700 kDa, and is of a length that spans
           half a sarcomere. Titin largely consists of multiple
           repeats of Ig-like and fibronectin type 3 (FN-III)-like
           domains. Titin connects the ends of myosin thick
           filaments to Z disks and extends along the thick
           filament to the H zone.  It appears to function
           similarly to an elastic band, keeping the myosin
           filaments centered in the sarcomere during muscle
           contraction or stretching. Within the sarcomere, titin
           is also attached to or is associated with myosin binding
           protein C (MyBP-C). MyBP-C appears to contribute to the
           generation of passive tension by titin, and similar to
           titin has repeated Ig-like and FN-III domains. Also
           included in this group are worm twitchin and insect
           projectin, thick filament proteins of invertebrate
           muscle, which also have repeated Ig-like and FN-III
           domains.
          Length = 74

 Score = 60.3 bits (147), Expect = 2e-11
 Identities = 27/73 (36%), Positives = 40/73 (54%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVN 625
           ++L+V ++G P PT  W  +G+PL   GR +I  T    +L I +A R+D G+Y     N
Sbjct: 2   VRLEVPISGRPTPTVTWSKDGKPLKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLKN 61

Query: 626 SLGEDVASFLVTV 638
             GE  A+  V V
Sbjct: 62  PAGEKSATINVKV 74



 Score = 50.7 bits (122), Expect = 5e-08
 Identities = 24/64 (37%), Positives = 32/64 (50%)

Query: 469 KVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTAT 528
            V   V + G PTP V+W KDG  +  S R +I T    ++L+I  A   D G+   T  
Sbjct: 1   SVRLEVPISGRPTPTVTWSKDGKPLKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLK 60

Query: 529 NRAG 532
           N AG
Sbjct: 61  NPAG 64


>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain.  Ig: immunoglobulin (Ig) domain
           found in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           this group are components of immunoglobulin, neuroglia,
           cell surface glycoproteins, such as, T-cell receptors,
           CD2, CD4, CD8, and membrane glycoproteins, such as,
           butyrophilin and chondroitin sulfate proteoglycan core
           protein. A predominant feature of most Ig domains is a
           disulfide bridge connecting the two beta-sheets with a
           tryptophan residue packed against the disulfide bond.
          Length = 74

 Score = 59.8 bits (144), Expect = 3e-11
 Identities = 22/74 (29%), Positives = 29/74 (39%), Gaps = 4/74 (5%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEI----FSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
           V  T    G P P ++W K+G  +     +  R    T +  STL I    L D G   C
Sbjct: 1   VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTC 60

Query: 526 TATNRAGHSITKAR 539
            A+N AG       
Sbjct: 61  VASNSAGTVSASVT 74



 Score = 45.2 bits (106), Expect = 4e-06
 Identities = 24/73 (32%), Positives = 31/73 (42%), Gaps = 4/73 (5%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLN----LRISDARRADRGEYQA 621
           + L    +G PPPT  WL NG+PL S     +  +    +    L IS+    D G Y  
Sbjct: 1   VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTC 60

Query: 622 HGVNSLGEDVASF 634
              NS G   AS 
Sbjct: 61  VASNSAGTVSASV 73


>gnl|CDD|200951 pfam00041, fn3, Fibronectin type III domain. 
          Length = 84

 Score = 54.7 bits (132), Expect = 2e-09
 Identities = 23/92 (25%), Positives = 34/92 (36%), Gaps = 12/92 (13%)

Query: 346 GPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSP---YWVRASPHM 402
             P    +   +        T+ W PP   G  P+ GY VE+R          +      
Sbjct: 1   SAPTNLTVTDVTS----TSLTLSWSPPP--GNGPITGYEVEYRPVNGGEEWKEITVPGT- 53

Query: 403 VEDTELMVSGLEPGWRYQFRITAENVVGFSEP 434
              T   ++GL+PG  Y+ R+ A N  G   P
Sbjct: 54  --TTSYTLTGLKPGTEYEVRVQAVNGAGEGPP 83



 Score = 54.7 bits (132), Expect = 2e-09
 Identities = 23/92 (25%), Positives = 34/92 (36%), Gaps = 12/92 (13%)

Query: 685 GPPGKPQLLPDSPSLDRDVFTIRWEPPEYDGGSPVLGYLVEHRRTGSP---YWVRASPHM 741
             P    +   +        T+ W PP   G  P+ GY VE+R          +      
Sbjct: 1   SAPTNLTVTDVTS----TSLTLSWSPPP--GNGPITGYEVEYRPVNGGEEWKEITVPGT- 53

Query: 742 VEDTELMVSGLEPGWRYQFRITAENVVGFSEP 773
              T   ++GL+PG  Y+ R+ A N  G   P
Sbjct: 54  --TTSYTLTGLKPGTEYEVRVQAVNGAGEGPP 83


>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type. 
          Length = 63

 Score = 53.2 bits (128), Expect = 5e-09
 Identities = 25/67 (37%), Positives = 30/67 (44%), Gaps = 4/67 (5%)

Query: 466 EDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
           E + V  T   EG P P ++W KDG  +      R V     STL I   +L D G   C
Sbjct: 1   EGQSVTLTCPAEGNPVPNITWLKDGKPL--PESNRFVASG--STLTIKSVSLEDSGLYTC 56

Query: 526 TATNRAG 532
            A N AG
Sbjct: 57  VAENSAG 63



 Score = 49.3 bits (118), Expect = 1e-07
 Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 4/67 (5%)

Query: 562 MGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQA 621
            G+ + L     G P P   WL +G+PL    R     +     L I      D G Y  
Sbjct: 1   EGQSVTLTCPAEGNPVPNITWLKDGKPLPESNR--FVASGST--LTIKSVSLEDSGLYTC 56

Query: 622 HGVNSLG 628
              NS G
Sbjct: 57  VAENSAG 63


>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig2_Robo: domain similar to the
           second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 86

 Score = 52.8 bits (127), Expect = 1e-08
 Identities = 29/65 (44%), Positives = 35/65 (53%), Gaps = 5/65 (7%)

Query: 478 GIPTPKVSWYKDGFEI-FSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAGHSIT 536
           G P P VSW KDG  +   + R RIV D +   L+I +A   DEG  KC ATN  G   +
Sbjct: 23  GHPEPTVSWRKDGQPLNLDNERVRIVDDGN---LLIAEARKSDEGTYKCVATNMVGERES 79

Query: 537 K-ARL 540
             ARL
Sbjct: 80  AAARL 84



 Score = 39.3 bits (92), Expect = 7e-04
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 4/57 (7%)

Query: 574 GMPPPTARWLHNGEPLTSGG-RYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGE 629
           G P PT  W  +G+PL     R  I       NL I++AR++D G Y+    N +GE
Sbjct: 23  GHPEPTVSWRKDGQPLNLDNERVRIVDDG---NLLIAEARKSDEGTYKCVATNMVGE 76


>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig3_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 69

 Score = 48.2 bits (115), Expect = 4e-07
 Identities = 25/72 (34%), Positives = 30/72 (41%), Gaps = 4/72 (5%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
           VEF  +V G P P V W K+  E+    R  I+ D    +L I      DEG   C A N
Sbjct: 1   VEFQCEVGGDPVPTVLWRKEDGEL-PKGRAEILDDK---SLKIRNVTAGDEGSYTCEAEN 56

Query: 530 RAGHSITKARLR 541
             G     A L 
Sbjct: 57  MVGKIEASASLT 68



 Score = 33.1 bits (76), Expect = 0.083
 Identities = 22/71 (30%), Positives = 31/71 (43%), Gaps = 4/71 (5%)

Query: 568 LKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSL 627
            +  + G P PT  W      L  G R EI   D+  +L+I +    D G Y     N +
Sbjct: 3   FQCEVGGDPVPTVLWRKEDGELPKG-RAEIL-DDK--SLKIRNVTAGDEGSYTCEAENMV 58

Query: 628 GEDVASFLVTV 638
           G+  AS  +TV
Sbjct: 59  GKIEASASLTV 69


>gnl|CDD|143302 cd05894, Ig_C5_MyBP-C, C5 immunoglobulin (Ig) domain of cardiac
           myosin binding protein C (MyBP-C).  Ig_C5_MyBP_C : the
           C5 immunoglobulin (Ig) domain of cardiac myosin binding
           protein C (MyBP-C). MyBP_C consists of repeated domains,
           Ig and fibronectin type 3, and various linkers. Three
           isoforms of MYBP_C exist and are included in this group:
           cardiac(c), and fast and slow skeletal muscle (s)
           MyBP_C. cMYBP_C has insertions between and inside
           domains and an additional cardiac-specific Ig domain at
           the N-terminus. For cMYBP_C  an interaction has been
           demonstrated between this C5 domain and the Ig C8
           domain.
          Length = 86

 Score = 48.3 bits (115), Expect = 5e-07
 Identities = 25/77 (32%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSG-GRYEITHTDRYLNLRISDARRADRGEYQA 621
           G  ++L V ++G P PT  W    +  T   GR  +       +  I  A R D G Y  
Sbjct: 10  GNKLRLDVPISGEPAPTVTWSRGDKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTI 69

Query: 622 HGVNSLGEDVASFLVTV 638
              N +GED AS  V V
Sbjct: 70  TVTNPVGEDHASLFVKV 86



 Score = 39.8 bits (93), Expect = 5e-04
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 3/66 (4%)

Query: 469 KVEFTVQVEGIPTPKVSWYKDGFEIF--SSRRQRIVTDNDISTLIIHQAALMDEGEIKCT 526
           K+   V + G P P V+W + G + F  +  R R+ +  D+S+ +I  A   DEG    T
Sbjct: 12  KLRLDVPISGEPAPTVTWSR-GDKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTIT 70

Query: 527 ATNRAG 532
            TN  G
Sbjct: 71  VTNPVG 76


>gnl|CDD|143173 cd04972, Ig_TrkABC_d4, Fourth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d4: the fourth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrKB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 90

 Score = 47.5 bits (113), Expect = 1e-06
 Identities = 20/80 (25%), Positives = 29/80 (36%)

Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDE 520
            T   E          EG P PKV W   G  +  +R   + T  DI  L +       +
Sbjct: 9   ATVVYEGGTATIRCTAEGSPLPKVEWIIAGLIVIQTRTDTLETTVDIYNLQLSNITSETQ 68

Query: 521 GEIKCTATNRAGHSITKARL 540
             + CTA N  G +    ++
Sbjct: 69  TTVTCTAENPVGQANVSVQV 88


>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
           (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2.
           Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
           II (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2. Perlecan
           consists of five domains. Domain I has three putative
           heparan sulfate attachment sites; domain II has four LDL
           receptor-like repeats, and one Ig-like repeat; domain
           III resembles the short arm of laminin chains; domain IV
           has multiple Ig-like repeats (21 repeats in human
           perlecan); and domain V resembles the globular G domain
           of the laminin A chain and internal repeats of EGF.
           Perlecan may participate in a variety of biological
           functions including cell binding, LDL-metabolism,
           basement membrane assembly and selective permeability,
           calcium binding, and growth- and neurite-promoting
           activities.
          Length = 78

 Score = 47.1 bits (112), Expect = 1e-06
 Identities = 22/65 (33%), Positives = 30/65 (46%)

Query: 468 EKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTA 527
           E VEFT    G+PTP ++W  +   +  S R  I ++    TL I      D+G   C A
Sbjct: 2   ETVEFTCVATGVPTPIINWRLNWGHVPDSARVSITSEGGYGTLTIRDVKESDQGAYTCEA 61

Query: 528 TNRAG 532
            N  G
Sbjct: 62  INTRG 66



 Score = 39.0 bits (91), Expect = 7e-04
 Identities = 19/66 (28%), Positives = 28/66 (42%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
           GE ++      G+P P   W  N   +    R  IT    Y  L I D + +D+G Y   
Sbjct: 1   GETVEFTCVATGVPTPIINWRLNWGHVPDSARVSITSEGGYGTLTIRDVKESDQGAYTCE 60

Query: 623 GVNSLG 628
            +N+ G
Sbjct: 61  AINTRG 66


>gnl|CDD|143224 cd05747, Ig5_Titin_like, M5, fifth immunoglobulin (Ig)-like domain
           of human titin C terminus and similar proteins.
           Ig5_Titin_like: domain similar to the M5, fifth
           immunoglobulin (Ig)-like domain from the human titin C
           terminus. Titin (also called connectin) is a fibrous
           sarcomeric protein specifically found in vertebrate
           striated muscle. Titin is gigantic; depending on isoform
           composition it ranges from 2970 to 3700 kDa, and is of a
           length that spans half a sarcomere. Titin largely
           consists of multiple repeats of Ig-like and fibronectin
           type 3 (FN-III)-like domains. Titin connects the ends of
           myosin thick filaments to Z disks and extends along the
           thick filament to the H zone, and appears to function
           similar to an elastic band, keeping the myosin filaments
           centered in the sarcomere during muscle contraction or
           stretching.
          Length = 92

 Score = 47.4 bits (112), Expect = 1e-06
 Identities = 24/75 (32%), Positives = 36/75 (48%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
           GE  +    + G P PT  W+  G+ + S  R++IT T+      IS  + +D G Y   
Sbjct: 18  GESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVV 77

Query: 623 GVNSLGEDVASFLVT 637
             NS G+  A F +T
Sbjct: 78  VENSEGKQEAQFTLT 92



 Score = 45.4 bits (107), Expect = 7e-06
 Identities = 24/70 (34%), Positives = 34/70 (48%)

Query: 463 TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGE 522
           T  E E   F+  V+G P P V+W ++G  I SS+R +I +    ST  I +  + DEG 
Sbjct: 14  TVSEGESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGN 73

Query: 523 IKCTATNRAG 532
                 N  G
Sbjct: 74  YTVVVENSEG 83


>gnl|CDD|143199 cd05722, Ig1_Neogenin, First immunoglobulin (Ig)-like domain in
           neogenin and similar proteins.  Ig1_Neogenin: first
           immunoglobulin (Ig)-like domain in neogenin and related
           proteins. Neogenin  is a cell surface protein which is
           expressed in the developing nervous system of vertebrate
           embryos in the growing nerve cells. It is also expressed
           in other embryonic tissues, and may play a general role
           in developmental processes such as cell migration,
           cell-cell recognition, and tissue growth regulation.
           Included in this group is the tumor suppressor protein
           DCC, which is deleted in colorectal carcinoma . DCC and
           neogenin each have four Ig-like domains followed by six
           fibronectin type III domains, a transmembrane domain,
           and an intracellular domain.
          Length = 95

 Score = 47.1 bits (112), Expect = 1e-06
 Identities = 27/95 (28%), Positives = 36/95 (37%), Gaps = 4/95 (4%)

Query: 454 SFIRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIH 513
            F+    D  A+    V      EG P PK+ W KDG  +     +R     + S LI  
Sbjct: 1   WFLSEPSDIVAVRGGPVVLNCSAEGEPPPKIEWKKDGVLLNLVSDERRQQLPNGSLLITS 60

Query: 514 ----QAALMDEGEIKCTATNRAGHSITKARLRLEA 544
               +    DEG  +C A N +  SI     RL  
Sbjct: 61  VVHSKHNKPDEGFYQCVAQNDSLGSIVSRTARLTV 95


>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig3_L1-CAM_like:
           domain similar to the third immunoglobulin (Ig)-like
           domain of the L1 cell adhesion molecule (CAM). L1
           belongs to the L1 subfamily of cell adhesion molecules
           (CAMs) and is comprised of an extracellular region
           having six Ig-like domains and five fibronectin type III
           domains, a transmembrane region and an intracellular
           domain. L1 is primarily expressed in the nervous system
           and is involved in its development and function. L1 is
           associated with an X-linked recessive disorder, X-linked
           hydrocephalus, MASA syndrome, or spastic paraplegia type
           1, that involves abnormalities of axonal growth. This
           group also contains the chicken neuron-glia cell
           adhesion molecule, Ng-CAM and human neurofascin.
          Length = 71

 Score = 46.2 bits (110), Expect = 2e-06
 Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 6/63 (9%)

Query: 477 EGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG---H 533
           EG+PTP++SW K G E+ +    R   +N   TL I   +  D+GE +CTA+N  G   H
Sbjct: 8   EGLPTPEISWIKIGGELPAD---RTKFENFNKTLKIDNVSEEDDGEYRCTASNSLGSARH 64

Query: 534 SIT 536
           +I+
Sbjct: 65  TIS 67



 Score = 33.9 bits (78), Expect = 0.035
 Identities = 19/66 (28%), Positives = 27/66 (40%), Gaps = 3/66 (4%)

Query: 573 AGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVA 632
            G+P P   W+  G  L +         +    L+I +    D GEY+    NSLG    
Sbjct: 8   EGLPTPEISWIKIGGELPAD---RTKFENFNKTLKIDNVSEEDDGEYRCTASNSLGSARH 64

Query: 633 SFLVTV 638
           +  VTV
Sbjct: 65  TISVTV 70


>gnl|CDD|143222 cd05745, Ig3_Peroxidasin, Third immunoglobulin (Ig)-like domain of
           peroxidasin.  Ig3_Peroxidasin: the third immunoglobulin
           (Ig)-like domain in peroxidasin. Peroxidasin has a
           peroxidase domain and interacting extracellular motifs
           containing four Ig-like domains. It has been suggested
           that peroxidasin is secreted and has functions related
           to the stabilization of the extracellular matrix. It may
           play a part in various other important processes such as
           removal and destruction of cells which have undergone
           programmed cell death, and protection of the organism
           against non-self.
          Length = 74

 Score = 46.5 bits (110), Expect = 2e-06
 Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 466 EDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
           E + V+F  + +G P P ++W K G ++   RR  +++     TL I + AL D+G+ +C
Sbjct: 1   EGQTVDFLCEAQGYPQPVIAWTKGGSQLSVDRRHLVLSS---GTLRISRVALHDQGQYEC 57

Query: 526 TATNRAGHSITKARL 540
            A N  G   T A+L
Sbjct: 58  QAVNIVGSQRTVAQL 72



 Score = 36.8 bits (85), Expect = 0.004
 Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 3/76 (3%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAH 622
           G+ +       G P P   W   G  L+   R+ +  +     LRIS     D+G+Y+  
Sbjct: 2   GQTVDFLCEAQGYPQPVIAWTKGGSQLSVDRRHLVLSSG---TLRISRVALHDQGQYECQ 58

Query: 623 GVNSLGEDVASFLVTV 638
            VN +G       +TV
Sbjct: 59  AVNIVGSQRTVAQLTV 74


>gnl|CDD|143207 cd05730, Ig3_NCAM-1_like, Third immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig3_NCAM-1_like: domain similar to the third
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1,and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 95

 Score = 45.7 bits (108), Expect = 5e-06
 Identities = 27/85 (31%), Positives = 37/85 (43%), Gaps = 4/85 (4%)

Query: 453 PSFIRALHDT---TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDIST 509
           P  IRA       TA   + V      +G P P ++W KDG E   S  ++   + D S 
Sbjct: 1   PPTIRARQSEVNATANLGQSVTLACDADGFPEPTMTWTKDG-EPIESGEEKYSFNEDGSE 59

Query: 510 LIIHQAALMDEGEIKCTATNRAGHS 534
           + I     +DE E  C A N+AG  
Sbjct: 60  MTILDVDKLDEAEYTCIAENKAGEQ 84



 Score = 43.4 bits (102), Expect = 4e-05
 Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 2/94 (2%)

Query: 545 PPTIRLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYL 604
           PPTIR  +Q E      +G+ + L     G P PT  W  +GEP+ SG      + D   
Sbjct: 1   PPTIRA-RQSEVNATANLGQSVTLACDADGFPEPTMTWTKDGEPIESGEEKYSFNEDGS- 58

Query: 605 NLRISDARRADRGEYQAHGVNSLGEDVASFLVTV 638
            + I D  + D  EY     N  GE  A   + V
Sbjct: 59  EMTILDVDKLDEAEYTCIAENKAGEQEAEIHLKV 92


>gnl|CDD|143240 cd05763, Ig_1, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_1: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 75

 Score = 44.9 bits (106), Expect = 6e-06
 Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 477 EGIPTPKVSWYKDGFEIFSSRRQR---IVTDNDISTLIIHQAALMDEGEIKCTATNRAGH 533
            G PTP+++W KDG   F + R+R   ++ ++D+    I    + D G   CTA N AG 
Sbjct: 8   TGHPTPQIAWQKDGGTDFPAARERRMHVMPEDDV--FFIVDVKIEDTGVYSCTAQNTAGS 65

Query: 534 SITKARL 540
               A L
Sbjct: 66  ISANATL 72


>gnl|CDD|212460 cd05723, Ig4_Neogenin, Fourth immunoglobulin (Ig)-like domain in
           neogenin and similar proteins.  Ig4_Neogenin: fourth
           immunoglobulin (Ig)-like domain in neogenin and related
           proteins. Neogenin  is a cell surface protein which is
           expressed in the developing nervous system of vertebrate
           embryos in the growing nerve cells. It is also expressed
           in other embryonic tissues, and may play a general role
           in developmental processes such as cell migration,
           cell-cell recognition, and tissue growth regulation.
           Included in this group is the tumor suppressor protein
           DCC, which is deleted in colorectal carcinoma . DCC and
           neogenin each have four Ig-like domains followed by six
           fibronectin type III domains, a transmembrane domain,
           and an intracellular domain.
          Length = 71

 Score = 44.6 bits (105), Expect = 6e-06
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
           + F  +V G PTP V W K+G  +  S   +IV ++++  L + ++   DEG  +C A N
Sbjct: 2   IVFECEVTGKPTPTVKWVKNGDMVIPSDYFKIVKEHNLQVLGLVKS---DEGFYQCIAEN 58

Query: 530 RAGHSITKARL 540
             G+    A+L
Sbjct: 59  DVGNVQAGAQL 69



 Score = 34.2 bits (78), Expect = 0.038
 Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 3/68 (4%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVN 625
           I  +  + G P PT +W+ NG+ +     ++I       NL++    ++D G YQ    N
Sbjct: 2   IVFECEVTGKPTPTVKWVKNGDMVIPSDYFKIVKEH---NLQVLGLVKSDEGFYQCIAEN 58

Query: 626 SLGEDVAS 633
            +G   A 
Sbjct: 59  DVGNVQAG 66


>gnl|CDD|143209 cd05732, Ig5_NCAM-1_like, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar
           proteins.  Ig5_NCAM-1 like: domain similar to the fifth
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic  (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1 and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain. Also included in this group is
           NCAM-2 (also known as OCAM/mamFas II and RNCAM)  NCAM-2
           is differentially expressed in the developing and mature
           olfactory epithelium (OE).
          Length = 96

 Score = 44.1 bits (104), Expect = 2e-05
 Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 7/83 (8%)

Query: 456 IRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQ----RIVTDNDI--ST 509
           I  L + TA+E E++  T + EG P P+++W +     FS   +    RIV       S+
Sbjct: 5   ITYLENQTAVELEQITLTCEAEGDPIPEITW-RRATRNFSEGDKSLDGRIVVRGHARVSS 63

Query: 510 LIIHQAALMDEGEIKCTATNRAG 532
           L +    L D G   C A+NR G
Sbjct: 64  LTLKDVQLTDAGRYDCEASNRIG 86


>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural cell
           adhesion molecule contactin-2 and similar proteins.
           Ig4_Contactin-2-like: fourth Ig domain of the neural
           cell adhesion molecule contactin-2. Contactins are
           comprised of six Ig domains followed by four fibronectin
           type III (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-2 (aliases
           TAG-1, axonin-1) facilitates cell adhesion by homophilic
           binding between molecules in apposed membranes. The
           first four Ig domains form the intermolecular binding
           fragment which arranges as a compact U-shaped module by
           contacts between Ig domains 1 and 4, and domains 2 and
           3. It has been proposed that a linear zipper-like array
           forms, from contactin-2 molecules alternatively provided
           by the two apposed membranes.
          Length = 85

 Score = 43.4 bits (102), Expect = 3e-05
 Identities = 26/66 (39%), Positives = 32/66 (48%), Gaps = 4/66 (6%)

Query: 573 AGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVA 632
           +G P P  RWL NG+PL S  R E+   D    LRI+    +D G YQ    N  G   A
Sbjct: 24  SGNPRPAYRWLKNGQPLASENRIEVEAGD----LRITKLSLSDSGMYQCVAENKHGTIYA 79

Query: 633 SFLVTV 638
           S  + V
Sbjct: 80  SAELAV 85



 Score = 34.5 bits (79), Expect = 0.033
 Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 12/69 (17%)

Query: 478 GIPTPKVSWYKDGFEIFSSRRQRIVTDNDI----STLIIHQAALMDEGEIKCTATNRAGH 533
           G P P   W K+G        Q + ++N I      L I + +L D G  +C A N+ G 
Sbjct: 25  GNPRPAYRWLKNG--------QPLASENRIEVEAGDLRITKLSLSDSGMYQCVAENKHGT 76

Query: 534 SITKARLRL 542
               A L +
Sbjct: 77  IYASAELAV 85


>gnl|CDD|143178 cd04977, Ig1_NCAM-1_like, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-1 and similar
           proteins.  Ig1_NCAM-1 like: first immunoglobulin
           (Ig)-like domain of neural cell adhesion molecule
           NCAM-1. NCAM-1 plays important roles in the development
           and regeneration of the central nervous system, in
           synaptogenesis and neural migration. NCAM mediates
           cell-cell and cell-substratum recognition and adhesion
           via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-nonNCAM), interactions. NCAM is expressed as three
           major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves the Ig1, Ig2, and Ig3
           domains. By this model, Ig1 and Ig2 mediate dimerization
           of NCAM molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain. Also included in this group is
           NCAM-2 (also known as OCAM/mamFas II and RNCAM).  NCAM-2
           is differentially expressed in the developing and mature
           olfactory epithelium (OE).
          Length = 92

 Score = 42.9 bits (101), Expect = 4e-05
 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 3/63 (4%)

Query: 472 FTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDND--ISTLIIHQAALMDEGEIKCTATN 529
           F  QV G P   +SW+    E   +++Q  V  ND   STL I+ A + D G  KC AT+
Sbjct: 20  FLCQVIGEPK-DISWFSPNGEKLVTQQQISVVQNDDVRSTLTIYNANIEDAGIYKCVATD 78

Query: 530 RAG 532
             G
Sbjct: 79  AKG 81


>gnl|CDD|222457 pfam13927, Ig_3, Immunoglobulin domain.  This family contains
           immunoglobulin-like domains.
          Length = 74

 Score = 41.6 bits (97), Expect = 9e-05
 Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 5/75 (6%)

Query: 456 IRALHDTTALEDEKVEFTVQVEGIP-TPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQ 514
           I      +      V  T   EG P  P +SWY++G    S         +  STL +  
Sbjct: 4   ITVSPSPSVTSGGGVTLTCSAEGGPPPPTISWYRNG----SISGGSGGLGSSGSTLTLSS 59

Query: 515 AALMDEGEIKCTATN 529
               D G   C A+N
Sbjct: 60  VTSEDSGTYTCVASN 74


>gnl|CDD|143223 cd05746, Ig4_Peroxidasin, Fourth immunoglobulin (Ig)-like domain of
           peroxidasin.  Ig4_Peroxidasin: the fourth immunoglobulin
           (Ig)-like domain in peroxidasin. Peroxidasin has a
           peroxidase domain and interacting extracellular motifs
           containing four Ig-like domains. It has been suggested
           that peroxidasin is secreted, and has functions related
           to the stabilization of the extracellular matrix. It may
           play a part in various other important processes such as
           removal and destruction of cells, which have undergone
           programmed cell death, and protection of the organism
           against non-self.
          Length = 69

 Score = 41.4 bits (97), Expect = 9e-05
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 3/71 (4%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
           V+     +G P P ++W KDG ++  S +  I  +     L I    + D+G  +C A N
Sbjct: 1   VQIPCSAQGDPEPTITWNKDGVQVTESGKFHISPE---GYLAIRDVGVADQGRYECVARN 57

Query: 530 RAGHSITKARL 540
             G++     L
Sbjct: 58  TIGYASVSMVL 68



 Score = 33.7 bits (77), Expect = 0.045
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVN 625
           +++  S  G P PT  W  +G  +T  G++ I+  + YL +R  D   AD+G Y+    N
Sbjct: 1   VQIPCSAQGDPEPTITWNKDGVQVTESGKFHISP-EGYLAIR--DVGVADQGRYECVARN 57

Query: 626 SLGEDVASFLVT 637
           ++G    S +++
Sbjct: 58  TIGYASVSMVLS 69


>gnl|CDD|143227 cd05750, Ig_Pro_neuregulin, Immunoglobulin (Ig)-like domain in
           neuregulins (NRGs).  Ig_Pro_neuregulin: immunoglobulin
           (Ig)-like domain in neuregulins (NRGs). NRGs are
           signaling molecules, which participate in cell-cell
           interactions in the nervous system, breast, heart, and
           other organ systems, and are implicated in the pathology
           of diseases including schizophrenia, multiple sclerosis,
           and breast cancer. There are four members of the
           neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1
           protein, binds to and activates the tyrosine kinases
           receptors ErbB3 and ErbB4, initiating signaling
           cascades. The other NRGs proteins bind one or the other
           or both of these ErbBs. NRG-1 has multiple functions;
           for example, in the brain it regulates various processes
           such as radial glia formation and neuronal migration,
           dendritic development, and expression of
           neurotransmitters receptors; in the peripheral nervous
           system NRG-1 regulates processes such as target cell
           differentiation, and Schwann cell survival. There are
           many NRG-1 isoforms, which arise from the alternative
           splicing of mRNA. Less is known of the functions of the
           other NRGs. NRG-2 and -3 are expressed predominantly in
           the nervous system. NRG-2 is expressed by motor neurons
           and terminal Schwann cells, and is concentrated near
           synaptic sites and may be a signal that regulates
           synaptic differentiation. NRG-4 has been shown to direct
           pancreatic islet cell development towards the delta-cell
           lineage.
          Length = 75

 Score = 41.4 bits (97), Expect = 9e-05
 Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 3/64 (4%)

Query: 480 PTPKVSWYKDGFEIFSSRRQRIVT---DNDISTLIIHQAALMDEGEIKCTATNRAGHSIT 536
           P+ +  W+KDG E+    + R +        S L I++A L D GE  C   N  G+   
Sbjct: 12  PSLRFKWFKDGKELNRKNKPRNIKIRNKKKNSELQINKAKLADSGEYTCVVENILGNDTV 71

Query: 537 KARL 540
            A +
Sbjct: 72  TANV 75



 Score = 37.5 bits (87), Expect = 0.003
 Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 3/64 (4%)

Query: 573 AGMPPPTARWLHNGEPLTSGGRYE---ITHTDRYLNLRISDARRADRGEYQAHGVNSLGE 629
           +  P    +W  +G+ L    +     I +  +   L+I+ A+ AD GEY     N LG 
Sbjct: 9   SEYPSLRFKWFKDGKELNRKNKPRNIKIRNKKKNSELQINKAKLADSGEYTCVVENILGN 68

Query: 630 DVAS 633
           D  +
Sbjct: 69  DTVT 72


>gnl|CDD|143273 cd05865, Ig1_NCAM-1, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-1.  Ig1_NCAM-1: first
           immunoglobulin (Ig)-like domain of neural cell adhesion
           molecule NCAM-1. NCAM-1 plays important roles in the
           development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-nonNCAM), interactions. NCAM is expressed as three
           major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves the Ig1, Ig2, and Ig3
           domains. By this model, Ig1 and Ig2 mediate dimerization
           of NCAM molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 96

 Score = 41.2 bits (96), Expect = 2e-04
 Identities = 22/65 (33%), Positives = 30/65 (46%), Gaps = 4/65 (6%)

Query: 472 FTVQVEGIPTPK-VSWYKDGFEIFSSRRQRIV---TDNDISTLIIHQAALMDEGEIKCTA 527
           F  QV G    K +SW+    E  +  +QRI     D+  STL I+ A + D G  KC  
Sbjct: 20  FLCQVAGEAKDKDISWFSPNGEKLTPNQQRISVVRNDDYSSTLTIYNANIDDAGIYKCVV 79

Query: 528 TNRAG 532
           +N   
Sbjct: 80  SNEDE 84


>gnl|CDD|143239 cd05762, Ig8_MLCK, Eighth immunoglobulin (Ig)-like domain of human
           myosin light-chain kinase (MLCK).  Ig8_MLCK: the eighth
           immunoglobulin (Ig)-like domain of human myosin
           light-chain kinase (MLCK). MLCK is a key regulator of
           different forms of cell motility involving actin and
           myosin II.  Agonist stimulation of smooth muscle cells
           increases cytosolic Ca2+, which binds calmodulin.  This
           Ca2+-calmodulin complex in turn binds to and activates
           MLCK. Activated MLCK leads to the phosphorylation of the
           20 kDa myosin regulatory light chain (RLC) of myosin II
           and the stimulation of actin-activated myosin MgATPase
           activity. MLCK is widely present in vertebrate tissues;
           it phosphorylates the 20 kDa RLC of both smooth and
           nonmuscle myosin II. Phosphorylation leads to the
           activation of the myosin motor domain and altered
           structural properties of myosin II. In smooth muscle
           MLCK it is involved in initiating contraction. In
           nonmuscle cells, MLCK may participate in cell division
           and cell motility; it has been suggested MLCK plays a
           role in cardiomyocyte differentiation and contraction
           through regulation of nonmuscle myosin II.
          Length = 98

 Score = 41.1 bits (96), Expect = 2e-04
 Identities = 25/93 (26%), Positives = 43/93 (46%)

Query: 553 QYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDAR 612
           Q+ + +    GE ++L   + G  P T  W+   + +  G   +I +T+    L I++ +
Sbjct: 5   QFPEDMKVRAGESVELFCKVTGTQPITCTWMKFRKQIQEGEGIKIENTENSSKLTITEGQ 64

Query: 613 RADRGEYQAHGVNSLGEDVASFLVTVTDRPLPP 645
           +   G Y     N LG   A   +TV D+P PP
Sbjct: 65  QEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97


>gnl|CDD|143213 cd05736, Ig2_Follistatin_like, Second immunoglobulin (Ig)-like
           domain of a follistatin-like molecule encoded by the
           Mahya gene and similar proteins.  Ig2_Follistatin_like:
           domain similar to the second immunoglobulin (Ig)-like
           domain found in a follistatin-like molecule encoded by
           the CNS-related Mahya gene. Mahya genes have been
           retained in certain Bilaterian branches during
           evolution.  They are conserved in Hymenoptera and
           Deuterostomes, but are absent from other metazoan
           species such as fruit fly and nematode. Mahya proteins
           are secretory, with a follistatin-like domain
           (Kazal-type serine/threonine protease inhibitor domain
           and EF-hand calcium-binding domain), two Ig-like
           domains, and a novel C-terminal domain. Mahya may be
           involved in learning and memory and in processing of
           sensory information in Hymenoptera and vertebrates.
           Follistatin is a secreted, multidomain protein that
           binds activins with high affinity and antagonizes their
           signaling.
          Length = 76

 Score = 40.3 bits (94), Expect = 3e-04
 Identities = 19/57 (33%), Positives = 28/57 (49%)

Query: 476 VEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG 532
            EGIP P+++W K+G +I     +++    + S L I      D G   C A N AG
Sbjct: 7   AEGIPLPRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKNEAG 63



 Score = 37.2 bits (86), Expect = 0.003
 Identities = 23/71 (32%), Positives = 31/71 (43%), Gaps = 2/71 (2%)

Query: 568 LKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSL 627
           L+    G+P P   WL NG  +T     ++T       L IS+ R  D G Y     N  
Sbjct: 3   LRCHAEGIPLPRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKNEA 62

Query: 628 G--EDVASFLV 636
           G  ED++S  V
Sbjct: 63  GVDEDISSLFV 73


>gnl|CDD|143235 cd05758, Ig5_KIRREL3-like, Fifth immunoglobulin (Ig)-like domain of
           Kirrel (kin of irregular chiasm-like) 3 (also known as
           Neph2) and similar proteins.  Ig5_KIRREL3-like: domain
           similar to the fifth immunoglobulin (Ig)-like domain of
           Kirrel (kin of irregular chiasm-like) 3 (also known as
           Neph2). This protein has five Ig-like domains, one
           transmembrane domain, and a cytoplasmic tail. Included
           in this group is mammalian Kirrel (Neph1), Kirrel2
           (Neph3), and Drosophila RST (irregular chiasm
           C-roughest) protein. These proteins contain multiple Ig
           domains, have properties of cell adhesion molecules, and
           are important in organ development.
          Length = 98

 Score = 40.9 bits (96), Expect = 3e-04
 Identities = 25/98 (25%), Positives = 36/98 (36%), Gaps = 8/98 (8%)

Query: 452 APSFIRALHDTTALEDEKVEFTVQVEGIPTP-KVSW-YKDGF-EIFSSRRQRIVTDND-- 506
            P  I +     A+  +K      +   P P ++ W +K+   E  SS R  + TD    
Sbjct: 1   GPPIITSEATQYAILGDKGRVECFIFSTPPPDRIVWTWKENELESGSSGRYTVETDPSPG 60

Query: 507 --ISTLIIHQAALMD-EGEIKCTATNRAGHSITKARLR 541
             +STL I      D +    CTA N  G       L 
Sbjct: 61  GVLSTLTISNTQESDFQTSYNCTAWNSFGSGTAIISLE 98



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 27/98 (27%), Positives = 39/98 (39%), Gaps = 14/98 (14%)

Query: 544 APPTI-RLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTA-RWLHNGEPLTSG--GRY--EI 597
            PP I     QY   +L   G+  +++  +   PPP    W      L SG  GRY  E 
Sbjct: 1   GPPIITSEATQY--AIL---GDKGRVECFIFSTPPPDRIVWTWKENELESGSSGRYTVET 55

Query: 598 THTDRYL--NLRISDARRAD-RGEYQAHGVNSLGEDVA 632
             +   +   L IS+ + +D +  Y     NS G   A
Sbjct: 56  DPSPGGVLSTLTISNTQESDFQTSYNCTAWNSFGSGTA 93


>gnl|CDD|143256 cd05848, Ig1_Contactin-5, First Ig domain of contactin-5.
           Ig1_Contactin-5: First Ig domain of the neural cell
           adhesion molecule contactin-5. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains, anchored to the membrane by
           glycosylphosphatidylinositol. The different contactins
           show different expression patterns in the central
           nervous system. In rats, a lack of contactin-5 (NB-2)
           results in an impairment of the neuronal activity in the
           auditory system. Contactin-5 is expressed specifically
           in the postnatal nervous system, peaking at about 3
           weeks postnatal. Contactin-5 is highly expressed in the
           adult human brain in the occipital lobe and in the
           amygdala; lower levels of expression have been detected
           in the corpus callosum, caudate nucleus, and spinal
           cord.
          Length = 94

 Score = 40.3 bits (94), Expect = 4e-04
 Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 10/86 (11%)

Query: 453 PSFIRALHD---TTALEDEKVEFTVQVEGIPTPKVSWYKDGFEI--FSSRRQRIVTDNDI 507
           P F++   D    T  +++KV    +  G P P   W ++G EI   S  R  ++     
Sbjct: 2   PVFVQEPDDAIFPTDSDEKKVILNCEARGNPVPTYRWLRNGTEIDTESDYRYSLID---- 57

Query: 508 STLIIHQAALM-DEGEIKCTATNRAG 532
             LII   + + D G  +C ATN  G
Sbjct: 58  GNLIISNPSEVKDSGRYQCLATNSIG 83



 Score = 34.1 bits (78), Expect = 0.064
 Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 3/64 (4%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRA-DRGEYQAHGV 624
           + L     G P PT RWL NG  + +   Y  +  D   NL IS+     D G YQ    
Sbjct: 22  VILNCEARGNPVPTYRWLRNGTEIDTESDYRYSLIDG--NLIISNPSEVKDSGRYQCLAT 79

Query: 625 NSLG 628
           NS+G
Sbjct: 80  NSIG 83


>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
           Ig5_Contactin_like: Fifth Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 73

 Score = 39.3 bits (92), Expect = 6e-04
 Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 3/55 (5%)

Query: 574 GMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLG 628
             P PT  W    E LT+  R  I   D   +L I +  ++D G+Y     N  G
Sbjct: 12  AAPKPTISWSKGTELLTNSSRICIW-PDG--SLEILNVTKSDEGKYTCFAENFFG 63



 Score = 37.8 bits (88), Expect = 0.002
 Identities = 18/56 (32%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 477 EGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG 532
           +  P P +SW K    + +S R  I  D    +L I      DEG+  C A N  G
Sbjct: 11  KAAPKPTISWSKGTELLTNSSRICIWPD---GSLEILNVTKSDEGKYTCFAENFFG 63


>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor and similar
           proteins.  Ig2_FGFR_like: domain similar to the second
           immunoglobulin (Ig)-like domain of fibroblast growth
           factor (FGF) receptor. FGF receptors bind FGF signaling
           polypeptides. FGFs participate in multiple processes
           such as morphogenesis, development, and angiogenesis.
           FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
           -2, -3, -4). Receptor diversity is controlled by
           alternative splicing producing splice variants with
           different ligand binding characteristics and different
           expression patterns. FGFRs have an extracellular region
           comprised of three Ig-like domains, a single
           transmembrane helix, and an intracellular tyrosine
           kinase domain. Ligand binding and specificity reside in
           the Ig-like domains 2 and 3, and the linker region that
           connects these two. FGFR activation and signaling depend
           on FGF-induced dimerization, a process involving cell
           surface heparin or heparin sulfate proteoglycans. This
           group also contains fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 does not have a protein
           tyrosine kinase domain at its C terminus; neither does
           its cytoplasmic domain appear to interact with a
           signaling partner. It has been suggested that FGFRL1 may
           not have any direct signaling function, but instead acts
           as a decoy receptor trapping FGFs and preventing them
           from binding other receptors.
          Length = 85

 Score = 39.7 bits (93), Expect = 6e-04
 Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 1/77 (1%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHT-DRYLNLRISDARRADRGEYQA 621
           G  ++LK   +G P PT  WL +G+P     R        +   L +     +D G+Y  
Sbjct: 9   GSTVRLKCPASGNPRPTITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVVPSDSGKYTC 68

Query: 622 HGVNSLGEDVASFLVTV 638
              N  G    ++ V V
Sbjct: 69  IVENKYGSINHTYKVDV 85



 Score = 37.7 bits (88), Expect = 0.003
 Identities = 17/66 (25%), Positives = 22/66 (33%), Gaps = 1/66 (1%)

Query: 469 KVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIV-TDNDISTLIIHQAALMDEGEIKCTA 527
            V       G P P ++W KDG       R           TLI+      D G+  C  
Sbjct: 11  TVRLKCPASGNPRPTITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVVPSDSGKYTCIV 70

Query: 528 TNRAGH 533
            N+ G 
Sbjct: 71  ENKYGS 76


>gnl|CDD|143300 cd05892, Ig_Myotilin_C, C-terminal immunoglobulin (Ig)-like domain
           of myotilin.  Ig_Myotilin_C: C-terminal immunoglobulin
           (Ig)-like domain of myotilin. Mytolin belongs to the
           palladin-myotilin-myopalladin family. Proteins belonging
           to the latter family contain multiple Ig-like domains
           and function as scaffolds, modulating actin
           cytoskeleton. Myotilin is most abundant in skeletal and
           cardiac muscle, and is involved in maintaining sarcomere
           integrity. It binds to alpha-actinin, filamin and actin.
           Mutations in myotilin lead to muscle disorders.
          Length = 75

 Score = 38.4 bits (89), Expect = 0.001
 Identities = 22/73 (30%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEI-FSSRRQRIVTDNDIS-TLIIHQAALMDEGEIKCTA 527
           V+   Q+  IP PK+ W ++   + +++ R  +  DN    TL+I      D G    +A
Sbjct: 1   VKLECQISAIPPPKIFWKRNNEMVQYNTDRISLYQDNSGRVTLLIKNVNKKDAGWYTVSA 60

Query: 528 TNRAGHSITKARL 540
            N AG +   ARL
Sbjct: 61  VNEAGVATCHARL 73



 Score = 31.1 bits (70), Expect = 0.45
 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 2/65 (3%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLT-SGGRYEITHTDR-YLNLRISDARRADRGEYQAHG 623
           +KL+  ++ +PPP   W  N E +  +  R  +   +   + L I +  + D G Y    
Sbjct: 1   VKLECQISAIPPPKIFWKRNNEMVQYNTDRISLYQDNSGRVTLLIKNVNKKDAGWYTVSA 60

Query: 624 VNSLG 628
           VN  G
Sbjct: 61  VNEAG 65


>gnl|CDD|143221 cd05744, Ig_Myotilin_C_like, Immunoglobulin (Ig)-like domain of
           myotilin, palladin, and myopalladin.
           Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in
           myotilin, palladin, and myopalladin.  Myotilin,
           palladin, and myopalladin function as scaffolds that
           regulate actin organization. Myotilin and myopalladin
           are most abundant in skeletal and cardiac muscle;
           palladin is ubiquitously expressed in the organs of
           developing vertebrates and  plays a key role in cellular
           morphogenesis. The three family members each interact
           with specific molecular partners: all three bind to
           alpha-actinin; in addition, palladin also binds to
           vasodilator-stimulated phosphoprotein (VASP) and ezrin,
           myotilin binds to filamin and actin, and myopalladin
           also binds to nebulin and cardiac ankyrin repeat protein
           (CARP).
          Length = 75

 Score = 38.2 bits (89), Expect = 0.001
 Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEI-FSSRRQRIVTDN-DISTLIIHQAALMDEGEIKCTA 527
           V    +V  IP P++ W K+   + +++ R  +  DN     L+I  A   D G    +A
Sbjct: 1   VRLECRVSAIPPPQIFWKKNNEMLTYNTDRISLYQDNCGRICLLIQNANKEDAGWYTVSA 60

Query: 528 TNRAGHSITKARL 540
            N AG     ARL
Sbjct: 61  VNEAGVVSCNARL 73



 Score = 31.3 bits (71), Expect = 0.39
 Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLT-SGGRYEITHTDR-YLNLRISDARRADRGEYQAHG 623
           ++L+  ++ +PPP   W  N E LT +  R  +   +   + L I +A + D G Y    
Sbjct: 1   VRLECRVSAIPPPQIFWKKNNEMLTYNTDRISLYQDNCGRICLLIQNANKEDAGWYTVSA 60

Query: 624 VNSLG 628
           VN  G
Sbjct: 61  VNEAG 65


>gnl|CDD|143242 cd05765, Ig_3, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_3: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 81

 Score = 38.3 bits (89), Expect = 0.001
 Identities = 25/71 (35%), Positives = 31/71 (43%), Gaps = 7/71 (9%)

Query: 468 EKVEFTVQVEGIPTPKVSWYK--DGFEIFSSR----RQRIVTDNDISTLIIHQAALMDEG 521
           E   F   V G P P+++W K   G E    R    R  +V  N I  L+I+ A   D G
Sbjct: 2   ETASFHCDVTGRPPPEITWEKQVHGKENLIMRPNHVRGNVVVTN-IGQLVIYNAQPQDAG 60

Query: 522 EIKCTATNRAG 532
              CTA N  G
Sbjct: 61  LYTCTARNSGG 71


>gnl|CDD|143177 cd04976, Ig2_VEGFR, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor (VEGFR).
           Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor (VEGFR). The
           VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. The VEGFR family consists of three
           members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
           VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at
           the Ig-like domains. VEGF-A is important to the growth
           and maintenance of vascular endothelial cells and to the
           development of new blood- and lymphatic-vessels in
           physiological and pathological states. VEGFR-2 is a
           major mediator of the mitogenic, angiogenic and
           microvascular permeability-enhancing effects of VEGF-A.
           VEGFR-1 may play an inhibitory part in these processes
           by binding VEGF and interfering with its interaction
           with VEGFR-2. VEGFR-1 has a signaling role in mediating
           monocyte chemotaxis. VEGFR-2 and -1 may mediate a
           chemotactic and a survival signal in hematopoietic stem
           cells or leukemia cells. VEGFR-3 has been shown to be
           involved in tumor angiogenesis and growth.
          Length = 71

 Score = 38.2 bits (89), Expect = 0.001
 Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 4/54 (7%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEY 619
           ++L V +   PPP  +W  NG+ ++   R        + +L I D    D G Y
Sbjct: 1   VRLPVKVKAYPPPEIQWYKNGKLISEKNRT---KKSGH-SLTIKDVTEEDAGNY 50



 Score = 32.0 bits (73), Expect = 0.20
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 4/62 (6%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
           V   V+V+  P P++ WYK+G  I S + +         +L I      D G      TN
Sbjct: 1   VRLPVKVKAYPPPEIQWYKNGKLI-SEKNRTKK---SGHSLTIKDVTEEDAGNYTVVLTN 56

Query: 530 RA 531
           + 
Sbjct: 57  KQ 58


>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
           important roles in the development and regeneration of
           the central nervous system, in synaptogenesis and neural
           migration. NCAM mediates cell-cell and cell-substratum
           recognition and adhesion via homophilic (NCAM-NCAM) and
           heterophilic (NCAM-non-NCAM) interactions. NCAM is
           expressed as three major isoforms having different
           intracellular extensions. The extracellular portion of
           NCAM has five N-terminal Ig-like domains and two
           fibronectin type III domains. The double zipper adhesion
           complex model for NCAM homophilic binding involves Ig1,
           Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
           dimerization of NCAM molecules situated on the same cell
           surface (cis interactions), and Ig3 domains mediate
           interactions between NCAM molecules expressed on the
           surface of opposing cells (trans interactions), through
           binding to the Ig1 and Ig2 domains. The adhesive ability
           of NCAM is modulated by the addition of polysialic acid
           chains to the fifth Ig-like domain.
          Length = 97

 Score = 38.4 bits (89), Expect = 0.002
 Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 8/87 (9%)

Query: 456 IRALHDTTALE-DEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQ----RIVTDNDI--S 508
           I  + + TA+E +E++  T +  G P P ++W +      SS  +     IV  +    S
Sbjct: 5   ITYVENQTAMELEEQITLTCEASGDPIPSITW-RTSTRNISSEEKTLDGHIVVRSHARVS 63

Query: 509 TLIIHQAALMDEGEIKCTATNRAGHSI 535
           +L +      D GE  CTA+N  G   
Sbjct: 64  SLTLKYIQYTDAGEYLCTASNTIGQDS 90



 Score = 28.0 bits (62), Expect = 7.9
 Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 5/83 (6%)

Query: 561 EMGEIIKLKVSMAGMPPPTARWLH-----NGEPLTSGGRYEITHTDRYLNLRISDARRAD 615
           E+ E I L    +G P P+  W       + E  T  G   +    R  +L +   +  D
Sbjct: 15  ELEEQITLTCEASGDPIPSITWRTSTRNISSEEKTLDGHIVVRSHARVSSLTLKYIQYTD 74

Query: 616 RGEYQAHGVNSLGEDVASFLVTV 638
            GEY     N++G+D  S  + V
Sbjct: 75  AGEYLCTASNTIGQDSQSMYLEV 97


>gnl|CDD|143237 cd05760, Ig2_PTK7, Second immunoglobulin (Ig)-like domain of
           protein tyrosine kinase (PTK) 7, also known as CCK4.
           Ig2_PTK7: domain similar to the second immunoglobulin
           (Ig)-like domain in protein tyrosine kinase (PTK) 7,
           also known as CCK4. PTK7 is a subfamily of the receptor
           protein tyrosine kinase family, and is referred to as an
           RPTK-like molecule. RPTKs transduce extracellular
           signals across the cell membrane, and play important
           roles in regulating cell proliferation, migration, and
           differentiation. PTK7 is organized as an extracellular
           portion having seven Ig-like domains, a single
           transmembrane region, and a cytoplasmic tyrosine
           kinase-like domain. PTK7 is considered a pseudokinase as
           it has several unusual residues in some of the highly
           conserved tyrosine kinase (TK) motifs; it is predicted
           to lack TK activity. PTK7 may function as a
           cell-adhesion molecule. PTK7 mRNA is expressed at high
           levels in placenta, melanocytes, liver, lung, pancreas,
           and kidney. PTK7 is overexpressed in several cancers,
           including melanoma and colon cancer lines.
          Length = 77

 Score = 37.2 bits (86), Expect = 0.003
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 5/78 (6%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLT-SGGRYEITHTDRYLNLRISDARRADRGEYQAHGV 624
           + L+  + G P PT +W  +G PL+   G Y ++  +R L LR   A   D G Y     
Sbjct: 1   VTLRCHIDGHPRPTYQWFRDGTPLSDGQGNYSVSSKERTLTLR--SAGPDDSGLYYCCAH 58

Query: 625 NSLGEDVAS--FLVTVTD 640
           N+ G   +S  F +++ D
Sbjct: 59  NAFGSVCSSQNFTLSIID 76



 Score = 35.3 bits (81), Expect = 0.016
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 1/64 (1%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
           V     ++G P P   W++DG  +   +    V+  +  TL +  A   D G   C A N
Sbjct: 1   VTLRCHIDGHPRPTYQWFRDGTPLSDGQGNYSVSSKE-RTLTLRSAGPDDSGLYYCCAHN 59

Query: 530 RAGH 533
             G 
Sbjct: 60  AFGS 63


>gnl|CDD|206066 pfam13895, Ig_2, Immunoglobulin domain.  This domain contains
           immunoglobulin-like domains.
          Length = 80

 Score = 37.0 bits (86), Expect = 0.004
 Identities = 23/76 (30%), Positives = 29/76 (38%), Gaps = 10/76 (13%)

Query: 462 TTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEG 521
           T   E E V  T    G P P  +WYKDG  + SS+       N   T  +      D G
Sbjct: 9   TVVFEGEDVTLTCSAPGNPPPNYTWYKDGVPLSSSQ-------NGFFTPNVSAE---DSG 58

Query: 522 EIKCTATNRAGHSITK 537
              C A+N  G   + 
Sbjct: 59  TYTCVASNGGGGKTSN 74



 Score = 30.9 bits (70), Expect = 0.52
 Identities = 15/56 (26%), Positives = 19/56 (33%), Gaps = 5/56 (8%)

Query: 545 PPTIRLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHT 600
            P +              GE + L  S  G PPP   W  +G PL+S      T  
Sbjct: 1   KPVLTPSPTV-----VFEGEDVTLTCSAPGNPPPNYTWYKDGVPLSSSQNGFFTPN 51


>gnl|CDD|143317 cd07693, Ig1_Robo, First immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors and similar proteins.  Ig1_Robo:
           domain similar to the first immunoglobulin (Ig)-like
           domain in Robo (roundabout) receptors. Robo receptors
           play a role in the development of the central nervous
           system (CNS), and are receptors of Slit protein. Slit is
           a repellant secreted by the neural cells in the midline.
           Slit acts through Robo to prevent most neurons from
           crossing the midline from either side. Three mammalian
           Robo homologs (robo1, -2, and -3), and three mammalian
           Slit homologs (Slit-1,-2, -3), have been identified.
           Commissural axons, which cross the midline, express low
           levels of Robo; longitudinal axons, which avoid the
           midline, express high levels of Robo. robo1, -2, and -3
           are expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 100

 Score = 37.5 bits (87), Expect = 0.004
 Identities = 24/102 (23%), Positives = 40/102 (39%), Gaps = 11/102 (10%)

Query: 452 APSFIRALHDTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEI----FSSRRQRIVTDNDI 507
            P  +    D    + +      + EG PTP + W K+G  +       R  RIV  +  
Sbjct: 1   PPRIVEHPSDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLPS-- 58

Query: 508 STL----IIH-QAALMDEGEIKCTATNRAGHSITKARLRLEA 544
            +L    ++H +    DEG   C A N  G ++++      A
Sbjct: 59  GSLFFLRVVHGRKGRSDEGVYVCVAHNSLGEAVSRNASLEVA 100



 Score = 32.9 bits (75), Expect = 0.15
 Identities = 29/96 (30%), Positives = 41/96 (42%), Gaps = 10/96 (10%)

Query: 545 PPTIRLPKQYEDGLLFEMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITH----- 599
           PP I    ++   L+   G+   L     G P PT +WL NG+PL +      +H     
Sbjct: 1   PPRI---VEHPSDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLP 57

Query: 600 TDRYLNLRISDARRA--DRGEYQAHGVNSLGEDVAS 633
           +     LR+   R+   D G Y     NSLGE V+ 
Sbjct: 58  SGSLFFLRVVHGRKGRSDEGVYVCVAHNSLGEAVSR 93


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
            binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 40.7 bits (95), Expect = 0.005
 Identities = 41/219 (18%), Positives = 85/219 (38%), Gaps = 33/219 (15%)

Query: 854  DNEDDYDI-VETNEHTGTGAPSDNENESDYFPEKTIDE------------------SVYG 894
            + E + +I  +  +H G     + E+E +   E T DE                     G
Sbjct: 692  EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEG 751

Query: 895  YDTIVYGYDSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLG 954
               +    D  + +    T  E +E+ED   +    +  +KG E     ++ +   E   
Sbjct: 752  KHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811

Query: 955  DVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASN 1014
                +  S     D ++K +  + E +   +G+ + D +              +    S+
Sbjct: 812  KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEK------------GVDGGGGSD 859

Query: 1015 ITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINF 1053
              D E+EE+EE+  + +E  E+  EEE +E++ ++P++ 
Sbjct: 860  GGDSEEEEEEEEEEEEEE--EEEEEEEEEEEENEEPLSL 896


>gnl|CDD|143265 cd05857, Ig2_FGFR, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor.  Ig2_FGFR:
           second immunoglobulin (Ig)-like domain of fibroblast
           growth factor (FGF) receptor. FGF receptors bind FGF
           signaling polypeptides. FGFs participate in multiple
           processes such as morphogenesis, development, and
           angiogenesis. FGFs bind to four FGF receptor tyrosine
           kinases (FGFR1, -2, -3, -4). Receptor diversity is
           controlled by alternative splicing producing splice
           variants with different ligand binding characteristics
           and different expression patterns. FGFRs have an
           extracellular region comprised of three IG-like domains,
           a single transmembrane helix, and an intracellular
           tyrosine kinase domain. Ligand binding and specificity
           reside in the Ig-like domains 2 and 3, and the linker
           region that connects these two. FGFR activation and
           signaling depend on FGF-induced dimerization, a process
           involving cell surface heparin or heparin sulfate
           proteoglycans.
          Length = 85

 Score = 36.8 bits (85), Expect = 0.006
 Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 5/60 (8%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGR---YEITHTDRYLNLRISDARRADRGEY 619
              +K +   AG P PT RWL NG+      R   Y++ +  ++ +L +     +D+G Y
Sbjct: 9   ANTVKFRCPAAGNPTPTMRWLKNGKEFKQEHRIGGYKVRN--QHWSLIMESVVPSDKGNY 66



 Score = 33.3 bits (76), Expect = 0.091
 Identities = 19/64 (29%), Positives = 25/64 (39%), Gaps = 1/64 (1%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRR-QRIVTDNDISTLIIHQAALMDEGEIKCTAT 528
           V+F     G PTP + W K+G E     R       N   +LI+      D+G   C   
Sbjct: 12  VKFRCPAAGNPTPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPSDKGNYTCVVE 71

Query: 529 NRAG 532
           N  G
Sbjct: 72  NEYG 75


>gnl|CDD|143215 cd05738, Ig2_RPTP_IIa_LAR_like, Second immunoglobulin (Ig)-like
           domain of  the receptor protein tyrosine phosphatase
           (RPTP)-F, also known as LAR.  Ig2_RPTP_IIa_LAR_like:
           domain similar to the second immunoglobulin (Ig)-like
           domain found in the receptor protein tyrosine
           phosphatase (RPTP)-F, also known as LAR. LAR belongs to
           the RPTP type IIa subfamily. Members of this subfamily
           are cell adhesion molecule-like proteins involved in
           central nervous system (CNS) development. They have
           large extracellular portions, comprised of multiple
           Ig-like domains and two to nine fibronectin type III
           (FNIII) domains, and a cytoplasmic portion having two
           tandem phosphatase domains.
          Length = 74

 Score = 36.6 bits (84), Expect = 0.006
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 478 GIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAG 532
           G P P+++W+KD   + ++   RI        L I  +   D+G+ +C ATN AG
Sbjct: 9   GNPDPEITWFKDFLPVDTTSNGRI-KQLRSGALQIENSEESDQGKYECVATNSAG 62



 Score = 28.9 bits (64), Expect = 2.6
 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 573 AGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLG 628
           +G P P   W  +  P+ +     I        L+I ++  +D+G+Y+    NS G
Sbjct: 8   SGNPDPEITWFKDFLPVDTTSNGRIKQLRSGA-LQIENSEESDQGKYECVATNSAG 62


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 40.1 bits (94), Expect = 0.007
 Identities = 37/165 (22%), Positives = 64/165 (38%), Gaps = 31/165 (18%)

Query: 15  TILIQTLWRSKLAMRRDEREFCMIRSKTIVIQKYFRGYLLMRKE---------------- 58
              IQ   R +   RR  +    I+ K  VIQ  FR   L+  E                
Sbjct: 748 ATRIQRAIRGRYLRRRYLQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKLQPLLSLL 806

Query: 59  --RQEYLAMKSSAVKIQEWYRNLQCMRQARQQYLALKHATLKQR--------EEFLKLKH 108
             R+EY +  +  +K+Q+  +  + +R+  +   +LK   L Q+        + F  LK 
Sbjct: 807 GSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKK 866

Query: 109 ATIAIQTLYKAKLLMKRDRAAYTELKQACVSVQQRWRANLTMRKQ 153
            TI +Q+  + +L  ++      ELK    S+      NL +  +
Sbjct: 867 ETIYLQSAQRVELAERQ----LQELKIDVKSISSLKLVNLELESE 907



 Score = 37.8 bits (88), Expect = 0.047
 Identities = 26/138 (18%), Positives = 47/138 (34%), Gaps = 16/138 (11%)

Query: 171 YRNTKLMRLEASYLHELKAATITIQRRYRANVAMRTQRERYVALRTATITIQTRFRAYLI 230
           ++   L  LE     +L      IQR  R     R  R RY+        IQ     + +
Sbjct: 728 FKAGVLAALEDMRDAKLDNIATRIQRAIRG----RYLRRRYLQALKRIKKIQVIQHGFRL 783

Query: 231 AKNQRDE---YAELKQARRFRFKLN---LRKYERVIELLKLKREQERQ------EKYRHQ 278
            +    E      +K         +    R Y   I  L+   ++E++       ++  +
Sbjct: 784 RRLVDYELKWRLFIKLQPLLSLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLK 843

Query: 279 CAVKIQSLWKMYRVRKKF 296
             V IQ   +  + +K+F
Sbjct: 844 AEVLIQKFGRSLKAKKRF 861



 Score = 33.5 bits (77), Expect = 0.85
 Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 14/105 (13%)

Query: 210 RYVALRTATITIQTRFRAYLIAKNQRDEYAELKQARRFRFKLNLRKYERVIELLKLKREQ 269
           R   L      IQ   R   + +          QA +      ++K + +    +L+R  
Sbjct: 740 RDAKLDNIATRIQRAIRGRYLRR-------RYLQALKR-----IKKIQVIQHGFRLRRLV 787

Query: 270 ERQEKYRHQCAVKIQSLWKMYRVRKKFADIIEQKKQAKKTADNQF 314
           + + K+R    +K+Q L  +   RK++   +    + +KT   + 
Sbjct: 788 DYELKWR--LFIKLQPLLSLLGSRKEYRSYLACIIKLQKTIKREK 830


>gnl|CDD|143210 cd05733, Ig6_L1-CAM_like, Sixth immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM) and similar
           proteins.  Ig6_L1-CAM_like: domain similar to the sixth
           immunoglobulin (Ig)-like domain of the L1 cell adhesion
           molecule (CAM).  L1 belongs to the L1 subfamily of cell
           adhesion molecules (CAMs) and is comprised of an
           extracellular region having six Ig-like domains and five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           NrCAM [Ng(neuronglia)CAM-related cell adhesion
           molecule], which is primarily expressed in the nervous
           system, and human neurofascin.
          Length = 77

 Score = 35.8 bits (83), Expect = 0.009
 Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)

Query: 472 FTVQVE--GIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLII--HQAALMD--EGEIKC 525
             ++ E  G P P  SW ++G      +  R+    D  TL+I        +  EGE +C
Sbjct: 1   IVIKCEAKGNPPPTFSWTRNGTHFDPEKDPRVTMKPDSGTLVIDNMNGGRAEDYEGEYQC 60

Query: 526 TATNRAGHSIT-KARLR 541
            A+N  G +I+ +  LR
Sbjct: 61  YASNELGTAISNEIHLR 77



 Score = 29.7 bits (67), Expect = 1.6
 Identities = 21/67 (31%), Positives = 27/67 (40%), Gaps = 4/67 (5%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDR--YLNLRISDARRADR--GEYQA 621
           I +K    G PPPT  W  NG          +T       L +   +  RA+   GEYQ 
Sbjct: 1   IVIKCEAKGNPPPTFSWTRNGTHFDPEKDPRVTMKPDSGTLVIDNMNGGRAEDYEGEYQC 60

Query: 622 HGVNSLG 628
           +  N LG
Sbjct: 61  YASNELG 67


>gnl|CDD|143303 cd05895, Ig_Pro_neuregulin-1, Immunoglobulin (Ig)-like domain found
           in neuregulin (NRG)-1.  Ig_Pro_neuregulin-1:
           immunoglobulin (Ig)-like domain found in neuregulin
           (NRG)-1. There are many NRG-1 isoforms which arise from
           the alternative splicing of mRNA. NRG-1 belongs to the
           neuregulin gene family, which is comprised of four
           genes. This group represents NRG-1. NRGs are signaling
           molecules, which participate in cell-cell interactions
           in the nervous system, breast, and heart, and other
           organ systems, and are implicated in the pathology of
           diseases including schizophrenia, multiple sclerosis,
           and breast cancer. The NRG-1 protein binds to and
           activates the tyrosine kinases receptors ErbB3 and
           ErbB4, initiating signaling cascades. NRG-1 has multiple
           functions; for example, in the brain it regulates
           various processes such as radial glia formation and
           neuronal migration, dendritic development, and
           expression of neurotransmitters receptors; in the
           peripheral nervous system NRG-1 regulates processes such
           as target cell differentiation, and Schwann cell
           survival.
          Length = 76

 Score = 35.7 bits (82), Expect = 0.010
 Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 4/57 (7%)

Query: 581 RWLHNGEPLTSGGRYE----ITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVAS 633
           +W  NG+ + +  + +    I    +   L+IS A  AD GEY+    + LG D  +
Sbjct: 17  KWFKNGKEIGAKNKPDNKIKIRKKKKSSELQISKASLADNGEYKCMVSSKLGNDSVT 73



 Score = 33.8 bits (77), Expect = 0.057
 Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 4/63 (6%)

Query: 480 PTPKVSWYKDGFEIFSSRR----QRIVTDNDISTLIIHQAALMDEGEIKCTATNRAGHSI 535
           P+ +  W+K+G EI +  +     +I      S L I +A+L D GE KC  +++ G+  
Sbjct: 12  PSLRFKWFKNGKEIGAKNKPDNKIKIRKKKKSSELQISKASLADNGEYKCMVSSKLGNDS 71

Query: 536 TKA 538
             A
Sbjct: 72  VTA 74


>gnl|CDD|143168 cd04967, Ig1_Contactin, First Ig domain of contactin.
           Ig1_Contactin: First Ig domain of contactins. Contactins
           are neural cell adhesion molecules and are comprised of
           six Ig domains followed by four fibronectin type
           III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal activity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 91

 Score = 36.3 bits (84), Expect = 0.011
 Identities = 20/70 (28%), Positives = 29/70 (41%), Gaps = 7/70 (10%)

Query: 466 EDEKVEFTVQVEGIPTPKVSWYKDGFEIF--SSRRQRIVTDNDISTLIIHQAALM-DEGE 522
           ++ KV    +  G P P   W  +G EI      R  +V  N    L+I   +   D G 
Sbjct: 18  DEGKVSLNCRARGSPPPTYRWLMNGTEIDDEPDSRYSLVGGN----LVISNPSKAKDAGR 73

Query: 523 IKCTATNRAG 532
            +C A+N  G
Sbjct: 74  YQCLASNIVG 83



 Score = 33.2 bits (76), Expect = 0.12
 Identities = 26/67 (38%), Positives = 31/67 (46%), Gaps = 11/67 (16%)

Query: 569 KVSMA----GMPPPTARWLHNGE--PLTSGGRYEITHTDRYLNLRISDARRA-DRGEYQA 621
           KVS+     G PPPT RWL NG         RY +       NL IS+  +A D G YQ 
Sbjct: 21  KVSLNCRARGSPPPTYRWLMNGTEIDDEPDSRYSLVGG----NLVISNPSKAKDAGRYQC 76

Query: 622 HGVNSLG 628
              N +G
Sbjct: 77  LASNIVG 83


>gnl|CDD|201341 pfam00612, IQ, IQ calmodulin-binding motif.  Calmodulin-binding
           motif.
          Length = 21

 Score = 33.8 bits (79), Expect = 0.015
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 277 HQCAVKIQSLWKMYRVRKKF 296
            + A+KIQ+ W+ Y  RK++
Sbjct: 1   RKAAIKIQAAWRGYLARKRY 20



 Score = 28.5 bits (65), Expect = 1.2
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query: 39 RSKTIVIQKYFRGYLLMRKER 59
          R   I IQ  +RGYL  ++ +
Sbjct: 1  RKAAIKIQAAWRGYLARKRYK 21


>gnl|CDD|143275 cd05867, Ig4_L1-CAM_like, Fourth immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig4_L1-CAM_like:
           fourth immunoglobulin (Ig)-like domain of the L1 cell
           adhesion molecule (CAM). L1 is comprised of an
           extracellular region having six Ig-like domains and five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 76

 Score = 35.2 bits (81), Expect = 0.017
 Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 468 EKVEFTVQVEGIPTPKVSWYKDGFEIFSS---RRQRIVTDNDISTLIIHQAALMDEGEIK 524
           E      QVEGIPTP ++W  +G  I  +    R+ + +      LI+      D    +
Sbjct: 2   ETARLDCQVEGIPTPNITWSINGAPIEGTDPDPRRHVSS----GALILTDVQPSDTAVYQ 57

Query: 525 CTATNRAGHSITKARL 540
           C A NR G+ +  A +
Sbjct: 58  CEARNRHGNLLANAHV 73


>gnl|CDD|143176 cd04975, Ig4_SCFR_like, Fourth immunoglobulin (Ig)-like domain of
           stem cell factor receptor (SCFR) and similar proteins.
           Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of
           stem cell factor receptor (SCFR). In addition to SCFR
           this group also includes the fourth Ig domain of
           platelet-derived growth factor receptors (PDGFR), alpha
           and beta, the fourth Ig domain of macrophage colony
           stimulating factor (M-CSF), and the Ig domain of the
           receptor tyrosine kinase KIT. SCFR and the PDGFR alpha
           and beta have similar organization: an extracellular
           component having five Ig-like domains, a transmembrane
           segment, and a cytoplasmic portion having protein
           tyrosine kinase activity. SCFR and its ligand SCF are
           critical for normal hematopoiesis, mast cell
           development, melanocytes and gametogenesis. SCF binds to
           the second and third Ig-like domains of SCFR, this
           fourth Ig-like domain participates in SCFR dimerization,
           which follows ligand binding. Deletion of this fourth
           SCFR_Ig-like domain abolishes the ligand-induced
           dimerization of SCFR and completely inhibits signal
           transduction. PDGF is a potent mitogen for connective
           tissue cells. PDGF-stimulated processes are mediated by
           three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
           binds to all three PDGFs, whereas the PDGFR beta, binds
           only to PDGF-B. In mice, PDGFR alpha, and PDGFR beta,
           are essential for normal development.
          Length = 101

 Score = 35.8 bits (83), Expect = 0.017
 Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%)

Query: 561 EMGEIIKLKVSM-AGMPPPTARWLHNGEPLTSGGRYEIT----HTDRYLN-LRISDARRA 614
            +GE + L V + A  PPP   W ++   LT+     +T       RY++ L++   + +
Sbjct: 16  NLGENLNLVVEVEAYPPPPHINWTYDNRTLTNKLTEIVTSENESEYRYVSELKLVRLKES 75

Query: 615 DRGEYQAHGVNSLGEDVASFLVTV 638
           + G Y     NS      +F + V
Sbjct: 76  EAGTYTFLASNSDASKSLTFELYV 99


>gnl|CDD|143179 cd04978, Ig4_L1-NrCAM_like, Fourth immunoglobulin (Ig)-like domain
           of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule),
           and NrCAM (Ng-CAM-related).  Ig4_L1-NrCAM_like: fourth
           immunoglobulin (Ig)-like domain of L1, Ng-CAM
           (Neuron-glia CAM cell adhesion molecule), and NrCAM
           (Ng-CAM-related). These proteins belong to the L1
           subfamily of cell adhesion molecules (CAMs) and are
           comprised of an extracellular region having six Ig-like
           domains and five fibronectin type III domains, a
           transmembrane region and an intracellular domain. These
           molecules are primarily expressed in the nervous system.
           L1 is associated with an X-linked recessive disorder,
           X-linked hydrocephalus, MASA syndrome, or spastic
           paraplegia type 1, that involves abnormalities of axonal
           growth.
          Length = 76

 Score = 35.1 bits (81), Expect = 0.020
 Identities = 20/67 (29%), Positives = 28/67 (41%), Gaps = 5/67 (7%)

Query: 468 EKVEFTVQVEGIPTPKVSWYKDG--FEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKC 525
           E      + EGIP P ++W  +G   E      +R V   D  TLI+      D    +C
Sbjct: 2   ETGRLDCEAEGIPQPTITWRLNGVPIEELPPDPRRRV---DGGTLILSNVQPNDTAVYQC 58

Query: 526 TATNRAG 532
            A+N  G
Sbjct: 59  NASNVHG 65



 Score = 33.1 bits (76), Expect = 0.076
 Identities = 23/81 (28%), Positives = 32/81 (39%), Gaps = 11/81 (13%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLN-----LRISDARRADRG 617
           GE  +L     G+P PT  W  NG P+      E    D         L +S+ +  D  
Sbjct: 1   GETGRLDCEAEGIPQPTITWRLNGVPI------EELPPDPRRRVDGGTLILSNVQPNDTA 54

Query: 618 EYQAHGVNSLGEDVASFLVTV 638
            YQ +  N  G  +A+  V V
Sbjct: 55  VYQCNASNVHGYLLANAFVHV 75


>gnl|CDD|143203 cd05726, Ig4_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig4_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 90

 Score = 35.3 bits (81), Expect = 0.021
 Identities = 25/87 (28%), Positives = 36/87 (41%), Gaps = 5/87 (5%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFE--IFSSRRQRIVTDNDIST---LIIHQAALMDEGEIK 524
           V F  +  G P P + W K+G +  +FS +  +  +   +S    L I      D G   
Sbjct: 4   VTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTGDLTITNVQRSDVGYYI 63

Query: 525 CTATNRAGHSITKARLRLEAPPTIRLP 551
           C   N AG  +TKA L +      R P
Sbjct: 64  CQTLNVAGSILTKAYLEVTDVIADRPP 90



 Score = 33.8 bits (77), Expect = 0.067
 Identities = 21/86 (24%), Positives = 35/86 (40%), Gaps = 11/86 (12%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNG--------EPLTSGGRYEITHTDRYLNLRISDARRA 614
           G  +  +    G P P   W   G        +P  S  R+ ++ T    +L I++ +R+
Sbjct: 1   GRTVTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTG---DLTITNVQRS 57

Query: 615 DRGEYQAHGVNSLGEDVASFLVTVTD 640
           D G Y    +N  G  +    + VTD
Sbjct: 58  DVGYYICQTLNVAGSILTKAYLEVTD 83


>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
            Reviewed.
          Length = 129

 Score = 35.8 bits (83), Expect = 0.030
 Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 989  EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDED 1045
            E+D E++    K   +    M    +I DD+   D++  FD ++L E++ E+E DE+
Sbjct: 75   EIDEEIIPLEEKFDKKKKKFMDGDDDIIDDDILPDDD--FDEEDLDEEDDEDEEDEE 129


>gnl|CDD|143212 cd05735, Ig8_DSCAM, Eight immunoglobulin (Ig) domain of Down
           Syndrome Cell Adhesion molecule (DSCAM).  Ig8_DSCAM:
           the eight immunoglobulin (Ig) domain of Down Syndrome
           Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion
           molecule expressed largely in the developing nervous
           system. The gene encoding DSCAM is located at human
           chromosome 21q22, the locus associated with the mental
           retardation phenotype of Down Syndrome. DSCAM is
           predicted to be the largest member of the IG
           superfamily. It has been demonstrated that DSCAM can
           mediate cation-independent homophilic intercellular
           adhesion.
          Length = 88

 Score = 34.6 bits (79), Expect = 0.031
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query: 606 LRISDARRADRGEYQAHGVNSLGEDVASFLVTVTDRPLPP 645
           L+I    R D G +  H +NS GED     +TV + P PP
Sbjct: 49  LQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDPP 88


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 38.1 bits (88), Expect = 0.032
 Identities = 38/184 (20%), Positives = 82/184 (44%), Gaps = 16/184 (8%)

Query: 905  DDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLP 964
            DDL+      D   + +  ES  ++ ES + G  +++ V +++              S  
Sbjct: 3835 DDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENEN-----------SDSEE 3883

Query: 965  VNSDIQIKIDK-PDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEED 1023
             N D+  +++  P+D  + + +  ++  NE  L  ++   Q S+E S A+N +D   +ED
Sbjct: 3884 ENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETE---QKSNEQSAANNESDLVSKED 3940

Query: 1024 EEDSF-DFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQE 1082
            +  +  D D   +++ EE  D+   D  I      +N     + E++  P+ + +   + 
Sbjct: 3941 DNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEG 4000

Query: 1083 DLDE 1086
            D+ +
Sbjct: 4001 DVSK 4004



 Score = 37.7 bits (87), Expect = 0.042
 Identities = 50/251 (19%), Positives = 96/251 (38%), Gaps = 52/251 (20%)

Query: 846  SSFRDKYVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSD 905
            +S  +K  D  ++ D++ET + +   + ++NE++                        S 
Sbjct: 3901 NSLNEKLWDEPNEEDLLETEQKSNEQSAANNESD----------------------LVSK 3938

Query: 906  DLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPV 965
            + D       + +E+ED E +  D      G ++E   I+ D   EN      +   L +
Sbjct: 3939 EDDNKALEDKDRQEKEDEEEMSDD-----VGIDDE---IQPDI-QENNSQPPPENEDLDL 3989

Query: 966  NSDIQI-----KIDKPDDEPDYVI----KGKYEVDNEMLLKRSKLKPQYSSEMSEASNIT 1016
              D+++      + K  D  D  +    + K E D E      K +P    +  E +N T
Sbjct: 3990 PEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAE------KDEPMQDEDPLEENN-T 4042

Query: 1017 DDEDEEDEEDS---FDFDELFEDNPEEEY--DEDDRDQPINFARNRHNKYIEDDQEEIYH 1071
             DED + ++ S    D +++ ED  EE    +E+  +  +          + +DQ    H
Sbjct: 4043 LDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNH 4102

Query: 1072 PKLMTMRSSQE 1082
            PK+    +   
Sbjct: 4103 PKMDAKSTFAS 4113



 Score = 35.0 bits (80), Expect = 0.28
 Identities = 43/209 (20%), Positives = 74/209 (35%), Gaps = 40/209 (19%)

Query: 854  DNEDDYDIVE----TNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDR 909
             + D+ +  E     N  T     S+NEN       + +DE V            +D+  
Sbjct: 3850 SDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEV------------NDIPE 3897

Query: 910  HYPT------LDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSL 963
                       DE  EE+  E+  K  E S    E +    +DD                
Sbjct: 3898 DLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDN-------------KA 3944

Query: 964  PVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSS-EMSEASNITDDEDEE 1022
              + D Q K D+ +   D  I    E+  ++    S+  P+    ++ E   + + E + 
Sbjct: 3945 LEDKDRQEKEDEEEMSDDVGIDD--EIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDV 4002

Query: 1023 DEE-DSFDFDELFEDNPEEEYDEDDRDQP 1050
             ++ D  D D    D  +EE D  ++D+P
Sbjct: 4003 SKDSDLEDMDMEAADENKEEAD-AEKDEP 4030



 Score = 34.2 bits (78), Expect = 0.58
 Identities = 60/274 (21%), Positives = 99/274 (36%), Gaps = 33/274 (12%)

Query: 850  DKYVDNED-DYDIVETNEHTGTGAPSDNEN------ESDYFPEKTIDESVYGYDTIVYGY 902
             K  D ED D +  + N+       +D E       E       T+DE +   D      
Sbjct: 4003 SKDSDLEDMDMEAADENKEE-----ADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAE 4057

Query: 903  DSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYS 962
            D + ++      + +E EE  E  VK  E   +G+  E + I D+    +         +
Sbjct: 4058 DDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAI-DNHPKMDAKSTFASAEA 4116

Query: 963  LPVNSDIQIKIDKPD-DEPDYV-----IKGKYEVDNEMLLK----RSKLKPQYSS----- 1007
               N+D  I  +  +  E D V       G++E   E         S+   QY S     
Sbjct: 4117 DEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEADRQYQSLGDHL 4176

Query: 1008 -EMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQ 1066
             E  +A+ I + ED  + +     D  F      + DE++  Q +  A     K I+ D+
Sbjct: 4177 REWQQANRIHEWEDLTESQSQAFDDSEFM---HVKEDEEEDLQALGNAEKDQIKSIDRDE 4233

Query: 1067 EEIYHPKLMTMRSSQEDLDEAPPVPEHLDDGPEI 1100
                +P  M   +  ED  +     + L DG +I
Sbjct: 4234 SANQNPDSMNSTNIAEDEADE-VGDKQLQDGQDI 4266



 Score = 30.4 bits (68), Expect = 7.1
 Identities = 34/175 (19%), Positives = 67/175 (38%), Gaps = 16/175 (9%)

Query: 916  EEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDK 975
            + E E+ +E     +E+       EA     D  Y++LGD L +++        +I   +
Sbjct: 4145 DGEFEQVQEDTSTPKEA-----MSEA-----DRQYQSLGDHL-REW----QQANRIHEWE 4189

Query: 976  PDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFE 1035
               E         E  +    +   L+   ++E  +  +I  DE      DS +   + E
Sbjct: 4190 DLTESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRDESANQNPDSMNSTNIAE 4249

Query: 1036 DNPEEEYDEDDRD-QPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQEDLDEAPP 1089
            D  +E  D+  +D Q I+  +      +  +   I   + +   S  ED+++  P
Sbjct: 4250 DEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFELSEDEDIEDELP 4304



 Score = 30.4 bits (68), Expect = 8.9
 Identities = 39/214 (18%), Positives = 71/214 (33%), Gaps = 27/214 (12%)

Query: 860  DIVETNEHTGTGA--PSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHYPTLDEE 917
            +  E  E  G      +D E E       T  E++             + DR Y +L + 
Sbjct: 4128 ENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAM------------SEADRQYQSLGDH 4175

Query: 918  EEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIK-IDKP 976
              E  + + + + E      E +++   D E+     D      +L      QIK ID+ 
Sbjct: 4176 LREWQQANRIHEWEDL---TESQSQAFDDSEFMHVKEDEEEDLQALGNAEKDQIKSIDRD 4232

Query: 977  ---DDEPDYVIKGKY------EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDS 1027
               +  PD +           EV ++ L     +     +            ++ ++   
Sbjct: 4233 ESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGSINQSEKVFE 4292

Query: 1028 FDFDELFEDNPEEEYDEDDRDQPINFARNRHNKY 1061
               DE  ED   +   +     PI+ AR+  NK+
Sbjct: 4293 LSEDEDIEDELPDYNVKITPAMPIDEARDLWNKH 4326


>gnl|CDD|143214 cd05737, Ig_Myomesin_like_C, C-temrinal immunoglobulin (Ig)-like
           domain of myomesin and M-protein.  Ig_Myomesin_like_C:
           domain similar to the C-temrinal immunoglobulin
           (Ig)-like domain of myomesin and M-protein. Myomesin and
           M-protein are both structural proteins localized to the
           M-band, a transverse structure in the center of the
           sarcomere, and are candidates for M-band bridges. Both
           proteins are modular, consisting mainly of repetitive
           Ig-like and fibronectin type III (FnIII) domains.
           Myomesin is expressed in all types of vertebrate
           striated muscle; M-protein has a muscle-type specific
           expression pattern. Myomesin is present in both slow and
           fast fibers; M-protein is present only in fast fibers.
           It has been suggested that myomesin acts as a molecular
           spring with alternative splicing as a means of modifying
           its elasticity.
          Length = 92

 Score = 34.4 bits (79), Expect = 0.039
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 2/79 (2%)

Query: 456 IRALHDT-TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRI-VTDNDISTLIIH 513
           +  L D  T +E + +  T  V G P P+VSW K+   +  S    + V     ++L I 
Sbjct: 4   VGGLPDVVTIMEGKTLNLTCTVFGDPDPEVSWLKNDQALALSDHYNVKVEQGKYASLTIK 63

Query: 514 QAALMDEGEIKCTATNRAG 532
             +  D G+      N+ G
Sbjct: 64  GVSSEDSGKYGIVVKNKYG 82



 Score = 34.0 bits (78), Expect = 0.066
 Identities = 20/77 (25%), Positives = 31/77 (40%), Gaps = 1/77 (1%)

Query: 563 GEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEIT-HTDRYLNLRISDARRADRGEYQA 621
           G+ + L  ++ G P P   WL N + L     Y +     +Y +L I      D G+Y  
Sbjct: 16  GKTLNLTCTVFGDPDPEVSWLKNDQALALSDHYNVKVEQGKYASLTIKGVSSEDSGKYGI 75

Query: 622 HGVNSLGEDVASFLVTV 638
              N  G +     V+V
Sbjct: 76  VVKNKYGGETVDVTVSV 92


>gnl|CDD|197470 smart00015, IQ, Calmodulin-binding motif.  Short calmodulin-binding
           motif containing conserved Ile and Gln residues.
          Length = 23

 Score = 32.3 bits (75), Expect = 0.048
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query: 276 RHQCAVKIQSLWKMYRVRKKF 296
             + A+ IQ+ W+ Y  RK++
Sbjct: 2   LTRAAIIIQAAWRGYLARKRY 22



 Score = 28.8 bits (66), Expect = 0.85
 Identities = 9/16 (56%), Positives = 11/16 (68%), Gaps = 1/16 (6%)

Query: 43 IVIQKYFRGYLLMRKE 58
          I+IQ  +RGY L RK 
Sbjct: 7  IIIQAAWRGY-LARKR 21


>gnl|CDD|143278 cd05870, Ig5_NCAM-2, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-2 (also known as
           OCAM/mamFas II and RNCAM).  Ig5_NCAM-2: the fifth
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-2 (also known as OCAM/mamFas II and
           RNCAM). NCAM-2  is organized similarly to NCAM ,
           including five N-terminal Ig-like domains and two
           fibronectin type III domains. NCAM-2 is differentially
           expressed in the developing and mature olfactory
           epithelium (OE), and may function like NCAM, as an
           adhesion molecule.
          Length = 98

 Score = 34.6 bits (79), Expect = 0.052
 Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 9/85 (10%)

Query: 456 IRALHDTTALEDEKVEFTVQVEGIPTPKVSWYK--DGFEIFSS------RRQRIVTDNDI 507
           I  L + T +E+     + + EG P P+++W +  DG   FS        R  +   +  
Sbjct: 5   IIQLKNETTVENGAATLSCKAEGEPIPEITWKRASDGHT-FSEGDKSPDGRIEVKGQHGE 63

Query: 508 STLIIHQAALMDEGEIKCTATNRAG 532
           S+L I    L D G   C A +R G
Sbjct: 64  SSLHIKDVKLSDSGRYDCEAASRIG 88


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 36.5 bits (84), Expect = 0.10
 Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 6/94 (6%)

Query: 855 NEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHYPTL 914
            +D     E    +     SD+E+  +   E +  E+    D      D D+        
Sbjct: 908 MKDPISFFEDGGWSFLMVGSDDES-DESEEEVSEYEA--SSDDESDETDEDEESDES--- 961

Query: 915 DEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDE 948
            E+  E++ E+   D E      E E+K   D  
Sbjct: 962 SEDLSEDESENDSSDEEDGEDWDELESKAAYDSR 995


>gnl|CDD|143172 cd04971, Ig_TrKABC_d5, Fifth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d5: the fifth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrkB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 81

 Score = 32.7 bits (75), Expect = 0.12
 Identities = 19/67 (28%), Positives = 25/67 (37%), Gaps = 7/67 (10%)

Query: 574 GMPPPTARWLHNGEPL-------TSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNS 626
           G P PT  W HNG  L       T       T T+ +  L+  +    + G Y     N 
Sbjct: 9   GNPKPTLTWYHNGAVLNESDYIRTEIHYEVTTPTEYHGCLQFDNPTHVNNGNYTLVASNE 68

Query: 627 LGEDVAS 633
            G+D  S
Sbjct: 69  YGQDSKS 75


>gnl|CDD|143263 cd05855, Ig_TrkB_d5, Fifth domain (immunoglobulin-like) of Trk
           receptor TrkB.  TrkB_d5: the fifth domain of Trk
           receptor TrkB, this is an immunoglobulin (Ig)-like
           domain which binds to neurotrophin. The Trk family of
           receptors are tyrosine kinase receptors, which mediate
           the trophic effects of the neurotrophin Nerve growth
           factor (NGF) family. The Trks are activated by
           dimerization, leading to autophosphorylation of
           intracellular tyrosine residues, and triggering the
           signal transduction pathway. TrkB shares significant
           sequence homology and domain organization with TrkA, and
           TrkC. The first three domains are leucine-rich domains.
           The fourth and fifth domains are Ig-like domains playing
           a part in ligand binding. TrKB is recognized by
           brain-derived neurotrophic factor (BDNF) and
           neurotrophin (NT)-4. In some cell systems NT-3 can
           activate TrkA and TrkB receptors. TrKB transcripts are
           found throughout multiple structures of the central and
           peripheral nervous systems.
          Length = 79

 Score = 32.5 bits (74), Expect = 0.14
 Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 5/65 (7%)

Query: 571 SMAGMPPPTARWLHNGEPLTSGGR-----YEITHTDRYLNLRISDARRADRGEYQAHGVN 625
           ++ G P PT +W H G  L          + I +T+ +  L++ +    + G Y     N
Sbjct: 6   TVKGNPKPTLQWFHEGAILNESEYICTKIHVINNTEYHGCLQLDNPTHLNNGIYTLVAKN 65

Query: 626 SLGED 630
             GED
Sbjct: 66  EYGED 70


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
            Superfamily; Provisional.
          Length = 844

 Score = 35.6 bits (82), Expect = 0.16
 Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 5/133 (3%)

Query: 902  YDSDDLDRHYPTLDEEEEEEDRESLVKDR--ESSVKGKEEEAKVIKDDEYYENLGDVLTK 959
            Y+ D+ D    T  E   ++D    + DR  ++  +   ++      D+   N       
Sbjct: 98   YEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDS-SNSQTDQGL 156

Query: 960  KYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDE 1019
            K S  VN     K+ +            Y   N+     +K   +  S      N +D +
Sbjct: 157  KQS--VNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKDGGKSKSSDPGPLNDSDGQ 214

Query: 1020 DEEDEEDSFDFDE 1032
             ++ + +S + D+
Sbjct: 215  GDDGDPESAEEDK 227


>gnl|CDD|143301 cd05893, Ig_Palladin_C, C-terminal immunoglobulin (Ig)-like domain
           of palladin.  Ig_Palladin_C: C-terminal immunoglobulin
           (Ig)-like domain of palladin. Palladin belongs to the
           palladin-myotilin-myopalladin family. Proteins belonging
           to this family contain multiple Ig-like domains and
           function as scaffolds, modulating actin cytoskeleton.
           Palladin binds to alpha-actinin ezrin,
           vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP,
           mDia interacting protein), and Src. Palladin also binds
           F-actin directly, via its Ig3 domain. Palladin is
           expressed as several alternatively spliced isoforms,
           having various combinations of Ig-like domains, in a
           cell-type-specific manner. It has been suggested that
           palladin's different Ig-like domains may be specialized
           for distinct functions.
          Length = 75

 Score = 32.3 bits (73), Expect = 0.17
 Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 2/65 (3%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDR--YLNLRISDARRADRGEYQAHG 623
           ++L+  ++G+P P   W    E LT        H D   Y+ L I  A + D G Y    
Sbjct: 1   VRLECRVSGVPHPQIFWKKENESLTHNTDRVSMHQDNCGYICLLIQGATKEDAGWYTVSA 60

Query: 624 VNSLG 628
            N  G
Sbjct: 61  KNEAG 65



 Score = 30.8 bits (69), Expect = 0.62
 Identities = 22/74 (29%), Positives = 31/74 (41%), Gaps = 4/74 (5%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDND---ISTLIIHQAALMDEGEIKCT 526
           V    +V G+P P++ W K+  E  +    R+    D      L+I  A   D G    +
Sbjct: 1   VRLECRVSGVPHPQIFWKKEN-ESLTHNTDRVSMHQDNCGYICLLIQGATKEDAGWYTVS 59

Query: 527 ATNRAGHSITKARL 540
           A N AG     ARL
Sbjct: 60  AKNEAGIVSCTARL 73


>gnl|CDD|215677 pfam00047, ig, Immunoglobulin domain.  Members of the
           immunoglobulin superfamily are found in hundreds of
           proteins of different functions. Examples include
           antibodies, the giant muscle kinase titin and receptor
           tyrosine kinases. Immunoglobulin-like domains may be
           involved in protein-protein and protein-ligand
           interactions. The Pfam alignments do not include the
           first and last strand of the immunoglobulin-like domain.
          Length = 62

 Score = 31.7 bits (72), Expect = 0.22
 Identities = 16/61 (26%), Positives = 21/61 (34%), Gaps = 1/61 (1%)

Query: 468 EKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDND-ISTLIIHQAALMDEGEIKCT 526
             V  T  V G P   V+W+K+G  +  S       +     TL I      D G   C 
Sbjct: 2   SSVTLTCSVSGPPQVDVTWFKEGKGLEESTTVGTDENRVSSITLTISNVTPEDSGTYTCV 61

Query: 527 A 527
            
Sbjct: 62  V 62


>gnl|CDD|143272 cd05864, Ig2_VEGFR-2, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 2 (VEGFR-2).
            Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 2 (VEGFR-2).
           The VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. VEGFRs bind VEGFs with high
           affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is
           a major mediator of the mitogenic, angiogenic and
           microvascular permeability-enhancing effects of VEGF-A;
           VEGF-A is important to the growth and maintenance of
           vascular endothelial cells and to the development of new
           blood- and lymphatic-vessels in physiological and
           pathological states. VEGF-A also interacts with VEGFR-1,
           which it binds more strongly than VEGFR-2.  VEGFR-2 and
           -1 may mediate a chemotactic and a survival signal in
           hematopoietic stem cells or leukemia cells.
          Length = 70

 Score = 31.8 bits (72), Expect = 0.22
 Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 5/54 (9%)

Query: 566 IKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEY 619
           +K+ V   G PPP  +W  NG+ +     ++         L I +    D G Y
Sbjct: 1   VKIPVKYYGYPPPEVKWYKNGQLIVLNHTFKRGVH-----LTIYEVTEKDAGNY 49


>gnl|CDD|143284 cd05876, Ig3_L1-CAM, Third immunoglobulin (Ig)-like domain of the
           L1 cell adhesion molecule (CAM).  Ig3_L1-CAM:  third
           immunoglobulin (Ig)-like domain of the L1 cell adhesion
           molecule (CAM). L1 belongs to the L1 subfamily of cell
           adhesion molecules (CAMs) and is comprised of an
           extracellular region having six Ig-like domains, five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 71

 Score = 31.8 bits (72), Expect = 0.23
 Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 3/65 (4%)

Query: 574 GMPPPTARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGVNSLGEDVAS 633
           G+P P   W     PL S  R +  + ++ L  ++ +   +D GEY     NS G     
Sbjct: 9   GLPTPEVHWDRIDGPL-SPNRTKKLNNNKTL--QLDNVLESDDGEYVCTAENSEGSARHH 65

Query: 634 FLVTV 638
           + VTV
Sbjct: 66  YTVTV 70



 Score = 29.9 bits (67), Expect = 1.2
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 8/65 (12%)

Query: 477 EGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN-----RA 531
           EG+PTP+V W +    +  +R +++   N+  TL +      D+GE  CTA N     R 
Sbjct: 8   EGLPTPEVHWDRIDGPLSPNRTKKL---NNNKTLQLDNVLESDDGEYVCTAENSEGSARH 64

Query: 532 GHSIT 536
            +++T
Sbjct: 65  HYTVT 69


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
            [Transcription].
          Length = 175

 Score = 34.0 bits (78), Expect = 0.23
 Identities = 22/87 (25%), Positives = 39/87 (44%), Gaps = 6/87 (6%)

Query: 965  VNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDE 1024
            + +  + K  K  D+     +   E D + L      + +   E+    +  DDEDE+D+
Sbjct: 91   IQAMTEKKDIKAKDKEVDAFE---EGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD 147

Query: 1025 EDSFDF---DELFEDNPEEEYDEDDRD 1048
            E        DE+ ED  ++E +ED+ D
Sbjct: 148  EIVEILIEDDEVDEDEDDDEDEEDEED 174


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 35.0 bits (81), Expect = 0.24
 Identities = 22/124 (17%), Positives = 41/124 (33%), Gaps = 12/124 (9%)

Query: 928  KDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGK 987
              +  + K  ++E    K  E    L       Y   ++   Q   D  DD+ D +    
Sbjct: 87   AAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDD 146

Query: 988  YEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFD--FDELFEDNPEEEYDED 1045
             + D++              +  +  +  DDEDEE +E        +  +   +E+  E 
Sbjct: 147  IDDDDDDED----------DDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEA 196

Query: 1046 DRDQ 1049
             R  
Sbjct: 197  LRQA 200



 Score = 31.9 bits (73), Expect = 1.9
 Identities = 23/141 (16%), Positives = 48/141 (34%), Gaps = 6/141 (4%)

Query: 908  DRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNS 967
             +       ++       +VKD + + +  +   K  K             KK    ++S
Sbjct: 44   SKKKTPEQIDQVLIFLSGMVKDTDDATE-SDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102

Query: 968  DIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDS 1027
              + +     D+ D      Y  D +  +           +  +  +   D+D++DE+D 
Sbjct: 103  SKKAEKKNALDKDD---DLNYVKDID--VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDD 157

Query: 1028 FDFDELFEDNPEEEYDEDDRD 1048
             D D+   D+ +EE  E    
Sbjct: 158  EDDDDDDVDDEDEEKKEAKEL 178


>gnl|CDD|143264 cd05856, Ig2_FGFRL1-like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor_like-1(FGFRL1). 
           Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain
           of fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal
           peptide, three extracellular Ig-like modules, a
           transmembrane segment, and a short intracellular domain.
           FGFRL1 is expressed preferentially in skeletal tissues.
           Similar to FGF receptors, the expressed protein
           interacts specifically with heparin and with FGF2.
           FGFRL1 does not have a protein tyrosine kinase domain at
           its C terminus; neither does its cytoplasmic domain
           appear to interact with a signaling partner. It has been
           suggested that FGFRL1 may not have any direct signaling
           function, but instead acts as a decoy receptor trapping
           FGFs and preventing them from binding other receptors.
          Length = 82

 Score = 32.1 bits (73), Expect = 0.26
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 4/78 (5%)

Query: 562 MGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTDR-YLNLRISDARRADRGEYQ 620
           +G  ++LK   +G P P   WL + +PLT     EI  + +    L + + +  D G+Y 
Sbjct: 8   VGSSVRLKCVASGNPRPDITWLKDNKPLT---PTEIGESRKKKWTLSLKNLKPEDSGKYT 64

Query: 621 AHGVNSLGEDVASFLVTV 638
            H  N  GE  A++ V V
Sbjct: 65  CHVSNRAGEINATYKVDV 82


>gnl|CDD|143274 cd05866, Ig1_NCAM-2, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-2.  Ig1_NCAM-2:
           first immunoglobulin (Ig)-like domain of neural cell
           adhesion molecule NCAM-2 (OCAM/mamFas II, RNCAM). NCAM-2
            is organized similarly to NCAM , including five
           N-terminal Ig-like domains and two fibronectin type III
           domains. NCAM-2 is differentially expressed in the
           developing and mature olfactory epithelium (OE), and may
           function like NCAM, as an adhesion molecule.
          Length = 92

 Score = 31.9 bits (72), Expect = 0.30
 Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 472 FTVQVEGIPTPKVSWYK-DGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNR 530
           FT    G P   + WY   G +I SS+R  +  +   S L I+ A + D G  +C AT+ 
Sbjct: 20  FTCTAIGEPE-SIDWYNPQGEKIVSSQRVVVQKEGVRSRLTIYNANIEDAGIYRCQATDA 78

Query: 531 AGHS 534
            G +
Sbjct: 79  KGQT 82


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
            fungal and plant proteins and contains many hypothetical
            proteins. VID27 is a cytoplasmic protein that plays a
            potential role in vacuolar protein degradation.
          Length = 794

 Score = 34.7 bits (80), Expect = 0.31
 Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 1007 SEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNK 1060
             E+ +A+   DDE+EEDEE+  + DE  E   +E  D+++ ++    ++   + 
Sbjct: 379  LEIEDANTERDDEEEEDEEEEEEEDED-EGPSKEHSDDEEFEEDDVESKYEDSD 431


>gnl|CDD|143299 cd05891, Ig_M-protein_C, C-terminal immunoglobulin (Ig)-like domain
           of M-protein (also known as myomesin-2).
           Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like
           domain of M-protein (also known as myomesin-2).
           M-protein is a structural protein localized to the
           M-band, a transverse structure in the center of the
           sarcomere, and is a candidate for M-band bridges.
           M-protein is modular consisting mainly of repetitive
           IG-like and fibronectin type III (FnIII) domains, and
           has a muscle-type specific expression pattern. M-protein
           is present in fast fibers.
          Length = 92

 Score = 31.8 bits (72), Expect = 0.34
 Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 1/71 (1%)

Query: 463 TALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDN-DISTLIIHQAALMDEG 521
           T +E + +  T  V G P P+V W+K+  +I  S    +  +    ++L I      D G
Sbjct: 12  TIMEGKTLNLTCTVFGNPDPEVIWFKNDQDIELSEHYSVKLEQGKYASLTIKGVTSEDSG 71

Query: 522 EIKCTATNRAG 532
           +      N+ G
Sbjct: 72  KYSINVKNKYG 82


>gnl|CDD|143267 cd05859, Ig4_PDGFR-alpha, Fourth immunoglobulin (Ig)-like domain of
           platelet-derived growth factor receptor (PDGFR) alpha.
           IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like
           domain of platelet-derived growth factor receptor
           (PDGFR) alpha. PDGF is a potent mitogen for connective
           tissue cells. PDGF-stimulated processes are mediated by
           three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
           binds to all three PDGFs, whereas the PDGFR beta (not
           included in this group) binds only to PDGF-B. PDGF alpha
           is organized as an extracellular component having five
           Ig-like domains, a transmembrane segment, and a
           cytoplasmic portion having protein tyrosine kinase
           activity. In mice, PDGFR alpha and PDGFR beta are
           essential for normal development.
          Length = 101

 Score = 31.8 bits (72), Expect = 0.42
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 468 EKVEFTVQVEGIPTPKVSWYKDG 490
           E  EF V+VE  P P++ W KD 
Sbjct: 19  EVKEFVVEVEAYPPPQIRWLKDN 41



 Score = 29.8 bits (67), Expect = 2.2
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 8/68 (11%)

Query: 561 EMGEIIKLKVSMAGMPPPTARWLHNGEPLTSGGRYEITHTD------RYLN-LRISDARR 613
            + E+ +  V +   PPP  RWL +   L      EIT ++      RY++ L++  A+ 
Sbjct: 16  NLHEVKEFVVEVEAYPPPQIRWLKDNRTL-IENLTEITTSEHNVQETRYVSKLKLIRAKE 74

Query: 614 ADRGEYQA 621
            D G Y A
Sbjct: 75  EDSGLYTA 82


>gnl|CDD|143241 cd05764, Ig_2, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_2: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 74

 Score = 31.3 bits (71), Expect = 0.43
 Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 2/65 (3%)

Query: 478 GIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATNRAGHSITK 537
           G P P + W     ++ S+  + +V DN   TL I    + D G   C A+N AG +   
Sbjct: 12  GDPEPAIHWISPDGKLISNSSRTLVYDN--GTLDILITTVKDTGSFTCIASNAAGEATAT 69

Query: 538 ARLRL 542
             L +
Sbjct: 70  VELHI 74


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
            related to Mpp10 (M phase phosphoprotein 10). The U3
            small nucleolar ribonucleoprotein (snoRNP) is required
            for three cleavage events that generate the mature 18S
            rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
            depletion of Mpp10, a U3 snoRNP-specific protein, halts
            18S rRNA production and impairs cleavage at the three U3
            snoRNP-dependent sites.
          Length = 613

 Score = 34.2 bits (78), Expect = 0.48
 Identities = 31/170 (18%), Positives = 53/170 (31%), Gaps = 11/170 (6%)

Query: 913  TLDEEEEEEDRESLVKDRESSVK-----GKEEEAKVIKDDEYYENLGDVLTKKYSLPVNS 967
                E EEE       D E   +     GK++E     +DE  +  G++  + +  P   
Sbjct: 204  LEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEK 263

Query: 968  DIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDS 1027
            D +      D E +     K  V  E   K  +       E  +      DE+E  E   
Sbjct: 264  DKKKDAGD-DAELEDDEPDKEAVKKEADSKPEE-----EDEEDDEQEDDQDEEEPPEAAM 317

Query: 1028 FDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKLMTM 1077
                         + +           + +  + IE  ++E   PK  T+
Sbjct: 318  DKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKSWTL 367



 Score = 32.7 bits (74), Expect = 1.2
 Identities = 18/95 (18%), Positives = 32/95 (33%), Gaps = 9/95 (9%)

Query: 1006 SSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDD 1065
            S      S  + D++EE+EED    DE+ +D  E +   +              + + DD
Sbjct: 104  SDGSDMDSEDSADDEEEEEEDESLEDEMIDDEDEADLFNESESS---------LEDLSDD 154

Query: 1066 QEEIYHPKLMTMRSSQEDLDEAPPVPEHLDDGPEI 1100
            + E    K M    + E+ +               
Sbjct: 155  ETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSG 189


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
            N-terminal.  This is the very acidic N-terminal region of
            the early transcription elongation factor Spt5. The
            Spt5-Spt4 complex regulates early transcription
            elongation by RNA polymerase II and has an imputed role
            in pre-mRNA processing via its physical association with
            mRNA capping enzymes. The actual function of this
            N-terminal domain is not known although it is dispensable
            for binding to Spt4.
          Length = 92

 Score = 31.3 bits (71), Expect = 0.50
 Identities = 17/53 (32%), Positives = 30/53 (56%)

Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
            DDE+EE+EE+  D ++L +++   +  E + D+       R  K  E+D EE+
Sbjct: 9    DDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEEL 61



 Score = 27.8 bits (62), Expect = 9.7
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 1011 EASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDR 1047
            EA    ++E+EE+EED  +     ++  +E   EDDR
Sbjct: 5    EAEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDR 41


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
            Rpc31.  RNA polymerase III contains seventeen subunits in
            yeasts and in human cells. Twelve of these are akin to
            RNA polymerase I or II and the other five are RNA pol
            III-specific, and form the functionally distinct groups
            (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34
            and Rpc82 form a cluster of enzyme-specific subunits that
            contribute to transcription initiation in S.cerevisiae
            and H.sapiens. There is evidence that these subunits are
            anchored at or near the N-terminal Zn-fold of Rpc1,
            itself prolonged by a highly conserved but RNA polymerase
            III-specific domain.
          Length = 221

 Score = 33.2 bits (76), Expect = 0.59
 Identities = 17/54 (31%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 993  EMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDD 1046
            ++  K S L+ +     +E  +  D++DEE+EE+  + DE F+D  +++ D+DD
Sbjct: 148  DIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDD--DDDDDDDD 199


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
            proteins whose interaction is required for the maturation
            of the 18S rRNA and for 40S ribosome production.
          Length = 809

 Score = 33.8 bits (78), Expect = 0.69
 Identities = 51/218 (23%), Positives = 84/218 (38%), Gaps = 54/218 (24%)

Query: 884  PEKTIDESVYGYDTIVY--GYDSDDLDRHYPT----LDEE---EEEEDRESLVKDRESSV 934
            P  T +E    YD  V    +D     R  PT     +EE   EE E  + L  +R   +
Sbjct: 227  PPMTPEEKDDEYDQRVRELTFDR----RAQPTDRTKTEEELAKEEAERLKKLEAERLRRM 282

Query: 935  KGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEM 994
            +G+EE+     D+E                   D +   D  DDE +       + D+  
Sbjct: 283  RGEEED-----DEE-----------------EEDSKESADDLDDEFEP------DDDDNF 314

Query: 995  LLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFA 1054
             L + +   +   +  +  +  DD D++ EE+  D D   E+  EE+ D DD D      
Sbjct: 315  GLGQGEEDEEEEEDGVDDEDEEDD-DDDLEEEEEDVDLSDEEEDEEDEDSDDEDDE---- 369

Query: 1055 RNRHNKYIEDDQEEIYHPKLMTMRSSQEDLDEAPPVPE 1092
                    E+++EE    K  +  S++ +L    P P+
Sbjct: 370  --------EEEEEEKEKKKKKSAESTRSELPFTFPCPK 399


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.  Genome
            duplication is precisely regulated by cyclin-dependent
            kinases CDKs, which bring about the onset of S phase by
            activating replication origins and then prevent
            relicensing of origins until mitosis is completed. The
            optimum sequence motif for CDK phosphorylation is
            S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
            potential phosphorylation sites. Drc1 is required for DNA
            synthesis and S-M replication checkpoint control. Drc1
            associates with Cdc2 and is phosphorylated at the onset
            of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
            replication by phosphorylating Drc1 and regulating its
            association with Cut5. Sld2 and Sld3 represent the
            minimal set of S-CDK substrates required for DNA
            replication.
          Length = 397

 Score = 33.2 bits (76), Expect = 0.72
 Identities = 28/128 (21%), Positives = 50/128 (39%), Gaps = 17/128 (13%)

Query: 925  SLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVI 984
             LV++ ES      +E  V+++ E  E     + ++    V+S         +DEP  V 
Sbjct: 238  ELVQEEESID----DELDVLREIEAEEAGIGPIEEEV---VDSQA------ANDEPRRVF 284

Query: 985  KGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDD-EDEEDEEDSFDFDELFEDNPEEEYD 1043
            K K +   +   +R K++P  +    E S    D  +E  + D     E        + D
Sbjct: 285  KKKGQ---KRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDED 341

Query: 1044 EDDRDQPI 1051
            ++D D   
Sbjct: 342  DEDEDDEE 349


>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor.  This
            product is encoded by astrovirus ORF2, one of the three
            astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
            undergoes an intracellular cleavage to form a 79kD
            protein. Subsequently, extracellular trypsin cleavage
            yields the three proteins forming the infectious virion.
          Length = 787

 Score = 33.6 bits (77), Expect = 0.78
 Identities = 16/54 (29%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 1003 PQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARN 1056
               S E ++ +     EDE+DE D FD  +     PE++ DE++R   ++   N
Sbjct: 668  DLISLEETD-TEDESTEDEDDELDRFDLHDSSGSEPEDD-DENNRVTLLSTLIN 719


>gnl|CDD|143280 cd05872, Ig_Sema4B_like, Immunoglobulin (Ig)-like domain of the
           class IV semaphorin Sema4B.  Ig_Sema4B_like;
           Immunoglobulin (Ig)-like domain of Sema4B_like. Sema4B
           is a Class IV semaphorin. Semaphorins are classified
           based on structural features additional to the Sema
           domain. Sema4B has extracellular Sema and Ig domains, a
           transmembrane domain and a short cytoplasmic domain.
           Sema4B has been shown to preferentially regulate the
           development of the postsynaptic specialization at the
           glutamatergic synapses. This cytoplasmic domain includes
           a PDZ-binding motif upon which the synaptic localization
           of Sem4B is dependent. Sema4B is a ligand of CLCP1,
           CLCP1 was identified in an expression profiling
           analysis, which compared a highly metastic lung cancer
           subline with its low metastic parental line. Sema4B was
           shown to promote CLCP1 endocytosis, and their
           interaction is a potential target for therapeutic
           intervention of metastasis.
          Length = 85

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 5/61 (8%)

Query: 579 TARWLHNGEPLTSGGRYEITHTDRYLNLRISDARRADRGEYQAHGV-NSLGEDVASFLVT 637
           +  WL NG PL +   Y +  TD    L I        G Y+ +       + VAS+ + 
Sbjct: 26  SPVWLFNGTPLNAQFSYRVG-TD---GLLILVTSPEHSGTYRCYSEEEGFQQLVASYSLN 81

Query: 638 V 638
           V
Sbjct: 82  V 82


>gnl|CDD|222477 pfam13965, SID-1_RNA_chan, dsRNA-gated channel SID-1.  This is a
            family of proteins that are transmembrane dsRNA-gated
            channels. They passively transport dsRNA into cells and
            do not act as ATP-dependent pumps. They are required for
            systemic RNA interference.
          Length = 567

 Score = 32.4 bits (74), Expect = 1.4
 Identities = 14/62 (22%), Positives = 22/62 (35%), Gaps = 2/62 (3%)

Query: 1013 SNITDDEDEEDEEDSFDFDELFEDNP--EEEYDEDDRDQPINFARNRHNKYIEDDQEEIY 1070
             +I   E    E+ + D      +    E E D    DQ I   R + + Y+ D   +  
Sbjct: 150  RDIISFEPSPSEQRAMDLQPDQSEEDSSERENDILMADQQIMVIREKASLYVSDLSRKDQ 209

Query: 1071 HP 1072
             P
Sbjct: 210  RP 211


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
            mitochondrial protein translocase family, which is
            responsible for movement of nuclear encoded pre-proteins
            into mitochondria, is very complex with at least 19
            components. These proteins include several chaperone
            proteins, four proteins of the outer membrane translocase
            (Tom) import receptor, five proteins of the Tom channel
            complex, five proteins of the inner membrane translocase
            (Tim) and three "motor" proteins. This family represents
            the Tom22 proteins. The N terminal region of Tom22 has
            been shown to have chaperone-like activity, and the C
            terminal region faces the intermembrane face.
          Length = 136

 Score = 31.1 bits (71), Expect = 1.4
 Identities = 10/44 (22%), Positives = 20/44 (45%)

Query: 989  EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDE 1032
            EV++E   ++       + E S+  +  D + + D  D  DF+ 
Sbjct: 6    EVEDETFQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFEN 49


>gnl|CDD|143266 cd05858, Ig3_FGFR-2, Third immunoglobulin (Ig)-like domain of
           fibroblast growth factor receptor 2 (FGFR2).
           Ig3_FGFR-2-like; domain similar to the third
           immunoglobulin (Ig)-like domain of human fibroblast
           growth factor receptor 2 (FGFR2). Fibroblast growth
           factors (FGFs) participate in morphogenesis,
           development, angiogenesis, and wound healing. These
           FGF-stimulated processes are mediated by four FGFR
           tyrosine kinases (FGRF1-4). FGFRs are comprised of an
           extracellular portion consisting of three Ig-like
           domains, a transmembrane helix, and a cytoplasmic
           portion having protein tyrosine kinase activity. The
           highly conserved Ig-like domains 2 and 3, and the linker
           region between D2 and D3 define a general binding site
           for FGFs. FGFR2 is required for male sex determination.
          Length = 90

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 23/84 (27%), Positives = 30/84 (35%), Gaps = 13/84 (15%)

Query: 470 VEFTVQVEGIPTPKVSWYK-----------DGFEIFSSRRQRIV--TDNDISTLIIHQAA 516
           VEF  +V     P + W K           DG    +  +   V  TD ++  L +    
Sbjct: 4   VEFVCKVYSDAQPHIQWLKHVEKNGSKYGPDGLPYVTVLKTAGVNTTDKEMEVLYLRNVT 63

Query: 517 LMDEGEIKCTATNRAGHSITKARL 540
             D GE  C A N  G S   A L
Sbjct: 64  FEDAGEYTCLAGNSIGISHHSAWL 87


>gnl|CDD|143250 cd05773, Ig8_hNephrin_like, Eighth immunoglobulin-like domain of
           nephrin.  Ig8_hNephrin_like: domain similar to the
           eighth immunoglobulin-like domain in human nephrin.
           Nephrin is an integral component of the slit diaphragm,
           and is a central component of the glomerular
           ultrafilter. Nephrin plays a structural role, and has a
           role in signaling. Nephrin is a transmembrane protein
           having a short intracellular portion, and an
           extracellular portion comprised of eight Ig-like
           domains, and one fibronectin type III-like domain. The
           extracellular portions of nephrin, from neighboring foot
           processes of separate podocyte cells, may interact with
           each other, and in association with other components of
           the slit diaphragm, form a porous molecular sieve within
           the slit pore.  The intracellular portion of nephrin is
           associated with linker proteins, which connect nephrin
           to the actin cytoskeleton. The intracellular portion is
           tyrosine phosphorylated, and mediates signaling from the
           slit diaphragm into the podocytes.
          Length = 109

 Score = 30.7 bits (69), Expect = 1.4
 Identities = 27/77 (35%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 574 GMPPPTARWLHNGEPLTSGG-RYEIT-------HTDRYLNLRISDARRADRGEYQAHGVN 625
           G+P    RW  NG PL  G  RYE T       HT     + +S A   D   +     N
Sbjct: 34  GVPRVQFRWAKNGVPLDLGNPRYEETTEHTGTVHTSILTIINVSAAL--DYALFTCTAHN 91

Query: 626 SLGEDVASFLVTVTDRP 642
           SLGED     +  T RP
Sbjct: 92  SLGEDSLDIQLVSTSRP 108


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 32.2 bits (73), Expect = 1.6
 Identities = 21/101 (20%), Positives = 29/101 (28%), Gaps = 12/101 (11%)

Query: 999  SKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRH 1058
            S L P+ +S  S  S  +                  E  P E ++     QP +F +  H
Sbjct: 116  SGLSPENTSGSSPESPASHSPPPSPPSHPGPH----EPAPPESHNPSPNQQPSSFLQPSH 171

Query: 1059 NKYIEDDQEEIYHPKLMTMRSSQEDLDEAPPVPEHLDDGPE 1099
                ED  EE   P        + D    P         P 
Sbjct: 172  ----EDSPEEPEPPT----SEPEPDSPGPPQSETPTSSPPP 204


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
            Provisional.
          Length = 337

 Score = 32.1 bits (73), Expect = 1.6
 Identities = 11/46 (23%), Positives = 17/46 (36%)

Query: 1015 ITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNK 1060
               DED +   D  D D    D+ +    + + D   N   +R  K
Sbjct: 278  EAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEK 323


>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
            unknown].
          Length = 769

 Score = 32.4 bits (73), Expect = 1.7
 Identities = 33/205 (16%), Positives = 64/205 (31%), Gaps = 35/205 (17%)

Query: 872  APSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHYPTLDEEEEEEDRESLVKDRE 931
                 +   + F ++     +          + D LD + P  ++   + D +       
Sbjct: 300  NGQYEQTIREIFADRATKLELDLQTVFESNMNRDTLDEYAPEGEDLRSDYDEDF------ 353

Query: 932  SSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPV-NSDIQIKIDKPDDEPDYVIKGKYEV 990
                  +    V  DD  +    +  +KK ++P   S  Q K  + ++E D     +   
Sbjct: 354  ----EYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEEST 409

Query: 991  DNEMLLKRSKLKPQYSSEMSEASNITDDEDEE-DEEDSFDFDELFEDNPEEEYDEDDR-- 1047
             + +             +  E  N     DEE   +D+  F+EL  +  E +  E     
Sbjct: 410  MSAI----------DDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDME 459

Query: 1048 -----------DQPINFARNRHNKY 1061
                        QP   A  R+ +Y
Sbjct: 460  KEDREFPDEAELQPSESAIERYKEY 484


>gnl|CDD|143282 cd05874, Ig6_NrCAM, Sixth immunoglobulin (Ig)-like domain of NrCAM
           (Ng (neuronglia) CAM-related cell adhesion molecule).
           Ig6_NrCAM: sixth immunoglobulin (Ig)-like domain of
           NrCAM (Ng (neuronglia) CAM-related cell adhesion
           molecule). NrCAM belongs to the L1 subfamily of cell
           adhesion molecules (CAMs) and is comprised of an
           extracellular region having six Ig-like domains and five
           fibronectin type III domains, a transmembrane region,
           and an intracellular domain. NrCAM is primarily
           expressed in the nervous system.
          Length = 77

 Score = 29.5 bits (66), Expect = 1.7
 Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 4/66 (6%)

Query: 475 QVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIH----QAALMDEGEIKCTATNR 530
           + +G P P  SW ++G      +  ++    +  TL+I+    + A   EG  +CTA N 
Sbjct: 6   EAKGKPPPSFSWTRNGTHFDIDKDPKVTMKPNTGTLVINIMNGEKAEAYEGVYQCTARNE 65

Query: 531 AGHSIT 536
            G +++
Sbjct: 66  RGAAVS 71


>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination, and
            repair / Cell division and chromosome partitioning /
            Translation].
          Length = 953

 Score = 32.2 bits (73), Expect = 1.8
 Identities = 35/186 (18%), Positives = 63/186 (33%), Gaps = 20/186 (10%)

Query: 917  EEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKP 976
            +  +E+R++   +R S  K ++E  K +    Y +       K     ++   + K    
Sbjct: 380  DHIQEERKNESLERFSLRKERKEGLKGMPRVVYEQKKLIGSIKPTL--MDQLQEKKSPDL 437

Query: 977  DDEPDY----VIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDE 1032
             DE       + K     ++E+ LKR       S   +  S      + +   DS D DE
Sbjct: 438  PDEEFIDTLALPKDLDMKNHELFLKRFANDLGLSISKAIKSKGNYSLEMDIASDSPDEDE 497

Query: 1033 LFEDNPEEEYDEDDR-----------DQPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQ 1081
               +  E E D   +           ++  N      N      +E  Y  KL     S+
Sbjct: 498  ---EEFESEVDSIRKIPDKYVNIIVEEEEENETEKTVNLRFPGWKERYYTSKLHFTTDSE 554

Query: 1082 EDLDEA 1087
            E + + 
Sbjct: 555  EKIRDM 560


>gnl|CDD|143271 cd05863, Ig2_VEGFR-3, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 3 (VEGFR-3).
            Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 3 (VEGFR-3).
           The VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. VEGFRs bind VEGFs with high
           affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds
           two members of the VEGF family (VEGF-C and -D) and is
           involved in tumor angiogenesis and growth.
          Length = 67

 Score = 28.8 bits (64), Expect = 2.4
 Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 8/62 (12%)

Query: 470 VEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTATN 529
           V+  V+V   P P+  WYKDG ++ S +  +        +L I        G       N
Sbjct: 1   VKLPVKVAAYPPPEFQWYKDG-KLISGKHSQ-------HSLQIKDVTEASAGTYTLVLWN 52

Query: 530 RA 531
            A
Sbjct: 53  SA 54


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
            kDa subunit.  This family contains several DNA-directed
            RNA polymerase 19 kDa polypeptides. The Poxvirus
            DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
            DNA-template-directed extension of the 3'-end of an RNA
            strand by one nucleotide at a time.
          Length = 167

 Score = 30.8 bits (70), Expect = 2.5
 Identities = 9/37 (24%), Positives = 21/37 (56%)

Query: 1009 MSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDED 1045
            M ++ +I D E ++D+ + ++ +E  E++ E     D
Sbjct: 1    MEDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESSD 37


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
            eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known to
            suppress GCR2 and is highly expressed in the muscle and
            heart. The function of this family is unknown although it
            has been speculated that SGT1 may be functionally
            analogous to the Gcr2p protein of Saccharomyces
            cerevisiae which is known to be a regulatory factor of
            glycolytic gene expression.
          Length = 557

 Score = 31.6 bits (72), Expect = 2.5
 Identities = 19/85 (22%), Positives = 36/85 (42%), Gaps = 7/85 (8%)

Query: 1008 EMSEASNITDDEDEEDEEDSFDFDELFE------DNPEEEYDEDDRDQPINFARNRHNKY 1061
               E  +  + +D ED++ SFD DE FE         ++E D D  D   + A    ++ 
Sbjct: 437  ADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDS-DDADEDDDED 495

Query: 1062 IEDDQEEIYHPKLMTMRSSQEDLDE 1086
             ++D++      L  +    + +D 
Sbjct: 496  DDEDEDSSSDSTLEELEEYMDQMDA 520


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is a
            family of uncharacterized proteins.
          Length = 449

 Score = 31.5 bits (71), Expect = 2.9
 Identities = 18/55 (32%), Positives = 31/55 (56%)

Query: 994  MLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRD 1048
              +K+     +   E  E +   +D+DEED++D  D DE  +D+ ++E DED+ D
Sbjct: 30   DTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDD 84


>gnl|CDD|197329 cd09095, INPP5c_INPP5E-like, Catalytic inositol polyphosphate
            5-phosphatase (INPP5c) domain of Inositol
            polyphosphate-5-phosphatase E and related proteins.
            INPP5c domain of Inositol polyphosphate-5-phosphatase E
            (also called type IV or 72 kDa 5-phosphatase), rat
            pharbin, and related proteins. This subfamily belongs to
            a family of Mg2+-dependent inositol polyphosphate
            5-phosphatases, which hydrolyze the 5-phosphate from the
            inositol ring of various 5-position phosphorylated
            phosphoinositides (PIs) and inositol phosphates (IPs),
            and to the large EEP
            (exonuclease/endonuclease/phosphatase) superfamily that
            contains functionally diverse enzymes that share a common
            catalytic mechanism of cleaving phosphodiester bonds.
            INPP5E hydrolyzes the 5-phosphate from PI(3,5)P2,
            PI(4,5)P2 and PI(3,4,5)P3, forming PI3P, PI4P, and
            PI(3,4)P2, respectively. It is a very potent PI(3,4,5)P3
            5-phosphatase. Its intracellular localization is chiefly
            cytosolic, with pronounced perinuclear/Golgi
            localization. INPP5E also has an N-terminal proline rich
            domain (PRD) and a C-terminal CAAX motif. This protein is
            expressed in a variety of tissues, including the breast,
            brain, testis, and haemopoietic cells. It is
            differentially expressed in several cancers, for example,
            it is up-regulated in cervical cancer and down-regulated
            in stomach cancer. It is a candidate target for
            therapeutics of obesity and related disorders, as it is
            expressed in the hypothalamus, and following insulin
            stimulation, it undergoes tyrosine phosphorylation,
            associates with insulin receptor substrate-1, -2, and
            PI3-kinase, and become active as a 5-phosphatase. INPP5E
            may play a role, along with other 5-phosphatases SHIP2
            and SKIP, in regulating glucose homoeostasis and energy
            metabolism. Mice deficient in INPPE5 develop a
            multi-organ disorder associated with structural defects
            of the primary cilium.
          Length = 298

 Score = 31.2 bits (71), Expect = 3.0
 Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 4/72 (5%)

Query: 942  KVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKL 1001
            + +  + Y    GDV T+   +    D   ++  P    D +I    EVD   LL+    
Sbjct: 156  RNVPTNPYKSESGDVTTRFDEVFWFGDFNFRLSGPRHLVDALINQGQEVDVSALLQ---- 211

Query: 1002 KPQYSSEMSEAS 1013
              Q + EMS+ S
Sbjct: 212  HDQLTREMSKGS 223


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
            Members of this family are bacterial proteins with a
            conserved motif [KR]FYDLN, sometimes flanked by a pair of
            CXXC motifs, followed by a long region of low complexity
            sequence in which roughly half the residues are Asp and
            Glu, including multiple runs of five or more acidic
            residues. The function of members of this family is
            unknown.
          Length = 104

 Score = 29.2 bits (66), Expect = 3.2
 Identities = 14/58 (24%), Positives = 29/58 (50%), Gaps = 14/58 (24%)

Query: 1011 EASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEE 1068
            E  +    +D++D++D  D  +L +D+ + + D+DD              ++EDD +E
Sbjct: 61   EDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDD--------------FLEDDDDE 104


>gnl|CDD|220149 pfam09234, DUF1963, Domain of unknown function (DUF1963).  This
            domain is found in a set of hypothetical bacterial
            proteins. Its exact function has not, as yet, been
            described.
          Length = 221

 Score = 30.8 bits (70), Expect = 3.5
 Identities = 20/91 (21%), Positives = 34/91 (37%), Gaps = 9/91 (9%)

Query: 965  VNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEM----SEASNITDDED 1020
            ++ D     D P+D+  + +   Y  D    L    L   +S        E     +  D
Sbjct: 52   IDLDDDDWGDSPEDQTGFRVI--YFEDIIEDLLPKDLIEDFSFLKAPFEGELKLPFEKSD 109

Query: 1021 EEDEEDSFDFDELFEDNP---EEEYDEDDRD 1048
            E   ED + F++ +E      EEE +E   +
Sbjct: 110  EPISEDDYSFEQEYESEILELEEEDEELIEE 140


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
            known as chromatin decondensation proteins. They bind to
            core histones and transfer DNA to them in a reaction that
            requires ATP. This is thought to play a role in the
            assembly of regular nucleosomal arrays.
          Length = 146

 Score = 30.0 bits (68), Expect = 3.5
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEE 1041
             D+DEEDEE+  D ++  ED  EEE
Sbjct: 114  SDDDEEDEEEEDDEEDDDEDESEEE 138


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
            timeless (tim) gene is essential for circadian function
            in Drosophila. Putative homologues of Drosophila tim have
            been identified in both mice and humans (mTim and hTIM,
            respectively). Mammalian TIM is not the true orthologue
            of Drosophila TIM, but is the likely orthologue of a fly
            gene, timeout (also called tim-2). mTim has been shown to
            be essential for embryonic development, but does not have
            substantiated circadian function. Some family members
            contain a SANT domain in this region.
          Length = 507

 Score = 31.2 bits (70), Expect = 3.5
 Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 4/83 (4%)

Query: 989  EVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDR- 1047
            E   E   K    K Q + +  + +    + DEE ++ S D D    D+        D+ 
Sbjct: 425  EALGEEEQKAPPKKKQLNQKNKQQTGSGTNSDEERDDTSLDEDRDLADDGGLPRIHKDKR 484

Query: 1048 ---DQPINFARNRHNKYIEDDQE 1067
                   +    R  K +EDD E
Sbjct: 485  AGASLTQSPLSRRRLKVVEDDDE 507


>gnl|CDD|214441 MTH00157, ATP6, ATP synthase F0 subunit 6; Provisional.
          Length = 223

 Score = 30.5 bits (70), Expect = 3.7
 Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 5/25 (20%)

Query: 824 QSTSPMLAAFMLLMLFTFIETISSF 848
           Q T P+L  FM+L     IETIS+ 
Sbjct: 131 QGTPPILMPFMVL-----IETISNL 150


>gnl|CDD|165173 PHA02826, PHA02826, IL-1 receptor-like protein; Provisional.
          Length = 227

 Score = 30.3 bits (68), Expect = 4.2
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 5/57 (8%)

Query: 484 VSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCT---ATNRAGHSITK 537
           ++WYK+G  +  +   RI   N+ STL+I  A   D G   C      N   ++ITK
Sbjct: 166 LTWYKNGNIVLYT--DRIQLRNNNSTLVIKSATHDDSGIYTCNLRFNKNSNNYNITK 220


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
            Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
            CPA1 family is a large family of proteins derived from
            Gram-positive and Gram-negative bacteria, blue green
            bacteria, yeast, plants and animals. Transporters from
            eukaryotes have been functionally characterized, and all
            of these catalyze Na+:H+ exchange. Their primary
            physiological functions may be in (1) cytoplasmic pH
            regulation, extruding the H+ generated during metabolism,
            and (2) salt tolerance (in plants), due to Na+ uptake
            into vacuoles. This model is specific for the fungal
            members of this family [Transport and binding proteins,
            Cations and iron carrying compounds].
          Length = 810

 Score = 31.0 bits (70), Expect = 4.3
 Identities = 48/252 (19%), Positives = 93/252 (36%), Gaps = 63/252 (25%)

Query: 842  IETISSFRDKYVDNEDDYDI--VETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIV 899
            + TI    DK   + ++ D+  V T+ + G  +  D+                       
Sbjct: 582  VNTIYGL-DKLARDTENRDVTYVPTSRYDGIESEIDDVYT-------------------- 620

Query: 900  YGYDSDDLD----RHYPTLDEEEEE------EDRESLVKDRESSV-KGKEEEAKVIKDDE 948
            Y  DS+ +     R    L EEE++      ED + ++++R+  + +  +   +  +D E
Sbjct: 621  YENDSESIASSERRRIKKLREEEQQAYIAYTEDNQVIIENRQGEILEYVDIHDRGARDAE 680

Query: 949  YYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLK----------- 997
               + G  L +  S P+    QI  ++      Y     Y+V N+++++           
Sbjct: 681  VGVHNGGRLKRALSPPLEKLHQI-TNEAKKSKYYA----YKVGNDLIIEDESGEVFRRYR 735

Query: 998  ------RSKLKPQYSSEMS--EASNITDDED--EEDEEDSFDF-DELFEDNPEEEYDED- 1045
                  + K+K +  S +S  E   I       E    D     DE+ +D  E E  +D 
Sbjct: 736  ISPHGGKRKIKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMADDEAESENMDDY 795

Query: 1046 -DRDQPINFARN 1056
             D D     +++
Sbjct: 796  EDSDDNAYESKD 807


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 30.9 bits (70), Expect = 4.9
 Identities = 15/72 (20%), Positives = 35/72 (48%), Gaps = 2/72 (2%)

Query: 1015 ITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKL 1074
            + D+E+EE++E+  + +E   D  EE  +E+++++     +       E   E +   K 
Sbjct: 35   VPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEW--ELLNKTKP 92

Query: 1075 MTMRSSQEDLDE 1086
            +  R+ ++   E
Sbjct: 93   IWTRNPKDVTKE 104


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
            consists of several bacterial cobalamin biosynthesis
            (CobT) proteins. CobT is involved in the transformation
            of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 30.6 bits (69), Expect = 4.9
 Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 13/84 (15%)

Query: 1007 SEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDED---DRDQPINFARNRHNKYIE 1063
            S M  A  + D+ +  D ED+ D     ED+P+E+ D+D   + +   + + +  +    
Sbjct: 204  SSMDMAEELGDEPESADSEDNED-----EDDPKEDEDDDQGEEEESGSSDSLSEDSDASS 258

Query: 1064 DDQEEIYHPKLMTM-RSSQEDLDE 1086
            ++ E       M    +S +D  +
Sbjct: 259  EEMESGE----MEAAEASADDTPD 278


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 30.8 bits (69), Expect = 5.0
 Identities = 39/216 (18%), Positives = 65/216 (30%), Gaps = 17/216 (7%)

Query: 112 AIQTLYKAKLLMKRDRAAYTELKQACVSVQQRWRANLTMRKQRAHFLLMKQKASVIQQWY 171
               +++   +  + RA  TEL  A     ++  A       R+     +Q+   ++Q  
Sbjct: 69  LRSGVFQLDDIRPQLRALRTELGTA---QGEKRAAETEREAARSELQKARQEREAVRQ-- 123

Query: 172 RNTKLMRLEASYLHELKAATITIQRRYRANVAMRTQRERYVALRTATITIQTRFRAYLIA 231
                 +  A    EL   T   Q        +  QR +  A   +    Q + +A    
Sbjct: 124 ELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQ 183

Query: 232 KNQRDEYAELKQARRFRFKLNLRKYERVIELLKLKREQERQEKYRHQCAVKIQSLWKMYR 291
              +    +L+ A+  +   NL       +      E  R+     Q A  IQ       
Sbjct: 184 LKSQVLDLKLRSAQIEQEAQNLATRANAAQ--ARTEELARRAAAAQQTAQAIQQR----- 236

Query: 292 VRKKFADIIEQKKQAKKTADNQFENQAPLYVRLEEA 327
                   I QK Q       Q   +     RLE A
Sbjct: 237 -----DAQISQKAQQIAARAEQIRERERQLQRLETA 267


>gnl|CDD|143259 cd05851, Ig3_Contactin-1, Third Ig domain of contactin-1.
           Ig3_Contactin-1: Third Ig domain of the neural cell
           adhesion molecule contactin-1. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-1 is
           differentially expressed in tumor tissues and may
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 88

 Score = 28.4 bits (63), Expect = 5.3
 Identities = 23/87 (26%), Positives = 34/87 (39%), Gaps = 12/87 (13%)

Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDIST----LIIHQAA 516
           DT AL+ + V       G P P + W K          + +    +IS     L I    
Sbjct: 10  DTYALKGQNVTLECFALGNPVPVIRWRK--------ILEPMPATAEISMSGAVLKIFNIQ 61

Query: 517 LMDEGEIKCTATNRAGHSITKARLRLE 543
             DEG  +C A N  G    +AR+ ++
Sbjct: 62  PEDEGTYECEAENIKGKDKHQARVYVQ 88


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 30.8 bits (70), Expect = 5.3
 Identities = 32/149 (21%), Positives = 54/149 (36%), Gaps = 16/149 (10%)

Query: 905  DDLDRHYPTLDEEEEEEDRESLVKDRESS-----------VKGKEEEAKVIKDDEYYENL 953
            +DLD+    L+E+EE E++E   + R  S            K K++E K  K        
Sbjct: 1132 EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSAD--KS 1189

Query: 954  GDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEAS 1013
                    S  V+SD + K+D   D       G  + D+E    + K       +  + +
Sbjct: 1190 KKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNN 1249

Query: 1014 NITDDEDEEDEEDSFDFDELFEDNPEEEY 1042
                 +  ED ++    D   E  P+   
Sbjct: 1250 ---SSKSSEDNDEFSSDDLSKEGKPKNAP 1275


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
            required for initiation of DNA replication in S.
            cerevisiae, forming a complex with MCM5/CDC46. Homologues
            of CDC45 have been identified in human, mouse and smut
            fungus among others.
          Length = 583

 Score = 30.7 bits (70), Expect = 5.6
 Identities = 13/53 (24%), Positives = 29/53 (54%)

Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
            DDE+ ++E++     E  ED+ +++ D+D   +  +  R R  +  E+ + E+
Sbjct: 129  DDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAEL 181


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part of
            a large ribonucleoprotein complex containing the U3
            snoRNA. Depletion of the Utp proteins impedes production
            of the 18S rRNA, indicating that they are part of the
            active pre-rRNA processing complex. This large RNP
            complex has been termed the small subunit (SSU)
            processome.
          Length = 728

 Score = 30.4 bits (69), Expect = 5.8
 Identities = 32/198 (16%), Positives = 65/198 (32%), Gaps = 28/198 (14%)

Query: 915  DEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKID 974
            D E EE  RE   ++     + +E   K +   ++    G+   +   L   +  + K  
Sbjct: 391  DAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEK 450

Query: 975  KPDDEPDY---VIKGKYEVDNEMLLKRSKLK-------------PQYSSEMSEASNITDD 1018
            K  DE +      + K E     LLKRS+               P   +  S   +    
Sbjct: 451  KESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQ 510

Query: 1019 EDEEDEEDSFDFDELFEDN---------PEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
            + ++      D                  +E+  + D D        + +   E+D+++ 
Sbjct: 511  DSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDE 570

Query: 1070 YHPKLMTMRSSQEDLDEA 1087
               +L  +   ++ + EA
Sbjct: 571  ---ELPFLFKQKDLIKEA 585


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of the
            RNA polymerase II associated Paf1 complex. The Paf1
            complex functions during the elongation phase of
            transcription in conjunction with Spt4-Spt5 and
            Spt16-Pob3i.
          Length = 431

 Score = 30.5 bits (69), Expect = 5.9
 Identities = 36/167 (21%), Positives = 64/167 (38%), Gaps = 36/167 (21%)

Query: 904  SDDLDRHYPTLDEEEEEED------RESLVKDRESSVKGKEEEAKVIKDDE---YYENL- 953
             D L++    L + +E+E+      RE  +K +  + K  E     + D+    YY+ L 
Sbjct: 255  EDTLEKRSDDLHDYDEDEEYKFKRVREYDMKVKSKATKLNELALFFVSDENGVVYYKPLR 314

Query: 954  ----------GDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKP 1003
                       DV+        N  + +K+  P  +                 +R++L P
Sbjct: 315  SRVELRRRRVNDVIRPLVREHNNDQLNVKLRNPSTK----------ESKMRDKRRARLDP 364

Query: 1004 QYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQP 1050
                E+ E      DEDEE+E+ S + +E   ++ EEE  +   D  
Sbjct: 365  IDFEEVDE------DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGS 405


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 29.4 bits (66), Expect = 6.4
 Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 13/99 (13%)

Query: 852 YVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDRHY 911
            + + D  DI++ NE        D+E E +   E+  +               D  D   
Sbjct: 27  KITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE-------------DIEDEEE 73

Query: 912 PTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYY 950
              DEEEEEED E  V  ++   K   +     +DD   
Sbjct: 74  IVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQ 112


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
            subunit (TFIIF-alpha).  Transcription initiation factor
            IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
            II-associating protein 74 (RAP74) is the large subunit of
            transcription factor IIF (TFIIF), which is essential for
            accurate initiation and stimulates elongation by RNA
            polymerase II.
          Length = 528

 Score = 30.3 bits (68), Expect = 6.6
 Identities = 41/185 (22%), Positives = 62/185 (33%), Gaps = 28/185 (15%)

Query: 849  RDKYVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTID---ESVYGYDTIVYGYDSD 905
            +D   D+EDD D  +     G    S  + +     +K +D   +   G D     YDSD
Sbjct: 216  KDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSD 275

Query: 906  DLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPV 965
            D        D+E  EED  S   D  +S    EE    +  +                P 
Sbjct: 276  D-------GDDEGREEDYIS---DSSASGNDPEEREDKLSPEI---------------PA 310

Query: 966  NSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEE 1025
              +I+   D  + E +   +          LK+ K K     +    S    D+ + D E
Sbjct: 311  KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGE 370

Query: 1026 DSFDF 1030
            DS   
Sbjct: 371  DSVSL 375


>gnl|CDD|187811 cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10.
            CRISPR (Clustered Regularly Interspaced Short Palindromic
            Repeats) and associated Cas proteins comprise a system
            for heritable host defense by prokaryotic cells against
            phage and other foreign DNA; Multidomain protein with
            permuted HD nuclease domain, palm domain and Zn-ribbon;
            signature gene for type III; also known as Csm1 family.
          Length = 650

 Score = 30.4 bits (69), Expect = 6.7
 Identities = 24/105 (22%), Positives = 35/105 (33%), Gaps = 1/105 (0%)

Query: 902  YDSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKY 961
            Y   +L    P   +E  +  RE  V  RE     ++E+  +    E    LG  L K  
Sbjct: 372  YSYLELAALNPRDSKEGSKGTRECKVCGREEP-IAEDEDEGLCPTCERLYELGKELLKDD 430

Query: 962  SLPVNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYS 1006
            S  V          P     Y++      + E L    +L   YS
Sbjct: 431  SFLVTEKEDGGKKLPKFNGYYLLFAYEADEYEELALEDELVRIYS 475


>gnl|CDD|184468 PRK14035, PRK14035, citrate synthase; Provisional.
          Length = 371

 Score = 30.1 bits (68), Expect = 6.7
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query: 167 IQQWYRNTKLMRLEASYLHELKAATITIQRR 197
           I + Y++ ++MR  A Y+ E     I I+ R
Sbjct: 341 ILEQYKDNRIMRPRAKYIGETNRKYIPIEER 371


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 30.0 bits (67), Expect = 6.9
 Identities = 37/184 (20%), Positives = 64/184 (34%), Gaps = 15/184 (8%)

Query: 912  PTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDE-------YYENLGDVLTKKYSLP 964
             T++EEE+EE RE   +  E+    K E+    +D E         E   +   K+ SL 
Sbjct: 109  ETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLE 168

Query: 965  VNSDIQIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDE 1024
             N+   +       E  +   G      E   +  KLK +      E     ++  ++ E
Sbjct: 169  ENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALEL----EELKKKRE 224

Query: 1025 EDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEIYHPKLMTMRSSQEDL 1084
            E     +E  +   +EE D   R++       R  K   + +      K   +       
Sbjct: 225  ERRKVLEEEEQRRKQEEADRKSREE----EEKRRLKEEIERRRAEAAEKRQKVPEDGLSE 280

Query: 1085 DEAP 1088
            D+ P
Sbjct: 281  DKKP 284


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 30.3 bits (68), Expect = 7.4
 Identities = 43/295 (14%), Positives = 88/295 (29%), Gaps = 24/295 (8%)

Query: 38  IRSKTIVIQKYFRGYLLMRKERQEYLAMKSSAVKIQEWYRNLQCMRQARQ--QYLALKHA 95
           +++ T+ +Q       L   E        +   K+Q            +Q  Q LALK  
Sbjct: 589 LQNITVRLQDL--TEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLT 646

Query: 96  TLKQREEFLKLKHATIAIQTLYKAKLLMKRDRAAYTELKQACVSVQQRWRANLTMRKQ-- 153
            L   +        T+  + + +  L ++         +Q  +   Q  +  LT  K+  
Sbjct: 647 ALHALQL-------TLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEML 699

Query: 154 -RAHFLLMKQKASVIQQWYRNTKLMRLEASYLHELKAATITIQRRYRANVAMRTQRERYV 212
            +   LL + +  + +      ++    +S   +L A    + +            +   
Sbjct: 700 AQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQ-----SLKELMHQART 754

Query: 213 ALRTATITIQTRFRAYLIAKNQRDEYAELKQARRFRFKLNLRKYERVIELLKLKREQERQ 272
            L+  T            A     E + L    +F      R  E    LLK    +  Q
Sbjct: 755 VLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFF----NRLREEDTHLLKTLEAEIGQ 810

Query: 273 EKYRHQCAVKIQSLWKMYRVRKKFADIIEQKKQAKKTADNQFENQAPLYVRLEEA 327
           E       +       + +  ++F   +E+K        +Q         +L + 
Sbjct: 811 EI-PSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQL 864


>gnl|CDD|143234 cd05757, Ig2_IL1R_like, Second immunoglobulin (Ig)-like domain of
           interleukin-1 receptor (IL1R) and similar proteins.
           Ig2_IL1R_like: domain similar to the second
           immunoglobulin (Ig)-like domain of interleukin-1
           receptor (IL1R).  IL-1 alpha and IL-1 beta are cytokines
           which participate in the regulation of inflammation,
           immune responses, and hematopoiesis. These cytokines
           bind to the IL-1 receptor type 1 (IL1R1), which is
           activated on additional association with an accessory
           protein, IL1RAP. IL-1 also binds a second receptor
           designated type II (IL1R2). Mature IL1R1 consists of
           three IG-like domains, a transmembrane domain, and a
           large cytoplasmic domain. Mature IL1R2 is organized
           similarly except that it has a short cytoplasmic domain.
           The latter does not initiate signal transduction. A
           naturally occurring cytokine IL-1RA (IL-1 receptor
           antagonist) is widely expressed and binds to IL-1
           receptors, inhibiting the binding of IL-1 alpha and IL-1
           beta. This group also contains ILIR-like 1 (IL1R1L)
           which maps to the same chromosomal location as IL1R1 and
           IL1R2.
          Length = 92

 Score = 28.1 bits (63), Expect = 7.4
 Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 4/48 (8%)

Query: 481 TPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDEGEIKCTAT 528
            P V WYKD   +   R++ +        L+I      D G   C  T
Sbjct: 29  LPPVQWYKDCKLLEGDRKRFVKGS----KLLIQNVTEEDAGNYTCKLT 72


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
            cilia and flagella are specialised organelles found at
            the periphery of cells of diverse organisms.
            Intra-flagellar transport (IFT) is required for the
            assembly and maintenance of eukaryotic cilia and
            flagella, and consists of the bidirectional movement of
            large protein particles between the base and the distal
            tip of the organelle. IFT particles contain multiple
            copies of two distinct protein complexes, A and B, which
            contain at least 6 and 11 protein subunits. IFT57 is part
            of complex B but is not, however, required for the core
            subunits to stay associated. This protein is known as
            Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 30.1 bits (68), Expect = 7.6
 Identities = 15/55 (27%), Positives = 25/55 (45%)

Query: 994  MLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRD 1048
             L   +  K  +S +  +  N  D+E+  DE+D+    E  E+  E E  +DD  
Sbjct: 108  DLADAALKKKGFSFKRPKYPNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEG 162


>gnl|CDD|143169 cd04968, Ig3_Contactin_like, Third Ig domain of contactin.
           Ig3_Contactin_like: Third Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 88

 Score = 27.8 bits (62), Expect = 7.7
 Identities = 22/80 (27%), Positives = 31/80 (38%), Gaps = 4/80 (5%)

Query: 461 DTTALEDEKVEFTVQVEGIPTPKVSWYKDGFEIFSSRRQRIVTDNDISTLIIHQAALMDE 520
           DT AL+ + V       G P P++ W K    + SS           + L I      DE
Sbjct: 10  DTYALKGQNVTLECFALGNPVPQIKWRKVDGSMPSSA----EISMSGAVLKIPNIQFEDE 65

Query: 521 GEIKCTATNRAGHSITKARL 540
           G  +C A N  G    + R+
Sbjct: 66  GTYECEAENIKGKDTHQGRI 85


>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein.  Mei5 is
            one of a pair of meiosis-specific proteins which
            facilitate the loading of Dmc1 on to Rad51 on DNA at
            double-strand breaks during recombination. Recombination
            is carried out by a large protein complex based around
            the two RecA homologues, Rad51 and Dmc1. This complex may
            play both a catalytic and a structural role in the
            interaction between homologous chromosomes during
            meiosis. Mei5 is seen to contain a coiled-coli region.
          Length = 212

 Score = 29.4 bits (66), Expect = 8.2
 Identities = 23/120 (19%), Positives = 47/120 (39%), Gaps = 12/120 (10%)

Query: 918  EEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDV--LTKKYSLPVNSDIQIKIDK 975
            E  E  +++ +  ESS+K  + E     +++  E    +     +  L   +    KI++
Sbjct: 59   ENFELDQAVSEPPESSLKNIDSEENETSNEKLIEKWRTICQSESRSIL---NSSSPKINR 115

Query: 976  PDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFE 1035
                 D+  K       E+  ++ KL+ Q   E  +   +   E  E + D  +  EL +
Sbjct: 116  MGGYKDFKRK-------ELEAEKRKLEYQVDEESDDLRRLKLVEKYEIKNDLSELQELIK 168


>gnl|CDD|218584 pfam05422, SIN1, Stress-activated map kinase interacting protein 1
            (SIN1).  This family consists of several stress-activated
            map kinase interacting protein 1 (MAPKAP1 OR SIN1)
            sequences. The fission yeast Sty1/Spc1 mitogen-activated
            protein (MAP) kinase is a member of the eukaryotic
            stress-activated MAP kinase (SAPK) family. Sin1 interacts
            with Sty1/Spc1. Cells lacking Sin1 display many, but not
            all, of the phenotypes of cells lacking the Sty1/Spc1 MAP
            kinase including sterility, multiple stress sensitivity
            and a cell-cycle delay. Sin1 is phosphorylated after
            stress but this is not Sty1/Spc1-dependent.
          Length = 482

 Score = 30.0 bits (67), Expect = 8.4
 Identities = 27/136 (19%), Positives = 48/136 (35%), Gaps = 7/136 (5%)

Query: 871  GAPSD----NENESDYFPEKTIDESVYGYDTI--VYGYDSDDLDRHYPTLDEEEEEEDRE 924
            GA          +SDY      ++S  G D    ++ Y    + R   T  E E  +   
Sbjct: 51   GAGGQVRHSRAEDSDYATSDLSEDSDVGDDDSSDIFSYSEVPIHRRSNTAQELERLDQAV 110

Query: 925  SLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVI 984
            +L   ++S++K K   +    D      L  + +KK  LP+ +   +   +        I
Sbjct: 111  NLTSAKQSAIKIKSSVSTDYDDLRSISELDFLFSKK-ELPLTTHNTVNKARSVSNAKAPI 169

Query: 985  KGKYEVDNEMLLKRSK 1000
             G   +    L + S 
Sbjct: 170  SGLQSLLEHKLEENSS 185


>gnl|CDD|216760 pfam01881, Cas_Cas6, CRISPR associated protein Cas6.  This group of
           families is one of several protein families that are
           always found associated with prokaryotic CRISPRs,
           themselves a family of clustered regularly interspaced
           short palindromic repeats, DNA repeats found in nearly
           half of all bacterial and archaeal genomes. These DNA
           repeat regions have a remarkably regular structure:
           unique sequences of constant size, called spacers, sit
           between each pair of repeats. It has been shown that the
           CRISPRs are virus-derived sequences acquired by the host
           to enable them to resist viral infection. The Cas
           proteins from the host use the CRISPRs to mediate an
           antiviral response. After transcription of the CRISPR, a
           complex of Cas proteins termed Cascade cleaves a CRISPR
           RNA precursor in each repeat and retains the cleavage
           products containing the virus-derived sequence. Assisted
           by the helicase Cas3, these mature CRISPR RNAs then
           serve as small guide RNAs that enable Cascade to
           interfere with virus proliferation. Cas5 contains an
           endonuclease motif, whose inactivation leads to loss of
           resistance, even in the presence of phage-derived
           spacers.
          Length = 152

 Score = 28.8 bits (65), Expect = 8.8
 Identities = 14/71 (19%), Positives = 25/71 (35%), Gaps = 19/71 (26%)

Query: 944 IKDDEYYENLGDVLTKKYSL----PVNSDIQIKIDKPD----------DEPDYVIKG--- 986
             D+E+ E L + L KKY          + + +                + +  I+G   
Sbjct: 60  PDDEEFEELLKENLIKKYEAFYGEKPEKEFKFEPLVFKKKVVKHKRIKIKKNTYIRGYLG 119

Query: 987 --KYEVDNEML 995
             + E D E+L
Sbjct: 120 KFRLEGDPELL 130


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
            unknown].
          Length = 869

 Score = 30.1 bits (67), Expect = 9.0
 Identities = 28/162 (17%), Positives = 53/162 (32%), Gaps = 31/162 (19%)

Query: 900  YGYDSDDLDRHYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTK 959
            Y +  +  D      DE  +EED +       ++ K  +       + ++       L  
Sbjct: 44   YSFGVNSEDDEEIDSDEAFDEEDEKRFADWSFNASKSGKS------NKDH-----KNLNN 92

Query: 960  KYSLPVN-SDIQIKIDKPDDEPDYVIKGKYEV-------------------DNEMLLKRS 999
               + +N SD  +  DK ++E       + E+                   ++    K +
Sbjct: 93   TKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQPEKNESGNNDHATDKEN 152

Query: 1000 KLKPQYSSEMSEASNITDDEDEEDEEDSFDFDELFEDNPEEE 1041
             L+   SS     S  +D E E +  DS   DE  +   +  
Sbjct: 153  LLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNL 194


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 30.1 bits (67), Expect = 9.1
 Identities = 41/220 (18%), Positives = 88/220 (40%), Gaps = 29/220 (13%)

Query: 850  DKYVDNEDDYDIVETNEHTGTGAPSDNENESDYFPEKTIDESVYGYDTIVYGYDSDDLDR 909
            +K + N+++       EH       D E   +   E+ ++E+V                 
Sbjct: 925  NKELKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENV----------------- 967

Query: 910  HYPTLDEEEEEEDRESLVKDRESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDI 969
                 +E  EE   E++ ++ E +V+   EE          EN+ + + +     V  +I
Sbjct: 968  -----EENVEENVEENVEENVEENVEENVEE-------NVEENVEENIEENVEENVEENI 1015

Query: 970  QIKIDKPDDEPDYVIKGKYEVDNEMLLKRSKLKPQYSSEMSEASNITDDEDEEDEEDSFD 1029
            +  +++ D+E    ++   E  +E  ++  +   + + E +   NI + ++E  EE   +
Sbjct: 1016 EENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEIEEN 1075

Query: 1030 FDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
             +E  E+N EE  +E+  +   N   N      E+ +E  
Sbjct: 1076 IEENIEENVEENVEENVEEIEENVEENVEENAEENAEENA 1115


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts harbouring
            premature signals for translation termination are
            recognised and rapidly degraded by eukaryotic cells
            through a pathway known as nonsense-mediated mRNA decay.
            In Saccharomyces cerevisiae, three trans-acting factors
            (Upf1 to Upf3) are required for nonsense-mediated mRNA
            decay.
          Length = 171

 Score = 28.9 bits (65), Expect = 9.1
 Identities = 11/39 (28%), Positives = 22/39 (56%)

Query: 1010 SEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRD 1048
            S + + +DD +E++E    D D+   D  E +  +D++D
Sbjct: 1    SGSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQD 39


>gnl|CDD|219912 pfam08574, DUF1762, Protein of unknown function (DUF1762).  This is a
            family of proteins of unknown function. Yeast IWR1 is
            known to interact with RNA polymerase II and deletion of
            this protein results in hypersensitivity to the K1 killer
            toxin.
          Length = 77

 Score = 27.4 bits (61), Expect = 9.2
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 1017 DDEDEEDEEDSFDFDELFEDNPEEEYDEDD 1046
            D++D+ D+  S D D   E+    +Y +D+
Sbjct: 48   DEDDDADQVLSDDEDSNAENYYRNDYPDDE 77


>gnl|CDD|227472 COG5143, SNC1, Synaptobrevin/VAMP-like protein [Intracellular
            trafficking and secretion].
          Length = 190

 Score = 29.3 bits (66), Expect = 9.2
 Identities = 29/148 (19%), Positives = 48/148 (32%), Gaps = 23/148 (15%)

Query: 873  PSDNENESDYFPEKTIDESVYGYDTIVYGYDSD---DLDRHYPTLDEEEEEEDRESLVKD 929
             S    ES  +    +  S      IVY   SD        Y  L+    E  + S ++ 
Sbjct: 46   ASRASIESGDYFFHYLKMS----SGIVYVPISDKEYPNKLAYGYLNSIATEFLKSSALEQ 101

Query: 930  RESSVKGKEEEAKVIKDDEYYENLGDVLTKKYSLPVNSDIQIKIDKPDDEPDYVIKGKYE 989
                  G               N+  V+ K Y  P   D   K+D+   E +   +   +
Sbjct: 102  LIDDTVG-----------IMRVNIDKVIEKGYRDPSIQD---KLDQLQQELEETKRVLNK 147

Query: 990  VDNEMLLKRSKLKP--QYSSEMSEASNI 1015
               ++L +  KL      SS +  +S +
Sbjct: 148  NIEKVLYRDEKLDLLVDLSSILLLSSKM 175


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
            Saccharomyces cerevisiae protein SPT2 which is a
            chromatin protein involved in transcriptional regulation.
          Length = 106

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 1010 SEASNITDDEDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDD 1065
              +    DD DEE++ED  DF E  +D+ E++YD D+     N  R R+    +DD
Sbjct: 6    ERSRRSRDDYDEEEDEDMDDFIE--DDDEEDDYDRDEIWAMFNKGRKRYAYRDDDD 59


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
            [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.0 bits (67), Expect = 9.5
 Identities = 20/100 (20%), Positives = 32/100 (32%), Gaps = 20/100 (20%)

Query: 970  QIKIDKPDDEPDYVIKGKYE------VDNEMLLKRSKLKPQYSSEMSEASNITDDEDEED 1023
            +   ++  +  DY++            D     K        + E SE      +E+ ED
Sbjct: 339  EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIE--NRNPEESE-----HEEEVED 391

Query: 1024 EEDSFDF------DELFEDNPEEEYDEDDRDQPINFARNR 1057
             ED  D       D+  E N     DE +    + F   R
Sbjct: 392  YEDENDHSKRICDDDELE-NHFRAADEKNSHLVVGFRNER 430


>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402).  This
            domain is functionally uncharacterized. This domain is
            found in eukaryotes. This presumed domain is typically
            between 350 to 473 amino acids in length. This domain is
            found associated with pfam07923.
          Length = 402

 Score = 29.6 bits (67), Expect = 9.6
 Identities = 13/51 (25%), Positives = 19/51 (37%), Gaps = 4/51 (7%)

Query: 1019 EDEEDEEDSFDFDELFEDNPEEEYDEDDRDQPINFARNRHNKYIEDDQEEI 1069
            + E    D  D  E  ++   E Y     DQP    +    + +  D EEI
Sbjct: 39   DTESLVGDPLDISESVKELKLEMYTSLAEDQP----KKEEIERLSTDSEEI 85


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
            Provisional.
          Length = 2849

 Score = 30.0 bits (67), Expect = 9.8
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 8/77 (10%)

Query: 982  YVIKGKYEV-DNEMLL-KRSKLKPQYSSEMSEASNIT---DDEDEEDEEDSFDFDELFED 1036
            Y I GK E+ D +M++ KR + +     +MS   N     DDEDE++++D  + DE  E+
Sbjct: 116  YPIHGKAEIGDLDMIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEE 175

Query: 1037 NPEEEY---DEDDRDQP 1050
              EE     DED+ D+ 
Sbjct: 176  EEEEIKGFDDEDEEDEG 192


>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
           recombination, and repair].
          Length = 440

 Score = 29.5 bits (67), Expect = 9.9
 Identities = 23/136 (16%), Positives = 46/136 (33%), Gaps = 6/136 (4%)

Query: 138 VSVQQRWRANLTMRKQRAHFLL---MKQKASVIQQWYRNTKLMRLE---ASYLHELKAAT 191
           V         L   ++R H  L   + QK   ++   R  +    E   +     L    
Sbjct: 260 VPDSAELLQQLDQLQRRLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDELA 319

Query: 192 ITIQRRYRANVAMRTQRERYVALRTATITIQTRFRAYLIAKNQRDEYAELKQARRFRFKL 251
           I ++R     +A++ QR   +  R      + + R   + +          + +R R + 
Sbjct: 320 IRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLEA 379

Query: 252 NLRKYERVIELLKLKR 267
            + + E +  L  L R
Sbjct: 380 LVEQLESLSPLATLAR 395


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.134    0.393 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 58,143,471
Number of extensions: 5976772
Number of successful extensions: 8254
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7446
Number of HSP's successfully gapped: 361
Length of query: 1101
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 994
Effective length of database: 6,191,724
Effective search space: 6154573656
Effective search space used: 6154573656
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.5 bits)