RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7040
         (716 letters)



>gnl|CDD|238020 cd00063, FN3, Fibronectin type 3 domain; One of three types of
           internal repeats found in the plasma protein
           fibronectin. Its tenth fibronectin type III repeat
           contains an RGD cell recognition sequence in a flexible
           loop between 2 strands. Approximately 2% of all animal
           proteins contain the FN3 repeat; including extracellular
           and intracellular proteins, membrane spanning cytokine
           receptors, growth hormone receptors, tyrosine
           phosphatase receptors, and adhesion molecules. FN3-like
           domains are also found in bacterial glycosyl hydrolases.
          Length = 93

 Score = 82.9 bits (205), Expect = 2e-19
 Identities = 37/90 (41%), Positives = 49/90 (54%), Gaps = 1/90 (1%)

Query: 325 PGKPGTPEIKDFDTDFVELAWTPPEQNGGSPIVGYIIEKKEKYSPIWEKCAQTEGDTPKG 384
           P  P    + D  +  V L+WTPPE +GG PI GY++E +EK S  W++   T G     
Sbjct: 1   PSPPTNLRVTDVTSTSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGSETSY 59

Query: 385 KVLDLIEGNQYEFRVLAVNKGGPGEPSDPT 414
            +  L  G +YEFRV AVN GG   PS+  
Sbjct: 60  TLTGLKPGTEYEFRVRAVNGGGESPPSESV 89



 Score = 82.2 bits (203), Expect = 5e-19
 Identities = 40/91 (43%), Positives = 54/91 (59%), Gaps = 2/91 (2%)

Query: 223 PSPPGGPLKVSNVHAEGVTLDWKVPDDDGGQPIEKYVVDKMDEATGRWTPAGETEGPVTG 282
           PSPP   L+V++V +  VTL W  P+DDGG PI  YVV+  ++ +G W     T G  T 
Sbjct: 1   PSPPTN-LRVTDVTSTSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGSETS 58

Query: 283 LEVEGLIPNHKYKFRVRAVNKQGKSEPLTTT 313
             + GL P  +Y+FRVRAVN  G+S P  + 
Sbjct: 59  YTLTGLKPGTEYEFRVRAVNGGGESPPSESV 89



 Score = 76.0 bits (187), Expect = 6e-17
 Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 5/91 (5%)

Query: 57  PEGP--LEVSNIHKDGCTLKWNKPKDDGGEPLEGYLVEKYDPETGVWIPVGKTR--EPEM 112
           P  P  L V+++     TL W  P+DDGG P+ GY+VE  +  +G W  V  T   E   
Sbjct: 1   PSPPTNLRVTDVTSTSVTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGSETSY 59

Query: 113 DVTGLTPGHEYKFRVKALNKEGESEPLETFS 143
            +TGL PG EY+FRV+A+N  GES P E+ +
Sbjct: 60  TLTGLKPGTEYEFRVRAVNGGGESPPSESVT 90



 Score = 37.5 bits (87), Expect = 0.003
 Identities = 23/78 (29%), Positives = 34/78 (43%), Gaps = 13/78 (16%)

Query: 519 CSLKWNPPEDDGGAPIEYYMVEKMETDTGKVL-IKWIRTNDKRVTIDNVDYFTKITIRPL 577
            +L W PPEDDGG PI  Y+VE  E  +G    ++    ++   T+  +   T+   R  
Sbjct: 17  VTLSWTPPEDDGG-PITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFR-- 73

Query: 578 QRSDTAQYTVTATNSQGK 595
                    V A N  G+
Sbjct: 74  ---------VRAVNGGGE 82


>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain. 
          Length = 90

 Score = 77.7 bits (192), Expect = 2e-17
 Identities = 26/85 (30%), Positives = 44/85 (51%)

Query: 432 QLSDIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKR 491
           +  D++V+ G +  F   V G+P PT  W  +   + S DRFK+  +  +  L + + + 
Sbjct: 6   KPKDVEVQEGESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQP 65

Query: 492 GDSGIYTLAVKNSWGTDKGTAKVTV 516
            D G YT    NS G  + +A++TV
Sbjct: 66  DDEGKYTCVATNSAGEAEASAELTV 90



 Score = 47.6 bits (114), Expect = 5e-07
 Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 5/59 (8%)

Query: 551 IKWIR-----TNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 604
           + W +      +  R  +        +TI  +Q  D  +YT  ATNS G+ +   E+ V
Sbjct: 32  VSWFKDGQPLRSSDRFKVTYEGGTYTLTISNVQPDDEGKYTCVATNSAGEAEASAELTV 90



 Score = 46.9 bits (112), Expect = 1e-06
 Identities = 14/48 (29%), Positives = 22/48 (45%)

Query: 651 NDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 698
           +  R  +        +TI  +Q  D  +YT  ATNS G+ +   E+ V
Sbjct: 43  SSDRFKVTYEGGTYTLTISNVQPDDEGKYTCVATNSAGEAEASAELTV 90



 Score = 40.3 bits (95), Expect = 2e-04
 Identities = 13/43 (30%), Positives = 17/43 (39%)

Query: 177 KVQHLDYNTKLGVRMAQRADAGFYTVTAENINGKDSVEVEVIV 219
           KV +      L +   Q  D G YT  A N  G+     E+ V
Sbjct: 48  KVTYEGGTYTLTISNVQPDDEGKYTCVATNSAGEAEASAELTV 90


>gnl|CDD|214495 smart00060, FN3, Fibronectin type 3 domain.  One of three types of
           internal repeat within the plasma protein, fibronectin.
           The tenth fibronectin type III repeat contains a RGD
           cell recognition sequence in a flexible loop between 2
           strands. Type III modules are present in both
           extracellular and intracellular proteins.
          Length = 83

 Score = 64.6 bits (157), Expect = 5e-13
 Identities = 30/85 (35%), Positives = 40/85 (47%), Gaps = 3/85 (3%)

Query: 325 PGKPGTPEIKDFDTDFVELAWTPPEQ-NGGSPIVGYIIEKKEKYSPIWEKCAQTEGDTPK 383
           P  P    + D  +  V L+W PP        IVGY +E +E+ S  W++   T   T  
Sbjct: 1   PSPPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYRVEYREEGSE-WKEVNVTPSST-S 58

Query: 384 GKVLDLIEGNQYEFRVLAVNKGGPG 408
             +  L  G +YEFRV AVN  G G
Sbjct: 59  YTLTGLKPGTEYEFRVRAVNGAGEG 83



 Score = 64.2 bits (156), Expect = 8e-13
 Identities = 31/85 (36%), Positives = 40/85 (47%), Gaps = 7/85 (8%)

Query: 56  DPEGPLEVSNIHKDGCTLKWNKPKDDGGEPLEGYLVE---KYDPETGVWIPV-GKTREPE 111
            P   L V+++     TL W  P DDG     GY+V    +Y  E   W  V        
Sbjct: 2   SPPSNLRVTDVTSTSVTLSWEPPPDDGIT---GYIVGYRVEYREEGSEWKEVNVTPSSTS 58

Query: 112 MDVTGLTPGHEYKFRVKALNKEGES 136
             +TGL PG EY+FRV+A+N  GE 
Sbjct: 59  YTLTGLKPGTEYEFRVRAVNGAGEG 83



 Score = 63.4 bits (154), Expect = 1e-12
 Identities = 33/86 (38%), Positives = 43/86 (50%), Gaps = 4/86 (4%)

Query: 223 PSPPGGPLKVSNVHAEGVTLDWKVPDDDGGQ-PIEKYVVDKMDEATGRWTPAGETEGPVT 281
           PSPP   L+V++V +  VTL W+ P DDG    I  Y V +  E    W          T
Sbjct: 1   PSPPSN-LRVTDVTSTSVTLSWEPPPDDGITGYIVGYRV-EYREEGSEWKEV-NVTPSST 57

Query: 282 GLEVEGLIPNHKYKFRVRAVNKQGKS 307
              + GL P  +Y+FRVRAVN  G+ 
Sbjct: 58  SYTLTGLKPGTEYEFRVRAVNGAGEG 83



 Score = 31.8 bits (72), Expect = 0.17
 Identities = 21/77 (27%), Positives = 29/77 (37%), Gaps = 13/77 (16%)

Query: 519 CSLKWNPPEDDGG-APIEYYMVEKMETDTGKVLIKWIRTNDKRVTIDNVDYFTKITIRPL 577
            +L W PP DDG    I  Y VE  E  +     +W   N    +       T  T+  L
Sbjct: 17  VTLSWEPPPDDGITGYIVGYRVEYREEGS-----EWKEVNVTPSS-------TSYTLTGL 64

Query: 578 QRSDTAQYTVTATNSQG 594
           +     ++ V A N  G
Sbjct: 65  KPGTEYEFRVRAVNGAG 81


>gnl|CDD|143225 cd05748, Ig_Titin_like, Immunoglobulin (Ig)-like domain of titin
           and similar proteins.  Ig_Titin_like: immunoglobulin
           (Ig)-like domain found in titin-like proteins. Titin
           (also called connectin) is a fibrous sarcomeric protein
           specifically found in vertebrate striated muscle. Titin
           is gigantic, depending on isoform composition it ranges
           from 2970 to 3700 kDa, and is of a length that spans
           half a sarcomere. Titin largely consists of multiple
           repeats of Ig-like and fibronectin type 3 (FN-III)-like
           domains. Titin connects the ends of myosin thick
           filaments to Z disks and extends along the thick
           filament to the H zone.  It appears to function
           similarly to an elastic band, keeping the myosin
           filaments centered in the sarcomere during muscle
           contraction or stretching. Within the sarcomere, titin
           is also attached to or is associated with myosin binding
           protein C (MyBP-C). MyBP-C appears to contribute to the
           generation of passive tension by titin, and similar to
           titin has repeated Ig-like and FN-III domains. Also
           included in this group are worm twitchin and insect
           projectin, thick filament proteins of invertebrate
           muscle, which also have repeated Ig-like and FN-III
           domains.
          Length = 74

 Score = 63.4 bits (155), Expect = 1e-12
 Identities = 23/74 (31%), Positives = 35/74 (47%)

Query: 443 NFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVK 502
           +   ++ + G P PT  W  +   +    R +I     ST L + +++R DSG YTL +K
Sbjct: 1   SVRLEVPISGRPTPTVTWSKDGKPLKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLK 60

Query: 503 NSWGTDKGTAKVTV 516
           N  G    T  V V
Sbjct: 61  NPAGEKSATINVKV 74



 Score = 51.8 bits (125), Expect = 1e-08
 Identities = 16/49 (32%), Positives = 24/49 (48%)

Query: 556 TNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 604
               RV I+     T + I+  +RSD+ +YT+T  N  G+    I V V
Sbjct: 26  KLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLKNPAGEKSATINVKV 74



 Score = 51.4 bits (124), Expect = 2e-08
 Identities = 17/50 (34%), Positives = 25/50 (50%)

Query: 649 LMNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 698
           L    RV I+     T + I+  +RSD+ +YT+T  N  G+    I V V
Sbjct: 25  LKLSGRVQIETTASSTSLVIKNAERSDSGKYTLTLKNPAGEKSATINVKV 74



 Score = 45.6 bits (109), Expect = 2e-06
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 182 DYNTKLGVRMAQRADAGFYTVTAENINGKDSVEVEVIV 219
             +T L ++ A+R+D+G YT+T +N  G+ S  + V V
Sbjct: 37  ASSTSLVIKNAERSDSGKYTLTLKNPAGEKSATINVKV 74


>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like.  IG domains that cannot
           be classified into one of IGv1, IGc1, IGc2, IG.
          Length = 85

 Score = 62.5 bits (152), Expect = 3e-12
 Identities = 20/84 (23%), Positives = 31/84 (36%), Gaps = 1/84 (1%)

Query: 434 SDIKVRAGSNFEFDINVIGEPIPTKEWLCNDIT-IISKDRFKIVNDDKSTKLKVFDSKRG 492
             + V+ G +        G P P   W       +    RF +     ++ L + +    
Sbjct: 2   PSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPE 61

Query: 493 DSGIYTLAVKNSWGTDKGTAKVTV 516
           DSG YT A  NS G+      +TV
Sbjct: 62  DSGTYTCAATNSSGSASSGTTLTV 85



 Score = 42.5 bits (100), Expect = 3e-05
 Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 6/60 (10%)

Query: 551 IKWIRTNDK------RVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 604
           + W +   K      R ++      + +TI  +   D+  YT  ATNS G       + V
Sbjct: 26  VTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85



 Score = 41.7 bits (98), Expect = 6e-05
 Identities = 13/52 (25%), Positives = 21/52 (40%)

Query: 647 QYLMNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 698
           + L    R ++      + +TI  +   D+  YT  ATNS G       + V
Sbjct: 34  KLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85



 Score = 30.9 bits (70), Expect = 0.47
 Identities = 9/28 (32%), Positives = 12/28 (42%)

Query: 192 AQRADAGFYTVTAENINGKDSVEVEVIV 219
               D+G YT  A N +G  S    + V
Sbjct: 58  VTPEDSGTYTCAATNSSGSASSGTTLTV 85


>gnl|CDD|214652 smart00409, IG, Immunoglobulin. 
          Length = 85

 Score = 62.5 bits (152), Expect = 3e-12
 Identities = 20/84 (23%), Positives = 31/84 (36%), Gaps = 1/84 (1%)

Query: 434 SDIKVRAGSNFEFDINVIGEPIPTKEWLCNDIT-IISKDRFKIVNDDKSTKLKVFDSKRG 492
             + V+ G +        G P P   W       +    RF +     ++ L + +    
Sbjct: 2   PSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPE 61

Query: 493 DSGIYTLAVKNSWGTDKGTAKVTV 516
           DSG YT A  NS G+      +TV
Sbjct: 62  DSGTYTCAATNSSGSASSGTTLTV 85



 Score = 42.5 bits (100), Expect = 3e-05
 Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 6/60 (10%)

Query: 551 IKWIRTNDK------RVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 604
           + W +   K      R ++      + +TI  +   D+  YT  ATNS G       + V
Sbjct: 26  VTWYKQGGKLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85



 Score = 41.7 bits (98), Expect = 6e-05
 Identities = 13/52 (25%), Positives = 21/52 (40%)

Query: 647 QYLMNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 698
           + L    R ++      + +TI  +   D+  YT  ATNS G       + V
Sbjct: 34  KLLAESGRFSVSRSGSTSTLTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85



 Score = 30.9 bits (70), Expect = 0.47
 Identities = 9/28 (32%), Positives = 12/28 (42%)

Query: 192 AQRADAGFYTVTAENINGKDSVEVEVIV 219
               D+G YT  A N +G  S    + V
Sbjct: 58  VTPEDSGTYTCAATNSSGSASSGTTLTV 85


>gnl|CDD|200951 pfam00041, fn3, Fibronectin type III domain. 
          Length = 84

 Score = 60.1 bits (146), Expect = 2e-11
 Identities = 29/86 (33%), Positives = 38/86 (44%), Gaps = 2/86 (2%)

Query: 326 GKPGTPEIKDFDTDFVELAWTPPEQNGGSPIVGYIIEKKEKYSPIWEKCAQTEGDTPKGK 385
             P    + D  +  + L+W+PP  NG  PI GY +E +        K     G T    
Sbjct: 1   SAPTNLTVTDVTSTSLTLSWSPPPGNG--PITGYEVEYRPVNGGEEWKEITVPGTTTSYT 58

Query: 386 VLDLIEGNQYEFRVLAVNKGGPGEPS 411
           +  L  G +YE RV AVN  G G PS
Sbjct: 59  LTGLKPGTEYEVRVQAVNGAGEGPPS 84



 Score = 57.8 bits (140), Expect = 1e-10
 Identities = 28/86 (32%), Positives = 39/86 (45%), Gaps = 3/86 (3%)

Query: 224 SPPGGPLKVSNVHAEGVTLDWKVPDDDGGQPIEKYVVDKMDEATGRWTPAGETEGPVTGL 283
           S P   L V++V +  +TL W  P  +G  PI  Y V+      G         G  T  
Sbjct: 1   SAPTN-LTVTDVTSTSLTLSWSPPPGNG--PITGYEVEYRPVNGGEEWKEITVPGTTTSY 57

Query: 284 EVEGLIPNHKYKFRVRAVNKQGKSEP 309
            + GL P  +Y+ RV+AVN  G+  P
Sbjct: 58  TLTGLKPGTEYEVRVQAVNGAGEGPP 83



 Score = 54.0 bits (130), Expect = 3e-09
 Identities = 30/86 (34%), Positives = 40/86 (46%), Gaps = 6/86 (6%)

Query: 56  DPEGPLEVSNIHKDGCTLKWNKPKDDGGEPLEGYLVEKYDPETG---VWIPVGKTREPEM 112
                L V+++     TL W+ P  +G  P+ GY VE      G     I V  T     
Sbjct: 1   SAPTNLTVTDVTSTSLTLSWSPPPGNG--PITGYEVEYRPVNGGEEWKEITVPGT-TTSY 57

Query: 113 DVTGLTPGHEYKFRVKALNKEGESEP 138
            +TGL PG EY+ RV+A+N  GE  P
Sbjct: 58  TLTGLKPGTEYEVRVQAVNGAGEGPP 83


>gnl|CDD|143239 cd05762, Ig8_MLCK, Eighth immunoglobulin (Ig)-like domain of human
           myosin light-chain kinase (MLCK).  Ig8_MLCK: the eighth
           immunoglobulin (Ig)-like domain of human myosin
           light-chain kinase (MLCK). MLCK is a key regulator of
           different forms of cell motility involving actin and
           myosin II.  Agonist stimulation of smooth muscle cells
           increases cytosolic Ca2+, which binds calmodulin.  This
           Ca2+-calmodulin complex in turn binds to and activates
           MLCK. Activated MLCK leads to the phosphorylation of the
           20 kDa myosin regulatory light chain (RLC) of myosin II
           and the stimulation of actin-activated myosin MgATPase
           activity. MLCK is widely present in vertebrate tissues;
           it phosphorylates the 20 kDa RLC of both smooth and
           nonmuscle myosin II. Phosphorylation leads to the
           activation of the myosin motor domain and altered
           structural properties of myosin II. In smooth muscle
           MLCK it is involved in initiating contraction. In
           nonmuscle cells, MLCK may participate in cell division
           and cell motility; it has been suggested MLCK plays a
           role in cardiomyocyte differentiation and contraction
           through regulation of nonmuscle myosin II.
          Length = 98

 Score = 45.7 bits (108), Expect = 4e-06
 Identities = 30/92 (32%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 435 DIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDS 494
           D+KVRAG + E    V G    T  W+     I   +  KI N + S+KL + + ++   
Sbjct: 9   DMKVRAGESVELFCKVTGTQPITCTWMKFRKQIQEGEGIKIENTENSSKLTITEGQQEHC 68

Query: 495 GIYTLAVKNSWGTDKGTAKVTVLGCSLKWNPP 526
           G YTL V+N  G+ +    +TV+    K +PP
Sbjct: 69  GCYTLEVENKLGSRQAQVNLTVVD---KPDPP 97



 Score = 34.5 bits (79), Expect = 0.025
 Identities = 17/50 (34%), Positives = 29/50 (58%)

Query: 177 KVQHLDYNTKLGVRMAQRADAGFYTVTAENINGKDSVEVEVIVLDKPSPP 226
           K+++ + ++KL +   Q+   G YT+  EN  G    +V + V+DKP PP
Sbjct: 48  KIENTENSSKLTITEGQQEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97



 Score = 31.9 bits (72), Expect = 0.22
 Identities = 16/51 (31%), Positives = 25/51 (49%)

Query: 561 VTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVVTDKPSAP 611
           + I+N +  +K+TI   Q+     YT+   N  G  Q  + + V DKP  P
Sbjct: 47  IKIENTENSSKLTITEGQQEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97



 Score = 31.9 bits (72), Expect = 0.22
 Identities = 16/51 (31%), Positives = 25/51 (49%)

Query: 655 VTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVVTDKPSAP 705
           + I+N +  +K+TI   Q+     YT+   N  G  Q  + + V DKP  P
Sbjct: 47  IKIENTENSSKLTITEGQQEHCGCYTLEVENKLGSRQAQVNLTVVDKPDPP 97


>gnl|CDD|143302 cd05894, Ig_C5_MyBP-C, C5 immunoglobulin (Ig) domain of cardiac
           myosin binding protein C (MyBP-C).  Ig_C5_MyBP_C : the
           C5 immunoglobulin (Ig) domain of cardiac myosin binding
           protein C (MyBP-C). MyBP_C consists of repeated domains,
           Ig and fibronectin type 3, and various linkers. Three
           isoforms of MYBP_C exist and are included in this group:
           cardiac(c), and fast and slow skeletal muscle (s)
           MyBP_C. cMYBP_C has insertions between and inside
           domains and an additional cardiac-specific Ig domain at
           the N-terminus. For cMYBP_C  an interaction has been
           demonstrated between this C5 domain and the Ig C8
           domain.
          Length = 86

 Score = 44.8 bits (106), Expect = 5e-06
 Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 1/82 (1%)

Query: 436 IKVRAGSNFEFDINVIGEPIPTKEWLCNDITII-SKDRFKIVNDDKSTKLKVFDSKRGDS 494
           I V AG+    D+ + GEP PT  W   D     ++ R ++ +    +   +  ++R D 
Sbjct: 5   IVVVAGNKLRLDVPISGEPAPTVTWSRGDKAFTETEGRVRVESYKDLSSFVIEGAEREDE 64

Query: 495 GIYTLAVKNSWGTDKGTAKVTV 516
           G+YT+ V N  G D  +  V V
Sbjct: 65  GVYTITVTNPVGEDHASLFVKV 86



 Score = 32.5 bits (74), Expect = 0.13
 Identities = 15/54 (27%), Positives = 24/54 (44%)

Query: 551 IKWIRTNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 604
            K     + RV +++    +   I   +R D   YT+T TN  G+D   + V V
Sbjct: 33  DKAFTETEGRVRVESYKDLSSFVIEGAEREDEGVYTITVTNPVGEDHASLFVKV 86



 Score = 31.4 bits (71), Expect = 0.28
 Identities = 14/45 (31%), Positives = 22/45 (48%)

Query: 654 RVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 698
           RV +++    +   I   +R D   YT+T TN  G+D   + V V
Sbjct: 42  RVRVESYKDLSSFVIEGAEREDEGVYTITVTNPVGEDHASLFVKV 86



 Score = 28.3 bits (63), Expect = 4.0
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 171 GSGMVRKVQHLDYNTKLGVRMAQRADAGFYTVTAENINGKDSVEVEVIV 219
             G VR   + D ++   +  A+R D G YT+T  N  G+D   + V V
Sbjct: 39  TEGRVRVESYKDLSS-FVIEGAEREDEGVYTITVTNPVGEDHASLFVKV 86


>gnl|CDD|227076 COG4733, COG4733, Phage-related protein, tail component [Function
           unknown].
          Length = 952

 Score = 46.3 bits (110), Expect = 5e-05
 Identities = 45/233 (19%), Positives = 62/233 (26%), Gaps = 37/233 (15%)

Query: 225 PPG-GPLKVSNVHAEGVTLD---WKVPDDDGGQPIEKYVVDKMDEATGRWTPAGETEGPV 280
           PPG      +      + L           GG+              G W  A  T    
Sbjct: 603 PPGVQIPTTNVSIDSFLNLVQGLATTLLKVGGEAFL-AAWAYEAGWDGNWITAPRT--SA 659

Query: 281 TGLEVEGLIPNHKYKFRVRAVNKQGKSEPLTTTASIEAKNPFNQPGKPGTPEIKDFDTDF 340
            G +VEG IP  +Y  RVRA+N    + P  T            P K           D 
Sbjct: 660 AGFDVEG-IPAGQYAIRVRAINVFEPNSPDATAYEFALNGKKVPPPKAMIY-------DA 711

Query: 341 VELAWTPPEQNGGSPIVGYIIEKKEKYS---PIWEKCAQTEGDTPKGKV--LDLIEGNQY 395
           V +         G P     I   E  S         A++ G+     +  + +  G  +
Sbjct: 712 VIITLV-IRLVVGDPTGAVDITSTEIRSAVIADGNFQARSLGNLNYPGLFSVGIQAGLTF 770

Query: 396 EFRVLAVNKGGPGEPSDPTAPHIARAKKVSPYINRDQLSDIKVRAGSNFEFDI 448
            FR   V+  G                    +    Q S    R       DI
Sbjct: 771 WFRNRNVDLVG----------------NNDKWEVYGQSSRDASRILELIGDDI 807



 Score = 32.8 bits (75), Expect = 0.88
 Identities = 17/65 (26%), Positives = 25/65 (38%), Gaps = 2/65 (3%)

Query: 74  KWNKPKDDGGEPLEGYLVEKYDPETGVWIPVGKTREPEMDVTGLTPGHEYKFRVKALNKE 133
                   GGE        +   + G WI   +T     DV G+  G +Y  RV+A+N  
Sbjct: 625 LATTLLKVGGEAFLAAWAYEAGWD-GNWITAPRTSAAGFDVEGIPAG-QYAIRVRAINVF 682

Query: 134 GESEP 138
             + P
Sbjct: 683 EPNSP 687


>gnl|CDD|212460 cd05723, Ig4_Neogenin, Fourth immunoglobulin (Ig)-like domain in
           neogenin and similar proteins.  Ig4_Neogenin: fourth
           immunoglobulin (Ig)-like domain in neogenin and related
           proteins. Neogenin  is a cell surface protein which is
           expressed in the developing nervous system of vertebrate
           embryos in the growing nerve cells. It is also expressed
           in other embryonic tissues, and may play a general role
           in developmental processes such as cell migration,
           cell-cell recognition, and tissue growth regulation.
           Included in this group is the tumor suppressor protein
           DCC, which is deleted in colorectal carcinoma . DCC and
           neogenin each have four Ig-like domains followed by six
           fibronectin type III domains, a transmembrane domain,
           and an intracellular domain.
          Length = 71

 Score = 41.1 bits (96), Expect = 7e-05
 Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 3/71 (4%)

Query: 446 FDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSW 505
           F+  V G+P PT +W+ N   +I  D FKIV +     L+V    + D G Y    +N  
Sbjct: 4   FECEVTGKPTPTVKWVKNGDMVIPSDYFKIVKEH---NLQVLGLVKSDEGFYQCIAENDV 60

Query: 506 GTDKGTAKVTV 516
           G  +  A++ +
Sbjct: 61  GNVQAGAQLII 71


>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig3_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 69

 Score = 40.9 bits (96), Expect = 8e-05
 Identities = 28/72 (38%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 445 EFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNS 504
           EF   V G+P+PT  W   D  +  K R +I+ DDKS  LK+ +   GD G YT   +N 
Sbjct: 2   EFQCEVGGDPVPTVLWRKEDGEL-PKGRAEIL-DDKS--LKIRNVTAGDEGSYTCEAENM 57

Query: 505 WGTDKGTAKVTV 516
            G  + +A +TV
Sbjct: 58  VGKIEASASLTV 69


>gnl|CDD|143224 cd05747, Ig5_Titin_like, M5, fifth immunoglobulin (Ig)-like domain
           of human titin C terminus and similar proteins.
           Ig5_Titin_like: domain similar to the M5, fifth
           immunoglobulin (Ig)-like domain from the human titin C
           terminus. Titin (also called connectin) is a fibrous
           sarcomeric protein specifically found in vertebrate
           striated muscle. Titin is gigantic; depending on isoform
           composition it ranges from 2970 to 3700 kDa, and is of a
           length that spans half a sarcomere. Titin largely
           consists of multiple repeats of Ig-like and fibronectin
           type 3 (FN-III)-like domains. Titin connects the ends of
           myosin thick filaments to Z disks and extends along the
           thick filament to the H zone, and appears to function
           similar to an elastic band, keeping the myosin filaments
           centered in the sarcomere during muscle contraction or
           stretching.
          Length = 92

 Score = 40.8 bits (95), Expect = 2e-04
 Identities = 22/71 (30%), Positives = 36/71 (50%)

Query: 436 IKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSG 495
           + V  G +  F  +V GEP PT  W+     I+S  R +I + +  +  ++   +  D G
Sbjct: 13  LTVSEGESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEG 72

Query: 496 IYTLAVKNSWG 506
            YT+ V+NS G
Sbjct: 73  NYTVVVENSEG 83



 Score = 31.2 bits (70), Expect = 0.45
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 645 ESQYLMNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQ 691
           E Q +++ +R  I + +Y +   I  +Q SD   YTV   NS+GK +
Sbjct: 40  EGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVVVENSEGKQE 86



 Score = 29.2 bits (65), Expect = 1.8
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 5/52 (9%)

Query: 551 IKWIR-----TNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQ 597
           + W+R      + +R  I + +Y +   I  +Q SD   YTV   NS+GK +
Sbjct: 35  VTWMREGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVVVENSEGKQE 86



 Score = 29.2 bits (65), Expect = 1.8
 Identities = 14/50 (28%), Positives = 21/50 (42%)

Query: 165 RSNQPCGSGMVRKVQHLDYNTKLGVRMAQRADAGFYTVTAENINGKDSVE 214
           R  Q   S    ++   +Y +   +   Q +D G YTV  EN  GK   +
Sbjct: 39  REGQIIVSSQRHQITSTEYKSTFEISKVQMSDEGNYTVVVENSEGKQEAQ 88


>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain.  Ig: immunoglobulin (Ig) domain
           found in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           this group are components of immunoglobulin, neuroglia,
           cell surface glycoproteins, such as, T-cell receptors,
           CD2, CD4, CD8, and membrane glycoproteins, such as,
           butyrophilin and chondroitin sulfate proteoglycan core
           protein. A predominant feature of most Ig domains is a
           disulfide bridge connecting the two beta-sheets with a
           tryptophan residue packed against the disulfide bond.
          Length = 74

 Score = 38.2 bits (88), Expect = 0.001
 Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 4/68 (5%)

Query: 450 VIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTK----LKVFDSKRGDSGIYTLAVKNSW 505
             G P PT  WL N   + S    ++ +   ++     L + +    DSG YT    NS 
Sbjct: 7   ASGPPPPTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTCVASNSA 66

Query: 506 GTDKGTAK 513
           GT   +  
Sbjct: 67  GTVSASVT 74



 Score = 32.5 bits (73), Expect = 0.090
 Identities = 12/60 (20%), Positives = 21/60 (35%), Gaps = 9/60 (15%)

Query: 551 IKWIR---------TNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIE 601
           I W++             R +       + +TI  +   D+  YT  A+NS G     + 
Sbjct: 15  ITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTCVASNSAGTVSASVT 74



 Score = 31.7 bits (71), Expect = 0.17
 Identities = 10/43 (23%), Positives = 17/43 (39%)

Query: 653 KRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIE 695
            R +       + +TI  +   D+  YT  A+NS G     + 
Sbjct: 32  VRSSRGTSSGSSTLTISNVTLEDSGTYTCVASNSAGTVSASVT 74


>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural cell
           adhesion molecule contactin-2 and similar proteins.
           Ig4_Contactin-2-like: fourth Ig domain of the neural
           cell adhesion molecule contactin-2. Contactins are
           comprised of six Ig domains followed by four fibronectin
           type III (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-2 (aliases
           TAG-1, axonin-1) facilitates cell adhesion by homophilic
           binding between molecules in apposed membranes. The
           first four Ig domains form the intermolecular binding
           fragment which arranges as a compact U-shaped module by
           contacts between Ig domains 1 and 4, and domains 2 and
           3. It has been proposed that a linear zipper-like array
           forms, from contactin-2 molecules alternatively provided
           by the two apposed membranes.
          Length = 85

 Score = 37.6 bits (87), Expect = 0.002
 Identities = 25/87 (28%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 431 DQLSDIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKS-TKLKVFDS 489
             +SD +   GS+  ++    G P P   WL N   + S++R ++   D   TKL +   
Sbjct: 4   KVISDTEADIGSSLRWECKASGNPRPAYRWLKNGQPLASENRIEVEAGDLRITKLSL--- 60

Query: 490 KRGDSGIYTLAVKNSWGTDKGTAKVTV 516
              DSG+Y    +N  GT   +A++ V
Sbjct: 61  --SDSGMYQCVAENKHGTIYASAELAV 85


>gnl|CDD|143214 cd05737, Ig_Myomesin_like_C, C-temrinal immunoglobulin (Ig)-like
           domain of myomesin and M-protein.  Ig_Myomesin_like_C:
           domain similar to the C-temrinal immunoglobulin
           (Ig)-like domain of myomesin and M-protein. Myomesin and
           M-protein are both structural proteins localized to the
           M-band, a transverse structure in the center of the
           sarcomere, and are candidates for M-band bridges. Both
           proteins are modular, consisting mainly of repetitive
           Ig-like and fibronectin type III (FnIII) domains.
           Myomesin is expressed in all types of vertebrate
           striated muscle; M-protein has a muscle-type specific
           expression pattern. Myomesin is present in both slow and
           fast fibers; M-protein is present only in fast fibers.
           It has been suggested that myomesin acts as a molecular
           spring with alternative splicing as a means of modifying
           its elasticity.
          Length = 92

 Score = 37.9 bits (88), Expect = 0.002
 Identities = 22/69 (31%), Positives = 31/69 (44%), Gaps = 1/69 (1%)

Query: 449 NVIGEPIPTKEWLCNDITIISKDRFKI-VNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGT 507
            V G+P P   WL ND  +   D + + V   K   L +      DSG Y + VKN +G 
Sbjct: 24  TVFGDPDPEVSWLKNDQALALSDHYNVKVEQGKYASLTIKGVSSEDSGKYGIVVKNKYGG 83

Query: 508 DKGTAKVTV 516
           +     V+V
Sbjct: 84  ETVDVTVSV 92


>gnl|CDD|143317 cd07693, Ig1_Robo, First immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors and similar proteins.  Ig1_Robo:
           domain similar to the first immunoglobulin (Ig)-like
           domain in Robo (roundabout) receptors. Robo receptors
           play a role in the development of the central nervous
           system (CNS), and are receptors of Slit protein. Slit is
           a repellant secreted by the neural cells in the midline.
           Slit acts through Robo to prevent most neurons from
           crossing the midline from either side. Three mammalian
           Robo homologs (robo1, -2, and -3), and three mammalian
           Slit homologs (Slit-1,-2, -3), have been identified.
           Commissural axons, which cross the midline, express low
           levels of Robo; longitudinal axons, which avoid the
           midline, express high levels of Robo. robo1, -2, and -3
           are expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 100

 Score = 35.2 bits (81), Expect = 0.018
 Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 7/84 (8%)

Query: 431 DQLSDIKVRAGSNFEFDINVIGEPIPTKEWLCN----DITIISKDRFKIVNDDKST-KLK 485
           +  SD+ V  G     +    G P PT +WL N    +         +IV    S   L+
Sbjct: 6   EHPSDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLPSGSLFFLR 65

Query: 486 VFDSKRG--DSGIYTLAVKNSWGT 507
           V   ++G  D G+Y     NS G 
Sbjct: 66  VVHGRKGRSDEGVYVCVAHNSLGE 89


>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type. 
          Length = 63

 Score = 33.5 bits (77), Expect = 0.026
 Identities = 17/66 (25%), Positives = 25/66 (37%), Gaps = 4/66 (6%)

Query: 441 GSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLA 500
           G +        G P+P   WL +   +   +RF          L +      DSG+YT  
Sbjct: 2   GQSVTLTCPAEGNPVPNITWLKDGKPLPESNRFVASGS----TLTIKSVSLEDSGLYTCV 57

Query: 501 VKNSWG 506
            +NS G
Sbjct: 58  AENSAG 63



 Score = 29.3 bits (66), Expect = 0.94
 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 3/46 (6%)

Query: 551 IKWIRTNDKRVTIDNVDYFTK--ITIRPLQRSDTAQYTVTATNSQG 594
           I W++ + K +   N    +   +TI+ +   D+  YT  A NS G
Sbjct: 19  ITWLK-DGKPLPESNRFVASGSTLTIKSVSLEDSGLYTCVAENSAG 63


>gnl|CDD|143227 cd05750, Ig_Pro_neuregulin, Immunoglobulin (Ig)-like domain in
           neuregulins (NRGs).  Ig_Pro_neuregulin: immunoglobulin
           (Ig)-like domain in neuregulins (NRGs). NRGs are
           signaling molecules, which participate in cell-cell
           interactions in the nervous system, breast, heart, and
           other organ systems, and are implicated in the pathology
           of diseases including schizophrenia, multiple sclerosis,
           and breast cancer. There are four members of the
           neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1
           protein, binds to and activates the tyrosine kinases
           receptors ErbB3 and ErbB4, initiating signaling
           cascades. The other NRGs proteins bind one or the other
           or both of these ErbBs. NRG-1 has multiple functions;
           for example, in the brain it regulates various processes
           such as radial glia formation and neuronal migration,
           dendritic development, and expression of
           neurotransmitters receptors; in the peripheral nervous
           system NRG-1 regulates processes such as target cell
           differentiation, and Schwann cell survival. There are
           many NRG-1 isoforms, which arise from the alternative
           splicing of mRNA. Less is known of the functions of the
           other NRGs. NRG-2 and -3 are expressed predominantly in
           the nervous system. NRG-2 is expressed by motor neurons
           and terminal Schwann cells, and is concentrated near
           synaptic sites and may be a signal that regulates
           synaptic differentiation. NRG-4 has been shown to direct
           pancreatic islet cell development towards the delta-cell
           lineage.
          Length = 75

 Score = 33.7 bits (77), Expect = 0.032
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 3/66 (4%)

Query: 452 GEPIPTKEWLCNDITIISKDR---FKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTD 508
             P    +W  +   +  K++    KI N  K+++L++  +K  DSG YT  V+N  G D
Sbjct: 10  EYPSLRFKWFKDGKELNRKNKPRNIKIRNKKKNSELQINKAKLADSGEYTCVVENILGND 69

Query: 509 KGTAKV 514
             TA V
Sbjct: 70  TVTANV 75



 Score = 32.5 bits (74), Expect = 0.087
 Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 4/54 (7%)

Query: 164 KRSNQPCGSGMVRKVQHLDYNTKLGVRMAQRADAGFYTVTAENINGKDSVEVEV 217
            R N+P       K+++   N++L +  A+ AD+G YT   ENI G D+V   V
Sbjct: 26  NRKNKP----RNIKIRNKKKNSELQINKAKLADSGEYTCVVENILGNDTVTANV 75


>gnl|CDD|143173 cd04972, Ig_TrkABC_d4, Fourth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d4: the fourth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrKB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 90

 Score = 33.3 bits (76), Expect = 0.068
 Identities = 19/91 (20%), Positives = 30/91 (32%), Gaps = 2/91 (2%)

Query: 427 YINRDQLSDIKVRAGSNFEFDINVIGEPIPTKEW-LCNDITIISKDRFKIVNDDKSTKLK 485
            I  D  +   V  G          G P+P  EW +   I I ++        D    L+
Sbjct: 1   TIPVDGPNATVVYEGGTATIRCTAEGSPLPKVEWIIAGLIVIQTRTDTLETTVDIYN-LQ 59

Query: 486 VFDSKRGDSGIYTLAVKNSWGTDKGTAKVTV 516
           + +         T   +N  G    + +VTV
Sbjct: 60  LSNITSETQTTVTCTAENPVGQANVSVQVTV 90


>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig2_Robo: domain similar to the
           second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 86

 Score = 33.1 bits (76), Expect = 0.074
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 5/67 (7%)

Query: 452 GEPIPTKEWLCNDITIISKD-RFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGT-DK 509
           G P PT  W  +   +   + R +IV+D     L + ++++ D G Y     N  G  + 
Sbjct: 23  GHPEPTVSWRKDGQPLNLDNERVRIVDDG---NLLIAEARKSDEGTYKCVATNMVGERES 79

Query: 510 GTAKVTV 516
             A+++V
Sbjct: 80  AAARLSV 86


>gnl|CDD|143213 cd05736, Ig2_Follistatin_like, Second immunoglobulin (Ig)-like
           domain of a follistatin-like molecule encoded by the
           Mahya gene and similar proteins.  Ig2_Follistatin_like:
           domain similar to the second immunoglobulin (Ig)-like
           domain found in a follistatin-like molecule encoded by
           the CNS-related Mahya gene. Mahya genes have been
           retained in certain Bilaterian branches during
           evolution.  They are conserved in Hymenoptera and
           Deuterostomes, but are absent from other metazoan
           species such as fruit fly and nematode. Mahya proteins
           are secretory, with a follistatin-like domain
           (Kazal-type serine/threonine protease inhibitor domain
           and EF-hand calcium-binding domain), two Ig-like
           domains, and a novel C-terminal domain. Mahya may be
           involved in learning and memory and in processing of
           sensory information in Hymenoptera and vertebrates.
           Follistatin is a secreted, multidomain protein that
           binds activins with high affinity and antagonizes their
           signaling.
          Length = 76

 Score = 32.6 bits (74), Expect = 0.100
 Identities = 18/65 (27%), Positives = 31/65 (47%)

Query: 452 GEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTDKGT 511
           G P+P   WL N + I  K   ++      ++L + + +  D+G YT   KN  G D+  
Sbjct: 9   GIPLPRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKNEAGVDEDI 68

Query: 512 AKVTV 516
           + + V
Sbjct: 69  SSLFV 73


>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
           (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2.
           Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
           II (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2. Perlecan
           consists of five domains. Domain I has three putative
           heparan sulfate attachment sites; domain II has four LDL
           receptor-like repeats, and one Ig-like repeat; domain
           III resembles the short arm of laminin chains; domain IV
           has multiple Ig-like repeats (21 repeats in human
           perlecan); and domain V resembles the globular G domain
           of the laminin A chain and internal repeats of EGF.
           Perlecan may participate in a variety of biological
           functions including cell binding, LDL-metabolism,
           basement membrane assembly and selective permeability,
           calcium binding, and growth- and neurite-promoting
           activities.
          Length = 78

 Score = 32.5 bits (74), Expect = 0.10
 Identities = 19/67 (28%), Positives = 24/67 (35%)

Query: 441 GSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLA 500
           G   EF     G P P   W  N   +    R  I ++     L + D K  D G YT  
Sbjct: 1   GETVEFTCVATGVPTPIINWRLNWGHVPDSARVSITSEGGYGTLTIRDVKESDQGAYTCE 60

Query: 501 VKNSWGT 507
             N+ G 
Sbjct: 61  AINTRGM 67


>gnl|CDD|143299 cd05891, Ig_M-protein_C, C-terminal immunoglobulin (Ig)-like domain
           of M-protein (also known as myomesin-2).
           Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like
           domain of M-protein (also known as myomesin-2).
           M-protein is a structural protein localized to the
           M-band, a transverse structure in the center of the
           sarcomere, and is a candidate for M-band bridges.
           M-protein is modular consisting mainly of repetitive
           IG-like and fibronectin type III (FnIII) domains, and
           has a muscle-type specific expression pattern. M-protein
           is present in fast fibers.
          Length = 92

 Score = 32.6 bits (74), Expect = 0.13
 Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 1/82 (1%)

Query: 436 IKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKI-VNDDKSTKLKVFDSKRGDS 494
           + +  G        V G P P   W  ND  I   + + + +   K   L +      DS
Sbjct: 11  VTIMEGKTLNLTCTVFGNPDPEVIWFKNDQDIELSEHYSVKLEQGKYASLTIKGVTSEDS 70

Query: 495 GIYTLAVKNSWGTDKGTAKVTV 516
           G Y++ VKN +G +     V+V
Sbjct: 71  GKYSINVKNKYGGETVDVTVSV 92


>gnl|CDD|143212 cd05735, Ig8_DSCAM, Eight immunoglobulin (Ig) domain of Down
           Syndrome Cell Adhesion molecule (DSCAM).  Ig8_DSCAM:
           the eight immunoglobulin (Ig) domain of Down Syndrome
           Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion
           molecule expressed largely in the developing nervous
           system. The gene encoding DSCAM is located at human
           chromosome 21q22, the locus associated with the mental
           retardation phenotype of Down Syndrome. DSCAM is
           predicted to be the largest member of the IG
           superfamily. It has been demonstrated that DSCAM can
           mediate cation-independent homophilic intercellular
           adhesion.
          Length = 88

 Score = 32.3 bits (73), Expect = 0.16
 Identities = 18/60 (30%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 646 SQYLMNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVVTDKPSAP 705
           S+YL++ K V     +  + + I P  R D+  ++  A NS G+D+  I++ V + P  P
Sbjct: 32  SRYLVSTKEV---GDEVISTLQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDPP 88



 Score = 31.2 bits (70), Expect = 0.34
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 194 RADAGFYTVTAENINGKDSVEVEVIVLDKPSPP 226
           R D+GF++  A N  G+D   +++ V + P PP
Sbjct: 56  REDSGFFSCHAINSYGEDRGIIQLTVQEPPDPP 88



 Score = 30.8 bits (69), Expect = 0.47
 Identities = 13/42 (30%), Positives = 23/42 (54%)

Query: 570 TKITIRPLQRSDTAQYTVTATNSQGKDQVFIEVVVTDKPSAP 611
           + + I P  R D+  ++  A NS G+D+  I++ V + P  P
Sbjct: 47  STLQILPTVREDSGFFSCHAINSYGEDRGIIQLTVQEPPDPP 88



 Score = 30.8 bits (69), Expect = 0.55
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 474 KIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTDKGTAKVTV 516
           K V D+  + L++  + R DSG ++    NS+G D+G  ++TV
Sbjct: 39  KEVGDEVISTLQILPTVREDSGFFSCHAINSYGEDRGIIQLTV 81


>gnl|CDD|143179 cd04978, Ig4_L1-NrCAM_like, Fourth immunoglobulin (Ig)-like domain
           of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule),
           and NrCAM (Ng-CAM-related).  Ig4_L1-NrCAM_like: fourth
           immunoglobulin (Ig)-like domain of L1, Ng-CAM
           (Neuron-glia CAM cell adhesion molecule), and NrCAM
           (Ng-CAM-related). These proteins belong to the L1
           subfamily of cell adhesion molecules (CAMs) and are
           comprised of an extracellular region having six Ig-like
           domains and five fibronectin type III domains, a
           transmembrane region and an intracellular domain. These
           molecules are primarily expressed in the nervous system.
           L1 is associated with an X-linked recessive disorder,
           X-linked hydrocephalus, MASA syndrome, or spastic
           paraplegia type 1, that involves abnormalities of axonal
           growth.
          Length = 76

 Score = 31.2 bits (71), Expect = 0.30
 Identities = 18/79 (22%), Positives = 26/79 (32%), Gaps = 5/79 (6%)

Query: 441 GSNFEFDINVIGEPIPTKEWLCNDITI--ISKDRFKIVNDDKSTKLKVFDSKRGDSGIYT 498
           G     D    G P PT  W  N + I  +  D            L + + +  D+ +Y 
Sbjct: 1   GETGRLDCEAEGIPQPTITWRLNGVPIEELPPDP---RRRVDGGTLILSNVQPNDTAVYQ 57

Query: 499 LAVKNSWGTDKGTAKVTVL 517
               N  G     A V V+
Sbjct: 58  CNASNVHGYLLANAFVHVV 76


>gnl|CDD|143171 cd04970, Ig6_Contactin_like, Sixth Ig domain of contactin.
           Ig6_Contactin_like: Sixth Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of neur
           onal act ivity in the rat auditory system. Contactin-5
           is highly expressed in the adult human brain in the
           occipital lobe and in the amygdala. Contactin-1 is
           differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 85

 Score = 30.9 bits (70), Expect = 0.36
 Identities = 24/77 (31%), Positives = 32/77 (41%), Gaps = 1/77 (1%)

Query: 147 ARDPYSKYTSVSSLDSHKRSNQPCGSGMVRKVQHLDYNTKLGVRMAQRADAGFYTVTAEN 206
           + DP    T   S +         G G  R+V   D N  L +R AQ   AG YT TA+ 
Sbjct: 10  SHDPTLDLTFTWSFNGVPIDFDKDG-GHYRRVGGKDSNGDLMIRNAQLKHAGKYTCTAQT 68

Query: 207 INGKDSVEVEVIVLDKP 223
           +    S   ++IV   P
Sbjct: 69  VVDSLSASADLIVRGPP 85


>gnl|CDD|143274 cd05866, Ig1_NCAM-2, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-2.  Ig1_NCAM-2:
           first immunoglobulin (Ig)-like domain of neural cell
           adhesion molecule NCAM-2 (OCAM/mamFas II, RNCAM). NCAM-2
            is organized similarly to NCAM , including five
           N-terminal Ig-like domains and two fibronectin type III
           domains. NCAM-2 is differentially expressed in the
           developing and mature olfactory epithelium (OE), and may
           function like NCAM, as an adhesion molecule.
          Length = 92

 Score = 31.2 bits (70), Expect = 0.47
 Identities = 17/74 (22%), Positives = 34/74 (45%)

Query: 433 LSDIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRG 492
           LS +++  G +  F    IGEP     +      I+S  R  +  +   ++L ++++   
Sbjct: 7   LSKVELSVGESKFFTCTAIGEPESIDWYNPQGEKIVSSQRVVVQKEGVRSRLTIYNANIE 66

Query: 493 DSGIYTLAVKNSWG 506
           D+GIY     ++ G
Sbjct: 67  DAGIYRCQATDAKG 80


>gnl|CDD|143240 cd05763, Ig_1, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_1: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 75

 Score = 30.7 bits (69), Expect = 0.47
 Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 452 GEPIPTKEWL---CNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTD 508
           G P P   W      D     + R  ++ +D      + D K  D+G+Y+   +N+ G+ 
Sbjct: 9   GHPTPQIAWQKDGGTDFPAARERRMHVMPEDDV--FFIVDVKIEDTGVYSCTAQNTAGSI 66

Query: 509 KGTAKVTVL 517
              A +TVL
Sbjct: 67  SANATLTVL 75


>gnl|CDD|143250 cd05773, Ig8_hNephrin_like, Eighth immunoglobulin-like domain of
           nephrin.  Ig8_hNephrin_like: domain similar to the
           eighth immunoglobulin-like domain in human nephrin.
           Nephrin is an integral component of the slit diaphragm,
           and is a central component of the glomerular
           ultrafilter. Nephrin plays a structural role, and has a
           role in signaling. Nephrin is a transmembrane protein
           having a short intracellular portion, and an
           extracellular portion comprised of eight Ig-like
           domains, and one fibronectin type III-like domain. The
           extracellular portions of nephrin, from neighboring foot
           processes of separate podocyte cells, may interact with
           each other, and in association with other components of
           the slit diaphragm, form a porous molecular sieve within
           the slit pore.  The intracellular portion of nephrin is
           associated with linker proteins, which connect nephrin
           to the actin cytoskeleton. The intracellular portion is
           tyrosine phosphorylated, and mediates signaling from the
           slit diaphragm into the podocytes.
          Length = 109

 Score = 31.1 bits (70), Expect = 0.51
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 581 DTAQYTVTATNSQGKDQVFIEVVVTDKP 608
           D A +T TA NS G+D + I++V T +P
Sbjct: 81  DYALFTCTAHNSLGEDSLDIQLVSTSRP 108



 Score = 31.1 bits (70), Expect = 0.51
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 675 DTAQYTVTATNSQGKDQVFIEVVVTDKP 702
           D A +T TA NS G+D + I++V T +P
Sbjct: 81  DYALFTCTAHNSLGEDSLDIQLVSTSRP 108


>gnl|CDD|143167 cd00099, IgV, Immunoglobulin variable domain (IgV).  IgV:
           Immunoglobulin variable domain (IgV). Members of the IgV
           family are components of immunoglobulin (Ig) and T cell
           receptors. The basic structure of Ig molecules is a
           tetramer of two light chains and two heavy chains linked
           by disulfide bonds. In Ig, each chain is composed of one
           variable domain (IgV) and one or more constant domains
           (IgC); these names reflect the fact that the variability
           in sequences is higher in the variable domain than in
           the constant domain. Within the variable domain, there
           are regions of even more variability called the
           hypervariable or complementarity-determining regions
           (CDRs) which are responsible for antigen binding. A
           predominant feature of most Ig domains is the disulfide
           bridge connecting 2 beta-sheets with a tryptophan
           residue packed against the disulfide bond.
          Length = 105

 Score = 30.7 bits (70), Expect = 0.62
 Identities = 18/54 (33%), Positives = 24/54 (44%), Gaps = 8/54 (14%)

Query: 470 KDRFKIVNDDKSTK--LKVFDSKRGDSGIYTLAVKNSWGTDK-----GTAKVTV 516
           K RF    D   +   L +   +  DS +Y  AV  S GT K     GT ++TV
Sbjct: 53  KGRFSGTRDSSKSSFTLTISSLQPEDSAVYYCAVSLSGGTYKLYFGQGT-RLTV 105


>gnl|CDD|128874 smart00612, Kelch, Kelch domain. 
          Length = 47

 Score = 29.1 bits (66), Expect = 0.65
 Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 2/34 (5%)

Query: 81  DGGEPLEGYLVEKYDPETGVWIPVGKTREPEMDV 114
           DGG+ L+   VE YDPET  W P+     P    
Sbjct: 9   DGGQRLKS--VEVYDPETNKWTPLPSMPTPRSGH 40


>gnl|CDD|143267 cd05859, Ig4_PDGFR-alpha, Fourth immunoglobulin (Ig)-like domain of
           platelet-derived growth factor receptor (PDGFR) alpha.
           IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like
           domain of platelet-derived growth factor receptor
           (PDGFR) alpha. PDGF is a potent mitogen for connective
           tissue cells. PDGF-stimulated processes are mediated by
           three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
           binds to all three PDGFs, whereas the PDGFR beta (not
           included in this group) binds only to PDGF-B. PDGF alpha
           is organized as an extracellular component having five
           Ig-like domains, a transmembrane segment, and a
           cytoplasmic portion having protein tyrosine kinase
           activity. In mice, PDGFR alpha and PDGFR beta are
           essential for normal development.
          Length = 101

 Score = 30.6 bits (69), Expect = 0.72
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 6/67 (8%)

Query: 443 NFEFDINVIGEPIPTKEWLCN------DITIISKDRFKIVNDDKSTKLKVFDSKRGDSGI 496
             EF + V   P P   WL +      ++T I+     +      +KLK+  +K  DSG+
Sbjct: 20  VKEFVVEVEAYPPPQIRWLKDNRTLIENLTEITTSEHNVQETRYVSKLKLIRAKEEDSGL 79

Query: 497 YTLAVKN 503
           YT   +N
Sbjct: 80  YTALAQN 86


>gnl|CDD|143284 cd05876, Ig3_L1-CAM, Third immunoglobulin (Ig)-like domain of the
           L1 cell adhesion molecule (CAM).  Ig3_L1-CAM:  third
           immunoglobulin (Ig)-like domain of the L1 cell adhesion
           molecule (CAM). L1 belongs to the L1 subfamily of cell
           adhesion molecules (CAMs) and is comprised of an
           extracellular region having six Ig-like domains, five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 71

 Score = 29.9 bits (67), Expect = 0.76
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 452 GEPIPTKEWLCNDITIISKDRFKIVNDDKSTKL-KVFDSKRGDSGIYTLAVKNSWGTDKG 510
           G P P   W   D   +S +R K +N++K+ +L  V +S   D G Y    +NS G+ + 
Sbjct: 9   GLPTPEVHWDRID-GPLSPNRTKKLNNNKTLQLDNVLES---DDGEYVCTAENSEGSARH 64

Query: 511 TAKVTV 516
              VTV
Sbjct: 65  HYTVTV 70


>gnl|CDD|199885 cd02855, E_set_GBE_prok_N, N-terminal Early set domain associated
           with the catalytic domain of prokaryotic glycogen
           branching enzyme.  This subfamily is composed of
           predominantly prokaryotic 1,4 alpha glucan branching
           enzymes, also called glycogen branching enzymes. E or
           "early" set domains are associated with the catalytic
           domain of glycogen branching enzymes at the N-terminal
           end. Glycogen branching enzyme catalyzes the formation
           of alpha-1,6 branch points in either glycogen or starch
           by cleavage of the alpha-1,4 glucosidic linkage,
           yielding a non-reducing end oligosaccharide chain, as
           well as the subsequent attachment of short glucosyl
           chains to the alpha-1,6 position. By increasing the
           number of non-reducing ends, glycogen is more reactive
           to synthesis and digestion as well as being more
           soluble. The N-terminal domain of the 1,4 alpha glucan
           branching enzyme may be related to the immunoglobulin
           and/or fibronectin type III superfamilies. These domains
           are associated with different types of catalytic domains
           at  either the N-terminal or C-terminal end and may be
           involved in homodimeric/tetrameric/dodecameric
           interactions.  Members of this family include members of
           the alpha amylase family, sialidase, galactose oxidase,
           cellulase, cellulose, hyaluronate lyase, chitobiase, and
           chitinase, among others.
          Length = 105

 Score = 30.2 bits (69), Expect = 0.92
 Identities = 14/60 (23%), Positives = 25/60 (41%), Gaps = 19/60 (31%)

Query: 95  DPETGVWIPVGKTREPEMDVTGLTPGHEYKFRVKALNKEGESEPLETFSSIIARDPYSKY 154
             ++GVW         E+ + G   G  YK+ ++    +GE         ++  DPY+ Y
Sbjct: 52  IGDSGVW---------ELFIPGAKEGDLYKYEIE--TADGE--------VLLKADPYAFY 92


>gnl|CDD|143184 cd04983, IgV_TCR_alpha_like, Immunoglobulin (Ig) variable (V)
           domain of T-cell receptor (TCR) alpha chain and similar
           proteins.  IgV_TCR_alpha: immunoglobulin (Ig) variable
           domain of the alpha chain of alpha/beta T-cell antigen
           receptors (TCRs). TCRs mediate antigen recognition by T
           lymphocytes, and are composed of alpha and beta, or
           gamma and delta, polypeptide chains with variable (V)
           and constant (C) regions. This group represents the
           variable domain of the alpha chain of TCRs and also
           includes the variable domain of delta chains of TCRs.
           Alpha/beta TCRs recognize antigen as peptide fragments
           presented by major histocompatibility complex (MHC)
           molecules. The variable domain of TCRs is responsible
           for antigen recognition, and is located at the
           N-terminus of the receptor.  Gamma/delta TCRs recognize
           intact protein antigens; they recognize proteins
           antigens directly and without antigen processing, and
           MHC independently of the bound peptide.
          Length = 109

 Score = 30.3 bits (69), Expect = 0.93
 Identities = 19/55 (34%), Positives = 27/55 (49%), Gaps = 8/55 (14%)

Query: 469 SKDRFKIVNDD--KSTKLKVFDSKRGDSGIYTLAVKNSWGTDK-----GTAKVTV 516
            K RF    D   KS+ L +  ++  DS +Y  A+  S GT K     GT ++TV
Sbjct: 55  EKGRFSATLDKSRKSSSLHISAAQLSDSAVYFCALSESGGTGKLTFGKGT-RLTV 108


>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
           Ig5_Contactin_like: Fifth Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 73

 Score = 29.3 bits (66), Expect = 1.2
 Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 3/65 (4%)

Query: 452 GEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTDKGT 511
             P PT  W      + +  R  I   D S  L++ +  + D G YT   +N +G    T
Sbjct: 12  AAPKPTISWSKGTELLTNSSRICIW-PDGS--LEILNVTKSDEGKYTCFAENFFGKANST 68

Query: 512 AKVTV 516
             ++V
Sbjct: 69  GSLSV 73



 Score = 27.8 bits (62), Expect = 4.3
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 187 LGVRMAQRADAGFYTVTAENINGK 210
           L +    ++D G YT  AEN  GK
Sbjct: 41  LEILNVTKSDEGKYTCFAENFFGK 64


>gnl|CDD|143169 cd04968, Ig3_Contactin_like, Third Ig domain of contactin.
           Ig3_Contactin_like: Third Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 88

 Score = 29.4 bits (66), Expect = 1.4
 Identities = 12/27 (44%), Positives = 15/27 (55%)

Query: 193 QRADAGFYTVTAENINGKDSVEVEVIV 219
           Q  D G Y   AENI GKD+ +  + V
Sbjct: 61  QFEDEGTYECEAENIKGKDTHQGRIYV 87



 Score = 27.8 bits (62), Expect = 4.8
 Identities = 20/91 (21%), Positives = 35/91 (38%), Gaps = 4/91 (4%)

Query: 426 PYINRDQLSDIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLK 485
           P I      D     G N   +   +G P+P  +W   D ++ S      ++   +  LK
Sbjct: 1   PSIIVVFPKDTYALKGQNVTLECFALGNPVPQIKWRKVDGSMPSS---AEISMSGAV-LK 56

Query: 486 VFDSKRGDSGIYTLAVKNSWGTDKGTAKVTV 516
           + + +  D G Y    +N  G D    ++ V
Sbjct: 57  IPNIQFEDEGTYECEAENIKGKDTHQGRIYV 87


>gnl|CDD|143178 cd04977, Ig1_NCAM-1_like, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-1 and similar
           proteins.  Ig1_NCAM-1 like: first immunoglobulin
           (Ig)-like domain of neural cell adhesion molecule
           NCAM-1. NCAM-1 plays important roles in the development
           and regeneration of the central nervous system, in
           synaptogenesis and neural migration. NCAM mediates
           cell-cell and cell-substratum recognition and adhesion
           via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-nonNCAM), interactions. NCAM is expressed as three
           major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves the Ig1, Ig2, and Ig3
           domains. By this model, Ig1 and Ig2 mediate dimerization
           of NCAM molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain. Also included in this group is
           NCAM-2 (also known as OCAM/mamFas II and RNCAM).  NCAM-2
           is differentially expressed in the developing and mature
           olfactory epithelium (OE).
          Length = 92

 Score = 29.4 bits (66), Expect = 1.5
 Identities = 21/86 (24%), Positives = 43/86 (50%), Gaps = 2/86 (2%)

Query: 433 LSDIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIV-NDDKSTKLKVFDSKR 491
            S  ++  G +  F   VIGEP     +  N   ++++ +  +V NDD  + L ++++  
Sbjct: 7   PSQGEISVGESKFFLCQVIGEPKDISWFSPNGEKLVTQQQISVVQNDDVRSTLTIYNANI 66

Query: 492 GDSGIYTLAVKNSWGTD-KGTAKVTV 516
            D+GIY     ++ GT+ + T  + +
Sbjct: 67  EDAGIYKCVATDAKGTESEATVNLKI 92


>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor and similar
           proteins.  Ig2_FGFR_like: domain similar to the second
           immunoglobulin (Ig)-like domain of fibroblast growth
           factor (FGF) receptor. FGF receptors bind FGF signaling
           polypeptides. FGFs participate in multiple processes
           such as morphogenesis, development, and angiogenesis.
           FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
           -2, -3, -4). Receptor diversity is controlled by
           alternative splicing producing splice variants with
           different ligand binding characteristics and different
           expression patterns. FGFRs have an extracellular region
           comprised of three Ig-like domains, a single
           transmembrane helix, and an intracellular tyrosine
           kinase domain. Ligand binding and specificity reside in
           the Ig-like domains 2 and 3, and the linker region that
           connects these two. FGFR activation and signaling depend
           on FGF-induced dimerization, a process involving cell
           surface heparin or heparin sulfate proteoglycans. This
           group also contains fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 does not have a protein
           tyrosine kinase domain at its C terminus; neither does
           its cytoplasmic domain appear to interact with a
           signaling partner. It has been suggested that FGFRL1 may
           not have any direct signaling function, but instead acts
           as a decoy receptor trapping FGFs and preventing them
           from binding other receptors.
          Length = 85

 Score = 29.3 bits (66), Expect = 1.6
 Identities = 25/82 (30%), Positives = 31/82 (37%), Gaps = 1/82 (1%)

Query: 436 IKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIV-NDDKSTKLKVFDSKRGDS 494
             V AGS         G P PT  WL +      + R        K   L +      DS
Sbjct: 4   HAVPAGSTVRLKCPASGNPRPTITWLKDGKPFKKEHRIGGYKVRKKKWTLILESVVPSDS 63

Query: 495 GIYTLAVKNSWGTDKGTAKVTV 516
           G YT  V+N +G+   T KV V
Sbjct: 64  GKYTCIVENKYGSINHTYKVDV 85


>gnl|CDD|143270 cd05862, Ig1_VEGFR, First immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor (VEGF) receptor(R).
           IG1_VEGFR: first immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor (VEGF) receptor(R).
           The VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. The VEGFR family consists of three
           members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
           VEGFR-3 (Flt-4). VEGF_A interacts with both VEGFR-1 and
           VEGFR-2. VEGFR-1 binds strongest to VEGF, VEGF-2 binds
           more weakly. VEGFR-3 appears not to bind VEGF, but binds
           other members of the VEGF family (VEGF-C and -D). VEGFRs
           bind VEGFs with high affinity with the IG-like domains.
           VEGF-A is important to the growth and maintenance of
           vascular endothelial cells and to the development of new
           blood- and lymphatic-vessels in physiological and
           pathological states. VEGFR-2 is a major mediator of the
           mitogenic, angiogenic and microvascular
           permeability-enhancing effects of VEGF-A. VEGFR-1 may
           play an inhibitory part in these processes by binding
           VEGF and interfering with its interaction with VEGFR-2.
           VEGFR-1 has a signaling role in mediating monocyte
           chemotaxis. VEGFR-2 and -1 may mediate a chemotactic and
           a survival signal in hematopoietic stem cells or
           leukemia cells. VEGFR-3 has been shown to be involved in
           tumor angiogenesis and growth.
          Length = 86

 Score = 29.4 bits (66), Expect = 1.6
 Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 1/51 (1%)

Query: 649 LMNDKRVTIDNVDYFTKI-TIRPLQRSDTAQYTVTATNSQGKDQVFIEVVV 698
            +++ R ++      +   TI  +  SD  +YT TA++ Q   +    V+V
Sbjct: 34  SVSENRRSLQEHTELSSTLTIENVTLSDLGRYTCTASSGQMIAKNSTIVIV 84


>gnl|CDD|143259 cd05851, Ig3_Contactin-1, Third Ig domain of contactin-1.
           Ig3_Contactin-1: Third Ig domain of the neural cell
           adhesion molecule contactin-1. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-1 is
           differentially expressed in tumor tissues and may
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 88

 Score = 29.2 bits (65), Expect = 1.9
 Identities = 20/85 (23%), Positives = 32/85 (37%), Gaps = 4/85 (4%)

Query: 432 QLSDIKVRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKR 491
           +  D     G N   +   +G P+P   W      + +     +        LK+F+ + 
Sbjct: 7   KFKDTYALKGQNVTLECFALGNPVPVIRWRKILEPMPATAEISM----SGAVLKIFNIQP 62

Query: 492 GDSGIYTLAVKNSWGTDKGTAKVTV 516
            D G Y    +N  G DK  A+V V
Sbjct: 63  EDEGTYECEAENIKGKDKHQARVYV 87


>gnl|CDD|219430 pfam07495, Y_Y_Y, Y_Y_Y domain.  This domain is mostly found at the
           end of the beta propellers (pfam07494) in a family of
           two component regulators. However they are also found
           tandemly repeated in Clostridium tetani CTC_02402
           without other signal conduction domains being present.
           It's named after the conserved tyrosines found in the
           alignment. The exact function is not known.
          Length = 64

 Score = 28.5 bits (64), Expect = 2.0
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 6/62 (9%)

Query: 80  DDGGEPLEGYLVEKYDPETGVWIPVGKTREPEMDVTGLTPGHEYKFRVKALNKEGESEPL 139
                 L  Y +E +D E   W+ +G   E     T L PG +Y  +VKA + +G     
Sbjct: 2   SGPENLLYRYRLEGFDGE---WVELGDYSEASY--TNLPPG-KYTLKVKAKDNDGNWSYD 55

Query: 140 ET 141
           + 
Sbjct: 56  DA 57


>gnl|CDD|165173 PHA02826, PHA02826, IL-1 receptor-like protein; Provisional.
          Length = 227

 Score = 30.7 bits (69), Expect = 2.1
 Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 457 TKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAV---KNSWGTD-KGTA 512
           T  W  N   ++  DR ++ N++  + L +  +   DSGIYT  +   KNS   +     
Sbjct: 165 TLTWYKNGNIVLYTDRIQLRNNN--STLVIKSATHDDSGIYTCNLRFNKNSNNYNITKEY 222

Query: 513 KVTVL 517
           KVT++
Sbjct: 223 KVTII 227


>gnl|CDD|143172 cd04971, Ig_TrKABC_d5, Fifth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d5: the fifth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrkB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 81

 Score = 28.5 bits (64), Expect = 2.6
 Identities = 16/72 (22%), Positives = 23/72 (31%), Gaps = 7/72 (9%)

Query: 447 DINVIGEPIPTKEWLCN-------DITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTL 499
              V G P PT  W  N       D              +    L+  +    ++G YTL
Sbjct: 4   PFTVRGNPKPTLTWYHNGAVLNESDYIRTEIHYEVTTPTEYHGCLQFDNPTHVNNGNYTL 63

Query: 500 AVKNSWGTDKGT 511
              N +G D  +
Sbjct: 64  VASNEYGQDSKS 75


>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain.  This domain is
           found in antibodies as well as neural protein P0 and
           CTL4 amongst others.
          Length = 114

 Score = 29.1 bits (65), Expect = 2.8
 Identities = 13/50 (26%), Positives = 20/50 (40%), Gaps = 3/50 (6%)

Query: 470 KDRFKIVNDDKSTK--LKVFDSKRGDSGIYTLAVKNS-WGTDKGTAKVTV 516
           K R  +  +       L + + +  DSG YT AV N          ++TV
Sbjct: 65  KGRVTLSGNGSKNDFSLTISNLRLSDSGTYTCAVSNPNELVFGAGTRLTV 114


>gnl|CDD|219740 pfam08192, Peptidase_S64, Peptidase family S64.  This family of
           fungal proteins is involved in the processing of
           membrane bound transcription factor Stp1. The processing
           causes the signalling domain of Stp1 to be passed to the
           nucleus where several permease genes are induced. The
           permeases are important for uptake of amino acids, and
           processing of tp1 only occurs in an amino acid-rich
           environment. This family is predicted to be distantly
           related to the trypsin family (MEROPS:S1) and to have a
           typical trypsin-like catalytic triad.
          Length = 644

 Score = 30.9 bits (70), Expect = 2.9
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 9/67 (13%)

Query: 629 SNYLQELHSSLMPISLESQYLMND--------KRVTIDNVDYFTK-ITIRPLQRSDTAQY 679
           SN L+ +   L  +  +  Y+M+             I+ ++ F K I+  P   +DT QY
Sbjct: 110 SNELEYVVDELKQLYEDLAYIMDQIHNSVTNLSTAVINAIECFKKFISFLPTITADTIQY 169

Query: 680 TVTATNS 686
            VT  NS
Sbjct: 170 DVTTENS 176


>gnl|CDD|225828 COG3291, COG3291, FOG: PKD repeat [General function prediction
           only].
          Length = 297

 Score = 30.2 bits (68), Expect = 3.1
 Identities = 29/140 (20%), Positives = 41/140 (29%), Gaps = 19/140 (13%)

Query: 493 DSGIYT--LAVKNSWGTDKG--TAKVTVLGCSLKWNPPEDDGGAP------IEYYMVEKM 542
           D+G YT  L V NS G+D    T  VTV    ++   PE                     
Sbjct: 134 DAGTYTVTLTVSNSTGSDSKTKTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTESSSGN 193

Query: 543 ETDTGKVLIKWIRTNDKR--VTIDNVDYFTKITIRPLQ-----RSDTAQYTVTATNSQGK 595
            +    V      TN       +  V   T  +  P         +   Y +T T +   
Sbjct: 194 LSSWVYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTVVYPTDKEGKGYYITLTGNGEF 253

Query: 596 DQVFIEVVVTDKPSAPEGPI 615
              F++VV   K        
Sbjct: 254 --SFVDVVAYVKNGDWSENN 271


>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
           Glycoprotein E (gE) of Alphaherpesvirus forms a complex
           with glycoprotein I (gI) (pfam01688), functioning as an
           immunoglobulin G (IgG) Fc binding protein. gE is
           involved in virus spread but is not essential for
           propagation.
          Length = 437

 Score = 30.5 bits (69), Expect = 3.8
 Identities = 16/61 (26%), Positives = 26/61 (42%), Gaps = 7/61 (11%)

Query: 473 FKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTDKGTAKVTVLGCSLKWNPPEDDGGA 532
           F  +  +K   L + ++   DSG+YTL V+     D G A  + +   +    P  D   
Sbjct: 81  FPRLTANKGG-LSILNATEQDSGVYTLYVRG----DPGEAHQSAV--VVTVVGPAPDPRT 133

Query: 533 P 533
           P
Sbjct: 134 P 134


>gnl|CDD|143176 cd04975, Ig4_SCFR_like, Fourth immunoglobulin (Ig)-like domain of
           stem cell factor receptor (SCFR) and similar proteins.
           Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of
           stem cell factor receptor (SCFR). In addition to SCFR
           this group also includes the fourth Ig domain of
           platelet-derived growth factor receptors (PDGFR), alpha
           and beta, the fourth Ig domain of macrophage colony
           stimulating factor (M-CSF), and the Ig domain of the
           receptor tyrosine kinase KIT. SCFR and the PDGFR alpha
           and beta have similar organization: an extracellular
           component having five Ig-like domains, a transmembrane
           segment, and a cytoplasmic portion having protein
           tyrosine kinase activity. SCFR and its ligand SCF are
           critical for normal hematopoiesis, mast cell
           development, melanocytes and gametogenesis. SCF binds to
           the second and third Ig-like domains of SCFR, this
           fourth Ig-like domain participates in SCFR dimerization,
           which follows ligand binding. Deletion of this fourth
           SCFR_Ig-like domain abolishes the ligand-induced
           dimerization of SCFR and completely inhibits signal
           transduction. PDGF is a potent mitogen for connective
           tissue cells. PDGF-stimulated processes are mediated by
           three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
           binds to all three PDGFs, whereas the PDGFR beta, binds
           only to PDGF-B. In mice, PDGFR alpha, and PDGFR beta,
           are essential for normal development.
          Length = 101

 Score = 28.4 bits (64), Expect = 4.2
 Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 6/75 (8%)

Query: 436 IKVRAGSNFEFDINVIGEP-IPTKEWLCNDITIISKDRFKIVNDDKS-----TKLKVFDS 489
           I V  G N    + V   P  P   W  ++ T+ +K    + ++++S     ++LK+   
Sbjct: 13  IFVNLGENLNLVVEVEAYPPPPHINWTYDNRTLTNKLTEIVTSENESEYRYVSELKLVRL 72

Query: 490 KRGDSGIYTLAVKNS 504
           K  ++G YT    NS
Sbjct: 73  KESEAGTYTFLASNS 87


>gnl|CDD|143265 cd05857, Ig2_FGFR, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor.  Ig2_FGFR:
           second immunoglobulin (Ig)-like domain of fibroblast
           growth factor (FGF) receptor. FGF receptors bind FGF
           signaling polypeptides. FGFs participate in multiple
           processes such as morphogenesis, development, and
           angiogenesis. FGFs bind to four FGF receptor tyrosine
           kinases (FGFR1, -2, -3, -4). Receptor diversity is
           controlled by alternative splicing producing splice
           variants with different ligand binding characteristics
           and different expression patterns. FGFRs have an
           extracellular region comprised of three IG-like domains,
           a single transmembrane helix, and an intracellular
           tyrosine kinase domain. Ligand binding and specificity
           reside in the Ig-like domains 2 and 3, and the linker
           region that connects these two. FGFR activation and
           signaling depend on FGF-induced dimerization, a process
           involving cell surface heparin or heparin sulfate
           proteoglycans.
          Length = 85

 Score = 27.9 bits (62), Expect = 4.2
 Identities = 25/83 (30%), Positives = 36/83 (43%), Gaps = 7/83 (8%)

Query: 438 VRAGSNFEFDINVIGEPIPTKEWLCNDITIISKDR---FKIVNDDKSTKLK-VFDSKRGD 493
           V A +  +F     G P PT  WL N      + R   +K+ N   S  ++ V  S   D
Sbjct: 6   VPAANTVKFRCPAAGNPTPTMRWLKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPS---D 62

Query: 494 SGIYTLAVKNSWGTDKGTAKVTV 516
            G YT  V+N +G+   T  + V
Sbjct: 63  KGNYTCVVENEYGSINHTYHLDV 85


>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig3_L1-CAM_like:
           domain similar to the third immunoglobulin (Ig)-like
           domain of the L1 cell adhesion molecule (CAM). L1
           belongs to the L1 subfamily of cell adhesion molecules
           (CAMs) and is comprised of an extracellular region
           having six Ig-like domains and five fibronectin type III
           domains, a transmembrane region and an intracellular
           domain. L1 is primarily expressed in the nervous system
           and is involved in its development and function. L1 is
           associated with an X-linked recessive disorder, X-linked
           hydrocephalus, MASA syndrome, or spastic paraplegia type
           1, that involves abnormalities of axonal growth. This
           group also contains the chicken neuron-glia cell
           adhesion molecule, Ng-CAM and human neurofascin.
          Length = 71

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 5/66 (7%)

Query: 452 GEPIPTKEWLCNDITIISKDRFKIVNDDKSTKL-KVFDSKRGDSGIYTLAVKNSWGTDKG 510
           G P P   W+      +  DR K  N +K+ K+  V +    D G Y     NS G+ + 
Sbjct: 9   GLPTPEISWIKIG-GELPADRTKFENFNKTLKIDNVSEE---DDGEYRCTASNSLGSARH 64

Query: 511 TAKVTV 516
           T  VTV
Sbjct: 65  TISVTV 70


>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
           important roles in the development and regeneration of
           the central nervous system, in synaptogenesis and neural
           migration. NCAM mediates cell-cell and cell-substratum
           recognition and adhesion via homophilic (NCAM-NCAM) and
           heterophilic (NCAM-non-NCAM) interactions. NCAM is
           expressed as three major isoforms having different
           intracellular extensions. The extracellular portion of
           NCAM has five N-terminal Ig-like domains and two
           fibronectin type III domains. The double zipper adhesion
           complex model for NCAM homophilic binding involves Ig1,
           Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
           dimerization of NCAM molecules situated on the same cell
           surface (cis interactions), and Ig3 domains mediate
           interactions between NCAM molecules expressed on the
           surface of opposing cells (trans interactions), through
           binding to the Ig1 and Ig2 domains. The adhesive ability
           of NCAM is modulated by the addition of polysialic acid
           chains to the fifth Ig-like domain.
          Length = 97

 Score = 28.4 bits (63), Expect = 4.6
 Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 4/57 (7%)

Query: 642 ISLESQYLMNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKD--QVFIEV 696
           IS E + L  D  + + +    + +T++ +Q +D  +Y  TA+N+ G+D   +++EV
Sbjct: 43  ISSEEKTL--DGHIVVRSHARVSSLTLKYIQYTDAGEYLCTASNTIGQDSQSMYLEV 97



 Score = 27.6 bits (61), Expect = 7.0
 Identities = 13/50 (26%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 555 RTNDKRVTIDNVDYFTKITIRPLQRSDTAQYTVTATNSQGKD--QVFIEV 602
           +T D  + + +    + +T++ +Q +D  +Y  TA+N+ G+D   +++EV
Sbjct: 48  KTLDGHIVVRSHARVSSLTLKYIQYTDAGEYLCTASNTIGQDSQSMYLEV 97


>gnl|CDD|143303 cd05895, Ig_Pro_neuregulin-1, Immunoglobulin (Ig)-like domain found
           in neuregulin (NRG)-1.  Ig_Pro_neuregulin-1:
           immunoglobulin (Ig)-like domain found in neuregulin
           (NRG)-1. There are many NRG-1 isoforms which arise from
           the alternative splicing of mRNA. NRG-1 belongs to the
           neuregulin gene family, which is comprised of four
           genes. This group represents NRG-1. NRGs are signaling
           molecules, which participate in cell-cell interactions
           in the nervous system, breast, and heart, and other
           organ systems, and are implicated in the pathology of
           diseases including schizophrenia, multiple sclerosis,
           and breast cancer. The NRG-1 protein binds to and
           activates the tyrosine kinases receptors ErbB3 and
           ErbB4, initiating signaling cascades. NRG-1 has multiple
           functions; for example, in the brain it regulates
           various processes such as radial glia formation and
           neuronal migration, dendritic development, and
           expression of neurotransmitters receptors; in the
           peripheral nervous system NRG-1 regulates processes such
           as target cell differentiation, and Schwann cell
           survival.
          Length = 76

 Score = 27.7 bits (61), Expect = 4.7
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 4/60 (6%)

Query: 459 EWLCNDITIISKDR----FKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTDKGTAKV 514
           +W  N   I +K++     KI    KS++L++  +   D+G Y   V +  G D  TA V
Sbjct: 17  KWFKNGKEIGAKNKPDNKIKIRKKKKSSELQISKASLADNGEYKCMVSSKLGNDSVTANV 76


>gnl|CDD|220480 pfam09937, DUF2169, Uncharacterized protein conserved in bacteria
           (DUF2169).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 298

 Score = 29.6 bits (67), Expect = 5.7
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 219 VLDKPSPPGGPLKVSNVHAEGVTLDWKVPD 248
            L  P P G P+++  +H EG  L +++P 
Sbjct: 230 QLPGPLPGGEPVELKGLHPEGRELSFRLPR 259


>gnl|CDD|143273 cd05865, Ig1_NCAM-1, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-1.  Ig1_NCAM-1: first
           immunoglobulin (Ig)-like domain of neural cell adhesion
           molecule NCAM-1. NCAM-1 plays important roles in the
           development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-nonNCAM), interactions. NCAM is expressed as three
           major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves the Ig1, Ig2, and Ig3
           domains. By this model, Ig1 and Ig2 mediate dimerization
           of NCAM molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 96

 Score = 27.7 bits (61), Expect = 6.8
 Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 477 NDDKSTKLKVFDSKRGDSGIYTLAVKNSWGTD-KGTAKVTV 516
           NDD S+ L ++++   D+GIY   V N    + + T  V +
Sbjct: 55  NDDYSSTLTIYNANIDDAGIYKCVVSNEDEGESEATVNVKI 95


>gnl|CDD|143207 cd05730, Ig3_NCAM-1_like, Third immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig3_NCAM-1_like: domain similar to the third
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1,and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 95

 Score = 27.6 bits (61), Expect = 7.2
 Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 5/79 (6%)

Query: 432 QLSDIKVRAGSNFEFDINVI----GEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVF 487
           +    +V A +N    + +     G P PT  W  +   I S +     N+D S ++ + 
Sbjct: 5   RARQSEVNATANLGQSVTLACDADGFPEPTMTWTKDGEPIESGEEKYSFNEDGS-EMTIL 63

Query: 488 DSKRGDSGIYTLAVKNSWG 506
           D  + D   YT   +N  G
Sbjct: 64  DVDKLDEAEYTCIAENKAG 82


>gnl|CDD|165267 PHA02961, PHA02961, hypothetical protein; Provisional.
          Length = 658

 Score = 29.5 bits (66), Expect = 7.3
 Identities = 21/93 (22%), Positives = 33/93 (35%), Gaps = 13/93 (13%)

Query: 479 DKSTKLKVFDSKRGDSGIYTLAVKNSWGTDKGTAKVTVLGCSLKWNPPEDDGGAPIEYYM 538
           D    +  FD+   D G+  LA        KG   +  L  +  W+P           Y+
Sbjct: 434 DYMCDMSEFDNNISDIGLGKLASFLCNAAKKGIIDINFLKTNCLWSPL---------MYL 484

Query: 539 VEKMETDTGKVLIKWIRTNDKRVTIDNVDYFTK 571
           ++    D+ KV         K +  DN+ Y  K
Sbjct: 485 ID----DSCKVDFSRFMMATKNIKADNIKYLKK 513


>gnl|CDD|180905 PRK07246, PRK07246, bifunctional ATP-dependent DNA helicase/DNA
           polymerase III subunit epsilon; Validated.
          Length = 820

 Score = 29.7 bits (67), Expect = 7.8
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query: 204 AENINGKDSVEVEVIVLDKPSPPGGPLKVS 233
           A+  +  D ++V+ IVL K +    P K+S
Sbjct: 203 AKPYSSPDYIKVQGIVLKKTAASLKPRKLS 232


>gnl|CDD|223373 COG0296, GlgB, 1,4-alpha-glucan branching enzyme [Carbohydrate
           transport and metabolism].
          Length = 628

 Score = 29.2 bits (66), Expect = 8.4
 Identities = 16/64 (25%), Positives = 24/64 (37%), Gaps = 19/64 (29%)

Query: 95  DPETGVWIPVGKTREPEMDVTGLTPGHEYKFRVKALNKEGESEPLETFSSIIARDPYSKY 154
             E+G+W         E+ V G  PG  YK+ +        S  L   +     DPY++ 
Sbjct: 69  RKESGIW---------ELFVPGAPPGTRYKYELI-----DPSGQLRLKA-----DPYARR 109

Query: 155 TSVS 158
             V 
Sbjct: 110 QEVG 113


>gnl|CDD|182883 PRK10985, PRK10985, putative hydrolase; Provisional.
          Length = 324

 Score = 29.2 bits (66), Expect = 8.5
 Identities = 10/22 (45%), Positives = 13/22 (59%), Gaps = 1/22 (4%)

Query: 337 DTDFVELAWT-PPEQNGGSPIV 357
           D DFV+LAW+  P Q    P +
Sbjct: 40  DGDFVDLAWSEDPAQARHKPRL 61


>gnl|CDD|187570 cd05260, GDP_MD_SDR_e, GDP-mannose 4,6 dehydratase, extended (e)
           SDRs.  GDP-mannose 4,6 dehydratase, a homodimeric SDR,
           catalyzes the NADP(H)-dependent conversion of
           GDP-(D)-mannose to GDP-4-keto, 6-deoxy-(D)-mannose in
           the fucose biosynthesis pathway. These proteins have the
           canonical active site triad and NAD-binding pattern,
           however the active site Asn is often missing and may be
           substituted with Asp. A Glu residue has been identified
           as an important active site base. Extended SDRs are
           distinct from classical SDRs. In addition to the
           Rossmann fold (alpha/beta folding pattern with a central
           beta-sheet) core region typical of all SDRs, extended
           SDRs have a less conserved C-terminal extension of
           approximately 100 amino acids. Extended SDRs are a
           diverse collection of proteins, and include isomerases,
           epimerases, oxidoreductases, and lyases; they typically
           have a TGXXGXXG cofactor binding motif. SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold, an NAD(P)(H)-binding region, and a structurally
           diverse C-terminal region. Sequence identity between
           different SDR enzymes is typically in the 15-30% range;
           they catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 316

 Score = 29.1 bits (66), Expect = 8.8
 Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 8/65 (12%)

Query: 463 NDITIISKDRFKIVN---DDKSTKLKVFDSKRGDSGIYTLA----VKNSWGTDKGTAKVT 515
            D   I+KDR  +      D S+  +  +  R D  IY LA    VK S+   + TA+V 
Sbjct: 41  IDHLYINKDRITLHYGDLTDSSSLRRAIEKVRPDE-IYHLAAQSHVKVSFDDPEYTAEVN 99

Query: 516 VLGCS 520
            +G  
Sbjct: 100 AVGTL 104


>gnl|CDD|143177 cd04976, Ig2_VEGFR, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor (VEGFR).
           Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor (VEGFR). The
           VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. The VEGFR family consists of three
           members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
           VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at
           the Ig-like domains. VEGF-A is important to the growth
           and maintenance of vascular endothelial cells and to the
           development of new blood- and lymphatic-vessels in
           physiological and pathological states. VEGFR-2 is a
           major mediator of the mitogenic, angiogenic and
           microvascular permeability-enhancing effects of VEGF-A.
           VEGFR-1 may play an inhibitory part in these processes
           by binding VEGF and interfering with its interaction
           with VEGFR-2. VEGFR-1 has a signaling role in mediating
           monocyte chemotaxis. VEGFR-2 and -1 may mediate a
           chemotactic and a survival signal in hematopoietic stem
           cells or leukemia cells. VEGFR-3 has been shown to be
           involved in tumor angiogenesis and growth.
          Length = 71

 Score = 26.6 bits (59), Expect = 8.9
 Identities = 16/55 (29%), Positives = 22/55 (40%), Gaps = 4/55 (7%)

Query: 450 VIGEPIPTKEWLCNDITIISKDRFKIVNDDKSTKLKVFDSKRGDSGIYTLAVKNS 504
           V   P P  +W  N   I  K+R K         L + D    D+G YT+ + N 
Sbjct: 7   VKAYPPPEIQWYKNGKLISEKNRTKK----SGHSLTIKDVTEEDAGNYTVVLTNK 57


>gnl|CDD|143257 cd05849, Ig1_Contactin-1, First Ig domain of contactin-1.
           Ig1_Contactin-1: First Ig domain of the neural cell
           adhesion molecule contactin-1. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-1 is
           differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 93

 Score = 27.2 bits (60), Expect = 8.9
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 18/64 (28%)

Query: 452 GEPIPTKEWLCNDITI-ISKDRFKIV-------NDDKSTKLKVFDSKRGDSGIYTLAVKN 503
             P P  +W  N++ I ++ DR+ +V       N DK            D+G Y   V N
Sbjct: 30  ANPFPIYKWRKNNLDIDLTNDRYSMVGGNLVINNPDKYK----------DAGRYVCIVSN 79

Query: 504 SWGT 507
            +G 
Sbjct: 80  IYGK 83


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.134    0.400 

Gapped
Lambda     K      H
   0.267   0.0802    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,578,079
Number of extensions: 3587709
Number of successful extensions: 2292
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2267
Number of HSP's successfully gapped: 112
Length of query: 716
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 612
Effective length of database: 6,324,786
Effective search space: 3870769032
Effective search space used: 3870769032
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (27.9 bits)