RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1672
         (600 letters)



>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like.  IG domains that cannot
           be classified into one of IGv1, IGc1, IGc2, IG.
          Length = 85

 Score = 69.5 bits (170), Expect = 9e-15
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 8/93 (8%)

Query: 261 SRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSS 320
              V     E+ T+ C     PP  ++WY  G  LL  +      R  V   G     S+
Sbjct: 1   PPSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAES-----GRFSVSRSG---STST 52

Query: 321 LVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
           L ++N    DSG + C A N +G A +  TL V
Sbjct: 53  LTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85


>gnl|CDD|214652 smart00409, IG, Immunoglobulin. 
          Length = 85

 Score = 69.5 bits (170), Expect = 9e-15
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 8/93 (8%)

Query: 261 SRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSS 320
              V     E+ T+ C     PP  ++WY  G  LL  +      R  V   G     S+
Sbjct: 1   PPSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAES-----GRFSVSRSG---STST 52

Query: 321 LVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
           L ++N    DSG + C A N +G A +  TL V
Sbjct: 53  LTISNVTPEDSGTYTCAATNSSGSASSGTTLTV 85


>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain.  Ig: immunoglobulin (Ig) domain
           found in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           this group are components of immunoglobulin, neuroglia,
           cell surface glycoproteins, such as, T-cell receptors,
           CD2, CD4, CD8, and membrane glycoproteins, such as,
           butyrophilin and chondroitin sulfate proteoglycan core
           protein. A predominant feature of most Ig domains is a
           disulfide bridge connecting the two beta-sheets with a
           tryptophan residue packed against the disulfide bond.
          Length = 74

 Score = 64.4 bits (156), Expect = 4e-13
 Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 5/79 (6%)

Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
            T+ C     PP  I+W  NG+ L ++              G     S+L ++N    DS
Sbjct: 1   VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSR-----GTSSGSSTLTISNVTLEDS 55

Query: 332 GRFYCVAENRAGIADANFT 350
           G + CVA N AG   A+ T
Sbjct: 56  GTYTCVASNSAGTVSASVT 74


>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 59.9 bits (146), Expect = 1e-11
 Identities = 25/60 (41%), Positives = 33/60 (55%)

Query: 76  NLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPI 135
           NL+ L L+   +  I  GA  GL NL  +DLS N LTSI    F  +  LR L+L+ N +
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 58.7 bits (143), Expect = 3e-11
 Identities = 28/59 (47%), Positives = 40/59 (67%)

Query: 125 LRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRL 183
           L+ L+L+ N ++ I  GAF+ +P L  LD+S + L  ISPEAF+G  SL S+ L+GN L
Sbjct: 2   LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 54.1 bits (131), Expect = 1e-09
 Identities = 24/60 (40%), Positives = 35/60 (58%), Gaps = 2/60 (3%)

Query: 52  QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLL 111
           + LD+S N L ++P  AF+  GL NL+ L L+  ++  I   A  GL +L  +DLS N L
Sbjct: 3   KSLDLSNNRLTVIPDGAFK--GLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 50.6 bits (122), Expect = 2e-08
 Identities = 24/60 (40%), Positives = 36/60 (60%)

Query: 100 NLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRL 159
           NL  +DLS+N LT IP   F+ +  L+ L+L+ N ++ I   AF  +P L  LD+S + L
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 45.2 bits (108), Expect = 1e-06
 Identities = 19/59 (32%), Positives = 29/59 (49%)

Query: 148 GLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPVRSVEPLLKLMMIELHDNP 206
            L  LD+S +RL  I   AF G  +L+ + L+GN L+     +   L  L  ++L  N 
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNN 59


>gnl|CDD|214507 smart00082, LRRCT, Leucine rich repeat C-terminal domain. 
          Length = 51

 Score = 52.8 bits (127), Expect = 3e-09
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 3/52 (5%)

Query: 205 NPWVCDCNMRSIKMWLADKKNVPVQPA--CTGPERLSGKVFSDLHADDFACK 254
           NP++CDC +R +  WL   +++       C  P  L G +   LH+ +F C 
Sbjct: 1   NPFICDCELRWLLRWLQANEHLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51


>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain. 
          Length = 90

 Score = 53.8 bits (130), Expect = 4e-09
 Identities = 27/87 (31%), Positives = 41/87 (47%), Gaps = 11/87 (12%)

Query: 269 SENATVV--CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNA 326
            E  +    C V   P   +SW+ +G+ L ++  F         E G Y    +L ++N 
Sbjct: 13  QEGESARFTCTVTGDPDPTVSWFKDGQPLRSSDRFK-----VTYEGGTY----TLTISNV 63

Query: 327 QESDSGRFYCVAENRAGIADANFTLQV 353
           Q  D G++ CVA N AG A+A+  L V
Sbjct: 64  QPDDEGKYTCVATNSAGEAEASAELTV 90


>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
           unknown].
          Length = 394

 Score = 54.6 bits (131), Expect = 7e-08
 Identities = 39/147 (26%), Positives = 66/147 (44%), Gaps = 4/147 (2%)

Query: 74  LLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARN 133
           LL L  L L    +   +   L  LTNL  +DL +N +T IP L       L++L+L+ N
Sbjct: 92  LLPLPSLDLNLNRLR-SNISELLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDN 150

Query: 134 PISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPVRSVEP 193
            I  +     + +P L  LD+S + L  +     +   +L ++ L+GN++S  P      
Sbjct: 151 KIESLP-SPLRNLPNLKNLDLSFNDLSDLPKLL-SNLSNLNNLDLSGNKISDLPPEIELL 208

Query: 194 LLKLMMIELHDNPWVCDCNMRSIKMWL 220
              L  ++L +N  +   +  S    L
Sbjct: 209 -SALEELDLSNNSIIELLSSLSNLKNL 234



 Score = 44.2 bits (104), Expect = 1e-04
 Identities = 38/136 (27%), Positives = 68/136 (50%), Gaps = 8/136 (5%)

Query: 52  QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLL 111
           + LD+S N+L  LPK     + L NL  L L+   I  +    ++ L+ L E+DLS+N +
Sbjct: 166 KNLDLSFNDLSDLPKL---LSNLSNLNNLDLSGNKISDL-PPEIELLSALEELDLSNNSI 221

Query: 112 TSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAK 171
             + S +  +++ L  L L+ N +  + +     +  L  LD+S +++  IS        
Sbjct: 222 IELLS-SLSNLKNLSGLELSNNKLEDLPES-IGNLSNLETLDLSNNQISSISSLG--SLT 277

Query: 172 SLESIKLNGNRLSHFP 187
           +L  + L+GN LS+  
Sbjct: 278 NLRELDLSGNSLSNAL 293



 Score = 38.8 bits (90), Expect = 0.006
 Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 7/137 (5%)

Query: 43  PEAPESELTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLI 102
                      LD+SGN +  LP E      L  L++L L+   I ++ S +L  L NL 
Sbjct: 180 KLLSNLSNLNNLDLSGNKISDLPPEI---ELLSALEELDLSNNSIIELLS-SLSNLKNLS 235

Query: 103 EIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHI 162
            ++LS+N L  +P     ++  L  L+L+ N IS I       +  L +LD+S + L + 
Sbjct: 236 GLELSNNKLEDLPES-IGNLSNLETLDLSNNQISSISSLG--SLTNLRELDLSGNSLSNA 292

Query: 163 SPEAFTGAKSLESIKLN 179
            P        LE +   
Sbjct: 293 LPLIALLLLLLELLLNL 309


>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type. 
          Length = 63

 Score = 46.2 bits (110), Expect = 7e-07
 Identities = 18/74 (24%), Positives = 28/74 (37%), Gaps = 13/74 (17%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           ++ T+ C  +  P   I+W        +        R            S+L + +    
Sbjct: 3   QSVTLTCPAEGNPVPNITWL------KDGKPLPESNRF-------VASGSTLTIKSVSLE 49

Query: 330 DSGRFYCVAENRAG 343
           DSG + CVAEN AG
Sbjct: 50  DSGLYTCVAENSAG 63


>gnl|CDD|206066 pfam13895, Ig_2, Immunoglobulin domain.  This domain contains
           immunoglobulin-like domains.
          Length = 80

 Score = 45.9 bits (109), Expect = 1e-06
 Identities = 27/101 (26%), Positives = 34/101 (33%), Gaps = 22/101 (21%)

Query: 254 KPEIRMDSRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQG 313
           KP +      V     E+ T+ C     PP   +WY       +    SS Q  F     
Sbjct: 1   KPVLTPSPTVVFE--GEDVTLTCSAPGNPPPNYTWY------KDGVPLSSSQNGFFT--- 49

Query: 314 EYERKSSLVLTNAQESDSGRFYCVAENRAGIADAN-FTLQV 353
                      N    DSG + CVA N  G   +N  TL V
Sbjct: 50  ----------PNVSAEDSGTYTCVASNGGGGKTSNPVTLTV 80


>gnl|CDD|143235 cd05758, Ig5_KIRREL3-like, Fifth immunoglobulin (Ig)-like domain of
           Kirrel (kin of irregular chiasm-like) 3 (also known as
           Neph2) and similar proteins.  Ig5_KIRREL3-like: domain
           similar to the fifth immunoglobulin (Ig)-like domain of
           Kirrel (kin of irregular chiasm-like) 3 (also known as
           Neph2). This protein has five Ig-like domains, one
           transmembrane domain, and a cytoplasmic tail. Included
           in this group is mammalian Kirrel (Neph1), Kirrel2
           (Neph3), and Drosophila RST (irregular chiasm
           C-roughest) protein. These proteins contain multiple Ig
           domains, have properties of cell adhesion molecules, and
           are important in organ development.
          Length = 98

 Score = 46.2 bits (110), Expect = 2e-06
 Identities = 28/102 (27%), Positives = 39/102 (38%), Gaps = 9/102 (8%)

Query: 255 PEIRMDSRYVEAVSSENATVVCRVDSIPPA-AISWYWNGRLLLNNTAFSSYQRIFVIEQG 313
           P I        A+  +   V C + S PP   I W W    L       S  R + +E  
Sbjct: 2   PPIITSEATQYAILGDKGRVECFIFSTPPPDRIVWTWKENEL----ESGSSGR-YTVETD 56

Query: 314 EYER--KSSLVLTNAQESD-SGRFYCVAENRAGIADANFTLQ 352
                  S+L ++N QESD    + C A N  G   A  +L+
Sbjct: 57  PSPGGVLSTLTISNTQESDFQTSYNCTAWNSFGSGTAIISLE 98


>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
           Ig5_Contactin_like: Fifth Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 73

 Score = 45.1 bits (107), Expect = 2e-06
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 12/78 (15%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           C+  + P   ISW     LL N++      RI +   G      SL + N  +SD G++ 
Sbjct: 8   CKPKAAPKPTISWSKGTELLTNSS------RICIWPDG------SLEILNVTKSDEGKYT 55

Query: 336 CVAENRAGIADANFTLQV 353
           C AEN  G A++  +L V
Sbjct: 56  CFAENFFGKANSTGSLSV 73


>gnl|CDD|143207 cd05730, Ig3_NCAM-1_like, Third immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig3_NCAM-1_like: domain similar to the third
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1,and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 95

 Score = 45.7 bits (108), Expect = 3e-06
 Identities = 27/106 (25%), Positives = 43/106 (40%), Gaps = 22/106 (20%)

Query: 255 PEIRMDSRYVEAVS--SENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQ 312
           P IR     V A +   ++ T+ C  D  P   ++W  +G                 IE 
Sbjct: 2   PTIRARQSEVNATANLGQSVTLACDADGFPEPTMTWTKDGE---------------PIES 46

Query: 313 GE-----YERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
           GE      E  S + + +  + D   + C+AEN+AG  +A   L+V
Sbjct: 47  GEEKYSFNEDGSEMTILDVDKLDEAEYTCIAENKAGEQEAEIHLKV 92


>gnl|CDD|143179 cd04978, Ig4_L1-NrCAM_like, Fourth immunoglobulin (Ig)-like domain
           of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule),
           and NrCAM (Ng-CAM-related).  Ig4_L1-NrCAM_like: fourth
           immunoglobulin (Ig)-like domain of L1, Ng-CAM
           (Neuron-glia CAM cell adhesion molecule), and NrCAM
           (Ng-CAM-related). These proteins belong to the L1
           subfamily of cell adhesion molecules (CAMs) and are
           comprised of an extracellular region having six Ig-like
           domains and five fibronectin type III domains, a
           transmembrane region and an intracellular domain. These
           molecules are primarily expressed in the nervous system.
           L1 is associated with an X-linked recessive disorder,
           X-linked hydrocephalus, MASA syndrome, or spastic
           paraplegia type 1, that involves abnormalities of axonal
           growth.
          Length = 76

 Score = 45.1 bits (107), Expect = 3e-06
 Identities = 21/84 (25%), Positives = 34/84 (40%), Gaps = 10/84 (11%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           E   + C  + IP   I+W  NG  +         +R+            +L+L+N Q +
Sbjct: 2   ETGRLDCEAEGIPQPTITWRLNGVPI-EELPPDPRRRVD---------GGTLILSNVQPN 51

Query: 330 DSGRFYCVAENRAGIADANFTLQV 353
           D+  + C A N  G   AN  + V
Sbjct: 52  DTAVYQCNASNVHGYLLANAFVHV 75


>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor and similar
           proteins.  Ig2_FGFR_like: domain similar to the second
           immunoglobulin (Ig)-like domain of fibroblast growth
           factor (FGF) receptor. FGF receptors bind FGF signaling
           polypeptides. FGFs participate in multiple processes
           such as morphogenesis, development, and angiogenesis.
           FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
           -2, -3, -4). Receptor diversity is controlled by
           alternative splicing producing splice variants with
           different ligand binding characteristics and different
           expression patterns. FGFRs have an extracellular region
           comprised of three Ig-like domains, a single
           transmembrane helix, and an intracellular tyrosine
           kinase domain. Ligand binding and specificity reside in
           the Ig-like domains 2 and 3, and the linker region that
           connects these two. FGFR activation and signaling depend
           on FGF-induced dimerization, a process involving cell
           surface heparin or heparin sulfate proteoglycans. This
           group also contains fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 does not have a protein
           tyrosine kinase domain at its C terminus; neither does
           its cytoplasmic domain appear to interact with a
           signaling partner. It has been suggested that FGFRL1 may
           not have any direct signaling function, but instead acts
           as a decoy receptor trapping FGFs and preventing them
           from binding other receptors.
          Length = 85

 Score = 44.3 bits (105), Expect = 5e-06
 Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 10/79 (12%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS-SLVLTNAQESDSGRF 334
           C     P   I+W  +G+           ++   I   +  +K  +L+L +   SDSG++
Sbjct: 16  CPASGNPRPTITWLKDGKPF---------KKEHRIGGYKVRKKKWTLILESVVPSDSGKY 66

Query: 335 YCVAENRAGIADANFTLQV 353
            C+ EN+ G  +  + + V
Sbjct: 67  TCIVENKYGSINHTYKVDV 85


>gnl|CDD|222457 pfam13927, Ig_3, Immunoglobulin domain.  This family contains
           immunoglobulin-like domains.
          Length = 74

 Score = 44.3 bits (104), Expect = 5e-06
 Identities = 25/88 (28%), Positives = 34/88 (38%), Gaps = 15/88 (17%)

Query: 254 KPEIRMDSRYVEAVSSENATVVCRVDSIPPAA-ISWYWNGRLLLNNTAFSSYQRIFVIEQ 312
           KP I +        S    T+ C  +  PP   ISWY NG +   +    S         
Sbjct: 1   KPVITVSPSPSV-TSGGGVTLTCSAEGGPPPPTISWYRNGSISGGSGGLGSSG------- 52

Query: 313 GEYERKSSLVLTNAQESDSGRFYCVAEN 340
                 S+L L++    DSG + CVA N
Sbjct: 53  ------STLTLSSVTSEDSGTYTCVASN 74


>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
           kinase; Provisional.
          Length = 968

 Score = 48.7 bits (116), Expect = 7e-06
 Identities = 44/137 (32%), Positives = 61/137 (44%), Gaps = 7/137 (5%)

Query: 50  LTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHI--GQIDSGALDGLTNLIEIDLS 107
           L   LD+S NNLQ   +   R+  + +LQ L LAR     G  DS     L NL   DLS
Sbjct: 429 LVYFLDISNNNLQ--GRINSRKWDMPSLQMLSLARNKFFGGLPDSFGSKRLENL---DLS 483

Query: 108 DNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAF 167
            N  +        S+  L  L L+ N +S            LV LD+S ++L    P +F
Sbjct: 484 RNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASF 543

Query: 168 TGAKSLESIKLNGNRLS 184
           +    L  + L+ N+LS
Sbjct: 544 SEMPVLSQLDLSQNQLS 560



 Score = 40.6 bits (95), Expect = 0.002
 Identities = 33/132 (25%), Positives = 62/132 (46%), Gaps = 3/132 (2%)

Query: 52  QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLL 111
           Q+L ++ N       ++F       L+ L L+R          L  L+ L+++ LS+N L
Sbjct: 455 QMLSLARNKFFGGLPDSFGSK---RLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKL 511

Query: 112 TSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAK 171
           +        S + L  L+L+ N +S     +F  +P L +LD+S+++L    P+     +
Sbjct: 512 SGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVE 571

Query: 172 SLESIKLNGNRL 183
           SL  + ++ N L
Sbjct: 572 SLVQVNISHNHL 583



 Score = 38.7 bits (90), Expect = 0.009
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 13/138 (9%)

Query: 88  GQIDSGALDGLTNLIEIDLSDNLLT-SIPSLTFQSVRFLRDLNLARNPIS-KIEKGAFQF 145
           G+I S A+  L  +  I+LS+N L+  IP   F +   LR LNL+ N  +  I +G    
Sbjct: 83  GKI-SSAIFRLPYIQTINLSNNQLSGPIPDDIFTTSSSLRYLNLSNNNFTGSIPRG---S 138

Query: 146 VPGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLS-HFPVRSVEPLLKLMMIELHD 204
           +P L  LD+S + L    P       SL+ + L GN L    P  S+  L  L  + L  
Sbjct: 139 IPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDLGGNVLVGKIPN-SLTNLTSLEFLTLAS 197

Query: 205 NPWVCDC-----NMRSIK 217
           N  V         M+S+K
Sbjct: 198 NQLVGQIPRELGQMKSLK 215



 Score = 37.5 bits (87), Expect = 0.024
 Identities = 41/135 (30%), Positives = 61/135 (45%), Gaps = 10/135 (7%)

Query: 54  LDMSGNNL--QILPKEAFRRAGLLNLQKLFLARCHI-GQIDSGALDGLTNLIEIDLSDNL 110
           LD+  NNL   I          L NLQ LFL +  + G I   ++  L  LI +DLSDN 
Sbjct: 241 LDLVYNNLTGPIPSS----LGNLKNLQYLFLYQNKLSGPIPP-SIFSLQKLISLDLSDNS 295

Query: 111 LT-SIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTG 169
           L+  IP L  Q ++ L  L+L  N  +     A   +P L  L +  ++     P+    
Sbjct: 296 LSGEIPELVIQ-LQNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGK 354

Query: 170 AKSLESIKLNGNRLS 184
             +L  + L+ N L+
Sbjct: 355 HNNLTVLDLSTNNLT 369



 Score = 37.1 bits (86), Expect = 0.026
 Identities = 43/155 (27%), Positives = 76/155 (49%), Gaps = 7/155 (4%)

Query: 53  VLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHI-GQIDSGALDGLTNLIEIDLSDNLL 111
           VLD+S NNL     E    +G  NL KL L    + G+I   +L    +L  + L DN  
Sbjct: 360 VLDLSTNNLTGEIPEGLCSSG--NLFKLILFSNSLEGEIPK-SLGACRSLRRVRLQDNSF 416

Query: 112 TSIPSLTFQSVRFLRDLNLARNPIS-KIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGA 170
           +      F  +  +  L+++ N +  +I    +  +P L  L ++ ++     P++F G+
Sbjct: 417 SGELPSEFTKLPLVYFLDISNNNLQGRINSRKWD-MPSLQMLSLARNKFFGGLPDSF-GS 474

Query: 171 KSLESIKLNGNRLSHFPVRSVEPLLKLMMIELHDN 205
           K LE++ L+ N+ S    R +  L +LM ++L +N
Sbjct: 475 KRLENLDLSRNQFSGAVPRKLGSLSELMQLKLSEN 509



 Score = 36.7 bits (85), Expect = 0.037
 Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 4/136 (2%)

Query: 35  RDKFLITIPEAPESELTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGA 94
           R+KF   +P++  S+  + LD+S N              L  L +L L+   +       
Sbjct: 461 RNKFFGGLPDSFGSKRLENLDLSRNQFSGAVPRKLGS--LSELMQLKLSENKLSGEIPDE 518

Query: 95  LDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDM 154
           L     L+ +DLS N L+     +F  +  L  L+L++N +S         V  LV++++
Sbjct: 519 LSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVESLVQVNI 578

Query: 155 SESRLEHISPEAFTGA 170
           S + L    P   TGA
Sbjct: 579 SHNHLHGSLP--STGA 592



 Score = 32.5 bits (74), Expect = 0.77
 Identities = 33/113 (29%), Positives = 59/113 (52%), Gaps = 4/113 (3%)

Query: 95  LDGLTNLIEIDLS-DNLLTSIPSLTFQSVRFLRDLNLARNPIS-KIEKGAFQFVPGLVKL 152
           + GLT+L  +DL  +NL   IPS +  +++ L+ L L +N +S  I    F  +  L+ L
Sbjct: 232 IGGLTSLNHLDLVYNNLTGPIPS-SLGNLKNLQYLFLYQNKLSGPIPPSIFS-LQKLISL 289

Query: 153 DMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPVRSVEPLLKLMMIELHDN 205
           D+S++ L    PE     ++LE + L  N  +     ++  L +L +++L  N
Sbjct: 290 DLSDNSLSGEIPELVIQLQNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSN 342


>gnl|CDD|205486 pfam13306, LRR_5, Leucine rich repeats (6 copies).  This family
           includes a number of leucine rich repeats. This family
           contains a large number of BSPA-like surface antigens
           from Trichomonas vaginalis.
          Length = 128

 Score = 44.4 bits (106), Expect = 1e-05
 Identities = 22/83 (26%), Positives = 35/83 (42%), Gaps = 3/83 (3%)

Query: 99  TNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESR 158
            +L  I +  ++ TSI    F     L+ + L  + ++ I   AF     L  + +  S 
Sbjct: 11  CSLTSITIPSSV-TSIGEYAFSGCTSLKSITLPSS-LTSIGSYAFYNCSSLTSITIPSS- 67

Query: 159 LEHISPEAFTGAKSLESIKLNGN 181
           L  I   AF+   SL SI +  N
Sbjct: 68  LTSIGEYAFSNCSSLTSITIPSN 90



 Score = 33.7 bits (78), Expect = 0.073
 Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 6/86 (6%)

Query: 59  NNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLT 118
           ++L  +   AF      +L  + +    +  I   A    ++L  I +  N LT+I S  
Sbjct: 43  SSLTSIGSYAF--YNCSSLTSITIP-SSLTSIGEYAFSNCSSLTSITIPSN-LTTIGSYA 98

Query: 119 FQSVRFLRDLNLARNPISKIEKGAFQ 144
           F +   L+ + +  + ++ I   AF 
Sbjct: 99  FSNCS-LKSITIPSS-VTTIGDYAFS 122


>gnl|CDD|143209 cd05732, Ig5_NCAM-1_like, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar
           proteins.  Ig5_NCAM-1 like: domain similar to the fifth
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic  (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1 and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain. Also included in this group is
           NCAM-2 (also known as OCAM/mamFas II and RNCAM)  NCAM-2
           is differentially expressed in the developing and mature
           olfactory epithelium (OE).
          Length = 96

 Score = 42.5 bits (100), Expect = 4e-05
 Identities = 30/93 (32%), Positives = 41/93 (44%), Gaps = 11/93 (11%)

Query: 254 KPEIRMDSRYVE---AVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVI 310
           +P+I     Y+E   AV  E  T+ C  +  P   I+W        +    S   RI V 
Sbjct: 2   QPKIT----YLENQTAVELEQITLTCEAEGDPIPEITWR-RATRNFSEGDKSLDGRIVVR 56

Query: 311 EQGEYERKSSLVLTNAQESDSGRFYCVAENRAG 343
                 R SSL L + Q +D+GR+ C A NR G
Sbjct: 57  GH---ARVSSLTLKDVQLTDAGRYDCEASNRIG 86


>gnl|CDD|143224 cd05747, Ig5_Titin_like, M5, fifth immunoglobulin (Ig)-like domain
           of human titin C terminus and similar proteins.
           Ig5_Titin_like: domain similar to the M5, fifth
           immunoglobulin (Ig)-like domain from the human titin C
           terminus. Titin (also called connectin) is a fibrous
           sarcomeric protein specifically found in vertebrate
           striated muscle. Titin is gigantic; depending on isoform
           composition it ranges from 2970 to 3700 kDa, and is of a
           length that spans half a sarcomere. Titin largely
           consists of multiple repeats of Ig-like and fibronectin
           type 3 (FN-III)-like domains. Titin connects the ends of
           myosin thick filaments to Z disks and extends along the
           thick filament to the H zone, and appears to function
           similar to an elastic band, keeping the myosin filaments
           centered in the sarcomere during muscle contraction or
           stretching.
          Length = 92

 Score = 42.3 bits (99), Expect = 4e-05
 Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 9/82 (10%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           E+A   C VD  P   ++W   G+++       S QR   I   EY  KS+  ++  Q S
Sbjct: 19  ESARFSCDVDGEPAPTVTWMREGQII------VSSQR-HQITSTEY--KSTFEISKVQMS 69

Query: 330 DSGRFYCVAENRAGIADANFTL 351
           D G +  V EN  G  +A FTL
Sbjct: 70  DEGNYTVVVENSEGKQEAQFTL 91


>gnl|CDD|215677 pfam00047, ig, Immunoglobulin domain.  Members of the
           immunoglobulin superfamily are found in hundreds of
           proteins of different functions. Examples include
           antibodies, the giant muscle kinase titin and receptor
           tyrosine kinases. Immunoglobulin-like domains may be
           involved in protein-protein and protein-ligand
           interactions. The Pfam alignments do not include the
           first and last strand of the immunoglobulin-like domain.
          Length = 62

 Score = 41.0 bits (96), Expect = 5e-05
 Identities = 15/71 (21%), Positives = 27/71 (38%), Gaps = 10/71 (14%)

Query: 269 SENATVVCRVDSIPPAAISWY-WNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQ 327
             + T+ C V   P   ++W+     L  + T  +   R+            +L ++N  
Sbjct: 1   GSSVTLTCSVSGPPQVDVTWFKEGKGLEESTTVGTDENRVS---------SITLTISNVT 51

Query: 328 ESDSGRFYCVA 338
             DSG + CV 
Sbjct: 52  PEDSGTYTCVV 62


>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
           repeats are short sequence motifs present in a number of
           proteins with diverse functions and cellular locations.
           These repeats are usually involved in protein-protein
           interactions. Each Leucine Rich Repeat is composed of a
           beta-alpha unit. These units form elongated non-globular
           structures. Leucine Rich Repeats are often flanked by
           cysteine rich domains.
          Length = 43

 Score = 39.8 bits (94), Expect = 8e-05
 Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 76  NLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLT 118
           NL+ L L+   I  +    L  L NL  +DLS N +T +  L+
Sbjct: 2   NLETLDLSNNQITDLP--PLSNLPNLETLDLSGNKITDLSPLS 42



 Score = 37.9 bits (89), Expect = 4e-04
 Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 99  TNLIEIDLSDNLLTSIPSLTFQSVRFLRDLNLARNPISKIE 139
           TNL  +DLS+N +T +P L   ++  L  L+L+ N I+ + 
Sbjct: 1   TNLETLDLSNNQITDLPPL--SNLPNLETLDLSGNKITDLS 39



 Score = 28.2 bits (64), Expect = 0.99
 Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 4/41 (9%)

Query: 52 QVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDS 92
          + LD+S N +  LP      + L NL+ L L+   I  +  
Sbjct: 4  ETLDLSNNQITDLPP----LSNLPNLETLDLSGNKITDLSP 40



 Score = 27.1 bits (61), Expect = 3.1
 Identities = 10/42 (23%), Positives = 23/42 (54%), Gaps = 2/42 (4%)

Query: 147 PGLVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSHFPV 188
             L  LD+S +++  + P   +   +LE++ L+GN+++    
Sbjct: 1   TNLETLDLSNNQITDLPP--LSNLPNLETLDLSGNKITDLSP 40


>gnl|CDD|143242 cd05765, Ig_3, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_3: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 81

 Score = 40.6 bits (95), Expect = 1e-04
 Identities = 28/86 (32%), Positives = 38/86 (44%), Gaps = 8/86 (9%)

Query: 270 ENATVVCRVDSIPPAAISW--YWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQ 327
           E A+  C V   PP  I+W    +G+  L          + V   G+      LV+ NAQ
Sbjct: 2   ETASFHCDVTGRPPPEITWEKQVHGKENLIMRPNHVRGNVVVTNIGQ------LVIYNAQ 55

Query: 328 ESDSGRFYCVAENRAGIADANFTLQV 353
             D+G + C A N  G+  ANF L V
Sbjct: 56  PQDAGLYTCTARNSGGLLRANFPLSV 81


>gnl|CDD|143260 cd05852, Ig5_Contactin-1, Fifth Ig domain of contactin-1.
           Ig5_Contactin-1: fifth Ig domain of the neural cell
           adhesion molecule contactin-1. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-1 is
           differentially expressed in tumor tissues and may
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 73

 Score = 39.6 bits (92), Expect = 2e-04
 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 12/78 (15%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           C+  + P    SW     LL+NN+      RI + + G      SL + N  + D G + 
Sbjct: 8   CKPKAAPKPKFSWSKGTELLVNNS------RISIWDDG------SLEILNITKLDEGSYT 55

Query: 336 CVAENRAGIADANFTLQV 353
           C AEN  G A++   L V
Sbjct: 56  CFAENNRGKANSTGVLSV 73


>gnl|CDD|143168 cd04967, Ig1_Contactin, First Ig domain of contactin.
           Ig1_Contactin: First Ig domain of contactins. Contactins
           are neural cell adhesion molecules and are comprised of
           six Ig domains followed by four fibronectin type
           III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal activity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 91

 Score = 39.7 bits (93), Expect = 3e-04
 Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 20/90 (22%)

Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS 319
           D+ + E       ++ CR    PP    W      L+N T          I+     R S
Sbjct: 10  DTIFPEESDEGKVSLNCRARGSPPPTYRW------LMNGTE---------IDDEPDSRYS 54

Query: 320 ----SLVLTNAQES-DSGRFYCVAENRAGI 344
               +LV++N  ++ D+GR+ C+A N  G 
Sbjct: 55  LVGGNLVISNPSKAKDAGRYQCLASNIVGT 84


>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig3_L1-CAM_like:
           domain similar to the third immunoglobulin (Ig)-like
           domain of the L1 cell adhesion molecule (CAM). L1
           belongs to the L1 subfamily of cell adhesion molecules
           (CAMs) and is comprised of an extracellular region
           having six Ig-like domains and five fibronectin type III
           domains, a transmembrane region and an intracellular
           domain. L1 is primarily expressed in the nervous system
           and is involved in its development and function. L1 is
           associated with an X-linked recessive disorder, X-linked
           hydrocephalus, MASA syndrome, or spastic paraplegia type
           1, that involves abnormalities of axonal growth. This
           group also contains the chicken neuron-glia cell
           adhesion molecule, Ng-CAM and human neurofascin.
          Length = 71

 Score = 38.9 bits (91), Expect = 4e-04
 Identities = 20/79 (25%), Positives = 31/79 (39%), Gaps = 14/79 (17%)

Query: 276 CRVDSIPPAAISWY-WNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRF 334
           C  + +P   ISW    G L  + T F ++ +             +L + N  E D G +
Sbjct: 5   CIAEGLPTPEISWIKIGGELPADRTKFENFNK-------------TLKIDNVSEEDDGEY 51

Query: 335 YCVAENRAGIADANFTLQV 353
            C A N  G A    ++ V
Sbjct: 52  RCTASNSLGSARHTISVTV 70


>gnl|CDD|143176 cd04975, Ig4_SCFR_like, Fourth immunoglobulin (Ig)-like domain of
           stem cell factor receptor (SCFR) and similar proteins.
           Ig4_SCFR_like; fourth immunoglobulin (Ig)-like domain of
           stem cell factor receptor (SCFR). In addition to SCFR
           this group also includes the fourth Ig domain of
           platelet-derived growth factor receptors (PDGFR), alpha
           and beta, the fourth Ig domain of macrophage colony
           stimulating factor (M-CSF), and the Ig domain of the
           receptor tyrosine kinase KIT. SCFR and the PDGFR alpha
           and beta have similar organization: an extracellular
           component having five Ig-like domains, a transmembrane
           segment, and a cytoplasmic portion having protein
           tyrosine kinase activity. SCFR and its ligand SCF are
           critical for normal hematopoiesis, mast cell
           development, melanocytes and gametogenesis. SCF binds to
           the second and third Ig-like domains of SCFR, this
           fourth Ig-like domain participates in SCFR dimerization,
           which follows ligand binding. Deletion of this fourth
           SCFR_Ig-like domain abolishes the ligand-induced
           dimerization of SCFR and completely inhibits signal
           transduction. PDGF is a potent mitogen for connective
           tissue cells. PDGF-stimulated processes are mediated by
           three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
           binds to all three PDGFs, whereas the PDGFR beta, binds
           only to PDGF-B. In mice, PDGFR alpha, and PDGFR beta,
           are essential for normal development.
          Length = 101

 Score = 39.6 bits (93), Expect = 4e-04
 Identities = 21/76 (27%), Positives = 30/76 (39%), Gaps = 8/76 (10%)

Query: 282 PPAAISWYWNGRLLLNNTAFSSYQRIFV--IEQGEYERKSSLVLTNAQESDSGRFYCVAE 339
           PP  I+W ++ R L N           V    + EY   S L L   +ES++G +  +A 
Sbjct: 32  PPPHINWTYDNRTLTNK------LTEIVTSENESEYRYVSELKLVRLKESEAGTYTFLAS 85

Query: 340 NRAGIADANFTLQVTY 355
           N        F L V  
Sbjct: 86  NSDASKSLTFELYVNV 101


>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig2_Robo: domain similar to the
           second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 86

 Score = 38.9 bits (91), Expect = 5e-04
 Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 12/81 (14%)

Query: 264 VEAVSSENATVVCRVD-SIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLV 322
            +    E A + C      P   +SW  +G+ L         +R+ +++ G      +L+
Sbjct: 6   TQVAVGEMAVLECSPPRGHPEPTVSWRKDGQPLN-----LDNERVRIVDDG------NLL 54

Query: 323 LTNAQESDSGRFYCVAENRAG 343
           +  A++SD G + CVA N  G
Sbjct: 55  IAEARKSDEGTYKCVATNMVG 75


>gnl|CDD|143180 cd04979, Ig_Semaphorin_C, Immunoglobulin (Ig)-like domain of
           semaphorin.  Ig_Semaphorin_C; Immunoglobulin (Ig)-like
           domain in semaphorins. Semaphorins are transmembrane
           protein that have important roles in a variety of
           tissues. Functionally, semaphorins were initially
           characterized for their importance in the development of
           the nervous system and in axonal guidance. Later they
           have been found to be important for the formation and
           functioning of the cardiovascular, endocrine,
           gastrointestinal, hepatic, immune, musculoskeletal,
           renal, reproductive, and respiratory systems.
           Semaphorins function through binding to their receptors
           and transmembrane semaphorins also serves as receptors
           themselves. Although molecular mechanism of semaphorins
           is poorly understood, the Ig-like domains may involve in
           ligand binding or dimerization.
          Length = 89

 Score = 38.5 bits (90), Expect = 7e-04
 Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 12/72 (16%)

Query: 270 ENATVV--CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQ 327
           E  +V   C   S   A++ W + G  L          R+ V E G       L++ +  
Sbjct: 10  EGNSVFLECSPKS-NLASVVWLFQGGPLQRKEEPEE--RLLVTEDG-------LLIRSVS 59

Query: 328 ESDSGRFYCVAE 339
            +D+G + C + 
Sbjct: 60  PADAGVYTCQSV 71


>gnl|CDD|143221 cd05744, Ig_Myotilin_C_like, Immunoglobulin (Ig)-like domain of
           myotilin, palladin, and myopalladin.
           Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in
           myotilin, palladin, and myopalladin.  Myotilin,
           palladin, and myopalladin function as scaffolds that
           regulate actin organization. Myotilin and myopalladin
           are most abundant in skeletal and cardiac muscle;
           palladin is ubiquitously expressed in the organs of
           developing vertebrates and  plays a key role in cellular
           morphogenesis. The three family members each interact
           with specific molecular partners: all three bind to
           alpha-actinin; in addition, palladin also binds to
           vasodilator-stimulated phosphoprotein (VASP) and ezrin,
           myotilin binds to filamin and actin, and myopalladin
           also binds to nebulin and cardiac ankyrin repeat protein
           (CARP).
          Length = 75

 Score = 37.9 bits (88), Expect = 0.001
 Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 7/78 (8%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           CRV +IPP  I W  N  +L  NT      RI  + Q    R   L++ NA + D+G + 
Sbjct: 5   CRVSAIPPPQIFWKKNNEMLTYNT-----DRI-SLYQDNCGR-ICLLIQNANKEDAGWYT 57

Query: 336 CVAENRAGIADANFTLQV 353
             A N AG+   N  L V
Sbjct: 58  VSAVNEAGVVSCNARLDV 75


>gnl|CDD|143265 cd05857, Ig2_FGFR, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor.  Ig2_FGFR:
           second immunoglobulin (Ig)-like domain of fibroblast
           growth factor (FGF) receptor. FGF receptors bind FGF
           signaling polypeptides. FGFs participate in multiple
           processes such as morphogenesis, development, and
           angiogenesis. FGFs bind to four FGF receptor tyrosine
           kinases (FGFR1, -2, -3, -4). Receptor diversity is
           controlled by alternative splicing producing splice
           variants with different ligand binding characteristics
           and different expression patterns. FGFRs have an
           extracellular region comprised of three IG-like domains,
           a single transmembrane helix, and an intracellular
           tyrosine kinase domain. Ligand binding and specificity
           reside in the Ig-like domains 2 and 3, and the linker
           region that connects these two. FGFR activation and
           signaling depend on FGF-induced dimerization, a process
           involving cell surface heparin or heparin sulfate
           proteoglycans.
          Length = 85

 Score = 38.3 bits (89), Expect = 0.001
 Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 14/81 (17%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYE---RKSSLVLTNAQESDSG 332
           C     P   + W  NG+       F    RI     G Y+   +  SL++ +   SD G
Sbjct: 16  CPAAGNPTPTMRWLKNGK------EFKQEHRI-----GGYKVRNQHWSLIMESVVPSDKG 64

Query: 333 RFYCVAENRAGIADANFTLQV 353
            + CV EN  G  +  + L V
Sbjct: 65  NYTCVVENEYGSINHTYHLDV 85


>gnl|CDD|143225 cd05748, Ig_Titin_like, Immunoglobulin (Ig)-like domain of titin
           and similar proteins.  Ig_Titin_like: immunoglobulin
           (Ig)-like domain found in titin-like proteins. Titin
           (also called connectin) is a fibrous sarcomeric protein
           specifically found in vertebrate striated muscle. Titin
           is gigantic, depending on isoform composition it ranges
           from 2970 to 3700 kDa, and is of a length that spans
           half a sarcomere. Titin largely consists of multiple
           repeats of Ig-like and fibronectin type 3 (FN-III)-like
           domains. Titin connects the ends of myosin thick
           filaments to Z disks and extends along the thick
           filament to the H zone.  It appears to function
           similarly to an elastic band, keeping the myosin
           filaments centered in the sarcomere during muscle
           contraction or stretching. Within the sarcomere, titin
           is also attached to or is associated with myosin binding
           protein C (MyBP-C). MyBP-C appears to contribute to the
           generation of passive tension by titin, and similar to
           titin has repeated Ig-like and FN-III domains. Also
           included in this group are worm twitchin and insect
           projectin, thick filament proteins of invertebrate
           muscle, which also have repeated Ig-like and FN-III
           domains.
          Length = 74

 Score = 37.6 bits (88), Expect = 0.001
 Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 9/73 (12%)

Query: 281 IPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAEN 340
            P   ++W  +G+ L  +           IE       +SLV+ NA+ SDSG++    +N
Sbjct: 11  RPTPTVTWSKDGKPLKLSGRVQ-------IETTAS--STSLVIKNAERSDSGKYTLTLKN 61

Query: 341 RAGIADANFTLQV 353
            AG   A   ++V
Sbjct: 62  PAGEKSATINVKV 74


>gnl|CDD|143275 cd05867, Ig4_L1-CAM_like, Fourth immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig4_L1-CAM_like:
           fourth immunoglobulin (Ig)-like domain of the L1 cell
           adhesion molecule (CAM). L1 is comprised of an
           extracellular region having six Ig-like domains and five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 76

 Score = 37.2 bits (86), Expect = 0.002
 Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 10/84 (11%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           E A + C+V+ IP   I+W  NG  +               +   +    +L+LT+ Q S
Sbjct: 2   ETARLDCQVEGIPTPNITWSINGAPIEGTDP----------DPRRHVSSGALILTDVQPS 51

Query: 330 DSGRFYCVAENRAGIADANFTLQV 353
           D+  + C A NR G   AN  + V
Sbjct: 52  DTAVYQCEARNRHGNLLANAHVHV 75


>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
           inhibitor (RI)-like subfamily. LRRs are 20-29 residue
           sequence motifs present in many proteins that
           participate in protein-protein interactions and have
           different functions and cellular locations. LRRs
           correspond to structural units consisting of a beta
           strand (LxxLxLxxN/CxL conserved pattern) and an alpha
           helix. This alignment contains 12 strands corresponding
           to 11 full repeats, consistent with the extent observed
           in the subfamily acting as Ran GTPase Activating
           Proteins (RanGAP1).
          Length = 319

 Score = 40.4 bits (95), Expect = 0.002
 Identities = 48/199 (24%), Positives = 71/199 (35%), Gaps = 36/199 (18%)

Query: 35  RDKFLITIPEAPESELT--------QVLDMSGNNLQIL---PKEAFRRAGLLNLQKLFLA 83
                  IP   +S L         Q LD+S N L        E+  R+   +LQ+L L 
Sbjct: 59  SLNETGRIPRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLLRSS--SLQELKLN 116

Query: 84  RCHIGQIDSGAL--DGLT----NLIEIDLSDNLLTSIP----SLTFQSVRFLRDLNLARN 133
              +G      L   GL      L ++ L  N L        +   ++ R L++LNLA N
Sbjct: 117 NNGLG-DRGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANN 175

Query: 134 PISKIEKG------AFQFVPGLVKLDMSESRLEHISPEAFTGA----KSLESIKLNGNRL 183
            I   + G        +    L  LD++ + L      A        KSLE + L  N L
Sbjct: 176 GIG--DAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNL 233

Query: 184 SHFPVRSVEPLLKLMMIEL 202
           +     ++   L    I L
Sbjct: 234 TDAGAAALASALLSPNISL 252



 Score = 36.9 bits (86), Expect = 0.023
 Identities = 25/130 (19%), Positives = 47/130 (36%), Gaps = 29/130 (22%)

Query: 52  QVLDMSGNNL---------QILPKEAFRRAGLLNLQKLFLARCHIGQID-SGALDGLTNL 101
           + L+++ N +         + L   A     +L+L    L     G    +  L  L +L
Sbjct: 168 KELNLANNGIGDAGIRALAEGLK--ANCNLEVLDLNNNGL--TDEGASALAETLASLKSL 223

Query: 102 IEIDLSDNLLTSIPSLTFQS-----VRFLRDLNLARNPISKIEKGAFQFV-------PGL 149
             ++L DN LT   +    S        L  L+L+ N    I     + +         L
Sbjct: 224 EVLNLGDNNLTDAGAAALASALLSPNISLLTLSLSCN---DITDDGAKDLAEVLAEKESL 280

Query: 150 VKLDMSESRL 159
           ++LD+  ++ 
Sbjct: 281 LELDLRGNKF 290


>gnl|CDD|143300 cd05892, Ig_Myotilin_C, C-terminal immunoglobulin (Ig)-like domain
           of myotilin.  Ig_Myotilin_C: C-terminal immunoglobulin
           (Ig)-like domain of myotilin. Mytolin belongs to the
           palladin-myotilin-myopalladin family. Proteins belonging
           to the latter family contain multiple Ig-like domains
           and function as scaffolds, modulating actin
           cytoskeleton. Myotilin is most abundant in skeletal and
           cardiac muscle, and is involved in maintaining sarcomere
           integrity. It binds to alpha-actinin, filamin and actin.
           Mutations in myotilin lead to muscle disorders.
          Length = 75

 Score = 36.9 bits (85), Expect = 0.002
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           C++ +IPP  I W  N  ++  NT      RI + +  +   + +L++ N  + D+G + 
Sbjct: 5   CQISAIPPPKIFWKRNNEMVQYNT-----DRISLYQ--DNSGRVTLLIKNVNKKDAGWYT 57

Query: 336 CVAENRAGIADANFTLQV 353
             A N AG+A  +  L V
Sbjct: 58  VSAVNEAGVATCHARLDV 75


>gnl|CDD|143203 cd05726, Ig4_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig4_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 90

 Score = 36.9 bits (85), Expect = 0.003
 Identities = 27/86 (31%), Positives = 33/86 (38%), Gaps = 8/86 (9%)

Query: 271 NATVVCRVDSIPPAAISWYWNGR--LLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQE 328
             T  C     P  AI W   G   LL +     S  R  V + G+      L +TN Q 
Sbjct: 3   TVTFQCEATGNPQPAIFWQKEGSQNLLFSYQPPQSSSRFSVSQTGD------LTITNVQR 56

Query: 329 SDSGRFYCVAENRAGIADANFTLQVT 354
           SD G + C   N AG       L+VT
Sbjct: 57  SDVGYYICQTLNVAGSILTKAYLEVT 82


>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural cell
           adhesion molecule contactin-2 and similar proteins.
           Ig4_Contactin-2-like: fourth Ig domain of the neural
           cell adhesion molecule contactin-2. Contactins are
           comprised of six Ig domains followed by four fibronectin
           type III (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-2 (aliases
           TAG-1, axonin-1) facilitates cell adhesion by homophilic
           binding between molecules in apposed membranes. The
           first four Ig domains form the intermolecular binding
           fragment which arranges as a compact U-shaped module by
           contacts between Ig domains 1 and 4, and domains 2 and
           3. It has been proposed that a linear zipper-like array
           forms, from contactin-2 molecules alternatively provided
           by the two apposed membranes.
          Length = 85

 Score = 36.4 bits (84), Expect = 0.004
 Identities = 28/78 (35%), Positives = 36/78 (46%), Gaps = 13/78 (16%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           C+    P  A  W  NG+ L      +S  RI V E G+      L +T    SDSG + 
Sbjct: 21  CKASGNPRPAYRWLKNGQPL------ASENRIEV-EAGD------LRITKLSLSDSGMYQ 67

Query: 336 CVAENRAGIADANFTLQV 353
           CVAEN+ G   A+  L V
Sbjct: 68  CVAENKHGTIYASAELAV 85


>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig3_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 69

 Score = 35.8 bits (83), Expect = 0.004
 Identities = 21/85 (24%), Positives = 27/85 (31%), Gaps = 19/85 (22%)

Query: 272 ATVVCRVDSIPPAAISWYWN-GRLLLNNTAFSSYQRIFVIEQGEYE--RKSSLVLTNAQE 328
               C V   P   + W    G L                  G  E     SL + N   
Sbjct: 1   VEFQCEVGGDPVPTVLWRKEDGELPK----------------GRAEILDDKSLKIRNVTA 44

Query: 329 SDSGRFYCVAENRAGIADANFTLQV 353
            D G + C AEN  G  +A+ +L V
Sbjct: 45  GDEGSYTCEAENMVGKIEASASLTV 69


>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
          Length = 623

 Score = 39.8 bits (93), Expect = 0.004
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 1/71 (1%)

Query: 114 IPSLTFQSVRFLRDLNLARNPISKIEKGAFQFVPGLVKLDMSESRLEHISPEAFTGAKSL 173
           IP+     +R L+ +NL+ N I      +   +  L  LD+S +      PE+     SL
Sbjct: 434 IPN-DISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSL 492

Query: 174 ESIKLNGNRLS 184
             + LNGN LS
Sbjct: 493 RILNLNGNSLS 503


>gnl|CDD|143258 cd05850, Ig1_Contactin-2, First Ig domain of contactin-2.
           Ig1_Contactin-2: First Ig domain of the neural cell
           adhesion molecule contactin-2-like. Contactins are
           comprised of six Ig domains followed by four fibronectin
           type III (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-2 (TAG-1,
           axonin-1) facilitates cell adhesion by homophilic
           binding between molecules in apposed membranes. It may
           play a part in the neuronal processes of neurite
           outgrowth, axon guidance and fasciculation, and neuronal
           migration. The first four Ig domains form the
           intermolecular binding fragment, which arranges as a
           compact U-shaped module by contacts between IG domains 1
           and 4, and domains 2 and 3. The different contactins
           show different expression patterns in the central
           nervous system. During development and in adulthood,
           contactin-2 is transiently expressed in subsets of
           central and peripheral neurons. Contactin-2 is also
           expressed in retinal amacrine cells in the developing
           chick retina, corresponding to the period of formation
           and maturation of AC processes.
          Length = 94

 Score = 35.7 bits (82), Expect = 0.009
 Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 12/82 (14%)

Query: 263 YVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLV 322
           + E    E  T+ CR  + PPA   W  NG   +     S Y  +            +LV
Sbjct: 13  FPEGSPEEKVTLGCRARASPPATYRWKMNGT-EIKFAPESRYTLV----------AGNLV 61

Query: 323 LTNAQES-DSGRFYCVAENRAG 343
           + N Q++ D+G + C+A NR G
Sbjct: 62  INNPQKARDAGSYQCLAINRCG 83


>gnl|CDD|143184 cd04983, IgV_TCR_alpha_like, Immunoglobulin (Ig) variable (V)
           domain of T-cell receptor (TCR) alpha chain and similar
           proteins.  IgV_TCR_alpha: immunoglobulin (Ig) variable
           domain of the alpha chain of alpha/beta T-cell antigen
           receptors (TCRs). TCRs mediate antigen recognition by T
           lymphocytes, and are composed of alpha and beta, or
           gamma and delta, polypeptide chains with variable (V)
           and constant (C) regions. This group represents the
           variable domain of the alpha chain of TCRs and also
           includes the variable domain of delta chains of TCRs.
           Alpha/beta TCRs recognize antigen as peptide fragments
           presented by major histocompatibility complex (MHC)
           molecules. The variable domain of TCRs is responsible
           for antigen recognition, and is located at the
           N-terminus of the receptor.  Gamma/delta TCRs recognize
           intact protein antigens; they recognize proteins
           antigens directly and without antigen processing, and
           MHC independently of the bound peptide.
          Length = 109

 Score = 35.7 bits (83), Expect = 0.010
 Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 7/91 (7%)

Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWY-WNGR----LLLNNTAFSSYQRI--FVIEQ 312
             + +     EN T+ C   +     + WY          L+  ++    +    F    
Sbjct: 4   SPQSLSVQEGENVTLNCNYSTSTFYYLFWYRQYPGQGPQFLIYISSNGEEKEKGRFSATL 63

Query: 313 GEYERKSSLVLTNAQESDSGRFYCVAENRAG 343
            +  + SSL ++ AQ SDS  ++C      G
Sbjct: 64  DKSRKSSSLHISAAQLSDSAVYFCALSESGG 94


>gnl|CDD|143171 cd04970, Ig6_Contactin_like, Sixth Ig domain of contactin.
           Ig6_Contactin_like: Sixth Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of neur
           onal act ivity in the rat auditory system. Contactin-5
           is highly expressed in the adult human brain in the
           occipital lobe and in the amygdala. Contactin-1 is
           differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 85

 Score = 35.2 bits (81), Expect = 0.010
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 11/91 (12%)

Query: 270 ENATVVCRVDSIPPAAISWYW--NGRLLLNNTAFSSYQRIFVIEQ-GEYERKSSLVLTNA 326
           E+ T+ C     P   +++ W  NG  +  +     Y+R+   +  G+      L++ NA
Sbjct: 1   ESITLQCHASHDPTLDLTFTWSFNGVPIDFDKDGGHYRRVGGKDSNGD------LMIRNA 54

Query: 327 QESDSGRFYCVAENRAGIADANFTLQVTYRG 357
           Q   +G++ C A+       A+  L V  RG
Sbjct: 55  QLKHAGKYTCTAQTVVDSLSASADLIV--RG 83


>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
           important roles in the development and regeneration of
           the central nervous system, in synaptogenesis and neural
           migration. NCAM mediates cell-cell and cell-substratum
           recognition and adhesion via homophilic (NCAM-NCAM) and
           heterophilic (NCAM-non-NCAM) interactions. NCAM is
           expressed as three major isoforms having different
           intracellular extensions. The extracellular portion of
           NCAM has five N-terminal Ig-like domains and two
           fibronectin type III domains. The double zipper adhesion
           complex model for NCAM homophilic binding involves Ig1,
           Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
           dimerization of NCAM molecules situated on the same cell
           surface (cis interactions), and Ig3 domains mediate
           interactions between NCAM molecules expressed on the
           surface of opposing cells (trans interactions), through
           binding to the Ig1 and Ig2 domains. The adhesive ability
           of NCAM is modulated by the addition of polysialic acid
           chains to the fifth Ig-like domain.
          Length = 97

 Score = 35.3 bits (81), Expect = 0.012
 Identities = 29/104 (27%), Positives = 45/104 (43%), Gaps = 12/104 (11%)

Query: 254 KPEIRMDSRYVEAVSS----ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFV 309
           KP+I     YVE  ++    E  T+ C     P  +I+W  + R + +    +    I V
Sbjct: 2   KPKIT----YVENQTAMELEEQITLTCEASGDPIPSITWRTSTRNISSE-EKTLDGHIVV 56

Query: 310 IEQGEYERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
                + R SSL L   Q +D+G + C A N  G    +  L+V
Sbjct: 57  ---RSHARVSSLTLKYIQYTDAGEYLCTASNTIGQDSQSMYLEV 97


>gnl|CDD|143237 cd05760, Ig2_PTK7, Second immunoglobulin (Ig)-like domain of
           protein tyrosine kinase (PTK) 7, also known as CCK4.
           Ig2_PTK7: domain similar to the second immunoglobulin
           (Ig)-like domain in protein tyrosine kinase (PTK) 7,
           also known as CCK4. PTK7 is a subfamily of the receptor
           protein tyrosine kinase family, and is referred to as an
           RPTK-like molecule. RPTKs transduce extracellular
           signals across the cell membrane, and play important
           roles in regulating cell proliferation, migration, and
           differentiation. PTK7 is organized as an extracellular
           portion having seven Ig-like domains, a single
           transmembrane region, and a cytoplasmic tyrosine
           kinase-like domain. PTK7 is considered a pseudokinase as
           it has several unusual residues in some of the highly
           conserved tyrosine kinase (TK) motifs; it is predicted
           to lack TK activity. PTK7 may function as a
           cell-adhesion molecule. PTK7 mRNA is expressed at high
           levels in placenta, melanocytes, liver, lung, pancreas,
           and kidney. PTK7 is overexpressed in several cancers,
           including melanoma and colon cancer lines.
          Length = 77

 Score = 34.9 bits (80), Expect = 0.013
 Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 18/86 (20%)

Query: 273 TVVCRVDSIPPAAISWYWNGRLLLN---NTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           T+ C +D  P     W+ +G  L +   N + SS +R             +L L +A   
Sbjct: 2   TLRCHIDGHPRPTYQWFRDGTPLSDGQGNYSVSSKER-------------TLTLRSAGPD 48

Query: 330 DSGRFYCVAENRAG--IADANFTLQV 353
           DSG +YC A N  G   +  NFTL +
Sbjct: 49  DSGLYYCCAHNAFGSVCSSQNFTLSI 74


>gnl|CDD|143241 cd05764, Ig_2, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_2: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 74

 Score = 34.8 bits (80), Expect = 0.013
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 13/83 (15%)

Query: 272 ATVVCRVDSIPPAAISWYW-NGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESD 330
           AT+ C+    P  AI W   +G+L+ N++      R  V + G      +L +      D
Sbjct: 4   ATLRCKARGDPEPAIHWISPDGKLISNSS------RTLVYDNG------TLDILITTVKD 51

Query: 331 SGRFYCVAENRAGIADANFTLQV 353
           +G F C+A N AG A A   L +
Sbjct: 52  TGSFTCIASNAAGEATATVELHI 74


>gnl|CDD|143278 cd05870, Ig5_NCAM-2, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-2 (also known as
           OCAM/mamFas II and RNCAM).  Ig5_NCAM-2: the fifth
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-2 (also known as OCAM/mamFas II and
           RNCAM). NCAM-2  is organized similarly to NCAM ,
           including five N-terminal Ig-like domains and two
           fibronectin type III domains. NCAM-2 is differentially
           expressed in the developing and mature olfactory
           epithelium (OE), and may function like NCAM, as an
           adhesion molecule.
          Length = 98

 Score = 35.3 bits (81), Expect = 0.013
 Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 12/87 (13%)

Query: 262 RYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQ-----RIFVIEQGEYE 316
           +    V +  AT+ C+ +  P   I+W    +   +   FS        RI V  +G++ 
Sbjct: 9   KNETTVENGAATLSCKAEGEPIPEITW----KRASDGHTFSEGDKSPDGRIEV--KGQHG 62

Query: 317 RKSSLVLTNAQESDSGRFYCVAENRAG 343
            +SSL + + + SDSGR+ C A +R G
Sbjct: 63  -ESSLHIKDVKLSDSGRYDCEAASRIG 88


>gnl|CDD|143167 cd00099, IgV, Immunoglobulin variable domain (IgV).  IgV:
           Immunoglobulin variable domain (IgV). Members of the IgV
           family are components of immunoglobulin (Ig) and T cell
           receptors. The basic structure of Ig molecules is a
           tetramer of two light chains and two heavy chains linked
           by disulfide bonds. In Ig, each chain is composed of one
           variable domain (IgV) and one or more constant domains
           (IgC); these names reflect the fact that the variability
           in sequences is higher in the variable domain than in
           the constant domain. Within the variable domain, there
           are regions of even more variability called the
           hypervariable or complementarity-determining regions
           (CDRs) which are responsible for antigen binding. A
           predominant feature of most Ig domains is the disulfide
           bridge connecting 2 beta-sheets with a tryptophan
           residue packed against the disulfide bond.
          Length = 105

 Score = 35.4 bits (82), Expect = 0.015
 Identities = 17/97 (17%), Positives = 32/97 (32%), Gaps = 11/97 (11%)

Query: 264 VEAVSSENATVVCRV-DSIPPAAISWY---------WNGRLLLNNTAFSS-YQRIFVIEQ 312
           +     E+ T+ C    S     I WY             +  N + ++   +  F   +
Sbjct: 1   LSVSEGESVTLSCTYSGSFSSYYIFWYRQKPGKGPELLIYISSNGSQYAGGVKGRFSGTR 60

Query: 313 GEYERKSSLVLTNAQESDSGRFYCVAENRAGIADANF 349
              +   +L +++ Q  DS  +YC      G     F
Sbjct: 61  DSSKSSFTLTISSLQPEDSAVYYCAVSLSGGTYKLYF 97


>gnl|CDD|143267 cd05859, Ig4_PDGFR-alpha, Fourth immunoglobulin (Ig)-like domain of
           platelet-derived growth factor receptor (PDGFR) alpha.
           IG4_PDGFR-alpha: The fourth immunoglobulin (Ig)-like
           domain of platelet-derived growth factor receptor
           (PDGFR) alpha. PDGF is a potent mitogen for connective
           tissue cells. PDGF-stimulated processes are mediated by
           three different PDGFs (PDGF-A,-B, and C). PDGFR alpha
           binds to all three PDGFs, whereas the PDGFR beta (not
           included in this group) binds only to PDGF-B. PDGF alpha
           is organized as an extracellular component having five
           Ig-like domains, a transmembrane segment, and a
           cytoplasmic portion having protein tyrosine kinase
           activity. In mice, PDGFR alpha and PDGFR beta are
           essential for normal development.
          Length = 101

 Score = 34.8 bits (80), Expect = 0.019
 Identities = 24/84 (28%), Positives = 35/84 (41%), Gaps = 3/84 (3%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           E    V  V++ PP  I W  + R L+ N    +          E    S L L  A+E 
Sbjct: 19  EVKEFVVEVEAYPPPQIRWLKDNRTLIENLTEITTS---EHNVQETRYVSKLKLIRAKEE 75

Query: 330 DSGRFYCVAENRAGIADANFTLQV 353
           DSG +  +A+N   +    F LQ+
Sbjct: 76  DSGLYTALAQNEDAVKSYTFALQI 99


>gnl|CDD|143317 cd07693, Ig1_Robo, First immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors and similar proteins.  Ig1_Robo:
           domain similar to the first immunoglobulin (Ig)-like
           domain in Robo (roundabout) receptors. Robo receptors
           play a role in the development of the central nervous
           system (CNS), and are receptors of Slit protein. Slit is
           a repellant secreted by the neural cells in the midline.
           Slit acts through Robo to prevent most neurons from
           crossing the midline from either side. Three mammalian
           Robo homologs (robo1, -2, and -3), and three mammalian
           Slit homologs (Slit-1,-2, -3), have been identified.
           Commissural axons, which cross the midline, express low
           levels of Robo; longitudinal axons, which avoid the
           midline, express high levels of Robo. robo1, -2, and -3
           are expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 100

 Score = 34.4 bits (79), Expect = 0.023
 Identities = 24/85 (28%), Positives = 38/85 (44%), Gaps = 3/85 (3%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           + AT+ C+ +  P   I W  NG+ L  +       RI +     +  +  +V      S
Sbjct: 17  DPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIVLPSGSLFFLR--VVHGRKGRS 74

Query: 330 DSGRFYCVAENRAGIADA-NFTLQV 353
           D G + CVA N  G A + N +L+V
Sbjct: 75  DEGVYVCVAHNSLGEAVSRNASLEV 99


>gnl|CDD|143215 cd05738, Ig2_RPTP_IIa_LAR_like, Second immunoglobulin (Ig)-like
           domain of  the receptor protein tyrosine phosphatase
           (RPTP)-F, also known as LAR.  Ig2_RPTP_IIa_LAR_like:
           domain similar to the second immunoglobulin (Ig)-like
           domain found in the receptor protein tyrosine
           phosphatase (RPTP)-F, also known as LAR. LAR belongs to
           the RPTP type IIa subfamily. Members of this subfamily
           are cell adhesion molecule-like proteins involved in
           central nervous system (CNS) development. They have
           large extracellular portions, comprised of multiple
           Ig-like domains and two to nine fibronectin type III
           (FNIII) domains, and a cytoplasmic portion having two
           tandem phosphatase domains.
          Length = 74

 Score = 33.9 bits (77), Expect = 0.026
 Identities = 21/74 (28%), Positives = 33/74 (44%), Gaps = 14/74 (18%)

Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYE--RKSSLVLTNAQES 329
           AT++C     P   I+W            F  +  +     G  +  R  +L + N++ES
Sbjct: 1   ATMLCAASGNPDPEITW------------FKDFLPVDTTSNGRIKQLRSGALQIENSEES 48

Query: 330 DSGRFYCVAENRAG 343
           D G++ CVA N AG
Sbjct: 49  DQGKYECVATNSAG 62


>gnl|CDD|143175 cd04974, Ig3_FGFR, Third immunoglobulin (Ig)-like domain of
           fibroblast growth factor receptor (FGFR).  Ig3_FGFR:
           third immunoglobulin (Ig)-like domain of fibroblast
           growth factor receptor (FGFR). Fibroblast growth factors
           (FGFs) participate in morphogenesis, development,
           angiogenesis, and wound healing. These FGF-stimulated
           processes are mediated by four FGFR tyrosine kinases
           (FGRF1-4). FGFRs are comprised of an extracellular
           portion consisting of three Ig-like domains, a
           transmembrane helix, and a cytoplasmic portion having
           protein tyrosine kinase activity. The highly conserved
           Ig-like domains 2 and 3, and the linker region between
           D2 and D3 define a general binding site for FGFs.
          Length = 90

 Score = 33.9 bits (78), Expect = 0.030
 Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 8/89 (8%)

Query: 271 NATVVCRVDSIPPAAISWYWNGRLLLNNTAFS----SYQRI-FVIEQGEYERKSS-LVLT 324
           +    C+V S     I W     + +N + +      Y  +  V      + +S  L L 
Sbjct: 3   DVEFHCKVYSDAQPHIQWL--KHVEVNGSKYGPDGLPYVTVLKVAGINTTDNESEVLYLR 60

Query: 325 NAQESDSGRFYCVAENRAGIADANFTLQV 353
           N    D+G + C+A N  G +  +  L V
Sbjct: 61  NVSFDDAGEYTCLAGNSIGPSHHSAWLTV 89


>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers. 
          Length = 24

 Score = 31.9 bits (74), Expect = 0.036
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 98  LTNLIEIDLSDNLLTSIPSLTFQS 121
           L NL E+DLS+N L+S+P   FQ 
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24



 Score = 28.9 bits (66), Expect = 0.42
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 52 QVLDMSGNNLQILPKEAFR 70
          + LD+S N L  LP  AF+
Sbjct: 5  RELDLSNNQLSSLPPGAFQ 23



 Score = 26.9 bits (61), Expect = 2.2
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 125 LRDLNLARNPISKIEKGAFQ 144
           LR+L+L+ N +S +  GAFQ
Sbjct: 4   LRELDLSNNQLSSLPPGAFQ 23


>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
           subfamily. 
          Length = 24

 Score = 31.9 bits (74), Expect = 0.036
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 98  LTNLIEIDLSDNLLTSIPSLTFQS 121
           L NL E+DLS+N L+S+P   FQ 
Sbjct: 1   LPNLRELDLSNNQLSSLPPGAFQG 24



 Score = 28.9 bits (66), Expect = 0.42
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 52 QVLDMSGNNLQILPKEAFR 70
          + LD+S N L  LP  AF+
Sbjct: 5  RELDLSNNQLSSLPPGAFQ 23



 Score = 26.9 bits (61), Expect = 2.2
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 125 LRDLNLARNPISKIEKGAFQ 144
           LR+L+L+ N +S +  GAFQ
Sbjct: 4   LRELDLSNNQLSSLPPGAFQ 23


>gnl|CDD|143178 cd04977, Ig1_NCAM-1_like, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-1 and similar
           proteins.  Ig1_NCAM-1 like: first immunoglobulin
           (Ig)-like domain of neural cell adhesion molecule
           NCAM-1. NCAM-1 plays important roles in the development
           and regeneration of the central nervous system, in
           synaptogenesis and neural migration. NCAM mediates
           cell-cell and cell-substratum recognition and adhesion
           via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-nonNCAM), interactions. NCAM is expressed as three
           major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves the Ig1, Ig2, and Ig3
           domains. By this model, Ig1 and Ig2 mediate dimerization
           of NCAM molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain. Also included in this group is
           NCAM-2 (also known as OCAM/mamFas II and RNCAM).  NCAM-2
           is differentially expressed in the developing and mature
           olfactory epithelium (OE).
          Length = 92

 Score = 33.6 bits (77), Expect = 0.044
 Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 10/82 (12%)

Query: 263 YVEAVSSENATVVCRVDSIPPAAISWYW-NGRLLLNNTAFSSYQRIFVIEQGEYERKSSL 321
             E    E+   +C+V    P  ISW+  NG  L+      + Q+I V++  +   +S+L
Sbjct: 9   QGEISVGESKFFLCQVIG-EPKDISWFSPNGEKLV------TQQQISVVQNDDV--RSTL 59

Query: 322 VLTNAQESDSGRFYCVAENRAG 343
            + NA   D+G + CVA +  G
Sbjct: 60  TIYNANIEDAGIYKCVATDAKG 81


>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain.  This domain is
           found in antibodies as well as neural protein P0 and
           CTL4 amongst others.
          Length = 114

 Score = 34.1 bits (78), Expect = 0.048
 Identities = 19/95 (20%), Positives = 28/95 (29%), Gaps = 15/95 (15%)

Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS 319
             R V      + T+ C   S    + S YW  + L        +            R  
Sbjct: 7   PPRPVTVAEGGSVTLPCSF-SSSSGSTSVYWYKQPLGKGPELIIHYVTSTPNGKVGPRFK 65

Query: 320 --------------SLVLTNAQESDSGRFYCVAEN 340
                         SL ++N + SDSG + C   N
Sbjct: 66  GRVTLSGNGSKNDFSLTISNLRLSDSGTYTCAVSN 100


>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein.  The Polycystin
           Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
           huge protein of 4303aas. Its repeated leucine-rich (LRR)
           segment is found in many proteins. It contains 16
           polycystic kidney disease (PKD) domains, one
           LDL-receptor class A domain, one C-type lectin family
           domain, and 16-18 putative TMSs in positions between
           residues 2200 and 4100. Polycystin-L has been shown to
           be a cation (Na+, K+ and Ca2+) channel that is activated
           by Ca2+. Two members of the PCC family (polycystin 1 and
           2) are mutated in autosomal dominant polycystic kidney
           disease, and polycystin-L is deleted in mice with renal
           and retinal defects. Note: this model is restricted to
           the amino half for technical reasons.
          Length = 2740

 Score = 36.2 bits (83), Expect = 0.062
 Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 3/82 (3%)

Query: 178 LNGNRLSHFPVRSVEPLLKLMMIELHDNPWVCDCNMRSIKMWLADKKNVPVQP---ACTG 234
           ++ N++S         L  L  I+L  NP+ CDC +  +  W  +K     QP    C G
Sbjct: 2   ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAG 61

Query: 235 PERLSGKVFSDLHADDFACKPE 256
           P  L+G+    +   D  C  E
Sbjct: 62  PGALAGQPLLGIPLLDSGCDEE 83


>gnl|CDD|143240 cd05763, Ig_1, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_1: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 75

 Score = 32.6 bits (74), Expect = 0.076
 Identities = 20/82 (24%), Positives = 36/82 (43%), Gaps = 8/82 (9%)

Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
           A + C     P   I+W  +G     +   +  +R+ V+ + +        + + +  D+
Sbjct: 1   ARLECAATGHPTPQIAWQKDGG---TDFPAARERRMHVMPEDD-----VFFIVDVKIEDT 52

Query: 332 GRFYCVAENRAGIADANFTLQV 353
           G + C A+N AG   AN TL V
Sbjct: 53  GVYSCTAQNTAGSISANATLTV 74


>gnl|CDD|143273 cd05865, Ig1_NCAM-1, First immunoglobulin (Ig)-like domain of
           neural cell adhesion molecule NCAM-1.  Ig1_NCAM-1: first
           immunoglobulin (Ig)-like domain of neural cell adhesion
           molecule NCAM-1. NCAM-1 plays important roles in the
           development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-nonNCAM), interactions. NCAM is expressed as three
           major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves the Ig1, Ig2, and Ig3
           domains. By this model, Ig1 and Ig2 mediate dimerization
           of NCAM molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 96

 Score = 32.7 bits (74), Expect = 0.091
 Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 8/59 (13%)

Query: 286 ISWYW-NGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAENRAG 343
           ISW+  NG  L  N      QRI V+   +Y   S+L + NA   D+G + CV  N   
Sbjct: 33  ISWFSPNGEKLTPNQ-----QRISVVRNDDY--SSTLTIYNANIDDAGIYKCVVSNEDE 84


>gnl|CDD|143213 cd05736, Ig2_Follistatin_like, Second immunoglobulin (Ig)-like
           domain of a follistatin-like molecule encoded by the
           Mahya gene and similar proteins.  Ig2_Follistatin_like:
           domain similar to the second immunoglobulin (Ig)-like
           domain found in a follistatin-like molecule encoded by
           the CNS-related Mahya gene. Mahya genes have been
           retained in certain Bilaterian branches during
           evolution.  They are conserved in Hymenoptera and
           Deuterostomes, but are absent from other metazoan
           species such as fruit fly and nematode. Mahya proteins
           are secretory, with a follistatin-like domain
           (Kazal-type serine/threonine protease inhibitor domain
           and EF-hand calcium-binding domain), two Ig-like
           domains, and a novel C-terminal domain. Mahya may be
           involved in learning and memory and in processing of
           sensory information in Hymenoptera and vertebrates.
           Follistatin is a secreted, multidomain protein that
           binds activins with high affinity and antagonizes their
           signaling.
          Length = 76

 Score = 32.2 bits (73), Expect = 0.099
 Identities = 19/73 (26%), Positives = 38/73 (52%), Gaps = 9/73 (12%)

Query: 272 ATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
           A++ C  + IP   ++W  NG  +    +    +++ +I  G     S L ++N +  D+
Sbjct: 1   ASLRCHAEGIPLPRLTWLKNGMDITPKLS----KQLTLIANG-----SELHISNVRYEDT 51

Query: 332 GRFYCVAENRAGI 344
           G + C+A+N AG+
Sbjct: 52  GAYTCIAKNEAGV 64


>gnl|CDD|143250 cd05773, Ig8_hNephrin_like, Eighth immunoglobulin-like domain of
           nephrin.  Ig8_hNephrin_like: domain similar to the
           eighth immunoglobulin-like domain in human nephrin.
           Nephrin is an integral component of the slit diaphragm,
           and is a central component of the glomerular
           ultrafilter. Nephrin plays a structural role, and has a
           role in signaling. Nephrin is a transmembrane protein
           having a short intracellular portion, and an
           extracellular portion comprised of eight Ig-like
           domains, and one fibronectin type III-like domain. The
           extracellular portions of nephrin, from neighboring foot
           processes of separate podocyte cells, may interact with
           each other, and in association with other components of
           the slit diaphragm, form a porous molecular sieve within
           the slit pore.  The intracellular portion of nephrin is
           associated with linker proteins, which connect nephrin
           to the actin cytoskeleton. The intracellular portion is
           tyrosine phosphorylated, and mediates signaling from the
           slit diaphragm into the podocytes.
          Length = 109

 Score = 33.0 bits (75), Expect = 0.10
 Identities = 25/94 (26%), Positives = 34/94 (36%), Gaps = 13/94 (13%)

Query: 268 SSENATVVCRVDSIPPAAISWYWNG-RLLLNNTAFSSYQRIFVIEQGEYE---RKSSLVL 323
            S +A +VC+   +P     W  NG  L L N  +         E  E+      S L +
Sbjct: 22  GSSDANLVCQAQGVPRVQFRWAKNGVPLDLGNPRYE--------ETTEHTGTVHTSILTI 73

Query: 324 TNAQES-DSGRFYCVAENRAGIADANFTLQVTYR 356
            N   + D   F C A N  G    +  L  T R
Sbjct: 74  INVSAALDYALFTCTAHNSLGEDSLDIQLVSTSR 107


>gnl|CDD|143264 cd05856, Ig2_FGFRL1-like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor_like-1(FGFRL1). 
           Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain
           of fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal
           peptide, three extracellular Ig-like modules, a
           transmembrane segment, and a short intracellular domain.
           FGFRL1 is expressed preferentially in skeletal tissues.
           Similar to FGF receptors, the expressed protein
           interacts specifically with heparin and with FGF2.
           FGFRL1 does not have a protein tyrosine kinase domain at
           its C terminus; neither does its cytoplasmic domain
           appear to interact with a signaling partner. It has been
           suggested that FGFRL1 may not have any direct signaling
           function, but instead acts as a decoy receptor trapping
           FGFs and preventing them from binding other receptors.
          Length = 82

 Score = 31.7 bits (72), Expect = 0.19
 Identities = 22/79 (27%), Positives = 33/79 (41%), Gaps = 13/79 (16%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS-SLVLTNAQESDSGRF 334
           C     P   I+W  + + L               E GE  +K  +L L N +  DSG++
Sbjct: 16  CVASGNPRPDITWLKDNKPLTPT------------EIGESRKKKWTLSLKNLKPEDSGKY 63

Query: 335 YCVAENRAGIADANFTLQV 353
            C   NRAG  +A + + V
Sbjct: 64  TCHVSNRAGEINATYKVDV 82


>gnl|CDD|143199 cd05722, Ig1_Neogenin, First immunoglobulin (Ig)-like domain in
           neogenin and similar proteins.  Ig1_Neogenin: first
           immunoglobulin (Ig)-like domain in neogenin and related
           proteins. Neogenin  is a cell surface protein which is
           expressed in the developing nervous system of vertebrate
           embryos in the growing nerve cells. It is also expressed
           in other embryonic tissues, and may play a general role
           in developmental processes such as cell migration,
           cell-cell recognition, and tissue growth regulation.
           Included in this group is the tumor suppressor protein
           DCC, which is deleted in colorectal carcinoma . DCC and
           neogenin each have four Ig-like domains followed by six
           fibronectin type III domains, a transmembrane domain,
           and an intracellular domain.
          Length = 95

 Score = 32.1 bits (73), Expect = 0.19
 Identities = 23/80 (28%), Positives = 33/80 (41%), Gaps = 15/80 (18%)

Query: 266 AVSSENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTN 325
           AV      + C  +  PP  I W  +G LL       S +R   +  G      SL++T+
Sbjct: 11  AVRGGPVVLNCSAEGEPPPKIEWKKDGVLL----NLVSDERRQQLPNG------SLLITS 60

Query: 326 AQES-----DSGRFYCVAEN 340
              S     D G + CVA+N
Sbjct: 61  VVHSKHNKPDEGFYQCVAQN 80


>gnl|CDD|191413 pfam05970, PIF1, PIF1-like helicase.  This family includes
           homologues of the PIF1 helicase, which inhibits
           telomerase activity and is cell cycle regulated. This
           family includes a large number of largely
           uncharacterized plant proteins. This family includes a
           P-loop motif that is involved in nucleotide binding.
          Length = 364

 Score = 33.9 bits (78), Expect = 0.21
 Identities = 27/116 (23%), Positives = 48/116 (41%), Gaps = 20/116 (17%)

Query: 462 LHSVINISNPDLINDTRKPEGLSPE----PHNDDVLFQNNYWNQNIRQPTNSELGFDSND 517
           + ++++   PD++ ++  P  L       P N+DV   NNY    + Q    E  + S+D
Sbjct: 243 IEAIVSEVYPDIVQNSTDPNYLCERAILCPTNEDVDEINNY---ILSQLPGEEKIYLSSD 299

Query: 518 KTPIIDGVSIGGELDDNYPPDYGLPIVGQGQNELLPNNIHPNAKTLRVWQRGVPVL 573
              I    +   + D  YP ++         N L  N +  +   L+V   G PV+
Sbjct: 300 S--ISKSDTDIPDDDALYPTEF--------LNSLKANGLPNHVLKLKV---GAPVM 342


>gnl|CDD|143285 cd05877, Ig_LP_like, Immunoglobulin (Ig)-like domain of human
           cartilage link protein (LP).  Ig_LP_like: immunoglobulin
           (Ig)-like domain similar to that that found in human
           cartilage link protein (LP). In cartilage,
           chondroitin-keratan sulfate proteoglycan (CSPG),
           aggrecan, forms cartilage link protein stabilized
           aggregates with hyaluronan (HA). These aggregates
           contribute to the tissue's load bearing properties.
           Aggregates having other CSPGs substituting for aggrecan
           may contribute to the structural integrity of many
           different tissues. Members of the vertebrate HPLN
           (hyaluronan/HA and proteoglycan binding link) protein
           family are physically linked adjacent to CSPG genes.
          Length = 106

 Score = 31.9 bits (73), Expect = 0.22
 Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 22/108 (20%)

Query: 270 ENATVVCRVDSIPPAA------ISWYW--NGRLLLNNT---------AFSSYQ-RIFVIE 311
            N T+ CR    P  +      + W    +  L   +          ++ SYQ R+F+  
Sbjct: 3   GNVTLPCRYHYEPELSAPRKIRVKWTKLESDYLKEEDVLVAIGTRHKSYGSYQGRVFLRR 62

Query: 312 QGEYERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQVTYRGVG 359
             +    +SLV+T+ +  D GR+ C  E   G+ D +  + +  RGV 
Sbjct: 63  AHD--LDASLVITDLRLEDYGRYRC--EVIDGLEDESVVVALRLRGVV 106


>gnl|CDD|143214 cd05737, Ig_Myomesin_like_C, C-temrinal immunoglobulin (Ig)-like
           domain of myomesin and M-protein.  Ig_Myomesin_like_C:
           domain similar to the C-temrinal immunoglobulin
           (Ig)-like domain of myomesin and M-protein. Myomesin and
           M-protein are both structural proteins localized to the
           M-band, a transverse structure in the center of the
           sarcomere, and are candidates for M-band bridges. Both
           proteins are modular, consisting mainly of repetitive
           Ig-like and fibronectin type III (FnIII) domains.
           Myomesin is expressed in all types of vertebrate
           striated muscle; M-protein has a muscle-type specific
           expression pattern. Myomesin is present in both slow and
           fast fibers; M-protein is present only in fast fibers.
           It has been suggested that myomesin acts as a molecular
           spring with alternative splicing as a means of modifying
           its elasticity.
          Length = 92

 Score = 31.3 bits (71), Expect = 0.27
 Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 8/78 (10%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           C V   P   +SW  N + L    A S +  + V EQG+Y   +SL +      DSG++ 
Sbjct: 23  CTVFGDPDPEVSWLKNDQAL----ALSDHYNVKV-EQGKY---ASLTIKGVSSEDSGKYG 74

Query: 336 CVAENRAGIADANFTLQV 353
            V +N+ G    + T+ V
Sbjct: 75  IVVKNKYGGETVDVTVSV 92


>gnl|CDD|219745 pfam08205, C2-set_2, CD80-like C2-set immunoglobulin domain.  These
           domains belong to the immunoglobulin superfamily.
          Length = 89

 Score = 31.2 bits (71), Expect = 0.29
 Identities = 26/83 (31%), Positives = 30/83 (36%), Gaps = 9/83 (10%)

Query: 264 VEAVSSENATVV--CRV-DSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSS 320
           V  +  EN  VV  C      P   I+WY +GR L   T  S        E G Y   S+
Sbjct: 7   VSLLEGENLEVVATCSSAGGKPAPRITWYLDGRELEAITTSSE----QDPESGLYTVTST 62

Query: 321 LVLTNAQESDSGR-FYCVAENRA 342
           L L      D GR   C     A
Sbjct: 63  LKLV-PSREDHGRSLTCQVSYGA 84


>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
           (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2.
           Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
           II (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2. Perlecan
           consists of five domains. Domain I has three putative
           heparan sulfate attachment sites; domain II has four LDL
           receptor-like repeats, and one Ig-like repeat; domain
           III resembles the short arm of laminin chains; domain IV
           has multiple Ig-like repeats (21 repeats in human
           perlecan); and domain V resembles the globular G domain
           of the laminin A chain and internal repeats of EGF.
           Perlecan may participate in a variety of biological
           functions including cell binding, LDL-metabolism,
           basement membrane assembly and selective permeability,
           calcium binding, and growth- and neurite-promoting
           activities.
          Length = 78

 Score = 30.9 bits (70), Expect = 0.31
 Identities = 19/74 (25%), Positives = 29/74 (39%), Gaps = 9/74 (12%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES 329
           E     C    +P   I+W       LN        R+ +  +G Y    +L + + +ES
Sbjct: 2   ETVEFTCVATGVPTPIINWR------LNWGHVPDSARVSITSEGGY---GTLTIRDVKES 52

Query: 330 DSGRFYCVAENRAG 343
           D G + C A N  G
Sbjct: 53  DQGAYTCEAINTRG 66


>gnl|CDD|143302 cd05894, Ig_C5_MyBP-C, C5 immunoglobulin (Ig) domain of cardiac
           myosin binding protein C (MyBP-C).  Ig_C5_MyBP_C : the
           C5 immunoglobulin (Ig) domain of cardiac myosin binding
           protein C (MyBP-C). MyBP_C consists of repeated domains,
           Ig and fibronectin type 3, and various linkers. Three
           isoforms of MYBP_C exist and are included in this group:
           cardiac(c), and fast and slow skeletal muscle (s)
           MyBP_C. cMYBP_C has insertions between and inside
           domains and an additional cardiac-specific Ig domain at
           the N-terminus. For cMYBP_C  an interaction has been
           demonstrated between this C5 domain and the Ig C8
           domain.
          Length = 86

 Score = 31.0 bits (70), Expect = 0.36
 Identities = 10/35 (28%), Positives = 16/35 (45%)

Query: 319 SSLVLTNAQESDSGRFYCVAENRAGIADANFTLQV 353
           SS V+  A+  D G +     N  G   A+  ++V
Sbjct: 52  SSFVIEGAEREDEGVYTITVTNPVGEDHASLFVKV 86


>gnl|CDD|143284 cd05876, Ig3_L1-CAM, Third immunoglobulin (Ig)-like domain of the
           L1 cell adhesion molecule (CAM).  Ig3_L1-CAM:  third
           immunoglobulin (Ig)-like domain of the L1 cell adhesion
           molecule (CAM). L1 belongs to the L1 subfamily of cell
           adhesion molecules (CAMs) and is comprised of an
           extracellular region having six Ig-like domains, five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 71

 Score = 30.3 bits (68), Expect = 0.48
 Identities = 22/82 (26%), Positives = 34/82 (41%), Gaps = 14/82 (17%)

Query: 273 TVVCRVDSIPPAAISW-YWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDS 331
            + C  + +P   + W   +G L  N T   +  +             +L L N  ESD 
Sbjct: 2   VLECIAEGLPTPEVHWDRIDGPLSPNRTKKLNNNK-------------TLQLDNVLESDD 48

Query: 332 GRFYCVAENRAGIADANFTLQV 353
           G + C AEN  G A  ++T+ V
Sbjct: 49  GEYVCTAENSEGSARHHYTVTV 70


>gnl|CDD|143272 cd05864, Ig2_VEGFR-2, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 2 (VEGFR-2).
            Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 2 (VEGFR-2).
           The VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. VEGFRs bind VEGFs with high
           affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is
           a major mediator of the mitogenic, angiogenic and
           microvascular permeability-enhancing effects of VEGF-A;
           VEGF-A is important to the growth and maintenance of
           vascular endothelial cells and to the development of new
           blood- and lymphatic-vessels in physiological and
           pathological states. VEGF-A also interacts with VEGFR-1,
           which it binds more strongly than VEGFR-2.  VEGFR-2 and
           -1 may mediate a chemotactic and a survival signal in
           hematopoietic stem cells or leukemia cells.
          Length = 70

 Score = 30.3 bits (68), Expect = 0.48
 Identities = 16/59 (27%), Positives = 24/59 (40%), Gaps = 14/59 (23%)

Query: 282 PPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAEN 340
           PP  + WY NG+L++ N  F              +R   L +    E D+G +  V  N
Sbjct: 11  PPPEVKWYKNGQLIVLNHTF--------------KRGVHLTIYEVTEKDAGNYTVVLTN 55


>gnl|CDD|143262 cd05854, Ig6_Contactin-2, Sixth Ig domain of contactin-2.
           Ig6_Contactin-2: Sixth Ig domain of the neural cell
           adhesion molecule contactin-2-like. Contactins are
           comprised of six Ig domains followed by four fibronectin
           type III (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-2 (TAG-1,
           axonin-1) facilitates cell adhesion by homophilic
           binding between molecules in apposed membranes. It may
           play a part in the neuronal processes of neurite
           outgrowth, axon guidance and fasciculation, and neuronal
           migration. The first four Ig domains form the
           intermolecular binding fragment, which arranges as a
           compact U-shaped module by contacts between IG domains 1
           and 4, and domains 2 and 3. The different contactins
           show different expression patterns in the central
           nervous system. During development and in adulthood,
           contactin-2 is transiently expressed in subsets of
           central and peripheral neurons. Contactin-2 is also
           expressed in retinal amacrine cells in the developing
           chick retina, corresponding to the period of formation
           and maturation of AC proce sses.
          Length = 85

 Score = 30.4 bits (68), Expect = 0.60
 Identities = 24/90 (26%), Positives = 36/90 (40%), Gaps = 15/90 (16%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKS------SLVL 323
           EN T+ C     P   +++ W+    L++              G Y R         LV+
Sbjct: 1   ENLTLQCHASHDPTMDLTFTWS----LDDFPID-----LDKPNGHYRRMEVKETIGDLVI 51

Query: 324 TNAQESDSGRFYCVAENRAGIADANFTLQV 353
            NAQ S +G + C A+     A A+ TL V
Sbjct: 52  VNAQLSHAGTYTCTAQTVVDSASASATLVV 81


>gnl|CDD|143276 cd05868, Ig4_NrCAM, Fourth immunoglobulin (Ig)-like domain of NrCAM
           (NgCAM-related cell adhesion molecule).  Ig4_ NrCAM:
           fourth immunoglobulin (Ig)-like domain of NrCAM
           (NgCAM-related cell adhesion molecule). NrCAM belongs to
           the L1 subfamily of cell adhesion molecules (CAMs) and
           is comprised of an extracellular region having six
           IG-like domains and five fibronectin type III domains, a
           transmembrane region and an intracellular domain. NrCAM
           is primarily expressed in the nervous system.
          Length = 76

 Score = 30.0 bits (67), Expect = 0.62
 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 16/87 (18%)

Query: 270 ENATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERK---SSLVLTNA 326
           E+ T++CR +  P  +ISW  NG  +              I   +  RK    +++ +  
Sbjct: 2   EDGTLICRANGNPKPSISWLTNGVPI-------------EIAPTDPSRKVDGDTIIFSKV 48

Query: 327 QESDSGRFYCVAENRAGIADANFTLQV 353
           QE  S  + C A N  G   AN  + V
Sbjct: 49  QERSSAVYQCNASNEYGYLLANAFVNV 75


>gnl|CDD|218711 pfam05709, Sipho_tail, Phage tail protein.  This family consists of
           several Siphovirus and other phage tail component
           proteins as well as some bacterial proteins of unknown
           function.
          Length = 242

 Score = 31.9 bits (73), Expect = 0.73
 Identities = 11/45 (24%), Positives = 21/45 (46%)

Query: 507 TNSELGFDSNDKTPIIDGVSIGGELDDNYPPDYGLPIVGQGQNEL 551
           T   L  DS   T +++G++    L      +   P++  G+NE+
Sbjct: 179 TGDVLVIDSATDTVVLNGINTLNGLAIGANTNSDFPVLPPGENEI 223


>gnl|CDD|212460 cd05723, Ig4_Neogenin, Fourth immunoglobulin (Ig)-like domain in
           neogenin and similar proteins.  Ig4_Neogenin: fourth
           immunoglobulin (Ig)-like domain in neogenin and related
           proteins. Neogenin  is a cell surface protein which is
           expressed in the developing nervous system of vertebrate
           embryos in the growing nerve cells. It is also expressed
           in other embryonic tissues, and may play a general role
           in developmental processes such as cell migration,
           cell-cell recognition, and tissue growth regulation.
           Included in this group is the tumor suppressor protein
           DCC, which is deleted in colorectal carcinoma . DCC and
           neogenin each have four Ig-like domains followed by six
           fibronectin type III domains, a transmembrane domain,
           and an intracellular domain.
          Length = 71

 Score = 29.5 bits (66), Expect = 0.74
 Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 12/76 (15%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           C V   P   + W  NG +++     S Y +I         ++ +L +    +SD G + 
Sbjct: 6   CEVTGKPTPTVKWVKNGDMVIP----SDYFKIV--------KEHNLQVLGLVKSDEGFYQ 53

Query: 336 CVAENRAGIADANFTL 351
           C+AEN  G   A   L
Sbjct: 54  CIAENDVGNVQAGAQL 69


>gnl|CDD|143257 cd05849, Ig1_Contactin-1, First Ig domain of contactin-1.
           Ig1_Contactin-1: First Ig domain of the neural cell
           adhesion molecule contactin-1. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-1 is
           differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 93

 Score = 29.9 bits (67), Expect = 0.81
 Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 13/86 (15%)

Query: 259 MDSRYVEAVSSENATVVCRVDSIPPAAISWYWN-GRLLLNNTAFSSYQRIFVIEQGEYER 317
           +D+ Y E  +    +V CR  + P     W  N   + L N  +S      VI   +  +
Sbjct: 9   IDTIYPEESTEGKVSVNCRARANPFPIYKWRKNNLDIDLTNDRYSMVGGNLVINNPDKYK 68

Query: 318 KSSLVLTNAQESDSGRFYCVAENRAG 343
                       D+GR+ C+  N  G
Sbjct: 69  ------------DAGRYVCIVSNIYG 82


>gnl|CDD|143177 cd04976, Ig2_VEGFR, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor (VEGFR).
           Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor (VEGFR). The
           VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. The VEGFR family consists of three
           members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and
           VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at
           the Ig-like domains. VEGF-A is important to the growth
           and maintenance of vascular endothelial cells and to the
           development of new blood- and lymphatic-vessels in
           physiological and pathological states. VEGFR-2 is a
           major mediator of the mitogenic, angiogenic and
           microvascular permeability-enhancing effects of VEGF-A.
           VEGFR-1 may play an inhibitory part in these processes
           by binding VEGF and interfering with its interaction
           with VEGFR-2. VEGFR-1 has a signaling role in mediating
           monocyte chemotaxis. VEGFR-2 and -1 may mediate a
           chemotactic and a survival signal in hematopoietic stem
           cells or leukemia cells. VEGFR-3 has been shown to be
           involved in tumor angiogenesis and growth.
          Length = 71

 Score = 29.3 bits (66), Expect = 0.87
 Identities = 18/71 (25%), Positives = 26/71 (36%), Gaps = 13/71 (18%)

Query: 282 PPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCVAENR 341
           PP  I WY NG+L+      S   R             SL + +  E D+G +  V  N+
Sbjct: 11  PPPEIQWYKNGKLI------SEKNRTKK-------SGHSLTIKDVTEEDAGNYTVVLTNK 57

Query: 342 AGIADANFTLQ 352
               +   T  
Sbjct: 58  QAKLEKRLTFT 68


>gnl|CDD|143301 cd05893, Ig_Palladin_C, C-terminal immunoglobulin (Ig)-like domain
           of palladin.  Ig_Palladin_C: C-terminal immunoglobulin
           (Ig)-like domain of palladin. Palladin belongs to the
           palladin-myotilin-myopalladin family. Proteins belonging
           to this family contain multiple Ig-like domains and
           function as scaffolds, modulating actin cytoskeleton.
           Palladin binds to alpha-actinin ezrin,
           vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP,
           mDia interacting protein), and Src. Palladin also binds
           F-actin directly, via its Ig3 domain. Palladin is
           expressed as several alternatively spliced isoforms,
           having various combinations of Ig-like domains, in a
           cell-type-specific manner. It has been suggested that
           palladin's different Ig-like domains may be specialized
           for distinct functions.
          Length = 75

 Score = 29.6 bits (66), Expect = 0.91
 Identities = 22/78 (28%), Positives = 31/78 (39%), Gaps = 7/78 (8%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFY 335
           CRV  +P   I W      L +NT   S      + Q        L++  A + D+G + 
Sbjct: 5   CRVSGVPHPQIFWKKENESLTHNTDRVS------MHQDNCGY-ICLLIQGATKEDAGWYT 57

Query: 336 CVAENRAGIADANFTLQV 353
             A+N AGI      L V
Sbjct: 58  VSAKNEAGIVSCTARLDV 75


>gnl|CDD|143281 cd05873, Ig_Sema4D_like, Immunoglobulin (Ig)-like domain of the
           class IV semaphorin Sema4D.  Ig_Sema4D_like;
           Immunoglobulin (Ig)-like domain of Sema4D. Sema4D is a
           Class IV semaphorin. Semaphorins are classified based on
           structural features additional to the Sema domain.
           Sema4D has extracellular Sema and Ig domains, a
           transmembrane domain, and a short cytoplasmic domain.
           Sema4D plays a part in the development of GABAergic
           synapses. Sema4D in addition is an immune semaphorin. It
           is abundant on resting T cells; its expression is weak
           on resting B cells and antigen presenting cells (APCs),
           but is upregulated by various stimuli. The receptor used
           by Sema4D in the immune system is CD72. Sem4D enhances
           the activation of B cells and DCs through binding CD72,
           perhaps by reducing CD72s inhibitory signals. The
           receptor used by Sema4D in the non-lymphatic tissues is
           plexin-B1. Sem4D is anchored to the cell surface but its
           extracellular domain can be released from the cell
           surface by a metalloprotease-dependent process. Sem4D
           may mediate its effects in its membrane bound form,
           and/or its cleaved form.
          Length = 87

 Score = 29.8 bits (67), Expect = 1.0
 Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 13/72 (18%)

Query: 271 NATVVCRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESD 330
           NA + C   S   A + W ++G++L   T  S+   ++         +  L++ NA E+D
Sbjct: 13  NAELKCSPKS-NLARVVWKFDGKVL---TPESAKYLLY---------RDGLLIFNASEAD 59

Query: 331 SGRFYCVAENRA 342
           +GR+ C++  ++
Sbjct: 60  AGRYQCLSVEKS 71


>gnl|CDD|143256 cd05848, Ig1_Contactin-5, First Ig domain of contactin-5.
           Ig1_Contactin-5: First Ig domain of the neural cell
           adhesion molecule contactin-5. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains, anchored to the membrane by
           glycosylphosphatidylinositol. The different contactins
           show different expression patterns in the central
           nervous system. In rats, a lack of contactin-5 (NB-2)
           results in an impairment of the neuronal activity in the
           auditory system. Contactin-5 is expressed specifically
           in the postnatal nervous system, peaking at about 3
           weeks postnatal. Contactin-5 is highly expressed in the
           adult human brain in the occipital lobe and in the
           amygdala; lower levels of expression have been detected
           in the corpus callosum, caudate nucleus, and spinal
           cord.
          Length = 94

 Score = 29.5 bits (66), Expect = 1.3
 Identities = 19/70 (27%), Positives = 27/70 (38%), Gaps = 12/70 (17%)

Query: 276 CRVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQES-DSGRF 334
           C     P     W  NG          S  R  +I+        +L+++N  E  DSGR+
Sbjct: 26  CEARGNPVPTYRWLRNG----TEIDTESDYRYSLID-------GNLIISNPSEVKDSGRY 74

Query: 335 YCVAENRAGI 344
            C+A N  G 
Sbjct: 75  QCLATNSIGS 84


>gnl|CDD|143191 cd05714, Ig_CSPGs_LP, Immunoglobulin (Ig)-like domain of
           chondroitin sulfate proteoglycans (CSPGs), human
           cartilage link protein (LP) and similar proteins.
           Ig_CSPGs_LP: immunoglobulin (Ig)-like domain similar to
           that found in chondroitin sulfate proteoglycans (CSPGs)
           and human cartilage link protein (LP).  Included in this
           group are the CSPGs aggrecan, versican, and neurocan. In
           CSPGs this Ig-like domain is followed by hyaluronan
           (HA)-binding tandem repeats, and a C-terminal region
           with epidermal growth factor-like, lectin-like, and
           complement regulatory protein-like domains. Separating
           these N- and C-terminal regions is a nonhomologous
           glycosaminoglycan attachment region. In cartilage,
           aggrecan forms cartilage link protein stabilized
           aggregates with hyaluronan (HA). These aggregates
           contribute to the tissue's load bearing properties.
           Aggrecan and versican have a wide distribution in
           connective tissue and extracellular matrices. Neurocan
           is localized almost exclusively in nervous tissue.
           Aggregates having other CSPGs substituting for aggrecan
           may contribute to the structural integrity of many
           different tissues. There is considerable evidence that
           HA-binding CSPGs are involved in developmental processes
           in the central nervous system. Members of the vertebrate
           HPLN (hyaluronan/HA and proteoglycan binding link)
           protein family are physically linked adjacent to CSPG
           genes.
          Length = 106

 Score = 29.5 bits (67), Expect = 1.4
 Identities = 17/40 (42%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 320 SLVLTNAQESDSGRFYC-VAENRAGIADANFTLQVTYRGV 358
           SLV+T+ +  DSGR+ C V +   GI D   T+++  RGV
Sbjct: 69  SLVITDLRLEDSGRYRCEVID---GIEDEQDTVELEVRGV 105


>gnl|CDD|144887 pfam01463, LRRCT, Leucine rich repeat C-terminal domain.  Leucine
           Rich Repeats pfam00560 are short sequence motifs present
           in a number of proteins with diverse functions and
           cellular locations. Leucine Rich Repeats are often
           flanked by cysteine rich domains. This domain is often
           found at the C-terminus of tandem leucine rich repeats.
          Length = 25

 Score = 27.2 bits (61), Expect = 1.6
 Identities = 10/23 (43%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 232 CTGPERLSGKVFSDLHADDFACK 254
           C GPE L G + S   + DF+C 
Sbjct: 4   CAGPESLRGPLLSLPPS-DFSCP 25


>gnl|CDD|143231 cd05754, Ig3_Perlecan_like, Third immunoglobulin (Ig)-like domain
           found in Perlecan and similar proteins.
           Ig3_Perlecan_like: domain similar to the third
           immunoglobulin (Ig)-like domain found in Perlecan.
           Perlecan is a large multi-domain heparin sulfate
           proteoglycan, important in tissue development and
           organogenesis.  Perlecan can be represented as 5 major
           portions; its fourth major portion (domain IV) is a
           tandem repeat of immunoglobulin-like domains (Ig2-Ig15),
           which can vary in size due to alternative splicing.
           Perlecan binds many cellular and extracellular ligands.
           Its domain IV region has many binding sites.  Some of
           these have been mapped at the level of individual
           Ig-like domains, including a site restricted to the Ig5
           domain for heparin/sulfatide, a site restricted to the
           Ig3 domain for nidogen-1 and nidogen-2, a site
           restricted to Ig4-5 for fibronectin, and sites
           restricted to Ig2 and to Ig13-15 for fibulin-2.
          Length = 85

 Score = 29.1 bits (65), Expect = 1.7
 Identities = 22/93 (23%), Positives = 32/93 (34%), Gaps = 16/93 (17%)

Query: 260 DSRYVEAVSSENATVVCRVDSIPPA-AISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERK 318
           + R  E     + + +CR  S  PA  + W   G  L +  A                  
Sbjct: 7   EPRSQEVRPGADVSFICRAKSKSPAYTLVWTRVGGGLPSR-AMDFNGI------------ 53

Query: 319 SSLVLTNAQESDSGRFYCVAENRAGIADANFTL 351
             L + N Q SD+G + C   N     +A  TL
Sbjct: 54  --LTIRNVQLSDAGTYVCTGSNMLDTDEATATL 84


>gnl|CDD|185285 PRK15387, PRK15387, E3 ubiquitin-protein ligase SspH2; Provisional.
          Length = 788

 Score = 31.3 bits (70), Expect = 1.9
 Identities = 52/195 (26%), Positives = 84/195 (43%), Gaps = 46/195 (23%)

Query: 36  DKFLITIPEAPESELTQVLDMSGNNLQILPKEAFRRAGLLNLQKLFLARCHIGQIDSGAL 95
           D  L ++P  P     + L++SGN L  LP       GLL L        H+  + SG  
Sbjct: 231 DNNLTSLPALPPE--LRTLEVSGNQLTSLP---VLPPGLLELSIFSNPLTHLPALPSGLC 285

Query: 96  ------DGLTNLI-------EIDLSDNLLTSIPSLTFQSVRF----------------LR 126
                 + LT+L        E+ +SDN L S+P+L  +  +                 L+
Sbjct: 286 KLWIFGNQLTSLPVLPPGLQELSVSDNQLASLPALPSELCKLWAYNNQLTSLPTLPSGLQ 345

Query: 127 DLNLARNPISKIEKGAFQFVPG-LVKLDMSESRLEHISPEAFTGAKSLESIKLNGNRLSH 185
           +L+++ N ++ +       +P  L KL    +RL  + P   +G K L    ++GNRL+ 
Sbjct: 346 ELSVSDNQLASLPT-----LPSELYKLWAYNNRLTSL-PALPSGLKEL---IVSGNRLTS 396

Query: 186 FPVRSVEPLLKLMMI 200
            PV   E  LK +M+
Sbjct: 397 LPVLPSE--LKELMV 409


>gnl|CDD|219476 pfam07584, BatA, Aerotolerance regulator N-terminal.  These
           proteins share a highly-conserved sequence at their
           N-terminus. They include several proteins from
           Rhodopirellula baltica and also several from
           proteobacteria. The proteins are produced by the Batl
           operon which appears to be important in pathogenicity
           and aerotolerance. This family is the conserved
           N-terminus, but the full length proteins carry multiple
           membrane-spanning domains. BatA ensures bacterial
           survival in the early stages of the infection process,
           when the infected sites are aerobic, and is produced
           under conditions of oxidative stress.
          Length = 77

 Score = 28.6 bits (65), Expect = 1.9
 Identities = 11/27 (40%), Positives = 17/27 (62%)

Query: 373 LALFFLIILILIIIIYLLIRMRTITYP 399
           L L+ L++L L II++LL+R R     
Sbjct: 8   LLLWGLLLLPLPIILHLLLRRRPRRVK 34


>gnl|CDD|143173 cd04972, Ig_TrkABC_d4, Fourth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d4: the fourth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrKB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 90

 Score = 29.0 bits (65), Expect = 2.0
 Identities = 22/101 (21%), Positives = 32/101 (31%), Gaps = 25/101 (24%)

Query: 260 DSRYVEAVSSENATVVCRVDSIPPAAISWYWNG------RLLLNNTAFSSYQRIFVIEQG 313
           ++  V       AT+ C  +  P   + W   G      R     T    Y         
Sbjct: 8   NATVVY--EGGTATIRCTAEGSPLPKVEWIIAGLIVIQTRTDTLETTVDIY--------- 56

Query: 314 EYERKSSLVLTNAQESDSGRFYCVAENRAGIADANFTLQVT 354
                 +L L+N          C AEN  G A+   ++QVT
Sbjct: 57  ------NLQLSNITSETQTTVTCTAENPVGQANV--SVQVT 89


>gnl|CDD|197684 smart00365, LRR_SD22, Leucine-rich repeat, SDS22-like subfamily. 
          Length = 22

 Score = 26.9 bits (61), Expect = 2.3
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 98  LTNLIEIDLSDNLLTSI 114
           LTNL E+DL DN +  I
Sbjct: 1   LTNLEELDLGDNKIKKI 17


>gnl|CDD|225994 COG3463, COG3463, Predicted membrane protein [Function unknown].
          Length = 458

 Score = 30.9 bits (70), Expect = 2.3
 Identities = 28/121 (23%), Positives = 42/121 (34%), Gaps = 8/121 (6%)

Query: 360 LPFLGGGHINGISLALFFLIILILIIIIYLLIRMRTITY-PNSKNPAQIEVMANGNAH-- 416
           LPFL  G + GIS       ILI II+I +L  +  I Y P + +   +E  A  N    
Sbjct: 305 LPFLFLGALYGISKIKSVKKILIKIILIGILASLALIPYTPIAPHSPFVEQGAMINLAVS 364

Query: 417 AVVNKTPSLTPVIETSSFTERKQFPPPSYHSTEMISPNGQLPNKTLHSVINISNPDLIND 476
            V+    +   +I        K           +        +  +    N S   L+N 
Sbjct: 365 KVIPGKEASFELIAII-----KDSKGYLLTINNLYPVFANDFDAYVLPKNNNSRVYLVNL 419

Query: 477 T 477
            
Sbjct: 420 E 420


>gnl|CDD|224190 COG1271, CydA, Cytochrome bd-type quinol oxidase, subunit 1 [Energy
           production and conversion].
          Length = 457

 Score = 30.7 bits (70), Expect = 2.4
 Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 355 YRGVGLPFLGGGHINGISLALFFLIILILIII-IYLLIRM 393
              V    +  G +   SL LF ++  +L+I  +YLL+R+
Sbjct: 397 KTDVVSSAVTAGSV-LFSLILFMVLYTVLLIAEVYLLLRL 435


>gnl|CDD|143271 cd05863, Ig2_VEGFR-3, Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 3 (VEGFR-3).
            Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of
           vascular endothelial growth factor receptor 3 (VEGFR-3).
           The VEGFRs have an extracellular component with seven
           Ig-like domains, a transmembrane segment, and an
           intracellular tyrosine kinase domain interrupted by a
           kinase-insert domain. VEGFRs bind VEGFs with high
           affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds
           two members of the VEGF family (VEGF-C and -D) and is
           involved in tumor angiogenesis and growth.
          Length = 67

 Score = 28.0 bits (62), Expect = 2.5
 Identities = 15/77 (19%), Positives = 30/77 (38%), Gaps = 17/77 (22%)

Query: 277 RVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYC 336
           +V + PP    WY +G+L+                      + SL + +  E+ +G +  
Sbjct: 6   KVAAYPPPEFQWYKDGKLISGK-----------------HSQHSLQIKDVTEASAGTYTL 48

Query: 337 VAENRAGIADANFTLQV 353
           V  N A   +   +L++
Sbjct: 49  VLWNSAAGLEKRISLEL 65


>gnl|CDD|218858 pfam06024, DUF912, Nucleopolyhedrovirus protein of unknown function
           (DUF912).  This family consists of several
           Nucleopolyhedrovirus proteins of unknown function.
          Length = 101

 Score = 28.4 bits (64), Expect = 3.0
 Identities = 13/45 (28%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 363 LGGGHINGISLALFFLIILILIIIIYLLIRMRTITYPNSKNPAQI 407
              G+I  I L  FF ++++L  I Y +I +R     ++ NP+ +
Sbjct: 58  ANAGNIILIGLLAFFCVLVLLYAIYYFVI-LRERRKYSTNNPSYV 101


>gnl|CDD|143197 cd05720, Ig_CD8_alpha, Immunoglobulin (Ig) like domain of CD8 alpha
           chain.  Ig_CD8_alpha: immunoglobulin (Ig)-like domain in
           CD8 alpha. The CD8 glycoprotein plays an essential role
           in the control of T-cell selection, maturation and the
           T-cell receptor (TCR)-mediated response to peptide
           antigen. CD8 is comprised of alpha and beta subunits and
           is expressed as either an alphaalpha or alphabeta dimer.
           Both dimeric isoforms can serve as a coreceptor for T
           cell activation and differentiation, however they have
           distinct physiological roles, different cellular
           distributions, unique binding partners etc. Each CD8
           subunit is comprised of an extracellular domain
           containing a v-type Ig-like domain, a single pass
           transmembrane portion and a short intracellular domain.
           The Ig domain of CD8 alpha binds to antibodies.
          Length = 104

 Score = 28.6 bits (64), Expect = 3.5
 Identities = 17/79 (21%), Positives = 29/79 (36%), Gaps = 13/79 (16%)

Query: 273 TVVCRVDSIPPAAISWYWNGRLLLNNTAF-----SSYQRIFVIEQGEYER----KSS--- 320
            + C V +  P   SW +          F      S +  +  E+   +R    +SS   
Sbjct: 10  ELKCEVLNSSPTGCSWLFQPPGSAPQPTFLVYLSGSSKITWDEEELSSKRFSGSRSSNSF 69

Query: 321 -LVLTNAQESDSGRFYCVA 338
            L L N Q+ + G ++C  
Sbjct: 70  VLTLKNFQKENEGYYFCSV 88


>gnl|CDD|143172 cd04971, Ig_TrKABC_d5, Fifth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d5: the fifth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrkB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 81

 Score = 27.7 bits (62), Expect = 3.5
 Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 2/71 (2%)

Query: 278 VDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQGEYERKSSLVLTNAQESDSGRFYCV 337
           V   P   ++WY NG +L  +        I        E    L   N    ++G +  V
Sbjct: 7   VRGNPKPTLTWYHNGAVLNESD--YIRTEIHYEVTTPTEYHGCLQFDNPTHVNNGNYTLV 64

Query: 338 AENRAGIADAN 348
           A N  G    +
Sbjct: 65  ASNEYGQDSKS 75


>gnl|CDD|143229 cd05752, Ig1_FcgammaR_like, Frst immunoglobulin (Ig)-like domain of
            Fcgamma-receptors (FcgammaRs) and similar proteins.
           Ig1_FcgammaR_like: domain similar to the first
           immunoglobulin (Ig)-like domain of  Fcgamma-receptors
           (FcgammaRs). Interactions between IgG and FcgammaR are
           important to the initiation of cellular and humoral
           response. IgG binding to FcgammaR leads to a cascade of
           signals and ultimately to functions such as
           antibody-dependent-cellular-cytotoxicity (ADCC),
           endocytosis, phagocytosis, release of inflammatory
           mediators, etc. FcgammaR has two Ig-like domains. This
           group also contains FcepsilonRI, which binds IgE with
           high affinity.
          Length = 78

 Score = 27.7 bits (62), Expect = 3.6
 Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 4/49 (8%)

Query: 270 ENATVVCRVDSIP-PAAISWYWNGRLLLNNTAFSSYQRIFVIEQ-GEYE 316
           E  T+ C   + P   +  WY NG+LL   T  +SY+        GEY 
Sbjct: 16  EKVTLTCNGFNSPEQNSTQWYHNGKLLETTT--NSYRIRAANNDSGEYR 62


>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
           This family includes attachment proteins from
           respiratory synctial virus. Glycoprotein G has not been
           shown to have any neuraminidase or hemagglutinin
           activity. The amino terminus is thought to be
           cytoplasmic, and the carboxyl terminus extracellular.
           The extracellular region contains four completely
           conserved cysteine residues.
          Length = 263

 Score = 29.7 bits (66), Expect = 3.7
 Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 17/98 (17%)

Query: 372 SLALFFLIILILIIIIYLLIRMRTITYPNSKNPAQIEVMANGNAHAVVNKTPSLTPVIET 431
           SLA   L IL +II   L+I             A I  +++ N       TP+ TP  + 
Sbjct: 28  SLAQIALSILAMIISTSLII-------------AAIIFISSANHKV----TPTTTPTQQI 70

Query: 432 SSFTERKQFPPPSYHSTEMISPNGQLPNKTLHSVINIS 469
           ++  +       + H+    SP+ Q     L   I   
Sbjct: 71  TNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDD 108


>gnl|CDD|164750 MTH00204, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 485

 Score = 30.0 bits (68), Expect = 3.7
 Identities = 11/31 (35%), Positives = 22/31 (70%), Gaps = 2/31 (6%)

Query: 368 INGISLALFFLIILILIIIIYLLIRMRTITY 398
           ++G+SL  FF+++  L+I I +LI  ++I +
Sbjct: 81  VDGVSL--FFILLTTLLIPICILISWKSIKF 109


>gnl|CDD|130784 TIGR01723, hmd_TIGR, 5,10-methenyltetrahydromethanopterin
           hydrogenase.  This model represents a clade of
           authenticated coenzyme
           N(5),N(10)-methenyltetrahydromethanopterin reductases.
           This enzyme does not use F420. This enzyme acts in
           methanogenesis and as such is restricted to methanogenic
           archaeal species. This clade is one of two clades in
           Pfam model pfam03201 [Energy metabolism,
           Methanogenesis].
          Length = 340

 Score = 29.9 bits (67), Expect = 4.3
 Identities = 17/52 (32%), Positives = 22/52 (42%), Gaps = 7/52 (13%)

Query: 228 VQPACTGPERLSGKVFSDLHADDF-------ACKPEIRMDSRYVEAVSSENA 272
           V  ACT P     K+F DL  +D         C PE++      E  +SE A
Sbjct: 170 VTHACTIPTTKFAKIFEDLGREDLNVTSYHPGCVPEMKGQVYIAEGYASEEA 221


>gnl|CDD|226560 COG4074, Mth, H2-forming N5,N10-methylenetetrahydromethanopterin
           dehydrogenase [Energy production and conversion].
          Length = 343

 Score = 29.9 bits (67), Expect = 4.3
 Identities = 15/52 (28%), Positives = 21/52 (40%), Gaps = 7/52 (13%)

Query: 228 VQPACTGPERLSGKVFSDLHADDF-------ACKPEIRMDSRYVEAVSSENA 272
           V  ACT P     K+F D+  +D           PE++      E  +SE A
Sbjct: 170 VTHACTIPTTKFKKIFEDMGREDLNVTSYHPGTVPEMKGQVYIAEGYASEEA 221


>gnl|CDD|143234 cd05757, Ig2_IL1R_like, Second immunoglobulin (Ig)-like domain of
           interleukin-1 receptor (IL1R) and similar proteins.
           Ig2_IL1R_like: domain similar to the second
           immunoglobulin (Ig)-like domain of interleukin-1
           receptor (IL1R).  IL-1 alpha and IL-1 beta are cytokines
           which participate in the regulation of inflammation,
           immune responses, and hematopoiesis. These cytokines
           bind to the IL-1 receptor type 1 (IL1R1), which is
           activated on additional association with an accessory
           protein, IL1RAP. IL-1 also binds a second receptor
           designated type II (IL1R2). Mature IL1R1 consists of
           three IG-like domains, a transmembrane domain, and a
           large cytoplasmic domain. Mature IL1R2 is organized
           similarly except that it has a short cytoplasmic domain.
           The latter does not initiate signal transduction. A
           naturally occurring cytokine IL-1RA (IL-1 receptor
           antagonist) is widely expressed and binds to IL-1
           receptors, inhibiting the binding of IL-1 alpha and IL-1
           beta. This group also contains ILIR-like 1 (IL1R1L)
           which maps to the same chromosomal location as IL1R1 and
           IL1R2.
          Length = 92

 Score = 28.1 bits (63), Expect = 4.4
 Identities = 18/86 (20%), Positives = 33/86 (38%), Gaps = 22/86 (25%)

Query: 260 DSRYVEAVSSENATVVC-------RVDSIPPAAISWYWNGRLLLNNTAFSSYQRIFVIEQ 312
            S      S++   +VC         +++PP  + WY + +LL          R   +  
Sbjct: 1   ISYKQILFSTKGGKIVCPDLDDFKNENTLPP--VQWYKDCKLLEG-------DRKRFV-- 49

Query: 313 GEYERKSSLVLTNAQESDSGRFYCVA 338
               + S L++ N  E D+G + C  
Sbjct: 50  ----KGSKLLIQNVTEEDAGNYTCKL 71


>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein.  The CorA
           transport system is the primary Mg2+ influx system of
           Salmonella typhimurium and Escherichia coli. CorA is
           virtually ubiquitous in the Bacteria and Archaea. There
           are also eukaryotic relatives of this protein. The
           family includes the MRS2 protein from yeast that is
           thought to be an RNA splicing protein. However its
           membership of this family suggests that its effect on
           splicing is due to altered magnesium levels in the cell.
          Length = 291

 Score = 29.2 bits (66), Expect = 5.6
 Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 2/34 (5%)

Query: 359 GLPFLGGGHINGISLALFFLIILILIIIIYLLIR 392
           G+P L   +  G    +   ++L+L I++ L  R
Sbjct: 259 GMPELDWPY--GYPFWIVLGLMLLLAILLILYFR 290


>gnl|CDD|100598 PRK00561, ppnK, inorganic polyphosphate/ATP-NAD kinase;
           Provisional.
          Length = 259

 Score = 29.1 bits (65), Expect = 5.9
 Identities = 13/51 (25%), Positives = 19/51 (37%)

Query: 76  NLQKLFLARCHIGQIDSGALDGLTNLIEIDLSDNLLTSIPSLTFQSVRFLR 126
                  A C +  I++G L   T+  E DL  N    +  L F  +  L 
Sbjct: 49  TAANYNCAGCKVVGINTGHLGFYTSFNETDLDQNFANKLDQLKFTQIDLLE 99


>gnl|CDD|216842 pfam02009, Rifin_STEVOR, Rifin/stevor family.  Several multicopy
           gene families have been described in Plasmodium
           falciparum, including the stevor family of subtelomeric
           open reading frames and the rif interspersed repetitive
           elements. Both families contain three predicted
           transmembrane segments. It has been proposed that stevor
           and rif are members of a larger superfamily that code
           for variant surface antigens.
          Length = 290

 Score = 28.9 bits (65), Expect = 6.9
 Identities = 11/27 (40%), Positives = 21/27 (77%), Gaps = 2/27 (7%)

Query: 368 INGISLALFFLIILILIIIIYLLIRMR 394
           I   ++A+  LII+++++IIYL++R R
Sbjct: 249 IYASAIAI--LIIVLVMLIIYLILRYR 273


>gnl|CDD|214338 CHL00025, ndhF, NADH dehydrogenase subunit 5.
          Length = 741

 Score = 29.1 bits (66), Expect = 7.8
 Identities = 12/37 (32%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 357 GVGLPFLGGGHING-ISLALFFLIILILIIIIYLLIR 392
           G G+ ++GGG I+  + L LF++ I +LI+  +    
Sbjct: 705 GEGIKYVGGGRISSYLFLYLFYVSIFLLILYFFFSFI 741


>gnl|CDD|177215 MTH00158, ATP8, ATP synthase F0 subunit 8; Provisional.
          Length = 32

 Score = 25.5 bits (57), Expect = 7.9
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query: 368 INGISLALFFLIILILIIII 387
           +N + L + FLI  IL  I+
Sbjct: 7   MNWLILFILFLITFILFNIL 26


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
            Provisional.
          Length = 2849

 Score = 29.2 bits (65), Expect = 8.0
 Identities = 35/163 (21%), Positives = 60/163 (36%), Gaps = 32/163 (19%)

Query: 133  NPISKIEKGAFQFVPGLVK---LDMSESRLEHISPEAFTGAKSL------ESI--KLNGN 181
            NP    E+ A       V      M +   EHI  + +   +SL      E+I   +   
Sbjct: 1305 NPEQIFEELAGNESNDDVTGAPCPMGDIDAEHIIGDDYDTFESLSDELLEETITNDIESL 1364

Query: 182  RLSHFPVRSVEPLLK----LMMIELHDNPWVCDCNMRSIKMWLADKKNVPVQPACTGPER 237
                F   +++  LK    L   ++HDN  +CD +          KKN+ V      PE 
Sbjct: 1365 EAKDFEQYTLKVNLKAPKLLKPAKIHDNEHLCDFS----------KKNLIV------PEP 1408

Query: 238  LSGKVFSDLHADDFACKPEIR-MDSRYVEAVSSENATVVCRVD 279
            L  +     +  D  C   ++ +D+ YV+  + + A    +  
Sbjct: 1409 LKEEEELGGNPPDIHCYAALKPLDTLYVKCPTEKAAYEAAKGK 1451


>gnl|CDD|220496 pfam09972, DUF2207, Predicted membrane protein (DUF2207).  This
           domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 503

 Score = 28.9 bits (65), Expect = 8.6
 Identities = 5/47 (10%), Positives = 18/47 (38%), Gaps = 1/47 (2%)

Query: 357 GVGLPFLGGGHI-NGISLALFFLIILILIIIIYLLIRMRTITYPNSK 402
            + +  L    I   ++  +   I+L++  +I  ++  +       +
Sbjct: 404 TLIILILSFILISLVLAALVLLAIVLVIGSVIAAILPRKLFGRWTPE 450


>gnl|CDD|224039 COG1114, BrnQ, Branched-chain amino acid permeases [Amino acid
           transport and metabolism].
          Length = 431

 Score = 28.7 bits (65), Expect = 8.7
 Identities = 13/42 (30%), Positives = 18/42 (42%), Gaps = 13/42 (30%)

Query: 357 GVGLPFLG-------GGHINGIS------LALFFLIILILII 385
           GVGLP LG       GG +  ++        + F I + L I
Sbjct: 49  GVGLPLLGIIAVALYGGGVESLATRIGPWFGVLFAIAIYLSI 90


>gnl|CDD|133063 cd06913, beta3GnTL1_like, Beta 1, 3-N-acetylglucosaminyltransferase
           is essential for the formation of
           poly-N-acetyllactosamine .  This family includes human
           Beta3GnTL1 and related eukaryotic proteins. Human
           Beta3GnTL1 is a putative
           beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is
           expressed at various levels in most of tissues examined.
           Beta 1, 3-N-acetylglucosaminyltransferase has been found
           to be essential for the formation of
           poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a
           unique carbohydrate composed of N-acetyllactosamine
           repeats. It is often an important part of
           cell-type-specific oligosaccharide structures and some
           functional oligosaccharides. It has been shown that the
           structure and biosynthesis of poly-N-acetyllactosamine
           display a dramatic change during development and
           oncogenesis. Several members of beta-1,
           3-N-acetylglucosaminyltransferase have been identified.
          Length = 219

 Score = 28.6 bits (64), Expect = 9.1
 Identities = 19/71 (26%), Positives = 31/71 (43%), Gaps = 7/71 (9%)

Query: 239 SGKVFSDLHADDFACKPEIRMDSRYVEAVSSENATVVCRVDSIPPAAISWY--WNGRL-- 294
           SG+    L +DD      IR+  +Y  A+   N+ + C+V  IP  +   Y  W   L  
Sbjct: 84  SGRYLCFLDSDDVMMPQRIRL--QYEAALQHPNSIIGCQVRRIPEDSTERYTRWINTLTR 141

Query: 295 -LLNNTAFSSY 304
             L    ++S+
Sbjct: 142 EQLLTQVYTSH 152


>gnl|CDD|220767 pfam10459, Peptidase_S46, Peptidase S46.  Dipeptidyl-peptidase 7
           (DPP-7) is the best characterized member of this family.
           It is a serine peptidase that is located on the cell
           surface and is predicted to have two N-terminal
           transmembrane domains.
          Length = 696

 Score = 29.1 bits (66), Expect = 9.2
 Identities = 8/18 (44%), Positives = 10/18 (55%)

Query: 338 AENRAGIADANFTLQVTY 355
              R    DAN TL++TY
Sbjct: 543 KSGRPVYPDANSTLRLTY 560


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.137    0.412 

Gapped
Lambda     K      H
   0.267   0.0637    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,992,313
Number of extensions: 3062624
Number of successful extensions: 5013
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4897
Number of HSP's successfully gapped: 173
Length of query: 600
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 498
Effective length of database: 6,413,494
Effective search space: 3193920012
Effective search space used: 3193920012
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.9 bits)