RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12803
         (902 letters)



>gnl|CDD|216924 pfam02191, OLF, Olfactomedin-like domain. 
          Length = 250

 Score =  239 bits (613), Expect = 7e-73
 Identities = 90/262 (34%), Positives = 135/262 (51%), Gaps = 18/262 (6%)

Query: 553 VYAVGKPVFHKSTSMKKESSGKPVDVIFGAWMKDPHPTSGNSEKFWMTTDSDPTHLNEYK 612
           + +V KPV  K    K           +GAWMKDP P S + + +     S  T L E++
Sbjct: 2   LASVSKPVTVKKAGYK-----------YGAWMKDPLPPSDDDKVYVTDGYSGDT-LFEFR 49

Query: 613 DKDMFKSDAKSWTYNLKEPFQGNAHVVFNNSFYYHKKFSNSIVQFDLGSSKIIN---IVN 669
           + + FK      TY L  P+QG  HVV+N S YY++  S +I+++DL +  ++    + N
Sbjct: 50  NLEDFKQGRPFKTYRLPYPWQGTGHVVYNGSLYYNRYNSRTIIKYDLTTRTVVATRVLPN 109

Query: 670 LSTVDPNIKLYKTGYNVMDFSLDENGLWVIYGL--SNNNTAVTKLDTATLQIQYTWNITL 727
               +        G   +D ++DENGLWVIY    +N N  ++KLD  TL ++ TW+   
Sbjct: 110 AGYNNTFP-YSWGGKTDIDLAVDENGLWVIYATEENNGNIVLSKLDPETLDVEQTWDTGY 168

Query: 728 LHKKVGDMFVVCGVLYVVDSVTDRNTNIRFALDLYKNDLLEVSLNFTNPFTNTTMITYNS 787
             +  G+ F+VCGVLYV  S + R+T I +A D         S+ F N +   +M+ YN 
Sbjct: 169 PKRSAGNAFMVCGVLYVTRSYSLRDTRIFYAYDTSTGTEEAPSIPFPNRYGYISMLDYNP 228

Query: 788 RYKDLYTWDRGNSLTYPIRYHE 809
           R + LY WD G  LTY +R+  
Sbjct: 229 RDRKLYAWDNGYQLTYDLRFKP 250


>gnl|CDD|128580 smart00284, OLF, Olfactomedin-like domains. 
          Length = 255

 Score =  148 bits (376), Expect = 3e-40
 Identities = 74/260 (28%), Positives = 119/260 (45%), Gaps = 24/260 (9%)

Query: 556 VGKPVFHKSTSMKKESSGKPVDVIFGAWMKDPHPTSGNSEKFWM--TTDSDPTHLNEYKD 613
           + KPV     +++    GK      GAWMKDP   +     +W           + EY  
Sbjct: 6   ISKPV-----TLQTSWKGK-----SGAWMKDPLWNTTKKSLYWYMPLNTRVLRSVREYSS 55

Query: 614 KDMFKSDAKSWTYNLKEPFQGNAHVVFNNSFYYHKKFSNSIVQFDLGS-----SKIINIV 668
              F+       + L    QG   VV+N S Y++K  S+ I +FDL +       ++N  
Sbjct: 56  MSDFQMGKNPTDHPLPHAGQGTGVVVYNGSLYFNKFNSHDICRFDLTTETYQKEPLLNGA 115

Query: 669 NLSTVDPNIKLYK-TGYNVMDFSLDENGLWVIYGLSNNNTA--VTKLDTATLQIQYTWNI 725
             +   P    Y   G++ +D ++DENGLWVIY    N     ++KL+ ATL I+ TW  
Sbjct: 116 GYNNRFP----YAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNPATLTIENTWIT 171

Query: 726 TLLHKKVGDMFVVCGVLYVVDSVTDRNTNIRFALDLYKNDLLEVSLNFTNPFTNTTMITY 785
           T   +   + F++CG+LYV  S+  +   + +A D        + + F N +   +M+ Y
Sbjct: 172 TYNKRSASNAFMICGILYVTRSLGSKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDY 231

Query: 786 NSRYKDLYTWDRGNSLTYPI 805
           N   + LY W+ G+ + Y I
Sbjct: 232 NPNDRKLYAWNNGHLVHYDI 251


>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
           Members of this family belong to the collagen
           superfamily. Collagens are generally extracellular
           structural proteins involved in formation of connective
           tissue structure. The alignment contains 20 copies of
           the G-X-Y repeat that forms a triple helix. The first
           position of the repeat is glycine, the second and third
           positions can be any residue but are frequently proline
           and hydroxyproline. Collagens are post translationally
           modified by proline hydroxylase to form the
           hydroxyproline residues. Defective hydroxylation is the
           cause of scurvy. Some members of the collagen
           superfamily are not involved in connective tissue
           structure but share the same triple helical structure.
          Length = 60

 Score = 65.6 bits (161), Expect = 2e-13
 Identities = 33/66 (50%), Positives = 33/66 (50%), Gaps = 6/66 (9%)

Query: 140 GAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPKGDMGYPGRIGLDGRD 199
           G PG  GPPGP G P   G  GP G  G PGP G PG  G  GP G  G P   G  G  
Sbjct: 1   GPPGPPGPPGPPGPP---GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP---GPPGAP 54

Query: 200 GLPGEP 205
           G PG P
Sbjct: 55  GAPGPP 60



 Score = 64.4 bits (158), Expect = 4e-13
 Identities = 29/60 (48%), Positives = 29/60 (48%)

Query: 122 GEIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKR 181
           G  G  G  G  G  G  G PG  GPPGP G PG  G  GP G  G PGP G PG  G  
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 64.0 bits (157), Expect = 6e-13
 Identities = 31/58 (53%), Positives = 31/58 (53%)

Query: 134 GERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPKGDMGYPG 191
           G  G  G PG  GPPGP G PG  G  GP G  G PGP G PG  G  GP G  G PG
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPG 58



 Score = 63.7 bits (156), Expect = 7e-13
 Identities = 29/61 (47%), Positives = 29/61 (47%), Gaps = 3/61 (4%)

Query: 125 GAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPK 184
           G  G  G  G  G  G PG   PPGP G PG  G  GP G  G PGP G PG  G  G  
Sbjct: 1   GPPGPPGPPGPPGPPGPPG---PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAP 57

Query: 185 G 185
           G
Sbjct: 58  G 58



 Score = 63.3 bits (155), Expect = 9e-13
 Identities = 28/59 (47%), Positives = 28/59 (47%)

Query: 111 PPGLQGPIGQKGEIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYP 169
           PPG  GP G  G  G  G  G  G  G  G PG  GPPGP G PG  G  G  G  G P
Sbjct: 2   PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 62.1 bits (152), Expect = 3e-12
 Identities = 30/60 (50%), Positives = 30/60 (50%)

Query: 131 GVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPKGDMGYP 190
           G  G  G  G PG  GPPGP G PG  G  GP G  G PGP G PG  G  G  G  G P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 61.7 bits (151), Expect = 3e-12
 Identities = 29/59 (49%), Positives = 29/59 (49%)

Query: 170 GPQGIPGDEGKRGPKGDMGYPGRIGLDGRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGK 228
           GP G PG  G  GP G  G PG  G  G  G PG PG  G PG  G  G PG  G PG 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 52.9 bits (128), Expect = 4e-09
 Identities = 26/60 (43%), Positives = 26/60 (43%)

Query: 191 GRIGLDGRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGKDGIPGTNGTNGEKGMTGPRGNR 250
           G  G  G  G PG PG  G PG  G  G PG  G PG  G PG  G  G  G  G  G  
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGPP 60



 Score = 52.1 bits (126), Expect = 7e-09
 Identities = 28/59 (47%), Positives = 28/59 (47%)

Query: 182 GPKGDMGYPGRIGLDGRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGKDGIPGTNGTNGE 240
           GP G  G PG  G  G  G PG PG  G PG  G  G PG  G PG  G PG  G  G 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 51.7 bits (125), Expect = 1e-08
 Identities = 28/63 (44%), Positives = 28/63 (44%), Gaps = 3/63 (4%)

Query: 197 GRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGKDGIPGTNGTNGEKGMTGPRGNRGKSGIN 256
           G  G PG PG  G PG  G    PG  G PG  G PG  G  G  G  GP G  G  G  
Sbjct: 1   GPPGPPGPPGPPGPPGPPGP---PGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAP 57

Query: 257 GVP 259
           G P
Sbjct: 58  GPP 60



 Score = 50.6 bits (122), Expect = 3e-08
 Identities = 22/48 (45%), Positives = 22/48 (45%)

Query: 215 GMDGIPGKDGLPGKDGIPGTNGTNGEKGMTGPRGNRGKSGINGVPGTP 262
           G  G PG  G PG  G PG  G  G  G  GP G  G  G  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 48



 Score = 49.8 bits (120), Expect = 5e-08
 Identities = 26/59 (44%), Positives = 26/59 (44%)

Query: 194 GLDGRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGKDGIPGTNGTNGEKGMTGPRGNRGK 252
           G  G  G PG PG  G PG  G  G PG  G PG  G PG  G  G  G  G  G  G 
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAPGP 59



 Score = 49.8 bits (120), Expect = 6e-08
 Identities = 26/57 (45%), Positives = 26/57 (45%)

Query: 206 GLDGIPGRNGMDGIPGKDGLPGKDGIPGTNGTNGEKGMTGPRGNRGKSGINGVPGTP 262
           G  G PG  G  G PG  G PG  G PG  G  G  G  GP G  G  G  G PG P
Sbjct: 1   GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGAPGAP 57


>gnl|CDD|214653 smart00410, IG_like, Immunoglobulin like.  IG domains that cannot
           be classified into one of IGv1, IGc1, IGc2, IG.
          Length = 85

 Score = 50.6 bits (121), Expect = 5e-08
 Identities = 26/84 (30%), Positives = 32/84 (38%), Gaps = 6/84 (7%)

Query: 381 VPIGRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVN 440
           V  G + TL CE    P     W    G L+   G ++S+S     S      L ISNV 
Sbjct: 6   VKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESG-RFSVSRSGSTST-----LTISNVT 59

Query: 441 ISDFDTYRCISKNEVDIAKGSLIL 464
             D  TY C + N    A     L
Sbjct: 60  PEDSGTYTCAATNSSGSASSGTTL 83



 Score = 47.5 bits (113), Expect = 7e-07
 Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 4/73 (5%)

Query: 298 SSPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLG---SWKDSAVASSTLNFTRIN 354
              + V+EG++V L C A+G P P + W     K +      S   S  ++STL  + + 
Sbjct: 1   PPSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSG-STSTLTISNVT 59

Query: 355 RINMGAYMCLADN 367
             + G Y C A N
Sbjct: 60  PEDSGTYTCAATN 72


>gnl|CDD|214652 smart00409, IG, Immunoglobulin. 
          Length = 85

 Score = 50.6 bits (121), Expect = 5e-08
 Identities = 26/84 (30%), Positives = 32/84 (38%), Gaps = 6/84 (7%)

Query: 381 VPIGRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVN 440
           V  G + TL CE    P     W    G L+   G ++S+S     S      L ISNV 
Sbjct: 6   VKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESG-RFSVSRSGSTST-----LTISNVT 59

Query: 441 ISDFDTYRCISKNEVDIAKGSLIL 464
             D  TY C + N    A     L
Sbjct: 60  PEDSGTYTCAATNSSGSASSGTTL 83



 Score = 47.5 bits (113), Expect = 7e-07
 Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 4/73 (5%)

Query: 298 SSPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLG---SWKDSAVASSTLNFTRIN 354
              + V+EG++V L C A+G P P + W     K +      S   S  ++STL  + + 
Sbjct: 1   PPSVTVKEGESVTLSCEASGSPPPEVTWYKQGGKLLAESGRFSVSRSG-STSTLTISNVT 59

Query: 355 RINMGAYMCLADN 367
             + G Y C A N
Sbjct: 60  PEDSGTYTCAATN 72


>gnl|CDD|191810 pfam07679, I-set, Immunoglobulin I-set domain. 
          Length = 90

 Score = 49.2 bits (118), Expect = 2e-07
 Identities = 18/74 (24%), Positives = 34/74 (45%), Gaps = 9/74 (12%)

Query: 299 SPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLGS-----WKDSAVASSTLNFTRI 353
             + V+EG++ R  C  TG P P++ W   + +P+         ++     + TL  + +
Sbjct: 8   KDVEVQEGESARFTCTVTGDPDPTVSW-FKDGQPLRSSDRFKVTYEGG---TYTLTISNV 63

Query: 354 NRINMGAYMCLADN 367
              + G Y C+A N
Sbjct: 64  QPDDEGKYTCVATN 77



 Score = 48.8 bits (117), Expect = 3e-07
 Identities = 21/86 (24%), Positives = 32/86 (37%), Gaps = 7/86 (8%)

Query: 370 PQILIRNRRVGVPIGRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQ 429
           P+   + + V V  G +A   C V   P     W F DG  +     ++ ++        
Sbjct: 1   PKFTQKPKDVEVQEGESARFTCTVTGDPDPTVSW-FKDGQPL-RSSDRFKVTYEGGTY-- 56

Query: 430 YIMQLNISNVNISDFDTYRCISKNEV 455
               L ISNV   D   Y C++ N  
Sbjct: 57  ---TLTISNVQPDDEGKYTCVATNSA 79


>gnl|CDD|143264 cd05856, Ig2_FGFRL1-like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor_like-1(FGFRL1). 
           Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain
           of fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal
           peptide, three extracellular Ig-like modules, a
           transmembrane segment, and a short intracellular domain.
           FGFRL1 is expressed preferentially in skeletal tissues.
           Similar to FGF receptors, the expressed protein
           interacts specifically with heparin and with FGF2.
           FGFRL1 does not have a protein tyrosine kinase domain at
           its C terminus; neither does its cytoplasmic domain
           appear to interact with a signaling partner. It has been
           suggested that FGFRL1 may not have any direct signaling
           function, but instead acts as a decoy receptor trapping
           FGFs and preventing them from binding other receptors.
          Length = 82

 Score = 47.1 bits (112), Expect = 1e-06
 Identities = 24/67 (35%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLA 365
           G +VRL+CVA+G+P+P I W   +NKP+T     +S     TL+   +   + G Y C  
Sbjct: 9   GSSVRLKCVASGNPRPDITW-LKDNKPLTPTEIGESRKKKWTLSLKNLKPEDSGKYTCHV 67

Query: 366 DNVPPQI 372
            N   +I
Sbjct: 68  SNRAGEI 74


>gnl|CDD|197706 smart00408, IGc2, Immunoglobulin C-2 Type. 
          Length = 63

 Score = 44.7 bits (106), Expect = 4e-06
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 3/63 (4%)

Query: 305 EGDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCL 364
           EG +V L C A G+P P+I W   + KP  L        + STL    ++  + G Y C+
Sbjct: 1   EGQSVTLTCPAEGNPVPNITW-LKDGKP--LPESNRFVASGSTLTIKSVSLEDSGLYTCV 57

Query: 365 ADN 367
           A+N
Sbjct: 58  AEN 60



 Score = 34.7 bits (80), Expect = 0.013
 Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 11/73 (15%)

Query: 383 IGRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVNIS 442
            G++ TL C  E  P+    W   DG  +       S   V  GS      L I +V++ 
Sbjct: 1   EGQSVTLTCPAEGNPVPNITWLK-DGKPLPE-----SNRFVASGST-----LTIKSVSLE 49

Query: 443 DFDTYRCISKNEV 455
           D   Y C+++N  
Sbjct: 50  DSGLYTCVAENSA 62


>gnl|CDD|206066 pfam13895, Ig_2, Immunoglobulin domain.  This domain contains
           immunoglobulin-like domains.
          Length = 80

 Score = 44.7 bits (106), Expect = 6e-06
 Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 15/76 (19%)

Query: 295 IFSSSPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLN---FT 351
           + + SP VV EG++V L C A G+P P+  W             KD    SS+ N     
Sbjct: 3   VLTPSPTVVFEGEDVTLTCSAPGNPPPNYTWY------------KDGVPLSSSQNGFFTP 50

Query: 352 RINRINMGAYMCLADN 367
            ++  + G Y C+A N
Sbjct: 51  NVSAEDSGTYTCVASN 66


>gnl|CDD|143165 cd00096, Ig, Immunoglobulin domain.  Ig: immunoglobulin (Ig) domain
           found in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           this group are components of immunoglobulin, neuroglia,
           cell surface glycoproteins, such as, T-cell receptors,
           CD2, CD4, CD8, and membrane glycoproteins, such as,
           butyrophilin and chondroitin sulfate proteoglycan core
           protein. A predominant feature of most Ig domains is a
           disulfide bridge connecting the two beta-sheets with a
           tryptophan residue packed against the disulfide bond.
          Length = 74

 Score = 42.1 bits (98), Expect = 4e-05
 Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 309 VRLRCVATGHPKPSIIWK-----TSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMC 363
           V L C+A+G P P+I W         +    + S + ++  SSTL  + +   + G Y C
Sbjct: 1   VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSRGTSSGSSTLTISNVTLEDSGTYTC 60

Query: 364 LADN 367
           +A N
Sbjct: 61  VASN 64



 Score = 42.1 bits (98), Expect = 5e-05
 Identities = 19/76 (25%), Positives = 25/76 (32%), Gaps = 3/76 (3%)

Query: 387 ATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVNISDFDT 446
            TL C     P     W      L      +   S    G+      L ISNV + D  T
Sbjct: 1   VTLTCLASGPPPPTITWLKNGKPLPSSVLTRVRSSR---GTSSGSSTLTISNVTLEDSGT 57

Query: 447 YRCISKNEVDIAKGSL 462
           Y C++ N       S+
Sbjct: 58  YTCVASNSAGTVSASV 73


>gnl|CDD|143205 cd05728, Ig4_Contactin-2-like, Fourth Ig domain of the neural cell
           adhesion molecule contactin-2 and similar proteins.
           Ig4_Contactin-2-like: fourth Ig domain of the neural
           cell adhesion molecule contactin-2. Contactins are
           comprised of six Ig domains followed by four fibronectin
           type III (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-2 (aliases
           TAG-1, axonin-1) facilitates cell adhesion by homophilic
           binding between molecules in apposed membranes. The
           first four Ig domains form the intermolecular binding
           fragment which arranges as a compact U-shaped module by
           contacts between Ig domains 1 and 4, and domains 2 and
           3. It has been proposed that a linear zipper-like array
           forms, from contactin-2 molecules alternatively provided
           by the two apposed membranes.
          Length = 85

 Score = 42.2 bits (99), Expect = 6e-05
 Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 5/63 (7%)

Query: 306 GDNVRLRCVATGHPKPSIIW-KTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCL 364
           G ++R  C A+G+P+P+  W K  +     L S     V +  L  T+++  + G Y C+
Sbjct: 14  GSSLRWECKASGNPRPAYRWLKNGQ----PLASENRIEVEAGDLRITKLSLSDSGMYQCV 69

Query: 365 ADN 367
           A+N
Sbjct: 70  AEN 72


>gnl|CDD|222457 pfam13927, Ig_3, Immunoglobulin domain.  This family contains
           immunoglobulin-like domains.
          Length = 74

 Score = 40.8 bits (95), Expect = 1e-04
 Identities = 22/85 (25%), Positives = 30/85 (35%), Gaps = 11/85 (12%)

Query: 369 PPQILIRNRRVGVPIGRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSY 428
            P I + +    V  G   TL C  E  P       + +G          SIS    G  
Sbjct: 1   KPVITV-SPSPSVTSGGGVTLTCSAEGGPPPPTISWYRNG----------SISGGSGGLG 49

Query: 429 QYIMQLNISNVNISDFDTYRCISKN 453
                L +S+V   D  TY C++ N
Sbjct: 50  SSGSTLTLSSVTSEDSGTYTCVASN 74



 Score = 33.9 bits (77), Expect = 0.042
 Identities = 23/75 (30%), Positives = 30/75 (40%), Gaps = 4/75 (5%)

Query: 294 HIFSSSPIVVREGDNVRLRCVATGHP-KPSIIWKTSENKPITLGSWKDSAVASSTLNFTR 352
            I  S    V  G  V L C A G P  P+I W  + +     G    S    STL  + 
Sbjct: 3   VITVSPSPSVTSGGGVTLTCSAEGGPPPPTISWYRNGSISGGSGGLGSS---GSTLTLSS 59

Query: 353 INRINMGAYMCLADN 367
           +   + G Y C+A N
Sbjct: 60  VTSEDSGTYTCVASN 74


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 45.0 bits (106), Expect = 1e-04
 Identities = 23/156 (14%), Positives = 34/156 (21%), Gaps = 8/156 (5%)

Query: 116 GPIGQKGEIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIP 175
            P G   E+ +            +       PP P   PG      P+     P  Q   
Sbjct: 106 SPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSS 165

Query: 176 GDEGKRGPKGDMGYPGRIGL---DGRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGKDGIP 232
             +       +   P        D       E      P ++  D  PG+   P     P
Sbjct: 166 FLQPSHEDSPEEPEPPT-SEPEPDSPGPPQSETPTSSPPPQSPPDE-PGEPQSPTPQQAP 223

Query: 233 GTNGTNGEKGMTGPRGNRGKSGINGVPGTPAICAYK 268
             N     +    P                   +Y 
Sbjct: 224 SPNTQQAVEHEDEPTEP---EREGPPFPGHRSHSYT 256



 Score = 40.3 bits (94), Expect = 0.004
 Identities = 19/136 (13%), Positives = 30/136 (22%), Gaps = 1/136 (0%)

Query: 125 GAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPK 184
            +  E   +   GEK   G  GP G            P G          P +     P+
Sbjct: 70  ESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPE 129

Query: 185 GDMGYPGRIGLDGRDGLPGEPGLDGIPGRNGMDGIPGKDGLPGKDGIPGTNGTNGEKGMT 244
               +            P EP        +             +D          E    
Sbjct: 130 SPASHSPP-PSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPD 188

Query: 245 GPRGNRGKSGINGVPG 260
            P   + ++  +  P 
Sbjct: 189 SPGPPQSETPTSSPPP 204



 Score = 35.7 bits (82), Expect = 0.096
 Identities = 14/130 (10%), Positives = 21/130 (16%), Gaps = 4/130 (3%)

Query: 120 QKGEIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEG 179
           + GE   +G+ G  G   E        P G                     P        
Sbjct: 80  RHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPS 139

Query: 180 KRGPKGDMGYPGRIGLDGRDGLPGEPGLDGIPGRNGMDGIPGKDGLP---GKDGIPGTNG 236
                G    P                       +  +        P             
Sbjct: 140 PPSHPGP-HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETP 198

Query: 237 TNGEKGMTGP 246
           T+     + P
Sbjct: 199 TSSPPPQSPP 208


>gnl|CDD|215677 pfam00047, ig, Immunoglobulin domain.  Members of the
           immunoglobulin superfamily are found in hundreds of
           proteins of different functions. Examples include
           antibodies, the giant muscle kinase titin and receptor
           tyrosine kinases. Immunoglobulin-like domains may be
           involved in protein-protein and protein-ligand
           interactions. The Pfam alignments do not include the
           first and last strand of the immunoglobulin-like domain.
          Length = 62

 Score = 38.7 bits (90), Expect = 5e-04
 Identities = 16/61 (26%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSW---KDSAVASSTLNFTRINRINMGAYM 362
           G +V L C  +G P+  + W   E K +   +     ++ V+S TL  + +   + G Y 
Sbjct: 1   GSSVTLTCSVSGPPQVDVTWFK-EGKGLEESTTVGTDENRVSSITLTISNVTPEDSGTYT 59

Query: 363 C 363
           C
Sbjct: 60  C 60



 Score = 31.4 bits (71), Expect = 0.23
 Identities = 18/67 (26%), Positives = 19/67 (28%), Gaps = 8/67 (11%)

Query: 384 GRTATLECEVEAFPLSVRYWEFIDGTLIE-HDGVKYSISDVDKGSYQYIMQLNISNVNIS 442
           G + TL C V   P     W      L E                      L ISNV   
Sbjct: 1   GSSVTLTCSVSGPPQVDVTWFKEGKGLEESTTVGTDENRVSSI-------TLTISNVTPE 53

Query: 443 DFDTYRC 449
           D  TY C
Sbjct: 54  DSGTYTC 60


>gnl|CDD|143207 cd05730, Ig3_NCAM-1_like, Third immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig3_NCAM-1_like: domain similar to the third
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1,and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain.
          Length = 95

 Score = 39.5 bits (92), Expect = 6e-04
 Identities = 29/98 (29%), Positives = 45/98 (45%), Gaps = 10/98 (10%)

Query: 369 PPQILIRNRRVGVP--IGRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKG 426
           PP I  R   V     +G++ TL C+ + FP     W   DG  IE    KYS ++   G
Sbjct: 1   PPTIRARQSEVNATANLGQSVTLACDADGFPEPTMTWT-KDGEPIESGEEKYSFNE--DG 57

Query: 427 SYQYIMQLNISNVNISDFDTYRCISKNEVDIAKGSLIL 464
           S     ++ I +V+  D   Y CI++N+    +  + L
Sbjct: 58  S-----EMTILDVDKLDEAEYTCIAENKAGEQEAEIHL 90



 Score = 36.4 bits (84), Expect = 0.008
 Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 286 PPTIVGSEHIFSSSPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDS-AVA 344
           PPTI   +   +++      G +V L C A G P+P++ W T + +PI  G  K S    
Sbjct: 1   PPTIRARQSEVNAT---ANLGQSVTLACDADGFPEPTMTW-TKDGEPIESGEEKYSFNED 56

Query: 345 SSTLNFTRINRINMGAYMCLADN 367
            S +    +++++   Y C+A+N
Sbjct: 57  GSEMTILDVDKLDEAEYTCIAEN 79


>gnl|CDD|143277 cd05869, Ig5_NCAM-1, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM).
           Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays
           important roles in the development and regeneration of
           the central nervous system, in synaptogenesis and neural
           migration. NCAM mediates cell-cell and cell-substratum
           recognition and adhesion via homophilic (NCAM-NCAM) and
           heterophilic (NCAM-non-NCAM) interactions. NCAM is
           expressed as three major isoforms having different
           intracellular extensions. The extracellular portion of
           NCAM has five N-terminal Ig-like domains and two
           fibronectin type III domains. The double zipper adhesion
           complex model for NCAM homophilic binding involves Ig1,
           Ig2, and Ig3. By this model, Ig1 and Ig2 mediate
           dimerization of NCAM molecules situated on the same cell
           surface (cis interactions), and Ig3 domains mediate
           interactions between NCAM molecules expressed on the
           surface of opposing cells (trans interactions), through
           binding to the Ig1 and Ig2 domains. The adhesive ability
           of NCAM is modulated by the addition of polysialic acid
           chains to the fifth Ig-like domain.
          Length = 97

 Score = 38.4 bits (89), Expect = 0.001
 Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 6/67 (8%)

Query: 307 DNVRLRCVATGHPKPSIIWKT------SENKPITLGSWKDSAVASSTLNFTRINRINMGA 360
           + + L C A+G P PSI W+T      SE K +       S    S+L    I   + G 
Sbjct: 18  EQITLTCEASGDPIPSITWRTSTRNISSEEKTLDGHIVVRSHARVSSLTLKYIQYTDAGE 77

Query: 361 YMCLADN 367
           Y+C A N
Sbjct: 78  YLCTASN 84


>gnl|CDD|143222 cd05745, Ig3_Peroxidasin, Third immunoglobulin (Ig)-like domain of
           peroxidasin.  Ig3_Peroxidasin: the third immunoglobulin
           (Ig)-like domain in peroxidasin. Peroxidasin has a
           peroxidase domain and interacting extracellular motifs
           containing four Ig-like domains. It has been suggested
           that peroxidasin is secreted and has functions related
           to the stabilization of the extracellular matrix. It may
           play a part in various other important processes such as
           removal and destruction of cells which have undergone
           programmed cell death, and protection of the organism
           against non-self.
          Length = 74

 Score = 37.6 bits (87), Expect = 0.002
 Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 2/64 (3%)

Query: 305 EGDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCL 364
           EG  V   C A G+P+P I W T     +++   +   ++S TL  +R+   + G Y C 
Sbjct: 1   EGQTVDFLCEAQGYPQPVIAW-TKGGSQLSVDR-RHLVLSSGTLRISRVALHDQGQYECQ 58

Query: 365 ADNV 368
           A N+
Sbjct: 59  AVNI 62


>gnl|CDD|143206 cd05729, Ig2_FGFR_like, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor and similar
           proteins.  Ig2_FGFR_like: domain similar to the second
           immunoglobulin (Ig)-like domain of fibroblast growth
           factor (FGF) receptor. FGF receptors bind FGF signaling
           polypeptides. FGFs participate in multiple processes
           such as morphogenesis, development, and angiogenesis.
           FGFs bind to four FGF receptor tyrosine kinases (FGFR1,
           -2, -3, -4). Receptor diversity is controlled by
           alternative splicing producing splice variants with
           different ligand binding characteristics and different
           expression patterns. FGFRs have an extracellular region
           comprised of three Ig-like domains, a single
           transmembrane helix, and an intracellular tyrosine
           kinase domain. Ligand binding and specificity reside in
           the Ig-like domains 2 and 3, and the linker region that
           connects these two. FGFR activation and signaling depend
           on FGF-induced dimerization, a process involving cell
           surface heparin or heparin sulfate proteoglycans. This
           group also contains fibroblast growth factor (FGF)
           receptor_like-1(FGFRL1). FGFRL1 does not have a protein
           tyrosine kinase domain at its C terminus; neither does
           its cytoplasmic domain appear to interact with a
           signaling partner. It has been suggested that FGFRL1 may
           not have any direct signaling function, but instead acts
           as a decoy receptor trapping FGFs and preventing them
           from binding other receptors.
          Length = 85

 Score = 37.4 bits (87), Expect = 0.003
 Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 6/66 (9%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPIT----LGSWKDSAVASSTLNFTRINRINMGAY 361
           G  VRL+C A+G+P+P+I W   + KP      +G +K       TL    +   + G Y
Sbjct: 9   GSTVRLKCPASGNPRPTITW-LKDGKPFKKEHRIGGYK-VRKKKWTLILESVVPSDSGKY 66

Query: 362 MCLADN 367
            C+ +N
Sbjct: 67  TCIVEN 72


>gnl|CDD|143169 cd04968, Ig3_Contactin_like, Third Ig domain of contactin.
           Ig3_Contactin_like: Third Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 88

 Score = 37.1 bits (86), Expect = 0.004
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 3/63 (4%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLA 365
           G NV L C A G+P P I W+  +       S  + +++ + L    I   + G Y C A
Sbjct: 16  GQNVTLECFALGNPVPQIKWRKVDGSMP---SSAEISMSGAVLKIPNIQFEDEGTYECEA 72

Query: 366 DNV 368
           +N+
Sbjct: 73  ENI 75


>gnl|CDD|143203 cd05726, Ig4_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig4_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 90

 Score = 36.9 bits (85), Expect = 0.005
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 8/77 (10%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKD-------SAVASSTLNFTRINRINM 358
           G  V  +C ATG+P+P+I W+  E     L S++        S   +  L  T + R ++
Sbjct: 1   GRTVTFQCEATGNPQPAIFWQ-KEGSQNLLFSYQPPQSSSRFSVSQTGDLTITNVQRSDV 59

Query: 359 GAYMCLADNVPPQILIR 375
           G Y+C   NV   IL +
Sbjct: 60  GYYICQTLNVAGSILTK 76


>gnl|CDD|143317 cd07693, Ig1_Robo, First immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors and similar proteins.  Ig1_Robo:
           domain similar to the first immunoglobulin (Ig)-like
           domain in Robo (roundabout) receptors. Robo receptors
           play a role in the development of the central nervous
           system (CNS), and are receptors of Slit protein. Slit is
           a repellant secreted by the neural cells in the midline.
           Slit acts through Robo to prevent most neurons from
           crossing the midline from either side. Three mammalian
           Robo homologs (robo1, -2, and -3), and three mammalian
           Slit homologs (Slit-1,-2, -3), have been identified.
           Commissural axons, which cross the midline, express low
           levels of Robo; longitudinal axons, which avoid the
           midline, express high levels of Robo. robo1, -2, and -3
           are expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 100

 Score = 36.8 bits (85), Expect = 0.005
 Identities = 35/113 (30%), Positives = 49/113 (43%), Gaps = 23/113 (20%)

Query: 286 PPTIVGSEHIFSSSPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDS---A 342
           PP IV  EH    S ++V +GD   L C A G P P+I W  +     T      S    
Sbjct: 1   PPRIV--EH---PSDLIVSKGDPATLNCKAEGRPTPTIQWLKNGQPLETDKDDPRSHRIV 55

Query: 343 VASSTLNFTRI-----NRINMGAYMCLADNVPPQILIRNRRVGVPIGRTATLE 390
           + S +L F R+      R + G Y+C+A N           +G  + R A+LE
Sbjct: 56  LPSGSLFFLRVVHGRKGRSDEGVYVCVAHN----------SLGEAVSRNASLE 98


>gnl|CDD|143179 cd04978, Ig4_L1-NrCAM_like, Fourth immunoglobulin (Ig)-like domain
           of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule),
           and NrCAM (Ng-CAM-related).  Ig4_L1-NrCAM_like: fourth
           immunoglobulin (Ig)-like domain of L1, Ng-CAM
           (Neuron-glia CAM cell adhesion molecule), and NrCAM
           (Ng-CAM-related). These proteins belong to the L1
           subfamily of cell adhesion molecules (CAMs) and are
           comprised of an extracellular region having six Ig-like
           domains and five fibronectin type III domains, a
           transmembrane region and an intracellular domain. These
           molecules are primarily expressed in the nervous system.
           L1 is associated with an X-linked recessive disorder,
           X-linked hydrocephalus, MASA syndrome, or spastic
           paraplegia type 1, that involves abnormalities of axonal
           growth.
          Length = 76

 Score = 35.8 bits (83), Expect = 0.009
 Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 8/73 (10%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTS----ENKPITLGSWKDSAVASSTLNFTRINRINMGAY 361
           G+  RL C A G P+P+I W+ +    E  P          V   TL  + +   +   Y
Sbjct: 1   GETGRLDCEAEGIPQPTITWRLNGVPIEELPPDPRRR----VDGGTLILSNVQPNDTAVY 56

Query: 362 MCLADNVPPQILI 374
            C A NV   +L 
Sbjct: 57  QCNASNVHGYLLA 69


>gnl|CDD|143202 cd05725, Ig3_Robo, Third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig3_Robo: domain similar to the
           third immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 69

 Score = 34.3 bits (79), Expect = 0.023
 Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 309 VRLRCVATGHPKPSIIWKTSENK-PITLGSWKDSAVASSTLNFTRINRINMGAYMCLADN 367
           V  +C   G P P+++W+  + + P      +   +   +L    +   + G+Y C A+N
Sbjct: 1   VEFQCEVGGDPVPTVLWRKEDGELPKG----RAEILDDKSLKIRNVTAGDEGSYTCEAEN 56

Query: 368 V 368
           +
Sbjct: 57  M 57


>gnl|CDD|143209 cd05732, Ig5_NCAM-1_like, Fifth immunoglobulin (Ig)-like domain of
           Neural Cell Adhesion Molecule NCAM-1 (NCAM) and similar
           proteins.  Ig5_NCAM-1 like: domain similar to the fifth
           immunoglobulin (Ig)-like domain of Neural Cell Adhesion
           Molecule NCAM-1 (NCAM). NCAM plays important roles in
           the development and regeneration of the central nervous
           system, in synaptogenesis and neural migration. NCAM
           mediates cell-cell and cell-substratum recognition and
           adhesion via homophilic  (NCAM-NCAM), and heterophilic
           (NCAM-non-NCAM), interactions. NCAM is expressed as
           three major isoforms having different intracellular
           extensions. The extracellular portion of NCAM has five
           N-terminal Ig-like domains and two fibronectin type III
           domains. The double zipper adhesion complex model for
           NCAM homophilic binding involves Ig1, Ig2, and Ig3. By
           this model, Ig1 and Ig2 mediate dimerization of NCAM
           molecules situated on the same cell surface (cis
           interactions), and Ig3 domains mediate interactions
           between NCAM molecules expressed on the surface of
           opposing cells (trans interactions), through binding to
           the Ig1 and Ig2 domains. The adhesive ability of NCAM is
           modulated by the addition of polysialic acid chains to
           the fifth Ig-like domain. Also included in this group is
           NCAM-2 (also known as OCAM/mamFas II and RNCAM)  NCAM-2
           is differentially expressed in the developing and mature
           olfactory epithelium (OE).
          Length = 96

 Score = 34.8 bits (80), Expect = 0.028
 Identities = 17/69 (24%), Positives = 26/69 (37%), Gaps = 6/69 (8%)

Query: 305 EGDNVRLRCVATGHPKPSIIWKTS------ENKPITLGSWKDSAVASSTLNFTRINRINM 358
           E + + L C A G P P I W+ +       +K +            S+L    +   + 
Sbjct: 15  ELEQITLTCEAEGDPIPEITWRRATRNFSEGDKSLDGRIVVRGHARVSSLTLKDVQLTDA 74

Query: 359 GAYMCLADN 367
           G Y C A N
Sbjct: 75  GRYDCEASN 83


>gnl|CDD|143284 cd05876, Ig3_L1-CAM, Third immunoglobulin (Ig)-like domain of the
           L1 cell adhesion molecule (CAM).  Ig3_L1-CAM:  third
           immunoglobulin (Ig)-like domain of the L1 cell adhesion
           molecule (CAM). L1 belongs to the L1 subfamily of cell
           adhesion molecules (CAMs) and is comprised of an
           extracellular region having six Ig-like domains, five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 71

 Score = 34.1 bits (78), Expect = 0.030
 Identities = 19/75 (25%), Positives = 28/75 (37%), Gaps = 12/75 (16%)

Query: 388 TLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQ-LNISNVNISDFDT 446
            LEC  E  P    +W+ IDG L             ++       + L + NV  SD   
Sbjct: 2   VLECIAEGLPTPEVHWDRIDGPLSP-----------NRTKKLNNNKTLQLDNVLESDDGE 50

Query: 447 YRCISKNEVDIAKGS 461
           Y C ++N    A+  
Sbjct: 51  YVCTAENSEGSARHH 65


>gnl|CDD|143201 cd05724, Ig2_Robo, Second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors.  Ig2_Robo: domain similar to the
           second immunoglobulin (Ig)-like domain in Robo
           (roundabout) receptors. Robo receptors play a role in
           the development of the central nervous system (CNS), and
           are receptors of Slit protein. Slit is a repellant
           secreted by the neural cells in the midline. Slit acts
           through Robo to prevent most neurons from crossing the
           midline from either side. Three mammalian Robo homologs
           (robo1, -2, and -3), and three mammalian Slit homologs
           (Slit-1,-2, -3), have been identified. Commissural
           axons, which cross the midline, express low levels of
           Robo; longitudinal axons, which avoid the midline,
           express high levels of Robo. robo1, -2, and -3 are
           expressed by commissural neurons in the vertebrate
           spinal cord and Slits 1, -2, -3 are expressed at the
           ventral midline. Robo-3 is a divergent member of the
           Robo family which instead of being a positive regulator
           of slit responsiveness, antagonizes slit responsiveness
           in precrossing axons.  The Slit-Robo interaction is
           mediated by the second leucine-rich repeat (LRR) domain
           of Slit and the two N-terminal Ig domains of Robo, Ig1
           and Ig2. The primary Robo binding site for Slit2 has
           been shown by surface plasmon resonance experiments and
           mutational analysis to be is the Ig1 domain, while the
           Ig2 domain has been proposed to harbor a weak secondary
           binding site.
          Length = 86

 Score = 34.3 bits (79), Expect = 0.034
 Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 12/92 (13%)

Query: 299 SPIVVREGDNVRLRCVA-TGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRIN 357
           S   V  G+   L C    GHP+P++ W+  + +P+ L + +   V    L      + +
Sbjct: 4   SDTQVAVGEMAVLECSPPRGHPEPTVSWR-KDGQPLNLDNERVRIVDDGNLLIAEARKSD 62

Query: 358 MGAYMCLADNVPPQILIRNRRVGVPIGRTATL 389
            G Y C+A N+          VG      A L
Sbjct: 63  EGTYKCVATNM----------VGERESAAARL 84



 Score = 31.2 bits (71), Expect = 0.36
 Identities = 24/83 (28%), Positives = 33/83 (39%), Gaps = 10/83 (12%)

Query: 374 IRNRRVGVPIGRTATLECEV-EAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIM 432
           +      V +G  A LEC      P     W   DG  +  D  +  I  VD G+     
Sbjct: 1   VEPSDTQVAVGEMAVLECSPPRGHPEPTVSWR-KDGQPLNLDNERVRI--VDDGN----- 52

Query: 433 QLNISNVNISDFDTYRCISKNEV 455
            L I+    SD  TY+C++ N V
Sbjct: 53  -LLIAEARKSDEGTYKCVATNMV 74


>gnl|CDD|143259 cd05851, Ig3_Contactin-1, Third Ig domain of contactin-1.
           Ig3_Contactin-1: Third Ig domain of the neural cell
           adhesion molecule contactin-1. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. Contactin-1 is
           differentially expressed in tumor tissues and may
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 88

 Score = 34.2 bits (78), Expect = 0.038
 Identities = 22/64 (34%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 306 GDNVRLRCVATGHPKPSIIW-KTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCL 364
           G NV L C A G+P P I W K  E  P T     + +++ + L    I   + G Y C 
Sbjct: 16  GQNVTLECFALGNPVPVIRWRKILEPMPAT----AEISMSGAVLKIFNIQPEDEGTYECE 71

Query: 365 ADNV 368
           A+N+
Sbjct: 72  AENI 75


>gnl|CDD|143241 cd05764, Ig_2, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_2: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 74

 Score = 33.2 bits (76), Expect = 0.067
 Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLA 365
           G    LRC A G P+P+I W + + K I+  S +     + TL+       + G++ C+A
Sbjct: 1   GQRATLRCKARGDPEPAIHWISPDGKLISNSS-RTLVYDNGTLDILITTVKDTGSFTCIA 59

Query: 366 DN 367
            N
Sbjct: 60  SN 61


>gnl|CDD|143213 cd05736, Ig2_Follistatin_like, Second immunoglobulin (Ig)-like
           domain of a follistatin-like molecule encoded by the
           Mahya gene and similar proteins.  Ig2_Follistatin_like:
           domain similar to the second immunoglobulin (Ig)-like
           domain found in a follistatin-like molecule encoded by
           the CNS-related Mahya gene. Mahya genes have been
           retained in certain Bilaterian branches during
           evolution.  They are conserved in Hymenoptera and
           Deuterostomes, but are absent from other metazoan
           species such as fruit fly and nematode. Mahya proteins
           are secretory, with a follistatin-like domain
           (Kazal-type serine/threonine protease inhibitor domain
           and EF-hand calcium-binding domain), two Ig-like
           domains, and a novel C-terminal domain. Mahya may be
           involved in learning and memory and in processing of
           sensory information in Hymenoptera and vertebrates.
           Follistatin is a secreted, multidomain protein that
           binds activins with high affinity and antagonizes their
           signaling.
          Length = 76

 Score = 33.0 bits (75), Expect = 0.074
 Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 9/83 (10%)

Query: 387 ATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVNISDFDT 446
           A+L C  E  PL    W   +G  I     K  ++ +  GS     +L+ISNV   D   
Sbjct: 1   ASLRCHAEGIPLPRLTW-LKNGMDITPKLSK-QLTLIANGS-----ELHISNVRYEDTGA 53

Query: 447 YRCISKNEVDIAK--GSLILFES 467
           Y CI+KNE  + +   SL + +S
Sbjct: 54  YTCIAKNEAGVDEDISSLFVEDS 76



 Score = 32.2 bits (73), Expect = 0.15
 Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 1/58 (1%)

Query: 311 LRCVATGHPKPSIIW-KTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLADN 367
           LRC A G P P + W K   +    L          S L+ + +   + GAY C+A N
Sbjct: 3   LRCHAEGIPLPRLTWLKNGMDITPKLSKQLTLIANGSELHISNVRYEDTGAYTCIAKN 60


>gnl|CDD|143208 cd05731, Ig3_L1-CAM_like, Third immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig3_L1-CAM_like:
           domain similar to the third immunoglobulin (Ig)-like
           domain of the L1 cell adhesion molecule (CAM). L1
           belongs to the L1 subfamily of cell adhesion molecules
           (CAMs) and is comprised of an extracellular region
           having six Ig-like domains and five fibronectin type III
           domains, a transmembrane region and an intracellular
           domain. L1 is primarily expressed in the nervous system
           and is involved in its development and function. L1 is
           associated with an X-linked recessive disorder, X-linked
           hydrocephalus, MASA syndrome, or spastic paraplegia type
           1, that involves abnormalities of axonal growth. This
           group also contains the chicken neuron-glia cell
           adhesion molecule, Ng-CAM and human neurofascin.
          Length = 71

 Score = 32.3 bits (74), Expect = 0.12
 Identities = 20/74 (27%), Positives = 28/74 (37%), Gaps = 12/74 (16%)

Query: 388 TLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQ-LNISNVNISDFDT 446
            LEC  E  P     W  I G L             D+  ++   + L I NV+  D   
Sbjct: 2   LLECIAEGLPTPEISWIKIGGELPA-----------DRTKFENFNKTLKIDNVSEEDDGE 50

Query: 447 YRCISKNEVDIAKG 460
           YRC + N +  A+ 
Sbjct: 51  YRCTASNSLGSARH 64


>gnl|CDD|143276 cd05868, Ig4_NrCAM, Fourth immunoglobulin (Ig)-like domain of NrCAM
           (NgCAM-related cell adhesion molecule).  Ig4_ NrCAM:
           fourth immunoglobulin (Ig)-like domain of NrCAM
           (NgCAM-related cell adhesion molecule). NrCAM belongs to
           the L1 subfamily of cell adhesion molecules (CAMs) and
           is comprised of an extracellular region having six
           IG-like domains and five fibronectin type III domains, a
           transmembrane region and an intracellular domain. NrCAM
           is primarily expressed in the nervous system.
          Length = 76

 Score = 32.3 bits (73), Expect = 0.15
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 2/63 (3%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDS-AVASSTLNFTRINRINMGAYMCL 364
           G++  L C A G+PKPSI W T+   PI +     S  V   T+ F+++   +   Y C 
Sbjct: 1   GEDGTLICRANGNPKPSISWLTN-GVPIEIAPTDPSRKVDGDTIIFSKVQERSSAVYQCN 59

Query: 365 ADN 367
           A N
Sbjct: 60  ASN 62


>gnl|CDD|143240 cd05763, Ig_1, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_1: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 75

 Score = 32.2 bits (73), Expect = 0.15
 Identities = 11/17 (64%), Positives = 12/17 (70%)

Query: 310 RLRCVATGHPKPSIIWK 326
           RL C ATGHP P I W+
Sbjct: 2   RLECAATGHPTPQIAWQ 18


>gnl|CDD|143170 cd04969, Ig5_Contactin_like, Fifth Ig domain of contactin.
           Ig5_Contactin_like: Fifth Ig domain of contactins.
           Contactins are neural cell adhesion molecules and are
           comprised of six Ig domains followed by four fibronectin
           type III(FnIII) domains anchored to the membrane by
           glycosylphosphatidylinositol. The first four Ig domains
           form the intermolecular binding fragment, which arranges
           as a compact U-shaped module via contacts between Ig
           domains 1 and 4, and between Ig domains 2 and 3.
           Contactin-2 (TAG-1, axonin-1) may play a part in the
           neuronal processes of neurite outgrowth, axon guidance
           and fasciculation, and neuronal migration. This group
           also includes contactin-1 and contactin-5. The different
           contactins show different expression patterns in the
           central nervous system. During development and in
           adulthood, contactin-2 is transiently expressed in
           subsets of central and peripheral neurons. Contactin-5
           is expressed specifically in the rat postnatal nervous
           system, peaking at about 3 weeks postnatal, and a lack
           of contactin-5 (NB-2) results in an impairment of
           neuronal act ivity in the rat auditory system.
           Contactin-5 is highly expressed in the adult human brain
           in the occipital lobe and in the amygdala. Contactin-1
           is differentially expressed in tumor tissues and may,
           through a RhoA mechanism, facilitate invasion and
           metastasis of human lung adenocarcinoma.
          Length = 73

 Score = 32.0 bits (73), Expect = 0.18
 Identities = 21/81 (25%), Positives = 27/81 (33%), Gaps = 10/81 (12%)

Query: 384 GRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVNISD 443
           G    +EC+ +A P     W      L         I     GS      L I NV  SD
Sbjct: 1   GGDVIIECKPKAAPKPTISWSKGTELLTNSS----RICIWPDGS------LEILNVTKSD 50

Query: 444 FDTYRCISKNEVDIAKGSLIL 464
              Y C ++N    A  +  L
Sbjct: 51  EGKYTCFAENFFGKANSTGSL 71



 Score = 30.1 bits (68), Expect = 0.74
 Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 2/63 (3%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLA 365
           G +V + C     PKP+I W  S+   +   S +       +L    + + + G Y C A
Sbjct: 1   GGDVIIECKPKAAPKPTISW--SKGTELLTNSSRICIWPDGSLEILNVTKSDEGKYTCFA 58

Query: 366 DNV 368
           +N 
Sbjct: 59  ENF 61


>gnl|CDD|222306 pfam13670, PepSY_2, Peptidase propeptide and YPEB domain.  This
           region is likely to have a protease inhibitory function
           (personal obs:C Yeats). The name is derived from
           Peptidase & Bacillus subtilis YPEB.
          Length = 83

 Score = 32.2 bits (74), Expect = 0.19
 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 1/43 (2%)

Query: 678 KLYKTGYNVMDFSLDENGLWVIYGLS-NNNTAVTKLDTATLQI 719
           KL   GY V    +D+ G + +YG   +       LD AT ++
Sbjct: 37  KLEAQGYKVKKIEVDDGGCYEVYGRDKDGKKVEIYLDPATGEV 79


>gnl|CDD|143220 cd05743, Ig_Perlecan_D2_like, Immunoglobulin (Ig)-like domain II
           (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2.
           Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain
           II (D2) of the human basement membrane heparan sulfate
           proteoglycan perlecan, also known as HSPG2. Perlecan
           consists of five domains. Domain I has three putative
           heparan sulfate attachment sites; domain II has four LDL
           receptor-like repeats, and one Ig-like repeat; domain
           III resembles the short arm of laminin chains; domain IV
           has multiple Ig-like repeats (21 repeats in human
           perlecan); and domain V resembles the globular G domain
           of the laminin A chain and internal repeats of EGF.
           Perlecan may participate in a variety of biological
           functions including cell binding, LDL-metabolism,
           basement membrane assembly and selective permeability,
           calcium binding, and growth- and neurite-promoting
           activities.
          Length = 78

 Score = 31.7 bits (72), Expect = 0.23
 Identities = 24/70 (34%), Positives = 29/70 (41%), Gaps = 13/70 (18%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASS-------TLNFTRINRINM 358
           G+ V   CVATG P P I W+      +  G   DSA  S        TL    +   + 
Sbjct: 1   GETVEFTCVATGVPTPIINWR------LNWGHVPDSARVSITSEGGYGTLTIRDVKESDQ 54

Query: 359 GAYMCLADNV 368
           GAY C A N 
Sbjct: 55  GAYTCEAINT 64


>gnl|CDD|143199 cd05722, Ig1_Neogenin, First immunoglobulin (Ig)-like domain in
           neogenin and similar proteins.  Ig1_Neogenin: first
           immunoglobulin (Ig)-like domain in neogenin and related
           proteins. Neogenin  is a cell surface protein which is
           expressed in the developing nervous system of vertebrate
           embryos in the growing nerve cells. It is also expressed
           in other embryonic tissues, and may play a general role
           in developmental processes such as cell migration,
           cell-cell recognition, and tissue growth regulation.
           Included in this group is the tumor suppressor protein
           DCC, which is deleted in colorectal carcinoma . DCC and
           neogenin each have four Ig-like domains followed by six
           fibronectin type III domains, a transmembrane domain,
           and an intracellular domain.
          Length = 95

 Score = 32.1 bits (73), Expect = 0.24
 Identities = 31/106 (29%), Positives = 42/106 (39%), Gaps = 26/106 (24%)

Query: 296 FSSSP--IVVREGDNVRLRCVATGHPKPSIIWKTS----------ENKPITLGSWKDSAV 343
           F S P  IV   G  V L C A G P P I WK              + +  GS   ++V
Sbjct: 2   FLSEPSDIVAVRGGPVVLNCSAEGEPPPKIEWKKDGVLLNLVSDERRQQLPNGSLLITSV 61

Query: 344 ASSTLNFTRINRINMGAYMCLADNVPPQILIRNRRVGVPIGRTATL 389
             S     + N+ + G Y C+A         +N  +G  + RTA L
Sbjct: 62  VHS-----KHNKPDEGFYQCVA---------QNDSLGSIVSRTARL 93


>gnl|CDD|143224 cd05747, Ig5_Titin_like, M5, fifth immunoglobulin (Ig)-like domain
           of human titin C terminus and similar proteins.
           Ig5_Titin_like: domain similar to the M5, fifth
           immunoglobulin (Ig)-like domain from the human titin C
           terminus. Titin (also called connectin) is a fibrous
           sarcomeric protein specifically found in vertebrate
           striated muscle. Titin is gigantic; depending on isoform
           composition it ranges from 2970 to 3700 kDa, and is of a
           length that spans half a sarcomere. Titin largely
           consists of multiple repeats of Ig-like and fibronectin
           type 3 (FN-III)-like domains. Titin connects the ends of
           myosin thick filaments to Z disks and extends along the
           thick filament to the H zone, and appears to function
           similar to an elastic band, keeping the myosin filaments
           centered in the sarcomere during muscle contraction or
           stretching.
          Length = 92

 Score = 31.6 bits (71), Expect = 0.34
 Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 13/74 (17%)

Query: 301 IVVREGDNVRLRCVATGHPKPSIIW-------KTSENKPITLGSWKDSAVASSTLNFTRI 353
           + V EG++ R  C   G P P++ W        +S+   IT   +K      ST   +++
Sbjct: 13  LTVSEGESARFSCDVDGEPAPTVTWMREGQIIVSSQRHQITSTEYK------STFEISKV 66

Query: 354 NRINMGAYMCLADN 367
              + G Y  + +N
Sbjct: 67  QMSDEGNYTVVVEN 80


>gnl|CDD|143173 cd04972, Ig_TrkABC_d4, Fourth domain (immunoglobulin-like) of Trk
           receptors TrkA, TrkB and TrkC.  TrkABC_d4: the fourth
           domain of Trk receptors TrkA, TrkB and TrkC, this is an
           immunoglobulin (Ig)-like domain which binds to
           neurotrophin. The Trk family of receptors are tyrosine
           kinase receptors. They are activated by dimerization,
           leading to autophosphorylation of intracellular tyrosine
           residues, and triggering the signal transduction
           pathway. TrkA, TrkB, and TrkC share significant sequence
           homology and domain organization. The first three
           domains are leucine-rich domains. The fourth and fifth
           domains are Ig-like domains playing a part in ligand
           binding. TrkA, Band C mediate the trophic effects of the
           neurotrophin Nerve growth factor (NGF) family. TrkA is
           recognized by NGF. TrKB is recognized by brain-derived
           neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC
           is recognized by NT-3. NT-3 is promiscuous as in some
           cell systems it activates TrkA and TrkB receptors. TrkA
           is a receptor found in all major NGF targets, including
           the sympathetic, trigeminal, and dorsal root ganglia,
           cholinergic neurons of the basal forebrain and the
           striatum. TrKB transcripts are found throughout multiple
           structures of the central and peripheral nervous
           systems. The TrkC gene is expressed throughout the
           mammalian nervous system.
          Length = 90

 Score = 31.3 bits (71), Expect = 0.42
 Identities = 19/69 (27%), Positives = 26/69 (37%), Gaps = 1/69 (1%)

Query: 301 IVVREGDNVRLRCVATGHPKPSIIWKTSENKPI-TLGSWKDSAVASSTLNFTRINRINMG 359
            VV EG    +RC A G P P + W  +    I T     ++ V    L  + I      
Sbjct: 10  TVVYEGGTATIRCTAEGSPLPKVEWIIAGLIVIQTRTDTLETTVDIYNLQLSNITSETQT 69

Query: 360 AYMCLADNV 368
              C A+N 
Sbjct: 70  TVTCTAENP 78



 Score = 30.6 bits (69), Expect = 0.77
 Identities = 23/81 (28%), Positives = 32/81 (39%), Gaps = 7/81 (8%)

Query: 384 GRTATLECEVEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVNISD 443
           G TAT+ C  E  PL    W  I G LI       ++          I  L +SN+    
Sbjct: 15  GGTATIRCTAEGSPLPKVEW-IIAG-LIVIQTRTDTLETTVD-----IYNLQLSNITSET 67

Query: 444 FDTYRCISKNEVDIAKGSLIL 464
             T  C ++N V  A  S+ +
Sbjct: 68  QTTVTCTAENPVGQANVSVQV 88


>gnl|CDD|143301 cd05893, Ig_Palladin_C, C-terminal immunoglobulin (Ig)-like domain
           of palladin.  Ig_Palladin_C: C-terminal immunoglobulin
           (Ig)-like domain of palladin. Palladin belongs to the
           palladin-myotilin-myopalladin family. Proteins belonging
           to this family contain multiple Ig-like domains and
           function as scaffolds, modulating actin cytoskeleton.
           Palladin binds to alpha-actinin ezrin,
           vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP,
           mDia interacting protein), and Src. Palladin also binds
           F-actin directly, via its Ig3 domain. Palladin is
           expressed as several alternatively spliced isoforms,
           having various combinations of Ig-like domains, in a
           cell-type-specific manner. It has been suggested that
           palladin's different Ig-like domains may be specialized
           for distinct functions.
          Length = 75

 Score = 30.8 bits (69), Expect = 0.46
 Identities = 13/26 (50%), Positives = 16/26 (61%), Gaps = 1/26 (3%)

Query: 309 VRLRCVATGHPKPSIIWKTSENKPIT 334
           VRL C  +G P P I WK  EN+ +T
Sbjct: 1   VRLECRVSGVPHPQIFWK-KENESLT 25


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 33.4 bits (76), Expect = 0.59
 Identities = 35/164 (21%), Positives = 45/164 (27%), Gaps = 15/164 (9%)

Query: 103 CNSTKEYCPPGLQGPIGQKGEIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGP 162
             S K      LQG  G    I A   +  +G RG +  P   GP  P G      + G 
Sbjct: 52  DMSKKAAQQQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMG-----QQMGG 106

Query: 163 KGDMGYPG------PQGIPGDEGKRGPKGDMGYPGRIGLDGRDGLPGEPGLDGIPGRNGM 216
            G             Q   G  G  GP       G +   G+ G        G P  +  
Sbjct: 107 PGTASNLLQSLNVRGQMPMGAAGM-GPHQMSRV-GTMQPGGQAG-GMMQQSSGQPQ-SQQ 162

Query: 217 DGIPGKDGLPGKDGIPGTNGTNGEKGMTGPRGNRGKSGINGVPG 260
               G      +    G N               G+ G+ G  G
Sbjct: 163 PNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGG 206


>gnl|CDD|143950 pfam00187, Chitin_bind_1, Chitin recognition protein. 
          Length = 38

 Score = 29.3 bits (66), Expect = 0.74
 Identities = 9/17 (52%), Positives = 12/17 (70%)

Query: 101 GFCNSTKEYCPPGLQGP 117
           G+C ST++YC PG Q  
Sbjct: 21  GWCGSTEDYCGPGCQSQ 37


>gnl|CDD|143265 cd05857, Ig2_FGFR, Second immunoglobulin (Ig)-like domain of
           fibroblast growth factor (FGF) receptor.  Ig2_FGFR:
           second immunoglobulin (Ig)-like domain of fibroblast
           growth factor (FGF) receptor. FGF receptors bind FGF
           signaling polypeptides. FGFs participate in multiple
           processes such as morphogenesis, development, and
           angiogenesis. FGFs bind to four FGF receptor tyrosine
           kinases (FGFR1, -2, -3, -4). Receptor diversity is
           controlled by alternative splicing producing splice
           variants with different ligand binding characteristics
           and different expression patterns. FGFRs have an
           extracellular region comprised of three IG-like domains,
           a single transmembrane helix, and an intracellular
           tyrosine kinase domain. Ligand binding and specificity
           reside in the Ig-like domains 2 and 3, and the linker
           region that connects these two. FGFR activation and
           signaling depend on FGF-induced dimerization, a process
           involving cell surface heparin or heparin sulfate
           proteoglycans.
          Length = 85

 Score = 30.2 bits (68), Expect = 0.98
 Identities = 19/79 (24%), Positives = 30/79 (37%), Gaps = 26/79 (32%)

Query: 303 VREGDNVRLRCVATGHPKPSIIWKTSENKPIT----LGSWK----------DSAVASSTL 348
           V   + V+ RC A G+P P++ W     K       +G +K          +S V S   
Sbjct: 6   VPAANTVKFRCPAAGNPTPTMRW-LKNGKEFKQEHRIGGYKVRNQHWSLIMESVVPS--- 61

Query: 349 NFTRINRINMGAYMCLADN 367
                   + G Y C+ +N
Sbjct: 62  --------DKGNYTCVVEN 72


>gnl|CDD|131020 TIGR01965, VCBS_repeat, VCBS repeat.  This domain of about 100
           residues is found multiple (up to 35) copies in long
           proteins from several species of Vibrio, Colwellia,
           Bradyrhizobium, and Shewanella (hence the name VCBS) and
           in smaller copy numbers in proteins from several other
           bacteria. The large protein size and repeat copy
           numbers, species distribution, and suggested activities
           of several member proteins suggests a role for this
           domain in adhesion.
          Length = 99

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 14/38 (36%), Positives = 18/38 (47%), Gaps = 8/38 (21%)

Query: 688 DFSLDENGLWVIYGLSNNNTAV-------TKLDTATLQ 718
            FS+D +G W  Y   N+ TAV       T  DT T+ 
Sbjct: 28  TFSIDADGQWT-YQADNSQTAVQALKAGETLTDTFTVT 64


>gnl|CDD|143167 cd00099, IgV, Immunoglobulin variable domain (IgV).  IgV:
           Immunoglobulin variable domain (IgV). Members of the IgV
           family are components of immunoglobulin (Ig) and T cell
           receptors. The basic structure of Ig molecules is a
           tetramer of two light chains and two heavy chains linked
           by disulfide bonds. In Ig, each chain is composed of one
           variable domain (IgV) and one or more constant domains
           (IgC); these names reflect the fact that the variability
           in sequences is higher in the variable domain than in
           the constant domain. Within the variable domain, there
           are regions of even more variability called the
           hypervariable or complementarity-determining regions
           (CDRs) which are responsible for antigen binding. A
           predominant feature of most Ig domains is the disulfide
           bridge connecting 2 beta-sheets with a tryptophan
           residue packed against the disulfide bond.
          Length = 105

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 19/85 (22%), Positives = 27/85 (31%), Gaps = 19/85 (22%)

Query: 381 VPIGRTATLECEVEAFPLSVR-YW---------EFI----DGTLIEHDGVK--YSISDVD 424
           V  G + TL C       S   +W         E +            GVK  +S +  D
Sbjct: 3   VSEGESVTLSCTYSGSFSSYYIFWYRQKPGKGPELLIYISSNGSQYAGGVKGRFSGT-RD 61

Query: 425 KGSYQYIMQLNISNVNISDFDTYRC 449
                +   L IS++   D   Y C
Sbjct: 62  SSKSSFT--LTISSLQPEDSAVYYC 84


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 32.7 bits (74), Expect = 1.1
 Identities = 21/93 (22%), Positives = 24/93 (25%), Gaps = 10/93 (10%)

Query: 143 GVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPKGDMGYPGRIGLDGR---- 198
           G    P      G     G    +  PG QG  G     G       P            
Sbjct: 376 GGASAPSGSAAEG--PASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPR 433

Query: 199 ---DGLPGEPGLDGIPGRNGMDGIPGKDGLPGK 228
              D  P  P   GIP R     +P    +PG 
Sbjct: 434 VPWDDAPPAPPRSGIPPR-PAPRMPEASPVPGA 465


>gnl|CDD|237886 PRK15042, pduD, propanediol dehydratase medium subunit;
           Provisional.
          Length = 219

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 16/37 (43%), Positives = 18/37 (48%), Gaps = 7/37 (18%)

Query: 443 DFDTYRCISKNEVDIAKGSLILFESNAKTPVPTEKEQ 479
             +TYR I KN    AKG     ES    PVPT  +Q
Sbjct: 156 TLETYRAIGKNAARYAKG-----ES--PQPVPTLNDQ 185


>gnl|CDD|240291 PTZ00146, PTZ00146, fibrillarin; Provisional.
          Length = 293

 Score = 32.0 bits (73), Expect = 1.2
 Identities = 17/58 (29%), Positives = 18/58 (31%)

Query: 136 RGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPKGDMGYPGRI 193
               G  G  G     G  G  G  G  G  G    +G  G  G RG  G  G    I
Sbjct: 1   GMGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGKVI 58



 Score = 31.2 bits (71), Expect = 2.4
 Identities = 19/60 (31%), Positives = 22/60 (36%)

Query: 133 KGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQGIPGDEGKRGPKGDMGYPGR 192
            G  G +G     G  G +G  G  G RG     G  G  G  G  G  GP   +  P R
Sbjct: 4   GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPGKVIVVPHR 63


>gnl|CDD|143242 cd05765, Ig_3, Subgroup of the immunoglobulin (Ig) superfamily.
           Ig_3: subgroup of the immunoglobulin (Ig) domain found
           in the Ig superfamily. The Ig superfamily is a
           heterogenous group of proteins, built on a common fold
           comprised of a sandwich of two beta sheets. Members of
           the Ig superfamily are components of immunoglobulin,
           neuroglia, cell surface glycoproteins, such as T-cell
           receptors, CD2, CD4, CD8, and membrane glycoproteins,
           such as butyrophilin and chondroitin sulfate
           proteoglycan core protein. A predominant feature of most
           Ig domains is a disulfide bridge connecting the two
           beta-sheets with a tryptophan residue packed against the
           disulfide bond.
          Length = 81

 Score = 29.4 bits (66), Expect = 1.5
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query: 306 GDNVRLRCVATGHPKPSIIWK 326
           G+     C  TG P P I W+
Sbjct: 1   GETASFHCDVTGRPPPEITWE 21


>gnl|CDD|143211 cd05734, Ig7_DSCAM, Seventh immunoglobulin (Ig)-like domain of Down
           Syndrome Cell Adhesion molecule (DSCAM).  Ig7_DSCAM: the
           seventh immunoglobulin (Ig)-like domain of Down Syndrome
           Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion
           molecule expressed largely in the developing nervous
           system. The gene encoding DSCAM is located at human
           chromosome 21q22, the locus associated with the mental
           retardation phenotype of Down Syndrome. DSCAM is
           predicted to be the largest member of the IG
           superfamily. It has been demonstrated that DSCAM can
           mediate cation-independent homophilic intercellular
           adhesion.
          Length = 79

 Score = 29.1 bits (65), Expect = 1.8
 Identities = 16/65 (24%), Positives = 30/65 (46%), Gaps = 6/65 (9%)

Query: 309 VRLRCVATGHPKPSIIWKTSENK------PITLGSWKDSAVASSTLNFTRINRINMGAYM 362
           V L C A G+P P+I+WK S+ +           + +   +++ +L    +   + G Y+
Sbjct: 1   VTLNCSAEGYPPPTIVWKHSKGRGHPQHTHTCCLAGRIQLLSNGSLLIKHVLEEDSGYYL 60

Query: 363 CLADN 367
           C   N
Sbjct: 61  CKVSN 65


>gnl|CDD|111993 pfam03157, Glutenin_hmw, High molecular weight glutenin subunit.
           Members of this family include high molecular weight
           subunits of glutenin. This group of gluten proteins is
           thought to be largely responsible for the elastic
           properties of gluten, and hence, doughs. Indeed,
           glutenin high molecular weight subunits are classified
           as elastomeric proteins, because the glutenin network
           can withstand significant deformations without breaking,
           and return to the original conformation when the stress
           is removed. Elastomeric proteins differ considerably in
           amino acid sequence, but they are all polymers whose
           subunits consist of elastomeric domains, composed of
           repeated motifs, and non-elastic domains that mediate
           cross-linking between the subunits. The elastomeric
           domain motifs are all rich in glycine residues in
           addition to other hydrophobic residues. High molecular
           weight glutenin subunits have an extensive central
           elastomeric domain, flanked by two terminal non-elastic
           domains that form disulphide cross-links. The central
           elastomeric domain is characterized by the following
           three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It
           possesses overlapping beta-turns within and between the
           repeated motifs, and assumes a regular helical secondary
           structure with a diameter of approx. 1.9 nm and a pitch
           of approx. 1.5 nm.
          Length = 779

 Score = 32.0 bits (71), Expect = 1.8
 Identities = 36/151 (23%), Positives = 56/151 (37%), Gaps = 6/151 (3%)

Query: 109 YCPPGLQGP-IGQKGEIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEGKRGPKGDMG 167
           + P  LQ P  GQ+     +   G  G++  +G  G       QG    +G++G +   G
Sbjct: 478 HDPASLQQPGQGQQPGQLQQPAQGQPGQQLAQGQQGQQPAQVQQGQQPAQGQQGQQLGQG 537

Query: 168 YPGPQGIPGDEGKRGPKGDMGYPGRIGLDGRDGLPGEPGLDGI----PGRNGMDGIPGKD 223
             G Q   G    +G +G     G+ G     G     G        P  +G    PG+ 
Sbjct: 538 QQGQQPGQGQHPAQGEQGQQPGQGQQGQQPGQGQQPGQGQPWYYPTSPQESGQGQQPGQW 597

Query: 224 GLPGKDGIPGTNGTNGEKGMTGPRGNRGKSG 254
             PG+   PG  G  G    +  +  +G+ G
Sbjct: 598 QQPGQWQQPG-QGQPGYYLTSPLQLGQGQQG 627


>gnl|CDD|143275 cd05867, Ig4_L1-CAM_like, Fourth immunoglobulin (Ig)-like domain of
           the L1 cell adhesion molecule (CAM).  Ig4_L1-CAM_like:
           fourth immunoglobulin (Ig)-like domain of the L1 cell
           adhesion molecule (CAM). L1 is comprised of an
           extracellular region having six Ig-like domains and five
           fibronectin type III domains, a transmembrane region and
           an intracellular domain. L1 is primarily expressed in
           the nervous system and is involved in its development
           and function. L1 is associated with an X-linked
           recessive disorder, X-linked hydrocephalus, MASA
           syndrome, or spastic paraplegia type 1, that involves
           abnormalities of axonal growth. This group also contains
           the chicken neuron-glia cell adhesion molecule, Ng-CAM.
          Length = 76

 Score = 29.1 bits (65), Expect = 2.0
 Identities = 18/69 (26%), Positives = 25/69 (36%)

Query: 306 GDNVRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLA 365
           G+  RL C   G P P+I W  +              V+S  L  T +   +   Y C A
Sbjct: 1   GETARLDCQVEGIPTPNITWSINGAPIEGTDPDPRRHVSSGALILTDVQPSDTAVYQCEA 60

Query: 366 DNVPPQILI 374
            N    +L 
Sbjct: 61  RNRHGNLLA 69


>gnl|CDD|143302 cd05894, Ig_C5_MyBP-C, C5 immunoglobulin (Ig) domain of cardiac
           myosin binding protein C (MyBP-C).  Ig_C5_MyBP_C : the
           C5 immunoglobulin (Ig) domain of cardiac myosin binding
           protein C (MyBP-C). MyBP_C consists of repeated domains,
           Ig and fibronectin type 3, and various linkers. Three
           isoforms of MYBP_C exist and are included in this group:
           cardiac(c), and fast and slow skeletal muscle (s)
           MyBP_C. cMYBP_C has insertions between and inside
           domains and an additional cardiac-specific Ig domain at
           the N-terminus. For cMYBP_C  an interaction has been
           demonstrated between this C5 domain and the Ig C8
           domain.
          Length = 86

 Score = 29.0 bits (65), Expect = 2.2
 Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 4/68 (5%)

Query: 297 SSSPIVVREGDNVRLRCVATGHPKPSIIWKTSENKPITLGSWK---DSAVASSTLNFTRI 353
           + + IVV  G+ +RL    +G P P++ W   + K  T    +   +S    S+      
Sbjct: 1   AENTIVVVAGNKLRLDVPISGEPAPTVTWSRGD-KAFTETEGRVRVESYKDLSSFVIEGA 59

Query: 354 NRINMGAY 361
            R + G Y
Sbjct: 60  EREDEGVY 67


>gnl|CDD|143231 cd05754, Ig3_Perlecan_like, Third immunoglobulin (Ig)-like domain
           found in Perlecan and similar proteins.
           Ig3_Perlecan_like: domain similar to the third
           immunoglobulin (Ig)-like domain found in Perlecan.
           Perlecan is a large multi-domain heparin sulfate
           proteoglycan, important in tissue development and
           organogenesis.  Perlecan can be represented as 5 major
           portions; its fourth major portion (domain IV) is a
           tandem repeat of immunoglobulin-like domains (Ig2-Ig15),
           which can vary in size due to alternative splicing.
           Perlecan binds many cellular and extracellular ligands.
           Its domain IV region has many binding sites.  Some of
           these have been mapped at the level of individual
           Ig-like domains, including a site restricted to the Ig5
           domain for heparin/sulfatide, a site restricted to the
           Ig3 domain for nidogen-1 and nidogen-2, a site
           restricted to Ig4-5 for fibronectin, and sites
           restricted to Ig2 and to Ig13-15 for fibulin-2.
          Length = 85

 Score = 29.1 bits (65), Expect = 2.7
 Identities = 22/92 (23%), Positives = 31/92 (33%), Gaps = 18/92 (19%)

Query: 377 RRVGVPIGRTATLECEVEAFPLSVRY---WEFIDGTLIEHDGVKYSISDVDKGSYQYIMQ 433
           R   V  G   +  C   A   S  Y   W  + G L                +  +   
Sbjct: 9   RSQEVRPGADVSFICR--AKSKSPAYTLVWTRVGGGL-------------PSRAMDFNGI 53

Query: 434 LNISNVNISDFDTYRCISKNEVDIAKGSLILF 465
           L I NV +SD  TY C   N +D  + +  L+
Sbjct: 54  LTIRNVQLSDAGTYVCTGSNMLDTDEATATLY 85


>gnl|CDD|143223 cd05746, Ig4_Peroxidasin, Fourth immunoglobulin (Ig)-like domain of
           peroxidasin.  Ig4_Peroxidasin: the fourth immunoglobulin
           (Ig)-like domain in peroxidasin. Peroxidasin has a
           peroxidase domain and interacting extracellular motifs
           containing four Ig-like domains. It has been suggested
           that peroxidasin is secreted, and has functions related
           to the stabilization of the extracellular matrix. It may
           play a part in various other important processes such as
           removal and destruction of cells, which have undergone
           programmed cell death, and protection of the organism
           against non-self.
          Length = 69

 Score = 28.3 bits (63), Expect = 3.2
 Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 309 VRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTLNFTRINRINMGAYMCLADN 367
           V++ C A G P+P+I W   +   +T  S K        L    +   + G Y C+A N
Sbjct: 1   VQIPCSAQGDPEPTITW-NKDGVQVT-ESGKFHISPEGYLAIRDVGVADQGRYECVARN 57


>gnl|CDD|143166 cd00098, IgC, Immunoglobulin Constant domain.  IgC: Immunoglobulin
           constant domain (IgC). Members of the IgC family are
           components of immunoglobulin, T-cell receptors, CD1 cell
           surface glycoproteins, secretory glycoproteins A/C, and
           Major Histocompatibility Complex (MHC) class I/II
           molecules. In immunoglobulins, each chain is composed of
           one variable domain (IgV) and one or more IgC domains.
           These names reflect the fact that the variability in
           sequences is higher in the variable domain than in the
           constant domain. The IgV domain is responsible for
           antigen binding, and the IgC domain is involved in
           oligomerization and molecular interactions.
          Length = 95

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 12/68 (17%)

Query: 294 HIFSSSPIVVREGDNVRLRCVATG-HPKP-SIIWKTSENKPITLGSWKDSAV-------- 343
            +   SP     G +V L C+ATG +P   ++ W     K +T G      V        
Sbjct: 2   FLLPPSP-EELLGGSVTLTCLATGFYPPDITVTWL-KNGKELTSGVTTTPPVPNSDGTYS 59

Query: 344 ASSTLNFT 351
            SS L  +
Sbjct: 60  VSSQLTVS 67


>gnl|CDD|118874 pfam10353, DUF2430, Protein of unknown function (DUF2430).  This is
           a family of short, 111 residue, proteins found in S.
           pombe. The function is not known.
          Length = 107

 Score = 28.8 bits (64), Expect = 4.4
 Identities = 15/57 (26%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 393 VEAFPLSVRYWEFIDGTLIEHDGVKYSISDVDKGSYQYIMQLNISNVNISDFDTYRC 449
           V+   L  +Y E +   + +H  + +  +D+   +Y Y++  NI+ +N +D D Y C
Sbjct: 24  VDYAKLPPKYAELLANAIDQHGLMLFDTADIRIEAYNYLLN-NITEIN-NDTDAYLC 78


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 30.0 bits (67), Expect = 4.8
 Identities = 29/114 (25%), Positives = 36/114 (31%), Gaps = 11/114 (9%)

Query: 146 GPPGPQGIPGDEGKRGPKGDMGY-PGPQGIPGDEGKRGPKGDM-GYPGRIGLDGRDGLPG 203
           GP   Q I         + +MG  PG   +P       P     G+    G+  R G   
Sbjct: 162 GPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQ 221

Query: 204 EPGLD------GIPGRNGMDGIPGKDGLPGKDGIPGTN---GTNGEKGMTGPRG 248
            P +       G   + G    PG    P     PG N   G N    M GPR 
Sbjct: 222 NPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPRA 275


>gnl|CDD|219745 pfam08205, C2-set_2, CD80-like C2-set immunoglobulin domain.  These
           domains belong to the immunoglobulin superfamily.
          Length = 89

 Score = 28.2 bits (63), Expect = 4.8
 Identities = 9/33 (27%), Positives = 12/33 (36%), Gaps = 1/33 (3%)

Query: 294 HIFSSSPIVVREGDNVRLRCVAT-GHPKPSIIW 325
            I     ++  E   V   C +  G P P I W
Sbjct: 2   TIEPPVSLLEGENLEVVATCSSAGGKPAPRITW 34


>gnl|CDD|219514 pfam07686, V-set, Immunoglobulin V-set domain.  This domain is
           found in antibodies as well as neural protein P0 and
           CTL4 amongst others.
          Length = 114

 Score = 28.7 bits (64), Expect = 5.3
 Identities = 9/30 (30%), Positives = 14/30 (46%), Gaps = 1/30 (3%)

Query: 297 SSSPIVVREGDNVRLRC-VATGHPKPSIIW 325
              P+ V EG +V L C  ++     S+ W
Sbjct: 7   PPRPVTVAEGGSVTLPCSFSSSSGSTSVYW 36


>gnl|CDD|216952 pfam02264, LamB, LamB porin.  Maltoporin (LamB protein) forms a
           trimeric structure which facilitates the diffusion of
           maltodextrins across the outer membrane of Gram-negative
           bacteria. The membrane channel is formed by an
           antiparallel beta-barrel.
          Length = 399

 Score = 30.2 bits (68), Expect = 5.8
 Identities = 13/48 (27%), Positives = 17/48 (35%), Gaps = 3/48 (6%)

Query: 784 TYNSRYKDLYTWDRGN---SLTYPIRYHEIDSYNLNKQEKTDADMTTG 828
            ++ R   +  W  G+   S  Y     E D YNLN     D  M   
Sbjct: 163 VFDLRLAGIPLWPDGSLEVSGDYARANDEKDDYNLNYGAAKDGWMFGA 210


>gnl|CDD|131928 TIGR02882, QoxB, cytochrome aa3 quinol oxidase, subunit I.  This
           family (QoxB) encodes subunit I of the aa3-type quinone
           oxidase, one of several bacterial terminal oxidases.
           This complex couples oxidation of reduced quinones with
           the reduction of molecular oxygen to water and the
           pumping of protons to form a proton gradient utilized
           for ATP production. aa3-type oxidases contain two heme a
           cofactors as well as copper atoms in the active site
           [Energy metabolism, Electron transport].
          Length = 643

 Score = 29.8 bits (67), Expect = 7.7
 Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 4/65 (6%)

Query: 723 WNITLLHKKVGDMFVVCGVLYVVDSVTDRNTNIRFALDLYKNDLLEVSLNFTNPFTN--T 780
           W  T+ HKK+G M+++C VL +     D    +R  L +  N  L+ + ++   FT    
Sbjct: 44  WLTTVDHKKIGVMYIICAVLMLFRGGID-ALLMRAQLTVPDNKFLD-AQHYNEIFTTHGV 101

Query: 781 TMITY 785
            MI +
Sbjct: 102 IMIIF 106


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
           dihydrolipoamide succinyltransferase.  This model
           represents an Actinobacterial clade of E2 enzyme, a
           component of the 2-oxoglutarate dehydrogenase complex
           involved in the TCA cycle. These proteins have multiple
           domains including the catalytic domain (pfam00198), one
           or two biotin domains (pfam00364) and an E3-component
           binding domain (pfam02817).
          Length = 579

 Score = 29.6 bits (66), Expect = 8.1
 Identities = 16/66 (24%), Positives = 22/66 (33%), Gaps = 3/66 (4%)

Query: 123 EIGAKGEMGVKGERGEKGAPGVVGPPGPQGIPGDEG-KRGPKGDMGYPGPQGIPGDEGKR 181
           E+G  G + + GE GE G+      P P+  P  E     P      P P          
Sbjct: 68  EVG--GVLAIIGEPGEAGSEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAGGSGE 125

Query: 182 GPKGDM 187
             +  M
Sbjct: 126 ATEVKM 131


>gnl|CDD|143256 cd05848, Ig1_Contactin-5, First Ig domain of contactin-5.
           Ig1_Contactin-5: First Ig domain of the neural cell
           adhesion molecule contactin-5. Contactins are comprised
           of six Ig domains followed by four fibronectin type III
           (FnIII) domains, anchored to the membrane by
           glycosylphosphatidylinositol. The different contactins
           show different expression patterns in the central
           nervous system. In rats, a lack of contactin-5 (NB-2)
           results in an impairment of the neuronal activity in the
           auditory system. Contactin-5 is expressed specifically
           in the postnatal nervous system, peaking at about 3
           weeks postnatal. Contactin-5 is highly expressed in the
           adult human brain in the occipital lobe and in the
           amygdala; lower levels of expression have been detected
           in the corpus callosum, caudate nucleus, and spinal
           cord.
          Length = 94

 Score = 27.6 bits (61), Expect = 9.3
 Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 11/88 (12%)

Query: 300 PIVVREGDN-----------VRLRCVATGHPKPSIIWKTSENKPITLGSWKDSAVASSTL 348
           P+ V+E D+           V L C A G+P P+  W  +  +  T   ++ S +  + +
Sbjct: 2   PVFVQEPDDAIFPTDSDEKKVILNCEARGNPVPTYRWLRNGTEIDTESDYRYSLIDGNLI 61

Query: 349 NFTRINRINMGAYMCLADNVPPQILIRN 376
                   + G Y CLA N    IL R 
Sbjct: 62  ISNPSEVKDSGRYQCLATNSIGSILSRE 89


>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
           family.  The immune response of the purple sea urchin
           appears to be more complex than previously believed in
           that it uses immune-related gene families homologous to
           vertebrate Toll-like and NOD/NALP-like receptor families
           as well as C-type lectins and a rudimentary complement
           system. In addition, the species also produces this
           unusual family of mRNAs, also known as 185/333, which is
           strongly upregulated in response to pathogen challenge.
          Length = 312

 Score = 29.2 bits (65), Expect = 9.4
 Identities = 31/86 (36%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 138 EKGAPGVVGPPGPQGIPGDEGKRGPKGDMGYPGPQ-GIPGDEGKR---GPKGDMGYPG-R 192
           E+G     G PG   + G     GP G   + GP  G P  +G+R   GP G   + G R
Sbjct: 36  ERGQGRFGGRPGGMQMGGPRQDGGPMGGRRFDGPGSGAPQMDGRRQNGGPMGGRRFDGPR 95

Query: 193 IGLDGRDGLPGEPGLDGIPGRNGMDG 218
            G    DG  G P   G  GR G DG
Sbjct: 96  FGGSRPDGAGGRP-FFGQGGRRG-DG 119


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.136    0.421 

Gapped
Lambda     K      H
   0.267   0.0637    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 46,035,462
Number of extensions: 4544587
Number of successful extensions: 2692
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2604
Number of HSP's successfully gapped: 115
Length of query: 902
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 796
Effective length of database: 6,236,078
Effective search space: 4963918088
Effective search space used: 4963918088
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 64 (28.6 bits)