RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1807
         (618 letters)



>gnl|CDD|222943 PHA02927, PHA02927, secreted complement-binding protein;
           Provisional.
          Length = 263

 Score = 98.2 bits (244), Expect = 8e-23
 Identities = 61/187 (32%), Positives = 89/187 (47%), Gaps = 14/187 (7%)

Query: 332 QCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGD----WSGK 387
           QC+   C  P  I  G +   G   GS  TYSC  G+ L G ++  C LG      W+ +
Sbjct: 80  QCIKRRCPSPRDIDNGQLDIGGVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPE 139

Query: 388 APTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWS 447
           AP C+ V C SP +I NG ++      T GSV  Y+C+  Y L   +    V    G+WS
Sbjct: 140 APICESVKCQSPPSISNGRHNGYEDFYTDGSVVTYSCNSGYSLIGNS---GVLCSGGEWS 196

Query: 448 GDTPSCELITCPEPEVPKG----GYVVGYDLNVHSTLEFHCEPGHLLVGQASLKCTRQGD 503
            D P+C+++ CP P +  G    G+   Y  N    ++F C+ G+ L G +S  C+    
Sbjct: 197 -DPPTCQIVKCPHPTISNGYLSSGFKRSYSYN--DNVDFKCKYGYKLSGSSSSTCSPGNT 253

Query: 504 WDNMSPQ 510
           W    P+
Sbjct: 254 WQPELPK 260



 Score = 78.9 bits (194), Expect = 3e-16
 Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 12/180 (6%)

Query: 220 CGSPDRHLNTTFVGTVNTKLGSTISYACPEGNMLVGSATRTCKEGF-----WTGVAPTCQ 274
           C SP R ++   +       GS+I+Y+C  G  L+G +   C+ G      W   AP C+
Sbjct: 86  CPSP-RDIDNGQLDIGGVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPEAPICE 144

Query: 275 YVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
            V C     I +G         T G+V  Y+C+  Y+LIG +  +C  GG+W+   P C 
Sbjct: 145 SVKCQSPPSISNGRHNGYEDFYTDGSVVTYSCNSGYSLIGNSGVLC-SGGEWS-DPPTCQ 202

Query: 335 FDWCAEPPQISGGIVTTSGRRTGSV---ATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
              C   P IS G +++  +R+ S      + C+ G+ L GS+   C  G  W  + P C
Sbjct: 203 IVKCPH-PTISNGYLSSGFKRSYSYNDNVDFKCKYGYKLSGSSSSTCSPGNTWQPELPKC 261



 Score = 47.3 bits (112), Expect = 9e-06
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 3/96 (3%)

Query: 240 GSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRT-TH 298
           GS ++Y+C  G  L+G++   C  G W+   PTCQ V C     I +G ++    R+ ++
Sbjct: 169 GSVVTYSCNSGYSLIGNSGVLCSGGEWSD-PPTCQIVKCPH-PTISNGYLSSGFKRSYSY 226

Query: 299 GAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
                + C   Y L G +   C  G  W    P+C+
Sbjct: 227 NDNVDFKCKYGYKLSGSSSSTCSPGNTWQPELPKCV 262


>gnl|CDD|153060 cd03590, CLECT_DC-SIGN_like, C-type lectin-like domain (CTLD) of
           the type found in human dendritic cell (DC)-specific
           intercellular adhesion molecule 3-grabbing non-integrin
           (DC-SIGN) and the related receptor, DC-SIGN receptor
           (DC-SIGNR).  CLECT_DC-SIGN_like: C-type lectin-like
           domain (CTLD) of the type found in human dendritic cell
           (DC)-specific intercellular adhesion molecule 3-grabbing
           non-integrin (DC-SIGN) and the related receptor, DC-SIGN
           receptor (DC-SIGNR).  This group also contains proteins
           similar to hepatic asialoglycoprotein receptor (ASGP-R)
           and langerin in human.  These proteins are type II
           membrane proteins with a CTLD ectodomain.  CTLD refers
           to a domain homologous to the carbohydrate-recognition
           domains (CRDs) of the C-type lectins.  DC-SIGN is
           thought to mediate the initial contact between dendritic
           cells and resting T cells, and may also mediate the
           rolling of DCs on epithelium.  DC-SIGN and DC-SIGNR bind
           to oligosaccharides present on human tissues, as well
           as, on pathogens including parasites, bacteria, and
           viruses.  DC-SIGN and DC-SIGNR bind to HIV enhancing
           viral infection of T cells.  DC-SIGN and DC-SIGNR are
           homotetrameric, and contain four CTLDs stabilized by a
           coiled coil of alpha helices.  The hepatic ASGP-R is an
           endocytic recycling receptor which binds and
           internalizes desialylated glycoproteins having a
           terminal galactose or N-acetylgalactosamine residues on
           their N-linked carbohydrate chains, via the
           clathrin-coated pit mediated endocytic pathway, and
           delivers them to lysosomes for degradation.  It has been
           proposed that glycoproteins bearing terminal Sia (sialic
           acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are
           endogenous ligands for ASGP-R and that ASGP-R
           participates in regulating the relative concentration of
           serum glycoproteins bearing alpha 2,6-linked Sia.  The
           human ASGP-R is a hetero-oligomer composed of two
           subunits, both of which are found within this group.
           Langerin is expressed in a subset of dendritic
           leukocytes, the Langerhans cells (LC). Langerin induces
           the formation of Birbeck Granules (BGs) and associates
           with these BGs following internalization.  Langerin
           binds, in a calcium-dependent manner, to
           glyco-conjugates containing mannose and related sugars
           mediating their uptake and degradation.  Langerin
           molecules oligomerize as trimers with three CTLDs held
           together by a coiled-coil of alpha helices.
          Length = 126

 Score = 80.4 bits (199), Expect = 4e-18
 Identities = 45/136 (33%), Positives = 62/136 (45%), Gaps = 24/136 (17%)

Query: 87  TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNT-----FLLGELERRKDRL 141
            SF  +CY F+ +K  S+ ++R  C+  G  LV     I N+     F+   L   +   
Sbjct: 6   KSFQSSCYFFSTEK-KSWEESRQFCEDMGAHLV-----IINSQEEQEFISKILSGNRS-- 57

Query: 142 KTLLVWIGAQKDPGITARTWKWVDG--EVVTKPSWGKDQPNNYNG-EQNCVVLDGGRGWL 198
                WIG   D   T   WKWVDG     +K  W   +PNN+ G  ++C  L       
Sbjct: 58  ----YWIGL-SDEE-TEGEWKWVDGTPLNSSKTFWHPGEPNNWGGGGEDCAELV-YDSGG 110

Query: 199 WNDVGCKLDYLHWICQ 214
           WNDV C L+Y  WIC+
Sbjct: 111 WNDVPCNLEY-RWICE 125


>gnl|CDD|153057 cd00037, CLECT, C-type lectin (CTL)/C-type lectin-like (CTLD)
           domain.  CLECT: C-type lectin (CTL)/C-type lectin-like
           (CTLD) domain; protein domains homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins.  This group is chiefly comprised of eukaryotic
           CTLDs, but contains some, as yet functionally
           uncharacterized, bacterial CTLDs.  Many CTLDs are
           calcium-dependent carbohydrate binding modules; other
           CTLDs bind protein ligands, lipids, and inorganic
           surfaces, including CaCO3 and ice.  Animal C-type
           lectins are involved in such functions as extracellular
           matrix organization, endocytosis, complement activation,
           pathogen recognition, and cell-cell interactions. For
           example: mannose-binding lectin and lung surfactant
           proteins A and D bind carbohydrates on surfaces (e.g.
           pathogens, allergens, necrotic, and apoptotic cells) and
           mediate functions associated with killing and
           phagocytosis;  P (platlet)-, E (endothelial)-, and L
           (leukocyte)- selectins (sels) mediate the initial
           attachment, tethering, and rolling of lymphocytes on
           inflamed vascular walls enabling subsequent lymphocyte
           adhesion and transmigration.  CTLDs may bind a variety
           of carbohydrate ligands including mannose,
           N-acetylglucosamine, galactose, N-acetylgalactosamine,
           and fucose.  Several CTLDs bind to protein ligands, and
           only some of these binding interactions are
           Ca2+-dependent; including the CTLDs of Coagulation
           Factors IX/X (IX/X) and Von Willebrand Factor (VWF)
           binding proteins, and natural killer cell receptors.
           C-type lectins, such as lithostathine, and some type II
           antifreeze glycoproteins function in a Ca2+-independent
           manner to bind inorganic surfaces.  Many proteins in
           this group contain a single CTLD; these CTLDs associate
           with each other through several different surfaces to
           form dimers, trimers, or tetramers, from which
           ligand-binding sites project in different orientations.
           Various vertebrate type 1 transmembrane proteins
           including macrophage mannose receptor, endo180,
           phospholipase A2 receptor, and dendritic and epithelial
           cell receptor (DEC205) have extracellular domains
           containing 8 or more CTLDs; these CTLDs remain in the
           parent model.  In some members (IX/X and VWF binding
           proteins), a loop extends to the adjoining domain to
           form a loop-swapped dimer.  A similar conformation is
           seen in the macrophage mannose receptor CRD4's putative
           non-sugar bound form of the domain in the acid
           environment of the endosome.  Lineage specific
           expansions of CTLDs have occurred in several animal
           lineages including Drosophila melanogaster and
           Caenorhabditis elegans; these CTLDs also remain in the
           parent model.
          Length = 116

 Score = 79.2 bits (195), Expect = 9e-18
 Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 10/125 (8%)

Query: 92  NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQ 151
           +CY+F+ +K  ++ +A+  C++ GG L      I +      L     +  +  VWIG  
Sbjct: 1   SCYKFSTEK-LTWEEAQEYCRSLGGHLA----SIHSEEENDFLASLLKKSSSSDVWIGLN 55

Query: 152 KDPGITARTWKWVDG-EVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLH 210
                +  TWKW DG  +V   +W   +PN    E +CVVL       WNDV C    L 
Sbjct: 56  DLS--SEGTWKWSDGSPLVDYTNWAPGEPNPGGSE-DCVVLSSSSDGKWNDVSCS-SKLP 111

Query: 211 WICQH 215
           +IC+ 
Sbjct: 112 FICEK 116


>gnl|CDD|214480 smart00034, CLECT, C-type lectin (CTL) or carbohydrate-recognition
           domain (CRD).  Many of these domains function as
           calcium-dependent carbohydrate binding modules.
          Length = 124

 Score = 73.0 bits (179), Expect = 1e-15
 Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 11/129 (8%)

Query: 87  TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLV 146
            S+   CY+F+ +K  ++ DA++ CQ+ GG L        N F+   L   K+   +   
Sbjct: 6   ISYGGKCYKFSTEK-KTWEDAQAFCQSLGGHLASIHSEAENDFVASLL---KNSGSSDYY 61

Query: 147 WIGAQKDPGITARTWKWVDGEVVTKPS-WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCK 205
           WIG       +  +W+W DG      S W   +PNN +G  +CVVL    G  WNDV C 
Sbjct: 62  WIGLSDPD--SNGSWQWSDGSGPVSYSNWAPGEPNNSSG--DCVVLS-TSGGKWNDVSC- 115

Query: 206 LDYLHWICQ 214
              L ++C+
Sbjct: 116 TSKLPFVCE 124


>gnl|CDD|214478 smart00032, CCP, Domain abundant in complement control proteins;
           SUSHI repeat; short complement-like repeat (SCR).  The
           complement control protein (CCP) modules (also known as
           short consensus repeats SCRs or SUSHI repeats) contain
           approximately 60 amino acid residues and have been
           identified in several proteins of the complement system.
           A missense mutation in seventh CCP domain causes
           deficiency of the b subunit of factor XIII.
          Length = 56

 Score = 65.6 bits (160), Expect = 9e-14
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333
           C     IE+GTVT  +   ++G    Y+C   YTLIG +   C + G W+   P C
Sbjct: 1   CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56



 Score = 60.6 bits (147), Expect = 5e-12
 Identities = 25/56 (44%), Positives = 30/56 (53%), Gaps = 2/56 (3%)

Query: 338 CAEPPQISGGIVTTSGR--RTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
           C  PP I  G VT+S      G   TYSC+PG+ L GS+ + C   G WS   PTC
Sbjct: 1   CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56



 Score = 57.2 bits (138), Expect = 8e-11
 Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 2/58 (3%)

Query: 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC 453
           C  P +I+NG  +  +GT ++G    Y+CD  Y L   +     C+ +G WS   P+C
Sbjct: 1   CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTIT--CLENGTWSPPPPTC 56



 Score = 47.9 bits (114), Expect = 2e-07
 Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 220 CGSPDRHLNTTFVGTVNT-KLGSTISYACPEGNMLVGSATRTCKE-GFWTGVAPTC 273
           C  P    N T   +  T   G T++Y+C  G  L+GS+T TC E G W+   PTC
Sbjct: 1   CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56



 Score = 47.9 bits (114), Expect = 2e-07
 Identities = 15/55 (27%), Positives = 24/55 (43%), Gaps = 2/55 (3%)

Query: 458 CPEPEVPKGGYVVGYDLNVH--STLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQ 510
           CP P   + G V           T+ + C+PG+ L+G +++ C   G W    P 
Sbjct: 1   CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPT 55


>gnl|CDD|153056 cd00033, CCP, Complement control protein (CCP) modules (aka short
           consensus repeats SCRs or SUSHI repeats) have been
           identified in several proteins of the complement system.
            SUSHI repeats (short complement-like repeat, SCR) are
           abundant in complement control proteins. The complement
           control protein (CCP) modules (also known as short
           consensus repeats SCRs or SUSHI repeats) contain
           approximately 60 amino acid residues and have been
           identified in several proteins of the complement system.
           Typically, 2 to 4 modules contribute to a binding site,
           implying that the orientation of the modules to each
           other is critical for function.
          Length = 57

 Score = 65.2 bits (159), Expect = 1e-13
 Identities = 19/56 (33%), Positives = 29/56 (51%)

Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333
           C      E+GTVT      ++G+   Y+C+E YTL+G +   C + G W+   P C
Sbjct: 1   CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTC 56



 Score = 61.7 bits (150), Expect = 3e-12
 Identities = 24/57 (42%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 338 CAEPPQISGGIVTTSGRRT--GSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCK 392
           C  PP    G VT S      GS  TYSC  G+ L GS+ + C   G WS   PTC+
Sbjct: 1   CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57



 Score = 54.8 bits (132), Expect = 6e-10
 Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCE 454
           C  P   +NG  +   G+ ++GS   Y+C++ Y L  G+     C  +G WS   P+CE
Sbjct: 1   CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLV-GSSTIT-CTENGGWSPPPPTCE 57



 Score = 51.7 bits (124), Expect = 7e-09
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 458 CPEPEVPKGGYVVGY--DLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQ 510
           CP P VP+ G V G     +  ST+ + C  G+ LVG +++ CT  G W    P 
Sbjct: 1   CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPT 55



 Score = 49.8 bits (119), Expect = 4e-08
 Identities = 25/57 (43%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 220 CGSPDRHLNTTFVGTVNT-KLGSTISYACPEGNMLVGSATRTCKE-GFWTGVAPTCQ 274
           C  P    N T  G+  +   GST++Y+C EG  LVGS+T TC E G W+   PTC+
Sbjct: 1   CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57


>gnl|CDD|215684 pfam00059, Lectin_C, Lectin C-type domain.  This family includes
           both long and short form C-type.
          Length = 108

 Score = 66.2 bits (162), Expect = 2e-13
 Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 12/115 (10%)

Query: 103 SFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLL--VWIGAQKDPGITART 160
           ++A+A + CQ  GG LV     I +     E +     +K      WIG       T  T
Sbjct: 4   TWAEAEAACQKLGGHLV----SIQS---AEEQDFLTSLVKASNTYAWIGLTDIN--TEGT 54

Query: 161 WKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215
           W W DG  V   +W   +PNN  G ++CV +       WND  C      ++C+ 
Sbjct: 55  WVWTDGSPVNYTNWAPGEPNNKGGNEDCVEIYTDTNGKWNDEPC-GSKRLYVCEF 108


>gnl|CDD|165022 PHA02639, PHA02639, EEV host range protein; Provisional.
          Length = 295

 Score = 68.5 bits (167), Expect = 1e-12
 Identities = 47/179 (26%), Positives = 74/179 (41%), Gaps = 12/179 (6%)

Query: 337 WCAEPPQISGGIVTTSGRR--TGSVATYSCEPGFILFGSNELKCGLGGD---WSGKAPTC 391
           +C +P  IS G +T    +   G +  Y+C   + L G     C    +   WS KAP C
Sbjct: 21  YCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFC 80

Query: 392 KYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAM--RRQVCMRDGKWSGD 449
              +C  P +I NG           G    Y C++   ++   +   +  C++D  W  D
Sbjct: 81  MLKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPD 140

Query: 450 TPSCELITCPEPEVPKGGYVVGYDLN----VHSTLEFHCEPGHLLVGQASLKCTRQGDW 504
            P C++I C  P + + GY+ G   N      + + F C+ G  LVG+    C     W
Sbjct: 141 PPICKMINCRFPAL-QNGYINGIPSNKKFYYKTRVGFSCKSGFDLVGEKYSTCNINATW 198



 Score = 65.8 bits (160), Expect = 8e-12
 Identities = 48/185 (25%), Positives = 70/185 (37%), Gaps = 14/185 (7%)

Query: 220 CGSPDRHLNTTFVGTVNTK--LGSTISYACPEGNMLVGSATRTC----KEGFWTGVAPTC 273
           C  PD  ++  F+  +  K  +G  I Y C     L+G   RTC        W+  AP C
Sbjct: 22  CDKPD-DISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFC 80

Query: 274 QYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHEN----YTLIGETRRVCGDGGKWNGT 329
              +C     I +G +  +      G    Y C+E+    Y+L+G  +  C     W   
Sbjct: 81  MLKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPD 140

Query: 330 EPQCLFDWCAEPPQISG---GIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSG 386
            P C    C  P   +G   GI +       +   +SC+ GF L G     C +   W  
Sbjct: 141 PPICKMINCRFPALQNGYINGIPSNKKFYYKTRVGFSCKSGFDLVGEKYSTCNINATWFP 200

Query: 387 KAPTC 391
             PTC
Sbjct: 201 SIPTC 205



 Score = 63.9 bits (155), Expect = 4e-11
 Identities = 44/187 (23%), Positives = 76/187 (40%), Gaps = 14/187 (7%)

Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVC---GDGGKWNGTEPQCL 334
           C K + I +G +T    +   G +  Y C+ +Y LIG+  R C    +   W+   P C+
Sbjct: 22  CDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCM 81

Query: 335 FDWCAEPPQISGGIVTTSGR--RTGSVATYSCEP----GFILFGSNELKCGLGGDWSGKA 388
              C +PP I  G +       + G    Y C       + L G+ ++ C     W    
Sbjct: 82  LKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDP 141

Query: 389 PTCKYVDCGSPANIDNGHYSLL--NGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKW 446
           P CK ++C  PA + NG+ + +  N    + +   ++C   + L     +   C  +  W
Sbjct: 142 PICKMINCRFPA-LQNGYINGIPSNKKFYYKTRVGFSCKSGFDLV--GEKYSTCNINATW 198

Query: 447 SGDTPSC 453
               P+C
Sbjct: 199 FPSIPTC 205



 Score = 52.4 bits (125), Expect = 2e-07
 Identities = 44/166 (26%), Positives = 65/166 (39%), Gaps = 27/166 (16%)

Query: 392 KYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGK---WSG 448
           K + C  P +I NG  + L      G + EYTC+ DY L     R   C++D     WS 
Sbjct: 18  KSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFR--TCIKDKNNAIWSN 75

Query: 449 DTPSCELITCPEP------------EVPKGGYVVGYDLNVHSTLEFHCEPGHLLVGQASL 496
             P C L  C +P            E+ K G  + Y  N H  +++       LVG   +
Sbjct: 76  KAPFCMLKECNDPPSIINGKIYNKREMYKVGDEIYYVCNEHKGVQYS------LVGNEKI 129

Query: 497 KCTRQGDWDNMSP--QFISFYFPPSET--ISAVSEAQTLGYMISVG 538
            C +   W    P  + I+  FP  +   I+ +   +   Y   VG
Sbjct: 130 TCIQDKSWKPDPPICKMINCRFPALQNGYINGIPSNKKFYYKTRVG 175



 Score = 37.7 bits (87), Expect = 0.012
 Identities = 24/104 (23%), Positives = 47/104 (45%), Gaps = 8/104 (7%)

Query: 238 KLGSTISYACPEGN----MLVGSATRTC-KEGFWTGVAPTCQYVDCGKLEHIEHGTVT-L 291
           K+G  I Y C E       LVG+   TC ++  W    P C+ ++C +   +++G +  +
Sbjct: 104 KVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICKMINC-RFPALQNGYINGI 162

Query: 292 ETTRTTHGAVAI-YACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
            + +  +    + ++C   + L+GE    C     W  + P C+
Sbjct: 163 PSNKKFYYKTRVGFSCKSGFDLVGEKYSTCNINATWFPSIPTCV 206


>gnl|CDD|165167 PHA02817, PHA02817, EEV Host range protein; Provisional.
          Length = 225

 Score = 60.7 bits (147), Expect = 2e-10
 Identities = 39/127 (30%), Positives = 55/127 (43%), Gaps = 8/127 (6%)

Query: 273 CQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHEN-----YTLIGETRRVCGDGGKWN 327
           C    C     I++G +  + T    G+   + C  N     YTL+GE   +C   GKWN
Sbjct: 19  CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGEKNIICEKDGKWN 78

Query: 328 GTEPQCLFDWCAEPPQISG---GIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDW 384
              P C    C  P   +G   GI  +      S  ++SC+PGF+L G+    CG+   W
Sbjct: 79  KEFPVCKIIRCRFPALQNGFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSW 138

Query: 385 SGKAPTC 391
             K P C
Sbjct: 139 IPKVPIC 145



 Score = 51.5 bits (123), Expect = 2e-07
 Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 12/123 (9%)

Query: 391 CKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDD-----YWLEAGAMRRQVCMRDGK 445
           C    C  P +I NG+          GS   + C ++     Y L     +  +C +DGK
Sbjct: 19  CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGE--KNIICEKDGK 76

Query: 446 WSGDTPSCELITCPEPEVPKGGYVVGYDLN----VHSTLEFHCEPGHLLVGQASLKCTRQ 501
           W+ + P C++I C  P + + G+V G   +      S + F C+PG +L+G     C   
Sbjct: 77  WNKEFPVCKIIRCRFPAL-QNGFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGIN 135

Query: 502 GDW 504
             W
Sbjct: 136 SSW 138



 Score = 40.7 bits (95), Expect = 0.001
 Identities = 34/133 (25%), Positives = 50/133 (37%), Gaps = 18/133 (13%)

Query: 333 CLFDWCAEPPQISGGIVTTSGR--RTGSVATYSCEPG-----FILFGSNELKCGLGGDWS 385
           C  + C  PP I  G +         GS  T+ C        + L G   + C   G W+
Sbjct: 19  CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGEKNIICEKDGKWN 78

Query: 386 GKAPTCKYVDCGSPANIDNGHYSLLNGTTT-----HGSVAEYTCDDDYWLEAGAMRRQVC 440
            + P CK + C  PA + NG    +NG        + S   ++C   + L  G  +  VC
Sbjct: 79  KEFPVCKIIRCRFPA-LQNGF---VNGIPDSKKFYYESEVSFSCKPGFVL-IGT-KYSVC 132

Query: 441 MRDGKWSGDTPSC 453
             +  W    P C
Sbjct: 133 GINSSWIPKVPIC 145


>gnl|CDD|153059 cd03589, CLECT_CEL-1_like, C-type lectin-like domain (CTLD) of the
           type found in CEL-1 from Cucumaria echinata and
           Echinoidin from Anthocidaris crassispina.
           CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of
           the type found in CEL-1 from Cucumaria echinata and
           Echinoidin from Anthocidaris crassispina.  CTLD refers
           to a domain homologous to the carbohydrate-recognition
           domains (CRDs) of the C-type lectins.  The CEL-1 CTLD
           binds three calcium ions and has a high specificity for
           N-acteylgalactosamine (GalNAc).  CEL-1 exhibits strong
           cytotoxicity which is inhibited by GalNAc.  This protein
           may play a role as a toxin defending against predation. 
           Echinoidin is found in the coelomic fluid of the sea
           urchin and is specific for GalBeta1-3GalNAc.  Echinoidin
           has a cell adhesive activity towards human cancer cells
           which is not mediated through the CTLD.  Both CEL-1 and
           Echinoidin are multimeric proteins comprised of multiple
           dimers linked by disulfide bonds.
          Length = 137

 Score = 55.8 bits (135), Expect = 2e-09
 Identities = 37/138 (26%), Positives = 57/138 (41%), Gaps = 15/138 (10%)

Query: 87  TSFDRNCYEFNIDKGASFADARSKCQ---ASG--GDLVHGFRGITNTFLLGELERRKDRL 141
           T+F   CY F      ++ +A  +C+     G    LV       N F+    E  +   
Sbjct: 6   TAFGGYCYRF-FGDRLTWEEAELRCRSFSIPGLIAHLVSIHSQEENDFVYDLFESSRGPD 64

Query: 142 KTLLVWIGA--QKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVL--DGGRGW 197
               +WIG   +   G     ++W DG  V    W   QP+NY G ++CV +   G  G 
Sbjct: 65  TPYGLWIGLHDRTSEG----PFEWTDGSPVDFTKWAGGQPDNYGGNEDCVQMWRRGDAGQ 120

Query: 198 LWNDVGCKLDYLHWICQH 215
            WND+ C   +  +IC+ 
Sbjct: 121 SWNDMPCDAVF-PYICKM 137


>gnl|CDD|215703 pfam00084, Sushi, Sushi domain (SCR repeat). 
          Length = 56

 Score = 53.0 bits (127), Expect = 3e-09
 Identities = 17/56 (30%), Positives = 24/56 (42%)

Query: 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333
           C     I HG ++        G+V  Y C   Y L+G +   C D G W+   P+C
Sbjct: 1   CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSPPLPEC 56



 Score = 52.2 bits (125), Expect = 6e-09
 Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 338 CAEPPQISGGIVTTSGRRT--GSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
           C  PP I  G+++T+      GSV  Y C+PG++L GS+ + C   G WS   P C
Sbjct: 1   CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSPPLPEC 56



 Score = 48.7 bits (116), Expect = 9e-08
 Identities = 20/58 (34%), Positives = 23/58 (39%), Gaps = 2/58 (3%)

Query: 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC 453
           C +P  I +G  S    +   GSV  Y CD  Y L         C  DG WS   P C
Sbjct: 1   CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLV--GSSVITCQDDGTWSPPLPEC 56



 Score = 42.9 bits (101), Expect = 9e-06
 Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 2/56 (3%)

Query: 220 CGSPDRHLNTTFVGTVN-TKLGSTISYACPEGNMLVGSATRTCKE-GFWTGVAPTC 273
           C +P    +     T N  ++GS + Y C  G +LVGS+  TC++ G W+   P C
Sbjct: 1   CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSPPLPEC 56



 Score = 42.2 bits (99), Expect = 2e-05
 Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 2/51 (3%)

Query: 458 CPEPEVPKGGYVVGY--DLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDN 506
           CP P     G +        V S + + C+PG++LVG + + C   G W  
Sbjct: 1   CPAPPPIPHGVISTTKNSYQVGSVVRYECDPGYVLVGSSVITCQDDGTWSP 51


>gnl|CDD|153061 cd03591, CLECT_collectin_like, C-type lectin-like domain (CTLD) of
           the type found in human collectins including lung
           surfactant proteins A and D, mannose- or mannan binding
           lectin (MBL), and CL-L1 (collectin liver 1).
           CLECT_collectin_like: C-type lectin-like domain (CTLD)
           of the type found in human collectins including lung
           surfactant proteins A and D, mannose- or mannan binding
           lectin (MBL), and CL-L1 (collectin liver 1).  CTLD
           refers to a domain homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins. The CTLDs of these collectins bind
           carbohydrates on surfaces (e.g. pathogens, allergens,
           necrotic, or apoptotic cells) and mediate functions
           associated with killing and phagocytosis.  MBPs
           recognize high mannose oligosaccharides in a calcium
           dependent manner, bind to a broad range of pathogens,
           and trigger cell killing by activating the complement
           pathway.  MBP also acts directly as an opsonin.  SP-A
           and SP-D in addition to functioning as host defense
           components, are components of pulmonary surfactant which
           play a role in surfactant homeostasis.  Pulmonary
           surfactant is a phospholipid-protein complex which
           reduces the surface tension within the lungs.  SP-A
           binds the major surfactant lipid:
           dipalmitoylphosphatidylcholine (DPPC).  SP-D binds two
           minor components of surfactant that contain sugar
           moieties: glucosylceramide and phosphatidylinositol
           (PI).  MBP and SP-A, -D monomers are homotrimers with an
           N-terminal collagen region and three CTLDs.  Multiple
           homotrimeric units associate to form supramolecular
           complexes.  MBL deficiency results in an increased
           susceptibility to a large number of different infections
           and to inflammatory disease, such as rheumatoid
           arthritis.
          Length = 114

 Score = 51.5 bits (124), Expect = 4e-08
 Identities = 28/112 (25%), Positives = 47/112 (41%), Gaps = 12/112 (10%)

Query: 104 FADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163
           F DA+  C  +GG L        N  +   +++          +IG       T   + +
Sbjct: 13  FDDAQKLCSEAGGTLAMPRNAAENAAIASYVKKGNTY-----AFIGITDLE--TEGQFVY 65

Query: 164 VDGEVVTKPSWGKDQPNNYNGEQNCVV-LDGGRGWLWNDVGCKLDYLHWICQ 214
           +DG  +T  +W   +PNN  G ++CV     G+   WNDV C      ++C+
Sbjct: 66  LDGGPLTYTNWKPGEPNNAGGGEDCVEMYTSGK---WNDVAC-NLTRLFVCE 113


>gnl|CDD|153062 cd03592, CLECT_selectins_like, C-type lectin-like domain (CTLD) of
           the type found in the type 1 transmembrane proteins:
           P(platlet)-, E(endothelial)-, and L(leukocyte)-
           selectins (sels).  CLECT_selectins_like: C-type
           lectin-like domain (CTLD) of the type found in the type
           1 transmembrane proteins:  P(platlet)-, E(endothelial)-,
           and L(leukocyte)- selectins (sels).  CTLD refers to a
           domain homologous to the carbohydrate-recognition
           domains (CRDs) of the C-type lectins.  P- E- and L-sels
           are cell adhesion receptors that mediate the initial
           attachment, tethering, and rolling of lymphocytes on
           inflamed vascular walls enabling subsequent lymphocyte
           adhesion and transmigration.  L- sel is expressed
           constitutively on most leukocytes.  P-sel is stored in
           the Weibel-Palade bodies of endothelial cells and in the
           alpha granules of platlets.  E- sels are present on
           endothelial cells.  Following platelet and/or
           endothelial cell activation P- sel is rapidly
           translocated to the cell surface and E-sel expression is
           induced.  The initial step in leukocyte migration
           involves interactions of selectins with fucosylated,
           sialylated, and sulfated carbohydrate moieties on target
           ligands displayed on glycoprotein scaffolds on
           endothelial cells and leucocytes.  A major ligand of P-
           E- and L-sels is PSGL-1 (P-sel glycoprotein ligand).
           Interactions of E- and P- sels with tumor cells may
           promote extravasation of cancer cells.   Regulation of
           L-sel and P-sel function includes proteolytic shedding
           of the most extracellular portion (containing the CTLD)
           from the cell surface.  Increased levels of the soluble
           form of P-sel in the plasma have been found in a number
           of diseases including coronary disease and diabetes.  E-
           and P- sel also play roles in the development of
           synovial inflammation in inflammatory arthritis.
           Platelet P-sel, but not endothelial P-sel, plays a role
           in the inflammatory response and neointimal formation
           after arterial injury.  Selectins may also function as
           signal-transducing receptors.
          Length = 115

 Score = 51.2 bits (123), Expect = 5e-08
 Identities = 28/119 (23%), Positives = 40/119 (33%), Gaps = 22/119 (18%)

Query: 103 SFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWK 162
           +F +A   C++ G DLV       N  L G   +          WI              
Sbjct: 11  TFNEAVKYCKSRGTDLVAIQNAEENALLNGFALKYNLG----YYWIDGND----INNEGT 62

Query: 163 WVD--GEVVTKPSWGKDQPNNYNGEQNCVV----LDGGRGWLWNDVGCKLDYLHW-ICQ 214
           WVD   + +   +W   +PNN    +NC+      +G     WND  C        IC 
Sbjct: 63  WVDTDKKELEYKNWAPGEPNN-GRNENCLEIYIKDNG----KWNDEPC--SKKKSAICY 114


>gnl|CDD|153068 cd03598, CLECT_EMBP_like, C-type lectin-like domain (CTLD) of the
           type found in the human proteins, eosinophil major basic
           protein (EMBP) and prepro major basic protein homolog
           (MBPH).  CLECT_EMBP_like: C-type lectin-like domain
           (CTLD) of the type found in the human proteins,
           eosinophil major basic protein (EMBP) and prepro major
           basic protein homolog (MBPH).  CTLD refers to a domain
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  Eosinophils and basophils
           carry out various functions in allergic, parasitic, and
           inflammatory diseases.  EMBP is stored in eosinophil
           crystalloid granules and is released upon degranulation.
            EMBP is also expressed in basophils.  The proform of
           EMBP is expressed in placental X cells and breast tissue
           and increases significantly during human pregnancy.
           EMBP has cytotoxic properties and damages bacteria and
           mammalian cells, in vitro, as well as, helminth
           parasites.  EMBP deposition has been observed in the
           inflamed tissue of allergy patients in a variety of
           diseases including asthma, atopic dermatitis, and
           rhinitis. In addition to its cytotoxic functions, EMBP
           activates cells and stimulates cytokine production.
           EMBP has been shown to bind the proteoglycan heparin.
           The binding site is similar to the carbohydrate binding
           site of other classical CTLD, such as mannose-binding
           protein (MBP1), however, heparin binding to EMBP is
           calcium ion independent.  MBPH has reduced potency in
           cytotoxic and cytostimulatory assays compared with EMBP.
          Length = 117

 Score = 51.3 bits (123), Expect = 5e-08
 Identities = 36/124 (29%), Positives = 46/124 (37%), Gaps = 10/124 (8%)

Query: 93  CYEFNIDKGASFADARSKCQ-ASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQ 151
           CY F       F DA+  C+    G+L      I +      ++R    L    VWIG  
Sbjct: 3   CYRFVKSPRT-FRDAQVICRRCYRGNLA----SIHSFAFNYRVQRLVSTLNQAQVWIGGI 57

Query: 152 KDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHW 211
                  R + WVDG V     W   QP   N   +CV L    G  W    CKL    +
Sbjct: 58  ITGKGRCRRFSWVDGSVWNYAYWAPGQP--GNRRGHCVELCTRGGH-WRRAHCKL-RRPF 113

Query: 212 ICQH 215
           IC +
Sbjct: 114 ICSY 117


>gnl|CDD|153073 cd03603, CLECT_VCBS, A bacterial subgroup of the C-type lectin-like
           (CTLD) domain; a subgroup of bacterial protein domains
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  CLECT_VCBS: A bacterial
           subgroup of the C-type lectin-like (CTLD) domain; a
           subgroup of bacterial protein domains homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins.  Many CTLDs are calcium-dependent carbohydrate
           binding modules; other CTLDs bind protein ligands,
           lipids, and inorganic surfaces including CaCO3 and ice. 
           Bacterial CTLDs within this group are functionally
           uncharacterized.  Animal C-type lectins are involved in
           such functions as extracellular matrix organization,
           endocytosis, complement activation, pathogen
           recognition, and cell-cell interactions.  CTLDs may bind
           a variety of carbohydrate ligands including mannose,
           N-acetylglucosamine, galactose, N-acetylgalactosamine,
           and fucose.  CTLDs associate with each other through
           several different surfaces to form dimers, trimers, or
           tetramers from which ligand-binding sites project in
           different orientations.  In some CTLDs a loop extends to
           the adjoining domain to form a loop-swapped dimer.
          Length = 118

 Score = 48.6 bits (116), Expect = 4e-07
 Identities = 34/121 (28%), Positives = 49/121 (40%), Gaps = 28/121 (23%)

Query: 94  YEFNIDKGASFADARSKCQASGGDLVHGFRGIT----NTFLLGELERRKDRLKTLLVWIG 149
           Y+F +D G ++  A++  ++ GG LV     I     N +LL               WIG
Sbjct: 3   YKF-VDGGMTWEAAQTLAESLGGHLVT----INSAEENDWLLSNFGGYGA------SWIG 51

Query: 150 AQKDPGITARTWKWVDGEVVTKPSWGKDQP-------NNYNGEQNCVVLDGGRGWLWNDV 202
           A      T  TWKW DGE  T  +WG  +P        +Y    +   + G     WND+
Sbjct: 52  ASDAA--TEGTWKWSDGEESTYTNWGSGEPHNNGGGNEDYAAINHFPGISGK----WNDL 105

Query: 203 G 203
            
Sbjct: 106 A 106


>gnl|CDD|165176 PHA02831, PHA02831, EEV host range protein; Provisional.
          Length = 268

 Score = 50.4 bits (120), Expect = 1e-06
 Identities = 31/127 (24%), Positives = 54/127 (42%), Gaps = 14/127 (11%)

Query: 219 NCGSPDRHLNTTFVGTVNTK-----LGSTISYACP----EGNMLVGSATRTCKEGFWTGV 269
           NC  P   LN    G +  K      G +++YAC     E   +VG+ T  C    W   
Sbjct: 77  NCKDPVTILN----GYIKNKKDQYSFGDSVTYACKVNKLEKYSIVGNETVKCINKQWVPK 132

Query: 270 APTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGT 329
            P C+ + C K   +++G + +   +  +G +  + C + + L+G +   C     W   
Sbjct: 133 YPVCKLIRC-KYPALQNGFLNVFEKKFYYGDIVNFKCKKGFILLGSSVSTCDINSIWYPG 191

Query: 330 EPQCLFD 336
            P+C+ D
Sbjct: 192 IPKCVKD 198



 Score = 49.2 bits (117), Expect = 2e-06
 Identities = 33/138 (23%), Positives = 55/138 (39%), Gaps = 8/138 (5%)

Query: 260 TCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACH----ENYTLIGE 315
           TC  G W+         +C     I +G +  +  + + G    YAC     E Y+++G 
Sbjct: 60  TCNNGSWSTKNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVTYACKVNKLEKYSIVGN 119

Query: 316 TRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTTSGRR--TGSVATYSCEPGFILFGS 373
               C +  +W    P C    C + P +  G +    ++   G +  + C+ GFIL GS
Sbjct: 120 ETVKCINK-QWVPKYPVCKLIRC-KYPALQNGFLNVFEKKFYYGDIVNFKCKKGFILLGS 177

Query: 374 NELKCGLGGDWSGKAPTC 391
           +   C +   W    P C
Sbjct: 178 SVSTCDINSIWYPGIPKC 195



 Score = 38.4 bits (89), Expect = 0.006
 Identities = 34/147 (23%), Positives = 56/147 (38%), Gaps = 10/147 (6%)

Query: 362 YSCEPGFILFGSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAE 421
           Y C   F       + C   G WS K       +C  P  I NG+        + G    
Sbjct: 48  YKCNNNF---DKVFVTCN-NGSWSTKNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVT 103

Query: 422 YTCDDDYWLEAGAM--RRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVGYDLNVH-- 477
           Y C  +  LE  ++     V   + +W    P C+LI C  P + + G++  ++   +  
Sbjct: 104 YACKVNK-LEKYSIVGNETVKCINKQWVPKYPVCKLIRCKYPAL-QNGFLNVFEKKFYYG 161

Query: 478 STLEFHCEPGHLLVGQASLKCTRQGDW 504
             + F C+ G +L+G +   C     W
Sbjct: 162 DIVNFKCKKGFILLGSSVSTCDINSIW 188


>gnl|CDD|165263 PHA02954, PHA02954, EEV membrane glycoprotein; Provisional.
          Length = 317

 Score = 47.8 bits (113), Expect = 8e-06
 Identities = 35/120 (29%), Positives = 51/120 (42%), Gaps = 6/120 (5%)

Query: 272 TCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEP 331
           TC   +C  L+ +EHG+      + + G      C   Y +IG +   C     WN   P
Sbjct: 124 TCPNAECQPLQ-LEHGSCQPVKEKYSFGEHITINCDVGYEVIGASYISC-TANSWN-VIP 180

Query: 332 QCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTC 391
            C      + P +S G+++ S    G V   SC+ GF L GS    C + G W+   P C
Sbjct: 181 SC--QQKCDIPSLSNGLISGSTFSIGGVIHLSCKSGFTLTGSPSSTC-IDGKWNPVLPIC 237



 Score = 38.1 bits (88), Expect = 0.010
 Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 6/96 (6%)

Query: 239 LGSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRTTH 298
            G  I+  C  G  ++G++  +C    W  V P+CQ   C  +  + +G ++  T     
Sbjct: 149 FGEHITINCDVGYEVIGASYISCTANSWN-VIPSCQQ-KC-DIPSLSNGLISGSTFSI-- 203

Query: 299 GAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCL 334
           G V   +C   +TL G     C D GKWN   P C+
Sbjct: 204 GGVIHLSCKSGFTLTGSPSSTCID-GKWNPVLPICV 238



 Score = 33.5 bits (76), Expect = 0.25
 Identities = 19/61 (31%), Positives = 25/61 (40%), Gaps = 5/61 (8%)

Query: 213 CQHNPANCGSPDRHLNTTFVGTVNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVAPT 272
           CQ     C  P   L+   +      +G  I  +C  G  L GS + TC +G W  V P 
Sbjct: 182 CQQK---CDIPS--LSNGLISGSTFSIGGVIHLSCKSGFTLTGSPSSTCIDGKWNPVLPI 236

Query: 273 C 273
           C
Sbjct: 237 C 237


>gnl|CDD|153064 cd03594, CLECT_REG-1_like, C-type lectin-like domain (CTLD) of the
           type found in Human REG-1 (lithostathine), REG-4, and
           avian eggshell-specific proteins: ansocalcin,
           structhiocalcin-1(SCA-1), and -2(SCA-2).
           CLECT_REG-1_like: C-type lectin-like domain (CTLD) of
           the type found in Human REG-1 (lithostathine), REG-4,
           and avian eggshell-specific proteins: ansocalcin,
           structhiocalcin-1(SCA-1), and -2(SCA-2).  CTLD refers to
           a domain homologous to the carbohydrate-recognition
           domains (CRDs) of the C-type lectins.  REG-1 is a
           proliferating factor which participates in various kinds
           of tissue regeneration including pancreatic beta-cell
           regeneration, regeneration of intestinal mucosa,
           regeneration of motor neurons, and perhaps in tissue
           regeneration of damaged heart.  REG-1 may play a role on
           the pathophysiology of Alzheimer's disease and in the
           development of gastric cancers.  Its expression is
           correlated with reduced survival from early-stage
           colorectal cancer.  REG-1 also binds and aggregates
           several bacterial strains from the intestinal flora and
           it has been suggested that it is involved in the control
           of the intestinal bacterial ecosystem.  Rat
           lithostathine has calcium carbonate crystal inhibitor
           activity in vitro.  REG-IV is unregulated in pancreatic,
           gastric, hepatocellular, and prostrate adenocarcinomas. 
           REG-IV activates the EGF receptor/Akt/AP-1 signaling
           pathway in colorectal carcinoma.  Ansocalcin, SCA-1 and
           -2 are found at high concentration in the calcified egg
           shell layer of goose and ostrich, respectively and tend
           to form aggregates.  Ansocalcin nucleates calcite
           crystal aggregates in vitro.
          Length = 129

 Score = 45.1 bits (107), Expect = 9e-06
 Identities = 25/70 (35%), Positives = 35/70 (50%), Gaps = 6/70 (8%)

Query: 146 VWIGAQKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWL-WNDVGC 204
           VWIG   DP   +R W+W DG  +   SW ++ P  Y     C  L    G+L WND  C
Sbjct: 64  VWIGLH-DPQ-QSRGWEWSDGSKLDYRSWDRNPP--YARGGYCAELSRSTGFLKWNDANC 119

Query: 205 KLDYLHWICQ 214
           + +   +IC+
Sbjct: 120 E-ERNPFICK 128


>gnl|CDD|153063 cd03593, CLECT_NK_receptors_like, C-type lectin-like domain (CTLD)
           of the type found in natural killer cell receptors
           (NKRs).  CLECT_NK_receptors_like: C-type lectin-like
           domain (CTLD) of the type found in natural killer cell
           receptors (NKRs), including proteins similar to oxidized
           low density lipoprotein (OxLDL) receptor (LOX-1), CD94,
           CD69, NKG2-A and -D, osteoclast inhibitory lectin
           (OCIL), dendritic cell-associated C-type lectin-1
           (dectin-1),  human myeloid inhibitory C-type lectin-like
           receptor (MICL), mast cell-associated functional antigen
           (MAFA), killer cell lectin-like receptors: subfamily F,
           member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and
           lys49 receptors.  CTLD refers to a domain homologous to
           the carbohydrate-recognition domains (CRDs) of the
           C-type lectins.  NKRs are variously associated with
           activation or inhibition of natural killer (NK) cells.
           Activating NKRs stimulate cytolysis by NK cells of
           virally infected or transformed cells; inhibitory NKRs
           block cytolysis upon recognition of markers of healthy
           self cells. Most Lys49 receptors are inhibitory; some
           are stimulatory.  OCIL inhibits NK cell function via
           binding to the receptor NKRP1D.  Murine OCIL in addition
           to inhibiting NK cell function inhibits osteoclast
           differentiation. MAFA clusters with the type I Fc
           epsilon receptor (FcepsilonRI) and inhibits the mast
           cells secretory response to FcepsilonRI stimulus.  CD72
           is a negative regulator of B cell receptor signaling.
           NKG2D is an activating receptor for stress-induced
           antigens; human NKG2D ligands include the stress induced
           MHC-I homologs, MICA, MICB, and ULBP family of
           glycoproteins  Several NKRs have a carbohydrate-binding
           capacity which is not mediated through calcium ions
           (e.g. OCIL binds a range of high molecular weight
           sulfated glycosaminoglycans including dextran sulfate,
           fucoidan, and gamma-carrageenan sugars).  Dectin-1 binds
           fungal beta-glucans and in involved in the innate immune
           responses to fungal pathogens.  MAFA binds saccharides
           having terminal alpha-D mannose residues in a
           calcium-dependent manner.  LOX-1 is the major receptor
           for OxLDL in endothelial cells and thought to play a
           role in the pathology of atherosclerosis.  Some NKRs
           exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and
           some as heterodimers (e.g. CD94/NKG2A).  Dectin-1 can
           function as a monomer in vitro.
          Length = 116

 Score = 41.2 bits (97), Expect = 2e-04
 Identities = 25/127 (19%), Positives = 48/127 (37%), Gaps = 18/127 (14%)

Query: 89  FDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWI 148
           +   CY F+++K  ++ +++  C +    L+     I +     ELE  + ++ +   WI
Sbjct: 8   YGNKCYYFSMEKK-TWNESKEACSSKNSSLLK----IDDE---EELEFLQSQIGSSSYWI 59

Query: 149 GAQKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDY 208
           G  ++   + + WKW+DG  +                 NC  L            C    
Sbjct: 60  GLSREK--SEKPWKWIDGSPLNN----LFNIRGSTKSGNCAYLSSTGI---YSEDCSTKK 110

Query: 209 LHWICQH 215
             WIC+ 
Sbjct: 111 -RWICEK 116


>gnl|CDD|153071 cd03601, CLECT_TC14_like, C-type lectin-like domain (CTLD) of the
           type found in lectins TC14, TC14-2, TC14-3, and TC14-4
           from the budding tunicate Polyandrocarpa misakiensis and
           PfG6 from the Acorn worm.  CLECT_TC14_like: C-type
           lectin-like domain (CTLD) of the type found in lectins
           TC14, TC14-2, TC14-3, and TC14-4 from the budding
           tunicate Polyandrocarpa misakiensis and PfG6 from the
           Acorn worm.  CTLD refers to a domain homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins.  TC14 is homodimeric.  The CTLD of TC14 binds
           D-galactose and D-fucose.  TC14 is expressed
           constitutively by multipotent epithelial and mesenchymal
           cells and plays in role during budding, in inducing the
           aggregation of undifferentiated mesenchymal cells to
           give rise to epithelial forming tissue.   TC14-2 and
           TC14-3 shows calcium-dependent galactose binding
           activity.  TC14-3 is a cytostatic factor which blocks
           cell growth and dedifferentiation of the atrial
           epithelium during asexual reproduction.  It may also act
           as a differentiation inducing factor.  Galactose
           inhibits the cytostatic activity of TC14-3.  The gene
           for Acorn worm PfG6 is gill-specific; PfG6 may be a
           secreted protein.
          Length = 119

 Score = 38.7 bits (90), Expect = 0.001
 Identities = 28/113 (24%), Positives = 45/113 (39%), Gaps = 17/113 (15%)

Query: 99  DKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLV-----WIGAQKD 153
           D+  ++A A + C++ G  L         +  + + E R   L   LV     W+GA  +
Sbjct: 7   DETMNYAKAGAFCRSRGMRLA--------SLAMRDSEMRDAILAFTLVKGHGYWVGAD-N 57

Query: 154 PGITARTWKWVDGEVVTKPS--WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGC 204
                  + W DG  +   S  W  ++P+N    Q CV L      L +D  C
Sbjct: 58  LQDGEYDFLWNDGVSLPTDSDLWAPNEPSNPQSRQLCVQLWSKYN-LLDDEYC 109


>gnl|CDD|153065 cd03595, CLECT_chondrolectin_like, C-type lectin-like domain (CTLD)
           of the type found in the human type-1A transmembrane
           proteins chondrolectin (CHODL) and layilin.
           CLECT_chondrolectin_like: C-type lectin-like domain
           (CTLD) of the type found in the human type-1A
           transmembrane proteins chondrolectin (CHODL) and
           layilin.  CTLD refers to a domain homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins.  CHODL is predominantly expressed in muscle
           cells and is associated with T-cell maturation.  Various
           alternatively spliced isoforms have been of CHODL have
           been identified.  The transmembrane form of CHODL is
           localized in the ER-Golgi apparatus.  Layilin is widely
           expressed in different cell types.  The extracellular
           CTLD of layilin binds hyaluronan (HA), a major
           constituent of the extracellular matrix (ECM).  The
           cytoplasmic tail of layilin binds various members of the
           band 4.1/ERM superfamily (talin, radixin, and merlin).
           The ERM proteins are cytoskeleton-membrane linker
           molecules which link actin to receptors in the plasma
           membrane.  Layilin co-localizes in with talin in
           membrane ruffles and may mediate signals from the ECM to
           the cell cytoskeleton.
          Length = 149

 Score = 38.7 bits (90), Expect = 0.002
 Identities = 35/154 (22%), Positives = 69/154 (44%), Gaps = 39/154 (25%)

Query: 89  FDRNCYEF----NIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERR--KDRLK 142
            ++ CY+     +  +  +F +AR  C+  GG+L+          +  E E++  +  ++
Sbjct: 8   TEKPCYKIAYFQDSRRRLNFEEARQACREDGGELLS---------IESENEQKLIERFIQ 58

Query: 143 TLLV-----WIGAQKDPGITART------WKWVDGEVVTKPSWGKDQPNNYNGEQNCVVL 191
           TL       WIG ++       +      + W+DG + T  +W  D+P+   G + CVV+
Sbjct: 59  TLRASDGDFWIGLRRSSQYNVTSSACSSLYYWLDGSISTFRNWYVDEPS--CGSEVCVVM 116

Query: 192 ------DGGRG----WLWNDVGCKLDYLHWICQH 215
                   G+G    + WND  C +   ++IC++
Sbjct: 117 YHQPSAPAGQGGPYLFQWNDDNCNMKN-NFICKY 149


>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein.  The Polycystin
           Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
           huge protein of 4303aas. Its repeated leucine-rich (LRR)
           segment is found in many proteins. It contains 16
           polycystic kidney disease (PKD) domains, one
           LDL-receptor class A domain, one C-type lectin family
           domain, and 16-18 putative TMSs in positions between
           residues 2200 and 4100. Polycystin-L has been shown to
           be a cation (Na+, K+ and Ca2+) channel that is activated
           by Ca2+. Two members of the PCC family (polycystin 1 and
           2) are mutated in autosomal dominant polycystic kidney
           disease, and polycystin-L is deleted in mice with renal
           and retinal defects. Note: this model is restricted to
           the amino half for technical reasons.
          Length = 2740

 Score = 40.5 bits (94), Expect = 0.003
 Identities = 55/243 (22%), Positives = 82/243 (33%), Gaps = 30/243 (12%)

Query: 93  CYEFNIDKGASFADARSKCQA-SGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQ 151
           C++   ++ A++ DA+ +C A +G  L          FL  ++    DR     VWIG  
Sbjct: 331 CFQIVPEE-AAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDR----GVWIGFS 385

Query: 152 KDPGITARTWKWVDG-EVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLH 210
              G         +  E          +P+    E  CV LD  RG   +D+ C   +  
Sbjct: 386 DVNGAEKGPAHQGEAFEAEECEEGLAGEPHPARAEH-CVRLD-PRGQCNSDL-CNAPH-A 441

Query: 211 WICQHNPANCGSPDRHLNTTFVGTVNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVA 270
           ++C+ NP   G P        +G  +  L   +        M    A     EG    + 
Sbjct: 442 YVCELNP---GGPVPDAENFAMGAASFDLHGLLQ---ALAAMDGLPAPP--HEGVEVLLF 493

Query: 271 PTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTE 330
           P  ++     L   E G  T E  R  H    IY                G GG   G E
Sbjct: 494 PALRFSRAAFLSSAEFG--TQELRRPAHILFQIYRLRCRLP---------GAGGPACGPE 542

Query: 331 PQC 333
            +C
Sbjct: 543 AEC 545


>gnl|CDD|153058 cd03588, CLECT_CSPGs, C-type lectin-like domain (CTLD) of the type
           found in chondroitin sulfate proteoglycan core proteins.
            CLECT_CSPGs: C-type lectin-like domain (CTLD) of the
           type found in chondroitin sulfate proteoglycan core
           proteins (CSPGs) in human and chicken aggrecan, frog
           brevican, and zebra fish dermacan.  CTLD refers to a
           domain homologous to the carbohydrate-recognition
           domains (CRDs) of the C-type lectins.  In cartilage,
           aggrecan forms cartilage link protein stabilized
           aggregates with hyaluronan (HA).  These aggregates
           contribute to the tissue's load bearing properties.
           Aggregates having other CSPGs substituting for aggrecan
           may contribute to the structural integrity of many
           different tissues.  Xenopus brevican is expressed in the
           notochord and the brain during early embryogenesis.
           Zebra fish dermacan is expressed in dermal bones and may
           play a role in dermal bone development.  CSPGs do
           contain LINK domain(s) which bind HA.  These LINK
           domains are considered by one classification system to
           be a variety of CTLD, but are omitted from this
           hierarchical classification based on insignificant
           sequence similarity.
          Length = 124

 Score = 37.9 bits (88), Expect = 0.003
 Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 22/124 (17%)

Query: 87  TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLV 146
             F  +CY    D+  ++ DA  +C+   G L        ++ +  E +   +       
Sbjct: 6   DKFQGHCYRHFPDR-ETWEDAERRCREQQGHL--------SSIVTPEEQEFVNNNAQDYQ 56

Query: 147 WIGAQKDPGITART----WKWVDGEVVTKPSWGKDQPNNY--NGEQNCVVLDGGRGWLWN 200
           WIG      +  RT    ++W DG  +   +W  +QP+N+   GE +CVV+       WN
Sbjct: 57  WIG------LNDRTIEGDFRWSDGHPLQFENWRPNQPDNFFATGE-DCVVMIWHEEGEWN 109

Query: 201 DVGC 204
           DV C
Sbjct: 110 DVPC 113


>gnl|CDD|223646 COG0573, PstC, ABC-type phosphate transport system, permease
           component [Inorganic ion transport and metabolism].
          Length = 310

 Score = 38.8 bits (91), Expect = 0.007
 Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 6/55 (10%)

Query: 508 SPQFISFYFPPSETISAVSEAQT----LGYMISV--GAGVVLFLLILLAIIYARL 556
           + +     F P  T++++   +      G   S     G+VLF++ LL  I AR 
Sbjct: 247 TARIPLSLFSPGTTLTSIIANEFGEASGGLHTSALFALGLVLFVITLLLNILARY 301


>gnl|CDD|224194 COG1275, TehA, Tellurite resistance protein and related permeases
           [Inorganic ion transport and metabolism].
          Length = 329

 Score = 36.9 bits (86), Expect = 0.027
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 10/76 (13%)

Query: 494 ASLKCTRQGDWDNMSPQFISFYFPP------SETISAVSEAQTLGYMISVGAGVVLFLLI 547
            SL   ++ + + ++P   S++ PP      S T S +  A  +   +  GAG   + L+
Sbjct: 126 YSLFRGKKFELETITP---SWFLPPVGIIVASLTASVLLGAYFVIGYVFFGAGFFSY-LV 181

Query: 548 LLAIIYARLRKATPVK 563
           L  I+  RL  A  + 
Sbjct: 182 LFPIVLYRLFTAGLLP 197


>gnl|CDD|183812 PRK12884, ubiA, prenyltransferase; Reviewed.
          Length = 279

 Score = 33.8 bits (78), Expect = 0.25
 Identities = 10/33 (30%), Positives = 17/33 (51%)

Query: 530 TLGYMISVGAGVVLFLLILLAIIYARLRKATPV 562
              Y+IS  A +V+ L+ +L I+Y    K   +
Sbjct: 97  IAAYLISPLAFLVVILVSVLGILYNWKLKEYGL 129


>gnl|CDD|153072 cd03602, CLECT_1, C-type lectin (CTL)/C-type lectin-like (CTLD)
           domain subgroup 1; a subgroup of protein domains
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  CLECT_1: C-type lectin
           (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a
           subgroup of protein domains homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins.  Many CTLDs are calcium-dependent carbohydrate
           binding modules; other CTLDs bind protein ligands,
           lipids, and inorganic surfaces including CaCO3 and ice. 
           Animal C-type lectins are involved in such functions as
           extracellular matrix organization, endocytosis,
           complement activation, pathogen recognition, and
           cell-cell interactions.  CTLDs may bind a variety of
           carbohydrate ligands including mannose,
           N-acetylglucosamine, galactose, N-acetylgalactosamine,
           and fucose. CTLDs associate with each other through
           several different surfaces to form dimers, trimers, or
           tetramers from which ligand-binding sites project in
           different orientations.  In some CTLDs a loop extends to
           the adjoining domain to form a loop-swapped dimer.
          Length = 108

 Score = 31.6 bits (72), Expect = 0.31
 Identities = 19/69 (27%), Positives = 28/69 (40%), Gaps = 12/69 (17%)

Query: 146 VWIGAQKDPGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLD-GGRGWLWNDVGC 204
            WIG  +D      +W+W DG   +  +W   QP    G+ +C  +   GR   W    C
Sbjct: 49  AWIGLYRDVD----SWRWSDGSESSFRNWNTFQP---FGQGDCATMYSSGR---WYAALC 98

Query: 205 KLDYLHWIC 213
                 +IC
Sbjct: 99  -SALKPFIC 106


>gnl|CDD|153070 cd03600, CLECT_thrombomodulin_like, C-type lectin-like domain
           (CTLD) of the type found in human thrombomodulin(TM),
           Endosialin, C14orf27, and C1qR.
           CLECT_thrombomodulin_like: C-type lectin-like domain
           (CTLD) of the type found in human thrombomodulin(TM),
           Endosialin, C14orf27, and C1qR.  CTLD refers to a domain
           homologous to the carbohydrate-recognition domains
           (CRDs) of the C-type lectins.  In these
           thrombomodulin-like proteins the residues involved in
           coordinating Ca2+ in the classical MBP-A CTLD are not
           conserved.  TM exerts anti-fibrinolytic and
           anti-inflammatory activity.  TM also regulates blood
           coagulation in the anticoagulant protein C pathway.  In
           this pathway, the procoagulant properties of thrombin
           (T) are lost when it binds TM.  TM also plays a key role
           in tumor biology.  It is expressed on endothelial cells
           and on several type of tumor cell including squamous
           cell carcinoma.  Loss of TM expression correlates with
           advanced stage and poor prognosis.  Loss of function of
           TM function may be associated with arterial or venous
           thrombosis and with late fetal loss.  Soluble molecules
           of TM retaining the CTLD are detected in human plasma
           and urine where higher levels indicate injury and/or
           enhanced turnover of the endothelium.  C1qR is expressed
           on endothelial cells and stem cells.  It is also
           expressed on monocots and neutrophils, where it is
           subject to ectodomain shedding.  Soluble forms of C1qR
           retaining the CTLD is detected in human plasma.  C1qR
           modulates the phagocytosis of apoptotic cells in vivo.
           C1qR-deficient mice are defective in clearance of
           apoptotic cells in vivo.  The cytoplasmic tail of C1qR,
           C-terminal to the CTLD of CD93, contains a PDZ binding
           domain which interacts with the PDZ domain-containing
           adaptor protein, GIPC.  The juxtamembrane region of this
           tail interacts with the ezrin/radixin/moesin family.
           Endosialin functions in the growth and progression of
           abdominal tumors and is expressed in the stroma of
           several tumors.
          Length = 141

 Score = 32.0 bits (73), Expect = 0.35
 Identities = 32/128 (25%), Positives = 47/128 (36%), Gaps = 14/128 (10%)

Query: 102 ASFADARSKCQASGGDLV---HGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPG--- 155
            +F +A+  C   GG+L     G      + LL     R  R    L WIG Q++P    
Sbjct: 14  LTFLEAQRSCIELGGNLATVRSGEEADVVSLLLAAGPGRHGRGSLRL-WIGLQREPRQCS 72

Query: 156 ---ITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVL--DGGRGWL--WNDVGCKLDY 208
              +  R + WV G+  T  S    +P        CV L   G       W D  C    
Sbjct: 73  DPSLPLRGFSWVTGDQDTDFSNWLQEPAGTCTSPRCVALSAAGSTPDNLKWKDGPCSARA 132

Query: 209 LHWICQHN 216
             ++C+ +
Sbjct: 133 DGYLCKFS 140


>gnl|CDD|233744 TIGR02138, phosphate_pstC, phosphate ABC transporter, permease
           protein PstC.  The typical operon for the high affinity
           inorganic phosphate ABC transporter encodes an
           ATP-binding protein, a phosphate-binding protein, and
           two permease proteins. This family consists of one of
           the two permease proteins, PstC, which is homologous to
           PstA (TIGR00974). In the model bacterium Escherichia
           coli, this transport system is induced when the
           concentration of extrallular inorganic phosphate is low.
           A constitutive, lower affinity transporter operates
           otherwise [Transport and binding proteins, Anions].
          Length = 295

 Score = 32.6 bits (75), Expect = 0.49
 Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 7/58 (12%)

Query: 506 NMSPQFISFYFPPSETISAV-----SEAQTLGYMISV--GAGVVLFLLILLAIIYARL 556
           N+ P      F P  TI+++      EA       S     G+VLF++ LL  I AR 
Sbjct: 232 NVFPISPLSLFDPGTTITSLIANQFGEASGGSVHTSALFALGLVLFVITLLVNILARY 289


>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat.  Approx. 40
          amino acid repeat. Tandem repeats form super-helix of
          helices that is proposed to mediate interaction of
          beta-catenin with its ligands. CAUTION: This family
          does not contain all known armadillo repeats.
          Length = 41

 Score = 28.6 bits (65), Expect = 0.71
 Identities = 10/41 (24%), Positives = 19/41 (46%), Gaps = 1/41 (2%)

Query: 1  KDNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLVT 41
           +NK+ ++  G + P+   LL+S  E+    +   L  L  
Sbjct: 2  PENKQAVIEAGAVPPLVQ-LLSSPDEEVQEEAAWALSNLAA 41


>gnl|CDD|213052 cd12087, TM_EGFR-like, Transmembrane domain of the Epidermal Growth
           Factor Receptor family of Protein Tyrosine Kinases.
           PTKs catalyze the transfer of the gamma-phosphoryl group
           from ATP to tyrosine (tyr) residues in protein
           substrates. EGFR (HER, ErbB) subfamily members include
           EGFR (HER1, ErbB1), HER2 (ErbB2), HER3 (ErbB3), HER4
           (ErbB4), and similar proteins. They are receptor PTKs
           (RTKs) containing an extracellular EGF-related
           ligand-binding region, a transmembrane (TM) helix, and a
           cytoplasmic region with a tyr kinase domain and a
           regulatory C-terminal tail. They are activated by
           ligand-induced dimerization, resulting in the
           phosphorylation of tyr residues in the C-terminal tail,
           which serve as binding sites for downstream signaling
           molecules. Collectively, they can recognize a variety of
           ligands including EGF, TGFalpha, and neuregulins, among
           others. All four subfamily members can form homo- or
           heterodimers. HER3 contains an impaired kinase domain
           and depends on its heterodimerization partner for
           activation. EGFR subfamily members are involved in
           signaling pathways leading to a broad range of cellular
           responses including cell proliferation, differentiation,
           migration, growth inhibition, and apoptosis. The TM
           domain not only serves as a membrane anchor, but also
           plays an important role in receptor dimerization and
           optimal activation. Mutations in the TM domain of EGFR
           family RTKs have been associated with increased breast
           cancer risk.
          Length = 38

 Score = 28.7 bits (65), Expect = 0.81
 Identities = 9/28 (32%), Positives = 14/28 (50%), Gaps = 1/28 (3%)

Query: 531 LGYMISVGA-GVVLFLLILLAIIYARLR 557
               I+ G  G +L L+IL  I++   R
Sbjct: 4   KTTSIAAGVVGGLLVLVILGLIVFLFRR 31


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 31.0 bits (71), Expect = 2.1
 Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 523 SAVSEAQTLGYMISV-GAGVVLFLLILLAIIYARL-RKATPVKNTENIEGAVR 573
           S V+E   LG    V G  V++  L L+A +   L R++   K +  +   VR
Sbjct: 637 SVVAELSPLGLADDVIGQAVIIIALALIAFLVWPLCRESWRDKESHTLRLVVR 689


>gnl|CDD|235993 PRK07331, PRK07331, cobalt transport protein CbiM; Provisional.
          Length = 322

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query: 529 QTLGYMISVGAGVVLFLLILLAIIYARLRK 558
           + LGY++S   GV++FL+I   I   +  K
Sbjct: 292 EVLGYILSAVVGVLIFLIIFKIISSLKKNK 321


>gnl|CDD|235668 PRK06007, fliF, flagellar MS-ring protein; Reviewed.
          Length = 542

 Score = 30.6 bits (70), Expect = 2.9
 Identities = 15/86 (17%), Positives = 32/86 (37%), Gaps = 2/86 (2%)

Query: 512 ISFYFPPSETISAVSEAQTLGYM--ISVGAGVVLFLLILLAIIYARLRKATPVKNTENIE 569
           + F     E  + +   Q   +M  I + AG +L L+++  ++  RLR   P +     E
Sbjct: 416 VPFDEEAEEEAAELPFWQQPWFMDLIKLAAGALLILILIFFVLRPRLRPLLPEEELAEEE 475

Query: 570 GAVRKEEQTAAVMSYASLSDGGGDNI 595
            A  +        +   +     + +
Sbjct: 476 AAAEEAALEEDEEAALEVELSDDEEL 501


>gnl|CDD|234485 TIGR04154, archaeo_STT3, oligosaccharyl transferase, archaeosortase
           A system-associated.  Members of this protein family
           occur, one to three members per genome, in the same
           species of Euryarchaeota as contain the predicted
           protein-sorting enzyme archaeosortase (TIGR04125) and
           its cognate protein-sorting signal PGF-CTERM
           (TIGR04126).
          Length = 815

 Score = 30.4 bits (69), Expect = 3.3
 Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 11/68 (16%)

Query: 504 WDNMSPQFISFYFPPSETISAVSEAQTLGYM----------ISVGAGVVLFLLILLAIIY 553
           +  ++   + F+F    T   ++EAQ L              + G G +L L  L  ++Y
Sbjct: 325 FSLIN-NGLRFFFGGGGTALTIAEAQPLFSTGGEFSLAPAWSNFGLGFLLALAGLALLLY 383

Query: 554 ARLRKATP 561
             LRK  P
Sbjct: 384 RPLRKYRP 391


>gnl|CDD|223744 COG0672, FTR1, High-affinity Fe2+/Pb2+ permease [Inorganic ion
           transport and metabolism].
          Length = 383

 Score = 30.1 bits (68), Expect = 3.6
 Identities = 15/86 (17%), Positives = 25/86 (29%), Gaps = 14/86 (16%)

Query: 529 QTLGYMISVGAGVVLFLLILLAIIYARLRKATPVKNTENIEGAVR------------KEE 576
                 + VG      +L+L+ ++        P+K    I  A               E 
Sbjct: 277 AATPSALEVGVYFGYLILVLILLLIRYRSVKLPLKPFFQITSAFLFYMCFVFAGKGVLEL 336

Query: 577 QTAAVMSYASLSDGGGD--NIYENVA 600
           Q A V+S +           IY +  
Sbjct: 337 QEAGVISPSLEPGPSISWLGIYPSYE 362


>gnl|CDD|237095 PRK12417, secY, preprotein translocase subunit SecY; Reviewed.
          Length = 404

 Score = 30.0 bits (68), Expect = 3.7
 Identities = 17/77 (22%), Positives = 33/77 (42%), Gaps = 13/77 (16%)

Query: 510 QFISFYFPPSETISAVSEAQTLGYMISVGAGVVLFLLIL--LAIIYARL-----RKATPV 562
             I   FP +  ISA  +  + G  +    GV ++L++   L+  ++ +     + A  +
Sbjct: 258 ILILSIFPNNSGISADMDMLSFGSPL----GVTVYLILQFLLSYFFSFVNINTKQIAKDM 313

Query: 563 -KNTENIEGAVRKEEQT 578
            K+   I G V   + T
Sbjct: 314 LKSGNYIPG-VYPGKDT 329


>gnl|CDD|163525 TIGR03813, put_Glu_GABA_T, putative glutamate/gamma-aminobutyrate
           antiporter.  Members of this protein family are putative
           putative glutamate/gamma-aminobutyrate antiporters. Each
           member of the seed alignment is found adjacent to a
           glutamate decarboxylase, which converts glutamate (Glu)
           to gamma-aminobutyrate (GABA). However, the majority
           belong to genome contexts with a glutaminase (converts
           Gln to Glu) as well as the decarboxylase that converts
           Glu to GABA. The specificity of the transporter remains
           uncertain.
          Length = 474

 Score = 30.2 bits (68), Expect = 4.1
 Identities = 17/59 (28%), Positives = 26/59 (44%), Gaps = 6/59 (10%)

Query: 511 FISFYFPPSETISAVSEAQTLGYMISVGAGVVLFLLILLAIIYARLRKATPVKNTENIE 569
           F+  + PPS+ I   S  + +G +I    G+    +    +IYAR RK        N E
Sbjct: 417 FVLSFIPPSQ-IGTGSPTRYVGLLI----GLAALFVTAPFLIYAR-RKPHWRDPNSNFE 469


>gnl|CDD|216184 pfam00907, T-box, T-box.  The T-box encodes a 180 amino acid domain
           that binds to DNA. Genes encoding T-box proteins are
           found in a wide range of animals, but not in other
           kingdoms such as plants. Family members are all thought
           to bind to the DNA consensus sequence TCACACCT. they are
           found exclusively in the nucleus, and perform
           DNA-binding and transcriptional activation/repression
           roles. They are generally required for development of
           the specific tissues they are expressed in, and
           mutations in T-box genes are implicated in human
           conditions such as DiGeorge syndrome and X-linked cleft
           palate, which feature malformations.
          Length = 182

 Score = 29.1 bits (66), Expect = 4.2
 Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 18/70 (25%)

Query: 348 IVTTSGRR---------TG--SVATYSCEPGFILFGSNELKCGLGGDW--SGKA----PT 390
           I+T SGRR         +G    A YS    F+    +  K    G+W  +GKA    P 
Sbjct: 21  IITKSGRRMFPTLKVSVSGLDPNALYSVLLDFVPVDDHRYKFH-NGEWVVAGKAEPHPPP 79

Query: 391 CKYVDCGSPA 400
             Y+   SPA
Sbjct: 80  RVYIHPDSPA 89


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 29.9 bits (68), Expect = 4.4
 Identities = 14/62 (22%), Positives = 21/62 (33%), Gaps = 11/62 (17%)

Query: 506 NMSPQFISFY----------FPPSETISAVSEAQTLGYMISV-GAGVVLFLLILLAIIYA 554
               + ++              P   IS  +E    G    V G    + LLILL++   
Sbjct: 116 GWPKERVARLRRRLRRLIWVLVPLLFISIFAENLPDGLARDVLGRLAFILLLILLSLFLW 175

Query: 555 RL 556
           RL
Sbjct: 176 RL 177


>gnl|CDD|153066 cd03596, CLECT_tetranectin_like, C-type lectin-like domain (CTLD)
           of the type found in the tetranectin (TN), cartilage
           derived C-type lectin (CLECSF1), and stem cell growth
           factor (SCGF).  CLECT_tetranectin_like: C-type
           lectin-like domain (CTLD) of the type found in the
           tetranectin (TN), cartilage derived C-type lectin
           (CLECSF1), and stem cell growth factor (SCGF).  CTLD
           refers to a domain homologous to the
           carbohydrate-recognition domains (CRDs) of the C-type
           lectins.  TN binds to plasminogen and stimulates
           activation of plasminogen, playing a key role in the
           regulation of proteolytic processes.  The TN CTLD binds
           two calcium ions.  Its calcium free form binds to
           various kringle-like protein ligands.  Two residues
           involved in the coordination of calcium are critical for
           the binding of TN to the fourth kringle (K4) domain of
           plasminogen (Plg K4).  TN binds the kringle 1-4 form of
           angiostatin (AST K1-4).  AST K1-4 is a fragment of Plg,
           commonly found in cancer tissues.  TN inhibits the
           binding of Plg and AST K1-4 to the extracellular matrix
           (EMC) of endothelial cells and counteracts the
           antiproliferative effects of AST K1-4 on these cells.
           TN also binds the tenth kringle domain of apolipoprotein
           (a).  In addition, TN binds fibrin and complex
           polysaccharides in a Ca2+ dependent manner.  The binding
           site for complex sulfated polysaccharides is N-terminal
           to the CTLD.  TN is homotrimeric; N-terminal to the CTLD
           is an alpha helical domain responsible for trimerization
           of monomeric units.  TN may modulate angiogenesis
           through interactions with angiostatin and coagulation
           through interaction with fibrin.  TN may play a role in
           myogenesis and in bone development.  Mice having a
           deletion in the TN gene exhibit a kyphotic spine
           abnormality.  TN is a useful prognostic marker of
           certain cancer types.  CLECSF1 is expressed in cartilage
           tissue, which is primarily intracellular matrix (ECM),
           and is a candidate for organizing ECM.  SCGF is strongly
           expressed in bone marrow and is a cytokine for primitive
           hematopoietic progenitor cells.
          Length = 129

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 25/119 (21%), Positives = 45/119 (37%), Gaps = 17/119 (14%)

Query: 103 SFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLL-----VWIGAQKDPGIT 157
            + +A   C A GG L              E +  +D +K  +     VW+G      + 
Sbjct: 20  HYHEASEDCIARGGTLATPRDS-------DENDALRDYVKASVPGNWEVWLGIND--MVA 70

Query: 158 ARTWKWVDGEVVTKPSWGKDQPNNYNG--EQNCVVLDGGRGWLWNDVGCKLDYLHWICQ 214
              W  V+G  ++  +W ++     +G   +NCV L       W D  C+     ++C+
Sbjct: 71  EGKWVDVNGSPISYFNWEREITAQPDGGKRENCVALSSSAQGKWFDEDCR-REKPYVCE 128


>gnl|CDD|223279 COG0201, SecY, Preprotein translocase subunit SecY [Intracellular
           trafficking and secretion].
          Length = 436

 Score = 29.5 bits (67), Expect = 5.8
 Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 9/85 (10%)

Query: 509 PQFISFYFPPSETISAVSEAQTLGYMISVGAGVVLFLLILLAIIYARLRKAT---PVKNT 565
           P  I+ +         +S   ++   +S G+ V + L ++L I ++         P +  
Sbjct: 282 PSTIAQFLGNGGLAYYLSPLTSISDALSPGSPVYIALYVVLIIFFSYFYTEIQFNPEEIA 341

Query: 566 ENIE--GA----VRKEEQTAAVMSY 584
           EN++  G     +R  + T   ++ 
Sbjct: 342 ENLKKSGGFIPGIRPGKDTEKYLNR 366


>gnl|CDD|233140 TIGR00816, tdt, C4-dicarboxylate transporter/malic acid transport
           protein.  The Tellurite-Resistance/Dicarboxylate
           Transporter (TDT) Family (TC 2.A.16)Two members of the
           TDT family have been functionally characterized. One is
           the TehA protein of E. coli which has been implicated in
           resistance to tellurite; the other is the Mae1 protein
           of S. pombe which functions in the uptake of malate and
           other dicarboxylates by a proton symportmechanism. These
           proteins exhibit 10 putative transmembrane
           a-helicalspanners (TMSs) [Transport and binding
           proteins, Carbohydrates, organic alcohols, and acids].
          Length = 320

 Score = 29.3 bits (66), Expect = 6.9
 Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 11/61 (18%)

Query: 513 SFYFPPSETI------SAVSEAQTLGYMISV----GAGVVLFLLILLAIIYARLRKATPV 562
            +Y PP   I       A+    TLG  +      GAG  L+ L L A++  RLR    +
Sbjct: 133 GWYLPPVALIFISAAALAILPYLTLGLTVINYLFWGAGFFLY-LSLEAVLLLRLRLHGLL 191

Query: 563 K 563
            
Sbjct: 192 P 192


>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
           This family contains, amongst other G-protein-coupled
           receptors (GCPRs), members of the opsin family, which
           have been considered to be typical members of the
           rhodopsin superfamily. They share several motifs, mainly
           the seven transmembrane helices, GCPRs of the rhodopsin
           superfamily. All opsins bind a chromophore, such as
           11-cis-retinal. The function of most opsins other than
           the photoisomerases is split into two steps: light
           absorption and G-protein activation. Photoisomerases, on
           the other hand, are not coupled to G-proteins - they are
           thought to generate and supply the chromophore that is
           used by visual opsins.
          Length = 251

 Score = 28.4 bits (64), Expect = 8.9
 Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 9/81 (11%)

Query: 511 FISFYFPPSE-----TISAVSEAQTLGY--MISVGAGVVLFLLILLA--IIYARLRKATP 561
           F               I    E+    Y  + ++   V+  L+IL+   +I   LRK   
Sbjct: 116 FSWLRTVEEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRKRAR 175

Query: 562 VKNTENIEGAVRKEEQTAAVM 582
              ++        +E+ AA M
Sbjct: 176 SGASQARAKRSSSKERKAAKM 196


>gnl|CDD|185731 cd08990, GH43_AXH_like, Glycosyl hydrolase family 43, includes
           arabinoxylan arabinofuranohydrolase, beta-xylosidase,
           endo-1,4-beta-xylanase, alpha-L-arabinofuranosidase.
           This glycosyl hydrolase family 43 (GH43) includes
           enzymes that have been characterized with
           beta-xylosidase (EC 3.2.1.37),
           alpha-L-arabinofuranosidase (EC 3.2.1.55),
           endo-alpha-L-arabinanase as well as arabinoxylan
           arabinofuranohydrolase (AXH) activities. These are all
           inverting enzymes (i.e. they invert the stereochemistry
           of the anomeric carbon atom of the substrate) that have
           an aspartate as the catalytic general base, a glutamate
           as the catalytic general acid and another aspartate that
           is responsible for pKa modulation and orienting the
           catalytic acid. Many of the enzymes in this family
           display both alpha-L-arabinofuranosidase and
           beta-D-xylosidase activity using aryl-glycosides as
           substrates. AXHs specifically hydrolyze the glycosidic
           bond between arabinofuranosyl substituents and
           xylopyranosyl backbone residues of arabinoxylan. Several
           of these enzymes also contain carbohydrate binding
           modules (CBMs) that bind cellulose or xylan. A common
           structural feature of GH43 enzymes is a 5-bladed
           beta-propeller domain that contains the catalytic acid
           and catalytic base. A long V-shaped groove, partially
           enclosed at one end, forms a single extended
           substrate-binding surface across the face of the
           propeller.
          Length = 274

 Score = 28.7 bits (65), Expect = 9.8
 Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 10/39 (25%)

Query: 411 NGTTTHGSVAEYTCDDDYWL--------EAGAMRRQVCM 441
           NG T HGS+ E+     ++L          G  RR VC+
Sbjct: 225 NGGTNHGSIVEF--KGQWYLFYHRATLSGGGDFRRSVCI 261


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.135    0.431 

Gapped
Lambda     K      H
   0.267   0.0746    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,659,270
Number of extensions: 2899601
Number of successful extensions: 2533
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2470
Number of HSP's successfully gapped: 78
Length of query: 618
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 515
Effective length of database: 6,369,140
Effective search space: 3280107100
Effective search space used: 3280107100
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.6 bits)