Query psy4323
Match_columns 191
No_of_seqs 182 out of 1205
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 22:41:32 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4323.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4323hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd03591 CLECT_collectin_like C 99.9 1.4E-23 3.1E-28 155.5 11.3 96 54-175 8-114 (114)
2 cd03598 CLECT_EMBP_like C-type 99.9 1.9E-23 4.2E-28 155.6 11.8 104 40-175 1-117 (117)
3 cd03603 CLECT_VCBS A bacterial 99.9 5.8E-23 1.3E-27 153.8 11.7 100 41-174 1-117 (118)
4 cd03592 CLECT_selectins_like C 99.9 4.8E-23 1E-27 152.7 11.1 101 43-175 3-115 (115)
5 cd03596 CLECT_tetranectin_like 99.9 6.5E-23 1.4E-27 155.1 11.5 107 40-175 9-129 (129)
6 cd03601 CLECT_TC14_like C-type 99.9 9.8E-23 2.1E-27 152.6 11.9 102 42-175 2-119 (119)
7 cd03588 CLECT_CSPGs C-type lec 99.9 1.2E-22 2.6E-27 152.8 11.3 101 40-176 10-124 (124)
8 cd03597 CLECT_attractin_like C 99.9 3E-22 6.4E-27 152.3 10.9 105 40-175 10-129 (129)
9 cd03589 CLECT_CEL-1_like C-typ 99.9 5.3E-22 1.2E-26 150.4 11.7 104 40-175 10-137 (137)
10 cd03590 CLECT_DC-SIGN_like C-t 99.9 1.4E-21 2.9E-26 145.9 12.0 104 40-175 10-126 (126)
11 cd03602 CLECT_1 C-type lectin 99.9 7.8E-22 1.7E-26 144.6 10.5 99 42-175 2-108 (108)
12 cd03594 CLECT_REG-1_like C-typ 99.9 2.1E-21 4.6E-26 145.7 11.0 104 40-175 10-129 (129)
13 cd03599 CLECT_DGCR2_like C-typ 99.9 2.6E-21 5.7E-26 151.8 10.9 107 41-175 13-153 (153)
14 cd03593 CLECT_NK_receptors_lik 99.8 9.3E-21 2E-25 139.8 11.3 99 40-175 10-116 (116)
15 cd03595 CLECT_chondrolectin_li 99.8 1.8E-20 4E-25 145.7 11.5 117 40-175 10-149 (149)
16 cd03600 CLECT_thrombomodulin_l 99.8 1.3E-19 2.9E-24 139.2 12.0 113 40-176 4-140 (141)
17 smart00034 CLECT C-type lectin 99.8 1.1E-18 2.3E-23 128.2 11.2 103 39-174 9-126 (126)
18 TIGR00864 PCC polycystin catio 99.8 9.7E-19 2.1E-23 178.9 12.2 119 31-178 316-449 (2740)
19 PHA02642 C-type lectin-like pr 99.8 2.5E-18 5.5E-23 141.1 11.9 112 30-177 85-202 (216)
20 PHA02953 IEV and EEV membrane 99.7 2E-17 4.2E-22 131.9 9.8 104 39-176 55-167 (170)
21 PHA03097 C-type lectin-like pr 99.7 1.4E-16 3E-21 125.6 10.8 97 39-177 54-156 (157)
22 cd00037 CLECT C-type lectin (C 99.7 6.8E-16 1.5E-20 110.6 11.1 103 42-175 2-116 (116)
23 PF00059 Lectin_C: Lectin C-ty 99.7 1.5E-16 3.2E-21 113.5 6.9 96 55-175 1-105 (105)
24 PHA02867 C-type lectin protein 99.5 3.3E-14 7.2E-19 112.8 8.3 111 21-178 43-157 (167)
25 PHA02911 C-type lectin-like pr 99.2 1.4E-10 3.1E-15 93.6 10.0 101 40-178 112-212 (213)
26 PF05473 Herpes_UL45: UL45 pro 97.7 0.00011 2.4E-09 60.1 6.7 44 40-90 94-138 (200)
27 cd03519 Link_domain_HAPLN_modu 94.1 0.044 9.5E-07 39.4 2.5 24 54-77 8-31 (91)
28 cd01102 Link_Domain The link d 91.6 0.14 3.1E-06 36.8 2.2 25 53-77 10-34 (92)
29 cd03520 Link_domain_CSPGs_modu 91.5 0.14 3.1E-06 37.1 2.2 24 54-77 8-31 (96)
30 smart00445 LINK Link (Hyaluron 91.1 0.16 3.4E-06 36.8 2.0 24 54-77 12-35 (94)
31 cd03518 Link_domain_HAPLN_modu 90.2 0.19 4E-06 36.5 1.7 23 55-77 12-34 (95)
32 cd03521 Link_domain_KIAA0527_l 90.1 0.28 6E-06 35.3 2.5 24 54-77 11-34 (95)
33 PF00193 Xlink: Extracellular 90.0 0.21 4.6E-06 35.9 1.9 23 55-77 12-34 (92)
34 cd03515 Link_domain_TSG_6_like 89.7 0.2 4.3E-06 36.2 1.5 23 55-77 12-34 (93)
35 PHA03093 EEV glycoprotein; Pro 88.8 0.78 1.7E-05 37.0 4.5 41 40-89 108-148 (185)
36 cd03517 Link_domain_CSPGs_modu 88.7 0.26 5.7E-06 35.7 1.6 22 56-77 13-34 (95)
37 PHA02673 ORF109 EEV glycoprote 87.2 0.64 1.4E-05 36.7 3.0 30 40-77 84-113 (161)
38 KOG4297|consensus 85.4 3.9 8.4E-05 27.4 6.1 32 57-88 109-140 (207)
39 cd03516 Link_domain_CD44_like 84.3 0.65 1.4E-05 36.1 1.8 23 55-77 17-39 (144)
40 PF05966 Chordopox_A33R: Chord 69.1 4.1 9E-05 33.1 2.4 28 41-75 113-140 (190)
41 PF07979 Intimin_C: Intimin C- 53.9 6.2 0.00014 28.9 0.9 33 54-88 10-42 (101)
42 PF03891 DUF333: Domain of unk 36.2 30 0.00066 21.9 1.9 18 61-78 6-23 (50)
43 PHA02672 ORF110 EEV glycoprote 28.5 38 0.00082 26.7 1.6 57 40-106 59-115 (166)
44 PF03781 FGE-sulfatase: Sulfat 20.7 56 0.0012 26.9 1.4 34 55-88 94-130 (260)
No 1
>cd03591 CLECT_collectin_like C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, a
Probab=99.90 E-value=1.4e-23 Score=155.54 Aligned_cols=96 Identities=21% Similarity=0.434 Sum_probs=84.0
Q ss_pred CcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCCCceEEccCCCcccc
Q psy4323 54 SLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANVNGWFWSGSGAKIGP 133 (191)
Q Consensus 54 ~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~~~w~W~~dGs~~~~ 133 (191)
.++++|.+|+.+|+.+||+||+|+|++|+++|..++.+ ....+||||++...++ .|.|+ ||+++
T Consensus 8 ~~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-~~~~~WiGl~~~~~~~------------~~~w~-dg~~~-- 71 (114)
T cd03591 8 GEEKNFDDAQKLCSEAGGTLAMPRNAAENAAIASYVKK-GNTYAFIGITDLETEG------------QFVYL-DGGPL-- 71 (114)
T ss_pred CceeCHHHHHHHHhhcCCEEecCCCHHHHHHHHHHHhc-CCccEEEecccCCcCC------------cEEeC-CCCCc--
Confidence 57899999999999999999999999999999999986 3457999999987766 99997 99998
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCccCCC-----------cCcCCCCCcceEEee
Q psy4323 134 TTQRNTGDWSATGGFGQAQPDNREAAQ-----------HDVACHHLKPFVCED 175 (191)
Q Consensus 134 ~~~~~y~nW~~~~~~~~~qP~~~~~~~-----------~d~~C~~~~~FICe~ 175 (191)
.|.+|.+ ++|++....+ .|.+|+.+++||||+
T Consensus 72 ----~y~~W~~------~ep~~~~~~~~Cv~~~~~~~W~~~~C~~~~~fICe~ 114 (114)
T cd03591 72 ----TYTNWKP------GEPNNAGGGEDCVEMYTSGKWNDVACNLTRLFVCEF 114 (114)
T ss_pred ----ccCCcCC------CCCCCCCCCCCeEEECCCCcCcCccCCCCeeEEeeC
Confidence 5779999 8898654211 789999999999985
No 2
>cd03598 CLECT_EMBP_like C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CLECT_EMBP_like: C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Eosinophils and basophils carry out various functions in allergic, parasitic, and inflammatory diseases. EMBP is stored in eosinophil crystalloid granules and is released upon degranulation. EMBP is also expressed in basophils. The proform of EMBP is expressed in placental X cells and breast tissue and increases significantly during human pregnancy. EMBP has cytotoxic properties and damages bacteria and mammalian cells, in vitro, as well as, helminth parasites. EMBP deposition has been observed in the inflamed tissue of all
Probab=99.90 E-value=1.9e-23 Score=155.64 Aligned_cols=104 Identities=25% Similarity=0.429 Sum_probs=86.9
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHC-CCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeC--CCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRH-CMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKC--NFNGCDRPDLQP 116 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~-gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~--~~~gc~~~~~~~ 116 (191)
|+||.|+ ..+++|.+|+..|+.+ ||+||+|+|++|+++|.+++.......+||||++. ..++
T Consensus 1 ~~Cy~~~-------~~~~t~~~A~~~C~~~~g~~La~i~s~~e~~~l~~~~~~~~~~~~WiGl~~~~~~~~~-------- 65 (117)
T cd03598 1 GRCYRFV-------KSPRTFRDAQVICRRCYRGNLASIHSFAFNYRVQRLVSTLNQAQVWIGGIITGKGRCR-------- 65 (117)
T ss_pred CceEEEe-------cCCCCHHHHHHHhhcCCCceEeeecChhHhHHHHHHHhCCCCCCEEEeeEcCCCCcCC--------
Confidence 4679986 4689999999999995 99999999999999999999765556899999987 4444
Q ss_pred CCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCC----------cCcCCCCCcceEEee
Q psy4323 117 ANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQ----------HDVACHHLKPFVCED 175 (191)
Q Consensus 117 ~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~----------~d~~C~~~~~FICe~ 175 (191)
.|.|+ ||+++ .|.+|.+ +||++..... +|.+|..+++||||.
T Consensus 66 ----~~~W~-dg~~~------~y~~W~~------g~p~~~~~~Cv~~~~~~g~W~~~~C~~~~~fiC~~ 117 (117)
T cd03598 66 ----RFSWV-DGSVW------NYAYWAP------GQPGNRRGHCVELCTRGGHWRRAHCKLRRPFICSY 117 (117)
T ss_pred ----eeEeC-CCCcc------CcCCCCC------CCCCCCCCCcEEEeCCCCeECCCcCCCCceeeecC
Confidence 89998 99988 5679999 8998632111 899999999999984
No 3
>cd03603 CLECT_VCBS A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Bacterial CTLDs within this group are functionally uncharacterized. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surface
Probab=99.90 E-value=5.8e-23 Score=153.78 Aligned_cols=100 Identities=21% Similarity=0.467 Sum_probs=84.7
Q ss_pred eeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCCC
Q psy4323 41 HSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANVN 120 (191)
Q Consensus 41 ~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~~ 120 (191)
|+|.|+ .++++|.+|+..|+++||+||+|+|++|++||.+++. ....+||||++...++
T Consensus 1 ~~Y~~~-------~~~~sw~~A~~~C~~~g~~La~I~s~~E~~fv~~~~~--~~~~~WiG~~~~~~~~------------ 59 (118)
T cd03603 1 HFYKFV-------DGGMTWEAAQTLAESLGGHLVTINSAEENDWLLSNFG--GYGASWIGASDAATEG------------ 59 (118)
T ss_pred CeEEEe-------CCCcCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHhc--cCCCEEEeeecCCCCC------------
Confidence 578887 3689999999999999999999999999999999987 2457999999987666
Q ss_pred ceEEccCCCccccCCCCCCCCCCCCCCCCCCCC-CCccCC------------C---cCcCCC-CCcceEEe
Q psy4323 121 GWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQP-DNREAA------------Q---HDVACH-HLKPFVCE 174 (191)
Q Consensus 121 ~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP-~~~~~~------------~---~d~~C~-~~~~FICe 174 (191)
.|+|+ ||+++ .|.+|.+ +|| ++.... . +|.+|+ .+++||||
T Consensus 60 ~w~W~-dg~~~------~~~~W~~------~eP~~~~~~~~~Cv~~~~~~~~~~~W~d~~C~~~~~~~iCe 117 (118)
T cd03603 60 TWKWS-DGEES------TYTNWGS------GEPHNNGGGNEDYAAINHFPGISGKWNDLANSYNTLGYVIE 117 (118)
T ss_pred ceEeC-CCCcC------CCCCcCC------CCCCCCCCCCcCeEEeecCCCCCCcCccCCCCccccceEEe
Confidence 99998 99997 5779999 888 432211 1 889999 99999998
No 4
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. L- sel is expressed constitutively on most leukocytes. P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets. E- sels are present on endothelial cells. Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=99.90 E-value=4.8e-23 Score=152.67 Aligned_cols=101 Identities=22% Similarity=0.539 Sum_probs=85.7
Q ss_pred EEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCCCce
Q psy4323 43 YFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANVNGW 122 (191)
Q Consensus 43 Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~~~w 122 (191)
|+|+ .++++|.+|+.+|+.+||+||+|+|++|++||..++.......+||||++...++ .|
T Consensus 3 Y~~~-------~~~~~w~~A~~~C~~~g~~La~i~s~~e~~~i~~~~~~~~~~~~WiG~~~~~~~~------------~W 63 (115)
T cd03592 3 YHYS-------TEKMTFNEAVKYCKSRGTDLVAIQNAEENALLNGFALKYNLGYYWIDGNDINNEG------------TW 63 (115)
T ss_pred EEEc-------CCccCHHHHHHHHHHcCCeEeecCCHHHHHHHHHHHHhcCCCCEEEeCccCCccC------------eE
Confidence 7776 4689999999999999999999999999999999876654457999999987765 89
Q ss_pred EEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCC---------C---cCcCCCCCcceEEee
Q psy4323 123 FWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAA---------Q---HDVACHHLKPFVCED 175 (191)
Q Consensus 123 ~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~---------~---~d~~C~~~~~FICe~ 175 (191)
+|+ ||+++ .|.+|.+ +||++.... . +|.+|+.+++||||+
T Consensus 64 ~~~-dg~~~------~y~~W~~------geP~~~~~~~Cv~~~~~~~g~W~d~~C~~~~~fICe~ 115 (115)
T cd03592 64 VDT-DKKEL------EYKNWAP------GEPNNGRNENCLEIYIKDNGKWNDEPCSKKKSAICYT 115 (115)
T ss_pred EeC-CCCcc------cccccCC------CCCCCCCCCCceEEccCCCCCCcCcCCCCCccceeCC
Confidence 997 99987 5779999 899864311 0 789999999999995
No 5
>cd03596 CLECT_tetranectin_like C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CLECT_tetranectin_like: C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TN binds to plasminogen and stimulates activation of plasminogen, playing a key role in the regulation of proteolytic processes. The TN CTLD binds two calcium ions. Its calcium free form binds to various kringle-like protein ligands. Two residues involved in the coordination of calcium are critical for the binding of TN to the fourth kringle (K4) domain of plasminogen (Plg K4). TN binds the kringle 1-4 form of angiostatin (AST K1-4). AST K1-4 is a fragment of Plg, commonly found in cancer tissues. TN inhibits the bin
Probab=99.89 E-value=6.5e-23 Score=155.11 Aligned_cols=107 Identities=21% Similarity=0.363 Sum_probs=87.2
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcC--CCCcEEEceeeCCCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRG--NVRYIWTSGRKCNFNGCDRPDLQPA 117 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~--~~~~~WIGl~~~~~~gc~~~~~~~~ 117 (191)
++||+|+. ++++|.+|+.+|+++||+||+|+|++|+++|.+++++. ....+||||++...++
T Consensus 9 ~~CY~~~~-------~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~~~~~~~~~WiGl~~~~~~~--------- 72 (129)
T cd03596 9 KKCYLVSE-------ETKHYHEASEDCIARGGTLATPRDSDENDALRDYVKASVPGNWEVWLGINDMVAEG--------- 72 (129)
T ss_pred CEEEEEec-------ccCCHHHHHHHHHhcCCeEecCCCHHHHHHHHHHHHhccCCCCcEEEeccccCccC---------
Confidence 46799973 57899999999999999999999999999999988754 2357999999988877
Q ss_pred CCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCC------------CcCcCCCCCcceEEee
Q psy4323 118 NVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAA------------QHDVACHHLKPFVCED 175 (191)
Q Consensus 118 ~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~------------~~d~~C~~~~~FICe~ 175 (191)
.|+|+ ||+++ .|.+|.+. ..+||++.... -+|.+|+.+++||||+
T Consensus 73 ---~w~w~-dG~~~------~~~~W~~~---~~~~p~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~fICe~ 129 (129)
T cd03596 73 ---KWVDV-NGSPI------SYFNWERE---ITAQPDGGKRENCVALSSSAQGKWFDEDCRREKPYVCEF 129 (129)
T ss_pred ---eEEeC-CCCCc------cccccCCC---CCCCCCCCCCCCCEEEccCCCCcCcCccCCCCCceeccC
Confidence 99998 99998 46799851 11677642211 0789999999999985
No 6
>cd03601 CLECT_TC14_like C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CLECT_TC14_like: C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TC14 is homodimeric. The CTLD of TC14 binds D-galactose and D-fucose. TC14 is expressed constitutively by multipotent epithelial and mesenchymal cells and plays in role during budding, in inducing the aggregation of undifferentiated mesenchymal cells to give rise to epithelial forming tissue. TC14-2 and TC14-3 shows calcium-dependent galactose binding activity. TC14-3 is a cytostatic factor which blocks cell growth and dedifferentiation of the atrial epithelium during asexual reproducti
Probab=99.89 E-value=9.8e-23 Score=152.64 Aligned_cols=102 Identities=21% Similarity=0.421 Sum_probs=81.5
Q ss_pred eEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcC---CCCcEEEceeeC-CCCCCCCCCCCCC
Q psy4323 42 SYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRG---NVRYIWTSGRKC-NFNGCDRPDLQPA 117 (191)
Q Consensus 42 ~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~---~~~~~WIGl~~~-~~~gc~~~~~~~~ 117 (191)
.|||+ .++++|.+|+.+|+.+||+||+|+|++| .++ ..+... ....+||||+|. ..++
T Consensus 2 ~~~~~-------~~~~~w~~A~~~C~~~G~~La~i~s~~e-~~~-~~i~~~~~~~~~~~WIGl~d~~~~~g--------- 63 (119)
T cd03601 2 EILCS-------DETMNYAKAGAFCRSRGMRLASLAMRDS-EMR-DAILAFTLVKGHGYWVGADNLQDGEY--------- 63 (119)
T ss_pred EEEEc-------CCcCCHHHHHHHHHhcCCEEeeecCHHH-HHH-HHHHhccccCCccEEEEeccCCCCCC---------
Confidence 48887 4689999999999999999999999988 333 333322 234699999998 7766
Q ss_pred CCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCC------------cCcCCCCCcceEEee
Q psy4323 118 NVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQ------------HDVACHHLKPFVCED 175 (191)
Q Consensus 118 ~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~------------~d~~C~~~~~FICe~ 175 (191)
.|+|+ ||+++.. .|.+|.+ +||++....+ +|.+|..+++||||+
T Consensus 64 ---~~~W~-dG~~~~~----~y~~W~~------geP~~~~~~e~Cv~~~~~~~~W~d~~C~~~~~fICek 119 (119)
T cd03601 64 ---DFLWN-DGVSLPT----DSDLWAP------NEPSNPQSRQLCVQLWSKYNLLDDEYCGRAKRVICEK 119 (119)
T ss_pred ---CeEeC-CCCCcCC----CCCccCC------CcCcCcCCCcCCeEEeCCCCCEeCccCCCCceeeecC
Confidence 99998 9998841 4789999 9998753221 899999999999986
No 7
>cd03588 CLECT_CSPGs C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins. CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classif
Probab=99.89 E-value=1.2e-22 Score=152.80 Aligned_cols=101 Identities=31% Similarity=0.606 Sum_probs=84.8
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANV 119 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~ 119 (191)
++||.|+ .++++|.+|+..|+.+||+||+|+|++|++||..++. ..+||||++...++
T Consensus 10 ~~Cy~~~-------~~~~sw~~A~~~C~~~gg~La~i~s~~e~~fl~~~~~----~~~WIGl~~~~~~~----------- 67 (124)
T cd03588 10 GHCYRHF-------PDRETWEDAERRCREQQGHLSSIVTPEEQEFVNNNAQ----DYQWIGLNDRTIEG----------- 67 (124)
T ss_pred CEEEEEE-------CCccCHHHHHHHHHhcCCEEeccCCHHHHHHHHHhcc----CcEEecceecCCCC-----------
Confidence 4669887 3679999999999999999999999999999988753 36999999887765
Q ss_pred CceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCcc-CCC-------------cCcCCCCCcceEEeec
Q psy4323 120 NGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNRE-AAQ-------------HDVACHHLKPFVCEDS 176 (191)
Q Consensus 120 ~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~-~~~-------------~d~~C~~~~~FICe~~ 176 (191)
.|+|+ ||+++ .|.+|.+ +||++.. ..+ +|.+|+.+++||||+.
T Consensus 68 -~~~W~-dg~~~------~~~~W~~------~~p~~~~~~~~~Cv~~~~~~~~~W~d~~C~~~~~fICe~~ 124 (124)
T cd03588 68 -DFRWS-DGHPL------QFENWRP------NQPDNFFATGEDCVVMIWHEEGEWNDVPCNYHLPFTCKKG 124 (124)
T ss_pred -ceEeC-CCCcc------cccCcCC------CCCCCCCCCCCCeEEEecCCCCeEcCCCCCCCCeeeeeCC
Confidence 89998 99998 4679999 8997631 111 8999999999999973
No 8
>cd03597 CLECT_attractin_like C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CLECT_attractin_like: C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Mouse AtrN (the product of the mahogany gene) has been shown to bind Agouti protein and to function in agouti-induced pigmentation and obesity. Mutations in AtrN have also been shown to cause spongiform encephalopathy and hypomyelination in rats and hamsters. The cytoplasmic region of mouse ALP has been shown to binds to melanocortin receptor (MCR4). Signaling through MCR4 plays a role in appetite suppression. Attractin may have therapeutic potential in the treatment of obesity. Human attractin (hAtrN) has been shown to be expressed on activated T cells and released extracellularly. The
Probab=99.88 E-value=3e-22 Score=152.33 Aligned_cols=105 Identities=21% Similarity=0.356 Sum_probs=86.7
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCC----CCcEEEceeeCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGN----VRYIWTSGRKCNFNGCDRPDLQ 115 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~----~~~~WIGl~~~~~~gc~~~~~~ 115 (191)
+.||.|+ ...++|.+|+.+|+++||+||+|++.+|++||.+++.+.. ...+||||++.. ++
T Consensus 10 ~~Cy~~~-------~~~~tw~~A~~~C~~~g~~La~i~~~~E~~fi~~~~~~~~~~~~~~~~WIGl~d~~-~g------- 74 (129)
T cd03597 10 NSCLKIN-------TARESYDNAKLYCRNLNAVLASLTTQKKVEFVLKELQKHQMTKQKLTPWVGLRKIN-VS------- 74 (129)
T ss_pred CEEEEEE-------cCCCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccCCCCceEEeeecCC-CC-------
Confidence 4569887 3679999999999999999999999999999999887531 247899999875 45
Q ss_pred CCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCC----------cCcCCCCC-cceEEee
Q psy4323 116 PANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQ----------HDVACHHL-KPFVCED 175 (191)
Q Consensus 116 ~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~----------~d~~C~~~-~~FICe~ 175 (191)
.|+|+ ||+++.+ .|++|.+ +||++.+... +|.+|... ++||||+
T Consensus 75 -----~w~W~-Dgs~~~~----~~~~W~~------geP~~~~~C~~~~~~~~~~w~d~~C~~~~~~~iCe~ 129 (129)
T cd03597 75 -----YWCWE-DMSPFTN----TTLQWLP------GEPSDAGFCGYLEEPAVSGLKANPCTNPVNGSVCER 129 (129)
T ss_pred -----ceEEC-CCCCCCC----ccccCCC------CCCCCcccEEEEcccccCccccCCcCCCCcceeecC
Confidence 89998 9998743 3779999 9999753211 99999999 6999995
No 9
>cd03589 CLECT_CEL-1_like C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.
Probab=99.88 E-value=5.3e-22 Score=150.44 Aligned_cols=104 Identities=24% Similarity=0.544 Sum_probs=86.8
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHC-----CCeEeEecCHHHHHHHHHHHHcC----CCCcEEEceeeCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRH-----CMDAVSLETPQENEFVKQRITRG----NVRYIWTSGRKCNFNGCD 110 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~-----gg~LasI~s~~E~~~i~~~l~~~----~~~~~WIGl~~~~~~gc~ 110 (191)
++||.|+ .+.++|.+|+.+|+.+ ||+||+|+|++|++||..++... ....+||||++...++
T Consensus 10 ~~Cy~~~-------~~~~~w~~A~~~C~~~~~~g~~~~La~i~s~~e~~~l~~~~~~~~~~~~~~~~WiGl~~~~~~~-- 80 (137)
T cd03589 10 GYCYRFF-------GDRLTWEEAELRCRSFSIPGLIAHLVSIHSQEENDFVYDLFESSRGPDTPYGLWIGLHDRTSEG-- 80 (137)
T ss_pred CEEEEEe-------CCCcCHHHHHHHHHhhcCCCCCceEcccCCHHHHHHHHHHHhhccccCCCCcEEEeeecCCccC--
Confidence 5679987 3579999999999987 69999999999999999998754 2357999999887766
Q ss_pred CCCCCCCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCC---------------cCcCCCCCcceEEee
Q psy4323 111 RPDLQPANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQ---------------HDVACHHLKPFVCED 175 (191)
Q Consensus 111 ~~~~~~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~---------------~d~~C~~~~~FICe~ 175 (191)
.|+|+ ||+++ .|.+|.+ +||++....+ +|.+|+.+++||||+
T Consensus 81 ----------~~~W~-dG~~~------~~~~W~~------~~P~~~~~~~~C~~~~~~~~~~~~W~d~~C~~~~~fIC~~ 137 (137)
T cd03589 81 ----------PFEWT-DGSPV------DFTKWAG------GQPDNYGGNEDCVQMWRRGDAGQSWNDMPCDAVFPYICKM 137 (137)
T ss_pred ----------ceEeC-CcCcC------CcCCcCC------CCCCCCCCCCCceeeecCCCCCCeecCCCCCCCcceeeeC
Confidence 99998 99997 5779999 8998643211 678999999999984
No 10
>cd03590 CLECT_DC-SIGN_like C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on path
Probab=99.87 E-value=1.4e-21 Score=145.86 Aligned_cols=104 Identities=24% Similarity=0.601 Sum_probs=87.3
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANV 119 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~ 119 (191)
++||+|+. .+++|.+|+.+|+.+||+||+|+|++|++||.+++. ....+||||++...++
T Consensus 10 ~~Cy~~~~-------~~~tw~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~--~~~~~WiGl~~~~~~~----------- 69 (126)
T cd03590 10 SSCYFFST-------EKKSWEESRQFCEDMGAHLVIINSQEEQEFISKILS--GNRSYWIGLSDEETEG----------- 69 (126)
T ss_pred CEEEEEeC-------CCcCHHHHHHHHHhCCCEEEeeCCHHHHHHHHHHhC--CCCCEEEeeecCCCcC-----------
Confidence 56699973 579999999999999999999999999999999986 3357999999986655
Q ss_pred CceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCcc--CCC-----------cCcCCCCCcceEEee
Q psy4323 120 NGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNRE--AAQ-----------HDVACHHLKPFVCED 175 (191)
Q Consensus 120 ~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~--~~~-----------~d~~C~~~~~FICe~ 175 (191)
.|.|+ ||+++. . .|.+|.+ ++|++.. ... .+.+|..+++||||+
T Consensus 70 -~~~W~-dg~~~~-~---~~~~W~~------~~p~~~~~~~~~C~~~~~~~~~w~~~~C~~~~~fiCek 126 (126)
T cd03590 70 -EWKWV-DGTPLN-S---SKTFWHP------GEPNNWGGGGEDCAELVYDSGGWNDVPCNLEYRWICEK 126 (126)
T ss_pred -CeEec-CCCCCC-C---ccCCcCC------CcCCCCCCCCCCCEEEECCCCcEeCcCCCCCEeeeeeC
Confidence 99998 999883 1 5779999 8998653 111 678999999999995
No 11
>cd03602 CLECT_1 C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_1: C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in differe
Probab=99.87 E-value=7.8e-22 Score=144.61 Aligned_cols=99 Identities=21% Similarity=0.545 Sum_probs=82.3
Q ss_pred eEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCCCc
Q psy4323 42 SYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANVNG 121 (191)
Q Consensus 42 ~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~~~ 121 (191)
+|+|+ .++++|.+|+.+|+.+||+||+|+|++|+++|.+++.. ....+||||++. ++ .
T Consensus 2 ~y~~~-------~~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-~~~~~WiGl~~~--~~------------~ 59 (108)
T cd03602 2 TFYLV-------NESKTWSEAQQYCRENYTDLATVQNQEDNALLSNLSRV-SNSAAWIGLYRD--VD------------S 59 (108)
T ss_pred ceEEe-------ccccCHHHHHHHHHHHCCccCeecCHHHHHHHHHHHhc-cCCcEEEEEECC--CC------------c
Confidence 37776 46899999999999999999999999999999999862 345799999986 44 8
Q ss_pred eEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCC--------cCcCCCCCcceEEee
Q psy4323 122 WFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQ--------HDVACHHLKPFVCED 175 (191)
Q Consensus 122 w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~--------~d~~C~~~~~FICe~ 175 (191)
|.|+ ||+++ .|.+|.+ ++|.+..... .+.+|..+++||||+
T Consensus 60 ~~W~-dg~~~------~~~~w~~------~~~~~~~~C~~~~~~~~w~~~~C~~~~~fIC~~ 108 (108)
T cd03602 60 WRWS-DGSES------SFRNWNT------FQPFGQGDCATMYSSGRWYAALCSALKPFICYD 108 (108)
T ss_pred eEEc-CCCCC------ccCccCC------CCCCCCCCeeEECcCCeECcccCCCCcCEeccC
Confidence 9998 99986 5779998 6665432211 899999999999985
No 12
>cd03594 CLECT_REG-1_like C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates
Probab=99.86 E-value=2.1e-21 Score=145.70 Aligned_cols=104 Identities=23% Similarity=0.530 Sum_probs=84.9
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHC--CCeEeEecCHHHHHHHHHHHHcC--CCCcEEEceeeCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRH--CMDAVSLETPQENEFVKQRITRG--NVRYIWTSGRKCNFNGCDRPDLQ 115 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~--gg~LasI~s~~E~~~i~~~l~~~--~~~~~WIGl~~~~~~gc~~~~~~ 115 (191)
++||.|+ .++++|.+|+.+|+.+ ||+||+|+|++|+++|..+++.. ....+||||++...++
T Consensus 10 ~~Cy~~~-------~~~~tw~~A~~~C~~~~~g~~La~i~s~~e~~~l~~~~~~~~~~~~~~WiGl~~~~~~~------- 75 (129)
T cd03594 10 GNCYGYF-------RQPLSWSDAELFCQKYGPGAHLASIHSPAEAAAIASLISSYQKAYQPVWIGLHDPQQSR------- 75 (129)
T ss_pred CEeeeEe-------ccCcCHHHHHHHHHhcCCCceEcccCCHHHHHHHHHHHHhhccCCccEEEeeccCCCCC-------
Confidence 5669887 3578999999999998 59999999999999999998754 3457999999877655
Q ss_pred CCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCC------------cCcCCCCCcceEEee
Q psy4323 116 PANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQ------------HDVACHHLKPFVCED 175 (191)
Q Consensus 116 ~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~------------~d~~C~~~~~FICe~ 175 (191)
.|+|+ ||+++ .|.+|.+ ++|....... .|.+|+.+++||||+
T Consensus 76 -----~~~W~-dg~~~------~~~~W~~------~~p~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~fICe~ 129 (129)
T cd03594 76 -----GWEWS-DGSKL------DYRSWDR------NPPYARGGYCAELSRSTGFLKWNDANCEERNPFICKY 129 (129)
T ss_pred -----ceEeC-CCCcc------eecccCC------CCCCCCCCCceEEEecCCCCeEECCCCCCCceeeeeC
Confidence 89998 99988 4679999 7873221110 678999999999985
No 13
>cd03599 CLECT_DGCR2_like C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CLECT_DGCR2_like: C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DGS is also known velo-cardio-facial syndrome (VCFS). DGS is a genetic abnormality that results in malformations of the heart, face, and limbs and is associated with schizophrenia and depressive disorders. DGCR2 is a candidate for involvement in the pathogenesis of DGS since the DGCR2 gene lies within the minimal DGS critical region (MDGRC) of 22q11, which when deleted gives rise to DGS, and the DGCR2 gene is in close proximity to the balanced translocation breakpoint in a DGS patient having a balanced translocation.
Probab=99.86 E-value=2.6e-21 Score=151.77 Aligned_cols=107 Identities=16% Similarity=0.235 Sum_probs=83.0
Q ss_pred eeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCC--------CCcEEEceeeC------CC
Q psy4323 41 HSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGN--------VRYIWTSGRKC------NF 106 (191)
Q Consensus 41 ~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~--------~~~~WIGl~~~------~~ 106 (191)
+||.++ .++++|.+|+.+|+++||+||+|+|.+|++||..++.+.. ...+||||++. ..
T Consensus 13 ~CYk~~-------~~~~tw~dA~~~C~~~Gg~Lasi~s~~e~~fl~~l~~~~~~~~~~~~~~~~~WIGL~~~~~~~~~~~ 85 (153)
T cd03599 13 SCYKVY-------LSGENYWDAVQTCQKVNGSLATFTTDQELQFILAQEWDFDERVFGRKDQCKFWVGYQYVITNRNHSL 85 (153)
T ss_pred eEEEEe-------CCcCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccccccccCCCEEEeecccccccCccc
Confidence 679986 4689999999999999999999999999999999996431 14699999643 34
Q ss_pred CCCCCCCCCCCCCCceEEccC-CCccccCCCCCCCCCCCCCCCCCCCCCCccC-----C----------C----cCcCCC
Q psy4323 107 NGCDRPDLQPANVNGWFWSGS-GAKIGPTTQRNTGDWSATGGFGQAQPDNREA-----A----------Q----HDVACH 166 (191)
Q Consensus 107 ~gc~~~~~~~~~~~~w~W~~d-Gs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~-----~----------~----~d~~C~ 166 (191)
+| .|+|+ | |+++.|+ ..+.+|.+ ++|++... . . +|..|.
T Consensus 86 eg------------~w~Ws-ddGs~~~y~--~w~~~w~~------gePn~~~e~C~~~~~~~~~~~~~~~~~~W~d~~C~ 144 (153)
T cd03599 86 EG------------RWEVA-YKGSMEVFL--PPEPIFAT------GMSTNDNVFCAQLQCFQIPSLRERGLHSWHAENCY 144 (153)
T ss_pred CC------------eEEEe-cCCccceec--CccccCCC------CCCCCCCCCCeEEEeeccccccccccCeeeCccCC
Confidence 55 99998 6 9998543 12445566 78876310 0 0 899999
Q ss_pred CCcceEEee
Q psy4323 167 HLKPFVCED 175 (191)
Q Consensus 167 ~~~~FICe~ 175 (191)
.+++||||+
T Consensus 145 ~~~~fiCq~ 153 (153)
T cd03599 145 EKSSFLCKR 153 (153)
T ss_pred CCCcceeCC
Confidence 999999985
No 14
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=99.85 E-value=9.3e-21 Score=139.76 Aligned_cols=99 Identities=17% Similarity=0.468 Sum_probs=80.3
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANV 119 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~ 119 (191)
++||+|+. ++++|.+|+.+|+.+||+||+|+|++|++||.+++ ....+|||+++...++
T Consensus 10 ~~Cy~~~~-------~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~---~~~~~WiGl~~~~~~~----------- 68 (116)
T cd03593 10 NKCYYFSM-------EKKTWNESKEACSSKNSSLLKIDDEEELEFLQSQI---GSSSYWIGLSREKSEK----------- 68 (116)
T ss_pred CEEEEEEc-------CCCCHHHHHHHHHhCCCcEEEECCHHHHHHHHHhc---CCCceEEEEeecCCCC-----------
Confidence 46699974 46899999999999999999999999999999988 2357999999987666
Q ss_pred CceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCC--------CcCcCCCCCcceEEee
Q psy4323 120 NGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAA--------QHDVACHHLKPFVCED 175 (191)
Q Consensus 120 ~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~--------~~d~~C~~~~~FICe~ 175 (191)
.|.|+ ||+++. +|.. .+|++.... -.+..|..+++||||+
T Consensus 69 -~~~W~-dg~~~~--------~~~~------~~~~~~~~~C~~~~~~~w~~~~C~~~~~~IC~k 116 (116)
T cd03593 69 -PWKWI-DGSPLN--------NLFN------IRGSTKSGNCAYLSSTGIYSEDCSTKKRWICEK 116 (116)
T ss_pred -CeEcc-CCCccc--------cccc------ccCCCCCCCceEEcCCcEEcccCCcCceeeeeC
Confidence 99998 999872 6766 455331111 1889999999999996
No 15
>cd03595 CLECT_chondrolectin_like C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CLECT_chondrolectin_like: C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CHODL is predominantly expressed in muscle cells and is associated with T-cell maturation. Various alternatively spliced isoforms have been of CHODL have been identified. The transmembrane form of CHODL is localized in the ER-Golgi apparatus. Layilin is widely expressed in different cell types. The extracellular CTLD of layilin binds hyaluronan (HA), a major constituent of the extracellular matrix (ECM). The cytoplasmic tail of layilin binds various members of the band 4.1/ERM superfamily (talin, radixin, and merlin). The ERM proteins are cytoskeleton-membrane l
Probab=99.84 E-value=1.8e-20 Score=145.72 Aligned_cols=117 Identities=16% Similarity=0.359 Sum_probs=85.7
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcC--CCCcEEEceeeCCCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRG--NVRYIWTSGRKCNFNGCDRPDLQPA 117 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~--~~~~~WIGl~~~~~~gc~~~~~~~~ 117 (191)
++||.+.... ...++++|.+|+.+|+.+||+||+|+|++|+++|..+|... ....+||||++...++ .+...+
T Consensus 10 ~~Cy~~~~~~--~~~~~~tw~~A~~~C~~~g~~LasI~s~~E~~~i~~~i~~~~~~~~~~WIGl~~~~~~~--~~~~~~- 84 (149)
T cd03595 10 KPCYKIAYFQ--DSRRRLNFEEARQACREDGGELLSIESENEQKLIERFIQTLRASDGDFWIGLRRSSQYN--VTSSAC- 84 (149)
T ss_pred CccEEEEEEe--ccccccCHHHHHHHHHHcCCEECccCCHHHHHHHHHHHHhhcCCCCcEEEEeECCCCcc--cccccc-
Confidence 4558643110 12468999999999999999999999999999999988643 3457999999876532 000000
Q ss_pred CCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccC---------------------CCcCcCCCCCcceEEee
Q psy4323 118 NVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREA---------------------AQHDVACHHLKPFVCED 175 (191)
Q Consensus 118 ~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~---------------------~~~d~~C~~~~~FICe~ 175 (191)
.+.|.|+ ||+++ .|.+|.+ ++|++... .-+|..|+.+++||||+
T Consensus 85 -~~~~~W~-dG~~~------~y~~W~~------~eP~~~~~~Cv~l~~~~~~~~~~~~~~~~~W~d~~C~~~~~fICe~ 149 (149)
T cd03595 85 -SSLYYWL-DGSIS------TFRNWYV------DEPSCGSEVCVVMYHQPSAPAGQGGPYLFQWNDDNCNMKNNFICKY 149 (149)
T ss_pred -CCccEEc-CCCcc------CccCCCC------CCCCCcccCCEEEEecCCCCcCcCcccCCCccCCCCCCCcccccCC
Confidence 1369998 99998 5779999 88874311 01688999999999985
No 16
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved. TM exerts anti-fibrinolytic and anti-inflammatory activity. TM also regulates blood coagulation in the anticoagulant protein C pathway. In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM. TM also plays a key role in tumor biology. It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma. Loss of TM expression correlates with advanced stage and poor prognosis. Loss of function of TM func
Probab=99.82 E-value=1.3e-19 Score=139.15 Aligned_cols=113 Identities=17% Similarity=0.412 Sum_probs=81.9
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcC------CCCcEEEceeeCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRG------NVRYIWTSGRKCNFNGCDRPD 113 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~------~~~~~WIGl~~~~~~gc~~~~ 113 (191)
++||.++ .++++|.+|+.+|+.+||+||+|+|++|+++|..++... ....+||||++...+ |..+.
T Consensus 4 ~~Cy~~~-------~~~~sw~~A~~~C~~~gg~La~i~s~~E~~~v~~~l~~~~~~~~~~~~~~WIGl~~~~~~-~~~~~ 75 (141)
T cd03600 4 DACYTLH-------PQKLTFLEAQRSCIELGGNLATVRSGEEADVVSLLLAAGPGRHGRGSLRLWIGLQREPRQ-CSDPS 75 (141)
T ss_pred CceEEEe-------CCccCHHHHHHHHHhhCCEeeecCCHHHHHHHHHHHhhccccccCCCccEEEeEecCccc-Ccccc
Confidence 4668876 468999999999999999999999999999999999765 245799999984321 10000
Q ss_pred CCCCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCC-----------------CcCcCCCCCc-ceEEee
Q psy4323 114 LQPANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAA-----------------QHDVACHHLK-PFVCED 175 (191)
Q Consensus 114 ~~~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~-----------------~~d~~C~~~~-~FICe~ 175 (191)
...+.|.|+ ||.... .|.+|.+ + |++.... -+|.+|+.++ +||||+
T Consensus 76 ---~~~~~f~W~-d~~~~~-----~y~~W~~------~-p~n~~~~~~Cv~l~~~~~~~~~~~W~d~~C~~~~~~fIC~~ 139 (141)
T cd03600 76 ---LPLRGFSWV-TGDQDT-----DFSNWLQ------E-PAGTCTSPRCVALSAAGSTPDNLKWKDGPCSARADGYLCKF 139 (141)
T ss_pred ---ccCCccEEC-CCCCCC-----Ccccccc------C-CCCCCCCCccEEEEccCCCCCCCccccCCcCCCCCCeEEee
Confidence 012379998 775421 5789998 5 4432110 1788999985 799996
Q ss_pred c
Q psy4323 176 S 176 (191)
Q Consensus 176 ~ 176 (191)
.
T Consensus 140 ~ 140 (141)
T cd03600 140 S 140 (141)
T ss_pred e
Confidence 3
No 17
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=99.79 E-value=1.1e-18 Score=128.17 Aligned_cols=103 Identities=31% Similarity=0.633 Sum_probs=82.5
Q ss_pred cceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcC--CCCcEEEceeeCCCCCCCCCCCCC
Q psy4323 39 VTHSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRG--NVRYIWTSGRKCNFNGCDRPDLQP 116 (191)
Q Consensus 39 ~~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~--~~~~~WIGl~~~~~~gc~~~~~~~ 116 (191)
.+.||+++. ..++|.+|+.+|+.+||+||+|++++|+++|..++... ....+||||++....+
T Consensus 9 ~~~Cy~~~~-------~~~~~~~A~~~C~~~~~~La~i~~~~e~~~i~~~~~~~~~~~~~~WiG~~~~~~~~-------- 73 (126)
T smart00034 9 GGKCYKFST-------EKKTWADAQAFCQSLGAHLASIHSEAENDFVASLLKNSGSNSDYYWIGLSDPDSNG-------- 73 (126)
T ss_pred CCEEEEEEC-------CccCHHHHHHHHHhcCCEEcccCCHHHHHHHHHHHHhhcCCCCCEEEecCccCcCC--------
Confidence 356699873 56999999999999999999999999999999999864 3468999999855544
Q ss_pred CCCCceEEccCCCc-cccCCCCCCCCCCCCCCCCCCCCCCccC------------CCcCcCCCCCcceEEe
Q psy4323 117 ANVNGWFWSGSGAK-IGPTTQRNTGDWSATGGFGQAQPDNREA------------AQHDVACHHLKPFVCE 174 (191)
Q Consensus 117 ~~~~~w~W~~dGs~-~~~~~~~~y~nW~~~~~~~~~qP~~~~~------------~~~d~~C~~~~~FICe 174 (191)
.|+|+ ||.+ + .|.+|.+ + |+.... .-.+.+|...++||||
T Consensus 74 ----~~~W~-dg~~~~------~~~~w~~------~-~~~~~~~~C~~~~~~~~~~w~~~~C~~~~~~ICe 126 (126)
T smart00034 74 ----SWQWS-DGSGPV------NYSNWAP------G-EPNGGSGDCVVLSTSGGGKWNDVSCTSKLPFVCE 126 (126)
T ss_pred ----CeEEC-CCCCCC------CccccCC------C-CCCCCCCCCEEEecCCCCcccCCCCCCCcccccC
Confidence 89998 9998 4 5779998 4 211111 1178899999999997
No 18
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=99.78 E-value=9.7e-19 Score=178.86 Aligned_cols=119 Identities=17% Similarity=0.249 Sum_probs=95.9
Q ss_pred ceecCCC----CcceeEEEeccCCCCCCcccCHHHHHHHHHHC-CCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCC
Q psy4323 31 STYRDAR----GVTHSYFFSWEHAPTRSLEVDWLDARNICRRH-CMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCN 105 (191)
Q Consensus 31 ~~~~~g~----~~~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~-gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~ 105 (191)
-.|+.|+ ..++||.|+ .++++|.+|+.+|+++ ||+||+|+|++|++||.+++++.....+||||+|..
T Consensus 316 ~~CP~GW~~f~~~g~CYk~~-------~e~~TW~dAe~~C~s~GGAhLAsI~S~eEn~FL~~lv~~s~~~~vWIGLsD~~ 388 (2740)
T TIGR00864 316 PHCPKDGEIFEENGHCFQIV-------PEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDRGVWIGFSDVN 388 (2740)
T ss_pred CCCCCCCeecCCCCEEEEEe-------CCccCHHHHHHHhhccCCeEEecCCCHHHHHHHHHHhhccCCccEEEeeeCCC
Confidence 3555655 146889987 4689999999999999 599999999999999999987654446999999988
Q ss_pred CCCCCCCCCCCCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCC---------C-cCcCCCCCcceEEee
Q psy4323 106 FNGCDRPDLQPANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAA---------Q-HDVACHHLKPFVCED 175 (191)
Q Consensus 106 ~~gc~~~~~~~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~---------~-~d~~C~~~~~FICe~ 175 (191)
.+| .+.|+|+ ||+++. .|.+|.+ +||++.... . +|..|..+++||||+
T Consensus 389 ~EG----------~g~WvWs-DGS~l~-----~YtnW~p------GEPNn~~~EdCV~l~~~g~WND~~Cs~~~~FICE~ 446 (2740)
T TIGR00864 389 GAE----------KGPAHQG-EAFEAE-----ECEEGLA------GEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCEL 446 (2740)
T ss_pred CCC----------ceeeEEC-CCCccc-----CccCcCC------CCCCCCCCCCCEEEcCCCCEEccCCCCCeeEEeEE
Confidence 766 2249998 999873 4779999 999873321 1 899999999999998
Q ss_pred cch
Q psy4323 176 SDE 178 (191)
Q Consensus 176 ~~~ 178 (191)
.+.
T Consensus 447 ~~~ 449 (2740)
T TIGR00864 447 NPG 449 (2740)
T ss_pred CCC
Confidence 754
No 19
>PHA02642 C-type lectin-like protein; Provisional
Probab=99.78 E-value=2.5e-18 Score=141.07 Aligned_cols=112 Identities=14% Similarity=0.212 Sum_probs=85.8
Q ss_pred cceecCCC--CcceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCC
Q psy4323 30 HSTYRDAR--GVTHSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFN 107 (191)
Q Consensus 30 ~~~~~~g~--~~~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~ 107 (191)
...|+.|+ -.++||+|+. +.++|.+|+.+|+++||+||+|++++|++||+++... ..+||||++...+
T Consensus 85 ~~~CP~gW~~~~~kCYyfs~-------~~ksW~eA~~~C~s~ga~La~I~seeE~~FL~~~~~~---~~yWIGLsd~~~e 154 (216)
T PHA02642 85 YVTCPKGWIGFGYKCFYFSE-------DSKNWTFGNTFCTSLGATLVKVETEEELNFLKRYKDS---SDHWIGLNRESSN 154 (216)
T ss_pred cCCCCCcCEEECCEEEEEeC-------cccCHHHHHHHHhhCCCeEeeECCHHHHHHHHHhhcC---CeEEEEeEeCCCC
Confidence 33454443 1357899984 5789999999999999999999999999999987542 4699999998877
Q ss_pred CCCCCCCCCCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccC----CCcCcCCCCCcceEEeecc
Q psy4323 108 GCDRPDLQPANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREA----AQHDVACHHLKPFVCEDSD 177 (191)
Q Consensus 108 gc~~~~~~~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~----~~~d~~C~~~~~FICe~~~ 177 (191)
+ .|+|+ ||+++. ..+|.. +. .+... ...+..|...+++||++..
T Consensus 155 ~------------~W~Wv-DGS~~n------~~~~i~------G~-g~CAyLs~~~i~s~~C~~~~~wIC~K~l 202 (216)
T PHA02642 155 H------------PWKWA-DNSNYN------ASFVIT------GT-GECAYLNDIRISSSRVYANRKWICSKTY 202 (216)
T ss_pred C------------ceEEC-CCCccC------cceecc------CC-CceEEEeCCceEccCcCCCceEEeeeec
Confidence 7 99998 999874 446655 21 11110 1189999999999999864
No 20
>PHA02953 IEV and EEV membrane glycoprotein; Provisional
Probab=99.73 E-value=2e-17 Score=131.91 Aligned_cols=104 Identities=13% Similarity=0.207 Sum_probs=75.4
Q ss_pred cceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCC
Q psy4323 39 VTHSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPAN 118 (191)
Q Consensus 39 ~~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~ 118 (191)
.++||.++ .++++|.+|+.+|+.+||+|++|+ +|+.||..+.. ...+||||+| .+|
T Consensus 55 ~~~CYk~f-------~~~~tW~~A~~~C~~~Gg~L~~~~--~e~~fv~~~~~---~~~~WIGL~d--~eg---------- 110 (170)
T PHA02953 55 DNYCYLDT-------NIQLSTYGAVYLCNKYRARLPKPN--FRHLKVLSLTY---GKDFWVSLKK--KNN---------- 110 (170)
T ss_pred CCEEEEEE-------CCcCCHHHHHHHHHhcCCCCCCCc--HHHHHHHHhcc---CCCEEEeEEC--CCC----------
Confidence 36779987 367999999999999999998877 67788876643 2369999998 555
Q ss_pred CCceEEccCC-CccccCCCCCCCCCCCCCCCCCCCCCCccCC--------CcCcCCCCCcceEEeec
Q psy4323 119 VNGWFWSGSG-AKIGPTTQRNTGDWSATGGFGQAQPDNREAA--------QHDVACHHLKPFVCEDS 176 (191)
Q Consensus 119 ~~~w~W~~dG-s~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~--------~~d~~C~~~~~FICe~~ 176 (191)
.|+|. || +.+++++-..+.+|.+ +|++.... -+|..|...++||||+.
T Consensus 111 --~W~w~-Dggs~~~y~~~~~~~~w~~-------~~~~~~e~C~~~~~~~W~d~~C~~~~~fICqk~ 167 (170)
T PHA02953 111 --RWLDI-NTNKTVDMNKNTELKKIKS-------KTKNDNEACYIYKSGELKETVCNSVNYIICVKR 167 (170)
T ss_pred --ceEeC-CCCeeeccccccccccccC-------CCCCCCCCceEEeCCeEEeccCCCCcEEEEEEe
Confidence 99998 75 6665441112455654 33331111 18999999999999985
No 21
>PHA03097 C-type lectin-like protein; Provisional
Probab=99.70 E-value=1.4e-16 Score=125.58 Aligned_cols=97 Identities=18% Similarity=0.157 Sum_probs=73.6
Q ss_pred cceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCC
Q psy4323 39 VTHSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPAN 118 (191)
Q Consensus 39 ~~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~ 118 (191)
.++||+|+. .+++|.+|+.+|+++||+||+|++++|++||.+++. ...+||||++..
T Consensus 54 ~~~CY~~s~-------~~~sW~~A~~~C~~~g~~La~I~~~~E~~fi~~~~~---~~~~WIGL~d~~------------- 110 (157)
T PHA03097 54 NNKCYTFSE-------NITNKHLAIERCADMDGILTLIDDQKEVLFVSRYKG---GQDLWIGIEKKK------------- 110 (157)
T ss_pred CCEEEEEec-------CCCcHHHHHHHHHhCCCEEeeeCCHHHHHHHHHhcC---CCCEEEeeecCC-------------
Confidence 357799984 578999999999999999999999999999998764 246999998853
Q ss_pred CCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCC------CcCcCCCCCcceEEeecc
Q psy4323 119 VNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAA------QHDVACHHLKPFVCEDSD 177 (191)
Q Consensus 119 ~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~------~~d~~C~~~~~FICe~~~ 177 (191)
. |+ ||+++. .+|.+ + +.+.... -+|.+|...++||||+..
T Consensus 111 --~--W~-dgs~~~-------~~~~~------~-~~~e~Cv~i~~~~w~d~~C~~~~~~ICek~~ 156 (157)
T PHA03097 111 --G--DD-DDREVL-------DKVVK------P-PKSGKCAYLKDKTIISSNCNATKGWICFDRL 156 (157)
T ss_pred --C--cc-CCCccc-------ccccC------C-CCCCCEEEEECCcEEeCCCCCCeeEEEeecC
Confidence 3 87 988652 24433 1 1111110 189999999999999863
No 22
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=99.67 E-value=6.8e-16 Score=110.56 Aligned_cols=103 Identities=25% Similarity=0.635 Sum_probs=81.4
Q ss_pred eEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCCCc
Q psy4323 42 SYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANVNG 121 (191)
Q Consensus 42 ~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~~~ 121 (191)
||.++ ..+++|.+|+..|+++||.|++|++.+|+++|..++.......+|||+.+....+ .
T Consensus 2 Cy~~~-------~~~~~~~~A~~~C~~~~~~L~~~~~~~e~~~i~~~~~~~~~~~~wvg~~~~~~~~------------~ 62 (116)
T cd00037 2 CYKFS-------TEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLKKSSSSDVWIGLNDLSSEG------------T 62 (116)
T ss_pred CEEEc-------CCccCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHHhCCCCCCEEEcccccCcCC------------C
Confidence 57775 3489999999999999999999999999999999987544468999999876444 8
Q ss_pred eEEccCCCccccCCCCCCCCCCCCCCCCCCCCC-CccCC-----------CcCcCCCCCcceEEee
Q psy4323 122 WFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPD-NREAA-----------QHDVACHHLKPFVCED 175 (191)
Q Consensus 122 w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~-~~~~~-----------~~d~~C~~~~~FICe~ 175 (191)
|.|+ ||+... .+.+|.+ ++|. ..... ..+..|...++||||+
T Consensus 63 ~~~~-~~~~~~-----~~~~w~~------~~~~~~~~~~C~~~~~~~~~~~~~~~C~~~~~~iC~~ 116 (116)
T cd00037 63 WKWS-DGSPLV-----DYTNWAP------GEPNPGGSEDCVVLSSSSDGKWNDVSCSSKLPFICEK 116 (116)
T ss_pred eEec-CCCccc-----cccCCCC------CCcCCCCCCCeeEEccCCCCCccCCCCCCCceeeecC
Confidence 9998 888731 5778988 6651 11110 1788999999999985
No 23
>PF00059 Lectin_C: Lectin C-type domain; InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins: C-type lectins, which are Ca+-dependent. S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins). C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=99.67 E-value=1.5e-16 Score=113.54 Aligned_cols=96 Identities=24% Similarity=0.610 Sum_probs=72.4
Q ss_pred cccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCCCceEEccCCCccccC
Q psy4323 55 LEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANVNGWFWSGSGAKIGPT 134 (191)
Q Consensus 55 ~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~~~w~W~~dGs~~~~~ 134 (191)
++++|.+|+.+|+.+||+||.|.+.+|+++|.+++. ....+|||+ +....+ .|.|+ +|......
T Consensus 1 e~~~~~~A~~~C~~~~~~L~~i~~~~e~~~i~~~~~--~~~~~Wig~-~~~~~~------------~~~w~-~~~~~~~~ 64 (105)
T PF00059_consen 1 EPMTWEEAQQYCQSMGAHLASINSEEENDFIQSQLK--SNESYWIGL-DSDNNG------------TWKWI-DGSPNSPE 64 (105)
T ss_dssp EEEEHHHHHHHHHHTTSEEB-GSSHHHHHHHHHHHH--SSSEEEEEE-ESSSTS------------EEEET-TSSBSSST
T ss_pred CCCCHHHHHHHHhcCCCEEeEeCCHHHhhhhhhccc--ccceeeeee-eccccc------------eeccc-cCCCcccc
Confidence 468999999999999999999999999999999998 456899999 544433 89998 88876432
Q ss_pred CCCCCCCCCCCCCCCCCCCCCccC--C-------CcCcCCCCCcceEEee
Q psy4323 135 TQRNTGDWSATGGFGQAQPDNREA--A-------QHDVACHHLKPFVCED 175 (191)
Q Consensus 135 ~~~~y~nW~~~~~~~~~qP~~~~~--~-------~~d~~C~~~~~FICe~ 175 (191)
..|.+|.. .+..... . -.+.+|..+++||||+
T Consensus 65 --~~~~~w~~-------~~~~~~C~~~~~~~~~~w~~~~C~~~~~fiCek 105 (105)
T PF00059_consen 65 --NFYTNWNP-------PNDSENCAYIYYSSSGKWNDVPCSEKYPFICEK 105 (105)
T ss_dssp --TBSGCBSS-------GGSSEEEEEEGCSTTTEEEEEETTSEEEEEEEE
T ss_pred --cccccccc-------CCCCCCeEEEEEcCCCeEEeeCCCCCeeEEeeC
Confidence 11566722 2221111 0 0899999999999996
No 24
>PHA02867 C-type lectin protein; Provisional
Probab=99.53 E-value=3.3e-14 Score=112.77 Aligned_cols=111 Identities=12% Similarity=0.158 Sum_probs=75.7
Q ss_pred cCCCCCccccceecCCCCcceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEc
Q psy4323 21 EHAPTRRVRHSTYRDARGVTHSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTS 100 (191)
Q Consensus 21 ~~~~~~~~~~~~~~~g~~~~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIG 100 (191)
++.+...+..|... .++||+||. ++++|.+|+..|+.+||+||+|++++|++||.++.+ ..+|||
T Consensus 43 ~~~~~~CP~gWi~~----~~~CY~fs~-------~~~tW~~A~~~C~~~ga~La~I~s~eE~~Fl~~~~~----~~~WIG 107 (167)
T PHA02867 43 PYFSKVCPDEWIGY----NSKCYYFTI-------NETNWNDSKKLCDVMDSSLIRFDNIETLNFVSRYGK----GSYWID 107 (167)
T ss_pred CCcCCCCCCCCEEE----CCEEEEEec-------cccCHHHHHHHHhhCCCEECCcCCHHHHHHHHHcCC----CCEEEE
Confidence 44444444444332 246699984 688999999999999999999999999999987632 469999
Q ss_pred eeeCCCCCCCCCCCCCCCCCceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCcc----CCCcCcCCCCCcceEEeec
Q psy4323 101 GRKCNFNGCDRPDLQPANVNGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNRE----AAQHDVACHHLKPFVCEDS 176 (191)
Q Consensus 101 l~~~~~~gc~~~~~~~~~~~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~----~~~~d~~C~~~~~FICe~~ 176 (191)
|++.. .+.|. +++.+ +.. ....+.. ....+.+|.....+||+|+
T Consensus 108 Ls~~~---------------~~~~~-~~s~~----------~~~------~~~~~Ca~i~~~~i~s~~C~~~~~wIC~K~ 155 (167)
T PHA02867 108 INQNR---------------KIPGI-NFSLY----------YEQ------GVNDICLLFDTSNIIEMSCIFHERTICVKE 155 (167)
T ss_pred EEeCC---------------CCCCc-cCcee----------eec------CCCCcEEEEeCCeEEeecccCCcEEEEEcc
Confidence 99864 23343 33321 111 1111111 0118899999999999998
Q ss_pred ch
Q psy4323 177 DE 178 (191)
Q Consensus 177 ~~ 178 (191)
..
T Consensus 156 ~~ 157 (167)
T PHA02867 156 DR 157 (167)
T ss_pred Cc
Confidence 65
No 25
>PHA02911 C-type lectin-like protein; Provisional
Probab=99.20 E-value=1.4e-10 Score=93.63 Aligned_cols=101 Identities=15% Similarity=0.222 Sum_probs=68.8
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCCCCCCCCCCCCCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNFNGCDRPDLQPANV 119 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~~gc~~~~~~~~~~ 119 (191)
++||+||.. ....+|..|+.+|..+|++|+. .+.+|++||+++.+......|||||+..+. +....
T Consensus 112 nkCYlFS~~-----s~sWsW~~Srr~C~~kGATLLk-~sdEELeFIqn~wkGks~~~fWID~r~ngS--------t~n~s 177 (213)
T PHA02911 112 GICLLSLGE-----EVGFRMEIAKRFCEKKDADLIG-KIDEEKKALENIWTGNDHSRFWIDNRAAAS--------TFDPV 177 (213)
T ss_pred CEEEEEecc-----ccccchhHHHHHHHhcCCEecc-CcHHHHHHHHHHHhccccceEEecCccccc--------eecCC
Confidence 566999842 2355679999999999999999 888999999987765545678988864321 11112
Q ss_pred CceEEccCCCccccCCCCCCCCCCCCCCCCCCCCCCccCCCcCcCCCCCcceEEeecch
Q psy4323 120 NGWFWSGSGAKIGPTTQRNTGDWSATGGFGQAQPDNREAAQHDVACHHLKPFVCEDSDE 178 (191)
Q Consensus 120 ~~w~W~~dGs~~~~~~~~~y~nW~~~~~~~~~qP~~~~~~~~d~~C~~~~~FICe~~~~ 178 (191)
+...|+ .+..+. .. +...+.+|+....+||||.++
T Consensus 178 ~sCA~I-s~~~~~---------------------~~--~~V~sesCs~~~~wICqK~~~ 212 (213)
T PHA02911 178 NECAIG-TQNHIP---------------------EV--PEVLKSPCDERHSFICIKKDN 212 (213)
T ss_pred CCeEEE-Eccccc---------------------CC--CceEccccCCCceEEEEeccC
Confidence 344554 332110 00 011668999999999999763
No 26
>PF05473 Herpes_UL45: UL45 protein; InterPro: IPR008646 This family consists several UL45 proteins and homologues found in the herpes simplex virus family. The herpes simplex virus UL45 gene encodes an 18 kDa virion envelope protein whose function remains unknown. It has been suggested that the 18 kDa UL45 gene product is required for efficient growth in the central nervous system at low doses and may play an important role under the conditions of a naturally acquired infection []. The Equine herpesvirus 1 UL45 protein represents a type II membrane glycoprotein which has found to be non-essential for EHV-1 growth in vitro but deletion reduces the viruses' replication efficiency [].
Probab=97.70 E-value=0.00011 Score=60.11 Aligned_cols=44 Identities=20% Similarity=0.285 Sum_probs=35.5
Q ss_pred ceeEEEeccCCCCCCccc-CHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHH
Q psy4323 40 THSYFFSWEHAPTRSLEV-DWLDARNICRRHCMDAVSLETPQENEFVKQRIT 90 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~-sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~ 90 (191)
.+||+|+. ..+ +|.+|+..|+++++.|+.+.+......|...+.
T Consensus 94 ~~Cy~~~~-------~~~~t~~eA~~~C~~~~s~L~~~~~~~~L~~ll~~~~ 138 (200)
T PF05473_consen 94 NSCYRFSN-------SPKKTWEEARNICAAYNSTLANVNNAKSLLELLDVLN 138 (200)
T ss_pred CEEEEEeC-------CCCcCHHHHHHHHHhcCCcCCCchhHHHHHHHHHHhc
Confidence 56799984 345 999999999999999999988777666666543
No 27
>cd03519 Link_domain_HAPLN_module_2 Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=94.09 E-value=0.044 Score=39.37 Aligned_cols=24 Identities=13% Similarity=0.268 Sum_probs=21.9
Q ss_pred CcccCHHHHHHHHHHCCCeEeEec
Q psy4323 54 SLEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 54 ~~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
..++|+.+|+..|+..|+.||++.
T Consensus 8 ~~~l~f~eA~~aC~~~ga~lAs~~ 31 (91)
T cd03519 8 PGKLTFSEAVAACQRDGAQIAKVG 31 (91)
T ss_pred ccccCHHHHHHHHHHcCCEeCCHH
Confidence 578999999999999999999874
No 28
>cd01102 Link_Domain The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of
Probab=91.59 E-value=0.14 Score=36.84 Aligned_cols=25 Identities=8% Similarity=0.190 Sum_probs=21.9
Q ss_pred CCcccCHHHHHHHHHHCCCeEeEec
Q psy4323 53 RSLEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 53 ~~~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
...++|+.+|+..|+..|+.||+..
T Consensus 10 g~y~l~f~eA~~aC~~~ga~lAs~~ 34 (92)
T cd01102 10 GRYKLTFAEAALACKARGAHLATPG 34 (92)
T ss_pred CCcccCHHHHHHHHHHcCCEeCCHH
Confidence 3567899999999999999999874
No 29
>cd03520 Link_domain_CSPGs_modules_2_4 Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and
Probab=91.54 E-value=0.14 Score=37.14 Aligned_cols=24 Identities=17% Similarity=0.203 Sum_probs=21.6
Q ss_pred CcccCHHHHHHHHHHCCCeEeEec
Q psy4323 54 SLEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 54 ~~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
..++++.+|+..|+..|+.||++.
T Consensus 8 ~~~l~f~eA~~aC~~~ga~lAs~~ 31 (96)
T cd03520 8 PEKFTFQEARAECRSLGAVLATTG 31 (96)
T ss_pred CCCcCHHHHHHHHHHcCCEeCCHH
Confidence 468999999999999999999873
No 30
>smart00445 LINK Link (Hyaluronan-binding).
Probab=91.12 E-value=0.16 Score=36.81 Aligned_cols=24 Identities=17% Similarity=0.426 Sum_probs=21.5
Q ss_pred CcccCHHHHHHHHHHCCCeEeEec
Q psy4323 54 SLEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 54 ~~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
..++|+.||+..|+..|+.||++.
T Consensus 12 ~y~l~f~eA~~aC~~~ga~lAs~~ 35 (94)
T smart00445 12 RYKLTFAEAREACRAQGATLATVG 35 (94)
T ss_pred CCccCHHHHHHHHHHcCCEeCCHH
Confidence 457899999999999999999874
No 31
>cd03518 Link_domain_HAPLN_module_1 Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=90.21 E-value=0.19 Score=36.47 Aligned_cols=23 Identities=17% Similarity=0.427 Sum_probs=20.6
Q ss_pred cccCHHHHHHHHHHCCCeEeEec
Q psy4323 55 LEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 55 ~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
-++|+.+|+..|+..|+.||+..
T Consensus 12 Y~l~f~eA~~aC~~~ga~lAs~~ 34 (95)
T cd03518 12 YNLNFHEAQQACEEQDATLASFE 34 (95)
T ss_pred cccCHHHHHHHHHHcCCeeCCHH
Confidence 46799999999999999999874
No 32
>cd03521 Link_domain_KIAA0527_like Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.
Probab=90.13 E-value=0.28 Score=35.27 Aligned_cols=24 Identities=21% Similarity=0.144 Sum_probs=21.5
Q ss_pred CcccCHHHHHHHHHHCCCeEeEec
Q psy4323 54 SLEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 54 ~~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
..++++.+|+..|++.|+.||++.
T Consensus 11 ~y~l~f~eA~~AC~~~gA~lAs~~ 34 (95)
T cd03521 11 SQGLGLRAARQSCASLGARLASAA 34 (95)
T ss_pred ccccCHHHHHHHHHHcCCEeccHH
Confidence 457899999999999999999874
No 33
>PF00193 Xlink: Extracellular link domain; InterPro: IPR000538 The link domain [] is a hyaluronan(HA)-binding region found in proteins of vertebrates that are involved in the assembly of extracellular matrix, cell adhesion, and migration. The structure has been shown [] to consist of two alpha helices and two antiparallel beta sheets arranged around a large hydrophobic core similar to that of C-type lectin. This domain contains four conserved cysteines involved in two disulphide bonds. The link domain has also been termed HABM [] (HA binding module) and PTR [] (proteoglycan tandem repeat). Proteins with such a domain include the proteoglycans aggrecan, brevican, neurocan and versican, which are expressed in the CNS; the cartilage link protein (LP), a proteoglycan that together with HA and aggrecan forms multimolecular aggregates; Tumour necrosis factor-inducible protein TSG-6, which may be involved in cell-cell and cell-matrix interactions during inflammation and tumourgenesis; and CD44 antigen, the main cell surface receptor for HA.; GO: 0005540 hyaluronic acid binding, 0007155 cell adhesion; PDB: 1O7B_T 2PF5_C 1O7C_T 2JCQ_A 2JCR_A 2JCP_A 1UUH_B 1POZ_A 2I83_A.
Probab=89.96 E-value=0.21 Score=35.88 Aligned_cols=23 Identities=22% Similarity=0.362 Sum_probs=18.0
Q ss_pred cccCHHHHHHHHHHCCCeEeEec
Q psy4323 55 LEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 55 ~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
-++++.+|+..|+.+|+.||+..
T Consensus 12 y~l~f~eA~~~C~~~ga~LAs~~ 34 (92)
T PF00193_consen 12 YKLTFTEAQQACRALGARLASPE 34 (92)
T ss_dssp SSB-HHHHHHHHHHTTCBE--HH
T ss_pred CcCcHHHHHHHHHHcCCeeCCHH
Confidence 47899999999999999999763
No 34
>cd03515 Link_domain_TSG_6_like This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accum
Probab=89.66 E-value=0.2 Score=36.21 Aligned_cols=23 Identities=9% Similarity=0.328 Sum_probs=20.6
Q ss_pred cccCHHHHHHHHHHCCCeEeEec
Q psy4323 55 LEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 55 ~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
-++|+.||+..|+..|+.||++.
T Consensus 12 Y~l~f~eA~~aC~~~ga~lAs~~ 34 (93)
T cd03515 12 YKLTYTEAKAACEAEGAHLATYS 34 (93)
T ss_pred cccCHHHHHHHHHHcCCccCCHH
Confidence 36899999999999999999874
No 35
>PHA03093 EEV glycoprotein; Provisional
Probab=88.81 E-value=0.78 Score=36.99 Aligned_cols=41 Identities=15% Similarity=0.276 Sum_probs=29.8
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHH
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRI 89 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l 89 (191)
|.||.|- .+.++|.||...|...|+.|-+ .+. ....|.++|
T Consensus 108 ~~C~~~~-------~epkTf~dA~~~C~~~g~~LPs-~~l-~~~WL~dYL 148 (185)
T PHA03093 108 GSCYIFH-------SEPKTFSDAKADCAKKSSTLPN-SNL-MTTWLSDYL 148 (185)
T ss_pred CEeEEec-------CCCcCHHHHHHHHHhcCCcCCC-cch-HHHHHHHHh
Confidence 5668884 4679999999999999999987 222 223555554
No 36
>cd03517 Link_domain_CSPGs_modules_1_3 Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues.
Probab=88.72 E-value=0.26 Score=35.71 Aligned_cols=22 Identities=14% Similarity=0.235 Sum_probs=20.0
Q ss_pred ccCHHHHHHHHHHCCCeEeEec
Q psy4323 56 EVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 56 ~~sW~~A~~~C~~~gg~LasI~ 77 (191)
++|+.+|+..|++.|+.||+..
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~~ 34 (95)
T cd03517 13 ALTFPRAQRACLDISAQIATPE 34 (95)
T ss_pred eECHHHHHHHHHHcCCEeCCHH
Confidence 5699999999999999999874
No 37
>PHA02673 ORF109 EEV glycoprotein; Provisional
Probab=87.17 E-value=0.64 Score=36.67 Aligned_cols=30 Identities=13% Similarity=0.230 Sum_probs=24.6
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEec
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
+.||.+- +.++|.||...|...|+.|-+..
T Consensus 84 ~~Cltl~--------~p~tf~eAn~~C~~~g~~LPs~~ 113 (161)
T PHA02673 84 NKCLTLK--------YPDTWTNANERCKELGQRLPSPS 113 (161)
T ss_pred CeeEEeC--------CCCcHHHHHHHHHhcCCcCCCCc
Confidence 4568873 46799999999999999998743
No 38
>KOG4297|consensus
Probab=85.37 E-value=3.9 Score=27.40 Aligned_cols=32 Identities=22% Similarity=0.398 Sum_probs=29.4
Q ss_pred cCHHHHHHHHHHCCCeEeEecCHHHHHHHHHH
Q psy4323 57 VDWLDARNICRRHCMDAVSLETPQENEFVKQR 88 (191)
Q Consensus 57 ~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~ 88 (191)
.+|..+...|...+++|+.+.+..++.++...
T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 140 (207)
T KOG4297|consen 109 DGWSTTGTGCGGQGANLVGVLSVQENNFITSN 140 (207)
T ss_pred cCHHHHHHHHHHhCCCcCeeCCHHHHHHHHHh
Confidence 38999999999998999999999999999865
No 39
>cd03516 Link_domain_CD44_like This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marke
Probab=84.29 E-value=0.65 Score=36.13 Aligned_cols=23 Identities=17% Similarity=0.406 Sum_probs=20.5
Q ss_pred cccCHHHHHHHHHHCCCeEeEec
Q psy4323 55 LEVDWLDARNICRRHCMDAVSLE 77 (191)
Q Consensus 55 ~~~sW~~A~~~C~~~gg~LasI~ 77 (191)
-++++.+|+..|+..|+.||++.
T Consensus 17 Y~lnf~eA~~aC~~~ga~lAs~~ 39 (144)
T cd03516 17 YSLNFTEAKEACRALGLTLASKA 39 (144)
T ss_pred ccCCHHHHHHHHHHcCCeeCCHH
Confidence 46899999999999999999774
No 40
>PF05966 Chordopox_A33R: Chordopoxvirus A33R protein; InterPro: IPR009238 This family consists of several Chordopoxvirus A33R proteins. A33R plays a role in promoting Ab-resistant cell-to-cell spread of virus [] and interacts with A36R to incorporate the protein into the outer membrane of intracellular enveloped virions (IEV) [].; PDB: 3K7B_A.
Probab=69.07 E-value=4.1 Score=33.08 Aligned_cols=28 Identities=11% Similarity=0.068 Sum_probs=20.0
Q ss_pred eeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeE
Q psy4323 41 HSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVS 75 (191)
Q Consensus 41 ~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~Las 75 (191)
.||.+- .+.++|.+|...|...|+.|-+
T Consensus 113 ~Cl~~~-------~~p~T~~~A~~~C~~~g~~LPs 140 (190)
T PF05966_consen 113 KCLTLN-------YEPKTFDEANSDCNNKGQTLPS 140 (190)
T ss_dssp EEEEEE-------EEEEEHHHHHHHHHHTT-B---
T ss_pred EEEEec-------CCCCCHHHHHHHHHhcCCcCCC
Confidence 458874 3578999999999999999987
No 41
>PF07979 Intimin_C: Intimin C-type lectin domain; InterPro: IPR013117 This domain is found at the C terminus of intimin. Its structure has been solved and shown to have a C-lectin type of structure []. Intimin is a bacterial adhesion molecule involved in intimate attachment of enteropathogenic and enterohemorrhagic Escherichia coli to mammalian host cells. Intimin targets the translocated intimin receptor (Tir), which is exported by the bacteria and integrated into the host cell plasma membrane.; GO: 0005488 binding, 0009405 pathogenesis, 0009986 cell surface; PDB: 1CWV_A 2ZQK_B 2ZWK_C 1F02_I 1E5U_I 1F00_I 3NCX_B 3NCW_D.
Probab=53.90 E-value=6.2 Score=28.87 Aligned_cols=33 Identities=24% Similarity=0.279 Sum_probs=25.9
Q ss_pred CcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHH
Q psy4323 54 SLEVDWLDARNICRRHCMDAVSLETPQENEFVKQR 88 (191)
Q Consensus 54 ~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~ 88 (191)
..++++.+|...|+..+|.|.+ +++|+.-|.+-
T Consensus 10 ~~~~~Y~~A~~~C~~~s~~Lps--S~~~L~~vy~~ 42 (101)
T PF07979_consen 10 SSRVTYSEAESICQNNSGRLPS--SQSELKDVYNE 42 (101)
T ss_dssp CCEETHHHHHHHTTTTCCESBS--SHHHHHHHHHH
T ss_pred cceEeHHHHHHHHHhccccCcc--cHHHHHHHHHh
Confidence 4689999999999999999975 45566656543
No 42
>PF03891 DUF333: Domain of unknown function (DUF333); InterPro: IPR005590 This family consists of bacterial proteins whose function has not been characterised.
Probab=36.20 E-value=30 Score=21.89 Aligned_cols=18 Identities=11% Similarity=-0.020 Sum_probs=15.0
Q ss_pred HHHHHHHHCCCeEeEecC
Q psy4323 61 DARNICRRHCMDAVSLET 78 (191)
Q Consensus 61 ~A~~~C~~~gg~LasI~s 78 (191)
-|..+|.+.||.|...++
T Consensus 6 PAs~yC~~~GG~~~~~~~ 23 (50)
T PF03891_consen 6 PASVYCVEQGGKLEIRKQ 23 (50)
T ss_pred hHHHHHHHhCCEEEEEEc
Confidence 478999999999986654
No 43
>PHA02672 ORF110 EEV glycoprotein; Provisional
Probab=28.53 E-value=38 Score=26.65 Aligned_cols=57 Identities=12% Similarity=0.087 Sum_probs=39.9
Q ss_pred ceeEEEeccCCCCCCcccCHHHHHHHHHHCCCeEeEecCHHHHHHHHHHHHcCCCCcEEEceeeCCC
Q psy4323 40 THSYFFSWEHAPTRSLEVDWLDARNICRRHCMDAVSLETPQENEFVKQRITRGNVRYIWTSGRKCNF 106 (191)
Q Consensus 40 ~~~Y~fs~~~~~~~~~~~sW~~A~~~C~~~gg~LasI~s~~E~~~i~~~l~~~~~~~~WIGl~~~~~ 106 (191)
..||+-. ..+++=..|...|+++.|+|-.|-+..- |+..+.-.....||.|-.+...
T Consensus 59 d~CyLnT-------~~q~s~~~A~~iC~~~~a~lP~~pn~~h---LKgVm~lt~~r~FW~thh~~y~ 115 (166)
T PHA02672 59 DLCVLNT-------HVITNVTLAHDICASMDGDPPATPNTML---LKGIMFLTGERSFWMTHHDAYT 115 (166)
T ss_pred CEEEEec-------ceeecccHHHHHHHhccCCCCCCCCHhH---hhheeeEecccEEEEEccccce
Confidence 3557764 4678999999999999999988766433 3323333345689999887553
No 44
>PF03781 FGE-sulfatase: Sulfatase-modifying factor enzyme 1; InterPro: IPR005532 This domain is found in eukaryotic proteins [] required for post-translational sulphatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulphatase deficiency (MSD) [, , , ]. The protein product of the SUMF1 gene is FGE, formylglycine-generating enzyme, which is a sulphatase. Sulphatases are enzymes essential for degradation and remodelling of sulphate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulphatases []. FGE is localised to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesised sulphatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilised by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulphide bond is present in the active site of FGE. An oxidised cysteine residue, possibly cysteine sulphenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulphatases []. This domain is also found in a few methyltransferases and protein kinases.; PDB: 2Y3C_A 2Q17_B 1Y4J_B 1Y1E_X 2AFT_X 2HIB_X 2AII_X 1Z70_X 1Y1F_X 2HI8_X ....
Probab=20.70 E-value=56 Score=26.89 Aligned_cols=34 Identities=26% Similarity=0.469 Sum_probs=19.3
Q ss_pred cccCHHHHHHHHHHCCC---eEeEecCHHHHHHHHHH
Q psy4323 55 LEVDWLDARNICRRHCM---DAVSLETPQENEFVKQR 88 (191)
Q Consensus 55 ~~~sW~~A~~~C~~~gg---~LasI~s~~E~~~i~~~ 88 (191)
..++|.+|+.+|+-++. .-.-+-+++|=++..+-
T Consensus 94 ~~Vsw~~A~ayc~wl~~~~g~~yRLPteaEWe~Aar~ 130 (260)
T PF03781_consen 94 VGVSWYDAQAYCNWLGKRTGEGYRLPTEAEWEYAARG 130 (260)
T ss_dssp -S--HHHHHHHHHHCTHHTTSS-B---HHHHHHHHHT
T ss_pred ceeeHHHHHHHHHHhcccccccccCCCHHHHHHHhcc
Confidence 45799999999999886 22233456666666543
Done!