Query psy14885
Match_columns 483
No_of_seqs 316 out of 2767
Neff 8.6
Searched_HMMs 46136
Date Fri Aug 16 23:18:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14885.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14885hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd03588 CLECT_CSPGs C-type lec 99.9 1.5E-24 3.3E-29 185.8 10.9 121 29-183 1-124 (124)
2 cd03589 CLECT_CEL-1_like C-typ 99.9 3E-24 6.6E-29 187.3 11.7 127 29-182 1-137 (137)
3 cd03597 CLECT_attractin_like C 99.9 4.1E-24 8.8E-29 184.3 11.8 125 29-182 1-129 (129)
4 cd03594 CLECT_REG-1_like C-typ 99.9 4E-24 8.7E-29 184.5 11.7 126 29-182 1-129 (129)
5 cd03599 CLECT_DGCR2_like C-typ 99.9 5E-24 1.1E-28 187.0 11.1 134 29-182 1-153 (153)
6 cd03590 CLECT_DC-SIGN_like C-t 99.9 1E-23 2.2E-28 181.1 11.8 123 29-182 1-126 (126)
7 cd03596 CLECT_tetranectin_like 99.9 5.3E-23 1.1E-27 177.6 11.6 125 29-182 1-129 (129)
8 COG0328 RnhA Ribonuclease HI [ 99.9 2E-22 4.4E-27 174.1 13.2 125 221-345 4-146 (154)
9 cd03593 CLECT_NK_receptors_lik 99.9 1.6E-22 3.4E-27 171.2 10.9 115 29-182 1-116 (116)
10 PRK06548 ribonuclease H; Provi 99.9 4.7E-22 1E-26 175.3 13.7 124 219-344 4-142 (161)
11 PHA02642 C-type lectin-like pr 99.9 1.2E-21 2.5E-26 178.6 13.2 133 24-199 83-215 (216)
12 PRK08719 ribonuclease H; Revie 99.9 7.5E-21 1.6E-25 166.4 13.3 118 221-343 5-146 (147)
13 cd03598 CLECT_EMBP_like C-type 99.8 4.2E-21 9.2E-26 162.7 10.7 115 38-182 1-117 (117)
14 PHA02953 IEV and EEV membrane 99.8 7.9E-21 1.7E-25 169.4 12.9 119 26-183 44-167 (170)
15 PHA03097 C-type lectin-like pr 99.8 1.6E-20 3.6E-25 166.0 13.0 114 26-184 43-156 (157)
16 PRK00203 rnhA ribonuclease H; 99.8 1.3E-20 2.9E-25 166.5 12.3 124 221-347 4-145 (150)
17 cd03591 CLECT_collectin_like C 99.8 1.3E-20 2.9E-25 159.0 10.8 111 39-182 2-114 (114)
18 smart00034 CLECT C-type lectin 99.8 1.7E-20 3.6E-25 160.2 11.6 125 29-181 1-126 (126)
19 PF00075 RNase_H: RNase H; In 99.8 8.2E-21 1.8E-25 164.4 8.3 114 221-342 4-131 (132)
20 cd03592 CLECT_selectins_like C 99.8 2.8E-20 6.1E-25 157.2 10.2 112 41-182 3-115 (115)
21 cd03601 CLECT_TC14_like C-type 99.8 6.5E-20 1.4E-24 155.9 10.7 113 41-182 3-119 (119)
22 TIGR00864 PCC polycystin catio 99.8 1.2E-19 2.5E-24 210.2 12.7 150 26-209 315-470 (2740)
23 cd03603 CLECT_VCBS A bacterial 99.8 2.3E-19 5E-24 152.2 11.1 113 39-181 1-117 (118)
24 cd03595 CLECT_chondrolectin_li 99.8 2.9E-19 6.2E-24 158.0 10.8 131 29-182 4-149 (149)
25 cd03600 CLECT_thrombomodulin_l 99.8 5.4E-19 1.2E-23 154.9 12.0 129 36-183 2-140 (141)
26 cd03602 CLECT_1 C-type lectin 99.8 5.5E-19 1.2E-23 147.5 9.6 106 40-182 2-108 (108)
27 PHA02867 C-type lectin protein 99.8 3.7E-18 8E-23 150.2 9.6 112 26-185 46-157 (167)
28 PRK13907 rnhA ribonuclease H; 99.7 3.7E-17 8.1E-22 140.8 12.6 118 223-344 4-127 (128)
29 cd06222 RnaseH RNase H (RNase 99.7 9.4E-17 2E-21 136.7 14.5 120 223-342 2-130 (130)
30 KOG3752|consensus 99.7 2.4E-16 5.3E-21 152.2 12.3 123 221-343 213-364 (371)
31 cd00037 CLECT C-type lectin (C 99.6 3.1E-15 6.6E-20 124.9 10.4 114 39-182 1-116 (116)
32 PRK07708 hypothetical protein; 99.6 1.8E-14 3.8E-19 134.2 15.0 122 221-346 74-210 (219)
33 PF00059 Lectin_C: Lectin C-ty 99.5 3.7E-14 8.1E-19 116.8 8.3 101 49-182 3-105 (105)
34 PRK07238 bifunctional RNase H/ 99.5 2.5E-13 5.4E-18 138.3 14.6 122 220-345 2-133 (372)
35 PHA02911 C-type lectin-like pr 99.4 1.9E-12 4.2E-17 114.3 9.9 108 26-184 100-211 (213)
36 PF13456 RVT_3: Reverse transc 99.3 1.3E-11 2.8E-16 98.3 8.9 86 254-343 1-86 (87)
37 PF05473 Herpes_UL45: UL45 pro 98.9 6.8E-09 1.5E-13 95.3 9.9 55 26-81 82-137 (200)
38 PHA02672 ORF110 EEV glycoprote 97.6 0.00025 5.4E-09 59.9 6.9 67 26-98 47-113 (166)
39 PHA03093 EEV glycoprotein; Pro 96.9 0.0013 2.8E-08 58.0 4.7 44 35-81 105-148 (185)
40 PHA02673 ORF109 EEV glycoprote 96.9 0.0038 8.2E-08 53.8 7.0 46 34-81 80-125 (161)
41 PF05966 Chordopox_A33R: Chord 95.8 0.003 6.5E-08 56.4 0.7 33 34-67 108-140 (190)
42 PF13966 zf-RVT: zinc-binding 94.0 0.043 9.4E-07 43.3 2.7 43 403-445 35-83 (86)
43 cd03519 Link_domain_HAPLN_modu 86.1 0.55 1.2E-05 37.1 2.2 25 44-69 7-31 (91)
44 cd03520 Link_domain_CSPGs_modu 79.2 1.7 3.8E-05 34.7 2.6 22 48-69 10-31 (96)
45 smart00445 LINK Link (Hyaluron 78.1 1.4 3.1E-05 35.1 1.8 22 48-69 14-35 (94)
46 cd01102 Link_Domain The link d 77.5 1.4 3E-05 35.1 1.5 22 48-69 13-34 (92)
47 KOG1812|consensus 75.0 9.3 0.0002 39.1 7.2 111 231-344 17-138 (384)
48 cd03518 Link_domain_HAPLN_modu 74.6 2 4.4E-05 34.3 1.8 22 48-69 13-34 (95)
49 cd03515 Link_domain_TSG_6_like 74.4 1.9 4.1E-05 34.3 1.5 22 48-69 13-34 (93)
50 PF00193 Xlink: Extracellular 73.4 1.7 3.7E-05 34.7 1.1 22 48-69 13-34 (92)
51 cd03521 Link_domain_KIAA0527_l 71.0 2.8 6E-05 33.0 1.7 22 48-69 13-34 (95)
52 cd03517 Link_domain_CSPGs_modu 70.3 2.2 4.8E-05 34.1 1.1 23 48-70 13-35 (95)
53 PF05380 Peptidase_A17: Pao re 65.4 27 0.0006 30.8 7.3 50 219-270 80-142 (159)
54 PF00336 DNA_pol_viral_C: DNA 63.4 8.1 0.00018 35.2 3.4 54 231-289 102-155 (245)
55 cd03516 Link_domain_CD44_like 59.2 6.1 0.00013 34.1 1.8 22 48-69 18-39 (144)
56 KOG4297|consensus 46.1 1.2E+02 0.0025 24.0 7.8 41 39-80 100-140 (207)
57 PF08391 Ly49: Ly49-like prote 45.8 12 0.00025 31.4 1.3 19 25-43 101-119 (119)
58 PF07979 Intimin_C: Intimin C- 45.3 12 0.00025 30.3 1.2 30 48-79 12-41 (101)
59 COG3341 Predicted double-stran 43.9 22 0.00047 32.9 2.9 93 250-344 104-201 (225)
60 COG0296 GlgB 1,4-alpha-glucan 39.6 34 0.00074 37.2 4.1 72 261-334 166-247 (628)
61 COG0590 CumB Cytosine/adenosin 38.1 82 0.0018 27.6 5.6 29 254-282 56-84 (152)
62 KOG0466|consensus 34.8 28 0.00062 34.1 2.3 36 416-451 101-142 (466)
63 PHA02910 hypothetical protein; 30.7 40 0.00087 27.7 2.2 37 31-67 65-102 (171)
64 COG0295 Cdd Cytidine deaminase 29.4 1.3E+02 0.0027 25.9 5.1 52 231-283 26-81 (134)
65 cd01285 nucleoside_deaminase N 26.3 2.1E+02 0.0045 23.2 5.9 32 252-283 43-74 (109)
66 PRK10860 tRNA-specific adenosi 25.8 1.3E+02 0.0028 27.0 4.9 29 254-282 60-88 (172)
67 cd01284 Riboflavin_deaminase-r 22.7 2.4E+02 0.0052 23.4 5.6 38 231-268 19-57 (115)
No 1
>cd03588 CLECT_CSPGs C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins. CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classif
Probab=99.91 E-value=1.5e-24 Score=185.84 Aligned_cols=121 Identities=19% Similarity=0.422 Sum_probs=101.4
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDD 108 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~ 108 (483)
||+||+.|++.||+++..+ ++|.+|+..|+.+||+||+|+|++|++||..++. ..+|||+++.
T Consensus 1 Cp~gw~~~~~~Cy~~~~~~-~sw~~A~~~C~~~gg~La~i~s~~e~~fl~~~~~------~~~WIGl~~~---------- 63 (124)
T cd03588 1 CEEGWDKFQGHCYRHFPDR-ETWEDAERRCREQQGHLSSIVTPEEQEFVNNNAQ------DYQWIGLNDR---------- 63 (124)
T ss_pred CCccceecCCEEEEEECCc-cCHHHHHHHHHhcCCEEeccCCHHHHHHHHHhcc------CcEEecceec----------
Confidence 9999999999999999995 9999999999999999999999999999988642 2345555432
Q ss_pred ccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCc--ccCCeEEEecccccccccccccCCCCceeeeeecc
Q psy14885 109 PQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADN--VQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEAS 183 (483)
Q Consensus 109 ~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~--~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~~ 183 (483)
...+.|.|+||+++.+ .+|.+ +|++. ..++||++..... +.|.+.+|. ..++|||||.
T Consensus 64 ------------~~~~~~~W~dg~~~~~--~~W~~~~p~~~~~~~~~Cv~~~~~~~-~~W~d~~C~--~~~~fICe~~ 124 (124)
T cd03588 64 ------------TIEGDFRWSDGHPLQF--ENWRPNQPDNFFATGEDCVVMIWHEE-GEWNDVPCN--YHLPFTCKKG 124 (124)
T ss_pred ------------CCCCceEeCCCCcccc--cCcCCCCCCCCCCCCCCeEEEecCCC-CeEcCCCCC--CCCeeeeeCC
Confidence 2467899999999875 78999 99874 3579999864322 469999999 9999999974
No 2
>cd03589 CLECT_CEL-1_like C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.
Probab=99.91 E-value=3e-24 Score=187.26 Aligned_cols=127 Identities=19% Similarity=0.407 Sum_probs=106.0
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhc-----CCcccccCCHhHHHHHHHHhhhc--CCCCCceeeecccCCCC
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSL-----DSDLANVNDADEHGFIMYQLFWQ--DPQRRKWYFGGTQQSPN 101 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~-----g~~L~~i~s~~e~~~l~~~~~~~--~~~~~~~wig~~~~~~~ 101 (483)
||+||+.|++.||+++... ++|.+|+..|+.+ ||+|++|+|++|++||..++... ......||||+++..
T Consensus 1 Cp~gw~~~~~~Cy~~~~~~-~~w~~A~~~C~~~~~~g~~~~La~i~s~~e~~~l~~~~~~~~~~~~~~~~WiGl~~~~-- 77 (137)
T cd03589 1 CPTFWTAFGGYCYRFFGDR-LTWEEAELRCRSFSIPGLIAHLVSIHSQEENDFVYDLFESSRGPDTPYGLWIGLHDRT-- 77 (137)
T ss_pred CCcchhhhCCEEEEEeCCC-cCHHHHHHHHHhhcCCCCCceEcccCCHHHHHHHHHHHhhccccCCCCcEEEeeecCC--
Confidence 9999999999999999995 9999999999997 69999999999999999998754 223345555555322
Q ss_pred cccccCcccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCcc-cCCeEEEeccc-ccccccccccCCCCceee
Q psy14885 102 LWVNEDDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNV-QRDYLAYSFSQ-SLKRWGFERVTGMEPLLF 178 (483)
Q Consensus 102 ~Wig~~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~-~~~Cv~~~~~~-~~~~W~~~~C~~~~~~~f 178 (483)
.++.|.|+||+++.+ .+|.+ +|++.. .++|+++.... ....|++.+|+ ..++|
T Consensus 78 --------------------~~~~~~W~dG~~~~~--~~W~~~~P~~~~~~~~C~~~~~~~~~~~~W~d~~C~--~~~~f 133 (137)
T cd03589 78 --------------------SEGPFEWTDGSPVDF--TKWAGGQPDNYGGNEDCVQMWRRGDAGQSWNDMPCD--AVFPY 133 (137)
T ss_pred --------------------ccCceEeCCcCcCCc--CCcCCCCCCCCCCCCCceeeecCCCCCCeecCCCCC--CCcce
Confidence 578899999999876 88999 998754 68999997654 23469999999 99999
Q ss_pred eeec
Q psy14885 179 ICEA 182 (483)
Q Consensus 179 vCe~ 182 (483)
|||+
T Consensus 134 IC~~ 137 (137)
T cd03589 134 ICKM 137 (137)
T ss_pred eeeC
Confidence 9995
No 3
>cd03597 CLECT_attractin_like C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CLECT_attractin_like: C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Mouse AtrN (the product of the mahogany gene) has been shown to bind Agouti protein and to function in agouti-induced pigmentation and obesity. Mutations in AtrN have also been shown to cause spongiform encephalopathy and hypomyelination in rats and hamsters. The cytoplasmic region of mouse ALP has been shown to binds to melanocortin receptor (MCR4). Signaling through MCR4 plays a role in appetite suppression. Attractin may have therapeutic potential in the treatment of obesity. Human attractin (hAtrN) has been shown to be expressed on activated T cells and released extracellularly. The
Probab=99.91 E-value=4.1e-24 Score=184.28 Aligned_cols=125 Identities=22% Similarity=0.368 Sum_probs=101.6
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcC--CCCCceeeecccCCCCccccc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQD--PQRRKWYFGGTQQSPNLWVNE 106 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~--~~~~~~wig~~~~~~~~Wig~ 106 (483)
||+||..|++.||+++.. .++|.+|+.+|+++||+||+|++.+|++||..++.... .....+|||+++..
T Consensus 1 Cp~gw~~~~~~Cy~~~~~-~~tw~~A~~~C~~~g~~La~i~~~~E~~fi~~~~~~~~~~~~~~~~WIGl~d~~------- 72 (129)
T cd03597 1 CGEGWHLVGNSCLKINTA-RESYDNAKLYCRNLNAVLASLTTQKKVEFVLKELQKHQMTKQKLTPWVGLRKIN------- 72 (129)
T ss_pred CCCcCEEcCCEEEEEEcC-CCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccCCCCceEEeeecCC-------
Confidence 999999999999999999 59999999999999999999999999999999886542 11234455554321
Q ss_pred CcccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCc-eeeeeec
Q psy14885 107 DDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEP-LLFICEA 182 (483)
Q Consensus 107 ~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~-~~fvCe~ 182 (483)
.+.|+|+||+++.+..++|.+ +|++ .++||.+..... ..|+|.+|. .. .+||||+
T Consensus 73 ----------------~g~w~W~Dgs~~~~~~~~W~~geP~~--~~~C~~~~~~~~-~~w~d~~C~--~~~~~~iCe~ 129 (129)
T cd03597 73 ----------------VSYWCWEDMSPFTNTTLQWLPGEPSD--AGFCGYLEEPAV-SGLKANPCT--NPVNGSVCER 129 (129)
T ss_pred ----------------CCceEECCCCCCCCccccCCCCCCCC--cccEEEEccccc-CccccCCcC--CCCcceeecC
Confidence 467999999988542378999 9987 379999855433 369999999 88 6999996
No 4
>cd03594 CLECT_REG-1_like C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates
Probab=99.91 E-value=4e-24 Score=184.49 Aligned_cols=126 Identities=22% Similarity=0.442 Sum_probs=106.2
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhc--CCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCccccc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSL--DSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNE 106 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~--g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~ 106 (483)
||+||+.|++.||+++..+ ++|.+|+..|+.+ ||+||+|+|.+|++||..+++........||||+++..
T Consensus 1 Cp~gw~~~~~~Cy~~~~~~-~tw~~A~~~C~~~~~g~~La~i~s~~e~~~l~~~~~~~~~~~~~~WiGl~~~~------- 72 (129)
T cd03594 1 CPKGWLPYKGNCYGYFRQP-LSWSDAELFCQKYGPGAHLASIHSPAEAAAIASLISSYQKAYQPVWIGLHDPQ------- 72 (129)
T ss_pred CCCCCEecCCEeeeEeccC-cCHHHHHHHHHhcCCCceEcccCCHHHHHHHHHHHHhhccCCccEEEeeccCC-------
Confidence 9999999999999999995 9999999999999 59999999999999999998765433455677666433
Q ss_pred CcccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 107 DDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 107 ~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
..+.|.|+||+++.+ .+|.+ +|. ...++||.+....+...|.+.+|+ ..++||||+
T Consensus 73 ---------------~~~~~~W~dg~~~~~--~~W~~~~p~-~~~~~Cv~~~~~~~~~~W~~~~C~--~~~~fICe~ 129 (129)
T cd03594 73 ---------------QSRGWEWSDGSKLDY--RSWDRNPPY-ARGGYCAELSRSTGFLKWNDANCE--ERNPFICKY 129 (129)
T ss_pred ---------------CCCceEeCCCCccee--cccCCCCCC-CCCCCceEEEecCCCCeEECCCCC--CCceeeeeC
Confidence 457899999999865 78999 994 346899999765544469999999 999999985
No 5
>cd03599 CLECT_DGCR2_like C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CLECT_DGCR2_like: C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DGS is also known velo-cardio-facial syndrome (VCFS). DGS is a genetic abnormality that results in malformations of the heart, face, and limbs and is associated with schizophrenia and depressive disorders. DGCR2 is a candidate for involvement in the pathogenesis of DGS since the DGCR2 gene lies within the minimal DGS critical region (MDGRC) of 22q11, which when deleted gives rise to DGS, and the DGCR2 gene is in close proximity to the balanced translocation breakpoint in a DGS patient having a balanced translocation.
Probab=99.91 E-value=5e-24 Score=187.01 Aligned_cols=134 Identities=22% Similarity=0.396 Sum_probs=103.5
Q ss_pred CcccccccCC--eEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCC------CCCceeeecccCCC
Q psy14885 29 CPQHWVQYRD--SCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDP------QRRKWYFGGTQQSP 100 (483)
Q Consensus 29 C~~gw~~~~~--~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~------~~~~~wig~~~~~~ 100 (483)
||+||..|++ +||+++... +||.+|+..|+..||+||+|+|.+|++||..++..... ....|||||++.-
T Consensus 1 CP~gw~~~~~~~~CYk~~~~~-~tw~dA~~~C~~~Gg~Lasi~s~~e~~fl~~l~~~~~~~~~~~~~~~~~WIGL~~~~- 78 (153)
T cd03599 1 CPSGWHHYEGTASCYKVYLSG-ENYWDAVQTCQKVNGSLATFTTDQELQFILAQEWDFDERVFGRKDQCKFWVGYQYVI- 78 (153)
T ss_pred CCCchhccCCCCeEEEEeCCc-CCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccccccccCCCEEEeecccc-
Confidence 9999999998 899999995 99999999999999999999999999999999865421 1134566664220
Q ss_pred CcccccCcccccccccCCcCCCCCccccC-CCCccccc--cccCCC-CCCCcccCCeEEEec-------ccccccccccc
Q psy14885 101 NLWVNEDDPQRRKWYFGGTQQSPNLWVNE-DGTNLNEL--DAAFLP-EPADNVQRDYLAYSF-------SQSLKRWGFER 169 (483)
Q Consensus 101 ~~Wig~~~~~~~~w~~~g~~~~~~~~~W~-dg~~~~~~--~~~w~~-~P~~~~~~~Cv~~~~-------~~~~~~W~~~~ 169 (483)
-.++ ...++.|+|+ ||+++.|. ..+|.+ +|++ ++++|+.+.. ......|+|.+
T Consensus 79 ----~~~~-----------~~~eg~w~WsddGs~~~y~~w~~~w~~gePn~-~~e~C~~~~~~~~~~~~~~~~~~W~d~~ 142 (153)
T cd03599 79 ----TNRN-----------HSLEGRWEVAYKGSMEVFLPPEPIFATGMSTN-DNVFCAQLQCFQIPSLRERGLHSWHAEN 142 (153)
T ss_pred ----cccC-----------cccCCeEEEecCCccceecCccccCCCCCCCC-CCCCCeEEEeeccccccccccCeeeCcc
Confidence 0011 1257899999 59988742 135688 8987 5689999962 22345799999
Q ss_pred cCCCCceeeeeec
Q psy14885 170 VTGMEPLLFICEA 182 (483)
Q Consensus 170 C~~~~~~~fvCe~ 182 (483)
|. ...+||||+
T Consensus 143 C~--~~~~fiCq~ 153 (153)
T cd03599 143 CY--EKSSFLCKR 153 (153)
T ss_pred CC--CCCcceeCC
Confidence 99 999999996
No 6
>cd03590 CLECT_DC-SIGN_like C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on path
Probab=99.90 E-value=1e-23 Score=181.10 Aligned_cols=123 Identities=25% Similarity=0.537 Sum_probs=105.0
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDD 108 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~ 108 (483)
||.||+.|++.||+++..+ ++|.+|+..|+.+||+||+|+|.+|++||..++. ....||||+++..
T Consensus 1 Cp~~w~~~~~~Cy~~~~~~-~tw~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~----~~~~~WiGl~~~~--------- 66 (126)
T cd03590 1 CPTNWKSFQSSCYFFSTEK-KSWEESRQFCEDMGAHLVIINSQEEQEFISKILS----GNRSYWIGLSDEE--------- 66 (126)
T ss_pred CCCCceEeCCEEEEEeCCC-cCHHHHHHHHHhCCCEEEeeCCHHHHHHHHHHhC----CCCCEEEeeecCC---------
Confidence 9999999999999999995 9999999999999999999999999999999885 3356666666432
Q ss_pred ccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCcc--cCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 109 PQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNV--QRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 109 ~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~--~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
.++.|.|.||+++.....+|.+ +|++.. .++|+.+.... +.|.+.+|+ ..++|||||
T Consensus 67 -------------~~~~~~W~dg~~~~~~~~~W~~~~p~~~~~~~~~C~~~~~~~--~~w~~~~C~--~~~~fiCek 126 (126)
T cd03590 67 -------------TEGEWKWVDGTPLNSSKTFWHPGEPNNWGGGGEDCAELVYDS--GGWNDVPCN--LEYRWICEK 126 (126)
T ss_pred -------------CcCCeEecCCCCCCCccCCcCCCcCCCCCCCCCCCEEEECCC--CcEeCcCCC--CCEeeeeeC
Confidence 4688999999988312288999 998763 68999997654 469999999 999999996
No 7
>cd03596 CLECT_tetranectin_like C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CLECT_tetranectin_like: C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TN binds to plasminogen and stimulates activation of plasminogen, playing a key role in the regulation of proteolytic processes. The TN CTLD binds two calcium ions. Its calcium free form binds to various kringle-like protein ligands. Two residues involved in the coordination of calcium are critical for the binding of TN to the fourth kringle (K4) domain of plasminogen (Plg K4). TN binds the kringle 1-4 form of angiostatin (AST K1-4). AST K1-4 is a fragment of Plg, commonly found in cancer tissues. TN inhibits the bin
Probab=99.89 E-value=5.3e-23 Score=177.60 Aligned_cols=125 Identities=19% Similarity=0.344 Sum_probs=103.2
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDD 108 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~ 108 (483)
||+|| .++++||+++..+ ++|.+|+..|+++||+||+|+|.+|+++|..+++........||||+++..
T Consensus 1 c~~g~-~~~~~CY~~~~~~-~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~~~~~~~~~WiGl~~~~--------- 69 (129)
T cd03596 1 CLKGT-KIHKKCYLVSEET-KHYHEASEDCIARGGTLATPRDSDENDALRDYVKASVPGNWEVWLGINDMV--------- 69 (129)
T ss_pred CCCCc-EECCEEEEEeccc-CCHHHHHHHHHhcCCeEecCCCHHHHHHHHHHHHhccCCCCcEEEeccccC---------
Confidence 89997 4478999999995 999999999999999999999999999999988765434455666665433
Q ss_pred ccccccccCCcCCCCCccccCCCCccccccccCC---C-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 109 PQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFL---P-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 109 ~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~---~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
.++.|.|+||+++.+ .+|. + +|++...++|+++..... +.|.+.+|. ..++||||.
T Consensus 70 -------------~~~~w~w~dG~~~~~--~~W~~~~~~~p~~~~~~~Cv~l~~~~~-~~W~d~~C~--~~~~fICe~ 129 (129)
T cd03596 70 -------------AEGKWVDVNGSPISY--FNWEREITAQPDGGKRENCVALSSSAQ-GKWFDEDCR--REKPYVCEF 129 (129)
T ss_pred -------------ccCeEEeCCCCCccc--cccCCCCCCCCCCCCCCCCEEEccCCC-CcCcCccCC--CCCceeccC
Confidence 577899999999876 8899 3 787655689999976422 369999999 999999984
No 8
>COG0328 RnhA Ribonuclease HI [DNA replication, recombination, and repair]
Probab=99.89 E-value=2e-22 Score=174.13 Aligned_cols=125 Identities=17% Similarity=0.229 Sum_probs=101.5
Q ss_pred ceEe-eCCCCC--CCceEEEEEeCCCEEE--eC-CCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhccc-
Q psy14885 221 DVVF-DLSKRN--SNTGFAFIINNENFQY--RL-PHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDNI- 293 (483)
Q Consensus 221 ~~~~-DgS~~~--~~~G~g~~~~~~~~~~--~l-~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~~- 293 (483)
..+| |||+.+ |+.|+|+|+..+...+ .. ...+||++||+.|++.||+.+.+.+...|.|+||||+|+++|..-
T Consensus 4 v~if~DGa~~gNpG~gG~g~vl~~~~~~~~~s~~~~~tTNNraEl~A~i~AL~~l~~~~~~~v~l~tDS~yv~~~i~~w~ 83 (154)
T COG0328 4 VEIFTDGACLGNPGPGGWGAVLRYGDGEKELSGGEGRTTNNRAELRALIEALEALKELGACEVTLYTDSKYVVEGITRWI 83 (154)
T ss_pred eEEEecCccCCCCCCceEEEEEEcCCceEEEeeeeecccChHHHHHHHHHHHHHHHhcCCceEEEEecHHHHHHHHHHHH
Confidence 4456 999977 9999999998544332 22 248999999999999999999998899999999999999999831
Q ss_pred --CCcc---------hHHHHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhcCC
Q psy14885 294 --QQRN---------NIIQLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSINSK 345 (483)
Q Consensus 294 --~~~~---------~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~~ 345 (483)
++.. --...+++.+.++..+...|.++|||||+|.++||.||+||+.|+...
T Consensus 84 ~~w~~~~w~~~~~~pvkn~dl~~~~~~~~~~~~~v~~~WVkgH~g~~~NeraD~LA~~~~~~~ 146 (154)
T COG0328 84 VKWKKNGWKTADKKPVKNKDLWEELDELLKRHELVFWEWVKGHAGHPENERADQLAREAARAA 146 (154)
T ss_pred hhccccCccccccCccccHHHHHHHHHHHhhCCeEEEEEeeCCCCChHHHHHHHHHHHHHHhh
Confidence 1111 112456777788888888999999999999999999999999998765
No 9
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=99.88 E-value=1.6e-22 Score=171.20 Aligned_cols=115 Identities=27% Similarity=0.516 Sum_probs=98.2
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDD 108 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~ 108 (483)
||+||+.|++.||+++... ++|.+|+..|+.+||+|++|+|.+|++||..++ ....+|||+++..
T Consensus 1 Cp~gw~~~~~~Cy~~~~~~-~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~-----~~~~~WiGl~~~~--------- 65 (116)
T cd03593 1 CPKDWICYGNKCYYFSMEK-KTWNESKEACSSKNSSLLKIDDEEELEFLQSQI-----GSSSYWIGLSREK--------- 65 (116)
T ss_pred CCCcCEEeCCEEEEEEcCC-CCHHHHHHHHHhCCCcEEEECCHHHHHHHHHhc-----CCCceEEEEeecC---------
Confidence 9999999999999999995 999999999999999999999999999999987 2345666665432
Q ss_pred ccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 109 PQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 109 ~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
.++.|.|+||+++. +|.. +|++ ..++|+.+... .|.+.+|. ..++|||||
T Consensus 66 -------------~~~~~~W~dg~~~~----~~~~~~~~~-~~~~C~~~~~~----~w~~~~C~--~~~~~IC~k 116 (116)
T cd03593 66 -------------SEKPWKWIDGSPLN----NLFNIRGST-KSGNCAYLSST----GIYSEDCS--TKKRWICEK 116 (116)
T ss_pred -------------CCCCeEccCCCccc----ccccccCCC-CCCCceEEcCC----cEEcccCC--cCceeeeeC
Confidence 46899999999875 4666 7765 56899998653 49999999 999999997
No 10
>PRK06548 ribonuclease H; Provisional
Probab=99.88 E-value=4.7e-22 Score=175.28 Aligned_cols=124 Identities=23% Similarity=0.281 Sum_probs=95.6
Q ss_pred CcceEe-eCCCCC--CCceEEEEEeCCCEEEeCCCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcc---
Q psy14885 219 PTDVVF-DLSKRN--SNTGFAFIINNENFQYRLPHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDN--- 292 (483)
Q Consensus 219 p~~~~~-DgS~~~--~~~G~g~~~~~~~~~~~l~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~--- 292 (483)
...++| ||++.+ |+.|+|+++.+..........+||++|||+||+.||+.+.. ...++.|+|||++|+++|+.
T Consensus 4 ~~~~IytDGa~~gnpg~~G~g~~~~~~~~~~g~~~~~TNnraEl~Aii~aL~~~~~-~~~~v~I~TDS~yvi~~i~~W~~ 82 (161)
T PRK06548 4 NEIIAATDGSSLANPGPSGWAWYVDENTWDSGGWDIATNNIAELTAVRELLIATRH-TDRPILILSDSKYVINSLTKWVY 82 (161)
T ss_pred CEEEEEEeeccCCCCCceEEEEEEeCCcEEccCCCCCCHHHHHHHHHHHHHHhhhc-CCceEEEEeChHHHHHHHHHHHH
Confidence 345677 999976 89999999876543333345789999999999999986642 44579999999999999984
Q ss_pred -----cCC---cchHH-HHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhcC
Q psy14885 293 -----IQQ---RNNII-QLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSINS 344 (483)
Q Consensus 293 -----~~~---~~~~~-~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~ 344 (483)
.++ +.++. +++++.+..+... ..|+|.|||||+|++|||.||+||++|+..
T Consensus 83 ~Wk~~gWk~s~G~pV~N~dL~~~l~~l~~~-~~v~~~wVkgHsg~~gNe~aD~LA~~aa~~ 142 (161)
T PRK06548 83 SWKMRKWRKADGKPVLNQEIIQEIDSLMEN-RNIRMSWVNAHTGHPLNEAADSLARQAANN 142 (161)
T ss_pred HHHHCCCcccCCCccccHHHHHHHHHHHhc-CceEEEEEecCCCCHHHHHHHHHHHHHHHH
Confidence 222 23333 4555566666554 479999999999999999999999998855
No 11
>PHA02642 C-type lectin-like protein; Provisional
Probab=99.87 E-value=1.2e-21 Score=178.56 Aligned_cols=133 Identities=20% Similarity=0.375 Sum_probs=106.0
Q ss_pred cccCCCcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcc
Q psy14885 24 ELELRCPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLW 103 (483)
Q Consensus 24 ~~~~~C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~W 103 (483)
..+..||.||..|+++||+|+.. ..+|.+|+..|+++||+|+.|+|++|++||+.+.. ...||||+++..
T Consensus 83 ~k~~~CP~gW~~~~~kCYyfs~~-~ksW~eA~~~C~s~ga~La~I~seeE~~FL~~~~~-----~~~yWIGLsd~~---- 152 (216)
T PHA02642 83 IKYVTCPKGWIGFGYKCFYFSED-SKNWTFGNTFCTSLGATLVKVETEEELNFLKRYKD-----SSDHWIGLNRES---- 152 (216)
T ss_pred cccCCCCCcCEEECCEEEEEeCc-ccCHHHHHHHHhhCCCeEeeECCHHHHHHHHHhhc-----CCeEEEEeEeCC----
Confidence 34678999999999999999999 49999999999999999999999999999998642 234666666533
Q ss_pred cccCcccccccccCCcCCCCCccccCCCCccccccccCCCCCCCcccCCeEEEecccccccccccccCCCCceeeeeecc
Q psy14885 104 VNEDDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLPEPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEAS 183 (483)
Q Consensus 104 ig~~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~~ 183 (483)
.++.|.|+||+++.. .+|. .+.++|+.+... ++.+..|. ..+.+||+|.
T Consensus 153 ------------------~e~~W~WvDGS~~n~--~~~i-----~G~g~CAyLs~~----~i~s~~C~--~~~~wIC~K~ 201 (216)
T PHA02642 153 ------------------SNHPWKWADNSNYNA--SFVI-----TGTGECAYLNDI----RISSSRVY--ANRKWICSKT 201 (216)
T ss_pred ------------------CCCceEECCCCccCc--ceec-----cCCCceEEEeCC----ceEccCcC--CCceEEeeee
Confidence 578899999998865 4453 235799998543 58999999 9999999999
Q ss_pred hhhhhhhccccccccc
Q psy14885 184 IQKLHYLLNDDRTYQY 199 (483)
Q Consensus 184 ~~~~~~~~~~~~~~~y 199 (483)
..+ ..++.+.+|.|
T Consensus 202 l~~--~~~~~~~~~~~ 215 (216)
T PHA02642 202 YTN--IYIPSNNSYNY 215 (216)
T ss_pred ccc--eEccCCCCCCC
Confidence 543 23444555555
No 12
>PRK08719 ribonuclease H; Reviewed
Probab=99.85 E-value=7.5e-21 Score=166.43 Aligned_cols=118 Identities=17% Similarity=0.221 Sum_probs=89.6
Q ss_pred ceEe-eCCCCC--C---CceEEEEEeCCC------EEEeCCCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHH
Q psy14885 221 DVVF-DLSKRN--S---NTGFAFIINNEN------FQYRLPHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLN 288 (483)
Q Consensus 221 ~~~~-DgS~~~--~---~~G~g~~~~~~~------~~~~l~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~ 288 (483)
.++| |||+.+ + ++|+|+++.... ....+...+||++|||.||+.||+.+.+. ..|+|||++|++
T Consensus 5 ~~iYtDGs~~~n~~~~~~~G~G~vv~~~~~~~~~~~~~~~~~~~Tnn~aEl~A~~~aL~~~~~~----~~i~tDS~yvi~ 80 (147)
T PRK08719 5 YSIYIDGAAPNNQHGCVRGGIGLVVYDEAGEIVDEQSITVNRYTDNAELELLALIEALEYARDG----DVIYSDSDYCVR 80 (147)
T ss_pred EEEEEecccCCCCCCCCCcEEEEEEEeCCCCeeEEEEecCCCCccHHHHHHHHHHHHHHHcCCC----CEEEechHHHHH
Confidence 3455 999965 3 469999876532 22334566899999999999999998643 379999999999
Q ss_pred hh--------cccCCcc---hH-HHHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhc
Q psy14885 289 GI--------DNIQQRN---NI-IQLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSIN 343 (483)
Q Consensus 289 ~l--------~~~~~~~---~~-~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~ 343 (483)
+| ++++..+ ++ ...+++.+..+.+ ...|+|.|||||+|++|||.||+||++|+.
T Consensus 81 ~i~~~~~~W~~~~w~~s~g~~v~n~dl~~~i~~l~~-~~~i~~~~VkgH~g~~~Ne~aD~lA~~a~~ 146 (147)
T PRK08719 81 GFNEWLDTWKQKGWRKSDKKPVANRDLWQQVDELRA-RKYVEVEKVTAHSGIEGNEAADMLAQAAAE 146 (147)
T ss_pred HHHHHHHHHHhCCcccCCCcccccHHHHHHHHHHhC-CCcEEEEEecCCCCChhHHHHHHHHHHHhh
Confidence 99 4455421 11 2455566666655 567999999999999999999999999875
No 13
>cd03598 CLECT_EMBP_like C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CLECT_EMBP_like: C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Eosinophils and basophils carry out various functions in allergic, parasitic, and inflammatory diseases. EMBP is stored in eosinophil crystalloid granules and is released upon degranulation. EMBP is also expressed in basophils. The proform of EMBP is expressed in placental X cells and breast tissue and increases significantly during human pregnancy. EMBP has cytotoxic properties and damages bacteria and mammalian cells, in vitro, as well as, helminth parasites. EMBP deposition has been observed in the inflamed tissue of all
Probab=99.85 E-value=4.2e-21 Score=162.73 Aligned_cols=115 Identities=23% Similarity=0.371 Sum_probs=95.3
Q ss_pred CeEEEEecCCCCCHHHHHHHhhh-cCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCccccccccc
Q psy14885 38 DSCYRFVKSPLKTRNDAKLNCKS-LDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYF 116 (483)
Q Consensus 38 ~~Cy~~~~~~~~tw~~A~~~C~~-~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~ 116 (483)
++||+++..+ ++|.+|+..|+. .||+||+|+|++|++||..+++.. ....+|||+....
T Consensus 1 ~~Cy~~~~~~-~t~~~A~~~C~~~~g~~La~i~s~~e~~~l~~~~~~~--~~~~~WiGl~~~~----------------- 60 (117)
T cd03598 1 GRCYRFVKSP-RTFRDAQVICRRCYRGNLASIHSFAFNYRVQRLVSTL--NQAQVWIGGIITG----------------- 60 (117)
T ss_pred CceEEEecCC-CCHHHHHHHhhcCCCceEeeecChhHhHHHHHHHhCC--CCCCEEEeeEcCC-----------------
Confidence 4799999996 999999999999 599999999999999999998543 2345677766431
Q ss_pred CCcCCCCCccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 117 GGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 117 ~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
....+.|.|+||+++.+ .+|.+ +|++ +.++|+.+.... +.|++.+|+ .+++||||.
T Consensus 61 ---~~~~~~~~W~dg~~~~y--~~W~~g~p~~-~~~~Cv~~~~~~--g~W~~~~C~--~~~~fiC~~ 117 (117)
T cd03598 61 ---KGRCRRFSWVDGSVWNY--AYWAPGQPGN-RRGHCVELCTRG--GHWRRAHCK--LRRPFICSY 117 (117)
T ss_pred ---CCcCCeeEeCCCCccCc--CCCCCCCCCC-CCCCcEEEeCCC--CeECCCcCC--CCceeeecC
Confidence 12468899999999876 78999 9987 568999997543 369999999 999999984
No 14
>PHA02953 IEV and EEV membrane glycoprotein; Provisional
Probab=99.85 E-value=7.9e-21 Score=169.43 Aligned_cols=119 Identities=13% Similarity=0.245 Sum_probs=90.2
Q ss_pred cCCCcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccc
Q psy14885 26 ELRCPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVN 105 (483)
Q Consensus 26 ~~~C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig 105 (483)
...||+||..|+++||+++... ++|.+|+..|+.+||+|+.++ +|+.|+..+.. ...||||+++
T Consensus 44 ~~~CP~GW~~~~~~CYk~f~~~-~tW~~A~~~C~~~Gg~L~~~~--~e~~fv~~~~~-----~~~~WIGL~d-------- 107 (170)
T PHA02953 44 PSACAKGWIPYDNYCYLDTNIQ-LSTYGAVYLCNKYRARLPKPN--FRHLKVLSLTY-----GKDFWVSLKK-------- 107 (170)
T ss_pred cCcCcccChhhCCEEEEEECCc-CCHHHHHHHHHhcCCCCCCCc--HHHHHHHHhcc-----CCCEEEeEEC--------
Confidence 5779999999999999999994 999999999999999997766 66788876542 2235555553
Q ss_pred cCcccccccccCCcCCCCCccccCCCC-ccccc----cccCCCCCCCcccCCeEEEecccccccccccccCCCCceeeee
Q psy14885 106 EDDPQRRKWYFGGTQQSPNLWVNEDGT-NLNEL----DAAFLPEPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFIC 180 (483)
Q Consensus 106 ~~~~~~~~w~~~g~~~~~~~~~W~dg~-~~~~~----~~~w~~~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvC 180 (483)
.++.|.|.||. .+.+. ..+|.++|++ +.++|+.+.. ..|+|.+|. ...+|||
T Consensus 108 ----------------~eg~W~w~Dggs~~~y~~~~~~~~w~~~~~~-~~e~C~~~~~----~~W~d~~C~--~~~~fIC 164 (170)
T PHA02953 108 ----------------KNNRWLDINTNKTVDMNKNTELKKIKSKTKN-DNEACYIYKS----GELKETVCN--SVNYIIC 164 (170)
T ss_pred ----------------CCCceEeCCCCeeeccccccccccccCCCCC-CCCCceEEeC----CeEEeccCC--CCcEEEE
Confidence 25678888864 44321 1336655544 5689998743 259999999 9999999
Q ss_pred ecc
Q psy14885 181 EAS 183 (483)
Q Consensus 181 e~~ 183 (483)
||+
T Consensus 165 qk~ 167 (170)
T PHA02953 165 VKR 167 (170)
T ss_pred EEe
Confidence 986
No 15
>PHA03097 C-type lectin-like protein; Provisional
Probab=99.84 E-value=1.6e-20 Score=166.00 Aligned_cols=114 Identities=18% Similarity=0.334 Sum_probs=91.0
Q ss_pred cCCCcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccc
Q psy14885 26 ELRCPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVN 105 (483)
Q Consensus 26 ~~~C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig 105 (483)
...||+||+.|+++||+++.. .++|.+|+..|+++||+|+.|++++|++||..++. ...||||+++..
T Consensus 43 ~~~CP~gW~~~~~~CY~~s~~-~~sW~~A~~~C~~~g~~La~I~~~~E~~fi~~~~~-----~~~~WIGL~d~~------ 110 (157)
T PHA03097 43 GLNCRSGWVGYNNKCYTFSEN-ITNKHLAIERCADMDGILTLIDDQKEVLFVSRYKG-----GQDLWIGIEKKK------ 110 (157)
T ss_pred CCCCCCCceeeCCEEEEEecC-CCcHHHHHHHHHhCCCEEeeeCCHHHHHHHHHhcC-----CCCEEEeeecCC------
Confidence 567999999999999999999 49999999999999999999999999999998762 234666665432
Q ss_pred cCcccccccccCCcCCCCCccccCCCCccccccccCCCCCCCcccCCeEEEecccccccccccccCCCCceeeeeecch
Q psy14885 106 EDDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLPEPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEASI 184 (483)
Q Consensus 106 ~~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~~~ 184 (483)
. |+||++.. .+|. +|. ++++|+.+... .|+|.+|. ..++||||+++
T Consensus 111 ------------------~---W~dgs~~~---~~~~-~~~--~~e~Cv~i~~~----~w~d~~C~--~~~~~ICek~~ 156 (157)
T PHA03097 111 ------------------G---DDDDREVL---DKVV-KPP--KSGKCAYLKDK----TIISSNCN--ATKGWICFDRL 156 (157)
T ss_pred ------------------C---ccCCCccc---cccc-CCC--CCCCEEEEECC----cEEeCCCC--CCeeEEEeecC
Confidence 1 88887653 2332 222 35899988542 49999999 99999999974
No 16
>PRK00203 rnhA ribonuclease H; Reviewed
Probab=99.84 E-value=1.3e-20 Score=166.54 Aligned_cols=124 Identities=17% Similarity=0.244 Sum_probs=92.7
Q ss_pred ceEe-eCCCCC--CCceEEEEEeCCCEEEeC---CCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcc--
Q psy14885 221 DVVF-DLSKRN--SNTGFAFIINNENFQYRL---PHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDN-- 292 (483)
Q Consensus 221 ~~~~-DgS~~~--~~~G~g~~~~~~~~~~~l---~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~-- 292 (483)
.++| |||+.+ +..|+|+++...+....+ ....||++|||.||+.||+.+.+ ...+.|+|||+++++.|..
T Consensus 4 v~iytDGs~~~n~~~~g~g~v~~~~~~~~~~~~~~~~~TN~~aEL~Ai~~AL~~~~~--~~~v~I~tDS~yvi~~i~~w~ 81 (150)
T PRK00203 4 VEIYTDGACLGNPGPGGWGAILRYKGHEKELSGGEALTTNNRMELMAAIEALEALKE--PCEVTLYTDSQYVRQGITEWI 81 (150)
T ss_pred EEEEEEecccCCCCceEEEEEEEECCeeEEEecCCCCCcHHHHHHHHHHHHHHHcCC--CCeEEEEECHHHHHHHHHHHH
Confidence 4566 999975 788999988654432211 35789999999999999998863 4689999999999999985
Q ss_pred ------cCCc---chHH-HHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhcCCCC
Q psy14885 293 ------IQQR---NNII-QLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSINSKLL 347 (483)
Q Consensus 293 ------~~~~---~~~~-~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~~~~ 347 (483)
.+.. .++. ..+++.+..+.. ...|.|.|||||+|++||+.||+|||+|+....+
T Consensus 82 ~~Wk~~~~~~~~g~~v~n~dl~~~i~~l~~-~~~v~~~wV~~H~~~~~N~~AD~lA~~a~~~~~~ 145 (150)
T PRK00203 82 HGWKKNGWKTADKKPVKNVDLWQRLDAALK-RHQIKWHWVKGHAGHPENERCDELARAGAEEATL 145 (150)
T ss_pred HHHHHcCCcccCCCccccHHHHHHHHHHhc-cCceEEEEecCCCCCHHHHHHHHHHHHHHHHhhh
Confidence 2221 1221 233444444443 3789999999999999999999999999876544
No 17
>cd03591 CLECT_collectin_like C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, a
Probab=99.84 E-value=1.3e-20 Score=158.96 Aligned_cols=111 Identities=22% Similarity=0.269 Sum_probs=93.3
Q ss_pred eEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCC
Q psy14885 39 SCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGG 118 (483)
Q Consensus 39 ~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g 118 (483)
+||+++..+ ++|.+|+.+|+.+||+||+|+|.+|++||..++.. ....+|||+++.
T Consensus 2 k~y~~~~~~-~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~---~~~~~WiGl~~~-------------------- 57 (114)
T cd03591 2 KIFVTNGEE-KNFDDAQKLCSEAGGTLAMPRNAAENAAIASYVKK---GNTYAFIGITDL-------------------- 57 (114)
T ss_pred cEEEeCCce-eCHHHHHHHHhhcCCEEecCCCHHHHHHHHHHHhc---CCccEEEecccC--------------------
Confidence 699999995 99999999999999999999999999999999864 234455555532
Q ss_pred cCCCCCccccCCCCccccccccCCC-CCCCcc-cCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 119 TQQSPNLWVNEDGTNLNELDAAFLP-EPADNV-QRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 119 ~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~-~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
..++.|.|+||+++.+ .+|.+ +|++.+ .++|+.+.. . +.|.+.+|+ ..++||||.
T Consensus 58 --~~~~~~~w~dg~~~~y--~~W~~~ep~~~~~~~~Cv~~~~-~--~~W~~~~C~--~~~~fICe~ 114 (114)
T cd03591 58 --ETEGQFVYLDGGPLTY--TNWKPGEPNNAGGGEDCVEMYT-S--GKWNDVACN--LTRLFVCEF 114 (114)
T ss_pred --CcCCcEEeCCCCCccc--CCcCCCCCCCCCCCCCeEEECC-C--CcCcCccCC--CCeeEEeeC
Confidence 2578899999999965 89999 998765 579999973 2 369999999 999999984
No 18
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=99.84 E-value=1.7e-20 Score=160.20 Aligned_cols=125 Identities=28% Similarity=0.565 Sum_probs=102.8
Q ss_pred CcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCc
Q psy14885 29 CPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDD 108 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~ 108 (483)
||+||+.+++.||+++... ++|.+|+..|+.+||+|++|++.+|++||..++.........||||++..
T Consensus 1 Cp~~w~~~~~~Cy~~~~~~-~~~~~A~~~C~~~~~~La~i~~~~e~~~i~~~~~~~~~~~~~~WiG~~~~---------- 69 (126)
T smart00034 1 CPSGWISYGGKCYKFSTEK-KTWADAQAFCQSLGAHLASIHSEAENDFVASLLKNSGSNSDYYWIGLSDP---------- 69 (126)
T ss_pred CCcchhhhCCEEEEEECCc-cCHHHHHHHHHhcCCEEcccCCHHHHHHHHHHHHhhcCCCCCEEEecCcc----------
Confidence 8999999999999999995 99999999999999999999999999999999876533345566666542
Q ss_pred ccccccccCCcCCCCCccccCCCCc-cccccccCCCCCCCcccCCeEEEecccccccccccccCCCCceeeeee
Q psy14885 109 PQRRKWYFGGTQQSPNLWVNEDGTN-LNELDAAFLPEPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICE 181 (483)
Q Consensus 109 ~~~~~w~~~g~~~~~~~~~W~dg~~-~~~~~~~w~~~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe 181 (483)
...+.|.|+||.+ ..+ .+|.+.|+....++|+.+..... ..|.+.+|. ...+||||
T Consensus 70 ------------~~~~~~~W~dg~~~~~~--~~w~~~~~~~~~~~C~~~~~~~~-~~w~~~~C~--~~~~~ICe 126 (126)
T smart00034 70 ------------DSNGSWQWSDGSGPVNY--SNWAPGEPNGGSGDCVVLSTSGG-GKWNDVSCT--SKLPFVCE 126 (126)
T ss_pred ------------CcCCCeEECCCCCCCCc--cccCCCCCCCCCCCCEEEecCCC-CcccCCCCC--CCcccccC
Confidence 1467899999998 444 78988433334689999977653 469999999 99999997
No 19
>PF00075 RNase_H: RNase H; InterPro: IPR002156 The RNase H domain is responsible for hydrolysis of the RNA portion of RNA x DNA hybrids, and this activity requires the presence of divalent cations (Mg2+ or Mn2+) that bind its active site. This domain is a part of a large family of homologous RNase H enzymes of which the RNase HI protein from Escherichia coli is the best characterised []. Secondary structure predictions for the enzymes from E. coli, yeast, human liver and diverse retroviruses (such as Rous sarcoma virus and the Foamy viruses) supported, in every case, the five beta-strands (1 to 5) and four or five alpha-helices (A, B/C, D, E) that have been identified by crystallography in the RNase H domain of Human immunodeficiency virus 1 (HIV-1) reverse transcriptase and in E. coli RNase H []. Reverse transcriptase (RT) is a modular enzyme carrying polymerase and ribonuclease H (RNase H) activities in separable domains. Reverse transcriptase (RT) converts the single-stranded RNA genome of a retrovirus into a double-stranded DNA copy for integration into the host genome. This process requires ribonuclease H as well as RNA- and DNA-directed DNA polymerase activities. Retroviral RNase H is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. Bacterial RNase H 3.1.26.4 from EC catalyses endonucleolytic cleavage to 5'-phosphomonoester acting on RNA-DNA hybrids. The 3D structure of the RNase H domain from diverse bacteria and retroviruses has been solved [, , ]. All have four beta strands and four to five alpha helices. The E. coli RNase H1 protein binds a single Mg2+ ion cofactor in the active site of the enzyme. The divalent cation is bound by the carboxyl groups of four acidic residues, Asp-10, Glu-48, Asp-70, and Asp-134 []. The first three acidic residues are highly conserved in all bacterial and retroviral RNase H sequences. ; GO: 0003676 nucleic acid binding, 0004523 ribonuclease H activity; PDB: 3LP3_B 2KW4_A 3P1G_A 1RIL_A 2RPI_A 4EQJ_G 4EP2_B 3OTY_P 3U3G_D 2ZQB_D ....
Probab=99.83 E-value=8.2e-21 Score=164.45 Aligned_cols=114 Identities=29% Similarity=0.469 Sum_probs=87.0
Q ss_pred ceEe-eCCCCC--CCceEEEEEeCCC-EEEeCCCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcc----
Q psy14885 221 DVVF-DLSKRN--SNTGFAFIINNEN-FQYRLPHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDN---- 292 (483)
Q Consensus 221 ~~~~-DgS~~~--~~~G~g~~~~~~~-~~~~l~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~---- 292 (483)
.++| |||... +..|+|+++..+. ....++ ..|+++|||.||..||+ +. ....++|+|||+++++.|..
T Consensus 4 ~~iytDgS~~~~~~~~~~g~v~~~~~~~~~~~~-~~s~~~aEl~Ai~~AL~-~~--~~~~v~I~tDS~~v~~~l~~~~~~ 79 (132)
T PF00075_consen 4 IIIYTDGSCRPNPGKGGAGYVVWGGRNFSFRLG-GQSNNRAELQAIIEALK-AL--EHRKVTIYTDSQYVLNALNKWLHG 79 (132)
T ss_dssp EEEEEEEEECTTTTEEEEEEEEETTEEEEEEEE-SECHHHHHHHHHHHHHH-TH--STSEEEEEES-HHHHHHHHTHHHH
T ss_pred EEEEEeCCccCCCCceEEEEEEECCeEEEeccc-ccchhhhheehHHHHHH-Hh--hcccccccccHHHHHHHHHHhccc
Confidence 4566 999764 7778888775442 123334 88999999999999999 54 34999999999999998877
Q ss_pred -cCCcc----hHHHHHHHHHHHHhhCCCceEEEEeccCCCC-cchHHHhHHHHhhh
Q psy14885 293 -IQQRN----NIIQLIKQEYYFASTNGSQISFMWIPSHSNI-ALNDKADQLAKNSI 342 (483)
Q Consensus 293 -~~~~~----~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~-~gNe~AD~lAk~a~ 342 (483)
.+... ++...|.+.+ ..+..|.|+|||||+|+ .||+.||+|||+|+
T Consensus 80 ~~~~~~~~~~~i~~~i~~~~----~~~~~v~~~~V~~H~~~~~~N~~aD~lAk~a~ 131 (132)
T PF00075_consen 80 NGWKKTSNGRPIKNEIWELL----SRGIKVRFRWVPGHSGVPQGNERADRLAKEAA 131 (132)
T ss_dssp TTSBSCTSSSBHTHHHHHHH----HHSSEEEEEESSSSSSSHHHHHHHHHHHHHHH
T ss_pred cccccccccccchhheeecc----ccceEEeeeeccCcCCCchhHHHHHHHHHHhc
Confidence 33322 3445444433 55889999999999999 79999999999986
No 20
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. L- sel is expressed constitutively on most leukocytes. P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets. E- sels are present on endothelial cells. Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=99.83 E-value=2.8e-20 Score=157.17 Aligned_cols=112 Identities=21% Similarity=0.409 Sum_probs=92.5
Q ss_pred EEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCCcC
Q psy14885 41 YRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGGTQ 120 (483)
Q Consensus 41 y~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g~~ 120 (483)
|+++... ++|.+|+.+|+.+||+||+|+|++|++||..++.... ...||||+++..
T Consensus 3 Y~~~~~~-~~w~~A~~~C~~~g~~La~i~s~~e~~~i~~~~~~~~--~~~~WiG~~~~~--------------------- 58 (115)
T cd03592 3 YHYSTEK-MTFNEAVKYCKSRGTDLVAIQNAEENALLNGFALKYN--LGYYWIDGNDIN--------------------- 58 (115)
T ss_pred EEEcCCc-cCHHHHHHHHHHcCCeEeecCCHHHHHHHHHHHHhcC--CCCEEEeCccCC---------------------
Confidence 8899995 9999999999999999999999999999999875432 235666665432
Q ss_pred CCCCccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 121 QSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 121 ~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
..+.|.|.||+++.+ .+|++ +|++..+++|+.+..... +.|+|.+|+ ..++||||+
T Consensus 59 -~~~~W~~~dg~~~~y--~~W~~geP~~~~~~~Cv~~~~~~~-g~W~d~~C~--~~~~fICe~ 115 (115)
T cd03592 59 -NEGTWVDTDKKELEY--KNWAPGEPNNGRNENCLEIYIKDN-GKWNDEPCS--KKKSAICYT 115 (115)
T ss_pred -ccCeEEeCCCCcccc--cccCCCCCCCCCCCCceEEccCCC-CCCcCcCCC--CCccceeCC
Confidence 467899999998765 88999 999866789999976422 469999999 999999996
No 21
>cd03601 CLECT_TC14_like C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CLECT_TC14_like: C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TC14 is homodimeric. The CTLD of TC14 binds D-galactose and D-fucose. TC14 is expressed constitutively by multipotent epithelial and mesenchymal cells and plays in role during budding, in inducing the aggregation of undifferentiated mesenchymal cells to give rise to epithelial forming tissue. TC14-2 and TC14-3 shows calcium-dependent galactose binding activity. TC14-3 is a cytostatic factor which blocks cell growth and dedifferentiation of the atrial epithelium during asexual reproducti
Probab=99.82 E-value=6.5e-20 Score=155.89 Aligned_cols=113 Identities=16% Similarity=0.169 Sum_probs=84.7
Q ss_pred EEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCCcC
Q psy14885 41 YRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGGTQ 120 (483)
Q Consensus 41 y~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g~~ 120 (483)
|+|+... ++|.+|+..|+.+||+||+|+|.+| .++..+..........||||++ | .
T Consensus 3 ~~~~~~~-~~w~~A~~~C~~~G~~La~i~s~~e-~~~~~i~~~~~~~~~~~WIGl~-----------d-----------~ 58 (119)
T cd03601 3 ILCSDET-MNYAKAGAFCRSRGMRLASLAMRDS-EMRDAILAFTLVKGHGYWVGAD-----------N-----------L 58 (119)
T ss_pred EEEcCCc-CCHHHHHHHHHhcCCEEeeecCHHH-HHHHHHHhccccCCccEEEEec-----------c-----------C
Confidence 7788884 9999999999999999999999988 3333332211112233455544 4 2
Q ss_pred -CCCCccccCCCCccc-cccccCCC-CCCCcc-cCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 121 -QSPNLWVNEDGTNLN-ELDAAFLP-EPADNV-QRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 121 -~~~~~~~W~dg~~~~-~~~~~w~~-~P~~~~-~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
..++.|+|+||+++. .| .+|++ +|++.+ .++||.+.... +.|+|.+|. ..++|||||
T Consensus 59 ~~~~g~~~W~dG~~~~~~y-~~W~~geP~~~~~~e~Cv~~~~~~--~~W~d~~C~--~~~~fICek 119 (119)
T cd03601 59 QDGEYDFLWNDGVSLPTDS-DLWAPNEPSNPQSRQLCVQLWSKY--NLLDDEYCG--RAKRVICEK 119 (119)
T ss_pred CCCCCCeEeCCCCCcCCCC-CccCCCcCcCcCCCcCCeEEeCCC--CCEeCccCC--CCceeeecC
Confidence 357889999999884 23 89999 998754 58999986543 369999999 999999996
No 22
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=99.80 E-value=1.2e-19 Score=210.20 Aligned_cols=150 Identities=15% Similarity=0.229 Sum_probs=119.3
Q ss_pred cCCCccccccc--CCeEEEEecCCCCCHHHHHHHhhhc-CCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCc
Q psy14885 26 ELRCPQHWVQY--RDSCYRFVKSPLKTRNDAKLNCKSL-DSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNL 102 (483)
Q Consensus 26 ~~~C~~gw~~~--~~~Cy~~~~~~~~tw~~A~~~C~~~-g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~ 102 (483)
...||+||..| +++||+++... ++|.+|+..|+++ ||+|++|+|.+|++||..++.+.. ...+|||+++..
T Consensus 315 ~~~CP~GW~~f~~~g~CYk~~~e~-~TW~dAe~~C~s~GGAhLAsI~S~eEn~FL~~lv~~s~--~~~vWIGLsD~~--- 388 (2740)
T TIGR00864 315 HPHCPKDGEIFEENGHCFQIVPEE-AAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSL--DRGVWIGFSDVN--- 388 (2740)
T ss_pred CCCCCCCCeecCCCCEEEEEeCCc-cCHHHHHHHhhccCCeEEecCCCHHHHHHHHHHhhccC--CccEEEeeeCCC---
Confidence 36799999755 79999999995 9999999999999 599999999999999999886432 223555554322
Q ss_pred ccccCcccccccccCCcCCCCC--ccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeee
Q psy14885 103 WVNEDDPQRRKWYFGGTQQSPN--LWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFI 179 (483)
Q Consensus 103 Wig~~~~~~~~w~~~g~~~~~~--~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fv 179 (483)
.++ .|+|+||+++..| .+|.+ ||++..+++|+.+... ..|+|.+|+ ..++||
T Consensus 389 -------------------~EG~g~WvWsDGS~l~~Y-tnW~pGEPNn~~~EdCV~l~~~---g~WND~~Cs--~~~~FI 443 (2740)
T TIGR00864 389 -------------------GAEKGPAHQGEAFEAEEC-EEGLAGEPHPARAEHCVRLDPR---GQCNSDLCN--APHAYV 443 (2740)
T ss_pred -------------------CCCceeeEECCCCcccCc-cCcCCCCCCCCCCCCCEEEcCC---CCEEccCCC--CCeeEE
Confidence 334 4999999998544 89999 9998777999998543 359999999 999999
Q ss_pred eecchhhhhhhccccccccccccccCCCCC
Q psy14885 180 CEASIQKLHYLLNDDRTYQYGMDIENPDKI 209 (483)
Q Consensus 180 Ce~~~~~~~~~~~~~~~~~yg~~~~~~~~~ 209 (483)
||+++ ..++.+..-|.+|+.+-++...
T Consensus 444 CE~~~---~~~v~d~~~~~~g~~~~~~~~~ 470 (2740)
T TIGR00864 444 CELNP---GGPVPDAENFAMGAASFDLHGL 470 (2740)
T ss_pred eEECC---CCcccccceEEeecccCCcccc
Confidence 99974 3667777888888877655444
No 23
>cd03603 CLECT_VCBS A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Bacterial CTLDs within this group are functionally uncharacterized. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surface
Probab=99.80 E-value=2.3e-19 Score=152.22 Aligned_cols=113 Identities=20% Similarity=0.342 Sum_probs=91.1
Q ss_pred eEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCC
Q psy14885 39 SCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGG 118 (483)
Q Consensus 39 ~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g 118 (483)
+||+++..+ ++|.+|+..|+++||+||+|+|++|++||..++. ....+|||+++..
T Consensus 1 ~~Y~~~~~~-~sw~~A~~~C~~~g~~La~I~s~~E~~fv~~~~~----~~~~~WiG~~~~~------------------- 56 (118)
T cd03603 1 HFYKFVDGG-MTWEAAQTLAESLGGHLVTINSAEENDWLLSNFG----GYGASWIGASDAA------------------- 56 (118)
T ss_pred CeEEEeCCC-cCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHhc----cCCCEEEeeecCC-------------------
Confidence 599999995 9999999999999999999999999999999885 2335566665432
Q ss_pred cCCCCCccccCCCCccccccccCCC-CC-CCc-ccCCeEEEecc-cccccccccccCCCCceeeeee
Q psy14885 119 TQQSPNLWVNEDGTNLNELDAAFLP-EP-ADN-VQRDYLAYSFS-QSLKRWGFERVTGMEPLLFICE 181 (483)
Q Consensus 119 ~~~~~~~~~W~dg~~~~~~~~~w~~-~P-~~~-~~~~Cv~~~~~-~~~~~W~~~~C~~~~~~~fvCe 181 (483)
.++.|.|+||+++.+ .+|.+ +| ++. +.++|+.+... ...+.|+|.+|+ ..+.+||||
T Consensus 57 ---~~~~w~W~dg~~~~~--~~W~~~eP~~~~~~~~~Cv~~~~~~~~~~~W~d~~C~-~~~~~~iCe 117 (118)
T cd03603 57 ---TEGTWKWSDGEESTY--TNWGSGEPHNNGGGNEDYAAINHFPGISGKWNDLANS-YNTLGYVIE 117 (118)
T ss_pred ---CCCceEeCCCCcCCC--CCcCCCCCCCCCCCCcCeEEeecCCCCCCcCccCCCC-ccccceEEe
Confidence 578899999999876 78999 99 553 35899998652 012359999997 458899998
No 24
>cd03595 CLECT_chondrolectin_like C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CLECT_chondrolectin_like: C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CHODL is predominantly expressed in muscle cells and is associated with T-cell maturation. Various alternatively spliced isoforms have been of CHODL have been identified. The transmembrane form of CHODL is localized in the ER-Golgi apparatus. Layilin is widely expressed in different cell types. The extracellular CTLD of layilin binds hyaluronan (HA), a major constituent of the extracellular matrix (ECM). The cytoplasmic tail of layilin binds various members of the band 4.1/ERM superfamily (talin, radixin, and merlin). The ERM proteins are cytoskeleton-membrane l
Probab=99.80 E-value=2.9e-19 Score=158.02 Aligned_cols=131 Identities=15% Similarity=0.259 Sum_probs=99.1
Q ss_pred CcccccccCCeEEEEe-----cCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcc
Q psy14885 29 CPQHWVQYRDSCYRFV-----KSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLW 103 (483)
Q Consensus 29 C~~gw~~~~~~Cy~~~-----~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~W 103 (483)
|..|+ ++.||+++ .. .++|.+|+..|+.+||+|++|+|++|++||..+++........+|||+++.....
T Consensus 4 ~~~~~---~~~Cy~~~~~~~~~~-~~tw~~A~~~C~~~g~~LasI~s~~E~~~i~~~i~~~~~~~~~~WIGl~~~~~~~- 78 (149)
T cd03595 4 CRRGT---EKPCYKIAYFQDSRR-RLNFEEARQACREDGGELLSIESENEQKLIERFIQTLRASDGDFWIGLRRSSQYN- 78 (149)
T ss_pred eccCC---CCccEEEEEEecccc-ccCHHHHHHHHHHcCCEECccCCHHHHHHHHHHHHhhcCCCCcEEEEeECCCCcc-
Confidence 55554 47899975 56 4999999999999999999999999999999988755445567888887543100
Q ss_pred cccCcccccccccCCcCCCCCccccCCCCccccccccCCC-CCCCcccCCeEEEeccc---------ccccccccccCCC
Q psy14885 104 VNEDDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQ---------SLKRWGFERVTGM 173 (483)
Q Consensus 104 ig~~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~---------~~~~W~~~~C~~~ 173 (483)
... ....+.|.|+||+++.+ .+|.+ +|++ +.++|+++.... ...+|+|..|+
T Consensus 79 --~~~-----------~~~~~~~~W~dG~~~~y--~~W~~~eP~~-~~~~Cv~l~~~~~~~~~~~~~~~~~W~d~~C~-- 140 (149)
T cd03595 79 --VTS-----------SACSSLYYWLDGSISTF--RNWYVDEPSC-GSEVCVVMYHQPSAPAGQGGPYLFQWNDDNCN-- 140 (149)
T ss_pred --ccc-----------cccCCccEEcCCCccCc--cCCCCCCCCC-cccCCEEEEecCCCCcCcCcccCCCccCCCCC--
Confidence 000 00135799999999876 78999 9986 458999996421 12369999999
Q ss_pred Cceeeeeec
Q psy14885 174 EPLLFICEA 182 (483)
Q Consensus 174 ~~~~fvCe~ 182 (483)
..++||||.
T Consensus 141 ~~~~fICe~ 149 (149)
T cd03595 141 MKNNFICKY 149 (149)
T ss_pred CCcccccCC
Confidence 999999984
No 25
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved. TM exerts anti-fibrinolytic and anti-inflammatory activity. TM also regulates blood coagulation in the anticoagulant protein C pathway. In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM. TM also plays a key role in tumor biology. It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma. Loss of TM expression correlates with advanced stage and poor prognosis. Loss of function of TM func
Probab=99.79 E-value=5.4e-19 Score=154.92 Aligned_cols=129 Identities=19% Similarity=0.239 Sum_probs=96.5
Q ss_pred cCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCC----CCCceeeecccCC-CCcccccCccc
Q psy14885 36 YRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDP----QRRKWYFGGTQQS-PNLWVNEDDPQ 110 (483)
Q Consensus 36 ~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~----~~~~~wig~~~~~-~~~Wig~~~~~ 110 (483)
.+++||+++... ++|.+|+.+|+.+||+||+|+|.+|+++|..++..... ....||||+++.. ...+.
T Consensus 2 ~~~~Cy~~~~~~-~sw~~A~~~C~~~gg~La~i~s~~E~~~v~~~l~~~~~~~~~~~~~~WIGl~~~~~~~~~~------ 74 (141)
T cd03600 2 VSDACYTLHPQK-LTFLEAQRSCIELGGNLATVRSGEEADVVSLLLAAGPGRHGRGSLRLWIGLQREPRQCSDP------ 74 (141)
T ss_pred CCCceEEEeCCc-cCHHHHHHHHHhhCCEeeecCCHHHHHHHHHHHhhccccccCCCccEEEeEecCcccCccc------
Confidence 357899999995 99999999999999999999999999999999876521 2456777776522 11111
Q ss_pred ccccccCCcCCCCCccccCCCCccccccccCCCCCCCc-ccCCeEEEecccc---cccccccccCCCCc-eeeeeecc
Q psy14885 111 RRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLPEPADN-VQRDYLAYSFSQS---LKRWGFERVTGMEP-LLFICEAS 183 (483)
Q Consensus 111 ~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~~P~~~-~~~~Cv~~~~~~~---~~~W~~~~C~~~~~-~~fvCe~~ 183 (483)
....+.|.|+||.....| ++|.+.|++. ..++||++....+ ...|+|.+|+ .. .+||||..
T Consensus 75 ---------~~~~~~f~W~d~~~~~~y-~~W~~~p~n~~~~~~Cv~l~~~~~~~~~~~W~d~~C~--~~~~~fIC~~~ 140 (141)
T cd03600 75 ---------SLPLRGFSWVTGDQDTDF-SNWLQEPAGTCTSPRCVALSAAGSTPDNLKWKDGPCS--ARADGYLCKFS 140 (141)
T ss_pred ---------cccCCccEECCCCCCCCc-cccccCCCCCCCCCccEEEEccCCCCCCCccccCCcC--CCCCCeEEeee
Confidence 113467999998654333 8899955443 4689999977542 3479999999 77 48999964
No 26
>cd03602 CLECT_1 C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_1: C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in differe
Probab=99.78 E-value=5.5e-19 Score=147.53 Aligned_cols=106 Identities=21% Similarity=0.307 Sum_probs=88.4
Q ss_pred EEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCCc
Q psy14885 40 CYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGGT 119 (483)
Q Consensus 40 Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g~ 119 (483)
||+++..+ ++|.+|+..|+.+||+|++|+|.+|++||..++. .....||||++ +
T Consensus 2 ~y~~~~~~-~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~---~~~~~~WiGl~-----------~----------- 55 (108)
T cd03602 2 TFYLVNES-KTWSEAQQYCRENYTDLATVQNQEDNALLSNLSR---VSNSAAWIGLY-----------R----------- 55 (108)
T ss_pred ceEEeccc-cCHHHHHHHHHHHCCccCeecCHHHHHHHHHHHh---ccCCcEEEEEE-----------C-----------
Confidence 79999995 9999999999999999999999999999999885 12344555555 3
Q ss_pred CCCCCccccCCCCccccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 120 QQSPNLWVNEDGTNLNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 120 ~~~~~~~~W~dg~~~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
..+.|.|+||++..+ .+|.+ +|.+ +++|+.+... +.|.+.+|. ..++||||+
T Consensus 56 --~~~~~~W~dg~~~~~--~~w~~~~~~~--~~~C~~~~~~---~~w~~~~C~--~~~~fIC~~ 108 (108)
T cd03602 56 --DVDSWRWSDGSESSF--RNWNTFQPFG--QGDCATMYSS---GRWYAALCS--ALKPFICYD 108 (108)
T ss_pred --CCCceEEcCCCCCcc--CccCCCCCCC--CCCeeEECcC---CeECcccCC--CCcCEeccC
Confidence 156799999999765 88999 8754 5899999754 369999999 999999996
No 27
>PHA02867 C-type lectin protein; Provisional
Probab=99.75 E-value=3.7e-18 Score=150.17 Aligned_cols=112 Identities=19% Similarity=0.257 Sum_probs=87.3
Q ss_pred cCCCcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccc
Q psy14885 26 ELRCPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVN 105 (483)
Q Consensus 26 ~~~C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig 105 (483)
+.+||+||..|+++||+++.. .++|.+|+..|+++||+|+.|+|++|++|+..+.. ..||||+++..
T Consensus 46 ~~~CP~gWi~~~~~CY~fs~~-~~tW~~A~~~C~~~ga~La~I~s~eE~~Fl~~~~~------~~~WIGLs~~~------ 112 (167)
T PHA02867 46 SKVCPDEWIGYNSKCYYFTIN-ETNWNDSKKLCDVMDSSLIRFDNIETLNFVSRYGK------GSYWIDINQNR------ 112 (167)
T ss_pred CCCCCCCCEEECCEEEEEecc-ccCHHHHHHHHhhCCCEECCcCCHHHHHHHHHcCC------CCEEEEEEeCC------
Confidence 678999999999999999999 59999999999999999999999999999987531 35777776543
Q ss_pred cCcccccccccCCcCCCCCccccCCCCccccccccCCCCCCCcccCCeEEEecccccccccccccCCCCceeeeeecchh
Q psy14885 106 EDDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLPEPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEASIQ 185 (483)
Q Consensus 106 ~~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~~~~ 185 (483)
.+.|.+++.+ .. ..++++|+.+... .+.+..|. ....+||+|+..
T Consensus 113 -------------------~~~~~~~s~~-------~~---~~~~~~Ca~i~~~----~i~s~~C~--~~~~wIC~K~~~ 157 (167)
T PHA02867 113 -------------------KIPGINFSLY-------YE---QGVNDICLLFDTS----NIIEMSCI--FHERTICVKEDR 157 (167)
T ss_pred -------------------CCCCccCcee-------ee---cCCCCcEEEEeCC----eEEeeccc--CCcEEEEEccCc
Confidence 1122333222 11 1235799998653 48999999 999999999843
No 28
>PRK13907 rnhA ribonuclease H; Provisional
Probab=99.73 E-value=3.7e-17 Score=140.78 Aligned_cols=118 Identities=20% Similarity=0.242 Sum_probs=97.8
Q ss_pred Ee-eCCCCC--CCceEEEEEeCCCEE--EeC-CCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcccCCc
Q psy14885 223 VF-DLSKRN--SNTGFAFIINNENFQ--YRL-PHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDNIQQR 296 (483)
Q Consensus 223 ~~-DgS~~~--~~~G~g~~~~~~~~~--~~l-~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~~~~~ 296 (483)
+| |||... ++.|+|+++.+.... ... ....||++||+.|++.||+.+.+.+..++.|+|||+.+++.+++.+..
T Consensus 4 iy~DGa~~~~~g~~G~G~vi~~~~~~~~~~~~~~~~tn~~AE~~All~aL~~a~~~g~~~v~i~sDS~~vi~~~~~~~~~ 83 (128)
T PRK13907 4 VYIDGASKGNPGPSGAGVFIKGVQPAVQLSLPLGTMSNHEAEYHALLAALKYCTEHNYNIVSFRTDSQLVERAVEKEYAK 83 (128)
T ss_pred EEEeeCCCCCCCccEEEEEEEECCeeEEEEecccccCCcHHHHHHHHHHHHHHHhCCCCEEEEEechHHHHHHHhHHHhc
Confidence 45 999976 899999999765443 222 356899999999999999999988888999999999999999987665
Q ss_pred chHHHHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhcC
Q psy14885 297 NNIIQLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSINS 344 (483)
Q Consensus 297 ~~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~ 344 (483)
+.-...+.+.+..+..+...+.|.|||+ +.|+.||.||+.|+..
T Consensus 84 ~~~~~~l~~~~~~l~~~f~~~~~~~v~r----~~N~~Ad~LA~~a~~~ 127 (128)
T PRK13907 84 NKMFAPLLEEALQYIKSFDLFFIKWIPS----SQNKVADELARKAILQ 127 (128)
T ss_pred ChhHHHHHHHHHHHHhcCCceEEEEcCc----hhchhHHHHHHHHHhc
Confidence 5555666666777778888999999996 4999999999998753
No 29
>cd06222 RnaseH RNase H (RNase HI) is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a not sequence-specific manner. One of the important functions of RNase H is to remove Okazaki fragments during DNA replication. RNase H knockout mice lack mitochondrial DNA replication and die as embryos. The retroviral reverse transcriptase contains an RNase H domain that plays an important role in converting a single stranded retroviral genomic RNA into a dsDNA for integration into host chromosomes. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription.
Probab=99.73 E-value=9.4e-17 Score=136.73 Aligned_cols=120 Identities=22% Similarity=0.284 Sum_probs=97.2
Q ss_pred Ee-eCCCCC--CCceEEEEEeCCCE--EEeC---CCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcccC
Q psy14885 223 VF-DLSKRN--SNTGFAFIINNENF--QYRL---PHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDNIQ 294 (483)
Q Consensus 223 ~~-DgS~~~--~~~G~g~~~~~~~~--~~~l---~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~~~ 294 (483)
+| |||... +.+|+|+++.+... .+.. ....++++||+.|+..||+.+......++.|++||+.++..+.+..
T Consensus 2 ~~~Dgs~~~~~~~~g~g~v~~~~~~~~~~~~~~~~~~~s~~~aEl~al~~al~~~~~~~~~~i~i~~Ds~~~~~~~~~~~ 81 (130)
T cd06222 2 IYTDGSCRGNPGPAGAGVVLRDPGGEVLLSGGLLGGNTTNNRAELLALIEALELALELGGKKVNIYTDSQYVINALTGWY 81 (130)
T ss_pred EEecccCCCCCCceEEEEEEEeCCCeEEEeccccCCCCcHHHHHHHHHHHHHHHHHhCCCceEEEEECHHHHHHHhhccc
Confidence 45 999977 58899999976443 2211 2578999999999999999998888999999999999999999865
Q ss_pred C-cchHHHHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhh
Q psy14885 295 Q-RNNIIQLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSI 342 (483)
Q Consensus 295 ~-~~~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~ 342 (483)
. .......+...+..+...+..++|.|||+|+++.+|+.||.|||+|.
T Consensus 82 ~~~~~~~~~~~~~i~~~~~~~~~~~i~~v~~h~~~~~n~~ad~la~~~~ 130 (130)
T cd06222 82 EGKPVKNVDLWQRLLALLKRFHKVRFEWVPGHSGIEGNERADALAKEAA 130 (130)
T ss_pred cCCChhhHHHHHHHHHHHhCCCeEEEEEcCCCCCCcchHHHHHHHHhhC
Confidence 4 23334444555555556789999999999999999999999999874
No 30
>KOG3752|consensus
Probab=99.68 E-value=2.4e-16 Score=152.23 Aligned_cols=123 Identities=24% Similarity=0.239 Sum_probs=95.9
Q ss_pred ceEe-eCCCCC-----CCceEEEEEeCC---CEEEeCC-CCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhh
Q psy14885 221 DVVF-DLSKRN-----SNTGFAFIINNE---NFQYRLP-HEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGI 290 (483)
Q Consensus 221 ~~~~-DgS~~~-----~~~G~g~~~~~~---~~~~~l~-~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l 290 (483)
.++| |||+.. ..+|+||+..++ +....+. +..|||+|||.||..||+.+.+....+++|.|||++++++|
T Consensus 213 ~vvytDGS~~~ng~~~~~AGyGvywg~~~e~N~s~pv~~g~qtNnrAEl~Av~~ALkka~~~~~~kv~I~TDS~~~i~~l 292 (371)
T KOG3752|consen 213 QVVYTDGSSSGNGRKSSRAGYGVYWGPGHELNVSGPLAGGRQTNNRAELIAAIEALKKARSKNINKVVIRTDSEYFINSL 292 (371)
T ss_pred eEEEecCccccCCCCCCcceeEEeeCCCCcccccccCCCCcccccHHHHHHHHHHHHHHHhcCCCcEEEEechHHHHHHH
Confidence 6677 999876 578999999764 4466675 88999999999999999999998888999999999999999
Q ss_pred cc--------cCC---cch------HHHHHHHHHHHHhh--CCCceEEEEeccCCCCcchHHHhHHHHhhhc
Q psy14885 291 DN--------IQQ---RNN------IIQLIKQEYYFAST--NGSQISFMWIPSHSNIALNDKADQLAKNSIN 343 (483)
Q Consensus 291 ~~--------~~~---~~~------~~~~i~~~~~~~~~--~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~ 343 (483)
+. ++. .++ ........+..|.+ ++.+|++.|||||.|+.|||.||.||++...
T Consensus 293 ~~wv~~~k~~~~k~~~~~~~i~~~v~n~~~~~e~~~l~q~~~~~~vq~~~V~Gh~gi~gne~Ad~lARkgs~ 364 (371)
T KOG3752|consen 293 TLWVQGWKKNGWKTSNGSDRICAYVKNQDFFNELDELEQEISNKKVQQEYVGGHSGILGNEMADALARKGST 364 (371)
T ss_pred HHHHhhhccCccccccCCCccceeeecchHHHHHHHHHhhhccCceEEEEecCcCCcchHHHHHHHHhhhhh
Confidence 74 111 111 01112223333333 4689999999999999999999999998743
No 31
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=99.61 E-value=3.1e-15 Score=124.92 Aligned_cols=114 Identities=28% Similarity=0.488 Sum_probs=92.8
Q ss_pred eEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCC
Q psy14885 39 SCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGG 118 (483)
Q Consensus 39 ~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g 118 (483)
.||+++... ++|.+|+..|+.+||+|++|++.+|++||..++.. .....||+|+.+..
T Consensus 1 ~Cy~~~~~~-~~~~~A~~~C~~~~~~L~~~~~~~e~~~i~~~~~~--~~~~~~wvg~~~~~------------------- 58 (116)
T cd00037 1 SCYKFSTEK-LTWEEAQEYCRSLGGHLASIHSEEENDFLASLLKK--SSSSDVWIGLNDLS------------------- 58 (116)
T ss_pred CCEEEcCCc-cCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHHhC--CCCCCEEEcccccC-------------------
Confidence 499999994 99999999999999999999999999999999865 24455666665432
Q ss_pred cCCCCCccccCCCCc-cccccccCCC-CCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 119 TQQSPNLWVNEDGTN-LNELDAAFLP-EPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 119 ~~~~~~~~~W~dg~~-~~~~~~~w~~-~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
....|.|.||+. ..+ .+|.+ +|.....++|+.+.... ...|.+.+|. ...+||||+
T Consensus 59 ---~~~~~~~~~~~~~~~~--~~w~~~~~~~~~~~~C~~~~~~~-~~~~~~~~C~--~~~~~iC~~ 116 (116)
T cd00037 59 ---SEGTWKWSDGSPLVDY--TNWAPGEPNPGGSEDCVVLSSSS-DGKWNDVSCS--SKLPFICEK 116 (116)
T ss_pred ---cCCCeEecCCCccccc--cCCCCCCcCCCCCCCeeEEccCC-CCCccCCCCC--CCceeeecC
Confidence 367899999998 433 77988 77444578999997654 2469999999 999999985
No 32
>PRK07708 hypothetical protein; Validated
Probab=99.60 E-value=1.8e-14 Score=134.17 Aligned_cols=122 Identities=18% Similarity=0.214 Sum_probs=94.0
Q ss_pred ceEe-eCCCCC--CCceEEEEEeC--CCEEE------eCCCCCcchHHHHHHHHHHHHHhHhcCCce--EEEEeccHHHH
Q psy14885 221 DVVF-DLSKRN--SNTGFAFIINN--ENFQY------RLPHEASIFTAEAIAIFYCIKKISDMKIKK--AIIFSDNKSVL 287 (483)
Q Consensus 221 ~~~~-DgS~~~--~~~G~g~~~~~--~~~~~------~l~~~~s~~~AEl~ai~~Al~~~~~~~~~~--~~I~tDs~~~~ 287 (483)
.++| |||..+ +.+|+|+++.. +...+ .+...+||++||+.|++.||+.+.+.+.+. |.|++||+.++
T Consensus 74 ~~vY~DGs~~~n~g~aG~GvVI~~~~g~~~~~~~~~~~l~~~~TNN~AEy~Ali~aL~~A~e~g~~~~~V~I~~DSqlVi 153 (219)
T PRK07708 74 ILVYFDGGFDKETKLAGLGIVIYYKQGNKRYRIRRNAYIEGIYDNNEAEYAALYYAMQELEELGVKHEPVTFRGDSQVVL 153 (219)
T ss_pred EEEEEeeccCCCCCCcEEEEEEEECCCCEEEEEEeeccccccccCcHHHHHHHHHHHHHHHHcCCCcceEEEEeccHHHH
Confidence 4555 999864 88999999964 22332 345568999999999999999999877654 89999999999
Q ss_pred HhhcccCCc-chHHHHHHHHHHHHhhC-CCceEEEEeccCCCCcchHHHhHHHHhhhcCCC
Q psy14885 288 NGIDNIQQR-NNIIQLIKQEYYFASTN-GSQISFMWIPSHSNIALNDKADQLAKNSINSKL 346 (483)
Q Consensus 288 ~~l~~~~~~-~~~~~~i~~~~~~~~~~-~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~~~ 346 (483)
+.+.+.++. ++.+..+.+.+..+.++ ...+.|.||| .+.|+.||+||+.|++...
T Consensus 154 ~qi~g~wk~~~~~l~~y~~~i~~l~~~~~l~~~~~~Vp----R~~N~~AD~LAk~Al~~~~ 210 (219)
T PRK07708 154 NQLAGEWPCYDEHLNHWLDRIEQKLKQLKLTPVYEPIS----RKQNKEADQLATQALEGTV 210 (219)
T ss_pred HHhCCCceeCChhHHHHHHHHHHHHhhCCceEEEEECC----chhhhHHHHHHHHHHhcCC
Confidence 999998763 45555555555554443 3458889998 5689999999999998543
No 33
>PF00059 Lectin_C: Lectin C-type domain; InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins: C-type lectins, which are Ca+-dependent. S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins). C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=99.52 E-value=3.7e-14 Score=116.80 Aligned_cols=101 Identities=24% Similarity=0.383 Sum_probs=78.2
Q ss_pred CCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcccccCcccccccccCCcCCCCCcccc
Q psy14885 49 KTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLWVNEDDPQRRKWYFGGTQQSPNLWVN 128 (483)
Q Consensus 49 ~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~Wig~~~~~~~~w~~~g~~~~~~~~~W 128 (483)
++|.+|+.+|+++||+|+.|.+.+|++||..++. ....||||+ . ......|.|
T Consensus 3 ~~~~~A~~~C~~~~~~L~~i~~~~e~~~i~~~~~----~~~~~Wig~-~----------------------~~~~~~~~w 55 (105)
T PF00059_consen 3 MTWEEAQQYCQSMGAHLASINSEEENDFIQSQLK----SNESYWIGL-D----------------------SDNNGTWKW 55 (105)
T ss_dssp EEHHHHHHHHHHTTSEEB-GSSHHHHHHHHHHHH----SSSEEEEEE-E----------------------SSSTSEEEE
T ss_pred CCHHHHHHHHhcCCCEEeEeCCHHHhhhhhhccc----ccceeeeee-e----------------------ccccceecc
Confidence 7899999999999999999999999999999987 556777777 1 114678899
Q ss_pred CCCCccccc--cccCCCCCCCcccCCeEEEecccccccccccccCCCCceeeeeec
Q psy14885 129 EDGTNLNEL--DAAFLPEPADNVQRDYLAYSFSQSLKRWGFERVTGMEPLLFICEA 182 (483)
Q Consensus 129 ~dg~~~~~~--~~~w~~~P~~~~~~~Cv~~~~~~~~~~W~~~~C~~~~~~~fvCe~ 182 (483)
.||...... ..+| + .....++|+.+. ....+.|.+.+|. ..++|||||
T Consensus 56 ~~~~~~~~~~~~~~w-~--~~~~~~~C~~~~-~~~~~~w~~~~C~--~~~~fiCek 105 (105)
T PF00059_consen 56 IDGSPNSPENFYTNW-N--PPNDSENCAYIY-YSSSGKWNDVPCS--EKYPFICEK 105 (105)
T ss_dssp TTSSBSSSTTBSGCB-S--SGGSSEEEEEEG-CSTTTEEEEEETT--SEEEEEEEE
T ss_pred ccCCCcccccccccc-c--cCCCCCCeEEEE-EcCCCeEEeeCCC--CCeeEEeeC
Confidence 999876542 2456 2 222358999993 2333579999999 999999996
No 34
>PRK07238 bifunctional RNase H/acid phosphatase; Provisional
Probab=99.50 E-value=2.5e-13 Score=138.35 Aligned_cols=122 Identities=16% Similarity=0.249 Sum_probs=99.1
Q ss_pred cceEe-eCCCCC--CCceEEEEEeCC--C-E--EE-eCCCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhh
Q psy14885 220 TDVVF-DLSKRN--SNTGFAFIINNE--N-F--QY-RLPHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGI 290 (483)
Q Consensus 220 ~~~~~-DgS~~~--~~~G~g~~~~~~--~-~--~~-~l~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l 290 (483)
..++| |||..+ |.+|+|+++.+. + . .. .....+||+.||+.|++.||+.+.+.+.+.+.|++||+.+++.+
T Consensus 2 ~~~i~~DGa~~~n~g~aG~G~vi~~~~~~~~~~~~~~~~~~~tnn~AE~~All~gL~~a~~~g~~~v~i~~DS~lvi~~i 81 (372)
T PRK07238 2 KVVVEADGGSRGNPGPAGYGAVVWDADRGEVLAERAEAIGRATNNVAEYRGLIAGLEAAAELGATEVEVRMDSKLVVEQM 81 (372)
T ss_pred eEEEEecCCCCCCCCceEEEEEEEeCCCCcEEEEeecccCCCCchHHHHHHHHHHHHHHHhCCCCeEEEEeCcHHHHHHh
Confidence 34566 999987 789999999763 2 1 21 12347899999999999999999988889999999999999999
Q ss_pred cccCC-cchHHHHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhcCC
Q psy14885 291 DNIQQ-RNNIIQLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSINSK 345 (483)
Q Consensus 291 ~~~~~-~~~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~~ 345 (483)
.+.+. .+.-+..+.+.+..+.++...+++.|||+ ++|+.||.||+.|+...
T Consensus 82 ~~~~~~~~~~l~~~~~~i~~l~~~f~~~~i~~v~r----~~N~~AD~LA~~a~~~~ 133 (372)
T PRK07238 82 SGRWKVKHPDMKPLAAQARELASQFGRVTYTWIPR----ARNAHADRLANEAMDAA 133 (372)
T ss_pred CCCCccCChHHHHHHHHHHHHHhcCCceEEEECCc----hhhhHHHHHHHHHHHhh
Confidence 98765 44555566666777778889999999995 69999999999997553
No 35
>PHA02911 C-type lectin-like protein; Provisional
Probab=99.38 E-value=1.9e-12 Score=114.25 Aligned_cols=108 Identities=18% Similarity=0.190 Sum_probs=77.4
Q ss_pred cCCCcccccccCCeEEEEecCCCCCH--HHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccCCCCcc
Q psy14885 26 ELRCPQHWVQYRDSCYRFVKSPLKTR--NDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQSPNLW 103 (483)
Q Consensus 26 ~~~C~~gw~~~~~~Cy~~~~~~~~tw--~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~~~~~W 103 (483)
.++||.+|..|+++||+|+.. +.+| ..|+++|..+||+|+. ++.+|++||+.+.. |..
T Consensus 100 ~~pC~~nW~~H~nkCYlFS~~-s~sWsW~~Srr~C~~kGATLLk-~sdEELeFIqn~wk-----------Gks------- 159 (213)
T PHA02911 100 EPICSRDWINHMGICLLSLGE-EVGFRMEIAKRFCEKKDADLIG-KIDEEKKALENIWT-----------GND------- 159 (213)
T ss_pred CCCCchhhHhhCCEEEEEecc-ccccchhHHHHHHHhcCCEecc-CcHHHHHHHHHHHh-----------ccc-------
Confidence 456889999999999999988 3555 9999999999999999 77799999997442 221
Q ss_pred cccCcccccccccCCcCCCCCccccCCCCccccccccCCCCCCCcccCCeEEEecc--cccccccccccCCCCceeeeee
Q psy14885 104 VNEDDPQRRKWYFGGTQQSPNLWVNEDGTNLNELDAAFLPEPADNVQRDYLAYSFS--QSLKRWGFERVTGMEPLLFICE 181 (483)
Q Consensus 104 ig~~~~~~~~w~~~g~~~~~~~~~W~dg~~~~~~~~~w~~~P~~~~~~~Cv~~~~~--~~~~~W~~~~C~~~~~~~fvCe 181 (483)
..|.|+||.... +.. .| ..+|+.+... ...+...+.+|. ....+|||
T Consensus 160 --------------------~~~fWID~r~ng---St~--n~----s~sCA~Is~~~~~~~~~V~sesCs--~~~~wICq 208 (213)
T PHA02911 160 --------------------HSRFWIDNRAAA---STF--DP----VNECAIGTQNHIPEVPEVLKSPCD--ERHSFICI 208 (213)
T ss_pred --------------------cceEEecCcccc---cee--cC----CCCeEEEEcccccCCCceEccccC--CCceEEEE
Confidence 123333332100 000 11 4689999763 122357889999 99999999
Q ss_pred cch
Q psy14885 182 ASI 184 (483)
Q Consensus 182 ~~~ 184 (483)
|..
T Consensus 209 K~~ 211 (213)
T PHA02911 209 KKD 211 (213)
T ss_pred ecc
Confidence 973
No 36
>PF13456 RVT_3: Reverse transcriptase-like; PDB: 3ALY_A 2EHG_A 3HST_B.
Probab=99.29 E-value=1.3e-11 Score=98.31 Aligned_cols=86 Identities=24% Similarity=0.280 Sum_probs=70.3
Q ss_pred chHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcccCCcchHHHHHHHHHHHHhhCCCceEEEEeccCCCCcchHH
Q psy14885 254 IFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDNIQQRNNIIQLIKQEYYFASTNGSQISFMWIPSHSNIALNDK 333 (483)
Q Consensus 254 ~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~~~~~~~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~ 333 (483)
+..||+.|+..||+.+.+.+.+++.|.|||+.+++.|++.......+..+...+..+.+....+.|.||| .++|..
T Consensus 1 ~~~aE~~al~~al~~a~~~g~~~i~v~sDs~~vv~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~i~----r~~N~~ 76 (87)
T PF13456_consen 1 PLEAEALALLEALQLAWELGIRKIIVESDSQLVVDAINGRSSSRSELRPLIQDIRSLLDRFWNVSVSHIP----REQNKV 76 (87)
T ss_dssp HHHHHHHHHHHHHHHHHCCT-SCEEEEES-HHHHHHHTTSS---SCCHHHHHHHHHHHCCCSCEEEEE------GGGSHH
T ss_pred CcHHHHHHHHHHHHHHHHCCCCEEEEEecCccccccccccccccccccccchhhhhhhccccceEEEEEC----hHHhHH
Confidence 4689999999999999999999999999999999999987554335666677777788889999999999 889999
Q ss_pred HhHHHHhhhc
Q psy14885 334 ADQLAKNSIN 343 (483)
Q Consensus 334 AD~lAk~a~~ 343 (483)
||.|||.|.+
T Consensus 77 A~~LA~~a~~ 86 (87)
T PF13456_consen 77 ADALAKFALS 86 (87)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHhh
Confidence 9999999864
No 37
>PF05473 Herpes_UL45: UL45 protein; InterPro: IPR008646 This family consists several UL45 proteins and homologues found in the herpes simplex virus family. The herpes simplex virus UL45 gene encodes an 18 kDa virion envelope protein whose function remains unknown. It has been suggested that the 18 kDa UL45 gene product is required for efficient growth in the central nervous system at low doses and may play an important role under the conditions of a naturally acquired infection []. The Equine herpesvirus 1 UL45 protein represents a type II membrane glycoprotein which has found to be non-essential for EHV-1 growth in vitro but deletion reduces the viruses' replication efficiency [].
Probab=98.91 E-value=6.8e-09 Score=95.32 Aligned_cols=55 Identities=42% Similarity=0.738 Sum_probs=48.4
Q ss_pred cCCCcccccccCCeEEEEecCCCC-CHHHHHHHhhhcCCcccccCCHhHHHHHHHHh
Q psy14885 26 ELRCPQHWVQYRDSCYRFVKSPLK-TRNDAKLNCKSLDSDLANVNDADEHGFIMYQL 81 (483)
Q Consensus 26 ~~~C~~gw~~~~~~Cy~~~~~~~~-tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~ 81 (483)
...||+||+.|++.||+++..+ + +|.+|+..|+.+++.|+.+++......+....
T Consensus 82 ~~~CP~~Wi~~~~~Cy~~~~~~-~~t~~eA~~~C~~~~s~L~~~~~~~~L~~ll~~~ 137 (200)
T PF05473_consen 82 CGPCPKGWIGYNNSCYRFSNSP-KKTWEEARNICAAYNSTLANVNNAKSLLELLDVL 137 (200)
T ss_pred CCCCCccceeeCCEEEEEeCCC-CcCHHHHHHHHHhcCCcCCCchhHHHHHHHHHHh
Confidence 7889999999999999999997 7 99999999999999999998877765555443
No 38
>PHA02672 ORF110 EEV glycoprotein; Provisional
Probab=97.56 E-value=0.00025 Score=59.90 Aligned_cols=67 Identities=19% Similarity=0.311 Sum_probs=53.3
Q ss_pred cCCCcccccccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHhhhcCCCCCceeeecccC
Q psy14885 26 ELRCPQHWVQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQLFWQDPQRRKWYFGGTQQ 98 (483)
Q Consensus 26 ~~~C~~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~~~~~~~~~~~wig~~~~ 98 (483)
..-|++||.+|.+.||.-..- +.+=+.|...|.++.|+|..+-+..-.+-|..+. ....||+|-++.
T Consensus 47 ~~AC~~GWvp~dd~CyLnT~~-q~s~~~A~~iC~~~~a~lP~~pn~~hLKgVm~lt-----~~r~FW~thh~~ 113 (166)
T PHA02672 47 KEACENGWVPIKDLCVLNTHV-ITNVTLAHDICASMDGDPPATPNTMLLKGIMFLT-----GERSFWMTHHDA 113 (166)
T ss_pred hhhhhcCCeeccCEEEEecce-eecccHHHHHHHhccCCCCCCCCHhHhhheeeEe-----cccEEEEEcccc
Confidence 456999999999999998888 5999999999999999999977665555444443 556777776643
No 39
>PHA03093 EEV glycoprotein; Provisional
Probab=96.91 E-value=0.0013 Score=58.04 Aligned_cols=44 Identities=39% Similarity=0.551 Sum_probs=36.8
Q ss_pred ccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHh
Q psy14885 35 QYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQL 81 (483)
Q Consensus 35 ~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~ 81 (483)
.|++.||.++.+| .+|++|...|...|+.|..-+- ...+|..++
T Consensus 105 ~~~~~C~~~~~ep-kTf~dA~~~C~~~g~~LPs~~l--~~~WL~dYL 148 (185)
T PHA03093 105 VYDGSCYIFHSEP-KTFSDAKADCAKKSSTLPNSNL--MTTWLSDYL 148 (185)
T ss_pred ecCCEeEEecCCC-cCHHHHHHHHHhcCCcCCCcch--HHHHHHHHh
Confidence 5688999999998 9999999999999999988332 223777777
No 40
>PHA02673 ORF109 EEV glycoprotein; Provisional
Probab=96.86 E-value=0.0038 Score=53.80 Aligned_cols=46 Identities=20% Similarity=0.344 Sum_probs=38.0
Q ss_pred cccCCeEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHHh
Q psy14885 34 VQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQL 81 (483)
Q Consensus 34 ~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~~ 81 (483)
+.++++||.++ .| .+|++|...|.+.|+.|.+..-..+..+|..++
T Consensus 80 I~~~~~Cltl~-~p-~tf~eAn~~C~~~g~~LPs~~l~~~~pWL~dYL 125 (161)
T PHA02673 80 INAGNKCLTLK-YP-DTWTNANERCKELGQRLPSPSINKKYPWLTDYL 125 (161)
T ss_pred cccCCeeEEeC-CC-CcHHHHHHHHHhcCCcCCCCcccccCcHHHHHh
Confidence 46688999999 87 899999999999999999876555555666665
No 41
>PF05966 Chordopox_A33R: Chordopoxvirus A33R protein; InterPro: IPR009238 This family consists of several Chordopoxvirus A33R proteins. A33R plays a role in promoting Ab-resistant cell-to-cell spread of virus [] and interacts with A36R to incorporate the protein into the outer membrane of intracellular enveloped virions (IEV) [].; PDB: 3K7B_A.
Probab=95.82 E-value=0.003 Score=56.37 Aligned_cols=33 Identities=24% Similarity=0.561 Sum_probs=26.5
Q ss_pred cccCCeEEEEecCCCCCHHHHHHHhhhcCCcccc
Q psy14885 34 VQYRDSCYRFVKSPLKTRNDAKLNCKSLDSDLAN 67 (483)
Q Consensus 34 ~~~~~~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~ 67 (483)
+.+++.||.+..+| .+|++|...|.+.|++|.+
T Consensus 108 I~~~n~Cl~~~~~p-~T~~~A~~~C~~~g~~LPs 140 (190)
T PF05966_consen 108 IVYDNKCLTLNYEP-KTFDEANSDCNNKGQTLPS 140 (190)
T ss_dssp EEETTEEEEEEEEE-EEHHHHHHHHHHTT-B---
T ss_pred cccCCEEEEecCCC-CCHHHHHHHHHhcCCcCCC
Confidence 45678999999997 8999999999999999987
No 42
>PF13966 zf-RVT: zinc-binding in reverse transcriptase
Probab=94.01 E-value=0.043 Score=43.28 Aligned_cols=43 Identities=30% Similarity=0.501 Sum_probs=30.8
Q ss_pred eeeeecCcccc-c--cccccCCCCCCce-ecCC-CCCccccc-cccCcH
Q psy14885 403 LTRLRIGHTRI-T--HSYLFTKTPHPIC-TCGF-PLTVKHIF-ECNKYK 445 (483)
Q Consensus 403 l~qlrtGH~~l-~--h~~~~~~~~~p~C-~Cg~-~et~~Hil-~Cp~~~ 445 (483)
+.-.|.-|.++ + -+.+.+...++.| .|+. +||++|+| .||...
T Consensus 35 ~f~W~~~~~rL~t~~~l~~r~~~~~~~C~~C~~~~Et~~Hlf~~C~~a~ 83 (86)
T PF13966_consen 35 FFLWRALHNRLPTKDNLQRRGIQVDPICPLCGNEEETIEHLFFHCPFAR 83 (86)
T ss_pred EEeeeeccccchhhhhhhccCCccCCccccCCCccccccceeccCcCcc
Confidence 44455555565 2 3555667788999 9997 89999996 999753
No 43
>cd03519 Link_domain_HAPLN_module_2 Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=86.10 E-value=0.55 Score=37.05 Aligned_cols=25 Identities=20% Similarity=0.220 Sum_probs=22.2
Q ss_pred ecCCCCCHHHHHHHhhhcCCcccccC
Q psy14885 44 VKSPLKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 44 ~~~~~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
.+. +.||.+|++.|+..|++|+++.
T Consensus 7 ~~~-~l~f~eA~~aC~~~ga~lAs~~ 31 (91)
T cd03519 7 HPG-KLTFSEAVAACQRDGAQIAKVG 31 (91)
T ss_pred Ccc-ccCHHHHHHHHHHcCCEeCCHH
Confidence 456 6999999999999999999884
No 44
>cd03520 Link_domain_CSPGs_modules_2_4 Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and
Probab=79.19 E-value=1.7 Score=34.71 Aligned_cols=22 Identities=32% Similarity=0.427 Sum_probs=20.5
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.|+++|+..|+..|++||+..
T Consensus 10 ~l~f~eA~~aC~~~ga~lAs~~ 31 (96)
T cd03520 10 KFTFQEARAECRSLGAVLATTG 31 (96)
T ss_pred CcCHHHHHHHHHHcCCEeCCHH
Confidence 6999999999999999999874
No 45
>smart00445 LINK Link (Hyaluronan-binding).
Probab=78.06 E-value=1.4 Score=35.13 Aligned_cols=22 Identities=27% Similarity=0.382 Sum_probs=20.5
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.|+++|++.|+..|++||++.
T Consensus 14 ~l~f~eA~~aC~~~ga~lAs~~ 35 (94)
T smart00445 14 KLTFAEAREACRAQGATLATVG 35 (94)
T ss_pred ccCHHHHHHHHHHcCCEeCCHH
Confidence 5899999999999999999884
No 46
>cd01102 Link_Domain The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of
Probab=77.47 E-value=1.4 Score=35.14 Aligned_cols=22 Identities=32% Similarity=0.363 Sum_probs=20.5
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.|+.+|++.|+..|++|++..
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~~ 34 (92)
T cd01102 13 KLTFAEAALACKARGAHLATPG 34 (92)
T ss_pred ccCHHHHHHHHHHcCCEeCCHH
Confidence 6899999999999999999884
No 47
>KOG1812|consensus
Probab=74.96 E-value=9.3 Score=39.10 Aligned_cols=111 Identities=17% Similarity=0.182 Sum_probs=69.3
Q ss_pred CCceEEEEEeCCC-E----E-E-eCC--CCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHhhcccCCcch--H
Q psy14885 231 SNTGFAFIINNEN-F----Q-Y-RLP--HEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNGIDNIQQRNN--I 299 (483)
Q Consensus 231 ~~~G~g~~~~~~~-~----~-~-~l~--~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~l~~~~~~~~--~ 299 (483)
..+|.|+.+.+.+ . . . ... ...+...||++|+..+|..+.+.+...+.+++|+.-....+.+...... .
T Consensus 17 ~~~g~~vai~d~~d~~~~f~~k~~~~~~~~~~~~~ae~~al~~~l~ea~~~~~~~~~~~~d~~~~~~~v~~~~~~~~~~~ 96 (384)
T KOG1812|consen 17 LLAGFGVAICDEHDDDLLFQMKASDHDSDSITPLEAELMALKRGLTEALELGLNHIVIYCDDELIYESVAGREKPEQHRK 96 (384)
T ss_pred hcccCceeeeccccHHHHHHhhcCcccccccchhhHHHHHHhhccHHHHhhccccceEecccHHHHHHHhhhhhHHHHHH
Confidence 4678888886632 1 1 1 111 2367899999999999999999999999999997666665555444332 4
Q ss_pred HHHHHHHHHHHhhCCCceEEEEeccCCCCcchHHHhHHHHhhhcC
Q psy14885 300 IQLIKQEYYFASTNGSQISFMWIPSHSNIALNDKADQLAKNSINS 344 (483)
Q Consensus 300 ~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~gNe~AD~lAk~a~~~ 344 (483)
+..+.+....++.+.....-+-||- ......|=+||+++..+
T Consensus 97 ~~~l~~~v~~~r~~l~~~~~i~~~~---~~d~~~~~~lA~e~i~s 138 (384)
T KOG1812|consen 97 IVLLVELVQRIREQLTSSEPILVPK---NADIKFAYKLAREAIVS 138 (384)
T ss_pred HHHHHHHHHHHHHHhhcccceeccc---chhhHHHHHHHHHhhcc
Confidence 5555555555554443333333331 01123477888888764
No 48
>cd03518 Link_domain_HAPLN_module_1 Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=74.65 E-value=2 Score=34.31 Aligned_cols=22 Identities=23% Similarity=0.357 Sum_probs=20.3
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.|+.+|++.|+..|++|+++.
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~~ 34 (95)
T cd03518 13 NLNFHEAQQACEEQDATLASFE 34 (95)
T ss_pred ccCHHHHHHHHHHcCCeeCCHH
Confidence 5899999999999999999884
No 49
>cd03515 Link_domain_TSG_6_like This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accum
Probab=74.44 E-value=1.9 Score=34.34 Aligned_cols=22 Identities=27% Similarity=0.351 Sum_probs=20.3
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.|+.+|++.|+..|++|+++.
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~~ 34 (93)
T cd03515 13 KLTYTEAKAACEAEGAHLATYS 34 (93)
T ss_pred ccCHHHHHHHHHHcCCccCCHH
Confidence 5899999999999999999884
No 50
>PF00193 Xlink: Extracellular link domain; InterPro: IPR000538 The link domain [] is a hyaluronan(HA)-binding region found in proteins of vertebrates that are involved in the assembly of extracellular matrix, cell adhesion, and migration. The structure has been shown [] to consist of two alpha helices and two antiparallel beta sheets arranged around a large hydrophobic core similar to that of C-type lectin. This domain contains four conserved cysteines involved in two disulphide bonds. The link domain has also been termed HABM [] (HA binding module) and PTR [] (proteoglycan tandem repeat). Proteins with such a domain include the proteoglycans aggrecan, brevican, neurocan and versican, which are expressed in the CNS; the cartilage link protein (LP), a proteoglycan that together with HA and aggrecan forms multimolecular aggregates; Tumour necrosis factor-inducible protein TSG-6, which may be involved in cell-cell and cell-matrix interactions during inflammation and tumourgenesis; and CD44 antigen, the main cell surface receptor for HA.; GO: 0005540 hyaluronic acid binding, 0007155 cell adhesion; PDB: 1O7B_T 2PF5_C 1O7C_T 2JCQ_A 2JCR_A 2JCP_A 1UUH_B 1POZ_A 2I83_A.
Probab=73.42 E-value=1.7 Score=34.67 Aligned_cols=22 Identities=27% Similarity=0.409 Sum_probs=17.4
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
..+|.+|+..|+..|++||+..
T Consensus 13 ~l~f~eA~~~C~~~ga~LAs~~ 34 (92)
T PF00193_consen 13 KLTFTEAQQACRALGARLASPE 34 (92)
T ss_dssp SB-HHHHHHHHHHTTCBE--HH
T ss_pred cCcHHHHHHHHHHcCCeeCCHH
Confidence 4899999999999999998874
No 51
>cd03521 Link_domain_KIAA0527_like Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.
Probab=70.97 E-value=2.8 Score=32.96 Aligned_cols=22 Identities=27% Similarity=0.303 Sum_probs=20.1
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.++.+|++.|++.|++|+++.
T Consensus 13 ~l~f~eA~~AC~~~gA~lAs~~ 34 (95)
T cd03521 13 GLGLRAARQSCASLGARLASAA 34 (95)
T ss_pred ccCHHHHHHHHHHcCCEeccHH
Confidence 5889999999999999999873
No 52
>cd03517 Link_domain_CSPGs_modules_1_3 Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues.
Probab=70.33 E-value=2.2 Score=34.14 Aligned_cols=23 Identities=17% Similarity=0.250 Sum_probs=20.6
Q ss_pred CCCHHHHHHHhhhcCCcccccCC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVND 70 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~s 70 (483)
+.|+.+|+..|+..|++|+++..
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~~Q 35 (95)
T cd03517 13 ALTFPRAQRACLDISAQIATPEQ 35 (95)
T ss_pred eECHHHHHHHHHHcCCEeCCHHH
Confidence 57899999999999999999853
No 53
>PF05380 Peptidase_A17: Pao retrotransposon peptidase ; InterPro: IPR008042 This signature identifies members of the Pao retrotransposon family.
Probab=65.43 E-value=27 Score=30.77 Aligned_cols=50 Identities=24% Similarity=0.297 Sum_probs=31.2
Q ss_pred CcceEe-eCCCCCCCceEEEEEeC---CCEEEeC--------C-CCCcchHHHHHHHHHHHHHhH
Q psy14885 219 PTDVVF-DLSKRNSNTGFAFIINN---ENFQYRL--------P-HEASIFTAEAIAIFYCIKKIS 270 (483)
Q Consensus 219 p~~~~~-DgS~~~~~~G~g~~~~~---~~~~~~l--------~-~~~s~~~AEl~ai~~Al~~~~ 270 (483)
....+| |+|.. ..|+.+++.. +.....+ + ...|+-+-||.|+..|.+++.
T Consensus 80 ~~L~~F~DAS~~--aygavvYlr~~~~~~~~~~ll~aKsrv~P~k~~tIPRlEL~a~~l~~~l~~ 142 (159)
T PF05380_consen 80 VELHVFCDASES--AYGAVVYLRSYSDGSVQVRLLFAKSRVAPLKTVTIPRLELLAALLGVRLAN 142 (159)
T ss_pred eeeeEeeccccc--ceeeEeEeeeccCCceeeeeeeecccccCCCCCcHHHHHHHHHHHHHHHHH
Confidence 356678 99953 3344333322 2222222 1 345999999999999999874
No 54
>PF00336 DNA_pol_viral_C: DNA polymerase (viral) C-terminal domain; InterPro: IPR001462 This domain is at the C terminus of hepatitis B-type viruses P proteins and represents a functional domain that controls the RNase H activities of the protein. The domain is always associated with IPR000201 from INTERPRO and .; GO: 0004523 ribonuclease H activity
Probab=63.41 E-value=8.1 Score=35.21 Aligned_cols=54 Identities=22% Similarity=0.281 Sum_probs=38.4
Q ss_pred CCceEEEEEeCCCEEEeCCCCCcchHHHHHHHHHHHHHhHhcCCceEEEEeccHHHHHh
Q psy14885 231 SNTGFAFIINNENFQYRLPHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDNKSVLNG 289 (483)
Q Consensus 231 ~~~G~g~~~~~~~~~~~l~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs~~~~~~ 289 (483)
-.+|+|+.+........+...-.+++|||+|.-.|.-+.. .++ |.|||..|+..
T Consensus 102 TpTgwgi~i~~~~~~~Tfs~~l~IhtaELlaaClAr~~~~----~r~-l~tDnt~Vlsr 155 (245)
T PF00336_consen 102 TPTGWGISITGQRMRGTFSKPLPIHTAELLAACLARLMSG----ARC-LGTDNTVVLSR 155 (245)
T ss_pred CCCcceeeecCceeeeeecccccchHHHHHHHHHHHhccC----CcE-EeecCcEEEec
Confidence 3567777766655555566667799999999988776652 333 89999887644
No 55
>cd03516 Link_domain_CD44_like This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marke
Probab=59.21 E-value=6.1 Score=34.07 Aligned_cols=22 Identities=27% Similarity=0.326 Sum_probs=20.2
Q ss_pred CCCHHHHHHHhhhcCCcccccC
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVN 69 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~ 69 (483)
+.+|.+|++.|+..|++||++.
T Consensus 18 ~lnf~eA~~aC~~~ga~lAs~~ 39 (144)
T cd03516 18 SLNFTEAKEACRALGLTLASKA 39 (144)
T ss_pred cCCHHHHHHHHHHcCCeeCCHH
Confidence 5899999999999999999874
No 56
>KOG4297|consensus
Probab=46.07 E-value=1.2e+02 Score=23.99 Aligned_cols=41 Identities=20% Similarity=0.380 Sum_probs=34.0
Q ss_pred eEEEEecCCCCCHHHHHHHhhhcCCcccccCCHhHHHHHHHH
Q psy14885 39 SCYRFVKSPLKTRNDAKLNCKSLDSDLANVNDADEHGFIMYQ 80 (483)
Q Consensus 39 ~Cy~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~~ 80 (483)
.|+..+... .+|..+...|...+++|+.+.+..+..++...
T Consensus 100 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 140 (207)
T KOG4297|consen 100 TCYTSFEWT-DGWSTTGTGCGGQGANLVGVLSVQENNFITSN 140 (207)
T ss_pred cccCchhhc-cCHHHHHHHHHHhCCCcCeeCCHHHHHHHHHh
Confidence 455555442 58999999999999999999999999999865
No 57
>PF08391 Ly49: Ly49-like protein, N-terminal region; InterPro: IPR013600 The sequences making up this entry are annotated as, or are similar to, Ly49 receptors (e.g. P20937 from SWISSPROT). These are type II transmembrane receptors expressed by mouse natural killer (NK) cells. They are classified as being activating (e.g.Ly49D and H) or inhibitory (e.g. Ly49A and G), depending on their effect on NK cell function []. They are members of the C-type lectin receptor superfamily [], and in fact in many family members this region is found immediately N-terminal to a lectin C-type domain (IPR001304 from INTERPRO). ; PDB: 1QO3_D 3C8J_D 1P4L_D 3C8K_D 3G8K_B 1JA3_B 3CAD_A 3G8L_A.
Probab=45.85 E-value=12 Score=31.44 Aligned_cols=19 Identities=37% Similarity=0.943 Sum_probs=16.0
Q ss_pred ccCCCcccccccCCeEEEE
Q psy14885 25 LELRCPQHWVQYRDSCYRF 43 (483)
Q Consensus 25 ~~~~C~~gw~~~~~~Cy~~ 43 (483)
....||.-|..|+.+||+|
T Consensus 101 ~g~~~e~~W~cyg~kCYYF 119 (119)
T PF08391_consen 101 TGKPCEKHWFCYGVKCYYF 119 (119)
T ss_dssp TT-EEECEEEEETTEEEEE
T ss_pred CCCCCCCceeeeCceeeeC
Confidence 3677899999999999997
No 58
>PF07979 Intimin_C: Intimin C-type lectin domain; InterPro: IPR013117 This domain is found at the C terminus of intimin. Its structure has been solved and shown to have a C-lectin type of structure []. Intimin is a bacterial adhesion molecule involved in intimate attachment of enteropathogenic and enterohemorrhagic Escherichia coli to mammalian host cells. Intimin targets the translocated intimin receptor (Tir), which is exported by the bacteria and integrated into the host cell plasma membrane.; GO: 0005488 binding, 0009405 pathogenesis, 0009986 cell surface; PDB: 1CWV_A 2ZQK_B 2ZWK_C 1F02_I 1E5U_I 1F00_I 3NCX_B 3NCW_D.
Probab=45.25 E-value=12 Score=30.26 Aligned_cols=30 Identities=17% Similarity=0.160 Sum_probs=23.6
Q ss_pred CCCHHHHHHHhhhcCCcccccCCHhHHHHHHH
Q psy14885 48 LKTRNDAKLNCKSLDSDLANVNDADEHGFIMY 79 (483)
Q Consensus 48 ~~tw~~A~~~C~~~g~~L~~i~s~~e~~~l~~ 79 (483)
+++|.+|+..|+..+|+|. .+.+|+.-|-.
T Consensus 12 ~~~Y~~A~~~C~~~s~~Lp--sS~~~L~~vy~ 41 (101)
T PF07979_consen 12 RVTYSEAESICQNNSGRLP--SSQSELKDVYN 41 (101)
T ss_dssp EETHHHHHHHTTTTCCESB--SSHHHHHHHHH
T ss_pred eEeHHHHHHHHHhccccCc--ccHHHHHHHHH
Confidence 5899999999999999995 45666555443
No 59
>COG3341 Predicted double-stranded RNA/RNA-DNA hybrid binding protein [General function prediction only]
Probab=43.91 E-value=22 Score=32.91 Aligned_cols=93 Identities=16% Similarity=0.178 Sum_probs=61.9
Q ss_pred CCCcchHHHHHHHHHHHHHhHhcCCceEEE-E----eccHHHHHhhcccCCcchHHHHHHHHHHHHhhCCCceEEEEecc
Q psy14885 250 HEASIFTAEAIAIFYCIKKISDMKIKKAII-F----SDNKSVLNGIDNIQQRNNIIQLIKQEYYFASTNGSQISFMWIPS 324 (483)
Q Consensus 250 ~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I-~----tDs~~~~~~l~~~~~~~~~~~~i~~~~~~~~~~~~~v~~~Wvpg 324 (483)
+..++..+|.+|++.+|+.+...+.++.+| + .||++.++-+.+.....- .....+=....-.+....+.|+.-
T Consensus 104 ~~~~n~s~d~la~ly~~~~~~~~~nrk~~i~y~~~~~ds~a~~k~~k~~~~~k~--~~~~~~e~~~~~~~v~h~~k~i~~ 181 (225)
T COG3341 104 SIKSNDSGDVLAKLYGLRYEVPLDNRKSVINYLTPGNDSWAYFKYVKDKCETKV--NLLKDREFFSIGKGVFHDEKDINI 181 (225)
T ss_pred ccccCchHHHHHHhccccccccccCccceeeccCCcchhHHHHHHHhhhhhhHH--HHHHhHHhhhccchhhhhhhhccc
Confidence 455788899999999999997766666555 6 699999988765321111 111110011112235567789999
Q ss_pred CCCCcchHHHhHHHHhhhcC
Q psy14885 325 HSNIALNDKADQLAKNSINS 344 (483)
Q Consensus 325 H~g~~gNe~AD~lAk~a~~~ 344 (483)
|..+..+..++.++......
T Consensus 182 ~~~~~~~~~~~s~~~~~~k~ 201 (225)
T COG3341 182 HIWIFESKKGNSHVYNTSKK 201 (225)
T ss_pred ccccchhhhhhhhhhchhhh
Confidence 99999998888888766544
No 60
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=39.60 E-value=34 Score=37.20 Aligned_cols=72 Identities=13% Similarity=0.068 Sum_probs=44.0
Q ss_pred HHHHHHHHhHhcCCceEEEEeccHHHHHh--------hcccCC--cchHHHHHHHHHHHHhhCCCceEEEEeccCCCCcc
Q psy14885 261 AIFYCIKKISDMKIKKAIIFSDNKSVLNG--------IDNIQQ--RNNIIQLIKQEYYFASTNGSQISFMWIPSHSNIAL 330 (483)
Q Consensus 261 ai~~Al~~~~~~~~~~~~I~tDs~~~~~~--------l~~~~~--~~~~~~~i~~~~~~~~~~~~~v~~~WvpgH~g~~g 330 (483)
++++=|..+.+.+.+.|.+.-=++..-.. .-.+.. +.+ ...+..+....+.|+.|-+-|||+|.+..|
T Consensus 166 ~a~~llpYl~elG~T~IELMPv~e~p~~~sWGYq~~g~yAp~sryGtP--edfk~fVD~aH~~GIgViLD~V~~HF~~d~ 243 (628)
T COG0296 166 LAIELLPYLKELGITHIELMPVAEHPGDRSWGYQGTGYYAPTSRYGTP--EDFKALVDAAHQAGIGVILDWVPNHFPPDG 243 (628)
T ss_pred HHHHHhHHHHHhCCCEEEEcccccCCCCCCCCCCcceeccccccCCCH--HHHHHHHHHHHHcCCEEEEEecCCcCCCCc
Confidence 34445666777787777664322111000 000111 122 456666777888999999999999999988
Q ss_pred hHHH
Q psy14885 331 NDKA 334 (483)
Q Consensus 331 Ne~A 334 (483)
|=++
T Consensus 244 ~~L~ 247 (628)
T COG0296 244 NYLA 247 (628)
T ss_pred chhh
Confidence 8655
No 61
>COG0590 CumB Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]
Probab=38.07 E-value=82 Score=27.63 Aligned_cols=29 Identities=17% Similarity=0.323 Sum_probs=20.6
Q ss_pred chHHHHHHHHHHHHHhHhcCCceEEEEec
Q psy14885 254 IFTAEAIAIFYCIKKISDMKIKKAIIFSD 282 (483)
Q Consensus 254 ~~~AEl~ai~~Al~~~~~~~~~~~~I~tD 282 (483)
.-.||..||..|-+.+........++|+=
T Consensus 56 taHAEi~air~a~~~~~~~~l~~~tlyvT 84 (152)
T COG0590 56 TAHAEILAIRAAAETLGNYRLKDCTLYVT 84 (152)
T ss_pred cccHHHHHHHHHHHhhCCCCcCCcEEEEe
Confidence 34899999999999986544444555543
No 62
>KOG0466|consensus
Probab=34.75 E-value=28 Score=34.06 Aligned_cols=36 Identities=25% Similarity=0.496 Sum_probs=27.6
Q ss_pred ccccCCCCCCce-ecCCC---CCcccc-c-cccCcHHHHHhc
Q psy14885 416 SYLFTKTPHPIC-TCGFP---LTVKHI-F-ECNKYKKFREKL 451 (483)
Q Consensus 416 ~~~~~~~~~p~C-~Cg~~---et~~Hi-l-~Cp~~~~~R~~l 451 (483)
++--.+.|+|.| .||.+ .-++|+ | .||-++..=..|
T Consensus 101 s~gS~k~d~~~c~~~g~~~~~klvRHVSfVDCPGHDiLMaTM 142 (466)
T KOG0466|consen 101 SFGSSKEDRPPCDRPGCEGKMKLVRHVSFVDCPGHDILMATM 142 (466)
T ss_pred ccCCCCCCCCCcccCCCCCceEEEEEEEeccCCchHHHHHHH
Confidence 344567889999 88874 459999 8 999998766655
No 63
>PHA02910 hypothetical protein; Provisional
Probab=30.75 E-value=40 Score=27.70 Aligned_cols=37 Identities=27% Similarity=0.560 Sum_probs=27.6
Q ss_pred ccccccCCeEEEEecCCCCCHHHHHHHhhhc-CCcccc
Q psy14885 31 QHWVQYRDSCYRFVKSPLKTRNDAKLNCKSL-DSDLAN 67 (483)
Q Consensus 31 ~gw~~~~~~Cy~~~~~~~~tw~~A~~~C~~~-g~~L~~ 67 (483)
..|+...+.||+-+..|-.++..++-.|+.. ++.|.+
T Consensus 65 hswfchygacykevlmppsnyshskyvceklmnsklpt 102 (171)
T PHA02910 65 HSWFCHYGACYKEVLMPPSNYSHSKYVCEKLMNSKLPT 102 (171)
T ss_pred eeeeeccchhhhhhccCCcccchhHHHHHHHhcccCCc
Confidence 4578888999997655548899999999863 555543
No 64
>COG0295 Cdd Cytidine deaminase [Nucleotide transport and metabolism]
Probab=29.38 E-value=1.3e+02 Score=25.90 Aligned_cols=52 Identities=19% Similarity=0.214 Sum_probs=35.0
Q ss_pred CCceEEEEEeCCCEEEeC----CCCCcchHHHHHHHHHHHHHhHhcCCceEEEEecc
Q psy14885 231 SNTGFAFIINNENFQYRL----PHEASIFTAEAIAIFYCIKKISDMKIKKAIIFSDN 283 (483)
Q Consensus 231 ~~~G~g~~~~~~~~~~~l----~~~~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs 283 (483)
=++|+++...++.+-+.. -...--.-||-.||..|+.. -......+.|++|+
T Consensus 26 F~VGAa~~t~~G~i~tG~NiEnasy~~t~CAErsAI~~ais~-G~~~~~~v~v~~~~ 81 (134)
T COG0295 26 FKVGAALRTKDGRIYTGANVENASYGLTVCAERSAIFKAISE-GKRKFDAVVVVADT 81 (134)
T ss_pred CcEEEEEEeCCCCEEEEEeeecccccchhhHHHHHHHHHHHc-CCCcEEEEEEEcCC
Confidence 467777776665443322 13333467999999999887 44456678888997
No 65
>cd01285 nucleoside_deaminase Nucleoside deaminases include adenosine, guanine and cytosine deaminases. These enzymes are Zn dependent and catalyze the deamination of nucleosides. The zinc ion in the active site plays a central role in the proposed catalytic mechanism, activating a water molecule to form a hydroxide ion that performs a nucleophilic attack on the substrate. The functional enzyme is a homodimer. Cytosine deaminase catalyzes the deamination of cytosine to uracil and ammonia and is a member of the pyrimidine salvage pathway. Cytosine deaminase is found in bacteria and fungi but is not present in mammals; for this reason, the enzyme is currently of interest for antimicrobial drug design and gene therapy applications against tumors. Some members of this family are tRNA-specific adenosine deaminases that generate inosine at the first position of their anticodon (position 34) of specific tRNAs; this modification is thought to enlarge the codon recognition capacity during protei
Probab=26.25 E-value=2.1e+02 Score=23.23 Aligned_cols=32 Identities=13% Similarity=0.227 Sum_probs=22.9
Q ss_pred CcchHHHHHHHHHHHHHhHhcCCceEEEEecc
Q psy14885 252 ASIFTAEAIAIFYCIKKISDMKIKKAIIFSDN 283 (483)
Q Consensus 252 ~s~~~AEl~ai~~Al~~~~~~~~~~~~I~tDs 283 (483)
.....||..||..+.+..........++|+--
T Consensus 43 ~~~~HAE~~ai~~~~~~~~~~~~~~~~ly~t~ 74 (109)
T cd01285 43 DPTAHAEIVAIRNAARRLGSYLLSGCTLYTTL 74 (109)
T ss_pred CCcccHHHHHHHHHHHHhCCCccCCeEEEEeC
Confidence 35788999999999887543334566777653
No 66
>PRK10860 tRNA-specific adenosine deaminase; Provisional
Probab=25.81 E-value=1.3e+02 Score=26.99 Aligned_cols=29 Identities=14% Similarity=0.141 Sum_probs=19.5
Q ss_pred chHHHHHHHHHHHHHhHhcCCceEEEEec
Q psy14885 254 IFTAEAIAIFYCIKKISDMKIKKAIIFSD 282 (483)
Q Consensus 254 ~~~AEl~ai~~Al~~~~~~~~~~~~I~tD 282 (483)
...||..||..|.+.+........++|+-
T Consensus 60 ~~HAEi~Ai~~a~~~~~~~~l~g~tlY~T 88 (172)
T PRK10860 60 TAHAEIMALRQGGLVLQNYRLLDATLYVT 88 (172)
T ss_pred ccCHHHHHHHHHHHhcCCCCcCCcEEEee
Confidence 46799999999987664333344556654
No 67
>cd01284 Riboflavin_deaminase-reductase Riboflavin-specific deaminase. Riboflavin biosynthesis protein RibD (Diaminohydroxyphosphoribosylaminopyrimidine deaminase) catalyzes the deamination of 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate, which is an intermediate step in the biosynthesis of riboflavin.The ribG gene of Bacillus subtilis and the ribD gene of E. coli are bifunctional and contain this deaminase domain and a reductase domain which catalyzes the subsequent reduction of the ribosyl side chain.
Probab=22.70 E-value=2.4e+02 Score=23.38 Aligned_cols=38 Identities=16% Similarity=0.056 Sum_probs=22.5
Q ss_pred CCceEEEEEeCCCE-EEeCCCCCcchHHHHHHHHHHHHH
Q psy14885 231 SNTGFAFIINNENF-QYRLPHEASIFTAEAIAIFYCIKK 268 (483)
Q Consensus 231 ~~~G~g~~~~~~~~-~~~l~~~~s~~~AEl~ai~~Al~~ 268 (483)
.++|+.++-.++.+ ............||..||..|.+.
T Consensus 19 ~pvGaviv~~~g~iv~~g~n~~~~~~HAE~~ai~~a~~~ 57 (115)
T cd01284 19 PPVGCVIVDDDGEIVGEGYHRKAGGPHAEVNALASAGEK 57 (115)
T ss_pred CCEEEEEEeCCCeEEEEecCCCCCcccHHHHHHHHHhhc
Confidence 45666544332322 222333336788999999998874
Done!