Query psy1807
Match_columns 618
No_of_seqs 648 out of 3229
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 21:50:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1807.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1807hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PHA02927 secreted complement-b 100.0 1.7E-41 3.7E-46 338.8 22.4 234 272-512 15-262 (263)
2 PHA02927 secreted complement-b 100.0 1.3E-38 2.8E-43 318.0 23.2 236 209-454 12-262 (263)
3 PHA02954 EEV membrane glycopro 100.0 4.8E-35 1E-39 293.4 21.5 225 270-514 13-240 (317)
4 PHA02639 EEV host range protei 100.0 5.2E-33 1.1E-37 277.8 21.8 181 332-515 16-209 (295)
5 PHA02954 EEV membrane glycopro 100.0 1.4E-32 3E-37 275.6 21.8 216 218-456 19-240 (317)
6 PHA02639 EEV host range protei 100.0 1.1E-30 2.3E-35 261.1 18.3 180 273-455 17-207 (295)
7 PHA02831 EEV host range protei 100.0 2.4E-30 5.3E-35 252.6 18.4 161 352-521 38-205 (268)
8 PHA02831 EEV host range protei 100.0 6.5E-28 1.4E-32 235.6 16.4 160 235-400 38-204 (268)
9 cd03597 CLECT_attractin_like C 99.9 9E-26 2E-30 201.7 13.3 125 84-215 3-129 (129)
10 cd03588 CLECT_CSPGs C-type lec 99.9 1.5E-25 3.3E-30 199.2 12.2 120 84-215 3-123 (124)
11 cd03589 CLECT_CEL-1_like C-typ 99.9 2.2E-25 4.7E-30 202.0 12.6 128 84-215 3-137 (137)
12 cd03599 CLECT_DGCR2_like C-typ 99.9 1.7E-24 3.8E-29 196.4 12.9 128 84-215 3-153 (153)
13 cd03590 CLECT_DC-SIGN_like C-t 99.9 6.3E-24 1.4E-28 189.5 12.8 121 84-215 3-126 (126)
14 cd03594 CLECT_REG-1_like C-typ 99.9 1.3E-23 2.8E-28 188.3 12.3 124 84-215 3-129 (129)
15 cd03596 CLECT_tetranectin_like 99.9 2E-23 4.3E-28 187.0 12.9 124 84-215 3-129 (129)
16 PHA02817 EEV Host range protei 99.9 1.2E-22 2.6E-27 193.3 14.8 123 389-514 17-148 (225)
17 cd03593 CLECT_NK_receptors_lik 99.9 1.1E-22 2.4E-27 178.6 11.7 114 84-215 3-116 (116)
18 cd03598 CLECT_EMBP_like C-type 99.9 1.5E-22 3.3E-27 178.0 11.6 116 91-215 1-117 (117)
19 cd03591 CLECT_collectin_like C 99.9 2.4E-22 5.1E-27 175.9 11.4 114 91-215 1-114 (114)
20 cd03603 CLECT_VCBS A bacterial 99.9 4.4E-22 9.6E-27 175.1 12.0 113 92-214 1-117 (118)
21 cd03601 CLECT_TC14_like C-type 99.9 1.7E-21 3.6E-26 171.7 11.5 114 94-215 3-119 (119)
22 cd03592 CLECT_selectins_like C 99.9 1.5E-21 3.4E-26 171.1 10.8 113 94-215 3-115 (115)
23 PHA02817 EEV Host range protei 99.9 2.9E-21 6.2E-26 183.9 12.9 122 272-394 18-148 (225)
24 PHA02642 C-type lectin-like pr 99.9 3.6E-21 7.8E-26 181.2 11.5 116 75-217 87-202 (216)
25 cd03600 CLECT_thrombomodulin_l 99.8 6.9E-21 1.5E-25 173.2 11.8 124 90-215 3-139 (141)
26 smart00034 CLECT C-type lectin 99.8 9.6E-21 2.1E-25 168.2 11.8 123 84-214 3-126 (126)
27 cd03595 CLECT_chondrolectin_li 99.8 1.1E-20 2.4E-25 173.2 12.0 122 89-215 8-149 (149)
28 PHA02953 IEV and EEV membrane 99.8 1.7E-20 3.7E-25 173.0 10.1 116 74-216 45-167 (170)
29 cd03602 CLECT_1 C-type lectin 99.8 4.8E-20 1E-24 159.7 10.9 108 92-215 1-108 (108)
30 PHA03097 C-type lectin-like pr 99.8 1.8E-19 3.9E-24 165.0 11.0 111 75-216 45-155 (157)
31 TIGR00864 PCC polycystin catio 99.8 1.2E-19 2.7E-24 217.6 12.5 129 74-219 316-450 (2740)
32 PHA02867 C-type lectin protein 99.8 4.5E-18 9.7E-23 154.6 12.0 109 74-216 47-155 (167)
33 cd00037 CLECT C-type lectin (C 99.6 3.2E-15 7E-20 129.9 11.3 115 92-215 1-116 (116)
34 PF00059 Lectin_C: Lectin C-ty 99.6 5.2E-15 1.1E-19 126.8 8.7 100 101-215 2-105 (105)
35 PHA02911 C-type lectin-like pr 99.3 5.9E-12 1.3E-16 114.6 8.1 106 81-217 102-211 (213)
36 cd00033 CCP Complement control 99.0 9.8E-10 2.1E-14 82.8 6.0 55 458-512 1-57 (57)
37 PF00084 Sushi: Sushi domain ( 98.9 8.8E-10 1.9E-14 82.8 4.5 56 396-453 1-56 (56)
38 smart00032 CCP Domain abundant 98.9 2.1E-09 4.6E-14 80.9 6.1 54 458-511 1-57 (57)
39 cd00033 CCP Complement control 98.9 2.5E-09 5.5E-14 80.5 5.7 57 396-454 1-57 (57)
40 smart00032 CCP Domain abundant 98.9 2.7E-09 6E-14 80.2 5.9 56 396-453 1-57 (57)
41 PF00084 Sushi: Sushi domain ( 98.9 1.6E-09 3.5E-14 81.3 4.1 56 278-333 1-56 (56)
42 PF05473 Herpes_UL45: UL45 pro 98.2 3.7E-06 7.9E-11 80.1 8.4 55 74-134 83-137 (200)
43 PHA02672 ORF110 EEV glycoprote 96.4 0.0037 7.9E-08 54.6 4.0 83 71-170 45-127 (166)
44 KOG4646|consensus 95.5 0.0035 7.5E-08 54.6 0.0 41 1-42 89-129 (173)
45 PHA03093 EEV glycoprotein; Pro 95.5 0.027 5.8E-07 51.5 5.6 47 87-136 104-150 (185)
46 PHA02673 ORF109 EEV glycoprote 95.4 0.02 4.3E-07 51.0 4.2 54 73-136 74-127 (161)
47 PF05966 Chordopox_A33R: Chord 94.1 0.03 6.5E-07 51.8 2.1 33 87-120 108-140 (190)
48 PF12191 stn_TNFRSF12A: Tumour 92.0 0.074 1.6E-06 45.5 1.3 19 495-513 33-51 (129)
49 cd03519 Link_domain_HAPLN_modu 90.4 0.28 6.1E-06 39.9 3.1 23 99-121 8-30 (91)
50 cd03520 Link_domain_CSPGs_modu 88.2 0.48 1E-05 39.1 3.0 23 99-121 8-30 (96)
51 KOG4564|consensus 86.9 0.77 1.7E-05 49.4 4.5 35 476-510 79-118 (473)
52 PTZ00382 Variant-specific surf 85.6 0.49 1.1E-05 39.5 1.8 11 483-493 41-51 (96)
53 cd01102 Link_Domain The link d 84.9 0.68 1.5E-05 38.1 2.2 21 101-121 13-33 (92)
54 smart00445 LINK Link (Hyaluron 84.6 0.72 1.6E-05 38.1 2.3 21 101-121 14-34 (94)
55 cd03518 Link_domain_HAPLN_modu 81.9 1 2.2E-05 37.1 2.2 21 101-121 13-33 (95)
56 cd03515 Link_domain_TSG_6_like 81.7 0.99 2.1E-05 37.1 2.0 21 101-121 13-33 (93)
57 PF14704 DERM: Dermatopontin 81.6 4.8 0.0001 36.8 6.7 138 239-392 3-149 (152)
58 PF00193 Xlink: Extracellular 80.9 1.6 3.4E-05 36.1 2.9 21 101-121 13-33 (92)
59 PF07204 Orthoreo_P10: Orthore 80.2 0.88 1.9E-05 36.8 1.2 26 537-562 47-72 (98)
60 cd03521 Link_domain_KIAA0527_l 79.9 1.4 3E-05 35.7 2.2 21 101-121 13-33 (95)
61 KOG4289|consensus 78.6 7.4 0.00016 46.7 8.2 154 239-449 1737-1902(2531)
62 cd03517 Link_domain_CSPGs_modu 77.6 1.5 3.2E-05 36.3 1.8 21 101-121 13-33 (95)
63 PF01034 Syndecan: Syndecan do 74.8 0.97 2.1E-05 34.1 0.0 23 542-564 22-44 (64)
64 smart00185 ARM Armadillo/beta- 74.3 1.3 2.9E-05 29.8 0.6 38 2-40 3-40 (41)
65 PF15050 SCIMP: SCIMP protein 73.7 1.2 2.6E-05 37.8 0.3 14 541-554 16-29 (133)
66 PF02009 Rifin_STEVOR: Rifin/s 72.3 1.4 2.9E-05 44.9 0.4 18 547-564 273-290 (299)
67 PF14575 EphA2_TM: Ephrin type 70.8 1.5 3.3E-05 34.7 0.3 15 601-615 56-70 (75)
68 TIGR01478 STEVOR variant surfa 70.2 2.8 6.1E-05 41.5 2.0 32 533-564 259-292 (295)
69 cd03516 Link_domain_CD44_like 69.8 3.3 7.2E-05 36.9 2.2 21 101-121 18-38 (144)
70 PTZ00370 STEVOR; Provisional 69.5 2.6 5.6E-05 41.9 1.6 31 534-564 256-288 (296)
71 PF08693 SKG6: Transmembrane a 69.0 3.4 7.3E-05 28.2 1.6 23 531-553 11-33 (40)
72 PF00514 Arm: Armadillo/beta-c 68.7 1.1 2.4E-05 30.6 -0.8 38 2-40 3-40 (41)
73 PF08374 Protocadherin: Protoc 68.6 6.2 0.00013 37.5 3.8 29 529-557 35-64 (221)
74 PF01102 Glycophorin_A: Glycop 67.1 5 0.00011 34.9 2.7 15 542-556 75-89 (122)
75 KOG1219|consensus 65.4 44 0.00096 42.9 10.6 14 594-607 4038-4051(4289)
76 PF14704 DERM: Dermatopontin 64.9 17 0.00038 33.1 6.0 127 297-447 2-144 (152)
77 PF02793 HRM: Hormone receptor 64.6 7.4 0.00016 29.8 3.0 36 414-449 17-55 (66)
78 smart00008 HormR Domain presen 60.8 10 0.00022 29.5 3.2 34 474-507 18-54 (70)
79 smart00008 HormR Domain presen 59.0 9.8 0.00021 29.6 2.8 32 237-268 18-53 (70)
80 KOG4297|consensus 57.7 30 0.00066 29.0 6.3 94 91-194 99-192 (207)
81 PF12273 RCR: Chitin synthesis 56.0 8.7 0.00019 34.0 2.3 26 538-563 4-29 (130)
82 PF04478 Mid2: Mid2 like cell 55.7 3.7 8.1E-05 36.9 -0.1 15 532-546 51-65 (154)
83 PF15102 TMEM154: TMEM154 prot 55.0 4 8.7E-05 36.4 0.0 21 530-550 58-78 (146)
84 PTZ00046 rifin; Provisional 53.4 7.6 0.00016 40.2 1.7 18 547-564 332-349 (358)
85 PF05454 DAG1: Dystroglycan (D 53.2 4.5 9.7E-05 40.9 0.0 29 534-562 150-178 (290)
86 PF02793 HRM: Hormone receptor 52.2 12 0.00026 28.6 2.3 34 474-507 17-55 (66)
87 TIGR01477 RIFIN variant surfac 52.0 8.3 0.00018 39.8 1.7 18 547-564 327-344 (353)
88 PF15330 SIT: SHP2-interacting 51.5 22 0.00048 30.2 3.9 7 594-600 47-53 (107)
89 PF02439 Adeno_E3_CR2: Adenovi 49.9 16 0.00034 24.6 2.1 16 546-561 19-34 (38)
90 KOG0994|consensus 49.1 44 0.00096 39.7 6.9 123 255-398 840-971 (1758)
91 PF15069 FAM163: FAM163 family 48.7 36 0.00077 30.3 4.8 26 532-557 4-31 (143)
92 PF12768 Rax2: Cortical protei 47.6 30 0.00064 35.1 4.9 36 528-563 223-261 (281)
93 PF11980 DUF3481: Domain of un 47.2 22 0.00048 28.4 3.0 31 534-564 19-49 (87)
94 PF12662 cEGF: Complement Clr- 45.3 14 0.00031 22.2 1.3 19 481-500 3-21 (24)
95 smart00607 FTP eel-Fucolectin 44.9 20 0.00044 32.6 2.9 26 42-67 121-146 (151)
96 KOG1025|consensus 44.8 37 0.0008 39.3 5.4 30 235-264 274-304 (1177)
97 TIGR03370 PEPCTERM_Roseo varia 44.1 37 0.00081 20.8 2.9 20 540-559 6-25 (26)
98 KOG3653|consensus 43.5 98 0.0021 33.5 7.9 29 434-464 78-109 (534)
99 PF05393 Hum_adeno_E3A: Human 43.4 42 0.00092 27.1 4.0 20 542-561 42-61 (94)
100 PF03302 VSP: Giardia variant- 41.9 20 0.00044 38.3 2.9 31 482-513 304-337 (397)
101 PF02038 ATP1G1_PLM_MAT8: ATP1 41.3 13 0.00029 26.6 0.9 26 538-563 20-45 (50)
102 KOG1219|consensus 40.9 1.5E+02 0.0033 38.6 9.8 26 302-328 3742-3767(4289)
103 KOG4289|consensus 39.7 1E+02 0.0022 37.9 7.9 139 361-513 1740-1909(2531)
104 PF02009 Rifin_STEVOR: Rifin/s 39.3 16 0.00034 37.3 1.4 27 538-564 261-287 (299)
105 TIGR03778 VPDSG_CTERM VPDSG-CT 37.9 46 0.001 20.4 2.7 19 539-557 6-24 (26)
106 PF06084 Cytomega_TRL10: Cytom 35.5 16 0.00034 30.8 0.6 17 593-609 115-135 (150)
107 TIGR02595 PEP_exosort PEP-CTER 34.4 48 0.001 20.3 2.5 12 543-554 9-20 (26)
108 PF07010 Endomucin: Endomucin; 33.1 1.2E+02 0.0026 29.3 6.0 49 545-594 203-251 (259)
109 KOG4564|consensus 32.6 45 0.00097 36.3 3.6 38 415-452 78-118 (473)
110 PF08114 PMP1_2: ATPase proteo 32.1 46 0.001 22.7 2.2 15 547-561 24-38 (43)
111 PHA03265 envelope glycoprotein 30.9 42 0.00092 34.4 2.9 8 528-535 347-354 (402)
112 PHA03242 envelope glycoprotein 28.2 52 0.0011 35.2 3.1 27 589-617 402-428 (428)
113 PF12606 RELT: Tumour necrosis 27.9 1.3E+02 0.0028 21.8 4.1 24 541-564 8-31 (50)
114 PF12877 DUF3827: Domain of un 27.7 91 0.002 34.9 4.9 40 528-567 264-305 (684)
115 PF07979 Intimin_C: Intimin C- 27.7 42 0.00091 28.1 1.8 31 100-132 11-41 (101)
116 PF15345 TMEM51: Transmembrane 27.5 1.1E+02 0.0024 29.7 4.9 25 537-561 65-89 (233)
117 PF06523 DUF1106: Protein of u 26.3 18 0.00038 27.8 -0.5 22 594-615 69-90 (91)
118 PF04689 S1FA: DNA binding pro 26.2 1.7E+02 0.0037 22.2 4.5 24 536-559 18-41 (69)
119 PF01708 Gemini_mov: Geminivir 26.1 1.2E+02 0.0026 24.7 4.1 12 558-569 67-78 (91)
120 PF13268 DUF4059: Protein of u 25.8 55 0.0012 25.4 2.0 25 539-563 15-39 (72)
121 PF08374 Protocadherin: Protoc 25.1 90 0.0019 29.9 3.7 25 530-554 40-64 (221)
122 PF05545 FixQ: Cbb3-type cytoc 24.9 84 0.0018 22.4 2.8 15 546-560 20-34 (49)
123 PF06247 Plasmod_Pvs28: Plasmo 24.8 54 0.0012 30.7 2.2 52 300-369 68-119 (197)
124 PTZ00046 rifin; Provisional 24.8 48 0.001 34.5 2.1 23 542-564 324-346 (358)
125 PF01299 Lamp: Lysosome-associ 24.8 24 0.00051 36.3 -0.1 13 547-559 288-300 (306)
126 TIGR01477 RIFIN variant surfac 24.4 49 0.0011 34.3 2.1 23 542-564 319-341 (353)
127 KOG0196|consensus 24.3 1.2E+02 0.0025 35.2 5.0 29 305-333 231-261 (996)
128 PTZ00382 Variant-specific surf 24.3 50 0.0011 27.5 1.8 7 424-430 9-15 (96)
129 PF14851 FAM176: FAM176 family 24.2 20 0.00042 32.6 -0.7 27 528-554 21-47 (153)
130 PF08391 Ly49: Ly49-like prote 24.0 44 0.00095 29.0 1.4 13 84-96 107-119 (119)
131 PF07589 VPEP: PEP-CTERM motif 23.3 1E+02 0.0022 18.7 2.5 12 542-553 9-20 (25)
132 PF10577 UPF0560: Uncharacteri 21.5 77 0.0017 36.5 3.0 18 236-253 46-64 (807)
133 PRK11056 hypothetical protein; 20.9 1.2E+02 0.0026 26.1 3.4 26 538-563 93-118 (120)
134 KOG0994|consensus 20.2 2.4E+02 0.0052 34.0 6.5 68 241-311 932-1008(1758)
No 1
>PHA02927 secreted complement-binding protein; Provisional
Probab=100.00 E-value=1.7e-41 Score=338.80 Aligned_cols=234 Identities=29% Similarity=0.637 Sum_probs=210.8
Q ss_pred ceeeecCCCCCccCCeeEEec-----CCCcccCCEEEEEeCCCceec--CCCeEEeCCCCCcCCCCCcccccccCCCCCc
Q psy1807 272 TCQYVDCGKLEHIEHGTVTLE-----TTRTTHGAVAIYACHENYTLI--GETRRVCGDGGKWNGTEPQCLFDWCAEPPQI 344 (618)
Q Consensus 272 ~C~~~~C~~~~~~~ng~~~~~-----~~~~~~g~~~~~~C~~Gy~l~--G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~ 344 (618)
.|.++.|+.|..+.||.+..+ ...|.+|++++|+|++||.+. |...++|+.+| |+. .|.|+.+.|+.|+.+
T Consensus 15 ~c~~~~c~~~~~~~~~~~~~~~~~~~~~~y~~g~~v~y~C~~Gy~~~~~g~~~~~C~~~g-Ws~-~p~C~~~~C~~p~~i 92 (263)
T PHA02927 15 GCVLSCCTIPSRPINMKFKNSVETDANANYNIGDTIEYLCLPGYRKQKMGPIYAKCTGTG-WTL-FNQCIKRRCPSPRDI 92 (263)
T ss_pred HHHhccCCCCCcccceeeccccccccCceeCCCCEEEEEeCCCceecCCCccEEEecCCC-CCC-CCcEEeCCCcCCCCC
Confidence 477889999999999998732 346889999999999999975 77889999888 995 799999999999999
Q ss_pred cccEEEecccccCceeeecCCCCcEEcCCceEEEccCC----eecCCCCceeeecCCCCCccCCCceEeecCcccccceE
Q psy1807 345 SGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGG----DWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVA 420 (618)
Q Consensus 345 ~~g~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G----~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~ 420 (618)
.||.+....+.+|++|+|+|++||.|.|...++|+.+| +|++..|.|+.+.|+.|+.+.||.+......|.+|++|
T Consensus 93 ~NG~~~~~~~~~G~~v~y~C~~Gy~l~G~~~~~C~~~~~g~~~Ws~~~P~C~~~~C~~P~~~~nG~~~~~~~~y~~g~~v 172 (263)
T PHA02927 93 DNGQLDIGGVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPEAPICESVKCQSPPSISNGRHNGYEDFYTDGSVV 172 (263)
T ss_pred CCCEEeCCCccCCCEEEEECCCCCEEcCCCeeEEEeCCCCcceECCCCCccccccCCCCCCCCCcEEcCCcccccCCCEE
Confidence 99999888889999999999999999999999999753 89999999999999999999999987655568899999
Q ss_pred EEEcCCCceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEc---ccCCCCcEEEEEcCCCCeeeCCceeE
Q psy1807 421 EYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVG---YDLNVHSTLEFHCEPGHLLVGQASLK 497 (618)
Q Consensus 421 ~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~---~~~~~g~~v~~~C~~Gy~l~G~~~~~ 497 (618)
+|+|++||.|. |...++|. +|+|+. +|.|+.+.|+.|.++ ||.+.. ..|.+|++|+|+|++||+|.|++.++
T Consensus 173 ~y~C~~Gy~l~--G~~~~~C~-~G~Ws~-~P~C~~v~C~~P~i~-ng~~~~~~k~~y~~g~~v~y~C~~Gy~l~G~~~~~ 247 (263)
T PHA02927 173 TYSCNSGYSLI--GNSGVLCS-GGEWSD-PPTCQIVKCPHPTIS-NGYLSSGFKRSYSYNDNVDFKCKYGYKLSGSSSST 247 (263)
T ss_pred EEECCCCCEEC--CCCeeEEC-CCccCC-CCeEeEeECcCCCCC-CCEEecCCCCccccCCEEEEECCCCCeEcCCCCeE
Confidence 99999999999 99999997 899997 799999999998765 565543 36899999999999999999999999
Q ss_pred eCcCCcccCCCCceE
Q psy1807 498 CTRQGDWDNMSPQFI 512 (618)
Q Consensus 498 C~~~G~Ws~~~P~C~ 512 (618)
|+++|+|++++|+|+
T Consensus 248 C~~~g~Ws~~~P~C~ 262 (263)
T PHA02927 248 CSPGNTWQPELPKCV 262 (263)
T ss_pred ECCCCEECCCCCeec
Confidence 999999999999996
No 2
>PHA02927 secreted complement-binding protein; Provisional
Probab=100.00 E-value=1.3e-38 Score=318.03 Aligned_cols=236 Identities=31% Similarity=0.643 Sum_probs=207.5
Q ss_pred ccccccCCCCCCCCCCcccceeeec------ccccccCCEEEEEcCCCceec--CCceeEecCCcccCCCCceeeecCCC
Q psy1807 209 LHWICQHNPANCGSPDRHLNTTFVG------TVNTKLGSTISYACPEGNMLV--GSATRTCKEGFWTGVAPTCQYVDCGK 280 (618)
Q Consensus 209 ~~~iC~~~~~~C~~p~~~~n~~~~~------~~~~~~G~~~~y~C~~Gy~l~--g~~~~~C~~g~Ws~~~p~C~~~~C~~ 280 (618)
+.+-|. +..|+.|..+.|+.+.. ...|.+|++|+|+|++||.+. |...++|+.+.|+. .|.|+.+.|+.
T Consensus 12 ~~~~c~--~~~c~~~~~~~~~~~~~~~~~~~~~~y~~g~~v~y~C~~Gy~~~~~g~~~~~C~~~gWs~-~p~C~~~~C~~ 88 (263)
T PHA02927 12 LGIGCV--LSCCTIPSRPINMKFKNSVETDANANYNIGDTIEYLCLPGYRKQKMGPIYAKCTGTGWTL-FNQCIKRRCPS 88 (263)
T ss_pred HHHHHH--hccCCCCCcccceeeccccccccCceeCCCCEEEEEeCCCceecCCCccEEEecCCCCCC-CCcEEeCCCcC
Confidence 345565 47799999999988752 247899999999999999874 78889999555994 79999999999
Q ss_pred CCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCC----CcCCCCCcccccccCCCCCccccEEEec--cc
Q psy1807 281 LEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGG----KWNGTEPQCLFDWCAEPPQISGGIVTTS--GR 354 (618)
Q Consensus 281 ~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G----~Ws~~~p~C~~~~C~~p~~~~~g~~~~~--~~ 354 (618)
|+.+.||.+... .+.+|++++|+|++||+|.|...++|+.+| +|++..|.|+.+.|+.|+.+.||.+... .+
T Consensus 89 p~~i~NG~~~~~--~~~~G~~v~y~C~~Gy~l~G~~~~~C~~~~~g~~~Ws~~~P~C~~~~C~~P~~~~nG~~~~~~~~y 166 (263)
T PHA02927 89 PRDIDNGQLDIG--GVDFGSSITYSCNSGYQLIGESKSYCELGSTGSMVWNPEAPICESVKCQSPPSISNGRHNGYEDFY 166 (263)
T ss_pred CCCCCCCEEeCC--CccCCCEEEEECCCCCEEcCCCeeEEEeCCCCcceECCCCCccccccCCCCCCCCCcEEcCCcccc
Confidence 999999998753 478999999999999999999999998753 7999999999999999999999988653 36
Q ss_pred ccCceeeecCCCCcEEcCCceEEEccCCeecCCCCceeeecCCCCCccCCCceEee-cCcccccceEEEEcCCCceeccC
Q psy1807 355 RTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSLL-NGTTTHGSVAEYTCDDDYWLEAG 433 (618)
Q Consensus 355 ~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~-~~~~~~g~~~~~~C~~Gy~l~~~ 433 (618)
.+|++|+|+|++||.|.|...++|+ +|+|+. +|.|+.+.|+.|. +.||.+... ...|.+|++|+|+|++||.|.
T Consensus 167 ~~g~~v~y~C~~Gy~l~G~~~~~C~-~G~Ws~-~P~C~~v~C~~P~-i~ng~~~~~~k~~y~~g~~v~y~C~~Gy~l~-- 241 (263)
T PHA02927 167 TDGSVVTYSCNSGYSLIGNSGVLCS-GGEWSD-PPTCQIVKCPHPT-ISNGYLSSGFKRSYSYNDNVDFKCKYGYKLS-- 241 (263)
T ss_pred cCCCEEEEECCCCCEECCCCeeEEC-CCccCC-CCeEeEeECcCCC-CCCCEEecCCCCccccCCEEEEECCCCCeEc--
Confidence 7899999999999999999999997 899997 7999999999875 778887643 356899999999999999999
Q ss_pred CcceEEeccCCeeeCCCCcee
Q psy1807 434 AMRRQVCMRDGKWSGDTPSCE 454 (618)
Q Consensus 434 G~~~~~C~~~G~Ws~~~p~C~ 454 (618)
|++.++|+.+|+|+++.|+|.
T Consensus 242 G~~~~~C~~~g~Ws~~~P~C~ 262 (263)
T PHA02927 242 GSSSSTCSPGNTWQPELPKCV 262 (263)
T ss_pred CCCCeEECCCCEECCCCCeec
Confidence 999999999999999999995
No 3
>PHA02954 EEV membrane glycoprotein; Provisional
Probab=100.00 E-value=4.8e-35 Score=293.40 Aligned_cols=225 Identities=20% Similarity=0.487 Sum_probs=190.0
Q ss_pred CCceeeecCCCCCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEE
Q psy1807 270 APTCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIV 349 (618)
Q Consensus 270 ~p~C~~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~ 349 (618)
.+.|....|+.|. +.||.+......|.+|++++|.|++||.+.|. .++|..+ .|++..| |. ..|+.|..+.++..
T Consensus 13 ~~~~v~~~C~~P~-~~ng~~~~~~~~Y~~gd~V~y~C~~GY~l~~~-~itC~~~-~WS~~~p-C~-k~C~~p~~~~~~~~ 87 (317)
T PHA02954 13 LPAVVYSTCTVPT-MNNAKLTSTETSFNDKQKVTFTCDSGYYSLDP-NAVCETD-KWKYENP-CK-KMCTVSDYVSELYD 87 (317)
T ss_pred HHHHHhCcCCCCC-CCcCEEcCCcccccCCCEEEEEcCCCcccCCC-CeEECCC-cccCCCC-CC-CcCCCCcccccccc
Confidence 4567788999864 57888776545788899999999999998875 6999765 6998777 97 57988766665543
Q ss_pred EecccccCceeeecCCCCcEEcCCceEEEcc---CCeecCCCCceeeecCCCCCccCCCceEeecCcccccceEEEEcCC
Q psy1807 350 TTSGRRTGSVATYSCEPGFILFGSNELKCGL---GGDWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDD 426 (618)
Q Consensus 350 ~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~---~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~ 426 (618)
...|.+|++|+|+|++||. .++|+. ++.|+. .|.|+.+.|++| .+.||.+......|.+|++|+|+|++
T Consensus 88 -~~~y~~G~~V~ysC~~Gy~-----~~~C~~~g~~~~WS~-~ptC~~i~C~pP-~i~NG~~~~~~~~Y~yGd~VtysC~~ 159 (317)
T PHA02954 88 -KPLYEVNSTITLICKDETK-----YFRCEEKNGNTSWND-TVTCPNAECQPL-QLEHGSCQPVKEKYSFGEHITINCDV 159 (317)
T ss_pred -CCCccCCCEEEEEECCCCc-----EEEeCCCCCCCccCC-CCeecceECCCC-CCCCCeeccCCCceeCCCEEEEEcCC
Confidence 3468899999999999994 578975 468996 679999999755 58899987656679999999999999
Q ss_pred CceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEcccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccC
Q psy1807 427 DYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVGYDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDN 506 (618)
Q Consensus 427 Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~ 506 (618)
||.|. |++.++|+.+ .|+ +.|.|+. .|+.|. +.||.+.+..|.+|++|+|+|++||.|.|+..++|+ +|+|++
T Consensus 160 GY~L~--Gs~~i~C~~~-~Ws-~~P~C~~-~C~~P~-i~NG~~sg~~y~~Gd~Vtf~C~~Gy~L~Gs~~itC~-~g~Ws~ 232 (317)
T PHA02954 160 GYEVI--GASYISCTAN-SWN-VIPSCQQ-KCDIPS-LSNGLISGSTFSIGGVIHLSCKSGFTLTGSPSSTCI-DGKWNP 232 (317)
T ss_pred CCEEC--cCCeeEECCC-cCC-CCCcccc-ccCCCC-CCCCeEcCCCcccCCEEEEECCCCCeECCCcceEEC-CCeEeC
Confidence 99999 9999999977 698 6799986 698764 567877777799999999999999999999999998 999999
Q ss_pred CCCceEEe
Q psy1807 507 MSPQFISF 514 (618)
Q Consensus 507 ~~P~C~~~ 514 (618)
++|+|...
T Consensus 233 ~~P~C~~~ 240 (317)
T PHA02954 233 VLPICVRS 240 (317)
T ss_pred CCCceeCC
Confidence 99999986
No 4
>PHA02639 EEV host range protein; Provisional
Probab=100.00 E-value=5.2e-33 Score=277.76 Aligned_cols=181 Identities=27% Similarity=0.592 Sum_probs=162.4
Q ss_pred cccccccCCCCCccccEEEe--cccccCceeeecCCCCcEEcCCceEEEccC---CeecCCCCceeeecCCCCCccCCCc
Q psy1807 332 QCLFDWCAEPPQISGGIVTT--SGRRTGSVATYSCEPGFILFGSNELKCGLG---GDWSGKAPTCKYVDCGSPANIDNGH 406 (618)
Q Consensus 332 ~C~~~~C~~p~~~~~g~~~~--~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~---G~Ws~~~p~C~~~~C~~p~~~~~g~ 406 (618)
.++.+.|+.|+.+.||.+.. ..|.+|++++|+|++||.|.|...++|+++ |+|++..|.|+.+.|+.|+.+.||.
T Consensus 16 ~~k~i~C~~P~~i~NG~v~~~~~~y~~G~~V~Y~Cn~GY~L~G~~~~~C~~d~~nG~WS~~~P~C~~~~C~~Pp~~~nG~ 95 (295)
T PHA02639 16 GVKSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCMLKECNDPPSIINGK 95 (295)
T ss_pred CccccCCcCCCCCCCCEEeccCCcccCCCEEEEEeCCCCEECCCCeEEEeCCCCCCEECCCCCEEeeccCcCCCCCCCcE
Confidence 35567899999999998875 358899999999999999999999999864 6999999999999999999999999
Q ss_pred eEeecCcccccceEEEEcCC----CceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEc----ccCCCCc
Q psy1807 407 YSLLNGTTTHGSVAEYTCDD----DYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVG----YDLNVHS 478 (618)
Q Consensus 407 ~~~~~~~~~~g~~~~~~C~~----Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~----~~~~~g~ 478 (618)
+......|.+|++++|+|++ ||.|. |...++|+.+|+|+++.|+|+.+.|+.|.++ +|.+.. ..|.+|+
T Consensus 96 i~~~~~~y~~G~~V~y~C~~g~~~gY~L~--G~~~~~C~~dG~WS~~~P~C~~i~C~~P~i~-nG~v~~~~~~~~~~yg~ 172 (295)
T PHA02639 96 IYNKREMYKVGDEIYYVCNEHKGVQYSLV--GNEKITCIQDKSWKPDPPICKMINCRFPALQ-NGYINGIPSNKKFYYKT 172 (295)
T ss_pred EecCCCceECCCEEEEEeCCCCCCceEEC--CCCeEEECCCCeECCCCCeeeeEEeCCCCCC-CCceEcccCCCceecCC
Confidence 88766779999999999999 59999 9999999999999999999999999988654 555543 3578999
Q ss_pred EEEEEcCCCCeeeCCceeEeCcCCcccCCCCceEEec
Q psy1807 479 TLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQFISFY 515 (618)
Q Consensus 479 ~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C~~~~ 515 (618)
+|+|+|++||.|.|+..++|+.+|+|++++|+|+...
T Consensus 173 ~V~fsC~~GY~L~Gs~~~tC~~nG~Ws~~~P~C~~~~ 209 (295)
T PHA02639 173 RVGFSCKSGFDLVGEKYSTCNINATWFPSIPTCVRNK 209 (295)
T ss_pred EEEEEcCCCCeEcCCCcEEECCCCeECCCCCeEeccC
Confidence 9999999999999999999999999999999999753
No 5
>PHA02954 EEV membrane glycoprotein; Provisional
Probab=100.00 E-value=1.4e-32 Score=275.64 Aligned_cols=216 Identities=21% Similarity=0.490 Sum_probs=181.7
Q ss_pred CCCCCCCcccceeeecc-cccccCCEEEEEcCCCceecCCceeEecCCcccCCCCceeeecCCCCCccCCeeEEecCCCc
Q psy1807 218 ANCGSPDRHLNTTFVGT-VNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQYVDCGKLEHIEHGTVTLETTRT 296 (618)
Q Consensus 218 ~~C~~p~~~~n~~~~~~-~~~~~G~~~~y~C~~Gy~l~g~~~~~C~~g~Ws~~~p~C~~~~C~~~~~~~ng~~~~~~~~~ 296 (618)
..|+.|.. .|+.+.+. ..|.+|++++|+|++||.+.+. .++|+++.|+...| |. +.|..|..+.++.. ...|
T Consensus 19 ~~C~~P~~-~ng~~~~~~~~Y~~gd~V~y~C~~GY~l~~~-~itC~~~~WS~~~p-C~-k~C~~p~~~~~~~~---~~~y 91 (317)
T PHA02954 19 STCTVPTM-NNAKLTSTETSFNDKQKVTFTCDSGYYSLDP-NAVCETDKWKYENP-CK-KMCTVSDYVSELYD---KPLY 91 (317)
T ss_pred CcCCCCCC-CcCEEcCCcccccCCCEEEEEcCCCcccCCC-CeEECCCcccCCCC-CC-CcCCCCcccccccc---CCCc
Confidence 67988765 68876654 5799999999999999998875 68999888998777 97 57998766665533 2358
Q ss_pred ccCCEEEEEeCCCceecCCCeEEeCC---CCCcCCCCCcccccccCCCCCccccEEEec--ccccCceeeecCCCCcEEc
Q psy1807 297 THGAVAIYACHENYTLIGETRRVCGD---GGKWNGTEPQCLFDWCAEPPQISGGIVTTS--GRRTGSVATYSCEPGFILF 371 (618)
Q Consensus 297 ~~g~~~~~~C~~Gy~l~G~~~~~C~~---~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~--~~~~g~~~~~~C~~Gy~l~ 371 (618)
.+|++++|+|++||. .++|.. ++.|+. .|+|+.+.|+ ++.+.||.+... .|.+|+.|+|+|++||.|.
T Consensus 92 ~~G~~V~ysC~~Gy~-----~~~C~~~g~~~~WS~-~ptC~~i~C~-pP~i~NG~~~~~~~~Y~yGd~VtysC~~GY~L~ 164 (317)
T PHA02954 92 EVNSTITLICKDETK-----YFRCEEKNGNTSWND-TVTCPNAECQ-PLQLEHGSCQPVKEKYSFGEHITINCDVGYEVI 164 (317)
T ss_pred cCCCEEEEEECCCCc-----EEEeCCCCCCCccCC-CCeecceECC-CCCCCCCeeccCCCceeCCCEEEEEcCCCCEEC
Confidence 899999999999995 478975 468997 5799999997 456789987653 4899999999999999999
Q ss_pred CCceEEEccCCeecCCCCceeeecCCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCC
Q psy1807 372 GSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTP 451 (618)
Q Consensus 372 G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p 451 (618)
|+..++|+++ .|+ ..|.|+. .|+.| .+.||.+.. ..|.+|++|+|+|++||.|. |++.++|. +|+|+++.|
T Consensus 165 Gs~~i~C~~~-~Ws-~~P~C~~-~C~~P-~i~NG~~sg--~~y~~Gd~Vtf~C~~Gy~L~--Gs~~itC~-~g~Ws~~~P 235 (317)
T PHA02954 165 GASYISCTAN-SWN-VIPSCQQ-KCDIP-SLSNGLISG--STFSIGGVIHLSCKSGFTLT--GSPSSTCI-DGKWNPVLP 235 (317)
T ss_pred cCCeeEECCC-cCC-CCCcccc-ccCCC-CCCCCeEcC--CCcccCCEEEEECCCCCeEC--CCcceEEC-CCeEeCCCC
Confidence 9999999987 698 6899985 69865 588998753 45889999999999999999 99999997 999999999
Q ss_pred ceeee
Q psy1807 452 SCELI 456 (618)
Q Consensus 452 ~C~~~ 456 (618)
.|...
T Consensus 236 ~C~~~ 240 (317)
T PHA02954 236 ICVRS 240 (317)
T ss_pred ceeCC
Confidence 99875
No 6
>PHA02639 EEV host range protein; Provisional
Probab=99.97 E-value=1.1e-30 Score=261.14 Aligned_cols=180 Identities=24% Similarity=0.578 Sum_probs=162.6
Q ss_pred eeeecCCCCCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCC---CCcCCCCCcccccccCCCCCccccEE
Q psy1807 273 CQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDG---GKWNGTEPQCLFDWCAEPPQISGGIV 349 (618)
Q Consensus 273 C~~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~---G~Ws~~~p~C~~~~C~~p~~~~~g~~ 349 (618)
++.+.|+.|+.+.||.+......|.+|+++.|+|++||+|.|...++|+.+ |+|++..|.|+.+.|+.|+.+.||.+
T Consensus 17 ~k~i~C~~P~~i~NG~v~~~~~~y~~G~~V~Y~Cn~GY~L~G~~~~~C~~d~~nG~WS~~~P~C~~~~C~~Pp~~~nG~i 96 (295)
T PHA02639 17 VKSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCMLKECNDPPSIINGKI 96 (295)
T ss_pred ccccCCcCCCCCCCCEEeccCCcccCCCEEEEEeCCCCEECCCCeEEEeCCCCCCEECCCCCEEeeccCcCCCCCCCcEE
Confidence 566789999999999988666679999999999999999999999999864 69999999999999999999999988
Q ss_pred Eec--ccccCceeeecCCC----CcEEcCCceEEEccCCeecCCCCceeeecCCCCCccCCCceEe--ecCcccccceEE
Q psy1807 350 TTS--GRRTGSVATYSCEP----GFILFGSNELKCGLGGDWSGKAPTCKYVDCGSPANIDNGHYSL--LNGTTTHGSVAE 421 (618)
Q Consensus 350 ~~~--~~~~g~~~~~~C~~----Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~--~~~~~~~g~~~~ 421 (618)
... .|.+|++++|+|++ ||.|.|...++|+.+|+|++..|.|+.+.|+.|. +.||.+.. ....+.+|++|+
T Consensus 97 ~~~~~~y~~G~~V~y~C~~g~~~gY~L~G~~~~~C~~dG~WS~~~P~C~~i~C~~P~-i~nG~v~~~~~~~~~~yg~~V~ 175 (295)
T PHA02639 97 YNKREMYKVGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICKMINCRFPA-LQNGYINGIPSNKKFYYKTRVG 175 (295)
T ss_pred ecCCCceECCCEEEEEeCCCCCCceEECCCCeEEECCCCeECCCCCeeeeEEeCCCC-CCCCceEcccCCCceecCCEEE
Confidence 764 47889999999999 5999999999999999999999999999999774 77887653 234678999999
Q ss_pred EEcCCCceeccCCcceEEeccCCeeeCCCCceee
Q psy1807 422 YTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCEL 455 (618)
Q Consensus 422 ~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~ 455 (618)
|+|++||.|. |+..++|+.+|+|++..|+|..
T Consensus 176 fsC~~GY~L~--Gs~~~tC~~nG~Ws~~~P~C~~ 207 (295)
T PHA02639 176 FSCKSGFDLV--GEKYSTCNINATWFPSIPTCVR 207 (295)
T ss_pred EEcCCCCeEc--CCCcEEECCCCeECCCCCeEec
Confidence 9999999999 9999999999999999999986
No 7
>PHA02831 EEV host range protein; Provisional
Probab=99.97 E-value=2.4e-30 Score=252.64 Aligned_cols=161 Identities=21% Similarity=0.477 Sum_probs=142.6
Q ss_pred cccccCceeeecCCCCcEEcCCceEEEccCCeecCCCCcee-eecCCCCCccCCCceEeecCcccccceEEEEcC----C
Q psy1807 352 SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCK-YVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCD----D 426 (618)
Q Consensus 352 ~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~-~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~----~ 426 (618)
..|.+|++|+|+|++||.+ ..++|. +|+|++ .|.|. .+.|+.|+.+.||.+......|.+|++|+|+|+ +
T Consensus 38 ~~Y~~Gs~V~Y~C~~GY~~---~~itC~-dG~WS~-~P~C~~~~~C~~Pp~i~NG~i~~~~~~Y~~Gd~VtYsC~~g~~~ 112 (268)
T PHA02831 38 KVYEENENLEYKCNNNFDK---VFVTCN-NGSWST-KNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVTYACKVNKLE 112 (268)
T ss_pred cccCCCCEEEEECCCCCEe---eeEEeC-CCccCC-CCcccccccCcCCCCCCCCEEeccCCceeCCCEEEEECCCCCCC
Confidence 3588999999999999985 378895 999996 57785 468999999999999876677999999999999 6
Q ss_pred CceeccCCcceEEeccCCeeeCCCCceeeecCCCCCcCCCcEEEc--ccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcc
Q psy1807 427 DYWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEVPKGGYVVG--YDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDW 504 (618)
Q Consensus 427 Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~--~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~W 504 (618)
||.|. |.+.++|. +|+|+++.|.|+.+.|+.|. ..||.+.. ..|.+|++|+|+|++||.|.|+..++|+.+|+|
T Consensus 113 GY~Lv--G~s~~tC~-dG~WS~~~P~C~~i~C~~P~-i~NG~i~~~~~~y~~G~~Vty~C~~GY~L~Gss~~tC~~nG~W 188 (268)
T PHA02831 113 KYSIV--GNETVKCI-NKQWVPKYPVCKLIRCKYPA-LQNGFLNVFEKKFYYGDIVNFKCKKGFILLGSSVSTCDINSIW 188 (268)
T ss_pred CeeEc--CCceeEeC-CCcCCCCCCeeeEeeCCCCC-CCCCccccCCCceecCCEEEEEcCCCCEECCCccEEECCCCeE
Confidence 99999 99999995 99999999999999999986 45776653 468899999999999999999999999999999
Q ss_pred cCCCCceEEeccCCCcc
Q psy1807 505 DNMSPQFISFYFPPSET 521 (618)
Q Consensus 505 s~~~P~C~~~~c~p~~~ 521 (618)
++++|+|+...|+++..
T Consensus 189 sp~~P~C~~~~~~~~~~ 205 (268)
T PHA02831 189 YPGIPKCVKDKVHNEIQ 205 (268)
T ss_pred CCCCCCccCcccCCCCC
Confidence 99999999988886543
No 8
>PHA02831 EEV host range protein; Provisional
Probab=99.95 E-value=6.5e-28 Score=235.62 Aligned_cols=160 Identities=23% Similarity=0.517 Sum_probs=142.9
Q ss_pred cccccCCEEEEEcCCCceecCCceeEecCCcccCCCCcee-eecCCCCCccCCeeEEecCCCcccCCEEEEEeC----CC
Q psy1807 235 VNTKLGSTISYACPEGNMLVGSATRTCKEGFWTGVAPTCQ-YVDCGKLEHIEHGTVTLETTRTTHGAVAIYACH----EN 309 (618)
Q Consensus 235 ~~~~~G~~~~y~C~~Gy~l~g~~~~~C~~g~Ws~~~p~C~-~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~----~G 309 (618)
..|.+|++|+|+|++||.+ ..++|.+|.|++ .|.|. .+.|+.|+.+.||.+......|.+|+.++|+|+ +|
T Consensus 38 ~~Y~~Gs~V~Y~C~~GY~~---~~itC~dG~WS~-~P~C~~~~~C~~Pp~i~NG~i~~~~~~Y~~Gd~VtYsC~~g~~~G 113 (268)
T PHA02831 38 KVYEENENLEYKCNNNFDK---VFVTCNNGSWST-KNMCIGKRNCKDPVTILNGYIKNKKDQYSFGDSVTYACKVNKLEK 113 (268)
T ss_pred cccCCCCEEEEECCCCCEe---eeEEeCCCccCC-CCcccccccCcCCCCCCCCEEeccCCceeCCCEEEEECCCCCCCC
Confidence 4799999999999999985 478999999996 47775 468999999999999876667999999999999 69
Q ss_pred ceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEEEe--cccccCceeeecCCCCcEEcCCceEEEccCCeecCC
Q psy1807 310 YTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTT--SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGK 387 (618)
Q Consensus 310 y~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~--~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~ 387 (618)
|.|.|+..++|. +|+|+++.|.|+.+.|+.| .+.||.+.. ..+.+|++|+|+|++||.|.|+.+++|+.+|+|++.
T Consensus 114 Y~LvG~s~~tC~-dG~WS~~~P~C~~i~C~~P-~i~NG~i~~~~~~y~~G~~Vty~C~~GY~L~Gss~~tC~~nG~Wsp~ 191 (268)
T PHA02831 114 YSIVGNETVKCI-NKQWVPKYPVCKLIRCKYP-ALQNGFLNVFEKKFYYGDIVNFKCKKGFILLGSSVSTCDINSIWYPG 191 (268)
T ss_pred eeEcCCceeEeC-CCcCCCCCCeeeEeeCCCC-CCCCCccccCCCceecCCEEEEEcCCCCEECCCccEEECCCCeECCC
Confidence 999999999996 9999999999999999876 578888764 357899999999999999999999999999999999
Q ss_pred CCceeeecCCCCC
Q psy1807 388 APTCKYVDCGSPA 400 (618)
Q Consensus 388 ~p~C~~~~C~~p~ 400 (618)
.|.|+...|+.|.
T Consensus 192 ~P~C~~~~~~~~~ 204 (268)
T PHA02831 192 IPKCVKDKVHNEI 204 (268)
T ss_pred CCCccCcccCCCC
Confidence 9999988887654
No 9
>cd03597 CLECT_attractin_like C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CLECT_attractin_like: C-type lectin-like domain (CTLD) of the type found in human and mouse attractin (AtrN) and attractin-like protein (ALP). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Mouse AtrN (the product of the mahogany gene) has been shown to bind Agouti protein and to function in agouti-induced pigmentation and obesity. Mutations in AtrN have also been shown to cause spongiform encephalopathy and hypomyelination in rats and hamsters. The cytoplasmic region of mouse ALP has been shown to binds to melanocortin receptor (MCR4). Signaling through MCR4 plays a role in appetite suppression. Attractin may have therapeutic potential in the treatment of obesity. Human attractin (hAtrN) has been shown to be expressed on activated T cells and released extracellularly. The
Probab=99.93 E-value=9e-26 Score=201.66 Aligned_cols=125 Identities=19% Similarity=0.314 Sum_probs=105.7
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W 163 (618)
.||..|+++||+|+.. .++|.+|+.+|+++||+||+|++++|++||.+++...........+||||++.. .+.|+|
T Consensus 3 ~gw~~~~~~Cy~~~~~-~~tw~~A~~~C~~~g~~La~i~~~~E~~fi~~~~~~~~~~~~~~~~WIGl~d~~---~g~w~W 78 (129)
T cd03597 3 EGWHLVGNSCLKINTA-RESYDNAKLYCRNLNAVLASLTTQKKVEFVLKELQKHQMTKQKLTPWVGLRKIN---VSYWCW 78 (129)
T ss_pred CcCEEcCCEEEEEEcC-CCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccCCCCceEEeeecCC---CCceEE
Confidence 9999999999999765 699999999999999999999999999999998876432112246899999864 578999
Q ss_pred eCCccc--ccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 164 VDGEVV--TKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 164 ~DGs~~--~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+||+++ .|++|.++||++. ++||.|..+...+|+|.+|.....+||||+
T Consensus 79 ~Dgs~~~~~~~~W~~geP~~~---~~C~~~~~~~~~~w~d~~C~~~~~~~iCe~ 129 (129)
T cd03597 79 EDMSPFTNTTLQWLPGEPSDA---GFCGYLEEPAVSGLKANPCTNPVNGSVCER 129 (129)
T ss_pred CCCCCCCCccccCCCCCCCCc---ccEEEEcccccCccccCCcCCCCcceeecC
Confidence 999988 4789999999973 689999766666899999998435899985
No 10
>cd03588 CLECT_CSPGs C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins. CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classif
Probab=99.93 E-value=1.5e-25 Score=199.20 Aligned_cols=120 Identities=25% Similarity=0.562 Sum_probs=104.9
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W 163 (618)
+||+.|+++||+|+.. .++|.+|++.|+.+||+||+|+|++|++||..++. ..+||||++.. .++.|+|
T Consensus 3 ~gw~~~~~~Cy~~~~~-~~sw~~A~~~C~~~gg~La~i~s~~e~~fl~~~~~--------~~~WIGl~~~~--~~~~~~W 71 (124)
T cd03588 3 EGWDKFQGHCYRHFPD-RETWEDAERRCREQQGHLSSIVTPEEQEFVNNNAQ--------DYQWIGLNDRT--IEGDFRW 71 (124)
T ss_pred ccceecCCEEEEEECC-ccCHHHHHHHHHhcCCEEeccCCHHHHHHHHHhcc--------CcEEecceecC--CCCceEe
Confidence 9999999999999765 69999999999999999999999999999988653 24899999765 3789999
Q ss_pred eCCcccccCCCCCCCCCCC-CCccceEEecCCCCceeeccccccccccccccC
Q psy1807 164 VDGEVVTKPSWGKDQPNNY-NGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 164 ~DGs~~~~~~W~~~~P~~~-~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+||+++.|.+|.+++|++. .+.++||.|......+|+|..|.. ..+||||+
T Consensus 72 ~dg~~~~~~~W~~~~p~~~~~~~~~Cv~~~~~~~~~W~d~~C~~-~~~fICe~ 123 (124)
T cd03588 72 SDGHPLQFENWRPNQPDNFFATGEDCVVMIWHEEGEWNDVPCNY-HLPFTCKK 123 (124)
T ss_pred CCCCcccccCcCCCCCCCCCCCCCCeEEEecCCCCeEcCCCCCC-CCeeeeeC
Confidence 9999999999999999875 345799999644445999999997 67999986
No 11
>cd03589 CLECT_CEL-1_like C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.
Probab=99.93 E-value=2.2e-25 Score=202.03 Aligned_cols=128 Identities=27% Similarity=0.574 Sum_probs=110.4
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHc-----CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCc
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQAS-----GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITA 158 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~-----g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~ 158 (618)
.||+.|+++||+|+.. .++|.+|+++|+++ +|+||+|+|++|++||..++...........|||||++.. .+
T Consensus 3 ~gw~~~~~~Cy~~~~~-~~~w~~A~~~C~~~~~~g~~~~La~i~s~~e~~~l~~~~~~~~~~~~~~~~WiGl~~~~--~~ 79 (137)
T cd03589 3 TFWTAFGGYCYRFFGD-RLTWEEAELRCRSFSIPGLIAHLVSIHSQEENDFVYDLFESSRGPDTPYGLWIGLHDRT--SE 79 (137)
T ss_pred cchhhhCCEEEEEeCC-CcCHHHHHHHHHhhcCCCCCceEcccCCHHHHHHHHHHHhhccccCCCCcEEEeeecCC--cc
Confidence 9999999999999765 69999999999997 6999999999999999999876421222357999999765 38
Q ss_pred ceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCC--CceeeccccccccccccccC
Q psy1807 159 RTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGR--GWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 159 ~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~--~~~W~d~~C~~~~~~~iC~~ 215 (618)
+.|+|+||+++.|++|.+++|++..+.++|+.|.... ...|++..|+. ..+||||.
T Consensus 80 ~~~~W~dG~~~~~~~W~~~~P~~~~~~~~C~~~~~~~~~~~~W~d~~C~~-~~~fIC~~ 137 (137)
T cd03589 80 GPFEWTDGSPVDFTKWAGGQPDNYGGNEDCVQMWRRGDAGQSWNDMPCDA-VFPYICKM 137 (137)
T ss_pred CceEeCCcCcCCcCCcCCCCCCCCCCCCCceeeecCCCCCCeecCCCCCC-CcceeeeC
Confidence 8999999999999999999999877678999997543 55899999997 68999984
No 12
>cd03599 CLECT_DGCR2_like C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CLECT_DGCR2_like: C-type lectin-like domain (CTLD) of the type found in DGCR2, an integral membrane protein deleted in DiGeorge Syndrome (DGS). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DGS is also known velo-cardio-facial syndrome (VCFS). DGS is a genetic abnormality that results in malformations of the heart, face, and limbs and is associated with schizophrenia and depressive disorders. DGCR2 is a candidate for involvement in the pathogenesis of DGS since the DGCR2 gene lies within the minimal DGS critical region (MDGRC) of 22q11, which when deleted gives rise to DGS, and the DGCR2 gene is in close proximity to the balanced translocation breakpoint in a DGS patient having a balanced translocation.
Probab=99.92 E-value=1.7e-24 Score=196.43 Aligned_cols=128 Identities=18% Similarity=0.350 Sum_probs=103.0
Q ss_pred ccceeecc--eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccC---C-CCccEEEeEEeC----
Q psy1807 84 AEITSFDR--NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDR---L-KTLLVWIGAQKD---- 153 (618)
Q Consensus 84 ~~w~~~~~--~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~---~-~~~~~WIGl~~~---- 153 (618)
.||..|++ +||+++.. .++|.+|+++|+++||+||+|++.+|++||..++...... . ....+||||++.
T Consensus 3 ~gw~~~~~~~~CYk~~~~-~~tw~dA~~~C~~~Gg~Lasi~s~~e~~fl~~l~~~~~~~~~~~~~~~~~WIGL~~~~~~~ 81 (153)
T cd03599 3 SGWHHYEGTASCYKVYLS-GENYWDAVQTCQKVNGSLATFTTDQELQFILAQEWDFDERVFGRKDQCKFWVGYQYVITNR 81 (153)
T ss_pred CchhccCCCCeEEEEeCC-cCCHHHHHHHHHHcCCEEcCCCCHHHHHHHHHHHHhhcccccccccCCCEEEeeccccccc
Confidence 99999998 89999765 6999999999999999999999999999999998654321 0 114599999522
Q ss_pred CCCCcceeEEe-CCcccccCC----CCCCCCCCCCCccceEEecC--------CCCceeeccccccccccccccC
Q psy1807 154 PGITARTWKWV-DGEVVTKPS----WGKDQPNNYNGEQNCVVLDG--------GRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 154 ~~~~~~~w~W~-DGs~~~~~~----W~~~~P~~~~~~~~Cv~l~~--------~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+...++.|+|+ ||+++.|.+ |.+++|++ ++++|+.|.. ....+|+|..|+. ...||||+
T Consensus 82 ~~~~eg~w~WsddGs~~~y~~w~~~w~~gePn~--~~e~C~~~~~~~~~~~~~~~~~~W~d~~C~~-~~~fiCq~ 153 (153)
T cd03599 82 NHSLEGRWEVAYKGSMEVFLPPEPIFATGMSTN--DNVFCAQLQCFQIPSLRERGLHSWHAENCYE-KSSFLCKR 153 (153)
T ss_pred CcccCCeEEEecCCccceecCccccCCCCCCCC--CCCCCeEEEeeccccccccccCeeeCccCCC-CCcceeCC
Confidence 22358899999 599998664 57889986 4579999962 2334899999998 67999985
No 13
>cd03590 CLECT_DC-SIGN_like C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on path
Probab=99.91 E-value=6.3e-24 Score=189.49 Aligned_cols=121 Identities=33% Similarity=0.688 Sum_probs=106.8
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W 163 (618)
.||+.|+++||+|+.. .++|.+|+..|+.+||+||+|+|++|++||.+++.. ...|||||++.. .++.|+|
T Consensus 3 ~~w~~~~~~Cy~~~~~-~~tw~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~------~~~~WiGl~~~~--~~~~~~W 73 (126)
T cd03590 3 TNWKSFQSSCYFFSTE-KKSWEESRQFCEDMGAHLVIINSQEEQEFISKILSG------NRSYWIGLSDEE--TEGEWKW 73 (126)
T ss_pred CCceEeCCEEEEEeCC-CcCHHHHHHHHHhCCCEEEeeCCHHHHHHHHHHhCC------CCCEEEeeecCC--CcCCeEe
Confidence 9999999999999765 699999999999999999999999999999998862 245999999865 3789999
Q ss_pred eCCccc--ccCCCCCCCCCCCC-CccceEEecCCCCceeeccccccccccccccC
Q psy1807 164 VDGEVV--TKPSWGKDQPNNYN-GEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 164 ~DGs~~--~~~~W~~~~P~~~~-~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+||+++ .|.+|.+++|++.. +.++||.|.... +.|.+..|.. ...||||+
T Consensus 74 ~dg~~~~~~~~~W~~~~p~~~~~~~~~C~~~~~~~-~~w~~~~C~~-~~~fiCek 126 (126)
T cd03590 74 VDGTPLNSSKTFWHPGEPNNWGGGGEDCAELVYDS-GGWNDVPCNL-EYRWICEK 126 (126)
T ss_pred cCCCCCCCccCCcCCCcCCCCCCCCCCCEEEECCC-CcEeCcCCCC-CEeeeeeC
Confidence 999998 89999999999864 467999997654 5899999998 67999984
No 14
>cd03594 CLECT_REG-1_like C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates
Probab=99.90 E-value=1.3e-23 Score=188.27 Aligned_cols=124 Identities=25% Similarity=0.523 Sum_probs=105.4
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHc--CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCccee
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQAS--GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTW 161 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~--g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w 161 (618)
.||+.|+++||+|+.. .++|.+|++.|+.+ ||+||+|++++|++||..++..... ....|||||++... ++.|
T Consensus 3 ~gw~~~~~~Cy~~~~~-~~tw~~A~~~C~~~~~g~~La~i~s~~e~~~l~~~~~~~~~--~~~~~WiGl~~~~~--~~~~ 77 (129)
T cd03594 3 KGWLPYKGNCYGYFRQ-PLSWSDAELFCQKYGPGAHLASIHSPAEAAAIASLISSYQK--AYQPVWIGLHDPQQ--SRGW 77 (129)
T ss_pred CCCEecCCEeeeEecc-CcCHHHHHHHHHhcCCCceEcccCCHHHHHHHHHHHHhhcc--CCccEEEeeccCCC--CCce
Confidence 9999999999999755 69999999999999 5999999999999999998876431 12469999998653 6799
Q ss_pred EEeCCcccccCCCCCCCCCCCCCccceEEecCCCC-ceeeccccccccccccccC
Q psy1807 162 KWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRG-WLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 162 ~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~-~~W~d~~C~~~~~~~iC~~ 215 (618)
+|+||+++.|.+|++++|. ...++||.|....+ ..|++..|.. ..+||||+
T Consensus 78 ~W~dg~~~~~~~W~~~~p~--~~~~~Cv~~~~~~~~~~W~~~~C~~-~~~fICe~ 129 (129)
T cd03594 78 EWSDGSKLDYRSWDRNPPY--ARGGYCAELSRSTGFLKWNDANCEE-RNPFICKY 129 (129)
T ss_pred EeCCCCcceecccCCCCCC--CCCCCceEEEecCCCCeEECCCCCC-CceeeeeC
Confidence 9999999999999999993 33579999975443 3799999998 67999985
No 15
>cd03596 CLECT_tetranectin_like C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CLECT_tetranectin_like: C-type lectin-like domain (CTLD) of the type found in the tetranectin (TN), cartilage derived C-type lectin (CLECSF1), and stem cell growth factor (SCGF). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TN binds to plasminogen and stimulates activation of plasminogen, playing a key role in the regulation of proteolytic processes. The TN CTLD binds two calcium ions. Its calcium free form binds to various kringle-like protein ligands. Two residues involved in the coordination of calcium are critical for the binding of TN to the fourth kringle (K4) domain of plasminogen (Plg K4). TN binds the kringle 1-4 form of angiostatin (AST K1-4). AST K1-4 is a fragment of Plg, commonly found in cancer tissues. TN inhibits the bin
Probab=99.90 E-value=2e-23 Score=187.01 Aligned_cols=124 Identities=22% Similarity=0.445 Sum_probs=105.2
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W 163 (618)
.||.. +++||+|+.. .++|.+|++.|+++||+||+|+|.+|++||..++..... ....|||||++.. .++.|+|
T Consensus 3 ~g~~~-~~~CY~~~~~-~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~~~~--~~~~~WiGl~~~~--~~~~w~w 76 (129)
T cd03596 3 KGTKI-HKKCYLVSEE-TKHYHEASEDCIARGGTLATPRDSDENDALRDYVKASVP--GNWEVWLGINDMV--AEGKWVD 76 (129)
T ss_pred CCcEE-CCEEEEEecc-cCCHHHHHHHHHhcCCeEecCCCHHHHHHHHHHHHhccC--CCCcEEEeccccC--ccCeEEe
Confidence 88755 7899999755 699999999999999999999999999999998865432 2346999999865 3889999
Q ss_pred eCCcccccCCCCC---CCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 164 VDGEVVTKPSWGK---DQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 164 ~DGs~~~~~~W~~---~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+||+++.|.+|++ ++|++. ..++||.|.....++|+|..|.. .++||||+
T Consensus 77 ~dG~~~~~~~W~~~~~~~p~~~-~~~~Cv~l~~~~~~~W~d~~C~~-~~~fICe~ 129 (129)
T cd03596 77 VNGSPISYFNWEREITAQPDGG-KRENCVALSSSAQGKWFDEDCRR-EKPYVCEF 129 (129)
T ss_pred CCCCCccccccCCCCCCCCCCC-CCCCCEEEccCCCCcCcCccCCC-CCceeccC
Confidence 9999999999997 899863 45799999765455899999997 67999985
No 16
>PHA02817 EEV Host range protein; Provisional
Probab=99.89 E-value=1.2e-22 Score=193.29 Aligned_cols=123 Identities=28% Similarity=0.614 Sum_probs=110.8
Q ss_pred CceeeecCCCCCccCCCceEeecCcccccceEEEEcCCC-----ceeccCCcceEEeccCCeeeCCCCceeeecCCCCCc
Q psy1807 389 PTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDD-----YWLEAGAMRRQVCMRDGKWSGDTPSCELITCPEPEV 463 (618)
Q Consensus 389 p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~G-----y~l~~~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~ 463 (618)
-.|+.+.|+.|+.+.||.+......|.+|++|+|+|++| |.|. |...++|+.+|+|+++.|.|+.+.|+.|.+
T Consensus 17 ~~C~~~~C~~PP~i~NG~i~~~~~~Y~~Gd~Vty~C~~G~~~~gY~L~--G~~~i~C~~dG~Ws~~~P~C~~v~C~~P~i 94 (225)
T PHA02817 17 SLCDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLV--GEKNIICEKDGKWNKEFPVCKIIRCRFPAL 94 (225)
T ss_pred CEecCcCCCCCCCCCCCEEecCCCceeCCCEEEEEeCCCCCCCCEEEC--CCCeEEECCCCcCCCCCCeeeeeECCCCCC
Confidence 458889999999999999876666799999999999997 8999 999999999999999999999999999874
Q ss_pred CCCcEEEc----ccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccCCCCceEEe
Q psy1807 464 PKGGYVVG----YDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQFISF 514 (618)
Q Consensus 464 ~~~g~~~~----~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C~~~ 514 (618)
.||.+.. ..|.+|++|+|+|++||.|.|+..++|+++|+|+++.|+|...
T Consensus 95 -~NG~v~~~~~~~~y~yg~~Vty~C~~Gy~L~G~~~~tC~~~G~WSp~~P~C~~~ 148 (225)
T PHA02817 95 -QNGFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICSRD 148 (225)
T ss_pred -cCceeEccccCCceEcCCEEEEEcCCCCEEcCCCceEECCCCeECCCCCEeecC
Confidence 4676653 3688999999999999999999999999999999999999874
No 17
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=99.88 E-value=1.1e-22 Score=178.61 Aligned_cols=114 Identities=19% Similarity=0.519 Sum_probs=100.0
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W 163 (618)
.||+.|+++||+|+.. .++|.+|+..|+.+||+||+|+|++|++||..++.. ..+||||++.. .++.|+|
T Consensus 3 ~gw~~~~~~Cy~~~~~-~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-------~~~WiGl~~~~--~~~~~~W 72 (116)
T cd03593 3 KDWICYGNKCYYFSME-KKTWNESKEACSSKNSSLLKIDDEEELEFLQSQIGS-------SSYWIGLSREK--SEKPWKW 72 (116)
T ss_pred CcCEEeCCEEEEEEcC-CCCHHHHHHHHHhCCCcEEEECCHHHHHHHHHhcCC-------CceEEEEeecC--CCCCeEc
Confidence 9999999999999765 599999999999999999999999999999998721 35899999875 3789999
Q ss_pred eCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 164 VDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 164 ~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+||+++. +|...+|++ ..++|++|... .|.+..|.. ..+||||+
T Consensus 73 ~dg~~~~--~~~~~~~~~--~~~~C~~~~~~---~w~~~~C~~-~~~~IC~k 116 (116)
T cd03593 73 IDGSPLN--NLFNIRGST--KSGNCAYLSST---GIYSEDCST-KKRWICEK 116 (116)
T ss_pred cCCCccc--ccccccCCC--CCCCceEEcCC---cEEcccCCc-CceeeeeC
Confidence 9999987 788888765 46799999765 699999997 67999985
No 18
>cd03598 CLECT_EMBP_like C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CLECT_EMBP_like: C-type lectin-like domain (CTLD) of the type found in the human proteins, eosinophil major basic protein (EMBP) and prepro major basic protein homolog (MBPH). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Eosinophils and basophils carry out various functions in allergic, parasitic, and inflammatory diseases. EMBP is stored in eosinophil crystalloid granules and is released upon degranulation. EMBP is also expressed in basophils. The proform of EMBP is expressed in placental X cells and breast tissue and increases significantly during human pregnancy. EMBP has cytotoxic properties and damages bacteria and mammalian cells, in vitro, as well as, helminth parasites. EMBP deposition has been observed in the inflamed tissue of all
Probab=99.88 E-value=1.5e-22 Score=177.97 Aligned_cols=116 Identities=29% Similarity=0.510 Sum_probs=98.4
Q ss_pred ceEEEEecCCCCCHHHHHHHHHHc-CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCccc
Q psy1807 91 RNCYEFNIDKGASFADARSKCQAS-GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVV 169 (618)
Q Consensus 91 ~~CY~~~~~~~~tw~~A~~~C~~~-g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~ 169 (618)
++||+|+.. .++|.+|+..|+++ ||+||+|+|++|++||.+++.... ...+||||++.....++.|.|+||+++
T Consensus 1 ~~Cy~~~~~-~~t~~~A~~~C~~~~g~~La~i~s~~e~~~l~~~~~~~~----~~~~WiGl~~~~~~~~~~~~W~dg~~~ 75 (117)
T cd03598 1 GRCYRFVKS-PRTFRDAQVICRRCYRGNLASIHSFAFNYRVQRLVSTLN----QAQVWIGGIITGKGRCRRFSWVDGSVW 75 (117)
T ss_pred CceEEEecC-CCCHHHHHHHhhcCCCceEeeecChhHhHHHHHHHhCCC----CCCEEEeeEcCCCCcCCeeEeCCCCcc
Confidence 589999765 69999999999995 999999999999999999885422 246999999862123789999999999
Q ss_pred ccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 170 TKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 170 ~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
.|.+|.+++|++ +.++|+.|....+ +|++..|.. .++|||++
T Consensus 76 ~y~~W~~g~p~~--~~~~Cv~~~~~~g-~W~~~~C~~-~~~fiC~~ 117 (117)
T cd03598 76 NYAYWAPGQPGN--RRGHCVELCTRGG-HWRRAHCKL-RRPFICSY 117 (117)
T ss_pred CcCCCCCCCCCC--CCCCcEEEeCCCC-eECCCcCCC-CceeeecC
Confidence 999999999998 4578999975443 899999997 78999984
No 19
>cd03591 CLECT_collectin_like C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, a
Probab=99.88 E-value=2.4e-22 Score=175.93 Aligned_cols=114 Identities=25% Similarity=0.525 Sum_probs=99.5
Q ss_pred ceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccc
Q psy1807 91 RNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVT 170 (618)
Q Consensus 91 ~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~ 170 (618)
.+||.++ .+.++|.+|+.+|+.+||+||+|+|.+|++||..++.. . ...+||||++.. .++.|+|+||+++.
T Consensus 1 ~k~y~~~-~~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-~----~~~~WiGl~~~~--~~~~~~w~dg~~~~ 72 (114)
T cd03591 1 EKIFVTN-GEEKNFDDAQKLCSEAGGTLAMPRNAAENAAIASYVKK-G----NTYAFIGITDLE--TEGQFVYLDGGPLT 72 (114)
T ss_pred CcEEEeC-CceeCHHHHHHHHhhcCCEEecCCCHHHHHHHHHHHhc-C----CccEEEecccCC--cCCcEEeCCCCCcc
Confidence 3799996 45799999999999999999999999999999999876 1 246999999765 38899999999999
Q ss_pred cCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 171 KPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 171 ~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
|++|.+++|++..+.++||.|... ..|++.+|+. ..+||||+
T Consensus 73 y~~W~~~ep~~~~~~~~Cv~~~~~--~~W~~~~C~~-~~~fICe~ 114 (114)
T cd03591 73 YTNWKPGEPNNAGGGEDCVEMYTS--GKWNDVACNL-TRLFVCEF 114 (114)
T ss_pred cCCcCCCCCCCCCCCCCeEEECCC--CcCcCccCCC-CeeEEeeC
Confidence 999999999987667899999743 3899999998 68999984
No 20
>cd03603 CLECT_VCBS A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Bacterial CTLDs within this group are functionally uncharacterized. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surface
Probab=99.87 E-value=4.4e-22 Score=175.07 Aligned_cols=113 Identities=30% Similarity=0.543 Sum_probs=97.2
Q ss_pred eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCccccc
Q psy1807 92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVTK 171 (618)
Q Consensus 92 ~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~~ 171 (618)
+||+|+. +.++|.+|+..|+++||+||+|+|++|++||..++.. ...+||||++.. .++.|+|+||+++.|
T Consensus 1 ~~Y~~~~-~~~sw~~A~~~C~~~g~~La~I~s~~E~~fv~~~~~~------~~~~WiG~~~~~--~~~~w~W~dg~~~~~ 71 (118)
T cd03603 1 HFYKFVD-GGMTWEAAQTLAESLGGHLVTINSAEENDWLLSNFGG------YGASWIGASDAA--TEGTWKWSDGEESTY 71 (118)
T ss_pred CeEEEeC-CCcCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHhcc------CCCEEEeeecCC--CCCceEeCCCCcCCC
Confidence 5899965 4699999999999999999999999999999998861 235899999865 388999999999999
Q ss_pred CCCCCCCC-CCCCCccceEEecCC--CCceeeccccc-ccccccccc
Q psy1807 172 PSWGKDQP-NNYNGEQNCVVLDGG--RGWLWNDVGCK-LDYLHWICQ 214 (618)
Q Consensus 172 ~~W~~~~P-~~~~~~~~Cv~l~~~--~~~~W~d~~C~-~~~~~~iC~ 214 (618)
.+|.+++| ++..+.++||.|... ..++|+|..|+ . +..||||
T Consensus 72 ~~W~~~eP~~~~~~~~~Cv~~~~~~~~~~~W~d~~C~~~-~~~~iCe 117 (118)
T cd03603 72 TNWGSGEPHNNGGGNEDYAAINHFPGISGKWNDLANSYN-TLGYVIE 117 (118)
T ss_pred CCcCCCCCCCCCCCCcCeEEeecCCCCCCcCccCCCCcc-ccceEEe
Confidence 99999999 766667899998642 23489999999 7 6799997
No 21
>cd03601 CLECT_TC14_like C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CLECT_TC14_like: C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TC14 is homodimeric. The CTLD of TC14 binds D-galactose and D-fucose. TC14 is expressed constitutively by multipotent epithelial and mesenchymal cells and plays in role during budding, in inducing the aggregation of undifferentiated mesenchymal cells to give rise to epithelial forming tissue. TC14-2 and TC14-3 shows calcium-dependent galactose binding activity. TC14-3 is a cytostatic factor which blocks cell growth and dedifferentiation of the atrial epithelium during asexual reproducti
Probab=99.86 E-value=1.7e-21 Score=171.73 Aligned_cols=114 Identities=22% Similarity=0.311 Sum_probs=90.5
Q ss_pred EEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeC-CCCCcceeEEeCCccc--c
Q psy1807 94 YEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKD-PGITARTWKWVDGEVV--T 170 (618)
Q Consensus 94 Y~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~-~~~~~~~w~W~DGs~~--~ 170 (618)
|+|+ .+.++|.+|+.+|+.+||+||+|+|++| .++. .+..... .....|||||++. . .++.|+|+||+++ .
T Consensus 3 ~~~~-~~~~~w~~A~~~C~~~G~~La~i~s~~e-~~~~-~i~~~~~-~~~~~~WIGl~d~~~--~~g~~~W~dG~~~~~~ 76 (119)
T cd03601 3 ILCS-DETMNYAKAGAFCRSRGMRLASLAMRDS-EMRD-AILAFTL-VKGHGYWVGADNLQD--GEYDFLWNDGVSLPTD 76 (119)
T ss_pred EEEc-CCcCCHHHHHHHHHhcCCEEeeecCHHH-HHHH-HHHhccc-cCCccEEEEeccCCC--CCCCeEeCCCCCcCCC
Confidence 6675 4469999999999999999999999988 3333 3332211 1124699999986 5 3889999999998 6
Q ss_pred cCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 171 KPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 171 ~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
|++|+++||++..+.++||.|....+ +|+|..|.. ..+||||+
T Consensus 77 y~~W~~geP~~~~~~e~Cv~~~~~~~-~W~d~~C~~-~~~fICek 119 (119)
T cd03601 77 SDLWAPNEPSNPQSRQLCVQLWSKYN-LLDDEYCGR-AKRVICEK 119 (119)
T ss_pred CCccCCCcCcCcCCCcCCeEEeCCCC-CEeCccCCC-CceeeecC
Confidence 99999999998766789999965433 899999997 67999985
No 22
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. L- sel is expressed constitutively on most leukocytes. P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets. E- sels are present on endothelial cells. Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=99.86 E-value=1.5e-21 Score=171.08 Aligned_cols=113 Identities=25% Similarity=0.414 Sum_probs=96.4
Q ss_pred EEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccccCC
Q psy1807 94 YEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVTKPS 173 (618)
Q Consensus 94 Y~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~~~~ 173 (618)
|+|+. +.++|++|+.+|+.+||+||+|+|++|++||..++.... ...|||||++.. ..+.|.|.||+++.|.+
T Consensus 3 Y~~~~-~~~~w~~A~~~C~~~g~~La~i~s~~e~~~i~~~~~~~~----~~~~WiG~~~~~--~~~~W~~~dg~~~~y~~ 75 (115)
T cd03592 3 YHYST-EKMTFNEAVKYCKSRGTDLVAIQNAEENALLNGFALKYN----LGYYWIDGNDIN--NEGTWVDTDKKELEYKN 75 (115)
T ss_pred EEEcC-CccCHHHHHHHHHHcCCeEeecCCHHHHHHHHHHHHhcC----CCCEEEeCccCC--ccCeEEeCCCCcccccc
Confidence 77754 469999999999999999999999999999999875432 236999999865 37889999999999999
Q ss_pred CCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 174 WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 174 W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
|++++|++.. .++|+.|......+|+|.+|.. ..+||||+
T Consensus 76 W~~geP~~~~-~~~Cv~~~~~~~g~W~d~~C~~-~~~fICe~ 115 (115)
T cd03592 76 WAPGEPNNGR-NENCLEIYIKDNGKWNDEPCSK-KKSAICYT 115 (115)
T ss_pred cCCCCCCCCC-CCCceEEccCCCCCCcCcCCCC-CccceeCC
Confidence 9999999865 6799999764344899999997 67999985
No 23
>PHA02817 EEV Host range protein; Provisional
Probab=99.86 E-value=2.9e-21 Score=183.90 Aligned_cols=122 Identities=31% Similarity=0.667 Sum_probs=110.3
Q ss_pred ceeeecCCCCCccCCeeEEecCCCcccCCEEEEEeCCC-----ceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccc
Q psy1807 272 TCQYVDCGKLEHIEHGTVTLETTRTTHGAVAIYACHEN-----YTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISG 346 (618)
Q Consensus 272 ~C~~~~C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~G-----y~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~ 346 (618)
.|+.+.|+.|+.+.||.+......|.+|+.++|+|++| |+|.|+..++|+.+|+|+++.|.|+.+.|+.|. +.|
T Consensus 18 ~C~~~~C~~PP~i~NG~i~~~~~~Y~~Gd~Vty~C~~G~~~~gY~L~G~~~i~C~~dG~Ws~~~P~C~~v~C~~P~-i~N 96 (225)
T PHA02817 18 LCDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVGEKNIICEKDGKWNKEFPVCKIIRCRFPA-LQN 96 (225)
T ss_pred EecCcCCCCCCCCCCCEEecCCCceeCCCEEEEEeCCCCCCCCEEECCCCeEEECCCCcCCCCCCeeeeeECCCCC-CcC
Confidence 58889999999999999876666799999999999997 899999999999999999989999999999764 778
Q ss_pred cEEEe----cccccCceeeecCCCCcEEcCCceEEEccCCeecCCCCceeee
Q psy1807 347 GIVTT----SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGKAPTCKYV 394 (618)
Q Consensus 347 g~~~~----~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~ 394 (618)
|.+.. ..+.+|++|+|+|++||.|.|+..++|+++|+|+++.|.|+..
T Consensus 97 G~v~~~~~~~~y~yg~~Vty~C~~Gy~L~G~~~~tC~~~G~WSp~~P~C~~~ 148 (225)
T PHA02817 97 GFVNGIPDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICSRD 148 (225)
T ss_pred ceeEccccCCceEcCCEEEEEcCCCCEEcCCCceEECCCCeECCCCCEeecC
Confidence 87753 3578999999999999999999999999999999999999853
No 24
>PHA02642 C-type lectin-like protein; Provisional
Probab=99.85 E-value=3.6e-21 Score=181.16 Aligned_cols=116 Identities=20% Similarity=0.381 Sum_probs=100.9
Q ss_pred ccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCC
Q psy1807 75 RCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDP 154 (618)
Q Consensus 75 ~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~ 154 (618)
.|| .||+.|+++||+|+.. .++|.+|++.|+++||+||+|++++|++||+++... ..|||||++..
T Consensus 87 ~CP------~gW~~~~~kCYyfs~~-~ksW~eA~~~C~s~ga~La~I~seeE~~FL~~~~~~-------~~yWIGLsd~~ 152 (216)
T PHA02642 87 TCP------KGWIGFGYKCFYFSED-SKNWTFGNTFCTSLGATLVKVETEEELNFLKRYKDS-------SDHWIGLNRES 152 (216)
T ss_pred CCC------CcCEEECCEEEEEeCc-ccCHHHHHHHHhhCCCeEeeECCHHHHHHHHHhhcC-------CeEEEEeEeCC
Confidence 688 9999999999999754 699999999999999999999999999999986432 24999999976
Q ss_pred CCCcceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCCC
Q psy1807 155 GITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHNP 217 (618)
Q Consensus 155 ~~~~~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~~ 217 (618)
. ++.|+|+||+++.+.+|..| .++||+++.. .+.+..|.. ...+||++..
T Consensus 153 ~--e~~W~WvDGS~~n~~~~i~G-------~g~CAyLs~~---~i~s~~C~~-~~~wIC~K~l 202 (216)
T PHA02642 153 S--NHPWKWADNSNYNASFVITG-------TGECAYLNDI---RISSSRVYA-NRKWICSKTY 202 (216)
T ss_pred C--CCceEECCCCccCcceeccC-------CCceEEEeCC---ceEccCcCC-CceEEeeeec
Confidence 3 88999999999998888653 4689999777 789999998 5799999853
No 25
>cd03600 CLECT_thrombomodulin_like C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CLECT_thrombomodulin_like: C-type lectin-like domain (CTLD) of the type found in human thrombomodulin(TM), Endosialin, C14orf27, and C1qR. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In these thrombomodulin-like proteins the residues involved in coordinating Ca2+ in the classical MBP-A CTLD are not conserved. TM exerts anti-fibrinolytic and anti-inflammatory activity. TM also regulates blood coagulation in the anticoagulant protein C pathway. In this pathway, the procoagulant properties of thrombin (T) are lost when it binds TM. TM also plays a key role in tumor biology. It is expressed on endothelial cells and on several type of tumor cell including squamous cell carcinoma. Loss of TM expression correlates with advanced stage and poor prognosis. Loss of function of TM func
Probab=99.85 E-value=6.9e-21 Score=173.22 Aligned_cols=124 Identities=24% Similarity=0.531 Sum_probs=99.3
Q ss_pred cceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccC--CCCccEEEeEEeCCC------CCccee
Q psy1807 90 DRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDR--LKTLLVWIGAQKDPG------ITARTW 161 (618)
Q Consensus 90 ~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~--~~~~~~WIGl~~~~~------~~~~~w 161 (618)
+++||+++.. .++|.+|+.+|+++||+||+|+|.+|++||..++...... .....+||||++... ..++.|
T Consensus 3 ~~~Cy~~~~~-~~sw~~A~~~C~~~gg~La~i~s~~E~~~v~~~l~~~~~~~~~~~~~~WIGl~~~~~~~~~~~~~~~~f 81 (141)
T cd03600 3 SDACYTLHPQ-KLTFLEAQRSCIELGGNLATVRSGEEADVVSLLLAAGPGRHGRGSLRLWIGLQREPRQCSDPSLPLRGF 81 (141)
T ss_pred CCceEEEeCC-ccCHHHHHHHHHhhCCEeeecCCHHHHHHHHHHHhhccccccCCCccEEEeEecCcccCccccccCCcc
Confidence 6799999765 6999999999999999999999999999999998764210 013469999997421 125789
Q ss_pred EEeCCc-ccccCCCCCCCCCCCCCccceEEecCCC----CceeeccccccccccccccC
Q psy1807 162 KWVDGE-VVTKPSWGKDQPNNYNGEQNCVVLDGGR----GWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 162 ~W~DGs-~~~~~~W~~~~P~~~~~~~~Cv~l~~~~----~~~W~d~~C~~~~~~~iC~~ 215 (618)
+|+||. .+.|++|+++ |++..+.++||.|.... .++|+|.+|+.....||||+
T Consensus 82 ~W~d~~~~~~y~~W~~~-p~n~~~~~~Cv~l~~~~~~~~~~~W~d~~C~~~~~~fIC~~ 139 (141)
T cd03600 82 SWVTGDQDTDFSNWLQE-PAGTCTSPRCVALSAAGSTPDNLKWKDGPCSARADGYLCKF 139 (141)
T ss_pred EECCCCCCCCccccccC-CCCCCCCCccEEEEccCCCCCCCccccCCcCCCCCCeEEee
Confidence 999976 5789999998 77765678999996543 56999999998435799996
No 26
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=99.84 E-value=9.6e-21 Score=168.17 Aligned_cols=123 Identities=30% Similarity=0.658 Sum_probs=104.7
Q ss_pred ccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEE
Q psy1807 84 AEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKW 163 (618)
Q Consensus 84 ~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W 163 (618)
.||+.++++||+++.. .++|.+|+.+|+.++|+||+|++++|++||..++.... .....|||||++.. ..+.|+|
T Consensus 3 ~~w~~~~~~Cy~~~~~-~~~~~~A~~~C~~~~~~La~i~~~~e~~~i~~~~~~~~--~~~~~~WiG~~~~~--~~~~~~W 77 (126)
T smart00034 3 SGWISYGGKCYKFSTE-KKTWADAQAFCQSLGAHLASIHSEAENDFVASLLKNSG--SNSDYYWIGLSDPD--SNGSWQW 77 (126)
T ss_pred cchhhhCCEEEEEECC-ccCHHHHHHHHHhcCCEEcccCCHHHHHHHHHHHHhhc--CCCCCEEEecCccC--cCCCeEE
Confidence 9999999999999765 59999999999999999999999999999999987742 12346999999743 3789999
Q ss_pred eCCcc-cccCCCCCCCCCCCCCccceEEecCCCCceeecccccccccccccc
Q psy1807 164 VDGEV-VTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQ 214 (618)
Q Consensus 164 ~DGs~-~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~ 214 (618)
+||++ +.|.+|.++ ++. ...++|+.|.....+.|.+.+|.. ..+||||
T Consensus 78 ~dg~~~~~~~~w~~~-~~~-~~~~~C~~~~~~~~~~w~~~~C~~-~~~~ICe 126 (126)
T smart00034 78 SDGSGPVNYSNWAPG-EPN-GGSGDCVVLSTSGGGKWNDVSCTS-KLPFVCE 126 (126)
T ss_pred CCCCCCCCccccCCC-CCC-CCCCCCEEEecCCCCcccCCCCCC-CcccccC
Confidence 99998 789999998 433 346799999866546899999997 7799997
No 27
>cd03595 CLECT_chondrolectin_like C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CLECT_chondrolectin_like: C-type lectin-like domain (CTLD) of the type found in the human type-1A transmembrane proteins chondrolectin (CHODL) and layilin. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CHODL is predominantly expressed in muscle cells and is associated with T-cell maturation. Various alternatively spliced isoforms have been of CHODL have been identified. The transmembrane form of CHODL is localized in the ER-Golgi apparatus. Layilin is widely expressed in different cell types. The extracellular CTLD of layilin binds hyaluronan (HA), a major constituent of the extracellular matrix (ECM). The cytoplasmic tail of layilin binds various members of the band 4.1/ERM superfamily (talin, radixin, and merlin). The ERM proteins are cytoskeleton-membrane l
Probab=99.84 E-value=1.1e-20 Score=173.24 Aligned_cols=122 Identities=24% Similarity=0.554 Sum_probs=98.2
Q ss_pred ecceEEEEe----cCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCC------Cc
Q psy1807 89 FDRNCYEFN----IDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGI------TA 158 (618)
Q Consensus 89 ~~~~CY~~~----~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~------~~ 158 (618)
|+++||+++ ..+.++|.+|+..|+.+||+||+|+|++|++||..++..... ....+||||++.... ..
T Consensus 8 ~~~~Cy~~~~~~~~~~~~tw~~A~~~C~~~g~~LasI~s~~E~~~i~~~i~~~~~--~~~~~WIGl~~~~~~~~~~~~~~ 85 (149)
T cd03595 8 TEKPCYKIAYFQDSRRRLNFEEARQACREDGGELLSIESENEQKLIERFIQTLRA--SDGDFWIGLRRSSQYNVTSSACS 85 (149)
T ss_pred CCCccEEEEEEeccccccCHHHHHHHHHHcCCEECccCCHHHHHHHHHHHHhhcC--CCCcEEEEeECCCCccccccccC
Confidence 558999864 245799999999999999999999999999999998865322 234699999975421 13
Q ss_pred ceeEEeCCcccccCCCCCCCCCCCCCccceEEecCC----------CCceeeccccccccccccccC
Q psy1807 159 RTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGG----------RGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 159 ~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~----------~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+.|+|+||+++.|.+|.+++|++ +.++|+.|... ..++|+|..|+. ..+||||+
T Consensus 86 ~~~~W~dG~~~~y~~W~~~eP~~--~~~~Cv~l~~~~~~~~~~~~~~~~~W~d~~C~~-~~~fICe~ 149 (149)
T cd03595 86 SLYYWLDGSISTFRNWYVDEPSC--GSEVCVVMYHQPSAPAGQGGPYLFQWNDDNCNM-KNNFICKY 149 (149)
T ss_pred CccEEcCCCccCccCCCCCCCCC--cccCCEEEEecCCCCcCcCcccCCCccCCCCCC-CcccccCC
Confidence 46999999999999999999986 35799998531 134899999998 67999984
No 28
>PHA02953 IEV and EEV membrane glycoprotein; Provisional
Probab=99.83 E-value=1.7e-20 Score=172.98 Aligned_cols=116 Identities=17% Similarity=0.258 Sum_probs=93.0
Q ss_pred CccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeC
Q psy1807 74 DRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKD 153 (618)
Q Consensus 74 ~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~ 153 (618)
..|| .||..|+++||+++.. .++|++|+.+|+.+||+|++|+ +|+.||..+... ..+||||++
T Consensus 45 ~~CP------~GW~~~~~~CYk~f~~-~~tW~~A~~~C~~~Gg~L~~~~--~e~~fv~~~~~~-------~~~WIGL~d- 107 (170)
T PHA02953 45 SACA------KGWIPYDNYCYLDTNI-QLSTYGAVYLCNKYRARLPKPN--FRHLKVLSLTYG-------KDFWVSLKK- 107 (170)
T ss_pred CcCc------ccChhhCCEEEEEECC-cCCHHHHHHHHHhcCCCCCCCc--HHHHHHHHhccC-------CCEEEeEEC-
Confidence 4799 9999999999999755 6999999999999999998877 667888765431 248999997
Q ss_pred CCCCcceeEEeCC-cccccCC------CCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCC
Q psy1807 154 PGITARTWKWVDG-EVVTKPS------WGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHN 216 (618)
Q Consensus 154 ~~~~~~~w~W~DG-s~~~~~~------W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~ 216 (618)
. ++.|+|.|| +.++|.+ |. ++|++ +.++|+.+..+ +|+|..|.. ..+||||+.
T Consensus 108 ~---eg~W~w~Dggs~~~y~~~~~~~~w~-~~~~~--~~e~C~~~~~~---~W~d~~C~~-~~~fICqk~ 167 (170)
T PHA02953 108 K---NNRWLDINTNKTVDMNKNTELKKIK-SKTKN--DNEACYIYKSG---ELKETVCNS-VNYIICVKR 167 (170)
T ss_pred C---CCceEeCCCCeeecccccccccccc-CCCCC--CCCCceEEeCC---eEEeccCCC-CcEEEEEEe
Confidence 2 779999996 5666554 66 45544 46799998653 799999998 569999873
No 29
>cd03602 CLECT_1 C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. CLECT_1: C-type lectin (CTL)/C-type lectin-like (CTLD) domain subgroup 1; a subgroup of protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in differe
Probab=99.82 E-value=4.8e-20 Score=159.68 Aligned_cols=108 Identities=20% Similarity=0.453 Sum_probs=92.9
Q ss_pred eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCccccc
Q psy1807 92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVTK 171 (618)
Q Consensus 92 ~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~~ 171 (618)
++|+|+. +.++|.+|+++|+.+||+||+|+|++|++||..++.. ....|||||++. ++.|.|+||+++.|
T Consensus 1 ~~y~~~~-~~~~w~~A~~~C~~~g~~La~i~s~~e~~~l~~~~~~-----~~~~~WiGl~~~----~~~~~W~dg~~~~~ 70 (108)
T cd03602 1 RTFYLVN-ESKTWSEAQQYCRENYTDLATVQNQEDNALLSNLSRV-----SNSAAWIGLYRD----VDSWRWSDGSESSF 70 (108)
T ss_pred CceEEec-cccCHHHHHHHHHHHCCccCeecCHHHHHHHHHHHhc-----cCCcEEEEEECC----CCceEEcCCCCCcc
Confidence 4788864 4699999999999999999999999999999998862 124699999975 67899999999999
Q ss_pred CCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 172 PSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 172 ~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
.+|.+++|.. .++||.|... .+|++..|.. ..+|||+.
T Consensus 71 ~~w~~~~~~~---~~~C~~~~~~--~~w~~~~C~~-~~~fIC~~ 108 (108)
T cd03602 71 RNWNTFQPFG---QGDCATMYSS--GRWYAALCSA-LKPFICYD 108 (108)
T ss_pred CccCCCCCCC---CCCeeEECcC--CeECcccCCC-CcCEeccC
Confidence 9999998753 5799999764 3899999997 67999984
No 30
>PHA03097 C-type lectin-like protein; Provisional
Probab=99.80 E-value=1.8e-19 Score=164.95 Aligned_cols=111 Identities=21% Similarity=0.309 Sum_probs=91.8
Q ss_pred ccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCC
Q psy1807 75 RCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDP 154 (618)
Q Consensus 75 ~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~ 154 (618)
.|| .||+.|+++||+|+.. .++|.+|+..|+++||+||+|++++|++||.+++.. ..|||||++..
T Consensus 45 ~CP------~gW~~~~~~CY~~s~~-~~sW~~A~~~C~~~g~~La~I~~~~E~~fi~~~~~~-------~~~WIGL~d~~ 110 (157)
T PHA03097 45 NCR------SGWVGYNNKCYTFSEN-ITNKHLAIERCADMDGILTLIDDQKEVLFVSRYKGG-------QDLWIGIEKKK 110 (157)
T ss_pred CCC------CCceeeCCEEEEEecC-CCcHHHHHHHHHhCCCEEeeeCCHHHHHHHHHhcCC-------CCEEEeeecCC
Confidence 688 9999999999999755 599999999999999999999999999999987632 35999999863
Q ss_pred CCCcceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCC
Q psy1807 155 GITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHN 216 (618)
Q Consensus 155 ~~~~~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~ 216 (618)
. |.||+++.. +|.+ + .+.++||++... .|+|..|.. ...||||+.
T Consensus 111 ---~----W~dgs~~~~-~~~~--~---~~~e~Cv~i~~~---~w~d~~C~~-~~~~ICek~ 155 (157)
T PHA03097 111 ---G----DDDDREVLD-KVVK--P---PKSGKCAYLKDK---TIISSNCNA-TKGWICFDR 155 (157)
T ss_pred ---C----ccCCCcccc-cccC--C---CCCCCEEEEECC---cEEeCCCCC-CeeEEEeec
Confidence 2 999987643 4432 2 235789999765 699999998 679999974
No 31
>TIGR00864 PCC polycystin cation channel protein. Note: this model has been restricted to the amino half because for technical reasons.
Probab=99.80 E-value=1.2e-19 Score=217.60 Aligned_cols=129 Identities=22% Similarity=0.366 Sum_probs=108.3
Q ss_pred CccccccccCccceee--cceEEEEecCCCCCHHHHHHHHHHc-CCeecccCChHHHHHHHHHhhhcccCCCCccEEEeE
Q psy1807 74 DRCASEKAASAEITSF--DRNCYEFNIDKGASFADARSKCQAS-GGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGA 150 (618)
Q Consensus 74 ~~Cp~~~~~~~~w~~~--~~~CY~~~~~~~~tw~~A~~~C~~~-g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl 150 (618)
..|| .||..| +++||+|+.. .++|.+|+..|+++ ||+||+|+|++|++||.+++.... ...+||||
T Consensus 316 ~~CP------~GW~~f~~~g~CYk~~~e-~~TW~dAe~~C~s~GGAhLAsI~S~eEn~FL~~lv~~s~----~~~vWIGL 384 (2740)
T TIGR00864 316 PHCP------KDGEIFEENGHCFQIVPE-EAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSL----DRGVWIGF 384 (2740)
T ss_pred CCCC------CCCeecCCCCEEEEEeCC-ccCHHHHHHHhhccCCeEEecCCCHHHHHHHHHHhhccC----CccEEEee
Confidence 3799 999765 8999999765 69999999999999 599999999999999999886532 23599999
Q ss_pred EeCCCCCcc--eeEEeCCccc-ccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCCCCC
Q psy1807 151 QKDPGITAR--TWKWVDGEVV-TKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHNPAN 219 (618)
Q Consensus 151 ~~~~~~~~~--~w~W~DGs~~-~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~~~~ 219 (618)
++... ++ .|+|.||+++ .|++|.++||++.. .++|+.|... + .|+|..|.. ...||||+.+..
T Consensus 385 sD~~~--EG~g~WvWsDGS~l~~YtnW~pGEPNn~~-~EdCV~l~~~-g-~WND~~Cs~-~~~FICE~~~~~ 450 (2740)
T TIGR00864 385 SDVNG--AEKGPAHQGEAFEAEECEEGLAGEPHPAR-AEHCVRLDPR-G-QCNSDLCNA-PHAYVCELNPGG 450 (2740)
T ss_pred eCCCC--CCceeeEECCCCcccCccCcCCCCCCCCC-CCCCEEEcCC-C-CEEccCCCC-CeeEEeEECCCC
Confidence 98653 44 4999999998 79999999999864 5899998654 2 699999998 679999986543
No 32
>PHA02867 C-type lectin protein; Provisional
Probab=99.76 E-value=4.5e-18 Score=154.56 Aligned_cols=109 Identities=17% Similarity=0.308 Sum_probs=88.3
Q ss_pred CccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeC
Q psy1807 74 DRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKD 153 (618)
Q Consensus 74 ~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~ 153 (618)
..|| .||+.|+++||+|+.+ .++|.+|+..|.++||+||+|++++|++||..+.. ..|||||++.
T Consensus 47 ~~CP------~gWi~~~~~CY~fs~~-~~tW~~A~~~C~~~ga~La~I~s~eE~~Fl~~~~~--------~~~WIGLs~~ 111 (167)
T PHA02867 47 KVCP------DEWIGYNSKCYYFTIN-ETNWNDSKKLCDVMDSSLIRFDNIETLNFVSRYGK--------GSYWIDINQN 111 (167)
T ss_pred CCCC------CCCEEECCEEEEEecc-ccCHHHHHHHHhhCCCEECCcCCHHHHHHHHHcCC--------CCEEEEEEeC
Confidence 3688 9999999999999654 69999999999999999999999999999987532 2499999986
Q ss_pred CCCCcceeEEeCCcccccCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccCC
Q psy1807 154 PGITARTWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQHN 216 (618)
Q Consensus 154 ~~~~~~~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~~ 216 (618)
. .|.|.+++.+ .. ..++++|++++.+ ...+..|.. .+.+||++.
T Consensus 112 ~-----~~~~~~~s~~-----~~-----~~~~~~Ca~i~~~---~i~s~~C~~-~~~wIC~K~ 155 (167)
T PHA02867 112 R-----KIPGINFSLY-----YE-----QGVNDICLLFDTS---NIIEMSCIF-HERTICVKE 155 (167)
T ss_pred C-----CCCCccCcee-----ee-----cCCCCcEEEEeCC---eEEeecccC-CcEEEEEcc
Confidence 4 3456555433 11 2346789999887 678999998 679999984
No 33
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=99.62 E-value=3.2e-15 Score=129.86 Aligned_cols=115 Identities=34% Similarity=0.742 Sum_probs=96.5
Q ss_pred eEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcc-cc
Q psy1807 92 NCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEV-VT 170 (618)
Q Consensus 92 ~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~-~~ 170 (618)
.||+++.. .++|++|++.|+.+||+|++|++.+|.+||..++.. .....+|||+++.. ..+.|.|.||+. ..
T Consensus 1 ~Cy~~~~~-~~~~~~A~~~C~~~~~~L~~~~~~~e~~~i~~~~~~----~~~~~~wvg~~~~~--~~~~~~~~~~~~~~~ 73 (116)
T cd00037 1 SCYKFSTE-KLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLKK----SSSSDVWIGLNDLS--SEGTWKWSDGSPLVD 73 (116)
T ss_pred CCEEEcCC-ccCHHHHHHHHHHcCCEEcccCCHHHHHHHHHHHhC----CCCCCEEEcccccC--cCCCeEecCCCcccc
Confidence 49999655 799999999999999999999999999999999874 12346999999875 378999999998 78
Q ss_pred cCCCCCCCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 171 KPSWGKDQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 171 ~~~W~~~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
+.+|.+++|.. ....+|+.+.......|.+.+|.. ..+|||++
T Consensus 74 ~~~w~~~~~~~-~~~~~C~~~~~~~~~~~~~~~C~~-~~~~iC~~ 116 (116)
T cd00037 74 YTNWAPGEPNP-GGSEDCVVLSSSSDGKWNDVSCSS-KLPFICEK 116 (116)
T ss_pred ccCCCCCCcCC-CCCCCeeEEccCCCCCccCCCCCC-CceeeecC
Confidence 89999999832 336799999765445899999996 67999974
No 34
>PF00059 Lectin_C: Lectin C-type domain; InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins: C-type lectins, which are Ca+-dependent. S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins). C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=99.58 E-value=5.2e-15 Score=126.76 Aligned_cols=100 Identities=28% Similarity=0.588 Sum_probs=81.0
Q ss_pred CCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccc----cCCCCC
Q psy1807 101 GASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVT----KPSWGK 176 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~----~~~W~~ 176 (618)
+++|.+|+++|+.+||+|+.|.+.+|++||..++.. ...+|||| +.. ..+.|+|.||++.. |.+|
T Consensus 2 ~~~~~~A~~~C~~~~~~L~~i~~~~e~~~i~~~~~~------~~~~Wig~-~~~--~~~~~~w~~~~~~~~~~~~~~w-- 70 (105)
T PF00059_consen 2 PMTWEEAQQYCQSMGAHLASINSEEENDFIQSQLKS------NESYWIGL-DSD--NNGTWKWIDGSPNSPENFYTNW-- 70 (105)
T ss_dssp EEEHHHHHHHHHHTTSEEB-GSSHHHHHHHHHHHHS------SSEEEEEE-ESS--STSEEEETTSSBSSSTTBSGCB--
T ss_pred CCCHHHHHHHHhcCCCEEeEeCCHHHhhhhhhcccc------cceeeeee-ecc--ccceeccccCCCcccccccccc--
Confidence 479999999999999999999999999999999871 24699999 333 38899999998765 5678
Q ss_pred CCCCCCCCccceEEecCCCCceeeccccccccccccccC
Q psy1807 177 DQPNNYNGEQNCVVLDGGRGWLWNDVGCKLDYLHWICQH 215 (618)
Q Consensus 177 ~~P~~~~~~~~Cv~l~~~~~~~W~d~~C~~~~~~~iC~~ 215 (618)
. ......+|++|.....+.|.+.+|.. ..+|||++
T Consensus 71 -~--~~~~~~~C~~~~~~~~~~w~~~~C~~-~~~fiCek 105 (105)
T PF00059_consen 71 -N--PPNDSENCAYIYYSSSGKWNDVPCSE-KYPFICEK 105 (105)
T ss_dssp -S--SGGSSEEEEEEGCSTTTEEEEEETTS-EEEEEEEE
T ss_pred -c--cCCCCCCeEEEEEcCCCeEEeeCCCC-CeeEEeeC
Confidence 1 22335799999555556999999998 78999985
No 35
>PHA02911 C-type lectin-like protein; Provisional
Probab=99.30 E-value=5.9e-12 Score=114.60 Aligned_cols=106 Identities=15% Similarity=0.264 Sum_probs=75.0
Q ss_pred ccCccceeecceEEEEecCC-CCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcc
Q psy1807 81 AASAEITSFDRNCYEFNIDK-GASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITAR 159 (618)
Q Consensus 81 ~~~~~w~~~~~~CY~~~~~~-~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~ 159 (618)
||+.+|..|+++||+|+..+ ..+|++|+.+|..+||+|+. .+.+|++||+. +|.|+. +
T Consensus 102 pC~~nW~~H~nkCYlFS~~s~sWsW~~Srr~C~~kGATLLk-~sdEELeFIqn-------------~wkGks-------~ 160 (213)
T PHA02911 102 ICSRDWINHMGICLLSLGEEVGFRMEIAKRFCEKKDADLIG-KIDEEKKALEN-------------IWTGND-------H 160 (213)
T ss_pred CCchhhHhhCCEEEEEeccccccchhHHHHHHHhcCCEecc-CcHHHHHHHHH-------------HHhccc-------c
Confidence 47799999999999996542 34559999999999999999 77799999996 355542 3
Q ss_pred eeEEeCCcccccCCCCCCCCCCCCCccceEEecCCC---CceeeccccccccccccccCCC
Q psy1807 160 TWKWVDGEVVTKPSWGKDQPNNYNGEQNCVVLDGGR---GWLWNDVGCKLDYLHWICQHNP 217 (618)
Q Consensus 160 ~w~W~DGs~~~~~~W~~~~P~~~~~~~~Cv~l~~~~---~~~W~d~~C~~~~~~~iC~~~~ 217 (618)
.|.|+||.- ..+- .+...+||.+...+ -.+-....|.. .+.+|||+..
T Consensus 161 ~~fWID~r~-ngSt--------~n~s~sCA~Is~~~~~~~~~V~sesCs~-~~~wICqK~~ 211 (213)
T PHA02911 161 SRFWIDNRA-AAST--------FDPVNECAIGTQNHIPEVPEVLKSPCDE-RHSFICIKKD 211 (213)
T ss_pred ceEEecCcc-ccce--------ecCCCCeEEEEcccccCCCceEccccCC-CceEEEEecc
Confidence 566666631 0000 11145799997630 01456678998 5799999853
No 36
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=98.98 E-value=9.8e-10 Score=82.79 Aligned_cols=55 Identities=36% Similarity=0.797 Sum_probs=50.7
Q ss_pred CCCCCcCCCcEEEc--ccCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccCCCCceE
Q psy1807 458 CPEPEVPKGGYVVG--YDLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQFI 512 (618)
Q Consensus 458 C~~p~~~~~g~~~~--~~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C~ 512 (618)
|+.|..+.||.+.. ..|.+|+.|+|.|++||.|.|+..++|+.+|+|++..|.|+
T Consensus 1 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C~ 57 (57)
T cd00033 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57 (57)
T ss_pred CcCCCCCCCcEEecCCCcccCCCEEEEECCCCCeEeCCCeeEECCCCeECCCCcccC
Confidence 78888888998887 57999999999999999999999999999999999999984
No 37
>PF00084 Sushi: Sushi domain (SCR repeat); InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13. Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=98.94 E-value=8.8e-10 Score=82.81 Aligned_cols=56 Identities=32% Similarity=0.712 Sum_probs=50.5
Q ss_pred CCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCCce
Q psy1807 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC 453 (618)
Q Consensus 396 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C 453 (618)
|+.|+.+.||.+...+..|.+|++|+|+|++||.+. |...++|+.+|+|++..|.|
T Consensus 1 C~~P~~~~~~~~~~~~~~~~~g~~~~~~C~~Gy~~~--g~~~~~C~~~g~W~~~~p~C 56 (56)
T PF00084_consen 1 CPPPPPIPNGSISCTNNPYSYGSTVTFSCNPGYELS--GSSTITCQSNGQWSPPPPTC 56 (56)
T ss_dssp EESESCBTTEEEESSSSSEETTEEEEEEESTTEEEE--SSSEEEEETTSEESSTTSEE
T ss_pred CcCCCCCCCcEEecCCCCccCCCEEEEcCCCCCEec--CCCEEEECCCCEECCCCCCC
Confidence 667888999977666678999999999999999999 99999999999999988887
No 38
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Probab=98.93 E-value=2.1e-09 Score=80.85 Aligned_cols=54 Identities=28% Similarity=0.744 Sum_probs=48.8
Q ss_pred CCCCCcCCCcEEEcc---cCCCCcEEEEEcCCCCeeeCCceeEeCcCCcccCCCCce
Q psy1807 458 CPEPEVPKGGYVVGY---DLNVHSTLEFHCEPGHLLVGQASLKCTRQGDWDNMSPQF 511 (618)
Q Consensus 458 C~~p~~~~~g~~~~~---~~~~g~~v~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~P~C 511 (618)
|+.|..+.||.+... .|.+|++++|.|++||.|.|...++|+.+|+|++..|.|
T Consensus 1 C~~p~~~~~g~~~~~~~~~~~~g~~~~~~C~~Gy~l~g~~~~~C~~~g~Ws~~~p~C 57 (57)
T smart00032 1 CPPPPDIENGTITSSNKGTYSVGDTVTYSCNPGYTLIGSSTITCLEDGTWSPPAPTC 57 (57)
T ss_pred CcCCCCCCCCEEecCCCCeecCCCEEEEEcCCCCEEcCCCeeEECCCCEECCCCCcC
Confidence 778888888888764 589999999999999999999999999999999998887
No 39
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=98.90 E-value=2.5e-09 Score=80.51 Aligned_cols=57 Identities=32% Similarity=0.711 Sum_probs=52.0
Q ss_pred CCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCCcee
Q psy1807 396 CGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSCE 454 (618)
Q Consensus 396 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C~ 454 (618)
|+.|+.+.||.+......|.+|+.++|.|++||.|. |...++|+.+|.|++..|.|+
T Consensus 1 C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~~~--g~~~~~C~~~g~W~~~~p~C~ 57 (57)
T cd00033 1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLV--GSSTITCTENGGWSPPPPTCE 57 (57)
T ss_pred CcCCCCCCCcEEecCCCcccCCCEEEEECCCCCeEe--CCCeeEECCCCeECCCCcccC
Confidence 788888889999876667999999999999999999 999999999999999888884
No 40
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Probab=98.90 E-value=2.7e-09 Score=80.21 Aligned_cols=56 Identities=36% Similarity=0.819 Sum_probs=50.7
Q ss_pred CCCCCccCCCceEeec-CcccccceEEEEcCCCceeccCCcceEEeccCCeeeCCCCce
Q psy1807 396 CGSPANIDNGHYSLLN-GTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGDTPSC 453 (618)
Q Consensus 396 C~~p~~~~~g~~~~~~-~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~~p~C 453 (618)
|+.|+.+.||.+.... ..|.+|++++|.|++||.|. |...++|+.+|+|+...|.|
T Consensus 1 C~~p~~~~~g~~~~~~~~~~~~g~~~~~~C~~Gy~l~--g~~~~~C~~~g~Ws~~~p~C 57 (57)
T smart00032 1 CPPPPDIENGTITSSNKGTYSVGDTVTYSCNPGYTLI--GSSTITCLEDGTWSPPAPTC 57 (57)
T ss_pred CcCCCCCCCCEEecCCCCeecCCCEEEEEcCCCCEEc--CCCeeEECCCCEECCCCCcC
Confidence 7788889999988765 67899999999999999999 99999999999999988877
No 41
>PF00084 Sushi: Sushi domain (SCR repeat); InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13. Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=98.88 E-value=1.6e-09 Score=81.32 Aligned_cols=56 Identities=23% Similarity=0.667 Sum_probs=50.5
Q ss_pred CCCCCccCCeeEEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCCCcCCCCCcc
Q psy1807 278 CGKLEHIEHGTVTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTEPQC 333 (618)
Q Consensus 278 C~~~~~~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C 333 (618)
|+.|+.+.||.+......+.+|++++|+|++||++.|...++|+.+|+|++..|.|
T Consensus 1 C~~P~~~~~~~~~~~~~~~~~g~~~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C 56 (56)
T PF00084_consen 1 CPPPPPIPNGSISCTNNPYSYGSTVTFSCNPGYELSGSSTITCQSNGQWSPPPPTC 56 (56)
T ss_dssp EESESCBTTEEEESSSSSEETTEEEEEEESTTEEEESSSEEEEETTSEESSTTSEE
T ss_pred CcCCCCCCCcEEecCCCCccCCCEEEEcCCCCCEecCCCEEEECCCCEECCCCCCC
Confidence 56778899998777667899999999999999999999999999999999988877
No 42
>PF05473 Herpes_UL45: UL45 protein; InterPro: IPR008646 This family consists several UL45 proteins and homologues found in the herpes simplex virus family. The herpes simplex virus UL45 gene encodes an 18 kDa virion envelope protein whose function remains unknown. It has been suggested that the 18 kDa UL45 gene product is required for efficient growth in the central nervous system at low doses and may play an important role under the conditions of a naturally acquired infection []. The Equine herpesvirus 1 UL45 protein represents a type II membrane glycoprotein which has found to be non-essential for EHV-1 growth in vitro but deletion reduces the viruses' replication efficiency [].
Probab=98.24 E-value=3.7e-06 Score=80.10 Aligned_cols=55 Identities=22% Similarity=0.407 Sum_probs=45.5
Q ss_pred CccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHh
Q psy1807 74 DRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGEL 134 (618)
Q Consensus 74 ~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l 134 (618)
..|| ++|+.|++.||+|+....++|+||+..|++++++|+.+.+......+...+
T Consensus 83 ~~CP------~~Wi~~~~~Cy~~~~~~~~t~~eA~~~C~~~~s~L~~~~~~~~L~~ll~~~ 137 (200)
T PF05473_consen 83 GPCP------KGWIGYNNSCYRFSNSPKKTWEEARNICAAYNSTLANVNNAKSLLELLDVL 137 (200)
T ss_pred CCCC------ccceeeCCEEEEEeCCCCcCHHHHHHHHHhcCCcCCCchhHHHHHHHHHHh
Confidence 4788 999999999999976543499999999999999999987776665555544
No 43
>PHA02672 ORF110 EEV glycoprotein; Provisional
Probab=96.42 E-value=0.0037 Score=54.59 Aligned_cols=83 Identities=11% Similarity=0.111 Sum_probs=58.5
Q ss_pred CCcCccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeE
Q psy1807 71 FSTDRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGA 150 (618)
Q Consensus 71 ~~~~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl 150 (618)
+-.+.|. .||+.|.+.||.- +.-..+-..|...|+.+.|+|..|-|..-++=|..+ .....||+|-
T Consensus 45 f~~~AC~------~GWvp~dd~CyLn-T~~q~s~~~A~~iC~~~~a~lP~~pn~~hLKgVm~l-------t~~r~FW~th 110 (166)
T PHA02672 45 FLKEACE------NGWVPIKDLCVLN-THVITNVTLAHDICASMDGDPPATPNTMLLKGIMFL-------TGERSFWMTH 110 (166)
T ss_pred hhhhhhh------cCCeeccCEEEEe-cceeecccHHHHHHHhccCCCCCCCCHhHhhheeeE-------ecccEEEEEc
Confidence 3344788 9999999999976 444588999999999999999998775533322121 1234699998
Q ss_pred EeCCCCCcceeEEeCCcccc
Q psy1807 151 QKDPGITARTWKWVDGEVVT 170 (618)
Q Consensus 151 ~~~~~~~~~~w~W~DGs~~~ 170 (618)
++.- ...+.=.+|..+.
T Consensus 111 h~~y---~~vy~~~~~~~~~ 127 (166)
T PHA02672 111 HDAY---TSVYEPKRGSYLP 127 (166)
T ss_pred cccc---eeeeecCCCCccc
Confidence 8653 4456555666554
No 44
>KOG4646|consensus
Probab=95.54 E-value=0.0035 Score=54.60 Aligned_cols=41 Identities=27% Similarity=0.208 Sum_probs=37.2
Q ss_pred CcccceeeccCCchhhhhhcccCCCchhHHHHHHhhhhcccC
Q psy1807 1 KDNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLVTP 42 (618)
Q Consensus 1 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~i~~l~~l~~~ 42 (618)
|+|+++|+..+|+++|..+++ ++.|++|.++++.+|+|..+
T Consensus 89 ~~n~~~I~ea~g~plii~~ls-sp~e~tv~sa~~~l~~l~~~ 129 (173)
T KOG4646|consen 89 KTNAKFIREALGLPLIIFVLS-SPPEITVHSAALFLQLLEFG 129 (173)
T ss_pred hHHHHHHHHhcCCceEEeecC-CChHHHHHHHHHHHHHhcCc
Confidence 579999999999999999988 99999999999999888743
No 45
>PHA03093 EEV glycoprotein; Provisional
Probab=95.52 E-value=0.027 Score=51.49 Aligned_cols=47 Identities=30% Similarity=0.582 Sum_probs=37.3
Q ss_pred eeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhh
Q psy1807 87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELER 136 (618)
Q Consensus 87 ~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~ 136 (618)
+.|+++||.|+.+ +++|+||...|...|+.|.+. +..+ .+|..+|..
T Consensus 104 I~~~~~C~~~~~e-pkTf~dA~~~C~~~g~~LPs~-~l~~-~WL~dYLeg 150 (185)
T PHA03093 104 IVYDGSCYIFHSE-PKTFSDAKADCAKKSSTLPNS-NLMT-TWLSDYLED 150 (185)
T ss_pred eecCCEeEEecCC-CcCHHHHHHHHHhcCCcCCCc-chHH-HHHHHHhhc
Confidence 5889999999865 699999999999999999983 3222 377766644
No 46
>PHA02673 ORF109 EEV glycoprotein; Provisional
Probab=95.39 E-value=0.02 Score=51.04 Aligned_cols=54 Identities=15% Similarity=0.293 Sum_probs=40.3
Q ss_pred cCccccccccCccceeecceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhh
Q psy1807 73 TDRCASEKAASAEITSFDRNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELER 136 (618)
Q Consensus 73 ~~~Cp~~~~~~~~w~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~ 136 (618)
...|. . +.|+++||.++ . +.+|+||...|...|+.|.+..-.+...+|..+|..
T Consensus 74 ~~tC~------G--I~~~~~Cltl~-~-p~tf~eAn~~C~~~g~~LPs~~l~~~~pWL~dYL~g 127 (161)
T PHA02673 74 NRTCD------G--INAGNKCLTLK-Y-PDTWTNANERCKELGQRLPSPSINKKYPWLTDYLVG 127 (161)
T ss_pred cCcCC------C--cccCCeeEEeC-C-CCcHHHHHHHHHhcCCcCCCCcccccCcHHHHHhhc
Confidence 45787 3 57899999997 4 589999999999999999985433333455555543
No 47
>PF05966 Chordopox_A33R: Chordopoxvirus A33R protein; InterPro: IPR009238 This family consists of several Chordopoxvirus A33R proteins. A33R plays a role in promoting Ab-resistant cell-to-cell spread of virus [] and interacts with A36R to incorporate the protein into the outer membrane of intracellular enveloped virions (IEV) [].; PDB: 3K7B_A.
Probab=94.08 E-value=0.03 Score=51.75 Aligned_cols=33 Identities=27% Similarity=0.631 Sum_probs=26.3
Q ss_pred eeecceEEEEecCCCCCHHHHHHHHHHcCCeecc
Q psy1807 87 TSFDRNCYEFNIDKGASFADARSKCQASGGDLVH 120 (618)
Q Consensus 87 ~~~~~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~ 120 (618)
+.|+++||.++.+ +++|+||...|...|+.|.+
T Consensus 108 I~~~n~Cl~~~~~-p~T~~~A~~~C~~~g~~LPs 140 (190)
T PF05966_consen 108 IVYDNKCLTLNYE-PKTFDEANSDCNNKGQTLPS 140 (190)
T ss_dssp EEETTEEEEEEEE-EEEHHHHHHHHHHTT-B---
T ss_pred cccCCEEEEecCC-CCCHHHHHHHHHhcCCcCCC
Confidence 7789999999765 58999999999999999997
No 48
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=91.97 E-value=0.074 Score=45.54 Aligned_cols=19 Identities=21% Similarity=0.524 Sum_probs=13.8
Q ss_pred eeEeCcCCcccCCCCceEE
Q psy1807 495 SLKCTRQGDWDNMSPQFIS 513 (618)
Q Consensus 495 ~~~C~~~G~Ws~~~P~C~~ 513 (618)
+..|...-.|+...-.|.+
T Consensus 33 ~~~Cp~G~sWs~dLdkCm~ 51 (129)
T PF12191_consen 33 TTPCPRGSSWSADLDKCMS 51 (129)
T ss_dssp CSSS-SSEEEETTTTEEEE
T ss_pred CCCCCCCCcCccccccccc
Confidence 4567766679988888887
No 49
>cd03519 Link_domain_HAPLN_module_2 Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=90.37 E-value=0.28 Score=39.94 Aligned_cols=23 Identities=22% Similarity=0.459 Sum_probs=20.7
Q ss_pred CCCCCHHHHHHHHHHcCCeeccc
Q psy1807 99 DKGASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 99 ~~~~tw~~A~~~C~~~g~~L~~i 121 (618)
..+++|+||++.|+..|++||+.
T Consensus 8 ~~~l~f~eA~~aC~~~ga~lAs~ 30 (91)
T cd03519 8 PGKLTFSEAVAACQRDGAQIAKV 30 (91)
T ss_pred ccccCHHHHHHHHHHcCCEeCCH
Confidence 34699999999999999999985
No 50
>cd03520 Link_domain_CSPGs_modules_2_4 Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and
Probab=88.19 E-value=0.48 Score=39.09 Aligned_cols=23 Identities=26% Similarity=0.528 Sum_probs=20.7
Q ss_pred CCCCCHHHHHHHHHHcCCeeccc
Q psy1807 99 DKGASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 99 ~~~~tw~~A~~~C~~~g~~L~~i 121 (618)
..+++++||++.|++.|++||+.
T Consensus 8 ~~~l~f~eA~~aC~~~ga~lAs~ 30 (96)
T cd03520 8 PEKFTFQEARAECRSLGAVLATT 30 (96)
T ss_pred CCCcCHHHHHHHHHHcCCEeCCH
Confidence 34689999999999999999984
No 51
>KOG4564|consensus
Probab=86.87 E-value=0.77 Score=49.42 Aligned_cols=35 Identities=20% Similarity=0.497 Sum_probs=25.4
Q ss_pred CCcEEEEEcCCC---C--eeeCCceeEeCcCCcccCCCCc
Q psy1807 476 VHSTLEFHCEPG---H--LLVGQASLKCTRQGDWDNMSPQ 510 (618)
Q Consensus 476 ~g~~v~~~C~~G---y--~l~G~~~~~C~~~G~Ws~~~P~ 510 (618)
.|+.+...|.+. + .-.|.....|.+||+|+..+|.
T Consensus 79 ~G~~v~~~CP~yf~~~~~~~~~~v~R~C~~~G~W~~~~~~ 118 (473)
T KOG4564|consen 79 AGTLVTVPCPDYFPGFSYSTSGNVTRNCTSNGTWSERPPS 118 (473)
T ss_pred CCceEEecCccccCCCcccCCCceEeecCCCCccCCCCCc
Confidence 356677777753 2 2457778999999999987664
No 52
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=85.63 E-value=0.49 Score=39.48 Aligned_cols=11 Identities=27% Similarity=0.549 Sum_probs=8.4
Q ss_pred EcCCCCeeeCC
Q psy1807 483 HCEPGHLLVGQ 493 (618)
Q Consensus 483 ~C~~Gy~l~G~ 493 (618)
.|..||.+.+.
T Consensus 41 ~C~~GY~~~~~ 51 (96)
T PTZ00382 41 ECNSGFSLDNG 51 (96)
T ss_pred cCcCCcccCCC
Confidence 58999998643
No 53
>cd01102 Link_Domain The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of
Probab=84.91 E-value=0.68 Score=38.08 Aligned_cols=21 Identities=33% Similarity=0.564 Sum_probs=19.7
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.+++.||++.|+++|++||+.
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~ 33 (92)
T cd01102 13 KLTFAEAALACKARGAHLATP 33 (92)
T ss_pred ccCHHHHHHHHHHcCCEeCCH
Confidence 689999999999999999985
No 54
>smart00445 LINK Link (Hyaluronan-binding).
Probab=84.62 E-value=0.72 Score=38.07 Aligned_cols=21 Identities=38% Similarity=0.657 Sum_probs=19.7
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.++++||++.|+++|++||+.
T Consensus 14 ~l~f~eA~~aC~~~ga~lAs~ 34 (94)
T smart00445 14 KLTFAEAREACRAQGATLATV 34 (94)
T ss_pred ccCHHHHHHHHHHcCCEeCCH
Confidence 589999999999999999985
No 55
>cd03518 Link_domain_HAPLN_module_1 Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.
Probab=81.93 E-value=1 Score=37.14 Aligned_cols=21 Identities=19% Similarity=0.404 Sum_probs=19.5
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.+++.||++.|+.+|++||+.
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~ 33 (95)
T cd03518 13 NLNFHEAQQACEEQDATLASF 33 (95)
T ss_pred ccCHHHHHHHHHHcCCeeCCH
Confidence 579999999999999999985
No 56
>cd03515 Link_domain_TSG_6_like This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accum
Probab=81.66 E-value=0.99 Score=37.09 Aligned_cols=21 Identities=24% Similarity=0.580 Sum_probs=19.6
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.+++.||++.|+++|++||+.
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~ 33 (93)
T cd03515 13 KLTYTEAKAACEAEGAHLATY 33 (93)
T ss_pred ccCHHHHHHHHHHcCCccCCH
Confidence 589999999999999999985
No 57
>PF14704 DERM: Dermatopontin
Probab=81.60 E-value=4.8 Score=36.75 Aligned_cols=138 Identities=16% Similarity=0.329 Sum_probs=79.6
Q ss_pred cCCEEEEEcCCCceecCCceeEec---CCcccCCCCceeeecCCCCCc-cCCeeEEecCCCcccCCEEEEEeCCCceecC
Q psy1807 239 LGSTISYACPEGNMLVGSATRTCK---EGFWTGVAPTCQYVDCGKLEH-IEHGTVTLETTRTTHGAVAIYACHENYTLIG 314 (618)
Q Consensus 239 ~G~~~~y~C~~Gy~l~g~~~~~C~---~g~Ws~~~p~C~~~~C~~~~~-~~ng~~~~~~~~~~~g~~~~~~C~~Gy~l~G 314 (618)
++..+.|+|+.|..|.+-.+.... +..|. -.|..+ +. +..+.-........++....|+|..|+.|.|
T Consensus 3 ~d~~f~f~Cp~gq~i~~v~S~~~n~~eDR~W~---F~C~~~-----~~~~~~~~~~~~~~vN~~d~~~~~tCp~n~~i~G 74 (152)
T PF14704_consen 3 YDQPFSFQCPSGQGISGVRSYHSNNREDRRWS---FGCSPV-----GSGPTQGSSCWWSYVNNYDQPFNFTCPSNYYITG 74 (152)
T ss_pred CCCCEEEECCCCCEEEEEEEEeCCCCCcccEE---EEEEeC-----CCCCCccceEcchhhhccCCcEEEECCCCCEEEE
Confidence 567789999999999888777776 66886 235443 22 1111111111345689999999999999988
Q ss_pred CCeEEeC--CCCCcCCCCCcccccccCCCCCccccE-EEecccccCceeeecCCCCcEEcCCceE--EEccCCeecCCCC
Q psy1807 315 ETRRVCG--DGGKWNGTEPQCLFDWCAEPPQISGGI-VTTSGRRTGSVATYSCEPGFILFGSNEL--KCGLGGDWSGKAP 389 (618)
Q Consensus 315 ~~~~~C~--~~G~Ws~~~p~C~~~~C~~p~~~~~g~-~~~~~~~~g~~~~~~C~~Gy~l~G~~~~--~C~~~G~Ws~~~p 389 (618)
-.+..=. +|-.|. ..|.... ...+.+=. .+.-...+...+.|.-..|+.|.|-.+. .-..|-.|. .-
T Consensus 75 ~~S~h~n~~eDR~w~---f~C~~~~---~~~~~~C~~~t~~~N~~d~~~~~~~~~~~~i~G~~S~~~~~~~DR~wk--~~ 146 (152)
T PF14704_consen 75 MQSYHDNSKEDRRWQ---FQCCRIN---GATLYNCYWTTGYVNNYDGPLDFTVPSGYVITGVSSYHDNSNEDRRWK--FR 146 (152)
T ss_pred EEEEecCCcccCceE---EEEEecC---CceeecceecccccccCCCcEEEEcCCCcEEEeEeeecCCCCcCceEE--EE
Confidence 6544322 345563 2332210 00011100 0001124677788888899999885432 234566775 34
Q ss_pred cee
Q psy1807 390 TCK 392 (618)
Q Consensus 390 ~C~ 392 (618)
.|+
T Consensus 147 ~C~ 149 (152)
T PF14704_consen 147 TCR 149 (152)
T ss_pred EEe
Confidence 565
No 58
>PF00193 Xlink: Extracellular link domain; InterPro: IPR000538 The link domain [] is a hyaluronan(HA)-binding region found in proteins of vertebrates that are involved in the assembly of extracellular matrix, cell adhesion, and migration. The structure has been shown [] to consist of two alpha helices and two antiparallel beta sheets arranged around a large hydrophobic core similar to that of C-type lectin. This domain contains four conserved cysteines involved in two disulphide bonds. The link domain has also been termed HABM [] (HA binding module) and PTR [] (proteoglycan tandem repeat). Proteins with such a domain include the proteoglycans aggrecan, brevican, neurocan and versican, which are expressed in the CNS; the cartilage link protein (LP), a proteoglycan that together with HA and aggrecan forms multimolecular aggregates; Tumour necrosis factor-inducible protein TSG-6, which may be involved in cell-cell and cell-matrix interactions during inflammation and tumourgenesis; and CD44 antigen, the main cell surface receptor for HA.; GO: 0005540 hyaluronic acid binding, 0007155 cell adhesion; PDB: 1O7B_T 2PF5_C 1O7C_T 2JCQ_A 2JCR_A 2JCP_A 1UUH_B 1POZ_A 2I83_A.
Probab=80.91 E-value=1.6 Score=36.15 Aligned_cols=21 Identities=29% Similarity=0.566 Sum_probs=16.8
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.+++.+|++.|++.|+.||+.
T Consensus 13 ~l~f~eA~~~C~~~ga~LAs~ 33 (92)
T PF00193_consen 13 KLTFTEAQQACRALGARLASP 33 (92)
T ss_dssp SB-HHHHHHHHHHTTCBE--H
T ss_pred cCcHHHHHHHHHHcCCeeCCH
Confidence 579999999999999999984
No 59
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=80.23 E-value=0.88 Score=36.84 Aligned_cols=26 Identities=27% Similarity=0.594 Sum_probs=12.9
Q ss_pred ehHHHHHHHHHHHHHHHHHHHhcCCC
Q psy1807 537 VGAGVVLFLLILLAIIYARLRKATPV 562 (618)
Q Consensus 537 ~~~~~~l~l~vl~~~~~~~~r~k~~~ 562 (618)
.+++++|+++++++++..|.|+|...
T Consensus 47 ~GGG~iLilIii~Lv~CC~~K~K~~~ 72 (98)
T PF07204_consen 47 AGGGLILILIIIALVCCCRAKHKTSA 72 (98)
T ss_pred ccchhhhHHHHHHHHHHhhhhhhhHh
Confidence 34444444445444555565555443
No 60
>cd03521 Link_domain_KIAA0527_like Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.
Probab=79.94 E-value=1.4 Score=35.66 Aligned_cols=21 Identities=29% Similarity=0.569 Sum_probs=19.6
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.++++||++.|+++|++||+.
T Consensus 13 ~l~f~eA~~AC~~~gA~lAs~ 33 (95)
T cd03521 13 GLGLRAARQSCASLGARLASA 33 (95)
T ss_pred ccCHHHHHHHHHHcCCEeccH
Confidence 579999999999999999984
No 61
>KOG4289|consensus
Probab=78.62 E-value=7.4 Score=46.72 Aligned_cols=154 Identities=26% Similarity=0.545 Sum_probs=88.0
Q ss_pred cCCEEEEEcCCCceecCC-----ceeEecCCcccCCCCceeeecCCCCC------ccCCeeEEecCCCccc-CCEEEEEe
Q psy1807 239 LGSTISYACPEGNMLVGS-----ATRTCKEGFWTGVAPTCQYVDCGKLE------HIEHGTVTLETTRTTH-GAVAIYAC 306 (618)
Q Consensus 239 ~G~~~~y~C~~Gy~l~g~-----~~~~C~~g~Ws~~~p~C~~~~C~~~~------~~~ng~~~~~~~~~~~-g~~~~~~C 306 (618)
.|-+| .|++||. |. ....|-.|=|. .|+|.+-.|..-. .-.+|...|....|.. |+ -..|
T Consensus 1737 ~GY~C--~C~~g~~--G~~Ce~~~dq~CPrGWWG--~P~CgpC~CavsKgfdp~CnKt~G~CqCKe~hy~~~~~--Cl~C 1808 (2531)
T KOG4289|consen 1737 HGYTC--ECPPGYT--GPYCELRADQPCPRGWWG--FPTCGPCNCAVSKGFDPDCNKTNGQCQCKENHYRPIGS--CLPC 1808 (2531)
T ss_pred CceeE--ECCCccc--CcchhhhccCCCCCcccC--CCCccCccccccCCCCCCccccCcceeeccccccCCCc--ceee
Confidence 44444 4889984 33 23456677775 7899876665321 2356777776655543 44 3335
Q ss_pred CCCceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEEEecccccCceeeecCCCCcEEcCCceEEEccCCeecC
Q psy1807 307 HENYTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSG 386 (618)
Q Consensus 307 ~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~ 386 (618)
+-+ .|+.++.|.++|+- .|. | |.+ |-+| =+|+.-|. ..
T Consensus 1809 dC~---~Gs~Sr~C~adGqC----------~C~-p-----gai-------GRqC-drCd~pfa---------------ev 1846 (2531)
T KOG4289|consen 1809 DCY---FGSDSRECDADGQC----------PCK-P-----GAI-------GRQC-DRCDNPFA---------------EV 1846 (2531)
T ss_pred ccc---cCCCcccccCCCcC----------CCC-C-----ccc-------cccc-cccCChhh---------------hc
Confidence 444 47888888877731 121 1 000 1100 02332221 11
Q ss_pred CCCceeeecCCCCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceEEeccCCeeeCC
Q psy1807 387 KAPTCKYVDCGSPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQVCMRDGKWSGD 449 (618)
Q Consensus 387 ~~p~C~~~~C~~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~~C~~~G~Ws~~ 449 (618)
....|+.+.=.-|..++.|.... ..++|+.+...|..|-. |...+.|...|.|..+
T Consensus 1847 ttlgCrv~y~~CPk~ie~giwWP---~tkfG~~a~~pCP~G~~----G~AvR~C~e~~gWl~p 1902 (2531)
T KOG4289|consen 1847 TTLGCRVVYDKCPKAIEAGIWWP---QTKFGSPAAVPCPKGSS----GLAVRHCDETGGWLPP 1902 (2531)
T ss_pred cccCcEEccccCcHHHhhccccc---ccccCCcccccCCCCcc----hhhhhhhhccCCCCCc
Confidence 23345544322345566665543 24789999999999943 8899999999999864
No 62
>cd03517 Link_domain_CSPGs_modules_1_3 Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues.
Probab=77.64 E-value=1.5 Score=36.30 Aligned_cols=21 Identities=14% Similarity=0.346 Sum_probs=19.4
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.+++.||++.|+++|+.||+.
T Consensus 13 ~l~f~eA~~aC~~~ga~lAs~ 33 (95)
T cd03517 13 ALTFPRAQRACLDISAQIATP 33 (95)
T ss_pred eECHHHHHHHHHHcCCEeCCH
Confidence 478999999999999999985
No 63
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=74.82 E-value=0.97 Score=34.06 Aligned_cols=23 Identities=26% Similarity=0.488 Sum_probs=1.3
Q ss_pred HHHHHHHHHHHHHHHHhcCCCCC
Q psy1807 542 VLFLLILLAIIYARLRKATPVKN 564 (618)
Q Consensus 542 ~l~l~vl~~~~~~~~r~k~~~~~ 564 (618)
++++++|++++..|+|||.+...
T Consensus 22 ll~ailLIlf~iyR~rkkdEGSY 44 (64)
T PF01034_consen 22 LLFAILLILFLIYRMRKKDEGSY 44 (64)
T ss_dssp ---------------S------S
T ss_pred HHHHHHHHHHHHHHHHhcCCCCc
Confidence 33444555555567777777654
No 64
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=74.29 E-value=1.3 Score=29.79 Aligned_cols=38 Identities=26% Similarity=0.421 Sum_probs=32.1
Q ss_pred cccceeeccCCchhhhhhcccCCCchhHHHHHHhhhhcc
Q psy1807 2 DNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLV 40 (618)
Q Consensus 2 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~i~~l~~l~ 40 (618)
.++++|+..++++.+..+|. +++++++..++.+|.+|.
T Consensus 3 ~~~~~i~~~g~i~~L~~ll~-~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 3 EQKQAVVDAGGLPALVELLK-SEDEEVVKEAAWALSNLS 40 (41)
T ss_pred HHHHHHHHCCCHHHHHHHHc-CCCHHHHHHHHHHHHHHc
Confidence 46788888999999999988 778889999988887764
No 65
>PF15050 SCIMP: SCIMP protein
Probab=73.74 E-value=1.2 Score=37.76 Aligned_cols=14 Identities=29% Similarity=0.634 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHHH
Q psy1807 541 VVLFLLILLAIIYA 554 (618)
Q Consensus 541 ~~l~l~vl~~~~~~ 554 (618)
++++-++|.+++|+
T Consensus 16 II~vS~~lglIlyC 29 (133)
T PF15050_consen 16 IILVSVVLGLILYC 29 (133)
T ss_pred HHHHHHHHHHHHHH
Confidence 33333455555553
No 66
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.30 E-value=1.4 Score=44.87 Aligned_cols=18 Identities=22% Similarity=0.301 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHhcCCCCC
Q psy1807 547 ILLAIIYARLRKATPVKN 564 (618)
Q Consensus 547 vl~~~~~~~~r~k~~~~~ 564 (618)
+++.+|+|.||||+-+|+
T Consensus 273 vIIYLILRYRRKKKmkKK 290 (299)
T PF02009_consen 273 VIIYLILRYRRKKKMKKK 290 (299)
T ss_pred HHHHHHHHHHHHhhhhHH
Confidence 344455556665555443
No 67
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=70.85 E-value=1.5 Score=34.69 Aligned_cols=15 Identities=33% Similarity=0.705 Sum_probs=13.6
Q ss_pred hhccCChhhhhhhhc
Q psy1807 601 ENTYDNPYETALAFA 615 (618)
Q Consensus 601 ~~~~~~~~~~~~~~~ 615 (618)
-.||..|...-.|||
T Consensus 56 P~TYEDP~qAV~eFA 70 (75)
T PF14575_consen 56 PHTYEDPNQAVREFA 70 (75)
T ss_dssp GGGSSSHHHHHHHCS
T ss_pred cccccCHHHHHHHHH
Confidence 689999999999997
No 68
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=70.22 E-value=2.8 Score=41.54 Aligned_cols=32 Identities=22% Similarity=0.352 Sum_probs=14.9
Q ss_pred eEEeehHHHHHHHHHHHHHH--HHHHHhcCCCCC
Q psy1807 533 YMISVGAGVVLFLLILLAII--YARLRKATPVKN 564 (618)
Q Consensus 533 ~~~~~~~~~~l~l~vl~~~~--~~~~r~k~~~~~ 564 (618)
+-.|+++.|+|+|.|+|+++ ..+||||..||.
T Consensus 259 ~Pcgiaalvllil~vvliiLYiWlyrrRK~swkh 292 (295)
T TIGR01478 259 LPYGIAALVLIILTVVLIILYIWLYRRRKKSWKH 292 (295)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhhcccccc
Confidence 33343444444443333333 236666766654
No 69
>cd03516 Link_domain_CD44_like This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marke
Probab=69.84 E-value=3.3 Score=36.90 Aligned_cols=21 Identities=29% Similarity=0.534 Sum_probs=19.5
Q ss_pred CCCHHHHHHHHHHcCCeeccc
Q psy1807 101 GASFADARSKCQASGGDLVHG 121 (618)
Q Consensus 101 ~~tw~~A~~~C~~~g~~L~~i 121 (618)
.+++.||++.|+++|+.||+.
T Consensus 18 ~lnf~eA~~aC~~~ga~lAs~ 38 (144)
T cd03516 18 SLNFTEAKEACRALGLTLASK 38 (144)
T ss_pred cCCHHHHHHHHHHcCCeeCCH
Confidence 479999999999999999984
No 70
>PTZ00370 STEVOR; Provisional
Probab=69.50 E-value=2.6 Score=41.89 Aligned_cols=31 Identities=29% Similarity=0.421 Sum_probs=14.0
Q ss_pred EEeehHHHHHHHHHHHHHHH--HHHHhcCCCCC
Q psy1807 534 MISVGAGVVLFLLILLAIIY--ARLRKATPVKN 564 (618)
Q Consensus 534 ~~~~~~~~~l~l~vl~~~~~--~~~r~k~~~~~ 564 (618)
-.|+++.|+|+|.|+|+++| .+||||..||.
T Consensus 256 Pygiaalvllil~vvliilYiwlyrrRK~swkh 288 (296)
T PTZ00370 256 PYGIAALVLLILAVVLIILYIWLYRRRKNSWKH 288 (296)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhhcchhHH
Confidence 33334444444433333332 36666666653
No 71
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=69.03 E-value=3.4 Score=28.17 Aligned_cols=23 Identities=17% Similarity=0.371 Sum_probs=10.5
Q ss_pred cceEEeehHHHHHHHHHHHHHHH
Q psy1807 531 LGYMISVGAGVVLFLLILLAIIY 553 (618)
Q Consensus 531 ~~~~~~~~~~~~l~l~vl~~~~~ 553 (618)
.+..++++.-+..++++|+++++
T Consensus 11 vaIa~~VvVPV~vI~~vl~~~l~ 33 (40)
T PF08693_consen 11 VAIAVGVVVPVGVIIIVLGAFLF 33 (40)
T ss_pred EEEEEEEEechHHHHHHHHHHhh
Confidence 44555555544444444433333
No 72
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=68.70 E-value=1.1 Score=30.59 Aligned_cols=38 Identities=32% Similarity=0.418 Sum_probs=32.3
Q ss_pred cccceeeccCCchhhhhhcccCCCchhHHHHHHhhhhcc
Q psy1807 2 DNKEYILSVGGIKPVTSLLLNSNHEQTVLNSITTLMYLV 40 (618)
Q Consensus 2 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~i~~l~~l~ 40 (618)
++++.|+..++++.+..+|+ +.+.++...++.+|-+|.
T Consensus 3 ~~~~~i~~~g~i~~Lv~ll~-~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 3 ENKQAIVEAGGIPPLVQLLK-SPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp HHHHHHHHTTHHHHHHHHTT-SSSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHcccHHHHHHHHc-CCCHHHHHHHHHHHHHHh
Confidence 56778899999999999998 888889988888876653
No 73
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=68.63 E-value=6.2 Score=37.54 Aligned_cols=29 Identities=24% Similarity=0.484 Sum_probs=14.6
Q ss_pred cccceEEeehHHHHHHHH-HHHHHHHHHHH
Q psy1807 529 QTLGYMISVGAGVVLFLL-ILLAIIYARLR 557 (618)
Q Consensus 529 ~~~~~~~~~~~~~~l~l~-vl~~~~~~~~r 557 (618)
..+.++++++++++.++| ++++++.|++|
T Consensus 35 d~~~I~iaiVAG~~tVILVI~i~v~vR~CR 64 (221)
T PF08374_consen 35 DYVKIMIAIVAGIMTVILVIFIVVLVRYCR 64 (221)
T ss_pred cceeeeeeeecchhhhHHHHHHHHHHHHHh
Confidence 345566665555544444 44444445344
No 74
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=67.14 E-value=5 Score=34.95 Aligned_cols=15 Identities=27% Similarity=0.217 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHH
Q psy1807 542 VLFLLILLAIIYARL 556 (618)
Q Consensus 542 ~l~l~vl~~~~~~~~ 556 (618)
+.+++++|+++|..|
T Consensus 75 aGvIg~Illi~y~ir 89 (122)
T PF01102_consen 75 AGVIGIILLISYCIR 89 (122)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 345555554444433
No 75
>KOG1219|consensus
Probab=65.44 E-value=44 Score=42.89 Aligned_cols=14 Identities=29% Similarity=0.392 Sum_probs=8.4
Q ss_pred chhhchhhhccCCh
Q psy1807 594 NIYENVAENTYDNP 607 (618)
Q Consensus 594 ~~~~~~~~~~~~~~ 607 (618)
|+|.++.-|.+-.|
T Consensus 4038 nVy~~v~PqvplrP 4051 (4289)
T KOG1219|consen 4038 NVYVPVVPQVPLRP 4051 (4289)
T ss_pred CccccCCCCCCcCC
Confidence 77776665555444
No 76
>PF14704 DERM: Dermatopontin
Probab=64.88 E-value=17 Score=33.06 Aligned_cols=127 Identities=11% Similarity=0.247 Sum_probs=73.9
Q ss_pred ccCCEEEEEeCCCceecCCCeEEeC--CCCCcCCCCCcccccccCCCCC-ccccE--EEecccccCceeeecCCCCcEEc
Q psy1807 297 THGAVAIYACHENYTLIGETRRVCG--DGGKWNGTEPQCLFDWCAEPPQ-ISGGI--VTTSGRRTGSVATYSCEPGFILF 371 (618)
Q Consensus 297 ~~g~~~~~~C~~Gy~l~G~~~~~C~--~~G~Ws~~~p~C~~~~C~~p~~-~~~g~--~~~~~~~~g~~~~~~C~~Gy~l~ 371 (618)
.++....|+|..|..|.+-.+.... +|-.|.- .|. ..+. +..+. ....-..++..+.|+|+.|+.|.
T Consensus 2 ~~d~~f~f~Cp~gq~i~~v~S~~~n~~eDR~W~F---~C~-----~~~~~~~~~~~~~~~~vN~~d~~~~~tCp~n~~i~ 73 (152)
T PF14704_consen 2 NYDQPFSFQCPSGQGISGVRSYHSNNREDRRWSF---GCS-----PVGSGPTQGSSCWWSYVNNYDQPFNFTCPSNYYIT 73 (152)
T ss_pred CCCCCEEEECCCCCEEEEEEEEeCCCCCcccEEE---EEE-----eCCCCCCccceEcchhhhccCCcEEEECCCCCEEE
Confidence 3567789999999988876666654 6668854 344 3332 11111 11112367899999999999998
Q ss_pred CCceEEEc--cCCeecCCCCceee------ecCC-CCCccCCCceEeecCcccccceEEEEcCCCceeccCCcceE--Ee
Q psy1807 372 GSNELKCG--LGGDWSGKAPTCKY------VDCG-SPANIDNGHYSLLNGTTTHGSVAEYTCDDDYWLEAGAMRRQ--VC 440 (618)
Q Consensus 372 G~~~~~C~--~~G~Ws~~~p~C~~------~~C~-~p~~~~~g~~~~~~~~~~~g~~~~~~C~~Gy~l~~~G~~~~--~C 440 (618)
|-.+..=. +|-.|. ..|.. ..|. .-. -.-.+...+.|+-..||.|. |..+. .-
T Consensus 74 G~~S~h~n~~eDR~w~---f~C~~~~~~~~~~C~~~t~-----------~~N~~d~~~~~~~~~~~~i~--G~~S~~~~~ 137 (152)
T PF14704_consen 74 GMQSYHDNSKEDRRWQ---FQCCRINGATLYNCYWTTG-----------YVNNYDGPLDFTVPSGYVIT--GVSSYHDNS 137 (152)
T ss_pred EEEEEecCCcccCceE---EEEEecCCceeecceeccc-----------ccccCCCcEEEEcCCCcEEE--eEeeecCCC
Confidence 75544322 345564 23322 1222 110 01245677888888889888 76432 22
Q ss_pred ccCCeee
Q psy1807 441 MRDGKWS 447 (618)
Q Consensus 441 ~~~G~Ws 447 (618)
..|-.|.
T Consensus 138 ~~DR~wk 144 (152)
T PF14704_consen 138 NEDRRWK 144 (152)
T ss_pred CcCceEE
Confidence 4455675
No 77
>PF02793 HRM: Hormone receptor domain; InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=64.61 E-value=7.4 Score=29.79 Aligned_cols=36 Identities=19% Similarity=0.582 Sum_probs=27.9
Q ss_pred ccccceEEEEcCCCce---eccCCcceEEeccCCeeeCC
Q psy1807 414 TTHGSVAEYTCDDDYW---LEAGAMRRQVCMRDGKWSGD 449 (618)
Q Consensus 414 ~~~g~~~~~~C~~Gy~---l~~~G~~~~~C~~~G~Ws~~ 449 (618)
...|.++...|..++. ....|..++.|..+|.|...
T Consensus 17 T~~G~~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~ 55 (66)
T PF02793_consen 17 TPAGETASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH 55 (66)
T ss_dssp BETTEEEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred CCCCCEEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence 4678999999998873 22226789999999999863
No 78
>smart00008 HormR Domain present in hormone receptors.
Probab=60.78 E-value=10 Score=29.51 Aligned_cols=34 Identities=24% Similarity=0.342 Sum_probs=26.1
Q ss_pred CCCCcEEEEEcCCCCeeeCC---ceeEeCcCCcccCC
Q psy1807 474 LNVHSTLEFHCEPGHLLVGQ---ASLKCTRQGDWDNM 507 (618)
Q Consensus 474 ~~~g~~v~~~C~~Gy~l~G~---~~~~C~~~G~Ws~~ 507 (618)
...|.++...|..++..... ....|..+|.|+..
T Consensus 18 t~~G~~~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~~ 54 (70)
T smart00008 18 TPAGQLVEVPCPDYFSGFSYKTGASRNCTENGGWSPP 54 (70)
T ss_pred CCCCCEEEecCCCcccCcCCCCCEEEEcCCCCCCccC
Confidence 45688999999987654322 67899999999864
No 79
>smart00008 HormR Domain present in hormone receptors.
Probab=59.02 E-value=9.8 Score=29.61 Aligned_cols=32 Identities=28% Similarity=0.556 Sum_probs=25.2
Q ss_pred cccCCEEEEEcCCCceecCC---ceeEec-CCcccC
Q psy1807 237 TKLGSTISYACPEGNMLVGS---ATRTCK-EGFWTG 268 (618)
Q Consensus 237 ~~~G~~~~y~C~~Gy~l~g~---~~~~C~-~g~Ws~ 268 (618)
-..|.++...|+.++..... ..+.|. +|.|+.
T Consensus 18 t~~G~~~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~ 53 (70)
T smart00008 18 TPAGQLVEVPCPDYFSGFSYKTGASRNCTENGGWSP 53 (70)
T ss_pred CCCCCEEEecCCCcccCcCCCCCEEEEcCCCCCCcc
Confidence 45788999999998654422 789999 889985
No 80
>KOG4297|consensus
Probab=57.69 E-value=30 Score=29.04 Aligned_cols=94 Identities=20% Similarity=0.393 Sum_probs=55.0
Q ss_pred ceEEEEecCCCCCHHHHHHHHHHcCCeecccCChHHHHHHHHHhhhcccCCCCccEEEeEEeCCCCCcceeEEeCCcccc
Q psy1807 91 RNCYEFNIDKGASFADARSKCQASGGDLVHGFRGITNTFLLGELERRKDRLKTLLVWIGAQKDPGITARTWKWVDGEVVT 170 (618)
Q Consensus 91 ~~CY~~~~~~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~~l~~~~~~~~~~~~WIGl~~~~~~~~~~w~W~DGs~~~ 170 (618)
..|+..+.. ..+|..+...|...+++|+.+.+..++.++....... ....|++..... ....+.|.++..
T Consensus 99 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~--~~~~~~~~~~~~-- 168 (207)
T KOG4297|consen 99 CTCYTSFEW-TDGWSTTGTGCGGQGANLVGVLSVQENNFITSNFRLF-----LGCLWLGGDSSF--YFTYPTWGDGTT-- 168 (207)
T ss_pred ccccCchhh-ccCHHHHHHHHHHhCCCcCeeCCHHHHHHHHHhHHhe-----eeeEEecCCccc--ccccccccCCCC--
Confidence 345544322 2489999999999999999999999999998751110 012344332221 133455555554
Q ss_pred cCCCCCCCCCCCCCccceEEecCC
Q psy1807 171 KPSWGKDQPNNYNGEQNCVVLDGG 194 (618)
Q Consensus 171 ~~~W~~~~P~~~~~~~~Cv~l~~~ 194 (618)
+..|....+........|+.+...
T Consensus 169 ~~~~~~~~~~~~~~~~~C~~~~~~ 192 (207)
T KOG4297|consen 169 YSNWALGDPNPPGDDENCLSLSTS 192 (207)
T ss_pred cccccccccCCCCCCCceEEEECC
Confidence 334444333322335678887544
No 81
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=55.95 E-value=8.7 Score=33.99 Aligned_cols=26 Identities=4% Similarity=0.216 Sum_probs=16.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807 538 GAGVVLFLLILLAIIYARLRKATPVK 563 (618)
Q Consensus 538 ~~~~~l~l~vl~~~~~~~~r~k~~~~ 563 (618)
++.+++++++|+++++.++++|++++
T Consensus 4 l~~iii~~i~l~~~~~~~~~rRR~r~ 29 (130)
T PF12273_consen 4 LFAIIIVAILLFLFLFYCHNRRRRRR 29 (130)
T ss_pred eHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 34455555566666677887777765
No 82
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=55.72 E-value=3.7 Score=36.87 Aligned_cols=15 Identities=27% Similarity=0.237 Sum_probs=6.5
Q ss_pred ceEEeehHHHHHHHH
Q psy1807 532 GYMISVGAGVVLFLL 546 (618)
Q Consensus 532 ~~~~~~~~~~~l~l~ 546 (618)
.+++.|+.++.++|+
T Consensus 51 VIGvVVGVGg~ill~ 65 (154)
T PF04478_consen 51 VIGVVVGVGGPILLG 65 (154)
T ss_pred EEEEEecccHHHHHH
Confidence 344444444444443
No 83
>PF15102 TMEM154: TMEM154 protein family
Probab=54.97 E-value=4 Score=36.41 Aligned_cols=21 Identities=24% Similarity=0.233 Sum_probs=8.9
Q ss_pred ccceEEeehHHHHHHHHHHHH
Q psy1807 530 TLGYMISVGAGVVLFLLILLA 550 (618)
Q Consensus 530 ~~~~~~~~~~~~~l~l~vl~~ 550 (618)
.+-+.|-.+..++|+|+|+++
T Consensus 58 iLmIlIP~VLLvlLLl~vV~l 78 (146)
T PF15102_consen 58 ILMILIPLVLLVLLLLSVVCL 78 (146)
T ss_pred EEEEeHHHHHHHHHHHHHHHh
Confidence 455555433333333333333
No 84
>PTZ00046 rifin; Provisional
Probab=53.45 E-value=7.6 Score=40.22 Aligned_cols=18 Identities=22% Similarity=0.301 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHhcCCCCC
Q psy1807 547 ILLAIIYARLRKATPVKN 564 (618)
Q Consensus 547 vl~~~~~~~~r~k~~~~~ 564 (618)
|++-+++|.||||+-+|+
T Consensus 332 vIIYLILRYRRKKKMkKK 349 (358)
T PTZ00046 332 VIIYLILRYRRKKKMKKK 349 (358)
T ss_pred HHHHHHHHhhhcchhHHH
Confidence 444466666666665554
No 85
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=53.17 E-value=4.5 Score=40.89 Aligned_cols=29 Identities=17% Similarity=0.263 Sum_probs=0.0
Q ss_pred EEeehHHHHHHHHHHHHHHHHHHHhcCCC
Q psy1807 534 MISVGAGVVLFLLILLAIIYARLRKATPV 562 (618)
Q Consensus 534 ~~~~~~~~~l~l~vl~~~~~~~~r~k~~~ 562 (618)
..+|++.++||++++++.+.+|+||+++.
T Consensus 150 IpaVVI~~iLLIA~iIa~icyrrkR~GK~ 178 (290)
T PF05454_consen 150 IPAVVIAAILLIAGIIACICYRRKRKGKM 178 (290)
T ss_dssp -----------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhcccc
Confidence 33444444444444444444454444443
No 86
>PF02793 HRM: Hormone receptor domain; InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=52.17 E-value=12 Score=28.57 Aligned_cols=34 Identities=26% Similarity=0.570 Sum_probs=26.7
Q ss_pred CCCCcEEEEEcCCCCe-----eeCCceeEeCcCCcccCC
Q psy1807 474 LNVHSTLEFHCEPGHL-----LVGQASLKCTRQGDWDNM 507 (618)
Q Consensus 474 ~~~g~~v~~~C~~Gy~-----l~G~~~~~C~~~G~Ws~~ 507 (618)
...|.++...|..++. ..|.....|..+|+|...
T Consensus 17 T~~G~~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~ 55 (66)
T PF02793_consen 17 TPAGETASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH 55 (66)
T ss_dssp BETTEEEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred CCCCCEEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence 4568889999998863 136678999999999864
No 87
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=51.99 E-value=8.3 Score=39.79 Aligned_cols=18 Identities=22% Similarity=0.301 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHhcCCCCC
Q psy1807 547 ILLAIIYARLRKATPVKN 564 (618)
Q Consensus 547 vl~~~~~~~~r~k~~~~~ 564 (618)
|++-+++|.||||+-+|+
T Consensus 327 vIIYLILRYRRKKKMkKK 344 (353)
T TIGR01477 327 VIIYLILRYRRKKKMKKK 344 (353)
T ss_pred HHHHHHHHhhhcchhHHH
Confidence 444466666666665554
No 88
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=51.46 E-value=22 Score=30.24 Aligned_cols=7 Identities=29% Similarity=0.506 Sum_probs=2.8
Q ss_pred chhhchh
Q psy1807 594 NIYENVA 600 (618)
Q Consensus 594 ~~~~~~~ 600 (618)
-||-++.
T Consensus 47 p~YgNL~ 53 (107)
T PF15330_consen 47 PCYGNLE 53 (107)
T ss_pred ccccccc
Confidence 3444443
No 89
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=49.92 E-value=16 Score=24.59 Aligned_cols=16 Identities=25% Similarity=0.293 Sum_probs=8.2
Q ss_pred HHHHHHHHHHHHhcCC
Q psy1807 546 LILLAIIYARLRKATP 561 (618)
Q Consensus 546 ~vl~~~~~~~~r~k~~ 561 (618)
+++.+.+|..++||.+
T Consensus 19 iii~~~~YaCcykk~~ 34 (38)
T PF02439_consen 19 IIICMFYYACCYKKHR 34 (38)
T ss_pred HHHHHHHHHHHHcccc
Confidence 3444555555555554
No 90
>KOG0994|consensus
Probab=49.10 E-value=44 Score=39.66 Aligned_cols=123 Identities=22% Similarity=0.503 Sum_probs=61.5
Q ss_pred CCceeEecCCcccCCCCceeeecCCCCC---ccCCee-EEecCCCcccCCEEEEEeCCCceecCCCeEEeCCCCCcCCCC
Q psy1807 255 GSATRTCKEGFWTGVAPTCQYVDCGKLE---HIEHGT-VTLETTRTTHGAVAIYACHENYTLIGETRRVCGDGGKWNGTE 330 (618)
Q Consensus 255 g~~~~~C~~g~Ws~~~p~C~~~~C~~~~---~~~ng~-~~~~~~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~ 330 (618)
|-..-.|+-|.|. .|.|..-.|..-. .+..|. +.+.+ +.-|..+. +|..||. |+..+ +.-
T Consensus 840 grqCnqCqpG~Wg--FPeCr~CqCNgHA~~Cd~~tGaCi~CqD--~T~G~~Cd-rCl~Gyy--GdP~l---------g~g 903 (1758)
T KOG0994|consen 840 GRQCNQCQPGYWG--FPECRPCQCNGHADTCDPITGACIDCQD--STTGHSCD-RCLDGYY--GDPRL---------GSG 903 (1758)
T ss_pred hhhccccCCCccC--CCcCccccccCcccccCccccccccccc--cccccchh-hhhcccc--CCccc---------CCC
Confidence 4455566789996 6888876664211 111121 22211 22222222 2555663 32211 012
Q ss_pred CcccccccCCCCCccccEEEe---cccccCceeeecCCCCcEEcCCceEEEccCCeecCC--CCceeeecCCC
Q psy1807 331 PQCLFDWCAEPPQISGGIVTT---SGRRTGSVATYSCEPGFILFGSNELKCGLGGDWSGK--APTCKYVDCGS 398 (618)
Q Consensus 331 p~C~~~~C~~p~~~~~g~~~~---~~~~~g~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~--~p~C~~~~C~~ 398 (618)
-.|.+-.| |..+..|.-.. .-..+...+...|++|| .|+.--.|. +|.|-.+ --+|+.-.|..
T Consensus 904 ~~CrPCpC--P~gp~Sg~~~A~sC~~d~~t~~ivC~C~~GY--~G~RCe~CA-~~~fGnP~~GGtCq~CeC~~ 971 (1758)
T KOG0994|consen 904 IGCRPCPC--PDGPASGRQHADSCYLDTRTQQIVCHCQEGY--SGSRCEICA-DNHFGNPSEGGTCQKCECSN 971 (1758)
T ss_pred CCCCCCCC--CCCCccchhccccccccccccceeeecccCc--cccchhhhc-ccccCCcccCCccccccccC
Confidence 34655344 44444443222 12245567778999999 577666774 7778532 24576555554
No 91
>PF15069 FAM163: FAM163 family
Probab=48.74 E-value=36 Score=30.34 Aligned_cols=26 Identities=27% Similarity=0.529 Sum_probs=13.8
Q ss_pred ceEEeehHHHHH--HHHHHHHHHHHHHH
Q psy1807 532 GYMISVGAGVVL--FLLILLAIIYARLR 557 (618)
Q Consensus 532 ~~~~~~~~~~~l--~l~vl~~~~~~~~r 557 (618)
+..++.++++.. ||.|++++.+.||+
T Consensus 4 GTvVItGgILAtVILLcIIaVLCYCRLQ 31 (143)
T PF15069_consen 4 GTVVITGGILATVILLCIIAVLCYCRLQ 31 (143)
T ss_pred eeEEEechHHHHHHHHHHHHHHHHHhhH
Confidence 444555554433 33366666666765
No 92
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=47.65 E-value=30 Score=35.09 Aligned_cols=36 Identities=25% Similarity=0.425 Sum_probs=19.5
Q ss_pred ccccceEEeehH---HHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807 528 AQTLGYMISVGA---GVVLFLLILLAIIYARLRKATPVK 563 (618)
Q Consensus 528 ~~~~~~~~~~~~---~~~l~l~vl~~~~~~~~r~k~~~~ 563 (618)
...+++.+.+++ ..+++|++++.+|+.+.|||+++.
T Consensus 223 ~l~~G~VVlIslAiALG~v~ll~l~Gii~~~~~r~~~~~ 261 (281)
T PF12768_consen 223 KLSRGFVVLISLAIALGTVFLLVLIGIILAYIRRRRQGY 261 (281)
T ss_pred cccceEEEEEehHHHHHHHHHHHHHHHHHHHHHhhhccC
Confidence 344555443322 224466677777777776665543
No 93
>PF11980 DUF3481: Domain of unknown function (DUF3481); InterPro: IPR022579 This domain of unknown function is located in the C terminus of the eukaryotic neuropilin receptor family of proteins. It is found in association with PF00754 from PFAM, PF00431 from PFAM and PF00629 from PFAM. There are two completely conserved residues (Y and E) that may be functionally important.
Probab=47.19 E-value=22 Score=28.40 Aligned_cols=31 Identities=19% Similarity=0.372 Sum_probs=12.8
Q ss_pred EEeehHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy1807 534 MISVGAGVVLFLLILLAIIYARLRKATPVKN 564 (618)
Q Consensus 534 ~~~~~~~~~l~l~vl~~~~~~~~r~k~~~~~ 564 (618)
+++.++.++++..|.+++++..-|.+-..|+
T Consensus 19 iiA~gga~llL~~v~l~vvL~C~r~~~a~kk 49 (87)
T PF11980_consen 19 IIAMGGALLLLVAVCLGVVLYCHRFHWAAKK 49 (87)
T ss_pred HHhhccHHHHHHHHHHHHHHhhhhhcccccc
Confidence 3333333333333444444444444444444
No 94
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=45.33 E-value=14 Score=22.16 Aligned_cols=19 Identities=26% Similarity=0.562 Sum_probs=13.4
Q ss_pred EEEcCCCCeeeCCceeEeCc
Q psy1807 481 EFHCEPGHLLVGQASLKCTR 500 (618)
Q Consensus 481 ~~~C~~Gy~l~G~~~~~C~~ 500 (618)
+.+|.+||+|... ..+|.+
T Consensus 3 ~C~C~~Gy~l~~d-~~~C~D 21 (24)
T PF12662_consen 3 TCSCPPGYQLSPD-GRSCED 21 (24)
T ss_pred EeeCCCCCcCCCC-CCcccc
Confidence 5679999998643 366763
No 95
>smart00607 FTP eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain.
Probab=44.94 E-value=20 Score=32.61 Aligned_cols=26 Identities=35% Similarity=0.386 Sum_probs=23.8
Q ss_pred CCCceEEEEEeecCCceeeeEEEeee
Q psy1807 42 PQSKSVFIQLVGVEGSLSLCEVEIFT 67 (618)
Q Consensus 42 ~~~~~~~~~~~~~~~~l~~C~~~~~~ 67 (618)
..+|+|.++|.+.+..|+|||+.+|.
T Consensus 121 ~~GRYV~i~Lpg~~~~L~LCEVeV~~ 146 (151)
T smart00607 121 MIGRYVTVYLPKPNESLILCEVEVNA 146 (151)
T ss_pred CeeEEEEEEecCCCccEEEeEEEEcc
Confidence 38999999999988899999999986
No 96
>KOG1025|consensus
Probab=44.78 E-value=37 Score=39.26 Aligned_cols=30 Identities=27% Similarity=0.564 Sum_probs=23.6
Q ss_pred cccccCCEEEEEcCCCcee-cCCceeEecCC
Q psy1807 235 VNTKLGSTISYACPEGNML-VGSATRTCKEG 264 (618)
Q Consensus 235 ~~~~~G~~~~y~C~~Gy~l-~g~~~~~C~~g 264 (618)
..|.||.++.=.|+..|.. .|+-.+.|..+
T Consensus 274 gkY~~Ga~CVk~CP~n~lvD~~sCVRaCp~~ 304 (1177)
T KOG1025|consen 274 GKYQYGATCVKECPHNLLVDNGSCVRACPDD 304 (1177)
T ss_pred ccccccCeehhhCCchhEecCCceeeecCCc
Confidence 5699999999999999876 45566778743
No 97
>TIGR03370 PEPCTERM_Roseo variant PEP-CTERM putative exosortase signal, Roseobacter type. A probable protein export sorting signal, PEP-CTERM, was described by Haft, et al. (PubMed:16930487). It is predicted to interact with a putative transpeptidase we designate exosortase. Most examples of this signal are recognized by model TIGR02595, but some unusual clades require different models. This model describes a variant with conserved motif VPLPA, rather than VPEP. This variant is found prominently in two members of the Rhodobacterales, namely Jannaschia sp. CCS1 and Roseobacter denitrificans OCh 114. One interesting member protein has a full-length duplication and therefore two copies of this putative sorting domain.
Probab=44.13 E-value=37 Score=20.83 Aligned_cols=20 Identities=30% Similarity=0.295 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHhc
Q psy1807 540 GVVLFLLILLAIIYARLRKA 559 (618)
Q Consensus 540 ~~~l~l~vl~~~~~~~~r~k 559 (618)
++.+++..|..+-..|||+|
T Consensus 6 ~~~LLl~gLggl~~~rRRrk 25 (26)
T TIGR03370 6 GALLLLAGLGGLGAMRRRRR 25 (26)
T ss_pred hHHHHHHHHHHHHHHHHhhc
Confidence 44555555555555555554
No 98
>KOG3653|consensus
Probab=43.45 E-value=98 Score=33.50 Aligned_cols=29 Identities=31% Similarity=0.518 Sum_probs=16.0
Q ss_pred CcceEEeccCCeee--CCCCcee-eecCCCCCcC
Q psy1807 434 AMRRQVCMRDGKWS--GDTPSCE-LITCPEPEVP 464 (618)
Q Consensus 434 G~~~~~C~~~G~Ws--~~~p~C~-~~~C~~p~~~ 464 (618)
|...+.+ .|-|+ .+.+.|. ...|..-..+
T Consensus 78 ~~~~~v~--qGC~~~~~D~~~c~~~~eCv~s~~~ 109 (534)
T KOG3653|consen 78 GTIEVVK--QGCWSHITDDINCEDSSECVVSAEP 109 (534)
T ss_pred CceeEEe--ecCccccCCccccccccccccCCCC
Confidence 4445555 47775 4566676 4466554333
No 99
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=43.41 E-value=42 Score=27.05 Aligned_cols=20 Identities=10% Similarity=0.318 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHHhcCC
Q psy1807 542 VLFLLILLAIIYARLRKATP 561 (618)
Q Consensus 542 ~l~l~vl~~~~~~~~r~k~~ 561 (618)
++|++++++++.+.++||+.
T Consensus 42 ~iFil~VilwfvCC~kRkrs 61 (94)
T PF05393_consen 42 GIFILLVILWFVCCKKRKRS 61 (94)
T ss_pred HHHHHHHHHHHHHHHHhhhc
Confidence 34455555555554444333
No 100
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=41.90 E-value=20 Score=38.28 Aligned_cols=31 Identities=23% Similarity=0.390 Sum_probs=19.4
Q ss_pred EEcCCCCeeeCCceeEeCc---CCcccCCCCceEE
Q psy1807 482 FHCEPGHLLVGQASLKCTR---QGDWDNMSPQFIS 513 (618)
Q Consensus 482 ~~C~~Gy~l~G~~~~~C~~---~G~Ws~~~P~C~~ 513 (618)
..|.+||.+.+..-+.|.. +..|.+ .+.|..
T Consensus 304 t~C~~Gyy~~~~~c~~C~~c~~~~~v~G-v~~C~~ 337 (397)
T PF03302_consen 304 TACADGYYKSGSGCVKCTSCSSNNGVTG-VKGCVS 337 (397)
T ss_pred cchhccccccccCCccccccCCCCcCcC-CCcCcc
Confidence 5788888877776667764 223554 455654
No 101
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=41.31 E-value=13 Score=26.64 Aligned_cols=26 Identities=27% Similarity=0.230 Sum_probs=13.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807 538 GAGVVLFLLILLAIIYARLRKATPVK 563 (618)
Q Consensus 538 ~~~~~l~l~vl~~~~~~~~r~k~~~~ 563 (618)
+..++|+++.+++++-.|.|.|.+.+
T Consensus 20 i~A~vlfi~Gi~iils~kckCk~~qk 45 (50)
T PF02038_consen 20 IFAGVLFILGILIILSGKCKCKFNQK 45 (50)
T ss_dssp HHHHHHHHHHHHHHCTTHHHHHHSTT
T ss_pred HHHHHHHHHHHHHHHcCccccCCCCC
Confidence 33445555555555555666544443
No 102
>KOG1219|consensus
Probab=40.90 E-value=1.5e+02 Score=38.57 Aligned_cols=26 Identities=19% Similarity=0.643 Sum_probs=17.7
Q ss_pred EEEEeCCCceecCCCeEEeCCCCCcCC
Q psy1807 302 AIYACHENYTLIGETRRVCGDGGKWNG 328 (618)
Q Consensus 302 ~~~~C~~Gy~l~G~~~~~C~~~G~Ws~ 328 (618)
..+.|..|....|...++-. +|+|..
T Consensus 3742 l~~~cgsG~Givg~q~~~Vn-DgqWHs 3767 (4289)
T KOG1219|consen 3742 LLADCGSGPGIVGSQKRTVN-DGQWHS 3767 (4289)
T ss_pred EEEecCCCCCcccccceEee-cCceeE
Confidence 35578888866676655544 778865
No 103
>KOG4289|consensus
Probab=39.68 E-value=1e+02 Score=37.87 Aligned_cols=139 Identities=26% Similarity=0.443 Sum_probs=83.0
Q ss_pred eecCCCCcEEcC---CceEEEccCCeecCCCCceeeecCCCC------CccCCCceEeecCccc-ccceEEEEcCCCcee
Q psy1807 361 TYSCEPGFILFG---SNELKCGLGGDWSGKAPTCKYVDCGSP------ANIDNGHYSLLNGTTT-HGSVAEYTCDDDYWL 430 (618)
Q Consensus 361 ~~~C~~Gy~l~G---~~~~~C~~~G~Ws~~~p~C~~~~C~~p------~~~~~g~~~~~~~~~~-~g~~~~~~C~~Gy~l 430 (618)
+..|.+||.=.- .....| ..| |-+ .|+|.+-.|... ..-++|.-.|....|. .|+ =+.|+=+
T Consensus 1740 ~C~C~~g~~G~~Ce~~~dq~C-PrG-WWG-~P~CgpC~CavsKgfdp~CnKt~G~CqCKe~hy~~~~~--Cl~CdC~--- 1811 (2531)
T KOG4289|consen 1740 TCECPPGYTGPYCELRADQPC-PRG-WWG-FPTCGPCNCAVSKGFDPDCNKTNGQCQCKENHYRPIGS--CLPCDCY--- 1811 (2531)
T ss_pred eEECCCcccCcchhhhccCCC-CCc-ccC-CCCccCccccccCCCCCCccccCcceeeccccccCCCc--ceeeccc---
Confidence 457888874210 123456 356 555 899987666543 2345777777655553 455 3445555
Q ss_pred ccCCcceEEeccCCeeeCCC----CceeeecCCCCCc--CCCcEEE----c----------ccCCCCcEEEEEcCCCCee
Q psy1807 431 EAGAMRRQVCMRDGKWSGDT----PSCELITCPEPEV--PKGGYVV----G----------YDLNVHSTLEFHCEPGHLL 490 (618)
Q Consensus 431 ~~~G~~~~~C~~~G~Ws~~~----p~C~~~~C~~p~~--~~~g~~~----~----------~~~~~g~~v~~~C~~Gy~l 490 (618)
. |+.++.|.++|+--=.+ -.|. .|..|-. -..|-.. + ...++|..+...|..|-
T Consensus 1812 ~--Gs~Sr~C~adGqC~C~pgaiGRqCd--rCd~pfaevttlgCrv~y~~CPk~ie~giwWP~tkfG~~a~~pCP~G~-- 1885 (2531)
T KOG4289|consen 1812 F--GSDSRECDADGQCPCKPGAIGRQCD--RCDNPFAEVTTLGCRVVYDKCPKAIEAGIWWPQTKFGSPAAVPCPKGS-- 1885 (2531)
T ss_pred c--CCCcccccCCCcCCCCCcccccccc--ccCChhhhccccCcEEccccCcHHHhhcccccccccCCcccccCCCCc--
Confidence 3 88899998888642100 0121 3444421 1122111 1 13578999999999995
Q ss_pred eCCceeEeCcCCcccCC-CCceEE
Q psy1807 491 VGQASLKCTRQGDWDNM-SPQFIS 513 (618)
Q Consensus 491 ~G~~~~~C~~~G~Ws~~-~P~C~~ 513 (618)
.|.+...|.+.|.|.++ +-.|+.
T Consensus 1886 ~G~AvR~C~e~~gWl~pnlfNCTs 1909 (2531)
T KOG4289|consen 1886 SGLAVRHCDETGGWLPPNLFNCTS 1909 (2531)
T ss_pred chhhhhhhhccCCCCCchhcccCC
Confidence 48889999999999974 445665
No 104
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=39.26 E-value=16 Score=37.33 Aligned_cols=27 Identities=26% Similarity=0.410 Sum_probs=19.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy1807 538 GAGVVLFLLILLAIIYARLRKATPVKN 564 (618)
Q Consensus 538 ~~~~~l~l~vl~~~~~~~~r~k~~~~~ 564 (618)
.+.++|++++++++||..||-||++|-
T Consensus 261 SiiaIliIVLIMvIIYLILRYRRKKKm 287 (299)
T PF02009_consen 261 SIIAILIIVLIMVIIYLILRYRRKKKM 287 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 334455555667788889998888765
No 105
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=37.86 E-value=46 Score=20.42 Aligned_cols=19 Identities=32% Similarity=0.336 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy1807 539 AGVVLFLLILLAIIYARLR 557 (618)
Q Consensus 539 ~~~~l~l~vl~~~~~~~~r 557 (618)
..++|+.+.+++++..|||
T Consensus 6 ST~~Ll~~~l~~l~~~rRr 24 (26)
T TIGR03778 6 STLALLGLGLLGLLGLRRR 24 (26)
T ss_pred hHHHHHHHHHHHHHHHhhc
Confidence 3445555555555555554
No 106
>PF06084 Cytomega_TRL10: Cytomegalovirus TRL10 protein; InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=35.50 E-value=16 Score=30.82 Aligned_cols=17 Identities=29% Similarity=0.540 Sum_probs=12.1
Q ss_pred cchhhchh----hhccCChhh
Q psy1807 593 DNIYENVA----ENTYDNPYE 609 (618)
Q Consensus 593 ~~~~~~~~----~~~~~~~~~ 609 (618)
.||||.|| .|--|+|-.
T Consensus 115 qniyeripyrp~rq~d~~p~~ 135 (150)
T PF06084_consen 115 QNIYERIPYRPCRQNDNSPPI 135 (150)
T ss_pred cchhhhcCCCcccccCCCCcc
Confidence 38999998 566666643
No 107
>TIGR02595 PEP_exosort PEP-CTERM putative exosortase interaction domain. This model describes a 25-residue domain that includes a near-invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In nearly every case, this motif is found within nine residues, and usually within five residues, of the extreme C-terminus of the protein. Proteins with this motif typically have signal sequences at the N-terminus. This region appears many times per genome or not at all, and co-occurs in genomes with a proposed protein-sorting integral membrane protein we designate exosortase (see TIGR02602). PEP-CTERM proteins frequently are poorly conserved, Ser/Thr-rich proteins and may become extensively modified proteinaceous constituents of extracellular material in bacterial biofilms.
Probab=34.42 E-value=48 Score=20.28 Aligned_cols=12 Identities=33% Similarity=0.329 Sum_probs=4.3
Q ss_pred HHHHHHHHHHHH
Q psy1807 543 LFLLILLAIIYA 554 (618)
Q Consensus 543 l~l~vl~~~~~~ 554 (618)
+++++++++..+
T Consensus 9 ll~~g~~~~~~r 20 (26)
T TIGR02595 9 LLLLGLGFLLLR 20 (26)
T ss_pred HHHHHHHHHHHh
Confidence 333333333333
No 108
>PF07010 Endomucin: Endomucin; InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=33.13 E-value=1.2e+02 Score=29.27 Aligned_cols=49 Identities=16% Similarity=0.297 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHHhcCCCCCCCccccccccceeceeEeeeeeccCCCCcc
Q psy1807 545 LLILLAIIYARLRKATPVKNTENIEGAVRKEEQTAAVMSYASLSDGGGDN 594 (618)
Q Consensus 545 l~vl~~~~~~~~r~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 594 (618)
++++|+-+|+-.||+... .+.+....+..+.++..|++--.++-+-||.
T Consensus 203 ~vf~LvgLyr~C~k~dPg-~p~~g~~qpqsdke~vklltvkt~s~e~ge~ 251 (259)
T PF07010_consen 203 SVFTLVGLYRMCWKTDPG-TPENGPDQPQSDKESVKLLTVKTISHESGEH 251 (259)
T ss_pred HHHHHHHHHHHhhcCCCC-CcccCCCCCCccccceeEEEEEecccCCccc
Confidence 335555666666676665 3455555555567777888887777777753
No 109
>KOG4564|consensus
Probab=32.64 E-value=45 Score=36.25 Aligned_cols=38 Identities=24% Similarity=0.510 Sum_probs=27.0
Q ss_pred cccceEEEEcCCCcee---ccCCcceEEeccCCeeeCCCCc
Q psy1807 415 THGSVAEYTCDDDYWL---EAGAMRRQVCMRDGKWSGDTPS 452 (618)
Q Consensus 415 ~~g~~~~~~C~~Gy~l---~~~G~~~~~C~~~G~Ws~~~p~ 452 (618)
..|+.++..|.+.+.. .+.|...+.|..||+|+..++.
T Consensus 78 ~~G~~v~~~CP~yf~~~~~~~~~~v~R~C~~~G~W~~~~~~ 118 (473)
T KOG4564|consen 78 PAGTLVTVPCPDYFPGFSYSTSGNVTRNCTSNGTWSERPPS 118 (473)
T ss_pred CCCceEEecCccccCCCcccCCCceEeecCCCCccCCCCCc
Confidence 3467777788755432 2348889999999999976553
No 110
>PF08114 PMP1_2: ATPase proteolipid family; InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=32.05 E-value=46 Score=22.69 Aligned_cols=15 Identities=20% Similarity=0.386 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHhcCC
Q psy1807 547 ILLAIIYARLRKATP 561 (618)
Q Consensus 547 vl~~~~~~~~r~k~~ 561 (618)
++.+++|+++..|++
T Consensus 24 iva~~iYRKw~aRkr 38 (43)
T PF08114_consen 24 IVALFIYRKWQARKR 38 (43)
T ss_pred HHHHHHHHHHHHHHH
Confidence 444455665554444
No 111
>PHA03265 envelope glycoprotein D; Provisional
Probab=30.91 E-value=42 Score=34.42 Aligned_cols=8 Identities=13% Similarity=0.347 Sum_probs=3.1
Q ss_pred ccccceEE
Q psy1807 528 AQTLGYMI 535 (618)
Q Consensus 528 ~~~~~~~~ 535 (618)
++..+..+
T Consensus 347 ~~~~g~~i 354 (402)
T PHA03265 347 STFVGISV 354 (402)
T ss_pred CcccceEE
Confidence 33334333
No 112
>PHA03242 envelope glycoprotein M; Provisional
Probab=28.16 E-value=52 Score=35.15 Aligned_cols=27 Identities=33% Similarity=0.590 Sum_probs=16.7
Q ss_pred CCCCcchhhchhhhccCChhhhhhhhccc
Q psy1807 589 DGGGDNIYENVAENTYDNPYETALAFAEW 617 (618)
Q Consensus 589 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 617 (618)
++.+|-|||.+++..-++.| +-+.++|
T Consensus 402 ~~~~e~iYd~v~~~~~~~~~--~~~~~~~ 428 (428)
T PHA03242 402 STDGEPIYDEVAPDEDEVLY--ARIQHEW 428 (428)
T ss_pred ccCCCccccccCCCcchhHH--HhhccCC
Confidence 36677999999854444444 4444455
No 113
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=27.86 E-value=1.3e+02 Score=21.81 Aligned_cols=24 Identities=21% Similarity=0.227 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCC
Q psy1807 541 VVLFLLILLAIIYARLRKATPVKN 564 (618)
Q Consensus 541 ~~l~l~vl~~~~~~~~r~k~~~~~ 564 (618)
.+++++++|++..+...||..-|.
T Consensus 8 ~i~iv~~lLg~~I~~~~K~ygYkh 31 (50)
T PF12606_consen 8 SIFIVMGLLGLSICTTLKAYGYKH 31 (50)
T ss_pred HHHHHHHHHHHHHHHHhhcccccc
Confidence 344455555555565555555544
No 114
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=27.70 E-value=91 Score=34.93 Aligned_cols=40 Identities=25% Similarity=0.310 Sum_probs=17.3
Q ss_pred ccccceEEeehHHH-HHHHHHHHHHHHH-HHHhcCCCCCCCc
Q psy1807 528 AQTLGYMISVGAGV-VLFLLILLAIIYA-RLRKATPVKNTEN 567 (618)
Q Consensus 528 ~~~~~~~~~~~~~~-~l~l~vl~~~~~~-~~r~k~~~~~~~~ 567 (618)
......||+++..+ +++++++++++++ -.|+++..+++..
T Consensus 264 s~~~NlWII~gVlvPv~vV~~Iiiil~~~LCRk~K~eFqpDa 305 (684)
T PF12877_consen 264 SPPNNLWIIAGVLVPVLVVLLIIIILYWKLCRKNKLEFQPDA 305 (684)
T ss_pred CCCCCeEEEehHhHHHHHHHHHHHHHHHHHhcccccCCCchh
Confidence 33344555544433 3333333334443 4455555555443
No 115
>PF07979 Intimin_C: Intimin C-type lectin domain; InterPro: IPR013117 This domain is found at the C terminus of intimin. Its structure has been solved and shown to have a C-lectin type of structure []. Intimin is a bacterial adhesion molecule involved in intimate attachment of enteropathogenic and enterohemorrhagic Escherichia coli to mammalian host cells. Intimin targets the translocated intimin receptor (Tir), which is exported by the bacteria and integrated into the host cell plasma membrane.; GO: 0005488 binding, 0009405 pathogenesis, 0009986 cell surface; PDB: 1CWV_A 2ZQK_B 2ZWK_C 1F02_I 1E5U_I 1F00_I 3NCX_B 3NCW_D.
Probab=27.67 E-value=42 Score=28.07 Aligned_cols=31 Identities=19% Similarity=0.306 Sum_probs=24.1
Q ss_pred CCCCHHHHHHHHHHcCCeecccCChHHHHHHHH
Q psy1807 100 KGASFADARSKCQASGGDLVHGFRGITNTFLLG 132 (618)
Q Consensus 100 ~~~tw~~A~~~C~~~g~~L~~i~s~~e~~fl~~ 132 (618)
.+++|.+|...|+..+|.|.+- ++|+.-|..
T Consensus 11 ~~~~Y~~A~~~C~~~s~~LpsS--~~~L~~vy~ 41 (101)
T PF07979_consen 11 SRVTYSEAESICQNNSGRLPSS--QSELKDVYN 41 (101)
T ss_dssp CEETHHHHHHHTTTTCCESBSS--HHHHHHHHH
T ss_pred ceEeHHHHHHHHHhccccCccc--HHHHHHHHH
Confidence 3689999999999999999974 455544443
No 116
>PF15345 TMEM51: Transmembrane protein 51
Probab=27.49 E-value=1.1e+02 Score=29.73 Aligned_cols=25 Identities=36% Similarity=0.525 Sum_probs=14.4
Q ss_pred ehHHHHHHHHHHHHHHHHHHHhcCC
Q psy1807 537 VGAGVVLFLLILLAIIYARLRKATP 561 (618)
Q Consensus 537 ~~~~~~l~l~vl~~~~~~~~r~k~~ 561 (618)
|+++++|+||.+++-|.-|+|++..
T Consensus 65 VG~Gv~LLLLSICL~IR~KRr~rq~ 89 (233)
T PF15345_consen 65 VGSGVALLLLSICLSIRDKRRRRQG 89 (233)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4556666666666665554444443
No 117
>PF06523 DUF1106: Protein of unknown function (DUF1106); InterPro: IPR009490 This family consists of several hypothetical bacterial proteins found in Escherichia coli and Citrobacter rodentium. The function of this family is unknown.
Probab=26.29 E-value=18 Score=27.83 Aligned_cols=22 Identities=18% Similarity=0.360 Sum_probs=19.4
Q ss_pred chhhchhhhccCChhhhhhhhc
Q psy1807 594 NIYENVAENTYDNPYETALAFA 615 (618)
Q Consensus 594 ~~~~~~~~~~~~~~~~~~~~~~ 615 (618)
-|+|.++|.++|+|-|+++|..
T Consensus 69 yi~dklkeifpde~iefsieye 90 (91)
T PF06523_consen 69 YIFDKLKEIFPDEAIEFSIEYE 90 (91)
T ss_pred hHHHHHHHhCCCCceEEEEEec
Confidence 6899999999999999998864
No 118
>PF04689 S1FA: DNA binding protein S1FA; InterPro: IPR006779 S1FA is an unusual small plant peptide of only 70 amino acids with a basic domain which contains a nuclear localization signal and a putative DNA binding helix. S1FA is highly conserved between dicotyledonous and monocotyledonous plants and may be a DNA-binding protein that specifically recognises the negative promoter element S1F [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=26.19 E-value=1.7e+02 Score=22.17 Aligned_cols=24 Identities=29% Similarity=0.422 Sum_probs=13.1
Q ss_pred eehHHHHHHHHHHHHHHHHHHHhc
Q psy1807 536 SVGAGVVLFLLILLAIIYARLRKA 559 (618)
Q Consensus 536 ~~~~~~~l~l~vl~~~~~~~~r~k 559 (618)
.++++++++.++..+++|...++.
T Consensus 18 lvV~g~ll~flvGnyvlY~Yaqk~ 41 (69)
T PF04689_consen 18 LVVAGLLLVFLVGNYVLYVYAQKT 41 (69)
T ss_pred ehHHHHHHHHHHHHHHHHHHHhhc
Confidence 355555555555555555554443
No 119
>PF01708 Gemini_mov: Geminivirus putative movement protein ; InterPro: IPR002621 This family consists of putative movement proteins from Maize streak virus and Wheat dwarf virus [].; GO: 0046740 spread of virus in host, cell to cell, 0016021 integral to membrane
Probab=26.13 E-value=1.2e+02 Score=24.72 Aligned_cols=12 Identities=42% Similarity=0.476 Sum_probs=5.6
Q ss_pred hcCCCCCCCccc
Q psy1807 558 KATPVKNTENIE 569 (618)
Q Consensus 558 ~k~~~~~~~~~~ 569 (618)
|.++.++|.++.
T Consensus 67 KAkrqrsTeEig 78 (91)
T PF01708_consen 67 KAKRQRSTEEIG 78 (91)
T ss_pred eeccCCceeeee
Confidence 444444455443
No 120
>PF13268 DUF4059: Protein of unknown function (DUF4059)
Probab=25.81 E-value=55 Score=25.37 Aligned_cols=25 Identities=20% Similarity=0.125 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807 539 AGVVLFLLILLAIIYARLRKATPVK 563 (618)
Q Consensus 539 ~~~~l~l~vl~~~~~~~~r~k~~~~ 563 (618)
.+.++++++.++|+.+|.++|+.+.
T Consensus 15 ls~i~V~~~~~~wi~~Ra~~~~DKT 39 (72)
T PF13268_consen 15 LSSILVLLVSGIWILWRALRKKDKT 39 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCc
Confidence 3445555677777777777766653
No 121
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=25.06 E-value=90 Score=29.94 Aligned_cols=25 Identities=4% Similarity=0.234 Sum_probs=14.5
Q ss_pred ccceEEeehHHHHHHHHHHHHHHHH
Q psy1807 530 TLGYMISVGAGVVLFLLILLAIIYA 554 (618)
Q Consensus 530 ~~~~~~~~~~~~~l~l~vl~~~~~~ 554 (618)
..+.+.++++++|+++++.++...+
T Consensus 40 ~iaiVAG~~tVILVI~i~v~vR~CR 64 (221)
T PF08374_consen 40 MIAIVAGIMTVILVIFIVVLVRYCR 64 (221)
T ss_pred eeeeecchhhhHHHHHHHHHHHHHh
Confidence 3445555666666666666666444
No 122
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=24.90 E-value=84 Score=22.36 Aligned_cols=15 Identities=13% Similarity=0.319 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHhcC
Q psy1807 546 LILLAIIYARLRKAT 560 (618)
Q Consensus 546 ~vl~~~~~~~~r~k~ 560 (618)
+++++++++-.|+|+
T Consensus 20 ~~F~gi~~w~~~~~~ 34 (49)
T PF05545_consen 20 VFFIGIVIWAYRPRN 34 (49)
T ss_pred HHHHHHHHHHHcccc
Confidence 344444444444443
No 123
>PF06247 Plasmod_Pvs28: Plasmodium ookinete surface protein Pvs28; InterPro: IPR010423 This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals [].; GO: 0009986 cell surface, 0016020 membrane; PDB: 1Z3G_B 1Z1Y_B 1Z27_A.
Probab=24.84 E-value=54 Score=30.72 Aligned_cols=52 Identities=23% Similarity=0.377 Sum_probs=28.4
Q ss_pred CEEEEEeCCCceecCCCeEEeCCCCCcCCCCCcccccccCCCCCccccEEEecccccCceeeecCCCCcE
Q psy1807 300 AVAIYACHENYTLIGETRRVCGDGGKWNGTEPQCLFDWCAEPPQISGGIVTTSGRRTGSVATYSCEPGFI 369 (618)
Q Consensus 300 ~~~~~~C~~Gy~l~G~~~~~C~~~G~Ws~~~p~C~~~~C~~p~~~~~g~~~~~~~~~g~~~~~~C~~Gy~ 369 (618)
......|..||.+... .|. +..|....|+ +|.-........+. ..+|+-|+.
T Consensus 68 ~~~~C~C~~gY~~~~~---vCv--------p~~C~~~~Cg------~GKCI~d~~~~~~~-~CSC~IGkV 119 (197)
T PF06247_consen 68 RAYKCDCINGYILKQG---VCV--------PNKCNNKDCG------SGKCILDPDNPNNP-TCSCNIGKV 119 (197)
T ss_dssp TSEEEEE-TTEEESSS---SEE--------EGGGSS---T------TEEEEEEEGGGSEE-EEEE-TEEE
T ss_pred eeEEEecccCceeeCC---eEc--------hhhcCceecC------CCeEEecCCCCCCc-eeEeeeceE
Confidence 4566778999988422 333 3455555554 35555444444444 789999987
No 124
>PTZ00046 rifin; Provisional
Probab=24.76 E-value=48 Score=34.48 Aligned_cols=23 Identities=26% Similarity=0.505 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHhcCCCCC
Q psy1807 542 VLFLLILLAIIYARLRKATPVKN 564 (618)
Q Consensus 542 ~l~l~vl~~~~~~~~r~k~~~~~ 564 (618)
+++++.+.+++|..||-||++|-
T Consensus 324 IvVIVLIMvIIYLILRYRRKKKM 346 (358)
T PTZ00046 324 IVVIVLIMVIIYLILRYRRKKKM 346 (358)
T ss_pred HHHHHHHHHHHHHHHHhhhcchh
Confidence 34444555678889999988764
No 125
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=24.75 E-value=24 Score=36.33 Aligned_cols=13 Identities=31% Similarity=0.536 Sum_probs=5.1
Q ss_pred HHHHHHHHHHHhc
Q psy1807 547 ILLAIIYARLRKA 559 (618)
Q Consensus 547 vl~~~~~~~~r~k 559 (618)
||+..++.|||.+
T Consensus 288 vLiaYli~Rrr~~ 300 (306)
T PF01299_consen 288 VLIAYLIGRRRSR 300 (306)
T ss_pred HHHhheeEecccc
Confidence 3333333344433
No 126
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=24.44 E-value=49 Score=34.33 Aligned_cols=23 Identities=30% Similarity=0.550 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHhcCCCCC
Q psy1807 542 VLFLLILLAIIYARLRKATPVKN 564 (618)
Q Consensus 542 ~l~l~vl~~~~~~~~r~k~~~~~ 564 (618)
+++++.+.+++|..||-||++|-
T Consensus 319 IvvIVLIMvIIYLILRYRRKKKM 341 (353)
T TIGR01477 319 ILIIVLIMVIIYLILRYRRKKKM 341 (353)
T ss_pred HHHHHHHHHHHHHHHHhhhcchh
Confidence 34444555688889999998764
No 127
>KOG0196|consensus
Probab=24.31 E-value=1.2e+02 Score=35.19 Aligned_cols=29 Identities=24% Similarity=0.446 Sum_probs=18.4
Q ss_pred EeCCCceec--CCCeEEeCCCCCcCCCCCcc
Q psy1807 305 ACHENYTLI--GETRRVCGDGGKWNGTEPQC 333 (618)
Q Consensus 305 ~C~~Gy~l~--G~~~~~C~~~G~Ws~~~p~C 333 (618)
+|-..-.-. +.....|..+|.|--+.-.|
T Consensus 231 tCV~nA~~~~~~~p~m~C~~dGeWlvpiG~C 261 (996)
T KOG0196|consen 231 SCVPNAEEEPVPVPRMYCSGDGEWLVPIGGC 261 (996)
T ss_pred cccCCCccCCCCCceeEEcCCCcEEEEcCce
Confidence 465543333 44678899999997655544
No 128
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=24.28 E-value=50 Score=27.51 Aligned_cols=7 Identities=43% Similarity=0.644 Sum_probs=3.4
Q ss_pred cCCCcee
Q psy1807 424 CDDDYWL 430 (618)
Q Consensus 424 C~~Gy~l 430 (618)
|.+||..
T Consensus 9 C~~g~~~ 15 (96)
T PTZ00382 9 CDSDKKP 15 (96)
T ss_pred CCCCCcc
Confidence 4455543
No 129
>PF14851 FAM176: FAM176 family
Probab=24.17 E-value=20 Score=32.62 Aligned_cols=27 Identities=30% Similarity=0.359 Sum_probs=15.7
Q ss_pred ccccceEEeehHHHHHHHHHHHHHHHH
Q psy1807 528 AQTLGYMISVGAGVVLFLLILLAIIYA 554 (618)
Q Consensus 528 ~~~~~~~~~~~~~~~l~l~vl~~~~~~ 554 (618)
...|.+.++|-++++|.|.+|.+-+.+
T Consensus 21 ~~aLYFv~gVC~GLlLtLcllV~risc 47 (153)
T PF14851_consen 21 RFALYFVSGVCAGLLLTLCLLVIRISC 47 (153)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence 444555566666666666566555555
No 130
>PF08391 Ly49: Ly49-like protein, N-terminal region; InterPro: IPR013600 The sequences making up this entry are annotated as, or are similar to, Ly49 receptors (e.g. P20937 from SWISSPROT). These are type II transmembrane receptors expressed by mouse natural killer (NK) cells. They are classified as being activating (e.g.Ly49D and H) or inhibitory (e.g. Ly49A and G), depending on their effect on NK cell function []. They are members of the C-type lectin receptor superfamily [], and in fact in many family members this region is found immediately N-terminal to a lectin C-type domain (IPR001304 from INTERPRO). ; PDB: 1QO3_D 3C8J_D 1P4L_D 3C8K_D 3G8K_B 1JA3_B 3CAD_A 3G8L_A.
Probab=24.01 E-value=44 Score=29.01 Aligned_cols=13 Identities=23% Similarity=0.608 Sum_probs=12.4
Q ss_pred ccceeecceEEEE
Q psy1807 84 AEITSFDRNCYEF 96 (618)
Q Consensus 84 ~~w~~~~~~CY~~ 96 (618)
.-|..||-+||+|
T Consensus 107 ~~W~cyg~kCYYF 119 (119)
T PF08391_consen 107 KHWFCYGVKCYYF 119 (119)
T ss_dssp CEEEEETTEEEEE
T ss_pred CceeeeCceeeeC
Confidence 8999999999998
No 131
>PF07589 VPEP: PEP-CTERM motif; InterPro: IPR013424 This entry describes a 25-residue region including an invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In most cases, this motif is found within nine residues of the C-terminal end of the protein. Proteins containing this motif typically have signal sequences at the N terminus [].
Probab=23.31 E-value=1e+02 Score=18.68 Aligned_cols=12 Identities=33% Similarity=0.467 Sum_probs=4.8
Q ss_pred HHHHHHHHHHHH
Q psy1807 542 VLFLLILLAIIY 553 (618)
Q Consensus 542 ~l~l~vl~~~~~ 553 (618)
.++++.++++..
T Consensus 9 ~l~~~gl~~l~~ 20 (25)
T PF07589_consen 9 ALLGLGLLGLAF 20 (25)
T ss_pred HHHHHHHHHHHH
Confidence 334444444333
No 132
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=21.49 E-value=77 Score=36.50 Aligned_cols=18 Identities=17% Similarity=0.300 Sum_probs=9.6
Q ss_pred ccccCCEEEEEc-CCCcee
Q psy1807 236 NTKLGSTISYAC-PEGNML 253 (618)
Q Consensus 236 ~~~~G~~~~y~C-~~Gy~l 253 (618)
.|..|..+.++= ..||..
T Consensus 46 ~yrlg~~l~v~a~k~gyv~ 64 (807)
T PF10577_consen 46 PYRLGTWLIVTASKDGYVP 64 (807)
T ss_pred eccCCCeEEEEEecCCccC
Confidence 566666655543 345543
No 133
>PRK11056 hypothetical protein; Provisional
Probab=20.92 E-value=1.2e+02 Score=26.10 Aligned_cols=26 Identities=19% Similarity=0.098 Sum_probs=17.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy1807 538 GAGVVLFLLILLAIIYARLRKATPVK 563 (618)
Q Consensus 538 ~~~~~l~l~vl~~~~~~~~r~k~~~~ 563 (618)
-...+++.++|++|+.+|++.++++.
T Consensus 93 NF~p~il~~~L~~Wi~~kl~~~~~~~ 118 (120)
T PRK11056 93 NFFPAVLSVILVFWIGRKLRNRKKKV 118 (120)
T ss_pred chHHHHHHHHHHHHHHHHHhcccccc
Confidence 33445666788889998887766543
No 134
>KOG0994|consensus
Probab=20.23 E-value=2.4e+02 Score=34.04 Aligned_cols=68 Identities=24% Similarity=0.478 Sum_probs=40.3
Q ss_pred CEEEEEcCCCceecCCceeEecCCcccCC--CCceeeecCCCCCccCC-eeEEecCC------CcccCCEEEEEeCCCce
Q psy1807 241 STISYACPEGNMLVGSATRTCKEGFWTGV--APTCQYVDCGKLEHIEH-GTVTLETT------RTTHGAVAIYACHENYT 311 (618)
Q Consensus 241 ~~~~y~C~~Gy~l~g~~~~~C~~g~Ws~~--~p~C~~~~C~~~~~~~n-g~~~~~~~------~~~~g~~~~~~C~~Gy~ 311 (618)
..+.-.|.+|| .|+..-.|-++.|..+ ..+|++-.|...-.+.. |......+ ....|+.+.+ |.+||.
T Consensus 932 ~~ivC~C~~GY--~G~RCe~CA~~~fGnP~~GGtCq~CeC~~NiD~~d~~aCD~~TG~CLkCL~hTeG~hCe~-Ck~Gf~ 1008 (1758)
T KOG0994|consen 932 QQIVCHCQEGY--SGSRCEICADNHFGNPSEGGTCQKCECSNNIDLYDPGACDVATGACLKCLYHTEGDHCEH-CKDGFY 1008 (1758)
T ss_pred cceeeecccCc--cccchhhhcccccCCcccCCccccccccCCcCccCCCccchhhchhhhhhhcccccchhh-ccccch
Confidence 44667899999 5777778888888643 35788777764321111 11111001 2345677766 888885
Done!