Query psy9274
Match_columns 318
No_of_seqs 393 out of 2921
Neff 8.8
Searched_HMMs 46136
Date Fri Aug 16 23:17:24 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9274.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9274hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.9 1.2E-26 2.5E-31 209.3 8.9 166 54-221 27-195 (423)
2 KOG4220|consensus 99.9 1.4E-25 3.1E-30 201.9 6.8 158 62-220 30-189 (503)
3 PHA03234 DNA packaging protein 99.9 2.1E-23 4.5E-28 192.7 17.4 138 59-200 29-170 (338)
4 PHA02834 chemokine receptor-li 99.9 7.2E-22 1.6E-26 181.7 15.8 135 62-200 28-164 (323)
5 PHA02638 CC chemokine receptor 99.9 9.4E-22 2E-26 186.4 14.2 141 59-202 95-237 (417)
6 PHA03087 G protein-coupled che 99.9 3.6E-21 7.7E-26 177.9 16.1 152 62-219 40-193 (335)
7 PHA03235 DNA packaging protein 99.9 7.7E-21 1.7E-25 179.5 15.4 138 59-199 29-170 (409)
8 PF00001 7tm_1: 7 transmembran 99.8 1.3E-20 2.9E-25 165.1 13.3 141 79-221 1-143 (257)
9 KOG2087|consensus 99.6 1.5E-15 3.2E-20 136.3 8.4 166 48-219 10-184 (363)
10 PF10320 7TM_GPCR_Srsx: Serpen 99.6 5.7E-15 1.2E-19 131.5 8.0 116 74-191 2-119 (257)
11 PF10328 7TM_GPCR_Srx: Serpent 99.4 6.7E-13 1.4E-17 119.4 11.1 128 71-198 2-132 (274)
12 PF10324 7TM_GPCR_Srw: Serpent 99.2 1E-10 2.3E-15 107.3 9.0 144 71-215 5-162 (318)
13 PF10323 7TM_GPCR_Srv: Serpent 98.6 3.9E-08 8.4E-13 88.8 5.1 125 73-197 5-140 (283)
14 PF05462 Dicty_CAR: Slime mold 98.6 8.6E-07 1.9E-11 80.4 11.8 127 64-198 8-136 (303)
15 PF11710 Git3: G protein-coupl 98.5 3.7E-07 8.1E-12 78.1 7.1 75 91-165 30-106 (201)
16 PF05296 TAS2R: Mammalian tast 98.4 1.3E-05 2.8E-10 73.2 15.8 137 60-196 4-149 (303)
17 PF10316 7TM_GPCR_Srbc: Serpen 98.1 1.5E-05 3.2E-10 71.3 9.8 132 62-193 5-141 (273)
18 PF10321 7TM_GPCR_Srt: Serpent 98.1 1.7E-05 3.7E-10 72.4 8.6 120 62-186 32-159 (313)
19 PF10292 7TM_GPCR_Srab: Serpen 97.9 0.00046 1E-08 63.6 15.6 128 63-191 17-153 (324)
20 PF10317 7TM_GPCR_Srd: Serpent 97.9 0.00019 4E-09 65.2 12.2 124 67-196 3-140 (292)
21 PF02118 Srg: Srg family chemo 97.5 0.00031 6.7E-09 63.0 8.0 127 70-197 5-144 (275)
22 PF00002 7tm_2: 7 transmembran 97.3 8.8E-05 1.9E-09 65.2 1.9 128 67-199 5-134 (242)
23 PF04789 DUF621: Protein of un 97.1 0.021 4.5E-07 50.4 14.1 127 79-205 31-162 (305)
24 PF03383 Serpentine_r_xa: Caen 97.1 0.002 4.3E-08 52.2 7.3 67 138-204 49-115 (153)
25 PF03402 V1R: Vomeronasal orga 96.9 0.0021 4.6E-08 57.1 6.3 101 93-195 7-113 (265)
26 PF02117 7TM_GPCR_Sra: Serpent 96.2 0.055 1.2E-06 50.0 10.8 111 58-169 16-137 (328)
27 PF02101 Ocular_alb: Ocular al 96.1 0.21 4.5E-06 46.3 13.9 124 64-193 28-170 (405)
28 PF06681 DUF1182: Protein of u 95.6 0.075 1.6E-06 44.9 8.0 93 71-168 49-152 (226)
29 PF10326 7TM_GPCR_Str: Serpent 95.5 0.0095 2.1E-07 54.3 2.7 102 70-171 6-120 (307)
30 KOG1832|consensus 94.9 0.0087 1.9E-07 60.2 0.5 12 229-240 1391-1402(1516)
31 KOG4193|consensus 94.6 0.28 6E-06 48.9 10.2 116 70-196 327-446 (610)
32 PF10318 7TM_GPCR_Srh: Serpent 94.5 0.32 6.8E-06 44.1 9.7 132 64-197 5-147 (302)
33 PF02175 7TM_GPCR_Srb: Serpent 93.4 2.4 5.3E-05 36.8 12.4 42 130-171 80-123 (236)
34 PF10319 7TM_GPCR_Srj: Serpent 93.2 1 2.2E-05 40.9 10.2 127 67-193 9-148 (310)
35 PF10327 7TM_GPCR_Sri: Serpent 92.6 1.1 2.3E-05 41.0 9.8 128 64-193 10-149 (303)
36 KOG1832|consensus 92.6 0.027 5.9E-07 56.8 -0.7 10 153-162 1299-1308(1516)
37 KOG2568|consensus 92.5 2 4.2E-05 41.7 11.6 18 96-113 339-356 (518)
38 PF03344 Daxx: Daxx Family; I 92.2 0.042 9E-07 55.5 0.0 6 213-218 349-354 (713)
39 PF01534 Frizzled: Frizzled/Sm 91.8 3.8 8.2E-05 37.8 12.4 152 64-221 10-176 (328)
40 TIGR00927 2A1904 K+-dependent 91.8 0.051 1.1E-06 55.7 0.1 10 74-83 470-480 (1096)
41 KOG4564|consensus 91.2 4.5 9.7E-05 39.1 12.4 129 67-199 149-291 (473)
42 PF03125 Sre: C. elegans Sre G 90.9 5.5 0.00012 37.3 12.8 110 62-171 26-161 (365)
43 PRK12585 putative monovalent c 90.2 0.14 3.1E-06 42.6 1.3 11 127-137 27-37 (197)
44 KOG4156|consensus 89.9 0.1 2.2E-06 51.0 0.2 10 2-11 303-312 (1329)
45 PRK12585 putative monovalent c 89.1 2.6 5.7E-05 35.3 7.8 24 68-91 8-31 (197)
46 KOG0262|consensus 89.0 0.21 4.6E-06 52.3 1.8 7 162-168 1209-1215(1640)
47 PF06524 NOA36: NOA36 protein; 88.8 0.21 4.6E-06 43.5 1.4 16 73-88 94-109 (314)
48 PF10322 7TM_GPCR_Sru: Serpent 87.6 14 0.00029 33.9 12.3 62 138-200 108-169 (307)
49 PF06524 NOA36: NOA36 protein; 86.0 0.3 6.5E-06 42.5 0.7 7 105-111 98-104 (314)
50 PF07133 Merozoite_SPAM: Meroz 85.8 0.73 1.6E-05 38.1 2.8 8 236-243 26-33 (173)
51 PF02076 STE3: Pheromone A rec 84.1 5.2 0.00011 36.1 7.8 102 94-200 23-127 (283)
52 PF03821 Mtp: Golgi 4-transmem 82.5 23 0.00049 30.9 10.5 15 155-169 132-148 (233)
53 PF02724 CDC45: CDC45-like pro 82.1 0.93 2E-05 45.6 2.4 6 216-221 81-86 (622)
54 KOG0127|consensus 81.5 0.38 8.3E-06 46.3 -0.5 6 160-165 145-150 (678)
55 PF03348 Serinc: Serine incorp 81.2 31 0.00068 33.1 12.2 117 62-190 181-299 (429)
56 PF10329 DUF2417: Region of un 81.1 37 0.0008 29.7 11.8 16 146-161 140-155 (232)
57 PRK14891 50S ribosomal protein 81.1 0.89 1.9E-05 35.3 1.4 6 228-233 51-56 (131)
58 PRK14891 50S ribosomal protein 79.8 0.5 1.1E-05 36.7 -0.3 11 230-240 27-37 (131)
59 PTZ00415 transmission-blocking 78.6 1.1 2.4E-05 48.8 1.7 8 212-219 112-119 (2849)
60 KOG3817|consensus 77.8 3.9 8.4E-05 37.6 4.6 7 126-132 199-205 (452)
61 PF07423 DUF1510: Protein of u 77.3 2.1 4.5E-05 36.9 2.7 23 171-193 10-32 (217)
62 KOG3064|consensus 76.8 1 2.2E-05 39.3 0.6 14 35-48 34-47 (303)
63 COG3125 CyoD Heme/copper-type 76.5 32 0.00068 26.3 8.5 75 77-152 28-102 (111)
64 KOG3064|consensus 74.2 0.84 1.8E-05 39.8 -0.5 6 158-163 123-128 (303)
65 KOG2927|consensus 73.9 3.1 6.8E-05 38.1 3.0 8 162-169 205-212 (372)
66 PF05758 Ycf1: Ycf1; InterPro 73.6 41 0.00088 35.5 11.2 16 187-202 202-217 (929)
67 PF11705 RNA_pol_3_Rpc31: DNA- 71.4 1.6 3.4E-05 38.3 0.5 6 157-162 98-103 (233)
68 PF14362 DUF4407: Domain of un 70.7 44 0.00095 30.3 9.9 53 145-197 51-103 (301)
69 CHL00204 ycf1 Ycf1; Provisiona 69.5 91 0.002 35.2 12.9 18 186-203 217-234 (1832)
70 PF05297 Herpes_LMP1: Herpesvi 67.9 1.7 3.7E-05 38.6 0.0 19 146-164 134-152 (381)
71 PF01102 Glycophorin_A: Glycop 66.3 5.7 0.00012 31.0 2.6 23 75-97 75-97 (122)
72 COG4547 CobT Cobalamin biosynt 63.9 2.1 4.5E-05 40.7 -0.3 11 77-87 33-43 (620)
73 COG4547 CobT Cobalamin biosynt 62.3 2.5 5.5E-05 40.1 -0.0 9 151-159 128-136 (620)
74 PF09882 DUF2109: Predicted me 61.7 23 0.00049 25.1 4.6 49 73-121 4-52 (78)
75 PF13042 DUF3902: Protein of u 61.4 85 0.0018 25.3 12.7 59 95-153 60-118 (161)
76 COG5414 TATA-binding protein-a 59.1 3 6.6E-05 37.1 -0.1 10 160-169 179-188 (392)
77 TIGR03869 F420-0_ABCperm propo 58.9 1.4E+02 0.003 27.6 10.7 34 83-116 124-157 (325)
78 KOG0943|consensus 58.9 3 6.5E-05 44.3 -0.1 7 75-81 1459-1465(3015)
79 PF15383 TMEM237: Transmembran 54.5 64 0.0014 28.6 7.4 51 153-204 154-205 (253)
80 PTZ00359 hypothetical protein; 52.5 1E+02 0.0022 29.2 8.6 85 72-156 259-343 (443)
81 PF06814 Lung_7-TM_R: Lung sev 51.0 1.9E+02 0.0041 26.0 14.0 45 142-191 124-168 (295)
82 COG1230 CzcD Co/Zn/Cd efflux s 49.7 2E+02 0.0043 26.2 9.9 57 68-126 126-183 (296)
83 PF12304 BCLP: Beta-casein lik 47.0 1.7E+02 0.0037 24.5 11.2 113 64-199 42-158 (188)
84 KOG1307|consensus 45.9 9 0.0002 36.7 0.8 10 64-73 61-70 (588)
85 PRK10582 cytochrome o ubiquino 45.7 1.3E+02 0.0029 22.9 8.2 18 96-113 45-62 (109)
86 PRK09535 btuC corrinoid ABC tr 45.0 2.3E+02 0.0049 26.7 9.8 34 83-116 168-201 (366)
87 PF01032 FecCD: FecCD transpor 44.8 91 0.002 28.5 7.2 39 82-120 115-153 (311)
88 PRK10440 iron-enterobactin tra 44.0 2.5E+02 0.0054 26.0 9.9 24 94-117 142-165 (330)
89 PF09184 PPP4R2: PPP4R2; Inte 43.5 6.9 0.00015 35.4 -0.3 15 152-166 120-134 (288)
90 PF06213 CobT: Cobalamin biosy 43.0 7.7 0.00017 35.0 -0.1 8 155-162 150-157 (282)
91 PF10110 GPDPase_memb: Membran 40.4 77 0.0017 25.4 5.4 44 147-191 103-146 (149)
92 KOG2051|consensus 40.1 6.7 0.00014 41.1 -1.1 9 105-113 685-693 (1128)
93 KOG3817|consensus 39.7 2.2E+02 0.0048 26.6 8.5 38 73-110 168-216 (452)
94 KOG1307|consensus 39.6 11 0.00023 36.2 0.2 10 99-108 85-94 (588)
95 KOG2140|consensus 39.4 21 0.00046 34.9 2.2 8 156-163 318-325 (739)
96 KOG1619|consensus 39.2 2.7E+02 0.0058 24.5 9.3 28 159-186 148-175 (245)
97 KOG2302|consensus 38.7 1.6E+02 0.0035 31.5 8.3 118 61-178 73-198 (1956)
98 TIGR01651 CobT cobaltochelatas 38.4 11 0.00024 37.3 0.2 7 153-159 111-117 (600)
99 TIGR01477 RIFIN variant surfac 38.3 45 0.00097 31.0 4.0 30 65-94 310-339 (353)
100 COG2245 Predicted membrane pro 37.9 2.4E+02 0.0051 23.4 12.3 32 74-105 29-63 (182)
101 KOG1292|consensus 37.8 2.1E+02 0.0047 27.9 8.5 26 169-194 201-226 (510)
102 PF05478 Prominin: Prominin; 37.3 1.2E+02 0.0025 31.9 7.4 22 149-170 481-502 (806)
103 COG0109 CyoE Polyprenyltransfe 37.1 1.1E+02 0.0023 28.0 6.1 23 146-168 190-212 (304)
104 KOG3609|consensus 36.9 1.9E+02 0.0042 30.0 8.5 76 116-195 323-403 (822)
105 KOG2632|consensus 36.8 1.7E+02 0.0036 26.0 7.1 37 86-122 46-82 (258)
106 KOG2051|consensus 36.4 9.5 0.00021 40.1 -0.7 9 154-162 806-814 (1128)
107 PF14283 DUF4366: Domain of un 36.1 12 0.00026 32.4 0.0 15 98-112 98-113 (218)
108 PF15330 SIT: SHP2-interacting 35.7 51 0.0011 25.1 3.4 31 64-94 2-32 (107)
109 PTZ00046 rifin; Provisional 35.4 54 0.0012 30.5 4.1 30 65-94 315-344 (358)
110 TIGR02847 CyoD cytochrome o ub 34.4 1.9E+02 0.0042 21.4 8.2 57 95-152 33-89 (96)
111 PF13853 7tm_4: Olfactory rece 34.1 25 0.00055 28.0 1.6 54 178-236 4-57 (144)
112 PF09184 PPP4R2: PPP4R2; Inte 34.1 10 0.00022 34.3 -0.7 14 155-168 99-113 (288)
113 KOG1289|consensus 33.8 4.9E+02 0.011 25.9 13.8 20 94-113 389-408 (550)
114 KOG2571|consensus 33.4 6.1E+02 0.013 26.9 11.5 11 211-221 761-771 (862)
115 PF05297 Herpes_LMP1: Herpesvi 33.2 14 0.00031 33.0 0.0 23 75-97 58-80 (381)
116 PF02118 Srg: Srg family chemo 33.0 3.3E+02 0.0072 23.7 10.5 30 62-91 170-199 (275)
117 PF02116 STE2: Fungal pheromon 32.6 2.8E+02 0.0061 25.1 8.2 15 82-96 51-65 (284)
118 PHA02638 CC chemokine receptor 32.5 4.5E+02 0.0097 25.0 10.5 23 174-196 308-330 (417)
119 PF08114 PMP1_2: ATPase proteo 32.4 12 0.00025 23.0 -0.4 23 70-92 12-34 (43)
120 COG4606 CeuB ABC-type enteroch 32.4 3.8E+02 0.0082 24.2 8.7 91 71-162 112-206 (321)
121 PHA03164 hypothetical protein; 32.3 45 0.00097 23.5 2.3 20 94-113 56-75 (88)
122 KOG2290|consensus 31.9 1.3E+02 0.0028 29.0 6.0 16 73-88 495-510 (652)
123 TIGR01297 CDF cation diffusion 31.9 3.5E+02 0.0075 23.6 10.2 10 58-67 58-67 (268)
124 PF11446 DUF2897: Protein of u 31.7 51 0.0011 21.8 2.4 20 69-88 8-27 (55)
125 PF02009 Rifin_STEVOR: Rifin/s 31.5 35 0.00076 31.0 2.2 26 67-92 258-283 (299)
126 COG2194 Predicted membrane-ass 31.5 4.7E+02 0.01 26.1 10.2 10 184-193 159-168 (555)
127 PF05086 Dicty_REP: Dictyostel 30.9 20 0.00043 36.5 0.6 10 154-163 681-690 (911)
128 PLN02159 Fe(2+) transport prot 30.5 4.4E+02 0.0096 24.4 11.0 73 30-112 20-93 (337)
129 TIGR00439 ftsX putative protei 30.4 4.2E+02 0.0092 24.1 9.4 21 72-92 191-211 (309)
130 PF09726 Macoilin: Transmembra 30.3 2E+02 0.0042 29.7 7.5 94 60-153 28-131 (697)
131 PF09586 YfhO: Bacterial membr 29.9 6.6E+02 0.014 26.2 15.1 57 141-200 183-239 (843)
132 KOG3540|consensus 29.1 17 0.00038 34.8 -0.1 12 231-242 179-190 (615)
133 PF11295 DUF3096: Protein of u 29.1 1E+02 0.0023 18.7 3.2 23 57-79 14-36 (39)
134 COG1480 Predicted membrane-ass 28.6 6.6E+02 0.014 25.8 10.6 19 93-111 304-322 (700)
135 PF03547 Mem_trans: Membrane t 28.1 3.1E+02 0.0067 25.4 8.1 12 115-126 41-52 (385)
136 KOG3214|consensus 27.6 25 0.00055 26.1 0.5 6 212-217 48-53 (109)
137 PF14494 DUF4436: Domain of un 27.5 2.6E+02 0.0055 24.9 6.9 21 138-158 235-255 (256)
138 PF04893 Yip1: Yip1 domain; I 27.3 3.1E+02 0.0068 21.6 9.5 16 172-187 154-169 (172)
139 PRK13591 ubiA prenyltransferas 27.0 4.9E+02 0.011 23.8 11.0 69 101-179 30-103 (307)
140 PRK11228 fecC iron-dicitrate t 26.9 5E+02 0.011 23.8 10.6 23 94-116 138-160 (323)
141 KOG4434|consensus 26.7 35 0.00075 31.8 1.3 25 64-88 97-121 (520)
142 KOG1485|consensus 26.7 3.8E+02 0.0082 25.6 8.1 11 105-115 150-160 (412)
143 PRK10441 iron-enterobactin tra 26.7 5.2E+02 0.011 23.9 9.6 24 94-117 146-169 (335)
144 PRK12911 bifunctional preprote 25.8 9.8E+02 0.021 26.8 12.3 12 171-182 1083-1094(1403)
145 COG2181 NarI Nitrate reductase 25.2 1.7E+02 0.0037 25.5 5.2 58 62-119 89-146 (228)
146 KOG0526|consensus 25.2 23 0.00049 34.6 -0.1 7 161-167 327-333 (615)
147 KOG2325|consensus 25.0 72 0.0016 31.2 3.2 17 174-190 163-179 (488)
148 PF02532 PsbI: Photosystem II 24.6 1E+02 0.0022 18.3 2.5 16 65-80 11-26 (36)
149 COG4645 Uncharacterized protei 24.6 5.9E+02 0.013 23.8 8.8 31 95-125 238-270 (410)
150 PF00003 7tm_3: 7 transmembran 24.3 25 0.00054 30.4 -0.1 7 131-137 52-58 (238)
151 KOG2428|consensus 24.2 58 0.0013 31.0 2.3 8 151-158 277-284 (443)
152 COG5129 MAK16 Nuclear protein 24.1 7.8 0.00017 33.2 -3.1 12 35-46 33-44 (303)
153 PF10856 DUF2678: Protein of u 24.1 1.5E+02 0.0032 22.9 4.1 17 60-76 29-45 (118)
154 KOG3114|consensus 23.9 2.9E+02 0.0063 25.0 6.5 25 63-87 117-141 (290)
155 PF10873 DUF2668: Protein of u 23.8 47 0.001 26.6 1.4 33 62-94 61-93 (155)
156 PF12606 RELT: Tumour necrosis 23.6 80 0.0017 20.4 2.2 24 67-92 5-28 (50)
157 PRK11026 ftsX cell division AB 23.4 5.7E+02 0.012 23.3 9.4 18 75-92 194-211 (309)
158 TIGR02901 QoxD cytochrome aa3 23.2 3.2E+02 0.0068 20.2 7.6 18 96-113 36-53 (94)
159 KOG2592|consensus 22.5 3.4E+02 0.0073 25.8 6.8 125 60-193 179-315 (426)
160 KOG2475|consensus 22.4 42 0.0009 32.8 1.0 6 312-317 205-210 (587)
161 PF10749 DUF2534: Protein of u 21.9 3.1E+02 0.0068 19.7 7.3 16 98-113 13-28 (85)
162 KOG3118|consensus 21.5 50 0.0011 31.8 1.3 59 236-294 54-112 (517)
163 COG2456 Uncharacterized conser 21.3 3.8E+02 0.0082 20.4 7.5 38 81-118 21-58 (121)
164 PF06609 TRI12: Fungal trichot 21.0 8.7E+02 0.019 24.6 11.1 31 149-179 283-313 (599)
165 KOG4434|consensus 20.9 26 0.00056 32.6 -0.6 6 5-10 9-14 (520)
166 PF11196 DUF2834: Protein of u 20.7 3.6E+02 0.0078 20.0 6.5 28 173-200 67-94 (97)
167 PF10192 GpcrRhopsn4: Rhodopsi 20.6 5.8E+02 0.013 22.3 14.4 29 80-108 46-74 (257)
168 PF04238 DUF420: Protein of un 20.4 4.4E+02 0.0095 20.8 12.3 27 171-197 104-130 (133)
169 KOG4193|consensus 20.1 4.2E+02 0.0091 26.9 7.5 49 146-194 486-539 (610)
No 1
>KOG4219|consensus
Probab=99.93 E-value=1.2e-26 Score=209.26 Aligned_cols=166 Identities=24% Similarity=0.507 Sum_probs=150.0
Q ss_pred hHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhH
Q psy9274 54 AEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDV 133 (318)
Q Consensus 54 ~~~~~~~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~ 133 (318)
....+.+...+.+++++++++++++||++|++++..+|++|+.+|+||+|||+||++++++..++.....+...|.+|..
T Consensus 27 ~f~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f 106 (423)
T KOG4219|consen 27 LFVLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSF 106 (423)
T ss_pred cccCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccc
Confidence 34566777899999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeeee-CCCc
Q psy9274 134 WCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTY-ASEL 210 (318)
Q Consensus 134 ~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~-~~~~ 210 (318)
+|++ |+..+...+|+++|++||+|||.||.||++.+ .++|.++++++++|+.|+++++|..+......... .+..
T Consensus 107 ~C~f~nf~~itav~vSVfTlvAiA~DRy~AIi~Pl~~r--~s~r~sk~iIllIW~lA~l~a~P~~l~s~v~~~~~~d~~~ 184 (423)
T KOG4219|consen 107 YCRFVNFFPITAVFVSVFTLVAIAIDRYMAIIHPLQPR--PSRRSSKIIILLIWALALLLALPQLLYSSVEELYLYDGES 184 (423)
T ss_pred eeeeccccchhhhhHhHHHHHHHHHHHHHHHhhhcccC--CCCcceeehhHHHHHHHHHHhccceeeeeeEEeeccCCcc
Confidence 9999 99999999999999999999999999999876 69999999999999999999999998877655433 2344
Q ss_pred ceeeEeecCCC
Q psy9274 211 HTACVFLDLDP 221 (318)
Q Consensus 211 ~~~C~~~~~~~ 221 (318)
...|...+...
T Consensus 185 ~~~~~~~~pe~ 195 (423)
T KOG4219|consen 185 RVVCVTAWPEH 195 (423)
T ss_pred eEEEEEecccc
Confidence 67788777765
No 2
>KOG4220|consensus
Probab=99.92 E-value=1.4e-25 Score=201.90 Aligned_cols=158 Identities=23% Similarity=0.410 Sum_probs=143.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HH
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YL 139 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~ 139 (318)
-++++++..+++++.++||++|++.+.-.|+|++..|+||++||+||++++.+.||+..++.+.+.|++|...|.+ .+
T Consensus 30 ~v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlWLal 109 (503)
T KOG4220|consen 30 VVFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLWLAL 109 (503)
T ss_pred EEeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHHHHH
Confidence 5677888899999999999999999999999999999999999999999999999999999999999999999999 67
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeeeeCCCcceeeEeecC
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDL 219 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~~C~~~~~ 219 (318)
-++...+|++.|+.|++|||..|.+|+.|+..+|.+|+.++|++.|++|+++-.|.+++|..+.= ........|..+..
T Consensus 110 DYvaSNASVmNLLiISFDRYFsVTrPLtYrakRTtkrA~~MI~~AW~iSfiLWaPaIl~WqyivG-krTv~~~eC~iQFl 188 (503)
T KOG4220|consen 110 DYVASNASVMNLLIISFDRYFSVTRPLTYRAKRTTKRAGLMIGAAWVLSFVLWAPAILFWQYIVG-KRTVPDGECYIQFL 188 (503)
T ss_pred HHHhhhhhhhhhheeeeecceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHHhhHhhee-eeecCCCceEEEee
Confidence 78889999999999999999999999999999999999999999999999999999888765432 22445677877666
Q ss_pred C
Q psy9274 220 D 220 (318)
Q Consensus 220 ~ 220 (318)
+
T Consensus 189 s 189 (503)
T KOG4220|consen 189 S 189 (503)
T ss_pred c
Confidence 5
No 3
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.91 E-value=2.1e-23 Score=192.68 Aligned_cols=138 Identities=16% Similarity=0.294 Sum_probs=115.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhh--hccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHH
Q psy9274 59 RMVSIIVPIFFGIIIILGFFGNALVVVVV--AFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCK 136 (318)
Q Consensus 59 ~~~~~~~~~~~~ii~~~gi~gN~lvi~~i--~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~ 136 (318)
...++++++++.+++++|++||++|++++ .+++++|+++|+|++|||+||+++++ .+|+.+... ...|++|...|+
T Consensus 29 ~~~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l-~lp~~~~~~-~~~w~fG~~lCk 106 (338)
T PHA03234 29 KKAQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHML-CVFFLTLNR-EALFNFNQAFCQ 106 (338)
T ss_pred HHHHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHH-hCCccCchhHHH
Confidence 45688899999999999999999999955 45566789999999999999999987 556555443 457999999999
Q ss_pred H--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcC
Q psy9274 137 M--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHG 200 (318)
Q Consensus 137 ~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~ 200 (318)
+ ++..++.++|++++++||+|||+||++|.+++. +++++...++++|+++++.++|.++...
T Consensus 107 ~~~~~~~~~~~~Si~~L~~ISiDRY~aIv~p~~~~~--~~~~~~~~i~~~Wi~s~l~~~P~l~~~~ 170 (338)
T PHA03234 107 CVLFIYHASCSYSICMLAIIATIRYKTLHRRKKNDK--KNNHIGRNIGILFLASAMCAIPAALFVK 170 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHeeeechhhhhh--hhhhHHHHHHHHHHHHHHHHhhHhHeee
Confidence 9 888899999999999999999999999998653 3444555667779999999999986543
No 4
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.88 E-value=7.2e-22 Score=181.72 Aligned_cols=135 Identities=23% Similarity=0.534 Sum_probs=114.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HH
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YL 139 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~ 139 (318)
+.++.+++.+++++|++||+++++++.++++ +++.|+|++|||+||++. ++.+|+.+.... ..|.+|...|++ ++
T Consensus 28 ~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~-~~~lP~~i~~~~-~~w~~g~~~C~~~~~~ 104 (323)
T PHA02834 28 NYFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLML-VFSFPFIIHNDL-NEWIFGEFMCKLVLGV 104 (323)
T ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHH-HHHHHHHHHHHc-CCcCCcchHHHhHHHH
Confidence 5678899999999999999999999887665 467899999999999986 457888766543 479999999999 66
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcC
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHG 200 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~ 200 (318)
..++..+|++++++||+|||++|++|++++. .+.+++.++++++|+++++.++|+++++.
T Consensus 105 ~~~~~~~Si~tL~~IsidRY~aI~~P~~~~~-~~~~~~~~~i~~iWi~s~l~~~P~~~~~~ 164 (323)
T PHA02834 105 YFVGFFSNMFFVTLISIDRYILVVNATKIKN-KSISLSVLLSVAAWVCSVILSMPAMVLYY 164 (323)
T ss_pred HHHHHHHHHHHHHHHHHHHhhheeCchhccC-CccchHHHHHHHHHHHHHHHHhhHHHHHH
Confidence 6777889999999999999999999998754 35677778889999999999999877543
No 5
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.87 E-value=9.4e-22 Score=186.43 Aligned_cols=141 Identities=30% Similarity=0.637 Sum_probs=124.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH-
Q psy9274 59 RMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM- 137 (318)
Q Consensus 59 ~~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~- 137 (318)
.....++.+++.+++++|++||++|++++.+ |++|++++++++|||++|+++++ .+|+.+... .+.|.++...|++
T Consensus 95 ~~~~~~l~~~y~lvfvlgliGN~LVl~il~~-k~lrt~t~i~llnLAisDLl~~l-~lPf~i~~~-~~~W~fg~~~Ck~~ 171 (417)
T PHA02638 95 PSISEYIKIFYIIIFILGLFGNAAIIMILFC-KKIKTITDIYIFNLAISDLIFVI-DFPFIIYNE-FDQWIFGDFMCKVI 171 (417)
T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCHhHHHHHHHHHHHHHHHH-HHHHHHHHH-hccccccccchhhH
Confidence 3456788899999999999999999987765 77899999999999999998865 788877655 4689999999999
Q ss_pred -HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCce
Q psy9274 138 -YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEV 202 (318)
Q Consensus 138 -~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~ 202 (318)
++...+.+++++++++|++|||+||+||+++++.++++++.++++++|++++++++|++++.+..
T Consensus 172 ~~l~~~~~~~Si~~L~~isiDRYlaIv~p~~~~~~~~~~~~~i~~~~iW~~s~l~slP~~~~~~~~ 237 (417)
T PHA02638 172 SASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEAS 237 (417)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccccceecHhhhHhhHhHHHHHHHHHHHHHHHHhhcc
Confidence 67778888999999999999999999999998888888888999999999999999999876643
No 6
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.87 E-value=3.6e-21 Score=177.90 Aligned_cols=152 Identities=27% Similarity=0.595 Sum_probs=129.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HH
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YL 139 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~ 139 (318)
..+..+++.+++++|++||+++++++.++ ++|++.++++.|||++|++.++ .+|..+.......|.++...|++ ++
T Consensus 40 ~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~-~~~~~~~~~~~~~~~~~~~~C~~~~~~ 117 (335)
T PHA03087 40 STILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVSGL 117 (335)
T ss_pred hhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHH-hHHHHHHHHhCCCCCCCcHHHHHHHHH
Confidence 56788889999999999999999999888 8899999999999999998877 56766655666788899999999 77
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeeeeCCCcceeeEeecC
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDL 219 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~~C~~~~~ 219 (318)
...+..+|++++++||+|||++|++|++|+...+++++.++++++|+++++.++|++........ .....|...+.
T Consensus 118 ~~~~~~~S~~~l~~iaidRy~aI~~p~~~~~~~~~~~~~~~~~~iWl~~~~~~~p~~~~~~~~~~----~~~~~C~~~~~ 193 (335)
T PHA03087 118 YYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISIIETTPILFVYTTKKD----HETLICCMFYN 193 (335)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccccchhhhHHHHHHHHHHHHHHhccHhheeeeecc----CCCceEEecCC
Confidence 78888999999999999999999999999999999999999999999999999988776543221 22445655443
No 7
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.86 E-value=7.7e-21 Score=179.54 Aligned_cols=138 Identities=22% Similarity=0.262 Sum_probs=111.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCC--CchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHH
Q psy9274 59 RMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQM--RSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCK 136 (318)
Q Consensus 59 ~~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l--~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~ 136 (318)
...+.+..+++.+++++|++||++|++++.+++++ +++.++|++|||++|+++ ++.+|+.+... ...|..+...|+
T Consensus 29 ~~~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~-l~~lP~~i~~~-~~~~~~g~~~Ck 106 (409)
T PHA03235 29 SAARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLT-VFVLPFIMLSN-QGLLSGSVAGCK 106 (409)
T ss_pred hhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHH-HHHHHHHHHhc-CccccCCCCeeh
Confidence 34577899999999999999999999986533332 356789999999999987 45788776532 122334467899
Q ss_pred H--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhc
Q psy9274 137 M--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMH 199 (318)
Q Consensus 137 ~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~ 199 (318)
+ ++..++..+|++++++||+|||++|++|+++++ .+++++.++++++|++++++++|++++.
T Consensus 107 ~~~~l~~~~~~~Si~tL~~ISiDRY~aI~~p~~~~~-~~~~~a~~ii~~iWi~sll~s~P~~~~~ 170 (409)
T PHA03235 107 FASLLYYASCTVGFATVALIAADRYRVIHQRTRARS-SAYRSTYKILGLTWFASLICSGPAPVYT 170 (409)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHeechhhccC-cccchhhhhHHHHHHHHHHHHHHHHHHH
Confidence 9 777888899999999999999999999986644 4688899999999999999999976543
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.85 E-value=1.3e-20 Score=165.10 Aligned_cols=141 Identities=30% Similarity=0.611 Sum_probs=123.4
Q ss_pred HHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HHHHHHHHHHHHHHHHHHH
Q psy9274 79 GNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YLIVVTAYASVYTLVLMSL 156 (318)
Q Consensus 79 gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~~~~~~~~s~~~l~~isi 156 (318)
||+++++++.++|++|++.++|++|||++|++.++...|..+.......|.++...|++ ++..++..+|.+++++||+
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~~~~is~ 80 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLFYFSSFSSIFSLVAISI 80 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 89999999999999999999999999999999999999988888777889999999999 7888899999999999999
Q ss_pred HhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeeeeCCCcceeeEeecCCC
Q psy9274 157 DRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDLDP 221 (318)
Q Consensus 157 dRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~~C~~~~~~~ 221 (318)
|||++|++|++|++..+++++..+++++|+++++.++|++++........ .....|.+.....
T Consensus 81 dRy~~i~~p~~~~~~~~~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~--~~~~~C~~~~~~~ 143 (257)
T PF00001_consen 81 DRYLAICHPLRYRRIRTRRRARIIIILIWIISFLISLPPLFFSWVYFVSD--GSQSFCFIDFSSS 143 (257)
T ss_dssp HHHHHHHSHHTHHHHSCHHHHHHHHHHHHHHHHHHHHHHHHTCEEEEEST--CCCEEEEESCSSS
T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccc--ccccccccccccc
Confidence 99999999999999999999999999999999999999988755443322 1256787765543
No 9
>KOG2087|consensus
Probab=99.61 E-value=1.5e-15 Score=136.26 Aligned_cols=166 Identities=23% Similarity=0.282 Sum_probs=132.5
Q ss_pred CCCCCchHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhc-
Q psy9274 48 NDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTT- 126 (318)
Q Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~- 126 (318)
+..+...+.-+...+.++.+..+++++++++||++|++.....+...++..+++.|||++|+++++....+..+.....
T Consensus 10 P~~da~~pcEdllg~~~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~g 89 (363)
T KOG2087|consen 10 PSPDAFNPCEDLLGYWILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRG 89 (363)
T ss_pred CCCCCCCcHHHhhccceeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHH
Confidence 3334444455566688899999999999999999999999988887889999999999999999986666655544322
Q ss_pred ------CCcchhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhh
Q psy9274 127 ------CWPFGDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNM 198 (318)
Q Consensus 127 ------~~~~~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~ 198 (318)
.|-.....|++ |+..++.-.|++++++++++|+++|++|++..+....|.+..+++..|+.+++.++.|+++
T Consensus 90 ey~~~ai~W~tg~gC~~aGflavFASElSv~~LT~itlEr~l~i~~p~~~~~~~~lr~~~~ill~~wl~~~l~A~~Pl~g 169 (363)
T KOG2087|consen 90 EYYKHAIDWQTGLGCPVAGFLAVFASELSVFLLTLITLERWLSITYPFRLDRKAKLRPLVLILLLGWLFAFLMALLPLFG 169 (363)
T ss_pred HHHHHHHhhhhcCCCchHHHHHHHHHHHHHHHHHHHHHHHHhheeccccCCCcccccHHHHHHHHHHHHHHHHHhccccC
Confidence 22245677998 8999999999999999999999999999998877667779999999999999999999887
Q ss_pred cCceeeeeCCCcceeeEeecC
Q psy9274 199 HGEVSYTYASELHTACVFLDL 219 (318)
Q Consensus 199 ~~~~~~~~~~~~~~~C~~~~~ 219 (318)
...+ .....|.-...
T Consensus 170 ~s~Y------~~~~vClPL~~ 184 (363)
T KOG2087|consen 170 ISSY------GASSVCLPLHI 184 (363)
T ss_pred CCCC------cccceeeeccc
Confidence 4322 22466754433
No 10
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=99.57 E-value=5.7e-15 Score=131.46 Aligned_cols=116 Identities=22% Similarity=0.437 Sum_probs=94.6
Q ss_pred HHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HHHHHHHHHHHHHH
Q psy9274 74 ILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YLIVVTAYASVYTL 151 (318)
Q Consensus 74 ~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~~~~~~~~s~~~l 151 (318)
++|++||+++++++.|+|++|+|.+++++.+|++|++.++..+|..+..+.... .....|-. +...++..+....+
T Consensus 2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~~~~~--i~~~~Cf~~~~~~~f~~~~qs~~~ 79 (257)
T PF10320_consen 2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLFRDHQ--ITRSECFWQIFFYIFFQCAQSVIM 79 (257)
T ss_pred EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHHhhee--ccHHHHHHHHHHHHHHHHHHHHHH
Confidence 578999999999999999999999999999999999999988887775443333 34455655 77778888889999
Q ss_pred HHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHH
Q psy9274 152 VLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLT 191 (318)
Q Consensus 152 ~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~ 191 (318)
++|++||++||+.|++|+++.+++.....++...+++...
T Consensus 80 l~i~iDr~iaV~~P~~Y~~~~~~~y~~~~~~~~~~~s~~~ 119 (257)
T PF10320_consen 80 LAIAIDRLIAVCFPLRYRTISTRKYLIILLIFPVIYSIFF 119 (257)
T ss_pred HHHHhhheeeEeehhhhhhcccccchhhHhHHHHHHHHHH
Confidence 9999999999999999999888875555555555555544
No 11
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.44 E-value=6.7e-13 Score=119.38 Aligned_cols=128 Identities=15% Similarity=0.297 Sum_probs=106.4
Q ss_pred HHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH---HHHHHHHHHH
Q psy9274 71 IIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM---YLIVVTAYAS 147 (318)
Q Consensus 71 ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~---~~~~~~~~~s 147 (318)
+++++|++.|.++++.+.|.+++|++.+++..+.|++|++.++..+.......+.+........... ++...+...+
T Consensus 2 ~~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~~l~~~~P~~~~~~~~~~~~~~s~~~g~~~~~~y~~~ 81 (274)
T PF10328_consen 2 LISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLIFLFWVVPMTLLDISFLPSSLNSIIFGFIGMFCYFIG 81 (274)
T ss_pred eeeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCchHHHHHHHHHHHHHHHHHH
Confidence 3578999999999999999999999999999999999999998666666555555544334433332 5566677789
Q ss_pred HHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhh
Q psy9274 148 VYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNM 198 (318)
Q Consensus 148 ~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~ 198 (318)
.++.++||++|++||+.|.+|+++.+.+.+..+++++|++++....+....
T Consensus 82 ~~~~~liaiNRf~ai~fP~~y~~~fs~~~T~~~i~~~~~~~~~~~~~~~~~ 132 (274)
T PF10328_consen 82 PLSHLLIAINRFCAIFFPFKYKKIFSFKNTIILIAFIWLLSIIISTILYFP 132 (274)
T ss_pred HHHHHHHHHHHHheeeeHHHHHhHcCccceehhhhHHHHHHHHHHHHhhhc
Confidence 999999999999999999999999999999999999999999777666554
No 12
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=99.16 E-value=1e-10 Score=107.31 Aligned_cols=144 Identities=26% Similarity=0.393 Sum_probs=107.4
Q ss_pred HHHHHHHHHHHHhhhhhhccCCCCc-hHHHHHHHHHHHHHHHHhhhHHHHHHHHhhc----C-CcchhHHH----HH--H
Q psy9274 71 IIIILGFFGNALVVVVVAFNQQMRS-TTNLLIINLAIADLLFIVCCVPFTATDYVTT----C-WPFGDVWC----KM--Y 138 (318)
Q Consensus 71 ii~~~gi~gN~lvi~~i~~~~~l~~-~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~----~-~~~~~~~C----~~--~ 138 (318)
+++++|+++|++.++++.|+. +|+ +.|+++..+|++|++..+..++..+...... . ++...... .+ .
T Consensus 5 ~~~~~g~~~N~~h~~VLtrk~-mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~~~~~~~~~C~~~~sy~~~~~~~~~~~ 83 (318)
T PF10324_consen 5 ILSIFGLFINIFHLIVLTRKS-MRSSSINILLIGIAICDLLYMLSILIWELFFFIIFYEDDECIPPYSYFWAYFKLIMES 83 (318)
T ss_pred EEeHHHHHHHHHHhhhcCChh-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccHHHHHHHHHHHH
Confidence 457899999999999998655 675 8999999999999999998888777655432 1 11111111 11 5
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeeeccCcc--ccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeeeeCCCcceeeE
Q psy9274 139 LIVVTAYASVYTLVLMSLDRFLAVVHPIAS--MSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACV 215 (318)
Q Consensus 139 ~~~~~~~~s~~~l~~isidRy~aI~~P~~y--~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~~C~ 215 (318)
+...+..+|.|+.++||+.|+++|.+|++. +++.+++.+..++++++++++++.++.++................|.
T Consensus 84 l~~~~~~~S~WL~V~mA~iR~l~i~~p~~~~~~~l~~~k~~~~~i~~v~~~s~~~~~~~~~~~~i~~~~~~~~p~~~C~ 162 (318)
T PF10324_consen 84 LSDIFRRISIWLGVLMALIRYLSIKFPMSSRFQKLSKPKFAIIVILIVFIISFLFSIPYFFRYKIVEVSDPWVPPPNCS 162 (318)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCCCeeeeehHHHHHHHHHHHHHhhceEEEEeccccccCCCcee
Confidence 666777899999999999999999999865 35677888889999999999999998886655443322223344575
No 13
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=98.62 E-value=3.9e-08 Score=88.84 Aligned_cols=125 Identities=21% Similarity=0.325 Sum_probs=92.6
Q ss_pred HHHHHHHHHHhhhhhhccCC----CCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhh---c-CCcchhHHHHH--HHHHH
Q psy9274 73 IILGFFGNALVVVVVAFNQQ----MRSTTNLLIINLAIADLLFIVCCVPFTATDYVT---T-CWPFGDVWCKM--YLIVV 142 (318)
Q Consensus 73 ~~~gi~gN~lvi~~i~~~~~----l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~---~-~~~~~~~~C~~--~~~~~ 142 (318)
+++.++..+++++++.+.|+ ++++.+.++.+.+++|++..+............ . ....+....++ .....
T Consensus 5 ~~~tlply~~il~~l~~~r~~~~~~~~~Fy~l~~~~~iaDi~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~y~ 84 (283)
T PF10323_consen 5 SIVTLPLYIFILYCLLKLRKRSKTFKSTFYTLLIQHCIADILSMLFYFLLQRFRAYGFFNDFYFSYQTYYAAIYYNIAYY 84 (283)
T ss_pred EeehHHHHHHHHHHHHHcccCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHhcchhHHHHHHHHHH
Confidence 45666777788777776554 458999999999999999887665544433221 1 01112223344 44455
Q ss_pred HHHHHHHHHHHHHHHhheeeeccC-ccccccchhhHHHHHHHHHHHHHHHHhhhHh
Q psy9274 143 TAYASVYTLVLMSLDRFLAVVHPI-ASMSVRTERNALKAILITWIVIVLTAVPVFN 197 (318)
Q Consensus 143 ~~~~s~~~l~~isidRy~aI~~P~-~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~ 197 (318)
+........+++|++||.||++|. +++++.++.+...++++-|+.+++.++|.+.
T Consensus 85 ~~~~~~~gi~lls~nR~~ai~~P~~~~~~~~~~~~~~~i~~i~wi~p~li~~~~~~ 140 (283)
T PF10323_consen 85 FLYIQCIGIVLLSLNRYLAICFPTSRHTKFWQPAKIWIIILIQWIPPLLISLPFFF 140 (283)
T ss_pred HHHHHHHhHHHHHHhhhheEEeecHHHhhhccccchhheeeeeehhhhhheeeeec
Confidence 666778889999999999999999 8888889999999999999999999998654
No 14
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=98.55 E-value=8.6e-07 Score=80.40 Aligned_cols=127 Identities=20% Similarity=0.276 Sum_probs=98.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HHHH
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YLIV 141 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~~~ 141 (318)
++.++..+.++++++|-+++++.+++.|++|++.+-++.-++++|++..+..+..... ....-++..|++ ++..
T Consensus 8 ~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~~~----~~~~~~s~lC~~Qafliq 83 (303)
T PF05462_consen 8 TLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMTLS----PSAGENSFLCQFQAFLIQ 83 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCCCcchhhHhHHHH
Confidence 4555666778899999999999999999999999999999999999987644332211 222335677988 7888
Q ss_pred HHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhh
Q psy9274 142 VTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNM 198 (318)
Q Consensus 142 ~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~ 198 (318)
++..++.+-.++||++=|+.+++... . .+|.-....+++|.++++.++.++..
T Consensus 84 ~f~~as~lWt~~iA~nly~~~~~~~~--~--~~k~e~~Y~l~~wGiPli~a~i~l~~ 136 (303)
T PF05462_consen 84 FFMLASFLWTLCIAFNLYLTFVKRYD--E--LEKLEKYYHLVCWGIPLIPAVIPLIG 136 (303)
T ss_pred HhhHHHHHHHHHHHHHhhheeEEcCc--c--HHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 88899998889999999999987532 2 35555678889999999988877643
No 15
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=98.48 E-value=3.7e-07 Score=78.06 Aligned_cols=75 Identities=21% Similarity=0.284 Sum_probs=60.7
Q ss_pred CCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeecc
Q psy9274 91 QQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHP 165 (318)
Q Consensus 91 ~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P 165 (318)
+++|.-.+-++.||.++|++.++..+...+.....+...-++..|.. ++..++..++-+..++||+.-|+.|.+|
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~q~g~~~sd~~ilaIAihT~l~v~~~ 106 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFLQVGDEASDLWILAIAIHTFLIVFRP 106 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 34456677799999999999998766655555554445456788988 8888888999999999999999999999
No 16
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=98.41 E-value=1.3e-05 Score=73.17 Aligned_cols=137 Identities=16% Similarity=0.235 Sum_probs=95.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhhhc---cCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHH
Q psy9274 60 MVSIIVPIFFGIIIILGFFGNALVVVVVAF---NQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCK 136 (318)
Q Consensus 60 ~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~---~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~ 136 (318)
....+..++..+.+++|++||+.++.+..+ +++.-+|.+..+.+||++.++...+.+...+..............++
T Consensus 4 ~~~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (303)
T PF05296_consen 4 SLEIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFLSFFFPNIYFSENVYK 83 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhhhHHH
Confidence 346677888899999999999998887653 33333699999999999999988766655555555554444555666
Q ss_pred H--HHHHHHHHHHHHHHHHHHHHhheeeeccCccc----cccchhhHHHHHHHHHHHHHHHHhhhH
Q psy9274 137 M--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASM----SVRTERNALKAILITWIVIVLTAVPVF 196 (318)
Q Consensus 137 ~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~----~~~t~~~~~~~i~~~W~~s~~~~~p~~ 196 (318)
+ ++..++...|.|..+++++-=++.|+..-+.- +++=.+.+..++++.++++++..++..
T Consensus 84 ~~~~~~~f~~~~s~W~tt~LsvfYcvKI~~fs~~~Fl~LK~rI~~~v~~lLl~s~l~s~~~~~~~~ 149 (303)
T PF05296_consen 84 IIDFLWMFSNSSSLWFTTWLSVFYCVKIANFSHPFFLWLKRRISKVVPWLLLGSLLISFLNLLSIP 149 (303)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHheeeecCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 6 77777888999999999999889998643211 111122233456667777774443333
No 17
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=98.15 E-value=1.5e-05 Score=71.28 Aligned_cols=132 Identities=15% Similarity=0.143 Sum_probs=86.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCc--chhHHHHH--
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWP--FGDVWCKM-- 137 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~--~~~~~C~~-- 137 (318)
.++..++..+.+.+..+.|..+++.+...|+.|++--.++--..+.|.+.++...+..+........+ .......+
T Consensus 5 ~~iv~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~if~~ 84 (273)
T PF10316_consen 5 AIIVSIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLIYYILFSLFPEFLDYKNLIFYI 84 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHHHHHHHHhChhhhccccEEeeh
Confidence 45667777788889999999999998877764455555666678999999887766555544322111 10000111
Q ss_pred -HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHh
Q psy9274 138 -YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193 (318)
Q Consensus 138 -~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~ 193 (318)
........+.......||+||.+|++.|..|++.+++-....+++++-.++++=..
T Consensus 85 ~~p~~~~~~iR~~l~~~Ia~dR~~A~~fPI~y~~~r~k~~~~~I~~~~~~~~~~d~~ 141 (273)
T PF10316_consen 85 GWPSSNLGSIRSILALIIALDRVFAVYFPIFYHNYRKKIPNFIIIIIALSYGLFDQY 141 (273)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHccCccccHHHHHHHHHHHHHHHHh
Confidence 33334455677888999999999999999988765554444444444444444333
No 18
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.06 E-value=1.7e-05 Score=72.37 Aligned_cols=120 Identities=16% Similarity=0.241 Sum_probs=86.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH----
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM---- 137 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~---- 137 (318)
+.++++.+.+.+++-.+..+.+++++.+++.+|.+.+-++.-||+.|++...+.-...-...+ .|...|+.
T Consensus 32 ~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~~si~tG~l~i-----~G~vfC~~P~~~ 106 (313)
T PF10321_consen 32 RPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFINSIITGILAI-----FGAVFCSYPRFI 106 (313)
T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHhhhhhhhHHHh-----cCccccCCchHh
Confidence 678899999999999999999999999888778999999999999999987643221111111 23344543
Q ss_pred ----HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHH
Q psy9274 138 ----YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWI 186 (318)
Q Consensus 138 ----~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~ 186 (318)
.+...+..++......++++|..-+..|........++++.+.+++..+
T Consensus 107 ~i~G~~~~~~W~~~c~~~i~LainR~~~l~~~~~~~~~F~g~~~~~~l~~~~i 159 (313)
T PF10321_consen 107 YIMGCIALGSWCGSCIICILLAINRCCELNFPFKLKFLFEGKRTYIVLFLPII 159 (313)
T ss_pred hhhhccchhhHHHHHHHHHHHHHHHHHHHhcchhhHHhhcCceEEeeehhHHH
Confidence 3334556788888899999999999998776666566665544444333
No 19
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=97.92 E-value=0.00046 Score=63.58 Aligned_cols=128 Identities=13% Similarity=0.109 Sum_probs=89.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHH---Hhh--c--CCcchhHHH
Q psy9274 63 IIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD---YVT--T--CWPFGDVWC 135 (318)
Q Consensus 63 ~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~---~~~--~--~~~~~~~~C 135 (318)
.+..++.++++++|++.++.+++.+.+++..|.-.-+++...+++-++.++..+...... +.. . ....+...|
T Consensus 17 ~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~H~N~ril~~~~~~~~l~~~~~r~~~h~~~l~~~~~~~~~Cd~~~~~~~C 96 (324)
T PF10292_consen 17 RLSLIFNLLLSIIAFPVIIYALWKIRNSKLFHFNTRILFIVHCFSFLIHCTGRIILHTYDLYNYFFPDDPCDMIPSTYRC 96 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcchhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCcccccchhHH
Confidence 344455778888999999998888888888887788888888888888876544433322 222 1 123466778
Q ss_pred HH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHH
Q psy9274 136 KM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLT 191 (318)
Q Consensus 136 ~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~ 191 (318)
-+ .+..++...+.++.++|++||++|-++.-+|.+. ++.....+..+.|+++++.
T Consensus 97 ~~lR~~~~~~~~~~~~t~v~l~IER~iAT~~~~~YE~~-~~~~Gi~l~~~qi~is~~~ 153 (324)
T PF10292_consen 97 FILRIPYNFGLFLVSFTTVSLVIERTIATFFSKSYEKS-GKWLGILLAFFQILISLLI 153 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCC-CccHHHHHHHHHHHHHHHH
Confidence 66 7778888889999999999999999999888764 3333234444445544443
No 20
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=97.89 E-value=0.00019 Score=65.19 Aligned_cols=124 Identities=20% Similarity=0.359 Sum_probs=87.3
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhccCC-CCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcC-----------CcchhHH
Q psy9274 67 IFFGIIIILGFFGNALVVVVVAFNQQ-MRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTC-----------WPFGDVW 134 (318)
Q Consensus 67 ~~~~ii~~~gi~gN~lvi~~i~~~~~-l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~-----------~~~~~~~ 134 (318)
++..+.+++|++.|+++++.+.++.. .-++..+++.|-|++|++.++..+.... ...... -.++...
T Consensus 3 i~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~~~~~q~-R~i~~~~~~~~~~~GpC~~~~~~~ 81 (292)
T PF10317_consen 3 IYHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIISAFLTQP-RIIPNGTSIAVISYGPCRYFGPWF 81 (292)
T ss_pred EeHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hccCCCCcEEEEeeecceecCcHH
Confidence 35678899999999999988876554 3367889999999999998875443322 221111 1246677
Q ss_pred HHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhH
Q psy9274 135 CKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVF 196 (318)
Q Consensus 135 C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~ 196 (318)
|.. .+...+...+.+++.+--+.||.++.++-. ++++...++++.+..++...+..+
T Consensus 82 C~~~y~~~~~~~~~s~~~i~~sf~yRy~iL~~~~~-----~~~~~i~~~~l~~~p~~~~~i~~~ 140 (292)
T PF10317_consen 82 CFFCYHIMLHFSIHSGWSIILSFYYRYYILRNPSP-----SRKRIIFICILIYIPSFSMLIFFF 140 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-----chhHHHHHHHHHHHHHHHHHHHHh
Confidence 877 455566777888888888899999987633 566677777777777776655444
No 21
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=97.55 E-value=0.00031 Score=62.95 Aligned_cols=127 Identities=18% Similarity=0.186 Sum_probs=82.4
Q ss_pred HHHHHHHHHHHHHhhhhhhccCC--C-CchHHHHHHHHHHHHHHHHhhhHHHH-HHHHhhcCC-------cchhHHHHH-
Q psy9274 70 GIIIILGFFGNALVVVVVAFNQQ--M-RSTTNLLIINLAIADLLFIVCCVPFT-ATDYVTTCW-------PFGDVWCKM- 137 (318)
Q Consensus 70 ~ii~~~gi~gN~lvi~~i~~~~~--l-~~~~~~ll~~LaiaDll~~l~~~p~~-~~~~~~~~~-------~~~~~~C~~- 137 (318)
.+-++.+++.++++++++.++|+ . +++.+.+...-++.|+++-+...+.. +........ ...+...++
T Consensus 5 ~~Ygi~s~~l~~~~~~~i~~~~~~~~~~~sFy~l~~~d~~~ni~~~ln~~~~~Rl~~~~~~~~~~~~~~~~~~~~~l~~~ 84 (275)
T PF02118_consen 5 LIYGIPSLILYIFILYVIWKSKKKSYFKSSFYRLYIMDLIMNILTYLNTWITLRLPSYTPPFCPFFLPFFENPPFFLNIF 84 (275)
T ss_pred hhHHHHHHHHHHHHHHHHHHhccccccCCccHHHHHHHhHHHHHHHHHHHHHHHhccccCccchhHHHHHcCCcHHHHHH
Confidence 34556777888888888887776 4 67888888888888887765444331 111100100 112233344
Q ss_pred -HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHh
Q psy9274 138 -YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFN 197 (318)
Q Consensus 138 -~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~ 197 (318)
++...+..+.....+.||++|+.+|..|.+|.+..++.. ..+++++.++.+....+.+.
T Consensus 85 ~~l~~~~~~~Q~~~~~~is~nR~t~v~~p~~~~~~W~~~~-~~~i~~i~~~p~~~~~~~~~ 144 (275)
T PF02118_consen 85 YFLQYYFAYVQYLSTILISLNRFTSVLFPIRYEKFWKRYY-WIIIIIIFLLPFSFTWNIFI 144 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhHHHHhhh-hhheeeeeehhHHHHHHHHc
Confidence 566677778888999999999999999999988766544 34555555555555444444
No 22
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=97.35 E-value=8.8e-05 Score=65.21 Aligned_cols=128 Identities=17% Similarity=0.178 Sum_probs=3.9
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHH--HHHHHHH
Q psy9274 67 IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKM--YLIVVTA 144 (318)
Q Consensus 67 ~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~--~~~~~~~ 144 (318)
++..+.+.+++++-+++++++...|++|+..+....||+++.++..+..+.. ...............|+. .+..++.
T Consensus 5 ~i~~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~~-~~~~~~~~~~~~~~~C~~~a~~~hy~~ 83 (242)
T PF00002_consen 5 IISYVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSFLIG-ISQTFSPISTTNHCLCRAIAILLHYFF 83 (242)
T ss_dssp HHHHHHHH------------------------------------------------------------------------
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHHhee-hhhccccccccccccchhhhhHhHHHH
Confidence 4444555555555556556555567778877889999999999887744322 111111111112335988 6667777
Q ss_pred HHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhc
Q psy9274 145 YASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMH 199 (318)
Q Consensus 145 ~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~ 199 (318)
.++.+-+++++++-|..++++... .+++.....++.|.++++.........
T Consensus 84 la~f~Wm~~~~~~l~~~~~~~~~~----~~~~~~~~~~~gwg~P~~iv~i~~~~~ 134 (242)
T PF00002_consen 84 LASFFWMLVEAFYLYRLLVKVFNS----SRRRFWWYYLIGWGIPALIVVISVAVN 134 (242)
T ss_dssp -------------------------------------------------------
T ss_pred HHHHHHHHHHHHHhheeEEEeecc----cchhhheeeeeeecCcceeeeeeeeec
Confidence 888888889999999999877321 244555667788988877776555443
No 23
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=97.14 E-value=0.021 Score=50.36 Aligned_cols=127 Identities=16% Similarity=0.283 Sum_probs=82.1
Q ss_pred HHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhH----HHHHHHHhhcCCcchhHHHHH-HHHHHHHHHHHHHHHH
Q psy9274 79 GNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCV----PFTATDYVTTCWPFGDVWCKM-YLIVVTAYASVYTLVL 153 (318)
Q Consensus 79 gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~----p~~~~~~~~~~~~~~~~~C~~-~~~~~~~~~s~~~l~~ 153 (318)
|-.+|+.++..++-++-+...|+..|.++-++.....+ |..+...+.....-+...-.+ ++.-++.++-++.=.+
T Consensus 31 ~~Flv~~i~lW~~Fk~m~ffwFl~qlt~s~fi~S~lNl~inVPatlfsl~t~~~~~s~~f~~~Sy~idf~h~siLfsNlv 110 (305)
T PF04789_consen 31 GAFLVLSIILWSHFKPMKFFWFLTQLTISVFIISSLNLLINVPATLFSLITKEFVQSELFIFMSYLIDFCHYSILFSNLV 110 (305)
T ss_pred HHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHhhhheEeCcHHHHHhhhHHHhccchhhHHHHHHHHHHHHHHHHHHH
Confidence 33455555555554456899999999999888765444 444433333333223333333 7777888888999999
Q ss_pred HHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeee
Q psy9274 154 MSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYT 205 (318)
Q Consensus 154 isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~ 205 (318)
||+.|.....+.-+..+...+.-..+-+..+|++++..........-.+.|.
T Consensus 111 iaIqR~fVFFfr~~t~~~F~s~~iyiWL~~vWils~~v~~~l~~~nC~Y~y~ 162 (305)
T PF04789_consen 111 IAIQRFFVFFFRNLTDKVFESPVIYIWLLLVWILSIGVVYSLMSNNCRYRYN 162 (305)
T ss_pred HHHhhheeeeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHhCCCeeecc
Confidence 9999988776433333444566777788889999888777665544444443
No 24
>PF03383 Serpentine_r_xa: Caenorhabditis serpentine receptor-like protein, class xa; InterPro: IPR005047 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class xa (Srxa), from the Str superfamily [].
Probab=97.14 E-value=0.002 Score=52.15 Aligned_cols=67 Identities=15% Similarity=0.185 Sum_probs=53.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceee
Q psy9274 138 YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSY 204 (318)
Q Consensus 138 ~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~ 204 (318)
.+..++....+++.++|+++|..+|.+|++...+.|.+|.++-+.++.++.+...+.|++..-...+
T Consensus 49 L~~tf~Yl~plfltvLMti~Ri~iV~~P~~~~~~Fs~~kl~~YC~~i~i~~~i~LlIPy~S~C~vnf 115 (153)
T PF03383_consen 49 LFGTFSYLHPLFLTVLMTINRIYIVLFPFGSEIWFSDKKLWIYCGIIAILSFISLLIPYFSDCYVNF 115 (153)
T ss_pred eeehHHHHHHHHHHHHHHHhheEEEEecCCCccccccchhHHHHHHHHHHHHHHHHhhcCCCCcEEE
Confidence 4445666677888899999999999999998778899999999988888887777766655544444
No 25
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=96.93 E-value=0.0021 Score=57.12 Aligned_cols=101 Identities=24% Similarity=0.416 Sum_probs=70.5
Q ss_pred CCchHHHHHHHHHHHHHHHHhh-hHHHHHHHHhhcCCcchhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCc--
Q psy9274 93 MRSTTNLLIINLAIADLLFIVC-CVPFTATDYVTTCWPFGDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIA-- 167 (318)
Q Consensus 93 l~~~~~~ll~~LaiaDll~~l~-~~p~~~~~~~~~~~~~~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~-- 167 (318)
..+|.+..+.+||+||+++.+. ++|.. ....... .+++..||+ |+..++-..|+.+.+++|+-+-+.|.-.-.
T Consensus 7 r~kp~dlIl~hLa~aN~lvLl~rGip~~-~~~~~~~-~~~d~gCK~v~Y~~RV~RglSictTCLLSvfQAItISP~~s~w 84 (265)
T PF03402_consen 7 RLKPIDLILIHLALANILVLLSRGIPQT-MAFFGWK-FFDDIGCKIVFYIYRVARGLSICTTCLLSVFQAITISPSNSRW 84 (265)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHhhHHHH-HHHhhcc-cCCCceeeeeeeehHHhchhhHHhhhHHHHHhhEEEecCCchh
Confidence 3579999999999999988764 44433 3332222 368899998 899999999999999999955555542211
Q ss_pred -cccccchhhHHHHHHHHHHHHHHHHhhh
Q psy9274 168 -SMSVRTERNALKAILITWIVIVLTAVPV 195 (318)
Q Consensus 168 -y~~~~t~~~~~~~i~~~W~~s~~~~~p~ 195 (318)
.-+...++.....+.+.|++.+++....
T Consensus 85 ~~lK~k~pk~i~~~~~~~Wilnlli~~~~ 113 (265)
T PF03402_consen 85 AELKVKAPKYIGPSCLFCWILNLLININV 113 (265)
T ss_pred hhhhhhcCCccccHHHHHHHHHHhhhcce
Confidence 1122345566677888899888776543
No 26
>PF02117 7TM_GPCR_Sra: Serpentine type 7TM GPCR chemoreceptor Sra; InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=96.19 E-value=0.055 Score=49.96 Aligned_cols=111 Identities=23% Similarity=0.229 Sum_probs=71.4
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCc-hHHHHHHHHHHHHHHHHhhhHHHHHHHH---hh---c--CC
Q psy9274 58 ERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRS-TTNLLIINLAIADLLFIVCCVPFTATDY---VT---T--CW 128 (318)
Q Consensus 58 ~~~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~-~~~~ll~~LaiaDll~~l~~~p~~~~~~---~~---~--~~ 128 (318)
......+..++..++.++++++....+-.++++.-..+ ...+++.+|..+++=-.. .+...+... +. + ..
T Consensus 16 tS~~~~~~~~~~~~~~~~t~~~s~~Ai~~l~~ksif~~sTr~LL~~~l~~a~~Hq~~-y~~~~~~~lyrsf~~~~dpC~i 94 (328)
T PF02117_consen 16 TSWNFRINQIFILIIIIITFIFSVFAIKKLWKKSIFSNSTRILLISNLLFANIHQIS-YAIIQIWSLYRSFFYSDDPCSI 94 (328)
T ss_pred hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCCeee
Confidence 34445566667777788888888888888887776665 455666666666653322 221222111 11 1 22
Q ss_pred cchhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccc
Q psy9274 129 PFGDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASM 169 (318)
Q Consensus 129 ~~~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~ 169 (318)
.+...-|.. .+..++...+++..+++.+||.+|-..|-.|+
T Consensus 95 ~f~~~eC~~~~~~~~~~~~~~~~~q~aL~idRl~at~~~~~~~ 137 (328)
T PF02117_consen 95 LFTSSECFPYYYFYYFTNSGMIFIQFALTIDRLLATFFPKYYS 137 (328)
T ss_pred EeecCceeeeehHHHHHHHHHHHHHHHHHHHHHHHHhchhhhh
Confidence 345566776 56666667788899999999999998886553
No 27
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=96.14 E-value=0.21 Score=46.26 Aligned_cols=124 Identities=15% Similarity=0.157 Sum_probs=70.8
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhccCC----CCc--hHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCC---------
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFNQQ----MRS--TTNLLIINLAIADLLFIVCCVPFTATDYVTTCW--------- 128 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~~~----l~~--~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~--------- 128 (318)
....+.+.-+++|++|-++-++--.|... .++ ...-.+..||+||++.++-.+....+.......
T Consensus 28 ~f~avCLgSs~l~l~gallQLlp~rr~~~~~~~~~sp~~~~rIl~~la~aDlLaclGVivRS~vWl~~p~~~~s~s~~~~ 107 (405)
T PF02101_consen 28 AFNAVCLGSSVLSLLGALLQLLPRRRSAGPRAPARSPSSSRRILFWLAVADLLACLGVIVRSSVWLGFPNFIDSISDVNG 107 (405)
T ss_pred hhhhhHHHHHHHHHHHHHHhhccccccccccccccCCcCCchhHHHHHHHHHHhhhhHHHHhhhhhcCCcccccccCCCC
Confidence 34444455556666666555542221111 111 234688999999999998666555554332211
Q ss_pred --cchhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHh
Q psy9274 129 --PFGDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193 (318)
Q Consensus 129 --~~~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~ 193 (318)
......|.. ....++..++.|-+.+-|+|-|+.+.+--. +.+...-=.++|..+.+++.
T Consensus 108 ~d~wp~afCv~ss~WIq~fYsAtfwWtfcYAVDv~Lv~~~~ag------~~~~~lYH~~aWgl~~lL~~ 170 (405)
T PF02101_consen 108 TDIWPAAFCVGSSMWIQLFYSATFWWTFCYAVDVYLVIRRSAG------RSTIWLYHMMAWGLPALLCA 170 (405)
T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhheeeccCC------CcchhHHHHHHHHHHHHHHH
Confidence 113467877 344467778888888999999998864321 11122334556766665544
No 28
>PF06681 DUF1182: Protein of unknown function (DUF1182); InterPro: IPR010601 This family consists of several hypothetical proteins of around 360 residues in length and seems to be specific to Caenorhabditis elegans. The function of this family is unknown.
Probab=95.59 E-value=0.075 Score=44.89 Aligned_cols=93 Identities=12% Similarity=0.121 Sum_probs=46.2
Q ss_pred HHHHHHHHHHHHhhhhhhccCCCC--chHHHHHHHHHHHHHHHHhhhHHHHHHHHhh-------cCCcchhHHHHH--HH
Q psy9274 71 IIIILGFFGNALVVVVVAFNQQMR--STTNLLIINLAIADLLFIVCCVPFTATDYVT-------TCWPFGDVWCKM--YL 139 (318)
Q Consensus 71 ii~~~gi~gN~lvi~~i~~~~~l~--~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~-------~~~~~~~~~C~~--~~ 139 (318)
++-..+.+.+.+.++.+.++|++- -...+. .+-+..-+..+|+.+.+.+. +....+-..|.+ -+
T Consensus 49 ~v~~~s~~~s~~L~ifi~~rk~l~~~~ksTIv-----FaT~g~fvLtlP~~lf~~wl~~~l~~~~ep~ytI~vCSflk~f 123 (226)
T PF06681_consen 49 IVNTVSWLFSYFLCIFIWRRKNLIRSYKSTIV-----FATFGQFVLTLPLYLFQTWLVFFLWFGHEPQYTIFVCSFLKNF 123 (226)
T ss_pred HHHhHHHHHHHHHHHHHhhhHHHHHhhcCeEE-----EeeHHHHHHHHHHHHHHHHHHHHHhcCCCceEEEEeehHHHhH
Confidence 344455556666655565555421 111111 12222223345555444322 122334456877 22
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCcc
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIAS 168 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y 168 (318)
...+..++....+++|++||+.|++-.|-
T Consensus 124 ~S~t~s~~~vip~aVAIyRy~~VV~k~rl 152 (226)
T PF06681_consen 124 TSSTTSCAQVIPVAVAIYRYLIVVLKKRL 152 (226)
T ss_pred HHHHhhhhccchhhhhhhhhheeeecccC
Confidence 22222445556668999999999975543
No 29
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=95.49 E-value=0.0095 Score=54.29 Aligned_cols=102 Identities=15% Similarity=0.136 Sum_probs=61.0
Q ss_pred HHHHHHHHHHHHHhhhhhhccCCCC-chHHHHHHHHHHHHHHHHhhhHHHHHHHHhhc----------CCcchhHHHHH-
Q psy9274 70 GIIIILGFFGNALVVVVVAFNQQMR-STTNLLIINLAIADLLFIVCCVPFTATDYVTT----------CWPFGDVWCKM- 137 (318)
Q Consensus 70 ~ii~~~gi~gN~lvi~~i~~~~~l~-~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~----------~~~~~~~~C~~- 137 (318)
.+.++++++.|++.++.+.++.+.+ ..--+++...|+.|++++++-+...-.....+ .+..+....++
T Consensus 6 ~~~~~~s~~~N~~Li~Li~~~s~k~~G~Yk~Lm~~fs~~~i~fs~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~l 85 (307)
T PF10326_consen 6 YIGFVLSLFLNSLLIYLILTKSPKSLGSYKYLMIYFSIFEIIFSILDFLVQPIIHSYGSSFVVFSDSGPFFLSKSIGQIL 85 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCCCCEEEEEehhHHHHHHHHHHHHHHHhceeecCCeeEEEeecccccccchheeeh
Confidence 4567789999999998887554333 34455667788888888775543222211111 11122222333
Q ss_pred -HHHHHHHHHHHHHHHHHHHHhheeeeccCccccc
Q psy9274 138 -YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV 171 (318)
Q Consensus 138 -~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~ 171 (318)
.+...+...++..++.-=+.||.+|++|-+.+..
T Consensus 86 ~~~y~~~f~~~~~llaihFiYRY~av~~~~~l~~f 120 (307)
T PF10326_consen 86 LAIYCGCFGMSFSLLAIHFIYRYWAVCRPKKLRYF 120 (307)
T ss_pred hhhhhhhHHHHHHHHHhHhhhheeeeccchhhccc
Confidence 3333444455666666677999999999865444
No 30
>KOG1832|consensus
Probab=94.88 E-value=0.0087 Score=60.24 Aligned_cols=12 Identities=0% Similarity=0.061 Sum_probs=6.7
Q ss_pred cccccccccccc
Q psy9274 229 NKTIFQSENDYC 240 (318)
Q Consensus 229 ~~~if~i~~~~~ 240 (318)
...+|.+.+-.-
T Consensus 1391 ~~r~yEIGR~r~ 1402 (1516)
T KOG1832|consen 1391 SARMYEIGRRRP 1402 (1516)
T ss_pred hhhhhhhcccCC
Confidence 346666655544
No 31
>KOG4193|consensus
Probab=94.59 E-value=0.28 Score=48.94 Aligned_cols=116 Identities=16% Similarity=0.186 Sum_probs=65.4
Q ss_pred HHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcc-hhH-HHHH--HHHHHHHH
Q psy9274 70 GIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPF-GDV-WCKM--YLIVVTAY 145 (318)
Q Consensus 70 ~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~-~~~-~C~~--~~~~~~~~ 145 (318)
.+-+.+++++=++.+++.+--|++++..+....||+++=++.-+ .++.+.|.. ... .|.. ++..++..
T Consensus 327 ~VG~~iSlvcL~lti~ty~~~~~l~~~~~~i~~~l~~~L~l~~l--------~fL~~~~~~~~~~~~C~~~a~llhff~L 398 (610)
T KOG4193|consen 327 TVGCIISLVCLLLTIATYLLFRKLQNDRTKIHINLCLCLFLAEL--------LFLLGIDRTSTSVVLCIAAAILLHFFFL 398 (610)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHHHH--------HHhcccccccCcccccHHHHHHHHHHHH
Confidence 33344444444444444444444454447777888887222211 111222221 122 7988 77788888
Q ss_pred HHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhH
Q psy9274 146 ASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVF 196 (318)
Q Consensus 146 ~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~ 196 (318)
++.+-+.++++.=|+.++.-+ +...+++.....++.|...+++.....
T Consensus 399 aaF~Wm~leg~hl~~~~v~vf---~~~~~~~~l~~~~~gwg~Pavvv~Isa 446 (610)
T KOG4193|consen 399 AAFFWMLLEGFHLYLLLVEVF---RSRPRRRKLLYSLYGWGVPAVVVGVSA 446 (610)
T ss_pred HHHHHHHHHHHHHHHHHHHHh---ccccchHHHHHHHHHhhhhHHHHhhee
Confidence 888889999999888555433 233444555554688888766655443
No 32
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=94.47 E-value=0.32 Score=44.14 Aligned_cols=132 Identities=14% Similarity=0.195 Sum_probs=75.4
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhccCCCC-chHHHHHHH----HHHHHHHHHhhhHHHHHHHHhhc----CCc-c-hh
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFNQQMR-STTNLLIIN----LAIADLLFIVCCVPFTATDYVTT----CWP-F-GD 132 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~-~~~~~ll~~----LaiaDll~~l~~~p~~~~~~~~~----~~~-~-~~ 132 (318)
.+..+.-++.++++.-|++.++.+.++...+ +..-..+++ ..+.|+..+++..|....-...+ ... + -+
T Consensus 5 ~~~~~~h~~~~i~~Pi~~~~~y~Il~ktp~~m~~~k~~ll~~~~~~~~~d~~~~~l~~P~~~~P~~~~~~~G~l~~~gv~ 84 (302)
T PF10318_consen 5 FYSIVLHIITIISIPIYIFGFYCILFKTPKSMKSVKWYLLNHHFWNFLLDLYLSFLIIPYFFFPSPAGYPLGILSYFGVP 84 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHheeEeChhhhHHHHHHHHHHHHHHHHHHHHhhhEEeeeeccchHHhhCCcchhhcccC
Confidence 4556667788899999999988877654322 333333333 34677777777777544433222 111 1 01
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHh
Q psy9274 133 VWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFN 197 (318)
Q Consensus 133 ~~C~~~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~ 197 (318)
..-++++..........+++.+--+||.+++..-+ +.. .++.....+++.++++.+..+|.+.
T Consensus 85 ~~~q~~i~~~~~~~~~~S~~~lFe~R~~~i~~~~~-~~~-~~~~r~~~~~~~y~~~~~~~~~~~~ 147 (302)
T PF10318_consen 85 TPIQFYIGVTLLAFVGVSILSLFENRYQIIIRNRS-RSL-WKKIRIIFIIFNYIIAFLFFIPIYL 147 (302)
T ss_pred hHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccc-ccc-chhhHHHHHHHHHHHHHHHHHHHhc
Confidence 12223555555566667777778899999986644 121 2222333444555566666666655
No 33
>PF02175 7TM_GPCR_Srb: Serpentine type 7TM GPCR chemoreceptor Srb; InterPro: IPR002184 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=93.43 E-value=2.4 Score=36.82 Aligned_cols=42 Identities=10% Similarity=0.066 Sum_probs=34.9
Q ss_pred chhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccc
Q psy9274 130 FGDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV 171 (318)
Q Consensus 130 ~~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~ 171 (318)
.+...++. ....+.++.+++....+++|||+|+...-+|.+.
T Consensus 80 I~~~~~K~~h~~~~flmT~~ml~PigftIERfiAl~~A~~YE~~ 123 (236)
T PF02175_consen 80 IDPTLFKYGHLTGLFLMTIPMLFPIGFTIERFIALKMAEKYENT 123 (236)
T ss_pred EchHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhccC
Confidence 45566777 6777888899999999999999999998888764
No 34
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=93.18 E-value=1 Score=40.89 Aligned_cols=127 Identities=16% Similarity=0.156 Sum_probs=74.2
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhccCCCC-chHHHHHHHHHHHHHHHHhhhH--HHHHHHH-------hhcCCcchh-HHH
Q psy9274 67 IFFGIIIILGFFGNALVVVVVAFNQQMR-STTNLLIINLAIADLLFIVCCV--PFTATDY-------VTTCWPFGD-VWC 135 (318)
Q Consensus 67 ~~~~ii~~~gi~gN~lvi~~i~~~~~l~-~~~~~ll~~LaiaDll~~l~~~--p~~~~~~-------~~~~~~~~~-~~C 135 (318)
.+--+.++++.+.|-+-++.+...|+.+ ..-.++++.-|+-|++..++-+ |..+..+ ..++|.... ..-
T Consensus 9 ~~Pk~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~vh~yry~F~~fi~dG~F~~~s~l~ 88 (310)
T PF10319_consen 9 YIPKIFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICVHGYRYAFVVFISDGPFFEKSELG 88 (310)
T ss_pred HHHHHHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhheeeccceEEEEEEcCCcCcCcchHH
Confidence 3344677788899999999888777655 4677888899999999887554 3322221 112222111 111
Q ss_pred HH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHh
Q psy9274 136 KM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193 (318)
Q Consensus 136 ~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~ 193 (318)
+. .....+..+|-..|.+-=+.||+++++|-..++...+......+..+-+....+..
T Consensus 89 ~~~ls~RCsfIs~sYaIL~~HFvYRYl~l~~~~~~~~~F~p~gl~~s~~~~~~h~~~W~~ 148 (310)
T PF10319_consen 89 QHLLSIRCSFISGSYAILHIHFVYRYLVLFNSKFINKYFMPYGLIGSILYCLFHFASWHV 148 (310)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHhCcHHHHHhhhhHHHHHHHHHHHHHHHHHHH
Confidence 11 22222333455556666789999999996655555454444444444444444433
No 35
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=92.65 E-value=1.1 Score=40.97 Aligned_cols=128 Identities=11% Similarity=0.186 Sum_probs=70.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhcc-CCCCchHHHHH---HHHHHHHHHHHhhhHHHHHHHHhhcC-------C-cch
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFN-QQMRSTTNLLI---INLAIADLLFIVCCVPFTATDYVTTC-------W-PFG 131 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~-~~l~~~~~~ll---~~LaiaDll~~l~~~p~~~~~~~~~~-------~-~~~ 131 (318)
.+...+-++++++++.|.+.++.+..+ +++.+-.++++ +...++|+-.++..-|..+.-...+. | ..+
T Consensus 10 ~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~agy~~G~l~~~f~i~ 89 (303)
T PF10327_consen 10 WLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAGYCVGFLARYFGIS 89 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEEEEEeehhhcCCCc
Confidence 455667788999999999998877654 44444333332 23567888888777776666544331 1 122
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHh
Q psy9274 132 DVWCKMYLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAV 193 (318)
Q Consensus 132 ~~~C~~~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~ 193 (318)
...|.. +..+....-+-.++.--+-|+.+|.+..+.++. ++.......++..++.++.++
T Consensus 90 ~h~~~~-i~~~l~~~q~~~L~~CF~~KHQaIa~i~~~~~l-~~~~~~~~~~~~~~~p~~~~~ 149 (303)
T PF10327_consen 90 SHYLMT-IWIFLFIFQIESLVICFLRKHQAIARILSKHVL-PKWFYIFFYFFCILFPFFIFI 149 (303)
T ss_pred hHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcC-cHHHHHHHhhHHHHHHHHHHH
Confidence 223332 222222233333444456799999988765542 333333344444444444333
No 36
>KOG1832|consensus
Probab=92.58 E-value=0.027 Score=56.84 Aligned_cols=10 Identities=30% Similarity=0.331 Sum_probs=4.4
Q ss_pred HHHHHhheee
Q psy9274 153 LMSLDRFLAV 162 (318)
Q Consensus 153 ~isidRy~aI 162 (318)
+-++||+-.+
T Consensus 1299 VP~Ldqc~Vt 1308 (1516)
T KOG1832|consen 1299 VPSLDQCAVT 1308 (1516)
T ss_pred CccccceEEE
Confidence 3445554333
No 37
>KOG2568|consensus
Probab=92.54 E-value=2 Score=41.71 Aligned_cols=18 Identities=33% Similarity=0.342 Sum_probs=12.8
Q ss_pred hHHHHHHHHHHHHHHHHh
Q psy9274 96 TTNLLIINLAIADLLFIV 113 (318)
Q Consensus 96 ~~~~ll~~LaiaDll~~l 113 (318)
...++.+-+|++|+.+..
T Consensus 339 ~~lf~~ip~ai~d~~f~~ 356 (518)
T KOG2568|consen 339 LILFAALPLAILDAAFIY 356 (518)
T ss_pred hhHHHHHHHHHHHHHHHH
Confidence 356777778888877655
No 38
>PF03344 Daxx: Daxx Family; InterPro: IPR005012 Daxx is a ubiquitously expressed protein that functions, in part, as a transcriptional co-repressor through its interaction with a growing number of nuclear, DNA-associated proteins. Human Daxx contains four structural domains commonly found in transcriptional regulatory proteins: two predicted paired amphipathic helices, an acid-rich domain and a Ser/Pro/Thr (SPT)-rich domain. The post-translational modification status of the SPT-domain of hDaxx regulates its association with transcription factors such as Pax3 and ETS-1, effectively bringing hDaxx to sites of active transcription. Through its presence at the site of active transcription, hDaxx could then be able to associate with acetylated histones present in the nucleosomes and Dek that is associated with chromatin. Through its association with the SPT-domain of hDaxx, histone deacetylases may also be brought to the site of active transcription. As a consequence, nucleosomes in the vicinity of the site of active transcription will have the histone tails deacetylated, allowing the deactylated tail to bind to DNA, thereby leading to an inactive chromatin structure and transcriptional repression []. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis as a component of nuclear promyelocytic leukemia protein (PML) oncogenic domains (PODS). Daxx associates with PODs through a direct interaction with PML, a critical component of PODs. The interaction is a dynamic, cell cycle regulated event and is dependent on the post-translational modification of PML by the small ubiquitin-related modifier SUMO-1. ; PDB: 2KZS_A 2KZU_A.
Probab=92.17 E-value=0.042 Score=55.52 Aligned_cols=6 Identities=33% Similarity=0.855 Sum_probs=0.0
Q ss_pred eeEeec
Q psy9274 213 ACVFLD 218 (318)
Q Consensus 213 ~C~~~~ 218 (318)
.|++.+
T Consensus 349 ~~~lt~ 354 (713)
T PF03344_consen 349 GCHLTD 354 (713)
T ss_dssp ------
T ss_pred cccccc
Confidence 344433
No 39
>PF01534 Frizzled: Frizzled/Smoothened family membrane region; InterPro: IPR000539 The frizzled (fz) locus of Drosophila coordinates the cytoskeletons of epidermal cells, producing a parallel array of cuticular hairs and bristles [, ]. In fz mutants, the orientation of individual hairs with respect both to their neighbours and to the organism as a whole is altered. In the wild-type wing, all hairs point towards the distal tip []. In the developing wing, fz has 2 functions: it is required for the proximal-distal transmission of an intracellular polarity signal; and it is required for cells to respond to the polarity signal. Fz produces an mRNA that encodes an integral membrane protein with 7 putative transmembrane (TM) domains. This protein should contain both extracellular and cytoplasmic domains, which could function in the transmission and interpretation of polarity information []. This signature is usually found downstream of the Fz domain (IPR000024 from INTERPRO); GO: 0007166 cell surface receptor linked signaling pathway, 0016020 membrane
Probab=91.82 E-value=3.8 Score=37.80 Aligned_cols=152 Identities=15% Similarity=0.223 Sum_probs=86.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhh----c---------CCcc
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVT----T---------CWPF 130 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~----~---------~~~~ 130 (318)
..-....+.+.+.++.+++.++.+...+++-+-+..-++-|+++-++.++..+...+...-. . ....
T Consensus 10 ~~~~wi~~~s~ic~~~t~ftv~Tf~id~~rf~yPerpi~fl~~Cy~~~s~~~l~~~~~g~~~i~C~~~~~~~~~~~~~~~ 89 (328)
T PF01534_consen 10 FARIWIGVWSVICFISTLFTVLTFLIDRSRFRYPERPIIFLSFCYFIVSLGYLIRFILGRESIACRSQDGTERILVQDGL 89 (328)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccCCccchhhHHHHHHHHHHHHHHHHHhccCCCcCCCcCCCCceeEeecCC
Confidence 34444555555566666666655554444233345555666777777666444444431100 0 0112
Q ss_pred hhHHHHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceeeeeCC
Q psy9274 131 GDVWCKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSYTYAS 208 (318)
Q Consensus 131 ~~~~C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~ 208 (318)
++..|.+ .+.+++..++..--+.+++.=|++......+.. -.+++...-+++|.+.++..+..+.....- .+
T Consensus 90 ~~~~C~~~F~l~Yyf~mAa~~WWviLt~~W~lsa~~kw~~e~--i~~~s~yfH~~aW~iP~~~ti~vL~~~~Vd----gD 163 (328)
T PF01534_consen 90 ENSSCTVVFLLLYYFGMAASLWWVILTLTWFLSAGLKWGSEA--IEKKSSYFHLVAWGIPAVLTIAVLALRKVD----GD 163 (328)
T ss_pred CCCchhhHHHHHHHHHhHHHHHHHHHHHHHHHHhhcccCcch--hhhhcchhhhHHhhhhHHHHHHHHHhcccc----cc
Confidence 4567877 444555555555555567777877765444333 245566678899999999888777654432 24
Q ss_pred CcceeeEeecCCC
Q psy9274 209 ELHTACVFLDLDP 221 (318)
Q Consensus 209 ~~~~~C~~~~~~~ 221 (318)
.....|..-..++
T Consensus 164 ~ltGiC~Vg~~~~ 176 (328)
T PF01534_consen 164 ELTGICFVGNQNP 176 (328)
T ss_pred cccceeEEeCCCH
Confidence 4566787766553
No 40
>TIGR00927 2A1904 K+-dependent Na+/Ca+ exchanger.
Probab=91.79 E-value=0.051 Score=55.69 Aligned_cols=10 Identities=20% Similarity=0.232 Sum_probs=4.2
Q ss_pred HHHHHH-HHHh
Q psy9274 74 ILGFFG-NALV 83 (318)
Q Consensus 74 ~~gi~g-N~lv 83 (318)
++++++ +.++
T Consensus 470 ~LaIv~dDyFV 480 (1096)
T TIGR00927 470 ALAIVCDEYFV 480 (1096)
T ss_pred HHHHHHHHHhH
Confidence 344444 4443
No 41
>KOG4564|consensus
Probab=91.17 E-value=4.5 Score=39.08 Aligned_cols=129 Identities=16% Similarity=0.158 Sum_probs=81.1
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcC------------CcchhHH
Q psy9274 67 IFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTC------------WPFGDVW 134 (318)
Q Consensus 67 ~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~------------~~~~~~~ 134 (318)
+++.+-..++++.=++.++++..-|++|-..|++=.||.++=++.++..+........... ...+...
T Consensus 149 ~lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 228 (473)
T KOG4564|consen 149 ILYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVLIKDLVLVVNGEQDASSDTSLHCLISSNPVG 228 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccchh
Confidence 4555555555555555666666778899889999999999999888866665554332211 1245678
Q ss_pred HHH--HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhc
Q psy9274 135 CKM--YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMH 199 (318)
Q Consensus 135 C~~--~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~ 199 (318)
|+. ++...+..+..+-|.+=.+.=|..|+--... -+++....+++.|.+.+++..+-.+..
T Consensus 229 Ck~~~~~~~Yf~~aNf~WmlvEGlYLhtLlv~~~~s----er~~~~~y~~iGWG~P~v~v~~Wai~r 291 (473)
T KOG4564|consen 229 CKLLFVFFQYFVLANFFWMLVEGLYLHTLLVVAFFS----ERRYFWLYLLIGWGVPAVFVVVWAIVR 291 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheeecc----CccHHHHHHHHhhCCcHHHhhhHHhHh
Confidence 998 4555555666666665565555444332211 144555677788888777777665543
No 42
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=90.93 E-value=5.5 Score=37.32 Aligned_cols=110 Identities=17% Similarity=0.270 Sum_probs=61.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHH---HHHhhhHHHHHHHHh---------hcCCc
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADL---LFIVCCVPFTATDYV---------TTCWP 129 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDl---l~~l~~~p~~~~~~~---------~~~~~ 129 (318)
..++.++-.++.++++.-....+.++.+.|..|.-.+++.+.+...-+ +.-+..+|....... ...|.
T Consensus 26 ~~~~~i~e~~l~~~~~~~~~~~~~i~~~~r~FH~N~~~l~~~~~~~w~e~~i~k~~i~~y~~~~~~~~~~~~~~~~~~~t 105 (365)
T PF03125_consen 26 YIILLIFELILYIICFYFTFKSCYIILKIRQFHRNLNILFIFILLQWFELIIGKILIIPYQIGIITLGDISIGTYIQWWT 105 (365)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhceeeecCccchhhhhhhc
Confidence 455666666666666666666677777777777655555555443332 222323333332211 01111
Q ss_pred ch-----------hHHHHH---HHHHHHHHHHHHHHHHHHHHhheeeeccCccccc
Q psy9274 130 FG-----------DVWCKM---YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSV 171 (318)
Q Consensus 130 ~~-----------~~~C~~---~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~ 171 (318)
.. ....-+ ++....++..++.+.++++||.+|-...--|-+.
T Consensus 106 ~~~~~m~~v~~~~~~~~l~~~~~l~~~y~~~~~~~~~~~~iER~~AT~~i~dYEk~ 161 (365)
T PF03125_consen 106 DDPSEMIPVNSFNDSWPLFIGGFLRWHYMFSAIFCLLAIVIERCFATYFIKDYEKK 161 (365)
T ss_pred cCcccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 11 111111 4444445666789999999999999988878663
No 43
>PRK12585 putative monovalent cation/H+ antiporter subunit G; Reviewed
Probab=90.21 E-value=0.14 Score=42.60 Aligned_cols=11 Identities=18% Similarity=0.350 Sum_probs=7.3
Q ss_pred CCcchhHHHHH
Q psy9274 127 CWPFGDVWCKM 137 (318)
Q Consensus 127 ~~~~~~~~C~~ 137 (318)
.+.+++.++++
T Consensus 27 LlRfPD~YtRL 37 (197)
T PRK12585 27 VIRLPDVYTRT 37 (197)
T ss_pred HHhcCcHHHHh
Confidence 34466777877
No 44
>KOG4156|consensus
Probab=89.93 E-value=0.1 Score=51.02 Aligned_cols=10 Identities=30% Similarity=0.265 Sum_probs=4.2
Q ss_pred CCCCCCcCCc
Q psy9274 2 NVPERDVMGK 11 (318)
Q Consensus 2 ~~~~~~~~~~ 11 (318)
|+|-|++-..
T Consensus 303 nlpyhmpenk 312 (1329)
T KOG4156|consen 303 NLPYHMPENK 312 (1329)
T ss_pred cCCccCCCcc
Confidence 4444444333
No 45
>PRK12585 putative monovalent cation/H+ antiporter subunit G; Reviewed
Probab=89.10 E-value=2.6 Score=35.28 Aligned_cols=24 Identities=21% Similarity=0.249 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHhhhhhhccC
Q psy9274 68 FFGIIIILGFFGNALVVVVVAFNQ 91 (318)
Q Consensus 68 ~~~ii~~~gi~gN~lvi~~i~~~~ 91 (318)
+..++.++|.+..++-.+-++|-+
T Consensus 8 I~~vLLliG~~f~ligaIGLlRfP 31 (197)
T PRK12585 8 IISIMILIGGLLSILAAIGVIRLP 31 (197)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcC
Confidence 333444455554444444445544
No 46
>KOG0262|consensus
Probab=89.03 E-value=0.21 Score=52.33 Aligned_cols=7 Identities=29% Similarity=0.558 Sum_probs=3.0
Q ss_pred eeccCcc
Q psy9274 162 VVHPIAS 168 (318)
Q Consensus 162 I~~P~~y 168 (318)
+.-|++.
T Consensus 1209 mt~pi~~ 1215 (1640)
T KOG0262|consen 1209 MTVPIKN 1215 (1640)
T ss_pred eeeeccC
Confidence 3444443
No 47
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=88.83 E-value=0.21 Score=43.46 Aligned_cols=16 Identities=13% Similarity=-0.004 Sum_probs=6.8
Q ss_pred HHHHHHHHHHhhhhhh
Q psy9274 73 IILGFFGNALVVVVVA 88 (318)
Q Consensus 73 ~~~gi~gN~lvi~~i~ 88 (318)
.++|.|....=.|+-.
T Consensus 94 ~mvGaiCDfCEawvCH 109 (314)
T PF06524_consen 94 GMVGAICDFCEAWVCH 109 (314)
T ss_pred chhhhhhccchhheec
Confidence 3344444444444443
No 48
>PF10322 7TM_GPCR_Sru: Serpentine type 7TM GPCR chemoreceptor Sru; InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily [].
Probab=87.56 E-value=14 Score=33.86 Aligned_cols=62 Identities=19% Similarity=0.253 Sum_probs=47.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcC
Q psy9274 138 YLIVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHG 200 (318)
Q Consensus 138 ~~~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~ 200 (318)
++...+.+++.+.-++.++-|.+.+..|-++.+. .++-....+-++.+++++.++|.+...|
T Consensus 108 ~~~~~~~Y~s~lf~~Lfc~~Rl~il~~p~~~~~i-~~~i~~~~~P~i~i~p~~~~f~~~pa~G 169 (307)
T PF10322_consen 108 FFYYYFNYSSMLFPVLFCLLRLIILYSPRNHKKI-CRKIFRIWIPFIFIYPFCFTFPMFPALG 169 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHheeCccchhHH-HHhHHHHHHHHHHHHHHHHHHHccCCcE
Confidence 6667778899999999999999999999877554 4444456677777778888777765544
No 49
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=85.95 E-value=0.3 Score=42.55 Aligned_cols=7 Identities=43% Similarity=0.391 Sum_probs=3.1
Q ss_pred HHHHHHH
Q psy9274 105 AIADLLF 111 (318)
Q Consensus 105 aiaDll~ 111 (318)
||.|+.=
T Consensus 98 aiCDfCE 104 (314)
T PF06524_consen 98 AICDFCE 104 (314)
T ss_pred hhhccch
Confidence 4445443
No 50
>PF07133 Merozoite_SPAM: Merozoite surface protein (SPAM); InterPro: IPR010784 This entry consists of several Plasmodium falciparum SPAM (secreted polymorphic antigen associated with merozoites) proteins, also know as merozoite surface proteins. Variation among SPAM alleles is the result of deletions and amino acid substitutions in non-repetitive sequences within and flanking the alanine heptad-repeat domain. Heptad repeats in which the a and d position contain hydrophobic residues generate amphipathic alpha-helices which give rise to helical bundles or coiled-coil structures in proteins. SPAM is an example of a P. falciparum antigen in which a repetitive sequence has features characteristic of a well-defined structural element [,].
Probab=85.82 E-value=0.73 Score=38.06 Aligned_cols=8 Identities=13% Similarity=-0.184 Sum_probs=2.9
Q ss_pred ccccccCC
Q psy9274 236 ENDYCKGG 243 (318)
Q Consensus 236 ~~~~~~~~ 243 (318)
++.+.++.
T Consensus 26 v~its~d~ 33 (173)
T PF07133_consen 26 VKITSKDK 33 (173)
T ss_pred eecccccc
Confidence 33333333
No 51
>PF02076 STE3: Pheromone A receptor; InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=84.10 E-value=5.2 Score=36.10 Aligned_cols=102 Identities=17% Similarity=0.355 Sum_probs=52.4
Q ss_pred CchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhee-eecc--Ccccc
Q psy9274 94 RSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFLA-VVHP--IASMS 170 (318)
Q Consensus 94 ~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~s~~~l~~isidRy~a-I~~P--~~y~~ 170 (318)
..++-.+++.+.++++...+-.+.. --.. ...|+ .+|.+......+.......+..++-|.++ |..+ ....+
T Consensus 23 N~~~~~li~Wl~l~nl~~~INaiIW-~~n~-~~~~~---~wCDI~~kl~~~~~~g~~~a~lcI~r~L~~I~s~r~~~~~~ 97 (283)
T PF02076_consen 23 NIGTCSLIFWLFLSNLIYFINAIIW-RDND-INWWP---VWCDISTKLIIGSSVGIPAASLCIMRRLYRIASARAVSLSR 97 (283)
T ss_pred cHHHHHHHHHHHHHHHHHHHhhhhc-CCCC-CccCc---eeeeehHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCh
Confidence 3577888888988888876632211 1111 12233 78987333333333333333444444333 2222 12222
Q ss_pred ccchhhHHHHHHHHHHHHHHHHhhhHhhcC
Q psy9274 171 VRTERNALKAILITWIVIVLTAVPVFNMHG 200 (318)
Q Consensus 171 ~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~ 200 (318)
..++|+..+=+++++.+.++.....++..+
T Consensus 98 ~~~rr~~~~d~~i~~g~Pil~m~l~yivQ~ 127 (283)
T PF02076_consen 98 SEKRRRIIIDLLICFGIPILQMALHYIVQG 127 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 223667777777778777776665555444
No 52
>PF03821 Mtp: Golgi 4-transmembrane spanning transporter; InterPro: IPR004687 LAPTM4 and 5 are lysosome-associated transmembrane proteins, found in mammals, insects and nematodes. ; GO: 0016021 integral to membrane
Probab=82.48 E-value=23 Score=30.86 Aligned_cols=15 Identities=7% Similarity=-0.150 Sum_probs=9.5
Q ss_pred HHHhheeeecc--Cccc
Q psy9274 155 SLDRFLAVVHP--IASM 169 (318)
Q Consensus 155 sidRy~aI~~P--~~y~ 169 (318)
++.+|.+.-+. +-++
T Consensus 132 si~~~~~~~~~n~fP~k 148 (233)
T PF03821_consen 132 SILNLKSQNPMNYFPYK 148 (233)
T ss_pred HHHHHHHhcccccCCcc
Confidence 66677777665 4444
No 53
>PF02724 CDC45: CDC45-like protein; InterPro: IPR003874 CDC45 is an essential gene required for initiation of DNA replication in Saccharomyces cerevisiae (cell division control protein 45), forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human [], mouse and the smut fungus, Melampsora spp., (tsd2 protein) among others.; GO: 0006270 DNA-dependent DNA replication initiation
Probab=82.11 E-value=0.93 Score=45.58 Aligned_cols=6 Identities=17% Similarity=0.163 Sum_probs=2.3
Q ss_pred eecCCC
Q psy9274 216 FLDLDP 221 (318)
Q Consensus 216 ~~~~~~ 221 (318)
++..-|
T Consensus 81 iDshRP 86 (622)
T PF02724_consen 81 IDSHRP 86 (622)
T ss_pred EeCCCC
Confidence 333334
No 54
>KOG0127|consensus
Probab=81.50 E-value=0.38 Score=46.27 Aligned_cols=6 Identities=33% Similarity=0.761 Sum_probs=2.9
Q ss_pred eeeecc
Q psy9274 160 LAVVHP 165 (318)
Q Consensus 160 ~aI~~P 165 (318)
+-|.-|
T Consensus 145 ~Ei~IP 150 (678)
T KOG0127|consen 145 VEIVIP 150 (678)
T ss_pred EEEEcc
Confidence 444445
No 55
>PF03348 Serinc: Serine incorporator (Serinc); InterPro: IPR005016 This is a family of proteins which display differential expression in various tumour and cell lines. The function of these proteins is unknown. ; GO: 0016020 membrane
Probab=81.17 E-value=31 Score=33.10 Aligned_cols=117 Identities=19% Similarity=0.121 Sum_probs=48.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHH--HHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHH
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLI--INLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYL 139 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll--~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~ 139 (318)
...+..+..++.+.++.+.++....+. .. .-..|.++ .||.++=++..+...|..- ...+-+..+ .-
T Consensus 181 ~~~Li~~T~~~y~~si~~~v~~y~~f~-~~--~C~lN~~fIt~nliL~vi~s~lSv~p~Vq-----e~~p~sgLL---qs 249 (429)
T PF03348_consen 181 YIALIGVTLLFYAASIAGIVLMYVFFT-PS--GCSLNKFFITFNLILCVIISVLSVLPKVQ-----EANPRSGLL---QS 249 (429)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhC-CC--CCchhHHHHHHHHHHHHHHHHHHhhhhhh-----hcCCCcccc---cH
Confidence 334444455555555555544443333 22 22334443 5555555544443333211 111111111 12
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHH
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVL 190 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~ 190 (318)
..++.+...++..+++-+- -.-|.|...+..-+.....++-+++++++++
T Consensus 250 svv~~Y~~yL~~SAlss~P-~~~CNp~~~~~~~~~~~~~iig~i~~~~~v~ 299 (429)
T PF03348_consen 250 SVVSLYTTYLTWSALSSEP-DKECNPSGSRSGSWNTWQSIIGLIFTFVSVL 299 (429)
T ss_pred HHHHHHHHHHHHHHHHcCC-CcccCCcccccCCcchHHHHHHHHHHHHHHH
Confidence 2333344444555665555 6677877333322333333333333444433
No 56
>PF10329 DUF2417: Region of unknown function (DUF2417); InterPro: IPR019431 This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO).
Probab=81.13 E-value=37 Score=29.66 Aligned_cols=16 Identities=25% Similarity=0.283 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHhhee
Q psy9274 146 ASVYTLVLMSLDRFLA 161 (318)
Q Consensus 146 ~s~~~l~~isidRy~a 161 (318)
+....+..+..||++.
T Consensus 140 a~l~~l~~~~~D~~v~ 155 (232)
T PF10329_consen 140 AFLSSLWGILADRYVE 155 (232)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4445566778899986
No 57
>PRK14891 50S ribosomal protein L24e/unknown domain fusion protein; Provisional
Probab=81.06 E-value=0.89 Score=35.32 Aligned_cols=6 Identities=0% Similarity=-0.678 Sum_probs=2.3
Q ss_pred cccccc
Q psy9274 228 YNKTIF 233 (318)
Q Consensus 228 ~~~~if 233 (318)
-|+.+|
T Consensus 51 kWT~~y 56 (131)
T PRK14891 51 EWTEAG 56 (131)
T ss_pred hhHHHH
Confidence 333344
No 58
>PRK14891 50S ribosomal protein L24e/unknown domain fusion protein; Provisional
Probab=79.78 E-value=0.5 Score=36.69 Aligned_cols=11 Identities=18% Similarity=0.458 Sum_probs=6.1
Q ss_pred ccccccccccc
Q psy9274 230 KTIFQSENDYC 240 (318)
Q Consensus 230 ~~if~i~~~~~ 240 (318)
-.+|.++++.|
T Consensus 27 Gkvf~FcssKC 37 (131)
T PRK14891 27 GTVLHFVDSKC 37 (131)
T ss_pred CCEEEEecHHH
Confidence 35555666555
No 59
>PTZ00415 transmission-blocking target antigen s230; Provisional
Probab=78.58 E-value=1.1 Score=48.80 Aligned_cols=8 Identities=13% Similarity=0.364 Sum_probs=3.8
Q ss_pred eeeEeecC
Q psy9274 212 TACVFLDL 219 (318)
Q Consensus 212 ~~C~~~~~ 219 (318)
..|++...
T Consensus 112 ~~~~y~~~ 119 (2849)
T PTZ00415 112 ITCFYPIH 119 (2849)
T ss_pred cceeeecc
Confidence 35555443
No 60
>KOG3817|consensus
Probab=77.81 E-value=3.9 Score=37.63 Aligned_cols=7 Identities=14% Similarity=0.425 Sum_probs=3.5
Q ss_pred cCCcchh
Q psy9274 126 TCWPFGD 132 (318)
Q Consensus 126 ~~~~~~~ 132 (318)
+.|.++-
T Consensus 199 GgWs~sl 205 (452)
T KOG3817|consen 199 GGWSISL 205 (452)
T ss_pred ccchhHH
Confidence 4565543
No 61
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=77.31 E-value=2.1 Score=36.91 Aligned_cols=23 Identities=26% Similarity=0.236 Sum_probs=10.5
Q ss_pred ccchhhHHHHHHHHHHHHHHHHh
Q psy9274 171 VRTERNALKAILITWIVIVLTAV 193 (318)
Q Consensus 171 ~~t~~~~~~~i~~~W~~s~~~~~ 193 (318)
+.+.+..-++|+++-++.++++.
T Consensus 10 RK~N~iLNiaI~IV~lLIiiva~ 32 (217)
T PF07423_consen 10 RKTNKILNIAIGIVSLLIIIVAY 32 (217)
T ss_pred hhhhhhHHHHHHHHHHHHHHHhh
Confidence 34555444555554444333333
No 62
>KOG3064|consensus
Probab=76.78 E-value=1 Score=39.32 Aligned_cols=14 Identities=21% Similarity=0.458 Sum_probs=6.6
Q ss_pred CCccccCCCccccC
Q psy9274 35 IPAICRNFSHEYIN 48 (318)
Q Consensus 35 ~~~~~~n~s~~~~~ 48 (318)
..+.|...|.+..|
T Consensus 34 vTGLCnR~SCPLAN 47 (303)
T KOG3064|consen 34 VTGLCNRSSCPLAN 47 (303)
T ss_pred cceeeccccCcCcc
Confidence 44455555544433
No 63
>COG3125 CyoD Heme/copper-type cytochrome/quinol oxidase, subunit 4 [Energy production and conversion]
Probab=76.52 E-value=32 Score=26.30 Aligned_cols=75 Identities=16% Similarity=0.302 Sum_probs=41.2
Q ss_pred HHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHHHHHHHHHHHHHHH
Q psy9274 77 FFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLV 152 (318)
Q Consensus 77 i~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~s~~~l~ 152 (318)
++.-++-+++....---++..-.++..||+.-+++-++++. ..-.--.++|......+.+++.......|+|.+-
T Consensus 28 IiLT~ipF~~vm~~~~~~~~~~~~i~~lA~iQi~vqLvyFl-HM~~~~eg~w~~~~~iFt~~i~vivvvGS~Wvm~ 102 (111)
T COG3125 28 IILTLIPFWVVMTGALSSTVTLIIILGLAVIQILVHLVYFL-HMNTKSEGRWNMGALIFTIFIIVIVVVGSIWVMH 102 (111)
T ss_pred HHHHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHh-cccCCcccceehHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333444444333322456777888888877776553221 1111123467777777777666666677777653
No 64
>KOG3064|consensus
Probab=74.24 E-value=0.84 Score=39.78 Aligned_cols=6 Identities=50% Similarity=0.423 Sum_probs=3.2
Q ss_pred hheeee
Q psy9274 158 RFLAVV 163 (318)
Q Consensus 158 Ry~aI~ 163 (318)
|++|+.
T Consensus 123 rklalr 128 (303)
T KOG3064|consen 123 RKLALR 128 (303)
T ss_pred HHHHhc
Confidence 555554
No 65
>KOG2927|consensus
Probab=73.94 E-value=3.1 Score=38.11 Aligned_cols=8 Identities=25% Similarity=0.443 Sum_probs=4.3
Q ss_pred eeccCccc
Q psy9274 162 VVHPIASM 169 (318)
Q Consensus 162 I~~P~~y~ 169 (318)
+++|++.+
T Consensus 205 vLFPLWP~ 212 (372)
T KOG2927|consen 205 VLFPLWPR 212 (372)
T ss_pred HhcccCcH
Confidence 45666543
No 66
>PF05758 Ycf1: Ycf1; InterPro: IPR008896 The chloroplast genomes of most higher plants contain two giant open reading frames designated ycf1 and ycf2. Although the function of Ycf1 is unknown, it is known to be an essential gene [].
Probab=73.60 E-value=41 Score=35.49 Aligned_cols=16 Identities=6% Similarity=0.260 Sum_probs=7.1
Q ss_pred HHHHHHhhhHhhcCce
Q psy9274 187 VIVLTAVPVFNMHGEV 202 (318)
Q Consensus 187 ~s~~~~~p~~~~~~~~ 202 (318)
+++++.+..+++.|+.
T Consensus 202 f~i~~f~~~~~ylgR~ 217 (929)
T PF05758_consen 202 FSIILFITCLYYLGRI 217 (929)
T ss_pred HHHHHHHHHHHHccCC
Confidence 4444444444444543
No 67
>PF11705 RNA_pol_3_Rpc31: DNA-directed RNA polymerase III subunit Rpc31; InterPro: IPR024661 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase III contains seventeen subunits in yeasts and in human cells. Twelve of these are akin to RNA polymerase I or II and the other five are RNA polymerase III-specific, and form the functionally distinct groups: (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31, Rpc34 and Rpc82 form a cluster of enzyme-specific subunits that contribute to transcription initiation in Saccharomyces cerevisiae and Homo sapiens. There is evidence that these subunits are anchored at or near the N-terminal Zn-fold of Rpc1, itself prolonged by a highly conserved but RNA polymerase III-specific domain []. This entry represents the Rpc31 subunit.
Probab=71.40 E-value=1.6 Score=38.27 Aligned_cols=6 Identities=33% Similarity=0.573 Sum_probs=2.8
Q ss_pred Hhheee
Q psy9274 157 DRFLAV 162 (318)
Q Consensus 157 dRy~aI 162 (318)
+||...
T Consensus 98 dry~~~ 103 (233)
T PF11705_consen 98 DRYKKK 103 (233)
T ss_pred HHHhhh
Confidence 455444
No 68
>PF14362 DUF4407: Domain of unknown function (DUF4407)
Probab=70.65 E-value=44 Score=30.26 Aligned_cols=53 Identities=23% Similarity=0.373 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHh
Q psy9274 145 YASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFN 197 (318)
Q Consensus 145 ~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~ 197 (318)
.+-+|.++.+.+||++...-............+..=+++.-+++++++-|..+
T Consensus 51 ~glvwgl~I~~lDR~ivss~~~~~~~~~~~~~~~~R~~lAvliaivIs~pl~l 103 (301)
T PF14362_consen 51 FGLVWGLVIFNLDRFIVSSIRKSDGSRKRLLQALPRLLLAVLIAIVISEPLEL 103 (301)
T ss_pred HHHHHHHHHHHHHHHHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44677888999999977654332221112222333344444455555555443
No 69
>CHL00204 ycf1 Ycf1; Provisional
Probab=69.48 E-value=91 Score=35.19 Aligned_cols=18 Identities=17% Similarity=0.300 Sum_probs=9.6
Q ss_pred HHHHHHHhhhHhhcCcee
Q psy9274 186 IVIVLTAVPVFNMHGEVS 203 (318)
Q Consensus 186 ~~s~~~~~p~~~~~~~~~ 203 (318)
++++++.+-.+.+.|+.+
T Consensus 217 iFsIllFitcvyYLGR~P 234 (1832)
T CHL00204 217 IFSILLFITCVYYLGRIP 234 (1832)
T ss_pred HHHHHHHHHHHHHcCCCC
Confidence 455555555555555543
No 70
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=67.90 E-value=1.7 Score=38.64 Aligned_cols=19 Identities=37% Similarity=0.525 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHhheeeec
Q psy9274 146 ASVYTLVLMSLDRFLAVVH 164 (318)
Q Consensus 146 ~s~~~l~~isidRy~aI~~ 164 (318)
+|.|++.+..+--++||+.
T Consensus 134 As~WtiLaFcLAF~Laivl 152 (381)
T PF05297_consen 134 ASFWTILAFCLAFLLAIVL 152 (381)
T ss_dssp -------------------
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 6777777666666666654
No 71
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=66.34 E-value=5.7 Score=30.98 Aligned_cols=23 Identities=17% Similarity=0.277 Sum_probs=9.1
Q ss_pred HHHHHHHHhhhhhhccCCCCchH
Q psy9274 75 LGFFGNALVVVVVAFNQQMRSTT 97 (318)
Q Consensus 75 ~gi~gN~lvi~~i~~~~~l~~~~ 97 (318)
.|++|=+++|+.++||++.|.+.
T Consensus 75 aGvIg~Illi~y~irR~~Kk~~~ 97 (122)
T PF01102_consen 75 AGVIGIILLISYCIRRLRKKSSS 97 (122)
T ss_dssp HHHHHHHHHHHHHHHHHS-----
T ss_pred HHHHHHHHHHHHHHHHHhccCCC
Confidence 34455455555555544445433
No 72
>COG4547 CobT Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) [Coenzyme metabolism]
Probab=63.93 E-value=2.1 Score=40.69 Aligned_cols=11 Identities=27% Similarity=0.344 Sum_probs=4.8
Q ss_pred HHHHHHhhhhh
Q psy9274 77 FFGNALVVVVV 87 (318)
Q Consensus 77 i~gN~lvi~~i 87 (318)
+-||-=+.+++
T Consensus 33 ~agd~~~ev~f 43 (620)
T COG4547 33 IAGDQELEVVF 43 (620)
T ss_pred hcCCccceeee
Confidence 34555443333
No 73
>COG4547 CobT Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphoribosyltransferase) [Coenzyme metabolism]
Probab=62.31 E-value=2.5 Score=40.14 Aligned_cols=9 Identities=22% Similarity=0.434 Sum_probs=4.0
Q ss_pred HHHHHHHhh
Q psy9274 151 LVLMSLDRF 159 (318)
Q Consensus 151 l~~isidRy 159 (318)
+..+.-++|
T Consensus 128 ~s~~l~ek~ 136 (620)
T COG4547 128 LSAMLEEKY 136 (620)
T ss_pred hhhhhhhhc
Confidence 334444454
No 74
>PF09882 DUF2109: Predicted membrane protein (DUF2109); InterPro: IPR019214 This entry is found in various hypothetical archaeal proteins and has no known function.
Probab=61.69 E-value=23 Score=25.05 Aligned_cols=49 Identities=27% Similarity=0.381 Sum_probs=35.5
Q ss_pred HHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHH
Q psy9274 73 IILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTAT 121 (318)
Q Consensus 73 ~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~ 121 (318)
.++|+++=..++-++..+.+.++-.++-++|-|++.++....--|+...
T Consensus 4 ~i~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g~i 52 (78)
T PF09882_consen 4 IIIGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMGAI 52 (78)
T ss_pred HHHHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHHHH
Confidence 3456666666777777777788888888899999988887755555443
No 75
>PF13042 DUF3902: Protein of unknown function (DUF3902)
Probab=61.43 E-value=85 Score=25.31 Aligned_cols=59 Identities=15% Similarity=0.258 Sum_probs=29.8
Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHHHHHHHHHHHHHHHH
Q psy9274 95 STTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVL 153 (318)
Q Consensus 95 ~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~s~~~l~~ 153 (318)
+..+-.+.--.+..+-+++.++.+.+.....+.|..+.-.-...+..+....++.+++.
T Consensus 60 k~~~k~l~kt~~iSF~~avLGiifgI~~qll~~WslsiM~wYWll~LlLyl~tiisLVi 118 (161)
T PF13042_consen 60 KKFSKVLIKTNVISFNFAVLGIIFGIIHQLLGKWSLSIMMWYWLLILLLYLITIISLVI 118 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333334444555555666666767777776554444444444444444444443
No 76
>COG5414 TATA-binding protein-associated factor [Transcription]
Probab=59.07 E-value=3 Score=37.12 Aligned_cols=10 Identities=10% Similarity=0.268 Sum_probs=5.6
Q ss_pred eeeeccCccc
Q psy9274 160 LAVVHPIASM 169 (318)
Q Consensus 160 ~aI~~P~~y~ 169 (318)
..+.-|++|.
T Consensus 179 hGlspPl~~V 188 (392)
T COG5414 179 HGLSPPLKYV 188 (392)
T ss_pred ccCCchhHHH
Confidence 3455666653
No 77
>TIGR03869 F420-0_ABCperm proposed F420-0 ABC transporter, permease protein. his small clade of ABC-type transporter permease protein components is found as a three gene cassette along with a periplasmic substrate-binding protein (TIGR03868) and an ATPase (TIGR03873). The organisms containing this cassette are all Actinobacteria and all contain numerous genes requiring the coenzyme F420. This model was defined based on five such organisms, four of which are lacking all F420 biosynthetic capability save the final side-chain polyglutamate attachment step (via the gene cofE: TIGR01916). In Jonesia denitrificans DSM 20603 and marine actinobacterium PHSC20C1 this cassette is in an apparent operon with the cofE gene and, in PHSC20C1, also with an F420-dependent glucose-6-phosphate dehydrogenase (TIGR03554). Based on these observations we propose that this permease protein is a component of a F420-0 (that is, F420 lacking only the polyglutamate tail) transporter.
Probab=58.93 E-value=1.4e+02 Score=27.56 Aligned_cols=34 Identities=18% Similarity=0.169 Sum_probs=21.8
Q ss_pred hhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhH
Q psy9274 83 VVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCV 116 (318)
Q Consensus 83 vi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~ 116 (318)
+++.+.++++..++..+.+..++++=++.++..+
T Consensus 124 lv~~l~~~~~~~~~~~liL~Gi~is~~~~a~~~~ 157 (325)
T TIGR03869 124 ATLGLARTLGTLTPTRTILAGLAVSQLCAAVTSF 157 (325)
T ss_pred HHHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHH
Confidence 3444443333346788999999998777766433
No 78
>KOG0943|consensus
Probab=58.92 E-value=3 Score=44.27 Aligned_cols=7 Identities=29% Similarity=0.420 Sum_probs=3.0
Q ss_pred HHHHHHH
Q psy9274 75 LGFFGNA 81 (318)
Q Consensus 75 ~gi~gN~ 81 (318)
+|.+-|+
T Consensus 1459 vGTiRnl 1465 (3015)
T KOG0943|consen 1459 VGTIRNL 1465 (3015)
T ss_pred hhhhhhh
Confidence 3444443
No 79
>PF15383 TMEM237: Transmembrane protein 237
Probab=54.51 E-value=64 Score=28.61 Aligned_cols=51 Identities=8% Similarity=0.157 Sum_probs=29.6
Q ss_pred HHHHHhheeeeccC-ccccccchhhHHHHHHHHHHHHHHHHhhhHhhcCceee
Q psy9274 153 LMSLDRFLAVVHPI-ASMSVRTERNALKAILITWIVIVLTAVPVFNMHGEVSY 204 (318)
Q Consensus 153 ~isidRy~aI~~P~-~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~~~~~ 204 (318)
+-++|||= +.+.- .+.+...+.....+.+++|+.++++++.....-.+..+
T Consensus 154 VSafDr~d-l~~~~~~~~r~~~~~~~~~lai~ly~~~lvlsls~~~~ddrl~l 205 (253)
T PF15383_consen 154 VSAFDRYD-LAHFSMAHLRGFLKLDPGALAILLYFIALVLSLSCQMTDDRLAL 205 (253)
T ss_pred HhhhhhHH-hhcccHHHHHhhhccCchHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34568863 33332 22233344455567778888888888776665555444
No 80
>PTZ00359 hypothetical protein; Provisional
Probab=52.54 E-value=1e+02 Score=29.16 Aligned_cols=85 Identities=11% Similarity=0.103 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHHHHHHHHHHHHHH
Q psy9274 72 IIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTL 151 (318)
Q Consensus 72 i~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~s~~~l 151 (318)
+++.|...-+++.+++...+..++-.+.+..-+|+.-+++.+..+......-+.+.|..+...|-++...+...+|.+.-
T Consensus 259 L~L~G~~vGiLItV~ls~~~~~~~~~~~ly~L~~ll~I~Tv~LL~yiH~k~Pl~Qy~~~d~~~~Gif~Ll~lvl~~t~i~ 338 (443)
T PTZ00359 259 IFLTGTVVGILITVCLSMFPEIKQYQWTLVGLIAMLYILTVALFLYIHAKRPLCQYYNPDVITFGIFALLFLLLVSTLIA 338 (443)
T ss_pred HHHhhhHHHHHHHHHhhhCcccccccchHHHHHHHHHHHHHHHHHHHHhcCchhhcCccchhhHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444333222233333333333333322222222223334665666666655555555666555
Q ss_pred HHHHH
Q psy9274 152 VLMSL 156 (318)
Q Consensus 152 ~~isi 156 (318)
++.-.
T Consensus 339 l~a~~ 343 (443)
T PTZ00359 339 LGAKH 343 (443)
T ss_pred HHHHh
Confidence 44444
No 81
>PF06814 Lung_7-TM_R: Lung seven transmembrane receptor; InterPro: IPR009637 This family represents a conserved region with eukaryotic lung seven transmembrane receptors and related proteins.; GO: 0016021 integral to membrane
Probab=50.97 E-value=1.9e+02 Score=26.04 Aligned_cols=45 Identities=20% Similarity=0.402 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHH
Q psy9274 142 VTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLT 191 (318)
Q Consensus 142 ~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~ 191 (318)
+-...+..++++++. - ..|++|.-.++ .++...+.+...+++.+.
T Consensus 124 ~k~~~~~~llllis~-G-ygivkp~L~~~---~~~v~~l~i~~~v~~~i~ 168 (295)
T PF06814_consen 124 LKRTLSFFLLLLISL-G-YGIVKPSLGRR---EKKVLMLVILYFVFSNIA 168 (295)
T ss_pred HHHHHHHHHHHHHhc-c-hheeccccCcc---eeehhHHHHHHHHHHHHH
Confidence 333444444444444 2 45567754332 234444444445544443
No 82
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=49.75 E-value=2e+02 Score=26.22 Aligned_cols=57 Identities=16% Similarity=0.100 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHH-HHHHHHHHhhhHHHHHHHHhhc
Q psy9274 68 FFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINL-AIADLLFIVCCVPFTATDYVTT 126 (318)
Q Consensus 68 ~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~L-aiaDll~~l~~~p~~~~~~~~~ 126 (318)
.+.+++++|++.|++..+.+.+.+ .+..|.=-..| +++|++..+..+...+...+.+
T Consensus 126 ~ml~va~~GL~vN~~~a~ll~~~~--~~~lN~r~a~LHvl~D~Lgsv~vIia~i~i~~~~ 183 (296)
T COG1230 126 GMLVVAIIGLVVNLVSALLLHKGH--EENLNMRGAYLHVLGDALGSVGVIIAAIVIRFTG 183 (296)
T ss_pred chHHHHHHHHHHHHHHHHHhhCCC--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 355678899999999999888762 12233333334 3488888775555555554444
No 83
>PF12304 BCLP: Beta-casein like protein; InterPro: IPR020977 This entry represents eukaryotic proteins that are typically between 216 to 240 amino acids in length which have two conserved sequence motifs: VLR and TRIY. Beta-casein-like protein is associated with cell morphology and a regulation of growth pattern of tumours. It is found in adenocarcinomas of uterine cervical tissues[].
Probab=47.04 E-value=1.7e+02 Score=24.49 Aligned_cols=113 Identities=12% Similarity=0.128 Sum_probs=55.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHH----hhcCCcchhHHHHHHH
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDY----VTTCWPFGDVWCKMYL 139 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~----~~~~~~~~~~~C~~~~ 139 (318)
+.-++...-.++++.+- ++.+++.|+++ +++-.+-++.+++.+.+++..+..-.++.. ..+... --..|.+
T Consensus 42 vsNiisv~Sgll~I~~G-I~AIvlSrnl~-~~~L~W~Ll~~S~ln~LlSaAc~vGL~~ai~~Tv~~~Gr~-Ll~~C~f-- 116 (188)
T PF12304_consen 42 VSNIISVTSGLLSIICG-IVAIVLSRNLR-NRPLHWTLLVVSLLNALLSAACAVGLLLAISLTVANQGRN-LLAGCNF-- 116 (188)
T ss_pred HHHHHHHHHHHHHHHHh-HHHHhhhccCC-CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCchh-HhhcCCC--
Confidence 33344444445555544 33344455554 567778888888877766543322211111 111110 0001111
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhc
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMH 199 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~ 199 (318)
.. ..++. |+.+.--+-+.|.+-.-++.|+.+++++..-..+.
T Consensus 117 ------------~~--~~~~~----~it~dCPFDpTRIY~TtL~LW~ps~ll~~~EavfS 158 (188)
T PF12304_consen 117 ------------TN--LNALS----SITNDCPFDPTRIYDTTLALWIPSILLSAVEAVFS 158 (188)
T ss_pred ------------CC--hhhcc----CCCCCCCCCcchHHHhHHHHHHHHHHHHHHHHHHH
Confidence 00 01222 33344445667778888889999888877655443
No 84
>KOG1307|consensus
Probab=45.94 E-value=9 Score=36.69 Aligned_cols=10 Identities=0% Similarity=0.444 Sum_probs=4.1
Q ss_pred HHHHHHHHHH
Q psy9274 64 IVPIFFGIII 73 (318)
Q Consensus 64 ~~~~~~~ii~ 73 (318)
+.+.+|.+++
T Consensus 61 v~g~iYmFvA 70 (588)
T KOG1307|consen 61 VIGLIYMFVA 70 (588)
T ss_pred hhhHHHHHHH
Confidence 3344444433
No 85
>PRK10582 cytochrome o ubiquinol oxidase subunit IV; Provisional
Probab=45.69 E-value=1.3e+02 Score=22.87 Aligned_cols=18 Identities=17% Similarity=0.242 Sum_probs=10.2
Q ss_pred hHHHHHHHHHHHHHHHHh
Q psy9274 96 TTNLLIINLAIADLLFIV 113 (318)
Q Consensus 96 ~~~~ll~~LaiaDll~~l 113 (318)
.....++.+|+.-+++=+
T Consensus 45 ~~~~~i~~lA~vQi~VqL 62 (109)
T PRK10582 45 VILGTILAMAVVQILVHL 62 (109)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 345566667765555443
No 86
>PRK09535 btuC corrinoid ABC transporter permease; Reviewed
Probab=44.96 E-value=2.3e+02 Score=26.70 Aligned_cols=34 Identities=21% Similarity=0.212 Sum_probs=20.8
Q ss_pred hhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhH
Q psy9274 83 VVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCV 116 (318)
Q Consensus 83 vi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~ 116 (318)
+++.+.++++..++..+.+...+++=++.++..+
T Consensus 168 lv~~la~~~~~~~~~~LiL~Giavs~~~~Al~s~ 201 (366)
T PRK09535 168 GVYAIATENGRTPVATLLLAGVAVQTFLGAVISY 201 (366)
T ss_pred HHHHHHhcCCCCCcchhhHHHHHHHHHHHHHHHH
Confidence 3344443333334457899999988887776544
No 87
>PF01032 FecCD: FecCD transport family; InterPro: IPR000522 This is a subfamily of bacterial binding-protein-dependent transport systems family, and includes transport system permease proteins involved in the transport across the membrane of several compounds. This entry contains the inner components of this multicomponent transport system.; GO: 0005215 transporter activity, 0016020 membrane; PDB: 4DBL_A 1L7V_B 2QI9_B 2NQ2_A.
Probab=44.82 E-value=91 Score=28.51 Aligned_cols=39 Identities=18% Similarity=0.217 Sum_probs=24.6
Q ss_pred HhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHH
Q psy9274 82 LVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTA 120 (318)
Q Consensus 82 lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~ 120 (318)
++++.+.++++..++..+.+...+++=++.++..+....
T Consensus 115 ~lv~~l~~~~~~~~~~~liL~Gi~i~~~~~al~~~l~~~ 153 (311)
T PF01032_consen 115 LLVYLLSKRRGGFSPLRLILAGIAISALFSALTSFLLLL 153 (311)
T ss_dssp HHHHHCHT-TTT--HHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHhhhhcccCCCChhHHHHHHHHHHHHHHHHHHHHHhh
Confidence 334444445555688999999999998888775554443
No 88
>PRK10440 iron-enterobactin transporter permease; Provisional
Probab=43.99 E-value=2.5e+02 Score=25.99 Aligned_cols=24 Identities=21% Similarity=0.257 Sum_probs=18.3
Q ss_pred CchHHHHHHHHHHHHHHHHhhhHH
Q psy9274 94 RSTTNLLIINLAIADLLFIVCCVP 117 (318)
Q Consensus 94 ~~~~~~ll~~LaiaDll~~l~~~p 117 (318)
.++..+.+...+++-++.++..+.
T Consensus 142 ~~~~~liL~Gi~is~~~~al~~~l 165 (330)
T PRK10440 142 IDTFRLIIIGIGVRAMLVAFNTWL 165 (330)
T ss_pred CChhHHHHHHHHHHHHHHHHHHHH
Confidence 567889999999988877765443
No 89
>PF09184 PPP4R2: PPP4R2; InterPro: IPR015267 PPP4R2 (protein phosphatase 4 core regulatory subunit R2) is the regulatory subunit of the histone H2A phosphatase complex. It has been shown to confer resistance to the anticancer drug cisplatin in yeast [], and may confer resistance in higher eukaryotes.
Probab=43.54 E-value=6.9 Score=35.40 Aligned_cols=15 Identities=20% Similarity=0.445 Sum_probs=10.4
Q ss_pred HHHHHHhheeeeccC
Q psy9274 152 VLMSLDRFLAVVHPI 166 (318)
Q Consensus 152 ~~isidRy~aI~~P~ 166 (318)
.+-|++|.+.|+..+
T Consensus 120 ~~~alek~~~V~s~~ 134 (288)
T PF09184_consen 120 FLRALEKVVLVTSTV 134 (288)
T ss_pred HHHHHheeEEEeeec
Confidence 356788888887544
No 90
>PF06213 CobT: Cobalamin biosynthesis protein CobT; InterPro: IPR006538 These proteins are CobT subunits of the aerobic cobalt chelatase (aerobic cobalamin biosynthesis pathway). Pseudomonas denitrificans CobT has been experimentally characterised [, ]. Aerobic cobalt chelatase consists of three subunits, CobT, CobN (IPR003672 from INTERPRO) and CobS (IPR006537 from INTERPRO). Cobalamin (vitamin B12) can be complexed with metal via the ATP-dependent reactions (aerobic pathway) (e.g., in P. denitrificans) or via ATP-independent reactions (anaerobic pathway) (e.g., in Salmonella typhimurium) [, ]. The corresponding cobalt chelatases are not homologous. However, aerobic cobalt chelatase subunits CobN and CobS are homologous to Mg-chelatase subunits BchH and BchI, respectively []. CobT, too, has been found to be remotely related to the third subunit of Mg-chelatase, BchD (involved in bacteriochlorophyll synthesis, e.g., in Rhodobacter capsulatus) []. Nomenclature note: CobT of the aerobic pathway P. denitrificans is not a homologue of CobT of the anaerobic pathway (Salmonella typhimurium, Escherichia coli). Therefore, annotation of any members of this family as nicotinate-mononucleotide--5,6-dimethylbenzimidazole phosphoribosyltransferases is erroneous.
Probab=42.99 E-value=7.7 Score=34.97 Aligned_cols=8 Identities=13% Similarity=0.119 Sum_probs=3.4
Q ss_pred HHHhheee
Q psy9274 155 SLDRFLAV 162 (318)
Q Consensus 155 sidRy~aI 162 (318)
...|+...
T Consensus 150 lr~rl~g~ 157 (282)
T PF06213_consen 150 LRERLTGQ 157 (282)
T ss_pred HHHHHhCC
Confidence 34444433
No 91
>PF10110 GPDPase_memb: Membrane domain of glycerophosphoryl diester phosphodiesterase; InterPro: IPR018476 Members of this family comprise the membrane domain of the prokaryotic enzyme glycerophosphoryl diester phosphodiesterase [].
Probab=40.44 E-value=77 Score=25.36 Aligned_cols=44 Identities=9% Similarity=0.144 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHH
Q psy9274 147 SVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLT 191 (318)
Q Consensus 147 s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~ 191 (318)
-++.+-.+.+++. .+...++.+...|+++....+....+..++.
T Consensus 103 ~if~lp~~vle~~-~~~~A~k~Sw~ltk~~~~~~~~~~l~~~~~~ 146 (149)
T PF10110_consen 103 LIFVLPLIVLENK-SFKEALKESWQLTKGRFWRILGRLLLLFIII 146 (149)
T ss_pred HHHHHHHHHHcCC-CHHHHHHHHHHHHcCcHHHHHHHHHHHHHHH
Confidence 3455667777777 6666666666667776665555554444443
No 92
>KOG2051|consensus
Probab=40.09 E-value=6.7 Score=41.12 Aligned_cols=9 Identities=44% Similarity=0.412 Sum_probs=3.8
Q ss_pred HHHHHHHHh
Q psy9274 105 AIADLLFIV 113 (318)
Q Consensus 105 aiaDll~~l 113 (318)
|+|+++.++
T Consensus 685 ~lA~llaGL 693 (1128)
T KOG2051|consen 685 ALASLLAGL 693 (1128)
T ss_pred HHHHHHHHH
Confidence 344444443
No 93
>KOG3817|consensus
Probab=39.69 E-value=2.2e+02 Score=26.58 Aligned_cols=38 Identities=21% Similarity=0.385 Sum_probs=21.6
Q ss_pred HHHHHHHHHHhhhhhh-ccCCCCc----------hHHHHHHHHHHHHHH
Q psy9274 73 IILGFFGNALVVVVVA-FNQQMRS----------TTNLLIINLAIADLL 110 (318)
Q Consensus 73 ~~~gi~gN~lvi~~i~-~~~~l~~----------~~~~ll~~LaiaDll 110 (318)
+.+|++.-+++++.+. |.-..++ ...+|++..+..|+=
T Consensus 168 ~v~GilaSLl~Viflv~rf~PKkt~~~~iliGgWs~slY~i~ql~~nLq 216 (452)
T KOG3817|consen 168 IVIGILASLLVVIFLVARFFPKKTMMYGILIGGWSISLYVIKQLADNLQ 216 (452)
T ss_pred cHHHHHHHHHHHHHHHHHhcccccceEEEEEccchhHHHHHHHHHHHHH
Confidence 4567777776554443 3222223 456777777777763
No 94
>KOG1307|consensus
Probab=39.60 E-value=11 Score=36.25 Aligned_cols=10 Identities=30% Similarity=0.547 Sum_probs=4.1
Q ss_pred HHHHHHHHHH
Q psy9274 99 LLIINLAIAD 108 (318)
Q Consensus 99 ~ll~~LaiaD 108 (318)
++.=-|+++|
T Consensus 85 vItEkL~iSd 94 (588)
T KOG1307|consen 85 VITEKLGISD 94 (588)
T ss_pred HHHHHhcCcc
Confidence 3333344444
No 95
>KOG2140|consensus
Probab=39.44 E-value=21 Score=34.88 Aligned_cols=8 Identities=13% Similarity=0.497 Sum_probs=3.3
Q ss_pred HHhheeee
Q psy9274 156 LDRFLAVV 163 (318)
Q Consensus 156 idRy~aI~ 163 (318)
++|+-+|.
T Consensus 318 fErlR~IL 325 (739)
T KOG2140|consen 318 FERLRYIL 325 (739)
T ss_pred HHHHHHHH
Confidence 34444443
No 96
>KOG1619|consensus
Probab=39.21 E-value=2.7e+02 Score=24.48 Aligned_cols=28 Identities=18% Similarity=0.088 Sum_probs=17.0
Q ss_pred heeeeccCccccccchhhHHHHHHHHHH
Q psy9274 159 FLAVVHPIASMSVRTERNALKAILITWI 186 (318)
Q Consensus 159 y~aI~~P~~y~~~~t~~~~~~~i~~~W~ 186 (318)
|+.=+-|..||..+-+.++..-+.+-|+
T Consensus 148 fl~pg~~~~~Rs~lmP~H~~~Gl~~f~l 175 (245)
T KOG1619|consen 148 FLFPGSPESYRSRLMPWHVFLGLAIFIL 175 (245)
T ss_pred HhcCCCCccHHhhhhhHHHHHHHHHHHH
Confidence 5666777777776666665544444443
No 97
>KOG2302|consensus
Probab=38.70 E-value=1.6e+02 Score=31.48 Aligned_cols=118 Identities=18% Similarity=0.274 Sum_probs=61.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHH----HhhcCCcchhHHHH
Q psy9274 61 VSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD----YVTTCWPFGDVWCK 136 (318)
Q Consensus 61 ~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~----~~~~~~~~~~~~C~ 136 (318)
++.+..-.+-.+..+-|+.|.+.+-.+.--..+.--..---+--|+-|++++.+.+-..+.. ++.....+++.+-+
T Consensus 73 lr~vcnpwfe~vsmlvillncvtlgmfrpced~~c~s~rc~ilqafddfifaffavemv~kmvalgifgkkcylgdtwnr 152 (1956)
T KOG2302|consen 73 LRMVCNPWFECVSMLVILLNCVTLGMFRPCEDMDCLSDRCKILQAFDDFIFAFFAVEMVLKMVALGIFGKKCYLGDTWNR 152 (1956)
T ss_pred HHhhccHHHHHHHHHHHHHhhhhhcccccchhcccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCchhh
Confidence 34455555566777778888777766543222221111122224677777766544333322 22334556777777
Q ss_pred H-HHHHHHHHH--H-HHHHHHHHHHhheeeeccCccccccchhhHH
Q psy9274 137 M-YLIVVTAYA--S-VYTLVLMSLDRFLAVVHPIASMSVRTERNAL 178 (318)
Q Consensus 137 ~-~~~~~~~~~--s-~~~l~~isidRy~aI~~P~~y~~~~t~~~~~ 178 (318)
+ |+.++...+ | -+.-+-++.-|-+.|.+|++.-.++..-|+.
T Consensus 153 ldffiv~agm~eysldlqnvslsairtvrvlrplrainrvpsmril 198 (1956)
T KOG2302|consen 153 LDFFIVMAGMVEYSLDLQNVSLSAIRTVRVLRPLRAINRVPSMRIL 198 (1956)
T ss_pred hhhhheehhhhhhcccccccchhhhhhhhhhhhhhHhccCchHHHH
Confidence 7 444333221 1 1222345556888899999875444443433
No 98
>TIGR01651 CobT cobaltochelatase, CobT subunit. This model describes the aerobic cobalamin pathway Pseudomonas denitrificans CobT gene product, which is a cobalt chelatase subunit, with a MW ~70 kDa. The aerobic pathway cobalt chelatase is a heterotrimeric, ATP-dependent enzyme that catalyzes cobalt insertion during cobalamin biosynthesis. The other two subunits are the P. denitrificans CobS (TIGR01650) and CobN (pfam02514 CobN/Magnesium Chelatase) proteins. To avoid potential confusion with the nonhomologous Salmonella typhimurium/E.coli cobT gene product, the P. denitrificans gene symbol is not used in the name of this model.
Probab=38.41 E-value=11 Score=37.34 Aligned_cols=7 Identities=14% Similarity=0.273 Sum_probs=2.9
Q ss_pred HHHHHhh
Q psy9274 153 LMSLDRF 159 (318)
Q Consensus 153 ~isidRy 159 (318)
++--+||
T Consensus 111 a~le~~~ 117 (600)
T TIGR01651 111 AMLEAKY 117 (600)
T ss_pred HHHHHHH
Confidence 3334444
No 99
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=38.32 E-value=45 Score=30.96 Aligned_cols=30 Identities=20% Similarity=0.278 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhccCCCC
Q psy9274 65 VPIFFGIIIILGFFGNALVVVVVAFNQQMR 94 (318)
Q Consensus 65 ~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~ 94 (318)
-+++..+++++.++.-.++|+.++|+||.+
T Consensus 310 t~IiaSiIAIvvIVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 310 TPIIASIIAILIIVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 456667777777777777888888777644
No 100
>COG2245 Predicted membrane protein [Function unknown]
Probab=37.85 E-value=2.4e+02 Score=23.45 Aligned_cols=32 Identities=25% Similarity=0.313 Sum_probs=16.0
Q ss_pred HHHHHHHHHhhhh---hhccCCCCchHHHHHHHHH
Q psy9274 74 ILGFFGNALVVVV---VAFNQQMRSTTNLLIINLA 105 (318)
Q Consensus 74 ~~gi~gN~lvi~~---i~~~~~l~~~~~~ll~~La 105 (318)
+++++|=.++++. +.+.-.-++.+|.++....
T Consensus 29 ilSlVG~VLvlval~~iS~~~ge~~IF~~yl~~~v 63 (182)
T COG2245 29 ILSLVGLVLVLVALYMISKQVGEDRIFNNYLIAVV 63 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHH
Confidence 3444444444443 4333334456777777633
No 101
>KOG1292|consensus
Probab=37.82 E-value=2.1e+02 Score=27.86 Aligned_cols=26 Identities=19% Similarity=0.409 Sum_probs=19.8
Q ss_pred ccccchhhHHHHHHHHHHHHHHHHhh
Q psy9274 169 MSVRTERNALKAILITWIVIVLTAVP 194 (318)
Q Consensus 169 ~~~~t~~~~~~~i~~~W~~s~~~~~p 194 (318)
++++.+....+.++++|+++.++.+-
T Consensus 201 ~~if~~f~vll~i~ivW~~~~iLT~t 226 (510)
T KOG1292|consen 201 IQIFSRFPVLLAIAIVWLYCFILTIT 226 (510)
T ss_pred cchHhhccHHHHHHHHHHHHHHHHhc
Confidence 44566667778899999999887663
No 102
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=37.25 E-value=1.2e+02 Score=31.91 Aligned_cols=22 Identities=18% Similarity=0.294 Sum_probs=15.1
Q ss_pred HHHHHHHHHhheeeeccCcccc
Q psy9274 149 YTLVLMSLDRFLAVVHPIASMS 170 (318)
Q Consensus 149 ~~l~~isidRy~aI~~P~~y~~ 170 (318)
+...++...=|..+|.|+..+.
T Consensus 481 ~~~Fl~G~~~~~lvC~p~~~~~ 502 (806)
T PF05478_consen 481 LFYFLVGGNTYTLVCQPLESNE 502 (806)
T ss_pred HHHHHHHhhhheeeecCCCccc
Confidence 3344556677899999995444
No 103
>COG0109 CyoE Polyprenyltransferase (cytochrome oxidase assembly factor) [Posttranslational modification, protein turnover, chaperones]
Probab=37.09 E-value=1.1e+02 Score=27.99 Aligned_cols=23 Identities=22% Similarity=0.310 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHhheeeeccCcc
Q psy9274 146 ASVYTLVLMSLDRFLAVVHPIAS 168 (318)
Q Consensus 146 ~s~~~l~~isidRy~aI~~P~~y 168 (318)
--.|.+...--|-|-+---|...
T Consensus 190 pHfwALAl~~~~DY~~AgiPMlP 212 (304)
T COG0109 190 PHFWALALKYKDDYKAAGIPMLP 212 (304)
T ss_pred HHHHHHHHHHHHHHHHcCCCccc
Confidence 44556666666778877777643
No 104
>KOG3609|consensus
Probab=36.88 E-value=1.9e+02 Score=30.01 Aligned_cols=76 Identities=18% Similarity=0.249 Sum_probs=38.0
Q ss_pred HHH-HHHHHhhcCCcchhHHHHH---HHHHHHHHHHHHHH-HHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHH
Q psy9274 116 VPF-TATDYVTTCWPFGDVWCKM---YLIVVTAYASVYTL-VLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVL 190 (318)
Q Consensus 116 ~p~-~~~~~~~~~~~~~~~~C~~---~~~~~~~~~s~~~l-~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~ 190 (318)
+|+ .+.+.+......|...|+- |+.....+...+.+ ++-+.+++. .+. ++..-..-..+=.+++.|+++++
T Consensus 323 ~P~~~l~yllap~S~~G~~~r~PfmKFi~H~~Sy~~FL~LL~l~s~~~~~---~~~-~~~~g~~p~~vE~li~~wV~G~i 398 (822)
T KOG3609|consen 323 FPMPSLVYLLAPMSRKGTTMRKPFMKFIAHITSYLVFLILLILASLIGFY---FAW-TDSRGVTPQALEILIYLWVMGLI 398 (822)
T ss_pred hHHHHHHHHhCCCCcccchhhchHHHHHHHHHHHHHHHHHHHHHHHhhhh---hhh-hcccCCCCcHHHHHHHHHHHHHH
Confidence 343 3444444455667777765 44433333333333 333444443 222 32222334455577778888887
Q ss_pred HHhhh
Q psy9274 191 TAVPV 195 (318)
Q Consensus 191 ~~~p~ 195 (318)
++=..
T Consensus 399 w~E~k 403 (822)
T KOG3609|consen 399 WEEIK 403 (822)
T ss_pred HHHHH
Confidence 76443
No 105
>KOG2632|consensus
Probab=36.80 E-value=1.7e+02 Score=26.03 Aligned_cols=37 Identities=16% Similarity=0.225 Sum_probs=26.0
Q ss_pred hhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHH
Q psy9274 86 VVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATD 122 (318)
Q Consensus 86 ~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~ 122 (318)
-+.+++++++-..+-+.|+++-.+++....++.....
T Consensus 46 ~~l~~~ql~RL~Ty~l~H~s~~hllfnmlaL~~~g~~ 82 (258)
T KOG2632|consen 46 ELLINWQLYRLITYALVHLSLPHLLFNMLALWPLGSQ 82 (258)
T ss_pred HHhhhHHHHHHHHHHHHhccHHHHHHHHHHHHhchhH
Confidence 3456677777777888899999998887555444333
No 106
>KOG2051|consensus
Probab=36.39 E-value=9.5 Score=40.05 Aligned_cols=9 Identities=11% Similarity=0.744 Sum_probs=3.6
Q ss_pred HHHHhheee
Q psy9274 154 MSLDRFLAV 162 (318)
Q Consensus 154 isidRy~aI 162 (318)
.++.||+-|
T Consensus 806 ~~fq~Y~~i 814 (1128)
T KOG2051|consen 806 VAFQRYILI 814 (1128)
T ss_pred HHHHHHhhc
Confidence 333444433
No 107
>PF14283 DUF4366: Domain of unknown function (DUF4366)
Probab=36.11 E-value=12 Score=32.37 Aligned_cols=15 Identities=40% Similarity=0.547 Sum_probs=8.8
Q ss_pred HHHHHHHH-HHHHHHH
Q psy9274 98 NLLIINLA-IADLLFI 112 (318)
Q Consensus 98 ~~ll~~La-iaDll~~ 112 (318)
|++++|.- =+||+.+
T Consensus 98 nV~fLn~VdE~DLl~l 113 (218)
T PF14283_consen 98 NVYFLNQVDEADLLAL 113 (218)
T ss_pred eEEEeccCCHHHHHHH
Confidence 56666654 3666654
No 108
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=35.65 E-value=51 Score=25.05 Aligned_cols=31 Identities=6% Similarity=0.161 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhccCCCC
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVAFNQQMR 94 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~ 94 (318)
++..+++++.++.++.|+++.....|+++..
T Consensus 2 ~Ll~il~llLll~l~asl~~wr~~~rq~k~~ 32 (107)
T PF15330_consen 2 LLLGILALLLLLSLAASLLAWRMKQRQKKAG 32 (107)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhcccc
Confidence 4456667777788888888777776655543
No 109
>PTZ00046 rifin; Provisional
Probab=35.41 E-value=54 Score=30.51 Aligned_cols=30 Identities=10% Similarity=0.176 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhccCCCC
Q psy9274 65 VPIFFGIIIILGFFGNALVVVVVAFNQQMR 94 (318)
Q Consensus 65 ~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~ 94 (318)
-+++..+++++-++.-.++|+.++|+||.+
T Consensus 315 taIiaSiiAIvVIVLIMvIIYLILRYRRKK 344 (358)
T PTZ00046 315 TAIIASIVAIVVIVLIMVIIYLILRYRRKK 344 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 355666777777777777788888777644
No 110
>TIGR02847 CyoD cytochrome o ubiquinol oxidase subunit IV. Cytochrome o terminal oxidase complex is the component of the aerobic respiratory chain which reacts with oxygen, reducing it to water with the concomitant transport of 4 protons across the membrane. Also known as the cytochrome bo complex, cytochrome o ubiquinol oxidase contains four subunits, two heme b cofactors and a copper atom which is believed to be the oxygen active site. This complex is structurally related to the cytochrome caa3 oxidases which utilize cytochrome c as the reductant and contain heme a cofactors, as well as the intermediate form aa3 oxidases which also react directly with quinones as the reductant.
Probab=34.36 E-value=1.9e+02 Score=21.44 Aligned_cols=57 Identities=16% Similarity=0.290 Sum_probs=23.9
Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHHHHHHHHHHHHHHH
Q psy9274 95 STTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLV 152 (318)
Q Consensus 95 ~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~s~~~l~ 152 (318)
......++.+|+.-+++-+.++ +.+-..-.+.|........+++.......|+|.+.
T Consensus 33 ~~~~~~i~~~A~iQi~vqL~~F-lHl~~~~~~~~n~~~l~Ft~~i~~iiv~GSiWIm~ 89 (96)
T TIGR02847 33 GLTLVIIIVLAVVQILVHLVFF-LHLNTSSEQRWNLISLLFTILIIFILIGGSIWIMH 89 (96)
T ss_pred hHHHHHHHHHHHHHHHHHHHHH-hhccCccccchHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3445566666665555443211 11111112234333333333444445556666543
No 111
>PF13853 7tm_4: Olfactory receptor
Probab=34.12 E-value=25 Score=28.02 Aligned_cols=54 Identities=17% Similarity=0.224 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHHHHhhhHhhcCceeeeeCCCcceeeEeecCCCCCCCCCccccccccc
Q psy9274 178 LKAILITWIVIVLTAVPVFNMHGEVSYTYASELHTACVFLDLDPVNRPDGYNKTIFQSE 236 (318)
Q Consensus 178 ~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~if~i~ 236 (318)
..++++.|+.+++.+++...... .++..+..++ ++++ .+..++++.++.++++.
T Consensus 4 ~~l~~~~~~~~~~~~l~~~~~~~--~l~~~nii~~--f~c~-~~ll~LaC~dt~~~~~~ 57 (144)
T PF13853_consen 4 LLLAAGSWLSGLLNSLPHTLLTL--SLCFCNIIHH--FCCD-PPLLKLACSDTSINEIV 57 (144)
T ss_pred ehhhHHHHHHHHHHHHHHHHHHe--eCCCCCCCcc--eeeC-HHHhcccCCchhhhhee
Confidence 35677889988888888776544 4444455555 3345 66777777776555543
No 112
>PF09184 PPP4R2: PPP4R2; InterPro: IPR015267 PPP4R2 (protein phosphatase 4 core regulatory subunit R2) is the regulatory subunit of the histone H2A phosphatase complex. It has been shown to confer resistance to the anticancer drug cisplatin in yeast [], and may confer resistance in higher eukaryotes.
Probab=34.09 E-value=10 Score=34.27 Aligned_cols=14 Identities=14% Similarity=0.453 Sum_probs=5.9
Q ss_pred HHHhheee-eccCcc
Q psy9274 155 SLDRFLAV-VHPIAS 168 (318)
Q Consensus 155 sidRy~aI-~~P~~y 168 (318)
++.|..=+ ..|.+|
T Consensus 99 TiqRlcEl~~~P~~~ 113 (288)
T PF09184_consen 99 TIQRLCELLLDPRKH 113 (288)
T ss_pred hHHHHHHHHhChhhc
Confidence 44454332 345543
No 113
>KOG1289|consensus
Probab=33.76 E-value=4.9e+02 Score=25.90 Aligned_cols=20 Identities=20% Similarity=0.298 Sum_probs=12.8
Q ss_pred CchHHHHHHHHHHHHHHHHh
Q psy9274 94 RSTTNLLIINLAIADLLFIV 113 (318)
Q Consensus 94 ~~~~~~ll~~LaiaDll~~l 113 (318)
+.|.|-.+++..++=++.++
T Consensus 389 ~~P~nAv~l~~i~s~llgll 408 (550)
T KOG1289|consen 389 KVPLNAVLLSCIISILLGLL 408 (550)
T ss_pred CCcHHHHHHHHHHHHHHHHH
Confidence 35777777776666555544
No 114
>KOG2571|consensus
Probab=33.35 E-value=6.1e+02 Score=26.85 Aligned_cols=11 Identities=27% Similarity=0.380 Sum_probs=5.9
Q ss_pred ceeeEeecCCC
Q psy9274 211 HTACVFLDLDP 221 (318)
Q Consensus 211 ~~~C~~~~~~~ 221 (318)
...|.+++.++
T Consensus 761 yA~~n~~d~SW 771 (862)
T KOG2571|consen 761 YAFCNFHDVSW 771 (862)
T ss_pred eeeEEeccccc
Confidence 34566666554
No 115
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=33.22 E-value=14 Score=33.03 Aligned_cols=23 Identities=4% Similarity=0.300 Sum_probs=0.0
Q ss_pred HHHHHHHHhhhhhhccCCCCchH
Q psy9274 75 LGFFGNALVVVVVAFNQQMRSTT 97 (318)
Q Consensus 75 ~gi~gN~lvi~~i~~~~~l~~~~ 97 (318)
++++.=+++|++++.+|++--|.
T Consensus 58 fAvvliiIIiIImlF~RrLLCPL 80 (381)
T PF05297_consen 58 FAVVLIIIIIIIMLFKRRLLCPL 80 (381)
T ss_dssp -----------------------
T ss_pred HHHHHHHHHHHHHHHHHhhcCcc
Confidence 33333344444444555555443
No 116
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=33.04 E-value=3.3e+02 Score=23.72 Aligned_cols=30 Identities=30% Similarity=0.333 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccC
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQ 91 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~ 91 (318)
..+..+.+.+..+++++.|..+..-+.+.+
T Consensus 170 ~~~~~~~~i~~~ii~i~~~~~~~~~l~~~~ 199 (275)
T PF02118_consen 170 SIFSLIYFIIIIIITIITNIITYRRLRKLS 199 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 345566667777788888887776665433
No 117
>PF02116 STE2: Fungal pheromone mating factor STE2 GPCR; InterPro: IPR000366 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures.; GO: 0004932 mating-type factor pheromone receptor activity, 0016020 membrane; PDB: 2K9P_A.
Probab=32.59 E-value=2.8e+02 Score=25.06 Aligned_cols=15 Identities=7% Similarity=0.290 Sum_probs=4.3
Q ss_pred HhhhhhhccCCCCch
Q psy9274 82 LVVVVVAFNQQMRST 96 (318)
Q Consensus 82 lvi~~i~~~~~l~~~ 96 (318)
++++.+.+++|+|+|
T Consensus 51 ivL~llt~~~Krktp 65 (284)
T PF02116_consen 51 IVLWLLTKSEKRKTP 65 (284)
T ss_dssp HHHHHHT--S-----
T ss_pred HHHHHhcCcccccCh
Confidence 445555565555555
No 118
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=32.47 E-value=4.5e+02 Score=25.03 Aligned_cols=23 Identities=13% Similarity=0.255 Sum_probs=10.1
Q ss_pred hhhHHHHHHHHHHHHHHHHhhhH
Q psy9274 174 ERNALKAILITWIVIVLTAVPVF 196 (318)
Q Consensus 174 ~~~~~~~i~~~W~~s~~~~~p~~ 196 (318)
++|+..+++.+-+.-+++-+|..
T Consensus 308 k~k~~rli~~ivi~f~lcW~Py~ 330 (417)
T PHA02638 308 KTKSIIIVSIIIICSLICWIPLN 330 (417)
T ss_pred cchhhhhHHHHHHHHHHHHhhHH
Confidence 33444444444444444445544
No 119
>PF08114 PMP1_2: ATPase proteolipid family; InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=32.44 E-value=12 Score=22.97 Aligned_cols=23 Identities=17% Similarity=0.333 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHhhhhhhccCC
Q psy9274 70 GIIIILGFFGNALVVVVVAFNQQ 92 (318)
Q Consensus 70 ~ii~~~gi~gN~lvi~~i~~~~~ 92 (318)
++++++|++|-.++..++.|+..
T Consensus 12 lVF~lVglv~i~iva~~iYRKw~ 34 (43)
T PF08114_consen 12 LVFCLVGLVGIGIVALFIYRKWQ 34 (43)
T ss_pred eehHHHHHHHHHHHHHHHHHHHH
Confidence 45566777777777777776544
No 120
>COG4606 CeuB ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism]
Probab=32.42 E-value=3.8e+02 Score=24.22 Aligned_cols=91 Identities=12% Similarity=0.208 Sum_probs=44.7
Q ss_pred HHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhh--cCCcchhHHHHH--HHHHHHHHH
Q psy9274 71 IIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVT--TCWPFGDVWCKM--YLIVVTAYA 146 (318)
Q Consensus 71 ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~--~~~~~~~~~C~~--~~~~~~~~~ 146 (318)
+.+++++.|+.+-+. +.|+-++|.+--+=++.+-....+.++..+..+-...+. +.|.-++..-.. ---.+....
T Consensus 112 vA~v~al~Gt~~F~~-~l~ri~~k~~i~VPLiGIm~Ggvi~sitTFiAy~~dllQ~l~~W~~G~Fs~v~~GrYElL~i~~ 190 (321)
T COG4606 112 VAFVFALAGTLLFMM-ILRRIKLKDVLFVPLIGIMFGGVISSITTFIAYRFDLLQSLGAWLQGDFSGVLRGRYELLWISI 190 (321)
T ss_pred HHHHHHHHHHHHHHH-HHHhccccceEeehhHHHHHHhHHHHHHHHHHHHHHHHHHhhHHhcCchHhhhccceeehHHHH
Confidence 344566667654444 444555676666666777666666665444433333332 255544443322 000011112
Q ss_pred HHHHHHHHHHHhheee
Q psy9274 147 SVYTLVLMSLDRFLAV 162 (318)
Q Consensus 147 s~~~l~~isidRy~aI 162 (318)
-+..++....||+...
T Consensus 191 ~l~i~ay~~AdrfTi~ 206 (321)
T COG4606 191 PLAILAYLFADRFTIA 206 (321)
T ss_pred HHHHHHHHHhhheeee
Confidence 2234445566887655
No 121
>PHA03164 hypothetical protein; Provisional
Probab=32.31 E-value=45 Score=23.46 Aligned_cols=20 Identities=45% Similarity=0.675 Sum_probs=10.5
Q ss_pred CchHHHHHHHHHHHHHHHHh
Q psy9274 94 RSTTNLLIINLAIADLLFIV 113 (318)
Q Consensus 94 ~~~~~~ll~~LaiaDll~~l 113 (318)
|+-+.+.+..||+|=+++.+
T Consensus 56 ktftFlvLtgLaIamILfii 75 (88)
T PHA03164 56 KTFTFLVLTGLAIAMILFII 75 (88)
T ss_pred heeehHHHHHHHHHHHHHHH
Confidence 34445555566665555443
No 122
>KOG2290|consensus
Probab=31.91 E-value=1.3e+02 Score=29.03 Aligned_cols=16 Identities=19% Similarity=0.407 Sum_probs=10.3
Q ss_pred HHHHHHHHHHhhhhhh
Q psy9274 73 IILGFFGNALVVVVVA 88 (318)
Q Consensus 73 ~~~gi~gN~lvi~~i~ 88 (318)
++-|+-||+...+.+-
T Consensus 495 ~~SGitGNLASAIFlp 510 (652)
T KOG2290|consen 495 FLSGITGNLASAIFLP 510 (652)
T ss_pred ecccccccchheeeec
Confidence 4568889976554443
No 123
>TIGR01297 CDF cation diffusion facilitator family transporter. This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
Probab=31.89 E-value=3.5e+02 Score=23.60 Aligned_cols=10 Identities=10% Similarity=0.328 Sum_probs=3.8
Q ss_pred hhhHHHHHHH
Q psy9274 58 ERMVSIIVPI 67 (318)
Q Consensus 58 ~~~~~~~~~~ 67 (318)
+.....+.++
T Consensus 58 E~l~~l~~~~ 67 (268)
T TIGR01297 58 EILAALLNGL 67 (268)
T ss_pred HHHHHHHHHH
Confidence 3333334443
No 124
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=31.67 E-value=51 Score=21.79 Aligned_cols=20 Identities=20% Similarity=0.213 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHhhhhhh
Q psy9274 69 FGIIIILGFFGNALVVVVVA 88 (318)
Q Consensus 69 ~~ii~~~gi~gN~lvi~~i~ 88 (318)
.+++.+--++||++++-.-.
T Consensus 8 IIviVlgvIigNia~LK~sA 27 (55)
T PF11446_consen 8 IIVIVLGVIIGNIAALKYSA 27 (55)
T ss_pred HHHHHHHHHHhHHHHHHHhc
Confidence 34444455679988775544
No 125
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=31.53 E-value=35 Score=31.04 Aligned_cols=26 Identities=19% Similarity=0.207 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhccCC
Q psy9274 67 IFFGIIIILGFFGNALVVVVVAFNQQ 92 (318)
Q Consensus 67 ~~~~ii~~~gi~gN~lvi~~i~~~~~ 92 (318)
++..++.++-|+.=.++|+.++|+||
T Consensus 258 I~aSiiaIliIVLIMvIIYLILRYRR 283 (299)
T PF02009_consen 258 IIASIIAILIIVLIMVIIYLILRYRR 283 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555555566655666777777665
No 126
>COG2194 Predicted membrane-associated, metal-dependent hydrolase [General function prediction only]
Probab=31.47 E-value=4.7e+02 Score=26.15 Aligned_cols=10 Identities=30% Similarity=0.398 Sum_probs=3.6
Q ss_pred HHHHHHHHHh
Q psy9274 184 TWIVIVLTAV 193 (318)
Q Consensus 184 ~W~~s~~~~~ 193 (318)
+.+++++.+.
T Consensus 159 ~~~~~~~~~~ 168 (555)
T COG2194 159 SLLVILAFAA 168 (555)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 127
>PF05086 Dicty_REP: Dictyostelium (Slime Mold) REP protein; InterPro: IPR007778 This family consists of REP proteins from a number of Dictyostelium species (Slime molds). REP protein is probably involved in transcription regulation and control of DNA replication, specifically the amplification of plasmid at low copy numbers. The formation of homomultimers may be required for their regulatory activity [].
Probab=30.94 E-value=20 Score=36.46 Aligned_cols=10 Identities=30% Similarity=0.650 Sum_probs=6.2
Q ss_pred HHHHhheeee
Q psy9274 154 MSLDRFLAVV 163 (318)
Q Consensus 154 isidRy~aI~ 163 (318)
-.+.||+.|.
T Consensus 681 ~nf~Rf~~Is 690 (911)
T PF05086_consen 681 SNFKRFLNIS 690 (911)
T ss_pred hhhheEEEEE
Confidence 3566776664
No 128
>PLN02159 Fe(2+) transport protein
Probab=30.49 E-value=4.4e+02 Score=24.41 Aligned_cols=73 Identities=19% Similarity=0.248 Sum_probs=36.7
Q ss_pred CCCCCCCccccCCC-ccccCCCCCchHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHH
Q psy9274 30 YNRSNIPAICRNFS-HEYINDNDVDAEAMERMVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIAD 108 (318)
Q Consensus 30 ~~~s~~~~~~~n~s-~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaD 108 (318)
..+++.|..|.+.. ..|.+... ....++. ...++.+.+++|-.+=++ ..+.+++|.+.+.+.+..+++.
T Consensus 20 ~~~~~~~~~c~~~~~~~c~~~~~------~~~lki~---al~vil~~sllg~~lP~~-~~~~~~~~~~~~~~~~~~~Fag 89 (337)
T PLN02159 20 PATSTAPEECDSESANPCINKAK------ALPLKII---AIVAILTTSMIGVTAPLF-SRYVSFLRPDGNIFTIVKCFAS 89 (337)
T ss_pred ccccCCCccccCCCCcCccCchh------ccchHHH---HHHHHHHHHHHHHHHHHH-HHHccccCCchhHHHHHHHHhh
Confidence 34567788887742 22222111 1122333 333444555555555554 3344455666677777777766
Q ss_pred HHHH
Q psy9274 109 LLFI 112 (318)
Q Consensus 109 ll~~ 112 (318)
-++.
T Consensus 90 GV~L 93 (337)
T PLN02159 90 GIIL 93 (337)
T ss_pred hHhH
Confidence 4443
No 129
>TIGR00439 ftsX putative protein insertion permease FtsX. FtsX is an integral membrane protein encoded in the same operon as signal recognition particle docking protein FtsY and FtsE. It belongs to a family of predicted permeases and may play a role in the insertion of proteins required for potassium transport, cell division, and other activities. FtsE is a hydrophilic nucleotide-binding protein that associates with the inner membrane by means of association with FtsX.
Probab=30.39 E-value=4.2e+02 Score=24.12 Aligned_cols=21 Identities=10% Similarity=0.336 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHhhhhhhccCC
Q psy9274 72 IIILGFFGNALVVVVVAFNQQ 92 (318)
Q Consensus 72 i~~~gi~gN~lvi~~i~~~~~ 92 (318)
+.++.+++|.+-+.+.-|++.
T Consensus 191 ~~~~~~i~Ntirl~i~~Rr~E 211 (309)
T TIGR00439 191 MAVFLVIGNSIRLQILSRRES 211 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 334445778777777776654
No 130
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=30.33 E-value=2e+02 Score=29.69 Aligned_cols=94 Identities=9% Similarity=0.169 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH---------HhhhhhhccCCCCc-hHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCc
Q psy9274 60 MVSIIVPIFFGIIIILGFFGNA---------LVVVVVAFNQQMRS-TTNLLIINLAIADLLFIVCCVPFTATDYVTTCWP 129 (318)
Q Consensus 60 ~~~~~~~~~~~ii~~~gi~gN~---------lvi~~i~~~~~l~~-~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~ 129 (318)
..++-+.+++.++.++-++..+ |.+-.++-.-+.+. .+.+|++.+|+.-=++|++++|...+.+....+.
T Consensus 28 ~~~~~~~~~w~~~~~~d~~~~~r~e~~~p~wl~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~p~~~~~~~~~~~v 107 (697)
T PF09726_consen 28 FLYVKFLLVWALVLLADFMLEFRFEYLWPFWLLLRSVYDSFKYQGLAFSVFFVCIAFTSDLICLFFIPVHWLFFAASTYV 107 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH
Q psy9274 130 FGDVWCKMYLIVVTAYASVYTLVL 153 (318)
Q Consensus 130 ~~~~~C~~~~~~~~~~~s~~~l~~ 153 (318)
.-...|..--.+...++++|++.+
T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~ 131 (697)
T PF09726_consen 108 WVQYVWHTDRGICLPTVSLWILFV 131 (697)
T ss_pred HHHHhhhccCCccHHHHHHHHHHH
No 131
>PF09586 YfhO: Bacterial membrane protein YfhO; InterPro: IPR018580 The yfhO gene is transcribed in Difco sporulation medium and the transcription is affected by the YvrGHb two-component system []. Some members of this family have been annotated as putative ABC transporter permease proteins.
Probab=29.85 E-value=6.6e+02 Score=26.22 Aligned_cols=57 Identities=12% Similarity=0.062 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHHHHHhheeeeccCccccccchhhHHHHHHHHHHHHHHHHhhhHhhcC
Q psy9274 141 VVTAYASVYTLVLMSLDRFLAVVHPIASMSVRTERNALKAILITWIVIVLTAVPVFNMHG 200 (318)
Q Consensus 141 ~~~~~~s~~~l~~isidRy~aI~~P~~y~~~~t~~~~~~~i~~~W~~s~~~~~p~~~~~~ 200 (318)
+.+.++.+..+++..+-|++ + +..+. .+..-.+.....++...+|.+..+|.+....
T Consensus 183 fYf~ym~~if~~iY~~~r~~-~-~~~k~-~~~~~~~~~~~~ilg~~lsa~~llP~~~~~l 239 (843)
T PF09586_consen 183 FYFAYMICIFLVIYFLIRYF-F-KNWKN-FFKKILRFIGSSILGVGLSAFLLLPTILSLL 239 (843)
T ss_pred HHHHHHHHHHHHHHHHHHHH-H-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445556666777788887 4 22211 1122233334444445555666667665543
No 132
>KOG3540|consensus
Probab=29.10 E-value=17 Score=34.80 Aligned_cols=12 Identities=17% Similarity=0.332 Sum_probs=7.2
Q ss_pred cccccccccccC
Q psy9274 231 TIFQSENDYCKG 242 (318)
Q Consensus 231 ~if~i~~~~~~~ 242 (318)
--|.-+-+.|..
T Consensus 179 D~F~GvEfVCCP 190 (615)
T KOG3540|consen 179 DMFRGVEFVCCP 190 (615)
T ss_pred ccccCceEEeCC
Confidence 345666667765
No 133
>PF11295 DUF3096: Protein of unknown function (DUF3096); InterPro: IPR021446 This entry is represented by the archaeal Thermoproteus tenax spherical virus 1, Orf18. The characteristics of the protein distribution suggest prophage matches and lateral genetic transfer in addition to the phage matches.
Probab=29.08 E-value=1e+02 Score=18.73 Aligned_cols=23 Identities=43% Similarity=0.857 Sum_probs=17.4
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHH
Q psy9274 57 MERMVSIIVPIFFGIIIILGFFG 79 (318)
Q Consensus 57 ~~~~~~~~~~~~~~ii~~~gi~g 79 (318)
++...++++++.+.++.++|+.+
T Consensus 14 ~PrllnyiVaiyLI~~G~lgL~~ 36 (39)
T PF11295_consen 14 MPRLLNYIVAIYLIVIGLLGLFG 36 (39)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhh
Confidence 45677788888888888887764
No 134
>COG1480 Predicted membrane-associated HD superfamily hydrolase [General function prediction only]
Probab=28.63 E-value=6.6e+02 Score=25.78 Aligned_cols=19 Identities=37% Similarity=0.522 Sum_probs=9.3
Q ss_pred CCchHHHHHHHHHHHHHHH
Q psy9274 93 MRSTTNLLIINLAIADLLF 111 (318)
Q Consensus 93 l~~~~~~ll~~LaiaDll~ 111 (318)
+|+...++++.+++.-++.
T Consensus 304 ~~~~~l~~~~~l~i~~l~l 322 (700)
T COG1480 304 LRNSLLLLYLSLAILTLSL 322 (700)
T ss_pred hhhhHHHHHHHHHHHHHHH
Confidence 3344555555555544433
No 135
>PF03547 Mem_trans: Membrane transport protein; InterPro: IPR004776 This entry represents a mostly uncharacterised family of membrane transport proteins found in eukaryotes, bacteria and archaea. Most characterised members of this family are the PIN components of auxin efflux systems from plants. These carriers are saturable, auxin-specific, and localized to the basal ends of auxin transport-competent cells [, ]. Plants typically posses several of these proteins, each displaying a unique tissue-specific expression pattern. They are expressed in almost all plant tissues including vascular tissues and roots, and influence many processes including the establishment of embryonic polarity, plant growth, apical hook formation in seedlings and the photo- and gravitrophic responses. These plant proteins are typically 600-700 amino acyl residues long and exhibit 8-12 transmembrane segments.; GO: 0055085 transmembrane transport, 0016021 integral to membrane
Probab=28.09 E-value=3.1e+02 Score=25.43 Aligned_cols=12 Identities=8% Similarity=0.135 Sum_probs=5.6
Q ss_pred hHHHHHHHHhhc
Q psy9274 115 CVPFTATDYVTT 126 (318)
Q Consensus 115 ~~p~~~~~~~~~ 126 (318)
.+|..+...+..
T Consensus 41 ~lP~liF~~i~~ 52 (385)
T PF03547_consen 41 FLPALIFSSIAN 52 (385)
T ss_pred HHHHHHHHHHHh
Confidence 455555444433
No 136
>KOG3214|consensus
Probab=27.61 E-value=25 Score=26.14 Aligned_cols=6 Identities=17% Similarity=0.595 Sum_probs=2.5
Q ss_pred eeeEee
Q psy9274 212 TACVFL 217 (318)
Q Consensus 212 ~~C~~~ 217 (318)
..|.++
T Consensus 48 ~sC~iC 53 (109)
T KOG3214|consen 48 ASCRIC 53 (109)
T ss_pred eeeeeh
Confidence 344444
No 137
>PF14494 DUF4436: Domain of unknown function (DUF4436)
Probab=27.49 E-value=2.6e+02 Score=24.91 Aligned_cols=21 Identities=24% Similarity=0.229 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy9274 138 YLIVVTAYASVYTLVLMSLDR 158 (318)
Q Consensus 138 ~~~~~~~~~s~~~l~~isidR 158 (318)
++......+..+.+.+++.-|
T Consensus 235 vlW~~i~l~~~m~~~v~~w~R 255 (256)
T PF14494_consen 235 VLWVLIALVASMVLYVVAWWR 255 (256)
T ss_pred hHHHHHHHHHHHHHHHHHhhc
Confidence 454444444445555555544
No 138
>PF04893 Yip1: Yip1 domain; InterPro: IPR006977 This entry contains proteins belonging to the Yip1 family and represents the Yip1 domain. The Yip1 integral membrane domain contains four transmembrane alpha helices. The domain is characterised by the motifs DLYGP and GY. The Yip1 protein is a golgi protein involved in vesicular transport that interacts with GTPases [].; GO: 0016020 membrane
Probab=27.34 E-value=3.1e+02 Score=21.61 Aligned_cols=16 Identities=13% Similarity=0.401 Sum_probs=8.9
Q ss_pred cchhhHHHHHHHHHHH
Q psy9274 172 RTERNALKAILITWIV 187 (318)
Q Consensus 172 ~t~~~~~~~i~~~W~~ 187 (318)
.+++|+...+++.|++
T Consensus 154 ~~~~ka~~~~~~~~ll 169 (172)
T PF04893_consen 154 LSRGKAFLIVIIPFLL 169 (172)
T ss_pred CChhHHHHHHHHHHHH
Confidence 3555666665555443
No 139
>PRK13591 ubiA prenyltransferase; Provisional
Probab=26.97 E-value=4.9e+02 Score=23.81 Aligned_cols=69 Identities=16% Similarity=0.087 Sum_probs=34.5
Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhe-----eeeccCccccccchh
Q psy9274 101 IINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYLIVVTAYASVYTLVLMSLDRFL-----AVVHPIASMSVRTER 175 (318)
Q Consensus 101 l~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~s~~~l~~isidRy~-----aI~~P~~y~~~~t~~ 175 (318)
+..+.-+++.+++...............+.+-..| ..+........++|||. +|-+|.++.. +++
T Consensus 30 ~~~~~~ss~~~a~~a~~~~~~a~lL~g~~~~~~~~--------~~~~L~~~s~~~iNd~~D~eiD~IN~P~r~~~--s~~ 99 (307)
T PRK13591 30 TFELLNSSTLVAFSGALRIHIAFLLLGLQSSILTC--------IAGGLIIYSVYTLDRALDSEEDAVNRSELIGS--NKK 99 (307)
T ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHhCCCcchHHH--------HHHHHHHHHHHHHhhhccchhhhccCcccccc--CHH
Confidence 34455566666554444333333333333232223 23344455666788884 4568877654 444
Q ss_pred hHHH
Q psy9274 176 NALK 179 (318)
Q Consensus 176 ~~~~ 179 (318)
.+..
T Consensus 100 ~a~~ 103 (307)
T PRK13591 100 IGLL 103 (307)
T ss_pred HHHH
Confidence 4443
No 140
>PRK11228 fecC iron-dicitrate transporter permease subunit; Provisional
Probab=26.94 E-value=5e+02 Score=23.85 Aligned_cols=23 Identities=13% Similarity=0.366 Sum_probs=16.8
Q ss_pred CchHHHHHHHHHHHHHHHHhhhH
Q psy9274 94 RSTTNLLIINLAIADLLFIVCCV 116 (318)
Q Consensus 94 ~~~~~~ll~~LaiaDll~~l~~~ 116 (318)
.++..+.+..++++=++.++..+
T Consensus 138 ~~~~~liL~Gi~is~l~~al~~~ 160 (323)
T PRK11228 138 HDRNRLILAGIALSALCMALTRI 160 (323)
T ss_pred CChhHHHHHHHHHHHHHHHHHHH
Confidence 46778888888888777666444
No 141
>KOG4434|consensus
Probab=26.72 E-value=35 Score=31.76 Aligned_cols=25 Identities=8% Similarity=-0.097 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhh
Q psy9274 64 IVPIFFGIIIILGFFGNALVVVVVA 88 (318)
Q Consensus 64 ~~~~~~~ii~~~gi~gN~lvi~~i~ 88 (318)
+--+..++--.+..+.|++|...-.
T Consensus 97 ikkcp~lvqEMVnc~~qLi~May~r 121 (520)
T KOG4434|consen 97 IKKCPRLVQEMVNCSQQLIVMAYSR 121 (520)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHhh
Confidence 3344445555566666665554433
No 142
>KOG1485|consensus
Probab=26.70 E-value=3.8e+02 Score=25.59 Aligned_cols=11 Identities=18% Similarity=0.208 Sum_probs=7.1
Q ss_pred HHHHHHHHhhh
Q psy9274 105 AIADLLFIVCC 115 (318)
Q Consensus 105 aiaDll~~l~~ 115 (318)
++.|++..++.
T Consensus 150 Sl~Dl~s~fvl 160 (412)
T KOG1485|consen 150 SLSDLVSGFVL 160 (412)
T ss_pred HHHHHHHHHHH
Confidence 46777776633
No 143
>PRK10441 iron-enterobactin transporter membrane protein; Provisional
Probab=26.68 E-value=5.2e+02 Score=23.94 Aligned_cols=24 Identities=21% Similarity=0.106 Sum_probs=17.8
Q ss_pred CchHHHHHHHHHHHHHHHHhhhHH
Q psy9274 94 RSTTNLLIINLAIADLLFIVCCVP 117 (318)
Q Consensus 94 ~~~~~~ll~~LaiaDll~~l~~~p 117 (318)
.++..+.+..++++-++.++..+.
T Consensus 146 ~~~~~LiL~Gi~is~~~~a~~~~l 169 (335)
T PRK10441 146 LSPVRLTLAGVALAAVLEGLTSGI 169 (335)
T ss_pred CCchHHHHHHHHHHHHHHHHHHHH
Confidence 467789999999888777664443
No 144
>PRK12911 bifunctional preprotein translocase subunit SecD/SecF; Reviewed
Probab=25.76 E-value=9.8e+02 Score=26.83 Aligned_cols=12 Identities=17% Similarity=0.036 Sum_probs=5.0
Q ss_pred ccchhhHHHHHH
Q psy9274 171 VRTERNALKAIL 182 (318)
Q Consensus 171 ~~t~~~~~~~i~ 182 (318)
++.+|+...++.
T Consensus 1083 Fmk~Rk~~~~iS 1094 (1403)
T PRK12911 1083 FLKECKRLWVVS 1094 (1403)
T ss_pred HHHHHHHHHHHH
Confidence 344444443333
No 145
>COG2181 NarI Nitrate reductase gamma subunit [Energy production and conversion]
Probab=25.23 E-value=1.7e+02 Score=25.47 Aligned_cols=58 Identities=17% Similarity=0.310 Sum_probs=30.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHH
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFT 119 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~ 119 (318)
.++.+.+..+++++|+++=++-=++-.|-|...++.++|..-|-++-++.++...+..
T Consensus 89 ai~~G~iaGv~~liG~~~L~~RR~~~~rvR~~St~~D~f~lilLla~~~~Gl~~~~~~ 146 (228)
T COG2181 89 AIVLGGIAGVLTLIGLTLLLLRRLFDPRVRATSTPSDIFALLLLLAQLLLGLYATPFS 146 (228)
T ss_pred eeehhhHHHHHHHHHHHHHHHHHHhccceeecCChhHHHHHHHHHHHHHhhhhhhhhc
Confidence 3444555555555554422111111112222336888888888777777777555543
No 146
>KOG0526|consensus
Probab=25.21 E-value=23 Score=34.56 Aligned_cols=7 Identities=14% Similarity=0.183 Sum_probs=3.0
Q ss_pred eeeccCc
Q psy9274 161 AVVHPIA 167 (318)
Q Consensus 161 aI~~P~~ 167 (318)
.|+.|-+
T Consensus 327 Kit~Pg~ 333 (615)
T KOG0526|consen 327 KITVPGE 333 (615)
T ss_pred eeecccc
Confidence 3444443
No 147
>KOG2325|consensus
Probab=24.99 E-value=72 Score=31.20 Aligned_cols=17 Identities=12% Similarity=0.155 Sum_probs=7.9
Q ss_pred hhhHHHHHHHHHHHHHH
Q psy9274 174 ERNALKAILITWIVIVL 190 (318)
Q Consensus 174 ~~~~~~~i~~~W~~s~~ 190 (318)
|.|+...+...-+++++
T Consensus 163 R~rA~a~~~~~~vlg~i 179 (488)
T KOG2325|consen 163 RPRAFAATSGGFVLGII 179 (488)
T ss_pred hHHHHHHhhhHHHHHHH
Confidence 44555555554444433
No 148
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=24.63 E-value=1e+02 Score=18.31 Aligned_cols=16 Identities=50% Similarity=0.918 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHH
Q psy9274 65 VPIFFGIIIILGFFGN 80 (318)
Q Consensus 65 ~~~~~~ii~~~gi~gN 80 (318)
..+++..++++|++.|
T Consensus 11 vV~ffv~LFifGflsn 26 (36)
T PF02532_consen 11 VVIFFVSLFIFGFLSN 26 (36)
T ss_dssp HHHHHHHHHHHHHHTT
T ss_pred hHHHHHHHHhccccCC
Confidence 3444555666676655
No 149
>COG4645 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.56 E-value=5.9e+02 Score=23.84 Aligned_cols=31 Identities=16% Similarity=0.116 Sum_probs=20.6
Q ss_pred chHHHHHHHHHHH--HHHHHhhhHHHHHHHHhh
Q psy9274 95 STTNLLIINLAIA--DLLFIVCCVPFTATDYVT 125 (318)
Q Consensus 95 ~~~~~ll~~Laia--Dll~~l~~~p~~~~~~~~ 125 (318)
-+.+-.+..||.+ -+..+.+.+|+.......
T Consensus 238 ~p~~P~lvgl~~Ay~Al~i~wv~~pLw~~~~~~ 270 (410)
T COG4645 238 LPFSPWLVGLAAAYLALAIFWVFMPLWIITLWF 270 (410)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHhHhHHhc
Confidence 4677778888888 455566677776665443
No 150
>PF00003 7tm_3: 7 transmembrane sweet-taste receptor of 3 GCPR; InterPro: IPR017978 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents the C-terminal region of family 3 GPCR receptor proteins, which contains the seven transmembrane region. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness []. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=24.26 E-value=25 Score=30.36 Aligned_cols=7 Identities=29% Similarity=1.042 Sum_probs=3.9
Q ss_pred hhHHHHH
Q psy9274 131 GDVWCKM 137 (318)
Q Consensus 131 ~~~~C~~ 137 (318)
+...|.+
T Consensus 52 s~~~C~~ 58 (238)
T PF00003_consen 52 SDILCTL 58 (238)
T ss_pred CCcEEEE
Confidence 3455766
No 151
>KOG2428|consensus
Probab=24.17 E-value=58 Score=31.02 Aligned_cols=8 Identities=50% Similarity=0.576 Sum_probs=3.4
Q ss_pred HHHHHHHh
Q psy9274 151 LVLMSLDR 158 (318)
Q Consensus 151 l~~isidR 158 (318)
|+++-.+|
T Consensus 277 lt~~~a~r 284 (443)
T KOG2428|consen 277 LTLNRADR 284 (443)
T ss_pred HHHHHhhc
Confidence 33444444
No 152
>COG5129 MAK16 Nuclear protein with HMG-like acidic region [General function prediction only]
Probab=24.13 E-value=7.8 Score=33.20 Aligned_cols=12 Identities=17% Similarity=0.440 Sum_probs=5.3
Q ss_pred CCccccCCCccc
Q psy9274 35 IPAICRNFSHEY 46 (318)
Q Consensus 35 ~~~~~~n~s~~~ 46 (318)
..+.|...|.+.
T Consensus 33 VTGLC~RqSCPL 44 (303)
T COG5129 33 VTGLCDRQSCPL 44 (303)
T ss_pred cceeeccccCcC
Confidence 344454444433
No 153
>PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function.
Probab=24.09 E-value=1.5e+02 Score=22.87 Aligned_cols=17 Identities=6% Similarity=0.446 Sum_probs=8.8
Q ss_pred hHHHHHHHHHHHHHHHH
Q psy9274 60 MVSIIVPIFFGIIIILG 76 (318)
Q Consensus 60 ~~~~~~~~~~~ii~~~g 76 (318)
...++++.+..+..+++
T Consensus 29 iinliiG~vT~l~VLvt 45 (118)
T PF10856_consen 29 IINLIIGAVTSLFVLVT 45 (118)
T ss_pred EEEeehHHHHHHHHHHH
Confidence 34555565555555443
No 154
>KOG3114|consensus
Probab=23.87 E-value=2.9e+02 Score=25.00 Aligned_cols=25 Identities=16% Similarity=0.032 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh
Q psy9274 63 IIVPIFFGIIIILGFFGNALVVVVV 87 (318)
Q Consensus 63 ~~~~~~~~ii~~~gi~gN~lvi~~i 87 (318)
-...+...+++++.+.||+...+.-
T Consensus 117 GPfWI~~TlVf~l~~~g~~~~~i~~ 141 (290)
T KOG3114|consen 117 GPFWITATLVFALAISGNLATFIRN 141 (290)
T ss_pred ccHHHHHHHHHHHHHcccHHHHHHh
Confidence 3455566677777777876655443
No 155
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=23.82 E-value=47 Score=26.59 Aligned_cols=33 Identities=24% Similarity=0.513 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCC
Q psy9274 62 SIIVPIFFGIIIILGFFGNALVVVVVAFNQQMR 94 (318)
Q Consensus 62 ~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~ 94 (318)
..+.++++.+++++|+++-+.+.+..+.++..+
T Consensus 61 tAIaGIVfgiVfimgvva~i~icvCmc~kn~rg 93 (155)
T PF10873_consen 61 TAIAGIVFGIVFIMGVVAGIAICVCMCMKNSRG 93 (155)
T ss_pred ceeeeeehhhHHHHHHHHHHHHHHhhhhhcCCC
Confidence 456777888888899888887777766555533
No 156
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=23.65 E-value=80 Score=20.43 Aligned_cols=24 Identities=21% Similarity=0.487 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhccCC
Q psy9274 67 IFFGIIIILGFFGNALVVVVVAFNQQ 92 (318)
Q Consensus 67 ~~~~ii~~~gi~gN~lvi~~i~~~~~ 92 (318)
++..+++++|++| +.++.+.+++.
T Consensus 5 ~iV~i~iv~~lLg--~~I~~~~K~yg 28 (50)
T PF12606_consen 5 LIVSIFIVMGLLG--LSICTTLKAYG 28 (50)
T ss_pred HHHHHHHHHHHHH--HHHHHHhhccc
Confidence 3444555555553 33334444443
No 157
>PRK11026 ftsX cell division ABC transporter subunit FtsX; Provisional
Probab=23.45 E-value=5.7e+02 Score=23.28 Aligned_cols=18 Identities=11% Similarity=0.244 Sum_probs=11.5
Q ss_pred HHHHHHHHhhhhhhccCC
Q psy9274 75 LGFFGNALVVVVVAFNQQ 92 (318)
Q Consensus 75 ~gi~gN~lvi~~i~~~~~ 92 (318)
+-+++|.+.+.+.-|++.
T Consensus 194 ~~~i~ntir~~v~~r~~e 211 (309)
T PRK11026 194 FLVIGNSVRLSIFSRRDT 211 (309)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 345677777777766554
No 158
>TIGR02901 QoxD cytochrome aa3 quinol oxidase, subunit IV. This family (QoxD) encodes subunit IV of the aa3-type quinone oxidase, one of several bacterial terminal oxidases. This complex couples oxidation of reduced quinones with the reduction of molecular oxygen to water and the pumping of protons to form a proton gradient utilized for ATP production. aa3-type oxidases contain two heme a cofactors as well as copper atoms in the active site.
Probab=23.17 E-value=3.2e+02 Score=20.23 Aligned_cols=18 Identities=17% Similarity=0.073 Sum_probs=9.9
Q ss_pred hHHHHHHHHHHHHHHHHh
Q psy9274 96 TTNLLIINLAIADLLFIV 113 (318)
Q Consensus 96 ~~~~ll~~LaiaDll~~l 113 (318)
.....++.+|++-+++-+
T Consensus 36 ~~~~~i~~lA~iQi~VqL 53 (94)
T TIGR02901 36 MGLTIIIIFAFIQAGLQL 53 (94)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 345555666666555544
No 159
>KOG2592|consensus
Probab=22.50 E-value=3.4e+02 Score=25.80 Aligned_cols=125 Identities=15% Similarity=0.103 Sum_probs=57.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcCCcchhHHHHHHH
Q psy9274 60 MVSIIVPIFFGIIIILGFFGNALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPFTATDYVTTCWPFGDVWCKMYL 139 (318)
Q Consensus 60 ~~~~~~~~~~~ii~~~gi~gN~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~~~~~~~~~~~~~~~~~C~~~~ 139 (318)
..+..+......+.++++++-+++.+.+.....-.-...+..+||-++=.+..+...| ......|.+..+ .-
T Consensus 179 ~wy~~Ll~~T~~~Y~~s~~~~~l~fv~ft~~~~C~~nk~fi~~nlilcv~~si~sv~P-----~VQe~~P~SGLl---QS 250 (426)
T KOG2592|consen 179 FWYAALLGVTLLMYLLSLVATVLLFVYFTPGDGCGENKFFISVNLILCVAISILSVHP-----KVQEGQPRSGLL---QS 250 (426)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhheEecCCCCCCcceEEEeechHHHHHHHHHhcCh-----hhhcCCCCcccc---hh
Confidence 3355555666666677776655554444431211112222333333322222222222 122222222111 22
Q ss_pred HHHHHHHHHHHHHHHHHHhheeeeccCcccc-cc-----------chhhHHHHHHHHHHHHHHHHh
Q psy9274 140 IVVTAYASVYTLVLMSLDRFLAVVHPIASMS-VR-----------TERNALKAILITWIVIVLTAV 193 (318)
Q Consensus 140 ~~~~~~~s~~~l~~isidRy~aI~~P~~y~~-~~-----------t~~~~~~~i~~~W~~s~~~~~ 193 (318)
...+.+...++..+++-++ =..|.|...+. .. +.....++-++.|+.+++.+.
T Consensus 251 svIs~Y~~YLt~SAlss~P-e~~CNP~~~~~~~~t~~~~~~~~~~~~~~~~iiGli~~~lcilYsa 315 (426)
T KOG2592|consen 251 SVISLYTMYLTWSALSSEP-ENGCNPWLNSSKNVTITVGPGASVSTFDATNIIGLIFLLLCILYSA 315 (426)
T ss_pred HHHHHHHHHHHHHHHhcCC-ccccChhhhcccccccccCcccccccccccchHHHHHHHHHHHHHH
Confidence 2344555666777888888 77888874332 10 111223455566666666554
No 160
>KOG2475|consensus
Probab=22.36 E-value=42 Score=32.77 Aligned_cols=6 Identities=33% Similarity=0.933 Sum_probs=2.8
Q ss_pred CCCccC
Q psy9274 312 GGWWFE 317 (318)
Q Consensus 312 ~~~~~~ 317 (318)
-|+||.
T Consensus 205 ~gt~~g 210 (587)
T KOG2475|consen 205 SGTWHG 210 (587)
T ss_pred cceeec
Confidence 345553
No 161
>PF10749 DUF2534: Protein of unknown function (DUF2534); InterPro: IPR019685 This entry represents proteins with unknown function, and appear to be restricted to Enterobacteriaceae.
Probab=21.89 E-value=3.1e+02 Score=19.70 Aligned_cols=16 Identities=19% Similarity=0.303 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHh
Q psy9274 98 NLLIINLAIADLLFIV 113 (318)
Q Consensus 98 ~~ll~~LaiaDll~~l 113 (318)
-.|+.++++.=++.+.
T Consensus 13 kkFl~~l~~vfiia~~ 28 (85)
T PF10749_consen 13 KKFLLALAIVFIIAAT 28 (85)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 3455555554444433
No 162
>KOG3118|consensus
Probab=21.46 E-value=50 Score=31.83 Aligned_cols=59 Identities=29% Similarity=0.556 Sum_probs=0.0
Q ss_pred ccccccCCCcccccchhhhhhHHHHHHHHHHHHHHhhhcCCCCCCccccccccchhhhh
Q psy9274 236 ENDYCKGGEREVHEEEEEEEEEEEETKAKETKLKEKKKLGGGGGEEEEEEEEEEEEEEE 294 (318)
Q Consensus 236 ~~~~~~~~~~~~~~~eeee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~eee~~~~ 294 (318)
.+.+-..+...+.++-.++++.+++++.+.+.+.+..-.+.++++.+..++++++++++
T Consensus 54 dd~~~~~~~~~~dde~~~e~~sedEEe~en~~~~e~l~s~~~e~D~e~~ddd~e~e~dd 112 (517)
T KOG3118|consen 54 DDDFHEFDLQGEDDESEEEEDSEDEEEGENKRQKEYLASDNGEGDDEDADDDKEEEEDD 112 (517)
T ss_pred ccchhHHHhhhchhhhhhhccccchhhhhhHHHHHHHhhhcccccccccccccchhhhc
No 163
>COG2456 Uncharacterized conserved protein [Function unknown]
Probab=21.26 E-value=3.8e+02 Score=20.43 Aligned_cols=38 Identities=16% Similarity=0.092 Sum_probs=26.2
Q ss_pred HHhhhhhhccCCCCchHHHHHHHHHHHHHHHHhhhHHH
Q psy9274 81 ALVVVVVAFNQQMRSTTNLLIINLAIADLLFIVCCVPF 118 (318)
Q Consensus 81 ~lvi~~i~~~~~l~~~~~~ll~~LaiaDll~~l~~~p~ 118 (318)
.++++++.|.|+.+...+=++..+|+.-++......|.
T Consensus 21 ~~m~~~~~r~k~~k~s~~~ll~W~~~wv~vlifal~P~ 58 (121)
T COG2456 21 FLMIYVFFRLKRRKLSLSDLLFWEAFWVFVLIFALFPE 58 (121)
T ss_pred HHHHHHHHHHHHhcCCccchHHHHHHHHHHHHHHhcch
Confidence 45566777777767777777777877766666555565
No 164
>PF06609 TRI12: Fungal trichothecene efflux pump (TRI12); InterPro: IPR010573 This family consists of several fungal specific trichothecene efflux pump proteins. Many of the genes involved in trichothecene toxin biosynthesis in Fusarium sporotrichioides are present within a gene cluster. It has been suggested that TRI12 may play a role in F. sporotrichioides self-protection against trichothecenes [].
Probab=21.04 E-value=8.7e+02 Score=24.56 Aligned_cols=31 Identities=13% Similarity=0.228 Sum_probs=21.0
Q ss_pred HHHHHHHHHhheeeeccCccccccchhhHHH
Q psy9274 149 YTLVLMSLDRFLAVVHPIASMSVRTERNALK 179 (318)
Q Consensus 149 ~~l~~isidRy~aI~~P~~y~~~~t~~~~~~ 179 (318)
++.++..+|+|.+-.+|+-.+++++.+|.+.
T Consensus 283 ~Lv~F~~wE~~~~~~~Pl~P~~Lf~~~r~~~ 313 (599)
T PF06609_consen 283 LLVAFVVWEWFGAPKDPLFPHRLFKDRRGFA 313 (599)
T ss_pred HHHHHHHhhhhccCCCCcCCHHHhccchHHH
Confidence 3445566788988789987777766545443
No 165
>KOG4434|consensus
Probab=20.89 E-value=26 Score=32.57 Aligned_cols=6 Identities=17% Similarity=0.041 Sum_probs=2.5
Q ss_pred CCCcCC
Q psy9274 5 ERDVMG 10 (318)
Q Consensus 5 ~~~~~~ 10 (318)
||+.|+
T Consensus 9 p~M~~k 14 (520)
T KOG4434|consen 9 PRMEIK 14 (520)
T ss_pred chhHHH
Confidence 344444
No 166
>PF11196 DUF2834: Protein of unknown function (DUF2834); InterPro: IPR021362 This is a bacterial family of uncharacterised proteins.
Probab=20.73 E-value=3.6e+02 Score=19.98 Aligned_cols=28 Identities=21% Similarity=0.215 Sum_probs=21.7
Q ss_pred chhhHHHHHHHHHHHHHHHHhhhHhhcC
Q psy9274 173 TERNALKAILITWIVIVLTAVPVFNMHG 200 (318)
Q Consensus 173 t~~~~~~~i~~~W~~s~~~~~p~~~~~~ 200 (318)
..|+....+..+..+++.+++|.++..+
T Consensus 67 ~ir~~w~~i~~t~~vgvs~glPLyL~lR 94 (97)
T PF11196_consen 67 GIRHWWLYIVLTFFVGVSFGLPLYLYLR 94 (97)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666888888899999999887654
No 167
>PF10192 GpcrRhopsn4: Rhodopsin-like GPCR transmembrane domain; InterPro: IPR019336 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). This region of 270 amino acids is the seven transmembrane alpha-helical domains included within five GPCRRHODOPSN4 motifs of a G-protein-coupled-receptor (GPCR) protein, conserved from nematodes to humans [].
Probab=20.62 E-value=5.8e+02 Score=22.35 Aligned_cols=29 Identities=10% Similarity=0.045 Sum_probs=14.9
Q ss_pred HHHhhhhhhccCCCCchHHHHHHHHHHHH
Q psy9274 80 NALVVVVVAFNQQMRSTTNLLIINLAIAD 108 (318)
Q Consensus 80 N~lvi~~i~~~~~l~~~~~~ll~~LaiaD 108 (318)
+...+..+.+++.++....++.+++.+-=
T Consensus 46 ~~~~~~~l~~~~~~h~~~~l~~~~l~l~~ 74 (257)
T PF10192_consen 46 SIYSIQSLKKRGLMHPVYKLFSAALLLQF 74 (257)
T ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Confidence 33344445555555555556555555433
No 168
>PF04238 DUF420: Protein of unknown function (DUF420); InterPro: IPR007352 This is a predicted membrane protein with four transmembrane helices.
Probab=20.36 E-value=4.4e+02 Score=20.82 Aligned_cols=27 Identities=19% Similarity=0.342 Sum_probs=19.9
Q ss_pred ccchhhHHHHHHHHHHHHHHHHhhhHh
Q psy9274 171 VRTERNALKAILITWIVIVLTAVPVFN 197 (318)
Q Consensus 171 ~~t~~~~~~~i~~~W~~s~~~~~p~~~ 197 (318)
..+.||....-..+|+++.+..+..++
T Consensus 104 ~~~Hrki~r~t~piWlyvsvTGvvVYl 130 (133)
T PF04238_consen 104 FTRHRKIGRWTFPIWLYVSVTGVVVYL 130 (133)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667777888999998888766543
No 169
>KOG4193|consensus
Probab=20.08 E-value=4.2e+02 Score=26.85 Aligned_cols=49 Identities=27% Similarity=0.347 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHhheeeeccCccccccc--h---hhHHHHHHHHHHHHHHHHhh
Q psy9274 146 ASVYTLVLMSLDRFLAVVHPIASMSVRT--E---RNALKAILITWIVIVLTAVP 194 (318)
Q Consensus 146 ~s~~~l~~isidRy~aI~~P~~y~~~~t--~---~~~~~~i~~~W~~s~~~~~p 194 (318)
..++.+++.-+-|-.....|...+.... . -....++.+.|+++++...+
T Consensus 486 i~~Fv~t~~~l~~~~~~~~~~~~~~~~~~~~~~~l~L~~lLGlTW~fgi~s~~~ 539 (610)
T KOG4193|consen 486 IVMFVVTLKKLLRRLSKLQPIASKLENISLIRSALALLFLLGLTWIFGIFSWLP 539 (610)
T ss_pred HHHHHHHHHHHhhcccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3444455555545555555443322111 1 11223556678887777666
Done!