Query psy16862
Match_columns 303
No_of_seqs 201 out of 1740
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 23:57:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16862.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16862hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4220|consensus 100.0 7.9E-33 1.7E-37 234.0 2.8 203 41-247 30-234 (503)
2 KOG4219|consensus 100.0 1.3E-30 2.8E-35 220.9 14.3 200 34-237 28-240 (423)
3 PHA03234 DNA packaging protein 100.0 2.5E-27 5.4E-32 207.2 17.5 194 37-236 28-227 (338)
4 PHA03235 DNA packaging protein 99.9 2.1E-26 4.6E-31 205.7 20.1 199 38-239 29-237 (409)
5 PHA02834 chemokine receptor-li 99.9 6.9E-27 1.5E-31 204.1 16.5 192 41-237 28-220 (323)
6 PHA02638 CC chemokine receptor 99.9 3.6E-26 7.9E-31 205.1 21.5 195 40-237 97-304 (417)
7 PHA03087 G protein-coupled che 99.9 2E-26 4.3E-31 202.7 17.9 198 39-238 38-235 (335)
8 PF00001 7tm_1: 7 transmembran 99.9 1.4E-25 3.1E-30 188.7 15.1 180 58-242 1-184 (257)
9 KOG2087|consensus 99.7 6E-18 1.3E-22 142.7 9.4 202 37-242 20-228 (363)
10 PF10320 7TM_GPCR_Srsx: Serpen 99.6 7E-16 1.5E-20 130.2 8.4 119 53-173 2-120 (257)
11 PF10328 7TM_GPCR_Srx: Serpent 99.6 8.8E-15 1.9E-19 125.0 14.6 186 50-240 2-191 (274)
12 PF10324 7TM_GPCR_Srw: Serpent 99.5 5.5E-13 1.2E-17 116.5 12.7 195 50-245 5-231 (318)
13 PF05296 TAS2R: Mammalian tast 99.3 3.1E-10 6.7E-15 98.0 19.2 204 41-245 6-219 (303)
14 PF11710 Git3: G protein-coupl 99.3 2.6E-10 5.6E-15 92.1 16.6 161 70-239 30-200 (201)
15 PF10323 7TM_GPCR_Srv: Serpent 99.2 8.9E-11 1.9E-15 100.5 10.1 181 52-238 5-195 (283)
16 PF05462 Dicty_CAR: Slime mold 99.1 1E-08 2.2E-13 87.8 18.5 129 42-178 7-135 (303)
17 PF10321 7TM_GPCR_Srt: Serpent 98.8 2E-07 4.4E-12 80.4 13.5 193 38-240 29-229 (313)
18 PF03402 V1R: Vomeronasal orga 98.4 1.3E-06 2.7E-11 73.2 8.9 172 73-246 8-187 (265)
19 PF10317 7TM_GPCR_Srd: Serpent 98.4 5.3E-06 1.1E-10 71.6 13.0 125 46-176 3-139 (292)
20 PF10316 7TM_GPCR_Srbc: Serpen 98.4 7.9E-06 1.7E-10 69.1 12.7 132 41-172 5-139 (273)
21 PF10292 7TM_GPCR_Srab: Serpen 98.3 8.1E-05 1.7E-09 65.2 17.7 131 41-172 16-153 (324)
22 PF00002 7tm_2: 7 transmembran 98.2 5.1E-07 1.1E-11 75.7 1.9 129 47-180 6-134 (242)
23 PF02118 Srg: Srg family chemo 98.0 5.3E-05 1.2E-09 64.7 10.7 121 50-171 6-137 (275)
24 PF02117 7TM_GPCR_Sra: Serpent 98.0 0.0014 3E-08 57.4 19.6 123 28-150 7-137 (328)
25 PF04789 DUF621: Protein of un 97.8 0.0012 2.6E-08 54.7 13.4 176 57-234 30-214 (305)
26 PF10327 7TM_GPCR_Sri: Serpent 97.7 0.00076 1.6E-08 58.4 11.6 196 44-243 11-230 (303)
27 PF10326 7TM_GPCR_Str: Serpent 97.7 4.6E-05 9.9E-10 66.3 4.0 102 49-150 6-118 (307)
28 KOG4193|consensus 97.6 0.0024 5.2E-08 60.1 14.3 167 50-237 328-496 (610)
29 PF02101 Ocular_alb: Ocular al 97.3 0.013 2.9E-07 51.0 14.5 128 43-176 28-172 (405)
30 PF10318 7TM_GPCR_Srh: Serpent 96.6 0.26 5.5E-06 42.7 17.1 132 44-177 6-146 (302)
31 KOG4564|consensus 96.6 0.15 3.3E-06 46.4 15.5 179 46-238 149-339 (473)
32 PF03383 Serpentine_r_xa: Caen 96.6 0.0016 3.5E-08 49.6 2.6 63 116-178 46-108 (153)
33 PF01534 Frizzled: Frizzled/Sm 96.4 0.38 8.3E-06 42.0 16.5 139 41-181 8-159 (328)
34 PF10319 7TM_GPCR_Srj: Serpent 96.4 0.089 1.9E-06 45.1 12.2 119 45-163 8-137 (310)
35 PF06681 DUF1182: Protein of u 96.3 0.11 2.3E-06 41.5 11.2 41 108-148 111-151 (226)
36 PF03125 Sre: C. elegans Sre G 95.9 0.66 1.4E-05 41.4 15.8 112 41-152 26-161 (365)
37 PF02175 7TM_GPCR_Srb: Serpent 95.9 0.81 1.7E-05 37.6 19.7 48 107-154 78-125 (236)
38 PF10322 7TM_GPCR_Sru: Serpent 95.8 0.96 2.1E-05 39.2 16.0 123 114-237 103-226 (307)
39 PF02076 STE3: Pheromone A rec 95.3 0.57 1.2E-05 40.1 12.6 117 55-177 4-123 (283)
40 PF10669 Phage_Gp23: Protein g 91.8 1.2 2.6E-05 30.7 6.6 25 212-236 20-44 (121)
41 PF15086 UPF0542: Uncharacteri 91.7 1.2 2.6E-05 28.9 6.3 20 205-224 19-38 (74)
42 KOG2302|consensus 89.3 16 0.00036 36.6 14.1 21 73-93 1194-1214(1956)
43 PF10669 Phage_Gp23: Protein g 87.9 2.3 4.9E-05 29.4 5.6 8 232-239 36-43 (121)
44 KOG4349|consensus 87.8 8.1 0.00018 28.1 8.5 85 41-126 47-132 (143)
45 PRK00247 putative inner membra 81.0 3.6 7.7E-05 37.3 5.4 29 271-299 335-363 (429)
46 PF13853 7tm_4: Olfactory rece 77.0 0.085 1.8E-06 40.3 -5.3 79 160-239 5-90 (144)
47 PF07074 TRAP-gamma: Transloco 76.6 4.1 8.9E-05 31.6 3.8 16 166-181 22-37 (170)
48 PF11044 TMEMspv1-c74-12: Plec 76.4 13 0.00027 21.7 5.0 26 212-237 8-33 (49)
49 PF01528 Herpes_glycop: Herpes 75.9 59 0.0013 29.1 11.8 36 206-241 301-336 (374)
50 COG1862 YajC Preprotein transl 71.2 5.4 0.00012 28.0 3.1 36 214-249 11-46 (97)
51 PF11671 Apis_Csd: Complementa 70.6 3.6 7.7E-05 30.1 2.1 38 260-297 12-49 (146)
52 PF08229 SHR3_chaperone: ER me 68.3 11 0.00024 30.2 4.6 13 219-231 144-156 (196)
53 PF06454 DUF1084: Protein of u 63.1 98 0.0021 26.6 10.6 22 37-58 24-45 (281)
54 PF10325 7TM_GPCR_Srz: Serpent 62.7 93 0.002 26.2 13.1 111 119-238 87-198 (267)
55 KOG4007|consensus 62.2 22 0.00047 28.3 5.0 19 281-299 205-223 (229)
56 KOG0345|consensus 61.4 7.5 0.00016 35.5 2.7 31 269-299 506-536 (567)
57 COG1230 CzcD Co/Zn/Cd efflux s 57.0 1.1E+02 0.0023 26.6 8.8 57 47-105 126-183 (296)
58 PF08229 SHR3_chaperone: ER me 54.9 5.6 0.00012 31.9 0.8 7 141-147 78-84 (196)
59 PHA03237 envelope glycoprotein 54.1 1.8E+02 0.0038 26.7 14.5 30 212-241 330-359 (424)
60 PRK06531 yajC preprotein trans 53.9 14 0.0003 26.9 2.6 30 217-246 7-36 (113)
61 PF14362 DUF4407: Domain of un 53.0 1.5E+02 0.0033 25.5 9.7 53 124-176 49-101 (301)
62 PF01102 Glycophorin_A: Glycop 51.6 20 0.00044 26.4 3.2 21 53-73 74-94 (122)
63 KOG2302|consensus 50.3 3.1E+02 0.0067 28.3 12.1 22 74-95 1159-1180(1956)
64 PHA03239 envelope glycoprotein 49.5 2.1E+02 0.0045 26.2 15.0 30 212-241 336-365 (429)
65 PF02699 YajC: Preprotein tran 49.2 32 0.00069 23.3 3.7 28 219-246 9-36 (82)
66 PF06783 UPF0239: Uncharacteri 47.7 4.4 9.5E-05 27.4 -0.7 19 204-222 22-40 (85)
67 PRK05886 yajC preprotein trans 47.2 30 0.00066 24.9 3.5 31 216-246 7-38 (109)
68 KOG4289|consensus 47.1 78 0.0017 33.6 7.2 61 110-176 2222-2282(2531)
69 TIGR00739 yajC preprotein tran 45.1 27 0.00059 23.8 2.9 28 220-247 11-38 (84)
70 PF00003 7tm_3: 7 transmembran 44.4 49 0.0011 27.2 5.0 29 109-137 51-79 (238)
71 PRK03449 putative inner membra 43.9 31 0.00066 30.0 3.6 28 210-237 237-264 (304)
72 COG1422 Predicted membrane pro 43.2 1E+02 0.0022 24.8 6.1 22 220-241 54-75 (201)
73 PF14138 COX16: Cytochrome c o 43.0 88 0.0019 21.1 5.0 24 215-238 3-26 (80)
74 PF09882 DUF2109: Predicted me 42.8 95 0.0021 20.7 4.9 48 51-98 3-50 (78)
75 PHA02616 VP2/VP3; Provisional 41.6 18 0.00039 28.7 1.7 13 80-92 60-72 (259)
76 KOG2927|consensus 40.7 74 0.0016 28.0 5.4 142 148-296 178-321 (372)
77 KOG0709|consensus 40.6 29 0.00062 31.6 3.0 26 261-286 246-271 (472)
78 KOG2422|consensus 40.6 9.5 0.0002 35.7 0.1 18 280-297 90-107 (665)
79 PRK01315 putative inner membra 40.3 59 0.0013 28.6 4.9 22 211-232 226-247 (329)
80 PF15102 TMEM154: TMEM154 prot 39.0 7.2 0.00016 29.5 -0.8 13 212-224 58-70 (146)
81 PF05297 Herpes_LMP1: Herpesvi 38.6 10 0.00022 32.1 0.0 8 133-140 118-125 (381)
82 smart00792 Agouti Agouti prote 36.8 71 0.0015 23.4 4.0 9 279-287 64-72 (124)
83 COG4736 CcoQ Cbb3-type cytochr 36.7 81 0.0018 20.0 3.8 31 215-245 12-42 (60)
84 PRK04125 murein hydrolase regu 36.5 15 0.00032 27.8 0.6 7 288-294 134-140 (141)
85 PHA03242 envelope glycoprotein 36.1 3.5E+02 0.0075 24.8 13.8 26 215-240 330-355 (428)
86 TIGR02736 cbb3_Q_epsi cytochro 35.6 36 0.00079 21.0 2.0 27 215-241 5-31 (56)
87 TIGR02230 ATPase_gene1 F0F1-AT 35.6 14 0.00031 26.1 0.3 16 284-299 25-40 (100)
88 PF11446 DUF2897: Protein of u 35.6 52 0.0011 20.4 2.8 23 46-68 6-28 (55)
89 KOG4055|consensus 34.1 34 0.00073 26.9 2.1 15 227-241 82-96 (213)
90 PF10883 DUF2681: Protein of u 32.1 1.7E+02 0.0037 20.1 7.0 23 221-243 13-35 (87)
91 PF11683 DUF3278: Protein of u 31.4 1.7E+02 0.0037 21.7 5.6 41 138-178 12-53 (129)
92 PF06679 DUF1180: Protein of u 31.0 34 0.00074 26.6 1.8 18 217-234 104-121 (163)
93 PRK05585 yajC preprotein trans 30.6 78 0.0017 22.7 3.4 26 221-246 27-52 (106)
94 KOG2357|consensus 30.6 59 0.0013 29.2 3.3 12 135-146 245-256 (440)
95 COG4606 CeuB ABC-type enteroch 29.6 3.6E+02 0.0079 23.1 7.6 94 49-143 111-206 (321)
96 PF06658 DUF1168: Protein of u 29.6 1.3E+02 0.0027 22.9 4.5 16 226-241 37-52 (142)
97 KOG2357|consensus 29.3 82 0.0018 28.4 3.9 8 108-115 176-183 (440)
98 PF10329 DUF2417: Region of un 29.2 3.4E+02 0.0073 22.6 9.4 17 126-142 139-155 (232)
99 PF07946 DUF1682: Protein of u 29.1 1.2E+02 0.0025 26.6 5.0 22 43-64 12-33 (321)
100 PF07946 DUF1682: Protein of u 28.5 33 0.00072 30.0 1.5 10 107-116 72-81 (321)
101 TIGR02976 phageshock_pspB phag 27.9 1.9E+02 0.0041 19.3 5.5 28 214-241 5-32 (75)
102 PRK09458 pspB phage shock prot 27.8 1.9E+02 0.0041 19.3 4.6 29 213-241 4-32 (75)
103 PF15330 SIT: SHP2-interacting 27.8 93 0.002 22.3 3.4 29 43-71 2-30 (107)
104 PF08213 DUF1713: Mitochondria 27.2 78 0.0017 17.5 2.2 15 275-289 11-25 (34)
105 PF05434 Tmemb_9: TMEM9; Inte 26.9 66 0.0014 24.5 2.6 14 285-298 130-143 (149)
106 PF02532 PsbI: Photosystem II 26.3 1.1E+02 0.0023 17.1 2.6 16 44-59 11-26 (36)
107 PF05151 PsbM: Photosystem II 25.4 1.2E+02 0.0026 16.3 2.7 23 207-229 7-29 (31)
108 smart00786 SHR3_chaperone ER m 25.3 1.2E+02 0.0025 24.5 3.8 20 219-238 144-163 (196)
109 PF05545 FixQ: Cbb3-type cytoc 25.3 1.3E+02 0.0028 17.9 3.3 23 218-240 15-37 (49)
110 cd07912 Tweety_N N-terminal do 24.9 4.4E+02 0.0095 24.2 7.9 17 76-92 79-95 (418)
111 PF12606 RELT: Tumour necrosis 24.9 1.4E+02 0.003 18.2 3.2 23 46-70 5-27 (50)
112 COG5472 Predicted small integr 24.8 3E+02 0.0064 20.5 6.2 19 82-100 101-119 (164)
113 KOG3584|consensus 24.5 96 0.0021 26.5 3.3 30 258-287 283-312 (348)
114 PRK02463 OxaA-like protein pre 24.0 76 0.0017 27.6 2.9 6 290-295 298-303 (307)
115 PF13038 DUF3899: Domain of un 24.0 2.4E+02 0.0053 19.2 5.1 11 222-232 19-29 (92)
116 PHA03235 DNA packaging protein 23.9 5.5E+02 0.012 23.3 10.5 20 217-236 297-316 (409)
117 PF04936 DUF658: Protein of un 23.5 1.7E+02 0.0038 22.6 4.3 15 258-272 144-158 (186)
118 PF02009 Rifin_STEVOR: Rifin/s 23.4 1.5E+02 0.0033 25.7 4.5 20 208-227 258-277 (299)
119 PRK13664 hypothetical protein; 23.3 1E+02 0.0022 19.2 2.5 16 48-63 11-26 (62)
120 PRK04989 psbM photosystem II r 22.6 83 0.0018 17.4 1.8 23 206-228 6-28 (35)
121 PRK03557 zinc transporter ZitB 22.5 5.2E+02 0.011 22.5 9.9 18 51-68 127-144 (312)
122 PF05399 EVI2A: Ectropic viral 22.1 2.4E+02 0.0053 22.9 5.0 26 254-279 182-207 (227)
123 PHA03164 hypothetical protein; 22.1 89 0.0019 20.7 2.2 21 73-93 56-76 (88)
124 PF10856 DUF2678: Protein of u 22.0 1.7E+02 0.0036 21.4 3.7 6 74-79 58-63 (118)
125 TIGR03038 PS_II_psbM photosyst 21.9 93 0.002 17.0 1.9 23 206-228 6-28 (33)
126 PF15345 TMEM51: Transmembrane 20.1 1.3E+02 0.0027 24.9 3.2 21 224-244 71-91 (233)
No 1
>KOG4220|consensus
Probab=99.97 E-value=7.9e-33 Score=233.99 Aligned_cols=203 Identities=27% Similarity=0.486 Sum_probs=177.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHH
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSF 120 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~ 120 (303)
-++++++...++++.++||++|++.+.-++++++..|+||++||+||++++++.+|.+..+.+-+.|++|..+|.++..+
T Consensus 30 ~v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlWLal 109 (503)
T KOG4220|consen 30 VVFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLWLAL 109 (503)
T ss_pred EEeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHHHHH
Confidence 45677888899999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEE-
Q psy16862 121 DVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCI- 199 (303)
Q Consensus 121 ~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~- 199 (303)
.++...+|++.+++|++|||..|.+|+.|+...|.+++..+|++.|++++++..|. +..|....+.. ....+.|.
T Consensus 110 DYvaSNASVmNLLiISFDRYFsVTrPLtYrakRTtkrA~~MI~~AW~iSfiLWaPa-Il~WqyivGkr---Tv~~~eC~i 185 (503)
T KOG4220|consen 110 DYVASNASVMNLLIISFDRYFSVTRPLTYRAKRTTKRAGLMIGAAWVLSFVLWAPA-ILFWQYIVGKR---TVPDGECYI 185 (503)
T ss_pred HHHhhhhhhhhhheeeeecceeecccccccccccchHHHHHHHHHHHHHHHHHHHH-HHhhHhheeee---ecCCCceEE
Confidence 99999999999999999999999999999999999999999999999999998666 45553333322 22345664
Q ss_pred -EEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHhhhhhccc
Q psy16862 200 -FVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQH 247 (303)
Q Consensus 200 -~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~~~~~~~ 247 (303)
+.......+-..+..|.+|..+|+++|.+|++..+++.+.....+...
T Consensus 186 QFlsnp~iTfGTAiAAFYlPVtiM~~LY~rIyret~kR~k~~~~lq~s~ 234 (503)
T KOG4220|consen 186 QFLSNPAITFGTAIAAFYLPVTIMTILYWRIYRETRKRQKELAKLQASL 234 (503)
T ss_pred EeecCceeehhHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhhhccccc
Confidence 334455556666788999999999999999999999999888776654
No 2
>KOG4219|consensus
Probab=99.97 E-value=1.3e-30 Score=220.94 Aligned_cols=200 Identities=25% Similarity=0.459 Sum_probs=168.7
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhcccccccccc
Q psy16862 34 IHIHIFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFM 113 (303)
Q Consensus 34 ~~~~~~~~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~ 113 (303)
.........+.+++++++.+++++||+++++++..+|++|+.+|++|+|||+||++++++..++...+.....|.+|..+
T Consensus 28 f~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~ 107 (423)
T KOG4219|consen 28 FVLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFY 107 (423)
T ss_pred ccCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccce
Confidence 34444558899999999999999999999999999999999999999999999999999999999888888999999999
Q ss_pred chhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccC
Q psy16862 114 CDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKH 193 (303)
Q Consensus 114 C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (303)
|++..|+..+...+|++++++||+|||.||.||++.+ .+++..+++++++|+++++++.|..+.......... ...
T Consensus 108 C~f~nf~~itav~vSVfTlvAiA~DRy~AIi~Pl~~r--~s~r~sk~iIllIW~lA~l~a~P~~l~s~v~~~~~~--d~~ 183 (423)
T KOG4219|consen 108 CRFVNFFPITAVFVSVFTLVAIAIDRYMAIIHPLQPR--PSRRSSKIIILLIWALALLLALPQLLYSSVEELYLY--DGE 183 (423)
T ss_pred eeeccccchhhhhHhHHHHHHHHHHHHHHHhhhcccC--CCCcceeehhHHHHHHHHHHhccceeeeeeEEeecc--CCc
Confidence 9999999999999999999999999999999999855 688999999999999999999998765442222211 111
Q ss_pred CCceEE---------EEeh----hhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 194 HPNVCI---------FVVN----VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 194 ~~~~c~---------~~~~----~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
....|. .... ..|..+..++.+++|++++.+.|..|.+.++.++
T Consensus 184 ~~~~~~~~~pe~~~~~~~~~~~~~~y~~vl~~lqYflPliVl~~~Yt~iav~LW~~~ 240 (423)
T KOG4219|consen 184 SRVVCVTAWPEHVCPTENESLLMQGYNYVLLFLQYFLPLIVLGLAYTVIAVTLWGRR 240 (423)
T ss_pred ceEEEEEecccccCCcchhhhhhcceeeeehhHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 222231 1111 2277778888999999999999999999999876
No 3
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.95 E-value=2.5e-27 Score=207.22 Aligned_cols=194 Identities=12% Similarity=0.070 Sum_probs=142.9
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHhh--cccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccc
Q psy16862 37 HIFPFIIKGMLMCFIIITAILGNLLVIISVI--KHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMC 114 (303)
Q Consensus 37 ~~~~~~~~~~~~~~i~i~gl~~N~~vl~~i~--~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C 114 (303)
......+..+++.+++++|++||+++++++. +++++|+++|+|++|||++|++.++ .+|+.+... ...|.+|+..|
T Consensus 28 ~~~~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l-~lp~~~~~~-~~~w~fG~~lC 105 (338)
T PHA03234 28 LKKAQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHML-CVFFLTLNR-EALFNFNQAFC 105 (338)
T ss_pred HHHHHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHH-hCCccCchhHH
Confidence 4456888899999999999999999999554 5566689999999999999999976 566655443 45699999999
Q ss_pred hhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCC-CcccccC
Q psy16862 115 DVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPD-HKQYRKH 193 (303)
Q Consensus 115 ~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 193 (303)
++..++.....++|++++++||+|||+||++|.+++. +.++....+++.|+++++.++|+++.......+ ....|..
T Consensus 106 k~~~~~~~~~~~~Si~~L~~ISiDRY~aIv~p~~~~~--~~~~~~~~i~~~Wi~s~l~~~P~l~~~~~~~~~~~~~~C~~ 183 (338)
T PHA03234 106 QCVLFIYHASCSYSICMLAIIATIRYKTLHRRKKNDK--KNNHIGRNIGILFLASAMCAIPAALFVKTEGKKGNYGKCNI 183 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHeeeechhhhhh--hhhhHHHHHHHHHHHHHHHHhhHhHeeeeeecCCCCCcCcc
Confidence 9999999999999999999999999999999987542 334444456667999999998886443322111 1122322
Q ss_pred CCceEEEEehhhHHH---HHhhhhhhhhhhHHHHHHHHHHHHHHHH
Q psy16862 194 HPNVCIFVVNVYYAL---ISSSISFWIPGFVMKKKKKRKKKKKKKK 236 (303)
Q Consensus 194 ~~~~c~~~~~~~~~~---~~~~~~~~ip~~i~~~~y~~I~~~l~~~ 236 (303)
.... ......+.. ....+.+++|+++|+++|..|.+.++++
T Consensus 184 ~~~~--~~~~~~~~~~~~~~~~~~f~iPl~im~~cY~~I~~~L~~~ 227 (338)
T PHA03234 184 HISS--KKAYDLFIAIKIVFCFIWGIFPTMIFSFFYVIFCKALHAL 227 (338)
T ss_pred cCCc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 2110 011111211 1223357899999999999999999764
No 4
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.95 E-value=2.1e-26 Score=205.73 Aligned_cols=199 Identities=15% Similarity=0.114 Sum_probs=145.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc-cccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccch
Q psy16862 38 IFPFIIKGMLMCFIIITAILGNLLVIISVIK-HRK-LRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCD 115 (303)
Q Consensus 38 ~~~~~~~~~~~~~i~i~gl~~N~~vl~~i~~-~~~-~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~ 115 (303)
...+.+..+++.+++++|++||+++++++.. ++. .++..++|++|||++|++. ++.+|+.+... ...|..+...|+
T Consensus 29 ~~~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~-l~~lP~~i~~~-~~~~~~g~~~Ck 106 (409)
T PHA03235 29 SAARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLT-VFVLPFIMLSN-QGLLSGSVAGCK 106 (409)
T ss_pred hhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHH-HHHHHHHHHhc-CccccCCCCeeh
Confidence 4457889999999999999999999987553 332 2356679999999999987 55788765432 223344578999
Q ss_pred hchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCC
Q psy16862 116 VWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHP 195 (303)
Q Consensus 116 ~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (303)
+..++...+..+|++++++||+|||++|++|.++.. .+++++..+++++|+++++.++|+.++.+....+.........
T Consensus 107 ~~~~l~~~~~~~Si~tL~~ISiDRY~aI~~p~~~~~-~~~~~a~~ii~~iWi~sll~s~P~~~~~~~~~~~~~~~~~~~~ 185 (409)
T PHA03235 107 FASLLYYASCTVGFATVALIAADRYRVIHQRTRARS-SAYRSTYKILGLTWFASLICSGPAPVYTTVVAHDDVDPEAPGY 185 (409)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHeechhhccC-cccchhhhhHHHHHHHHHHHHHHHHHHHhhhhccccCcCCCCc
Confidence 999999999999999999999999999999986544 4678888899999999999998875443211111100001112
Q ss_pred ceEEEEeh--------hhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHH
Q psy16862 196 NVCIFVVN--------VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKK 239 (303)
Q Consensus 196 ~~c~~~~~--------~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~ 239 (303)
..|..... ..|.+...++.+++|+++|+++|.+|++.++++.++
T Consensus 186 ~~C~~~~~~~~~~~~~~~y~i~l~i~~f~iPl~im~~~Y~~I~~~l~~~~~~ 237 (409)
T PHA03235 186 ETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTWFYTFFYRTLKRASYK 237 (409)
T ss_pred ceeeEeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Confidence 34532211 124445556778999999999999999999887653
No 5
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.95 E-value=6.9e-27 Score=204.11 Aligned_cols=192 Identities=19% Similarity=0.319 Sum_probs=143.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHH
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSF 120 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~ 120 (303)
..+..+++.+++++|++||+++++++.++++ +++.+++++|||++|++. ++.+|+.+.... ..|.+|+..|++..++
T Consensus 28 ~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~-~~~lP~~i~~~~-~~w~~g~~~C~~~~~~ 104 (323)
T PHA02834 28 NYFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLML-VFSFPFIIHNDL-NEWIFGEFMCKLVLGV 104 (323)
T ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHH-HHHHHHHHHHHc-CCcCCcchHHHhHHHH
Confidence 4567899999999999999999998887655 457899999999999986 557887765544 4699999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEE
Q psy16862 121 DVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIF 200 (303)
Q Consensus 121 ~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~ 200 (303)
......+|++++++||+|||++|++|++++. .+.+++...++++|+++++.++|+.+.......++...|.... .+..
T Consensus 105 ~~~~~~~Si~tL~~IsidRY~aI~~P~~~~~-~~~~~~~~~i~~iWi~s~l~~~P~~~~~~~~~~~~~~~C~~~~-~~~~ 182 (323)
T PHA02834 105 YFVGFFSNMFFVTLISIDRYILVVNATKIKN-KSISLSVLLSVAAWVCSVILSMPAMVLYYVDNTDNLKQCIFND-YHEN 182 (323)
T ss_pred HHHHHHHHHHHHHHHHHHHhhheeCchhccC-CccchHHHHHHHHHHHHHHHHhhHHHHHHhccCCCceEEeccC-Cccc
Confidence 8889999999999999999999999998654 3456777788899999999998875432111111111222111 1100
Q ss_pred -EehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 201 -VVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 201 -~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
.....+.....++.+++|+++++++|.+|++.+++++
T Consensus 183 ~~~~~~~~~~~~i~~f~iPl~ii~~~Y~~I~~~l~~~~ 220 (323)
T PHA02834 183 FSWSAFFNFEINIFGIVIPLIILIYCYSKILYTLKNCK 220 (323)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 0111122334467899999999999999999998764
No 6
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.95 E-value=3.6e-26 Score=205.10 Aligned_cols=195 Identities=21% Similarity=0.331 Sum_probs=150.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchH
Q psy16862 40 PFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNS 119 (303)
Q Consensus 40 ~~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~ 119 (303)
......+++.+++++|++||+++++++.+ +++|++++++++|||++|++.+ +.+|+.+... .+.|.+|+..|++..+
T Consensus 97 ~~~~l~~~y~lvfvlgliGN~LVl~il~~-k~lrt~t~i~llnLAisDLl~~-l~lPf~i~~~-~~~W~fg~~~Ck~~~~ 173 (417)
T PHA02638 97 ISEYIKIFYIIIFILGLFGNAAIIMILFC-KKIKTITDIYIFNLAISDLIFV-IDFPFIIYNE-FDQWIFGDFMCKVISA 173 (417)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCHhHHHHHHHHHHHHHHH-HHHHHHHHHH-hccccccccchhhHHH
Confidence 45677888999999999999999987764 7788999999999999999885 4788877655 3579999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCc--ccc-cCCCc
Q psy16862 120 FDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHK--QYR-KHHPN 196 (303)
Q Consensus 120 ~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~~~ 196 (303)
+.....+++++.++++++|||+||++|.+++...+++.....++++|+++++.++|+++.......... ... .....
T Consensus 174 l~~~~~~~Si~~L~~isiDRYlaIv~p~~~~~~~~~~~~~i~~~~iW~~s~l~slP~~~~~~~~~~~~~~~~~~~~~~~~ 253 (417)
T PHA02638 174 SYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEASNIIFSAQDSNETISNY 253 (417)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccccceecHhhhHhhHhHHHHHHHHHHHHHHHHhhccccccccccCCCCccCC
Confidence 999999999999999999999999999998777777888888999999999999998654321100000 000 00112
Q ss_pred eEEEEe----------hhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 197 VCIFVV----------NVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 197 ~c~~~~----------~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
.|.... ...+.+...++.+++|++++++||.+|++.+++.+
T Consensus 254 ~C~~~~~~~~~~~~~~~~~~~~~~~i~~f~lPl~vmi~cY~~I~~~L~~~~ 304 (417)
T PHA02638 254 QCTLIEDNEKNNISFLGRILQFEINILGMFIPIIIFAFCYIKIILKLKQLK 304 (417)
T ss_pred eeeeeccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 443211 01223334566789999999999999999997754
No 7
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.95 E-value=2e-26 Score=202.68 Aligned_cols=198 Identities=20% Similarity=0.263 Sum_probs=155.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhch
Q psy16862 39 FPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWN 118 (303)
Q Consensus 39 ~~~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~ 118 (303)
....+..+++.+++++|++||+++++++.++ ++|++.++++.+||++|++.++ ..|..+.......|..++..|++..
T Consensus 38 ~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~-~~~~~~~~~~~~~~~~~~~~C~~~~ 115 (335)
T PHA03087 38 TNSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVS 115 (335)
T ss_pred chhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHH-hHHHHHHHHhCCCCCCCcHHHHHHH
Confidence 3467788899999999999999999998887 8899999999999999998866 5666665566677888999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceE
Q psy16862 119 SFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVC 198 (303)
Q Consensus 119 ~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c 198 (303)
++......+|++++++||+|||++|++|++|+...+.+++..+++++|+++++.++|+........+.....|.......
T Consensus 116 ~~~~~~~~~S~~~l~~iaidRy~aI~~p~~~~~~~~~~~~~~~~~~iWl~~~~~~~p~~~~~~~~~~~~~~~C~~~~~~~ 195 (335)
T PHA03087 116 GLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISIIETTPILFVYTTKKDHETLICCMFYNNK 195 (335)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccchhhhHHHHHHHHHHHHHHhccHhheeeeeccCCCceEEecCCCc
Confidence 99999999999999999999999999999998888999999999999999999988875544332222222222111100
Q ss_pred EEEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH
Q psy16862 199 IFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKK 238 (303)
Q Consensus 199 ~~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~ 238 (303)
...+...+.+...++.+++|+++++++|.+|++.++++++
T Consensus 196 ~~~~~~~~~~~~~~~~~~lP~~ii~~~y~~i~~~l~~~~~ 235 (335)
T PHA03087 196 TMNWKLFINFEINIIGMLIPLTILLYCYSKILITLKGINK 235 (335)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 0111122223345667899999999999999998887653
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.93 E-value=1.4e-25 Score=188.71 Aligned_cols=180 Identities=29% Similarity=0.572 Sum_probs=153.4
Q ss_pred HHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHHHHHHHHHHHHHHHHHHH
Q psy16862 58 GNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFSTASILHLCCISV 137 (303)
Q Consensus 58 ~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~iai 137 (303)
||+++++++.++++++++.++++++||++|++.++...|..+.....+.|..++..|++..++..++..++.++++++++
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~~~~is~ 80 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLFYFSSFSSIFSLVAISI 80 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 89999999999999999999999999999999999888887777776788999999999999999999999999999999
Q ss_pred HHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEEEeh----hhHHHHHhhh
Q psy16862 138 DRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVVN----VYYALISSSI 213 (303)
Q Consensus 138 dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~----~~~~~~~~~~ 213 (303)
|||++|++|++|+...+++++...++++|+++++.++|+.+.......... ....|..... ..+.....++
T Consensus 81 dRy~~i~~p~~~~~~~~~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~-----~~~~C~~~~~~~~~~~~~~~~~~~ 155 (257)
T PF00001_consen 81 DRYLAICHPLRYRRIRTRRRARIIIILIWIISFLISLPPLFFSWVYFVSDG-----SQSFCFIDFSSSSSQIYFIYFFIV 155 (257)
T ss_dssp HHHHHHHSHHTHHHHSCHHHHHHHHHHHHHHHHHHHHHHHHTCEEEEESTC-----CCEEEEESCSSSHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccc-----cccccccccccccccccccccccc
Confidence 999999999999999999999999999999999999998654332222211 0445654332 3577778888
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHhhh
Q psy16862 214 SFWIPGFVMKKKKKRKKKKKKKKKKKRKK 242 (303)
Q Consensus 214 ~~~ip~~i~~~~y~~I~~~l~~~~~~~~~ 242 (303)
.+++|+++++++|.+|++.+++++++...
T Consensus 156 ~~~~p~~~~~~~~~~i~~~~~~~~~~~~~ 184 (257)
T PF00001_consen 156 FFILPLIIILICYIRILRKLRRQRKRIKS 184 (257)
T ss_dssp HTHHHHHHHHHHHHHHHHHHHHHHHCTCC
T ss_pred ccccceeeeeeeccccccccccccccccc
Confidence 99999999999999999999998876554
No 9
>KOG2087|consensus
Probab=99.74 E-value=6e-18 Score=142.71 Aligned_cols=202 Identities=18% Similarity=0.242 Sum_probs=152.8
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhc-------ccccc
Q psy16862 37 HIFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLT-------DEWYF 109 (303)
Q Consensus 37 ~~~~~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~-------~~~~~ 109 (303)
+.....+.-+...++.+++++||++|+......+...++..+++.|||++|+++++.......+.... ..|..
T Consensus 20 dllg~~~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~ 99 (363)
T KOG2087|consen 20 DLLGYWILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQ 99 (363)
T ss_pred HhhccceeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhh
Confidence 33446666677788899999999999998888877778999999999999999988665554443322 23455
Q ss_pred ccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcc
Q psy16862 110 GYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQ 189 (303)
Q Consensus 110 ~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~ 189 (303)
+...|.+.+|+..++.-.|++.++.+++||++.|.+|++-.+....+....+++..|+.+++.++.|.+ +...+.....
T Consensus 100 tg~gC~~aGflavFASElSv~~LT~itlEr~l~i~~p~~~~~~~~lr~~~~ill~~wl~~~l~A~~Pl~-g~s~Y~~~~v 178 (363)
T KOG2087|consen 100 TGLGCPVAGFLAVFASELSVFLLTLITLERWLSITYPFRLDRKAKLRPLVLILLLGWLFAFLMALLPLF-GISSYGASSV 178 (363)
T ss_pred hcCCCchHHHHHHHHHHHHHHHHHHHHHHHHhheeccccCCCcccccHHHHHHHHHHHHHHHHHhcccc-CCCCCcccce
Confidence 678999999999999999999999999999999999999777766777899999999999999999954 3211111111
Q ss_pred cccCCCceEEEEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHhhh
Q psy16862 190 YRKHHPNVCIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKK 242 (303)
Q Consensus 190 ~~~~~~~~c~~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~~ 242 (303)
+-..+.+.+ .+.-|.....+....+.++++.++|.+++..+++.......
T Consensus 179 ClPL~~~~~---~s~g~y~~~~l~~N~lafiiia~~Y~~iy~~l~~~~~~~~~ 228 (363)
T KOG2087|consen 179 CLPLHIEEP---LSTGYYLVALLGLNLLAFIIIAFSYGKIYCSLRKGDLSATL 228 (363)
T ss_pred eeecccCCc---cchhHHHHHHHHHHHHHHHHHHHHhhhhheeeecCCCcccc
Confidence 111111111 11215556667778899999999999999999886554443
No 10
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=99.63 E-value=7e-16 Score=130.18 Aligned_cols=119 Identities=26% Similarity=0.426 Sum_probs=100.2
Q ss_pred HHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHHHHHHHHHHHHHH
Q psy16862 53 ITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFSTASILHL 132 (303)
Q Consensus 53 i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l 132 (303)
++|++||+.++..+.++|++|++.+++++.+|++|++.++..+|...... +........|....+...++.......+
T Consensus 2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~--~~~~i~~~~Cf~~~~~~~f~~~~qs~~~ 79 (257)
T PF10320_consen 2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLF--RDHQITRSECFWQIFFYIFFQCAQSVIM 79 (257)
T ss_pred EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHH--hheeccHHHHHHHHHHHHHHHHHHHHHH
Confidence 57899999999999999999999999999999999999888887766333 2334567799999999999999999999
Q ss_pred HHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhh
Q psy16862 133 CCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIIS 173 (303)
Q Consensus 133 ~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~ 173 (303)
++|++||++||+.|.+|+...+.+..........+.+....
T Consensus 80 l~i~iDr~iaV~~P~~Y~~~~~~~y~~~~~~~~~~~s~~~~ 120 (257)
T PF10320_consen 80 LAIAIDRLIAVCFPLRYRTISTRKYLIILLIFPVIYSIFFT 120 (257)
T ss_pred HHHHhhheeeEeehhhhhhcccccchhhHhHHHHHHHHHHH
Confidence 99999999999999999999888865555555555555543
No 11
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.63 E-value=8.8e-15 Score=124.99 Aligned_cols=186 Identities=20% Similarity=0.277 Sum_probs=133.8
Q ss_pred HHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccc-hhchHHHHHHHHHH
Q psy16862 50 FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMC-DVWNSFDVYFSTAS 128 (303)
Q Consensus 50 ~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C-~~~~~~~~~~~~~s 128 (303)
+++++|++.|.++++.+.+.+.++++.+++..+.|++|.+.++..+.........+.....+... .+.+.....+...+
T Consensus 2 ~~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~~l~~~~P~~~~~~~~~~~~~~s~~~g~~~~~~y~~~ 81 (274)
T PF10328_consen 2 LISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLIFLFWVVPMTLLDISFLPSSLNSIIFGFIGMFCYFIG 81 (274)
T ss_pred eeeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCchHHHHHHHHHHHHHHHHHH
Confidence 35789999999999999999999999999999999999999886666555555555333333333 34478888888999
Q ss_pred HHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccc-c--ccccCCCcccccCCCceEEEEehhh
Q psy16862 129 ILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIML-G--WYTTPDHKQYRKHHPNVCIFVVNVY 205 (303)
Q Consensus 129 ~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~c~~~~~~~ 205 (303)
.++.++||++|+.||..|.+|+++.+.+.+..+++++|++++....+..+. + .....+.-.+....++.|...
T Consensus 82 ~~~~~liaiNRf~ai~fP~~y~~~fs~~~T~~~i~~~~~~~~~~~~~~~~~~~C~~~y~~~~~~~~~~~~~~C~~~---- 157 (274)
T PF10328_consen 82 PLSHLLIAINRFCAIFFPFKYKKIFSFKNTIILIAFIWLLSIIISTILYFPDGCYFYYDPETWSWSYPTDPPCGNY---- 157 (274)
T ss_pred HHHHHHHHHHHHheeeeHHHHHhHcCccceehhhhHHHHHHHHHHHHhhhcCCCcceeccceeeeecCCCCccchh----
Confidence 999999999999999999999999999999999999999999666555432 1 122222222222233355432
Q ss_pred HHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHh
Q psy16862 206 YALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKR 240 (303)
Q Consensus 206 ~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~ 240 (303)
..........+-.+++.+++..++.++++.+++.
T Consensus 158 -~~~~~~~~~~~~~~~~~~lni~t~ikl~~~~~~~ 191 (274)
T PF10328_consen 158 -SWYFDFYKNFILVIISNILNIITFIKLRKFRKKI 191 (274)
T ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Confidence 1211122233444667778888888888777655
No 12
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=99.47 E-value=5.5e-13 Score=116.48 Aligned_cols=195 Identities=16% Similarity=0.196 Sum_probs=134.1
Q ss_pred HHHHHHHHHHHHHHHHhhccccccc-chhHHHHHHHHHhHHHHhhhhhHHHHHHhccc-----ccccc----ccchhchH
Q psy16862 50 FIIITAILGNLLVIISVIKHRKLRI-ITNYFVVSLAFADLLVALCVMPFNAIVSLTDE-----WYFGY----FMCDVWNS 119 (303)
Q Consensus 50 ~i~i~gl~~N~~vl~~i~~~~~~~~-~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~-----~~~~~----~~C~~~~~ 119 (303)
+++++|+++|++.++++.| |.+|+ +.|.++..+|++|++..+..++..+....... ..... ....+...
T Consensus 5 ~~~~~g~~~N~~h~~VLtr-k~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~~~~~~~~~C~~~~sy~~~~~~~~~~~ 83 (318)
T PF10324_consen 5 ILSIFGLFINIFHLIVLTR-KSMRSSSINILLIGIAICDLLYMLSILIWELFFFIIFYEDDECIPPYSYFWAYFKLIMES 83 (318)
T ss_pred EEeHHHHHHHHHHhhhcCC-hhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccHHHHHHHHHHHH
Confidence 3578999999999998876 44565 88999999999999998887777665444321 01111 12233456
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--CcccchhhhHHHHHHHHhhhhhhhhccccccccccCC----CcccccC
Q psy16862 120 FDVYFSTASILHLCCISVDRYYAIVKPLQY--PIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPD----HKQYRKH 193 (303)
Q Consensus 120 ~~~~~~~~s~~~l~~iaidRy~aI~~P~~~--~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 193 (303)
+...+..+|.|..++||+.|+++|.+|+.. +.+.+++.+...+.++++++.+.+.+..+.......+ ....|..
T Consensus 84 l~~~~~~~S~WL~V~mA~iR~l~i~~p~~~~~~~l~~~k~~~~~i~~v~~~s~~~~~~~~~~~~i~~~~~~~~p~~~C~~ 163 (318)
T PF10324_consen 84 LSDIFRRISIWLGVLMALIRYLSIKFPMSSRFQKLSKPKFAIIVILIVFIISFLFSIPYFFRYKIVEVSDPWVPPPNCSG 163 (318)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCCCeeeeehHHHHHHHHHHHHHhhceEEEEeccccccCCCceee
Confidence 777888999999999999999999999864 3466778888888888888888887775432222111 1122321
Q ss_pred CCc---eEEEE-----e----h----hhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHhhhhhc
Q psy16862 194 HPN---VCIFV-----V----N----VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN 245 (303)
Q Consensus 194 ~~~---~c~~~-----~----~----~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~~~~~ 245 (303)
.+. ...+. . + ..+.+...++.-++|.++..++.+.+++.+|+.++++++..+
T Consensus 164 ~~~~~~~~~Y~~~~~~~~~~~~~~~~~~~~~~~gi~~kiiP~il~~ilti~Li~~Lrk~~~~r~~~~~ 231 (318)
T PF10324_consen 164 FPENYTFPRYMLNISELFTENDCLFFRIYFFIDGIFFKIIPCILLPILTILLIIELRKAKKRRKKLSS 231 (318)
T ss_pred ccccccccccchhhhhhhhhhHHHHHHHHHHhhhhHhhhhhHHHHHHHHHHHHHHHHhccHhhhcccc
Confidence 110 00000 0 1 123333445557899999999999999999999988876653
No 13
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=99.31 E-value=3.1e-10 Score=97.95 Aligned_cols=204 Identities=12% Similarity=0.112 Sum_probs=136.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc---cccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhc
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKH---RKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVW 117 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~---~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~ 117 (303)
..+..++..+..++|++||++++.+...+ ++.-.|.+.++.+||+++++.............+.......+..++..
T Consensus 6 ~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (303)
T PF05296_consen 6 EIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFLSFFFPNIYFSENVYKII 85 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhhhHHHHH
Confidence 55668888899999999999999887753 333469999999999999998776555555555555555566788888
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC----cccchhhhHHHHHHHHhhhh--hhhhccccccccccCCCcccc
Q psy16862 118 NSFDVYFSTASILHLCCISVDRYYAIVKPLQYP----IIMNQNTVLLMLSNVWILPG--IISFTPIMLGWYTTPDHKQYR 191 (303)
Q Consensus 118 ~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~----~~~t~~~~~~~i~~~w~~~~--~~~~~~~~~~~~~~~~~~~~~ 191 (303)
.++..+....|.|..+++++--++-|+...+.- ++.=.+.+..++.+.+++++ ...+|.. ............+
T Consensus 86 ~~~~~f~~~~s~W~tt~LsvfYcvKI~~fs~~~Fl~LK~rI~~~v~~lLl~s~l~s~~~~~~~~~~-~~~~~~~~~~~~~ 164 (303)
T PF05296_consen 86 DFLWMFSNSSSLWFTTWLSVFYCVKIANFSHPFFLWLKRRISKVVPWLLLGSLLISFLNLLSIPLF-IDNHINNNNTNNS 164 (303)
T ss_pred HHHHHHHhHHHHHHHHHHHHHHheeeecCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-eeeeeccccccCC
Confidence 899999999999999999999999888654311 11112224455666777777 3333321 1111111111111
Q ss_pred cCCCceEE-EEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHhhhhhc
Q psy16862 192 KHHPNVCI-FVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN 245 (303)
Q Consensus 192 ~~~~~~c~-~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~~~~~ 245 (303)
........ ......+......+..++|+++++++...++..++||.|+.++...
T Consensus 165 ~N~t~~~~~~~~~~~~~~~~~~~~~~lPf~i~l~s~~lli~SL~rH~r~M~~n~~ 219 (303)
T PF05296_consen 165 RNSTSNFQESKSSYFYFFILFNLGSFLPFLIFLVSSILLIFSLWRHMRRMQKNAT 219 (303)
T ss_pred CcceEEeecchHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhhCCCC
Confidence 11111111 1122233333345788999999999999999999999999986654
No 14
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=99.30 E-value=2.6e-10 Score=92.11 Aligned_cols=161 Identities=16% Similarity=0.080 Sum_probs=109.6
Q ss_pred cccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy16862 70 RKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQY 149 (303)
Q Consensus 70 ~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~ 149 (303)
++++...+.+++||.++|++.++..+...+.....+.-..++..|...+++.+++..++.+..++||++-|+.|.+|-..
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~q~g~~~sd~~ilaIAihT~l~v~~~~~~ 109 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFLQVGDEASDLWILAIAIHTFLIVFRPNWK 109 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccc
Confidence 34456667799999999999998777656666666555667889999999999999999999999999999999999221
Q ss_pred ----Cc-----ccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEEEe-hhhHHHHHhhhhhhhhh
Q psy16862 150 ----PI-----IMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVV-NVYYALISSSISFWIPG 219 (303)
Q Consensus 150 ----~~-----~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~-~~~~~~~~~~~~~~ip~ 219 (303)
+. ...+..+..+..++|++...++... .+.... ......|..+. ...+.....-+..++..
T Consensus 110 ~~~~~~~~~gl~~~~~~v~~~~~~~~~~~~~la~i~--~~~~~Y-------~~~g~WCWi~~~~~~~Rl~l~y~~~~~~~ 180 (201)
T PF11710_consen 110 RKRSKNVEGGLYPYRYWVWVIWILVPLLLASLAFIG--LGGPGY-------GPAGAWCWIPSRYEWYRLWLHYIWRFIII 180 (201)
T ss_pred cccccccccceEEeeeeeehHHHHHHHHHHHHHHhc--cccCcc-------cccCcEEEECCCcHHHHHHHHHHHHHHHH
Confidence 10 0112223333445555544443222 011111 12344564332 33456666666677778
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy16862 220 FVMKKKKKRKKKKKKKKKKK 239 (303)
Q Consensus 220 ~i~~~~y~~I~~~l~~~~~~ 239 (303)
+++++.|..|+..+|++.|+
T Consensus 181 ~~~i~iY~~if~~lrr~~~~ 200 (201)
T PF11710_consen 181 FAIIIIYIAIFFYLRRRIRR 200 (201)
T ss_pred HHHHHHHHHHHHHHhhhhcc
Confidence 89999999999999998764
No 15
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.21 E-value=8.9e-11 Score=100.47 Aligned_cols=181 Identities=14% Similarity=0.188 Sum_probs=121.3
Q ss_pred HHHHHHHHHHHHHHhhcccc----cccchhHHHHHHHHHhHHHHhhhhhHHHHHHhc---c-ccccccccchhchHHHHH
Q psy16862 52 IITAILGNLLVIISVIKHRK----LRIITNYFVVSLAFADLLVALCVMPFNAIVSLT---D-EWYFGYFMCDVWNSFDVY 123 (303)
Q Consensus 52 ~i~gl~~N~~vl~~i~~~~~----~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~---~-~~~~~~~~C~~~~~~~~~ 123 (303)
+++.++..+.+++++.+.|+ ++++.+.++.+.+++|++..+............ + ....++...++.....+.
T Consensus 5 ~~~tlply~~il~~l~~~r~~~~~~~~~Fy~l~~~~~iaDi~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~y~ 84 (283)
T PF10323_consen 5 SIVTLPLYIFILYCLLKLRKRSKTFKSTFYTLLIQHCIADILSMLFYFLLQRFRAYGFFNDFYFSYQTYYAAIYYNIAYY 84 (283)
T ss_pred EeehHHHHHHHHHHHHHcccCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHhcchhHHHHHHHHHH
Confidence 45667777888877776544 458999999999999999877654433332221 1 112333445566667777
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCC-CCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEEE-
Q psy16862 124 FSTASILHLCCISVDRYYAIVKPL-QYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFV- 201 (303)
Q Consensus 124 ~~~~s~~~l~~iaidRy~aI~~P~-~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~- 201 (303)
.........+++|++||.||++|. ++.+..++.+...++++-|+.+++.++|...-.....+... ......+
T Consensus 85 ~~~~~~~gi~lls~nR~~ai~~P~~~~~~~~~~~~~~~i~~i~wi~p~li~~~~~~~~~~~f~~~~------~~~~~~d~ 158 (283)
T PF10323_consen 85 FLYIQCIGIVLLSLNRYLAICFPTSRHTKFWQPAKIWIIILIQWIPPLLISLPFFFDTDFYFDNEE------NMSLFVDP 158 (283)
T ss_pred HHHHHHHhHHHHHHhhhheEEeecHHHhhhccccchhheeeeeehhhhhheeeeeccCceeeeccc------ceeeecCH
Confidence 778889999999999999999999 77888888888899999999999998875332221111111 1111111
Q ss_pred ehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH
Q psy16862 202 VNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKK 238 (303)
Q Consensus 202 ~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~ 238 (303)
...........+...+.++..+++|..++..+|++.+
T Consensus 159 ~~~~~~~~~~~~~~~~~cv~~iv~Y~~i~~~iRk~~k 195 (283)
T PF10323_consen 159 EFIQRNFLIAFIFVSVTCVICIVCYGIIFIFIRKRNK 195 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 1111222333444556677889999999999998774
No 16
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=99.12 E-value=1e-08 Score=87.80 Aligned_cols=129 Identities=22% Similarity=0.322 Sum_probs=103.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHHH
Q psy16862 42 IIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFD 121 (303)
Q Consensus 42 ~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~~ 121 (303)
-....+..+..+++++|-++++..+...+++|++.+.++..++++|++..+..+... ..+....++..|.+++++.
T Consensus 7 ~~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~----~~~~~~~~s~lC~~Qafli 82 (303)
T PF05462_consen 7 RTLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMT----LSPSAGENSFLCQFQAFLI 82 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHH----hcccCCCCCcchhhHhHHH
Confidence 345566677789999999999999999999999999999999999999876543221 1122344577999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcccc
Q psy16862 122 VYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIM 178 (303)
Q Consensus 122 ~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~ 178 (303)
+++..++.+..++||++=|+.++++.. -..+.-....+++|.++++.++.++.
T Consensus 83 q~f~~as~lWt~~iA~nly~~~~~~~~----~~~k~e~~Y~l~~wGiPli~a~i~l~ 135 (303)
T PF05462_consen 83 QFFMLASFLWTLCIAFNLYLTFVKRYD----ELEKLEKYYHLVCWGIPLIPAVIPLI 135 (303)
T ss_pred HHhhHHHHHHHHHHHHHhhheeEEcCc----cHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999999987533 13555677888999999998887754
No 17
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.76 E-value=2e-07 Score=80.36 Aligned_cols=193 Identities=13% Similarity=0.115 Sum_probs=118.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchh-
Q psy16862 38 IFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDV- 116 (303)
Q Consensus 38 ~~~~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~- 116 (303)
+..+.+.++...+.+++-.+..+..++++.+++..+.+.+-++.-||+.|++.........-.... .|...|..
T Consensus 29 g~~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~~si~tG~l~i-----~G~vfC~~P 103 (313)
T PF10321_consen 29 GVKRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFINSIITGILAI-----FGAVFCSYP 103 (313)
T ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHhhhhhhhHHHh-----cCccccCCc
Confidence 445778889999999999999999999999877777899999999999999986543222222222 24445554
Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHH--HhhhhhhhhccccccccccCCCcc
Q psy16862 117 -----WNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNV--WILPGIISFTPIMLGWYTTPDHKQ 189 (303)
Q Consensus 117 -----~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~--w~~~~~~~~~~~~~~~~~~~~~~~ 189 (303)
.+.+...+..++......++++|..-+..|.........++....+.+. +.+-..+-.+|..+.. +...
T Consensus 104 ~~~~i~G~~~~~~W~~~c~~~i~LainR~~~l~~~~~~~~~F~g~~~~~~l~~~~iY~~y~~~fT~pv~Fns----~~~s 179 (313)
T PF10321_consen 104 RFIYIMGCIALGSWCGSCIICILLAINRCCELNFPFKLKFLFEGKRTYIVLFLPIIYGFYFSFFTPPVLFNS----EYMS 179 (313)
T ss_pred hHhhhhhccchhhHHHHHHHHHHHHHHHHHHHhcchhhHHhhcCceEEeeehhHHHHHHHHHHHcCcccCCC----Ccce
Confidence 3445566667888899999999999999987665565655544333333 2222222234433321 0111
Q ss_pred cccCCCceEEEEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHh
Q psy16862 190 YRKHHPNVCIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKR 240 (303)
Q Consensus 190 ~~~~~~~~c~~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~ 240 (303)
+. .++..-+......|.-....+..++-.+.+.++|..+...++++.+..
T Consensus 180 w~-fDP~i~~g~~~~~Y~N~~~~~nN~~~~~~t~~lY~~l~~~l~~k~~~~ 229 (313)
T PF10321_consen 180 WF-FDPMIFEGKSPELYSNIPHTFNNIIVVICTTFLYIYLCYYLIKKSKYS 229 (313)
T ss_pred Ee-cCccccCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 10 001000001112233333344445556778889999998888876544
No 18
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=98.43 E-value=1.3e-06 Score=73.20 Aligned_cols=172 Identities=15% Similarity=0.043 Sum_probs=108.1
Q ss_pred ccchhHHHHHHHHHhHHHHhh-hhhHHHHHHhccccccccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---
Q psy16862 73 RIITNYFVVSLAFADLLVALC-VMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQ--- 148 (303)
Q Consensus 73 ~~~~~~ll~~Lai~Dl~~~l~-~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~--- 148 (303)
.+|.+..+.+||++|++..+. ++|.... .+... .+++..|++..++..++-..|+.+.+++++..-+.|.-...
T Consensus 8 ~kp~dlIl~hLa~aN~lvLl~rGip~~~~-~~~~~-~~~d~gCK~v~Y~~RV~RglSictTCLLSvfQAItISP~~s~w~ 85 (265)
T PF03402_consen 8 LKPIDLILIHLALANILVLLSRGIPQTMA-FFGWK-FFDDIGCKIVFYIYRVARGLSICTTCLLSVFQAITISPSNSRWA 85 (265)
T ss_pred CCcHHHHHHHHHHHHHHHHHHhhHHHHHH-Hhhcc-cCCCceeeeeeeehHHhchhhHHhhhHHHHHhhEEEecCCchhh
Confidence 579999999999999988765 3443322 22222 36899999999999999999999999999966665553221
Q ss_pred CCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEEEehhhH----HHHHhhhhhhhhhhHHHH
Q psy16862 149 YPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVVNVYY----ALISSSISFWIPGFVMKK 224 (303)
Q Consensus 149 ~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~----~~~~~~~~~~ip~~i~~~ 224 (303)
.-+...++.....+.+.|++.+++............+............|........ ......+.-++.+.+|..
T Consensus 86 ~lK~k~pk~i~~~~~~~Wilnlli~~~~~~~~~~~~n~t~~~~~~~~~yCs~~~~~~~~~~~~~~~~~~~D~~fvgLM~~ 165 (265)
T PF03402_consen 86 ELKVKAPKYIGPSCLFCWILNLLININVLVYIIATPNSTNNNFTYSLGYCSSVPISYIVSSLFATLLSFRDVLFVGLMIW 165 (265)
T ss_pred hhhhhcCCccccHHHHHHHHHHhhhcceeEEEEecCccccCceeecCCeEEEEecchhhhhhhhhHHHhhHHHHHHHHHh
Confidence 1112244556777888999988876433221111111111111123356754322111 112223334566778888
Q ss_pred HHHHHHHHHHHHHHHhhhhhcc
Q psy16862 225 KKKRKKKKKKKKKKKRKKKKNQ 246 (303)
Q Consensus 225 ~y~~I~~~l~~~~~~~~~~~~~ 246 (303)
..+.++..+.||+++.+..++.
T Consensus 166 aSgyMV~~L~RH~q~VqhIhs~ 187 (265)
T PF03402_consen 166 ASGYMVFLLYRHHQQVQHIHSS 187 (265)
T ss_pred hhhhheeeeeecccccccCcCC
Confidence 8888899999999888776553
No 19
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=98.43 E-value=5.3e-06 Score=71.57 Aligned_cols=125 Identities=17% Similarity=0.219 Sum_probs=91.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccc-ccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhcc-----------cccccccc
Q psy16862 46 MLMCFIIITAILGNLLVIISVIKHR-KLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTD-----------EWYFGYFM 113 (303)
Q Consensus 46 ~~~~~i~i~gl~~N~~vl~~i~~~~-~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~-----------~~~~~~~~ 113 (303)
++..+.+++|++.|+++++.+.++. +.-+...+++.|-|+.|++.++...... ...... --..|+..
T Consensus 3 i~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~~~~~q-~R~i~~~~~~~~~~~GpC~~~~~~~ 81 (292)
T PF10317_consen 3 IYHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIISAFLTQ-PRIIPNGTSIAVISYGPCRYFGPWF 81 (292)
T ss_pred EeHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCCcEEEEeeecceecCcHH
Confidence 3567889999999999998887653 3335778999999999999876543322 111111 12356778
Q ss_pred chhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcc
Q psy16862 114 CDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTP 176 (303)
Q Consensus 114 C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~ 176 (303)
|.....+...+...+.+++.+--..||..+.++.. ++++....+.+.+..++...+..
T Consensus 82 C~~~y~~~~~~~~~s~~~i~~sf~yRy~iL~~~~~-----~~~~~i~~~~l~~~p~~~~~i~~ 139 (292)
T PF10317_consen 82 CFFCYHIMLHFSIHSGWSIILSFYYRYYILRNPSP-----SRKRIIFICILIYIPSFSMLIFF 139 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-----chhHHHHHHHHHHHHHHHHHHHH
Confidence 88888888888889999999999999999998744 55666677777777777765544
No 20
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=98.39 E-value=7.9e-06 Score=69.07 Aligned_cols=132 Identities=12% Similarity=0.055 Sum_probs=86.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhcc--cccc-ccccchhc
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTD--EWYF-GYFMCDVW 117 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~--~~~~-~~~~C~~~ 117 (303)
..+..++..+..++....|..+++.+...|+.+++.-.++....+.|.+.++...+......... .... ....--+.
T Consensus 5 ~~iv~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~if~~ 84 (273)
T PF10316_consen 5 AIIVSIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLIYYILFSLFPEFLDYKNLIFYI 84 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHHHHHHHHhChhhhccccEEeeh
Confidence 45566777788889999999999988876664456566666789999999887766544443321 1111 22222233
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhh
Q psy16862 118 NSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGII 172 (303)
Q Consensus 118 ~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~ 172 (303)
+................||+||.+|+..|..|+....+-....+++++...++.-
T Consensus 85 ~~p~~~~~~iR~~l~~~Ia~dR~~A~~fPI~y~~~r~k~~~~~I~~~~~~~~~~d 139 (273)
T PF10316_consen 85 GWPSSNLGSIRSILALIIALDRVFAVYFPIFYHNYRKKIPNFIIIIIALSYGLFD 139 (273)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHccCccccHHHHHHHHHHHHHHH
Confidence 4455555577889999999999999999988776544443333333333333333
No 21
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=98.31 E-value=8.1e-05 Score=65.20 Aligned_cols=131 Identities=9% Similarity=0.055 Sum_probs=94.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHH---HHHhc--c--cccccccc
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNA---IVSLT--D--EWYFGYFM 113 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~---~~~~~--~--~~~~~~~~ 113 (303)
..+..++..+++++|++.++..+......+.+|.-..+++....++.++.++..+.... ..+.. + +...+...
T Consensus 16 l~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~H~N~ril~~~~~~~~l~~~~~r~~~h~~~l~~~~~~~~~Cd~~~~~~~ 95 (324)
T PF10292_consen 16 LRLSLIFNLLLSIIAFPVIIYALWKIRNSKLFHFNTRILFIVHCFSFLIHCTGRIILHTYDLYNYFFPDDPCDMIPSTYR 95 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCcccccchhH
Confidence 34455667788889999999988888888888877788888888888888765433322 22222 1 23346678
Q ss_pred chhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhh
Q psy16862 114 CDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGII 172 (303)
Q Consensus 114 C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~ 172 (303)
|...-....++......+.+++++||++|.++.-+|.+. ...-...+..+.|++++..
T Consensus 96 C~~lR~~~~~~~~~~~~t~v~l~IER~iAT~~~~~YE~~-~~~~Gi~l~~~qi~is~~~ 153 (324)
T PF10292_consen 96 CFILRIPYNFGLFLVSFTTVSLVIERTIATFFSKSYEKS-GKWLGILLAFFQILISLLI 153 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCC-CccHHHHHHHHHHHHHHHH
Confidence 888888999999999999999999999999999888865 2333344444455554443
No 22
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=98.22 E-value=5.1e-07 Score=75.69 Aligned_cols=129 Identities=19% Similarity=0.221 Sum_probs=3.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHHHHHHHH
Q psy16862 47 LMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFST 126 (303)
Q Consensus 47 ~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~ 126 (303)
+..+.+.+++++-++.+..+...|++|+..+....||++++++..+..+.. ......+.....+..|...+.+.+++..
T Consensus 6 i~~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~~-~~~~~~~~~~~~~~~C~~~a~~~hy~~l 84 (242)
T PF00002_consen 6 ISYVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSFLIG-ISQTFSPISTTNHCLCRAIAILLHYFFL 84 (242)
T ss_dssp HHHHHHH-------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHHhee-hhhccccccccccccchhhhhHhHHHHH
Confidence 334444555555555555555557777777888899999999886654322 1111111111123359999999999999
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcccccc
Q psy16862 127 ASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLG 180 (303)
Q Consensus 127 ~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~ 180 (303)
++...+.+.+++-|..+..+... .+++.....+..|.+++++........
T Consensus 85 a~f~Wm~~~~~~l~~~~~~~~~~----~~~~~~~~~~~gwg~P~~iv~i~~~~~ 134 (242)
T PF00002_consen 85 ASFFWMLVEAFYLYRLLVKVFNS----SRRRFWWYYLIGWGIPALIVVISVAVN 134 (242)
T ss_dssp ------------------------------------------------------
T ss_pred HHHHHHHHHHHHhheeEEEeecc----cchhhheeeeeeecCcceeeeeeeeec
Confidence 99999999999999999887331 245556677788999888876664443
No 23
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=98.03 E-value=5.3e-05 Score=64.72 Aligned_cols=121 Identities=13% Similarity=0.212 Sum_probs=77.9
Q ss_pred HHHHHHHHHHHHHHHHhhcccc--c-ccchhHHHHHHHHHhHHHHhhhhhH-HHHHHhc-------cccccccccchhch
Q psy16862 50 FIIITAILGNLLVIISVIKHRK--L-RIITNYFVVSLAFADLLVALCVMPF-NAIVSLT-------DEWYFGYFMCDVWN 118 (303)
Q Consensus 50 ~i~i~gl~~N~~vl~~i~~~~~--~-~~~~~~ll~~Lai~Dl~~~l~~~~~-~~~~~~~-------~~~~~~~~~C~~~~ 118 (303)
+-++.+++.+..+++++.++++ . +++.+.+...-++.|++.-+...+. -+..... ......+.......
T Consensus 6 ~Ygi~s~~l~~~~~~~i~~~~~~~~~~~sFy~l~~~d~~~ni~~~ln~~~~~Rl~~~~~~~~~~~~~~~~~~~~~l~~~~ 85 (275)
T PF02118_consen 6 IYGIPSLILYIFILYVIWKSKKKSYFKSSFYRLYIMDLIMNILTYLNTWITLRLPSYTPPFCPFFLPFFENPPFFLNIFY 85 (275)
T ss_pred hHHHHHHHHHHHHHHHHHHhccccccCCccHHHHHHHhHHHHHHHHHHHHHHHhccccCccchhHHHHHcCCcHHHHHHH
Confidence 3466778888888888887776 3 6677777777777777765443332 1111100 00122344566667
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhh
Q psy16862 119 SFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGI 171 (303)
Q Consensus 119 ~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~ 171 (303)
++...+..+.......|+++|+.+|..|.++.+.-++.. ..+++++.++++.
T Consensus 86 ~l~~~~~~~Q~~~~~~is~nR~t~v~~p~~~~~~W~~~~-~~~i~~i~~~p~~ 137 (275)
T PF02118_consen 86 FLQYYFAYVQYLSTILISLNRFTSVLFPIRYEKFWKRYY-WIIIIIIFLLPFS 137 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhHHHHhhh-hhheeeeeehhHH
Confidence 788888899999999999999999999999877555333 2333334444433
No 24
>PF02117 7TM_GPCR_Sra: Serpentine type 7TM GPCR chemoreceptor Sra; InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=98.03 E-value=0.0014 Score=57.43 Aligned_cols=123 Identities=18% Similarity=0.236 Sum_probs=83.6
Q ss_pred CCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch-hHHHHHHHHHhHHHHhhhhhH--HHHHHhc
Q psy16862 28 FPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIIT-NYFVVSLAFADLLVALCVMPF--NAIVSLT 104 (303)
Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~-~~ll~~Lai~Dl~~~l~~~~~--~~~~~~~ 104 (303)
.+++...........+..++..++.++++++....+-.+.++.-..+.+ .+++.+|..+++=........ .+...+.
T Consensus 7 As~~~~~~~tS~~~~~~~~~~~~~~~~t~~~s~~Ai~~l~~ksif~~sTr~LL~~~l~~a~~Hq~~y~~~~~~~lyrsf~ 86 (328)
T PF02117_consen 7 ASEDQLEALTSWNFRINQIFILIIIIITFIFSVFAIKKLWKKSIFSNSTRILLISNLLFANIHQISYAIIQIWSLYRSFF 86 (328)
T ss_pred CCHHHHHHHhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3333333334444677778888888899999988888888776666544 555567777766443322111 1111111
Q ss_pred ---c--ccccccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy16862 105 ---D--EWYFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYP 150 (303)
Q Consensus 105 ---~--~~~~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~ 150 (303)
+ ...+.+..|.....+..++....++..+++++||.+|-..|-.+.
T Consensus 87 ~~~dpC~i~f~~~eC~~~~~~~~~~~~~~~~~q~aL~idRl~at~~~~~~~ 137 (328)
T PF02117_consen 87 YSDDPCSILFTSSECFPYYYFYYFTNSGMIFIQFALTIDRLLATFFPKYYS 137 (328)
T ss_pred cCCCCeeeEeecCceeeeehHHHHHHHHHHHHHHHHHHHHHHHHhchhhhh
Confidence 1 344677799999999999999999999999999999999886543
No 25
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=97.76 E-value=0.0012 Score=54.66 Aligned_cols=176 Identities=13% Similarity=0.141 Sum_probs=102.5
Q ss_pred HHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhh----hhHHHHHHhccccccccccchhchHHHHHHHHHHHHHH
Q psy16862 57 LGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCV----MPFNAIVSLTDEWYFGYFMCDVWNSFDVYFSTASILHL 132 (303)
Q Consensus 57 ~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~----~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l 132 (303)
.|-.+++.++....-.+-+...++..|.++-++...+. .|..+......+.. .+..-.+..++..++.++-.++-
T Consensus 30 t~~Flv~~i~lW~~Fk~m~ffwFl~qlt~s~fi~S~lNl~inVPatlfsl~t~~~~-~s~~f~~~Sy~idf~h~siLfsN 108 (305)
T PF04789_consen 30 TGAFLVLSIILWSHFKPMKFFWFLTQLTISVFIISSLNLLINVPATLFSLITKEFV-QSELFIFMSYLIDFCHYSILFSN 108 (305)
T ss_pred HHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHhhhheEeCcHHHHHhhhHHHh-ccchhhHHHHHHHHHHHHHHHHH
Confidence 33344444444444334577889999988887765443 34444333333222 22334455677888889999999
Q ss_pred HHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccc-cccCC--Cc--ccccCCCceEEEEehhhHH
Q psy16862 133 CCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGW-YTTPD--HK--QYRKHHPNVCIFVVNVYYA 207 (303)
Q Consensus 133 ~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~-~~~~~--~~--~~~~~~~~~c~~~~~~~~~ 207 (303)
.+||++|....-.-..-.+.....-..+-+..+|++++........-+- +..+. .+ ..|+.....-.........
T Consensus 109 lviaIqR~fVFFfr~~t~~~F~s~~iyiWL~~vWils~~v~~~l~~~nC~Y~y~~~~~~y~L~C~~~~~~v~~~~P~~Iq 188 (305)
T PF04789_consen 109 LVIAIQRFFVFFFRNLTDKVFESPVIYIWLLLVWILSIGVVYSLMSNNCRYRYNKWSKHYQLNCETCNSVVDISPPRGIQ 188 (305)
T ss_pred HHHHHhhheeeeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHhCCCeeecccccccEEEEcCCCCeeEeeCCCCchh
Confidence 9999999876654333334455677778888899999887544432211 11111 11 1233332222334445556
Q ss_pred HHHhhhhhhhhhhHHHHHHHHHHHHHH
Q psy16862 208 LISSSISFWIPGFVMKKKKKRKKKKKK 234 (303)
Q Consensus 208 ~~~~~~~~~ip~~i~~~~y~~I~~~l~ 234 (303)
.+..++.+.+|+.+. +.|+.+..++.
T Consensus 189 iiE~ilQ~~IPi~Il-~iYiAIIiKI~ 214 (305)
T PF04789_consen 189 IIEIILQFGIPIFIL-VIYIAIIIKII 214 (305)
T ss_pred HHHHHHHHhHHHHHH-HHHHHHHHHHH
Confidence 677888899997554 45555554443
No 26
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=97.66 E-value=0.00076 Score=58.43 Aligned_cols=196 Identities=16% Similarity=0.171 Sum_probs=105.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc-cccccchhHHHH---HHHHHhHHHHhhhhhHHHHHHhccccccccccchh---
Q psy16862 44 KGMLMCFIIITAILGNLLVIISVIKH-RKLRIITNYFVV---SLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDV--- 116 (303)
Q Consensus 44 ~~~~~~~i~i~gl~~N~~vl~~i~~~-~~~~~~~~~ll~---~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~--- 116 (303)
....+=++++++++.|.+.++.+..+ +++.+-.++++. ...++|+-.+++.-|..+.....+ +..| ..|..
T Consensus 11 li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~ag-y~~G-~l~~~f~i 88 (303)
T PF10327_consen 11 LINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAG-YCVG-FLARYFGI 88 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEE-EEEe-ehhhcCCC
Confidence 34566677899999999999877654 444443333332 356789988888888766655543 1222 23333
Q ss_pred ----chHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCccccc
Q psy16862 117 ----WNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRK 192 (303)
Q Consensus 117 ----~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~ 192 (303)
...+..+......-.++.--+-|+.+|.+..+.+. .++..-.....+..+..++..... .......++...+-.
T Consensus 89 ~~h~~~~i~~~l~~~q~~~L~~CF~~KHQaIa~i~~~~~-l~~~~~~~~~~~~~~~p~~~~~~~-~~~~~s~eeq~~yI~ 166 (303)
T PF10327_consen 89 SSHYLMTIWIFLFIFQIESLVICFLRKHQAIARILSKHV-LPKWFYIFFYFFCILFPFFIFICF-YFSGMSREEQMEYIR 166 (303)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCc-CcHHHHHHHhhHHHHHHHHHHHHH-HhcCCCHHHHHHHHH
Confidence 33344444555556666667789999999876443 233333333333333333332222 111111111111111
Q ss_pred CC-----------CceEEEEeh--hhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHhhhh
Q psy16862 193 HH-----------PNVCIFVVN--VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKK 243 (303)
Q Consensus 193 ~~-----------~~~c~~~~~--~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~~~ 243 (303)
.. +..-.++.+ .....+...++.++..++..+++..+++.+++.+++..+.
T Consensus 167 ~nyP~~~~~F~~l~nf~iY~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~L~~~k~~iS~~ 230 (303)
T PF10327_consen 167 ENYPEYLSWFSSLPNFAIYDFNPWFIFFFILAFFGGFLCFVIFIFLTIDMFRMLKKLKKKISKQ 230 (303)
T ss_pred hhCCchhHHHhcCCCEEEEeChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCHH
Confidence 11 111112222 2233444455566677788888899999888877665543
No 27
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=97.65 E-value=4.6e-05 Score=66.28 Aligned_cols=102 Identities=11% Similarity=0.185 Sum_probs=67.5
Q ss_pred HHHHHHHHHHHHHHHHHhhcccccc-cchhHHHHHHHHHhHHHHhhhhhHHHHHHhc----------cccccccccchhc
Q psy16862 49 CFIIITAILGNLLVIISVIKHRKLR-IITNYFVVSLAFADLLVALCVMPFNAIVSLT----------DEWYFGYFMCDVW 117 (303)
Q Consensus 49 ~~i~i~gl~~N~~vl~~i~~~~~~~-~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~----------~~~~~~~~~C~~~ 117 (303)
-+.++++++.|++.++.+.++.+.. ....+++...|+.|++.+++-....-..... +.+..+.......
T Consensus 6 ~~~~~~s~~~N~~Li~Li~~~s~k~~G~Yk~Lm~~fs~~~i~fs~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~l 85 (307)
T PF10326_consen 6 YIGFVLSLFLNSLLIYLILTKSPKSLGSYKYLMIYFSIFEIIFSILDFLVQPIIHSYGSSFVVFSDSGPFFLSKSIGQIL 85 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCCCCEEEEEehhHHHHHHHHHHHHHHHhceeecCCeeEEEeecccccccchheeeh
Confidence 3567889999999999887653333 3456677788999998876543322111111 1122334445555
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy16862 118 NSFDVYFSTASILHLCCISVDRYYAIVKPLQYP 150 (303)
Q Consensus 118 ~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~ 150 (303)
..+...+...+...++.--+.||.++++|.+.+
T Consensus 86 ~~~y~~~f~~~~~llaihFiYRY~av~~~~~l~ 118 (307)
T PF10326_consen 86 LAIYCGCFGMSFSLLAIHFIYRYWAVCRPKKLR 118 (307)
T ss_pred hhhhhhhHHHHHHHHHhHhhhheeeeccchhhc
Confidence 666677777788888888889999999998643
No 28
>KOG4193|consensus
Probab=97.57 E-value=0.0024 Score=60.07 Aligned_cols=167 Identities=16% Similarity=0.124 Sum_probs=97.6
Q ss_pred HHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccc-cc-cchhchHHHHHHHHH
Q psy16862 50 FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFG-YF-MCDVWNSFDVYFSTA 127 (303)
Q Consensus 50 ~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~-~~-~C~~~~~~~~~~~~~ 127 (303)
+-+++++++=.+.+++..--+++++..+....+|+++=++.-+ .++.+.|..+ .. .|...+++..++..+
T Consensus 328 VG~~iSlvcL~lti~ty~~~~~l~~~~~~i~~~l~~~L~l~~l--------~fL~~~~~~~~~~~~C~~~a~llhff~La 399 (610)
T KOG4193|consen 328 VGCIISLVCLLLTIATYLLFRKLQNDRTKIHINLCLCLFLAEL--------LFLLGIDRTSTSVVLCIAAAILLHFFFLA 399 (610)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHHHH--------HHhcccccccCcccccHHHHHHHHHHHHH
Confidence 3334444444444444444455554447777788877222211 1111222222 22 799999999999999
Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEEEehhhHH
Q psy16862 128 SILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVVNVYYA 207 (303)
Q Consensus 128 s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~ 207 (303)
..+.+.+++++-|+.++.-+. ....++...+....|...+++............. ....+..|.......+.
T Consensus 400 aF~Wm~leg~hl~~~~v~vf~---~~~~~~~l~~~~~gwg~Pavvv~Isa~~~~~~~~-----~~~~~~~CWl~~~~~~~ 471 (610)
T KOG4193|consen 400 AFFWMLLEGFHLYLLLVEVFR---SRPRRRKLLYSLYGWGVPAVVVGVSALVDPDLEG-----QYGTPRVCWLDTQNGFI 471 (610)
T ss_pred HHHHHHHHHHHHHHHHHHHhc---cccchHHHHHHHHHhhhhHHHHhheeEEeccCcc-----ccccCCceEEecCCceE
Confidence 999999999999997765443 2234455555468999998887766544332211 11223337655444433
Q ss_pred HHHhhhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 208 LISSSISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 208 ~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
+. |+-|+.+++.+++.++...-.+-
T Consensus 472 ~~-----F~GPv~~ii~~Ni~~Fv~t~~~l 496 (610)
T KOG4193|consen 472 WS-----FLGPVTLIILVNIVMFVVTLKKL 496 (610)
T ss_pred EE-----EehHHHHHHHHHHHHHHHHHHHH
Confidence 32 67788888877765555544433
No 29
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=97.32 E-value=0.013 Score=51.02 Aligned_cols=128 Identities=20% Similarity=0.255 Sum_probs=80.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccc----ccc--chhHHHHHHHHHhHHHHhhhhhHHHHHHhccccc--------
Q psy16862 43 IKGMLMCFIIITAILGNLLVIISVIKHRK----LRI--ITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWY-------- 108 (303)
Q Consensus 43 ~~~~~~~~i~i~gl~~N~~vl~~i~~~~~----~~~--~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~-------- 108 (303)
.+..+.+.-..+|+.|-++-+.--.+... ..+ ...-.+..||++|++.++..+....+++......
T Consensus 28 ~f~avCLgSs~l~l~gallQLlp~rr~~~~~~~~~sp~~~~rIl~~la~aDlLaclGVivRS~vWl~~p~~~~s~s~~~~ 107 (405)
T PF02101_consen 28 AFNAVCLGSSVLSLLGALLQLLPRRRSAGPRAPARSPSSSRRILFWLAVADLLACLGVIVRSSVWLGFPNFIDSISDVNG 107 (405)
T ss_pred hhhhhHHHHHHHHHHHHHHhhccccccccccccccCCcCCchhHHHHHHHHHHhhhhHHHHhhhhhcCCcccccccCCCC
Confidence 34444555556666665544442211100 011 2356889999999999887666555544332111
Q ss_pred ---cccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcc
Q psy16862 109 ---FGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTP 176 (303)
Q Consensus 109 ---~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~ 176 (303)
.....|.....-.++++.++.|.+.+-|+|-|+.+.+--. .......-...|..+.++..-.
T Consensus 108 ~d~wp~afCv~ss~WIq~fYsAtfwWtfcYAVDv~Lv~~~~ag------~~~~~lYH~~aWgl~~lL~~~G 172 (405)
T PF02101_consen 108 TDIWPAAFCVGSSMWIQLFYSATFWWTFCYAVDVYLVIRRSAG------RSTIWLYHMMAWGLPALLCAEG 172 (405)
T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhheeeccCC------CcchhHHHHHHHHHHHHHHHhc
Confidence 1235888877777788899999999999999998865321 1223455667788888876544
No 30
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=96.64 E-value=0.26 Score=42.68 Aligned_cols=132 Identities=9% Similarity=0.087 Sum_probs=76.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccc--cccch---hHHHHHHHHHhHHHHhhhhhHHHHHHhc----cccccccccc
Q psy16862 44 KGMLMCFIIITAILGNLLVIISVIKHRK--LRIIT---NYFVVSLAFADLLVALCVMPFNAIVSLT----DEWYFGYFMC 114 (303)
Q Consensus 44 ~~~~~~~i~i~gl~~N~~vl~~i~~~~~--~~~~~---~~ll~~Lai~Dl~~~l~~~~~~~~~~~~----~~~~~~~~~C 114 (303)
.....=.+.++++..|++..+.+.++.. +++.- -.+-....+.|+..+++..|........ |-...-..-.
T Consensus 6 ~~~~~h~~~~i~~Pi~~~~~y~Il~ktp~~m~~~k~~ll~~~~~~~~~d~~~~~l~~P~~~~P~~~~~~~G~l~~~gv~~ 85 (302)
T PF10318_consen 6 YSIVLHIITIISIPIYIFGFYCILFKTPKSMKSVKWYLLNHHFWNFLLDLYLSFLIIPYFFFPSPAGYPLGILSYFGVPT 85 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHheeEeChhhhHHHHHHHHHHHHHHHHHHHHhhhEEeeeeccchHHhhCCcchhhcccCh
Confidence 3455566778899999999998886532 22211 2222234568888888877754433222 2111112345
Q ss_pred hhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccc
Q psy16862 115 DVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPI 177 (303)
Q Consensus 115 ~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~ 177 (303)
.+..++..........+++.+--+||.++..+.+... .++.-....++.++++.....|..
T Consensus 86 ~~q~~i~~~~~~~~~~S~~~lFe~R~~~i~~~~~~~~--~~~~r~~~~~~~y~~~~~~~~~~~ 146 (302)
T PF10318_consen 86 PIQFYIGVTLLAFVGVSILSLFENRYQIIIRNRSRSL--WKKIRIIFIIFNYIIAFLFFIPIY 146 (302)
T ss_pred HHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccccccc--chhhHHHHHHHHHHHHHHHHHHHh
Confidence 5566666777777778888888999999996643111 122233344444555555554543
No 31
>KOG4564|consensus
Probab=96.61 E-value=0.15 Score=46.39 Aligned_cols=179 Identities=14% Similarity=0.059 Sum_probs=114.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccc------------ccccccc
Q psy16862 46 MLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDE------------WYFGYFM 113 (303)
Q Consensus 46 ~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~------------~~~~~~~ 113 (303)
+++.+-..++++.=++.+.++..-|++|=..|++=.||.++=++-++..+........... -..+...
T Consensus 149 ~lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 228 (473)
T KOG4564|consen 149 ILYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVLIKDLVLVVNGEQDASSDTSLHCLISSNPVG 228 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccchh
Confidence 5666666666766667777777788888778899899999888887766555443332211 1256779
Q ss_pred chhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccC
Q psy16862 114 CDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKH 193 (303)
Q Consensus 114 C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (303)
|+...++...+..+.-+.+.+=.+.-|..+.--... .++.....+.+.|.+.+++..+=.+ .....++. .|..
T Consensus 229 Ck~~~~~~~Yf~~aNf~WmlvEGlYLhtLlv~~~~s----er~~~~~y~~iGWG~P~v~v~~Wai-~r~~~~d~--~CW~ 301 (473)
T KOG4564|consen 229 CKLLFVFFQYFVLANFFWMLVEGLYLHTLLVVAFFS----ERRYFWLYLLIGWGVPAVFVVVWAI-VRLYLEDT--GCWD 301 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheeecc----CccHHHHHHHHhhCCcHHHhhhHHh-Hheeeccc--cccc
Confidence 999988888888888888888888777766654331 2466677888889999888655433 33323332 2222
Q ss_pred CCceEEEEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH
Q psy16862 194 HPNVCIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKK 238 (303)
Q Consensus 194 ~~~~c~~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~ 238 (303)
.. .........-...++++++-.++.+.|++.+-++-|
T Consensus 302 ~~-------~~~~~~WIi~gPi~~ai~vNf~flinIvrILv~KLr 339 (473)
T KOG4564|consen 302 SN-------DTGHIWWIIRGPILLAILVNFIFLINIVRILVTKLR 339 (473)
T ss_pred cC-------CCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 22 111111222233445666677777778777766554
No 32
>PF03383 Serpentine_r_xa: Caenorhabditis serpentine receptor-like protein, class xa; InterPro: IPR005047 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class xa (Srxa), from the Str superfamily [].
Probab=96.59 E-value=0.0016 Score=49.60 Aligned_cols=63 Identities=14% Similarity=0.250 Sum_probs=53.1
Q ss_pred hchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcccc
Q psy16862 116 VWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIM 178 (303)
Q Consensus 116 ~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~ 178 (303)
....+..++.....+..++|+++|-..|.+|+....+.+.++.++.+.++.++.+..-+.|.+
T Consensus 46 ~~TL~~tf~Yl~plfltvLMti~Ri~iV~~P~~~~~~Fs~~kl~~YC~~i~i~~~i~LlIPy~ 108 (153)
T PF03383_consen 46 EVTLFGTFSYLHPLFLTVLMTINRIYIVLFPFGSEIWFSDKKLWIYCGIIAILSFISLLIPYF 108 (153)
T ss_pred EEeeeehHHHHHHHHHHHHHHHhheEEEEecCCCccccccchhHHHHHHHHHHHHHHHHhhcC
Confidence 345567778888999999999999999999999777899999999998888888777767754
No 33
>PF01534 Frizzled: Frizzled/Smoothened family membrane region; InterPro: IPR000539 The frizzled (fz) locus of Drosophila coordinates the cytoskeletons of epidermal cells, producing a parallel array of cuticular hairs and bristles [, ]. In fz mutants, the orientation of individual hairs with respect both to their neighbours and to the organism as a whole is altered. In the wild-type wing, all hairs point towards the distal tip []. In the developing wing, fz has 2 functions: it is required for the proximal-distal transmission of an intracellular polarity signal; and it is required for cells to respond to the polarity signal. Fz produces an mRNA that encodes an integral membrane protein with 7 putative transmembrane (TM) domains. This protein should contain both extracellular and cytoplasmic domains, which could function in the transmission and interpretation of polarity information []. This signature is usually found downstream of the Fz domain (IPR000024 from INTERPRO); GO: 0007166 cell surface receptor linked signaling pathway, 0016020 membrane
Probab=96.42 E-value=0.38 Score=42.04 Aligned_cols=139 Identities=12% Similarity=0.153 Sum_probs=90.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHH-H-hc-c----------cc
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIV-S-LT-D----------EW 107 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~-~-~~-~----------~~ 107 (303)
.-..-..+.+...+..+.+++.+.++.-+.+.-.-...-+..++++-++.++..+...+.. . .. + ..
T Consensus 8 ~~~~~~wi~~~s~ic~~~t~ftv~Tf~id~~rf~yPerpi~fl~~Cy~~~s~~~l~~~~~g~~~i~C~~~~~~~~~~~~~ 87 (328)
T PF01534_consen 8 KEFARIWIGVWSVICFISTLFTVLTFLIDRSRFRYPERPIIFLSFCYFIVSLGYLIRFILGRESIACRSQDGTERILVQD 87 (328)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCCccchhhHHHHHHHHHHHHHHHHHhccCCCcCCCcCCCCceeEeec
Confidence 3444556666667777777777777776655334445666677777666655444433331 0 00 0 11
Q ss_pred ccccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccc
Q psy16862 108 YFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGW 181 (303)
Q Consensus 108 ~~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~ 181 (303)
..++..|.+...+.+++..++...-+.+++.=|++....-.+..+ .++....-++.|.++++..+.....+.
T Consensus 88 ~~~~~~C~~~F~l~Yyf~mAa~~WWviLt~~W~lsa~~kw~~e~i--~~~s~yfH~~aW~iP~~~ti~vL~~~~ 159 (328)
T PF01534_consen 88 GLENSSCTVVFLLLYYFGMAASLWWVILTLTWFLSAGLKWGSEAI--EKKSSYFHLVAWGIPAVLTIAVLALRK 159 (328)
T ss_pred CCCCCchhhHHHHHHHHHhHHHHHHHHHHHHHHHHhhcccCcchh--hhhcchhhhHHhhhhHHHHHHHHHhcc
Confidence 234679999999999998888888888899999988765442322 355666788899999998877655444
No 34
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=96.40 E-value=0.089 Score=45.12 Aligned_cols=119 Identities=13% Similarity=0.073 Sum_probs=69.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccc-cchhHHHHHHHHHhHHHHhhhhhH--HHHHHh-------ccccccccc-c
Q psy16862 45 GMLMCFIIITAILGNLLVIISVIKHRKLR-IITNYFVVSLAFADLLVALCVMPF--NAIVSL-------TDEWYFGYF-M 113 (303)
Q Consensus 45 ~~~~~~i~i~gl~~N~~vl~~i~~~~~~~-~~~~~ll~~Lai~Dl~~~l~~~~~--~~~~~~-------~~~~~~~~~-~ 113 (303)
-.+--+.++++.+.|-+.++.+..+++.+ ....++++.-|+-|++..+.-+.. .+...- .++|-.... .
T Consensus 8 ~~~Pk~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~vh~yry~F~~fi~dG~F~~~s~l 87 (310)
T PF10319_consen 8 HYIPKIFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICVHGYRYAFVVFISDGPFFEKSEL 87 (310)
T ss_pred HHHHHHHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhheeeccceEEEEEEcCCcCcCcchH
Confidence 34445678889999999999888877765 366788899999999987764332 222211 112211111 1
Q ss_pred chhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHH
Q psy16862 114 CDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLS 163 (303)
Q Consensus 114 C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~ 163 (303)
-.......-.+..+|-..+..=-+.||+++.+|....+...+......+.
T Consensus 88 ~~~~ls~RCsfIs~sYaIL~~HFvYRYl~l~~~~~~~~~F~p~gl~~s~~ 137 (310)
T PF10319_consen 88 GQHLLSIRCSFISGSYAILHIHFVYRYLVLFNSKFINKYFMPYGLIGSIL 137 (310)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHhCcHHHHHhhhhHHHHHHHH
Confidence 11111222223334445566667889999999966444444444333333
No 35
>PF06681 DUF1182: Protein of unknown function (DUF1182); InterPro: IPR010601 This family consists of several hypothetical proteins of around 360 residues in length and seems to be specific to Caenorhabditis elegans. The function of this family is unknown.
Probab=96.32 E-value=0.11 Score=41.49 Aligned_cols=41 Identities=15% Similarity=0.368 Sum_probs=30.9
Q ss_pred ccccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Q psy16862 108 YFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQ 148 (303)
Q Consensus 108 ~~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~ 148 (303)
...-.+|.+.--.......+.....+++|++||+.|.+-.+
T Consensus 111 ~ytI~vCSflk~f~S~t~s~~~vip~aVAIyRy~~VV~k~r 151 (226)
T PF06681_consen 111 QYTIFVCSFLKNFTSSTTSCAQVIPVAVAIYRYLIVVLKKR 151 (226)
T ss_pred eEEEEeehHHHhHHHHHhhhhccchhhhhhhhhheeeeccc
Confidence 34667898876666666666666777899999999998655
No 36
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=95.89 E-value=0.66 Score=41.43 Aligned_cols=112 Identities=13% Similarity=0.132 Sum_probs=63.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHh---HHHHhhhhhHHHHHHhcc---------ccc
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFAD---LLVALCVMPFNAIVSLTD---------EWY 108 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~D---l~~~l~~~~~~~~~~~~~---------~~~ 108 (303)
..+..++-.++.+++..-....+.++.+-+..|.-.+++.......- ++.-+...|........+ .|.
T Consensus 26 ~~~~~i~e~~l~~~~~~~~~~~~~i~~~~r~FH~N~~~l~~~~~~~w~e~~i~k~~i~~y~~~~~~~~~~~~~~~~~~~t 105 (365)
T PF03125_consen 26 YIILLIFELILYIICFYFTFKSCYIILKIRQFHRNLNILFIFILLQWFELIIGKILIIPYQIGIITLGDISIGTYIQWWT 105 (365)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhceeeecCccchhhhhhhc
Confidence 44555555566666666666666777777777765555555444333 333333444433222111 111
Q ss_pred ccc-----------ccc-hhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcc
Q psy16862 109 FGY-----------FMC-DVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPII 152 (303)
Q Consensus 109 ~~~-----------~~C-~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~ 152 (303)
.+. ... .+.+++.........+.+.++++||..|-..--.|.+.
T Consensus 106 ~~~~~m~~v~~~~~~~~l~~~~~l~~~y~~~~~~~~~~~~iER~~AT~~i~dYEk~ 161 (365)
T PF03125_consen 106 DDPSEMIPVNSFNDSWPLFIGGFLRWHYMFSAIFCLLAIVIERCFATYFIKDYEKK 161 (365)
T ss_pred cCcccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 111 111 12234555555666899999999999999998888764
No 37
>PF02175 7TM_GPCR_Srb: Serpentine type 7TM GPCR chemoreceptor Srb; InterPro: IPR002184 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=95.87 E-value=0.81 Score=37.56 Aligned_cols=48 Identities=10% Similarity=0.145 Sum_probs=40.2
Q ss_pred cccccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccc
Q psy16862 107 WYFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMN 154 (303)
Q Consensus 107 ~~~~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t 154 (303)
...+...++.......+..+.++..-..+++||++|....-+|....+
T Consensus 78 LiI~~~~~K~~h~~~~flmT~~ml~PigftIERfiAl~~A~~YE~~r~ 125 (236)
T PF02175_consen 78 LIIDPTLFKYGHLTGLFLMTIPMLFPIGFTIERFIALKMAEKYENTRT 125 (236)
T ss_pred EEEchHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhccCce
Confidence 345666788888899999999999999999999999999888876433
No 38
>PF10322 7TM_GPCR_Sru: Serpentine type 7TM GPCR chemoreceptor Sru; InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily [].
Probab=95.84 E-value=0.96 Score=39.20 Aligned_cols=123 Identities=13% Similarity=-0.015 Sum_probs=70.2
Q ss_pred chhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccC
Q psy16862 114 CDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKH 193 (303)
Q Consensus 114 C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (303)
=++..++...+.+.+...-++.++-|.+.+..|..+.+. ..+-.....-++.+++++.+.|.+....+...-+..+..+
T Consensus 103 Lk~l~~~~~~~~Y~s~lf~~Lfc~~Rl~il~~p~~~~~i-~~~i~~~~~P~i~i~p~~~~f~~~pa~G~C~Ql~~Pf~fG 181 (307)
T PF10322_consen 103 LKILVFFYYYFNYSSMLFPVLFCLLRLIILYSPRNHKKI-CRKIFRIWIPFIFIYPFCFTFPMFPALGYCRQLDPPFPFG 181 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHheeCccchhHH-HHhHHHHHHHHHHHHHHHHHHHccCCcEEEEeCCCCCCCC
Confidence 345567888889999999999999999999999876544 3344455666666666666655532211111111101000
Q ss_pred CCce-EEEEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 194 HPNV-CIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 194 ~~~~-c~~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
.... -...+..........+...+-++.+++++...++++|+.+
T Consensus 182 AI~I~~~~~~~~~~~~~~~l~~s~~~~~~iii~N~lm~~Klr~~k 226 (307)
T PF10322_consen 182 AIIITSTGSWFNIRNSIFHLFFSIFWMISIIILNILMFFKLRKLK 226 (307)
T ss_pred EEEEEEEcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 0000 0111111112222333344556677888888888888876
No 39
>PF02076 STE3: Pheromone A receptor; InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=95.28 E-value=0.57 Score=40.12 Aligned_cols=117 Identities=16% Similarity=0.142 Sum_probs=64.5
Q ss_pred HHHHHHHHHHHhhcc-cccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchHHHHHHHHHHHHHHH
Q psy16862 55 AILGNLLVIISVIKH-RKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFSTASILHLC 133 (303)
Q Consensus 55 gl~~N~~vl~~i~~~-~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~ 133 (303)
+++|.++.+.-+.-+ |....++-.++..+.++++...+-. ++..-.. ...-+..|.+..-+.......-....+
T Consensus 4 s~i~~~l~i~Pl~wH~ra~N~~~~~li~Wl~l~nl~~~INa----iIW~~n~-~~~~~~wCDI~~kl~~~~~~g~~~a~l 78 (283)
T PF02076_consen 4 SFIAFVLLIVPLPWHWRARNIGTCSLIFWLFLSNLIYFINA----IIWRDND-INWWPVWCDISTKLIIGSSVGIPAASL 78 (283)
T ss_pred HHHHHHHHHHHHHHHHHcCcHHHHHHHHHHHHHHHHHHHhh----hhcCCCC-CccCceeeeehHHHHHHHHHHHHHHHH
Confidence 344444444333322 2223577888899999998876532 1211111 111227999988887777777666666
Q ss_pred HHHHHHHHHHhcCCCCC--cccchhhhHHHHHHHHhhhhhhhhccc
Q psy16862 134 CISVDRYYAIVKPLQYP--IIMNQNTVLLMLSNVWILPGIISFTPI 177 (303)
Q Consensus 134 ~iaidRy~aI~~P~~~~--~~~t~~~~~~~i~~~w~~~~~~~~~~~ 177 (303)
+|+-.-| .|..+-... ..-.+|+..+=+.+++.+.++.....+
T Consensus 79 cI~r~L~-~I~s~r~~~~~~~~~rr~~~~d~~i~~g~Pil~m~l~y 123 (283)
T PF02076_consen 79 CIMRRLY-RIASARAVSLSRSEKRRRIIIDLLICFGIPILQMALHY 123 (283)
T ss_pred HHHHHHH-HHhccccCCCChHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6665544 344433211 111256666666667777766655443
No 40
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=91.77 E-value=1.2 Score=30.66 Aligned_cols=25 Identities=8% Similarity=0.044 Sum_probs=12.6
Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHH
Q psy16862 212 SISFWIPGFVMKKKKKRKKKKKKKK 236 (303)
Q Consensus 212 ~~~~~ip~~i~~~~y~~I~~~l~~~ 236 (303)
.+.++..++|+++.-.+|+..-+.-
T Consensus 20 A~L~i~~FiILLIi~~~IW~~~r~~ 44 (121)
T PF10669_consen 20 AFLFIVVFIILLIITKSIWHDSRQV 44 (121)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHH
Confidence 3334444555555566666544433
No 41
>PF15086 UPF0542: Uncharacterised protein family UPF0542
Probab=91.74 E-value=1.2 Score=28.93 Aligned_cols=20 Identities=15% Similarity=0.458 Sum_probs=13.2
Q ss_pred hHHHHHhhhhhhhhhhHHHH
Q psy16862 205 YYALISSSISFWIPGFVMKK 224 (303)
Q Consensus 205 ~~~~~~~~~~~~ip~~i~~~ 224 (303)
.|.++..++..+.|++++..
T Consensus 19 P~~Fl~~vll~LtPlfiisa 38 (74)
T PF15086_consen 19 PYEFLTTVLLILTPLFIISA 38 (74)
T ss_pred hHHHHHHHHHHHhHHHHHHH
Confidence 46666667777788766544
No 42
>KOG2302|consensus
Probab=89.26 E-value=16 Score=36.59 Aligned_cols=21 Identities=24% Similarity=0.414 Sum_probs=16.6
Q ss_pred ccchhHHHHHHHHHhHHHHhh
Q psy16862 73 RIITNYFVVSLAFADLLVALC 93 (303)
Q Consensus 73 ~~~~~~ll~~Lai~Dl~~~l~ 93 (303)
.+..+.+++..++.|++....
T Consensus 1194 WN~LDgflv~vsviDilvs~a 1214 (1956)
T KOG2302|consen 1194 WNVLDGFLVAVSVIDILVSQA 1214 (1956)
T ss_pred HHhhhHHHHHHHHHHHHHHHh
Confidence 466788999999999987543
No 43
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=87.94 E-value=2.3 Score=29.36 Aligned_cols=8 Identities=0% Similarity=-0.001 Sum_probs=3.2
Q ss_pred HHHHHHHH
Q psy16862 232 KKKKKKKK 239 (303)
Q Consensus 232 ~l~~~~~~ 239 (303)
.++-.++.
T Consensus 36 ~IW~~~r~ 43 (121)
T PF10669_consen 36 SIWHDSRQ 43 (121)
T ss_pred HHhhhHHH
Confidence 34444433
No 44
>KOG4349|consensus
Probab=87.75 E-value=8.1 Score=28.07 Aligned_cols=85 Identities=7% Similarity=0.098 Sum_probs=43.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-chhHHHHHHHHHhHHHHhhhhhHHHHHHhccccccccccchhchH
Q psy16862 41 FIIKGMLMCFIIITAILGNLLVIISVIKHRKLRI-ITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNS 119 (303)
Q Consensus 41 ~~~~~~~~~~i~i~gl~~N~~vl~~i~~~~~~~~-~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~~~~~~~~~~C~~~~~ 119 (303)
+-++........++-+.+-++.++.+.++|.-|- +.-+++. --+.-+..++..-..............+.+.|.+.+.
T Consensus 47 y~vFLWal~Ss~fih~~A~ilalFTLRkHkygrF~si~iliM-gfi~P~~~Gi~tS~viA~~~~As~~~ms~~y~~i~G~ 125 (143)
T KOG4349|consen 47 YSVFLWALLSSMFIHLGATILALFTLRKHKYGRFISIPILIM-GFIIPFVFGIPTSMVLALAMYASKNAVSTWYCAIMGI 125 (143)
T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHhhcCccceeeehHHHHH-HHHHHHHhcchHHHHHHHHHHHccCCccHHHHHHHHH
Confidence 3444444445556677778888888877666553 2222222 2223333333222122222223345667778888776
Q ss_pred HHHHHHH
Q psy16862 120 FDVYFST 126 (303)
Q Consensus 120 ~~~~~~~ 126 (303)
.+.++..
T Consensus 126 ~QT~~i~ 132 (143)
T KOG4349|consen 126 IQTLLIF 132 (143)
T ss_pred HHHHHHH
Confidence 6655443
No 45
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=81.02 E-value=3.6 Score=37.33 Aligned_cols=29 Identities=21% Similarity=0.380 Sum_probs=12.8
Q ss_pred hhhhhhHHHHHHHHHhHhhhhhccccccc
Q psy16862 271 KKKKKTTKKKKRKKKRKKKKNQHHRDVWQ 299 (303)
Q Consensus 271 ~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (303)
.++.....+|+++|..+++||+..||+.+
T Consensus 335 ~~~~~~~~~k~~~k~~~~~~~~~~~~~~~ 363 (429)
T PRK00247 335 TRTAEKNEAKARKKEIAQKRRAAEREINR 363 (429)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333334444444444444444444443
No 46
>PF13853 7tm_4: Olfactory receptor
Probab=77.04 E-value=0.085 Score=40.30 Aligned_cols=79 Identities=14% Similarity=0.142 Sum_probs=47.2
Q ss_pred HHHHHHHhhhhhhhhccccccc---cccCCCcccccCCC---ceEEE-EehhhHHHHHhhhhhhhhhhHHHHHHHHHHHH
Q psy16862 160 LMLSNVWILPGIISFTPIMLGW---YTTPDHKQYRKHHP---NVCIF-VVNVYYALISSSISFWIPGFVMKKKKKRKKKK 232 (303)
Q Consensus 160 ~~i~~~w~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~---~~c~~-~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~ 232 (303)
.+.++.|+.+++.+++...... ..+..+..+|+ .+ -.|.. ..++.+......+..+.|+.+++++|..|+..
T Consensus 5 ~l~~~~~~~~~~~~l~~~~~~~~l~~~nii~~f~c~-~~ll~LaC~dt~~~~~~~~~~~~~~~~~~~~~Il~SY~~Il~a 83 (144)
T PF13853_consen 5 LLAAGSWLSGLLNSLPHTLLTLSLCFCNIIHHFCCD-PPLLKLACSDTSINEIVGFVVAIFILLGPLLLILFSYIRILRA 83 (144)
T ss_pred hhhHHHHHHHHHHHHHHHHHHeeCCCCCCCcceeeC-HHHhcccCCchhhhheeeecccceeEEEEeeccccceeEEEeh
Confidence 3455667777766555432211 11222333333 22 25643 44555566666677889999999999999998
Q ss_pred HHHHHHH
Q psy16862 233 KKKKKKK 239 (303)
Q Consensus 233 l~~~~~~ 239 (303)
+.|....
T Consensus 84 vlki~S~ 90 (144)
T PF13853_consen 84 VLKIPSA 90 (144)
T ss_pred hhccccc
Confidence 8776533
No 47
>PF07074 TRAP-gamma: Translocon-associated protein, gamma subunit (TRAP-gamma); InterPro: IPR009779 This family consists of several eukaryotic translocon-associated protein, gamma subunit (TRAP-gamma) sequences. The translocation site (translocon), at which nascent polypeptides pass through the endoplasmic reticulum membrane, contains a component previously called 'signal sequence receptor' that is now renamed as 'translocon-associated protein' (TRAP). The TRAP complex is comprised of four membrane proteins alpha, beta, gamma and delta, which are present in a stoichiometric relation, and are genuine neighbours in intact microsomes. The gamma subunit is predicted to span the membrane four times [].; GO: 0006613 cotranslational protein targeting to membrane, 0005784 Sec61 translocon complex, 0030176 integral to endoplasmic reticulum membrane
Probab=76.64 E-value=4.1 Score=31.57 Aligned_cols=16 Identities=38% Similarity=0.482 Sum_probs=9.5
Q ss_pred Hhhhhhhhhccccccc
Q psy16862 166 WILPGIISFTPIMLGW 181 (303)
Q Consensus 166 w~~~~~~~~~~~~~~~ 181 (303)
|+-++++++.|.+..|
T Consensus 22 Y~nA~ivS~vPi~LF~ 37 (170)
T PF07074_consen 22 YGNALIVSAVPIWLFW 37 (170)
T ss_pred hhHHHHHHHHHHHHHH
Confidence 4555566666766555
No 48
>PF11044 TMEMspv1-c74-12: Plectrovirus spv1-c74 ORF 12 transmembrane protein; InterPro: IPR022743 This is a group of proteins expressed by Plectroviruses. The Plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. This entry represents putative transmembrane proteins of unknown function.
Probab=76.42 E-value=13 Score=21.68 Aligned_cols=26 Identities=15% Similarity=-0.129 Sum_probs=13.4
Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 212 SISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 212 ~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
++..++.+.+....-..|+..+++-+
T Consensus 8 iFsvvIil~If~~iGl~IyQkikqIr 33 (49)
T PF11044_consen 8 IFSVVIILGIFAWIGLSIYQKIKQIR 33 (49)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444455555556666655544
No 49
>PF01528 Herpes_glycop: Herpesvirus glycoprotein M; InterPro: IPR000785 The Equid herpesvirus 1 (Equine herpesvirus 1, EHV-1) protein belongs to a family of sequences that groups together Human herpesvirus 1 (HHV-1) UL10, EHV-1 52, Human herpesvirus 3 (HHV-3) 50, Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4) BBRF3, Human herpesvirus 1 (HHV-1) 39 and Human cytomegalovirus (HHV-5) UL100. Little is yet known about the properties of the protein. However, its amino acid sequence is highly hydrophobic, containing 8 putative membrane-spanning regions, and it is therefore believed to be either membrane-associated or transmembrane.; GO: 0016020 membrane
Probab=75.91 E-value=59 Score=29.12 Aligned_cols=36 Identities=6% Similarity=-0.176 Sum_probs=24.2
Q ss_pred HHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHhh
Q psy16862 206 YALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 206 ~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
+.....+..-++|++.++..-+++.|..+.++++..
T Consensus 301 ~~~~i~~~la~i~~i~l~~~vvR~vR~~~~hr~~~~ 336 (374)
T PF01528_consen 301 LHTGIAINLAVIAIICLIMMVVRLVRAFLYHRRRST 336 (374)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccch
Confidence 334444455667778888888888888887765433
No 50
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=71.18 E-value=5.4 Score=28.02 Aligned_cols=36 Identities=8% Similarity=0.001 Sum_probs=26.5
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHhhhhhccccc
Q psy16862 214 SFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHR 249 (303)
Q Consensus 214 ~~~ip~~i~~~~y~~I~~~l~~~~~~~~~~~~~~~~ 249 (303)
.++.+++++.++|..++|-.+++.+++++...+..+
T Consensus 11 ~ll~~vl~~~ifyFli~RPQrKr~K~~~~ml~sL~k 46 (97)
T COG1862 11 LLLPLVLIFAIFYFLIIRPQRKRMKEHQELLNSLKK 46 (97)
T ss_pred HHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHhccC
Confidence 344556677888999999999988888877665433
No 51
>PF11671 Apis_Csd: Complementary sex determiner protein; InterPro: IPR021007 Sex determination proteins are found in eukaryotes. Proteins in this family are typically between 168 and 410 amino acids in length. It plays a role in the gender determination of around 20% of all animals. In the honeybee, the mechanism of sex determination depends on the complementary sex determiner (csd) gene which produces an SR-type protein. Males are homozygous while females are homozygous for the csd gene. Heterozygosity generates an active protein which initiates female development []. This entry represents the C-terminal end of the sex determination protein.
Probab=70.57 E-value=3.6 Score=30.14 Aligned_cols=38 Identities=11% Similarity=0.407 Sum_probs=23.7
Q ss_pred hhHHHHHHHhhhhhhhhHHHHHHHHHhHhhhhhccccc
Q psy16862 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDV 297 (303)
Q Consensus 260 ~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (303)
.++.+.|.+++.++-++++++....++.+.|..+.|.|
T Consensus 12 reqkS~KnE~sYr~YRetSrERSRdRrEReRsRE~kii 49 (146)
T PF11671_consen 12 REQKSYKNENSYREYRETSRERSRDRRERERSRERKII 49 (146)
T ss_pred hhhhhhcchhHHHHHHHhhhhhhhhhhhhhhhcccccc
Confidence 34444555566666677777777777666666665554
No 52
>PF08229 SHR3_chaperone: ER membrane protein SH3 ; InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=68.32 E-value=11 Score=30.23 Aligned_cols=13 Identities=0% Similarity=-0.404 Sum_probs=5.5
Q ss_pred hhHHHHHHHHHHH
Q psy16862 219 GFVMKKKKKRKKK 231 (303)
Q Consensus 219 ~~i~~~~y~~I~~ 231 (303)
++-++++-+.-++
T Consensus 144 LvGVLvLQaG~~Y 156 (196)
T PF08229_consen 144 LVGVLVLQAGQWY 156 (196)
T ss_pred HHHHHHHHhhHHH
Confidence 3334444444444
No 53
>PF06454 DUF1084: Protein of unknown function (DUF1084); InterPro: IPR009457 This entry consists of several hypothetical plant specific proteins of unknown function.
Probab=63.13 E-value=98 Score=26.56 Aligned_cols=22 Identities=9% Similarity=0.128 Sum_probs=12.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHH
Q psy16862 37 HIFPFIIKGMLMCFIIITAILG 58 (303)
Q Consensus 37 ~~~~~~~~~~~~~~i~i~gl~~ 58 (303)
........+.++.+++++|++=
T Consensus 24 ~~~~~~~La~iy~~v~~~aliQ 45 (281)
T PF06454_consen 24 QDGLFYALAAIYLLVALVALIQ 45 (281)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3334566666666666665544
No 54
>PF10325 7TM_GPCR_Srz: Serpentine type 7TM GPCR chemoreceptor Srz; InterPro: IPR018817 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class z (Srz), a solo family amongst the superfamilies of chemoreceptors [, ]. The genes encoding Srz appear to be under strong adaptive evolutionary pressure [].
Probab=62.72 E-value=93 Score=26.17 Aligned_cols=111 Identities=12% Similarity=0.040 Sum_probs=54.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCce
Q psy16862 119 SFDVYFSTASILHLCCISVDRYYAIVKPLQ-YPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNV 197 (303)
Q Consensus 119 ~~~~~~~~~s~~~l~~iaidRy~aI~~P~~-~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (303)
+.......+--..+.++|++|++---.|-. .....+.+.....+..+..+.++-.+............+. . ..
T Consensus 87 ~~l~i~~~v~~lllsLLAIqRFllyFfP~~Ek~v~~~~k~~~~~I~~lY~~~~~k~i~~~~~~~~~~~~~~--~---~~- 160 (267)
T PF10325_consen 87 FILYIITQVFHLLLSLLAIQRFLLYFFPSSEKYVNFSQKNIKKIIWFLYIFFILKDIVFFIWYFISFNNES--T---EE- 160 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--c---hh-
Confidence 344444456667899999999999999963 1222344444444444444444442222111110000000 0 00
Q ss_pred EEEEehhhHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH
Q psy16862 198 CIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKK 238 (303)
Q Consensus 198 c~~~~~~~~~~~~~~~~~~ip~~i~~~~y~~I~~~l~~~~~ 238 (303)
.......+ .....+.. +-+.+...+|+-|...+||.+.
T Consensus 161 -~~~~~~~~-~~~~~~~~-~ll~~S~lLYIPI~isirK~~~ 198 (267)
T PF10325_consen 161 -IETFSYIY-VIFYIILN-ILLFLSALLYIPIFISIRKLSH 198 (267)
T ss_pred -hhHHHHHH-HHHHHHhh-HHHHHHHHHHHHHHHHHHHhhc
Confidence 00111112 11222222 3356677899999998887774
No 55
>KOG4007|consensus
Probab=62.23 E-value=22 Score=28.25 Aligned_cols=19 Identities=11% Similarity=0.386 Sum_probs=9.3
Q ss_pred HHHHHhHhhhhhccccccc
Q psy16862 281 KRKKKRKKKKNQHHRDVWQ 299 (303)
Q Consensus 281 ~~~~~~~~~~~~~~~~~~~ 299 (303)
.-|.|=|++=+|..|.||.
T Consensus 205 ~~Q~rWk~qVqEQRr~ifd 223 (229)
T KOG4007|consen 205 HAQQRWKLQVQEQRRHIFD 223 (229)
T ss_pred HHHHHHHHHHHHHHHhhhh
Confidence 3344444455555555554
No 56
>KOG0345|consensus
Probab=61.40 E-value=7.5 Score=35.48 Aligned_cols=31 Identities=35% Similarity=0.505 Sum_probs=14.8
Q ss_pred hhhhhhhhHHHHHHHHHhHhhhhhccccccc
Q psy16862 269 KKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQ 299 (303)
Q Consensus 269 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (303)
|++|+++++.++.|+-+|+++....+|...|
T Consensus 506 K~~k~~~k~~~~~rka~k~~k~~~k~r~~~q 536 (567)
T KOG0345|consen 506 KEAKKDKKAVRESRKAKKKRKLEGKKRQLIQ 536 (567)
T ss_pred hhHHHHHhhhhhhhhhhhhhhHHHHHHHHhh
Confidence 3444445555555555554444444444433
No 57
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=56.99 E-value=1.1e+02 Score=26.59 Aligned_cols=57 Identities=19% Similarity=0.163 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHH-HHhHHHHhhhhhHHHHHHhcc
Q psy16862 47 LMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLA-FADLLVALCVMPFNAIVSLTD 105 (303)
Q Consensus 47 ~~~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~La-i~Dl~~~l~~~~~~~~~~~~~ 105 (303)
.+..+.++|++.|.+..+.+.+.+. ...|.=-+.|- ++|++..+..+...+...+.+
T Consensus 126 ~ml~va~~GL~vN~~~a~ll~~~~~--~~lN~r~a~LHvl~D~Lgsv~vIia~i~i~~~~ 183 (296)
T COG1230 126 GMLVVAIIGLVVNLVSALLLHKGHE--ENLNMRGAYLHVLGDALGSVGVIIAAIVIRFTG 183 (296)
T ss_pred chHHHHHHHHHHHHHHHHHhhCCCc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 4567788999999999998877621 12233333333 379888776666656655555
No 58
>PF08229 SHR3_chaperone: ER membrane protein SH3 ; InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=54.94 E-value=5.6 Score=31.92 Aligned_cols=7 Identities=29% Similarity=0.444 Sum_probs=3.2
Q ss_pred HHHhcCC
Q psy16862 141 YAIVKPL 147 (303)
Q Consensus 141 ~aI~~P~ 147 (303)
+-+.+|-
T Consensus 78 iKl~kp~ 84 (196)
T PF08229_consen 78 IKLYKPS 84 (196)
T ss_pred HHHcCCc
Confidence 3444553
No 59
>PHA03237 envelope glycoprotein M; Provisional
Probab=54.15 E-value=1.8e+02 Score=26.65 Aligned_cols=30 Identities=3% Similarity=-0.061 Sum_probs=19.5
Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHhh
Q psy16862 212 SISFWIPGFVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 212 ~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
+..-++|++.++..-+++.|...-|+++..
T Consensus 330 ~~Laviail~l~m~vvRlvRa~~yHr~~~t 359 (424)
T PHA03237 330 LVLAIFAVIIVIMLVVRLVRACLYHRRRST 359 (424)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 344556777777777788777666664433
No 60
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=53.92 E-value=14 Score=26.86 Aligned_cols=30 Identities=23% Similarity=0.183 Sum_probs=15.7
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhhhhcc
Q psy16862 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246 (303)
Q Consensus 217 ip~~i~~~~y~~I~~~l~~~~~~~~~~~~~ 246 (303)
+|+++++..++.+++-.+|++++.+....+
T Consensus 7 l~~vv~~~i~yf~iRPQkKr~Ke~~em~~s 36 (113)
T PRK06531 7 IMFVVMLGLIFFMQRQQKKQAQERQNQLNA 36 (113)
T ss_pred HHHHHHHHHHHheechHHHHHHHHHHHHHh
Confidence 444444444444566666666555554443
No 61
>PF14362 DUF4407: Domain of unknown function (DUF4407)
Probab=53.00 E-value=1.5e+02 Score=25.55 Aligned_cols=53 Identities=13% Similarity=-0.009 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcc
Q psy16862 124 FSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTP 176 (303)
Q Consensus 124 ~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~ 176 (303)
....-+|.++++.+|||+...-............+..-+++..+++++++-|.
T Consensus 49 i~~glvwgl~I~~lDR~ivss~~~~~~~~~~~~~~~~R~~lAvliaivIs~pl 101 (301)
T PF14362_consen 49 IPFGLVWGLVIFNLDRFIVSSIRKSDGSRKRLLQALPRLLLAVLIAIVISEPL 101 (301)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33457889999999999988654432211111223333344444445554443
No 62
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=51.55 E-value=20 Score=26.36 Aligned_cols=21 Identities=19% Similarity=0.487 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHhhcccccc
Q psy16862 53 ITAILGNLLVIISVIKHRKLR 73 (303)
Q Consensus 53 i~gl~~N~~vl~~i~~~~~~~ 73 (303)
+.|++|=+++++.++++++.+
T Consensus 74 ~aGvIg~Illi~y~irR~~Kk 94 (122)
T PF01102_consen 74 MAGVIGIILLISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHS--
T ss_pred HHHHHHHHHHHHHHHHHHhcc
Confidence 345555444444444433333
No 63
>KOG2302|consensus
Probab=50.31 E-value=3.1e+02 Score=28.33 Aligned_cols=22 Identities=9% Similarity=0.433 Sum_probs=11.8
Q ss_pred cchhHHHHHHHHHhHHHHhhhh
Q psy16862 74 IITNYFVVSLAFADLLVALCVM 95 (303)
Q Consensus 74 ~~~~~ll~~Lai~Dl~~~l~~~ 95 (303)
+..|+++...-++.+..-++.+
T Consensus 1159 tlsnyIFtaIfV~Em~lKVVAL 1180 (1956)
T KOG2302|consen 1159 TLSNYIFTAIFVVEMTLKVVAL 1180 (1956)
T ss_pred EecchHHHHHHHHHHHHHHHhh
Confidence 4455555555566655544433
No 64
>PHA03239 envelope glycoprotein M; Provisional
Probab=49.54 E-value=2.1e+02 Score=26.20 Aligned_cols=30 Identities=13% Similarity=-0.023 Sum_probs=19.5
Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHhh
Q psy16862 212 SISFWIPGFVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 212 ~~~~~ip~~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
+..-++|++.++..-+++.|...-|+++..
T Consensus 336 ~~Laviail~l~~~ivRlvRa~~yHr~~~t 365 (429)
T PHA03239 336 GILAAFAVISIALAILRATRAYKFHKAANS 365 (429)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 344556777777777777777666664443
No 65
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=49.21 E-value=32 Score=23.31 Aligned_cols=28 Identities=11% Similarity=0.018 Sum_probs=17.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhhhhcc
Q psy16862 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246 (303)
Q Consensus 219 ~~i~~~~y~~I~~~l~~~~~~~~~~~~~ 246 (303)
++++.++|...++-.+|++++++....+
T Consensus 9 v~~~~i~yf~~~rpqkk~~k~~~~m~~~ 36 (82)
T PF02699_consen 9 VIIFVIFYFLMIRPQKKQQKEHQEMLAS 36 (82)
T ss_dssp HHHHHHHHHHTHHHHHHHHHHHTTGGG-
T ss_pred HHHHHHHhhheecHHHHHHHHHHHHHHc
Confidence 3556666777777777766666655543
No 66
>PF06783 UPF0239: Uncharacterised protein family (UPF0239); InterPro: IPR009621 This is a group of transmembrane proteins of unknown function.; GO: 0016021 integral to membrane
Probab=47.69 E-value=4.4 Score=27.43 Aligned_cols=19 Identities=21% Similarity=0.224 Sum_probs=10.4
Q ss_pred hhHHHHHhhhhhhhhhhHH
Q psy16862 204 VYYALISSSISFWIPGFVM 222 (303)
Q Consensus 204 ~~~~~~~~~~~~~ip~~i~ 222 (303)
..|.++...++-+++++.+
T Consensus 22 lRYGLf~GAIFQliCilAi 40 (85)
T PF06783_consen 22 LRYGLFVGAIFQLICILAI 40 (85)
T ss_pred HHHHHHHHHHHHHHHHHhe
Confidence 4566666666555554433
No 67
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=47.15 E-value=30 Score=24.90 Aligned_cols=31 Identities=10% Similarity=0.170 Sum_probs=18.1
Q ss_pred hhhhh-HHHHHHHHHHHHHHHHHHHhhhhhcc
Q psy16862 216 WIPGF-VMKKKKKRKKKKKKKKKKKRKKKKNQ 246 (303)
Q Consensus 216 ~ip~~-i~~~~y~~I~~~l~~~~~~~~~~~~~ 246 (303)
++|++ ++.++|..+++-.+|+++.++....+
T Consensus 7 ll~lv~i~~i~yF~~iRPQkKr~K~~~~m~~~ 38 (109)
T PRK05886 7 FLPFLLIMGGFMYFASRRQRKAMQATIDLHES 38 (109)
T ss_pred HHHHHHHHHHHHHHHccHHHHHHHHHHHHHHh
Confidence 34544 44566777776666666655554443
No 68
>KOG4289|consensus
Probab=47.12 E-value=78 Score=33.62 Aligned_cols=61 Identities=13% Similarity=0.224 Sum_probs=43.3
Q ss_pred ccccchhchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccchhhhHHHHHHHHhhhhhhhhcc
Q psy16862 110 GYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTP 176 (303)
Q Consensus 110 ~~~~C~~~~~~~~~~~~~s~~~l~~iaidRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~ 176 (303)
...+|.+...+..+.+.+....+++-+++-|.+..--. .........++.|.+..++....
T Consensus 2222 nq~~CtvvailLhf~~~stFaWlfl~gLhlYRml~e~v------~tg~mrfy~~vGwgvPa~itgla 2282 (2531)
T KOG4289|consen 2222 NQFYCTVVAILLHFTYLSTFAWLFLEGLHLYRMLTENV------DTGPMRFYYLVGWGVPALITGLA 2282 (2531)
T ss_pred CchhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHhccC------CCCceeEEEEeecchhhheeeee
Confidence 67899999999999988888888888888888877522 22222233445688777765444
No 69
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=45.12 E-value=27 Score=23.79 Aligned_cols=28 Identities=14% Similarity=0.087 Sum_probs=18.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhhccc
Q psy16862 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQH 247 (303)
Q Consensus 220 ~i~~~~y~~I~~~l~~~~~~~~~~~~~~ 247 (303)
++++++|..+++-.+|++++++....+.
T Consensus 11 v~~~i~yf~~~rpqkK~~k~~~~m~~~L 38 (84)
T TIGR00739 11 LIFLIFYFLIIRPQRKRRKAHKKLIESL 38 (84)
T ss_pred HHHHHHHHheechHHHHHHHHHHHHHhC
Confidence 3455677777777777777776665543
No 70
>PF00003 7tm_3: 7 transmembrane sweet-taste receptor of 3 GCPR; InterPro: IPR017978 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents the C-terminal region of family 3 GPCR receptor proteins, which contains the seven transmembrane region. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness []. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=44.43 E-value=49 Score=27.19 Aligned_cols=29 Identities=10% Similarity=-0.087 Sum_probs=16.7
Q ss_pred cccccchhchHHHHHHHHHHHHHHHHHHH
Q psy16862 109 FGYFMCDVWNSFDVYFSTASILHLCCISV 137 (303)
Q Consensus 109 ~~~~~C~~~~~~~~~~~~~s~~~l~~iai 137 (303)
.++..|.+..++...+.......+.+=+.
T Consensus 51 ~s~~~C~~r~~~~~l~f~l~~~~ll~K~~ 79 (238)
T PF00003_consen 51 PSDILCTLRRWLFSLGFTLIFSALLAKTW 79 (238)
T ss_pred cCCcEEEEeeeeeeeehHhhhhHHHHhhh
Confidence 45678988776655555444444444333
No 71
>PRK03449 putative inner membrane protein translocase component YidC; Provisional
Probab=43.86 E-value=31 Score=29.98 Aligned_cols=28 Identities=4% Similarity=-0.155 Sum_probs=15.3
Q ss_pred HhhhhhhhhhhHHHHHHHHHHHHHHHHH
Q psy16862 210 SSSISFWIPGFVMKKKKKRKKKKKKKKK 237 (303)
Q Consensus 210 ~~~~~~~ip~~i~~~~y~~I~~~l~~~~ 237 (303)
..++++.+|..+.++.-+.=+..+-++-
T Consensus 237 ~~~~~~~~Pagl~LYW~~snl~~i~Qq~ 264 (304)
T PRK03449 237 VLVGGPFLPLAILLYWVSNNIWTFGQQH 264 (304)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455556676666665555444444443
No 72
>COG1422 Predicted membrane protein [Function unknown]
Probab=43.17 E-value=1e+02 Score=24.84 Aligned_cols=22 Identities=18% Similarity=-0.025 Sum_probs=10.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHhh
Q psy16862 220 FVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 220 ~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
.+++..|..|..+.--.+.+.+
T Consensus 54 avi~gl~~~i~~~~liD~ekm~ 75 (201)
T COG1422 54 AVITGLYITILQKLLIDQEKMK 75 (201)
T ss_pred HHHHHHHHHHHHHHhccHHHHH
Confidence 3444555555555444444333
No 73
>PF14138 COX16: Cytochrome c oxidase assembly protein COX16
Probab=43.04 E-value=88 Score=21.10 Aligned_cols=24 Identities=8% Similarity=0.011 Sum_probs=13.5
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHH
Q psy16862 215 FWIPGFVMKKKKKRKKKKKKKKKK 238 (303)
Q Consensus 215 ~~ip~~i~~~~y~~I~~~l~~~~~ 238 (303)
+.+|++.+++.-..-+...-+.+-
T Consensus 3 ~GlPf~~liV~GS~gL~~ftq~Ry 26 (80)
T PF14138_consen 3 FGLPFLLLIVGGSFGLSEFTQIRY 26 (80)
T ss_pred ccccHHHHHHHHHHHHHHHHHHHh
Confidence 567877766655555544444333
No 74
>PF09882 DUF2109: Predicted membrane protein (DUF2109); InterPro: IPR019214 This entry is found in various hypothetical archaeal proteins and has no known function.
Probab=42.80 E-value=95 Score=20.70 Aligned_cols=48 Identities=25% Similarity=0.373 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHH
Q psy16862 51 IIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFN 98 (303)
Q Consensus 51 i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~ 98 (303)
..++|+++=...+-++..+.+.++-.++-.+|-|++-++....--|+-
T Consensus 3 ~~i~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g 50 (78)
T PF09882_consen 3 IIIIGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMG 50 (78)
T ss_pred hHHHHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHH
Confidence 345666666667777776777777777888888888887766555543
No 75
>PHA02616 VP2/VP3; Provisional
Probab=41.61 E-value=18 Score=28.69 Aligned_cols=13 Identities=31% Similarity=0.199 Sum_probs=6.1
Q ss_pred HHHHHHHhHHHHh
Q psy16862 80 VVSLAFADLLVAL 92 (303)
Q Consensus 80 l~~Lai~Dl~~~l 92 (303)
+++-++++-+..+
T Consensus 60 vvnR~~s~ElqrL 72 (259)
T PHA02616 60 VVNRTASRELQRL 72 (259)
T ss_pred HHHHHHHHHHHHH
Confidence 3444555544444
No 76
>KOG2927|consensus
Probab=40.72 E-value=74 Score=28.02 Aligned_cols=142 Identities=11% Similarity=0.003 Sum_probs=0.0
Q ss_pred CCCcccchhhhHHHHHHHHhhhhhhhhccccccccccCCCcccccCCCceEEEEehhhHHHHHhhhhhhhhhhHHHHHHH
Q psy16862 148 QYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVVNVYYALISSSISFWIPGFVMKKKKK 227 (303)
Q Consensus 148 ~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~~~~~~~~~~~~~ip~~i~~~~y~ 227 (303)
+|-+...++.....++++.++.+.+++..+-+.....-....+.+.... .....++..++.-+|-+.|+-++|.
T Consensus 178 ~YVW~yep~~~~~~vl~~~fvl~tlaivLFPLWP~~mR~gvyY~sig~~------gfl~~IlvLaIvRlILF~I~~il~~ 251 (372)
T KOG2927|consen 178 HYVWIYEPRPLMWQVLGVLFVLVTLAIVLFPLWPRRMRQGVYYLSIGAG------GFLAFILVLAIVRLILFGITWILTG 251 (372)
T ss_pred eEEEeccCCchhHHHHHHHHHHHHHHHHhcccCcHHHhcceeeeecchh------HHHHHHHHHHHHHHHHHHHHHHHhC
Q ss_pred --HHHHHHHHHHHHhhhhhcccccccccccCCCchhHHHHHHHhhhhhhhhHHHHHHHHHhHhhhhhcccc
Q psy16862 228 --RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296 (303)
Q Consensus 228 --~I~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~ 296 (303)
.=++..-.-.-.-.-..+-..-=......... ...-.+++.++++++++..+.-...+...+..+|.+
T Consensus 252 g~~g~W~FPNL~eDvGfleSF~PLy~~~~~~~~~-~~aK~~k~skk~~k~k~~~~eas~~~~~~~~~~~i~ 321 (372)
T KOG2927|consen 252 GKHGFWLFPNLTEDVGFLESFKPLYEYHYKKDPK-KTAKLDKKSKKKKKRKSSPEEASRSEAEQDMSGHID 321 (372)
T ss_pred CCCceEeccchhhhhhHHHhhcccccccCCCCcc-hhhHHHHHHHHHHhhhcChhhhhhhhhhhhhccCCC
No 77
>KOG0709|consensus
Probab=40.61 E-value=29 Score=31.63 Aligned_cols=26 Identities=31% Similarity=0.519 Sum_probs=10.9
Q ss_pred hHHHHHHHhhhhhhhhHHHHHHHHHh
Q psy16862 261 EEEEKKKKKKKKKKKTTKKKKRKKKR 286 (303)
Q Consensus 261 ~~~~~~~~~~~k~~~~~~~~~~~~~~ 286 (303)
+++.-|+-++|-|||.+.++.|+|||
T Consensus 246 EEriLKrvRRKIrNK~SAQESRrkKk 271 (472)
T KOG0709|consen 246 EERILKRVRRKIRNKRSAQESRRKKK 271 (472)
T ss_pred HHHHHHHHHHHHHhhhhhHHHHHhHh
Confidence 34433334444444444444444433
No 78
>KOG2422|consensus
Probab=40.56 E-value=9.5 Score=35.71 Aligned_cols=18 Identities=44% Similarity=0.715 Sum_probs=8.2
Q ss_pred HHHHHHhHhhhhhccccc
Q psy16862 280 KKRKKKRKKKKNQHHRDV 297 (303)
Q Consensus 280 ~~~~~~~~~~~~~~~~~~ 297 (303)
++.+||+||++|++..+|
T Consensus 90 ~k~KKK~krkkKk~~~~~ 107 (665)
T KOG2422|consen 90 NKKKKKKKRKKKKSTAEV 107 (665)
T ss_pred cccchhhhhccccccCcc
Confidence 344444444455544443
No 79
>PRK01315 putative inner membrane protein translocase component YidC; Provisional
Probab=40.35 E-value=59 Score=28.61 Aligned_cols=22 Identities=9% Similarity=-0.242 Sum_probs=11.1
Q ss_pred hhhhhhhhhhHHHHHHHHHHHH
Q psy16862 211 SSISFWIPGFVMKKKKKRKKKK 232 (303)
Q Consensus 211 ~~~~~~ip~~i~~~~y~~I~~~ 232 (303)
.++++.+|..+.++.-..=+..
T Consensus 226 ~~~~~~fPaGL~LYW~~snl~s 247 (329)
T PRK01315 226 LVSGIAFPVGVLFYWLTSNVWT 247 (329)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4444555666665554433333
No 80
>PF15102 TMEM154: TMEM154 protein family
Probab=39.00 E-value=7.2 Score=29.51 Aligned_cols=13 Identities=15% Similarity=0.207 Sum_probs=6.4
Q ss_pred hhhhhhhhhHHHH
Q psy16862 212 SISFWIPGFVMKK 224 (303)
Q Consensus 212 ~~~~~ip~~i~~~ 224 (303)
++.+++|++++++
T Consensus 58 iLmIlIP~VLLvl 70 (146)
T PF15102_consen 58 ILMILIPLVLLVL 70 (146)
T ss_pred EEEEeHHHHHHHH
Confidence 4455666444433
No 81
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=38.64 E-value=10 Score=32.07 Aligned_cols=8 Identities=25% Similarity=0.210 Sum_probs=0.0
Q ss_pred HHHHHHHH
Q psy16862 133 CCISVDRY 140 (303)
Q Consensus 133 ~~iaidRy 140 (303)
.++++.-|
T Consensus 118 LaL~vW~Y 125 (381)
T PF05297_consen 118 LALGVWFY 125 (381)
T ss_dssp --------
T ss_pred HHHHHHHH
Confidence 33344334
No 82
>smart00792 Agouti Agouti protein. The agouti protein regulates pigmentation in the mouse hair follicle producing a black hair with a subapical yellow band. A highly homologous protein agouti signal protein (ASIP) is present in humans and is expressed at highest levels in adipose tissue where it may play a role in energy homeostasis and possibly human pigmentation PUBMED:11837451, PUBMED:11833005.
Probab=36.80 E-value=71 Score=23.36 Aligned_cols=9 Identities=44% Similarity=0.527 Sum_probs=3.5
Q ss_pred HHHHHHHhH
Q psy16862 279 KKKRKKKRK 287 (303)
Q Consensus 279 ~~~~~~~~~ 287 (303)
+++.+|++|
T Consensus 64 ~~~~~k~k~ 72 (124)
T smart00792 64 KKLQQSKEK 72 (124)
T ss_pred hhhhhhhhh
Confidence 334444333
No 83
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=36.72 E-value=81 Score=19.97 Aligned_cols=31 Identities=10% Similarity=-0.197 Sum_probs=20.1
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHhhhhhc
Q psy16862 215 FWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN 245 (303)
Q Consensus 215 ~~ip~~i~~~~y~~I~~~l~~~~~~~~~~~~ 245 (303)
.+..+.+.++++..|+...++++|.+....+
T Consensus 12 a~~t~~~~l~fiavi~~ayr~~~K~~~d~aa 42 (60)
T COG4736 12 AWGTIAFTLFFIAVIYFAYRPGKKGEFDEAA 42 (60)
T ss_pred HHHHHHHHHHHHHHHHHHhcccchhhHHHHh
Confidence 3344455666777777777777776665555
No 84
>PRK04125 murein hydrolase regulator LrgA; Provisional
Probab=36.49 E-value=15 Score=27.83 Aligned_cols=7 Identities=43% Similarity=0.363 Sum_probs=3.1
Q ss_pred hhhhhcc
Q psy16862 288 KKKNQHH 294 (303)
Q Consensus 288 ~~~~~~~ 294 (303)
-+-|||+
T Consensus 134 ~~~~~~~ 140 (141)
T PRK04125 134 IKGRQHG 140 (141)
T ss_pred ccccccC
Confidence 3344444
No 85
>PHA03242 envelope glycoprotein M; Provisional
Probab=36.07 E-value=3.5e+02 Score=24.85 Aligned_cols=26 Identities=12% Similarity=0.106 Sum_probs=16.8
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHh
Q psy16862 215 FWIPGFVMKKKKKRKKKKKKKKKKKR 240 (303)
Q Consensus 215 ~~ip~~i~~~~y~~I~~~l~~~~~~~ 240 (303)
-++|++.++..-+++.|...-|+++.
T Consensus 330 aviail~l~~~vvRlvRa~~yHr~~~ 355 (428)
T PHA03242 330 ALVALFALAMAVLRLVRAYLYHRRHR 355 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 45666667777777777766665443
No 86
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=35.63 E-value=36 Score=21.01 Aligned_cols=27 Identities=11% Similarity=-0.015 Sum_probs=19.6
Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHhh
Q psy16862 215 FWIPGFVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 215 ~~ip~~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
+++.+++++++|..|+..-++.++.++
T Consensus 5 f~~ti~lvv~LYgY~yhLYrsek~G~r 31 (56)
T TIGR02736 5 FAFTLLLVIFLYAYIYHLYRSQKKGER 31 (56)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhccccc
Confidence 345556777889999988887776654
No 87
>TIGR02230 ATPase_gene1 F0F1-ATPase subunit, putative. This model represents a protein found encoded in F1F0-ATPase operons in several genomes, including Methanosarcina barkeri (archaeal) and Chlorobium tepidum (bacterial). It is a small protein (about 100 amino acids) with long hydrophic stretches and is presumed to be a subunit of the enzyme.
Probab=35.60 E-value=14 Score=26.09 Aligned_cols=16 Identities=13% Similarity=0.642 Sum_probs=9.4
Q ss_pred HHhHhhhhhccccccc
Q psy16862 284 KKRKKKKNQHHRDVWQ 299 (303)
Q Consensus 284 ~~~~~~~~~~~~~~~~ 299 (303)
.|+.+-+|++.+++|+
T Consensus 25 ~R~~~a~r~~~~~~~~ 40 (100)
T TIGR02230 25 LRKLRARKNATRSIWE 40 (100)
T ss_pred HHHHHHHhcCCCcHHH
Confidence 3444555666667765
No 88
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=35.57 E-value=52 Score=20.41 Aligned_cols=23 Identities=26% Similarity=0.472 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc
Q psy16862 46 MLMCFIIITAILGNLLVIISVIK 68 (303)
Q Consensus 46 ~~~~~i~i~gl~~N~~vl~~i~~ 68 (303)
.+++++.+--++||+.+|---.+
T Consensus 6 wlIIviVlgvIigNia~LK~sAk 28 (55)
T PF11446_consen 6 WLIIVIVLGVIIGNIAALKYSAK 28 (55)
T ss_pred hHHHHHHHHHHHhHHHHHHHhcc
Confidence 34445555567888888764443
No 89
>KOG4055|consensus
Probab=34.11 E-value=34 Score=26.90 Aligned_cols=15 Identities=7% Similarity=0.193 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHhh
Q psy16862 227 KRKKKKKKKKKKKRK 241 (303)
Q Consensus 227 ~~I~~~l~~~~~~~~ 241 (303)
+.+|+.+||+...|.
T Consensus 82 FHvYR~lRRrEq~Rl 96 (213)
T KOG4055|consen 82 FHVYRHLRRREQDRL 96 (213)
T ss_pred hHHHHHHHHHHHHHH
Confidence 467788877765554
No 90
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=32.08 E-value=1.7e+02 Score=20.12 Aligned_cols=23 Identities=30% Similarity=0.052 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhh
Q psy16862 221 VMKKKKKRKKKKKKKKKKKRKKK 243 (303)
Q Consensus 221 i~~~~y~~I~~~l~~~~~~~~~~ 243 (303)
++++..+..++.+++.+++..+.
T Consensus 13 v~~~i~~y~~~k~~ka~~~~~kL 35 (87)
T PF10883_consen 13 VVALILAYLWWKVKKAKKQNAKL 35 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555566666666554433
No 91
>PF11683 DUF3278: Protein of unknown function (DUF3278); InterPro: IPR021697 This bacterial family of proteins has no known function.
Probab=31.38 E-value=1.7e+02 Score=21.65 Aligned_cols=41 Identities=15% Similarity=0.267 Sum_probs=19.7
Q ss_pred HHHHHHhcCCC-CCcccchhhhHHHHHHHHhhhhhhhhcccc
Q psy16862 138 DRYYAIVKPLQ-YPIIMNQNTVLLMLSNVWILPGIISFTPIM 178 (303)
Q Consensus 138 dRy~aI~~P~~-~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~ 178 (303)
-|+.-|.-|+. |++..-.+-.-...+..|.+.++..+..++
T Consensus 12 K~fygI~GplDE~r~~ei~rign~a~i~l~~~~l~~~li~l~ 53 (129)
T PF11683_consen 12 KRFYGIQGPLDEYRRQEINRIGNNAFIILFYYSLLLNLISLL 53 (129)
T ss_pred HHHhCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666777775 332222233333444455555555544433
No 92
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=31.02 E-value=34 Score=26.63 Aligned_cols=18 Identities=6% Similarity=0.085 Sum_probs=7.8
Q ss_pred hhhhHHHHHHHHHHHHHH
Q psy16862 217 IPGFVMKKKKKRKKKKKK 234 (303)
Q Consensus 217 ip~~i~~~~y~~I~~~l~ 234 (303)
+..++++++-+++++.-+
T Consensus 104 ~s~l~i~yfvir~~R~r~ 121 (163)
T PF06679_consen 104 LSALAILYFVIRTFRLRR 121 (163)
T ss_pred HHHHHHHHHHHHHHhhcc
Confidence 333444444444444433
No 93
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=30.64 E-value=78 Score=22.68 Aligned_cols=26 Identities=15% Similarity=0.074 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcc
Q psy16862 221 VMKKKKKRKKKKKKKKKKKRKKKKNQ 246 (303)
Q Consensus 221 i~~~~y~~I~~~l~~~~~~~~~~~~~ 246 (303)
++.++|..+++-.+|++++++....+
T Consensus 27 i~~i~yf~~~RpqkK~~k~~~~~~~~ 52 (106)
T PRK05585 27 FFAIFYFLIIRPQQKRQKEHKKMLSS 52 (106)
T ss_pred HHHHHHHHhccHHHHHHHHHHHHHHh
Confidence 34456677777777777666655554
No 94
>KOG2357|consensus
Probab=30.56 E-value=59 Score=29.23 Aligned_cols=12 Identities=25% Similarity=0.473 Sum_probs=5.7
Q ss_pred HHHHHHHHHhcC
Q psy16862 135 ISVDRYYAIVKP 146 (303)
Q Consensus 135 iaidRy~aI~~P 146 (303)
--+.||..|.++
T Consensus 245 ~DLs~F~si~~~ 256 (440)
T KOG2357|consen 245 RDLSRFASIVSS 256 (440)
T ss_pred HHHHHHhccCCC
Confidence 344455555543
No 95
>COG4606 CeuB ABC-type enterochelin transport system, permease component [Inorganic ion transport and metabolism]
Probab=29.60 E-value=3.6e+02 Score=23.13 Aligned_cols=94 Identities=15% Similarity=0.303 Sum_probs=51.5
Q ss_pred HHHHHHHHHHHHHHHHHhhcccccccchhHHHHHHHHHhHHHHhhhhhHHHHHHhc--cccccccccchhchHHHHHHHH
Q psy16862 49 CFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLT--DEWYFGYFMCDVWNSFDVYFST 126 (303)
Q Consensus 49 ~~i~i~gl~~N~~vl~~i~~~~~~~~~~~~ll~~Lai~Dl~~~l~~~~~~~~~~~~--~~~~~~~~~C~~~~~~~~~~~~ 126 (303)
.+..++++.|+.+.+..+ ++-+++..-.+-+..+-....+.++.....+-..... +.|..|+..-...+=-......
T Consensus 111 lvA~v~al~Gt~~F~~~l-~ri~~k~~i~VPLiGIm~Ggvi~sitTFiAy~~dllQ~l~~W~~G~Fs~v~~GrYElL~i~ 189 (321)
T COG4606 111 LVAFVFALAGTLLFMMIL-RRIKLKDVLFVPLIGIMFGGVISSITTFIAYRFDLLQSLGAWLQGDFSGVLRGRYELLWIS 189 (321)
T ss_pred HHHHHHHHHHHHHHHHHH-HhccccceEeehhHHHHHHhHHHHHHHHHHHHHHHHHHhhHHhcCchHhhhccceeehHHH
Confidence 345667788877666644 4445566666777777777766655443332222222 3677666544443322222233
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy16862 127 ASILHLCCISVDRYYAI 143 (303)
Q Consensus 127 ~s~~~l~~iaidRy~aI 143 (303)
.-...++...-||+...
T Consensus 190 ~~l~i~ay~~AdrfTi~ 206 (321)
T COG4606 190 IPLAILAYLFADRFTIA 206 (321)
T ss_pred HHHHHHHHHHhhheeee
Confidence 34445556666776443
No 96
>PF06658 DUF1168: Protein of unknown function (DUF1168); InterPro: IPR009548 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=29.58 E-value=1.3e+02 Score=22.92 Aligned_cols=16 Identities=6% Similarity=0.171 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHhh
Q psy16862 226 KKRKKKKKKKKKKKRK 241 (303)
Q Consensus 226 y~~I~~~l~~~~~~~~ 241 (303)
.+.+|+..|++--.|.
T Consensus 37 eFHvYR~~RRrE~~Rl 52 (142)
T PF06658_consen 37 EFHVYRASRRREYERL 52 (142)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4567777777655444
No 97
>KOG2357|consensus
Probab=29.31 E-value=82 Score=28.39 Aligned_cols=8 Identities=25% Similarity=0.430 Sum_probs=3.3
Q ss_pred ccccccch
Q psy16862 108 YFGYFMCD 115 (303)
Q Consensus 108 ~~~~~~C~ 115 (303)
..|...|.
T Consensus 176 ~tGR~~c~ 183 (440)
T KOG2357|consen 176 ATGRVNCK 183 (440)
T ss_pred hcchhHHh
Confidence 33444443
No 98
>PF10329 DUF2417: Region of unknown function (DUF2417); InterPro: IPR019431 This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO).
Probab=29.19 E-value=3.4e+02 Score=22.61 Aligned_cols=17 Identities=35% Similarity=0.309 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy16862 126 TASILHLCCISVDRYYA 142 (303)
Q Consensus 126 ~~s~~~l~~iaidRy~a 142 (303)
.+....+..+..||++.
T Consensus 139 Wa~l~~l~~~~~D~~v~ 155 (232)
T PF10329_consen 139 WAFLSSLWGILADRYVE 155 (232)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 35556677788899986
No 99
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=29.06 E-value=1.2e+02 Score=26.62 Aligned_cols=22 Identities=9% Similarity=-0.150 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy16862 43 IKGMLMCFIIITAILGNLLVII 64 (303)
Q Consensus 43 ~~~~~~~~i~i~gl~~N~~vl~ 64 (303)
.+.+++++..++|--.|--+.-
T Consensus 12 ~~l~~y~~~y~~G~~~N~~~A~ 33 (321)
T PF07946_consen 12 AFLLLYVVNYFIGKSKNRRIAK 33 (321)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444555555544333
No 100
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=28.46 E-value=33 Score=30.01 Aligned_cols=10 Identities=20% Similarity=0.335 Sum_probs=4.7
Q ss_pred cccccccchh
Q psy16862 107 WYFGYFMCDV 116 (303)
Q Consensus 107 ~~~~~~~C~~ 116 (303)
|..|...|..
T Consensus 72 yaTGR~~~~~ 81 (321)
T PF07946_consen 72 YATGRRNCEG 81 (321)
T ss_pred EEeCCCCeEE
Confidence 3445555543
No 101
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=27.92 E-value=1.9e+02 Score=19.28 Aligned_cols=28 Identities=11% Similarity=-0.039 Sum_probs=16.9
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHhh
Q psy16862 214 SFWIPGFVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 214 ~~~ip~~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
++++|+++.++.-.-++..++-.++++.
T Consensus 5 fl~~Pliif~ifVap~wl~lHY~~k~~~ 32 (75)
T TIGR02976 5 FLAIPLIIFVIFVAPLWLILHYRSKRKT 32 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 4567777776666666655555544433
No 102
>PRK09458 pspB phage shock protein B; Provisional
Probab=27.84 E-value=1.9e+02 Score=19.27 Aligned_cols=29 Identities=10% Similarity=-0.086 Sum_probs=18.5
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHHHHhh
Q psy16862 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRK 241 (303)
Q Consensus 213 ~~~~ip~~i~~~~y~~I~~~l~~~~~~~~ 241 (303)
.++.+|+++.+++-.-|+..++=++|++.
T Consensus 4 ~fl~~PliiF~ifVaPiWL~LHY~sk~~~ 32 (75)
T PRK09458 4 LFLAIPLTIFVLFVAPIWLWLHYRSKRQG 32 (75)
T ss_pred hHHHHhHHHHHHHHHHHHHHHhhcccccC
Confidence 34567777777777777766665554443
No 103
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=27.79 E-value=93 Score=22.35 Aligned_cols=29 Identities=10% Similarity=0.241 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccc
Q psy16862 43 IKGMLMCFIIITAILGNLLVIISVIKHRK 71 (303)
Q Consensus 43 ~~~~~~~~i~i~gl~~N~~vl~~i~~~~~ 71 (303)
++..+++++.++.+.+|++......+.++
T Consensus 2 ~Ll~il~llLll~l~asl~~wr~~~rq~k 30 (107)
T PF15330_consen 2 LLLGILALLLLLSLAASLLAWRMKQRQKK 30 (107)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 34566677778888888888776655444
No 104
>PF08213 DUF1713: Mitochondrial domain of unknown function (DUF1713); InterPro: IPR013177 This domain is found at the C-terminal end of mitochondrial proteins of unknown function.
Probab=27.23 E-value=78 Score=17.46 Aligned_cols=15 Identities=53% Similarity=0.581 Sum_probs=6.2
Q ss_pred hhHHHHHHHHHhHhh
Q psy16862 275 KTTKKKKRKKKRKKK 289 (303)
Q Consensus 275 ~~~~~~~~~~~~~~~ 289 (303)
+.+..++-+|.+|+.
T Consensus 11 ~KM~kHK~kKlrKr~ 25 (34)
T PF08213_consen 11 KKMKKHKYKKLRKRT 25 (34)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444444444333
No 105
>PF05434 Tmemb_9: TMEM9; InterPro: IPR008853 This family contains several eukaryotic transmembrane proteins which are homologous to Homo sapiens transmembrane protein 9 Q9P0T7 from SWISSPROT. The TMEM9 gene encodes a 183 amino-acid protein that contains an N-terminal signal peptide, a single transmembrane region, three potential N-glycosylation sites and three conserved cys-rich domains in the N terminus, but no known functional domains. The protein is highly conserved between species from Caenorhabditis elegans to H. sapiens and belongs to a novel family of transmembrane proteins. The exact function of TMEM9 is unknown although it has been found to be widely expressed and localised to the late endosomes and lysosomes []. Members of this family contain CXCXC repeats IPR004153 from INTERPRO in their N-terminal region.; GO: 0016021 integral to membrane
Probab=26.86 E-value=66 Score=24.55 Aligned_cols=14 Identities=7% Similarity=0.539 Sum_probs=5.6
Q ss_pred HhHhhhhhcccccc
Q psy16862 285 KRKKKKNQHHRDVW 298 (303)
Q Consensus 285 ~~~~~~~~~~~~~~ 298 (303)
|=||+=+|..|.||
T Consensus 130 rWK~QVqEQRknIf 143 (149)
T PF05434_consen 130 RWKRQVQEQRKNIF 143 (149)
T ss_pred HHHHHHHHHHHhhh
Confidence 33333334344444
No 106
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=26.34 E-value=1.1e+02 Score=17.06 Aligned_cols=16 Identities=19% Similarity=0.285 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHHHH
Q psy16862 44 KGMLMCFIIITAILGN 59 (303)
Q Consensus 44 ~~~~~~~i~i~gl~~N 59 (303)
..+++..++++|.+.|
T Consensus 11 vV~ffv~LFifGflsn 26 (36)
T PF02532_consen 11 VVIFFVSLFIFGFLSN 26 (36)
T ss_dssp HHHHHHHHHHHHHHTT
T ss_pred hHHHHHHHHhccccCC
Confidence 3344455556666554
No 107
>PF05151 PsbM: Photosystem II reaction centre M protein (PsbM); InterPro: IPR007826 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbM found in PSII. PsbM is one of the most hydrophobic proteins in the thylakoid membrane. The function of this protein is unknown.; GO: 0015979 photosynthesis, 0019684 photosynthesis, light reaction, 0009523 photosystem II, 0016021 integral to membrane; PDB: 3A0H_m 3ARC_m 3A0B_M 3PRR_M 3PRQ_M 1S5L_M 4FBY_e 3BZ2_M 3BZ1_M 2AXT_M ....
Probab=25.41 E-value=1.2e+02 Score=16.32 Aligned_cols=23 Identities=17% Similarity=0.219 Sum_probs=16.0
Q ss_pred HHHHhhhhhhhhhhHHHHHHHHH
Q psy16862 207 ALISSSISFWIPGFVMKKKKKRK 229 (303)
Q Consensus 207 ~~~~~~~~~~ip~~i~~~~y~~I 229 (303)
.++......++|....++.|++-
T Consensus 7 ~fiAtaLfi~iPt~FLiilyvqT 29 (31)
T PF05151_consen 7 AFIATALFILIPTAFLIILYVQT 29 (31)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHhheEeee
Confidence 34455666778888888888753
No 108
>smart00786 SHR3_chaperone ER membrane protein SH3. This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) PUBMED:15623581. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=25.34 E-value=1.2e+02 Score=24.46 Aligned_cols=20 Identities=10% Similarity=-0.155 Sum_probs=9.1
Q ss_pred hhHHHHHHHHHHHHHHHHHH
Q psy16862 219 GFVMKKKKKRKKKKKKKKKK 238 (303)
Q Consensus 219 ~~i~~~~y~~I~~~l~~~~~ 238 (303)
++-++++-..-++.-|..++
T Consensus 144 LvGVL~LQaG~wYAer~~~~ 163 (196)
T smart00786 144 LVGVLVLQAGLWYAERKDAK 163 (196)
T ss_pred HHHHHHHHhhHHHHHHhHHH
Confidence 34444444554544444443
No 109
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=25.32 E-value=1.3e+02 Score=17.86 Aligned_cols=23 Identities=13% Similarity=-0.091 Sum_probs=11.6
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHh
Q psy16862 218 PGFVMKKKKKRKKKKKKKKKKKR 240 (303)
Q Consensus 218 p~~i~~~~y~~I~~~l~~~~~~~ 240 (303)
.++.++++.+.+++..+++++++
T Consensus 15 ~v~~~~~F~gi~~w~~~~~~k~~ 37 (49)
T PF05545_consen 15 TVLFFVFFIGIVIWAYRPRNKKR 37 (49)
T ss_pred HHHHHHHHHHHHHHHHcccchhh
Confidence 33444555555566555554333
No 110
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=24.91 E-value=4.4e+02 Score=24.19 Aligned_cols=17 Identities=29% Similarity=0.233 Sum_probs=10.9
Q ss_pred hhHHHHHHHHHhHHHHh
Q psy16862 76 TNYFVVSLAFADLLVAL 92 (303)
Q Consensus 76 ~~~ll~~Lai~Dl~~~l 92 (303)
...+...|.+.-++.+.
T Consensus 79 ~~c~~~sLiiltL~~~a 95 (418)
T cd07912 79 ICCLKWSLVIATLLCCA 95 (418)
T ss_pred ccHHHHHHHHHHHHHHH
Confidence 45666677777766544
No 111
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=24.89 E-value=1.4e+02 Score=18.17 Aligned_cols=23 Identities=26% Similarity=0.587 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccc
Q psy16862 46 MLMCFIIITAILGNLLVIISVIKHR 70 (303)
Q Consensus 46 ~~~~~i~i~gl~~N~~vl~~i~~~~ 70 (303)
++..+++++|++| +.++.+.+.+
T Consensus 5 ~iV~i~iv~~lLg--~~I~~~~K~y 27 (50)
T PF12606_consen 5 LIVSIFIVMGLLG--LSICTTLKAY 27 (50)
T ss_pred HHHHHHHHHHHHH--HHHHHHhhcc
Confidence 4455556666666 3344444443
No 112
>COG5472 Predicted small integral membrane protein [Function unknown]
Probab=24.81 E-value=3e+02 Score=20.50 Aligned_cols=19 Identities=11% Similarity=0.069 Sum_probs=8.9
Q ss_pred HHHHHhHHHHhhhhhHHHH
Q psy16862 82 SLAFADLLVALCVMPFNAI 100 (303)
Q Consensus 82 ~Lai~Dl~~~l~~~~~~~~ 100 (303)
.-++.|....+.++..+..
T Consensus 101 AK~~~~~g~tL~flLWFF~ 119 (164)
T COG5472 101 AKGIAIIGLTLGFLLWFFV 119 (164)
T ss_pred ccchhhhhhHHHHHHHHHH
Confidence 3445555555544444333
No 113
>KOG3584|consensus
Probab=24.50 E-value=96 Score=26.46 Aligned_cols=30 Identities=27% Similarity=0.535 Sum_probs=19.8
Q ss_pred CchhHHHHHHHhhhhhhhhHHHHHHHHHhH
Q psy16862 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287 (303)
Q Consensus 258 ~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~ 287 (303)
..+++..+|+.-+--|+|++.++=|+|||-
T Consensus 283 ~~aee~trKRevRLmKNREAARECRRKKKE 312 (348)
T KOG3584|consen 283 QGAEEATRKREVRLMKNREAARECRRKKKE 312 (348)
T ss_pred ccchhhhhHHHHHHHhhHHHHHHHHHhHhH
Confidence 445555666666777788888776666553
No 114
>PRK02463 OxaA-like protein precursor; Provisional
Probab=24.01 E-value=76 Score=27.62 Aligned_cols=6 Identities=0% Similarity=0.130 Sum_probs=3.1
Q ss_pred hhhccc
Q psy16862 290 KNQHHR 295 (303)
Q Consensus 290 ~~~~~~ 295 (303)
+||-||
T Consensus 298 ~rnagk 303 (307)
T PRK02463 298 NRNAGK 303 (307)
T ss_pred cccccc
Confidence 455555
No 115
>PF13038 DUF3899: Domain of unknown function (DUF3899)
Probab=23.97 E-value=2.4e+02 Score=19.22 Aligned_cols=11 Identities=0% Similarity=-0.363 Sum_probs=4.0
Q ss_pred HHHHHHHHHHH
Q psy16862 222 MKKKKKRKKKK 232 (303)
Q Consensus 222 ~~~~y~~I~~~ 232 (303)
..+.-...+-.
T Consensus 19 ~~v~~~GfFd~ 29 (92)
T PF13038_consen 19 LFVFQSGFFDG 29 (92)
T ss_pred HHHHhcCchHH
Confidence 33333333333
No 116
>PHA03235 DNA packaging protein UL33; Provisional
Probab=23.89 E-value=5.5e+02 Score=23.31 Aligned_cols=20 Identities=5% Similarity=-0.174 Sum_probs=10.9
Q ss_pred hhhhHHHHHHHHHHHHHHHH
Q psy16862 217 IPGFVMKKKKKRKKKKKKKK 236 (303)
Q Consensus 217 ip~~i~~~~y~~I~~~l~~~ 236 (303)
+-.++--+.|...-...|+.
T Consensus 297 ~ns~lNPiIY~~~~~~FRk~ 316 (409)
T PHA03235 297 LHCLLNPILYAFLGNDFLKR 316 (409)
T ss_pred HHHhHhHHHHHHhhHHHHHH
Confidence 44455556666665555544
No 117
>PF04936 DUF658: Protein of unknown function (DUF658); InterPro: IPR007020 This entry is represented by Bacteriophage r1t, Orf18. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. These are proteins of unknown function found in Lactococcus lactis and in their associated bacteriophage.
Probab=23.49 E-value=1.7e+02 Score=22.65 Aligned_cols=15 Identities=33% Similarity=0.355 Sum_probs=5.7
Q ss_pred CchhHHHHHHHhhhh
Q psy16862 258 RGEEEEEKKKKKKKK 272 (303)
Q Consensus 258 ~~~~~~~~~~~~~~k 272 (303)
+.|+.+.+.+.++|+
T Consensus 144 e~e~~E~k~k~rrK~ 158 (186)
T PF04936_consen 144 EEELIETKEKYRRKH 158 (186)
T ss_pred cHHHHHHhhHHHhhh
Confidence 334443333333333
No 118
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=23.41 E-value=1.5e+02 Score=25.67 Aligned_cols=20 Identities=25% Similarity=0.187 Sum_probs=9.8
Q ss_pred HHHhhhhhhhhhhHHHHHHH
Q psy16862 208 LISSSISFWIPGFVMKKKKK 227 (303)
Q Consensus 208 ~~~~~~~~~ip~~i~~~~y~ 227 (303)
++..++..++-+++|++.|.
T Consensus 258 I~aSiiaIliIVLIMvIIYL 277 (299)
T PF02009_consen 258 IIASIIAILIIVLIMVIIYL 277 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444454555555554
No 119
>PRK13664 hypothetical protein; Provisional
Probab=23.29 E-value=1e+02 Score=19.18 Aligned_cols=16 Identities=13% Similarity=0.372 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHH
Q psy16862 48 MCFIIITAILGNLLVI 63 (303)
Q Consensus 48 ~~~i~i~gl~~N~~vl 63 (303)
+++++++|++-|++==
T Consensus 11 lill~lvG~i~N~iK~ 26 (62)
T PRK13664 11 LVLVFLVGVLLNVIKD 26 (62)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3566778888886543
No 120
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=22.64 E-value=83 Score=17.41 Aligned_cols=23 Identities=9% Similarity=0.144 Sum_probs=16.4
Q ss_pred HHHHHhhhhhhhhhhHHHHHHHH
Q psy16862 206 YALISSSISFWIPGFVMKKKKKR 228 (303)
Q Consensus 206 ~~~~~~~~~~~ip~~i~~~~y~~ 228 (303)
..++......++|.+..++.|++
T Consensus 6 lgfiAt~Lfi~iPt~FLlilYvq 28 (35)
T PRK04989 6 LGFVASLLFVLVPTVFLIILYIQ 28 (35)
T ss_pred HHHHHHHHHHHHHHHHHHHHhee
Confidence 34455566678898888888865
No 121
>PRK03557 zinc transporter ZitB; Provisional
Probab=22.50 E-value=5.2e+02 Score=22.45 Aligned_cols=18 Identities=22% Similarity=0.401 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHhhc
Q psy16862 51 IIITAILGNLLVIISVIK 68 (303)
Q Consensus 51 i~i~gl~~N~~vl~~i~~ 68 (303)
+.++|++.|.+..+...+
T Consensus 127 v~~~~~~~~~~~~~~~~~ 144 (312)
T PRK03557 127 IAVAGLLANILSFWLLHH 144 (312)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 344566667655544433
No 122
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=22.15 E-value=2.4e+02 Score=22.95 Aligned_cols=26 Identities=12% Similarity=0.036 Sum_probs=14.4
Q ss_pred ccCCCchhHHHHHHHhhhhhhhhHHH
Q psy16862 254 NRTSRGEEEEEKKKKKKKKKKKTTKK 279 (303)
Q Consensus 254 ~~~~~~~~~~~~~~~~~~k~~~~~~~ 279 (303)
+.-...+.+.=+|.+.-.-.+-.+|.
T Consensus 182 SgLWPa~sdTWkRakqltg~~l~mQs 207 (227)
T PF05399_consen 182 SGLWPAESDTWKRAKQLTGPNLMMQS 207 (227)
T ss_pred cccCccccchhhhhhhccCccceecc
Confidence 34445666666665555555544444
No 123
>PHA03164 hypothetical protein; Provisional
Probab=22.08 E-value=89 Score=20.66 Aligned_cols=21 Identities=24% Similarity=0.314 Sum_probs=15.9
Q ss_pred ccchhHHHHHHHHHhHHHHhh
Q psy16862 73 RIITNYFVVSLAFADLLVALC 93 (303)
Q Consensus 73 ~~~~~~ll~~Lai~Dl~~~l~ 93 (303)
++.+.+.+..||++-+++.++
T Consensus 56 ktftFlvLtgLaIamILfiif 76 (88)
T PHA03164 56 KTFTFLVLTGLAIAMILFIIF 76 (88)
T ss_pred heeehHHHHHHHHHHHHHHHH
Confidence 566778888999988777543
No 124
>PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function.
Probab=22.01 E-value=1.7e+02 Score=21.37 Aligned_cols=6 Identities=17% Similarity=0.085 Sum_probs=2.5
Q ss_pred cchhHH
Q psy16862 74 IITNYF 79 (303)
Q Consensus 74 ~~~~~l 79 (303)
.|.|++
T Consensus 58 ~p~~if 63 (118)
T PF10856_consen 58 KPLHIF 63 (118)
T ss_pred CceEEe
Confidence 344444
No 125
>TIGR03038 PS_II_psbM photosystem II reaction center protein PsbM. Members of this protein family are the photosystem II reaction center M protein, product of the psbM gene, in Cyanobacteria and their derived organelles in plants. This model resembles Pfam model pfam05151 but has cutoffs set to avoid false-positive matches to similar (not necessarily homologous) sequences in species that are not photosynthetic.
Probab=21.95 E-value=93 Score=16.96 Aligned_cols=23 Identities=9% Similarity=0.160 Sum_probs=16.4
Q ss_pred HHHHHhhhhhhhhhhHHHHHHHH
Q psy16862 206 YALISSSISFWIPGFVMKKKKKR 228 (303)
Q Consensus 206 ~~~~~~~~~~~ip~~i~~~~y~~ 228 (303)
..++......++|.+..++.|++
T Consensus 6 l~fiAt~Lfi~iPt~FLiilYvq 28 (33)
T TIGR03038 6 LGFIATLLFILVPTVFLLILYIQ 28 (33)
T ss_pred HHHHHHHHHHHHHHHHHHHHhee
Confidence 34455566678898888888865
No 126
>PF15345 TMEM51: Transmembrane protein 51
Probab=20.12 E-value=1.3e+02 Score=24.94 Aligned_cols=21 Identities=10% Similarity=0.072 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHhhhhh
Q psy16862 224 KKKKRKKKKKKKKKKKRKKKK 244 (303)
Q Consensus 224 ~~y~~I~~~l~~~~~~~~~~~ 244 (303)
.+-..|...+|.++|++....
T Consensus 71 LLLLSICL~IR~KRr~rq~~e 91 (233)
T PF15345_consen 71 LLLLSICLSIRDKRRRRQGEE 91 (233)
T ss_pred HHHHHHHHHHHHHHHHhhccc
Confidence 333444555666655555444
Done!