Query psy3982
Match_columns 185
No_of_seqs 118 out of 1506
Neff 10.2
Searched_HMMs 46136
Date Fri Aug 16 22:15:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3982.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3982hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PHA03234 DNA packaging protein 99.9 1.6E-22 3.5E-27 155.8 12.5 126 12-160 179-316 (338)
2 PHA03235 DNA packaging protein 99.9 4.9E-22 1.1E-26 156.4 11.6 129 10-160 184-323 (409)
3 KOG4219|consensus 99.9 2.6E-21 5.7E-26 146.6 9.2 141 11-161 185-336 (423)
4 PHA02834 chemokine receptor-li 99.8 5.9E-21 1.3E-25 146.6 8.6 110 29-160 190-309 (323)
5 PHA02638 CC chemokine receptor 99.8 1E-19 2.3E-24 143.7 12.2 130 11-162 252-399 (417)
6 PHA03087 G protein-coupled che 99.8 8.8E-19 1.9E-23 135.2 12.4 133 7-160 181-323 (335)
7 KOG4220|consensus 99.8 7.9E-19 1.7E-23 133.4 7.2 79 83-161 410-488 (503)
8 PF00001 7tm_1: 7 transmembran 99.7 4.4E-16 9.6E-21 114.6 10.7 123 11-143 133-257 (257)
9 KOG2087|consensus 99.3 5.4E-14 1.2E-18 105.9 -3.4 133 12-165 177-311 (363)
10 PF10324 7TM_GPCR_Srw: Serpent 99.3 1.1E-10 2.4E-15 89.6 11.6 117 32-160 196-317 (318)
11 PF05296 TAS2R: Mammalian tast 99.0 6.5E-09 1.4E-13 79.3 12.6 123 27-160 179-301 (303)
12 PF10320 7TM_GPCR_Srsx: Serpen 98.9 4.7E-08 1E-12 72.9 11.2 128 7-158 128-257 (257)
13 PF10323 7TM_GPCR_Srv: Serpent 98.6 1.5E-06 3.2E-11 65.8 13.5 116 32-160 167-283 (283)
14 PF10321 7TM_GPCR_Srt: Serpent 98.3 9.6E-06 2.1E-10 62.1 9.9 113 27-159 194-306 (313)
15 PF03402 V1R: Vomeronasal orga 98.1 3.3E-06 7.2E-11 62.8 3.2 111 34-154 153-263 (265)
16 PF10317 7TM_GPCR_Srd: Serpent 98.0 0.00022 4.9E-09 54.2 12.3 77 79-156 216-292 (292)
17 PF10327 7TM_GPCR_Sri: Serpent 98.0 0.00018 3.8E-09 55.1 11.2 108 30-153 195-302 (303)
18 PF10328 7TM_GPCR_Srx: Serpent 97.9 0.00064 1.4E-08 51.2 12.6 111 34-158 163-273 (274)
19 PF05462 Dicty_CAR: Slime mold 97.8 0.00082 1.8E-08 51.3 12.5 65 92-158 204-268 (303)
20 PF11970 Git3_C: G protein-cou 97.7 0.00025 5.4E-09 42.5 6.1 70 82-151 4-75 (76)
21 PF10326 7TM_GPCR_Str: Serpent 97.3 6.6E-05 1.4E-09 57.5 0.3 110 29-155 197-307 (307)
22 PF10318 7TM_GPCR_Srh: Serpent 97.1 0.009 1.9E-07 45.6 10.5 79 80-160 223-302 (302)
23 KOG4193|consensus 96.9 0.019 4.1E-07 48.1 11.3 67 87-159 514-580 (610)
24 PF10319 7TM_GPCR_Srj: Serpent 96.6 0.04 8.6E-07 42.1 10.2 74 79-153 235-309 (310)
25 PF02101 Ocular_alb: Ocular al 96.6 0.023 4.9E-07 44.3 8.7 24 91-114 245-268 (405)
26 PF03125 Sre: C. elegans Sre G 96.2 0.14 2.9E-06 40.4 11.4 44 120-163 281-324 (365)
27 PF02118 Srg: Srg family chemo 96.2 0.21 4.6E-06 37.5 12.0 98 37-153 177-274 (275)
28 KOG4564|consensus 95.6 0.086 1.9E-06 42.7 7.9 100 39-160 318-419 (473)
29 PF04789 DUF621: Protein of un 92.0 4.2 9.1E-05 30.8 11.8 112 25-158 184-296 (305)
30 PF10292 7TM_GPCR_Srab: Serpen 85.6 15 0.00032 28.5 12.5 78 80-158 223-304 (324)
31 PF02076 STE3: Pheromone A rec 82.0 21 0.00044 27.3 11.7 25 87-112 196-220 (283)
32 PF10322 7TM_GPCR_Sru: Serpent 81.5 13 0.00028 28.8 7.8 104 34-152 201-304 (307)
33 PF13853 7tm_4: Olfactory rece 71.9 0.0045 9.8E-08 42.1 -12.0 69 23-112 52-120 (144)
34 KOG0721|consensus 70.4 40 0.00086 24.7 7.2 16 144-159 110-125 (230)
35 KOG2575|consensus 66.3 11 0.00023 30.3 4.0 32 79-110 248-279 (510)
36 PF11014 DUF2852: Protein of u 64.5 13 0.00029 24.1 3.6 26 91-117 11-36 (115)
37 PF05478 Prominin: Prominin; 62.3 38 0.00082 30.1 7.1 33 86-118 457-489 (806)
38 COG1862 YajC Preprotein transl 58.9 30 0.00066 21.8 4.4 30 39-68 14-43 (97)
39 PF15086 UPF0542: Uncharacteri 58.2 37 0.00079 20.1 6.4 37 27-63 19-56 (74)
40 KOG3500|consensus 55.5 22 0.00048 21.5 3.1 26 85-112 30-55 (84)
41 PRK04989 psbM photosystem II r 54.0 27 0.00058 17.4 2.8 22 29-50 7-28 (35)
42 PF02117 7TM_GPCR_Sra: Serpent 52.6 1.1E+02 0.0024 23.9 9.1 34 82-115 228-261 (328)
43 COG4736 CcoQ Cbb3-type cytochr 52.2 43 0.00093 19.0 4.9 39 29-67 4-42 (60)
44 TIGR03038 PS_II_psbM photosyst 50.2 30 0.00066 17.0 2.6 22 29-50 7-28 (33)
45 PF06305 DUF1049: Protein of u 50.2 47 0.001 18.9 6.7 35 29-63 24-58 (68)
46 PHA02650 hypothetical protein; 48.6 54 0.0012 19.7 4.0 28 26-53 47-74 (81)
47 PRK06531 yajC preprotein trans 48.1 42 0.00091 21.8 3.9 7 39-45 7-13 (113)
48 PF09889 DUF2116: Uncharacteri 47.6 27 0.00058 19.8 2.6 20 84-103 34-53 (59)
49 KOG3827|consensus 47.6 12 0.00025 29.8 1.5 63 86-148 60-124 (400)
50 CHL00080 psbM photosystem II p 45.7 39 0.00085 16.7 2.7 22 29-50 7-28 (34)
51 PHA02844 putative transmembran 45.6 44 0.00095 19.8 3.3 26 27-52 47-72 (75)
52 COG2322 Predicted membrane pro 45.5 1E+02 0.0023 21.6 7.9 37 81-117 71-107 (177)
53 COG3924 Predicted membrane pro 43.0 71 0.0015 18.9 4.0 33 22-54 36-68 (80)
54 TIGR00739 yajC preprotein tran 42.4 71 0.0015 19.4 4.1 24 43-66 12-35 (84)
55 PHA02975 hypothetical protein; 40.3 76 0.0017 18.5 4.2 27 26-52 42-68 (69)
56 PF10322 7TM_GPCR_Sru: Serpent 40.3 1.8E+02 0.0038 22.7 6.9 17 37-53 208-224 (307)
57 PF05817 Ribophorin_II: Oligos 39.6 2.6E+02 0.0055 24.3 8.7 28 28-55 548-575 (636)
58 PHA02819 hypothetical protein; 38.5 84 0.0018 18.5 4.0 26 26-51 44-69 (71)
59 PF05151 PsbM: Photosystem II 38.3 52 0.0011 15.9 3.7 21 30-50 8-28 (31)
60 PF15102 TMEM154: TMEM154 prot 37.2 2.5 5.4E-05 28.6 -3.1 7 35-41 59-65 (146)
61 PF07330 DUF1467: Protein of u 36.5 79 0.0017 19.4 3.6 34 73-106 38-71 (85)
62 PF04238 DUF420: Protein of un 34.2 1.5E+02 0.0031 19.9 8.6 38 81-118 30-67 (133)
63 PF11710 Git3: G protein-coupl 33.2 1.3E+02 0.0027 21.7 4.9 35 27-61 166-200 (201)
64 PRK14094 psbM photosystem II r 32.1 37 0.0008 18.1 1.4 22 29-50 7-28 (50)
65 PLN00090 photosystem II reacti 31.0 1E+02 0.0023 19.2 3.5 22 29-50 77-98 (113)
66 COG3771 Predicted membrane pro 30.6 1.4E+02 0.0029 18.5 6.0 42 29-70 45-86 (97)
67 PF00002 7tm_2: 7 transmembran 30.4 17 0.00037 26.6 0.0 21 87-107 195-215 (242)
68 PF02699 YajC: Preprotein tran 30.1 1.1E+02 0.0024 18.5 3.5 18 42-59 10-27 (82)
69 PRK05886 yajC preprotein trans 30.1 1.6E+02 0.0034 19.1 4.8 24 37-60 6-30 (109)
70 COG4068 Uncharacterized protei 28.8 1.2E+02 0.0025 17.2 3.2 19 84-102 38-56 (64)
71 PF11119 DUF2633: Protein of u 28.8 1.2E+02 0.0026 17.2 4.1 30 83-112 4-33 (59)
72 PF12725 DUF3810: Protein of u 28.4 2.9E+02 0.0063 21.6 7.7 22 86-107 58-79 (318)
73 PF14163 SieB: Superinfection 28.4 1.2E+02 0.0026 20.5 4.0 22 93-114 4-25 (151)
74 PF02439 Adeno_E3_CR2: Adenovi 27.9 95 0.0021 15.8 3.3 28 29-56 7-34 (38)
75 PTZ00370 STEVOR; Provisional 27.3 1.1E+02 0.0025 23.4 3.8 17 38-54 265-281 (296)
76 PF00664 ABC_membrane: ABC tra 27.2 2.3E+02 0.005 20.1 9.6 20 122-141 256-275 (275)
77 PF00375 SDF: Sodium:dicarboxy 26.3 3.4E+02 0.0074 21.7 9.9 73 86-158 166-246 (390)
78 PF05241 EBP: Emopamil binding 26.0 1.2E+02 0.0025 21.8 3.6 32 27-58 160-191 (194)
79 PHA03054 IMV membrane protein; 26.0 1.5E+02 0.0032 17.5 4.0 26 26-51 46-71 (72)
80 TIGR02736 cbb3_Q_epsi cytochro 25.6 1.3E+02 0.0029 16.8 3.0 21 39-59 7-27 (56)
81 COG4665 FcbT2 TRAP-type mannit 25.1 2.5E+02 0.0055 19.8 6.9 33 24-56 88-120 (182)
82 TIGR01478 STEVOR variant surfa 24.5 1.1E+02 0.0024 23.4 3.3 15 39-53 270-284 (295)
83 PHA03234 DNA packaging protein 24.5 89 0.0019 24.4 3.1 11 101-111 205-215 (338)
84 PF10669 Phage_Gp23: Protein g 22.5 2.1E+02 0.0046 18.0 5.5 15 37-51 23-37 (121)
85 PHA01815 hypothetical protein 22.4 1.4E+02 0.003 15.8 4.3 23 31-53 9-31 (55)
86 PF10066 DUF2304: Uncharacteri 22.4 2.3E+02 0.0049 18.3 11.2 51 83-134 29-79 (115)
87 TIGR03052 PS_I_psaI photosyste 22.2 1E+02 0.0022 14.9 1.9 19 32-50 8-26 (31)
88 TIGR00934 2a38euk potassium up 21.7 5.1E+02 0.011 23.3 7.1 32 84-115 443-474 (800)
89 PF03155 Alg6_Alg8: ALG6, ALG8 21.3 1.4E+02 0.003 24.8 3.6 27 83-109 218-244 (469)
90 PF08525 OapA_N: Opacity-assoc 20.3 1.2E+02 0.0026 14.4 2.9 23 82-106 7-29 (30)
91 PHA02909 hypothetical protein; 20.1 1.8E+02 0.0038 16.2 4.5 20 32-51 40-59 (72)
92 PF10361 DUF2434: Protein of u 20.1 4.2E+02 0.0091 20.5 6.8 46 10-64 32-77 (296)
No 1
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.89 E-value=1.6e-22 Score=155.78 Aligned_cols=126 Identities=17% Similarity=0.261 Sum_probs=94.1
Q ss_pred ccccccccccccccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHH
Q psy3982 12 TACGTDYLTKDWHHRSYL---IFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKL 88 (185)
Q Consensus 12 ~~C~~~~~~~~~~~~~y~---~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (185)
+.|...|+.++. ...+. ...+.+.+++|+++|++||.+|.+.++++ +.++++|+
T Consensus 179 ~~C~~~~~~~~~-~~~~~~~~~~~~~~~f~iPl~im~~cY~~I~~~L~~~----------------------~~~~~~k~ 235 (338)
T PHA03234 179 GKCNIHISSKKA-YDLFIAIKIVFCFIWGIFPTMIFSFFYVIFCKALHAL----------------------TEKKHKKT 235 (338)
T ss_pred CcCcccCCcchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------------------hhhhhhhh
Confidence 579887754321 11222 12234457999999999999999888742 01346789
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHhhhc----cc-----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982 89 AKVALMTISLWFLAWTPYLVINFGGILEM----VE-----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP 159 (185)
Q Consensus 89 ~~~~~~i~~~f~i~~~P~~v~~~~~~~~~----~~-----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~ 159 (185)
+|++++++++|++||+||++..++..... .. .......++.+++++||++||+||++.+++||+++++.++
T Consensus 236 ~k~i~~vv~vF~iCWlPy~iv~l~~~~~~~~~~~~c~~~~~~~~~~~v~~~La~~nsclNPiIY~f~~~~FR~~~~~~~~ 315 (338)
T PHA03234 236 LFFIRILILSFLCIQIPNIAILICEIAFLYIANNSCFGLAQREILQIIIRLMPEIHCFSNPLVYAFTGGDFRLRFTACFQ 315 (338)
T ss_pred hhHHHHHHHHHHHHHhHHHHHHHHHHHHHhcccCcchHHHHHHHHHHHHHHHHHhhhhhhHHHHHHhhHHHHHHHHHHHH
Confidence 99999999999999999999887654321 00 1234567888999999999999999999999999998776
Q ss_pred c
Q psy3982 160 S 160 (185)
Q Consensus 160 ~ 160 (185)
+
T Consensus 316 ~ 316 (338)
T PHA03234 316 D 316 (338)
T ss_pred H
Confidence 4
No 2
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.88 E-value=4.9e-22 Score=156.39 Aligned_cols=129 Identities=12% Similarity=0.272 Sum_probs=99.3
Q ss_pred Cccccccccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHH
Q psy3982 10 NLTACGTDYLTKDW--HHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIK 87 (185)
Q Consensus 10 ~~~~C~~~~~~~~~--~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (185)
+...|...+..+.. ....|.+..+++.|++|+++|++||.+|++.++++.+ ++++|
T Consensus 184 ~~~~C~~~~~~~~~~~~~~~y~i~l~i~~f~iPl~im~~~Y~~I~~~l~~~~~----------------------~~~~k 241 (409)
T PHA03235 184 GYETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTWFYTFFYRTLKRASY----------------------KKRSR 241 (409)
T ss_pred CcceeeEeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----------------------hcchh
Confidence 34579765544321 1234666777788999999999999999999986421 12457
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc----c-----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982 88 LAKVALMTISLWFLAWTPYLVINFGGILEMV----E-----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF 158 (185)
Q Consensus 88 ~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~----~-----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~ 158 (185)
..+++++++++|++||+||++..++..+... . ....+..++.+++++|+++||+||.+.+++||+++++.+
T Consensus 242 ~~~~v~iivv~F~iCWlPy~v~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~La~~ns~lNPiIY~~~~~~FRk~~~~~l 321 (409)
T PHA03235 242 TLTFVCILLLSFLCLQTPFVAIMIFDSYATLIWPSDCEHINLRDAVSTLSRLVPNLHCLLNPILYAFLGNDFLKRFRQCF 321 (409)
T ss_pred hhhhHHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCchhhhHHHHHHHHHHHHHHHHHhHhHHHHHHhhHHHHHHHHHHH
Confidence 7888999999999999999999887654321 1 234567788999999999999999999999999999888
Q ss_pred cc
Q psy3982 159 PS 160 (185)
Q Consensus 159 ~~ 160 (185)
++
T Consensus 322 ~~ 323 (409)
T PHA03235 322 RG 323 (409)
T ss_pred hh
Confidence 64
No 3
>KOG4219|consensus
Probab=99.85 E-value=2.6e-21 Score=146.56 Aligned_cols=141 Identities=20% Similarity=0.333 Sum_probs=111.9
Q ss_pred ccccccccccc--c-----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCch
Q psy3982 11 LTACGTDYLTK--D-----WHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTS 83 (185)
Q Consensus 11 ~~~C~~~~~~~--~-----~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (185)
...|...|+.. . ...+.|..++.++.+++|++++..+|..|...+|..+.... +.. ++.+..+
T Consensus 185 ~~~~~~~~pe~~~~~~~~~~~~~~y~~vl~~lqYflPliVl~~~Yt~iav~LW~~~~~gd-~~d---------~~~~~~k 254 (423)
T KOG4219|consen 185 RVVCVTAWPEHVCPTENESLLMQGYNYVLLFLQYFLPLIVLGLAYTVIAVTLWGRRIPGD-QQD---------RKHEQLK 254 (423)
T ss_pred eEEEEEecccccCCcchhhhhhcceeeeehhHHHHHHHHHHHHHHHHHHHHHHhccCccc-hhc---------hhhHHHH
Confidence 55676666554 1 12234888899999999999999999999999996542211 110 1334567
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc----cchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982 84 AEIKLAKVALMTISLWFLAWTPYLVINFGGILEMV----EINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP 159 (185)
Q Consensus 84 ~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~----~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~ 159 (185)
.++|+.||++++++.|.+||+||++..++.....+ ......+....||+.+|+|.||+||++.|++||.++++.++
T Consensus 255 ak~K~vkmliiVV~~FaicWlPyh~y~il~~~~~~i~~~k~i~~vyl~~~WLaMSst~yNPiIY~~lN~Rfr~gf~~~fr 334 (423)
T KOG4219|consen 255 AKKKVVKMLIIVVVIFAICWLPYHIYFILNATNPEINRKKFIQQVYLAIYWLAMSSTCYNPIIYCFLNKRFRGGFRRAFR 334 (423)
T ss_pred HHHHHHHHHHHHHHHHHHhccChhHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhccHhhhhhHHHHHHHHhhhhh
Confidence 78899999999999999999999999988764443 25577888999999999999999999999999999999997
Q ss_pred cc
Q psy3982 160 SL 161 (185)
Q Consensus 160 ~~ 161 (185)
++
T Consensus 335 ~c 336 (423)
T KOG4219|consen 335 WC 336 (423)
T ss_pred ee
Confidence 65
No 4
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.84 E-value=5.9e-21 Score=146.61 Aligned_cols=110 Identities=20% Similarity=0.376 Sum_probs=87.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLV 108 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v 108 (185)
....+++.+++|+++|+++|.+|.+.++++++ +.++|.+|+++++++.|++||+|+++
T Consensus 190 ~~~~~i~~f~iPl~ii~~~Y~~I~~~l~~~~~----------------------~~~~k~~k~~~~vv~~F~icWlPy~i 247 (323)
T PHA02834 190 NFEINIFGIVIPLIILIYCYSKILYTLKNCKN----------------------KNKTRSIKIILTVVTFTVVFWVPFNI 247 (323)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----------------------cccceEEeehhHHHHHHHHHHhhHHH
Confidence 33455778999999999999999999875311 22457889999999999999999999
Q ss_pred HHHHHhhhccc--------c--hHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982 109 INFGGILEMVE--------I--NPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS 160 (185)
Q Consensus 109 ~~~~~~~~~~~--------~--~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~ 160 (185)
..++..+.... . ......+..+++++||++||+||++.+++||+++++++++
T Consensus 248 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~iNPiIY~~~~~~fR~~~~~~~~~ 309 (323)
T PHA02834 248 VLFINSLQSVGLIDIGCYHFKKIVYSIDIAELISFVHCCVNPIIYAFVGKNFKKVFKNMFCR 309 (323)
T ss_pred HHHHHHHHHhcCCCccchHHHHHHHHHHHHHHHHHhhccccHHHHHHhcHHHHHHHHHHHHh
Confidence 88876543211 0 1123457889999999999999999999999999998743
No 5
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.82 E-value=1e-19 Score=143.74 Aligned_cols=130 Identities=15% Similarity=0.233 Sum_probs=97.9
Q ss_pred ccccccccccccc----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHH
Q psy3982 11 LTACGTDYLTKDW----HHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEI 86 (185)
Q Consensus 11 ~~~C~~~~~~~~~----~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (185)
...|...++.+.. ....+.+...++++++|+++|++||.+|++.+++.+ .++++
T Consensus 252 ~~~C~~~~~~~~~~~~~~~~~~~~~~~i~~f~lPl~vmi~cY~~I~~~L~~~~----------------------~~~k~ 309 (417)
T PHA02638 252 NYQCTLIEDNEKNNISFLGRILQFEINILGMFIPIIIFAFCYIKIILKLKQLK----------------------KSKKT 309 (417)
T ss_pred CCeeeeeccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------------------ccccc
Confidence 4579765543221 112344455677899999999999999999987531 12356
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc--------------chHHHHHHHHHHHhhhhhccchhheecchHHHH
Q psy3982 87 KLAKVALMTISLWFLAWTPYLVINFGGILEMVE--------------INPLITIWGSVFAKANAVYNPIVYAISHPKFRQ 152 (185)
Q Consensus 87 ~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~--------------~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~ 152 (185)
|.+|+++.++++|++||+|+.+..++....... .......+..+++++||++||+||++.+++||+
T Consensus 310 k~~rli~~ivi~f~lcW~Py~i~~ll~~~~~~~~~~~~~~~~~c~~~~l~~~~~vt~~la~~~sclNPiIY~f~~~~FR~ 389 (417)
T PHA02638 310 KSIIIVSIIIICSLICWIPLNIVILFATMYSFKGFNSIISEHICGFIKLGYAMMLAEAISLTHCCINPLIYTLIGENFRM 389 (417)
T ss_pred hhhhhHHHHHHHHHHHHhhHHHHHHHHHHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHhhhHHHHHHhCHHHHH
Confidence 778999999999999999999998877653210 123455678899999999999999999999999
Q ss_pred HHHhhccccc
Q psy3982 153 ALDKKFPSLV 162 (185)
Q Consensus 153 ~~~~~~~~~~ 162 (185)
++++.+++..
T Consensus 390 ~l~~~~~~~~ 399 (417)
T PHA02638 390 HLLMIFRNIF 399 (417)
T ss_pred HHHHHHHHhc
Confidence 9999886543
No 6
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.80 E-value=8.8e-19 Score=135.22 Aligned_cols=133 Identities=18% Similarity=0.305 Sum_probs=98.6
Q ss_pred ccCCcccccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHH
Q psy3982 7 PEGNLTACGTDYLTKDWHHRSY-LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAE 85 (185)
Q Consensus 7 ~~~~~~~C~~~~~~~~~~~~~y-~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (185)
.+.+...|...+.++......+ .....++.+++|+++++++|.+|.+.++++++ .+++
T Consensus 181 ~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~lP~~ii~~~y~~i~~~l~~~~~---------------------~~~~ 239 (335)
T PHA03087 181 KDHETLICCMFYNNKTMNWKLFINFEINIIGMLIPLTILLYCYSKILITLKGINK---------------------SKKN 239 (335)
T ss_pred ccCCCceEEecCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------------chhc
Confidence 3445567887776433222222 23345677899999999999999888875321 2346
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhc-----c-c---chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHh
Q psy3982 86 IKLAKVALMTISLWFLAWTPYLVINFGGILEM-----V-E---INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDK 156 (185)
Q Consensus 86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~-----~-~---~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~ 156 (185)
+|+.|++..++++|++||+|+.+..++..... . . ....+..+..+++.+|+++||+||++++++||+++++
T Consensus 240 ~k~~k~l~~iv~~f~i~w~P~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~~NPiIY~~~~~~fr~~~~~ 319 (335)
T PHA03087 240 KKAIKLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSGCKAVKYIQYALHVTEIISLSHCCINPLIYAFVSEFFNKHKKK 319 (335)
T ss_pred chHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHHHHHhhhhhHHHHcCHHHHHHHHH
Confidence 78999999999999999999999877654311 1 1 2244556788899999999999999999999999999
Q ss_pred hccc
Q psy3982 157 KFPS 160 (185)
Q Consensus 157 ~~~~ 160 (185)
.++.
T Consensus 320 ~~~~ 323 (335)
T PHA03087 320 SLKL 323 (335)
T ss_pred HHHH
Confidence 8853
No 7
>KOG4220|consensus
Probab=99.77 E-value=7.9e-19 Score=133.41 Aligned_cols=79 Identities=23% Similarity=0.371 Sum_probs=74.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcccc
Q psy3982 83 SAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPSL 161 (185)
Q Consensus 83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~~ 161 (185)
.+|+|++|++.+|.++|+++|.||.++.++..+.....+..++.+..||-++||-+||++|.++|+.||+.++++|.|.
T Consensus 410 ~rErKAAkTLsAILlAFIiTWtPYNImVlv~tFC~~CiP~tlW~~gYwLCYINSTiNP~CYALCNatFrkTfk~lL~Cr 488 (503)
T KOG4220|consen 410 VRERKAAKTLSAILLAFILTWTPYNIMVLVNTFCKNCIPETLWTFGYWLCYINSTINPLCYALCNATFRKTFKRLLLCR 488 (503)
T ss_pred HHHHHHHHHHHHHHHHHHHHcccceeeeehHhhcccccchhHhhhhhheeeecccccHHHHHHHhHHHHHHHHHhheee
Confidence 5789999999999999999999999999999988888999999999999999999999999999999999999999654
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.68 E-value=4.4e-16 Score=114.58 Aligned_cols=123 Identities=22% Similarity=0.525 Sum_probs=101.1
Q ss_pred cccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHH
Q psy3982 11 LTACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAK 90 (185)
Q Consensus 11 ~~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (185)
...|..++..+. ...+.....++.+++|+++++++|.+|++.+++++++...+.... ..++.++++|.++
T Consensus 133 ~~~C~~~~~~~~--~~~~~~~~~~~~~~~p~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~ 202 (257)
T PF00001_consen 133 QSFCFIDFSSSS--SQIYFIYFFIVFFILPLIIILICYIRILRKLRRQRKRIKSQSSSS--------SRRRSRRERRAAR 202 (257)
T ss_dssp CEEEEESCSSSH--HHHHHHHHHHHHTHHHHHHHHHHHHHHHHHHHHHHHCTCCHTSHH--------HHHHHHHHHHHHH
T ss_pred cccccccccccc--cccccccccccccccceeeeeeecccccccccccccccccccccc--------ccccccccccccc
Confidence 567977665533 456788888889999999999999999999998877653221100 1234577899999
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHhhhccc--chHHHHHHHHHHHhhhhhccchhh
Q psy3982 91 VALMTISLWFLAWTPYLVINFGGILEMVE--INPLITIWGSVFAKANAVYNPIVY 143 (185)
Q Consensus 91 ~~~~i~~~f~i~~~P~~v~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~nPiiY 143 (185)
+++.++++|++||+|+.+..++....+.. .......++.++.++|+++||+||
T Consensus 203 ~~~~i~~~f~~~~~P~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~~s~~nP~iY 257 (257)
T PF00001_consen 203 TLLIIVLVFLLCWLPYFILSLLSVFSPSSSLISSILFYISYFLAFLNSCLNPIIY 257 (257)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHSSTSTCSHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccCCceeHHHHHHHHcCccchhhHHHHHHHHHHHHHHHhhCcEEC
Confidence 99999999999999999999998888776 578899999999999999999998
No 9
>KOG2087|consensus
Probab=99.32 E-value=5.4e-14 Score=105.93 Aligned_cols=133 Identities=16% Similarity=0.235 Sum_probs=100.0
Q ss_pred ccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHH-HHHHH
Q psy3982 12 TACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAE-IKLAK 90 (185)
Q Consensus 12 ~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 90 (185)
..|.+....+. ....|.+..++..-.+-+++|.++|++++..+++- ....... ...+|
T Consensus 177 ~vClPL~~~~~-~s~g~y~~~~l~~N~lafiiia~~Y~~iy~~l~~~--------------------~~~~~~~~~~~ak 235 (363)
T KOG2087|consen 177 SVCLPLHIEEP-LSTGYYLVALLGLNLLAFIIIAFSYGKIYCSLRKG--------------------DLSATLISTSVAK 235 (363)
T ss_pred ceeeecccCCc-cchhHHHHHHHHHHHHHHHHHHHHhhhhheeeecC--------------------CCccccchhhhhh
Confidence 58987554432 23336666667777788999999999999999952 1111222 37788
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHhhhcccch-HHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcccccccc
Q psy3982 91 VALMTISLWFLAWTPYLVINFGGILEMVEIN-PLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPSLVCGT 165 (185)
Q Consensus 91 ~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~~~~~~ 165 (185)
-+..+++.-++||.|..+..+...+..+-.. .....+..++.-+|+++||++|.+.++.||+.++.++++..+++
T Consensus 236 r~a~LvfTd~icw~Pi~f~~~~al~~~~li~~~~sk~llv~flPlns~~NP~LYa~fT~~fk~d~~~l~~k~~~~k 311 (363)
T KOG2087|consen 236 RMAFLVFTDCICWCPIAFFKFSALIGVELISVSYSKWLLVFFLPLNSCLNPFLYAFFTPVFKEDLFLLLSKVGLCK 311 (363)
T ss_pred CeeEEEEccccccCchheeeeHHhcCCcccChhhceeEEEEEEEcccccCchhHHHcCHHHHHHHHHHHhhcchhh
Confidence 8889999999999999999998887766422 33335666777899999999999999999999999887654443
No 10
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=99.25 E-value=1.1e-10 Score=89.61 Aligned_cols=117 Identities=14% Similarity=0.202 Sum_probs=95.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHH
Q psy3982 32 YGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINF 111 (185)
Q Consensus 32 ~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~ 111 (185)
..++.-++|.++..++...+...+|+.++++++..+ + ..+++.+.+++++.+++.|+++-+|..+..+
T Consensus 196 ~gi~~kiiP~il~~ilti~Li~~Lrk~~~~r~~~~~-----------~-~~~~~~~tt~li~~~ti~f~i~e~p~gi~~~ 263 (318)
T PF10324_consen 196 DGIFFKIIPCILLPILTILLIIELRKAKKRRKKLSS-----------S-KSKKSDRTTKLILFMTISFLISELPQGIIFL 263 (318)
T ss_pred hhhHhhhhhHHHHHHHHHHHHHHHHhccHhhhcccc-----------c-ccccccchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344457899999999999999999987777653321 1 1356779999999999999999999999999
Q ss_pred HHhhhccc-----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982 112 GGILEMVE-----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS 160 (185)
Q Consensus 112 ~~~~~~~~-----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~ 160 (185)
+.....+. ....+..+..++..+|+..|++||.+++.+||+++++++++
T Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~h~~ic~~mSsqYR~t~~~~f~~ 317 (318)
T PF10324_consen 264 LESFFEEDSGLIFIIIQLSIIFNILITINSSIHFFICCFMSSQYRKTVKKLFGC 317 (318)
T ss_pred HHHHHhhhcchhHHHHHHHHHHHHHHHHHhhhHHHhhhhhhHHHHHHHHHHhcc
Confidence 87753332 44567778899999999999999999999999999999853
No 11
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=99.04 E-value=6.5e-09 Score=79.28 Aligned_cols=123 Identities=16% Similarity=0.255 Sum_probs=93.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982 27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPY 106 (185)
Q Consensus 27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~ 106 (185)
.+......+..++|+++++.+...+...++||.|+.+....+. +..+.+.+.|+.|++....+.|+++++-.
T Consensus 179 ~~~~~~~~~~~~lPf~i~l~s~~lli~SL~rH~r~M~~n~~g~--------~~ps~~aH~~a~k~~~sfl~ly~~~~~~~ 250 (303)
T PF05296_consen 179 FYFFILFNLGSFLPFLIFLVSSILLIFSLWRHMRRMQKNATGF--------RDPSTEAHIRAIKTMISFLILYIIYFLSL 250 (303)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhhCCCCCC--------CCCCcHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445588999999999999999999999999987543332 23356789999999988888888777554
Q ss_pred HHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982 107 LVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS 160 (185)
Q Consensus 107 ~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~ 160 (185)
.+........+ ......+...+..+.|..+|++-.+.|+++|+++++.+++
T Consensus 251 ~~~~~~~~~~~---~~~~~~~~~~i~~~yps~hs~iLIlgn~KLr~~~~~il~~ 301 (303)
T PF05296_consen 251 ILSFLSFFFPE---NSIWFWVCEIIIALYPSGHSIILILGNPKLRQALLKILWC 301 (303)
T ss_pred HHHHHHHHhcc---ccHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhh
Confidence 43222222222 2445667788999999999999999999999999998854
No 12
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.86 E-value=4.7e-08 Score=72.94 Aligned_cols=128 Identities=14% Similarity=0.259 Sum_probs=87.2
Q ss_pred ccCCcccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHH
Q psy3982 7 PEGNLTACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEI 86 (185)
Q Consensus 7 ~~~~~~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (185)
+++....|.+...-++. ++....... .++ .+++++.|...+..+++++++ ..+.++
T Consensus 128 ~~~~~~~C~pp~a~~~~---~~~~~~~~~-~~i-nv~tvivY~i~~~~~~~k~~~-------------------~~~~~~ 183 (257)
T PF10320_consen 128 DDETIVICNPPLAFHGT---ASQIWSYSN-III-NVITVIVYIITIIIFKRKSRS-------------------NSSRSK 183 (257)
T ss_pred CCcccccCCCccccCcc---HHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHccc-------------------cchhHH
Confidence 44457789775443332 333222222 222 244456666666666654222 346688
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc--chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982 87 KLAKVALMTISLWFLAWTPYLVINFGGILEMVE--INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF 158 (185)
Q Consensus 87 ~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~ 158 (185)
|+.|.+...++.|++.|.-..+...+....... ....+.....+++..+.+.|.++|.++|+|||+++++++
T Consensus 184 kv~ksL~v~v~i~i~~w~~s~~~~~v~~~~~~~~~~~~~i~~~~~i~v~~~~s~~ffV~~~~S~EYR~af~~~~ 257 (257)
T PF10320_consen 184 KVFKSLKVTVIIFIFSWFLSQIINTVSLALGLDGETIAIIQMYAGIFVNISYSQNFFVYYWRSSEYRKAFRELF 257 (257)
T ss_pred HHHHHhhhheeeeeHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHHHHHHheEEEEcCHHHHHHHHHhC
Confidence 999999999999999998887776654433332 445577778899999999999999999999999999864
No 13
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=98.65 E-value=1.5e-06 Score=65.81 Aligned_cols=116 Identities=15% Similarity=0.220 Sum_probs=77.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHH
Q psy3982 32 YGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINF 111 (185)
Q Consensus 32 ~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~ 111 (185)
.......+.+++.+++|..+...+|++.+...++ ..+..++|.+.+.+..+...++++...=+.....
T Consensus 167 ~~~~~~~~~cv~~iv~Y~~i~~~iRk~~k~~s~~------------~s~~~~rE~~L~~~~~i~~~a~~~~~~~~~~~~~ 234 (283)
T PF10323_consen 167 IAFIFVSVTCVICIVCYGIIFIFIRKRNKKKSKS------------SSRSRRREIRLAIQVFILFCAFFVILVYYIFSNY 234 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHh------------hhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455667888999999999999776411100 2335577888888877777666655544433333
Q ss_pred HHhhhcccc-hHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982 112 GGILEMVEI-NPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS 160 (185)
Q Consensus 112 ~~~~~~~~~-~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~ 160 (185)
.....+.. ...+..+-..+..+.|.+||..-.+.|+++||.+++.++|
T Consensus 235 -~~~~~~~~~~~~~r~~y~~~~~~~s~inP~~LLi~n~~lr~~~~~~~~~ 283 (283)
T PF10323_consen 235 -FAQNFNTDPIFYLRAFYPILNGLLSFINPWMLLIFNKDLRKQVRRMLKC 283 (283)
T ss_pred -HHhcccchHHHHHHHHHHHHHHHHHhhhhHHhhhccHHHHHHHHHHcCC
Confidence 23222222 3344456777888889999999999999999999998864
No 14
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.28 E-value=9.6e-06 Score=62.06 Aligned_cols=113 Identities=15% Similarity=0.183 Sum_probs=79.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982 27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPY 106 (185)
Q Consensus 27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~ 106 (185)
.|.-......-++-.+.+..+|+.+...++++.+... +.+..+.+++++....++.+.+.+.++-|
T Consensus 194 ~Y~N~~~~~nN~~~~~~t~~lY~~l~~~l~~k~~~~~--------------s~~~~k~~~qI~iQs~iIC~f~~i~a~iy 259 (313)
T PF10321_consen 194 LYSNIPHTFNNIIVVICTTFLYIYLCYYLIKKSKYST--------------SSQLSKAQRQIFIQSVIICFFHAIAAVIY 259 (313)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------chhhhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 3555555555556578888899998888887654432 23344556677766666766666665544
Q ss_pred HHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982 107 LVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP 159 (185)
Q Consensus 107 ~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~ 159 (185)
....++ ..+.++..+..+.-.+++..+|+||...|+..|+++++++.
T Consensus 260 v~m~f~------~~p~~~i~~~~~~Wql~~g~~~iIYl~lNrtIR~~~~k~~~ 306 (313)
T PF10321_consen 260 VYMQFF------PPPPWLIIIGQISWQLSHGCPPIIYLTLNRTIRNSVLKMLG 306 (313)
T ss_pred Heeeee------cccHHHHHHHHHHHhccCCccceEEEEECHHHHHHHHHHHc
Confidence 333332 13466777777888899999999999999999999999884
No 15
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=98.06 E-value=3.3e-06 Score=62.82 Aligned_cols=111 Identities=15% Similarity=0.121 Sum_probs=85.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy3982 34 FFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGG 113 (185)
Q Consensus 34 ~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~ 113 (185)
.+.-++.+.+|+++.+.++..+.||+++.+...... ...+...|.|+++++++++..|+.++....++.++.
T Consensus 153 ~~~D~~fvgLM~~aSgyMV~~L~RH~q~VqhIhs~~--------~s~r~SpE~RAtktILlLVs~FV~fY~l~si~~~~~ 224 (265)
T PF03402_consen 153 SFRDVLFVGLMIWASGYMVFLLYRHHQQVQHIHSSS--------LSPRSSPETRATKTILLLVSTFVSFYGLSSILFIYL 224 (265)
T ss_pred HhhHHHHHHHHHhhhhhheeeeeecccccccCcCCC--------CCCCCChhHHHhCeEeeHHHHHHHHHhHHHHHHHHH
Confidence 444577788999999999999999999987543211 223456799999999999999999999988776544
Q ss_pred hhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHH
Q psy3982 114 ILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQAL 154 (185)
Q Consensus 114 ~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~ 154 (185)
..... ..++.....+++..-|.+-|++....++.+.+.+
T Consensus 225 ~~~~~--~~~~~~~~~~ls~cfptisPfvLI~~d~~i~~~~ 263 (265)
T PF03402_consen 225 TSFKN--SPWLLNISVFLSSCFPTISPFVLISSDKRIIKFL 263 (265)
T ss_pred HHhcC--CcceeEHHHHHhHHhHhhChHHhhccCchHHHHh
Confidence 43332 2334457788999999999999999998887754
No 16
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=98.00 E-value=0.00022 Score=54.25 Aligned_cols=77 Identities=16% Similarity=0.195 Sum_probs=64.1
Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHh
Q psy3982 79 QSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDK 156 (185)
Q Consensus 79 ~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~ 156 (185)
++.++.+++..|.+..-+..=+++..|.....++...... ......++...+..+.+.+||+++...=+.||+.++|
T Consensus 216 ~~tk~~h~~lv~~Lt~Q~~lP~~~~~p~~~~~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~P~itl~fv~PYR~~i~r 292 (292)
T PF10317_consen 216 ERTKSMHRQLVKGLTIQALLPLFFYIPGVIIYFLSQFTGY-EHPFLEYLIFMLASLPPLIDPLITLYFVRPYRKAILR 292 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcc-ccHHHHHHHHHHHHHHHHhchHhheeeeHhHHHHhcC
Confidence 3345678899999999999999999988888887776665 3466777777788899999999999999999999864
No 17
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=97.97 E-value=0.00018 Score=55.13 Aligned_cols=108 Identities=15% Similarity=0.209 Sum_probs=84.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy3982 30 IFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVI 109 (185)
Q Consensus 30 ~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~ 109 (185)
.....++.++-.++..+++..+++.+++.+++.. ++..++++++.+.+++-..+-.+|.+|..+.
T Consensus 195 ~~~~~~~~~~~~~~~~~~~~~m~~~L~~~k~~iS---------------~~ty~kHk~av~SLi~Q~~~~~i~~~P~~~~ 259 (303)
T PF10327_consen 195 FILAFFGGFLCFVIFIFLTIDMFRMLKKLKKKIS---------------KQTYQKHKEAVRSLIAQFATSSICILPPFIF 259 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3334444556677788888999999988777663 3356888999999998888888999999887
Q ss_pred HHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHH
Q psy3982 110 NFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQA 153 (185)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~ 153 (185)
.+......+ ..+.+..........+|.+|-++-.+.++.||+-
T Consensus 260 ~~~~~~~~~-~~q~i~~~~~~~f~~HS~~n~ivli~t~ppYR~f 302 (303)
T PF10327_consen 260 VVVVIFEFE-DAQVISEICLAIFSSHSSVNMIVLIITTPPYRKF 302 (303)
T ss_pred hhhheecCC-CcHHHHHHHHHHHHHhhHhhheeeeEcCcchhhc
Confidence 776555443 4567778888889999999999999999999984
No 18
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=97.87 E-value=0.00064 Score=51.24 Aligned_cols=111 Identities=13% Similarity=0.111 Sum_probs=75.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy3982 34 FFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGG 113 (185)
Q Consensus 34 ~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~ 113 (185)
......-.+++..++..++.++++.+++.. ...+..++++++|.+.++..+.-...|++.++-+.+..-+
T Consensus 163 ~~~~~~~~~~~~~lni~t~ikl~~~~~~~~---------~~~~~s~~r~rke~~f~~Qs~~Q~~~~~i~~~~~~~~~~~- 232 (274)
T PF10328_consen 163 FYKNFILVIISNILNIITFIKLRKFRKKIS---------VSSSESKKRRRKEIRFFIQSFIQDLLYLIDLIFYFFIPPL- 232 (274)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---------ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-
Confidence 333444456777778888878876655541 0111144567789999999999999999999877766443
Q ss_pred hhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982 114 ILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF 158 (185)
Q Consensus 114 ~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~ 158 (185)
.++ .......+.+.-.+..++|++|+...|+|+|+.+++..
T Consensus 233 --~~~--~~~~F~~~t~~w~~~h~~DG~i~l~fN~~~r~~~~~~~ 273 (274)
T PF10328_consen 233 --SSN--RWWQFFCTTFSWVLVHALDGLIMLIFNSEIRRKIRKKK 273 (274)
T ss_pred --ccc--cHHHHHHHHHHHHHHHHhcceeEeEEcHHHHHHHHhcc
Confidence 122 23333344444456678899999999999999988753
No 19
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=97.82 E-value=0.00082 Score=51.35 Aligned_cols=65 Identities=12% Similarity=0.216 Sum_probs=52.5
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982 92 ALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF 158 (185)
Q Consensus 92 ~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~ 158 (185)
+..-.++|++||+|-.+..+....++. +..+..+...++.+.-.+|.++|++.++-.++.+...+
T Consensus 204 L~~Yp~ifiicw~fa~INRI~~~~~~~--~~~l~~Lh~~~s~lqGf~nsivy~~n~~~~~~~~~~~~ 268 (303)
T PF05462_consen 204 LVNYPLIFIICWIFATINRIYNFIGKN--PFWLSVLHVGFSPLQGFFNSIVYGYNNSLMWRYLGSKI 268 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC--chHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 456889999999999999999887643 34556666677777889999999999999888877665
No 20
>PF11970 Git3_C: G protein-coupled glucose receptor regulating Gpa2 C-term; InterPro: IPR022596 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This family is the conserved C-terminal domain of the member proteins.
Probab=97.67 E-value=0.00025 Score=42.54 Aligned_cols=70 Identities=7% Similarity=0.196 Sum_probs=55.4
Q ss_pred chHHHHHHHHHHHHHHHHHHHhh-HHHHHHHHHhhhccc-chHHHHHHHHHHHhhhhhccchhheecchHHH
Q psy3982 82 TSAEIKLAKVALMTISLWFLAWT-PYLVINFGGILEMVE-INPLITIWGSVFAKANAVYNPIVYAISHPKFR 151 (185)
Q Consensus 82 ~~~~~~~~~~~~~i~~~f~i~~~-P~~v~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr 151 (185)
+++-+|.++.+++-=++|+++|+ |+..-..-......+ ...++..++.++..+|..+|-++|++..+..|
T Consensus 4 r~~i~r~lr~mfiYP~~Yi~lwlfP~~~~~~~~~~~~~~~p~~~l~~i~~~~~~~~G~VD~lvf~~~erpw~ 75 (76)
T PF11970_consen 4 RKRIRRQLRSMFIYPLVYIVLWLFPFAAHRMQYMYEIGHGPSFWLFCIAGFMQPSQGFVDCLVFTLRERPWR 75 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCchHHHHHHHHHHHccCHHHhhheeeecccCC
Confidence 35567889999999999999999 887666555533332 55788889999999999999999999876543
No 21
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=97.29 E-value=6.6e-05 Score=57.48 Aligned_cols=110 Identities=13% Similarity=0.203 Sum_probs=77.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHH-HHHHHHHHhhHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALM-TISLWFLAWTPYL 107 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-i~~~f~i~~~P~~ 107 (185)
..........+++.++++|-.++++.+++..+.. +.+.++.+++..+.+++ .++..++.++|..
T Consensus 197 ~~~~~~~im~~s~~iii~cg~~~~~~i~~~~~~~---------------S~~~~~lq~QLf~aLv~Qt~iP~i~~~~P~~ 261 (307)
T PF10326_consen 197 GMLIMCIIMSISFFIIIYCGIKIYKKIKKLSSIM---------------SSKTRKLQKQLFKALVIQTIIPFIFMYIPVF 261 (307)
T ss_pred eeeecchhHHhHHHHHHHHHhhhHHHHhcccccc---------------ChhhHHHHHHHHHHHHHHhhhhheeeecchh
Confidence 3444555567888999999999999986422221 23344666777777644 4566778889998
Q ss_pred HHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHH
Q psy3982 108 VINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALD 155 (185)
Q Consensus 108 v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~ 155 (185)
+.......+-+ ......+...+..+.|++||++-.+.-++||++++
T Consensus 262 ~~~~~p~~~i~--~~~~~~~~~~~~~~yP~iDpl~~i~~ik~yR~~i~ 307 (307)
T PF10326_consen 262 IVFILPFFGID--LGFFSNIISILISLYPAIDPLPVIFIIKDYRKAIK 307 (307)
T ss_pred heeeeeccCCC--CCccccHhhhhEEEEeehhhheeeEeeHHHHHhhC
Confidence 87776655433 13344556678889999999999999999999864
No 22
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=97.12 E-value=0.009 Score=45.63 Aligned_cols=79 Identities=13% Similarity=0.214 Sum_probs=56.1
Q ss_pred CCchHHHHHHHHHHHHHH-HHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982 80 SGTSAEIKLAKVALMTIS-LWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF 158 (185)
Q Consensus 80 ~~~~~~~~~~~~~~~i~~-~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~ 158 (185)
+..+.++|..+.+.+-++ -+++.-.|.....+....+. ..+.+..+..++...+..+.-++..+.++.+|+.+++++
T Consensus 223 ~T~k~Qkkfl~~l~iQ~~ip~~~l~~P~~~~~~~~~~~~--~~q~~~n~~~~~~~~HG~~sti~mi~~~~pYR~~~~~~~ 300 (302)
T PF10318_consen 223 KTRKMQKKFLIALIIQVLIPFIFLFIPLIYFIISIIFGY--YNQALNNISFIIISLHGIASTIVMILVHKPYRKFLLSLF 300 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccc--cccccchHHHHHHHhccHHHHHHHhhccHHHHHHHHHHh
Confidence 345667777776655444 44455557766655443332 335566777888899999999999999999999999988
Q ss_pred cc
Q psy3982 159 PS 160 (185)
Q Consensus 159 ~~ 160 (185)
+|
T Consensus 301 ~~ 302 (302)
T PF10318_consen 301 RC 302 (302)
T ss_pred cC
Confidence 54
No 23
>KOG4193|consensus
Probab=96.93 E-value=0.019 Score=48.06 Aligned_cols=67 Identities=19% Similarity=0.203 Sum_probs=44.9
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982 87 KLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP 159 (185)
Q Consensus 87 ~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~ 159 (185)
+..+..+++..++.+.|. +.+.....+ ....+.+++.++-.+.-++=.++|++++++.|++.++.++
T Consensus 514 ~~~~~~l~L~~lLGlTW~----fgi~s~~~~--~~~v~~YlFti~NalQG~fIFi~~cll~~kvr~~~~k~~~ 580 (610)
T KOG4193|consen 514 SLIRSALALLFLLGLTWI----FGIFSWLPG--TSVVFAYLFTIFNALQGVFIFIFHCLLRKKVRKEYRKWLC 580 (610)
T ss_pred HHHHHHHHHHHHHHHHHH----HHHHHHhcc--cchHHHHHHHHHHHhhhhHhhHhhhhhhHHHHHHHHHHhc
Confidence 667788888888888884 333332221 2344445555555555555678899999999999999886
No 24
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=96.64 E-value=0.04 Score=42.07 Aligned_cols=74 Identities=14% Similarity=0.294 Sum_probs=56.0
Q ss_pred CCCchHHHHHHHHHHHH-HHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHH
Q psy3982 79 QSGTSAEIKLAKVALMT-ISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQA 153 (185)
Q Consensus 79 ~~~~~~~~~~~~~~~~i-~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~ 153 (185)
++..+.+++..|.+..- +..-++++.|-.+......++-+ ...++.++......+-|.+||+.-.++-+.||+.
T Consensus 235 ~~T~~lq~qL~~AL~vQT~IPi~vsf~Pc~~~wy~pif~i~-~~~~~n~~~~iAls~FPf~DPlAii~~lP~~R~r 309 (310)
T PF10319_consen 235 KKTKRLQRQLFKALIVQTVIPICVSFSPCVLSWYGPIFGID-LGRWNNYFSVIALSAFPFLDPLAIILCLPAFRNR 309 (310)
T ss_pred HhHHHHHHHHHHHHHHHHHhHHHHhhccHHHHHhHHHHcCC-hhHHHHHHHHHHHHHccccCchHhheecHHhhcc
Confidence 33456667777776554 44568889999888765555533 5677888888888899999999999999999975
No 25
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=96.59 E-value=0.023 Score=44.33 Aligned_cols=24 Identities=8% Similarity=0.310 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHh
Q psy3982 91 VALMTISLWFLAWTPYLVINFGGI 114 (185)
Q Consensus 91 ~~~~i~~~f~i~~~P~~v~~~~~~ 114 (185)
-.+.|.++|++||+|..+-.++.-
T Consensus 245 kFf~I~lVF~iCWlpNIINg~LL~ 268 (405)
T PF02101_consen 245 KFFKIMLVFYICWLPNIINGSLLF 268 (405)
T ss_pred HHHHHHHHHHHHhhhhhHhHHHHH
Confidence 356789999999999998776543
No 26
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=96.20 E-value=0.14 Score=40.45 Aligned_cols=44 Identities=18% Similarity=0.344 Sum_probs=34.7
Q ss_pred chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcccccc
Q psy3982 120 INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPSLVC 163 (185)
Q Consensus 120 ~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~~~~ 163 (185)
.......+......+++.+=|.+-....+..|+++++.+....+
T Consensus 281 ~~~~~~~~~e~~i~l~~~~i~~~~i~s~~~wrk~f~~~~~~~~~ 324 (365)
T PF03125_consen 281 YRTILNHIFENCIFLNPIFICPVIIFSVPSWRKEFKRSFPKIRC 324 (365)
T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHcCHHHHHHHHHhhhhhhc
Confidence 34566677778888888888888888999999999988765443
No 27
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=96.17 E-value=0.21 Score=37.45 Aligned_cols=98 Identities=13% Similarity=0.183 Sum_probs=52.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhh
Q psy3982 37 YFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILE 116 (185)
Q Consensus 37 ~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~ 116 (185)
+++-.++.+.++....+++++.+++ ..+.|++..+..+.....+.+.-+...+........
T Consensus 177 ~i~~~ii~i~~~~~~~~~l~~~~~~-------------------~~~~er~L~~is~~~~~~~~~~~~~~~~~~~~~~~~ 237 (275)
T PF02118_consen 177 FIIIIIITIITNIITYRRLRKLSKR-------------------IKSVERNLTIISFIISFVQLLIAIWQIINSFAFFFD 237 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh-------------------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3344455566666666666654322 224567777776766666665554444422211111
Q ss_pred cccchHHHHHHHHHHHhhhhhccchhheecchHHHHH
Q psy3982 117 MVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQA 153 (185)
Q Consensus 117 ~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~ 153 (185)
..+.......+..+..=+-....|++-.+.+++.|+.
T Consensus 238 ~~~~~~~~~~~~~f~sD~ltl~~P~iLl~fs~~vR~~ 274 (275)
T PF02118_consen 238 FLPISSFLYYLLPFASDLLTLSQPYILLIFSKNVRRQ 274 (275)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHheECHhhhcc
Confidence 1111222333333333333455699999999999985
No 28
>KOG4564|consensus
Probab=95.59 E-value=0.086 Score=42.65 Aligned_cols=100 Identities=15% Similarity=0.229 Sum_probs=61.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc
Q psy3982 39 VPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMV 118 (185)
Q Consensus 39 ~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~ 118 (185)
+.+++-++..+.|.+.+-++-+.. +.......+|++|..+.++=.|.+-++. ..+..+
T Consensus 318 ~ai~vNf~flinIvrILv~KLr~~---------------~~~~~~~y~K~vKaTLvLIPLfGI~~il-------f~~~P~ 375 (473)
T KOG4564|consen 318 LAILVNFIFLINIVRILVTKLRAS---------------NASETDQYRKLVKATLVLIPLFGIHYIL-------FAFRPD 375 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCc---------------cccchHHHHHHHHHHHHHHHHcCCeeEE-------EEecCc
Confidence 334444555566666665432221 2333444788888888877666644322 112221
Q ss_pred c--chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982 119 E--INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS 160 (185)
Q Consensus 119 ~--~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~ 160 (185)
+ .......+..+|....-.+=-++|++.|++.|.++++.+.+
T Consensus 376 ~~~~~~v~~~~~~~L~SfQGf~VAvlYCFlN~EVq~elrr~W~r 419 (473)
T KOG4564|consen 376 EDTLREVYLYFELFLGSFQGFFVAVLYCFLNGEVQAELRRKWSR 419 (473)
T ss_pred hHHHHHHHHHHHHHHHhccchheehheeecCHHHHHHHHHHHHh
Confidence 1 23444555666777777888999999999999999998754
No 29
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=92.03 E-value=4.2 Score=30.77 Aligned_cols=112 Identities=15% Similarity=0.187 Sum_probs=67.9
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhh
Q psy3982 25 HRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWT 104 (185)
Q Consensus 25 ~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~ 104 (185)
.....+.-.++.+.+|.++. ..|+.+..++-..+ ....++.|.+++|.-..+-+.|- .
T Consensus 184 P~~IqiiE~ilQ~~IPi~Il-~iYiAIIiKI~~MK------------------kssLnK~Ei~ILKQAifIFvlFQ---~ 241 (305)
T PF04789_consen 184 PRGIQIIEIILQFGIPIFIL-VIYIAIIIKIIKMK------------------KSSLNKNEITILKQAIFIFVLFQ---A 241 (305)
T ss_pred CCchhHHHHHHHHhHHHHHH-HHHHHHHHHHHHHh------------------hccCCchhHHHHHHHHHHHHHHH---H
Confidence 34556777888899996554 45555544443211 12244667788777665544444 4
Q ss_pred HHHHHHHHHhhhccc-chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982 105 PYLVINFGGILEMVE-INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF 158 (185)
Q Consensus 105 P~~v~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~ 158 (185)
+..+..+-....-+. .-..+..+...+-..--+.-|....+.++|.||-+-...
T Consensus 242 ss~VFl~~qt~~~~~~tAFlIKR~iNT~ei~agaatP~ffffTs~eirk~~s~kv 296 (305)
T PF04789_consen 242 SSCVFLLCQTFKFNVATAFLIKRIINTMEIFAGAATPCFFFFTSKEIRKLVSSKV 296 (305)
T ss_pred HHHHHHhheeeeecchHHHHHHHHHHHHHHHhhccCCeEEEEehHHHHHHHHhhc
Confidence 443333322222111 334566666667777788899999999999999766544
No 30
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=85.64 E-value=15 Score=28.50 Aligned_cols=78 Identities=10% Similarity=-0.028 Sum_probs=49.3
Q ss_pred CCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHH
Q psy3982 80 SGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVE----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALD 155 (185)
Q Consensus 80 ~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~ 155 (185)
-+..+..|+.+++..++....+.-+.+.+..+.......+ ....+.. .....-..+.+=|++.....++.|++.+
T Consensus 223 YQl~ENl~slr~L~p~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~y~al~e-~~~~~P~Y~ii~~~~~~~~~k~~r~~~~ 301 (324)
T PF10292_consen 223 YQLEENLRSLRLLKPFIILSSIFIFFYIFASIFLRLFNPSMSKPNYFALAE-LNHIFPLYSIISPLILYRKIKKIRKKRK 301 (324)
T ss_pred HhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCChhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445567888888777777777666666665554433332 2222222 2234455778889998888998888877
Q ss_pred hhc
Q psy3982 156 KKF 158 (185)
Q Consensus 156 ~~~ 158 (185)
+.+
T Consensus 302 ~~l 304 (324)
T PF10292_consen 302 KRL 304 (324)
T ss_pred HHH
Confidence 655
No 31
>PF02076 STE3: Pheromone A receptor; InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=82.02 E-value=21 Score=27.32 Aligned_cols=25 Identities=12% Similarity=0.204 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy3982 87 KLAKVALMTISLWFLAWTPYLVINFG 112 (185)
Q Consensus 87 ~~~~~~~~i~~~f~i~~~P~~v~~~~ 112 (185)
|..| ++++.++.+++.+|..+..++
T Consensus 196 Rf~R-L~~la~~~i~~~~P~~i~~~v 220 (283)
T PF02076_consen 196 RFFR-LLALALVIILIYLPLSIYYLV 220 (283)
T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHH
Confidence 4444 677888889999999998888
No 32
>PF10322 7TM_GPCR_Sru: Serpentine type 7TM GPCR chemoreceptor Sru; InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily [].
Probab=81.48 E-value=13 Score=28.78 Aligned_cols=104 Identities=13% Similarity=0.184 Sum_probs=54.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy3982 34 FFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGG 113 (185)
Q Consensus 34 ~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~ 113 (185)
+.....-++.++.++..++.++|+.+.+... .+.+.+..+.|+..+.+++++++.++.- .+..+..
T Consensus 201 l~~s~~~~~~iii~N~lm~~Klr~~k~~l~~----------~~~s~~~~KAE~SLT~Tmi~mliP~i~n----~i~~i~~ 266 (307)
T PF10322_consen 201 LFFSIFWMISIIILNILMFFKLRKLKSSLSS----------QRRSKQSRKAERSLTLTMISMLIPYITN----GIISISF 266 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------hhhcccchhhheeHHHHHHHHHHHHHHH----HHHHHHH
Confidence 3334444577777888888888876521110 1113445667777777777777666532 2222222
Q ss_pred hhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHH
Q psy3982 114 ILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQ 152 (185)
Q Consensus 114 ~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~ 152 (185)
.... .....+..+-.+..=+..++=|.++.+.++=|||
T Consensus 267 ~~~~-~~~~y~~~iRp~~~D~~~~iv~w~FY~THP~Fkk 304 (307)
T PF10322_consen 267 LFFP-SYSSYILLIRPFGNDCRTCIVPWYFYLTHPIFKK 304 (307)
T ss_pred hcCc-hHHHHHHHHhhhhhccceeEEEEEeeecChhhcC
Confidence 2211 1112222222222223344458888888887776
No 33
>PF13853 7tm_4: Olfactory receptor
Probab=71.89 E-value=0.0045 Score=42.07 Aligned_cols=69 Identities=17% Similarity=0.256 Sum_probs=47.8
Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHH
Q psy3982 23 WHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLA 102 (185)
Q Consensus 23 ~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~ 102 (185)
..+..+......+..+.|+.+++++|++|...+.|. ...+.++|++.+......+..+.
T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~Il~SY~~Il~avlki---------------------~S~~~r~KAfsTC~sHL~vv~lf 110 (144)
T PF13853_consen 52 SINEIVGFVVAIFILLGPLLLILFSYIRILRAVLKI---------------------PSAEGRSKAFSTCSSHLTVVSLF 110 (144)
T ss_pred hhhheeeecccceeEEEEeeccccceeEEEehhhcc---------------------cccccccccccccccCceEEEEe
Confidence 334455566666677899999999999999988742 22344568888877777777776
Q ss_pred hhHHHHHHHH
Q psy3982 103 WTPYLVINFG 112 (185)
Q Consensus 103 ~~P~~v~~~~ 112 (185)
+.|..+..+.
T Consensus 111 y~~~~~~y~~ 120 (144)
T PF13853_consen 111 YGTVIFMYLS 120 (144)
T ss_pred eeeeEEEEec
Confidence 6666554443
No 34
>KOG0721|consensus
Probab=70.39 E-value=40 Score=24.70 Aligned_cols=16 Identities=13% Similarity=0.115 Sum_probs=11.0
Q ss_pred eecchHHHHHHHhhcc
Q psy3982 144 AISHPKFRQALDKKFP 159 (185)
Q Consensus 144 ~~~~~~fr~~~~~~~~ 159 (185)
+-..+++|++.+++..
T Consensus 110 gas~~eIKkaYR~LSi 125 (230)
T KOG0721|consen 110 GASEKEIKKAYRRLSI 125 (230)
T ss_pred CCCHHHHHHHHHHhhh
Confidence 4456778888777653
No 35
>KOG2575|consensus
Probab=66.27 E-value=11 Score=30.33 Aligned_cols=32 Identities=19% Similarity=0.472 Sum_probs=25.6
Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHhhHHHHHH
Q psy3982 79 QSGTSAEIKLAKVALMTISLWFLAWTPYLVIN 110 (185)
Q Consensus 79 ~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~ 110 (185)
++..+.-.+.++.-+++++.|+++|+|+....
T Consensus 248 ~k~~~~f~ri~~ia~~Vv~TF~iiw~P~~~~~ 279 (510)
T KOG2575|consen 248 PKLFNSFARIIKIALAVVGTFVIIWLPFLLSG 279 (510)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 33444567899999999999999999997644
No 36
>PF11014 DUF2852: Protein of unknown function (DUF2852); InterPro: IPR021273 This bacterial family of proteins has no known function.
Probab=64.50 E-value=13 Score=24.10 Aligned_cols=26 Identities=8% Similarity=0.082 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHhhhc
Q psy3982 91 VALMTISLWFLAWTPYLVINFGGILEM 117 (185)
Q Consensus 91 ~~~~i~~~f~i~~~P~~v~~~~~~~~~ 117 (185)
.+.++|+-|+++| |..+..+.+.+.+
T Consensus 11 ~Ia~mVlGFi~fW-PlGla~Lay~iw~ 36 (115)
T PF11014_consen 11 WIAAMVLGFIVFW-PLGLALLAYMIWG 36 (115)
T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHH
Confidence 3667888888866 8776666554433
No 37
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=62.29 E-value=38 Score=30.06 Aligned_cols=33 Identities=9% Similarity=-0.018 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc
Q psy3982 86 IKLAKVALMTISLWFLAWTPYLVINFGGILEMV 118 (185)
Q Consensus 86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~ 118 (185)
.-..-+++.+.+.|+++|+-..+..+....+..
T Consensus 457 tGg~~Lm~gv~~~Flf~~~l~l~~~~~Fl~G~~ 489 (806)
T PF05478_consen 457 TGGNFLMAGVGLSFLFSWFLMLLVLFYFLVGGN 489 (806)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 344557788899999999888777776665543
No 38
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=58.89 E-value=30 Score=21.78 Aligned_cols=30 Identities=17% Similarity=0.329 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy3982 39 VPLILIIYAYYFIVRAVSVHEKQMREQAKK 68 (185)
Q Consensus 39 ~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~ 68 (185)
.+++++.+.|..+.+--+++.+.++.+.+.
T Consensus 14 ~~vl~~~ifyFli~RPQrKr~K~~~~ml~s 43 (97)
T COG1862 14 PLVLIFAIFYFLIIRPQRKRMKEHQELLNS 43 (97)
T ss_pred HHHHHHHHHHHhhcCHHHHHHHHHHHHHHh
Confidence 334566777777777777766666655443
No 39
>PF15086 UPF0542: Uncharacterised protein family UPF0542
Probab=58.23 E-value=37 Score=20.06 Aligned_cols=37 Identities=14% Similarity=0.355 Sum_probs=22.9
Q ss_pred hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH
Q psy3982 27 SYLIFYGFFVYFVPLI-LIIYAYYFIVRAVSVHEKQMR 63 (185)
Q Consensus 27 ~y~~~~~~~~~~~p~~-i~~~~y~~i~~~l~~~~~~~~ 63 (185)
.|..+..++..+.|+. +..++..++.+.+.++.++.+
T Consensus 19 P~~Fl~~vll~LtPlfiisa~lSwkLaK~ie~~ere~K 56 (74)
T PF15086_consen 19 PYEFLTTVLLILTPLFIISAVLSWKLAKAIEKEEREKK 56 (74)
T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555666667764 446677777777776655544
No 40
>KOG3500|consensus
Probab=55.49 E-value=22 Score=21.49 Aligned_cols=26 Identities=15% Similarity=0.399 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy3982 85 EIKLAKVALMTISLWFLAWTPYLVINFG 112 (185)
Q Consensus 85 ~~~~~~~~~~i~~~f~i~~~P~~v~~~~ 112 (185)
++...+.++.++. ++||+-..+..+.
T Consensus 30 ~~~li~~~lvlTA--VCCwlfWli~yla 55 (84)
T KOG3500|consen 30 NRGLIRCMLVLTA--VCCWLFWLIVYLA 55 (84)
T ss_pred CcChhHHHHHHHH--HHHHHHHHHHHHH
Confidence 4445555544444 8899776666553
No 41
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=54.04 E-value=27 Score=17.41 Aligned_cols=22 Identities=18% Similarity=0.366 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~ 50 (185)
......++.++|.+.++..|++
T Consensus 7 gfiAt~Lfi~iPt~FLlilYvq 28 (35)
T PRK04989 7 GFVASLLFVLVPTVFLIILYIQ 28 (35)
T ss_pred HHHHHHHHHHHHHHHHHHHhee
Confidence 4455567778899888888874
No 42
>PF02117 7TM_GPCR_Sra: Serpentine type 7TM GPCR chemoreceptor Sra; InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=52.59 E-value=1.1e+02 Score=23.88 Aligned_cols=34 Identities=15% Similarity=-0.047 Sum_probs=23.1
Q ss_pred chHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhh
Q psy3982 82 TSAEIKLAKVALMTISLWFLAWTPYLVINFGGIL 115 (185)
Q Consensus 82 ~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~ 115 (185)
.++....++.+..+++.-++|..-|.+...+...
T Consensus 228 ~~Eni~ST~aI~~i~~~Q~i~~~iys~~v~~l~~ 261 (328)
T PF02117_consen 228 KRENINSTKAICIISISQFICMFIYSFGVFILRN 261 (328)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455677888888888888887777655544333
No 43
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=52.17 E-value=43 Score=19.05 Aligned_cols=39 Identities=15% Similarity=0.126 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAK 67 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~ 67 (185)
.+.....-.+..+.+.+++++.|+...|.++|...++..
T Consensus 4 e~~~~~a~a~~t~~~~l~fiavi~~ayr~~~K~~~d~aa 42 (60)
T COG4736 4 EMMRGFADAWGTIAFTLFFIAVIYFAYRPGKKGEFDEAA 42 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhHHHHh
Confidence 334444444555566667777777788877666655543
No 44
>TIGR03038 PS_II_psbM photosystem II reaction center protein PsbM. Members of this protein family are the photosystem II reaction center M protein, product of the psbM gene, in Cyanobacteria and their derived organelles in plants. This model resembles Pfam model pfam05151 but has cutoffs set to avoid false-positive matches to similar (not necessarily homologous) sequences in species that are not photosynthetic.
Probab=50.17 E-value=30 Score=16.96 Aligned_cols=22 Identities=14% Similarity=0.345 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~ 50 (185)
......++.++|...++..|++
T Consensus 7 ~fiAt~Lfi~iPt~FLiilYvq 28 (33)
T TIGR03038 7 GFIATLLFILVPTVFLLILYIQ 28 (33)
T ss_pred HHHHHHHHHHHHHHHHHHHhee
Confidence 3445566778899888888874
No 45
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=50.16 E-value=47 Score=18.88 Aligned_cols=35 Identities=9% Similarity=0.142 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMR 63 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~ 63 (185)
....+++++++-.++....+.+.-+.+++.+++..
T Consensus 24 il~~f~~G~llg~l~~~~~~~~~r~~~~~~~k~l~ 58 (68)
T PF06305_consen 24 ILIAFLLGALLGWLLSLPSRLRLRRRIRRLRKELK 58 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444455555555555555555555555544444
No 46
>PHA02650 hypothetical protein; Provisional
Probab=48.62 E-value=54 Score=19.70 Aligned_cols=28 Identities=7% Similarity=-0.045 Sum_probs=17.1
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 26 RSYLIFYGFFVYFVPLILIIYAYYFIVR 53 (185)
Q Consensus 26 ~~y~~~~~~~~~~~p~~i~~~~y~~i~~ 53 (185)
..+..+++++.+++-+++.++.|.+..+
T Consensus 47 ~~~~~~ii~i~~v~i~~l~~flYLK~~~ 74 (81)
T PHA02650 47 FNGQNFIFLIFSLIIVALFSFFVFKGYT 74 (81)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3455555556566666777777776644
No 47
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=48.14 E-value=42 Score=21.83 Aligned_cols=7 Identities=0% Similarity=0.614 Sum_probs=2.8
Q ss_pred HHHHHHH
Q psy3982 39 VPLILII 45 (185)
Q Consensus 39 ~p~~i~~ 45 (185)
+|+++++
T Consensus 7 l~~vv~~ 13 (113)
T PRK06531 7 IMFVVML 13 (113)
T ss_pred HHHHHHH
Confidence 3444443
No 48
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=47.63 E-value=27 Score=19.81 Aligned_cols=20 Identities=15% Similarity=0.255 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy3982 84 AEIKLAKVALMTISLWFLAW 103 (185)
Q Consensus 84 ~~~~~~~~~~~i~~~f~i~~ 103 (185)
+.+|.-.++..+++++++-|
T Consensus 34 ~~~~~~~i~~~~~i~~l~v~ 53 (59)
T PF09889_consen 34 RMRKTQYIFFGIFILFLAVW 53 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34455555666665555544
No 49
>KOG3827|consensus
Probab=47.60 E-value=12 Score=29.76 Aligned_cols=63 Identities=13% Similarity=0.163 Sum_probs=40.3
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccc--hHHHHHHHHHHHhhhhhccchhheecch
Q psy3982 86 IKLAKVALMTISLWFLAWTPYLVINFGGILEMVEI--NPLITIWGSVFAKANAVYNPIVYAISHP 148 (185)
Q Consensus 86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~nPiiY~~~~~ 148 (185)
.|--.++++-+++|++.|+-+.++..+-++.+++. ...-.....+...+++...-++|.+-.+
T Consensus 60 ~kWR~~lliF~~sf~~SWl~Fg~iwwlIA~~hGDL~~~~~~~~~tpCV~nV~sf~sAFLFSiETQ 124 (400)
T KOG3827|consen 60 LKWRWMLLIFSLSFVLSWLFFGVIWWLIAYAHGDLEPDPPGENHTPCVMNVHSFTSAFLFSIETQ 124 (400)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccCCCCcCCCcceeeccchhhhheeeeeee
Confidence 34456788899999999999988777666655541 1111222345555666666666666544
No 50
>CHL00080 psbM photosystem II protein M
Probab=45.67 E-value=39 Score=16.67 Aligned_cols=22 Identities=14% Similarity=0.259 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~ 50 (185)
......++..+|...+++.|++
T Consensus 7 gfiAt~LFi~iPt~FLlilyvk 28 (34)
T CHL00080 7 AFIATALFILVPTAFLLIIYVK 28 (34)
T ss_pred HHHHHHHHHHHHHHHHHHhhee
Confidence 4455566778898888888864
No 51
>PHA02844 putative transmembrane protein; Provisional
Probab=45.57 E-value=44 Score=19.82 Aligned_cols=26 Identities=19% Similarity=0.268 Sum_probs=16.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 27 SYLIFYGFFVYFVPLILIIYAYYFIV 52 (185)
Q Consensus 27 ~y~~~~~~~~~~~p~~i~~~~y~~i~ 52 (185)
.+..+++++.+++..++.++.|.+..
T Consensus 47 ~~~~~ii~i~~v~~~~~~~flYLK~~ 72 (75)
T PHA02844 47 STKIWILTIIFVVFATFLTFLYLKAV 72 (75)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhee
Confidence 44445555666666677777777654
No 52
>COG2322 Predicted membrane protein [Function unknown]
Probab=45.48 E-value=1e+02 Score=21.55 Aligned_cols=37 Identities=3% Similarity=-0.059 Sum_probs=27.4
Q ss_pred CchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhc
Q psy3982 81 GTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEM 117 (185)
Q Consensus 81 ~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~ 117 (185)
..++++++..+-....+.|+++++-.+...--..++.
T Consensus 71 ~i~~Hk~aMltA~~l~l~FlvlYltr~~l~~~t~f~~ 107 (177)
T COG2322 71 NIEKHKRAMLTAFTLALVFLVLYLTRHGLGGETAFGG 107 (177)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCCC
Confidence 4467888888889999999999987776655443333
No 53
>COG3924 Predicted membrane protein [Function unknown]
Probab=43.01 E-value=71 Score=18.89 Aligned_cols=33 Identities=9% Similarity=0.350 Sum_probs=23.7
Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 22 DWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRA 54 (185)
Q Consensus 22 ~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~ 54 (185)
++....+-.....-.+.+|++.++.|++.|-..
T Consensus 36 t~G~~gfP~WFE~aCi~lPllFi~l~~~mvkfi 68 (80)
T COG3924 36 TPGFTGFPLWFEMACILLPLLFIVLCWAMVKFI 68 (80)
T ss_pred CCCCCCChHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344455666677778899999999998866433
No 54
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=42.42 E-value=71 Score=19.45 Aligned_cols=24 Identities=17% Similarity=0.437 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 43 LIIYAYYFIVRAVSVHEKQMREQA 66 (185)
Q Consensus 43 i~~~~y~~i~~~l~~~~~~~~~~~ 66 (185)
++++.|..+.+--++++++.+.+.
T Consensus 12 ~~~i~yf~~~rpqkK~~k~~~~m~ 35 (84)
T TIGR00739 12 IFLIFYFLIIRPQRKRRKAHKKLI 35 (84)
T ss_pred HHHHHHHheechHHHHHHHHHHHH
Confidence 344556655555555555554443
No 55
>PHA02975 hypothetical protein; Provisional
Probab=40.35 E-value=76 Score=18.51 Aligned_cols=27 Identities=11% Similarity=0.377 Sum_probs=17.5
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 26 RSYLIFYGFFVYFVPLILIIYAYYFIV 52 (185)
Q Consensus 26 ~~y~~~~~~~~~~~p~~i~~~~y~~i~ 52 (185)
..+..++.++.+++..++.++.|.+..
T Consensus 42 ~~~~~~ii~i~~v~~~~~~~flYLK~~ 68 (69)
T PHA02975 42 SLSIILIIFIIFITCIAVFTFLYLKLM 68 (69)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 345555555666666777788887653
No 56
>PF10322 7TM_GPCR_Sru: Serpentine type 7TM GPCR chemoreceptor Sru; InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily [].
Probab=40.30 E-value=1.8e+02 Score=22.66 Aligned_cols=17 Identities=18% Similarity=0.372 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy3982 37 YFVPLILIIYAYYFIVR 53 (185)
Q Consensus 37 ~~~p~~i~~~~y~~i~~ 53 (185)
.+..+++-...+.++-+
T Consensus 208 ~~~iii~N~lm~~Klr~ 224 (307)
T PF10322_consen 208 MISIIILNILMFFKLRK 224 (307)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33334444444444443
No 57
>PF05817 Ribophorin_II: Oligosaccharyltransferase subunit Ribophorin II; InterPro: IPR008814 This family consists of several eukaryotic Ribophorin II (RPN2) proteins. The mammalian oligosaccharyltransferase (OST) is a protein complex that effects the cotranslational N-glycosylation of newly synthesised polypeptides, and is composed of at least four rough ER-specific membrane proteins: ribophorins I and II (RI and RII), OST48, and Dadl. The mechanism(s) by which the subunits of this complex are retained in the ER are not well understood [].; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0018279 protein N-linked glycosylation via asparagine, 0005789 endoplasmic reticulum membrane, 0008250 oligosaccharyltransferase complex
Probab=39.62 E-value=2.6e+02 Score=24.35 Aligned_cols=28 Identities=7% Similarity=0.139 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 28 YLIFYGFFVYFVPLILIIYAYYFIVRAV 55 (185)
Q Consensus 28 y~~~~~~~~~~~p~~i~~~~y~~i~~~l 55 (185)
..-..|+...+.|+++++..+.++--.+
T Consensus 548 ~vS~~F~~~vlapl~~Ll~~W~~lG~Nl 575 (636)
T PF05817_consen 548 IVSLVFTGLVLAPLLVLLILWLKLGANL 575 (636)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCcch
Confidence 3444455667788877777777654333
No 58
>PHA02819 hypothetical protein; Provisional
Probab=38.50 E-value=84 Score=18.46 Aligned_cols=26 Identities=19% Similarity=0.080 Sum_probs=16.4
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 26 RSYLIFYGFFVYFVPLILIIYAYYFI 51 (185)
Q Consensus 26 ~~y~~~~~~~~~~~p~~i~~~~y~~i 51 (185)
..+..+++++.+++-+++.++.|.+.
T Consensus 44 ~~~~~~ii~l~~~~~~~~~~flYLK~ 69 (71)
T PHA02819 44 FLRYYLIIGLVTIVFVIIFIIFYLKV 69 (71)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34555555566666667777777765
No 59
>PF05151 PsbM: Photosystem II reaction centre M protein (PsbM); InterPro: IPR007826 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbM found in PSII. PsbM is one of the most hydrophobic proteins in the thylakoid membrane. The function of this protein is unknown.; GO: 0015979 photosynthesis, 0019684 photosynthesis, light reaction, 0009523 photosystem II, 0016021 integral to membrane; PDB: 3A0H_m 3ARC_m 3A0B_M 3PRR_M 3PRQ_M 1S5L_M 4FBY_e 3BZ2_M 3BZ1_M 2AXT_M ....
Probab=38.26 E-value=52 Score=15.95 Aligned_cols=21 Identities=14% Similarity=0.309 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy3982 30 IFYGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 30 ~~~~~~~~~~p~~i~~~~y~~ 50 (185)
.....++.++|....+..|++
T Consensus 8 fiAtaLfi~iPt~FLiilyvq 28 (31)
T PF05151_consen 8 FIATALFILIPTAFLIILYVQ 28 (31)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhheEee
Confidence 444566677888888888875
No 60
>PF15102 TMEM154: TMEM154 protein family
Probab=37.17 E-value=2.5 Score=28.63 Aligned_cols=7 Identities=29% Similarity=0.932 Sum_probs=3.1
Q ss_pred HHHHHHH
Q psy3982 35 FVYFVPL 41 (185)
Q Consensus 35 ~~~~~p~ 41 (185)
+..++|+
T Consensus 59 LmIlIP~ 65 (146)
T PF15102_consen 59 LMILIPL 65 (146)
T ss_pred EEEeHHH
Confidence 3344553
No 61
>PF07330 DUF1467: Protein of unknown function (DUF1467); InterPro: IPR009935 This family consists of several bacterial proteins of around 90 residues in length. The function of this family is unknown.
Probab=36.50 E-value=79 Score=19.39 Aligned_cols=34 Identities=3% Similarity=0.057 Sum_probs=23.7
Q ss_pred cccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982 73 TLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPY 106 (185)
Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~ 106 (185)
+...+.+.+.+-.+|+..+.+.-+++|.+.+.-+
T Consensus 38 Gt~~sAP~~~~l~rk~~~TTiiaavi~~~~~~~~ 71 (85)
T PF07330_consen 38 GTDPSAPANPRLKRKALITTIIAAVIFAIIYLII 71 (85)
T ss_pred CCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444456677778888888888888887766433
No 62
>PF04238 DUF420: Protein of unknown function (DUF420); InterPro: IPR007352 This is a predicted membrane protein with four transmembrane helices.
Probab=34.20 E-value=1.5e+02 Score=19.93 Aligned_cols=38 Identities=13% Similarity=0.112 Sum_probs=26.8
Q ss_pred CchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc
Q psy3982 81 GTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMV 118 (185)
Q Consensus 81 ~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~ 118 (185)
+.+.+++...........|+++++-++...-...+.++
T Consensus 30 ~~~~Hr~~Ml~a~~ls~lFlv~Yl~~~~~~g~~~f~g~ 67 (133)
T PF04238_consen 30 RIKLHRKLMLTAFVLSALFLVSYLYYHFLGGSTPFGGP 67 (133)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCCC
Confidence 34668888888899999999999877765333333333
No 63
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=33.16 E-value=1.3e+02 Score=21.69 Aligned_cols=35 Identities=17% Similarity=0.295 Sum_probs=25.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQ 61 (185)
Q Consensus 27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~ 61 (185)
.+.....-+..++-.++++..|+.|+..+|++-++
T Consensus 166 ~~Rl~l~y~~~~~~~~~~i~iY~~if~~lrr~~~~ 200 (201)
T PF11710_consen 166 WYRLWLHYIWRFIIIFAIIIIYIAIFFYLRRRIRR 200 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Confidence 34555555556666888999999999999976543
No 64
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=32.05 E-value=37 Score=18.14 Aligned_cols=22 Identities=18% Similarity=0.177 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~ 50 (185)
..+..+++..+|.+.+++.|++
T Consensus 7 gfiAtaLFi~iPT~FLlilYVk 28 (50)
T PRK14094 7 GFVASLLFVGVPTIFLIGLFIS 28 (50)
T ss_pred HHHHHHHHHHHHHHHhhheeEE
Confidence 4455566678888888888874
No 65
>PLN00090 photosystem II reaction center M protein; Provisional
Probab=31.03 E-value=1e+02 Score=19.25 Aligned_cols=22 Identities=14% Similarity=0.226 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~ 50 (185)
..+...++.++|.+.+++.|++
T Consensus 77 afIATaLFIlIPTaFLLILYVQ 98 (113)
T PLN00090 77 AYLAVALGTFLPCLFLINLFIQ 98 (113)
T ss_pred HHHHHHHHHHHHHHHHHHHHhh
Confidence 4445566778899888888874
No 66
>COG3771 Predicted membrane protein [Function unknown]
Probab=30.62 E-value=1.4e+02 Score=18.48 Aligned_cols=42 Identities=10% Similarity=0.065 Sum_probs=30.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMN 70 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~ 70 (185)
....|..++.+--++...+|.+.-..+.+-.++.+.+..+.+
T Consensus 45 la~lF~~G~~lgwli~g~fy~k~~l~~~~l~rqiKr~~~q~~ 86 (97)
T COG3771 45 LATLFAAGFALGWLICGLFYLKVRLSLMRLERQIKRLENQLS 86 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 345566677777788888899888888877777776665544
No 67
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=30.37 E-value=17 Score=26.56 Aligned_cols=21 Identities=19% Similarity=0.422 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHhhHHH
Q psy3982 87 KLAKVALMTISLWFLAWTPYL 107 (185)
Q Consensus 87 ~~~~~~~~i~~~f~i~~~P~~ 107 (185)
+..+..+.+...+.+.|+...
T Consensus 195 ~~~~~~~~L~~llg~~w~~~~ 215 (242)
T PF00002_consen 195 KILRASLILLPLLGITWLFGI 215 (242)
T ss_dssp ---------------------
T ss_pred hHHHHHHHHHHHHHHHheeee
Confidence 467777778888888887643
No 68
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=30.12 E-value=1.1e+02 Score=18.47 Aligned_cols=18 Identities=17% Similarity=0.615 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy3982 42 ILIIYAYYFIVRAVSVHE 59 (185)
Q Consensus 42 ~i~~~~y~~i~~~l~~~~ 59 (185)
+++++.|..+.+--++++
T Consensus 10 ~~~~i~yf~~~rpqkk~~ 27 (82)
T PF02699_consen 10 IIFVIFYFLMIRPQKKQQ 27 (82)
T ss_dssp HHHHHHHHHTHHHHHHHH
T ss_pred HHHHHHhhheecHHHHHH
Confidence 344444544444444333
No 69
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=30.06 E-value=1.6e+02 Score=19.06 Aligned_cols=24 Identities=25% Similarity=0.405 Sum_probs=10.2
Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHH
Q psy3982 37 YFVPLIL-IIYAYYFIVRAVSVHEK 60 (185)
Q Consensus 37 ~~~p~~i-~~~~y~~i~~~l~~~~~ 60 (185)
.++|+++ +++.|..+.+--+|+++
T Consensus 6 ~ll~lv~i~~i~yF~~iRPQkKr~K 30 (109)
T PRK05886 6 LFLPFLLIMGGFMYFASRRQRKAMQ 30 (109)
T ss_pred HHHHHHHHHHHHHHHHccHHHHHHH
Confidence 3345443 34445444433343333
No 70
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=28.81 E-value=1.2e+02 Score=17.18 Aligned_cols=19 Identities=11% Similarity=0.076 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy3982 84 AEIKLAKVALMTISLWFLA 102 (185)
Q Consensus 84 ~~~~~~~~~~~i~~~f~i~ 102 (185)
++.|.+.++...+++.++.
T Consensus 38 ~R~r~~~~~~~li~aLi~v 56 (64)
T COG4068 38 KRQRNFMILMFLILALILV 56 (64)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444433
No 71
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=28.78 E-value=1.2e+02 Score=17.21 Aligned_cols=30 Identities=10% Similarity=0.183 Sum_probs=16.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy3982 83 SAEIKLAKVALMTISLWFLAWTPYLVINFG 112 (185)
Q Consensus 83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~ 112 (185)
+...+.++.++++...+++.-+-|..+..+
T Consensus 4 k~~~~mtriVLLISfiIlfgRl~Y~~I~a~ 33 (59)
T PF11119_consen 4 KKNSRMTRIVLLISFIILFGRLIYSAIGAW 33 (59)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 334566676666666655554444444333
No 72
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=28.39 E-value=2.9e+02 Score=21.57 Aligned_cols=22 Identities=14% Similarity=0.310 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHhhHHH
Q psy3982 86 IKLAKVALMTISLWFLAWTPYL 107 (185)
Q Consensus 86 ~~~~~~~~~i~~~f~i~~~P~~ 107 (185)
.+++..+..+.+.|.+.|.-..
T Consensus 58 ~~~~~~~~~~y~~F~~~WGlNY 79 (318)
T PF12725_consen 58 LNILFFLSVLYFLFYLLWGLNY 79 (318)
T ss_pred HHHHHHHHHHHHHHHHHhhhhc
Confidence 4555555666666666665443
No 73
>PF14163 SieB: Superinfection exclusion protein B
Probab=28.36 E-value=1.2e+02 Score=20.51 Aligned_cols=22 Identities=14% Similarity=0.220 Sum_probs=13.1
Q ss_pred HHHHHHHHHHhhHHHHHHHHHh
Q psy3982 93 LMTISLWFLAWTPYLVINFGGI 114 (185)
Q Consensus 93 ~~i~~~f~i~~~P~~v~~~~~~ 114 (185)
.+++.+-++.++|-.+...+..
T Consensus 4 ~l~i~~~~llf~P~~~~~~l~l 25 (151)
T PF14163_consen 4 WLIIFSGLLLFLPESLLEWLNL 25 (151)
T ss_pred HHHHHHHHHHHCCHHHHHHhCc
Confidence 3444444566888877665544
No 74
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=27.93 E-value=95 Score=15.84 Aligned_cols=28 Identities=14% Similarity=0.222 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 29 LIFYGFFVYFVPLILIIYAYYFIVRAVS 56 (185)
Q Consensus 29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~ 56 (185)
.+...++..+.-.++.+++|+--+++-+
T Consensus 7 aIIv~V~vg~~iiii~~~~YaCcykk~~ 34 (38)
T PF02439_consen 7 AIIVAVVVGMAIIIICMFYYACCYKKHR 34 (38)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHcccc
Confidence 3334444444445556666665544433
No 75
>PTZ00370 STEVOR; Provisional
Probab=27.30 E-value=1.1e+02 Score=23.44 Aligned_cols=17 Identities=18% Similarity=0.547 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy3982 38 FVPLILIIYAYYFIVRA 54 (185)
Q Consensus 38 ~~p~~i~~~~y~~i~~~ 54 (185)
++-.++++++|+.++++
T Consensus 265 lil~vvliilYiwlyrr 281 (296)
T PTZ00370 265 LILAVVLIILYIWLYRR 281 (296)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 33345556666654443
No 76
>PF00664 ABC_membrane: ABC transporter transmembrane region; InterPro: IPR001140 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain []. The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). A variety of ATP-binding transport proteins have a six transmembrane helical region. They are all integral membrane proteins involved in a variety of transport systems. Members of this family include; the cystic fibrosis transmembrane conductance regulator (CFTR), bacterial leukotoxin secretion ATP-binding protein, multidrug resistance proteins, the yeast leptomycin B resistance protein, the mammalian sulphonylurea receptor and antigen peptide transporter 2. Many of these proteins have two such regions.; GO: 0005524 ATP binding, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0006810 transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3G61_B 3G5U_B 3G60_A 3B60_D 3QF4_B 2HYD_A 2ONJ_A 4A82_B 4AA3_A 2YL4_A.
Probab=27.24 E-value=2.3e+02 Score=20.09 Aligned_cols=20 Identities=15% Similarity=0.217 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHhhhhhccch
Q psy3982 122 PLITIWGSVFAKANAVYNPI 141 (185)
Q Consensus 122 ~~~~~~~~~l~~~~~~~nPi 141 (185)
.....+...+.+.+...+|+
T Consensus 256 ~s~g~~~~~~~~~~~~~~pl 275 (275)
T PF00664_consen 256 ISIGTLVAFLSLSSQLINPL 275 (275)
T ss_dssp SHHHHHHHHHHHHHHHHHHH
T ss_pred cCHHHHHHHHHHHHHHHhhC
Confidence 34555666677777777774
No 77
>PF00375 SDF: Sodium:dicarboxylate symporter family; InterPro: IPR001991 It has been shown [] that integral membrane proteins that mediate the uptake of a wide variety of molecules with the concomitant uptake of sodium ions (sodium symporters) can be grouped, on the basis of sequence and functional similarities into a number of distinct families. One of these families [] is known as the sodium:dicarboxylate symporter family (SDF). Such re-uptake of neurotransmitters from the synapses, is thought to be an important mechanism for terminating their action, by removing these chemicals from the synaptic cleft, and transporting them into presynaptic nerve terminals, and surrounding neuroglia. this removal is also believed to prevent them accumulating to the point of reaching neurotoxic [, ]. The structure of these transporter proteins has been variously reported to contain from 8 to 10 transmembrane (TM) regions, although 10 now seems to be the accepted value. Members of the family include: several mammalian excitatory amino acid transporters, and a number of bacterial transporters. They vary with regars to their dependence on transport of sodium, and other ions.; GO: 0017153 sodium:dicarboxylate symporter activity, 0006835 dicarboxylic acid transport, 0016020 membrane; PDB: 3V8G_B 1XFH_A 3KBC_B 2NWX_B 3V8F_B 2NWL_B 2NWW_A.
Probab=26.27 E-value=3.4e+02 Score=21.70 Aligned_cols=73 Identities=12% Similarity=0.124 Sum_probs=41.0
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc----chHH---HHHHHHHHHhhhhhccchhhe-ecchHHHHHHHhh
Q psy3982 86 IKLAKVALMTISLWFLAWTPYLVINFGGILEMVE----INPL---ITIWGSVFAKANAVYNPIVYA-ISHPKFRQALDKK 157 (185)
Q Consensus 86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~----~~~~---~~~~~~~l~~~~~~~nPiiY~-~~~~~fr~~~~~~ 157 (185)
-+.......-++-++++..|+.+..+........ .... +..+.......-..+.|+++. +..++..+-++++
T Consensus 166 ~~~~~~~~~~ii~~i~~~~Pigv~~l~a~~~~~~~~~~l~~l~~~v~~~~~~~~i~~~v~~pl~~~~~~~~np~~~~~~~ 245 (390)
T PF00375_consen 166 FESLNEVIMKIINWIMKLAPIGVFGLIANSIATQGLSILGALGKFVLTVYVALLIHLFVVLPLILFVLTRKNPFKFLKAM 245 (390)
T ss_dssp HHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHSSCCGHHHHHHHHHHHHHHHHHHHHHHTHHHHHH-TTT--HHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHHHHHHHHHHHHHHHHHHHHHhhHHHhhhcCCCHHHHHHHH
Confidence 3445556677788899999999988765533222 1111 111111122223566899988 5555777766665
Q ss_pred c
Q psy3982 158 F 158 (185)
Q Consensus 158 ~ 158 (185)
.
T Consensus 246 ~ 246 (390)
T PF00375_consen 246 L 246 (390)
T ss_dssp H
T ss_pred H
Confidence 4
No 78
>PF05241 EBP: Emopamil binding protein ; InterPro: IPR007905 Emopamil binding protein (EBP) is a nonglycosylated type I integral membrane protein of endoplasmic reticulum and shows high level expression in epithelial tissues. The EBP protein has emopamil binding domains, including the sterol acceptor site and the catalytic centre, which show Delta8-Delta7 sterol isomerase activity. Human sterol isomerase, a homologue of mouse EBP, is suggested not only to play a role in cholesterol biosynthesis, but also to affect lipoprotein internalisation. In humans, mutations of EBP are known to cause the genetic disorder of X-linked dominant chondrodysplasia punctata (CDPX2). This syndrome of humans is lethal in most males, and affected females display asymmetric hyperkeratotic skin and skeletal abnormalities [].; GO: 0047750 cholestenol delta-isomerase activity, 0016125 sterol metabolic process, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=25.99 E-value=1.2e+02 Score=21.82 Aligned_cols=32 Identities=28% Similarity=0.405 Sum_probs=26.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVH 58 (185)
Q Consensus 27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~ 58 (185)
.|.+....+-.++|+.++.-++-.|....+.+
T Consensus 160 ~y~v~~N~iWivvP~~~l~~s~~~i~~a~~~~ 191 (194)
T PF05241_consen 160 FYFVFPNGIWIVVPLLLLYQSWKEIARAFRAA 191 (194)
T ss_pred eehhhcchHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45667778889999999999999888887754
No 79
>PHA03054 IMV membrane protein; Provisional
Probab=25.95 E-value=1.5e+02 Score=17.45 Aligned_cols=26 Identities=23% Similarity=0.467 Sum_probs=15.4
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 26 RSYLIFYGFFVYFVPLILIIYAYYFI 51 (185)
Q Consensus 26 ~~y~~~~~~~~~~~p~~i~~~~y~~i 51 (185)
..+..+++++.+++-+++.++.|.+.
T Consensus 46 ~~~~~~ii~l~~v~~~~l~~flYLK~ 71 (72)
T PHA03054 46 WGWYWLIIIFFIVLILLLLIYLYLKV 71 (72)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34555555555666666667777653
No 80
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=25.56 E-value=1.3e+02 Score=16.75 Aligned_cols=21 Identities=24% Similarity=0.558 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy3982 39 VPLILIIYAYYFIVRAVSVHE 59 (185)
Q Consensus 39 ~p~~i~~~~y~~i~~~l~~~~ 59 (185)
+.+++.+++|.-|+.--|+++
T Consensus 7 ~ti~lvv~LYgY~yhLYrsek 27 (56)
T TIGR02736 7 FTLLLVIFLYAYIYHLYRSQK 27 (56)
T ss_pred HHHHHHHHHHHHHHHhhhhhc
Confidence 334455566666676666443
No 81
>COG4665 FcbT2 TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=25.07 E-value=2.5e+02 Score=19.80 Aligned_cols=33 Identities=15% Similarity=0.398 Sum_probs=23.1
Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 24 HHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVS 56 (185)
Q Consensus 24 ~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~ 56 (185)
..+.+.=.+..+.|++|+++++..|.-=+..+.
T Consensus 88 R~qa~vDllGtifFLlPfc~l~iy~~~~~~~~S 120 (182)
T COG4665 88 RTQAWVDLLGTIFFLLPFCLLVIYLSWPYVALS 120 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHH
Confidence 345667777888899999888776665444444
No 82
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=24.53 E-value=1.1e+02 Score=23.45 Aligned_cols=15 Identities=20% Similarity=0.587 Sum_probs=7.2
Q ss_pred HHHHHHHHHHHHHHH
Q psy3982 39 VPLILIIYAYYFIVR 53 (185)
Q Consensus 39 ~p~~i~~~~y~~i~~ 53 (185)
+-.++++++|+.+++
T Consensus 270 il~vvliiLYiWlyr 284 (295)
T TIGR01478 270 ILTVVLIILYIWLYR 284 (295)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334555555554433
No 83
>PHA03234 DNA packaging protein UL33; Provisional
Probab=24.50 E-value=89 Score=24.41 Aligned_cols=11 Identities=18% Similarity=0.495 Sum_probs=6.4
Q ss_pred HHhhHHHHHHH
Q psy3982 101 LAWTPYLVINF 111 (185)
Q Consensus 101 i~~~P~~v~~~ 111 (185)
...+|..++.+
T Consensus 205 ~f~iPl~im~~ 215 (338)
T PHA03234 205 WGIFPTMIFSF 215 (338)
T ss_pred HHHHHHHHHHH
Confidence 34567766554
No 84
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=22.55 E-value=2.1e+02 Score=17.98 Aligned_cols=15 Identities=20% Similarity=0.397 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHHHHH
Q psy3982 37 YFVPLILIIYAYYFI 51 (185)
Q Consensus 37 ~~~p~~i~~~~y~~i 51 (185)
++..++++++.-.+|
T Consensus 23 ~i~~FiILLIi~~~I 37 (121)
T PF10669_consen 23 FIVVFIILLIITKSI 37 (121)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333344444444444
No 85
>PHA01815 hypothetical protein
Probab=22.39 E-value=1.4e+02 Score=15.81 Aligned_cols=23 Identities=9% Similarity=0.217 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 31 FYGFFVYFVPLILIIYAYYFIVR 53 (185)
Q Consensus 31 ~~~~~~~~~p~~i~~~~y~~i~~ 53 (185)
..+.+.|++.+++++..|+++..
T Consensus 9 ivfllaflitliilmt~~irvsf 31 (55)
T PHA01815 9 IVFLLAFLITLIILMTLHIRVSF 31 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666777777777765533
No 86
>PF10066 DUF2304: Uncharacterized conserved protein (DUF2304); InterPro: IPR019277 This entry represents hypothetical archaeal and bacterial proteins that have no known function.
Probab=22.38 E-value=2.3e+02 Score=18.25 Aligned_cols=51 Identities=16% Similarity=0.127 Sum_probs=29.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhh
Q psy3982 83 SAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKA 134 (185)
Q Consensus 83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~ 134 (185)
+.+....+.++.+...++.. .|.....+....+-......+..+...+.+.
T Consensus 29 ~~~~~l~Wl~~~i~~l~~~i-fP~~~~~vA~~lGi~~~~n~lf~~~i~~ll~ 79 (115)
T PF10066_consen 29 RLKYSLLWLVFSIILLILSI-FPNILDWVAKLLGIGRPPNLLFYLGILFLLV 79 (115)
T ss_pred cHHHHHHHHHHHHHHHHHHh-hhhHHHHHHHHHCCCchhHHHHHHHHHHHHH
Confidence 44556666666655555544 4887777777766655444555444444433
No 87
>TIGR03052 PS_I_psaI photosystem I reaction center subunit VIII. Members of this protein family are PsaI, subunit VIII of the photosystem I reaction center. This protein is found in both the Cyanobacteria and the chloroplasts of plants, but is absent from non-oxygenic photosynthetic bacteria such as Rhodobacter sphaeroides. Species that contain photosystem I also contain photosystem II, which splits water and releases molecular oxygen.
Probab=22.25 E-value=1e+02 Score=14.94 Aligned_cols=19 Identities=5% Similarity=0.294 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy3982 32 YGFFVYFVPLILIIYAYYF 50 (185)
Q Consensus 32 ~~~~~~~~p~~i~~~~y~~ 50 (185)
.-+++.+.|.+.|...+.-
T Consensus 8 VPlVglvfPai~Ma~lf~y 26 (31)
T TIGR03052 8 VPLVGLVFPAVFMALLFRY 26 (31)
T ss_pred hhHHHHHHHHHHHHHHHHh
Confidence 3456677888888766553
No 88
>TIGR00934 2a38euk potassium uptake protein, Trk family. The proteins of the Trk family are derived from Gram-negative and Gram-positive bacteria, yeast and wheat. The proteins of E. coli K12 TrkH and TrkG as well as several yeast proteins have been functionally characterized.The E. coli TrkH and TrkG proteins are complexed to two peripheral membrane proteins, TrkA, an NAD-binding protein, and TrkE, an ATP-binding protein. This complex forms the potassium uptake system. This family is specific for the eukaryotic Trk system.
Probab=21.70 E-value=5.1e+02 Score=23.29 Aligned_cols=32 Identities=13% Similarity=0.144 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhh
Q psy3982 84 AEIKLAKVALMTISLWFLAWTPYLVINFGGIL 115 (185)
Q Consensus 84 ~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~ 115 (185)
-|-|++|.++.++..|++.|--..++.+...+
T Consensus 443 iEYRaLk~L~~Iv~~Y~~~~~llG~i~l~~wi 474 (800)
T TIGR00934 443 IEYRALKCLCSIVLVYFLGFNILGFVLLLPWI 474 (800)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46799999999999999999877666655443
No 89
>PF03155 Alg6_Alg8: ALG6, ALG8 glycosyltransferase family; InterPro: IPR004856 N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum membrane prior to the transfer of the oligosaccharide moiety to the target asparagine residues. This oligosaccharide is composed of Glc(3)Man(9)GlcNAc(2). The addition of the three glucose residues is the final series of steps in the synthesis of the oligosaccharide precursor. Alg6 transfers the first glucose residue, and Alg8 transfers the second one []. In the human alg6 gene, a C-T transition, which causes Ala333 to be replaced with Val, has been identified as the cause of a congenital disorder of glycosylation, designated as type Ic OMIM:603147 []. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0005789 endoplasmic reticulum membrane
Probab=21.28 E-value=1.4e+02 Score=24.80 Aligned_cols=27 Identities=22% Similarity=0.443 Sum_probs=23.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy3982 83 SAEIKLAKVALMTISLWFLAWTPYLVI 109 (185)
Q Consensus 83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~ 109 (185)
..-.|..+.-..++.+|.++|.|+...
T Consensus 218 ~~~~~~~~lg~~Vi~~f~~~~~PF~~~ 244 (469)
T PF03155_consen 218 FSIKRLIKLGIVVIATFALSFGPFLYS 244 (469)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 456889999999999999999999743
No 90
>PF08525 OapA_N: Opacity-associated protein A N-terminal motif; InterPro: IPR013731 This domain is found in the Haemophilus influenzae opacity-associated protein (OapA). It is required for efficient nasopharyngeal mucosal colonisation, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [, ]. This motif occurs at the N terminus of these proteins. It contains a conserved histidine followed by a run of hydrophobic residues. Many of the proteins in this entry are unassigned peptidases belonging to MEROPS peptidase family M23B.
Probab=20.29 E-value=1.2e+02 Score=14.37 Aligned_cols=23 Identities=9% Similarity=-0.021 Sum_probs=11.3
Q ss_pred chHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982 82 TSAEIKLAKVALMTISLWFLAWTPY 106 (185)
Q Consensus 82 ~~~~~~~~~~~~~i~~~f~i~~~P~ 106 (185)
.+.+++.+..+..+++. +.|.|.
T Consensus 7 P~~Hr~~l~~l~~v~l~--ll~~Ps 29 (30)
T PF08525_consen 7 PKLHRRALIALSAVVLV--LLLWPS 29 (30)
T ss_pred CHHHHHHHHHHHHHHHH--HHhccC
Confidence 34455555544444444 444453
No 91
>PHA02909 hypothetical protein; Provisional
Probab=20.15 E-value=1.8e+02 Score=16.19 Aligned_cols=20 Identities=10% Similarity=0.422 Sum_probs=8.9
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy3982 32 YGFFVYFVPLILIIYAYYFI 51 (185)
Q Consensus 32 ~~~~~~~~p~~i~~~~y~~i 51 (185)
++++.|+.-+.++...|.-|
T Consensus 40 lfviiflsmftilacsyvyi 59 (72)
T PHA02909 40 LFVIIFLSMFTILACSYVYI 59 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444555444
No 92
>PF10361 DUF2434: Protein of unknown function (DUF2434); InterPro: IPR018830 This entry represents a family of proteins conserved in fungi. Their function is not known.
Probab=20.12 E-value=4.2e+02 Score=20.50 Aligned_cols=46 Identities=15% Similarity=0.128 Sum_probs=22.1
Q ss_pred CcccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982 10 NLTACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMRE 64 (185)
Q Consensus 10 ~~~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~ 64 (185)
|.+.|+.-..+ ...+.+.-+.+.+.+.+-+++. +..++||.+.-.+
T Consensus 32 N~TsCysPi~~--ig~rg~vGI~fav~f~i~lvlt-------LvnL~KHG~~~lp 77 (296)
T PF10361_consen 32 NGTSCYSPINP--IGTRGSVGIAFAVLFAIALVLT-------LVNLRKHGRLYLP 77 (296)
T ss_pred cCcccCCCCcc--ccccchhHHHHHHHHHHHHHHH-------HHHHHHhhhhcCC
Confidence 56778553333 3334444444444444332222 2356777766543
Done!