Query psy6126
Match_columns 214
No_of_seqs 166 out of 1612
Neff 10.7
Searched_HMMs 46136
Date Fri Aug 16 18:17:50 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6126.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6126hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4220|consensus 100.0 6.3E-32 1.4E-36 204.8 11.5 180 2-182 53-233 (503)
2 KOG4219|consensus 100.0 7.3E-30 1.6E-34 193.6 7.8 172 1-174 57-241 (423)
3 PF00001 7tm_1: 7 transmembran 99.9 2.9E-28 6.3E-33 182.7 5.4 176 2-177 7-183 (257)
4 PHA03234 DNA packaging protein 99.9 1.3E-26 2.8E-31 180.3 13.6 162 5-172 60-227 (338)
5 PHA02834 chemokine receptor-li 99.9 2.7E-25 5.9E-30 172.6 13.3 164 3-172 52-219 (323)
6 PHA02638 CC chemokine receptor 99.9 4.9E-25 1.1E-29 175.8 12.7 167 5-173 123-304 (417)
7 PHA03235 DNA packaging protein 99.9 1.3E-23 2.8E-28 167.0 14.5 160 11-173 66-235 (409)
8 PHA03087 G protein-coupled che 99.9 2.3E-23 5E-28 162.8 14.6 167 3-173 64-234 (335)
9 KOG2087|consensus 99.7 4.4E-16 9.4E-21 117.7 12.1 165 5-174 50-224 (363)
10 PF10320 7TM_GPCR_Srsx: Serpen 99.7 1.1E-15 2.3E-20 115.0 12.3 165 1-174 12-176 (257)
11 PF10328 7TM_GPCR_Srx: Serpent 99.6 2.3E-15 5E-20 114.5 11.9 174 3-176 17-191 (274)
12 PF10324 7TM_GPCR_Srw: Serpent 99.4 8.7E-12 1.9E-16 97.0 11.4 178 3-180 19-230 (318)
13 PF10323 7TM_GPCR_Srv: Serpent 99.3 5.3E-12 1.2E-16 96.3 9.0 162 10-174 29-195 (283)
14 PF11710 Git3: G protein-coupl 99.3 6.4E-11 1.4E-15 85.5 13.3 162 8-175 30-200 (201)
15 PF03402 V1R: Vomeronasal orga 98.9 1.7E-08 3.6E-13 75.5 9.0 172 9-182 6-187 (265)
16 PF05296 TAS2R: Mammalian tast 98.9 6.5E-08 1.4E-12 74.6 12.0 174 7-180 37-218 (303)
17 PF05462 Dicty_CAR: Slime mold 98.7 9.2E-07 2E-11 67.9 12.5 123 2-136 29-151 (303)
18 PF10321 7TM_GPCR_Srt: Serpent 98.6 6.9E-07 1.5E-11 68.9 10.6 163 2-176 55-229 (313)
19 PF02118 Srg: Srg family chemo 98.2 6.5E-06 1.4E-10 62.8 8.2 164 10-176 31-202 (275)
20 PF00002 7tm_2: 7 transmembran 97.9 2.1E-06 4.6E-11 64.2 0.0 109 4-117 25-133 (242)
21 PF10292 7TM_GPCR_Srab: Serpen 97.8 0.00064 1.4E-08 53.2 11.7 126 6-136 43-176 (324)
22 PF10316 7TM_GPCR_Srbc: Serpen 97.7 0.00049 1.1E-08 52.2 9.1 89 7-95 33-124 (273)
23 PF04789 DUF621: Protein of un 97.5 0.0035 7.5E-08 46.9 11.8 163 6-171 41-215 (305)
24 PF03383 Serpentine_r_xa: Caen 97.4 0.00016 3.4E-09 49.4 2.7 101 16-116 5-108 (153)
25 PF10317 7TM_GPCR_Srd: Serpent 97.3 0.0023 4.9E-08 49.4 9.4 108 4-116 24-141 (292)
26 KOG4193|consensus 97.0 0.0015 3.2E-08 55.0 5.8 142 8-168 348-491 (610)
27 PF02101 Ocular_alb: Ocular al 96.7 0.055 1.2E-06 42.6 11.7 94 14-113 67-171 (405)
28 PF10322 7TM_GPCR_Sru: Serpent 96.6 0.052 1.1E-06 42.1 11.0 122 50-173 101-226 (307)
29 PF10326 7TM_GPCR_Str: Serpent 96.6 0.0031 6.7E-08 48.9 4.3 107 5-112 25-141 (307)
30 PF02175 7TM_GPCR_Srb: Serpent 96.2 0.064 1.4E-06 39.4 8.6 45 46-90 79-123 (236)
31 PF02117 7TM_GPCR_Sra: Serpent 95.7 0.081 1.8E-06 41.6 8.1 75 13-88 55-137 (328)
32 PF10319 7TM_GPCR_Srj: Serpent 95.4 0.35 7.6E-06 37.4 10.3 108 3-110 28-146 (310)
33 PF10318 7TM_GPCR_Srh: Serpent 95.3 0.14 2.9E-06 39.7 8.2 111 4-116 29-147 (302)
34 PF06681 DUF1182: Protein of u 94.2 0.66 1.4E-05 33.5 8.5 112 45-171 110-224 (226)
35 PF10327 7TM_GPCR_Sri: Serpent 94.1 1 2.2E-05 35.0 10.2 109 3-113 33-150 (303)
36 PF01534 Frizzled: Frizzled/Sm 91.9 2.8 6.1E-05 33.0 9.7 87 47-137 89-175 (328)
37 KOG4564|consensus 91.7 0.73 1.6E-05 37.9 6.5 110 3-116 168-289 (473)
38 PF15086 UPF0542: Uncharacteri 90.2 0.79 1.7E-05 26.9 3.9 36 141-176 19-55 (74)
39 PF02076 STE3: Pheromone A rec 88.8 9.3 0.0002 29.5 12.5 105 12-122 24-130 (283)
40 PF03125 Sre: C. elegans Sre G 87.5 5.8 0.00012 31.8 8.6 35 56-90 127-161 (365)
41 PF13853 7tm_4: Olfactory rece 79.4 0.099 2.2E-06 35.8 -3.8 76 98-173 5-88 (144)
42 COG1862 YajC Preprotein transl 62.6 15 0.00033 23.3 3.5 27 151-177 12-38 (97)
43 PF10325 7TM_GPCR_Srz: Serpent 62.3 66 0.0014 24.3 7.9 55 56-110 86-141 (267)
44 KOG4016|consensus 61.6 34 0.00074 25.0 5.4 55 96-151 25-79 (233)
45 PF01528 Herpes_glycop: Herpes 52.9 1.2E+02 0.0027 24.5 12.8 35 141-175 300-334 (374)
46 TIGR02736 cbb3_Q_epsi cytochro 52.3 41 0.00088 18.8 3.6 28 151-178 5-32 (56)
47 PRK06531 yajC preprotein trans 51.4 22 0.00047 23.3 2.9 14 153-166 7-20 (113)
48 PF02699 YajC: Preprotein tran 48.2 47 0.001 20.2 4.0 17 156-172 10-26 (82)
49 TIGR00739 yajC preprotein tran 45.1 37 0.00079 20.9 3.1 19 157-175 12-30 (84)
50 PRK04989 psbM photosystem II r 44.5 38 0.00082 16.9 2.5 20 145-164 9-28 (35)
51 PRK05886 yajC preprotein trans 43.5 56 0.0012 21.2 3.9 20 152-171 7-27 (109)
52 PF05545 FixQ: Cbb3-type cytoc 43.5 53 0.0012 17.6 4.7 25 151-175 12-36 (49)
53 TIGR02976 phageshock_pspB phag 42.6 55 0.0012 19.7 3.5 23 150-172 5-27 (75)
54 TIGR03038 PS_II_psbM photosyst 41.1 47 0.001 16.3 2.5 20 145-164 9-28 (33)
55 COG4736 CcoQ Cbb3-type cytochr 37.6 81 0.0018 18.0 4.8 31 150-180 11-41 (60)
56 PRK09458 pspB phage shock prot 37.1 48 0.001 19.9 2.6 27 149-175 4-30 (75)
57 PF09835 DUF2062: Uncharacteri 36.8 1.2E+02 0.0026 20.8 5.1 31 148-178 122-152 (154)
58 PRK05585 yajC preprotein trans 36.6 79 0.0017 20.4 3.8 18 158-175 28-45 (106)
59 PF11044 TMEMspv1-c74-12: Plec 36.6 69 0.0015 16.9 4.9 14 160-173 20-33 (49)
60 PF15050 SCIMP: SCIMP protein 35.2 86 0.0019 20.7 3.7 20 143-162 9-28 (133)
61 CHL00080 psbM photosystem II p 34.9 62 0.0014 16.0 2.3 20 145-164 9-28 (34)
62 PF05241 EBP: Emopamil binding 34.6 1.2E+02 0.0027 21.9 5.0 30 145-174 164-193 (194)
63 PF06143 Baculo_11_kDa: Baculo 34.1 1.2E+02 0.0025 18.8 4.7 26 145-170 36-61 (84)
64 PF06305 DUF1049: Protein of u 34.0 94 0.002 17.7 5.3 14 158-171 39-52 (68)
65 PHA03237 envelope glycoprotein 33.7 2.8E+02 0.006 23.0 10.7 32 143-174 325-356 (424)
66 PF05151 PsbM: Photosystem II 32.6 67 0.0015 15.6 3.7 21 145-165 9-29 (31)
67 PHA03164 hypothetical protein; 29.0 52 0.0011 19.6 1.9 23 8-30 53-75 (88)
68 PF10669 Phage_Gp23: Protein g 28.3 86 0.0019 19.7 2.8 15 152-166 24-38 (121)
69 PF08374 Protocadherin: Protoc 28.0 57 0.0012 24.0 2.3 9 165-173 56-64 (221)
70 PF06667 PspB: Phage shock pro 27.9 1.4E+02 0.0029 18.0 3.5 16 151-166 6-21 (75)
71 PHA02909 hypothetical protein; 26.6 1.2E+02 0.0027 16.8 3.4 29 46-74 27-55 (72)
72 COG3924 Predicted membrane pro 25.7 1.5E+02 0.0034 17.6 3.9 28 143-170 43-70 (80)
73 PLN00090 photosystem II reacti 24.8 1.4E+02 0.003 18.8 3.2 20 145-164 79-98 (113)
74 KOG4298|consensus 24.3 2.8E+02 0.0061 20.1 6.3 24 94-117 140-163 (245)
75 PF01102 Glycophorin_A: Glycop 23.1 2E+02 0.0044 19.1 4.0 12 164-175 83-94 (122)
76 COG3114 CcmD Heme exporter pro 22.2 1.7E+02 0.0038 16.9 4.7 24 140-163 15-38 (67)
77 PRK14094 psbM photosystem II r 21.1 64 0.0014 17.3 1.1 20 145-164 9-28 (50)
78 PF08693 SKG6: Transmembrane a 20.4 37 0.0008 17.6 0.1 20 152-171 18-37 (40)
No 1
>KOG4220|consensus
Probab=99.97 E-value=6.3e-32 Score=204.75 Aligned_cols=180 Identities=22% Similarity=0.355 Sum_probs=155.4
Q ss_pred cceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheee
Q psy6126 2 SPFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMI 81 (214)
Q Consensus 2 ~~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI 81 (214)
+.+.-.|++++..|+|+.+||++|++.|.+.+|++..+.+-+.|.+|...|.++..+.++...+|+++++.|++|||++|
T Consensus 53 iSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlWLalDYvaSNASVmNLLiISFDRYFsV 132 (503)
T KOG4220|consen 53 ISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLWLALDYVASNASVMNLLIISFDRYFSV 132 (503)
T ss_pred EEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHHHHHHHHhhhhhhhhhheeeeecceee
Confidence 34567889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhcccccc-ccCccceeeecCCCCCCchhHHHHHHHHHHHHHHHHH
Q psy6126 82 GHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFG-LEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISIVV 160 (214)
Q Consensus 82 ~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~-~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 160 (214)
.+|+.|+...|.+++..+|+..|++++++..|.++.+....+ .....+.|..+... +....+-..+..|.+|+.+|.+
T Consensus 133 TrPLtYrakRTtkrA~~MI~~AW~iSfiLWaPaIl~WqyivGkrTv~~~eC~iQFls-np~iTfGTAiAAFYlPVtiM~~ 211 (503)
T KOG4220|consen 133 TRPLTYRAKRTTKRAGLMIGAAWVLSFVLWAPAILFWQYIVGKRTVPDGECYIQFLS-NPAITFGTAIAAFYLPVTIMTI 211 (503)
T ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHhhHhheeeeecCCCceEEEeec-CceeehhHHHHHHHhhHHHHHH
Confidence 999999999999999999999999999999997765333222 23445689876554 4455555677889999999999
Q ss_pred HHHHHHHHHHhhhhhhhhhhhc
Q psy6126 161 CYARIFYIVRKTAMKSRAMNMK 182 (214)
Q Consensus 161 ~y~~i~~~l~~~~~~~~~~~~~ 182 (214)
+|++|++..+++.+........
T Consensus 212 LY~rIyret~kR~k~~~~lq~s 233 (503)
T KOG4220|consen 212 LYWRIYRETRKRQKELAKLQAS 233 (503)
T ss_pred HHHHHHHHHHHHHHhhhhcccc
Confidence 9999999999988766554433
No 2
>KOG4219|consensus
Probab=99.96 E-value=7.3e-30 Score=193.56 Aligned_cols=172 Identities=20% Similarity=0.295 Sum_probs=144.5
Q ss_pred CcceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhhee
Q psy6126 1 MSPFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVM 80 (214)
Q Consensus 1 i~~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~a 80 (214)
++++..+|++|+.+|+|+.|||+||++++++..++........+|.+|...|++..|+......+|++++++||+|||.|
T Consensus 57 lwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~C~f~nf~~itav~vSVfTlvAiA~DRy~A 136 (423)
T KOG4219|consen 57 LWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFYCRFVNFFPITAVFVSVFTLVAIAIDRYMA 136 (423)
T ss_pred EEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccceeeeccccchhhhhHhHHHHHHHHHHHHHH
Confidence 35677899999999999999999999999999999998888899999999999999999999999999999999999999
Q ss_pred eeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhcccccccc---CccceeeecCCCC----------CCchhHHHH
Q psy6126 81 IGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLE---PSIGSCSILPDDY----------GHSPKEFLF 147 (214)
Q Consensus 81 I~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~---~~~~~C~~~~~~~----------~~~~~~~~~ 147 (214)
|.+|++.+ .+++....+++.+|++++++++|..+.......+. .....|...+.+. .+.|...+.
T Consensus 137 Ii~Pl~~r--~s~r~sk~iIllIW~lA~l~a~P~~l~s~v~~~~~~d~~~~~~~~~~~pe~~~~~~~~~~~~~~y~~vl~ 214 (423)
T KOG4219|consen 137 IIHPLQPR--PSRRSSKIIILLIWALALLLALPQLLYSSVEELYLYDGESRVVCVTAWPEHVCPTENESLLMQGYNYVLL 214 (423)
T ss_pred HhhhcccC--CCCcceeehhHHHHHHHHHHhccceeeeeeEEeeccCCcceEEEEEecccccCCcchhhhhhcceeeeeh
Confidence 99999755 59999999999999999999999776533332222 1233565432221 122556677
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy6126 148 LVAFVIPCISIVVCYARIFYIVRKTAM 174 (214)
Q Consensus 148 ~~~~~~p~~~~~~~y~~i~~~l~~~~~ 174 (214)
++.+++|++++...|..|.+.+|..+.
T Consensus 215 ~lqYflPliVl~~~Yt~iav~LW~~~~ 241 (423)
T KOG4219|consen 215 FLQYFLPLIVLGLAYTVIAVTLWGRRI 241 (423)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 889999999999999999999998653
No 3
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.95 E-value=2.9e-28 Score=182.71 Aligned_cols=176 Identities=25% Similarity=0.514 Sum_probs=149.2
Q ss_pred cceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheee
Q psy6126 2 SPFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMI 81 (214)
Q Consensus 2 ~~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI 81 (214)
+++.++|++|++.++++.|||++|++.++...|........++|..++..|++..++...+..+|.++++++++|||++|
T Consensus 7 ~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~~~~is~dRy~~i 86 (257)
T PF00001_consen 7 LVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLFYFSSFSSIFSLVAISIDRYLAI 86 (257)
T ss_dssp HHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhhhhccCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 45678889999999999999999999999888888877766789999999999999999999999999999999999999
Q ss_pred eccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCC-CchhHHHHHHHHHHHHHHHHH
Q psy6126 82 GHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYG-HSPKEFLFLVAFVIPCISIVV 160 (214)
Q Consensus 82 ~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~ 160 (214)
++|++|++..+++++...++.+|+.+++.++|+.+..+.....+.....|........ ..+.....++.+++|++++++
T Consensus 87 ~~p~~~~~~~~~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 166 (257)
T PF00001_consen 87 CHPLRYRRIRTRRRARIIIILIWIISFLISLPPLFFSWVYFVSDGSQSFCFIDFSSSSSQIYFIYFFIVFFILPLIIILI 166 (257)
T ss_dssp HSHHTHHHHSCHHHHHHHHHHHHHHHHHHHHHHHHTCEEEEESTCCCEEEEESCSSSHHHHHHHHHHHHHTHHHHHHHHH
T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccceeeeee
Confidence 9999999999999999999999999999999987764443333333568987555442 344556677889999999999
Q ss_pred HHHHHHHHHHhhhhhhh
Q psy6126 161 CYARIFYIVRKTAMKSR 177 (214)
Q Consensus 161 ~y~~i~~~l~~~~~~~~ 177 (214)
+|.+|++.+|+++++.+
T Consensus 167 ~~~~i~~~~~~~~~~~~ 183 (257)
T PF00001_consen 167 CYIRILRKLRRQRKRIK 183 (257)
T ss_dssp HHHHHHHHHHHHHHCTC
T ss_pred ecccccccccccccccc
Confidence 99999999999877644
No 4
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.94 E-value=1.3e-26 Score=180.28 Aligned_cols=162 Identities=15% Similarity=0.182 Sum_probs=120.1
Q ss_pred eccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeecc
Q psy6126 5 RRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHP 84 (214)
Q Consensus 5 ~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P 84 (214)
.+++++|+++|+|+.|||++|++.++. +|+..... .++|.+|+..|++.+++.....++|++++++||+|||+||++|
T Consensus 60 ~~~~~~rt~tn~fi~NLAvaDLL~~l~-lp~~~~~~-~~~w~fG~~lCk~~~~~~~~~~~~Si~~L~~ISiDRY~aIv~p 137 (338)
T PHA03234 60 YHKVAKHNATSFYLITLFASDFLHMLC-VFFLTLNR-EALFNFNQAFCQCVLFIYHASCSYSICMLAIIATIRYKTLHRR 137 (338)
T ss_pred HhccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHH-hCCccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhHeeeech
Confidence 566677999999999999999999854 56555433 4579999999999999999999999999999999999999999
Q ss_pred CCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHH------HHHHHHHHHHHHH
Q psy6126 85 TLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEF------LFLVAFVIPCISI 158 (214)
Q Consensus 85 ~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~------~~~~~~~~p~~~~ 158 (214)
.+++. ++++....+++.|+.+++.++|.++.... ...+.....|...++.. ..+..+ ..++.+++|+++|
T Consensus 138 ~~~~~--~~~~~~~~i~~~Wi~s~l~~~P~l~~~~~-~~~~~~~~~C~~~~~~~-~~~~~~~~~~~~~~~~~f~iPl~im 213 (338)
T PHA03234 138 KKNDK--KNNHIGRNIGILFLASAMCAIPAALFVKT-EGKKGNYGKCNIHISSK-KAYDLFIAIKIVFCFIWGIFPTMIF 213 (338)
T ss_pred hhhhh--hhhhHHHHHHHHHHHHHHHHhhHhHeeee-eecCCCCCcCcccCCcc-hhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 87643 33444455666799999999998754221 12222224686543321 222221 1233468999999
Q ss_pred HHHHHHHHHHHHhh
Q psy6126 159 VVCYARIFYIVRKT 172 (214)
Q Consensus 159 ~~~y~~i~~~l~~~ 172 (214)
++||.+|.+.+++.
T Consensus 214 ~~cY~~I~~~L~~~ 227 (338)
T PHA03234 214 SFFYVIFCKALHAL 227 (338)
T ss_pred HHHHHHHHHHHHhh
Confidence 99999999998763
No 5
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.93 E-value=2.7e-25 Score=172.55 Aligned_cols=164 Identities=20% Similarity=0.367 Sum_probs=123.1
Q ss_pred ceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeee
Q psy6126 3 PFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIG 82 (214)
Q Consensus 3 ~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~ 82 (214)
++.++|+ +++.|+|+.|||++|++.+ +.+|+.+.... ++|.+|+..|++.+++...+..+|++++++||+|||.+|+
T Consensus 52 vi~~~~~-~~~~n~~i~nLAiaDll~~-~~lP~~i~~~~-~~w~~g~~~C~~~~~~~~~~~~~Si~tL~~IsidRY~aI~ 128 (323)
T PHA02834 52 VLIVKRF-MFVVDVYLFNIAMSDLMLV-FSFPFIIHNDL-NEWIFGEFMCKLVLGVYFVGFFSNMFFVTLISIDRYILVV 128 (323)
T ss_pred HHHhccc-cchhhhhhHHHHHHHHHHH-HHHHHHHHHHc-CCcCCcchHHHhHHHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence 3444444 4678999999999999874 57888765543 4699999999999999999999999999999999999999
Q ss_pred ccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhH----HHHHHHHHHHHHHH
Q psy6126 83 HPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKE----FLFLVAFVIPCISI 158 (214)
Q Consensus 83 ~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~----~~~~~~~~~p~~~~ 158 (214)
+|++++. .+.+++...++.+|+.+++.++|+.... .. ........|.......+..... ...+..+++|++++
T Consensus 129 ~P~~~~~-~~~~~~~~~i~~iWi~s~l~~~P~~~~~-~~-~~~~~~~~C~~~~~~~~~~~~~~~~~~~~i~~f~iPl~ii 205 (323)
T PHA02834 129 NATKIKN-KSISLSVLLSVAAWVCSVILSMPAMVLY-YV-DNTDNLKQCIFNDYHENFSWSAFFNFEINIFGIVIPLIIL 205 (323)
T ss_pred CchhccC-CccchHHHHHHHHHHHHHHHHhhHHHHH-Hh-ccCCCceEEeccCCccchHHHHHHHHHHHHHHHHHHHHHH
Confidence 9998654 3556777788899999999999987642 11 1222334686432222211111 22466789999999
Q ss_pred HHHHHHHHHHHHhh
Q psy6126 159 VVCYARIFYIVRKT 172 (214)
Q Consensus 159 ~~~y~~i~~~l~~~ 172 (214)
+++|.+|++.+|++
T Consensus 206 ~~~Y~~I~~~l~~~ 219 (323)
T PHA02834 206 IYCYSKILYTLKNC 219 (323)
T ss_pred HHHHHHHHHHHHhc
Confidence 99999999998775
No 6
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.93 E-value=4.9e-25 Score=175.79 Aligned_cols=167 Identities=19% Similarity=0.377 Sum_probs=129.7
Q ss_pred eccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeecc
Q psy6126 5 RRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHP 84 (214)
Q Consensus 5 ~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P 84 (214)
.++|++|+++|+++.|||++|++.+ +.+|+.+... .++|.+|+..|++..++.....++++++++++++|||+||++|
T Consensus 123 l~~k~lrt~t~i~llnLAisDLl~~-l~lPf~i~~~-~~~W~fg~~~Ck~~~~l~~~~~~~Si~~L~~isiDRYlaIv~p 200 (417)
T PHA02638 123 LFCKKIKTITDIYIFNLAISDLIFV-IDFPFIIYNE-FDQWIFGDFMCKVISASYYIGFFSNMFLITLMSIDRYFAILYP 200 (417)
T ss_pred HHhccCCCHhHHHHHHHHHHHHHHH-HHHHHHHHHH-hccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3457889999999999999999886 5688777654 4679999999999999999999999999999999999999999
Q ss_pred CCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccc------ccc--CccceeeecCCCCC--Cch-----hHHHHHH
Q psy6126 85 TLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRF------GLE--PSIGSCSILPDDYG--HSP-----KEFLFLV 149 (214)
Q Consensus 85 ~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~------~~~--~~~~~C~~~~~~~~--~~~-----~~~~~~~ 149 (214)
.++++..+++.....++.+|+++++.++|+++...... +.+ .....|....+... ..+ .....++
T Consensus 201 ~~~~~~~~~~~~~i~~~~iW~~s~l~slP~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~i~ 280 (417)
T PHA02638 201 ISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEASNIIFSAQDSNETISNYQCTLIEDNEKNNISFLGRILQFEINIL 280 (417)
T ss_pred ccccceecHhhhHhhHhHHHHHHHHHHHHHHHHhhccccccccccCCCCccCCeeeeeccccchhHHHHHHHHHHHHHHH
Confidence 99888878888888899999999999999876422110 111 12347864322211 111 1122456
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhh
Q psy6126 150 AFVIPCISIVVCYARIFYIVRKTA 173 (214)
Q Consensus 150 ~~~~p~~~~~~~y~~i~~~l~~~~ 173 (214)
.+++|++++++||.+|++.+++.+
T Consensus 281 ~f~lPl~vmi~cY~~I~~~L~~~~ 304 (417)
T PHA02638 281 GMFIPIIIFAFCYIKIILKLKQLK 304 (417)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Confidence 789999999999999999997743
No 7
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.91 E-value=1.3e-23 Score=167.00 Aligned_cols=160 Identities=19% Similarity=0.234 Sum_probs=117.9
Q ss_pred CCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCcc
Q psy6126 11 RNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKL 90 (214)
Q Consensus 11 ~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~ 90 (214)
++..++|+.|||++|++.. +.+|+.+... ...|..|+..|++..++...+..+|++++++||+|||.+|++|.++..
T Consensus 66 ~~~~~~~I~NLAvsDLl~l-~~lP~~i~~~-~~~~~~g~~~Ck~~~~l~~~~~~~Si~tL~~ISiDRY~aI~~p~~~~~- 142 (409)
T PHA03235 66 FSTPTLYMTNLYLANLLTV-FVLPFIMLSN-QGLLSGSVAGCKFASLLYYASCTVGFATVALIAADRYRVIHQRTRARS- 142 (409)
T ss_pred CCccHHHHHHHHHHHHHHH-HHHHHHHHhc-CccccCCCCeehhHHHHHHHHHHHHHHHHHHHHHHHHHHeechhhccC-
Confidence 4567899999999999874 5688776432 122334568999999999999999999999999999999999986544
Q ss_pred ccccchhhhhHHHHHHHHHHhhhhhhhcccc----ccc-cCccceeeecCCCCCC-----chhHHHHHHHHHHHHHHHHH
Q psy6126 91 YSSKFLAFMVACTWLFGFGALVPTWLGVWGR----FGL-EPSIGSCSILPDDYGH-----SPKEFLFLVAFVIPCISIVV 160 (214)
Q Consensus 91 ~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~----~~~-~~~~~~C~~~~~~~~~-----~~~~~~~~~~~~~p~~~~~~ 160 (214)
.+++++..+++++|+.+++.++|+.+..+.. ..+ ......|......... .+.....++.+++|+++|++
T Consensus 143 ~~~~~a~~ii~~iWi~sll~s~P~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~y~i~l~i~~f~iPl~im~~ 222 (409)
T PHA03235 143 SAYRSTYKILGLTWFASLICSGPAPVYTTVVAHDDVDPEAPGYETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTW 222 (409)
T ss_pred cccchhhhhHHHHHHHHHHHHHHHHHHHhhhhccccCcCCCCcceeeEeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4688899999999999999999965432211 111 1223468653322111 12233456678999999999
Q ss_pred HHHHHHHHHHhhh
Q psy6126 161 CYARIFYIVRKTA 173 (214)
Q Consensus 161 ~y~~i~~~l~~~~ 173 (214)
+|.+|++.++++.
T Consensus 223 ~Y~~I~~~l~~~~ 235 (409)
T PHA03235 223 FYTFFYRTLKRAS 235 (409)
T ss_pred HHHHHHHHHHhhh
Confidence 9999999998754
No 8
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.91 E-value=2.3e-23 Score=162.83 Aligned_cols=167 Identities=19% Similarity=0.413 Sum_probs=132.2
Q ss_pred ceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeee
Q psy6126 3 PFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIG 82 (214)
Q Consensus 3 ~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~ 82 (214)
++.++ ++|++.++++.|||++|++.++ ..|........++|..++..|++..++......+|++++++||+|||++|+
T Consensus 64 ~~~~~-~~~~~~~~ll~~laisDll~~~-~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~S~~~l~~iaidRy~aI~ 141 (335)
T PHA03087 64 VLTKT-KIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVSGLYYIGFYNSMNFITVMSVDRYIAIV 141 (335)
T ss_pred eehhc-cccCchHHHHHHHHHHHHHHHH-hHHHHHHHHhCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44555 8899999999999999998875 466666555567788999999999999999999999999999999999999
Q ss_pred ccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHH----HHHHHHHHHHHHH
Q psy6126 83 HPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEF----LFLVAFVIPCISI 158 (214)
Q Consensus 83 ~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~----~~~~~~~~p~~~~ 158 (214)
+|++|++..+++++..+++.+|+++++.++|++..... ..+.....|....+.....+..+ ..++.+++|.+++
T Consensus 142 ~p~~~~~~~~~~~~~~~~~~iWl~~~~~~~p~~~~~~~--~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~lP~~ii 219 (335)
T PHA03087 142 HPVKSNKINTVKYGYIVSLVIWIISIIETTPILFVYTT--KKDHETLICCMFYNNKTMNWKLFINFEINIIGMLIPLTIL 219 (335)
T ss_pred ccCccccccchhhhHHHHHHHHHHHHHHhccHhheeee--eccCCCceEEecCCCcchhHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999987764222 22333446755443222222221 2455689999999
Q ss_pred HHHHHHHHHHHHhhh
Q psy6126 159 VVCYARIFYIVRKTA 173 (214)
Q Consensus 159 ~~~y~~i~~~l~~~~ 173 (214)
+++|.+|.+.+++++
T Consensus 220 ~~~y~~i~~~l~~~~ 234 (335)
T PHA03087 220 LYCYSKILITLKGIN 234 (335)
T ss_pred HHHHHHHHHHHHhcc
Confidence 999999998887754
No 9
>KOG2087|consensus
Probab=99.69 E-value=4.4e-16 Score=117.74 Aligned_cols=165 Identities=20% Similarity=0.259 Sum_probs=127.5
Q ss_pred eccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhc-c-------cccccccccchhhHHHHHHHHHHHHHHHHHHHH
Q psy6126 5 RRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQ-R-------AWTHGHLLCQLFPLLRYGLLAVSLFTVLGITIN 76 (214)
Q Consensus 5 ~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~-~-------~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~d 76 (214)
...+...++..+++.|||++|+++++........+... + .|. +...|++.+|+..+....|+++++.+++|
T Consensus 50 ~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~-tg~gC~~aGflavFASElSv~~LT~itlE 128 (363)
T KOG2087|consen 50 FTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQ-TGLGCPVAGFLAVFASELSVFLLTLITLE 128 (363)
T ss_pred eehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhh-hcCCCchHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455567889999999999999997655555444321 1 244 44789999999999999999999999999
Q ss_pred hheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCC-CCCc-hhHHHHHHHHHHH
Q psy6126 77 RYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDD-YGHS-PKEFLFLVAFVIP 154 (214)
Q Consensus 77 Ry~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~-~~~~-~~~~~~~~~~~~p 154 (214)
|+++|++|++-.+....+....+.+..|+.+++.++.|+++ ...+ .....|...+.. .... ++....+....+.
T Consensus 129 r~l~i~~p~~~~~~~~lr~~~~ill~~wl~~~l~A~~Pl~g-~s~Y---~~~~vClPL~~~~~~s~g~y~~~~l~~N~la 204 (363)
T KOG2087|consen 129 RWLSITYPFRLDRKAKLRPLVLILLLGWLFAFLMALLPLFG-ISSY---GASSVCLPLHIEEPLSTGYYLVALLGLNLLA 204 (363)
T ss_pred HHhheeccccCCCcccccHHHHHHHHHHHHHHHHHhccccC-CCCC---cccceeeecccCCccchhHHHHHHHHHHHHH
Confidence 99999999998887666779999999999999999988876 2322 223578653322 2222 5666778889999
Q ss_pred HHHHHHHHHHHHHHHHhhhh
Q psy6126 155 CISIVVCYARIFYIVRKTAM 174 (214)
Q Consensus 155 ~~~~~~~y~~i~~~l~~~~~ 174 (214)
++++..+|++++..+++...
T Consensus 205 fiiia~~Y~~iy~~l~~~~~ 224 (363)
T KOG2087|consen 205 FIIIAFSYGKIYCSLRKGDL 224 (363)
T ss_pred HHHHHHHhhhhheeeecCCC
Confidence 99999999999999888543
No 10
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=99.67 E-value=1.1e-15 Score=115.00 Aligned_cols=165 Identities=16% Similarity=0.210 Sum_probs=117.4
Q ss_pred CcceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhhee
Q psy6126 1 MSPFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVM 80 (214)
Q Consensus 1 i~~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~a 80 (214)
|.++.++|++|++.++++..+|++|++.++..+|.....+. +.......|-...+...++..+....+++|++||++|
T Consensus 12 i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~~--~~~i~~~~Cf~~~~~~~f~~~~qs~~~l~i~iDr~ia 89 (257)
T PF10320_consen 12 IILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLFR--DHQITRSECFWQIFFYIFFQCAQSVIMLAIAIDRLIA 89 (257)
T ss_pred HHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHHh--heeccHHHHHHHHHHHHHHHHHHHHHHHHHHhhheee
Confidence 35678999999999999999999999999887777774432 2345567899999999999999999999999999999
Q ss_pred eeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHHHHHHHHHHH
Q psy6126 81 IGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISIVV 160 (214)
Q Consensus 81 I~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 160 (214)
|+.|.+|++..+.+..........+.+.......+.. ..+.....|..+....+..+..+.... . +..+++++
T Consensus 90 V~~P~~Y~~~~~~~y~~~~~~~~~~~s~~~~~~~~~~-----~~~~~~~~C~pp~a~~~~~~~~~~~~~-~-~inv~tvi 162 (257)
T PF10320_consen 90 VCFPLRYRTISTRKYLIILLIFPVIYSIFFTVIGFLY-----RDDETIVICNPPLAFHGTASQIWSYSN-I-IINVITVI 162 (257)
T ss_pred EeehhhhhhcccccchhhHhHHHHHHHHHHHhheeEe-----cCCcccccCCCccccCccHHHHHHHHH-H-HHHHHHHH
Confidence 9999999999888866665555555555544332221 122245689876555555554444332 2 22233455
Q ss_pred HHHHHHHHHHhhhh
Q psy6126 161 CYARIFYIVRKTAM 174 (214)
Q Consensus 161 ~y~~i~~~l~~~~~ 174 (214)
.|......++++.+
T Consensus 163 vY~i~~~~~~~k~~ 176 (257)
T PF10320_consen 163 VYIITIIIFKRKSR 176 (257)
T ss_pred HHHHHHHHHHHHcc
Confidence 67766666666554
No 11
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.65 E-value=2.3e-15 Score=114.51 Aligned_cols=174 Identities=17% Similarity=0.142 Sum_probs=123.5
Q ss_pred ceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhccccccccc-ccchhhHHHHHHHHHHHHHHHHHHHHhheee
Q psy6126 3 PFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHL-LCQLFPLLRYGLLAVSLFTVLGITINRYVMI 81 (214)
Q Consensus 3 ~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~-~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI 81 (214)
.+.+.+++|++.+++..+.|++|.+.++..+-........+.....+. .-.+.+.....+...+.++-++||+||+.||
T Consensus 17 ~~~~~~~~~~sF~~l~~~~a~~n~i~~~~~l~~~~P~~~~~~~~~~~~~~s~~~g~~~~~~y~~~~~~~~liaiNRf~ai 96 (274)
T PF10328_consen 17 IIFKLKSLRNSFGILCASQAIANIIICLIFLFWVVPMTLLDISFLPSSLNSIIFGFIGMFCYFIGPLSHLLIAINRFCAI 96 (274)
T ss_pred HHHhcccccCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhee
Confidence 456778999999999999999999999754433333222222222332 3445578888888899999999999999999
Q ss_pred eccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHHHHHHHHHHHH
Q psy6126 82 GHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISIVVC 161 (214)
Q Consensus 82 ~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 161 (214)
..|.+|++..+.+.+..+++++|++++....+..........++.+...-.+..++.-..+..+........-.+++.++
T Consensus 97 ~fP~~y~~~fs~~~T~~~i~~~~~~~~~~~~~~~~~~~C~~~y~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~l 176 (274)
T PF10328_consen 97 FFPFKYKKIFSFKNTIILIAFIWLLSIIISTILYFPDGCYFYYDPETWSWSYPTDPPCGNYSWYFDFYKNFILVIISNIL 176 (274)
T ss_pred eeHHHHHhHcCccceehhhhHHHHHHHHHHHHhhhcCCCcceeccceeeeecCCCCccchhhhHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999977766555434555555554443332222223334444444455556667778
Q ss_pred HHHHHHHHHhhhhhh
Q psy6126 162 YARIFYIVRKTAMKS 176 (214)
Q Consensus 162 y~~i~~~l~~~~~~~ 176 (214)
+..++.++++.+++.
T Consensus 177 ni~t~ikl~~~~~~~ 191 (274)
T PF10328_consen 177 NIITFIKLRKFRKKI 191 (274)
T ss_pred HHHHHHHHHHhhhhc
Confidence 999988888876665
No 12
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=99.37 E-value=8.7e-12 Score=96.97 Aligned_cols=178 Identities=16% Similarity=0.175 Sum_probs=121.1
Q ss_pred ceeccCCCCC-chhhHHHHHHHHHHHHHHhhhhhhhhhhhcc---c--ccc----cccccchhhHHHHHHHHHHHHHHHH
Q psy6126 3 PFRRPSTVRN-ATAVFIINLSVSDLMFCCFNLPLAASTFWQR---A--WTH----GHLLCQLFPLLRYGLLAVSLFTVLG 72 (214)
Q Consensus 3 ~v~~~~~~~~-~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~---~--~~~----~~~~C~~~~~~~~~~~~~s~~~l~~ 72 (214)
.+..+|.+|+ +.|.+++.+|++|++..+..++......... + ... ......+...+...+...|.|..++
T Consensus 19 ~VLtrk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~~~~~~~~~C~~~~sy~~~~~~~~~~~l~~~~~~~S~WL~V~ 98 (318)
T PF10324_consen 19 IVLTRKSMRSSSINILLIGIAICDLLYMLSILIWELFFFIIFYEDDECIPPYSYFWAYFKLIMESLSDIFRRISIWLGVL 98 (318)
T ss_pred hhcCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667788886 8999999999999999988777776544322 1 111 1112334557788888999999999
Q ss_pred HHHHhheeeeccCCc--CccccccchhhhhHHHHHHHHHHhhhhhhhccccccc--cCccceeeecC-CCCCCch-----
Q psy6126 73 ITINRYVMIGHPTLY--PKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGL--EPSIGSCSILP-DDYGHSP----- 142 (214)
Q Consensus 73 is~dRy~aI~~P~~~--~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~--~~~~~~C~~~~-~~~~~~~----- 142 (214)
||+.|++++.+|+.. ++..+++.+..++..+++.+.+...+.++........ -.....|...+ ......|
T Consensus 99 mA~iR~l~i~~p~~~~~~~l~~~k~~~~~i~~v~~~s~~~~~~~~~~~~i~~~~~~~~p~~~C~~~~~~~~~~~Y~~~~~ 178 (318)
T PF10324_consen 99 MALIRYLSIKFPMSSRFQKLSKPKFAIIVILIVFIISFLFSIPYFFRYKIVEVSDPWVPPPNCSGFPENYTFPRYMLNIS 178 (318)
T ss_pred HHHHHHHHHHhhccccccccCCCCeeeeehHHHHHHHHHHHHHhhceEEEEeccccccCCCceeeccccccccccchhhh
Confidence 999999999999865 4566788888888888999988888876542222111 01223575211 1111111
Q ss_pred -------------hHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh
Q psy6126 143 -------------KEFL-FLVAFVIPCISIVVCYARIFYIVRKTAMKSRAMN 180 (214)
Q Consensus 143 -------------~~~~-~~~~~~~p~~~~~~~y~~i~~~l~~~~~~~~~~~ 180 (214)
..+. .++.-++|.++..++.+.+.+.+|+.++++++..
T Consensus 179 ~~~~~~~~~~~~~~~~~~gi~~kiiP~il~~ilti~Li~~Lrk~~~~r~~~~ 230 (318)
T PF10324_consen 179 ELFTENDCLFFRIYFFIDGIFFKIIPCILLPILTILLIIELRKAKKRRKKLS 230 (318)
T ss_pred hhhhhhHHHHHHHHHHhhhhHhhhhhHHHHHHHHHHHHHHHHhccHhhhccc
Confidence 0111 2333689999999999999999999877665554
No 13
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.35 E-value=5.3e-12 Score=96.27 Aligned_cols=162 Identities=15% Similarity=0.314 Sum_probs=112.1
Q ss_pred CCCchhhHHHHHHHHHHHHHHhhhhhhhhhhh---cc-cccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccC
Q psy6126 10 VRNATAVFIINLSVSDLMFCCFNLPLAASTFW---QR-AWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPT 85 (214)
Q Consensus 10 ~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~---~~-~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~ 85 (214)
.+++.+-++.+.+++|++..+........... .+ ....++..-++.....+..........+.+|++||.||++|.
T Consensus 29 ~~~~Fy~l~~~~~iaDi~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~y~~~~~~~~gi~lls~nR~~ai~~P~ 108 (283)
T PF10323_consen 29 FKSTFYTLLIQHCIADILSMLFYFLLQRFRAYGFFNDFYFSYQTYYAAIYYNIAYYFLYIQCIGIVLLSLNRYLAICFPT 108 (283)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHhcchhHHHHHHHHHHHHHHHHHhHHHHHHhhhheEEeec
Confidence 56899999999999999987654433332221 11 122333345666777778888889999999999999999999
Q ss_pred -CcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHH
Q psy6126 86 -LYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISIVVCYAR 164 (214)
Q Consensus 86 -~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~y~~ 164 (214)
++.+..++.+...++++-|+.+.+.++|.+.. .+..++.....-.. .+................+..+..+++|+.
T Consensus 109 ~~~~~~~~~~~~~~i~~i~wi~p~li~~~~~~~--~~~~f~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~cv~~iv~Y~~ 185 (283)
T PF10323_consen 109 SRHTKFWQPAKIWIIILIQWIPPLLISLPFFFD--TDFYFDNEENMSLF-VDPEFIQRNFLIAFIFVSVTCVICIVCYGI 185 (283)
T ss_pred HHHhhhccccchhheeeeeehhhhhheeeeecc--Cceeeecccceeee-cCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 88888899999999999999999999996553 33333333221111 111112222333445556667778899999
Q ss_pred HHHHHHhhhh
Q psy6126 165 IFYIVRKTAM 174 (214)
Q Consensus 165 i~~~l~~~~~ 174 (214)
++..+||+.+
T Consensus 186 i~~~iRk~~k 195 (283)
T PF10323_consen 186 IFIFIRKRNK 195 (283)
T ss_pred HHHHHHHhhh
Confidence 9999998764
No 14
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=99.33 E-value=6.4e-11 Score=85.47 Aligned_cols=162 Identities=17% Similarity=0.174 Sum_probs=107.9
Q ss_pred CCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCc
Q psy6126 8 STVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLY 87 (214)
Q Consensus 8 ~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~ 87 (214)
++++.-.+-++.||.++|++.++..+.........++...++..|..++++.+....++.+..++||++-|+.|.+|-..
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~q~g~~~sd~~ilaIAihT~l~v~~~~~~ 109 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFLQVGDEASDLWILAIAIHTFLIVFRPNWK 109 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccc
Confidence 44556667799999999999997655545555545555567799999999999999999999999999999999999211
Q ss_pred ----Ccc-----ccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHHHHHHHHH
Q psy6126 88 ----PKL-----YSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISI 158 (214)
Q Consensus 88 ----~~~-----~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~ 158 (214)
+.. ..+..+..+..+.|++ .+...+.. .....+.+....|..+++.. .+..++.-+-.++-.+.+
T Consensus 110 ~~~~~~~~~gl~~~~~~v~~~~~~~~~~---~~~la~i~-~~~~~Y~~~g~WCWi~~~~~--~~Rl~l~y~~~~~~~~~~ 183 (201)
T PF11710_consen 110 RKRSKNVEGGLYPYRYWVWVIWILVPLL---LASLAFIG-LGGPGYGPAGAWCWIPSRYE--WYRLWLHYIWRFIIIFAI 183 (201)
T ss_pred cccccccccceEEeeeeeehHHHHHHHH---HHHHHHhc-cccCcccccCcEEEECCCcH--HHHHHHHHHHHHHHHHHH
Confidence 110 0111122222233333 33333322 22334556677998865542 344445444555666778
Q ss_pred HHHHHHHHHHHHhhhhh
Q psy6126 159 VVCYARIFYIVRKTAMK 175 (214)
Q Consensus 159 ~~~y~~i~~~l~~~~~~ 175 (214)
++.|..|+..+|++.|+
T Consensus 184 i~iY~~if~~lrr~~~~ 200 (201)
T PF11710_consen 184 IIIYIAIFFYLRRRIRR 200 (201)
T ss_pred HHHHHHHHHHHhhhhcc
Confidence 88999999999997664
No 15
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=98.87 E-value=1.7e-08 Score=75.48 Aligned_cols=172 Identities=12% Similarity=0.170 Sum_probs=112.2
Q ss_pred CCCCchhhHHHHHHHHHHHHHHh-hhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccC--
Q psy6126 9 TVRNATAVFIINLSVSDLMFCCF-NLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPT-- 85 (214)
Q Consensus 9 ~~~~~~~~~i~~Lav~Dll~~~~-~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~-- 85 (214)
+..+|.|..+.+||++|.++.+. .+|.... ....+ .+++..||+..|+..++.+.|+.+.+++++-.-+.|.-..
T Consensus 6 ~r~kp~dlIl~hLa~aN~lvLl~rGip~~~~-~~~~~-~~~d~gCK~v~Y~~RV~RglSictTCLLSvfQAItISP~~s~ 83 (265)
T PF03402_consen 6 HRLKPIDLILIHLALANILVLLSRGIPQTMA-FFGWK-FFDDIGCKIVFYIYRVARGLSICTTCLLSVFQAITISPSNSR 83 (265)
T ss_pred CCCCcHHHHHHHHHHHHHHHHHHhhHHHHHH-Hhhcc-cCCCceeeeeeeehHHhchhhHHhhhHHHHHhhEEEecCCch
Confidence 44589999999999999988865 4554332 22222 3588999999999999999999999999997777665432
Q ss_pred -CcCccccccchhhhhHHHHHHHHHHhhhhhhhc-cccccc----cCccceeeecCCCCCC-chhHHHHHHHHHHHHHHH
Q psy6126 86 -LYPKLYSSKFLAFMVACTWLFGFGALVPTWLGV-WGRFGL----EPSIGSCSILPDDYGH-SPKEFLFLVAFVIPCISI 158 (214)
Q Consensus 86 -~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~-~~~~~~----~~~~~~C~~~~~~~~~-~~~~~~~~~~~~~p~~~~ 158 (214)
..-+...++.....+...|++..++........ ...... ......|...+..... ........+--++.+.+|
T Consensus 84 w~~lK~k~pk~i~~~~~~~Wilnlli~~~~~~~~~~~~n~t~~~~~~~~~yCs~~~~~~~~~~~~~~~~~~~D~~fvgLM 163 (265)
T PF03402_consen 84 WAELKVKAPKYIGPSCLFCWILNLLININVLVYIIATPNSTNNNFTYSLGYCSSVPISYIVSSLFATLLSFRDVLFVGLM 163 (265)
T ss_pred hhhhhhhcCCccccHHHHHHHHHHhhhcceeEEEEecCccccCceeecCCeEEEEecchhhhhhhhhHHHhhHHHHHHHH
Confidence 112333556777788889999888765422211 111111 1123468654332221 112222233346667788
Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhc
Q psy6126 159 VVCYARIFYIVRKTAMKSRAMNMK 182 (214)
Q Consensus 159 ~~~y~~i~~~l~~~~~~~~~~~~~ 182 (214)
+...+.+...+.||+++.+..++.
T Consensus 164 ~~aSgyMV~~L~RH~q~VqhIhs~ 187 (265)
T PF03402_consen 164 IWASGYMVFLLYRHHQQVQHIHSS 187 (265)
T ss_pred HhhhhhheeeeeecccccccCcCC
Confidence 889999999999998887665433
No 16
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=98.85 E-value=6.5e-08 Score=74.57 Aligned_cols=174 Identities=11% Similarity=0.082 Sum_probs=109.4
Q ss_pred cCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCC
Q psy6126 7 PSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTL 86 (214)
Q Consensus 7 ~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~ 86 (214)
+++.-+|.|..+.+||+++++.............+..+....+..++...++..+....|.|..+++++-=++-|....+
T Consensus 37 k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~s~W~tt~LsvfYcvKI~~fs~ 116 (303)
T PF05296_consen 37 KSRKLSPSDQILTSLAISRILLQWVILLNSFLSFFFPNIYFSENVYKIIDFLWMFSNSSSLWFTTWLSVFYCVKIANFSH 116 (303)
T ss_pred cCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHHheeeecCCc
Confidence 34444799999999999999887665555555555555555667899999999999999999999999999999988643
Q ss_pred cC----ccccccchhhhhHHHHHHHHH--HhhhhhhhccccccccCccceee--ecCCCCCCchhHHHHHHHHHHHHHHH
Q psy6126 87 YP----KLYSSKFLAFMVACTWLFGFG--ALVPTWLGVWGRFGLEPSIGSCS--ILPDDYGHSPKEFLFLVAFVIPCISI 158 (214)
Q Consensus 87 ~~----~~~t~~~~~~~i~~~w~~~~~--~~~~~~~~~~~~~~~~~~~~~C~--~~~~~~~~~~~~~~~~~~~~~p~~~~ 158 (214)
.- +..=.+-+..++...++.++. ...|.................+. ......+..+......+..++|++++
T Consensus 117 ~~Fl~LK~rI~~~v~~lLl~s~l~s~~~~~~~~~~~~~~~~~~~~~~~~N~t~~~~~~~~~~~~~~~~~~~~~~lPf~i~ 196 (303)
T PF05296_consen 117 PFFLWLKRRISKVVPWLLLGSLLISFLNLLSIPLFIDNHINNNNTNNSRNSTSNFQESKSSYFYFFILFNLGSFLPFLIF 196 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheeeeeccccccCCCcceEEeecchHHHHHHHHHHHHHHHHhHHHH
Confidence 21 111112233345566666663 33333221111100001111111 11111111122223347889999999
Q ss_pred HHHHHHHHHHHHhhhhhhhhhh
Q psy6126 159 VVCYARIFYIVRKTAMKSRAMN 180 (214)
Q Consensus 159 ~~~y~~i~~~l~~~~~~~~~~~ 180 (214)
+++...+...++||.|+.+...
T Consensus 197 l~s~~lli~SL~rH~r~M~~n~ 218 (303)
T PF05296_consen 197 LVSSILLIFSLWRHMRRMQKNA 218 (303)
T ss_pred HHHHHHHHHHHHHHHHHhhCCC
Confidence 9999999999999999877553
No 17
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=98.65 E-value=9.2e-07 Score=67.86 Aligned_cols=123 Identities=20% Similarity=0.246 Sum_probs=90.3
Q ss_pred cceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheee
Q psy6126 2 SPFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMI 81 (214)
Q Consensus 2 ~~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI 81 (214)
+.....|++|++.+-++.-++++|++..+..+..... .+...++..|.+++++.+.+..++.+-.++||++=|..+
T Consensus 29 ~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~~~----~~~~~~s~lC~~Qafliq~f~~as~lWt~~iA~nly~~~ 104 (303)
T PF05462_consen 29 ITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMTLS----PSAGENSFLCQFQAFLIQFFMLASFLWTLCIAFNLYLTF 104 (303)
T ss_pred HHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCCCcchhhHhHHHHHhhHHHHHHHHHHHHHhhhee
Confidence 3456778899999999999999999987543332221 223345678999999999999999999999999999999
Q ss_pred eccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCC
Q psy6126 82 GHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPD 136 (214)
Q Consensus 82 ~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~ 136 (214)
+++... .++.-....+++|.++++.++.++.. . .+......|+..++
T Consensus 105 ~~~~~~----~~k~e~~Y~l~~wGiPli~a~i~l~~--~--~~g~~G~WCWI~~~ 151 (303)
T PF05462_consen 105 VKRYDE----LEKLEKYYHLVCWGIPLIPAVIPLIG--N--DYGPAGNWCWIKPE 151 (303)
T ss_pred EEcCcc----HHHHHHHHHHHHHHHHHHHHHHHHHc--c--CCCCCCCceeecCC
Confidence 876322 34555567788999999887766543 2 22334458987644
No 18
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.60 E-value=6.9e-07 Score=68.89 Aligned_cols=163 Identities=15% Similarity=0.157 Sum_probs=104.4
Q ss_pred cceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccc------hhhHHHHHHHHHHHHHHHHHHH
Q psy6126 2 SPFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQ------LFPLLRYGLLAVSLFTVLGITI 75 (214)
Q Consensus 2 ~~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~------~~~~~~~~~~~~s~~~l~~is~ 75 (214)
.++.+++..|++.+-++.-||+.|+..........-...+. |...|. +.|.+...+...+......+++
T Consensus 55 ~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~~si~tG~l~i~-----G~vfC~~P~~~~i~G~~~~~~W~~~c~~~i~Lai 129 (313)
T PF10321_consen 55 IAIFKKKLFKMSCYKIMFFLAIFDIIQLFINSIITGILAIF-----GAVFCSYPRFIYIMGCIALGSWCGSCIICILLAI 129 (313)
T ss_pred HHHHHhccccCcHHHHHHHHHHHHHHHHHhhhhhhhHHHhc-----CccccCCchHhhhhhccchhhHHHHHHHHHHHHH
Confidence 35667777789999999999999999875432222222222 233443 4556667778888999999999
Q ss_pred HhheeeeccCCcCccccccchhhhhHHHHHHHHHHh--hhhhhhccccccccCccceeeecC----CCCCCchhHHHHHH
Q psy6126 76 NRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGAL--VPTWLGVWGRFGLEPSIGSCSILP----DDYGHSPKEFLFLV 149 (214)
Q Consensus 76 dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~--~~~~~~~~~~~~~~~~~~~C~~~~----~~~~~~~~~~~~~~ 149 (214)
||...+..|........+++....+.+..+.++... .++..+ +.+.....+++ +...+.|.......
T Consensus 130 nR~~~l~~~~~~~~~F~g~~~~~~l~~~~iY~~y~~~fT~pv~F-------ns~~~sw~fDP~i~~g~~~~~Y~N~~~~~ 202 (313)
T PF10321_consen 130 NRCCELNFPFKLKFLFEGKRTYIVLFLPIIYGFYFSFFTPPVLF-------NSEYMSWFFDPMIFEGKSPELYSNIPHTF 202 (313)
T ss_pred HHHHHHhcchhhHHhhcCceEEeeehhHHHHHHHHHHHcCcccC-------CCCcceEecCccccCCCCccchhhHHHHH
Confidence 999999999887888788887776665555544433 233322 12222222222 11222244444455
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy6126 150 AFVIPCISIVVCYARIFYIVRKTAMKS 176 (214)
Q Consensus 150 ~~~~p~~~~~~~y~~i~~~l~~~~~~~ 176 (214)
..++-.+.+..+|+.+...++++.+..
T Consensus 203 nN~~~~~~t~~lY~~l~~~l~~k~~~~ 229 (313)
T PF10321_consen 203 NNIIVVICTTFLYIYLCYYLIKKSKYS 229 (313)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 566667778889999999988875554
No 19
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=98.24 E-value=6.5e-06 Score=62.77 Aligned_cols=164 Identities=13% Similarity=0.170 Sum_probs=91.5
Q ss_pred CCCchhhHHHHHHHHHHHHHHhhhhh-hhhhhhcc------c-ccccccccchhhHHHHHHHHHHHHHHHHHHHHhheee
Q psy6126 10 VRNATAVFIINLSVSDLMFCCFNLPL-AASTFWQR------A-WTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMI 81 (214)
Q Consensus 10 ~~~~~~~~i~~Lav~Dll~~~~~~~~-~~~~~~~~------~-~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI 81 (214)
.+++.+.+...-++.|++.-+...+. .+...... . ....+.......++.+.+..+.......+++||+-++
T Consensus 31 ~~~sFy~l~~~d~~~ni~~~ln~~~~~Rl~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~Q~~~~~~is~nR~t~v 110 (275)
T PF02118_consen 31 FKSSFYRLYIMDLIMNILTYLNTWITLRLPSYTPPFCPFFLPFFENPPFFLNIFYFLQYYFAYVQYLSTILISLNRFTSV 110 (275)
T ss_pred cCCccHHHHHHHhHHHHHHHHHHHHHHHhccccCccchhHHHHHcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36788887777777777775432221 12111000 0 1223345667778888888999999999999999999
Q ss_pred eccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHHHHHHHHHHHH
Q psy6126 82 GHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISIVVC 161 (214)
Q Consensus 82 ~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 161 (214)
..|.++++..++.. ..+++.+.++++....+.+.. ......+.+...-....... ...........+++..++.+++
T Consensus 111 ~~p~~~~~~W~~~~-~~~i~~i~~~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~ii~i~~ 187 (275)
T PF02118_consen 111 LFPIRYEKFWKRYY-WIIIIIIFLLPFSFTWNIFIS-PTYVVYDNGGFSYSYNDTVS-WASLSIFSLIYFIIIIIITIIT 187 (275)
T ss_pred hhHHhhhHHHHhhh-hhheeeeeehhHHHHHHHHcc-ccEEEEECCceEEEEEeccc-hhHHHHHHHHHHHHHHHHHHHH
Confidence 99999998877544 334444455554443333322 33333332222222211111 1111222333444555556667
Q ss_pred HHHHHHHHHhhhhhh
Q psy6126 162 YARIFYIVRKTAMKS 176 (214)
Q Consensus 162 y~~i~~~l~~~~~~~ 176 (214)
+....+++++.+++.
T Consensus 188 ~~~~~~~l~~~~~~~ 202 (275)
T PF02118_consen 188 NIITYRRLRKLSKRI 202 (275)
T ss_pred HHHHHHHHHhhhhhh
Confidence 777777777655433
No 20
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=97.92 E-value=2.1e-06 Score=64.21 Aligned_cols=109 Identities=17% Similarity=0.087 Sum_probs=0.0
Q ss_pred eeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeec
Q psy6126 4 FRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGH 83 (214)
Q Consensus 4 v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~ 83 (214)
....|++|+..+....||++++++..+..+.. ......+....++..|...+.+.+.+..++..-+.+.++|=|..+..
T Consensus 25 ~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~~-~~~~~~~~~~~~~~~C~~~a~~~hy~~la~f~Wm~~~~~~l~~~~~~ 103 (242)
T PF00002_consen 25 YLLFRKLRSFRNKIHLNLCLSLLLANLSFLIG-ISQTFSPISTTNHCLCRAIAILLHYFFLASFFWMLVEAFYLYRLLVK 103 (242)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHhhcccchhhhhhhHHHHHHHHHHHhee-hhhccccccccccccchhhhhHhHHHHHHHHHHHHHHHHHhheeEEE
Confidence 34457788878888999999999887543322 11111111112334699999999999999999999999999999988
Q ss_pred cCCcCccccccchhhhhHHHHHHHHHHhhhhhhh
Q psy6126 84 PTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLG 117 (214)
Q Consensus 84 P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~ 117 (214)
+... .+++.....+..|.++.++.......
T Consensus 104 ~~~~----~~~~~~~~~~~gwg~P~~iv~i~~~~ 133 (242)
T PF00002_consen 104 VFNS----SRRRFWWYYLIGWGIPALIVVISVAV 133 (242)
T ss_dssp ----------------------------------
T ss_pred eecc----cchhhheeeeeeecCcceeeeeeeee
Confidence 7321 24455666777899888877765443
No 21
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=97.78 E-value=0.00064 Score=53.20 Aligned_cols=126 Identities=13% Similarity=0.116 Sum_probs=78.7
Q ss_pred ccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhh---c--c--cccccccccchhhHHHHHHHHHHHHHHHHHHHHhh
Q psy6126 6 RPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFW---Q--R--AWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRY 78 (214)
Q Consensus 6 ~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~---~--~--~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy 78 (214)
.++..|....+++....++.++.++.-+......++ . + +...+...|-.......++...+..+..++++||+
T Consensus 43 ~~~~~H~N~ril~~~~~~~~l~~~~~r~~~h~~~l~~~~~~~~~Cd~~~~~~~C~~lR~~~~~~~~~~~~t~v~l~IER~ 122 (324)
T PF10292_consen 43 NSKLFHFNTRILFIVHCFSFLIHCTGRIILHTYDLYNYFFPDDPCDMIPSTYRCFILRIPYNFGLFLVSFTTVSLVIERT 122 (324)
T ss_pred hcchhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556677888888888888887654443332222 1 1 12335567878888888999999999999999999
Q ss_pred eeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccc-cCccceeeecCC
Q psy6126 79 VMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGL-EPSIGSCSILPD 136 (214)
Q Consensus 79 ~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~C~~~~~ 136 (214)
+|-.+.-+|.+. ++.-...+..+.|+++.......+. ..+. +.....|.....
T Consensus 123 iAT~~~~~YE~~-~~~~Gi~l~~~qi~is~~~~~~~~~----~~~~~~~~~~yC~~~~~ 176 (324)
T PF10292_consen 123 IATFFSKSYEKS-GKWLGILLAFFQILISLLILYFVYS----DEDFPNERMYYCQASSS 176 (324)
T ss_pred HHHhhHHHhcCC-CccHHHHHHHHHHHHHHHHHHHHHh----hhhccCccceeeEEecC
Confidence 999999889874 2322233444445554443332221 1122 334457876443
No 22
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=97.66 E-value=0.00049 Score=52.20 Aligned_cols=89 Identities=9% Similarity=0.008 Sum_probs=56.4
Q ss_pred cCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhc---ccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeec
Q psy6126 7 PSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQ---RAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGH 83 (214)
Q Consensus 7 ~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~---~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~ 83 (214)
.|+.+++.-.++-.-.+.|.+.++...+........ .+....+..=-..+................||+||.+|+..
T Consensus 33 ~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~if~~~~p~~~~~~iR~~l~~~Ia~dR~~A~~f 112 (273)
T PF10316_consen 33 SKKKKKPDLSLFYFRFAIDVFYGFSVFIYLIYYILFSLFPEFLDYKNLIFYIGWPSSNLGSIRSILALIIALDRVFAVYF 112 (273)
T ss_pred ccccCCCCEEeeHHHHHHHHHHHHHHHHHHHHHHHHHhChhhhccccEEeehhhhHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 333566666777778999999998766554443321 11111111222223344444567788999999999999999
Q ss_pred cCCcCccccccc
Q psy6126 84 PTLYPKLYSSKF 95 (214)
Q Consensus 84 P~~~~~~~t~~~ 95 (214)
|..|++...+-.
T Consensus 113 PI~y~~~r~k~~ 124 (273)
T PF10316_consen 113 PIFYHNYRKKIP 124 (273)
T ss_pred CHHHHccCcccc
Confidence 998887644433
No 23
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=97.55 E-value=0.0035 Score=46.87 Aligned_cols=163 Identities=17% Similarity=0.224 Sum_probs=102.7
Q ss_pred ccCCCCCchhhHHHHHHHHHHHHHHhhh----hhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheee
Q psy6126 6 RPSTVRNATAVFIINLSVSDLMFCCFNL----PLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMI 81 (214)
Q Consensus 6 ~~~~~~~~~~~~i~~Lav~Dll~~~~~~----~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI 81 (214)
++.-++-+...|+..|.++-.+.+...+ |-.+..++..+..- +..-.+..+....+.++-.+.-..+|++|....
T Consensus 41 W~~Fk~m~ffwFl~qlt~s~fi~S~lNl~inVPatlfsl~t~~~~~-s~~f~~~Sy~idf~h~siLfsNlviaIqR~fVF 119 (305)
T PF04789_consen 41 WSHFKPMKFFWFLTQLTISVFIISSLNLLINVPATLFSLITKEFVQ-SELFIFMSYLIDFCHYSILFSNLVIAIQRFFVF 119 (305)
T ss_pred HHhcccchHHHHHHHHHHHHHHHHhhhheEeCcHHHHHhhhHHHhc-cchhhHHHHHHHHHHHHHHHHHHHHHHhhheee
Confidence 3333445678888899888887765433 33333333333222 234455667888888889999999999998877
Q ss_pred eccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCcc----ceeeecC----CCCCCchhHHHHHHHHHH
Q psy6126 82 GHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSI----GSCSILP----DDYGHSPKEFLFLVAFVI 153 (214)
Q Consensus 82 ~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~----~~C~~~~----~~~~~~~~~~~~~~~~~~ 153 (214)
-.-....+...+.-..+-+..+|+++...-...... ...+.++... ..|.... ..........-..+.+.+
T Consensus 120 Ffr~~t~~~F~s~~iyiWL~~vWils~~v~~~l~~~-nC~Y~y~~~~~~y~L~C~~~~~~v~~~~P~~IqiiE~ilQ~~I 198 (305)
T PF04789_consen 120 FFRNLTDKVFESPVIYIWLLLVWILSIGVVYSLMSN-NCRYRYNKWSKHYQLNCETCNSVVDISPPRGIQIIEIILQFGI 198 (305)
T ss_pred eehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHhC-CCeeecccccccEEEEcCCCCeeEeeCCCCchhHHHHHHHHhH
Confidence 654445566677888888999999998776654432 3444433322 2565431 122233444567788899
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy6126 154 PCISIVVCYARIFYIVRK 171 (214)
Q Consensus 154 p~~~~~~~y~~i~~~l~~ 171 (214)
|+++.+ .|+.+..++-.
T Consensus 199 Pi~Il~-iYiAIIiKI~~ 215 (305)
T PF04789_consen 199 PIFILV-IYIAIIIKIIK 215 (305)
T ss_pred HHHHHH-HHHHHHHHHHH
Confidence 987665 58777665543
No 24
>PF03383 Serpentine_r_xa: Caenorhabditis serpentine receptor-like protein, class xa; InterPro: IPR005047 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class xa (Srxa), from the Str superfamily [].
Probab=97.36 E-value=0.00016 Score=49.42 Aligned_cols=101 Identities=17% Similarity=0.138 Sum_probs=63.9
Q ss_pred hHHHHHHHHHHHHHHhhhhhhhhhhhccccc---ccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCcccc
Q psy6126 16 VFIINLSVSDLMFCCFNLPLAASTFWQRAWT---HGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKLYS 92 (214)
Q Consensus 16 ~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~---~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~~t 92 (214)
.++.++++++++..+..+.-........+.. +-+..-+-...+..++.....+...+|++||-..|.+|++...++|
T Consensus 5 vYI~~M~i~gi~~~~~~~v~~~~~~~~~~~~Y~~yr~~~G~~~TL~~tf~Yl~plfltvLMti~Ri~iV~~P~~~~~~Fs 84 (153)
T PF03383_consen 5 VYIYNMIICGIIDKFFIFVDFWIPLFLPDEGYEEYRQFFGKEVTLFGTFSYLHPLFLTVLMTINRIYIVLFPFGSEIWFS 84 (153)
T ss_pred EEEeHHHHhhHHHHHHHHhhhhhhhccChhHHHHHHHHhCCEEeeeehHHHHHHHHHHHHHHHhheEEEEecCCCccccc
Confidence 4667788888777654222222222211111 1111222233445566667888899999999999999998777889
Q ss_pred ccchhhhhHHHHHHHHHHhhhhhh
Q psy6126 93 SKFLAFMVACTWLFGFGALVPTWL 116 (214)
Q Consensus 93 ~~~~~~~i~~~w~~~~~~~~~~~~ 116 (214)
.++..+-+....++.+..-+-|++
T Consensus 85 ~~kl~~YC~~i~i~~~i~LlIPy~ 108 (153)
T PF03383_consen 85 DKKLWIYCGIIAILSFISLLIPYF 108 (153)
T ss_pred cchhHHHHHHHHHHHHHHHHhhcC
Confidence 999988887766666665554544
No 25
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=97.35 E-value=0.0023 Score=49.37 Aligned_cols=108 Identities=13% Similarity=0.200 Sum_probs=74.9
Q ss_pred eeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhc---------c-cccccccccchhhHHHHHHHHHHHHHHHHH
Q psy6126 4 FRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQ---------R-AWTHGHLLCQLFPLLRYGLLAVSLFTVLGI 73 (214)
Q Consensus 4 v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~---------~-~~~~~~~~C~~~~~~~~~~~~~s~~~l~~i 73 (214)
..+.++.-+...+++.|-|+.|++.++.......-..-+ | --..|+..|.....+...+...+.+.+..-
T Consensus 24 ~~~tp~~l~~~~~~l~~~~~~~~~~~~~~~~~q~R~i~~~~~~~~~~~GpC~~~~~~~C~~~y~~~~~~~~~s~~~i~~s 103 (292)
T PF10317_consen 24 IFKTPKSLRTYSILLLNTAIFDLISIISAFLTQPRIIPNGTSIAVISYGPCRYFGPWFCFFCYHIMLHFSIHSGWSIILS 103 (292)
T ss_pred HHhChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCcEEEEeeecceecCcHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555556788999999999999986533322211111 1 123466788888888888888999999999
Q ss_pred HHHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhh
Q psy6126 74 TINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWL 116 (214)
Q Consensus 74 s~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~ 116 (214)
-..||..+.++.. ++++....+...+..++......++
T Consensus 104 f~yRy~iL~~~~~-----~~~~~i~~~~l~~~p~~~~~i~~~~ 141 (292)
T PF10317_consen 104 FYYRYYILRNPSP-----SRKRIIFICILIYIPSFSMLIFFFF 141 (292)
T ss_pred HHHHHHHhcCCCC-----chhHHHHHHHHHHHHHHHHHHHHhc
Confidence 9999999988643 5566677777777777766554443
No 26
>KOG4193|consensus
Probab=97.04 E-value=0.0015 Score=55.05 Aligned_cols=142 Identities=13% Similarity=0.065 Sum_probs=83.2
Q ss_pred CCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhccccccccc--ccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccC
Q psy6126 8 STVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHL--LCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPT 85 (214)
Q Consensus 8 ~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~--~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~ 85 (214)
|++++..+....||+++=++.- + .++.+.|..+.. .|.....+.+.+..++.+-+..++++=|..++.=
T Consensus 348 ~~l~~~~~~i~~~l~~~L~l~~-l-------~fL~~~~~~~~~~~~C~~~a~llhff~LaaF~Wm~leg~hl~~~~v~v- 418 (610)
T KOG4193|consen 348 RKLQNDRTKIHINLCLCLFLAE-L-------LFLLGIDRTSTSVVLCIAAAILLHFFFLAAFFWMLLEGFHLYLLLVEV- 418 (610)
T ss_pred HHHHhhcchhHHHHHHHHHHHH-H-------HHhcccccccCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence 4444444666677776621111 1 111223333332 6999999999999999999999999999854443
Q ss_pred CcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHH
Q psy6126 86 LYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAFVIPCISIVVCYARI 165 (214)
Q Consensus 86 ~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~y~~i 165 (214)
......++........|....++...........+........|+...+..... . |+.|+.+++.+|+.+
T Consensus 419 --f~~~~~~~~l~~~~~gwg~Pavvv~Isa~~~~~~~~~~~~~~~CWl~~~~~~~~--~------F~GPv~~ii~~Ni~~ 488 (610)
T KOG4193|consen 419 --FRSRPRRRKLLYSLYGWGVPAVVVGVSALVDPDLEGQYGTPRVCWLDTQNGFIW--S------FLGPVTLIILVNIVM 488 (610)
T ss_pred --hccccchHHHHHHHHHhhhhHHHHhheeEEeccCccccccCCceEEecCCceEE--E------EehHHHHHHHHHHHH
Confidence 333345555555568999988877764443222221223333588754432211 1 667777777777555
Q ss_pred HHH
Q psy6126 166 FYI 168 (214)
Q Consensus 166 ~~~ 168 (214)
+..
T Consensus 489 Fv~ 491 (610)
T KOG4193|consen 489 FVV 491 (610)
T ss_pred HHH
Confidence 444
No 27
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=96.74 E-value=0.055 Score=42.62 Aligned_cols=94 Identities=10% Similarity=0.115 Sum_probs=67.0
Q ss_pred hhhHHHHHHHHHHHHHHhhhhhhhhhhhcccc-----------cccccccchhhHHHHHHHHHHHHHHHHHHHHhheeee
Q psy6126 14 TAVFIINLSVSDLMFCCFNLPLAASTFWQRAW-----------THGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIG 82 (214)
Q Consensus 14 ~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~-----------~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~ 82 (214)
..-.+..||++|++.++..+.....+....+. ..+..+|.......+.+..++-+-+++-|+|=|+.+.
T Consensus 67 ~~rIl~~la~aDlLaclGVivRS~vWl~~p~~~~s~s~~~~~d~wp~afCv~ss~WIq~fYsAtfwWtfcYAVDv~Lv~~ 146 (405)
T PF02101_consen 67 SRRILFWLAVADLLACLGVIVRSSVWLGFPNFIDSISDVNGTDIWPAAFCVGSSMWIQLFYSATFWWTFCYAVDVYLVIR 146 (405)
T ss_pred CchhHHHHHHHHHHhhhhHHHHhhhhhcCCcccccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhheee
Confidence 35688999999999997666655544322211 1234689888888888888999999999999999885
Q ss_pred ccCCcCccccccchhhhhHHHHHHHHHHhhh
Q psy6126 83 HPTLYPKLYSSKFLAFMVACTWLFGFGALVP 113 (214)
Q Consensus 83 ~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~ 113 (214)
+.- .+.....--..+|.++.++..-
T Consensus 147 ~~a------g~~~~~lYH~~aWgl~~lL~~~ 171 (405)
T PF02101_consen 147 RSA------GRSTIWLYHMMAWGLPALLCAE 171 (405)
T ss_pred ccC------CCcchhHHHHHHHHHHHHHHHh
Confidence 531 2233445667889998887775
No 28
>PF10322 7TM_GPCR_Sru: Serpentine type 7TM GPCR chemoreceptor Sru; InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily [].
Probab=96.63 E-value=0.052 Score=42.11 Aligned_cols=122 Identities=15% Similarity=0.230 Sum_probs=73.8
Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccCc--
Q psy6126 50 LLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEPS-- 127 (214)
Q Consensus 50 ~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~-- 127 (214)
..=++..++.+...+.+.+..++.++-|.+-+..|..+++. .++-.....-++.+.+++.+.|.+........-+..
T Consensus 101 ~~Lk~l~~~~~~~~Y~s~lf~~Lfc~~Rl~il~~p~~~~~i-~~~i~~~~~P~i~i~p~~~~f~~~pa~G~C~Ql~~Pf~ 179 (307)
T PF10322_consen 101 HFLKILVFFYYYFNYSSMLFPVLFCLLRLIILYSPRNHKKI-CRKIFRIWIPFIFIYPFCFTFPMFPALGYCRQLDPPFP 179 (307)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHheeCccchhHH-HHhHHHHHHHHHHHHHHHHHHHccCCcEEEEeCCCCCC
Confidence 44566778888999999999999999999999999876554 444445566667777777777755431111111110
Q ss_pred cceee--ecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy6126 128 IGSCS--ILPDDYGHSPKEFLFLVAFVIPCISIVVCYARIFYIVRKTA 173 (214)
Q Consensus 128 ~~~C~--~~~~~~~~~~~~~~~~~~~~~p~~~~~~~y~~i~~~l~~~~ 173 (214)
.+.=. ......+.. .....+.....-.+.++++++.++.++|+.+
T Consensus 180 fGAI~I~~~~~~~~~~-~~~~~l~~s~~~~~~iii~N~lm~~Klr~~k 226 (307)
T PF10322_consen 180 FGAIIITSTGSWFNIR-NSIFHLFFSIFWMISIIILNILMFFKLRKLK 226 (307)
T ss_pred CCEEEEEEEcccccch-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 01000 011111111 2222333444445567778999999998865
No 29
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=96.60 E-value=0.0031 Score=48.94 Aligned_cols=107 Identities=12% Similarity=0.197 Sum_probs=68.2
Q ss_pred eccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhh--------h--cccccccccccchhhHHHHHHHHHHHHHHHHHH
Q psy6126 5 RRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTF--------W--QRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGIT 74 (214)
Q Consensus 5 ~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~--------~--~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is 74 (214)
.+.++.-....+++...|+.|++.+++-........ + .+.+..+....+....+...+...+...++.--
T Consensus 25 ~~s~k~~G~Yk~Lm~~fs~~~i~fs~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~y~~~f~~~~~llaihF 104 (307)
T PF10326_consen 25 TKSPKSLGSYKYLMIYFSIFEIIFSILDFLVQPIIHSYGSSFVVFSDSGPFFLSKSIGQILLAIYCGCFGMSFSLLAIHF 104 (307)
T ss_pred hccCCCCCCEEEEEehhHHHHHHHHHHHHHHHhceeecCCeeEEEeecccccccchheeehhhhhhhhHHHHHHHHHhHh
Confidence 344444467788888999999999875443322211 1 122334555667777777777888888888889
Q ss_pred HHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhh
Q psy6126 75 INRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALV 112 (214)
Q Consensus 75 ~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~ 112 (214)
+.||.++++|.+.+. .+.++........-+.+..+..
T Consensus 105 iYRY~av~~~~~l~~-f~~~~~~~w~~~~~~~~~~W~~ 141 (307)
T PF10326_consen 105 IYRYWAVCRPKKLRY-FSGWRFIIWFLIPIIFGIIWFL 141 (307)
T ss_pred hhheeeeccchhhcc-ccccceeeeehhhhhhcccceE
Confidence 999999999976433 3555555444444444444333
No 30
>PF02175 7TM_GPCR_Srb: Serpentine type 7TM GPCR chemoreceptor Srb; InterPro: IPR002184 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=96.18 E-value=0.064 Score=39.42 Aligned_cols=45 Identities=18% Similarity=0.274 Sum_probs=38.2
Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCcc
Q psy6126 46 THGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKL 90 (214)
Q Consensus 46 ~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~ 90 (214)
..+...++............+++.-..+++|||+|...--+|.+.
T Consensus 79 iI~~~~~K~~h~~~~flmT~~ml~PigftIERfiAl~~A~~YE~~ 123 (236)
T PF02175_consen 79 IIDPTLFKYGHLTGLFLMTIPMLFPIGFTIERFIALKMAEKYENT 123 (236)
T ss_pred EEchHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhccC
Confidence 345567788888888899999999999999999999998888774
No 31
>PF02117 7TM_GPCR_Sra: Serpentine type 7TM GPCR chemoreceptor Sra; InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=95.71 E-value=0.081 Score=41.56 Aligned_cols=75 Identities=15% Similarity=0.233 Sum_probs=50.9
Q ss_pred chhhHHHHHHHHHHHHHHhhhhhhhhhh----h-cc---cccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeecc
Q psy6126 13 ATAVFIINLSVSDLMFCCFNLPLAASTF----W-QR---AWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHP 84 (214)
Q Consensus 13 ~~~~~i~~Lav~Dll~~~~~~~~~~~~~----~-~~---~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P 84 (214)
...+++.+|..+++-.... .-..+..+ . .+ +....+..|.........+....++.-+++++||.+|-..|
T Consensus 55 Tr~LL~~~l~~a~~Hq~~y-~~~~~~~lyrsf~~~~dpC~i~f~~~eC~~~~~~~~~~~~~~~~~q~aL~idRl~at~~~ 133 (328)
T PF02117_consen 55 TRILLISNLLFANIHQISY-AIIQIWSLYRSFFYSDDPCSILFTSSECFPYYYFYYFTNSGMIFIQFALTIDRLLATFFP 133 (328)
T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCCeeeEeecCceeeeehHHHHHHHHHHHHHHHHHHHHHHHHhch
Confidence 4456666777777655432 11122111 1 11 24556678999999999998889999999999999999888
Q ss_pred CCcC
Q psy6126 85 TLYP 88 (214)
Q Consensus 85 ~~~~ 88 (214)
..+.
T Consensus 134 ~~~~ 137 (328)
T PF02117_consen 134 KYYS 137 (328)
T ss_pred hhhh
Confidence 6543
No 32
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=95.41 E-value=0.35 Score=37.39 Aligned_cols=108 Identities=9% Similarity=0.069 Sum_probs=60.7
Q ss_pred ceeccCCCC-CchhhHHHHHHHHHHHHHHh--hhhhhhhhh-------hcccccccc-cccchhhHHHHHHHHHHHHHHH
Q psy6126 3 PFRRPSTVR-NATAVFIINLSVSDLMFCCF--NLPLAASTF-------WQRAWTHGH-LLCQLFPLLRYGLLAVSLFTVL 71 (214)
Q Consensus 3 ~v~~~~~~~-~~~~~~i~~Lav~Dll~~~~--~~~~~~~~~-------~~~~~~~~~-~~C~~~~~~~~~~~~~s~~~l~ 71 (214)
.+..+|+.+ ....++++.-|+-|++.++. .+|..+..+ ..++|-... ..-.......-.....|-..|.
T Consensus 28 li~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~vh~yry~F~~fi~dG~F~~~s~l~~~~ls~RCsfIs~sYaIL~ 107 (310)
T PF10319_consen 28 LIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICVHGYRYAFVVFISDGPFFEKSELGQHLLSIRCSFISGSYAILH 107 (310)
T ss_pred eEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhheeeccceEEEEEEcCCcCcCcchHHHHHHHHHHHHHhhhHHHHH
Confidence 444555554 67889999999999999874 455544332 123332221 1222223333344445555566
Q ss_pred HHHHHhheeeeccCCcCccccccchhhhhHHHHHHHHHH
Q psy6126 72 GITINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGA 110 (214)
Q Consensus 72 ~is~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~ 110 (214)
.==+.||+++++|..-++...+......+..+-.....+
T Consensus 108 ~HFvYRYl~l~~~~~~~~~F~p~gl~~s~~~~~~h~~~W 146 (310)
T PF10319_consen 108 IHFVYRYLVLFNSKFINKYFMPYGLIGSILYCLFHFASW 146 (310)
T ss_pred HHHHHHHHHHhCcHHHHHhhhhHHHHHHHHHHHHHHHHH
Confidence 667899999999965555544444444444333333333
No 33
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=95.33 E-value=0.14 Score=39.72 Aligned_cols=111 Identities=14% Similarity=0.165 Sum_probs=64.8
Q ss_pred eeccCCCCCchhhHHHH----HHHHHHHHHHhhhhhhhhhhhc----ccccccccccchhhHHHHHHHHHHHHHHHHHHH
Q psy6126 4 FRRPSTVRNATAVFIIN----LSVSDLMFCCFNLPLAASTFWQ----RAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITI 75 (214)
Q Consensus 4 v~~~~~~~~~~~~~i~~----Lav~Dll~~~~~~~~~~~~~~~----~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~ 75 (214)
+.+.++.-+.....+.+ ..+.|+..+.+..|........ |-...-+.-..++.++..........++..+--
T Consensus 29 l~ktp~~m~~~k~~ll~~~~~~~~~d~~~~~l~~P~~~~P~~~~~~~G~l~~~gv~~~~q~~i~~~~~~~~~~S~~~lFe 108 (302)
T PF10318_consen 29 LFKTPKSMKSVKWYLLNHHFWNFLLDLYLSFLIIPYFFFPSPAGYPLGILSYFGVPTPIQFYIGVTLLAFVGVSILSLFE 108 (302)
T ss_pred eEeChhhhHHHHHHHHHHHHHHHHHHHHhhhEEeeeeccchHHhhCCcchhhcccChHHHHHHHHHHhhhhhHHHHHHHH
Confidence 34444443444333333 4557888888888866654332 111111244666777777777788888888889
Q ss_pred HhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhh
Q psy6126 76 NRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWL 116 (214)
Q Consensus 76 dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~ 116 (214)
+||.++....+ +...++--.......++.+.....|...
T Consensus 109 ~R~~~i~~~~~--~~~~~~~r~~~~~~~y~~~~~~~~~~~~ 147 (302)
T PF10318_consen 109 NRYQIIIRNRS--RSLWKKIRIIFIIFNYIIAFLFFIPIYL 147 (302)
T ss_pred HHHHHHhcccc--cccchhhHHHHHHHHHHHHHHHHHHHhc
Confidence 99999997643 1212222333445556666666666554
No 34
>PF06681 DUF1182: Protein of unknown function (DUF1182); InterPro: IPR010601 This family consists of several hypothetical proteins of around 360 residues in length and seems to be specific to Caenorhabditis elegans. The function of this family is unknown.
Probab=94.25 E-value=0.66 Score=33.51 Aligned_cols=112 Identities=13% Similarity=0.104 Sum_probs=59.4
Q ss_pred ccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhh--ccccc
Q psy6126 45 WTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLG--VWGRF 122 (214)
Q Consensus 45 ~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~--~~~~~ 122 (214)
...+-..|.+.--+...+....-...+++|++||.-|++-.+-.. .. .|.+-.+.++|.++. .....
T Consensus 110 p~ytI~vCSflk~f~S~t~s~~~vip~aVAIyRy~~VV~k~rl~~----~f-------~v~v~~iit~~~~~~a~~~~~~ 178 (226)
T PF06681_consen 110 PQYTIFVCSFLKNFTSSTTSCAQVIPVAVAIYRYLIVVLKKRLSV----WF-------VVVVHVIITFPFIFYAILNFPF 178 (226)
T ss_pred ceEEEEeehHHHhHHHHHhhhhccchhhhhhhhhheeeecccCCc----hH-------HHHHHHHHHHHHHHHHHHhccc
Confidence 344556898887777777666666777899999999988644321 11 222222333332211 01222
Q ss_pred cccCccceeeecCCCCCC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6126 123 GLEPSIGSCSILPDDYGH-SPKEFLFLVAFVIPCISIVVCYARIFYIVRK 171 (214)
Q Consensus 123 ~~~~~~~~C~~~~~~~~~-~~~~~~~~~~~~~p~~~~~~~y~~i~~~l~~ 171 (214)
+.......|......++. .... .....++ ++.++.+..|++.+||
T Consensus 179 G~~t~NdiCa~L~fs~~~~~Vri---~~tl~l~-lfAVl~N~~i~~fvkk 224 (226)
T PF06681_consen 179 GEPTPNDICAILRFSKSMELVRI---SYTLGLN-LFAVLLNFVILRFVKK 224 (226)
T ss_pred CCCCcCCeeeEEEEccchHHHHH---HHHhHHH-HHHHHHHHhHheeeec
Confidence 333444588754333322 1222 1123333 3455668887777665
No 35
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=94.10 E-value=1 Score=35.01 Aligned_cols=109 Identities=12% Similarity=0.122 Sum_probs=60.2
Q ss_pred ceeccCCCCCchhhHH---HHHHHHHHHHHHhhhhhhhhhhhccccccc------ccccchhhHHHHHHHHHHHHHHHHH
Q psy6126 3 PFRRPSTVRNATAVFI---INLSVSDLMFCCFNLPLAASTFWQRAWTHG------HLLCQLFPLLRYGLLAVSLFTVLGI 73 (214)
Q Consensus 3 ~v~~~~~~~~~~~~~i---~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~------~~~C~~~~~~~~~~~~~s~~~l~~i 73 (214)
++.+.+++.+-.++++ ....++|+-.+++.-|..+.....+ +..| +..+.....+.........-.++.-
T Consensus 33 i~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~ag-y~~G~l~~~f~i~~h~~~~i~~~l~~~q~~~L~~C 111 (303)
T PF10327_consen 33 IIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAG-YCVGFLARYFGISSHYLMTIWIFLFIFQIESLVIC 111 (303)
T ss_pred EEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEE-EEEeehhhcCCCchHHHHHHHHHHHHHHHHHHHHH
Confidence 4566666665333333 3356789999888777777654332 2222 1223333344444445555566666
Q ss_pred HHHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhh
Q psy6126 74 TINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVP 113 (214)
Q Consensus 74 s~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~ 113 (214)
-+-|..+|.+..+.++. ++..-........+..+.....
T Consensus 112 F~~KHQaIa~i~~~~~l-~~~~~~~~~~~~~~~p~~~~~~ 150 (303)
T PF10327_consen 112 FLRKHQAIARILSKHVL-PKWFYIFFYFFCILFPFFIFIC 150 (303)
T ss_pred HHHHHHHHHhhhccCcC-cHHHHHHHhhHHHHHHHHHHHH
Confidence 78899999998754442 4444344444444444444443
No 36
>PF01534 Frizzled: Frizzled/Smoothened family membrane region; InterPro: IPR000539 The frizzled (fz) locus of Drosophila coordinates the cytoskeletons of epidermal cells, producing a parallel array of cuticular hairs and bristles [, ]. In fz mutants, the orientation of individual hairs with respect both to their neighbours and to the organism as a whole is altered. In the wild-type wing, all hairs point towards the distal tip []. In the developing wing, fz has 2 functions: it is required for the proximal-distal transmission of an intracellular polarity signal; and it is required for cells to respond to the polarity signal. Fz produces an mRNA that encodes an integral membrane protein with 7 putative transmembrane (TM) domains. This protein should contain both extracellular and cytoplasmic domains, which could function in the transmission and interpretation of polarity information []. This signature is usually found downstream of the Fz domain (IPR000024 from INTERPRO); GO: 0007166 cell surface receptor linked signaling pathway, 0016020 membrane
Probab=91.85 E-value=2.8 Score=33.03 Aligned_cols=87 Identities=11% Similarity=0.106 Sum_probs=66.1
Q ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhhhccccccccC
Q psy6126 47 HGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWLGVWGRFGLEP 126 (214)
Q Consensus 47 ~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~ 126 (214)
.++..|.+...+.+.+..++..--+.+++.-|++-........+ .++....-++.|.++.+..+..+. ....+.|.
T Consensus 89 ~~~~~C~~~F~l~Yyf~mAa~~WWviLt~~W~lsa~~kw~~e~i--~~~s~yfH~~aW~iP~~~ti~vL~--~~~VdgD~ 164 (328)
T PF01534_consen 89 LENSSCTVVFLLLYYFGMAASLWWVILTLTWFLSAGLKWGSEAI--EKKSSYFHLVAWGIPAVLTIAVLA--LRKVDGDE 164 (328)
T ss_pred CCCCchhhHHHHHHHHHhHHHHHHHHHHHHHHHHhhcccCcchh--hhhcchhhhHHhhhhHHHHHHHHH--hccccccc
Confidence 35679999999999998888888888899999998876654443 466677788999999998887554 34456666
Q ss_pred ccceeeecCCC
Q psy6126 127 SIGSCSILPDD 137 (214)
Q Consensus 127 ~~~~C~~~~~~ 137 (214)
-.+.|......
T Consensus 165 ltGiC~Vg~~~ 175 (328)
T PF01534_consen 165 LTGICFVGNQN 175 (328)
T ss_pred ccceeEEeCCC
Confidence 77789875443
No 37
>KOG4564|consensus
Probab=91.67 E-value=0.73 Score=37.87 Aligned_cols=110 Identities=10% Similarity=0.027 Sum_probs=71.1
Q ss_pred ceeccCCCCCchhhHHHHHHHHHHHHHHhhhhhhhhhhhccc------------ccccccccchhhHHHHHHHHHHHHHH
Q psy6126 3 PFRRPSTVRNATAVFIINLSVSDLMFCCFNLPLAASTFWQRA------------WTHGHLLCQLFPLLRYGLLAVSLFTV 70 (214)
Q Consensus 3 ~v~~~~~~~~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~------------~~~~~~~C~~~~~~~~~~~~~s~~~l 70 (214)
++..-|++|-..|++=.||-++=++-++..+.........++ ...+...|+....+...+..+.-+-+
T Consensus 168 If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~~~~l~~~~~~~~~~~~~~~~~~~~~~~~Ck~~~~~~~Yf~~aNf~Wm 247 (473)
T KOG4564|consen 168 IFLYFRSLHCTRNYIHMNLFASFILRAASVLIKDLVLVVNGEQDASSDTSLHCLISSNPVGCKLLFVFFQYFVLANFFWM 247 (473)
T ss_pred HHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHH
Confidence 455668899889999999999999888765554443332211 12455789998887777777776666
Q ss_pred HHHHHHhheeeeccCCcCccccccchhhhhHHHHHHHHHHhhhhhh
Q psy6126 71 LGITINRYVMIGHPTLYPKLYSSKFLAFMVACTWLFGFGALVPTWL 116 (214)
Q Consensus 71 ~~is~dRy~aI~~P~~~~~~~t~~~~~~~i~~~w~~~~~~~~~~~~ 116 (214)
.+=.+.=|-.+.--... .+++......+.|.+..++..+-.+
T Consensus 248 lvEGlYLhtLlv~~~~s----er~~~~~y~~iGWG~P~v~v~~Wai 289 (473)
T KOG4564|consen 248 LVEGLYLHTLLVVAFFS----ERRYFWLYLLIGWGVPAVFVVVWAI 289 (473)
T ss_pred HHHHHHHHHhhheeecc----CccHHHHHHHHhhCCcHHHhhhHHh
Confidence 55554444333322111 2556666777888888887776443
No 38
>PF15086 UPF0542: Uncharacterised protein family UPF0542
Probab=90.16 E-value=0.79 Score=26.90 Aligned_cols=36 Identities=11% Similarity=0.085 Sum_probs=23.7
Q ss_pred chhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhh
Q psy6126 141 SPKEFLFLVAFVIPCISIV-VCYARIFYIVRKTAMKS 176 (214)
Q Consensus 141 ~~~~~~~~~~~~~p~~~~~-~~y~~i~~~l~~~~~~~ 176 (214)
.+..+..++..+.|++++. ++.+++.+.+.+..++.
T Consensus 19 P~~Fl~~vll~LtPlfiisa~lSwkLaK~ie~~ere~ 55 (74)
T PF15086_consen 19 PYEFLTTVLLILTPLFIISAVLSWKLAKAIEKEEREK 55 (74)
T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455667778888877665 66777777776654443
No 39
>PF02076 STE3: Pheromone A receptor; InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=88.80 E-value=9.3 Score=29.49 Aligned_cols=105 Identities=14% Similarity=0.104 Sum_probs=57.5
Q ss_pred CchhhHHHHHHHHHHHHHHhhhhhhhhhhhcccccccccccchhhHHHHHHHHHHHHHHHHHHHHhheeeeccCCc--Cc
Q psy6126 12 NATAVFIINLSVSDLMFCCFNLPLAASTFWQRAWTHGHLLCQLFPLLRYGLLAVSLFTVLGITINRYVMIGHPTLY--PK 89 (214)
Q Consensus 12 ~~~~~~i~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~--~~ 89 (214)
.+.-.++..+.+.|+...+= ...+-.+. ...|+ ..|.+...+.......-....++|+-.=| .|..+-+. .+
T Consensus 24 ~~~~~li~Wl~l~nl~~~IN-aiIW~~n~-~~~~~---~wCDI~~kl~~~~~~g~~~a~lcI~r~L~-~I~s~r~~~~~~ 97 (283)
T PF02076_consen 24 IGTCSLIFWLFLSNLIYFIN-AIIWRDND-INWWP---VWCDISTKLIIGSSVGIPAASLCIMRRLY-RIASARAVSLSR 97 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHh-hhhcCCCC-CccCc---eeeeehHHHHHHHHHHHHHHHHHHHHHHH-HHhccccCCCCh
Confidence 45667777788888776631 11111111 12222 78999988877776655444544443333 23333211 11
Q ss_pred cccccchhhhhHHHHHHHHHHhhhhhhhccccc
Q psy6126 90 LYSSKFLAFMVACTWLFGFGALVPTWLGVWGRF 122 (214)
Q Consensus 90 ~~t~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~ 122 (214)
.-.+|+...=.++++.+.++.....++....++
T Consensus 98 ~~~rr~~~~d~~i~~g~Pil~m~l~yivQ~~Rf 130 (283)
T PF02076_consen 98 SEKRRRIIIDLLICFGIPILQMALHYIVQGHRF 130 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccce
Confidence 112566666688888888887777666544443
No 40
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=87.48 E-value=5.8 Score=31.80 Aligned_cols=35 Identities=26% Similarity=0.360 Sum_probs=27.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHhheeeeccCCcCcc
Q psy6126 56 PLLRYGLLAVSLFTVLGITINRYVMIGHPTLYPKL 90 (214)
Q Consensus 56 ~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~~~ 90 (214)
+++.....+.....+.++++||.+|-.+--.|.+.
T Consensus 127 ~~l~~~y~~~~~~~~~~~~iER~~AT~~i~dYEk~ 161 (365)
T PF03125_consen 127 GFLRWHYMFSAIFCLLAIVIERCFATYFIKDYEKK 161 (365)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 34444455666889999999999999998888774
No 41
>PF13853 7tm_4: Olfactory receptor
Probab=79.45 E-value=0.099 Score=35.80 Aligned_cols=76 Identities=21% Similarity=0.210 Sum_probs=42.1
Q ss_pred hhhHHHHHHHHHHhhhhhhh-ccccccccCccceeeec------CC-CCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6126 98 FMVACTWLFGFGALVPTWLG-VWGRFGLEPSIGSCSIL------PD-DYGHSPKEFLFLVAFVIPCISIVVCYARIFYIV 169 (214)
Q Consensus 98 ~~i~~~w~~~~~~~~~~~~~-~~~~~~~~~~~~~C~~~------~~-~~~~~~~~~~~~~~~~~p~~~~~~~y~~i~~~l 169 (214)
.+.+..|..+++..++.... ......+..+...|+.. .+ ..++.+.........+.|+.+++.+|++|+..+
T Consensus 5 ~l~~~~~~~~~~~~l~~~~~~~~l~~~nii~~f~c~~~ll~LaC~dt~~~~~~~~~~~~~~~~~~~~~Il~SY~~Il~av 84 (144)
T PF13853_consen 5 LLAAGSWLSGLLNSLPHTLLTLSLCFCNIIHHFCCDPPLLKLACSDTSINEIVGFVVAIFILLGPLLLILFSYIRILRAV 84 (144)
T ss_pred hhhHHHHHHHHHHHHHHHHHHeeCCCCCCCcceeeCHHHhcccCCchhhhheeeecccceeEEEEeeccccceeEEEehh
Confidence 35567788888777764432 11112222222233311 01 112222333345567899999999999999988
Q ss_pred Hhhh
Q psy6126 170 RKTA 173 (214)
Q Consensus 170 ~~~~ 173 (214)
.|..
T Consensus 85 lki~ 88 (144)
T PF13853_consen 85 LKIP 88 (144)
T ss_pred hccc
Confidence 7754
No 42
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=62.64 E-value=15 Score=23.25 Aligned_cols=27 Identities=15% Similarity=0.172 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy6126 151 FVIPCISIVVCYARIFYIVRKTAMKSR 177 (214)
Q Consensus 151 ~~~p~~~~~~~y~~i~~~l~~~~~~~~ 177 (214)
++.+++++.++|..+.+--+|+.++.+
T Consensus 12 ll~~vl~~~ifyFli~RPQrKr~K~~~ 38 (97)
T COG1862 12 LLPLVLIFAIFYFLIIRPQRKRMKEHQ 38 (97)
T ss_pred HHHHHHHHHHHHHhhcCHHHHHHHHHH
Confidence 333445566677766666555544433
No 43
>PF10325 7TM_GPCR_Srz: Serpentine type 7TM GPCR chemoreceptor Srz; InterPro: IPR018817 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class z (Srz), a solo family amongst the superfamilies of chemoreceptors [, ]. The genes encoding Srz appear to be under strong adaptive evolutionary pressure [].
Probab=62.32 E-value=66 Score=24.33 Aligned_cols=55 Identities=15% Similarity=0.161 Sum_probs=32.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHhheeeeccCCcC-ccccccchhhhhHHHHHHHHHH
Q psy6126 56 PLLRYGLLAVSLFTVLGITINRYVMIGHPTLYP-KLYSSKFLAFMVACTWLFGFGA 110 (214)
Q Consensus 56 ~~~~~~~~~~s~~~l~~is~dRy~aI~~P~~~~-~~~t~~~~~~~i~~~w~~~~~~ 110 (214)
.+.......+--+.+..+|++|+.---.|..-+ ...+++.....+...+....+-
T Consensus 86 ~~~l~i~~~v~~lllsLLAIqRFllyFfP~~Ek~v~~~~k~~~~~I~~lY~~~~~k 141 (267)
T PF10325_consen 86 IFILYIITQVFHLLLSLLAIQRFLLYFFPSSEKYVNFSQKNIKKIIWFLYIFFILK 141 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCchhhhhhhhhhhHHHHHHHHHHHHHHH
Confidence 344445556666778999999999988886211 1124444444444444444443
No 44
>KOG4016|consensus
Probab=61.56 E-value=34 Score=25.01 Aligned_cols=55 Identities=13% Similarity=0.088 Sum_probs=30.4
Q ss_pred hhhhhHHHHHHHHHHhhhhhhhccccccccCccceeeecCCCCCCchhHHHHHHHH
Q psy6126 96 LAFMVACTWLFGFGALVPTWLGVWGRFGLEPSIGSCSILPDDYGHSPKEFLFLVAF 151 (214)
Q Consensus 96 ~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~~~~ 151 (214)
..++-+++|++++++.....-..+.+. .+..+..|.++.+...=.|...+.++.+
T Consensus 25 ~ti~R~~~~lFsliVf~si~~eGy~n~-~~~~~~~Ciynrn~~ACsyg~avG~~Af 79 (233)
T KOG4016|consen 25 QTILRVVSWLFSLIVFGSIVNEGYLNS-ASSGEEFCIYNRNSNACSYGVAVGVLAF 79 (233)
T ss_pred hhHHHHHHHHHHHhheeeeccccccCc-ccCCceEEEECCCCcchhHHHHHHHHHH
Confidence 345667889999887665443222221 2334458887766554445444444333
No 45
>PF01528 Herpes_glycop: Herpesvirus glycoprotein M; InterPro: IPR000785 The Equid herpesvirus 1 (Equine herpesvirus 1, EHV-1) protein belongs to a family of sequences that groups together Human herpesvirus 1 (HHV-1) UL10, EHV-1 52, Human herpesvirus 3 (HHV-3) 50, Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4) BBRF3, Human herpesvirus 1 (HHV-1) 39 and Human cytomegalovirus (HHV-5) UL100. Little is yet known about the properties of the protein. However, its amino acid sequence is highly hydrophobic, containing 8 putative membrane-spanning regions, and it is therefore believed to be either membrane-associated or transmembrane.; GO: 0016020 membrane
Probab=52.89 E-value=1.2e+02 Score=24.54 Aligned_cols=35 Identities=11% Similarity=0.085 Sum_probs=25.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy6126 141 SPKEFLFLVAFVIPCISIVVCYARIFYIVRKTAMK 175 (214)
Q Consensus 141 ~~~~~~~~~~~~~p~~~~~~~y~~i~~~l~~~~~~ 175 (214)
.+.....+..-++|++.++..-.++.|..+.|+++
T Consensus 300 ~~~~~i~~~la~i~~i~l~~~vvR~vR~~~~hr~~ 334 (374)
T PF01528_consen 300 NLHTGIAINLAVIAIICLIMMVVRLVRAFLYHRRR 334 (374)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 34445556667788888888888888888776553
No 46
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=52.26 E-value=41 Score=18.79 Aligned_cols=28 Identities=21% Similarity=0.426 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy6126 151 FVIPCISIVVCYARIFYIVRKTAMKSRA 178 (214)
Q Consensus 151 ~~~p~~~~~~~y~~i~~~l~~~~~~~~~ 178 (214)
+++.+++.+++|+-++..-|+..+..+.
T Consensus 5 f~~ti~lvv~LYgY~yhLYrsek~G~rd 32 (56)
T TIGR02736 5 FAFTLLLVIFLYAYIYHLYRSQKKGERD 32 (56)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhcccccC
Confidence 4445556677788877665554444333
No 47
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=51.38 E-value=22 Score=23.29 Aligned_cols=14 Identities=7% Similarity=-0.090 Sum_probs=5.5
Q ss_pred HHHHHHHHHHHHHH
Q psy6126 153 IPCISIVVCYARIF 166 (214)
Q Consensus 153 ~p~~~~~~~y~~i~ 166 (214)
+|++++...++.++
T Consensus 7 l~~vv~~~i~yf~i 20 (113)
T PRK06531 7 IMFVVMLGLIFFMQ 20 (113)
T ss_pred HHHHHHHHHHHhee
Confidence 34444443333333
No 48
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=48.24 E-value=47 Score=20.22 Aligned_cols=17 Identities=24% Similarity=0.309 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy6126 156 ISIVVCYARIFYIVRKT 172 (214)
Q Consensus 156 ~~~~~~y~~i~~~l~~~ 172 (214)
+++.++|..+.+--+|+
T Consensus 10 ~~~~i~yf~~~rpqkk~ 26 (82)
T PF02699_consen 10 IIFVIFYFLMIRPQKKQ 26 (82)
T ss_dssp HHHHHHHHHTHHHHHHH
T ss_pred HHHHHHhhheecHHHHH
Confidence 44445555544444433
No 49
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=45.10 E-value=37 Score=20.86 Aligned_cols=19 Identities=21% Similarity=0.195 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHhhhhh
Q psy6126 157 SIVVCYARIFYIVRKTAMK 175 (214)
Q Consensus 157 ~~~~~y~~i~~~l~~~~~~ 175 (214)
+++++|..+.+--+|+.++
T Consensus 12 ~~~i~yf~~~rpqkK~~k~ 30 (84)
T TIGR00739 12 IFLIFYFLIIRPQRKRRKA 30 (84)
T ss_pred HHHHHHHheechHHHHHHH
Confidence 3445565555544444333
No 50
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=44.45 E-value=38 Score=16.88 Aligned_cols=20 Identities=15% Similarity=0.467 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYAR 164 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~ 164 (214)
....++..+|..++++.|++
T Consensus 9 iAt~Lfi~iPt~FLlilYvq 28 (35)
T PRK04989 9 VASLLFVLVPTVFLIILYIQ 28 (35)
T ss_pred HHHHHHHHHHHHHHHHHhee
Confidence 34566778999999988864
No 51
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=43.54 E-value=56 Score=21.24 Aligned_cols=20 Identities=15% Similarity=-0.018 Sum_probs=9.1
Q ss_pred HHHHH-HHHHHHHHHHHHHHh
Q psy6126 152 VIPCI-SIVVCYARIFYIVRK 171 (214)
Q Consensus 152 ~~p~~-~~~~~y~~i~~~l~~ 171 (214)
++|++ ++.++|..+.+--+|
T Consensus 7 ll~lv~i~~i~yF~~iRPQkK 27 (109)
T PRK05886 7 FLPFLLIMGGFMYFASRRQRK 27 (109)
T ss_pred HHHHHHHHHHHHHHHccHHHH
Confidence 34443 344556555444333
No 52
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=43.45 E-value=53 Score=17.59 Aligned_cols=25 Identities=12% Similarity=0.118 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhh
Q psy6126 151 FVIPCISIVVCYARIFYIVRKTAMK 175 (214)
Q Consensus 151 ~~~p~~~~~~~y~~i~~~l~~~~~~ 175 (214)
.+..+++++++.+.++...++++++
T Consensus 12 ~~~~v~~~~~F~gi~~w~~~~~~k~ 36 (49)
T PF05545_consen 12 SIGTVLFFVFFIGIVIWAYRPRNKK 36 (49)
T ss_pred HHHHHHHHHHHHHHHHHHHcccchh
Confidence 3444444555555556666655443
No 53
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=42.55 E-value=55 Score=19.67 Aligned_cols=23 Identities=26% Similarity=0.431 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q psy6126 150 AFVIPCISIVVCYARIFYIVRKT 172 (214)
Q Consensus 150 ~~~~p~~~~~~~y~~i~~~l~~~ 172 (214)
++++|++++++.-.-+...++-.
T Consensus 5 fl~~Pliif~ifVap~wl~lHY~ 27 (75)
T TIGR02976 5 FLAIPLIIFVIFVAPLWLILHYR 27 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677776666555555554443
No 54
>TIGR03038 PS_II_psbM photosystem II reaction center protein PsbM. Members of this protein family are the photosystem II reaction center M protein, product of the psbM gene, in Cyanobacteria and their derived organelles in plants. This model resembles Pfam model pfam05151 but has cutoffs set to avoid false-positive matches to similar (not necessarily homologous) sequences in species that are not photosynthetic.
Probab=41.05 E-value=47 Score=16.30 Aligned_cols=20 Identities=15% Similarity=0.486 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYAR 164 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~ 164 (214)
....++..+|..+.++.|++
T Consensus 9 iAt~Lfi~iPt~FLiilYvq 28 (33)
T TIGR03038 9 IATLLFILVPTVFLLILYIQ 28 (33)
T ss_pred HHHHHHHHHHHHHHHHHhee
Confidence 34566778999999988864
No 55
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=37.61 E-value=81 Score=18.02 Aligned_cols=31 Identities=10% Similarity=-0.076 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh
Q psy6126 150 AFVIPCISIVVCYARIFYIVRKTAMKSRAMN 180 (214)
Q Consensus 150 ~~~~p~~~~~~~y~~i~~~l~~~~~~~~~~~ 180 (214)
-.+..+.+.++++..++...|++.|......
T Consensus 11 ~a~~t~~~~l~fiavi~~ayr~~~K~~~d~a 41 (60)
T COG4736 11 DAWGTIAFTLFFIAVIYFAYRPGKKGEFDEA 41 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHhcccchhhHHHH
Confidence 3344445555666667777777666554443
No 56
>PRK09458 pspB phage shock protein B; Provisional
Probab=37.05 E-value=48 Score=19.86 Aligned_cols=27 Identities=19% Similarity=0.290 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy6126 149 VAFVIPCISIVVCYARIFYIVRKTAMK 175 (214)
Q Consensus 149 ~~~~~p~~~~~~~y~~i~~~l~~~~~~ 175 (214)
.++++|+++.+++-.-|...+.-.+|+
T Consensus 4 ~fl~~PliiF~ifVaPiWL~LHY~sk~ 30 (75)
T PRK09458 4 LFLAIPLTIFVLFVAPIWLWLHYRSKR 30 (75)
T ss_pred hHHHHhHHHHHHHHHHHHHHHhhcccc
Confidence 345677777776666666666554443
No 57
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=36.82 E-value=1.2e+02 Score=20.81 Aligned_cols=31 Identities=19% Similarity=0.467 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy6126 148 LVAFVIPCISIVVCYARIFYIVRKTAMKSRA 178 (214)
Q Consensus 148 ~~~~~~p~~~~~~~y~~i~~~l~~~~~~~~~ 178 (214)
+...++-++..++.|..+...+++.++++++
T Consensus 122 ~G~~i~~~v~~~i~Y~l~~~~~~~~r~~r~~ 152 (154)
T PF09835_consen 122 LGSLILGIVLGIISYFLVYFLVRKYRKRRRK 152 (154)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3445555666677888888777776665543
No 58
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=36.61 E-value=79 Score=20.41 Aligned_cols=18 Identities=17% Similarity=0.185 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHhhhhh
Q psy6126 158 IVVCYARIFYIVRKTAMK 175 (214)
Q Consensus 158 ~~~~y~~i~~~l~~~~~~ 175 (214)
++++|..+.+--+|++++
T Consensus 28 ~~i~yf~~~RpqkK~~k~ 45 (106)
T PRK05585 28 FAIFYFLIIRPQQKRQKE 45 (106)
T ss_pred HHHHHHHhccHHHHHHHH
Confidence 334565555544444433
No 59
>PF11044 TMEMspv1-c74-12: Plectrovirus spv1-c74 ORF 12 transmembrane protein; InterPro: IPR022743 This is a group of proteins expressed by Plectroviruses. The Plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. This entry represents putative transmembrane proteins of unknown function.
Probab=36.60 E-value=69 Score=16.90 Aligned_cols=14 Identities=7% Similarity=0.057 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHhhh
Q psy6126 160 VCYARIFYIVRKTA 173 (214)
Q Consensus 160 ~~y~~i~~~l~~~~ 173 (214)
..=..|+..+++-+
T Consensus 20 ~iGl~IyQkikqIr 33 (49)
T PF11044_consen 20 WIGLSIYQKIKQIR 33 (49)
T ss_pred HHHHHHHHHHHHHH
Confidence 33345566665543
No 60
>PF15050 SCIMP: SCIMP protein
Probab=35.17 E-value=86 Score=20.70 Aligned_cols=20 Identities=10% Similarity=0.195 Sum_probs=8.6
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy6126 143 KEFLFLVAFVIPCISIVVCY 162 (214)
Q Consensus 143 ~~~~~~~~~~~p~~~~~~~y 162 (214)
.+.+.+.+.++.+.+=+++|
T Consensus 9 WiiLAVaII~vS~~lglIly 28 (133)
T PF15050_consen 9 WIILAVAIILVSVVLGLILY 28 (133)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444
No 61
>CHL00080 psbM photosystem II protein M
Probab=34.85 E-value=62 Score=15.97 Aligned_cols=20 Identities=10% Similarity=0.376 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYAR 164 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~ 164 (214)
....++..+|-.++++.|++
T Consensus 9 iAt~LFi~iPt~FLlilyvk 28 (34)
T CHL00080 9 IATALFILVPTAFLLIIYVK 28 (34)
T ss_pred HHHHHHHHHHHHHHHHhhee
Confidence 34566778898888888864
No 62
>PF05241 EBP: Emopamil binding protein ; InterPro: IPR007905 Emopamil binding protein (EBP) is a nonglycosylated type I integral membrane protein of endoplasmic reticulum and shows high level expression in epithelial tissues. The EBP protein has emopamil binding domains, including the sterol acceptor site and the catalytic centre, which show Delta8-Delta7 sterol isomerase activity. Human sterol isomerase, a homologue of mouse EBP, is suggested not only to play a role in cholesterol biosynthesis, but also to affect lipoprotein internalisation. In humans, mutations of EBP are known to cause the genetic disorder of X-linked dominant chondrodysplasia punctata (CDPX2). This syndrome of humans is lethal in most males, and affected females display asymmetric hyperkeratotic skin and skeletal abnormalities [].; GO: 0047750 cholestenol delta-isomerase activity, 0016125 sterol metabolic process, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=34.57 E-value=1.2e+02 Score=21.90 Aligned_cols=30 Identities=17% Similarity=0.201 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy6126 145 FLFLVAFVIPCISIVVCYARIFYIVRKTAM 174 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~i~~~l~~~~~ 174 (214)
+...+-.++|+.++.-++-.|....+.+++
T Consensus 164 ~~N~iWivvP~~~l~~s~~~i~~a~~~~~~ 193 (194)
T PF05241_consen 164 FPNGIWIVVPLLLLYQSWKEIARAFRAAQK 193 (194)
T ss_pred hcchHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 345667899999999999888877766544
No 63
>PF06143 Baculo_11_kDa: Baculovirus 11 kDa family; InterPro: IPR009313 This is a family of uncharacterised Baculovirus proteins that are all about 11 kDa in size.
Probab=34.10 E-value=1.2e+02 Score=18.75 Aligned_cols=26 Identities=15% Similarity=0.415 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYARIFYIVR 170 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~i~~~l~ 170 (214)
++.++..++-+++++.++..++...+
T Consensus 36 FvLVic~~lVfVii~lFi~ll~~i~~ 61 (84)
T PF06143_consen 36 FVLVICCFLVFVIIVLFILLLYNINK 61 (84)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455555555555555555544433
No 64
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=34.04 E-value=94 Score=17.71 Aligned_cols=14 Identities=14% Similarity=0.142 Sum_probs=5.0
Q ss_pred HHHHHHHHHHHHHh
Q psy6126 158 IVVCYARIFYIVRK 171 (214)
Q Consensus 158 ~~~~y~~i~~~l~~ 171 (214)
....+.+.-+.+++
T Consensus 39 ~~~~~~~~r~~~~~ 52 (68)
T PF06305_consen 39 SLPSRLRLRRRIRR 52 (68)
T ss_pred HHHHHHHHHHHHHH
Confidence 33333333333333
No 65
>PHA03237 envelope glycoprotein M; Provisional
Probab=33.72 E-value=2.8e+02 Score=23.04 Aligned_cols=32 Identities=13% Similarity=0.072 Sum_probs=21.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy6126 143 KEFLFLVAFVIPCISIVVCYARIFYIVRKTAM 174 (214)
Q Consensus 143 ~~~~~~~~~~~p~~~~~~~y~~i~~~l~~~~~ 174 (214)
.....+..-++|++.++..-.++.|...-|++
T Consensus 325 ~~~v~~~Laviail~l~m~vvRlvRa~~yHr~ 356 (424)
T PHA03237 325 RLACKLVLAIFAVIIVIMLVVRLVRACLYHRR 356 (424)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444555677777777777888777655544
No 66
>PF05151 PsbM: Photosystem II reaction centre M protein (PsbM); InterPro: IPR007826 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbM found in PSII. PsbM is one of the most hydrophobic proteins in the thylakoid membrane. The function of this protein is unknown.; GO: 0015979 photosynthesis, 0019684 photosynthesis, light reaction, 0009523 photosystem II, 0016021 integral to membrane; PDB: 3A0H_m 3ARC_m 3A0B_M 3PRR_M 3PRQ_M 1S5L_M 4FBY_e 3BZ2_M 3BZ1_M 2AXT_M ....
Probab=32.56 E-value=67 Score=15.56 Aligned_cols=21 Identities=14% Similarity=0.358 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYARI 165 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~i 165 (214)
....++..+|..+.++.|++.
T Consensus 9 iAtaLfi~iPt~FLiilyvqT 29 (31)
T PF05151_consen 9 IATALFILIPTAFLIILYVQT 29 (31)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhheEeee
Confidence 345667788999999988763
No 67
>PHA03164 hypothetical protein; Provisional
Probab=29.02 E-value=52 Score=19.61 Aligned_cols=23 Identities=13% Similarity=0.306 Sum_probs=16.2
Q ss_pred CCCCCchhhHHHHHHHHHHHHHH
Q psy6126 8 STVRNATAVFIINLSVSDLMFCC 30 (214)
Q Consensus 8 ~~~~~~~~~~i~~Lav~Dll~~~ 30 (214)
++.|+-+.+.+..||++-++..+
T Consensus 53 nrRktftFlvLtgLaIamILfii 75 (88)
T PHA03164 53 NRRKTFTFLVLTGLAIAMILFII 75 (88)
T ss_pred hhhheeehHHHHHHHHHHHHHHH
Confidence 34456667778889998877654
No 68
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=28.30 E-value=86 Score=19.72 Aligned_cols=15 Identities=13% Similarity=0.390 Sum_probs=5.9
Q ss_pred HHHHHHHHHHHHHHH
Q psy6126 152 VIPCISIVVCYARIF 166 (214)
Q Consensus 152 ~~p~~~~~~~y~~i~ 166 (214)
+.-++++++.-.+|+
T Consensus 24 i~~FiILLIi~~~IW 38 (121)
T PF10669_consen 24 IVVFIILLIITKSIW 38 (121)
T ss_pred HHHHHHHHHHHHHHh
Confidence 333333444344443
No 69
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=28.00 E-value=57 Score=24.00 Aligned_cols=9 Identities=33% Similarity=0.397 Sum_probs=3.6
Q ss_pred HHHHHHhhh
Q psy6126 165 IFYIVRKTA 173 (214)
Q Consensus 165 i~~~l~~~~ 173 (214)
|...+|+.+
T Consensus 56 i~v~vR~CR 64 (221)
T PF08374_consen 56 IVVLVRYCR 64 (221)
T ss_pred HHHHHHHHh
Confidence 334443333
No 70
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=27.93 E-value=1.4e+02 Score=18.00 Aligned_cols=16 Identities=25% Similarity=0.567 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHHHHHH
Q psy6126 151 FVIPCISIVVCYARIF 166 (214)
Q Consensus 151 ~~~p~~~~~~~y~~i~ 166 (214)
.++|+++.+++-.-+.
T Consensus 6 l~~plivf~ifVap~W 21 (75)
T PF06667_consen 6 LFVPLIVFMIFVAPIW 21 (75)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444444444333333
No 71
>PHA02909 hypothetical protein; Provisional
Probab=26.61 E-value=1.2e+02 Score=16.79 Aligned_cols=29 Identities=17% Similarity=0.504 Sum_probs=22.9
Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHH
Q psy6126 46 THGHLLCQLFPLLRYGLLAVSLFTVLGIT 74 (214)
Q Consensus 46 ~~~~~~C~~~~~~~~~~~~~s~~~l~~is 74 (214)
...+..|....|+.....+.|++++.+-+
T Consensus 27 ytentfcimvsfilfviiflsmftilacs 55 (72)
T PHA02909 27 YTENTFCIMVSFILFVIIFLSMFTILACS 55 (72)
T ss_pred EeccchhHHHHHHHHHHHHHHHHHHHHHH
Confidence 34567899999988888888888887654
No 72
>COG3924 Predicted membrane protein [Function unknown]
Probab=25.71 E-value=1.5e+02 Score=17.55 Aligned_cols=28 Identities=25% Similarity=0.626 Sum_probs=20.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6126 143 KEFLFLVAFVIPCISIVVCYARIFYIVR 170 (214)
Q Consensus 143 ~~~~~~~~~~~p~~~~~~~y~~i~~~l~ 170 (214)
-.+.-...+.+|++++++|+..|-.+.+
T Consensus 43 P~WFE~aCi~lPllFi~l~~~mvkfif~ 70 (80)
T COG3924 43 PLWFEMACILLPLLFIVLCWAMVKFIFR 70 (80)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3455566789999999999987654444
No 73
>PLN00090 photosystem II reaction center M protein; Provisional
Probab=24.82 E-value=1.4e+02 Score=18.79 Aligned_cols=20 Identities=10% Similarity=0.497 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYAR 164 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~ 164 (214)
.-..++.++|..+++++|++
T Consensus 79 IATaLFIlIPTaFLLILYVQ 98 (113)
T PLN00090 79 LAVALGTFLPCLFLINLFIQ 98 (113)
T ss_pred HHHHHHHHHHHHHHHHHHhh
Confidence 34567788999999999974
No 74
>KOG4298|consensus
Probab=24.29 E-value=2.8e+02 Score=20.07 Aligned_cols=24 Identities=21% Similarity=0.365 Sum_probs=18.5
Q ss_pred cchhhhhHHHHHHHHHHhhhhhhh
Q psy6126 94 KFLAFMVACTWLFGFGALVPTWLG 117 (214)
Q Consensus 94 ~~~~~~i~~~w~~~~~~~~~~~~~ 117 (214)
-|...-|=..|.++.++++..++.
T Consensus 140 erlh~yIelaW~FST~iGllLFL~ 163 (245)
T KOG4298|consen 140 ERLHFYIELAWAFSTVIGLLLFLA 163 (245)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355667888999999998887654
No 75
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=23.14 E-value=2e+02 Score=19.13 Aligned_cols=12 Identities=33% Similarity=0.448 Sum_probs=4.8
Q ss_pred HHHHHHHhhhhh
Q psy6126 164 RIFYIVRKTAMK 175 (214)
Q Consensus 164 ~i~~~l~~~~~~ 175 (214)
.|++.+||++|+
T Consensus 83 li~y~irR~~Kk 94 (122)
T PF01102_consen 83 LISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHHS--
T ss_pred HHHHHHHHHhcc
Confidence 344455555444
No 76
>COG3114 CcmD Heme exporter protein D [Intracellular trafficking and secretion]
Probab=22.18 E-value=1.7e+02 Score=16.91 Aligned_cols=24 Identities=17% Similarity=0.105 Sum_probs=14.9
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHH
Q psy6126 140 HSPKEFLFLVAFVIPCISIVVCYA 163 (214)
Q Consensus 140 ~~~~~~~~~~~~~~p~~~~~~~y~ 163 (214)
..++.+..+..-++|+++.++-..
T Consensus 15 yafyVWlA~~~tll~l~~l~v~sv 38 (67)
T COG3114 15 YAFYVWLAVGMTLLPLAVLVVHSV 38 (67)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHH
Confidence 445566666667777776665443
No 77
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=21.06 E-value=64 Score=17.27 Aligned_cols=20 Identities=15% Similarity=0.218 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6126 145 FLFLVAFVIPCISIVVCYAR 164 (214)
Q Consensus 145 ~~~~~~~~~p~~~~~~~y~~ 164 (214)
....++..+|-+++++.|++
T Consensus 9 iAtaLFi~iPT~FLlilYVk 28 (50)
T PRK14094 9 VASLLFVGVPTIFLIGLFIS 28 (50)
T ss_pred HHHHHHHHHHHHHhhheeEE
Confidence 34566778888888888863
No 78
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=20.36 E-value=37 Score=17.58 Aligned_cols=20 Identities=15% Similarity=0.436 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy6126 152 VIPCISIVVCYARIFYIVRK 171 (214)
Q Consensus 152 ~~p~~~~~~~y~~i~~~l~~ 171 (214)
.+|+..+++.-+.++...||
T Consensus 18 vVPV~vI~~vl~~~l~~~~r 37 (40)
T PF08693_consen 18 VVPVGVIIIVLGAFLFFWYR 37 (40)
T ss_pred EechHHHHHHHHHHhheEEe
Confidence 44555555444444443343
Done!