Query psy4354
Match_columns 342
No_of_seqs 140 out of 1561
Neff 10.4
Searched_HMMs 46136
Date Fri Aug 16 23:25:02 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4354.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4354hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4220|consensus 100.0 2.9E-40 6.2E-45 280.4 21.0 331 9-340 108-493 (503)
2 KOG4219|consensus 100.0 1.4E-34 3E-39 246.5 8.1 211 5-336 109-337 (423)
3 PHA03235 DNA packaging protein 100.0 3.1E-32 6.7E-37 245.9 15.6 200 4-335 105-324 (409)
4 PHA03234 DNA packaging protein 100.0 8.9E-32 1.9E-36 237.8 18.1 194 4-332 105-314 (338)
5 PHA02834 chemokine receptor-li 100.0 1.3E-31 2.7E-36 236.7 13.8 199 4-337 98-312 (323)
6 PHA02638 CC chemokine receptor 100.0 2.7E-29 5.8E-34 227.8 16.0 201 4-336 168-399 (417)
7 PHA03087 G protein-coupled che 100.0 1.6E-27 3.5E-32 212.4 14.4 198 4-335 111-324 (335)
8 PF00001 7tm_1: 7 transmembran 99.9 2.4E-26 5.1E-31 196.6 14.2 195 4-317 57-257 (257)
9 KOG2087|consensus 99.8 2.3E-20 5E-25 158.2 0.9 199 3-335 103-307 (363)
10 PF10324 7TM_GPCR_Srw: Serpent 99.7 2.6E-16 5.6E-21 139.3 11.4 209 5-335 79-318 (318)
11 PF10320 7TM_GPCR_Srsx: Serpen 99.6 2E-14 4.4E-19 122.6 13.9 193 3-332 60-257 (257)
12 PF10323 7TM_GPCR_Srv: Serpent 99.5 8.6E-13 1.9E-17 114.1 12.4 198 12-334 83-283 (283)
13 PF10328 7TM_GPCR_Srx: Serpent 99.3 4.1E-11 8.8E-16 103.6 15.3 198 4-332 67-273 (274)
14 PF05296 TAS2R: Mammalian tast 99.1 2.1E-08 4.5E-13 87.6 18.1 206 9-336 87-303 (303)
15 PF05462 Dicty_CAR: Slime mold 98.9 2.1E-07 4.5E-12 80.6 19.9 70 265-337 204-273 (303)
16 PF10321 7TM_GPCR_Srt: Serpent 98.9 4E-09 8.8E-14 91.7 9.3 80 254-340 234-313 (313)
17 PF02118 Srg: Srg family chemo 98.3 4.1E-06 8.9E-11 72.6 9.3 114 8-122 85-199 (275)
18 KOG4193|consensus 98.3 1.6E-05 3.6E-10 75.0 13.1 201 4-339 386-586 (610)
19 PF10317 7TM_GPCR_Srd: Serpent 98.2 9E-05 1.9E-09 64.7 16.6 75 254-330 218-292 (292)
20 PF10292 7TM_GPCR_Srab: Serpen 98.2 9.9E-05 2.2E-09 65.4 16.1 60 2-62 94-154 (324)
21 PF10318 7TM_GPCR_Srh: Serpent 98.0 0.00029 6.3E-09 61.9 15.6 78 254-334 224-302 (302)
22 PF04789 DUF621: Protein of un 98.0 0.001 2.3E-08 55.4 16.2 115 6-121 92-216 (305)
23 PF03125 Sre: C. elegans Sre G 97.9 0.00057 1.2E-08 61.6 15.4 60 4-63 123-183 (365)
24 PF02101 Ocular_alb: Ocular al 97.9 0.00074 1.6E-08 59.0 14.4 96 7-118 120-216 (405)
25 PF10326 7TM_GPCR_Str: Serpent 97.7 5.6E-06 1.2E-10 72.9 -0.6 73 254-329 234-307 (307)
26 PF10327 7TM_GPCR_Sri: Serpent 97.7 0.0017 3.6E-08 57.0 14.2 72 254-327 231-302 (303)
27 PF11970 Git3_C: G protein-cou 97.6 0.00034 7.3E-09 47.0 6.3 71 254-324 3-74 (76)
28 PF11710 Git3: G protein-coupl 97.2 0.0011 2.4E-08 54.1 6.7 116 4-124 74-200 (201)
29 PF03402 V1R: Vomeronasal orga 97.0 0.00054 1.2E-08 58.0 3.4 70 256-328 194-263 (265)
30 PF02117 7TM_GPCR_Sra: Serpent 97.0 0.018 4E-07 51.0 13.1 81 5-90 102-182 (328)
31 PF10322 7TM_GPCR_Sru: Serpent 96.9 0.025 5.3E-07 49.4 12.6 115 8-122 107-226 (307)
32 PF10319 7TM_GPCR_Srj: Serpent 95.5 0.63 1.4E-05 40.4 13.3 72 254-327 237-309 (310)
33 KOG4564|consensus 95.4 0.032 6.9E-07 51.1 5.7 71 258-335 349-420 (473)
34 PF10316 7TM_GPCR_Srbc: Serpen 94.9 0.27 5.8E-06 42.3 9.5 105 10-115 87-195 (273)
35 PF00002 7tm_2: 7 transmembran 94.5 0.011 2.3E-07 50.1 0.0 101 4-117 72-172 (242)
36 PF02175 7TM_GPCR_Srb: Serpent 93.7 0.42 9.2E-06 39.4 7.8 106 10-120 91-197 (236)
37 PF01534 Frizzled: Frizzled/Sm 93.4 3.6 7.8E-05 36.4 13.6 77 6-87 96-173 (328)
38 PF02076 STE3: Pheromone A rec 92.4 5.4 0.00012 34.6 13.1 47 43-91 100-146 (283)
39 PF06681 DUF1182: Protein of u 89.4 2.7 5.9E-05 34.0 7.7 16 20-35 133-148 (226)
40 PF03383 Serpentine_r_xa: Caen 89.0 0.82 1.8E-05 35.3 4.5 77 10-86 50-129 (153)
41 PF13853 7tm_4: Olfactory rece 83.9 0.033 7.2E-07 43.0 -5.6 75 48-122 4-88 (144)
42 PHA03235 DNA packaging protein 67.8 27 0.00058 32.2 7.7 24 101-124 296-319 (409)
43 PF10325 7TM_GPCR_Srz: Serpent 58.6 28 0.00061 29.7 5.9 103 16-123 94-198 (267)
44 PF09889 DUF2116: Uncharacteri 58.4 23 0.0005 22.5 3.8 23 255-277 32-54 (59)
45 KOG2575|consensus 48.4 20 0.00044 32.4 3.2 26 257-282 253-278 (510)
46 COG3924 Predicted membrane pro 43.7 84 0.0018 20.7 4.6 30 90-119 41-70 (80)
47 PRK04989 psbM photosystem II r 43.2 39 0.00084 18.7 2.6 22 92-113 7-28 (35)
48 PF06072 Herpes_US9: Alphaherp 41.4 91 0.002 19.7 5.1 28 255-282 25-53 (60)
49 TIGR03038 PS_II_psbM photosyst 41.0 47 0.001 18.1 2.7 22 92-113 7-28 (33)
50 PHA01815 hypothetical protein 40.7 76 0.0016 18.6 3.7 23 93-115 8-30 (55)
51 COG4665 FcbT2 TRAP-type mannit 39.3 1.4E+02 0.003 23.6 6.1 32 90-121 91-122 (182)
52 PF15086 UPF0542: Uncharacteri 39.1 1.1E+02 0.0025 20.2 5.3 36 91-126 20-56 (74)
53 CHL00080 psbM photosystem II p 38.9 48 0.001 18.2 2.5 22 92-113 7-28 (34)
54 TIGR02230 ATPase_gene1 F0F1-AT 37.3 78 0.0017 22.6 4.2 59 257-315 34-93 (100)
55 TIGR02736 cbb3_Q_epsi cytochro 34.7 52 0.0011 20.4 2.6 26 100-125 5-30 (56)
56 COG1862 YajC Preprotein transl 34.4 72 0.0016 22.6 3.7 28 101-128 13-40 (97)
57 PF05151 PsbM: Photosystem II 33.7 83 0.0018 17.0 3.7 20 94-113 9-28 (31)
58 PF04238 DUF420: Protein of un 32.7 74 0.0016 24.1 3.8 37 254-290 30-66 (133)
59 PF14752 RBP_receptor: Retinol 31.4 1.1E+02 0.0024 29.8 5.6 57 259-338 162-218 (617)
60 PF02439 Adeno_E3_CR2: Adenovi 28.2 1.1E+02 0.0025 17.4 3.0 27 93-119 8-34 (38)
61 PF12877 DUF3827: Domain of un 27.7 3.5E+02 0.0076 26.5 7.9 20 102-121 277-296 (684)
62 PHA03234 DNA packaging protein 27.5 3.3E+02 0.0071 24.3 7.7 24 103-126 291-314 (338)
63 PF05297 Herpes_LMP1: Herpesvi 27.0 21 0.00045 30.5 0.0 17 8-24 35-51 (381)
64 PRK06531 yajC preprotein trans 26.1 91 0.002 22.8 3.1 25 101-128 6-30 (113)
65 PF05391 Lsm_interact: Lsm int 26.0 31 0.00067 16.7 0.5 10 320-329 11-20 (21)
66 PF15102 TMEM154: TMEM154 prot 25.8 12 0.00026 28.6 -1.5 24 97-120 58-81 (146)
67 PF05478 Prominin: Prominin; 25.6 1.9E+02 0.0042 29.5 6.4 77 91-290 412-488 (806)
68 PF08525 OapA_N: Opacity-assoc 24.3 1E+02 0.0023 16.4 2.4 21 256-278 8-28 (30)
69 PRK14094 psbM photosystem II r 24.1 67 0.0014 19.1 1.7 23 91-113 6-28 (50)
70 PF03904 DUF334: Domain of unk 24.1 3.8E+02 0.0082 22.4 6.6 34 88-121 194-227 (230)
71 COG4736 CcoQ Cbb3-type cytochr 23.8 2E+02 0.0044 18.3 4.2 31 97-127 9-39 (60)
72 PF14362 DUF4407: Domain of un 23.8 72 0.0016 27.9 2.8 21 15-35 50-70 (301)
73 PF02699 YajC: Preprotein tran 22.5 2.3E+02 0.0051 19.2 4.5 23 105-127 10-32 (82)
74 PLN00090 photosystem II reacti 22.2 1.6E+02 0.0035 20.6 3.5 24 91-114 76-99 (113)
75 KOG4220|consensus 21.5 2.7E+02 0.0058 25.8 5.7 38 302-339 459-496 (503)
76 PF09835 DUF2062: Uncharacteri 21.4 3.3E+02 0.0072 20.9 5.8 30 98-127 123-152 (154)
77 PF10624 TraS: Plasmid conjuga 20.6 43 0.00093 25.0 0.6 48 269-317 31-81 (164)
78 KOG4583|consensus 20.6 6.2E+02 0.013 22.6 7.6 15 264-278 372-386 (391)
79 TIGR02976 phageshock_pspB phag 20.6 2.3E+02 0.0049 19.0 3.9 26 99-124 5-30 (75)
80 PF07325 Curto_V2: Curtovirus 20.5 73 0.0016 22.7 1.6 18 323-340 52-69 (126)
81 PF05398 PufQ: PufQ cytochrome 20.4 2.5E+02 0.0055 18.6 4.0 26 90-115 21-46 (73)
82 PHA02975 hypothetical protein; 20.3 2.1E+02 0.0046 18.7 3.6 25 91-115 44-68 (69)
83 COG2322 Predicted membrane pro 20.3 2E+02 0.0042 22.7 4.0 33 254-286 71-103 (177)
84 PHA02650 hypothetical protein; 20.0 2.3E+02 0.005 19.1 3.7 25 91-115 49-73 (81)
No 1
>KOG4220|consensus
Probab=100.00 E-value=2.9e-40 Score=280.40 Aligned_cols=331 Identities=24% Similarity=0.408 Sum_probs=196.1
Q ss_pred HhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceecc-
Q psy4354 9 VLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMVS- 86 (342)
Q Consensus 9 ~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~~- 86 (342)
-+-++...+|+++|+.|+||||+.|.+ +.|+.+.|++|+.++|++.|++++++..|.++.|.........+...|..+
T Consensus 108 alDYvaSNASVmNLLiISFDRYFsVTrPLtYrakRTtkrA~~MI~~AW~iSfiLWaPaIl~WqyivGkrTv~~~eC~iQF 187 (503)
T KOG4220|consen 108 ALDYVASNASVMNLLIISFDRYFSVTRPLTYRAKRTTKRAGLMIGAAWVLSFVLWAPAILFWQYIVGKRTVPDGECYIQF 187 (503)
T ss_pred HHHHHhhhhhhhhhheeeeecceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHHhhHhheeeeecCCCceEEEe
Confidence 466788899999999999999999999 899999999999999999999999999988888887766665567889774
Q ss_pred -CCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc----CCCCCCCCCCccceeecccc---cccc
Q psy4354 87 -QDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMA----GKKPDTSDNKTSHFIFFKKR---KFFR 158 (342)
Q Consensus 87 -~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~---~~~~ 158 (342)
.+....+-..+..|.+|..+|+++|++|++..+++.+.....+....+ ......+......+.+.+.. ....
T Consensus 188 lsnp~iTfGTAiAAFYlPVtiM~~LY~rIyret~kR~k~~~~lq~s~~~~~~~~~~~~~~~~~~~~s~r~~p~~~~~~~~ 267 (503)
T KOG4220|consen 188 LSNPAITFGTAIAAFYLPVTIMTILYWRIYRETRKRQKELAKLQASLPSFSAIKLSPESPKGDSKSSGRSSPSEEGKREP 267 (503)
T ss_pred ecCceeehhHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccCcccccccccCCCCCCCcccccccc
Confidence 333444455577899999999999999999999998877654433211 11111100000001000000 0000
Q ss_pred ccccCCCCCCCCCCCCcccc------cCCCCCcCCCCCCceeccCCCCCCCCCc--c------ccCCC---------CCC
Q psy4354 159 IKKCTNVVPPSPNKLSINVI------DEDNGINNATTSSSLILADGHSNSDADR--R------TSINN---------EAN 215 (342)
Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~------~~~~~---------~~~ 215 (342)
......+..+.+........ ++++.......+...+..+.+....... . ..+.. ++.
T Consensus 268 ~~~~~~~~~k~ps~~~~~~~~~s~~edsd~~~~~s~~~s~~~~~~se~~~~~~v~~~~~~~~~~~D~~~~~~~i~i~~~~ 347 (503)
T KOG4220|consen 268 LTNGCISNSKAPSLTPTESWKPSEKEDSDESSSESLTSSPLERPGSELSEIEAVVAKMPANQRKVDEEGLNTLIQIPTDQ 347 (503)
T ss_pred CCCCccccccCcccCCccCCCCccccccccccccccccCCccccccccccccceeccCCCCcCCCCcccccccccccccc
Confidence 00111111111111111000 0000000000000000000000000000 0 00000 000
Q ss_pred c------cc---eeccCCCCCCCCCCc----------cccccccCC---CCCcccchhhHHHHHHhhHHHHHHHHHHHHH
Q psy4354 216 T------AF---TITHNNGASQSNHNN----------ECVQVKHKI---PPTKKEKKESLEAKRERKAAKTLAIITGAFV 273 (342)
Q Consensus 216 ~------~~---~~~~~~~~~~~~~~~----------~~~~~~~~~---~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~ 273 (342)
. .. ........++.+++. ..++..... .....+++++....+|+|++|++.+|.+.|+
T Consensus 348 ~~p~s~sc~p~~~~t~~~~~s~~ns~~gk~r~~~~~~~~~~~~kkf~~~~r~q~~k~k~~~~~rErKAAkTLsAILlAFI 427 (503)
T KOG4220|consen 348 MLPKSDSCVPIFSATDTDKTTDTNSGAGKRRAGPVARKTGLDYKKFAKRARSQSRKKKKMSLVRERKAAKTLSAILLAFI 427 (503)
T ss_pred CCCCCCcccccccccccccCCccCccccccccCccccccchhhhhhhhhhhhhhhhhcchhhHHHHHHHHHHHHHHHHHH
Confidence 0 00 000000000000000 000000011 1111222333344889999999999999999
Q ss_pred HhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHhcCCCCCC
Q psy4354 274 ICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGSPNRGR 340 (342)
Q Consensus 274 ~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~~~~~~ 340 (342)
+||.||.|+.++..||+++ .+..+..+..||.|+||-+||++|++.|..||+.++++|.|+..+++
T Consensus 428 iTWtPYNImVlv~tFC~~C-iP~tlW~~gYwLCYINSTiNP~CYALCNatFrkTfk~lL~Cr~~~~~ 493 (503)
T KOG4220|consen 428 LTWTPYNIMVLVNTFCKNC-IPETLWTFGYWLCYINSTINPLCYALCNATFRKTFKRLLLCRWKKRR 493 (503)
T ss_pred HHcccceeeeehHhhcccc-cchhHhhhhhheeeecccccHHHHHHHhHHHHHHHHHhheeeecccc
Confidence 9999999999999999987 77788899999999999999999999999999999999988865544
No 2
>KOG4219|consensus
Probab=100.00 E-value=1.4e-34 Score=246.50 Aligned_cols=211 Identities=22% Similarity=0.406 Sum_probs=166.1
Q ss_pred hhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCC--cccccCccc
Q psy4354 5 LFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPE--YMDRINQQK 82 (342)
Q Consensus 5 ~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~--~~~~~~~~~ 82 (342)
-|..|+....+.+|+++++|||+|||+||.||.. .+.++|...++|+++|++++++++|..+..+..+ ..++.....
T Consensus 109 ~f~nf~~itav~vSVfTlvAiA~DRy~AIi~Pl~-~r~s~r~sk~iIllIW~lA~l~a~P~~l~s~v~~~~~~d~~~~~~ 187 (423)
T KOG4219|consen 109 RFVNFFPITAVFVSVFTLVAIAIDRYMAIIHPLQ-PRPSRRSSKIIILLIWALALLLALPQLLYSSVEELYLYDGESRVV 187 (423)
T ss_pred eeccccchhhhhHhHHHHHHHHHHHHHHHhhhcc-cCCCCcceeehhHHHHHHHHHHhccceeeeeeEEeeccCCcceEE
Confidence 3567888899999999999999999999999753 4499999999999999999999996554333221 122222344
Q ss_pred eecc---------CC----chhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCcccee
Q psy4354 83 CMVS---------QD----VGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFI 149 (342)
Q Consensus 83 C~~~---------~~----~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (342)
|... .. +.|.....++.|++|++++.+.|.+|.+.+|..+....
T Consensus 188 ~~~~~pe~~~~~~~~~~~~~~y~~vl~~lqYflPliVl~~~Yt~iav~LW~~~~~gd----------------------- 244 (423)
T KOG4219|consen 188 CVTAWPEHVCPTENESLLMQGYNYVLLFLQYFLPLIVLGLAYTVIAVTLWGRRIPGD----------------------- 244 (423)
T ss_pred EEEecccccCCcchhhhhhcceeeeehhHHHHHHHHHHHHHHHHHHHHHHhccCccc-----------------------
Confidence 4331 11 23778888899999999999999999999998431000
Q ss_pred eccccccccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCC
Q psy4354 150 FFKKRKFFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQS 229 (342)
Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (342)
T Consensus 245 -------------------------------------------------------------------------------- 244 (423)
T KOG4219|consen 245 -------------------------------------------------------------------------------- 244 (423)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred CCCccccccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCC---CCchhHHHHHHHHHH
Q psy4354 230 NHNNECVQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQT---CYISDYLASFFLWLG 306 (342)
Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~---~~~~~~~~~~~~~l~ 306 (342)
..+ +...+.+..+|+.||+++|++.|.+||+||.+..++....++ ......++....||+
T Consensus 245 ----------------~~d-~~~~~~kak~K~vkmliiVV~~FaicWlPyh~y~il~~~~~~i~~~k~i~~vyl~~~WLa 307 (423)
T KOG4219|consen 245 ----------------QQD-RKHEQLKAKKKVVKMLIIVVVIFAICWLPYHIYFILNATNPEINRKKFIQQVYLAIYWLA 307 (423)
T ss_pred ----------------hhc-hhhHHHHHHHHHHHHHHHHHHHHHHhccChhHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Confidence 011 223366788999999999999999999999999998766532 235667888889999
Q ss_pred hhcccccchhhhccChhHHHHHHHHHhcCC
Q psy4354 307 YFNSTLNPVIYTVFSPEFRQAFKRILCGSP 336 (342)
Q Consensus 307 ~~ns~vNPiiY~~~n~~fR~~~~~ll~~~~ 336 (342)
..|+|.||+||+++|++||.++++.|+|..
T Consensus 308 MSst~yNPiIY~~lN~Rfr~gf~~~fr~cp 337 (423)
T KOG4219|consen 308 MSSTCYNPIIYCFLNKRFRGGFRRAFRWCP 337 (423)
T ss_pred HHHhhhccHhhhhhHHHHHHHHhhhhheee
Confidence 999999999999999999999999998773
No 3
>PHA03235 DNA packaging protein UL33; Provisional
Probab=100.00 E-value=3.1e-32 Score=245.87 Aligned_cols=200 Identities=14% Similarity=0.267 Sum_probs=152.4
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccC---CCcc-cccC
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKD---PEYM-DRIN 79 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~---~~~~-~~~~ 79 (342)
|-+.+++...+..+|++++++||+|||+||++|.|....+++++.++++++|++++++++|+.+.... ..+. ...+
T Consensus 105 Ck~~~~l~~~~~~~Si~tL~~ISiDRY~aI~~p~~~~~~~~~~a~~ii~~iWi~sll~s~P~~~~~~~~~~~~~~~~~~~ 184 (409)
T PHA03235 105 CKFASLLYYASCTVGFATVALIAADRYRVIHQRTRARSSAYRSTYKILGLTWFASLICSGPAPVYTTVVAHDDVDPEAPG 184 (409)
T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHHHHeechhhccCcccchhhhhHHHHHHHHHHHHHHHHHHHhhhhccccCcCCCC
Confidence 44567888889999999999999999999999766666778899999999999999999986543211 1111 1112
Q ss_pred ccceeccCC--c------hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeec
Q psy4354 80 QQKCMVSQD--V------GYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFF 151 (342)
Q Consensus 80 ~~~C~~~~~--~------~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (342)
...|..... . .|.++.+++.|++|+++|++||.+|++++|++.+
T Consensus 185 ~~~C~~~~~~~~~~~~~~~y~i~l~i~~f~iPl~im~~~Y~~I~~~l~~~~~---------------------------- 236 (409)
T PHA03235 185 YETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTWFYTFFYRTLKRASY---------------------------- 236 (409)
T ss_pred cceeeEeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----------------------------
Confidence 456865322 1 2445556677999999999999999999987421
Q ss_pred cccccccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCC
Q psy4354 152 KKRKFFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNH 231 (342)
Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (342)
T Consensus 237 -------------------------------------------------------------------------------- 236 (409)
T PHA03235 237 -------------------------------------------------------------------------------- 236 (409)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred CccccccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcC-----CC---CchhHHHHHHH
Q psy4354 232 NNECVQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQ-----TC---YISDYLASFFL 303 (342)
Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~-----~~---~~~~~~~~~~~ 303 (342)
++++|+.+++++++++|++||+||.++.++..+.+ +. .....+..+..
T Consensus 237 ------------------------~~~~k~~~~v~iivv~F~iCWlPy~v~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~ 292 (409)
T PHA03235 237 ------------------------KKRSRTLTFVCILLLSFLCLQTPFVAIMIFDSYATLIWPSDCEHINLRDAVSTLSR 292 (409)
T ss_pred ------------------------hcchhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCchhhhHHHHHHHHHH
Confidence 12346788999999999999999999888754421 11 12334567888
Q ss_pred HHHhhcccccchhhhccChhHHHHHHHHHhcC
Q psy4354 304 WLGYFNSTLNPVIYTVFSPEFRQAFKRILCGS 335 (342)
Q Consensus 304 ~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~ 335 (342)
+|+++|||+||+||++++++||+++++.++++
T Consensus 293 ~La~~ns~lNPiIY~~~~~~FRk~~~~~l~~~ 324 (409)
T PHA03235 293 LVPNLHCLLNPILYAFLGNDFLKRFRQCFRGE 324 (409)
T ss_pred HHHHHHHhHhHHHHHHhhHHHHHHHHHHHhhh
Confidence 99999999999999999999999999999654
No 4
>PHA03234 DNA packaging protein UL33; Provisional
Probab=100.00 E-value=8.9e-32 Score=237.76 Aligned_cols=194 Identities=14% Similarity=0.188 Sum_probs=141.7
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccce
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKC 83 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C 83 (342)
+-+.+++...+.++|++++++||+|||+||++|...+ .++++....++++|+++++.++|+++......... +...|
T Consensus 105 Ck~~~~~~~~~~~~Si~~L~~ISiDRY~aIv~p~~~~-~~~~~~~~~i~~~Wi~s~l~~~P~l~~~~~~~~~~--~~~~C 181 (338)
T PHA03234 105 CQCVLFIYHASCSYSICMLAIIATIRYKTLHRRKKND-KKNNHIGRNIGILFLASAMCAIPAALFVKTEGKKG--NYGKC 181 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHeeeechhhhh-hhhhhHHHHHHHHHHHHHHHHhhHhHeeeeeecCC--CCCcC
Confidence 3456888999999999999999999999999954333 33445556677779999999997765443221111 12468
Q ss_pred eccCC--chhHHH------HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccc
Q psy4354 84 MVSQD--VGYQIF------ATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRK 155 (342)
Q Consensus 84 ~~~~~--~~~~~~------~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (342)
...+. ..|..+ ..++.|++|+++|++||.+|.+++++..
T Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~f~iPl~im~~cY~~I~~~L~~~~--------------------------------- 228 (338)
T PHA03234 182 NIHISSKKAYDLFIAIKIVFCFIWGIFPTMIFSFFYVIFCKALHALT--------------------------------- 228 (338)
T ss_pred cccCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh---------------------------------
Confidence 65432 222221 2233468999999999999998887521
Q ss_pred cccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccc
Q psy4354 156 FFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNEC 235 (342)
Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (342)
T Consensus 229 -------------------------------------------------------------------------------- 228 (338)
T PHA03234 229 -------------------------------------------------------------------------------- 228 (338)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred cccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcC-----CCC---chhHHHHHHHHHHh
Q psy4354 236 VQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQ-----TCY---ISDYLASFFLWLGY 307 (342)
Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~-----~~~---~~~~~~~~~~~l~~ 307 (342)
.++++|++|++++++++|++||+||.++.++..+.. .+. .....+.++.+|++
T Consensus 229 -------------------~~~~~k~~k~i~~vv~vF~iCWlPy~iv~l~~~~~~~~~~~~c~~~~~~~~~~~v~~~La~ 289 (338)
T PHA03234 229 -------------------EKKHKKTLFFIRILILSFLCIQIPNIAILICEIAFLYIANNSCFGLAQREILQIIIRLMPE 289 (338)
T ss_pred -------------------hhhhhhhhhHHHHHHHHHHHHHhHHHHHHHHHHHHHhcccCcchHHHHHHHHHHHHHHHHH
Confidence 123578999999999999999999999887654321 111 12344667889999
Q ss_pred hcccccchhhhccChhHHHHHHHHH
Q psy4354 308 FNSTLNPVIYTVFSPEFRQAFKRIL 332 (342)
Q Consensus 308 ~ns~vNPiiY~~~n~~fR~~~~~ll 332 (342)
+|||+||+||++++++||+++++++
T Consensus 290 ~nsclNPiIY~f~~~~FR~~~~~~~ 314 (338)
T PHA03234 290 IHCFSNPLVYAFTGGDFRLRFTACF 314 (338)
T ss_pred hhhhhhHHHHHHhhHHHHHHHHHHH
Confidence 9999999999999999999988776
No 5
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.97 E-value=1.3e-31 Score=236.73 Aligned_cols=199 Identities=18% Similarity=0.369 Sum_probs=147.8
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccce
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKC 83 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C 83 (342)
+-+.+++...+..+|++++++||+|||++|++|......+.+++.++++++|+++++.++|+++++..... . +...|
T Consensus 98 C~~~~~~~~~~~~~Si~tL~~IsidRY~aI~~P~~~~~~~~~~~~~~i~~iWi~s~l~~~P~~~~~~~~~~-~--~~~~C 174 (323)
T PHA02834 98 CKLVLGVYFVGFFSNMFFVTLISIDRYILVVNATKIKNKSISLSVLLSVAAWVCSVILSMPAMVLYYVDNT-D--NLKQC 174 (323)
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHhhheeCchhccCCccchHHHHHHHHHHHHHHHHhhHHHHHHhccC-C--CceEE
Confidence 44566777788889999999999999999999543344556778888899999999999988776543211 1 13468
Q ss_pred eccCC-c--h----hHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeecccccc
Q psy4354 84 MVSQD-V--G----YQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKF 156 (342)
Q Consensus 84 ~~~~~-~--~----~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (342)
..... . . +.....++.|++|+++|+++|.+|++++|++.+
T Consensus 175 ~~~~~~~~~~~~~~~~~~~~i~~f~iPl~ii~~~Y~~I~~~l~~~~~--------------------------------- 221 (323)
T PHA02834 175 IFNDYHENFSWSAFFNFEINIFGIVIPLIILIYCYSKILYTLKNCKN--------------------------------- 221 (323)
T ss_pred eccCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---------------------------------
Confidence 64211 1 1 122334678899999999999999999876310
Q ss_pred ccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCcccc
Q psy4354 157 FRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECV 236 (342)
Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (342)
T Consensus 222 -------------------------------------------------------------------------------- 221 (323)
T PHA02834 222 -------------------------------------------------------------------------------- 221 (323)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCC------Cc---hhHHHHHHHHHHh
Q psy4354 237 QVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTC------YI---SDYLASFFLWLGY 307 (342)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~------~~---~~~~~~~~~~l~~ 307 (342)
++++|.+|++++++++|++||+||.+..++..+.+.. .. ......+..++++
T Consensus 222 -------------------~~~~k~~k~~~~vv~~F~icWlPy~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~ 282 (323)
T PHA02834 222 -------------------KNKTRSIKIILTVVTFTVVFWVPFNIVLFINSLQSVGLIDIGCYHFKKIVYSIDIAELISF 282 (323)
T ss_pred -------------------cccceEEeehhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCccchHHHHHHHHHHHHHHHHH
Confidence 1135778999999999999999999988876553211 00 1112356778999
Q ss_pred hcccccchhhhccChhHHHHHHHHHhcCCC
Q psy4354 308 FNSTLNPVIYTVFSPEFRQAFKRILCGSPN 337 (342)
Q Consensus 308 ~ns~vNPiiY~~~n~~fR~~~~~ll~~~~~ 337 (342)
+||++||+||+++|++||+++++++|+.++
T Consensus 283 ~ns~iNPiIY~~~~~~fR~~~~~~~~~~~~ 312 (323)
T PHA02834 283 VHCCVNPIIYAFVGKNFKKVFKNMFCRTNN 312 (323)
T ss_pred hhccccHHHHHHhcHHHHHHHHHHHHhhhh
Confidence 999999999999999999999999975543
No 6
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.96 E-value=2.7e-29 Score=227.81 Aligned_cols=201 Identities=17% Similarity=0.337 Sum_probs=150.9
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCc-------c
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEY-------M 75 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~-------~ 75 (342)
+-+.+++...+.++|++++++|++|||+||+| .++....+++.+.++++++|+++++.++|+++.++.... .
T Consensus 168 Ck~~~~l~~~~~~~Si~~L~~isiDRYlaIv~p~~~~~~~~~~~~~i~~~~iW~~s~l~slP~~~~~~~~~~~~~~~~~~ 247 (417)
T PHA02638 168 CKVISASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEASNIIFSAQDSN 247 (417)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccceecHhhhHhhHhHHHHHHHHHHHHHHHHhhccccccccccCC
Confidence 34567778888999999999999999999999 555555677778889999999999999998876543211 1
Q ss_pred cccCccceeccCCc----------hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCc
Q psy4354 76 DRINQQKCMVSQDV----------GYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKT 145 (342)
Q Consensus 76 ~~~~~~~C~~~~~~----------~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (342)
.......|...... .+.+...++.+++|++++++||.+|++++++..+
T Consensus 248 ~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~i~~f~lPl~vmi~cY~~I~~~L~~~~~---------------------- 305 (417)
T PHA02638 248 ETISNYQCTLIEDNEKNNISFLGRILQFEINILGMFIPIIIFAFCYIKIILKLKQLKK---------------------- 305 (417)
T ss_pred CCccCCeeeeeccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------------------
Confidence 11113467653211 1223334667899999999999999999876310
Q ss_pred cceeeccccccccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCC
Q psy4354 146 SHFIFFKKRKFFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNG 225 (342)
Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (342)
T Consensus 306 -------------------------------------------------------------------------------- 305 (417)
T PHA02638 306 -------------------------------------------------------------------------------- 305 (417)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred CCCCCCCccccccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCC-----------CC-
Q psy4354 226 ASQSNHNNECVQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQT-----------CY- 293 (342)
Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~-----------~~- 293 (342)
++++|++|++++++++|++||+||.++.++..+... +.
T Consensus 306 ------------------------------~~k~k~~rli~~ivi~f~lcW~Py~i~~ll~~~~~~~~~~~~~~~~~c~~ 355 (417)
T PHA02638 306 ------------------------------SKKTKSIIIVSIIIICSLICWIPLNIVILFATMYSFKGFNSIISEHICGF 355 (417)
T ss_pred ------------------------------cccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHHhhccccccccccccH
Confidence 124577899999999999999999999888655311 11
Q ss_pred -chhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHhcCC
Q psy4354 294 -ISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGSP 336 (342)
Q Consensus 294 -~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~~ 336 (342)
..+....++.+++++|+|+||++|++.+++||+++++++++..
T Consensus 356 ~~l~~~~~vt~~la~~~sclNPiIY~f~~~~FR~~l~~~~~~~~ 399 (417)
T PHA02638 356 IKLGYAMMLAEAISLTHCCINPLIYTLIGENFRMHLLMIFRNIF 399 (417)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHHhCHHHHHHHHHHHHHhc
Confidence 1123455677899999999999999999999999999996554
No 7
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.95 E-value=1.6e-27 Score=212.39 Aligned_cols=198 Identities=21% Similarity=0.292 Sum_probs=150.7
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccc
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQK 82 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~ 82 (342)
+-+.+++...+..+|++++++||+|||++|++ .+|....+.+++.++++++|+++++.++|+++.+...... +...
T Consensus 111 C~~~~~~~~~~~~~S~~~l~~iaidRy~aI~~p~~~~~~~~~~~~~~~~~~iWl~~~~~~~p~~~~~~~~~~~---~~~~ 187 (335)
T PHA03087 111 CKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISIIETTPILFVYTTKKDH---ETLI 187 (335)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccchhhhHHHHHHHHHHHHHHhccHhheeeeeccC---CCce
Confidence 44567788888999999999999999999999 6777788999999999999999999999776665432221 2445
Q ss_pred eeccCC---chh----HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccc
Q psy4354 83 CMVSQD---VGY----QIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRK 155 (342)
Q Consensus 83 C~~~~~---~~~----~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (342)
|...++ ..+ .+...++.+++|+++++++|.+|++.++++.
T Consensus 188 C~~~~~~~~~~~~~~~~~~~~~~~~~lP~~ii~~~y~~i~~~l~~~~--------------------------------- 234 (335)
T PHA03087 188 CCMFYNNKTMNWKLFINFEINIIGMLIPLTILLYCYSKILITLKGIN--------------------------------- 234 (335)
T ss_pred EEecCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------------------------
Confidence 655432 111 1223456789999999999999998887632
Q ss_pred cccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccc
Q psy4354 156 FFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNEC 235 (342)
Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (342)
T Consensus 235 -------------------------------------------------------------------------------- 234 (335)
T PHA03087 235 -------------------------------------------------------------------------------- 234 (335)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred cccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhc-----CCCC---chhHHHHHHHHHHh
Q psy4354 236 VQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLC-----QTCY---ISDYLASFFLWLGY 307 (342)
Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~-----~~~~---~~~~~~~~~~~l~~ 307 (342)
..++++|++|++++++++|++||+|+.+..++..+. +... .......++.++..
T Consensus 235 ------------------~~~~~~k~~k~l~~iv~~f~i~w~P~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~ 296 (335)
T PHA03087 235 ------------------KSKKNKKAIKLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSGCKAVKYIQYALHVTEIISL 296 (335)
T ss_pred ------------------cchhcchHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHH
Confidence 013457889999999999999999999887765331 1111 12233456677899
Q ss_pred hcccccchhhhccChhHHHHHHHHHhcC
Q psy4354 308 FNSTLNPVIYTVFSPEFRQAFKRILCGS 335 (342)
Q Consensus 308 ~ns~vNPiiY~~~n~~fR~~~~~ll~~~ 335 (342)
+|+++||+||++++++||++++++++..
T Consensus 297 ~ns~~NPiIY~~~~~~fr~~~~~~~~~~ 324 (335)
T PHA03087 297 SHCCINPLIYAFVSEFFNKHKKKSLKLM 324 (335)
T ss_pred HHHhhhhhHHHHcCHHHHHHHHHHHHHH
Confidence 9999999999999999999999998544
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.94 E-value=2.4e-26 Score=196.61 Aligned_cols=195 Identities=30% Similarity=0.551 Sum_probs=160.5
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccc
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQK 82 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~ 82 (342)
+-+.+++..++..+|.+++++|++|||++|++ .+|+...+++++..+++++|++++++++|+.++.+....... +...
T Consensus 57 C~~~~~~~~~~~~~s~~~~~~is~dRy~~i~~p~~~~~~~~~~~~~~~i~~~w~~~~~~~~~~~~~~~~~~~~~~-~~~~ 135 (257)
T PF00001_consen 57 CRIFGFLFYFSSFSSIFSLVAISIDRYLAICHPLRYRRIRTRRRARIIIILIWIISFLISLPPLFFSWVYFVSDG-SQSF 135 (257)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHSHHTHHHHSCHHHHHHHHHHHHHHHHHHHHHHHHTCEEEEESTC-CCEE
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc-cccc
Confidence 44667888899999999999999999999999 788888899999999999999999999988877554332221 1567
Q ss_pred eeccCCc----hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeecccccccc
Q psy4354 83 CMVSQDV----GYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKFFR 158 (342)
Q Consensus 83 C~~~~~~----~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (342)
|...... .+..+..++.+++|+++++++|.+|++.+|++.++.....
T Consensus 136 C~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~i~~~~~~~~~~~~~~~----------------------------- 186 (257)
T PF00001_consen 136 CFIDFSSSSSQIYFIYFFIVFFILPLIIILICYIRILRKLRRQRKRIKSQS----------------------------- 186 (257)
T ss_dssp EEESCSSSHHHHHHHHHHHHHTHHHHHHHHHHHHHHHHHHHHHHHCTCCHT-----------------------------
T ss_pred ccccccccccccccccccccccccceeeeeeeccccccccccccccccccc-----------------------------
Confidence 8886443 4777888889999999999999999999999876544310
Q ss_pred ccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCcccccc
Q psy4354 159 IKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECVQV 238 (342)
Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (342)
T Consensus 187 -------------------------------------------------------------------------------- 186 (257)
T PF00001_consen 187 -------------------------------------------------------------------------------- 186 (257)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCC-chhHHHHHHHHHHhhcccccchhh
Q psy4354 239 KHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCY-ISDYLASFFLWLGYFNSTLNPVIY 317 (342)
Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~-~~~~~~~~~~~l~~~ns~vNPiiY 317 (342)
....+.+.++|+|+++++++++++|++||+|+.+..++....+... .......++.++.++|+++||++|
T Consensus 187 ---------~~~~~~~~~~~~~~~~~~~~i~~~f~~~~~P~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~~s~~nP~iY 257 (257)
T PF00001_consen 187 ---------SSSSRRRSRRERRAARTLLIIVLVFLLCWLPYFILSLLSVFSPSSSLISSILFYISYFLAFLNSCLNPIIY 257 (257)
T ss_dssp ---------SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHSSTSTCSHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ---------ccccccccccccccccccccccccccccCCceeHHHHHHHHcCccchhhHHHHHHHHHHHHHHHhhCcEEC
Confidence 0223346688999999999999999999999999999887776543 467788889999999999999999
No 9
>KOG2087|consensus
Probab=99.78 E-value=2.3e-20 Score=158.18 Aligned_cols=199 Identities=19% Similarity=0.357 Sum_probs=151.6
Q ss_pred hhhhHHHhhHHHHHhHHHHhhheeeceeEEEeeccc-ccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCcc
Q psy4354 3 QLLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDY-IHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQ 81 (342)
Q Consensus 3 ~~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y-~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~ 81 (342)
++=+.||+..++.-.|+++++.+++||+++|++|.+ .+....|.+..++...|+.+++.++.|+++.+.+.. ..
T Consensus 103 gC~~aGflavFASElSv~~LT~itlEr~l~i~~p~~~~~~~~lr~~~~ill~~wl~~~l~A~~Pl~g~s~Y~~-----~~ 177 (363)
T KOG2087|consen 103 GCPVAGFLAVFASELSVFLLTLITLERWLSITYPFRLDRKAKLRPLVLILLLGWLFAFLMALLPLFGISSYGA-----SS 177 (363)
T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHhheeccccCCCcccccHHHHHHHHHHHHHHHHHhccccCCCCCcc-----cc
Confidence 345689999999999999999999999999999444 444555559999999999999999999998665432 46
Q ss_pred ceecc-----CCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeecccccc
Q psy4354 82 KCMVS-----QDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKF 156 (342)
Q Consensus 82 ~C~~~-----~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (342)
.|.+- .+..|.+..++.+=.+.+++|+++|++|+..+++-....
T Consensus 178 vClPL~~~~~~s~g~y~~~~l~~N~lafiiia~~Y~~iy~~l~~~~~~~------------------------------- 226 (363)
T KOG2087|consen 178 VCLPLHIEEPLSTGYYLVALLGLNLLAFIIIAFSYGKIYCSLRKGDLSA------------------------------- 226 (363)
T ss_pred eeeecccCCccchhHHHHHHHHHHHHHHHHHHHHhhhhheeeecCCCcc-------------------------------
Confidence 78662 233355566666678899999999999999998711000
Q ss_pred ccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCcccc
Q psy4354 157 FRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECV 236 (342)
Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (342)
T Consensus 227 -------------------------------------------------------------------------------- 226 (363)
T KOG2087|consen 227 -------------------------------------------------------------------------------- 226 (363)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchh
Q psy4354 237 QVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVI 316 (342)
Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPii 316 (342)
......++|-++.+++.-.+||.|..++.+...+..+.........+..++.-+|+|+||++
T Consensus 227 ------------------~~~~~~~akr~a~LvfTd~icw~Pi~f~~~~al~~~~li~~~~sk~llv~flPlns~~NP~L 288 (363)
T KOG2087|consen 227 ------------------TLISTSVAKRMAFLVFTDCICWCPIAFFKFSALIGVELISVSYSKWLLVFFLPLNSCLNPFL 288 (363)
T ss_pred ------------------ccchhhhhhCeeEEEEccccccCchheeeeHHhcCCcccChhhceeEEEEEEEcccccCchh
Confidence 00114677778889999999999988888877666554333333334455677999999999
Q ss_pred hhccChhHHHHHHHHHhcC
Q psy4354 317 YTVFSPEFRQAFKRILCGS 335 (342)
Q Consensus 317 Y~~~n~~fR~~~~~ll~~~ 335 (342)
|++.++.||+.++.++.+.
T Consensus 289 Ya~fT~~fk~d~~~l~~k~ 307 (363)
T KOG2087|consen 289 YAFFTPVFKEDLFLLLSKV 307 (363)
T ss_pred HHHcCHHHHHHHHHHHhhc
Confidence 9999999999999998554
No 10
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=99.68 E-value=2.6e-16 Score=139.28 Aligned_cols=209 Identities=20% Similarity=0.251 Sum_probs=154.8
Q ss_pred hhHHHhhHHHHHhHHHHhhheeeceeEEEeecccc---cccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcc-cccCc
Q psy4354 5 LFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYI---HTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYM-DRINQ 80 (342)
Q Consensus 5 ~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~---~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~-~~~~~ 80 (342)
++...+...+...|+|..++||+.||++|.+|... ...+.+.+..++.++++++++..++.++.+...+.. ...+.
T Consensus 79 ~~~~~l~~~~~~~S~WL~V~mA~iR~l~i~~p~~~~~~~l~~~k~~~~~i~~v~~~s~~~~~~~~~~~~i~~~~~~~~p~ 158 (318)
T PF10324_consen 79 LIMESLSDIFRRISIWLGVLMALIRYLSIKFPMSSRFQKLSKPKFAIIVILIVFIISFLFSIPYFFRYKIVEVSDPWVPP 158 (318)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCCCeeeeehHHHHHHHHHHHHHhhceEEEEeccccccCC
Confidence 45567888889999999999999999999985322 334677788888999999999998766555433222 12224
Q ss_pred cceec---c-CC--------c-----------hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCC
Q psy4354 81 QKCMV---S-QD--------V-----------GYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKK 137 (342)
Q Consensus 81 ~~C~~---~-~~--------~-----------~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~ 137 (342)
..|.. . .. . .+.....++.-++|.++..++...+.+.+|+..+++.....
T Consensus 159 ~~C~~~~~~~~~~~Y~~~~~~~~~~~~~~~~~~~~~~~gi~~kiiP~il~~ilti~Li~~Lrk~~~~r~~~~~------- 231 (318)
T PF10324_consen 159 PNCSGFPENYTFPRYMLNISELFTENDCLFFRIYFFIDGIFFKIIPCILLPILTILLIIELRKAKKRRKKLSS------- 231 (318)
T ss_pred CceeeccccccccccchhhhhhhhhhHHHHHHHHHHhhhhHhhhhhHHHHHHHHHHHHHHHHhccHhhhcccc-------
Confidence 55751 0 00 0 11122223446899999999999999999997776665110
Q ss_pred CCCCCCCccceeeccccccccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCcc
Q psy4354 138 PDTSDNKTSHFIFFKKRKFFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTA 217 (342)
Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (342)
T Consensus 232 -------------------------------------------------------------------------------- 231 (318)
T PF10324_consen 232 -------------------------------------------------------------------------------- 231 (318)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ceeccCCCCCCCCCCccccccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCC----C
Q psy4354 218 FTITHNNGASQSNHNNECVQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTC----Y 293 (342)
Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~----~ 293 (342)
+ ..+++.+.+++++.+++.|+++-+|+.+..++..+..+. .
T Consensus 232 ----------------------------------~-~~~~~~~tt~li~~~ti~f~i~e~p~gi~~~~~~~~~~~~~~~~ 276 (318)
T PF10324_consen 232 ----------------------------------S-KSKKSDRTTKLILFMTISFLISELPQGIIFLLESFFEEDSGLIF 276 (318)
T ss_pred ----------------------------------c-ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhH
Confidence 0 024567899999999999999999999999997764321 2
Q ss_pred chhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHhcC
Q psy4354 294 ISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGS 335 (342)
Q Consensus 294 ~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~ 335 (342)
.......+...|..+|+..|++||.+++.+||++++++|+||
T Consensus 277 ~~~~~~~~~~~l~~~ns~~h~~ic~~mSsqYR~t~~~~f~~k 318 (318)
T PF10324_consen 277 IIIQLSIIFNILITINSSIHFFICCFMSSQYRKTVKKLFGCK 318 (318)
T ss_pred HHHHHHHHHHHHHHHHhhhHHHhhhhhhHHHHHHHHHHhccC
Confidence 345566677889999999999999999999999999999986
No 11
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=99.60 E-value=2e-14 Score=122.57 Aligned_cols=193 Identities=13% Similarity=0.211 Sum_probs=134.7
Q ss_pred hhhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCcc
Q psy4354 3 QLLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQ 81 (342)
Q Consensus 3 ~~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~ 81 (342)
+++...+...++..++...+++|++||++||+. .+|+...+.+...++.....+.+....+ +++-.. ++....
T Consensus 60 ~Cf~~~~~~~f~~~~qs~~~l~i~iDr~iaV~~P~~Y~~~~~~~y~~~~~~~~~~~s~~~~~---~~~~~~---~~~~~~ 133 (257)
T PF10320_consen 60 ECFWQIFFYIFFQCAQSVIMLAIAIDRLIAVCFPLRYRTISTRKYLIILLIFPVIYSIFFTV---IGFLYR---DDETIV 133 (257)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhheeeEeehhhhhhcccccchhhHhHHHHHHHHHHHh---heeEec---CCcccc
Confidence 356666777888899999999999999999999 6887777777555555555555444432 222221 111367
Q ss_pred ceeccC---CchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeecccccccc
Q psy4354 82 KCMVSQ---DVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKFFR 158 (342)
Q Consensus 82 ~C~~~~---~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (342)
.|.+.. ...+.++... ..++ -+++++.|...+..++++.++
T Consensus 134 ~C~pp~a~~~~~~~~~~~~-~~~i-nv~tvivY~i~~~~~~~k~~~---------------------------------- 177 (257)
T PF10320_consen 134 ICNPPLAFHGTASQIWSYS-NIII-NVITVIVYIITIIIFKRKSRS---------------------------------- 177 (257)
T ss_pred cCCCccccCccHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHccc----------------------------------
Confidence 898853 3344444333 2233 355677888888877774321
Q ss_pred ccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCcccccc
Q psy4354 159 IKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECVQV 238 (342)
Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (342)
T Consensus 178 -------------------------------------------------------------------------------- 177 (257)
T PF10320_consen 178 -------------------------------------------------------------------------------- 177 (257)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred ccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCC-CCchhHHHHHHHHHHhhcccccchhh
Q psy4354 239 KHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQT-CYISDYLASFFLWLGYFNSTLNPVIY 317 (342)
Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~-~~~~~~~~~~~~~l~~~ns~vNPiiY 317 (342)
..+.++|+.|.+.+.+++|+++|.=..+...+....+. ......+.....++..+|.+.|.+||
T Consensus 178 ---------------~~~~~~kv~ksL~v~v~i~i~~w~~s~~~~~v~~~~~~~~~~~~~i~~~~~i~v~~~~s~~ffV~ 242 (257)
T PF10320_consen 178 ---------------NSSRSKKVFKSLKVTVIIFIFSWFLSQIINTVSLALGLDGETIAIIQMYAGIFVNISYSQNFFVY 242 (257)
T ss_pred ---------------cchhHHHHHHHhhhheeeeeHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHHHHHHheEE
Confidence 12457899999999999999999887766666544433 22334466667789999999999999
Q ss_pred hccChhHHHHHHHHH
Q psy4354 318 TVFSPEFRQAFKRIL 332 (342)
Q Consensus 318 ~~~n~~fR~~~~~ll 332 (342)
.++|+|||++++++|
T Consensus 243 ~~~S~EYR~af~~~~ 257 (257)
T PF10320_consen 243 YWRSSEYRKAFRELF 257 (257)
T ss_pred EEcCHHHHHHHHHhC
Confidence 999999999999975
No 12
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.46 E-value=8.6e-13 Score=114.08 Aligned_cols=198 Identities=20% Similarity=0.292 Sum_probs=134.6
Q ss_pred HHHHHhHHHHhhheeeceeEEEeecc--cccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceeccCC-
Q psy4354 12 HYNVDMLGYSTVIMFVDKYWAVTNVD--YIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMVSQD- 88 (342)
Q Consensus 12 ~~~~~~S~~~l~~IaidRY~aI~~~~--y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~~~~- 88 (342)
++...+.....+.+|++||.||++|. +.+..++.+...++++-|+.++++++ +++......+....+ ..-..+.+
T Consensus 83 y~~~~~~~~gi~lls~nR~~ai~~P~~~~~~~~~~~~~~~i~~i~wi~p~li~~-~~~~~~~~~f~~~~~-~~~~~d~~~ 160 (283)
T PF10323_consen 83 YYFLYIQCIGIVLLSLNRYLAICFPTSRHTKFWQPAKIWIIILIQWIPPLLISL-PFFFDTDFYFDNEEN-MSLFVDPEF 160 (283)
T ss_pred HHHHHHHHHhHHHHHHhhhheEEeecHHHhhhccccchhheeeeeehhhhhhee-eeeccCceeeecccc-eeeecCHHH
Confidence 44555667789999999999999954 66677888999999999999999999 455555555544322 11111111
Q ss_pred chhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccccccccccCCCCCC
Q psy4354 89 VGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKFFRIKKCTNVVPP 168 (342)
Q Consensus 89 ~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (342)
.............+-++..+++|+.++..+|++.+...
T Consensus 161 ~~~~~~~~~~~~~~~cv~~iv~Y~~i~~~iRk~~k~~s------------------------------------------ 198 (283)
T PF10323_consen 161 IQRNFLIAFIFVSVTCVICIVCYGIIFIFIRKRNKKKS------------------------------------------ 198 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh------------------------------------------
Confidence 11111222344455667889999999999998764111
Q ss_pred CCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccccccccCCCCCccc
Q psy4354 169 SPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECVQVKHKIPPTKKE 248 (342)
Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (342)
T Consensus 199 -------------------------------------------------------------------------------- 198 (283)
T PF10323_consen 199 -------------------------------------------------------------------------------- 198 (283)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred chhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHH
Q psy4354 249 KKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAF 328 (342)
Q Consensus 249 ~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~ 328 (342)
++.++..++|.+.+.+.++.++++++..+=+. ........++......+..+.-.+..+.|.+||+.-.+.|+++|+++
T Consensus 199 ~~~s~~~~rE~~L~~~~~i~~~a~~~~~~~~~-~~~~~~~~~~~~~~~~~r~~y~~~~~~~s~inP~~LLi~n~~lr~~~ 277 (283)
T PF10323_consen 199 KSSSRSRRREIRLAIQVFILFCAFFVILVYYI-FSNYFAQNFNTDPIFYLRAFYPILNGLLSFINPWMLLIFNKDLRKQV 277 (283)
T ss_pred HhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhcccchHHHHHHHHHHHHHHHHHhhhhHHhhhccHHHHHHH
Confidence 11223567899999998888877666554443 33333343333223333346667788899999999999999999999
Q ss_pred HHHHhc
Q psy4354 329 KRILCG 334 (342)
Q Consensus 329 ~~ll~~ 334 (342)
++.++|
T Consensus 278 ~~~~~~ 283 (283)
T PF10323_consen 278 RRMLKC 283 (283)
T ss_pred HHHcCC
Confidence 999987
No 13
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=99.34 E-value=4.1e-11 Score=103.65 Aligned_cols=198 Identities=13% Similarity=0.212 Sum_probs=137.3
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccC-CCcccc----
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKD-PEYMDR---- 77 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~-~~~~~~---- 77 (342)
..+.+++...+...+.++-++||++|++||+. .+|.+..+.+.+.++++++|++++....++.+.-+. ..++++
T Consensus 67 s~~~g~~~~~~y~~~~~~~~liaiNRf~ai~fP~~y~~~fs~~~T~~~i~~~~~~~~~~~~~~~~~~~C~~~y~~~~~~~ 146 (274)
T PF10328_consen 67 SIIFGFIGMFCYFIGPLSHLLIAINRFCAIFFPFKYKKIFSFKNTIILIAFIWLLSIIISTILYFPDGCYFYYDPETWSW 146 (274)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHheeeeHHHHHhHcCccceehhhhHHHHHHHHHHHHhhhcCCCcceeccceeee
Confidence 45567888888889999999999999999999 677888899999999999999999777655542221 111111
Q ss_pred --cCccceeccCCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccc
Q psy4354 78 --INQQKCMVSQDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRK 155 (342)
Q Consensus 78 --~~~~~C~~~~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (342)
.+...|.. +..+.......+-.+++.+++..++.++++.+++...
T Consensus 147 ~~~~~~~C~~-----~~~~~~~~~~~~~~~~~~~lni~t~ikl~~~~~~~~~---------------------------- 193 (274)
T PF10328_consen 147 SYPTDPPCGN-----YSWYFDFYKNFILVIISNILNIITFIKLRKFRKKISV---------------------------- 193 (274)
T ss_pred ecCCCCccch-----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc----------------------------
Confidence 01224433 2222223333444567888999999999987665511
Q ss_pred cccccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccc
Q psy4354 156 FFRIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNEC 235 (342)
Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (342)
T Consensus 194 -------------------------------------------------------------------------------- 193 (274)
T PF10328_consen 194 -------------------------------------------------------------------------------- 193 (274)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred cccccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHH-HHHHhhcccccc
Q psy4354 236 VQVKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFF-LWLGYFNSTLNP 314 (342)
Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~-~~l~~~ns~vNP 314 (342)
.+...++++++|.+..+-.++-.+.|++.++=+.+..-+ .+ ..+...+. ...-.+..++|+
T Consensus 194 -----------~~~~s~~r~rke~~f~~Qs~~Q~~~~~i~~~~~~~~~~~---~~----~~~~~F~~~t~~w~~~h~~DG 255 (274)
T PF10328_consen 194 -----------SSSESKKRRRKEIRFFIQSFIQDLLYLIDLIFYFFIPPL---SS----NRWWQFFCTTFSWVLVHALDG 255 (274)
T ss_pred -----------cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cc----ccHHHHHHHHHHHHHHHHhcc
Confidence 111233366789999999999999999999887766443 22 13333333 333445677899
Q ss_pred hhhhccChhHHHHHHHHH
Q psy4354 315 VIYTVFSPEFRQAFKRIL 332 (342)
Q Consensus 315 iiY~~~n~~fR~~~~~ll 332 (342)
+|+.+.|+|+|+.+++..
T Consensus 256 ~i~l~fN~~~r~~~~~~~ 273 (274)
T PF10328_consen 256 LIMLIFNSEIRRKIRKKK 273 (274)
T ss_pred eeEeEEcHHHHHHHHhcc
Confidence 999999999999998754
No 14
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=99.05 E-value=2.1e-08 Score=87.58 Aligned_cols=206 Identities=14% Similarity=0.107 Sum_probs=132.3
Q ss_pred HhhHHHHHhHHHHhhheeeceeEEEeecccccc-cchhH----HHHHHHHHHHHHHHHHHHhHhcccCC-CcccccCccc
Q psy4354 9 VLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHT-RNANR----IIMMIVVVWSVAFIVSLAPQLGWKDP-EYMDRINQQK 82 (342)
Q Consensus 9 ~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~-~t~r~----~~~~i~~~W~~s~~~~~p~~~~~~~~-~~~~~~~~~~ 82 (342)
++..++...|.|..+.+++-=++-|+...++-. .=|+| +..++.++++++++..++....+... ..........
T Consensus 87 ~~~~f~~~~s~W~tt~LsvfYcvKI~~fs~~~Fl~LK~rI~~~v~~lLl~s~l~s~~~~~~~~~~~~~~~~~~~~~~~~N 166 (303)
T PF05296_consen 87 FLWMFSNSSSLWFTTWLSVFYCVKIANFSHPFFLWLKRRISKVVPWLLLGSLLISFLNLLSIPLFIDNHINNNNTNNSRN 166 (303)
T ss_pred HHHHHHhHHHHHHHHHHHHHHheeeecCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheeeeeccccccCCCc
Confidence 445566777899999999999999998323221 11222 33566777777773333222223221 1000011111
Q ss_pred eec--c-CCc--hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccccc
Q psy4354 83 CMV--S-QDV--GYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKFF 157 (342)
Q Consensus 83 C~~--~-~~~--~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (342)
+.. . ... .+......+..++|+++++++-..+...++||.|+++.....
T Consensus 167 ~t~~~~~~~~~~~~~~~~~~~~~~lPf~i~l~s~~lli~SL~rH~r~M~~n~~g-------------------------- 220 (303)
T PF05296_consen 167 STSNFQESKSSYFYFFILFNLGSFLPFLIFLVSSILLIFSLWRHMRRMQKNATG-------------------------- 220 (303)
T ss_pred ceEEeecchHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhhCCCCC--------------------------
Confidence 111 1 111 223333347789999999999999999999999998872110
Q ss_pred cccccCCCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccccc
Q psy4354 158 RIKKCTNVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECVQ 237 (342)
Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (342)
T Consensus 221 -------------------------------------------------------------------------------- 220 (303)
T PF05296_consen 221 -------------------------------------------------------------------------------- 220 (303)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred cccCCCCCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhh
Q psy4354 238 VKHKIPPTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIY 317 (342)
Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY 317 (342)
.+..+.+.+.|++|+++...+.|+++++-..+..+ ....++ ......+...+.++.+...|++-
T Consensus 221 ------------~~~ps~~aH~~a~k~~~sfl~ly~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~i~~~yps~hs~iL 284 (303)
T PF05296_consen 221 ------------FRDPSTEAHIRAIKTMISFLILYIIYFLSLILSFL-SFFFPE---NSIWFWVCEIIIALYPSGHSIIL 284 (303)
T ss_pred ------------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcc---ccHHHHHHHHHHHHHHHHHHHHH
Confidence 00013478999999999988888887766443222 222222 24455677788999999999999
Q ss_pred hccChhHHHHHHHHHhcCC
Q psy4354 318 TVFSPEFRQAFKRILCGSP 336 (342)
Q Consensus 318 ~~~n~~fR~~~~~ll~~~~ 336 (342)
.+-|+++|++++++++|.|
T Consensus 285 Ilgn~KLr~~~~~il~~~k 303 (303)
T PF05296_consen 285 ILGNPKLRQALLKILWCLK 303 (303)
T ss_pred hcCchHHHHHHHHHHhhcC
Confidence 9999999999999999875
No 15
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=98.95 E-value=2.1e-07 Score=80.59 Aligned_cols=70 Identities=11% Similarity=0.191 Sum_probs=54.0
Q ss_pred HHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHhcCCC
Q psy4354 265 LAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGSPN 337 (342)
Q Consensus 265 l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~~~ 337 (342)
+..-.++|++||+|-.|..+...+... +.+...+-..+..+...+|.++|++.|+..++.+...+++.+.
T Consensus 204 L~~Yp~ifiicw~fa~INRI~~~~~~~---~~~l~~Lh~~~s~lqGf~nsivy~~n~~~~~~~~~~~~~~~~~ 273 (303)
T PF05462_consen 204 LVNYPLIFIICWIFATINRIYNFIGKN---PFWLSVLHVGFSPLQGFFNSIVYGYNNSLMWRYLGSKILCQFT 273 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC---chHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Confidence 556889999999999999998876432 3444444456777788999999999999988888877755543
No 16
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.95 E-value=4e-09 Score=91.66 Aligned_cols=80 Identities=16% Similarity=0.149 Sum_probs=63.9
Q ss_pred HHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHh
Q psy4354 254 EAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILC 333 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~ 333 (342)
..+.+++...-..++++.+.+.++=|....++ ..+.++..+....--+++..+|+||..+|+..|++++++++
T Consensus 234 ~~k~~~qI~iQs~iIC~f~~i~a~iyv~m~f~-------~~p~~~i~~~~~~Wql~~g~~~iIYl~lNrtIR~~~~k~~~ 306 (313)
T PF10321_consen 234 LSKAQRQIFIQSVIICFFHAIAAVIYVYMQFF-------PPPPWLIIIGQISWQLSHGCPPIIYLTLNRTIRNSVLKMLG 306 (313)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHeeeee-------cccHHHHHHHHHHHhccCCccceEEEEECHHHHHHHHHHHc
Confidence 34667888888888888888876665544432 13577778888888899999999999999999999999999
Q ss_pred cCCCCCC
Q psy4354 334 GSPNRGR 340 (342)
Q Consensus 334 ~~~~~~~ 340 (342)
.++.++|
T Consensus 307 ~k~~r~~ 313 (313)
T PF10321_consen 307 PKKIRKK 313 (313)
T ss_pred cccccCC
Confidence 8776654
No 17
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=98.30 E-value=4.1e-06 Score=72.56 Aligned_cols=114 Identities=11% Similarity=0.106 Sum_probs=63.6
Q ss_pred HHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceecc
Q psy4354 8 GVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMVS 86 (342)
Q Consensus 8 ~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~~ 86 (342)
.++.+.+..+...+...|+++|+-+|.. .+|.+..+ |....+++++.++++....+.+..-....+.++.........
T Consensus 85 ~~l~~~~~~~Q~~~~~~is~nR~t~v~~p~~~~~~W~-~~~~~~i~~i~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (275)
T PF02118_consen 85 YFLQYYFAYVQYLSTILISLNRFTSVLFPIRYEKFWK-RYYWIIIIIIFLLPFSFTWNIFISPTYVVYDNGGFSYSYNDT 163 (275)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhHHHH-hhhhhheeeeeehhHHHHHHHHccccEEEEECCceEEEEEec
Confidence 4567778888899999999999999999 55555555 444556666677776655433332222222222111111111
Q ss_pred CCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Q psy4354 87 QDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRI 122 (342)
Q Consensus 87 ~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~ 122 (342)
.+...........+++-+++.++++....+++++..
T Consensus 164 ~~~~~~~~~~~~~~i~~~ii~i~~~~~~~~~l~~~~ 199 (275)
T PF02118_consen 164 VSWASLSIFSLIYFIIIIIITIITNIITYRRLRKLS 199 (275)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 111112222233344445566667777777776543
No 18
>KOG4193|consensus
Probab=98.27 E-value=1.6e-05 Score=75.04 Aligned_cols=201 Identities=13% Similarity=0.182 Sum_probs=128.7
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccce
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKC 83 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C 83 (342)
+...+++.+++..+..+=+.++++.=|+.++. +......++.....+..|+.++++...........+. .......|
T Consensus 386 C~~~a~llhff~LaaF~Wm~leg~hl~~~~v~--vf~~~~~~~~l~~~~~gwg~Pavvv~Isa~~~~~~~~-~~~~~~~C 462 (610)
T KOG4193|consen 386 CIAAAILLHFFFLAAFFWMLLEGFHLYLLLVE--VFRSRPRRRKLLYSLYGWGVPAVVVGVSALVDPDLEG-QYGTPRVC 462 (610)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HhccccchHHHHHHHHHhhhhHHHHhheeEEeccCcc-ccccCCce
Confidence 45667777888888888888899988885554 3444555555666569999998877744433332211 11123448
Q ss_pred eccCCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccccccccccC
Q psy4354 84 MVSQDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKFFRIKKCT 163 (342)
Q Consensus 84 ~~~~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (342)
....+..+.+. |+.|+.+++.+++.++..+-++..++......
T Consensus 463 Wl~~~~~~~~~-----F~GPv~~ii~~Ni~~Fv~t~~~l~~~~~~~~~-------------------------------- 505 (610)
T KOG4193|consen 463 WLDTQNGFIWS-----FLGPVTLIILVNIVMFVVTLKKLLRRLSKLQP-------------------------------- 505 (610)
T ss_pred EEecCCceEEE-----EehHHHHHHHHHHHHHHHHHHHHhhcccccCc--------------------------------
Confidence 88655443333 78899999999977776665554433331100
Q ss_pred CCCCCCCCCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccccccccCCC
Q psy4354 164 NVVPPSPNKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECVQVKHKIP 243 (342)
Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (342)
T Consensus 506 -------------------------------------------------------------------------------- 505 (610)
T KOG4193|consen 506 -------------------------------------------------------------------------------- 505 (610)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred CCcccchhhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChh
Q psy4354 244 PTKKEKKESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPE 323 (342)
Q Consensus 244 ~~~~~~~~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~ 323 (342)
..++....+..+..++..+++-+.|.=-+..+ .++ ....+.+++.++-.+....=.++|++++++
T Consensus 506 --------~~~~~~~~~~~~~~l~L~~lLGlTW~fgi~s~-----~~~--~~~v~~YlFti~NalQG~fIFi~~cll~~k 570 (610)
T KOG4193|consen 506 --------IASKLENISLIRSALALLFLLGLTWIFGIFSW-----LPG--TSVVFAYLFTIFNALQGVFIFIFHCLLRKK 570 (610)
T ss_pred --------chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hcc--cchHHHHHHHHHHHhhhhHhhHhhhhhhHH
Confidence 00112222778888888889999987533221 121 234455566666666667778999999999
Q ss_pred HHHHHHHHHhcCCCCC
Q psy4354 324 FRQAFKRILCGSPNRG 339 (342)
Q Consensus 324 fR~~~~~ll~~~~~~~ 339 (342)
.|++.++.+||.+.+.
T Consensus 571 vr~~~~k~~~~~~~~~ 586 (610)
T KOG4193|consen 571 VRKEYRKWLCCGRGDS 586 (610)
T ss_pred HHHHHHHHhcccCCCC
Confidence 9999999999766543
No 19
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=98.25 E-value=9e-05 Score=64.72 Aligned_cols=75 Identities=19% Similarity=0.296 Sum_probs=61.7
Q ss_pred HHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHHH
Q psy4354 254 EAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKR 330 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ 330 (342)
.++.+++.+|.+.+=.++=+++..|.....++..+.+. .......+...+..+.+.+||+++...=+.||++++|
T Consensus 218 tk~~h~~lv~~Lt~Q~~lP~~~~~p~~~~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~P~itl~fv~PYR~~i~r 292 (292)
T PF10317_consen 218 TKSMHRQLVKGLTIQALLPLFFYIPGVIIYFLSQFTGY--EHPFLEYLIFMLASLPPLIDPLITLYFVRPYRKAILR 292 (292)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcc--ccHHHHHHHHHHHHHHHHhchHhheeeeHhHHHHhcC
Confidence 44678889999999999999999988877777666553 3556666777788899999999999999999999875
No 20
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=98.20 E-value=9.9e-05 Score=65.40 Aligned_cols=60 Identities=8% Similarity=0.184 Sum_probs=44.4
Q ss_pred chhhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHH
Q psy4354 2 IQLLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVS 62 (342)
Q Consensus 2 ~~~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~ 62 (342)
..+++.-....++...+.++.++|.+||++|-++ -.|++. ++.-..+++.+-|+++++..
T Consensus 94 ~~C~~lR~~~~~~~~~~~~t~v~l~IER~iAT~~~~~YE~~-~~~~Gi~l~~~qi~is~~~~ 154 (324)
T PF10292_consen 94 YRCFILRIPYNFGLFLVSFTTVSLVIERTIATFFSKSYEKS-GKWLGILLAFFQILISLLIL 154 (324)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCC-CccHHHHHHHHHHHHHHHHH
Confidence 3567778888899999999999999999999999 777654 33444455555566655543
No 21
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=98.03 E-value=0.00029 Score=61.90 Aligned_cols=78 Identities=10% Similarity=0.154 Sum_probs=56.5
Q ss_pred HHHHHhhHHHHHHHHHHHHHH-hhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHH
Q psy4354 254 EAKRERKAAKTLAIITGAFVI-CWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRIL 332 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~-cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll 332 (342)
..+.++|..+.+.+=+++.++ .-.|.....+...+.. .......+...+...+..+.-++..+.++.||+.+++++
T Consensus 224 T~k~Qkkfl~~l~iQ~~ip~~~l~~P~~~~~~~~~~~~---~~q~~~n~~~~~~~~HG~~sti~mi~~~~pYR~~~~~~~ 300 (302)
T PF10318_consen 224 TRKMQKKFLIALIIQVLIPFIFLFIPLIYFIISIIFGY---YNQALNNISFIIISLHGIASTIVMILVHKPYRKFLLSLF 300 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccc---cccccchHHHHHHHhccHHHHHHHhhccHHHHHHHHHHh
Confidence 446677888777665555444 4468665555443332 224455677788889999999999999999999999999
Q ss_pred hc
Q psy4354 333 CG 334 (342)
Q Consensus 333 ~~ 334 (342)
||
T Consensus 301 ~~ 302 (302)
T PF10318_consen 301 RC 302 (302)
T ss_pred cC
Confidence 87
No 22
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=97.96 E-value=0.001 Score=55.39 Aligned_cols=115 Identities=18% Similarity=0.278 Sum_probs=76.3
Q ss_pred hHHHhhHHHHHhHHHHhhheeeceeEEEeeccc-ccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCccc--ccCccc
Q psy4354 6 FTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDY-IHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMD--RINQQK 82 (342)
Q Consensus 6 ~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y-~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~--~~~~~~ 82 (342)
+..++-.++.++-.++-..||+.|....-..++ .+.....-..+.+..+|++++..-....-.--.+.|.. ......
T Consensus 92 ~~Sy~idf~h~siLfsNlviaIqR~fVFFfr~~t~~~F~s~~iyiWL~~vWils~~v~~~l~~~nC~Y~y~~~~~~y~L~ 171 (305)
T PF04789_consen 92 FMSYLIDFCHYSILFSNLVIAIQRFFVFFFRNLTDKVFESPVIYIWLLLVWILSIGVVYSLMSNNCRYRYNKWSKHYQLN 171 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhheeeeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHhCCCeeecccccccEEEE
Confidence 456667778888889999999999987776222 23445667889999999999987663332111222222 222456
Q ss_pred eecc-------CCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Q psy4354 83 CMVS-------QDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKR 121 (342)
Q Consensus 83 C~~~-------~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~ 121 (342)
|... ......+.-.++-+.+|+ +++++|+.|..++-..
T Consensus 172 C~~~~~~v~~~~P~~IqiiE~ilQ~~IPi-~Il~iYiAIIiKI~~M 216 (305)
T PF04789_consen 172 CETCNSVVDISPPRGIQIIEIILQFGIPI-FILVIYIAIIIKIIKM 216 (305)
T ss_pred cCCCCeeEeeCCCCchhHHHHHHHHhHHH-HHHHHHHHHHHHHHHH
Confidence 6553 224456777788889995 7778888888776643
No 23
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=97.91 E-value=0.00057 Score=61.63 Aligned_cols=60 Identities=17% Similarity=0.217 Sum_probs=41.8
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHH
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSL 63 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~ 63 (342)
.++.+++.....+...+.+.++++||.+|-.+ -.|+++....-..+++.+.-++++..+.
T Consensus 123 l~~~~~l~~~y~~~~~~~~~~~~iER~~AT~~i~dYEk~~R~~I~~~l~~~~~~~~~~~s~ 183 (365)
T PF03125_consen 123 LFIGGFLRWHYMFSAIFCLLAIVIERCFATYFIKDYEKKSRRWISILLIIFSQIFSIIFSY 183 (365)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHHHHHHHHHHHHHHHH
Confidence 46677888777788889999999999999999 7787664333334444444445555543
No 24
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=97.86 E-value=0.00074 Score=58.96 Aligned_cols=96 Identities=16% Similarity=0.187 Sum_probs=49.9
Q ss_pred HHHhhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceec
Q psy4354 7 TGVLSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMV 85 (342)
Q Consensus 7 ~~~l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~ 85 (342)
...+++ +..++.|=+.+.|+|=|+.+.+ .-.. ...+==..+|+++.+++.--+..... ++ ...|..
T Consensus 120 s~WIq~-fYsAtfwWtfcYAVDv~Lv~~~~ag~~------~~~lYH~~aWgl~~lL~~~Gl~~Ly~----Ps--~~~Ce~ 186 (405)
T PF02101_consen 120 SMWIQL-FYSATFWWTFCYAVDVYLVIRRSAGRS------TIWLYHMMAWGLPALLCAEGLAMLYY----PS--ISRCES 186 (405)
T ss_pred HHHHHH-HHHHHHHHHHHHHHHHhheeeccCCCc------chhHHHHHHHHHHHHHHHhccceeec----CC--HHhhhh
Confidence 344443 4555555566679999999987 3211 12233456798888877633333322 11 335754
Q ss_pred cCCchhHHHHHHHHhHHHHHHHHHHHHHHHHHH
Q psy4354 86 SQDVGYQIFATCSTFYVPLLVILVLYWKIYQTA 118 (342)
Q Consensus 86 ~~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~v 118 (342)
..+.. +...+. .++|++++++.+=.+|++.
T Consensus 187 ~l~~a--lphYvt-tY~PlllVlvaNPiLy~~a 216 (405)
T PF02101_consen 187 GLEHA--LPHYVT-TYIPLLLVLVANPILYIKA 216 (405)
T ss_pred hhhhc--chhHHH-HHHHHHHHHHhccHHHHHH
Confidence 32222 222222 3567666665554444443
No 25
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=97.73 E-value=5.6e-06 Score=72.95 Aligned_cols=73 Identities=18% Similarity=0.348 Sum_probs=52.9
Q ss_pred HHHHHhhHHHHHHH-HHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHH
Q psy4354 254 EAKRERKAAKTLAI-ITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFK 329 (342)
Q Consensus 254 ~~~~~~k~~k~l~~-v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~ 329 (342)
.++-+++..+++++ ..+-.++.++|..+..+...+.-+. .....+...+..+-+++||++-.+.-++||++++
T Consensus 234 ~~~lq~QLf~aLv~Qt~iP~i~~~~P~~~~~~~p~~~i~~---~~~~~~~~~~~~~yP~iDpl~~i~~ik~yR~~i~ 307 (307)
T PF10326_consen 234 TRKLQKQLFKALVIQTIIPFIFMYIPVFIVFILPFFGIDL---GFFSNIISILISLYPAIDPLPVIFIIKDYRKAIK 307 (307)
T ss_pred hHHHHHHHHHHHHHHhhhhheeeecchhheeeeeccCCCC---CccccHhhhhEEEEeehhhheeeEeeHHHHHhhC
Confidence 34557777777665 4566788899988877765554332 2233455667888999999999999999999874
No 26
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=97.68 E-value=0.0017 Score=56.97 Aligned_cols=72 Identities=17% Similarity=0.312 Sum_probs=58.2
Q ss_pred HHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHH
Q psy4354 254 EAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQA 327 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~ 327 (342)
..++++++.+.+++=.++-.+|-+|..+...+..+..+ ....+.++...+....|.+|-++-.+.++.||+-
T Consensus 231 ty~kHk~av~SLi~Q~~~~~i~~~P~~~~~~~~~~~~~--~~q~i~~~~~~~f~~HS~~n~ivli~t~ppYR~f 302 (303)
T PF10327_consen 231 TYQKHKEAVRSLIAQFATSSICILPPFIFVVVVIFEFE--DAQVISEICLAIFSSHSSVNMIVLIITTPPYRKF 302 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhheecCC--CcHHHHHHHHHHHHHhhHhhheeeeEcCcchhhc
Confidence 55789999999999999999999998776665444322 3456667777888899999999999999999974
No 27
>PF11970 Git3_C: G protein-coupled glucose receptor regulating Gpa2 C-term; InterPro: IPR022596 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This family is the conserved C-terminal domain of the member proteins.
Probab=97.57 E-value=0.00034 Score=47.01 Aligned_cols=71 Identities=15% Similarity=0.227 Sum_probs=55.3
Q ss_pred HHHHHhhHHHHHHHHHHHHHHhhh-hHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhH
Q psy4354 254 EAKRERKAAKTLAIITGAFVICWL-PFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEF 324 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~cw~-P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~f 324 (342)
++++=+|.+|.+++-=++|+++|+ |...-.+-...........++..++..+..+|+.+|-++|++.-+.-
T Consensus 3 ~r~~i~r~lr~mfiYP~~Yi~lwlfP~~~~~~~~~~~~~~~p~~~l~~i~~~~~~~~G~VD~lvf~~~erpw 74 (76)
T PF11970_consen 3 RRKRIRRQLRSMFIYPLVYIVLWLFPFAAHRMQYMYEIGHGPSFWLFCIAGFMQPSQGFVDCLVFTLRERPW 74 (76)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCchHHHHHHHHHHHccCHHHhhheeeecccC
Confidence 445667889999999999999999 97655554442233345678888889999999999999999876643
No 28
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=97.21 E-value=0.0011 Score=54.14 Aligned_cols=116 Identities=15% Similarity=0.275 Sum_probs=75.8
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccc----------hhHHHHHHHHHHHHHHHHHHHhHhcccCCC
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRN----------ANRIIMMIVVVWSVAFIVSLAPQLGWKDPE 73 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t----------~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~ 73 (342)
|..+|++...+..++-+.+.+||++=|+.|.++.++++.+ +..+..+..+.|++...+ .+.+.+...
T Consensus 74 C~aqGf~~q~g~~~sd~~ilaIAihT~l~v~~~~~~~~~~~~~~~gl~~~~~~v~~~~~~~~~~~~~l---a~i~~~~~~ 150 (201)
T PF11710_consen 74 CQAQGFFLQVGDEASDLWILAIAIHTFLIVFRPNWKRKRSKNVEGGLYPYRYWVWVIWILVPLLLASL---AFIGLGGPG 150 (201)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccccccccccceEEeeeeeehHHHHHHHHHHHH---HHhccccCc
Confidence 5678999999999999999999999999999962222211 112223333444444333 333323333
Q ss_pred cccccCccceeccCCc-hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4354 74 YMDRINQQKCMVSQDV-GYQIFATCSTFYVPLLVILVLYWKIYQTARKRIRR 124 (342)
Q Consensus 74 ~~~~~~~~~C~~~~~~-~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~ 124 (342)
|.+. ...|-.+.+. .+.++..-+--++-...++++|..|+..+|++.|+
T Consensus 151 Y~~~--g~WCWi~~~~~~~Rl~l~y~~~~~~~~~~i~iY~~if~~lrr~~~~ 200 (201)
T PF11710_consen 151 YGPA--GAWCWIPSRYEWYRLWLHYIWRFIIIFAIIIIYIAIFFYLRRRIRR 200 (201)
T ss_pred cccc--CcEEEECCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Confidence 4333 7889886443 35555554545566678999999999999998764
No 29
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=97.03 E-value=0.00054 Score=58.04 Aligned_cols=70 Identities=17% Similarity=0.265 Sum_probs=54.0
Q ss_pred HHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHH
Q psy4354 256 KRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAF 328 (342)
Q Consensus 256 ~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~ 328 (342)
..|.|+++++++.+..|++.+.--.++.+......+ ..++..+..++...-+.+-||+-...++++.+-+
T Consensus 194 SpE~RAtktILlLVs~FV~fY~l~si~~~~~~~~~~---~~~~~~~~~~ls~cfptisPfvLI~~d~~i~~~~ 263 (265)
T PF03402_consen 194 SPETRATKTILLLVSTFVSFYGLSSILFIYLTSFKN---SPWLLNISVFLSSCFPTISPFVLISSDKRIIKFL 263 (265)
T ss_pred ChhHHHhCeEeeHHHHHHHHHhHHHHHHHHHHHhcC---CcceeEHHHHHhHHhHhhChHHhhccCchHHHHh
Confidence 358999999999999999999998777655433332 2334456677888889999999999999887654
No 30
>PF02117 7TM_GPCR_Sra: Serpentine type 7TM GPCR chemoreceptor Sra; InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=97.03 E-value=0.018 Score=50.99 Aligned_cols=81 Identities=12% Similarity=0.134 Sum_probs=45.6
Q ss_pred hhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCcccee
Q psy4354 5 LFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCM 84 (342)
Q Consensus 5 ~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~ 84 (342)
.+...+..++....++.-.++.+||.+|-..|.|..+.+.....++...+-++++... .+..|++ +.+.....|.
T Consensus 102 ~~~~~~~~~~~~~~~~~q~aL~idRl~at~~~~~~~~~~~~~g~~l~i~~li~s~~~~--~~i~~~~---p~~gyv~~C~ 176 (328)
T PF02117_consen 102 FPYYYFYYFTNSGMIFIQFALTIDRLLATFFPKYYSKKSYIIGIILSILVLILSFITG--FIIYWDD---PLDGYVPSCF 176 (328)
T ss_pred eeeehHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhhhhHHHHHHHHHHHHHHHHHHe--eEEEeCC---CCcccccccc
Confidence 3444555666777788999999999999877766655444443333333333333332 2223332 1111256887
Q ss_pred ccCCch
Q psy4354 85 VSQDVG 90 (342)
Q Consensus 85 ~~~~~~ 90 (342)
......
T Consensus 177 ~~p~~s 182 (328)
T PF02117_consen 177 YPPKNS 182 (328)
T ss_pred CCChhH
Confidence 764443
No 31
>PF10322 7TM_GPCR_Sru: Serpentine type 7TM GPCR chemoreceptor Sru; InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily [].
Probab=96.92 E-value=0.025 Score=49.40 Aligned_cols=115 Identities=8% Similarity=-0.141 Sum_probs=71.8
Q ss_pred HHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccC--CCcccccCcccee-
Q psy4354 8 GVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKD--PEYMDRINQQKCM- 84 (342)
Q Consensus 8 ~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~--~~~~~~~~~~~C~- 84 (342)
-++++++.+.+...-++.++=|-+.+..|..+....++-..+.+-.+.+.++++++|.+.+.+. +-..+-.-+..-.
T Consensus 107 ~~~~~~~~Y~s~lf~~Lfc~~Rl~il~~p~~~~~i~~~i~~~~~P~i~i~p~~~~f~~~pa~G~C~Ql~~Pf~fGAI~I~ 186 (307)
T PF10322_consen 107 VFFYYYFNYSSMLFPVLFCLLRLIILYSPRNHKKICRKIFRIWIPFIFIYPFCFTFPMFPALGYCRQLDPPFPFGAIIIT 186 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHheeCccchhHHHHhHHHHHHHHHHHHHHHHHHHccCCcEEEEeCCCCCCCCEEEEE
Confidence 4556677888999999999999999999766666777777788888888888888865443321 1000000011110
Q ss_pred --ccCCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Q psy4354 85 --VSQDVGYQIFATCSTFYVPLLVILVLYWKIYQTARKRI 122 (342)
Q Consensus 85 --~~~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~ 122 (342)
..+.........++...+-+++++++++..+.++|+.+
T Consensus 187 ~~~~~~~~~~~~~~l~~s~~~~~~iii~N~lm~~Klr~~k 226 (307)
T PF10322_consen 187 STGSWFNIRNSIFHLFFSIFWMISIIILNILMFFKLRKLK 226 (307)
T ss_pred EEcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 01111112222333444555788889999999999865
No 32
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=95.47 E-value=0.63 Score=40.43 Aligned_cols=72 Identities=15% Similarity=0.161 Sum_probs=55.6
Q ss_pred HHHHHhhHHHHHHHHHH-HHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHH
Q psy4354 254 EAKRERKAAKTLAIITG-AFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQA 327 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~-~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~ 327 (342)
..+.+++..|.+++=++ --++|..|-++.+....+.-+ ...+...+.......=+.++|+.-.+.=+.||+.
T Consensus 237 T~~lq~qL~~AL~vQT~IPi~vsf~Pc~~~wy~pif~i~--~~~~~n~~~~iAls~FPf~DPlAii~~lP~~R~r 309 (310)
T PF10319_consen 237 TKRLQRQLFKALIVQTVIPICVSFSPCVLSWYGPIFGID--LGRWNNYFSVIALSAFPFLDPLAIILCLPAFRNR 309 (310)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHhhccHHHHHhHHHHcCC--hhHHHHHHHHHHHHHccccCchHhheecHHhhcc
Confidence 44667888888877544 468899998888876666543 5667777777777778899999999999999975
No 33
>KOG4564|consensus
Probab=95.42 E-value=0.032 Score=51.14 Aligned_cols=71 Identities=21% Similarity=0.324 Sum_probs=46.9
Q ss_pred HhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCC-chhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHhcC
Q psy4354 258 ERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCY-ISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGS 335 (342)
Q Consensus 258 ~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~-~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~ 335 (342)
-+|++|..++.+=+|.+-++.+ .+.++.. .......+-..|..+...+=-++|+|.|++.|.++||.+.+.
T Consensus 349 y~K~vKaTLvLIPLfGI~~ilf-------~~~P~~~~~~~v~~~~~~~L~SfQGf~VAvlYCFlN~EVq~elrr~W~r~ 420 (473)
T KOG4564|consen 349 YRKLVKATLVLIPLFGIHYILF-------AFRPDEDTLREVYLYFELFLGSFQGFFVAVLYCFLNGEVQAELRRKWSRW 420 (473)
T ss_pred HHHHHHHHHHHHHHcCCeeEEE-------EecCchHHHHHHHHHHHHHHHhccchheehheeecCHHHHHHHHHHHHhc
Confidence 5677777777666665553332 1222221 122333344567888888889999999999999999998443
No 34
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=94.88 E-value=0.27 Score=42.28 Aligned_cols=105 Identities=17% Similarity=0.211 Sum_probs=50.4
Q ss_pred hhHHHHHhHHHHhhheeeceeEEEeec-ccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCC--CcccccCccceecc
Q psy4354 10 LSHYNVDMLGYSTVIMFVDKYWAVTNV-DYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDP--EYMDRINQQKCMVS 86 (342)
Q Consensus 10 l~~~~~~~S~~~l~~IaidRY~aI~~~-~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~--~~~~~~~~~~C~~~ 86 (342)
.......+.......||+||-+|+..| .|++...+--...+++.+-..++.=.. .+++.-+. +..++.....|..+
T Consensus 87 p~~~~~~iR~~l~~~Ia~dR~~A~~fPI~y~~~r~k~~~~~I~~~~~~~~~~d~~-vlf~~C~~~i~~p~~C~~~~C~vn 165 (273)
T PF10316_consen 87 PSSNLGSIRSILALIIALDRVFAVYFPIFYHNYRKKIPNFIIIIIALSYGLFDQY-VLFGFCDFVIDVPPNCVNFGCAVN 165 (273)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHccCccccHHHHHHHHHHHHHHHHh-HhhhcCCCCCCCCCCCCccCCCCc
Confidence 333444556788899999999999994 444433332222222222222222222 33443332 23344446667664
Q ss_pred CC-chhHHHHHHHHhHHHHHHHHHHHHHHH
Q psy4354 87 QD-VGYQIFATCSTFYVPLLVILVLYWKIY 115 (342)
Q Consensus 87 ~~-~~~~~~~~~~~~~ip~~~i~~~y~~I~ 115 (342)
.- ..|-...-.+.+.+-.+..+.+-.+++
T Consensus 166 ~Cf~~Yw~~~~~i~~~li~~~S~~L~~KLf 195 (273)
T PF10316_consen 166 QCFRQYWLTSEMIIFSLIIILSILLCIKLF 195 (273)
T ss_pred hHHHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence 21 223333333444444444444445555
No 35
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=94.46 E-value=0.011 Score=50.12 Aligned_cols=101 Identities=17% Similarity=0.228 Sum_probs=0.0
Q ss_pred hhhHHHhhHHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccce
Q psy4354 4 LLFTGVLSHYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKC 83 (342)
Q Consensus 4 ~~~~~~l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C 83 (342)
|...+++.+++..++..=+.+.++|=|..+.... ...+++.....+..|.+++++...... .....| +...|
T Consensus 72 C~~~a~~~hy~~la~f~Wm~~~~~~l~~~~~~~~---~~~~~~~~~~~~~gwg~P~~iv~i~~~-~~~~~y----~~~~C 143 (242)
T PF00002_consen 72 CRAIAILLHYFFLASFFWMLVEAFYLYRLLVKVF---NSSRRRFWWYYLIGWGIPALIVVISVA-VNSDGY----GNDNC 143 (242)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred chhhhhHhHHHHHHHHHHHHHHHHHhheeEEEee---cccchhhheeeeeeecCcceeeeeeee-eccccc----ccccc
Confidence 4556777777777787778899999999998731 111445566677889999887774333 221122 23567
Q ss_pred eccCCchhHHHHHHHHhHHHHHHHHHHHHHHHHH
Q psy4354 84 MVSQDVGYQIFATCSTFYVPLLVILVLYWKIYQT 117 (342)
Q Consensus 84 ~~~~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~ 117 (342)
-.+. ..+.. ..+..|..+++..+..++..
T Consensus 144 Wl~~-~~~~~----~~f~~P~~~~l~in~vi~~~ 172 (242)
T PF00002_consen 144 WLSN-DWGFI----WAFVGPVLIILLINIVIFIL 172 (242)
T ss_dssp ----------------------------------
T ss_pred cccC-CCceE----EEEEecccceecccchhhee
Confidence 3332 22222 22456666555555444433
No 36
>PF02175 7TM_GPCR_Srb: Serpentine type 7TM GPCR chemoreceptor Srb; InterPro: IPR002184 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=93.70 E-value=0.42 Score=39.45 Aligned_cols=106 Identities=11% Similarity=0.132 Sum_probs=54.3
Q ss_pred hhHHHHHhHHHHhhheeeceeEEEee-cccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceeccCC
Q psy4354 10 LSHYNVDMLGYSTVIMFVDKYWAVTN-VDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMVSQD 88 (342)
Q Consensus 10 l~~~~~~~S~~~l~~IaidRY~aI~~-~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~~~~ 88 (342)
...+.+..+++.-..+++|||+|... -+|++..+.-.-.+....+ ++-.+...+.+.+..++++ ....+..+..
T Consensus 91 ~~~flmT~~ml~PigftIERfiAl~~A~~YE~~r~~LGPiL~~~li----~~d~~ii~~iy~dE~F~~~-~iSf~l~P~t 165 (236)
T PF02175_consen 91 TGLFLMTIPMLFPIGFTIERFIALKMAEKYENTRTLLGPILVFILI----LIDFLIIYFIYKDEKFSDP-FISFILIPST 165 (236)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHhhccCceeehhHHHHHHH----HHHHHHHHHhhcCCCCCCC-eEEEEEecCc
Confidence 34456777888899999999999999 8888766544433333222 2111112222333333322 2344444545
Q ss_pred chhHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Q psy4354 89 VGYQIFATCSTFYVPLLVILVLYWKIYQTARK 120 (342)
Q Consensus 89 ~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~ 120 (342)
.+..........+.-=+.-.+++..+++.=++
T Consensus 166 sa~~~n~~~~~Ll~~~i~nl~~n~~Ll~~n~~ 197 (236)
T PF02175_consen 166 SAPKFNIFFWFLLYLNIFNLIFNCILLRQNRR 197 (236)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444333332222224445555555544443
No 37
>PF01534 Frizzled: Frizzled/Smoothened family membrane region; InterPro: IPR000539 The frizzled (fz) locus of Drosophila coordinates the cytoskeletons of epidermal cells, producing a parallel array of cuticular hairs and bristles [, ]. In fz mutants, the orientation of individual hairs with respect both to their neighbours and to the organism as a whole is altered. In the wild-type wing, all hairs point towards the distal tip []. In the developing wing, fz has 2 functions: it is required for the proximal-distal transmission of an intracellular polarity signal; and it is required for cells to respond to the polarity signal. Fz produces an mRNA that encodes an integral membrane protein with 7 putative transmembrane (TM) domains. This protein should contain both extracellular and cytoplasmic domains, which could function in the transmission and interpretation of polarity information []. This signature is usually found downstream of the Fz domain (IPR000024 from INTERPRO); GO: 0007166 cell surface receptor linked signaling pathway, 0016020 membrane
Probab=93.36 E-value=3.6 Score=36.45 Aligned_cols=77 Identities=10% Similarity=0.149 Sum_probs=48.1
Q ss_pred hHHHhh-HHHHHhHHHHhhheeeceeEEEeecccccccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCcccee
Q psy4354 6 FTGVLS-HYNVDMLGYSTVIMFVDKYWAVTNVDYIHTRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCM 84 (342)
Q Consensus 6 ~~~~l~-~~~~~~S~~~l~~IaidRY~aI~~~~y~~~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~ 84 (342)
+...+. ++.+.+++| -+.+++.-|++....+-... -.+.....=+++|.++++..+ ..+..+.++.+.- .+.|.
T Consensus 96 ~~F~l~Yyf~mAa~~W-WviLt~~W~lsa~~kw~~e~-i~~~s~yfH~~aW~iP~~~ti-~vL~~~~VdgD~l--tGiC~ 170 (328)
T PF01534_consen 96 VVFLLLYYFGMAASLW-WVILTLTWFLSAGLKWGSEA-IEKKSSYFHLVAWGIPAVLTI-AVLALRKVDGDEL--TGICF 170 (328)
T ss_pred hHHHHHHHHHhHHHHH-HHHHHHHHHHHhhcccCcch-hhhhcchhhhHHhhhhHHHHH-HHHHhcccccccc--cceeE
Confidence 334444 444555555 44558888887776322222 245566778899999999888 5556666555554 77898
Q ss_pred ccC
Q psy4354 85 VSQ 87 (342)
Q Consensus 85 ~~~ 87 (342)
.-.
T Consensus 171 Vg~ 173 (328)
T PF01534_consen 171 VGN 173 (328)
T ss_pred EeC
Confidence 743
No 38
>PF02076 STE3: Pheromone A receptor; InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=92.39 E-value=5.4 Score=34.60 Aligned_cols=47 Identities=9% Similarity=0.219 Sum_probs=30.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceeccCCchh
Q psy4354 43 NANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMVSQDVGY 91 (342)
Q Consensus 43 t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~~~~~~~ 91 (342)
.+|+..+=++.+++++++.....++..+. .|+ -.+...|.......+
T Consensus 100 ~rr~~~~d~~i~~g~Pil~m~l~yivQ~~-Rf~-I~e~~GC~~~~~~s~ 146 (283)
T PF02076_consen 100 KRRRIIIDLLICFGIPILQMALHYIVQGH-RFD-IVEDVGCYPAIYPSW 146 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccc-cee-EecccCCccCCCCCH
Confidence 36778888999999999988765554332 221 113667877544443
No 39
>PF06681 DUF1182: Protein of unknown function (DUF1182); InterPro: IPR010601 This family consists of several hypothetical proteins of around 360 residues in length and seems to be specific to Caenorhabditis elegans. The function of this family is unknown.
Probab=89.37 E-value=2.7 Score=33.96 Aligned_cols=16 Identities=19% Similarity=0.185 Sum_probs=13.3
Q ss_pred HHhhheeeceeEEEee
Q psy4354 20 YSTVIMFVDKYWAVTN 35 (342)
Q Consensus 20 ~~l~~IaidRY~aI~~ 35 (342)
...+++|+|||+-|+.
T Consensus 133 vip~aVAIyRy~~VV~ 148 (226)
T PF06681_consen 133 VIPVAVAIYRYLIVVL 148 (226)
T ss_pred cchhhhhhhhhheeee
Confidence 3445799999999998
No 40
>PF03383 Serpentine_r_xa: Caenorhabditis serpentine receptor-like protein, class xa; InterPro: IPR005047 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class xa (Srxa), from the Str superfamily [].
Probab=89.03 E-value=0.82 Score=35.30 Aligned_cols=77 Identities=13% Similarity=0.105 Sum_probs=53.3
Q ss_pred hhHHHHHhHHHHhhheeeceeEEEeeccccc-ccchhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccc--cCccceecc
Q psy4354 10 LSHYNVDMLGYSTVIMFVDKYWAVTNVDYIH-TRNANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDR--INQQKCMVS 86 (342)
Q Consensus 10 l~~~~~~~S~~~l~~IaidRY~aI~~~~y~~-~~t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~--~~~~~C~~~ 86 (342)
+..++.....+..++|+++|-.+|.+|.-+. .+|.+|..+-..+..++.+..-+-|++.--...++.. .-...|.++
T Consensus 50 ~~tf~Yl~plfltvLMti~Ri~iV~~P~~~~~~Fs~~kl~~YC~~i~i~~~i~LlIPy~S~C~vnf~~~~~~f~s~Cap~ 129 (153)
T PF03383_consen 50 FGTFSYLHPLFLTVLMTINRIYIVLFPFGSEIWFSDKKLWIYCGIIAILSFISLLIPYFSDCYVNFDARTFSFVSACAPD 129 (153)
T ss_pred eehHHHHHHHHHHHHHHHhheEEEEecCCCccccccchhHHHHHHHHHHHHHHHHhhcCCCCcEEEEeeeeEEEEccCCC
Confidence 3455666778889999999999999975433 6899999988888888877776667764333222211 115567764
No 41
>PF13853 7tm_4: Olfactory receptor
Probab=83.87 E-value=0.033 Score=42.96 Aligned_cols=75 Identities=19% Similarity=0.213 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceec---------c-CCchhHHHHHHHHhHHHHHHHHHHHHHHHHH
Q psy4354 48 IMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMV---------S-QDVGYQIFATCSTFYVPLLVILVLYWKIYQT 117 (342)
Q Consensus 48 ~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~---------~-~~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~ 117 (342)
..++++.|+.+++..++....-....+.+..+...|+. . .+..+......+..+.|+.++++.|.+|+..
T Consensus 4 ~~l~~~~~~~~~~~~l~~~~~~~~l~~~nii~~f~c~~~ll~LaC~dt~~~~~~~~~~~~~~~~~~~~~Il~SY~~Il~a 83 (144)
T PF13853_consen 4 LLLAAGSWLSGLLNSLPHTLLTLSLCFCNIIHHFCCDPPLLKLACSDTSINEIVGFVVAIFILLGPLLLILFSYIRILRA 83 (144)
T ss_pred ehhhHHHHHHHHHHHHHHHHHHeeCCCCCCCcceeeCHHHhcccCCchhhhheeeecccceeEEEEeeccccceeEEEeh
Confidence 35677789888887776554322222222222444442 1 2233444555566789999999999999999
Q ss_pred HHHHH
Q psy4354 118 ARKRI 122 (342)
Q Consensus 118 vr~~~ 122 (342)
+.|..
T Consensus 84 vlki~ 88 (144)
T PF13853_consen 84 VLKIP 88 (144)
T ss_pred hhccc
Confidence 88754
No 42
>PHA03235 DNA packaging protein UL33; Provisional
Probab=67.79 E-value=27 Score=32.21 Aligned_cols=24 Identities=25% Similarity=0.232 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy4354 101 YVPLLVILVLYWKIYQTARKRIRR 124 (342)
Q Consensus 101 ~ip~~~i~~~y~~I~~~vr~~~~~ 124 (342)
.+-.++=-++|...-...|+..++
T Consensus 296 ~~ns~lNPiIY~~~~~~FRk~~~~ 319 (409)
T PHA03235 296 NLHCLLNPILYAFLGNDFLKRFRQ 319 (409)
T ss_pred HHHHhHhHHHHHHhhHHHHHHHHH
Confidence 344556677787777777766544
No 43
>PF10325 7TM_GPCR_Srz: Serpentine type 7TM GPCR chemoreceptor Srz; InterPro: IPR018817 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class z (Srz), a solo family amongst the superfamilies of chemoreceptors [, ]. The genes encoding Srz appear to be under strong adaptive evolutionary pressure [].
Probab=58.55 E-value=28 Score=29.74 Aligned_cols=103 Identities=12% Similarity=0.154 Sum_probs=51.6
Q ss_pred HhHHHHhhheeeceeEEEeeccccccc--chhHHHHHHHHHHHHHHHHHHHhHhcccCCCcccccCccceeccCCchhHH
Q psy4354 16 DMLGYSTVIMFVDKYWAVTNVDYIHTR--NANRIIMMIVVVWSVAFIVSLAPQLGWKDPEYMDRINQQKCMVSQDVGYQI 93 (342)
Q Consensus 16 ~~S~~~l~~IaidRY~aI~~~~y~~~~--t~r~~~~~i~~~W~~s~~~~~p~~~~~~~~~~~~~~~~~~C~~~~~~~~~~ 93 (342)
.+--+.+..+|++|++---.|..++.. +.+.....+..++++.++-.+. .+.+......++ ...........+..
T Consensus 94 ~v~~lllsLLAIqRFllyFfP~~Ek~v~~~~k~~~~~I~~lY~~~~~k~i~-~~~~~~~~~~~~--~~~~~~~~~~~~~~ 170 (267)
T PF10325_consen 94 QVFHLLLSLLAIQRFLLYFFPSSEKYVNFSQKNIKKIIWFLYIFFILKDIV-FFIWYFISFNNE--STEEIETFSYIYVI 170 (267)
T ss_pred HHHHHHHHHHHHHHHHHHhCCchhhhhhhhhhhHHHHHHHHHHHHHHHHHH-HHHHHHhhcccc--cchhhhHHHHHHHH
Confidence 333467888999999877777655544 4555555555555555444331 111111111110 00000111111211
Q ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy4354 94 FATCSTFYVPLLVILVLYWKIYQTARKRIR 123 (342)
Q Consensus 94 ~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~ 123 (342)
.. +.. -+-+.+...+|.-|...+||.++
T Consensus 171 ~~-~~~-~~ll~~S~lLYIPI~isirK~~~ 198 (267)
T PF10325_consen 171 FY-IIL-NILLFLSALLYIPIFISIRKLSH 198 (267)
T ss_pred HH-HHh-hHHHHHHHHHHHHHHHHHHHhhc
Confidence 11 111 23345777899999999998764
No 44
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=58.42 E-value=23 Score=22.45 Aligned_cols=23 Identities=30% Similarity=0.401 Sum_probs=13.7
Q ss_pred HHHHhhHHHHHHHHHHHHHHhhh
Q psy4354 255 AKRERKAAKTLAIITGAFVICWL 277 (342)
Q Consensus 255 ~~~~~k~~k~l~~v~~~f~~cw~ 277 (342)
+++.+|.-.+++++.+++++-|+
T Consensus 32 qk~~~~~~~i~~~~~i~~l~v~~ 54 (59)
T PF09889_consen 32 QKRMRKTQYIFFGIFILFLAVWI 54 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445566666666666666554
No 45
>KOG2575|consensus
Probab=48.39 E-value=20 Score=32.40 Aligned_cols=26 Identities=35% Similarity=0.713 Sum_probs=22.1
Q ss_pred HHhhHHHHHHHHHHHHHHhhhhHHHH
Q psy4354 257 RERKAAKTLAIITGAFVICWLPFFMM 282 (342)
Q Consensus 257 ~~~k~~k~l~~v~~~f~~cw~P~~i~ 282 (342)
.-.+.++.-++|++.|.++|+|+...
T Consensus 253 ~f~ri~~ia~~Vv~TF~iiw~P~~~~ 278 (510)
T KOG2575|consen 253 SFARIIKIALAVVGTFVIIWLPFLLS 278 (510)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 35788999999999999999997643
No 46
>COG3924 Predicted membrane protein [Function unknown]
Probab=43.70 E-value=84 Score=20.69 Aligned_cols=30 Identities=20% Similarity=0.435 Sum_probs=22.2
Q ss_pred hhHHHHHHHHhHHHHHHHHHHHHHHHHHHH
Q psy4354 90 GYQIFATCSTFYVPLLVILVLYWKIYQTAR 119 (342)
Q Consensus 90 ~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr 119 (342)
.+-.+.-+..+.+|++.+.+||..|-...|
T Consensus 41 gfP~WFE~aCi~lPllFi~l~~~mvkfif~ 70 (80)
T COG3924 41 GFPLWFEMACILLPLLFIVLCWAMVKFIFR 70 (80)
T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 345555666778999999999988765554
No 47
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=43.19 E-value=39 Score=18.72 Aligned_cols=22 Identities=23% Similarity=0.457 Sum_probs=15.9
Q ss_pred HHHHHHHHhHHHHHHHHHHHHH
Q psy4354 92 QIFATCSTFYVPLLVILVLYWK 113 (342)
Q Consensus 92 ~~~~~~~~~~ip~~~i~~~y~~ 113 (342)
......++.++|.+.++++|.+
T Consensus 7 gfiAt~Lfi~iPt~FLlilYvq 28 (35)
T PRK04989 7 GFVASLLFVLVPTVFLIILYIQ 28 (35)
T ss_pred HHHHHHHHHHHHHHHHHHHhee
Confidence 3444566778899988888864
No 48
>PF06072 Herpes_US9: Alphaherpesvirus tegument protein US9; InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=41.44 E-value=91 Score=19.70 Aligned_cols=28 Identities=21% Similarity=0.265 Sum_probs=15.2
Q ss_pred HHHHhhHHHHHHHHHHHH-HHhhhhHHHH
Q psy4354 255 AKRERKAAKTLAIITGAF-VICWLPFFMM 282 (342)
Q Consensus 255 ~~~~~k~~k~l~~v~~~f-~~cw~P~~i~ 282 (342)
.++++|...+.+++++++ ++|-+-..+-
T Consensus 25 ~r~RrRrc~~~v~~v~~~~~~c~~S~~lG 53 (60)
T PF06072_consen 25 SRRRRRRCRLAVAIVFAVVALCVLSGGLG 53 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556666555555555 6776554433
No 49
>TIGR03038 PS_II_psbM photosystem II reaction center protein PsbM. Members of this protein family are the photosystem II reaction center M protein, product of the psbM gene, in Cyanobacteria and their derived organelles in plants. This model resembles Pfam model pfam05151 but has cutoffs set to avoid false-positive matches to similar (not necessarily homologous) sequences in species that are not photosynthetic.
Probab=41.02 E-value=47 Score=18.11 Aligned_cols=22 Identities=32% Similarity=0.517 Sum_probs=15.7
Q ss_pred HHHHHHHHhHHHHHHHHHHHHH
Q psy4354 92 QIFATCSTFYVPLLVILVLYWK 113 (342)
Q Consensus 92 ~~~~~~~~~~ip~~~i~~~y~~ 113 (342)
......++..+|.+..+++|.+
T Consensus 7 ~fiAt~Lfi~iPt~FLiilYvq 28 (33)
T TIGR03038 7 GFIATLLFILVPTVFLLILYIQ 28 (33)
T ss_pred HHHHHHHHHHHHHHHHHHHhee
Confidence 3444556678899888888864
No 50
>PHA01815 hypothetical protein
Probab=40.69 E-value=76 Score=18.62 Aligned_cols=23 Identities=13% Similarity=0.359 Sum_probs=14.5
Q ss_pred HHHHHHHhHHHHHHHHHHHHHHH
Q psy4354 93 IFATCSTFYVPLLVILVLYWKIY 115 (342)
Q Consensus 93 ~~~~~~~~~ip~~~i~~~y~~I~ 115 (342)
++.+.+.|++.+++...+|+++-
T Consensus 8 ~ivfllaflitliilmt~~irvs 30 (55)
T PHA01815 8 AIVFLLAFLITLIILMTLHIRVS 30 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455566677777777776654
No 51
>COG4665 FcbT2 TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=39.31 E-value=1.4e+02 Score=23.56 Aligned_cols=32 Identities=9% Similarity=0.136 Sum_probs=22.1
Q ss_pred hhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Q psy4354 90 GYQIFATCSTFYVPLLVILVLYWKIYQTARKR 121 (342)
Q Consensus 90 ~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~ 121 (342)
.+.=+...++|++|++++++.|+.=+....-+
T Consensus 91 a~vDllGtifFLlPfc~l~iy~~~~~~~~S~~ 122 (182)
T COG4665 91 AWVDLLGTIFFLLPFCLLVIYLSWPYVALSWA 122 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHccHHHHHHHH
Confidence 34444556678999999988887776655543
No 52
>PF15086 UPF0542: Uncharacterised protein family UPF0542
Probab=39.15 E-value=1.1e+02 Score=20.16 Aligned_cols=36 Identities=28% Similarity=0.588 Sum_probs=23.8
Q ss_pred hHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHhh
Q psy4354 91 YQIFATCSTFYVPLL-VILVLYWKIYQTARKRIRRRR 126 (342)
Q Consensus 91 ~~~~~~~~~~~ip~~-~i~~~y~~I~~~vr~~~~~~~ 126 (342)
|..+..++..+.|+. +..+|-+++.+.+.++.+...
T Consensus 20 ~~Fl~~vll~LtPlfiisa~lSwkLaK~ie~~ere~K 56 (74)
T PF15086_consen 20 YEFLTTVLLILTPLFIISAVLSWKLAKAIEKEEREKK 56 (74)
T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555566667788874 556677788888876655433
No 53
>CHL00080 psbM photosystem II protein M
Probab=38.85 E-value=48 Score=18.18 Aligned_cols=22 Identities=32% Similarity=0.510 Sum_probs=15.8
Q ss_pred HHHHHHHHhHHHHHHHHHHHHH
Q psy4354 92 QIFATCSTFYVPLLVILVLYWK 113 (342)
Q Consensus 92 ~~~~~~~~~~ip~~~i~~~y~~ 113 (342)
......++..+|.+.++++|.+
T Consensus 7 gfiAt~LFi~iPt~FLlilyvk 28 (34)
T CHL00080 7 AFIATALFILVPTAFLLIIYVK 28 (34)
T ss_pred HHHHHHHHHHHHHHHHHHhhee
Confidence 3445566778898888888864
No 54
>TIGR02230 ATPase_gene1 F0F1-ATPase subunit, putative. This model represents a protein found encoded in F1F0-ATPase operons in several genomes, including Methanosarcina barkeri (archaeal) and Chlorobium tepidum (bacterial). It is a small protein (about 100 amino acids) with long hydrophic stretches and is presumed to be a subunit of the enzyme.
Probab=37.32 E-value=78 Score=22.59 Aligned_cols=59 Identities=14% Similarity=-0.008 Sum_probs=41.4
Q ss_pred HHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCC-chhHHHHHHHHHHhhcccccch
Q psy4354 257 RERKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCY-ISDYLASFFLWLGYFNSTLNPV 315 (342)
Q Consensus 257 ~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~-~~~~~~~~~~~l~~~ns~vNPi 315 (342)
.++.+.+.+..+..+=+..-+|.++..++-.+.+..- ....+...+.+++.+-.|.|-+
T Consensus 34 ~~~~~~~~l~~~g~IG~~~v~pil~G~~lG~WLD~~~~t~~~~tl~~lllGv~~G~~n~w 93 (100)
T TIGR02230 34 ATRSIWEGLGMFGLIGWSVAIPTLLGVAVGIWLDRHYPSPFSWTLTMLIVGVVIGCLNAW 93 (100)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcHHHHHHHHHHHHHHHHHHH
Confidence 3566888888888888888889888888877765432 2334555556677777777753
No 55
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=34.73 E-value=52 Score=20.45 Aligned_cols=26 Identities=31% Similarity=0.548 Sum_probs=19.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4354 100 FYVPLLVILVLYWKIYQTARKRIRRR 125 (342)
Q Consensus 100 ~~ip~~~i~~~y~~I~~~vr~~~~~~ 125 (342)
|++.+.+++++|+-|+..-|++.+..
T Consensus 5 f~~ti~lvv~LYgY~yhLYrsek~G~ 30 (56)
T TIGR02736 5 FAFTLLLVIFLYAYIYHLYRSQKKGE 30 (56)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhcccc
Confidence 45556778889999998888766543
No 56
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=34.37 E-value=72 Score=22.61 Aligned_cols=28 Identities=21% Similarity=0.526 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy4354 101 YVPLLVILVLYWKIYQTARKRIRRRRQQ 128 (342)
Q Consensus 101 ~ip~~~i~~~y~~I~~~vr~~~~~~~~~ 128 (342)
+..++++.++|..++|--|++.+.+...
T Consensus 13 l~~vl~~~ifyFli~RPQrKr~K~~~~m 40 (97)
T COG1862 13 LPLVLIFAIFYFLIIRPQRKRMKEHQEL 40 (97)
T ss_pred HHHHHHHHHHHHhhcCHHHHHHHHHHHH
Confidence 3344567778888887777766655543
No 57
>PF05151 PsbM: Photosystem II reaction centre M protein (PsbM); InterPro: IPR007826 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbM found in PSII. PsbM is one of the most hydrophobic proteins in the thylakoid membrane. The function of this protein is unknown.; GO: 0015979 photosynthesis, 0019684 photosynthesis, light reaction, 0009523 photosystem II, 0016021 integral to membrane; PDB: 3A0H_m 3ARC_m 3A0B_M 3PRR_M 3PRQ_M 1S5L_M 4FBY_e 3BZ2_M 3BZ1_M 2AXT_M ....
Probab=33.74 E-value=83 Score=16.97 Aligned_cols=20 Identities=25% Similarity=0.514 Sum_probs=14.5
Q ss_pred HHHHHHhHHHHHHHHHHHHH
Q psy4354 94 FATCSTFYVPLLVILVLYWK 113 (342)
Q Consensus 94 ~~~~~~~~ip~~~i~~~y~~ 113 (342)
....++.++|....+++|.+
T Consensus 9 iAtaLfi~iPt~FLiilyvq 28 (31)
T PF05151_consen 9 IATALFILIPTAFLIILYVQ 28 (31)
T ss_dssp HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhheEee
Confidence 34455667888888888875
No 58
>PF04238 DUF420: Protein of unknown function (DUF420); InterPro: IPR007352 This is a predicted membrane protein with four transmembrane helices.
Probab=32.71 E-value=74 Score=24.06 Aligned_cols=37 Identities=19% Similarity=0.328 Sum_probs=27.3
Q ss_pred HHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcC
Q psy4354 254 EAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQ 290 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~ 290 (342)
+.+.+++...+.+....+|++|++=+....-...+.+
T Consensus 30 ~~~~Hr~~Ml~a~~ls~lFlv~Yl~~~~~~g~~~f~g 66 (133)
T PF04238_consen 30 RIKLHRKLMLTAFVLSALFLVSYLYYHFLGGSTPFGG 66 (133)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCC
Confidence 4567889999999999999999987665533333333
No 59
>PF14752 RBP_receptor: Retinol binding protein receptor
Probab=31.36 E-value=1.1e+02 Score=29.84 Aligned_cols=57 Identities=18% Similarity=0.268 Sum_probs=40.2
Q ss_pred hhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcCCCCchhHHHHHHHHHHhhcccccchhhhccChhHHHHHHHHHhcCCCC
Q psy4354 259 RKAAKTLAIITGAFVICWLPFFMMALLLPLCQTCYISDYLASFFLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGSPNR 338 (342)
Q Consensus 259 ~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~~~~ 338 (342)
.-.+-.=.+++++|+++|.|+.++.-+....+.... .+.....++-+++++++++.+
T Consensus 162 ~Ll~~lP~llCL~fL~~~f~~~lvk~~~~~~~~~~~-----------------------~l~~~~~~~yvk~LL~~~~~~ 218 (617)
T PF14752_consen 162 SLLASLPQLLCLAFLSLWFPYLLVKSFRNRTGKGSE-----------------------DLQSSYYEEYVKSLLRRKPLR 218 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------------------ccccccHHHHHHHHhcCCCcc
Confidence 345555667899999999999888766554332111 667788888889988877543
No 60
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=28.23 E-value=1.1e+02 Score=17.36 Aligned_cols=27 Identities=15% Similarity=0.260 Sum_probs=15.5
Q ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHH
Q psy4354 93 IFATCSTFYVPLLVILVLYWKIYQTAR 119 (342)
Q Consensus 93 ~~~~~~~~~ip~~~i~~~y~~I~~~vr 119 (342)
+...++..+.-+++.+++|..-+++-+
T Consensus 8 IIv~V~vg~~iiii~~~~YaCcykk~~ 34 (38)
T PF02439_consen 8 IIVAVVVGMAIIIICMFYYACCYKKHR 34 (38)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcccc
Confidence 344444445555666677777666554
No 61
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=27.70 E-value=3.5e+02 Score=26.51 Aligned_cols=20 Identities=30% Similarity=0.622 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy4354 102 VPLLVILVLYWKIYQTARKR 121 (342)
Q Consensus 102 ip~~~i~~~y~~I~~~vr~~ 121 (342)
+|+++++++-+.++.++.+.
T Consensus 277 vPv~vV~~Iiiil~~~LCRk 296 (684)
T PF12877_consen 277 VPVLVVLLIIIILYWKLCRK 296 (684)
T ss_pred HHHHHHHHHHHHHHHHHhcc
Confidence 44444444444444444443
No 62
>PHA03234 DNA packaging protein UL33; Provisional
Probab=27.50 E-value=3.3e+02 Score=24.29 Aligned_cols=24 Identities=13% Similarity=-0.108 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q psy4354 103 PLLVILVLYWKIYQTARKRIRRRR 126 (342)
Q Consensus 103 p~~~i~~~y~~I~~~vr~~~~~~~ 126 (342)
-.++=-++|...-...|+..++.-
T Consensus 291 nsclNPiIY~f~~~~FR~~~~~~~ 314 (338)
T PHA03234 291 HCFSNPLVYAFTGGDFRLRFTACF 314 (338)
T ss_pred hhhhhHHHHHHhhHHHHHHHHHHH
Confidence 334556778887777777655533
No 63
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=27.04 E-value=21 Score=30.52 Aligned_cols=17 Identities=12% Similarity=0.136 Sum_probs=0.0
Q ss_pred HHhhHHHHHhHHHHhhh
Q psy4354 8 GVLSHYNVDMLGYSTVI 24 (342)
Q Consensus 8 ~~l~~~~~~~S~~~l~~ 24 (342)
+.+..+++.+|=++-.|
T Consensus 35 ail~w~~iimsd~t~~a 51 (381)
T PF05297_consen 35 AILVWFFIIMSDLTQGA 51 (381)
T ss_dssp -----------------
T ss_pred HHHHHHHHHHhccccch
Confidence 34444444444443333
No 64
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=26.15 E-value=91 Score=22.83 Aligned_cols=25 Identities=16% Similarity=0.404 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy4354 101 YVPLLVILVLYWKIYQTARKRIRRRRQQ 128 (342)
Q Consensus 101 ~ip~~~i~~~y~~I~~~vr~~~~~~~~~ 128 (342)
++|++++++.|..+ +|.+.++...+
T Consensus 6 il~~vv~~~i~yf~---iRPQkKr~Ke~ 30 (113)
T PRK06531 6 IIMFVVMLGLIFFM---QRQQKKQAQER 30 (113)
T ss_pred HHHHHHHHHHHHhe---echHHHHHHHH
Confidence 34555554444333 44444444443
No 65
>PF05391 Lsm_interact: Lsm interaction motif; InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=25.99 E-value=31 Score=16.69 Aligned_cols=10 Identities=30% Similarity=0.584 Sum_probs=6.4
Q ss_pred cChhHHHHHH
Q psy4354 320 FSPEFRQAFK 329 (342)
Q Consensus 320 ~n~~fR~~~~ 329 (342)
.|.+||+-+.
T Consensus 11 SNddFrkmfl 20 (21)
T PF05391_consen 11 SNDDFRKMFL 20 (21)
T ss_pred chHHHHHHHc
Confidence 5677776553
No 66
>PF15102 TMEM154: TMEM154 protein family
Probab=25.84 E-value=12 Score=28.57 Aligned_cols=24 Identities=13% Similarity=0.268 Sum_probs=10.6
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHH
Q psy4354 97 CSTFYVPLLVILVLYWKIYQTARK 120 (342)
Q Consensus 97 ~~~~~ip~~~i~~~y~~I~~~vr~ 120 (342)
+++.++|+++++++-+.+...+.+
T Consensus 58 iLmIlIP~VLLvlLLl~vV~lv~~ 81 (146)
T PF15102_consen 58 ILMILIPLVLLVLLLLSVVCLVIY 81 (146)
T ss_pred EEEEeHHHHHHHHHHHHHHHheeE
Confidence 344456644444444444433433
No 67
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=25.56 E-value=1.9e+02 Score=29.46 Aligned_cols=77 Identities=14% Similarity=0.194 Sum_probs=0.0
Q ss_pred hHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCCCCCCCCccceeeccccccccccccCCCCCCCC
Q psy4354 91 YQIFATCSTFYVPLLVILVLYWKIYQTARKRIRRRRQQRNVLMAGKKPDTSDNKTSHFIFFKKRKFFRIKKCTNVVPPSP 170 (342)
Q Consensus 91 ~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (342)
|.++..+++.++-++++++.+..+..-+....++.......-.+
T Consensus 412 yR~~~~lil~~~llLIv~~~~lGLl~G~~G~~~~~~p~~r~c~~------------------------------------ 455 (806)
T PF05478_consen 412 YRWIVGLILCCVLLLIVLCLLLGLLCGCCGYRRRADPTDRGCSS------------------------------------ 455 (806)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCcccCCCC------------------------------------
Q ss_pred CCCCcccccCCCCCcCCCCCCceeccCCCCCCCCCccccCCCCCCccceeccCCCCCCCCCCccccccccCCCCCcccch
Q psy4354 171 NKLSINVIDEDNGINNATTSSSLILADGHSNSDADRRTSINNEANTAFTITHNNGASQSNHNNECVQVKHKIPPTKKEKK 250 (342)
Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (342)
T Consensus 456 -------------------------------------------------------------------------------- 455 (806)
T PF05478_consen 456 -------------------------------------------------------------------------------- 455 (806)
T ss_pred --------------------------------------------------------------------------------
Q ss_pred hhHHHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHHhhcC
Q psy4354 251 ESLEAKRERKAAKTLAIITGAFVICWLPFFMMALLLPLCQ 290 (342)
Q Consensus 251 ~~~~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~~~~~ 290 (342)
..-+.-+++.|.++|+++|+-..+..+....++
T Consensus 456 -------~tGg~~Lm~gv~~~Flf~~~l~l~~~~~Fl~G~ 488 (806)
T PF05478_consen 456 -------NTGGNFLMAGVGLSFLFSWFLMLLVLFYFLVGG 488 (806)
T ss_pred -------CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
No 68
>PF08525 OapA_N: Opacity-associated protein A N-terminal motif; InterPro: IPR013731 This domain is found in the Haemophilus influenzae opacity-associated protein (OapA). It is required for efficient nasopharyngeal mucosal colonisation, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [, ]. This motif occurs at the N terminus of these proteins. It contains a conserved histidine followed by a run of hydrophobic residues. Many of the proteins in this entry are unassigned peptidases belonging to MEROPS peptidase family M23B.
Probab=24.31 E-value=1e+02 Score=16.37 Aligned_cols=21 Identities=24% Similarity=0.305 Sum_probs=11.4
Q ss_pred HHHhhHHHHHHHHHHHHHHhhhh
Q psy4354 256 KRERKAAKTLAIITGAFVICWLP 278 (342)
Q Consensus 256 ~~~~k~~k~l~~v~~~f~~cw~P 278 (342)
+.++++...+..++++ +.|.|
T Consensus 8 ~~Hr~~l~~l~~v~l~--ll~~P 28 (30)
T PF08525_consen 8 KLHRRALIALSAVVLV--LLLWP 28 (30)
T ss_pred HHHHHHHHHHHHHHHH--HHhcc
Confidence 4456665555555554 45556
No 69
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=24.15 E-value=67 Score=19.10 Aligned_cols=23 Identities=17% Similarity=0.316 Sum_probs=15.8
Q ss_pred hHHHHHHHHhHHHHHHHHHHHHH
Q psy4354 91 YQIFATCSTFYVPLLVILVLYWK 113 (342)
Q Consensus 91 ~~~~~~~~~~~ip~~~i~~~y~~ 113 (342)
...+...++..+|.+.++++|..
T Consensus 6 lgfiAtaLFi~iPT~FLlilYVk 28 (50)
T PRK14094 6 FGFVASLLFVGVPTIFLIGLFIS 28 (50)
T ss_pred HHHHHHHHHHHHHHHHhhheeEE
Confidence 34445566778888888888764
No 70
>PF03904 DUF334: Domain of unknown function (DUF334); InterPro: IPR005602 This is a family of proteins found in Staphylococcus aureus plasmid with no characterised function.
Probab=24.12 E-value=3.8e+02 Score=22.36 Aligned_cols=34 Identities=15% Similarity=0.368 Sum_probs=22.0
Q ss_pred CchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Q psy4354 88 DVGYQIFATCSTFYVPLLVILVLYWKIYQTARKR 121 (342)
Q Consensus 88 ~~~~~~~~~~~~~~ip~~~i~~~y~~I~~~vr~~ 121 (342)
+..+..++..+.|.+|.++-+...+.+|--+|.+
T Consensus 194 se~~~~~lwyi~Y~vPY~~~ig~~i~l~~~~~~~ 227 (230)
T PF03904_consen 194 SESFWTYLWYIAYLVPYIFAIGLFIYLYEWIRAK 227 (230)
T ss_pred hHhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Confidence 3455556667778888877566666666666653
No 71
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=23.84 E-value=2e+02 Score=18.30 Aligned_cols=31 Identities=13% Similarity=0.094 Sum_probs=19.5
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy4354 97 CSTFYVPLLVILVLYWKIYQTARKRIRRRRQ 127 (342)
Q Consensus 97 ~~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~ 127 (342)
....+.-+.+.+++++.|+...|.+.|....
T Consensus 9 ~a~a~~t~~~~l~fiavi~~ayr~~~K~~~d 39 (60)
T COG4736 9 FADAWGTIAFTLFFIAVIYFAYRPGKKGEFD 39 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccchhhHH
Confidence 3334444556667777778778777666554
No 72
>PF14362 DUF4407: Domain of unknown function (DUF4407)
Probab=23.78 E-value=72 Score=27.89 Aligned_cols=21 Identities=19% Similarity=0.165 Sum_probs=16.6
Q ss_pred HHhHHHHhhheeeceeEEEee
Q psy4354 15 VDMLGYSTVIMFVDKYWAVTN 35 (342)
Q Consensus 15 ~~~S~~~l~~IaidRY~aI~~ 35 (342)
.++-+|-++.+++|||+...-
T Consensus 50 ~~glvwgl~I~~lDR~ivss~ 70 (301)
T PF14362_consen 50 PFGLVWGLVIFNLDRFIVSSI 70 (301)
T ss_pred HHHHHHHHHHHHHHHHHHhcc
Confidence 344578899999999997766
No 73
>PF02699 YajC: Preprotein translocase subunit; InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA []. Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought []. More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=22.53 E-value=2.3e+02 Score=19.21 Aligned_cols=23 Identities=9% Similarity=0.561 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh
Q psy4354 105 LVILVLYWKIYQTARKRIRRRRQ 127 (342)
Q Consensus 105 ~~i~~~y~~I~~~vr~~~~~~~~ 127 (342)
+++.++|...++--+++.+....
T Consensus 10 ~~~~i~yf~~~rpqkk~~k~~~~ 32 (82)
T PF02699_consen 10 IIFVIFYFLMIRPQKKQQKEHQE 32 (82)
T ss_dssp HHHHHHHHHTHHHHHHHHHHHTT
T ss_pred HHHHHHhhheecHHHHHHHHHHH
Confidence 55666676666666655554444
No 74
>PLN00090 photosystem II reaction center M protein; Provisional
Probab=22.17 E-value=1.6e+02 Score=20.56 Aligned_cols=24 Identities=17% Similarity=0.325 Sum_probs=17.0
Q ss_pred hHHHHHHHHhHHHHHHHHHHHHHH
Q psy4354 91 YQIFATCSTFYVPLLVILVLYWKI 114 (342)
Q Consensus 91 ~~~~~~~~~~~ip~~~i~~~y~~I 114 (342)
...+.+.++.++|.+.++++|+.-
T Consensus 76 LafIATaLFIlIPTaFLLILYVQT 99 (113)
T PLN00090 76 GAYLAVALGTFLPCLFLINLFIQT 99 (113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhh
Confidence 344455667789999998888753
No 75
>KOG4220|consensus
Probab=21.51 E-value=2.7e+02 Score=25.82 Aligned_cols=38 Identities=11% Similarity=0.037 Sum_probs=25.6
Q ss_pred HHHHHhhcccccchhhhccChhHHHHHHHHHhcCCCCC
Q psy4354 302 FLWLGYFNSTLNPVIYTVFSPEFRQAFKRILCGSPNRG 339 (342)
Q Consensus 302 ~~~l~~~ns~vNPiiY~~~n~~fR~~~~~ll~~~~~~~ 339 (342)
.+.=..+|.+.=++.-.-+.+.|++-+..-++.++.++
T Consensus 459 CYINSTiNP~CYALCNatFrkTfk~lL~Cr~~~~~~~~ 496 (503)
T KOG4220|consen 459 CYINSTINPLCYALCNATFRKTFKRLLLCRWKKRRTRR 496 (503)
T ss_pred eeecccccHHHHHHHhHHHHHHHHHhheeeecccchhc
Confidence 33444556666666777788889888888776665443
No 76
>PF09835 DUF2062: Uncharacterized protein conserved in bacteria (DUF2062); InterPro: IPR018639 This domain, found in various prokaryotic proteins, has no known function. It is found at the C-terminal of family 2 glycosyltransferase proteins, in addition to proteins of unknown function.
Probab=21.38 E-value=3.3e+02 Score=20.88 Aligned_cols=30 Identities=23% Similarity=0.528 Sum_probs=21.2
Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy4354 98 STFYVPLLVILVLYWKIYQTARKRIRRRRQ 127 (342)
Q Consensus 98 ~~~~ip~~~i~~~y~~I~~~vr~~~~~~~~ 127 (342)
...++.++..++.|..++..+++.++++.+
T Consensus 123 G~~i~~~v~~~i~Y~l~~~~~~~~r~~r~~ 152 (154)
T PF09835_consen 123 GSLILGIVLGIISYFLVYFLVRKYRKRRRK 152 (154)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344555567778888888888877766655
No 77
>PF10624 TraS: Plasmid conjugative transfer entry exclusion protein TraS; InterPro: IPR018898 Entry exclusion (Eex) is a process which prevents redundant transfer of DNA between donor cells. TraS is a protein involved in Eex. It blocks redundant conjugative DNA synthesis and transport between donor cells, and it is suggested that TraS interferes with a signalling pathway that is required to trigger DNA transfer []. TraS on the recipient cell is known to form an interaction with TraG on the donor cell [].
Probab=20.59 E-value=43 Score=24.98 Aligned_cols=48 Identities=19% Similarity=0.267 Sum_probs=22.4
Q ss_pred HHHHHHhhhhHHHHHHHHhhcCCCCchhH---HHHHHHHHHhhcccccchhh
Q psy4354 269 TGAFVICWLPFFMMALLLPLCQTCYISDY---LASFFLWLGYFNSTLNPVIY 317 (342)
Q Consensus 269 ~~~f~~cw~P~~i~~~~~~~~~~~~~~~~---~~~~~~~l~~~ns~vNPiiY 317 (342)
++.|++.|.-.++..+...-.. ....+. -..+...++-+.|.+||++-
T Consensus 31 il~f~~lWqglFiwlF~qIrkK-r~v~defkfsk~vwyi~mpVcsllsPlls 81 (164)
T PF10624_consen 31 ILLFIVLWQGLFIWLFIQIRKK-RNVSDEFKFSKGVWYILMPVCSLLSPLLS 81 (164)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc-CCCcchhccccceEEeeecHHHHHhHHHH
Confidence 4456666666555555543221 111111 11122335556677777654
No 78
>KOG4583|consensus
Probab=20.59 E-value=6.2e+02 Score=22.63 Aligned_cols=15 Identities=27% Similarity=0.306 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHhhhh
Q psy4354 264 TLAIITGAFVICWLP 278 (342)
Q Consensus 264 ~l~~v~~~f~~cw~P 278 (342)
++-..+..|+.+.+|
T Consensus 372 t~~sfvtTFFaSLlP 386 (391)
T KOG4583|consen 372 TAWSFVTTFFASLLP 386 (391)
T ss_pred HHHHHHHHHHHHhcC
Confidence 344455556655555
No 79
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=20.56 E-value=2.3e+02 Score=19.05 Aligned_cols=26 Identities=15% Similarity=0.425 Sum_probs=14.2
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4354 99 TFYVPLLVILVLYWKIYQTARKRIRR 124 (342)
Q Consensus 99 ~~~ip~~~i~~~y~~I~~~vr~~~~~ 124 (342)
++++|+++.++.-.-++..+...+++
T Consensus 5 fl~~Pliif~ifVap~wl~lHY~~k~ 30 (75)
T TIGR02976 5 FLAIPLIIFVIFVAPLWLILHYRSKR 30 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 45677666655555555555444433
No 80
>PF07325 Curto_V2: Curtovirus V2 protein; InterPro: IPR009931 This family consists of several Curtovirus V2 proteins. The exact function of V2 is unclear but it is known that the protein is required for a successful host infection process [].
Probab=20.50 E-value=73 Score=22.73 Aligned_cols=18 Identities=22% Similarity=0.353 Sum_probs=13.7
Q ss_pred hHHHHHHHHHhcCCCCCC
Q psy4354 323 EFRQAFKRILCGSPNRGR 340 (342)
Q Consensus 323 ~fR~~~~~ll~~~~~~~~ 340 (342)
.|.++++++|+++++-+|
T Consensus 52 qFQKevKKLLk~k~sF~r 69 (126)
T PF07325_consen 52 QFQKEVKKLLKRKCSFKR 69 (126)
T ss_pred HHHHHHHHHHHHhcccch
Confidence 589999999987765433
No 81
>PF05398 PufQ: PufQ cytochrome subunit; InterPro: IPR008800 This family consists of bacterial PufQ proteins. PufQ is required for bacteriochlorophyll biosynthesis serving a regulatory function in the formation of photosynthetic complexes [].; GO: 0015979 photosynthesis, 0030494 bacteriochlorophyll biosynthetic process
Probab=20.36 E-value=2.5e+02 Score=18.65 Aligned_cols=26 Identities=23% Similarity=0.143 Sum_probs=17.4
Q ss_pred hhHHHHHHHHhHHHHHHHHHHHHHHH
Q psy4354 90 GYQIFATCSTFYVPLLVILVLYWKIY 115 (342)
Q Consensus 90 ~~~~~~~~~~~~ip~~~i~~~y~~I~ 115 (342)
.+..+..++..-+|+.++.+.|..|.
T Consensus 21 f~vYFalIflaAlP~a~l~W~~~~ir 46 (73)
T PF05398_consen 21 FYVYFALIFLAALPFATLTWAYALIR 46 (73)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445556666789988887776653
No 82
>PHA02975 hypothetical protein; Provisional
Probab=20.33 E-value=2.1e+02 Score=18.67 Aligned_cols=25 Identities=24% Similarity=0.301 Sum_probs=15.4
Q ss_pred hHHHHHHHHhHHHHHHHHHHHHHHH
Q psy4354 91 YQIFATCSTFYVPLLVILVLYWKIY 115 (342)
Q Consensus 91 ~~~~~~~~~~~ip~~~i~~~y~~I~ 115 (342)
+.++...+++++.+++.+++|.+..
T Consensus 44 ~~~~ii~i~~v~~~~~~~flYLK~~ 68 (69)
T PHA02975 44 SIILIIFIIFITCIAVFTFLYLKLM 68 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3444444556666677788887654
No 83
>COG2322 Predicted membrane protein [Function unknown]
Probab=20.31 E-value=2e+02 Score=22.71 Aligned_cols=33 Identities=12% Similarity=0.138 Sum_probs=26.2
Q ss_pred HHHHHhhHHHHHHHHHHHHHHhhhhHHHHHHHH
Q psy4354 254 EAKRERKAAKTLAIITGAFVICWLPFFMMALLL 286 (342)
Q Consensus 254 ~~~~~~k~~k~l~~v~~~f~~cw~P~~i~~~~~ 286 (342)
..++++++..+.....++|+++++-+....--.
T Consensus 71 ~i~~Hk~aMltA~~l~l~FlvlYltr~~l~~~t 103 (177)
T COG2322 71 NIEKHKRAMLTAFTLALVFLVLYLTRHGLGGET 103 (177)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 457789999999999999999998876654433
No 84
>PHA02650 hypothetical protein; Provisional
Probab=20.01 E-value=2.3e+02 Score=19.12 Aligned_cols=25 Identities=16% Similarity=0.062 Sum_probs=13.5
Q ss_pred hHHHHHHHHhHHHHHHHHHHHHHHH
Q psy4354 91 YQIFATCSTFYVPLLVILVLYWKIY 115 (342)
Q Consensus 91 ~~~~~~~~~~~ip~~~i~~~y~~I~ 115 (342)
+.++..++++++-+++.+++|.+..
T Consensus 49 ~~~~ii~i~~v~i~~l~~flYLK~~ 73 (81)
T PHA02650 49 GQNFIFLIFSLIIVALFSFFVFKGY 73 (81)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3333334444555566667776654
Done!