Query psy1136
Match_columns 167
No_of_seqs 109 out of 1125
Neff 9.2
Searched_HMMs 46136
Date Fri Aug 16 16:22:13 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1136.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1136hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.8 2.4E-20 5.1E-25 146.2 5.5 126 26-151 29-176 (423)
2 KOG4220|consensus 99.8 4.7E-19 1E-23 139.2 7.6 132 34-165 32-188 (503)
3 PHA03234 DNA packaging protein 99.7 1.5E-17 3.3E-22 131.5 10.8 118 28-147 28-169 (338)
4 PHA03235 DNA packaging protein 99.6 2.6E-15 5.7E-20 121.5 12.4 118 26-147 26-170 (409)
5 PHA02834 chemokine receptor-li 99.6 2E-15 4.4E-20 118.7 11.2 116 30-148 26-164 (323)
6 PHA02638 CC chemokine receptor 99.6 2.6E-14 5.7E-19 116.0 10.8 116 31-149 97-236 (417)
7 PHA03087 G protein-coupled che 99.5 2.5E-13 5.4E-18 107.1 10.4 118 29-148 37-178 (335)
8 PF00001 7tm_1: 7 transmembran 99.4 2.4E-12 5.2E-17 96.0 7.7 100 49-148 1-124 (257)
9 PF10320 7TM_GPCR_Srsx: Serpen 98.5 2.9E-08 6.2E-13 75.9 1.5 100 44-150 2-101 (257)
10 KOG2087|consensus 98.1 4E-06 8.6E-11 66.0 3.7 79 35-114 27-113 (363)
11 PF10328 7TM_GPCR_Srx: Serpent 97.2 0.00085 1.8E-08 51.6 5.6 43 42-84 3-45 (274)
12 PF10324 7TM_GPCR_Srw: Serpent 97.2 0.0014 3E-08 51.4 6.8 49 42-91 6-55 (318)
13 PF11710 Git3: G protein-coupl 97.1 0.0026 5.7E-08 47.0 6.8 54 61-114 30-83 (201)
14 PF05296 TAS2R: Mammalian tast 97.0 0.012 2.7E-07 46.1 10.3 53 32-84 6-61 (303)
15 PF05462 Dicty_CAR: Slime mold 96.9 0.025 5.3E-07 44.5 11.0 77 34-114 8-84 (303)
16 KOG4219|consensus 96.1 0.011 2.4E-07 47.6 5.1 53 115-167 63-115 (423)
17 PF10321 7TM_GPCR_Srt: Serpent 94.9 0.26 5.7E-06 38.9 8.8 55 30-84 30-84 (313)
18 PF03402 V1R: Vomeronasal orga 92.6 0.21 4.5E-06 38.5 4.2 53 62-116 6-59 (265)
19 PF10317 7TM_GPCR_Srd: Serpent 91.6 0.92 2E-05 35.2 6.9 47 38-84 4-51 (292)
20 PF00002 7tm_2: 7 transmembran 87.0 0.39 8.5E-06 35.9 1.7 71 41-114 9-81 (242)
21 KOG4564|consensus 83.7 13 0.00028 31.2 9.1 78 37-114 149-238 (473)
22 PHA03234 DNA packaging protein 79.5 1.7 3.7E-05 34.6 2.6 46 119-166 66-111 (338)
23 KOG4220|consensus 76.2 0.71 1.5E-05 37.8 -0.4 49 115-163 58-106 (503)
24 PF10316 7TM_GPCR_Srbc: Serpen 74.8 18 0.00039 28.1 7.0 105 38-147 11-118 (273)
25 PF00001 7tm_1: 7 transmembran 66.5 4.6 0.0001 29.3 2.1 49 117-165 14-62 (257)
26 PF10323 7TM_GPCR_Srv: Serpent 62.3 3.6 7.8E-05 31.9 0.8 40 45-84 7-50 (283)
27 PF01102 Glycophorin_A: Glycop 60.4 8.2 0.00018 26.2 2.2 23 46-68 76-98 (122)
28 PHA02638 CC chemokine receptor 56.6 21 0.00046 29.3 4.5 46 117-164 127-172 (417)
29 PF10327 7TM_GPCR_Sri: Serpent 55.8 22 0.00047 27.9 4.3 62 34-95 10-75 (303)
30 PHA02834 chemokine receptor-li 54.5 11 0.00023 29.6 2.4 43 121-165 61-103 (323)
31 PF09882 DUF2109: Predicted me 46.4 44 0.00096 20.7 3.6 48 44-91 5-52 (78)
32 PF02118 Srg: Srg family chemo 44.7 1.2E+02 0.0026 22.9 6.7 42 43-84 8-52 (275)
33 TIGR01477 RIFIN variant surfac 42.6 23 0.00051 28.5 2.5 30 35-64 310-339 (353)
34 PF10292 7TM_GPCR_Srab: Serpen 39.8 1.9E+02 0.004 22.8 8.6 47 37-83 21-67 (324)
35 PF02468 PsbN: Photosystem II 39.1 38 0.00083 18.5 2.3 32 36-67 7-38 (43)
36 PTZ00046 rifin; Provisional 38.4 30 0.00065 28.0 2.5 30 35-64 315-344 (358)
37 PRK13183 psbN photosystem II r 36.5 45 0.00098 18.4 2.3 32 36-67 10-41 (46)
38 PF02009 Rifin_STEVOR: Rifin/s 33.7 32 0.00069 27.1 2.0 28 36-63 257-284 (299)
39 CHL00020 psbN photosystem II p 33.6 43 0.00092 18.3 1.9 32 36-67 7-38 (43)
40 PF11446 DUF2897: Protein of u 32.4 46 0.001 19.2 2.0 15 40-54 9-23 (55)
41 PF10326 7TM_GPCR_Str: Serpent 30.9 32 0.0007 26.6 1.7 44 41-84 7-51 (307)
42 PHA03087 G protein-coupled che 29.7 48 0.001 25.9 2.5 44 121-165 73-116 (335)
43 PF12606 RELT: Tumour necrosis 29.6 71 0.0015 18.0 2.5 24 37-62 5-28 (50)
44 PF06024 DUF912: Nucleopolyhed 28.7 87 0.0019 20.3 3.2 9 55-63 84-92 (101)
45 PF06679 DUF1180: Protein of u 25.4 1.5E+02 0.0032 21.2 4.0 24 34-57 94-117 (163)
46 PF02060 ISK_Channel: Slow vol 21.2 1.1E+02 0.0024 20.9 2.6 10 63-72 74-83 (129)
47 PF01299 Lamp: Lysosome-associ 20.8 2E+02 0.0043 22.5 4.4 11 52-62 289-299 (306)
48 PHA03235 DNA packaging protein 20.7 1E+02 0.0022 25.3 2.8 39 123-165 70-110 (409)
49 PF05545 FixQ: Cbb3-type cytoc 20.7 1.6E+02 0.0036 16.0 3.7 18 41-60 16-33 (49)
No 1
>KOG4219|consensus
Probab=99.81 E-value=2.4e-20 Score=146.19 Aligned_cols=126 Identities=24% Similarity=0.402 Sum_probs=109.3
Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhcccccccccc
Q psy1136 26 PTLLNVYVFFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLC 105 (167)
Q Consensus 26 ~~~~~~~~~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C 105 (167)
......+.+..++|.++++++++||+++++++..+||+|+.+|+|++|||+||++++++..|+...+...+.|.+|.++|
T Consensus 29 ~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~C 108 (423)
T KOG4219|consen 29 VLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFYC 108 (423)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhcccccccee
Confidence 34556677889999999999999999999999999999999999999999999999999999988878888999999999
Q ss_pred chhhhHHhhh--------------h--------hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy1136 106 YFLPMLQVME--------------R--------LFRDATCGYFINIALADIIKCMFVLPITLMVTLVQ 151 (167)
Q Consensus 106 ~~~~~~~~~~--------------~--------~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~ 151 (167)
++..|+...+ | +.++..+.+...|..+|.+|++.+.|..+.+....
T Consensus 109 ~f~nf~~itav~vSVfTlvAiA~DRy~AIi~Pl~~r~s~r~sk~iIllIW~lA~l~a~P~~l~s~v~~ 176 (423)
T KOG4219|consen 109 RFVNFFPITAVFVSVFTLVAIAIDRYMAIIHPLQPRPSRRSSKIIILLIWALALLLALPQLLYSSVEE 176 (423)
T ss_pred eeccccchhhhhHhHHHHHHHHHHHHHHHhhhcccCCCCcceeehhHHHHHHHHHHhccceeeeeeEE
Confidence 9998876662 1 34466677788888999999999999988776544
No 2
>KOG4220|consensus
Probab=99.78 E-value=4.7e-19 Score=139.23 Aligned_cols=132 Identities=21% Similarity=0.318 Sum_probs=116.3
Q ss_pred HHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhhHHh
Q psy1136 34 FFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQV 113 (167)
Q Consensus 34 ~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~~~~ 113 (167)
+.+++-.++.++.++||++|++.+...|++|+..|+|+++||++|++.+.+++|+...+.+.|.|.+|...|.+-..+.+
T Consensus 32 ~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlWLalDY 111 (503)
T KOG4220|consen 32 FIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLWLALDY 111 (503)
T ss_pred eeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHHHHHHH
Confidence 45567778899999999999999999999999999999999999999999999999999999999999999999888877
Q ss_pred hh--------------h----------hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-hhcccccccccceeccc
Q psy1136 114 ME--------------R----------LFRDATCGYFINIALADIIKCMFVLPITLMVT-LVQNWTWGSFLCYFLPM 165 (167)
Q Consensus 114 ~~--------------~----------~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~-~~~~w~~g~~~C~~~~~ 165 (167)
.. | +.+.+++++.++|+..|+++.++..|..++++ +.+..+.....|...+.
T Consensus 112 vaSNASVmNLLiISFDRYFsVTrPLtYrakRTtkrA~~MI~~AW~iSfiLWaPaIl~WqyivGkrTv~~~eC~iQFl 188 (503)
T KOG4220|consen 112 VASNASVMNLLIISFDRYFSVTRPLTYRAKRTTKRAGLMIGAAWVLSFVLWAPAILFWQYIVGKRTVPDGECYIQFL 188 (503)
T ss_pred HhhhhhhhhhheeeeecceeecccccccccccchHHHHHHHHHHHHHHHHHHHHHHhhHhheeeeecCCCceEEEee
Confidence 71 1 45578889999999999999999999887775 56678888889977654
No 3
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.74 E-value=1.5e-17 Score=131.53 Aligned_cols=118 Identities=14% Similarity=0.183 Sum_probs=89.2
Q ss_pred chhHHHHHHHHHHHHHHHHhhhhHhhhHhh--eeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhcccccccccc
Q psy1136 28 LLNVYVFFIILYGVTILSGVALNIFAIYHI--IVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLC 105 (167)
Q Consensus 28 ~~~~~~~~~~~~~ii~~~gl~gN~~vi~vi--~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C 105 (167)
....+.+..++|.+++++|++||+++++++ .+++++|+++|+|+.|||+||++.++ .+|...... .++|++|+..|
T Consensus 28 ~~~~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l-~lp~~~~~~-~~~w~fG~~lC 105 (338)
T PHA03234 28 LKKAQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHML-CVFFLTLNR-EALFNFNQAFC 105 (338)
T ss_pred HHHHHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHH-hCCccCchhHH
Confidence 345677889999999999999999999844 45667799999999999999999865 567665533 45799999999
Q ss_pred chhhhHHhh--------------hh------hh--cchhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1136 106 YFLPMLQVM--------------ER------LF--RDATCGYFINIALADIIKCMFVLPITLMV 147 (167)
Q Consensus 106 ~~~~~~~~~--------------~~------~~--~~~~~~~~~~i~~~w~~~~~~~~P~~~~~ 147 (167)
|+.+++... +| +. +..+++....+++.|+.+++.+.|.....
T Consensus 106 k~~~~~~~~~~~~Si~~L~~ISiDRY~aIv~p~~~~~~~~~~~~~i~~~Wi~s~l~~~P~l~~~ 169 (338)
T PHA03234 106 QCVLFIYHASCSYSICMLAIIATIRYKTLHRRKKNDKKNNHIGRNIGILFLASAMCAIPAALFV 169 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHeeeechhhhhhhhhhHHHHHHHHHHHHHHHHhhHhHee
Confidence 999887766 22 11 11122333456667999999999987654
No 4
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.64 E-value=2.6e-15 Score=121.51 Aligned_cols=118 Identities=17% Similarity=0.196 Sum_probs=86.3
Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhhHhhhHhheecccc--ccchhhHhHHHHHHHHHHhccccchhHHHHhhcccc--cc
Q psy1136 26 PTLLNVYVFFIILYGVTILSGVALNIFAIYHIIVERLF--RDATCGYFINIALADIIKCMFVLPITLMVTLVQNWT--WG 101 (167)
Q Consensus 26 ~~~~~~~~~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~--~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~--~g 101 (167)
+.....+.+..+++.+++++|++||+++++++.+++|+ ++.+++|+.|||++|++. ++.+|+.+.. +.|. .|
T Consensus 26 ~~~~~~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~-l~~lP~~i~~---~~~~~~~g 101 (409)
T PHA03235 26 EGLSAARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLT-VFVLPFIMLS---NQGLLSGS 101 (409)
T ss_pred hhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHH-HHHHHHHHHh---cCccccCC
Confidence 33455677889999999999999999999875433332 456789999999999985 5678877653 3454 45
Q ss_pred ccccchhhhHHhh--------------hh---------hhcchhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1136 102 SFLCYFLPMLQVM--------------ER---------LFRDATCGYFINIALADIIKCMFVLPITLMV 147 (167)
Q Consensus 102 ~~~C~~~~~~~~~--------------~~---------~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~ 147 (167)
+..|++.+++... +| +.+...+++...++++|+.+++++.|.....
T Consensus 102 ~~~Ck~~~~l~~~~~~~Si~tL~~ISiDRY~aI~~p~~~~~~~~~~a~~ii~~iWi~sll~s~P~~~~~ 170 (409)
T PHA03235 102 VAGCKFASLLYYASCTVGFATVALIAADRYRVIHQRTRARSSAYRSTYKILGLTWFASLICSGPAPVYT 170 (409)
T ss_pred CCeehhHHHHHHHHHHHHHHHHHHHHHHHHHHeechhhccCcccchhhhhHHHHHHHHHHHHHHHHHHH
Confidence 7899999876655 12 1112345677778889999999999976554
No 5
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.64 E-value=2e-15 Score=118.73 Aligned_cols=116 Identities=21% Similarity=0.447 Sum_probs=87.9
Q ss_pred hHHHHHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhh
Q psy1136 30 NVYVFFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLP 109 (167)
Q Consensus 30 ~~~~~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~ 109 (167)
..+.+..++|.+++++|++||+++++++.++|++ ++.|+|+.|||++|++. .+.+|+.+... .++|.+|+..|++.+
T Consensus 26 ~~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~~-~~~n~~i~nLAiaDll~-~~~lP~~i~~~-~~~w~~g~~~C~~~~ 102 (323)
T PHA02834 26 DVNYFVIVFYILLFIFGLIGNVLVIAVLIVKRFM-FVVDVYLFNIAMSDLML-VFSFPFIIHND-LNEWIFGEFMCKLVL 102 (323)
T ss_pred chhhhHHHHHHHHHHHHHhhHHHHHHHHHhcccc-chhhhhhHHHHHHHHHH-HHHHHHHHHHH-cCCcCCcchHHHhHH
Confidence 3455778999999999999999999988775554 56899999999999985 56789876644 357999999999877
Q ss_pred hHHhh--------------hh--------hhc-chhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1136 110 MLQVM--------------ER--------LFR-DATCGYFINIALADIIKCMFVLPITLMVT 148 (167)
Q Consensus 110 ~~~~~--------------~~--------~~~-~~~~~~~~~i~~~w~~~~~~~~P~~~~~~ 148 (167)
+.... +| +.+ ...+++...++.+|+++++.+.|......
T Consensus 103 ~~~~~~~~~Si~tL~~IsidRY~aI~~P~~~~~~~~~~~~~~i~~iWi~s~l~~~P~~~~~~ 164 (323)
T PHA02834 103 GVYFVGFFSNMFFVTLISIDRYILVVNATKIKNKSISLSVLLSVAAWVCSVILSMPAMVLYY 164 (323)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhheeCchhccCCccchHHHHHHHHHHHHHHHHhhHHHHHH
Confidence 55443 22 111 12345566778899999999999876544
No 6
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.56 E-value=2.6e-14 Score=116.01 Aligned_cols=116 Identities=17% Similarity=0.305 Sum_probs=90.0
Q ss_pred HHHHHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhh
Q psy1136 31 VYVFFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPM 110 (167)
Q Consensus 31 ~~~~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~ 110 (167)
...+..++|.+++++|++||+++++++. +|++|+++|+++.|||++|++.. +.+|+++... .++|.+|+..|++..+
T Consensus 97 ~~~~l~~~y~lvfvlgliGN~LVl~il~-~k~lrt~t~i~llnLAisDLl~~-l~lPf~i~~~-~~~W~fg~~~Ck~~~~ 173 (417)
T PHA02638 97 ISEYIKIFYIIIFILGLFGNAAIIMILF-CKKIKTITDIYIFNLAISDLIFV-IDFPFIIYNE-FDQWIFGDFMCKVISA 173 (417)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCCHhHHHHHHHHHHHHHHH-HHHHHHHHHH-hccccccccchhhHHH
Confidence 4557788999999999999999987655 58889999999999999999864 5789887754 4689999999999765
Q ss_pred HHhh--------------hh--------h--hcchhHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy1136 111 LQVM--------------ER--------L--FRDATCGYFINIALADIIKCMFVLPITLMVTL 149 (167)
Q Consensus 111 ~~~~--------------~~--------~--~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~ 149 (167)
+... .| + .....+.....++..|+++++++.|.......
T Consensus 174 l~~~~~~~Si~~L~~isiDRYlaIv~p~~~~~~~~~~~~~i~~~~iW~~s~l~slP~~~~~~~ 236 (417)
T PHA02638 174 SYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIITSPAYFIFEA 236 (417)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccccceecHhhhHhhHhHHHHHHHHHHHHHHHHhhc
Confidence 4444 22 1 11233445666778999999999998776543
No 7
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.49 E-value=2.5e-13 Score=107.10 Aligned_cols=118 Identities=22% Similarity=0.381 Sum_probs=92.1
Q ss_pred hhHHHHHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchh
Q psy1136 29 LNVYVFFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFL 108 (167)
Q Consensus 29 ~~~~~~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~ 108 (167)
...+.+..+++.+++++|++||+++++++.++ +.|++.|+++.|||++|++.++ ..|........++|.+|+..|++.
T Consensus 37 ~~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~-~~~~~~~~~~~~~~~~~~~~C~~~ 114 (335)
T PHA03087 37 DTNSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIV 114 (335)
T ss_pred cchhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHH-hHHHHHHHHhCCCCCCCcHHHHHH
Confidence 44566788899999999999999999988877 8899999999999999998655 577766555567899999999988
Q ss_pred hhHHhh--------------hh----------hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1136 109 PMLQVM--------------ER----------LFRDATCGYFINIALADIIKCMFVLPITLMVT 148 (167)
Q Consensus 109 ~~~~~~--------------~~----------~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~ 148 (167)
.+.... +| +.....+++...+...|+++++.+.|......
T Consensus 115 ~~~~~~~~~~S~~~l~~iaidRy~aI~~p~~~~~~~~~~~~~~~~~~iWl~~~~~~~p~~~~~~ 178 (335)
T PHA03087 115 SGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWIISIIETTPILFVYT 178 (335)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccchhhhHHHHHHHHHHHHHHhccHhheee
Confidence 876555 22 11234456677778899999888888766543
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.36 E-value=2.4e-12 Score=95.98 Aligned_cols=100 Identities=26% Similarity=0.409 Sum_probs=81.0
Q ss_pred hhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhhHHhh--------------
Q psy1136 49 LNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQVM-------------- 114 (167)
Q Consensus 49 gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~-------------- 114 (167)
||+++++++.++|++|++.++++.|||++|++.++...|........++|.+|+..|++..++...
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~s~~~~~~is~ 80 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLFYFSSFSSIFSLVAISI 80 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccccccccccccccccccccc
Confidence 899999999999999999999999999999999888888777766667899999999999877665
Q ss_pred hh----------hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1136 115 ER----------LFRDATCGYFINIALADIIKCMFVLPITLMVT 148 (167)
Q Consensus 115 ~~----------~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~ 148 (167)
+| +...+.++....+...|+.+++++.|......
T Consensus 81 dRy~~i~~p~~~~~~~~~~~~~~~i~~~w~~~~~~~~~~~~~~~ 124 (257)
T PF00001_consen 81 DRYLAICHPLRYRRIRTRRRARIIIILIWIISFLISLPPLFFSW 124 (257)
T ss_dssp HHHHHHHSHHTHHHHSCHHHHHHHHHHHHHHHHHHHHHHHHTCE
T ss_pred cccccccccccccccccccccccccccccccccccccccccccc
Confidence 11 22234466778888899999888888876643
No 9
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.54 E-value=2.9e-08 Score=75.94 Aligned_cols=100 Identities=17% Similarity=0.273 Sum_probs=71.9
Q ss_pred HHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhhHHhhhhhhcchhH
Q psy1136 44 LSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQVMERLFRDATC 123 (167)
Q Consensus 44 ~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~~~~~~~~~~ 123 (167)
++|+.||..++.++.++|++|++.++++..+|++|++.....+|...... ++-......|-...+...+. .....
T Consensus 2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~--~~~~i~~~~Cf~~~~~~~f~---~~~qs 76 (257)
T PF10320_consen 2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLF--RDHQITRSECFWQIFFYIFF---QCAQS 76 (257)
T ss_pred EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHH--hheeccHHHHHHHHHHHHHH---HHHHH
Confidence 46899999999999999999999999999999999997665666554322 22234455687776666554 34555
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy1136 124 GYFINIALADIIKCMFVLPITLMVTLV 150 (167)
Q Consensus 124 ~~~~~i~~~w~~~~~~~~P~~~~~~~~ 150 (167)
.....+++||.++ +..|..+....+
T Consensus 77 ~~~l~i~iDr~ia--V~~P~~Y~~~~~ 101 (257)
T PF10320_consen 77 VIMLAIAIDRLIA--VCFPLRYRTIST 101 (257)
T ss_pred HHHHHHHhhheee--Eeehhhhhhccc
Confidence 6667788888765 556665554433
No 10
>KOG2087|consensus
Probab=98.06 E-value=4e-06 Score=65.97 Aligned_cols=79 Identities=19% Similarity=0.223 Sum_probs=58.6
Q ss_pred HHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHh-hc-------cccccccccc
Q psy1136 35 FIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTL-VQ-------NWTWGSFLCY 106 (167)
Q Consensus 35 ~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~-~~-------~w~~g~~~C~ 106 (167)
.-+...++..++++||.+++......+.+.++..+++.|||.+|+++++...-+.....- .| .|.-|. .|+
T Consensus 27 lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~tg~-gC~ 105 (363)
T KOG2087|consen 27 LRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQTGL-GCP 105 (363)
T ss_pred eeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhhhcC-CCc
Confidence 334456788899999999998888888888899999999999999988764444333221 12 254444 699
Q ss_pred hhhhHHhh
Q psy1136 107 FLPMLQVM 114 (167)
Q Consensus 107 ~~~~~~~~ 114 (167)
+.+|+..+
T Consensus 106 ~aGflavF 113 (363)
T KOG2087|consen 106 VAGFLAVF 113 (363)
T ss_pred hHHHHHHH
Confidence 99987777
No 11
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=97.20 E-value=0.00085 Score=51.65 Aligned_cols=43 Identities=23% Similarity=0.346 Sum_probs=39.4
Q ss_pred HHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhcc
Q psy1136 42 TILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMF 84 (167)
Q Consensus 42 i~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~ 84 (167)
+.++|++.|.++++.+.+.|++|++.+.+-.|.|++|.+.++.
T Consensus 3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~ 45 (274)
T PF10328_consen 3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLI 45 (274)
T ss_pred eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHH
Confidence 4678999999999999999999999999999999999998764
No 12
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=97.18 E-value=0.0014 Score=51.42 Aligned_cols=49 Identities=22% Similarity=0.384 Sum_probs=39.2
Q ss_pred HHHHHhhhhHhhhHhheecccccc-chhhHhHHHHHHHHHHhccccchhHH
Q psy1136 42 TILSGVALNIFAIYHIIVERLFRD-ATCGYFINIALADIIKCMFVLPITLM 91 (167)
Q Consensus 42 i~~~gl~gN~~vi~vi~~~~~~~~-~~~~~l~nLaisDll~~l~~~p~~~~ 91 (167)
+.++|+++|..-+.+ ..+|+||+ +.|.++..+|++|++..+..++....
T Consensus 6 ~~~~g~~~N~~h~~V-Ltrk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~ 55 (318)
T PF10324_consen 6 LSIFGLFINIFHLIV-LTRKSMRSSSINILLIGIAICDLLYMLSILIWELF 55 (318)
T ss_pred EeHHHHHHHHHHhhh-cCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457899999887764 44688886 89999999999999988776666554
No 13
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=97.06 E-value=0.0026 Score=46.96 Aligned_cols=54 Identities=17% Similarity=0.085 Sum_probs=38.7
Q ss_pred cccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhhHHhh
Q psy1136 61 RLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQVM 114 (167)
Q Consensus 61 ~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~ 114 (167)
+|+|+..+-++.||.++|++.++..+.........+.-.-++..|...+++...
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~q~ 83 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFLQV 83 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHHHH
Confidence 566777888999999999999986444333333333344566789999998776
No 14
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.98 E-value=0.012 Score=46.11 Aligned_cols=53 Identities=17% Similarity=0.212 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHhhhhHhhhHhhe---eccccccchhhHhHHHHHHHHHHhcc
Q psy1136 32 YVFFIILYGVTILSGVALNIFAIYHII---VERLFRDATCGYFINIALADIIKCMF 84 (167)
Q Consensus 32 ~~~~~~~~~ii~~~gl~gN~~vi~vi~---~~~~~~~~~~~~l~nLaisDll~~l~ 84 (167)
+.+..++..+.+++|++||+.++.+-. +++|.-+|.+..+.+||++.++.-..
T Consensus 6 ~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~ 61 (303)
T PF05296_consen 6 EIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWV 61 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHH
Confidence 345677888889999999998776553 34555678999999999999986554
No 15
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=96.86 E-value=0.025 Score=44.46 Aligned_cols=77 Identities=17% Similarity=0.227 Sum_probs=57.5
Q ss_pred HHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhhHHh
Q psy1136 34 FFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQV 113 (167)
Q Consensus 34 ~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~~~~ 113 (167)
...++..+...++++|-+.++..+.+.|+.|++.+-++.-++++|++..+..+... ..++-.-++..|++.+++.+
T Consensus 8 ~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~----~~~~~~~~s~lC~~Qafliq 83 (303)
T PF05462_consen 8 TLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMT----LSPSAGENSFLCQFQAFLIQ 83 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHH----hcccCCCCCcchhhHhHHHH
Confidence 33445556688899999999999999999999999999999999998664332221 11233345678999998777
Q ss_pred h
Q psy1136 114 M 114 (167)
Q Consensus 114 ~ 114 (167)
+
T Consensus 84 ~ 84 (303)
T PF05462_consen 84 F 84 (303)
T ss_pred H
Confidence 6
No 16
>KOG4219|consensus
Probab=96.14 E-value=0.011 Score=47.65 Aligned_cols=53 Identities=30% Similarity=0.620 Sum_probs=48.2
Q ss_pred hhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceecccCC
Q psy1136 115 ERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQ 167 (167)
Q Consensus 115 ~~~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~~~~q 167 (167)
.|++|..++..+..++++.....+++.|+-....+.+.|.+|.++|++.++++
T Consensus 63 hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~C~f~nf~~ 115 (423)
T KOG4219|consen 63 HRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFYCRFVNFFP 115 (423)
T ss_pred hhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccceeeeccccc
Confidence 45778888999999999999999999999988889999999999999998864
No 17
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=94.92 E-value=0.26 Score=38.91 Aligned_cols=55 Identities=18% Similarity=0.197 Sum_probs=46.8
Q ss_pred hHHHHHHHHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhcc
Q psy1136 30 NVYVFFIILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMF 84 (167)
Q Consensus 30 ~~~~~~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~ 84 (167)
..+.+.++.+.+.+++-.+.++..++++.++++.|.+.+-.+.=||+.|+.....
T Consensus 30 ~~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~ 84 (313)
T PF10321_consen 30 VKRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFI 84 (313)
T ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHh
Confidence 4456778888888888899999999999887777899999999999999996543
No 18
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=92.56 E-value=0.21 Score=38.52 Aligned_cols=53 Identities=17% Similarity=0.132 Sum_probs=38.5
Q ss_pred ccccchhhHhHHHHHHHHHHhcc-ccchhHHHHhhccccccccccchhhhHHhhhh
Q psy1136 62 LFRDATCGYFINIALADIIKCMF-VLPITLMVTLVQNWTWGSFLCYFLPMLQVMER 116 (167)
Q Consensus 62 ~~~~~~~~~l~nLaisDll~~l~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~~~ 116 (167)
+..+|++..+.+||+++.+..+. ++|-... ... .-.+++..||+..+.+-..|
T Consensus 6 ~r~kp~dlIl~hLa~aN~lvLl~rGip~~~~-~~~-~~~~~d~gCK~v~Y~~RV~R 59 (265)
T PF03402_consen 6 HRLKPIDLILIHLALANILVLLSRGIPQTMA-FFG-WKFFDDIGCKIVFYIYRVAR 59 (265)
T ss_pred CCCCcHHHHHHHHHHHHHHHHHHhhHHHHHH-Hhh-cccCCCceeeeeeeehHHhc
Confidence 34567999999999999997765 6664333 222 12379999999988887755
No 19
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=91.57 E-value=0.92 Score=35.23 Aligned_cols=47 Identities=23% Similarity=0.323 Sum_probs=37.4
Q ss_pred HHHHHHHHHhhhhHhhhHhheec-cccccchhhHhHHHHHHHHHHhcc
Q psy1136 38 LYGVTILSGVALNIFAIYHIIVE-RLFRDATCGYFINIALADIIKCMF 84 (167)
Q Consensus 38 ~~~ii~~~gl~gN~~vi~vi~~~-~~~~~~~~~~l~nLaisDll~~l~ 84 (167)
++.+.+.+|++.|.++++.+.++ ++.-+.-.+++.|-|+.|++.+..
T Consensus 4 ~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~ 51 (292)
T PF10317_consen 4 YHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIIS 51 (292)
T ss_pred eHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678899999999999877654 444456889999999999997653
No 20
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=87.02 E-value=0.39 Score=35.94 Aligned_cols=71 Identities=14% Similarity=0.132 Sum_probs=1.6
Q ss_pred HHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccc--cccccchhhhHHhh
Q psy1136 41 VTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTW--GSFLCYFLPMLQVM 114 (167)
Q Consensus 41 ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~--g~~~C~~~~~~~~~ 114 (167)
+-..+++++-.+.+......|++|+..+....||++++++..+..+. ... ..++... .+..|...+.+.+.
T Consensus 9 vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~-~~~--~~~~~~~~~~~~~C~~~a~~~hy 81 (242)
T PF00002_consen 9 VGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSFLI-GIS--QTFSPISTTNHCLCRAIAILLHY 81 (242)
T ss_dssp HHHH------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHHhe-ehh--hccccccccccccchhhhhHhHH
Confidence 33444555544445445556788888888999999999986553221 111 1111111 22349988876665
No 21
>KOG4564|consensus
Probab=83.71 E-value=13 Score=31.21 Aligned_cols=78 Identities=17% Similarity=0.140 Sum_probs=49.6
Q ss_pred HHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhcc--------c----cccccc
Q psy1136 37 ILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQN--------W----TWGSFL 104 (167)
Q Consensus 37 ~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~--------w----~~g~~~ 104 (167)
++|.+--.++++.=++.+.++...|++|=..|+.-.||-+|=++.++..+.........+. + .-+...
T Consensus 149 ~lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 228 (473)
T KOG4564|consen 149 ILYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVLIKDLVLVVNGEQDASSDTSLHCLISSNPVG 228 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccccchh
Confidence 4555555555555445555666778999889999999999999988765544433221111 1 145568
Q ss_pred cchhhhHHhh
Q psy1136 105 CYFLPMLQVM 114 (167)
Q Consensus 105 C~~~~~~~~~ 114 (167)
||....+...
T Consensus 229 Ck~~~~~~~Y 238 (473)
T KOG4564|consen 229 CKLLFVFFQY 238 (473)
T ss_pred HHHHHHHHHH
Confidence 9887654444
No 22
>PHA03234 DNA packaging protein UL33; Provisional
Probab=79.54 E-value=1.7 Score=34.60 Aligned_cols=46 Identities=15% Similarity=0.315 Sum_probs=33.7
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceecccC
Q psy1136 119 RDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPML 166 (167)
Q Consensus 119 ~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~~~~ 166 (167)
+..+...+..+++.+++. ++..|+.... ...+|.+|+..|+...++
T Consensus 66 rt~tn~fi~NLAvaDLL~-~l~lp~~~~~-~~~~w~fG~~lCk~~~~~ 111 (338)
T PHA03234 66 HNATSFYLITLFASDFLH-MLCVFFLTLN-REALFNFNQAFCQCVLFI 111 (338)
T ss_pred ccHHHHHHHHHHHHHHHH-HHHHHHHHHH-HhCCccCchhHHHHHHHH
Confidence 456678889999999776 4566765543 345799999999987543
No 23
>KOG4220|consensus
Probab=76.16 E-value=0.71 Score=37.84 Aligned_cols=49 Identities=22% Similarity=0.477 Sum_probs=42.4
Q ss_pred hhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceec
Q psy1136 115 ERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFL 163 (167)
Q Consensus 115 ~~~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~ 163 (167)
+|++....+..+..++.+.+.-.++++|+.....+.++|.+|...|+++
T Consensus 58 nrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlW 106 (503)
T KOG4220|consen 58 NRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLW 106 (503)
T ss_pred cceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHH
Confidence 4556667777888889899888899999999999999999999999875
No 24
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=74.84 E-value=18 Score=28.08 Aligned_cols=105 Identities=13% Similarity=0.164 Sum_probs=56.3
Q ss_pred HHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHHHHhhccccccccccchhhhHHhh--h
Q psy1136 38 LYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPMLQVM--E 115 (167)
Q Consensus 38 ~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~--~ 115 (167)
+-.+........|..+++.+..+|+.|++--.++---...|.+.+....+...+.... .| +.+.. ....+.... +
T Consensus 11 i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~~~~~~-~~-~~~~~-~~~~~if~~~~p 87 (273)
T PF10316_consen 11 IGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLIYYILF-SL-FPEFL-DYKNLIFYIGWP 87 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHHHHHHH-Hh-Chhhh-ccccEEeehhhh
Confidence 3344567788899988887776666445444555556889999888765554443322 22 22222 111111111 1
Q ss_pred h-hhcchhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1136 116 R-LFRDATCGYFINIALADIIKCMFVLPITLMV 147 (167)
Q Consensus 116 ~-~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~ 147 (167)
. .....+......++++|.+| .-.|..+..
T Consensus 88 ~~~~~~iR~~l~~~Ia~dR~~A--~~fPI~y~~ 118 (273)
T PF10316_consen 88 SSNLGSIRSILALIIALDRVFA--VYFPIFYHN 118 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH--HcCCHHHHc
Confidence 0 00122333446778888776 445554443
No 25
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=66.49 E-value=4.6 Score=29.32 Aligned_cols=49 Identities=27% Similarity=0.646 Sum_probs=37.0
Q ss_pred hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceeccc
Q psy1136 117 LFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPM 165 (167)
Q Consensus 117 ~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~~~ 165 (167)
+.+......+..+++..++..+...|........++|..++..|+...+
T Consensus 14 ~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~ 62 (257)
T PF00001_consen 14 RLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGF 62 (257)
T ss_dssp GG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHH
T ss_pred cCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccc
Confidence 3344556677888999988877778877777777899999999987643
No 26
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=62.25 E-value=3.6 Score=31.88 Aligned_cols=40 Identities=28% Similarity=0.530 Sum_probs=27.6
Q ss_pred HHhhhhHhhhHhheeccc----cccchhhHhHHHHHHHHHHhcc
Q psy1136 45 SGVALNIFAIYHIIVERL----FRDATCGYFINIALADIIKCMF 84 (167)
Q Consensus 45 ~gl~gN~~vi~vi~~~~~----~~~~~~~~l~nLaisDll~~l~ 84 (167)
+.+.-...+++++.+.|+ .+++.+-++.+-+++|++..+.
T Consensus 7 ~tlply~~il~~l~~~r~~~~~~~~~Fy~l~~~~~iaDi~~~~~ 50 (283)
T PF10323_consen 7 VTLPLYIFILYCLLKLRKRSKTFKSTFYTLLIQHCIADILSMLF 50 (283)
T ss_pred ehHHHHHHHHHHHHHcccCccccCCHHHHHHHHHHHHHHHHHHH
Confidence 344445555555544332 5799999999999999996654
No 27
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=60.40 E-value=8.2 Score=26.18 Aligned_cols=23 Identities=22% Similarity=0.044 Sum_probs=7.3
Q ss_pred HhhhhHhhhHhheeccccccchh
Q psy1136 46 GVALNIFAIYHIIVERLFRDATC 68 (167)
Q Consensus 46 gl~gN~~vi~vi~~~~~~~~~~~ 68 (167)
|++|=+++++..+||+++|++.+
T Consensus 76 GvIg~Illi~y~irR~~Kk~~~~ 98 (122)
T PF01102_consen 76 GVIGIILLISYCIRRLRKKSSSD 98 (122)
T ss_dssp HHHHHHHHHHHHHHHHS------
T ss_pred HHHHHHHHHHHHHHHHhccCCCC
Confidence 44554444433334444444333
No 28
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=56.62 E-value=21 Score=29.30 Aligned_cols=46 Identities=24% Similarity=0.616 Sum_probs=34.2
Q ss_pred hhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceecc
Q psy1136 117 LFRDATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLP 164 (167)
Q Consensus 117 ~~~~~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~~ 164 (167)
+.+......+..+++..++. ++..|+.+... .++|.+|+..|+...
T Consensus 127 ~lrt~t~i~llnLAisDLl~-~l~lPf~i~~~-~~~W~fg~~~Ck~~~ 172 (417)
T PHA02638 127 KIKTITDIYIFNLAISDLIF-VIDFPFIIYNE-FDQWIFGDFMCKVIS 172 (417)
T ss_pred cCCCHhHHHHHHHHHHHHHH-HHHHHHHHHHH-hccccccccchhhHH
Confidence 34455667788889888664 67889877654 468999999998753
No 29
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=55.81 E-value=22 Score=27.88 Aligned_cols=62 Identities=24% Similarity=0.338 Sum_probs=43.9
Q ss_pred HHHHHHHHHHHHHhhhhHhhhHhhee-ccccccchhhHh---HHHHHHHHHHhccccchhHHHHhh
Q psy1136 34 FFIILYGVTILSGVALNIFAIYHIIV-ERLFRDATCGYF---INIALADIIKCMFVLPITLMVTLV 95 (167)
Q Consensus 34 ~~~~~~~ii~~~gl~gN~~vi~vi~~-~~~~~~~~~~~l---~nLaisDll~~l~~~p~~~~~~~~ 95 (167)
+....|-+++.+++.-|.+.++.+.. .+|+.+..++++ +...++|+-.+...-|........
T Consensus 10 ~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~a 75 (303)
T PF10327_consen 10 WLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPA 75 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeE
Confidence 66778889999999999999986654 555666555444 336788998877666766654433
No 30
>PHA02834 chemokine receptor-like protein; Provisional
Probab=54.50 E-value=11 Score=29.64 Aligned_cols=43 Identities=26% Similarity=0.681 Sum_probs=31.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceeccc
Q psy1136 121 ATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPM 165 (167)
Q Consensus 121 ~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~~~ 165 (167)
.....+..+++.++.. .+..|+..... .++|.+|+..|+...+
T Consensus 61 ~~n~~i~nLAiaDll~-~~~lP~~i~~~-~~~w~~g~~~C~~~~~ 103 (323)
T PHA02834 61 VVDVYLFNIAMSDLML-VFSFPFIIHND-LNEWIFGEFMCKLVLG 103 (323)
T ss_pred hhhhhhHHHHHHHHHH-HHHHHHHHHHH-cCCcCCcchHHHhHHH
Confidence 3456678888888764 67889865544 4579999999987643
No 31
>PF09882 DUF2109: Predicted membrane protein (DUF2109); InterPro: IPR019214 This entry is found in various hypothetical archaeal proteins and has no known function.
Probab=46.35 E-value=44 Score=20.65 Aligned_cols=48 Identities=21% Similarity=0.264 Sum_probs=33.1
Q ss_pred HHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhccccchhHH
Q psy1136 44 LSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCMFVLPITLM 91 (167)
Q Consensus 44 ~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l~~~p~~~~ 91 (167)
++|+++=...+=++..+.+.++..+.-.+|-+++.++....--|+...
T Consensus 5 i~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g~i 52 (78)
T PF09882_consen 5 IIGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMGAI 52 (78)
T ss_pred HHHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHHHH
Confidence 345555444454555677888889999999999999865555565443
No 32
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=44.70 E-value=1.2e+02 Score=22.86 Aligned_cols=42 Identities=17% Similarity=0.325 Sum_probs=28.6
Q ss_pred HHHHhhhhHhhhHhheeccc--c-ccchhhHhHHHHHHHHHHhcc
Q psy1136 43 ILSGVALNIFAIYHIIVERL--F-RDATCGYFINIALADIIKCMF 84 (167)
Q Consensus 43 ~~~gl~gN~~vi~vi~~~~~--~-~~~~~~~l~nLaisDll~~l~ 84 (167)
++.+++.+..+++++.++|| . +++.+.+..-=++.+++.-+.
T Consensus 8 gi~s~~l~~~~~~~i~~~~~~~~~~~sFy~l~~~d~~~ni~~~ln 52 (275)
T PF02118_consen 8 GIPSLILYIFILYVIWKSKKKSYFKSSFYRLYIMDLIMNILTYLN 52 (275)
T ss_pred HHHHHHHHHHHHHHHHHhccccccCCccHHHHHHHhHHHHHHHHH
Confidence 45677777778887777666 3 677777776667777764443
No 33
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=42.56 E-value=23 Score=28.49 Aligned_cols=30 Identities=20% Similarity=0.204 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHhhhhHhhhHhheeccccc
Q psy1136 35 FIILYGVTILSGVALNIFAIYHIIVERLFR 64 (167)
Q Consensus 35 ~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~ 64 (167)
..++.+++.++-++.-.+++|.+.|.||.+
T Consensus 310 t~IiaSiIAIvvIVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 310 TPIIASIIAILIIVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 456777777777777778888887765543
No 34
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=39.78 E-value=1.9e+02 Score=22.80 Aligned_cols=47 Identities=23% Similarity=0.340 Sum_probs=35.5
Q ss_pred HHHHHHHHHHhhhhHhhhHhheeccccccchhhHhHHHHHHHHHHhc
Q psy1136 37 ILYGVTILSGVALNIFAIYHIIVERLFRDATCGYFINIALADIIKCM 83 (167)
Q Consensus 37 ~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~~~~l~nLaisDll~~l 83 (167)
.+-.++.++|++.++..++.+.+++..|.-..+++....++.++-++
T Consensus 21 ~~~~~~s~~~~~~~~~~~~~~~~~~~~H~N~ril~~~~~~~~l~~~~ 67 (324)
T PF10292_consen 21 IFNLLLSIIAFPVIIYALWKIRNSKLFHFNTRILFIVHCFSFLIHCT 67 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhchhHHHHHHHHHHHHHHHHH
Confidence 34556677777777777776777777788889999998888887654
No 35
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=39.10 E-value=38 Score=18.46 Aligned_cols=32 Identities=9% Similarity=0.042 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHhhhhHhhhHhheeccccccch
Q psy1136 36 IILYGVTILSGVALNIFAIYHIIVERLFRDAT 67 (167)
Q Consensus 36 ~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~ 67 (167)
..++....++|+.|-.+....-.-.|++|.|.
T Consensus 7 ~~i~i~~~lv~~Tgy~iYtaFGppSk~LrDPf 38 (43)
T PF02468_consen 7 LAIFISCLLVSITGYAIYTAFGPPSKELRDPF 38 (43)
T ss_pred HHHHHHHHHHHHHhhhhhheeCCCccccCCcc
Confidence 34555667778888655443334577778764
No 36
>PTZ00046 rifin; Provisional
Probab=38.44 E-value=30 Score=27.99 Aligned_cols=30 Identities=20% Similarity=0.256 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHhhhhHhhhHhheeccccc
Q psy1136 35 FIILYGVTILSGVALNIFAIYHIIVERLFR 64 (167)
Q Consensus 35 ~~~~~~ii~~~gl~gN~~vi~vi~~~~~~~ 64 (167)
..++++++.++-++.-.+++|.+.|.||.+
T Consensus 315 taIiaSiiAIvVIVLIMvIIYLILRYRRKK 344 (358)
T PTZ00046 315 TAIIASIVAIVVIVLIMVIIYLILRYRRKK 344 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 356777777777777778888887765543
No 37
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=36.48 E-value=45 Score=18.44 Aligned_cols=32 Identities=9% Similarity=-0.110 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHhhhhHhhhHhheeccccccch
Q psy1136 36 IILYGVTILSGVALNIFAIYHIIVERLFRDAT 67 (167)
Q Consensus 36 ~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~ 67 (167)
..+++...++|+.|-.+....=.-.|++|.|.
T Consensus 10 ~~i~i~~lL~~~TgyaiYtaFGppSk~LrDPF 41 (46)
T PRK13183 10 LAITILAILLALTGFGIYTAFGPPSKELDDPF 41 (46)
T ss_pred HHHHHHHHHHHHhhheeeeccCCcccccCCch
Confidence 34555667788888766554445567777764
No 38
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=33.65 E-value=32 Score=27.15 Aligned_cols=28 Identities=21% Similarity=0.234 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHhhhhHhhhHhheecccc
Q psy1136 36 IILYGVTILSGVALNIFAIYHIIVERLF 63 (167)
Q Consensus 36 ~~~~~ii~~~gl~gN~~vi~vi~~~~~~ 63 (167)
.++.+++.++-++.=.+++|.|.|.||.
T Consensus 257 ~I~aSiiaIliIVLIMvIIYLILRYRRK 284 (299)
T PF02009_consen 257 AIIASIIAILIIVLIMVIIYLILRYRRK 284 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555666656777777665553
No 39
>CHL00020 psbN photosystem II protein N
Probab=33.56 E-value=43 Score=18.26 Aligned_cols=32 Identities=9% Similarity=0.038 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHhhhhHhhhHhheeccccccch
Q psy1136 36 IILYGVTILSGVALNIFAIYHIIVERLFRDAT 67 (167)
Q Consensus 36 ~~~~~ii~~~gl~gN~~vi~vi~~~~~~~~~~ 67 (167)
..++....++|+.|-.+....=.-.|++|.|.
T Consensus 7 ~~i~i~~ll~~~Tgy~iYtaFGppSk~LrDPf 38 (43)
T CHL00020 7 VAIFISGLLVSFTGYALYTAFGQPSKQLRDPF 38 (43)
T ss_pred HHHHHHHHHHHhhheeeeeccCCchhccCCch
Confidence 34555667778888765554445567777764
No 40
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=32.41 E-value=46 Score=19.17 Aligned_cols=15 Identities=33% Similarity=0.248 Sum_probs=9.7
Q ss_pred HHHHHHHhhhhHhhh
Q psy1136 40 GVTILSGVALNIFAI 54 (167)
Q Consensus 40 ~ii~~~gl~gN~~vi 54 (167)
.++.+.-++||++++
T Consensus 9 IviVlgvIigNia~L 23 (55)
T PF11446_consen 9 IVIVLGVIIGNIAAL 23 (55)
T ss_pred HHHHHHHHHhHHHHH
Confidence 344444578998776
No 41
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=30.86 E-value=32 Score=26.59 Aligned_cols=44 Identities=18% Similarity=0.249 Sum_probs=30.9
Q ss_pred HHHHHHhhhhHhhhHhheeccccc-cchhhHhHHHHHHHHHHhcc
Q psy1136 41 VTILSGVALNIFAIYHIIVERLFR-DATCGYFINIALADIIKCMF 84 (167)
Q Consensus 41 ii~~~gl~gN~~vi~vi~~~~~~~-~~~~~~l~nLaisDll~~l~ 84 (167)
+.++++++.|.+.++.+.++.+++ -.=.+++.-.|+.|+..+..
T Consensus 7 ~~~~~s~~~N~~Li~Li~~~s~k~~G~Yk~Lm~~fs~~~i~fs~~ 51 (307)
T PF10326_consen 7 IGFVLSLFLNSLLIYLILTKSPKSLGSYKYLMIYFSIFEIIFSIL 51 (307)
T ss_pred HHHHHHHHHHHHHHHHHHhccCCCCCCEEEEEehhHHHHHHHHHH
Confidence 456789999999998776543333 22455666788888887764
No 42
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=29.71 E-value=48 Score=25.90 Aligned_cols=44 Identities=23% Similarity=0.626 Sum_probs=28.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceeccc
Q psy1136 121 ATCGYFINIALADIIKCMFVLPITLMVTLVQNWTWGSFLCYFLPM 165 (167)
Q Consensus 121 ~~~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w~~g~~~C~~~~~ 165 (167)
.....+..+++...+. ....|........++|.+|+..|+...+
T Consensus 73 ~~~~ll~~laisDll~-~~~~~~~~~~~~~~~~~~~~~~C~~~~~ 116 (335)
T PHA03087 73 PMDIYLLNLAVSDLLF-VMTLPFQIYYYILFQWSFGEFACKIVSG 116 (335)
T ss_pred chHHHHHHHHHHHHHH-HHhHHHHHHHHhCCCCCCCcHHHHHHHH
Confidence 3444556667667554 3456766656667788899988876543
No 43
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=29.60 E-value=71 Score=18.01 Aligned_cols=24 Identities=13% Similarity=0.191 Sum_probs=12.2
Q ss_pred HHHHHHHHHHhhhhHhhhHhheeccc
Q psy1136 37 ILYGVTILSGVALNIFAIYHIIVERL 62 (167)
Q Consensus 37 ~~~~ii~~~gl~gN~~vi~vi~~~~~ 62 (167)
++..+++++|++| +.+..+.+.+.
T Consensus 5 ~iV~i~iv~~lLg--~~I~~~~K~yg 28 (50)
T PF12606_consen 5 LIVSIFIVMGLLG--LSICTTLKAYG 28 (50)
T ss_pred HHHHHHHHHHHHH--HHHHHHhhccc
Confidence 3445666667666 33334444433
No 44
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=28.74 E-value=87 Score=20.30 Aligned_cols=9 Identities=33% Similarity=0.420 Sum_probs=4.2
Q ss_pred Hhheecccc
Q psy1136 55 YHIIVERLF 63 (167)
Q Consensus 55 ~vi~~~~~~ 63 (167)
.++.|.|+.
T Consensus 84 FVILRer~~ 92 (101)
T PF06024_consen 84 FVILRERQK 92 (101)
T ss_pred EEEEecccc
Confidence 344555443
No 45
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=25.43 E-value=1.5e+02 Score=21.24 Aligned_cols=24 Identities=29% Similarity=0.297 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHhhhhHhhhHhh
Q psy1136 34 FFIILYGVTILSGVALNIFAIYHI 57 (167)
Q Consensus 34 ~~~~~~~ii~~~gl~gN~~vi~vi 57 (167)
+.-.+|.++++.+++.=.+++-++
T Consensus 94 l~R~~~Vl~g~s~l~i~yfvir~~ 117 (163)
T PF06679_consen 94 LKRALYVLVGLSALAILYFVIRTF 117 (163)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHH
Confidence 344556666666655444444333
No 46
>PF02060 ISK_Channel: Slow voltage-gated potassium channel; InterPro: IPR000369 Potassium channels are the most diverse group of the ion channel family [, ]. They are important in shaping the action potential, and in neuronal excitability and plasticity []. The potassium channel family is composed of several functionally distinct isoforms, which can be broadly separated into 2 groups []: the practically non-inactivating 'delayed' group and the rapidly inactivating 'transient' group. These are all highly similar proteins, with only small amino acid changes causing the diversity of the voltage-dependent gating mechanism, channel conductance and toxin binding properties. Each type of K+ channel is activated by different signals and conditions depending on their type of regulation: some open in response to depolarisation of the plasma membrane; others in response to hyperpolarisation or an increase in intracellular calcium concentration; some can be regulated by binding of a transmitter, together with intracellular kinases; while others are regulated by GTP-binding proteins or other second messengers []. In eukaryotic cells, K+ channels are involved in neural signalling and generation of the cardiac rhythm, act as effectors in signal transduction pathways involving G protein-coupled receptors (GPCRs) and may have a role in target cell lysis by cytotoxic T-lymphocytes []. In prokaryotic cells, they play a role in the maintenance of ionic homeostasis []. All K+ channels discovered so far possess a core of alpha subunits, each comprising either one or two copies of a highly conserved pore loop domain (P-domain). The P-domain contains the sequence (T/SxxTxGxG), which has been termed the K+ selectivity sequence. In families that contain one P-domain, four subunits assemble to form a selective pathway for K+ across the membrane. However, it remains unclear how the 2 P-domain subunits assemble to form a selective pore. The functional diversity of these families can arise through homo- or hetero-associations of alpha subunits or association with auxiliary cytoplasmic beta subunits. K+ channel subunits containing one pore domain can be assigned into one of two superfamilies: those that possess six transmembrane (TM) domains and those that possess only two TM domains. The six TM domain superfamily can be further subdivided into conserved gene families: the voltage-gated (Kv) channels; the KCNQ channels (originally known as KvLQT channels); the EAG-like K+ channels; and three types of calcium (Ca)-activated K+ channels (BK, IK and SK) []. The 2TM domain family comprises inward-rectifying K+ channels. In addition, there are K+ channel alpha-subunits that possess two P-domains. These are usually highly regulated K+ selective leak channels. Two types of beta subunit (KCNE and KCNAB) are presently known to associate with voltage-gated alpha subunits (Kv, KCNQ and eag-like). However, not all combinations of alpha and beta subunits are possible. The KCNE family of K+ channel subunits are membrane glycoproteins that possess a single transmembrane (TM) domain. They share no structural relationship with the alpha subunit proteins, which possess pore forming domains. The subunits appear to have a regulatory function, modulating the kinetics and voltage dependence of the alpha subunits of voltage-dependent K+ channels. KCNE subunits are formed from short polypeptides of ~130 amino acids, and are divided into five subfamilies: KCNE1 (MinK/IsK), KCNE2 (MiRP1), KCNE3 (MiRP2), KCNE4 (MiRP3) and KCNE1L (AMMECR2). ; GO: 0005249 voltage-gated potassium channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2K21_A.
Probab=21.25 E-value=1.1e+02 Score=20.89 Aligned_cols=10 Identities=20% Similarity=0.159 Sum_probs=3.5
Q ss_pred cccchhhHhH
Q psy1136 63 FRDATCGYFI 72 (167)
Q Consensus 63 ~~~~~~~~l~ 72 (167)
.+.|-+.|+.
T Consensus 74 s~DPyh~YIe 83 (129)
T PF02060_consen 74 SHDPYHQYIE 83 (129)
T ss_dssp ---TTTTTS-
T ss_pred ccChHHHHhH
Confidence 3456555553
No 47
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=20.77 E-value=2e+02 Score=22.54 Aligned_cols=11 Identities=27% Similarity=0.208 Sum_probs=5.4
Q ss_pred hhhHhheeccc
Q psy1136 52 FAIYHIIVERL 62 (167)
Q Consensus 52 ~vi~vi~~~~~ 62 (167)
++.|.+.|||+
T Consensus 289 LiaYli~Rrr~ 299 (306)
T PF01299_consen 289 LIAYLIGRRRS 299 (306)
T ss_pred HHhheeEeccc
Confidence 44455555443
No 48
>PHA03235 DNA packaging protein UL33; Provisional
Probab=20.72 E-value=1e+02 Score=25.35 Aligned_cols=39 Identities=26% Similarity=0.490 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--ccccccceeccc
Q psy1136 123 CGYFINIALADIIKCMFVLPITLMVTLVQNW--TWGSFLCYFLPM 165 (167)
Q Consensus 123 ~~~~~~i~~~w~~~~~~~~P~~~~~~~~~~w--~~g~~~C~~~~~ 165 (167)
...+..+++..++. ++..|+.+.. ..| ..|...|++..+
T Consensus 70 ~~~I~NLAvsDLl~-l~~lP~~i~~---~~~~~~~g~~~Ck~~~~ 110 (409)
T PHA03235 70 TLYMTNLYLANLLT-VFVLPFIMLS---NQGLLSGSVAGCKFASL 110 (409)
T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHh---cCccccCCCCeehhHHH
Confidence 45678888888765 6778876543 344 445789987754
No 49
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=20.72 E-value=1.6e+02 Score=16.03 Aligned_cols=18 Identities=11% Similarity=-0.040 Sum_probs=8.2
Q ss_pred HHHHHHhhhhHhhhHhheec
Q psy1136 41 VTILSGVALNIFAIYHIIVE 60 (167)
Q Consensus 41 ii~~~gl~gN~~vi~vi~~~ 60 (167)
+++.+.++| +++++..++
T Consensus 16 v~~~~~F~g--i~~w~~~~~ 33 (49)
T PF05545_consen 16 VLFFVFFIG--IVIWAYRPR 33 (49)
T ss_pred HHHHHHHHH--HHHHHHccc
Confidence 333334444 455655444
Done!