Query psy15241
Match_columns 117
No_of_seqs 117 out of 1184
Neff 9.9
Searched_HMMs 46136
Date Sat Aug 17 00:55:13 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15241.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15241hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.8 5E-20 1.1E-24 127.6 5.9 84 13-96 251-337 (423)
2 PHA03234 DNA packaging protein 99.8 3.5E-19 7.6E-24 123.9 6.9 78 16-93 230-315 (338)
3 KOG4220|consensus 99.8 3.4E-19 7.3E-24 124.0 2.6 85 14-99 408-492 (503)
4 PHA02834 chemokine receptor-li 99.7 2.8E-18 6E-23 118.7 2.7 81 16-96 222-311 (323)
5 PHA03235 DNA packaging protein 99.7 1.1E-17 2.5E-22 118.8 5.6 79 17-95 238-324 (409)
6 PHA02638 CC chemokine receptor 99.7 1.5E-16 3.2E-21 113.4 6.2 81 16-96 306-399 (417)
7 PHA03087 G protein-coupled che 99.6 1.6E-15 3.4E-20 105.2 5.7 85 15-99 236-328 (335)
8 PF00001 7tm_1: 7 transmembran 99.3 1.4E-11 3.1E-16 81.0 6.6 66 12-77 191-257 (257)
9 KOG2087|consensus 98.8 5.2E-11 1.1E-15 82.3 -6.9 77 19-95 231-307 (363)
10 PF10324 7TM_GPCR_Srw: Serpent 98.7 3E-08 6.4E-13 68.5 6.2 80 15-94 234-317 (318)
11 PF10320 7TM_GPCR_Srsx: Serpen 98.3 1.1E-06 2.4E-11 59.3 5.1 79 14-92 178-257 (257)
12 PF11970 Git3_C: G protein-cou 98.0 6E-05 1.3E-09 41.7 6.7 69 14-82 3-72 (76)
13 PF10321 7TM_GPCR_Srt: Serpent 97.6 0.0002 4.4E-09 49.8 5.5 77 10-93 230-306 (313)
14 PF10323 7TM_GPCR_Srv: Serpent 97.5 0.00086 1.9E-08 46.0 7.8 81 11-94 201-283 (283)
15 PF05296 TAS2R: Mammalian tast 97.4 0.002 4.4E-08 44.7 8.6 81 11-95 222-302 (303)
16 PF10317 7TM_GPCR_Srd: Serpent 97.0 0.0055 1.2E-07 42.2 7.7 78 11-90 215-292 (292)
17 PF10328 7TM_GPCR_Srx: Serpent 97.0 0.0059 1.3E-07 41.5 7.7 79 8-92 195-273 (274)
18 PF05462 Dicty_CAR: Slime mold 97.0 0.0062 1.3E-07 42.3 7.8 67 25-94 204-270 (303)
19 PF03402 V1R: Vomeronasal orga 96.8 0.0025 5.4E-08 43.4 4.3 83 3-88 181-263 (265)
20 PF02101 Ocular_alb: Ocular al 95.9 0.031 6.8E-07 39.9 5.8 26 21-46 242-267 (405)
21 PF10327 7TM_GPCR_Sri: Serpent 95.8 0.029 6.3E-07 39.0 5.2 78 8-87 225-302 (303)
22 PF10318 7TM_GPCR_Srh: Serpent 95.6 0.078 1.7E-06 36.5 6.8 83 9-94 219-302 (302)
23 PF10326 7TM_GPCR_Str: Serpent 94.3 0.0093 2E-07 41.1 -0.5 76 11-89 231-307 (307)
24 KOG4193|consensus 92.7 1.6 3.6E-05 33.5 8.8 76 20-102 514-589 (610)
25 KOG4564|consensus 92.3 0.15 3.2E-06 37.7 2.9 83 5-95 337-420 (473)
26 PF03125 Sre: C. elegans Sre G 92.0 2.4 5.3E-05 30.3 8.7 42 55-96 282-323 (365)
27 KOG2575|consensus 88.9 0.7 1.5E-05 33.7 3.7 26 17-42 253-278 (510)
28 PF10319 7TM_GPCR_Srj: Serpent 87.8 2.3 5.1E-05 29.9 5.6 75 11-87 234-309 (310)
29 KOG3827|consensus 83.5 1.2 2.6E-05 32.1 2.6 61 20-80 61-122 (400)
30 PF02118 Srg: Srg family chemo 83.3 7.3 0.00016 26.2 6.4 75 12-87 200-274 (275)
31 PF04789 DUF621: Protein of un 78.9 11 0.00025 26.2 5.9 76 14-92 221-296 (305)
32 PF09889 DUF2116: Uncharacteri 78.3 5 0.00011 21.0 3.3 24 14-37 31-54 (59)
33 PF10292 7TM_GPCR_Srab: Serpen 76.0 24 0.00052 24.7 7.2 80 14-93 224-305 (324)
34 PF05393 Hum_adeno_E3A: Human 70.1 7.6 0.00016 22.1 2.8 17 21-37 40-56 (94)
35 PF03155 Alg6_Alg8: ALG6, ALG8 64.8 13 0.00028 27.9 3.9 27 16-42 218-244 (469)
36 PF04238 DUF420: Protein of un 58.6 27 0.00059 21.4 4.1 41 12-52 28-68 (133)
37 KOG1341|consensus 51.3 70 0.0015 25.3 5.8 36 12-47 450-485 (854)
38 PF06072 Herpes_US9: Alphaherp 50.5 37 0.0008 17.8 3.7 25 14-38 24-49 (60)
39 PF11298 DUF3099: Protein of u 45.4 52 0.0011 18.0 3.7 30 11-40 10-39 (73)
40 PF02532 PsbI: Photosystem II 43.7 37 0.0008 15.8 2.6 15 21-35 3-17 (36)
41 PF06210 DUF1003: Protein of u 43.5 61 0.0013 19.1 3.8 56 24-83 5-60 (108)
42 TIGR01912 TatC-Arch Twin argin 43.0 1E+02 0.0023 20.8 5.4 32 15-46 7-38 (237)
43 PHA02657 hypothetical protein; 42.5 64 0.0014 18.3 3.8 24 24-47 27-50 (95)
44 PF10777 YlaC: Inner membrane 40.7 64 0.0014 20.3 3.7 39 11-49 21-59 (155)
45 TIGR00934 2a38euk potassium up 39.7 2.1E+02 0.0045 23.4 7.0 30 18-47 444-473 (800)
46 PHA02849 putative transmembran 39.1 71 0.0015 17.8 3.9 25 23-47 16-40 (82)
47 COG4068 Uncharacterized protei 38.7 54 0.0012 17.2 2.6 12 17-28 38-49 (64)
48 PF10445 DUF2456: Protein of u 36.9 86 0.0019 18.1 3.9 26 18-43 26-51 (94)
49 PF15188 CCDC-167: Coiled-coil 35.4 49 0.0011 18.7 2.4 18 22-39 67-84 (85)
50 COG3821 Predicted membrane pro 35.4 37 0.00079 22.6 2.1 17 26-42 7-23 (234)
51 PF07010 Endomucin: Endomucin; 35.0 46 0.001 22.6 2.5 7 40-46 206-212 (259)
52 PF05391 Lsm_interact: Lsm int 34.8 26 0.00056 14.3 0.9 10 80-89 11-20 (21)
53 COG2322 Predicted membrane pro 34.0 1E+02 0.0022 19.9 3.9 31 14-44 71-101 (177)
54 PF14992 TMCO5: TMCO5 family 32.2 1E+02 0.0023 21.5 4.0 24 19-42 215-240 (280)
55 PF01679 Pmp3: Proteolipid mem 31.4 52 0.0011 16.6 1.9 22 23-44 25-46 (51)
56 KOG4686|consensus 31.1 1.1E+02 0.0025 22.1 4.1 24 20-43 264-287 (459)
57 COG1823 Predicted Na+/dicarbox 30.5 2.2E+02 0.0048 21.2 5.4 58 21-78 221-293 (458)
58 COG2383 Uncharacterized conser 29.8 89 0.0019 18.4 2.9 19 17-35 35-53 (109)
59 KOG1726|consensus 29.8 86 0.0019 21.2 3.2 14 28-41 50-63 (225)
60 PF11119 DUF2633: Protein of u 28.7 97 0.0021 16.2 4.3 31 16-46 4-34 (59)
61 PRK06761 hypothetical protein; 27.0 74 0.0016 22.2 2.6 34 59-93 169-202 (282)
62 PF12609 DUF3774: Wound-induce 26.3 4.4 9.6E-05 22.5 -2.8 8 31-38 72-79 (79)
63 PF04505 Dispanin: Interferon- 25.1 64 0.0014 17.9 1.7 20 28-47 22-41 (82)
64 KOG0812|consensus 24.6 1.2E+02 0.0026 21.4 3.2 19 18-36 289-307 (311)
65 PF05399 EVI2A: Ectropic viral 24.6 2.1E+02 0.0045 19.3 4.2 23 22-44 131-153 (227)
66 PF10601 zf-LITAF-like: LITAF- 24.5 1.2E+02 0.0026 16.1 2.8 8 34-41 44-51 (73)
67 PF06196 DUF997: Protein of un 24.4 1.4E+02 0.003 16.6 3.3 16 24-39 9-24 (80)
68 PF14257 DUF4349: Domain of un 24.4 1.6E+02 0.0034 20.0 3.9 18 28-45 238-255 (262)
69 PF04272 Phospholamban: Phosph 24.3 1.1E+02 0.0023 15.1 2.4 9 32-40 38-46 (52)
70 KOG4561|consensus 24.3 2.6E+02 0.0056 19.7 5.0 31 60-90 236-266 (281)
71 PF06716 DUF1201: Protein of u 24.2 1.1E+02 0.0023 15.2 3.5 15 28-42 11-25 (54)
72 PRK09731 putative general secr 23.6 1.8E+02 0.0038 18.9 3.7 28 14-41 32-62 (178)
73 COG1687 AzlD Predicted branche 23.5 1.7E+02 0.0037 17.3 3.7 23 21-43 6-28 (106)
74 PRK10633 hypothetical protein; 23.2 1.5E+02 0.0033 16.6 3.6 24 16-39 6-29 (80)
75 COG5058 LAG1 Protein transport 23.1 2.2E+02 0.0048 20.6 4.3 62 19-80 231-294 (395)
76 COG4041 Predicted membrane pro 22.3 1.8E+02 0.0039 18.1 3.3 21 26-46 10-30 (171)
77 KOG4220|consensus 22.0 3.6E+02 0.0078 20.5 5.9 76 13-90 139-226 (503)
78 PRK06099 F0F1 ATP synthase sub 21.5 1.7E+02 0.0037 17.8 3.2 19 32-50 45-63 (126)
No 1
>KOG4219|consensus
Probab=99.80 E-value=5e-20 Score=127.60 Aligned_cols=84 Identities=17% Similarity=0.342 Sum_probs=73.0
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC---CcchhHhHHHHHHHhhhccccchhheecChHHHHHHH
Q psy15241 13 GQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQS---LLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELS 89 (117)
Q Consensus 13 ~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~---~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~ 89 (117)
++.++++|+.||++++|+.|.+||+||++..++......- .....++....+|+.+|+|+||+||++.|++||.+++
T Consensus 251 ~~~kak~K~vkmliiVV~~FaicWlPyh~y~il~~~~~~i~~~k~i~~vyl~~~WLaMSst~yNPiIY~~lN~Rfr~gf~ 330 (423)
T KOG4219|consen 251 EQLKAKKKVVKMLIIVVVIFAICWLPYHIYFILNATNPEINRKKFIQQVYLAIYWLAMSSTCYNPIIYCFLNKRFRGGFR 330 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhccHhhhhhHHHHHHHHh
Confidence 5778899999999999999999999999998877543321 2344677889999999999999999999999999999
Q ss_pred hhcccCc
Q psy15241 90 KRIPCLG 96 (117)
Q Consensus 90 ~~~~~~~ 96 (117)
+.++||.
T Consensus 331 ~~fr~cp 337 (423)
T KOG4219|consen 331 RAFRWCP 337 (423)
T ss_pred hhhheee
Confidence 9999885
No 2
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.78 E-value=3.5e-19 Score=123.87 Aligned_cols=78 Identities=15% Similarity=0.084 Sum_probs=62.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC----CC----cchhHhHHHHHHHhhhccccchhheecChHHHHH
Q psy15241 16 SAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQ----SL----LTPGLTMIPACTCKAVACLDPYVYAISHPRYRLE 87 (117)
Q Consensus 16 ~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~----~~----~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 87 (117)
++++|++|++++++++|++||+||++..++...... .. .....+.++.+++++|||+||+||++.+++||++
T Consensus 230 ~~~~k~~k~i~~vv~vF~iCWlPy~iv~l~~~~~~~~~~~~c~~~~~~~~~~~v~~~La~~nsclNPiIY~f~~~~FR~~ 309 (338)
T PHA03234 230 KKHKKTLFFIRILILSFLCIQIPNIAILICEIAFLYIANNSCFGLAQREILQIIIRLMPEIHCFSNPLVYAFTGGDFRLR 309 (338)
T ss_pred hhhhhhhhHHHHHHHHHHHHHhHHHHHHHHHHHHHhcccCcchHHHHHHHHHHHHHHHHHhhhhhhHHHHHHhhHHHHHH
Confidence 567899999999999999999999998776543211 10 1123457889999999999999999999999999
Q ss_pred HHhhcc
Q psy15241 88 LSKRIP 93 (117)
Q Consensus 88 ~~~~~~ 93 (117)
+++.++
T Consensus 310 ~~~~~~ 315 (338)
T PHA03234 310 FTACFQ 315 (338)
T ss_pred HHHHHH
Confidence 887663
No 3
>KOG4220|consensus
Probab=99.75 E-value=3.4e-19 Score=123.97 Aligned_cols=85 Identities=22% Similarity=0.434 Sum_probs=77.5
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHhhcc
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRIP 93 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~ 93 (117)
...+|+|++|++..|+++|++||+||+++.++..+.... .+..+..+.-+|.|+||-+||+.|.+-|+.||+.+++++.
T Consensus 408 ~~~rErKAAkTLsAILlAFIiTWtPYNImVlv~tFC~~C-iP~tlW~~gYwLCYINSTiNP~CYALCNatFrkTfk~lL~ 486 (503)
T KOG4220|consen 408 SLVRERKAAKTLSAILLAFILTWTPYNIMVLVNTFCKNC-IPETLWTFGYWLCYINSTINPLCYALCNATFRKTFKRLLL 486 (503)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHcccceeeeehHhhcccc-cchhHhhhhhheeeecccccHHHHHHHhHHHHHHHHHhhe
Confidence 447899999999999999999999999999999887766 7889999999999999999999999999999999999997
Q ss_pred cCcccC
Q psy15241 94 CLGIKE 99 (117)
Q Consensus 94 ~~~~~~ 99 (117)
|....+
T Consensus 487 Cr~~~~ 492 (503)
T KOG4220|consen 487 CRWKKR 492 (503)
T ss_pred eeeccc
Confidence 665443
No 4
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.71 E-value=2.8e-18 Score=118.68 Aligned_cols=81 Identities=16% Similarity=0.315 Sum_probs=63.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCc-------ch--hHhHHHHHHHhhhccccchhheecChHHHH
Q psy15241 16 SAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLL-------TP--GLTMIPACTCKAVACLDPYVYAISHPRYRL 86 (117)
Q Consensus 16 ~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~-------~~--~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~ 86 (117)
+.++|..|++++++++|++||+||++..++..+...... .. ....++.+++++||++||+||++.+++||+
T Consensus 222 ~~~~k~~k~~~~vv~~F~icWlPy~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~iNPiIY~~~~~~fR~ 301 (323)
T PHA02834 222 KNKTRSIKIILTVVTFTVVFWVPFNIVLFINSLQSVGLIDIGCYHFKKIVYSIDIAELISFVHCCVNPIIYAFVGKNFKK 301 (323)
T ss_pred cccceEEeehhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCccchHHHHHHHHHHHHHHHHHhhccccHHHHHHhcHHHHH
Confidence 346788999999999999999999999877654321100 11 124578899999999999999999999999
Q ss_pred HHHhhcccCc
Q psy15241 87 ELSKRIPCLG 96 (117)
Q Consensus 87 ~~~~~~~~~~ 96 (117)
++++++++..
T Consensus 302 ~~~~~~~~~~ 311 (323)
T PHA02834 302 VFKNMFCRTN 311 (323)
T ss_pred HHHHHHHhhh
Confidence 9999885543
No 5
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.71 E-value=1.1e-17 Score=118.84 Aligned_cols=79 Identities=13% Similarity=0.183 Sum_probs=63.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC----C----CcchhHhHHHHHHHhhhccccchhheecChHHHHHH
Q psy15241 17 AEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQ----S----LLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLEL 88 (117)
Q Consensus 17 ~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~----~----~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 88 (117)
+++|..+++++++++|++||+||+++.++..+... . .....+..++.+++++||++||+||++.+++||+++
T Consensus 238 ~~~k~~~~v~iivv~F~iCWlPy~v~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~La~~ns~lNPiIY~~~~~~FRk~~ 317 (409)
T PHA03235 238 KRSRTLTFVCILLLSFLCLQTPFVAIMIFDSYATLIWPSDCEHINLRDAVSTLSRLVPNLHCLLNPILYAFLGNDFLKRF 317 (409)
T ss_pred cchhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCchhhhHHHHHHHHHHHHHHHHHhHhHHHHHHhhHHHHHHH
Confidence 35677888899999999999999998887644221 1 012245678899999999999999999999999999
Q ss_pred HhhcccC
Q psy15241 89 SKRIPCL 95 (117)
Q Consensus 89 ~~~~~~~ 95 (117)
++.+.+.
T Consensus 318 ~~~l~~~ 324 (409)
T PHA03235 318 RQCFRGE 324 (409)
T ss_pred HHHHhhh
Confidence 9888653
No 6
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.66 E-value=1.5e-16 Score=113.44 Aligned_cols=81 Identities=14% Similarity=0.216 Sum_probs=64.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-----------C--cchhHhHHHHHHHhhhccccchhheecCh
Q psy15241 16 SAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQS-----------L--LTPGLTMIPACTCKAVACLDPYVYAISHP 82 (117)
Q Consensus 16 ~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~-----------~--~~~~~~~~~~~l~~~ns~~NPiiY~~~~~ 82 (117)
++++|.+|++++++++|++||+||++..++..+.... . .......++..++++|+|+||+||++.++
T Consensus 306 ~~k~k~~rli~~ivi~f~lcW~Py~i~~ll~~~~~~~~~~~~~~~~~c~~~~l~~~~~vt~~la~~~sclNPiIY~f~~~ 385 (417)
T PHA02638 306 SKKTKSIIIVSIIIICSLICWIPLNIVILFATMYSFKGFNSIISEHICGFIKLGYAMMLAEAISLTHCCINPLIYTLIGE 385 (417)
T ss_pred cccchhhhhHHHHHHHHHHHHhhHHHHHHHHHHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHhhhHHHHHHhCH
Confidence 3467888999999999999999999998877543211 0 01234567889999999999999999999
Q ss_pred HHHHHHHhhcccCc
Q psy15241 83 RYRLELSKRIPCLG 96 (117)
Q Consensus 83 ~fr~~~~~~~~~~~ 96 (117)
+||+++++.+++..
T Consensus 386 ~FR~~l~~~~~~~~ 399 (417)
T PHA02638 386 NFRMHLLMIFRNIF 399 (417)
T ss_pred HHHHHHHHHHHHhc
Confidence 99999999885544
No 7
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.60 E-value=1.6e-15 Score=105.18 Aligned_cols=85 Identities=16% Similarity=0.280 Sum_probs=65.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CCC-C----cchhHhHHHHHHHhhhccccchhheecChHHHH
Q psy15241 15 SSAEVRIAKTAITLCALFVASWTPYAVVALTGAFG---DQS-L----LTPGLTMIPACTCKAVACLDPYVYAISHPRYRL 86 (117)
Q Consensus 15 ~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~---~~~-~----~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~ 86 (117)
.++++|+.|++.+++++|++||+|+.+..++.... ... . ..+....++.+++.+|+++||+||++.+++||+
T Consensus 236 ~~~~~k~~k~l~~iv~~f~i~w~P~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~~NPiIY~~~~~~fr~ 315 (335)
T PHA03087 236 SKKNKKAIKLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSGCKAVKYIQYALHVTEIISLSHCCINPLIYAFVSEFFNK 315 (335)
T ss_pred chhcchHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHHHHHhhhhhHHHHcCHHHHH
Confidence 35678999999999999999999999877655321 111 0 112344678889999999999999999999999
Q ss_pred HHHhhcccCcccC
Q psy15241 87 ELSKRIPCLGIKE 99 (117)
Q Consensus 87 ~~~~~~~~~~~~~ 99 (117)
++++.++....+.
T Consensus 316 ~~~~~~~~~~~~~ 328 (335)
T PHA03087 316 HKKKSLKLMFTSS 328 (335)
T ss_pred HHHHHHHHHhhcC
Confidence 9999886544333
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.27 E-value=1.4e-11 Score=80.97 Aligned_cols=66 Identities=27% Similarity=0.516 Sum_probs=57.0
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-cchhHhHHHHHHHhhhccccchhh
Q psy15241 12 EGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSL-LTPGLTMIPACTCKAVACLDPYVY 77 (117)
Q Consensus 12 ~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~ns~~NPiiY 77 (117)
..+.++++|.++++.++++.|++||+|+.+..++.......+ .......++.++.++|+++||+||
T Consensus 191 ~~~~~~~~~~~~~~~~i~~~f~~~~~P~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~~s~~nP~iY 257 (257)
T PF00001_consen 191 RRRSRRERRAARTLLIIVLVFLLCWLPYFILSLLSVFSPSSSLISSILFYISYFLAFLNSCLNPIIY 257 (257)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHSSTSTCSHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccCCceeHHHHHHHHcCccchhhHHHHHHHHHHHHHHHhhCcEEC
Confidence 346678999999999999999999999999998888766553 355677899999999999999998
No 9
>KOG2087|consensus
Probab=98.76 E-value=5.2e-11 Score=82.26 Aligned_cols=77 Identities=18% Similarity=0.303 Sum_probs=60.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHhhcccC
Q psy15241 19 VRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRIPCL 95 (117)
Q Consensus 19 ~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~~ 95 (117)
.+..|-++.+++.-++||.|..++.+...+...-........+..++.-+|||+||++|.+.++.||+.++.++...
T Consensus 231 ~~~akr~a~LvfTd~icw~Pi~f~~~~al~~~~li~~~~sk~llv~flPlns~~NP~LYa~fT~~fk~d~~~l~~k~ 307 (363)
T KOG2087|consen 231 TSVAKRMAFLVFTDCICWCPIAFFKFSALIGVELISVSYSKWLLVFFLPLNSCLNPFLYAFFTPVFKEDLFLLLSKV 307 (363)
T ss_pred hhhhhCeeEEEEccccccCchheeeeHHhcCCcccChhhceeEEEEEEEcccccCchhHHHcCHHHHHHHHHHHhhc
Confidence 46777778889999999999999988776655442222233566777889999999999999999999999988655
No 10
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=98.74 E-value=3e-08 Score=68.54 Aligned_cols=80 Identities=18% Similarity=0.276 Sum_probs=66.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----CcchhHhHHHHHHHhhhccccchhheecChHHHHHHHh
Q psy15241 15 SSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQS----LLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSK 90 (117)
Q Consensus 15 ~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~----~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~ 90 (117)
.+++.+.++++..+.+.|+++=+|..+..++....... ........+..++..+|+..|++||++.+.+||+.+++
T Consensus 234 ~~~~~~tt~li~~~ti~f~i~e~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~h~~ic~~mSsqYR~t~~~ 313 (318)
T PF10324_consen 234 SKKSDRTTKLILFMTISFLISELPQGIIFLLESFFEEDSGLIFIIIQLSIIFNILITINSSIHFFICCFMSSQYRKTVKK 313 (318)
T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHhhhHHHhhhhhhHHHHHHHHH
Confidence 34677899999999999999999999999987753322 12234456788899999999999999999999999999
Q ss_pred hccc
Q psy15241 91 RIPC 94 (117)
Q Consensus 91 ~~~~ 94 (117)
+++|
T Consensus 314 ~f~~ 317 (318)
T PF10324_consen 314 LFGC 317 (318)
T ss_pred Hhcc
Confidence 9965
No 11
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.33 E-value=1.1e-06 Score=59.30 Aligned_cols=79 Identities=14% Similarity=0.153 Sum_probs=63.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CcchhHhHHHHHHHhhhccccchhheecChHHHHHHHhhc
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQS-LLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRI 92 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~ 92 (117)
..++++|+.|.+.+.++.|+.+|.=..+...+....... ...........+++.+|.+.|-++|.+.|+|||+++++++
T Consensus 178 ~~~~~~kv~ksL~v~v~i~i~~w~~s~~~~~v~~~~~~~~~~~~~i~~~~~i~v~~~~s~~ffV~~~~S~EYR~af~~~~ 257 (257)
T PF10320_consen 178 NSSRSKKVFKSLKVTVIIFIFSWFLSQIINTVSLALGLDGETIAIIQMYAGIFVNISYSQNFFVYYWRSSEYRKAFRELF 257 (257)
T ss_pred cchhHHHHHHHhhhheeeeeHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHHHHHHheEEEEcCHHHHHHHHHhC
Confidence 466789999999999999999999887776655433333 2222345667889999999999999999999999999864
No 12
>PF11970 Git3_C: G protein-coupled glucose receptor regulating Gpa2 C-term; InterPro: IPR022596 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This family is the conserved C-terminal domain of the member proteins.
Probab=98.00 E-value=6e-05 Score=41.71 Aligned_cols=69 Identities=12% Similarity=0.149 Sum_probs=51.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecCh
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWT-PYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHP 82 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~-P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~ 82 (117)
++++-+|.+|.+++==++|+++|+ |+.+-.+-............+..++.++..+|+++|-++|++.-+
T Consensus 3 ~r~~i~r~lr~mfiYP~~Yi~lwlfP~~~~~~~~~~~~~~~p~~~l~~i~~~~~~~~G~VD~lvf~~~er 72 (76)
T PF11970_consen 3 RRKRIRRQLRSMFIYPLVYIVLWLFPFAAHRMQYMYEIGHGPSFWLFCIAGFMQPSQGFVDCLVFTLRER 72 (76)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCchHHHHHHHHHHHccCHHHhhheeeecc
Confidence 455667778888888889999999 977665543322222234566788999999999999999998754
No 13
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=97.58 E-value=0.0002 Score=49.83 Aligned_cols=77 Identities=16% Similarity=0.181 Sum_probs=57.7
Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHH
Q psy15241 10 QKEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELS 89 (117)
Q Consensus 10 ~~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~ 89 (117)
.+....++++++.-...++.+...+.++=|...... ..++.+..+..+.=-+++..+|+||...|+..|++++
T Consensus 230 ~s~~~~k~~~qI~iQs~iIC~f~~i~a~iyv~m~f~-------~~p~~~i~~~~~~Wql~~g~~~iIYl~lNrtIR~~~~ 302 (313)
T PF10321_consen 230 TSSQLSKAQRQIFIQSVIICFFHAIAAVIYVYMQFF-------PPPPWLIIIGQISWQLSHGCPPIIYLTLNRTIRNSVL 302 (313)
T ss_pred cchhhhhhhHHHHHHHHHHHHHHHHHHHHHHeeeee-------cccHHHHHHHHHHHhccCCccceEEEEECHHHHHHHH
Confidence 344455667777777666766667766665444432 1356677788888899999999999999999999999
Q ss_pred hhcc
Q psy15241 90 KRIP 93 (117)
Q Consensus 90 ~~~~ 93 (117)
++++
T Consensus 303 k~~~ 306 (313)
T PF10321_consen 303 KMLG 306 (313)
T ss_pred HHHc
Confidence 9885
No 14
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=97.51 E-value=0.00086 Score=46.01 Aligned_cols=81 Identities=15% Similarity=0.228 Sum_probs=52.5
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhH--HHHHHHhhhccccchhheecChHHHHHH
Q psy15241 11 KEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTM--IPACTCKAVACLDPYVYAISHPRYRLEL 88 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~--~~~~l~~~ns~~NPiiY~~~~~~fr~~~ 88 (117)
..+..++|.+.+.+..+.+.++++. +=|++...+....... ....+. +-..+.-..|.+||+.-.+.|+++|+.+
T Consensus 201 ~s~~~~rE~~L~~~~~i~~~a~~~~-~~~~~~~~~~~~~~~~--~~~~~~r~~y~~~~~~~s~inP~~LLi~n~~lr~~~ 277 (283)
T PF10323_consen 201 SSRSRRREIRLAIQVFILFCAFFVI-LVYYIFSNYFAQNFNT--DPIFYLRAFYPILNGLLSFINPWMLLIFNKDLRKQV 277 (283)
T ss_pred hhhhhhHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccc--hHHHHHHHHHHHHHHHHHhhhhHHhhhccHHHHHHH
Confidence 4456677888777776555554433 3334443333332222 223333 5566677788899999999999999999
Q ss_pred Hhhccc
Q psy15241 89 SKRIPC 94 (117)
Q Consensus 89 ~~~~~~ 94 (117)
++.++|
T Consensus 278 ~~~~~~ 283 (283)
T PF10323_consen 278 RRMLKC 283 (283)
T ss_pred HHHcCC
Confidence 998876
No 15
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=97.40 E-value=0.002 Score=44.66 Aligned_cols=81 Identities=16% Similarity=0.204 Sum_probs=59.9
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHh
Q psy15241 11 KEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSK 90 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~ 90 (117)
++.+.++..|+.|++...++.|+++++-..+.. ...... .......+...+.++.+...|+|--+.|+++|+++++
T Consensus 222 ~~ps~~aH~~a~k~~~sfl~ly~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~i~~~yps~hs~iLIlgn~KLr~~~~~ 297 (303)
T PF05296_consen 222 RDPSTEAHIRAIKTMISFLILYIIYFLSLILSF-LSFFFP---ENSIWFWVCEIIIALYPSGHSIILILGNPKLRQALLK 297 (303)
T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhc---cccHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHH
Confidence 456778888999998877777777665533322 222122 2345567888899999999999999999999999999
Q ss_pred hcccC
Q psy15241 91 RIPCL 95 (117)
Q Consensus 91 ~~~~~ 95 (117)
.+++.
T Consensus 298 il~~~ 302 (303)
T PF05296_consen 298 ILWCL 302 (303)
T ss_pred HHhhc
Confidence 88654
No 16
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=97.04 E-value=0.0055 Score=42.17 Aligned_cols=78 Identities=15% Similarity=0.162 Sum_probs=62.5
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHh
Q psy15241 11 KEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSK 90 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~ 90 (117)
+++.++..++.+|.+.+-...=.++..|-....++....... .+...++...++.+-+.+||+++.+.-+.+|+.++|
T Consensus 215 S~~tk~~h~~lv~~Lt~Q~~lP~~~~~p~~~~~~~~~~~~~~--~~~~e~~~~~~~~~~~~~~P~itl~fv~PYR~~i~r 292 (292)
T PF10317_consen 215 SERTKSMHRQLVKGLTIQALLPLFFYIPGVIIYFLSQFTGYE--HPFLEYLIFMLASLPPLIDPLITLYFVRPYRKAILR 292 (292)
T ss_pred CHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhccc--cHHHHHHHHHHHHHHHHhchHhheeeeHhHHHHhcC
Confidence 455566677888888888888889999888777776665544 566667777788889999999999999999999875
No 17
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=97.02 E-value=0.0059 Score=41.53 Aligned_cols=79 Identities=14% Similarity=0.116 Sum_probs=57.0
Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHH
Q psy15241 8 SGQKEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLE 87 (117)
Q Consensus 8 ~~~~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 87 (117)
..+.++++++|.+..+..++--..|++.++=|.....+. . .....+..+.+.-.+.-++|++|+.+.|+++|+.
T Consensus 195 ~~~s~~r~rke~~f~~Qs~~Q~~~~~i~~~~~~~~~~~~---~---~~~~~F~~~t~~w~~~h~~DG~i~l~fN~~~r~~ 268 (274)
T PF10328_consen 195 SSESKKRRRKEIRFFIQSFIQDLLYLIDLIFYFFIPPLS---S---NRWWQFFCTTFSWVLVHALDGLIMLIFNSEIRRK 268 (274)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---c---ccHHHHHHHHHHHHHHHHhcceeEeEEcHHHHHH
Confidence 345667788899999999999999999998887774431 1 1223333333334455568999999999999999
Q ss_pred HHhhc
Q psy15241 88 LSKRI 92 (117)
Q Consensus 88 ~~~~~ 92 (117)
+++..
T Consensus 269 ~~~~~ 273 (274)
T PF10328_consen 269 IRKKK 273 (274)
T ss_pred HHhcc
Confidence 88753
No 18
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=97.02 E-value=0.0062 Score=42.35 Aligned_cols=67 Identities=13% Similarity=0.227 Sum_probs=52.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHhhccc
Q psy15241 25 AITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRIPC 94 (117)
Q Consensus 25 l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~ 94 (117)
+..=.++|++||+|-.+..+.+.+++ .+..+..+-..++.+-..+|.++|.+-++-.++.+...+.+
T Consensus 204 L~~Yp~ifiicw~fa~INRI~~~~~~---~~~~l~~Lh~~~s~lqGf~nsivy~~n~~~~~~~~~~~~~~ 270 (303)
T PF05462_consen 204 LVNYPLIFIICWIFATINRIYNFIGK---NPFWLSVLHVGFSPLQGFFNSIVYGYNNSLMWRYLGSKILC 270 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC---CchHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 34567889999999999999888753 23455556667788888999999999999988888776643
No 19
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=96.79 E-value=0.0025 Score=43.43 Aligned_cols=83 Identities=23% Similarity=0.245 Sum_probs=61.6
Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecCh
Q psy15241 3 VESLRSGQKEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHP 82 (117)
Q Consensus 3 ~~~~~~~~~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~ 82 (117)
+++.++.....+...|.|+++++.+.|..|+..|..-.++.++...... .+.+..+..+++.+-+.+-|++---.++
T Consensus 181 VqhIhs~~~s~r~SpE~RAtktILlLVs~FV~fY~l~si~~~~~~~~~~---~~~~~~~~~~ls~cfptisPfvLI~~d~ 257 (265)
T PF03402_consen 181 VQHIHSSSLSPRSSPETRATKTILLLVSTFVSFYGLSSILFIYLTSFKN---SPWLLNISVFLSSCFPTISPFVLISSDK 257 (265)
T ss_pred cccCcCCCCCCCCChhHHHhCeEeeHHHHHHHHHhHHHHHHHHHHHhcC---CcceeEHHHHHhHHhHhhChHHhhccCc
Confidence 4455555555666789999999999999999999998877665443322 2344567778888889999999888887
Q ss_pred HHHHHH
Q psy15241 83 RYRLEL 88 (117)
Q Consensus 83 ~fr~~~ 88 (117)
+..+-+
T Consensus 258 ~i~~~~ 263 (265)
T PF03402_consen 258 RIIKFL 263 (265)
T ss_pred hHHHHh
Confidence 766544
No 20
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=95.91 E-value=0.031 Score=39.89 Aligned_cols=26 Identities=15% Similarity=0.258 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 21 IAKTAITLCALFVASWTPYAVVALTG 46 (117)
Q Consensus 21 ~~k~l~~iv~~f~icw~P~~~~~~~~ 46 (117)
+-+-.+.|+++|++||+|-.+--++.
T Consensus 242 IK~kFf~I~lVF~iCWlpNIINg~LL 267 (405)
T PF02101_consen 242 IKIKFFKIMLVFYICWLPNIINGSLL 267 (405)
T ss_pred HHHHHHHHHHHHHHHhhhhhHhHHHH
Confidence 33445678999999999987665533
No 21
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=95.76 E-value=0.029 Score=38.96 Aligned_cols=78 Identities=13% Similarity=0.068 Sum_probs=61.0
Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHH
Q psy15241 8 SGQKEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLE 87 (117)
Q Consensus 8 ~~~~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 87 (117)
.+-+.+..++.|++.+-+.+-.++-.+|-+|-.+.......... .......++.....+.|.+|-++--+.++.||+-
T Consensus 225 ~~iS~~ty~kHk~av~SLi~Q~~~~~i~~~P~~~~~~~~~~~~~--~~q~i~~~~~~~f~~HS~~n~ivli~t~ppYR~f 302 (303)
T PF10327_consen 225 KKISKQTYQKHKEAVRSLIAQFATSSICILPPFIFVVVVIFEFE--DAQVISEICLAIFSSHSSVNMIVLIITTPPYRKF 302 (303)
T ss_pred HHhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhheecCC--CcHHHHHHHHHHHHHhhHhhheeeeEcCcchhhc
Confidence 33345566778899999998888889999998887765554432 2456667778888999999999999999999973
No 22
>PF10318 7TM_GPCR_Srh: Serpentine type 7TM GPCR chemoreceptor Srh; InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srh is part of the Str superfamily of chemoreceptors [].
Probab=95.58 E-value=0.078 Score=36.48 Aligned_cols=83 Identities=14% Similarity=0.151 Sum_probs=60.4
Q ss_pred cccccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHH
Q psy15241 9 GQKEGQSSAEVRIAKTAITLCALF-VASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLE 87 (117)
Q Consensus 9 ~~~~~~~~~~~k~~k~l~~iv~~f-~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 87 (117)
+.+++..+..+|..+.+.+-+.+. ++.-.|.....+...... .....-+++.++....+.+.-++..+.++.+|+.
T Consensus 219 ~~S~~T~k~Qkkfl~~l~iQ~~ip~~~l~~P~~~~~~~~~~~~---~~q~~~n~~~~~~~~HG~~sti~mi~~~~pYR~~ 295 (302)
T PF10318_consen 219 TMSKKTRKMQKKFLIALIIQVLIPFIFLFIPLIYFIISIIFGY---YNQALNNISFIIISLHGIASTIVMILVHKPYRKF 295 (302)
T ss_pred hhCHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccc---cccccchHHHHHHHhccHHHHHHHhhccHHHHHH
Confidence 334556666778888777755544 555668777765444332 2345567778888999999999999999999999
Q ss_pred HHhhccc
Q psy15241 88 LSKRIPC 94 (117)
Q Consensus 88 ~~~~~~~ 94 (117)
++++++|
T Consensus 296 ~~~~~~~ 302 (302)
T PF10318_consen 296 LLSLFRC 302 (302)
T ss_pred HHHHhcC
Confidence 9999865
No 23
>PF10326 7TM_GPCR_Str: Serpentine type 7TM GPCR chemoreceptor Str; InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split [].
Probab=94.34 E-value=0.0093 Score=41.14 Aligned_cols=76 Identities=14% Similarity=0.164 Sum_probs=53.8
Q ss_pred cccchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHH
Q psy15241 11 KEGQSSAEVRIAKTAIT-LCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELS 89 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~-iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~ 89 (117)
+.+.++.+++..+.+++ .++-.++.+.|..+..+.-.++-.. .....+...+..+-+++||++--+.-+++|++++
T Consensus 231 S~~~~~lq~QLf~aLv~Qt~iP~i~~~~P~~~~~~~p~~~i~~---~~~~~~~~~~~~~yP~iDpl~~i~~ik~yR~~i~ 307 (307)
T PF10326_consen 231 SSKTRKLQKQLFKALVIQTIIPFIFMYIPVFIVFILPFFGIDL---GFFSNIISILISLYPAIDPLPVIFIIKDYRKAIK 307 (307)
T ss_pred ChhhHHHHHHHHHHHHHHhhhhheeeecchhheeeeeccCCCC---CccccHhhhhEEEEeehhhheeeEeeHHHHHhhC
Confidence 44455667777777776 4456778889988776654433222 2334556667888999999999999999999874
No 24
>KOG4193|consensus
Probab=92.68 E-value=1.6 Score=33.46 Aligned_cols=76 Identities=12% Similarity=0.055 Sum_probs=46.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHhhcccCcccC
Q psy15241 20 RIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRIPCLGIKE 99 (117)
Q Consensus 20 k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~~~~~~ 99 (117)
+..+.......++-+.|.=-....+ .+ ....+.+++.++-.+-.+.=-++|++..++.|++.++.++|+....
T Consensus 514 ~~~~~~l~L~~lLGlTW~fgi~s~~----~~---~~~v~~YlFti~NalQG~fIFi~~cll~~kvr~~~~k~~~~~~~~~ 586 (610)
T KOG4193|consen 514 SLIRSALALLFLLGLTWIFGIFSWL----PG---TSVVFAYLFTIFNALQGVFIFIFHCLLRKKVRKEYRKWLCCGRGDS 586 (610)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh----cc---cchHHHHHHHHHHHhhhhHhhHhhhhhhHHHHHHHHHHhcccCCCC
Confidence 4455555555556666653222211 11 2344556666666666666678899999999999999887554444
Q ss_pred CCc
Q psy15241 100 KEP 102 (117)
Q Consensus 100 ~~~ 102 (117)
...
T Consensus 587 ~~~ 589 (610)
T KOG4193|consen 587 SAS 589 (610)
T ss_pred CCC
Confidence 333
No 25
>KOG4564|consensus
Probab=92.27 E-value=0.15 Score=37.66 Aligned_cols=83 Identities=14% Similarity=0.197 Sum_probs=50.5
Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-cchhHhHHHHHHHhhhccccchhheecChH
Q psy15241 5 SLRSGQKEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSL-LTPGLTMIPACTCKAVACLDPYVYAISHPR 83 (117)
Q Consensus 5 ~~~~~~~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~ 83 (117)
++|..+ .......+|++|-..+.+-+|-|-++.+ .+..... .......+..+|...-..+=.++|+|.|++
T Consensus 337 KLr~~~-~~~~~~y~K~vKaTLvLIPLfGI~~ilf-------~~~P~~~~~~~v~~~~~~~L~SfQGf~VAvlYCFlN~E 408 (473)
T KOG4564|consen 337 KLRASN-ASETDQYRKLVKATLVLIPLFGIHYILF-------AFRPDEDTLREVYLYFELFLGSFQGFFVAVLYCFLNGE 408 (473)
T ss_pred HhcCcc-ccchHHHHHHHHHHHHHHHHcCCeeEEE-------EecCchHHHHHHHHHHHHHHHhccchheehheeecCHH
Confidence 444442 2233347777777666655554432221 1111111 112233566777888888889999999999
Q ss_pred HHHHHHhhcccC
Q psy15241 84 YRLELSKRIPCL 95 (117)
Q Consensus 84 fr~~~~~~~~~~ 95 (117)
.|.++|+.+...
T Consensus 409 Vq~elrr~W~r~ 420 (473)
T KOG4564|consen 409 VQAELRRKWSRW 420 (473)
T ss_pred HHHHHHHHHHhc
Confidence 999999988554
No 26
>PF03125 Sre: C. elegans Sre G protein-coupled chemoreceptor; InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=91.96 E-value=2.4 Score=30.26 Aligned_cols=42 Identities=19% Similarity=0.104 Sum_probs=32.5
Q ss_pred chhHhHHHHHHHhhhccccchhheecChHHHHHHHhhcccCc
Q psy15241 55 TPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRIPCLG 96 (117)
Q Consensus 55 ~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~~~~ 96 (117)
......+......+++.+=|.+-....+..|+++++.++...
T Consensus 282 ~~~~~~~~e~~i~l~~~~i~~~~i~s~~~wrk~f~~~~~~~~ 323 (365)
T PF03125_consen 282 RTILNHIFENCIFLNPIFICPVIIFSVPSWRKEFKRSFPKIR 323 (365)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHcCHHHHHHHHHhhhhhh
Confidence 345556777888888888787777888999999999886543
No 27
>KOG2575|consensus
Probab=88.88 E-value=0.7 Score=33.70 Aligned_cols=26 Identities=31% Similarity=0.611 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 17 AEVRIAKTAITLCALFVASWTPYAVV 42 (117)
Q Consensus 17 ~~~k~~k~l~~iv~~f~icw~P~~~~ 42 (117)
.-.+++++-+++++.|+++|.|+...
T Consensus 253 ~f~ri~~ia~~Vv~TF~iiw~P~~~~ 278 (510)
T KOG2575|consen 253 SFARIIKIALAVVGTFVIIWLPFLLS 278 (510)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 35688899999999999999999744
No 28
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=87.75 E-value=2.3 Score=29.86 Aligned_cols=75 Identities=17% Similarity=0.228 Sum_probs=53.2
Q ss_pred cccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHH
Q psy15241 11 KEGQSSAEVRIAKTAITL-CALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLE 87 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~i-v~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 87 (117)
+++..+-.++..|.+.+- ++--++|..|-.+.+..-.++ .+ ......+++......=+++||+.-.+.-+.||+.
T Consensus 234 S~~T~~lq~qL~~AL~vQT~IPi~vsf~Pc~~~wy~pif~-i~-~~~~~n~~~~iAls~FPf~DPlAii~~lP~~R~r 309 (310)
T PF10319_consen 234 SKKTKRLQRQLFKALIVQTVIPICVSFSPCVLSWYGPIFG-ID-LGRWNNYFSVIALSAFPFLDPLAIILCLPAFRNR 309 (310)
T ss_pred CHhHHHHHHHHHHHHHHHHHhHHHHhhccHHHHHhHHHHc-CC-hhHHHHHHHHHHHHHccccCchHhheecHHhhcc
Confidence 455566677788877764 445688999988876544433 22 4566677777777777899998888888888864
No 29
>KOG3827|consensus
Probab=83.45 E-value=1.2 Score=32.07 Aligned_cols=61 Identities=15% Similarity=0.239 Sum_probs=39.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchh-HhHHHHHHHhhhccccchhheec
Q psy15241 20 RIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPG-LTMIPACTCKAVACLDPYVYAIS 80 (117)
Q Consensus 20 k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~ns~~NPiiY~~~ 80 (117)
|--.++.+....|++.|+-|.+++.+..+.+....... -.....+...+++....++|++-
T Consensus 61 kWR~~lliF~~sf~~SWl~Fg~iwwlIA~~hGDL~~~~~~~~~tpCV~nV~sf~sAFLFSiE 122 (400)
T KOG3827|consen 61 KWRWMLLIFSLSFVLSWLFFGVIWWLIAYAHGDLEPDPPGENHTPCVMNVHSFTSAFLFSIE 122 (400)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccCCCCcCCCcceeeccchhhhheeeee
Confidence 44457778889999999999988877666554421111 12234456666666667776654
No 30
>PF02118 Srg: Srg family chemoreceptor; InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=83.35 E-value=7.3 Score=26.24 Aligned_cols=75 Identities=12% Similarity=0.072 Sum_probs=37.2
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHH
Q psy15241 12 EGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLE 87 (117)
Q Consensus 12 ~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~ 87 (117)
.+.++.|++..+..++..+++.+.-+.-.+... ...............+..+..=.-+...|++-.+.+++.|+.
T Consensus 200 ~~~~~~er~L~~is~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~f~sD~ltl~~P~iLl~fs~~vR~~ 274 (275)
T PF02118_consen 200 KRIKSVERNLTIISFIISFVQLLIAIWQIINSF-AFFFDFLPISSFLYYLLPFASDLLTLSQPYILLIFSKNVRRQ 274 (275)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHheECHhhhcc
Confidence 344456777776666655555544443333211 111111111222222222222222445699999999999985
No 31
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=78.95 E-value=11 Score=26.24 Aligned_cols=76 Identities=18% Similarity=0.147 Sum_probs=45.9
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChHHHHHHHhhc
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPRYRLELSKRI 92 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~ 92 (117)
..+.|.++.|.-. .+|++.-.+..++.+...........-....+...+-..-++..|-.+.+.+++.|+-+-...
T Consensus 221 LnK~Ei~ILKQAi---fIFvlFQ~ss~VFl~~qt~~~~~~tAFlIKR~iNT~ei~agaatP~ffffTs~eirk~~s~kv 296 (305)
T PF04789_consen 221 LNKNEITILKQAI---FIFVLFQASSCVFLLCQTFKFNVATAFLIKRIINTMEIFAGAATPCFFFFTSKEIRKLVSSKV 296 (305)
T ss_pred CCchhHHHHHHHH---HHHHHHHHHHHHHHhheeeeecchHHHHHHHHHHHHHHHhhccCCeEEEEehHHHHHHHHhhc
Confidence 4455777777744 455555566555544443322221111223445556667788999999999999998876543
No 32
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=78.35 E-value=5 Score=21.02 Aligned_cols=24 Identities=17% Similarity=0.321 Sum_probs=14.4
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWT 37 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~ 37 (117)
++++.+|.-.++.+++++|++-|+
T Consensus 31 ~qk~~~~~~~i~~~~~i~~l~v~~ 54 (59)
T PF09889_consen 31 RQKRMRKTQYIFFGIFILFLAVWI 54 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444455666666666666665554
No 33
>PF10292 7TM_GPCR_Srab: Serpentine type 7TM GPCR receptor class ab chemoreceptor; InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells [].
Probab=76.02 E-value=24 Score=24.74 Aligned_cols=80 Identities=13% Similarity=0.078 Sum_probs=51.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHH--HHHHHhhhccccchhheecChHHHHHHHhh
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMI--PACTCKAVACLDPYVYAISHPRYRLELSKR 91 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~--~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~ 91 (117)
|...+.|.++++...+.+..+.-+-+.+..+...........+....+ ........+.+=|++.....++.|++.++.
T Consensus 224 Ql~ENl~slr~L~p~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~y~al~e~~~~~P~Y~ii~~~~~~~~~k~~r~~~~~~ 303 (324)
T PF10292_consen 224 QLEENLRSLRLLKPFIILSSIFIFFYIFASIFLRLFNPSMSKPNYFALAELNHIFPLYSIISPLILYRKIKKIRKKRKKR 303 (324)
T ss_pred hHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556688999998888887777777776665544332222112221122 233455677788888888888888887776
Q ss_pred cc
Q psy15241 92 IP 93 (117)
Q Consensus 92 ~~ 93 (117)
+.
T Consensus 304 l~ 305 (324)
T PF10292_consen 304 LE 305 (324)
T ss_pred HH
Confidence 63
No 34
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=70.05 E-value=7.6 Score=22.06 Aligned_cols=17 Identities=24% Similarity=0.264 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy15241 21 IAKTAITLCALFVASWT 37 (117)
Q Consensus 21 ~~k~l~~iv~~f~icw~ 37 (117)
+..+.+++++.+++||.
T Consensus 40 I~~iFil~VilwfvCC~ 56 (94)
T PF05393_consen 40 ICGIFILLVILWFVCCK 56 (94)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33444556666666665
No 35
>PF03155 Alg6_Alg8: ALG6, ALG8 glycosyltransferase family; InterPro: IPR004856 N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum membrane prior to the transfer of the oligosaccharide moiety to the target asparagine residues. This oligosaccharide is composed of Glc(3)Man(9)GlcNAc(2). The addition of the three glucose residues is the final series of steps in the synthesis of the oligosaccharide precursor. Alg6 transfers the first glucose residue, and Alg8 transfers the second one []. In the human alg6 gene, a C-T transition, which causes Ala333 to be replaced with Val, has been identified as the cause of a congenital disorder of glycosylation, designated as type Ic OMIM:603147 []. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0005789 endoplasmic reticulum membrane
Probab=64.82 E-value=13 Score=27.87 Aligned_cols=27 Identities=26% Similarity=0.357 Sum_probs=23.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 16 SAEVRIAKTAITLCALFVASWTPYAVV 42 (117)
Q Consensus 16 ~~~~k~~k~l~~iv~~f~icw~P~~~~ 42 (117)
..-+|..++-++++.+|.++|.|+...
T Consensus 218 ~~~~~~~~lg~~Vi~~f~~~~~PF~~~ 244 (469)
T PF03155_consen 218 FSIKRLIKLGIVVIATFALSFGPFLYS 244 (469)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 456889999999999999999999633
No 36
>PF04238 DUF420: Protein of unknown function (DUF420); InterPro: IPR007352 This is a predicted membrane protein with four transmembrane helices.
Probab=58.58 E-value=27 Score=21.41 Aligned_cols=41 Identities=27% Similarity=0.436 Sum_probs=28.3
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
Q psy15241 12 EGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQS 52 (117)
Q Consensus 12 ~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~ 52 (117)
+++.+..++.....+....+|++||+-++...-...+.++.
T Consensus 28 ~~~~~~Hr~~Ml~a~~ls~lFlv~Yl~~~~~~g~~~f~g~~ 68 (133)
T PF04238_consen 28 RGRIKLHRKLMLTAFVLSALFLVSYLYYHFLGGSTPFGGPG 68 (133)
T ss_pred hCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCCCc
Confidence 34556677888888888899999999777664333344433
No 37
>KOG1341|consensus
Probab=51.26 E-value=70 Score=25.28 Aligned_cols=36 Identities=22% Similarity=0.284 Sum_probs=27.4
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 12 EGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGA 47 (117)
Q Consensus 12 ~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~ 47 (117)
..-..-|-|+.|.+..++++|++.|--...+.++.+
T Consensus 450 eeLggIEyRAlk~Lcsil~vY~l~~nIvafV~llv~ 485 (854)
T KOG1341|consen 450 EELGGIEYRALKCLCSILVVYFLGWNIVAFVTLLVF 485 (854)
T ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455789999999999999999987766655443
No 38
>PF06072 Herpes_US9: Alphaherpesvirus tegument protein US9; InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=50.50 E-value=37 Score=17.80 Aligned_cols=25 Identities=12% Similarity=0.066 Sum_probs=14.1
Q ss_pred chhHHHHHHHHHHHHHHHH-HHHHHH
Q psy15241 14 QSSAEVRIAKTAITLCALF-VASWTP 38 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f-~icw~P 38 (117)
..++++|...+.+.++..+ ++|-+-
T Consensus 24 ~~r~RrRrc~~~v~~v~~~~~~c~~S 49 (60)
T PF06072_consen 24 ASRRRRRRCRLAVAIVFAVVALCVLS 49 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455666666555555555 555544
No 39
>PF11298 DUF3099: Protein of unknown function (DUF3099); InterPro: IPR021449 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known.
Probab=45.44 E-value=52 Score=18.02 Aligned_cols=30 Identities=17% Similarity=0.046 Sum_probs=21.1
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 11 KEGQSSAEVRIAKTAITLCALFVASWTPYA 40 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~iv~~f~icw~P~~ 40 (117)
.+.+.+++++-..++.+=+.+|++.+.=+.
T Consensus 10 ~~d~~~R~r~Y~i~M~~Ri~~fvlA~~~~~ 39 (73)
T PF11298_consen 10 SQDQRRRRRRYLIMMGIRIPCFVLAAVVYR 39 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 445666778888777777777777766554
No 40
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=43.66 E-value=37 Score=15.80 Aligned_cols=15 Identities=20% Similarity=0.239 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHHH
Q psy15241 21 IAKTAITLCALFVAS 35 (117)
Q Consensus 21 ~~k~l~~iv~~f~ic 35 (117)
+.|+++.+++.|+++
T Consensus 3 ~LK~~Vy~vV~ffv~ 17 (36)
T PF02532_consen 3 TLKIFVYTVVIFFVS 17 (36)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred EEEEeehhhHHHHHH
Confidence 345666666665554
No 41
>PF06210 DUF1003: Protein of unknown function (DUF1003); InterPro: IPR010406 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=43.49 E-value=61 Score=19.14 Aligned_cols=56 Identities=14% Similarity=0.220 Sum_probs=32.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcchhHhHHHHHHHhhhccccchhheecChH
Q psy15241 24 TAITLCALFVASWTPYAVVALTGAFGDQSLLTPGLTMIPACTCKAVACLDPYVYAISHPR 83 (117)
Q Consensus 24 ~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~~~~ 83 (117)
..+++.++|++.|.-.++..... .... ++.+..+...+.......-|+|-.=-|++
T Consensus 5 ~Fi~~~~~~~~~Wi~~N~~~~~~--~~fD--pyPFilLnl~lS~~Aa~~ap~IlmsQNRq 60 (108)
T PF06210_consen 5 TFIIIFTVFLAVWILLNILAPPR--PAFD--PYPFILLNLVLSLEAAYQAPLILMSQNRQ 60 (108)
T ss_pred HHHHHHHHHHHHHHHHHhhcccc--CCCC--CccHHHHHHHHHHHHHHHHHHHHHHhhHh
Confidence 34566777888888776655443 1112 23344555666666666778877644433
No 42
>TIGR01912 TatC-Arch Twin arginine targeting (Tat) protein translocase TatC, Archaeal clade. This model represents the TatC translocase component of the Sec-independent protein translocation system. This system is responsible for translocation of folded proteins, often with bound cofactors across the periplasmic membrane. A related model (TIGR00945) represents the bacterial clade of this family. TatC is often found (in bacteria) in a gene cluster with the two other components of the system, TatA/E (TIGR01411) and TatB (TIGR01410). A model also exists for the Twin-arginine signal sequence (TIGR01409).
Probab=42.96 E-value=1e+02 Score=20.80 Aligned_cols=32 Identities=9% Similarity=0.135 Sum_probs=24.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 15 SSAEVRIAKTAITLCALFVASWTPYAVVALTG 46 (117)
Q Consensus 15 ~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~ 46 (117)
..-++|..+.+..+++.+.+||.|++.-.+++
T Consensus 7 ~ELR~Rli~~~i~~~~~~~~~~~~~~~~~i~~ 38 (237)
T TIGR01912 7 ATFRLILLLVALAIVTGSVLGWLTIFTPFIIA 38 (237)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35578899999999999999997666544443
No 43
>PHA02657 hypothetical protein; Provisional
Probab=42.51 E-value=64 Score=18.27 Aligned_cols=24 Identities=17% Similarity=0.167 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 24 TAITLCALFVASWTPYAVVALTGA 47 (117)
Q Consensus 24 ~l~~iv~~f~icw~P~~~~~~~~~ 47 (117)
++++.+.++.+|.+-|.++.+..+
T Consensus 27 imVitvfv~vI~il~flLLYLvkW 50 (95)
T PHA02657 27 ILVFTIFIFVVCILIYLLIYLVDW 50 (95)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Confidence 456667777888887777766543
No 44
>PF10777 YlaC: Inner membrane protein YlaC; InterPro: IPR019713 The extracytoplasmic function (ECF) sigma factors are small regulatory proteins that are quite divergent in sequence relative to most other sigma factors. YlaC, regulated by YlaA, is important in oxidative stress resistance. It contributes to hydrogen peroxide resistance in Bacillus subtilis [].
Probab=40.75 E-value=64 Score=20.35 Aligned_cols=39 Identities=10% Similarity=0.204 Sum_probs=26.1
Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
Q psy15241 11 KEGQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFG 49 (117)
Q Consensus 11 ~~~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~ 49 (117)
++++.+=+.+.++.--...+.-+++|+|..++.+.....
T Consensus 21 RDnkprFs~~Fi~~HP~L~~~M~~~y~~~~~lm~~spy~ 59 (155)
T PF10777_consen 21 RDNKPRFSSSFIRNHPYLCLAMYAAYLAVAALMYYSPYF 59 (155)
T ss_pred cCCCccccHHHHHhCcHHHHHHHHHHHHHHHHHHhcchh
Confidence 344555566666666666677788999988777765543
No 45
>TIGR00934 2a38euk potassium uptake protein, Trk family. The proteins of the Trk family are derived from Gram-negative and Gram-positive bacteria, yeast and wheat. The proteins of E. coli K12 TrkH and TrkG as well as several yeast proteins have been functionally characterized.The E. coli TrkH and TrkG proteins are complexed to two peripheral membrane proteins, TrkA, an NAD-binding protein, and TrkE, an ATP-binding protein. This complex forms the potassium uptake system. This family is specific for the eukaryotic Trk system.
Probab=39.70 E-value=2.1e+02 Score=23.35 Aligned_cols=30 Identities=17% Similarity=0.187 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 18 EVRIAKTAITLCALFVASWTPYAVVALTGA 47 (117)
Q Consensus 18 ~~k~~k~l~~iv~~f~icw~P~~~~~~~~~ 47 (117)
|-|++|.++.++..|++.|--.-++.+..+
T Consensus 444 EYRaLk~L~~Iv~~Y~~~~~llG~i~l~~w 473 (800)
T TIGR00934 444 EYRALKCLCSIVLVYFLGFNILGFVLLLPW 473 (800)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 669999999999999988876665555444
No 46
>PHA02849 putative transmembrane protein; Provisional
Probab=39.06 E-value=71 Score=17.76 Aligned_cols=25 Identities=16% Similarity=0.270 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 23 KTAITLCALFVASWTPYAVVALTGA 47 (117)
Q Consensus 23 k~l~~iv~~f~icw~P~~~~~~~~~ 47 (117)
-.+++.+.++.+|.+-|.++.+..+
T Consensus 16 ~v~vi~v~v~vI~i~~flLlyLvkw 40 (82)
T PHA02849 16 AVTVILVFVLVISFLAFMLLYLIKW 40 (82)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455667777778777777766544
No 47
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=38.73 E-value=54 Score=17.20 Aligned_cols=12 Identities=17% Similarity=-0.020 Sum_probs=4.7
Q ss_pred HHHHHHHHHHHH
Q psy15241 17 AEVRIAKTAITL 28 (117)
Q Consensus 17 ~~~k~~k~l~~i 28 (117)
+++|.+.++...
T Consensus 38 ~R~r~~~~~~~l 49 (64)
T COG4068 38 KRQRNFMILMFL 49 (64)
T ss_pred HHHHHHHHHHHH
Confidence 333444444333
No 48
>PF10445 DUF2456: Protein of unknown function (DUF2456); InterPro: IPR018852 This entry represents a family of uncharacterised proteins.
Probab=36.91 E-value=86 Score=18.11 Aligned_cols=26 Identities=12% Similarity=0.170 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 18 EVRIAKTAITLCALFVASWTPYAVVA 43 (117)
Q Consensus 18 ~~k~~k~l~~iv~~f~icw~P~~~~~ 43 (117)
-.++.|-++..+..|++.|-|...+.
T Consensus 26 v~~~~rgli~av~~f~~~WP~tigIl 51 (94)
T PF10445_consen 26 VQQAIRGLILAVLIFLLLWPITIGIL 51 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHh
Confidence 45788888899999999999976554
No 49
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=35.43 E-value=49 Score=18.72 Aligned_cols=18 Identities=28% Similarity=0.460 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy15241 22 AKTAITLCALFVASWTPY 39 (117)
Q Consensus 22 ~k~l~~iv~~f~icw~P~ 39 (117)
-|.+.+.+.+|++|.+-|
T Consensus 67 rK~~~ls~~l~~v~~LvY 84 (85)
T PF15188_consen 67 RKSMLLSVALFFVCFLVY 84 (85)
T ss_pred hhhHHHHHHHHHHHHHHc
Confidence 455666777777777654
No 50
>COG3821 Predicted membrane protein [Function unknown]
Probab=35.40 E-value=37 Score=22.62 Aligned_cols=17 Identities=24% Similarity=0.550 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy15241 26 ITLCALFVASWTPYAVV 42 (117)
Q Consensus 26 ~~iv~~f~icw~P~~~~ 42 (117)
.+.+..|++||+-|.-.
T Consensus 7 ~lA~~~F~~~W~~yt~~ 23 (234)
T COG3821 7 LLALAVFLLCWLLYTPA 23 (234)
T ss_pred HHHHHHHHHHHHhhhHH
Confidence 35677899999987543
No 51
>PF07010 Endomucin: Endomucin; InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=34.96 E-value=46 Score=22.56 Aligned_cols=7 Identities=29% Similarity=0.354 Sum_probs=2.7
Q ss_pred HHHHHHH
Q psy15241 40 AVVALTG 46 (117)
Q Consensus 40 ~~~~~~~ 46 (117)
.++-++.
T Consensus 206 ~LvgLyr 212 (259)
T PF07010_consen 206 TLVGLYR 212 (259)
T ss_pred HHHHHHH
Confidence 3333433
No 52
>PF05391 Lsm_interact: Lsm interaction motif; InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=34.80 E-value=26 Score=14.29 Aligned_cols=10 Identities=20% Similarity=0.268 Sum_probs=6.4
Q ss_pred cChHHHHHHH
Q psy15241 80 SHPRYRLELS 89 (117)
Q Consensus 80 ~~~~fr~~~~ 89 (117)
.|.+||+-+.
T Consensus 11 SNddFrkmfl 20 (21)
T PF05391_consen 11 SNDDFRKMFL 20 (21)
T ss_pred chHHHHHHHc
Confidence 4677777653
No 53
>COG2322 Predicted membrane protein [Function unknown]
Probab=34.03 E-value=1e+02 Score=19.90 Aligned_cols=31 Identities=19% Similarity=0.149 Sum_probs=23.6
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 14 QSSAEVRIAKTAITLCALFVASWTPYAVVAL 44 (117)
Q Consensus 14 ~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~ 44 (117)
..+..+++..+-.....+|+++|+-++...-
T Consensus 71 ~i~~Hk~aMltA~~l~l~FlvlYltr~~l~~ 101 (177)
T COG2322 71 NIEKHKRAMLTAFTLALVFLVLYLTRHGLGG 101 (177)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 4556677888888888899999988776543
No 54
>PF14992 TMCO5: TMCO5 family
Probab=32.19 E-value=1e+02 Score=21.54 Aligned_cols=24 Identities=8% Similarity=0.232 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHHH
Q psy15241 19 VRIAKTAITLCALF--VASWTPYAVV 42 (117)
Q Consensus 19 ~k~~k~l~~iv~~f--~icw~P~~~~ 42 (117)
.++.+.++.++++| ++||+-+++.
T Consensus 215 kr~lr~l~f~vL~f~~LL~y~~f~~~ 240 (280)
T PF14992_consen 215 KRALRLLFFMVLFFTRLLGYLLFYIQ 240 (280)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 35555554444444 5566655533
No 55
>PF01679 Pmp3: Proteolipid membrane potential modulator; InterPro: IPR000612 Proteolipid membrane potential modulator is an evolutionarily conserved proteolipid in the plasma membrane which, in S. pombe, is transcriptionally regulated by the Spc1 stress MAPK (mitogen-activated protein kinases) pathway. It functions to modulate the membrane potential, particularly to resist high cellular cation concentration. In eukaryotic organisms, stress-activated mitogen-activated protein kinases play crucial roles in transmitting environmental signals that will regulate gene expression for allowing the cell to adapt to cellular stress. Pmp3-like proteins are highly conserved in bacteria, yeast, nematode and plants. Proteins in this entry include the PMP3 as well as several other proteins that have been shown [] to be evolutionary related. These are small proteins of from 52 to 140 amino-acid resiudes that contain two transmembrane domains and belong to the UPF0057 (PMP3) protein family.; GO: 0016021 integral to membrane
Probab=31.38 E-value=52 Score=16.60 Aligned_cols=22 Identities=27% Similarity=0.310 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy15241 23 KTAITLCALFVASWTPYAVVAL 44 (117)
Q Consensus 23 k~l~~iv~~f~icw~P~~~~~~ 44 (117)
+-+.+-++..++.|.|-.+..+
T Consensus 25 ~~~~inl~Ltl~g~iPg~ihA~ 46 (51)
T PF01679_consen 25 KDFWINLLLTLLGWIPGVIHAL 46 (51)
T ss_pred hhhHHHHHHHHHHHHHHHHHee
Confidence 3445666677888999766543
No 56
>KOG4686|consensus
Probab=31.12 E-value=1.1e+02 Score=22.13 Aligned_cols=24 Identities=17% Similarity=0.581 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 20 RIAKTAITLCALFVASWTPYAVVA 43 (117)
Q Consensus 20 k~~k~l~~iv~~f~icw~P~~~~~ 43 (117)
-...+++++.++|.+-|.|+..+-
T Consensus 264 ppfw~~~iicv~yyva~fPFi~lg 287 (459)
T KOG4686|consen 264 PPFWVLVIICVLYYVAWFPFITLG 287 (459)
T ss_pred ccHHHHHHHHHHHHHHHHHHhhhh
Confidence 346788999999999999997653
No 57
>COG1823 Predicted Na+/dicarboxylate symporter [General function prediction only]
Probab=30.46 E-value=2.2e+02 Score=21.19 Aligned_cols=58 Identities=21% Similarity=0.216 Sum_probs=34.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCcch---------------hHhHHHHHHHhhhccccchhhe
Q psy15241 21 IAKTAITLCALFVASWTPYAVVALTGAFGDQSLLTP---------------GLTMIPACTCKAVACLDPYVYA 78 (117)
Q Consensus 21 ~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~~~~~---------------~~~~~~~~l~~~ns~~NPiiY~ 78 (117)
...-++.-++.+++-.+||-++.+.......++... ....+.+.+...-.-+||+-|+
T Consensus 221 t~~~ivM~lV~~VirLTPYgV~AlMtkv~ats~~~~I~~Lg~FivAsY~Ai~~MFvvH~iLL~~~g~NPi~Y~ 293 (458)
T COG1823 221 TLQSIVMKLVRLVIRLTPYGVLALMTKVVATSNLEDIIKLGSFIVASYIAIFIMFVVHGILLALNGVNPIKYF 293 (458)
T ss_pred HHHHHHHHHHHHHHHhccHHHHHHHHHHHhhccHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHH
Confidence 334445566778899999999887554333221111 1112334455556678999886
No 58
>COG2383 Uncharacterized conserved protein [Function unknown]
Probab=29.84 E-value=89 Score=18.35 Aligned_cols=19 Identities=21% Similarity=0.321 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy15241 17 AEVRIAKTAITLCALFVAS 35 (117)
Q Consensus 17 ~~~k~~k~l~~iv~~f~ic 35 (117)
+-+|+.|++..++.+|++.
T Consensus 35 Alkk~ikvvl~liGvyils 53 (109)
T COG2383 35 ALKKAIKVVLLLIGVYILS 53 (109)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4678999999999988653
No 59
>KOG1726|consensus
Probab=29.75 E-value=86 Score=21.20 Aligned_cols=14 Identities=29% Similarity=0.598 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHH
Q psy15241 28 LCALFVASWTPYAV 41 (117)
Q Consensus 28 iv~~f~icw~P~~~ 41 (117)
.+.-|+++|+|++.
T Consensus 50 ~~~d~~lsw~P~Y~ 63 (225)
T KOG1726|consen 50 TLTDFLLSWFPFYS 63 (225)
T ss_pred HHHHHHHHHhhhHH
Confidence 44567899999873
No 60
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=28.72 E-value=97 Score=16.23 Aligned_cols=31 Identities=6% Similarity=0.158 Sum_probs=17.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 16 SAEVRIAKTAITLCALFVASWTPYAVVALTG 46 (117)
Q Consensus 16 ~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~ 46 (117)
+...+.+|+++++..++++.-+-|..+.-+.
T Consensus 4 k~~~~mtriVLLISfiIlfgRl~Y~~I~a~~ 34 (59)
T PF11119_consen 4 KKNSRMTRIVLLISFIILFGRLIYSAIGAWV 34 (59)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHhHHH
Confidence 3445666666666665555555555554443
No 61
>PRK06761 hypothetical protein; Provisional
Probab=26.98 E-value=74 Score=22.20 Aligned_cols=34 Identities=9% Similarity=0.197 Sum_probs=26.3
Q ss_pred hHHHHHHHhhhccccchhheecChHHHHHHHhhcc
Q psy15241 59 TMIPACTCKAVACLDPYVYAISHPRYRLELSKRIP 93 (117)
Q Consensus 59 ~~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~~~~ 93 (117)
-++..+...+. -+||+++.+...|..+.+++.+.
T Consensus 169 ~y~~~l~~~i~-~l~p~l~yl~~~dv~~~~~~~~~ 202 (282)
T PRK06761 169 NYVMKLAKIIE-NLNPMLFYLEQDDVEFSFRKALK 202 (282)
T ss_pred HHHHHHHHHHh-ccCcEEEEecccCHHHHHHHHHH
Confidence 34555555555 68999999999999999988763
No 62
>PF12609 DUF3774: Wound-induced protein; InterPro: IPR022251 This family of proteins is found in eukaryotes. Proteins in this family are typically between 81 and 97 amino acids in length. The proteins in the family are often annotated as wound-induced proteins however there is little accompanying literature to confirm this.
Probab=26.35 E-value=4.4 Score=22.52 Aligned_cols=8 Identities=25% Similarity=1.315 Sum_probs=5.6
Q ss_pred HHHHHHHH
Q psy15241 31 LFVASWTP 38 (117)
Q Consensus 31 ~f~icw~P 38 (117)
.|+-||.|
T Consensus 72 MyLSCWGP 79 (79)
T PF12609_consen 72 MYLSCWGP 79 (79)
T ss_pred EEEeccCc
Confidence 46678877
No 63
>PF04505 Dispanin: Interferon-induced transmembrane protein; InterPro: IPR007593 This family includes the human leukocyte antigen CD225, which is an interferon inducible transmembrane protein, and is associated with interferon induced cell growth suppression [].; GO: 0009607 response to biotic stimulus, 0016021 integral to membrane
Probab=25.13 E-value=64 Score=17.89 Aligned_cols=20 Identities=15% Similarity=0.085 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy15241 28 LCALFVASWTPYAVVALTGA 47 (117)
Q Consensus 28 iv~~f~icw~P~~~~~~~~~ 47 (117)
.++..++|+.|.-++.++..
T Consensus 22 sI~s~l~Cc~PlGi~Ai~~s 41 (82)
T PF04505_consen 22 SIFSTLCCCWPLGIVAIVYS 41 (82)
T ss_pred HHHHHHHHHhhHHHHHheec
Confidence 34444466669777766543
No 64
>KOG0812|consensus
Probab=24.61 E-value=1.2e+02 Score=21.44 Aligned_cols=19 Identities=16% Similarity=0.036 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy15241 18 EVRIAKTAITLCALFVASW 36 (117)
Q Consensus 18 ~~k~~k~l~~iv~~f~icw 36 (117)
+.-.+|+++++++.|++.|
T Consensus 289 RwLmvkiF~i~ivFflvfv 307 (311)
T KOG0812|consen 289 RWLMVKIFGILIVFFLVFV 307 (311)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 3344555554444444444
No 65
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=24.58 E-value=2.1e+02 Score=19.31 Aligned_cols=23 Identities=13% Similarity=0.170 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 22 AKTAITLCALFVASWTPYAVVAL 44 (117)
Q Consensus 22 ~k~l~~iv~~f~icw~P~~~~~~ 44 (117)
...+++|.++|+||-+-+...++
T Consensus 131 LIClIIIAVLfLICT~LfLSTVV 153 (227)
T PF05399_consen 131 LICLIIIAVLFLICTLLFLSTVV 153 (227)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444566678888777654443
No 66
>PF10601 zf-LITAF-like: LITAF-like zinc ribbon domain; InterPro: IPR006629 Members of this family display a conserved zinc ribbon structure [] with the motif C-XX-C- separated from the more C-terminal HX-C(P)X-C-X4-G-R motif by a variable region of usually 25-30 (hydrophobic) residues. Although it belongs to one of the zinc finger's fold groups (zinc ribbon), this particular domain was first identified in LPS-induced tumour necrosis alpha factor (LITAF) which is produced in mammalian cells after being challenged with lipopolysaccharide (LPS). The hydrophobic region probably inserts into the membrane rather than traversing it. Such an insertion brings together the N- and C-terminal C-XX-C motifs to form a compact Zn2+-binding structure [].
Probab=24.48 E-value=1.2e+02 Score=16.11 Aligned_cols=8 Identities=13% Similarity=0.509 Sum_probs=4.7
Q ss_pred HHHHHHHH
Q psy15241 34 ASWTPYAV 41 (117)
Q Consensus 34 icw~P~~~ 41 (117)
+||+|+..
T Consensus 44 ~~~iP~~~ 51 (73)
T PF10601_consen 44 CCCIPFCC 51 (73)
T ss_pred HhhHhhcc
Confidence 46666654
No 67
>PF06196 DUF997: Protein of unknown function (DUF997); InterPro: IPR010398 This is a family of predicted bacterial membrane protein with unknown function.
Probab=24.44 E-value=1.4e+02 Score=16.59 Aligned_cols=16 Identities=19% Similarity=0.501 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHH
Q psy15241 24 TAITLCALFVASWTPY 39 (117)
Q Consensus 24 ~l~~iv~~f~icw~P~ 39 (117)
..++..++|++.|.=+
T Consensus 9 ~tl~l~l~yf~~W~~~ 24 (80)
T PF06196_consen 9 WTLGLTLIYFAWWYGF 24 (80)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344566777777543
No 68
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=24.37 E-value=1.6e+02 Score=19.99 Aligned_cols=18 Identities=11% Similarity=0.288 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy15241 28 LCALFVASWTPYAVVALT 45 (117)
Q Consensus 28 iv~~f~icw~P~~~~~~~ 45 (117)
+++++++-|+|+.++..+
T Consensus 238 ~~l~~l~p~~~~~~~~~~ 255 (262)
T PF14257_consen 238 VFLVGLLPWLPLILIIGL 255 (262)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 334555666666554443
No 69
>PF04272 Phospholamban: Phospholamban; InterPro: IPR005984 Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17. The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=24.28 E-value=1.1e+02 Score=15.12 Aligned_cols=9 Identities=0% Similarity=-0.086 Sum_probs=3.7
Q ss_pred HHHHHHHHH
Q psy15241 32 FVASWTPYA 40 (117)
Q Consensus 32 f~icw~P~~ 40 (117)
-++|-+-..
T Consensus 38 ilicllli~ 46 (52)
T PF04272_consen 38 ILICLLLIC 46 (52)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHH
Confidence 344444333
No 70
>KOG4561|consensus
Probab=24.25 E-value=2.6e+02 Score=19.66 Aligned_cols=31 Identities=16% Similarity=-0.084 Sum_probs=17.2
Q ss_pred HHHHHHHhhhccccchhheecChHHHHHHHh
Q psy15241 60 MIPACTCKAVACLDPYVYAISHPRYRLELSK 90 (117)
Q Consensus 60 ~~~~~l~~~ns~~NPiiY~~~~~~fr~~~~~ 90 (117)
.+...+...|.++|-.--.|+.+..|.+.|.
T Consensus 236 ~~~~~~l~~~~~L~v~Nl~Wf~km~~ga~K~ 266 (281)
T KOG4561|consen 236 FLPLFLLGLNALLLVLNLYWFSKMVRGALKV 266 (281)
T ss_pred hHhHHHHHHHHHHHhhHHHHHHHHHHHHHHh
Confidence 4455555566665554455555666655554
No 71
>PF06716 DUF1201: Protein of unknown function (DUF1201); InterPro: IPR009591 This entry consists of several Beet yellows virus (BYV) putative membrane-binding proteins of around 54 residues in length. The function of this currently unknown.
Probab=24.24 E-value=1.1e+02 Score=15.21 Aligned_cols=15 Identities=20% Similarity=0.257 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHH
Q psy15241 28 LCALFVASWTPYAVV 42 (117)
Q Consensus 28 iv~~f~icw~P~~~~ 42 (117)
....|++|..-+...
T Consensus 11 ~~F~~lIC~Fl~~~~ 25 (54)
T PF06716_consen 11 LAFGFLICLFLFCLV 25 (54)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344445554444333
No 72
>PRK09731 putative general secretion pathway protein YghD; Provisional
Probab=23.55 E-value=1.8e+02 Score=18.95 Aligned_cols=28 Identities=21% Similarity=0.387 Sum_probs=18.5
Q ss_pred chhHHHHHHHHHH---HHHHHHHHHHHHHHH
Q psy15241 14 QSSAEVRIAKTAI---TLCALFVASWTPYAV 41 (117)
Q Consensus 14 ~~~~~~k~~k~l~---~iv~~f~icw~P~~~ 41 (117)
...+|++++..+. +++++|...|.|..-
T Consensus 32 ls~REq~ll~~~g~vL~l~i~Y~~iWqPl~~ 62 (178)
T PRK09731 32 RSPREKGMLLAAVVFLFSVGYYVLIWQPLSE 62 (178)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 4456777666554 455677888888754
No 73
>COG1687 AzlD Predicted branched-chain amino acid permeases (azaleucine resistance) [Amino acid transport and metabolism]
Probab=23.52 E-value=1.7e+02 Score=17.29 Aligned_cols=23 Identities=13% Similarity=0.326 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 21 IAKTAITLCALFVASWTPYAVVA 43 (117)
Q Consensus 21 ~~k~l~~iv~~f~icw~P~~~~~ 43 (117)
..-++.++++.|+.=|+||.++.
T Consensus 6 ~l~i~v~~v~t~~~RilPF~if~ 28 (106)
T COG1687 6 ILTIIVIAVGTFLTRILPFLIFK 28 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcC
Confidence 44566778899999999998763
No 74
>PRK10633 hypothetical protein; Provisional
Probab=23.20 E-value=1.5e+02 Score=16.55 Aligned_cols=24 Identities=25% Similarity=0.392 Sum_probs=14.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy15241 16 SAEVRIAKTAITLCALFVASWTPY 39 (117)
Q Consensus 16 ~~~~k~~k~l~~iv~~f~icw~P~ 39 (117)
++-+|=++......++|++.|.=.
T Consensus 6 ~Qa~kEA~~al~L~l~y~~~W~~~ 29 (80)
T PRK10633 6 VQAHKEARWALGLTLLYLAAWLVA 29 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444555566777888888654
No 75
>COG5058 LAG1 Protein transporter of the TRAM (translocating chain-associating membrane) superfamily, longevity assurance factor [Intracellular trafficking and secretion]
Probab=23.14 E-value=2.2e+02 Score=20.65 Aligned_cols=62 Identities=6% Similarity=0.067 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCCcchhHhHHHHHHHhhhccccchhheec
Q psy15241 19 VRIAKTAITLCALFVASWTPYAVVALTGAFG--DQSLLTPGLTMIPACTCKAVACLDPYVYAIS 80 (117)
Q Consensus 19 ~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~--~~~~~~~~~~~~~~~l~~~ns~~NPiiY~~~ 80 (117)
+..--+++-.++.-++-|+-|+.....--.. -.-+....+..++.++-++|+.+-+.+++++
T Consensus 231 kD~~elv~HHIVTllLI~lSY~fhftr~GlAI~itmDvSD~~Ls~sK~lnYl~~~l~~~iF~iF 294 (395)
T COG5058 231 KDFKELVFHHIVTLLLIWLSYVFHFTRMGLAIYITMDVSDFFLSLSKTLNYLNSVLATFIFGIF 294 (395)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhccceEEEEEeccHHHHHHHHHHHHhhchhHHHHHHHH
Confidence 3344455667777788888887654421100 0113456667788888888888777776654
No 76
>COG4041 Predicted membrane protein [Function unknown]
Probab=22.35 E-value=1.8e+02 Score=18.08 Aligned_cols=21 Identities=5% Similarity=0.243 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy15241 26 ITLCALFVASWTPYAVVALTG 46 (117)
Q Consensus 26 ~~iv~~f~icw~P~~~~~~~~ 46 (117)
+-+++.-++||+-|.++..+-
T Consensus 10 v~i~~a~~i~wlnfv~idt~m 30 (171)
T COG4041 10 VKIIIAGIICWLNFVLIDTYM 30 (171)
T ss_pred HHHHHHHHHHHHHHHHHHHhh
Confidence 345667789999998876543
No 77
>KOG4220|consensus
Probab=21.96 E-value=3.6e+02 Score=20.47 Aligned_cols=76 Identities=8% Similarity=-0.076 Sum_probs=43.0
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------------CcchhHhHHHHHHHhhhccccchhheec
Q psy15241 13 GQSSAEVRIAKTAITLCALFVASWTPYAVVALTGAFGDQS------------LLTPGLTMIPACTCKAVACLDPYVYAIS 80 (117)
Q Consensus 13 ~~~~~~~k~~k~l~~iv~~f~icw~P~~~~~~~~~~~~~~------------~~~~~~~~~~~~l~~~ns~~NPiiY~~~ 80 (117)
...|..+|+..|+.+.=++-++.|.|.... |++..+.. +.+...+--+..-.|+--.+=-++|.-.
T Consensus 139 rakRTtkrA~~MI~~AW~iSfiLWaPaIl~--WqyivGkrTv~~~eC~iQFlsnp~iTfGTAiAAFYlPVtiM~~LY~rI 216 (503)
T KOG4220|consen 139 RAKRTTKRAGLMIGAAWVLSFVLWAPAILF--WQYIVGKRTVPDGECYIQFLSNPAITFGTAIAAFYLPVTIMTILYWRI 216 (503)
T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHh--hHhheeeeecCCCceEEEeecCceeehhHHHHHHHhhHHHHHHHHHHH
Confidence 345567788888888888999999997655 34433322 1111111111112233333445678877
Q ss_pred ChHHHHHHHh
Q psy15241 81 HPRYRLELSK 90 (117)
Q Consensus 81 ~~~fr~~~~~ 90 (117)
.++-|+..++
T Consensus 217 yret~kR~k~ 226 (503)
T KOG4220|consen 217 YRETRKRQKE 226 (503)
T ss_pred HHHHHHHHHh
Confidence 7777666655
No 78
>PRK06099 F0F1 ATP synthase subunit I; Validated
Probab=21.48 E-value=1.7e+02 Score=17.81 Aligned_cols=19 Identities=16% Similarity=0.434 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHCC
Q psy15241 32 FVASWTPYAVVALTGAFGD 50 (117)
Q Consensus 32 f~icw~P~~~~~~~~~~~~ 50 (117)
.++||+|..++.......+
T Consensus 45 gli~~LPn~~Fa~~aF~~~ 63 (126)
T PRK06099 45 FLSAFLPFCLFVYWIFFRK 63 (126)
T ss_pred hHHHHHhHHHHHHHHHHhc
Confidence 3678899888877664333
Done!