Query psy5550
Match_columns 127
No_of_seqs 106 out of 1364
Neff 8.9
Searched_HMMs 46136
Date Fri Aug 16 20:25:01 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5550.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5550hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4219|consensus 99.7 5.8E-17 1.3E-21 121.6 5.5 92 35-126 27-119 (423)
2 PHA03234 DNA packaging protein 99.6 2.3E-15 4.9E-20 112.9 9.0 86 39-125 28-115 (338)
3 KOG4220|consensus 99.5 2.7E-15 5.9E-20 113.1 -2.3 84 42-125 29-113 (503)
4 PHA02834 chemokine receptor-li 99.4 8.2E-13 1.8E-17 98.4 9.2 81 42-124 27-107 (323)
5 PHA02638 CC chemokine receptor 99.4 2.7E-12 5.9E-17 98.8 9.3 81 41-123 96-176 (417)
6 PHA03087 G protein-coupled che 99.3 2.4E-12 5.3E-17 95.8 7.1 84 41-125 38-121 (335)
7 PHA03235 DNA packaging protein 99.3 2.3E-11 5.1E-16 93.4 9.7 85 40-125 29-115 (409)
8 PF00001 7tm_1: 7 transmembran 99.0 5E-10 1.1E-14 78.5 5.5 66 60-125 1-67 (257)
9 KOG2087|consensus 98.2 6.5E-07 1.4E-11 67.2 2.0 84 41-125 22-114 (363)
10 PF10320 7TM_GPCR_Srsx: Serpen 98.2 9.1E-07 2E-11 64.2 1.8 67 55-123 2-69 (257)
11 PF11710 Git3: G protein-coupl 97.3 0.0023 4.9E-08 45.0 7.8 53 72-124 30-83 (201)
12 PF05462 Dicty_CAR: Slime mold 96.9 0.008 1.7E-07 44.9 7.9 77 45-125 8-85 (303)
13 PF05296 TAS2R: Mammalian tast 96.6 0.026 5.7E-07 42.0 9.1 74 45-118 8-85 (303)
14 PF10328 7TM_GPCR_Srx: Serpent 95.5 0.047 1E-06 39.7 6.0 42 53-94 3-44 (274)
15 PF10324 7TM_GPCR_Srw: Serpent 95.5 0.024 5.3E-07 41.9 4.5 51 53-104 6-58 (318)
16 PF10321 7TM_GPCR_Srt: Serpent 95.4 0.12 2.6E-06 38.8 7.8 77 41-122 30-107 (313)
17 PF03402 V1R: Vomeronasal orga 95.1 0.052 1.1E-06 39.8 4.8 51 72-124 5-57 (265)
18 PF10317 7TM_GPCR_Srd: Serpent 91.7 0.72 1.6E-05 34.0 6.0 46 49-94 4-50 (292)
19 PF00002 7tm_2: 7 transmembran 85.5 0.59 1.3E-05 33.1 1.9 73 51-124 8-81 (242)
20 PF01102 Glycophorin_A: Glycop 59.3 14 0.00031 23.9 3.1 9 65-73 85-93 (122)
21 PF10327 7TM_GPCR_Sri: Serpent 53.6 19 0.00041 26.8 3.4 62 45-106 10-76 (303)
22 PF10316 7TM_GPCR_Srbc: Serpen 46.0 1.1E+02 0.0024 22.6 6.3 56 47-102 9-65 (273)
23 PF09882 DUF2109: Predicted me 45.8 64 0.0014 19.2 4.2 46 56-101 6-52 (78)
24 PF10323 7TM_GPCR_Srv: Serpent 45.0 36 0.00078 25.0 3.7 35 60-94 11-49 (283)
25 PF12304 BCLP: Beta-casein lik 44.0 1.1E+02 0.0024 21.4 7.9 51 42-94 39-89 (188)
26 PF10873 DUF2668: Protein of u 41.3 13 0.00027 25.0 0.7 35 42-76 60-94 (155)
27 PF04789 DUF621: Protein of un 38.7 1.6E+02 0.0034 22.2 6.1 48 58-105 29-81 (305)
28 PF10329 DUF2417: Region of un 38.3 77 0.0017 23.0 4.4 37 58-94 84-120 (232)
29 PF02532 PsbI: Photosystem II 35.0 63 0.0014 16.2 2.5 17 45-61 10-26 (36)
30 PF11446 DUF2897: Protein of u 34.2 61 0.0013 17.9 2.7 10 57-66 15-24 (55)
31 TIGR01477 RIFIN variant surfac 34.1 63 0.0014 24.9 3.6 30 47-76 311-340 (353)
32 COG1230 CzcD Co/Zn/Cd efflux s 32.3 2.2E+02 0.0048 21.4 7.0 66 50-118 127-194 (296)
33 PTZ00046 rifin; Provisional 30.9 78 0.0017 24.5 3.6 29 48-76 317-345 (358)
34 PF02009 Rifin_STEVOR: Rifin/s 29.8 53 0.0011 24.7 2.5 27 49-75 259-285 (299)
35 PF05393 Hum_adeno_E3A: Human 25.2 93 0.002 19.1 2.5 6 65-70 50-55 (94)
36 PF08114 PMP1_2: ATPase proteo 25.1 47 0.001 17.3 1.1 19 52-70 13-31 (43)
37 PF02101 Ocular_alb: Ocular al 24.9 2.7E+02 0.0059 22.0 5.6 59 45-103 28-93 (405)
38 PF06024 DUF912: Nucleopolyhed 24.6 84 0.0018 19.5 2.4 6 68-73 86-91 (101)
39 KOG4564|consensus 24.3 3.9E+02 0.0084 21.7 10.0 47 48-94 149-195 (473)
40 PF02468 PsbN: Photosystem II 20.8 1.2E+02 0.0025 16.0 2.1 31 47-77 7-37 (43)
41 PF10319 7TM_GPCR_Srj: Serpent 20.7 2.1E+02 0.0045 21.7 4.2 49 52-100 13-65 (310)
42 KOG4193|consensus 20.5 3.3E+02 0.0071 22.8 5.6 56 63-125 339-396 (610)
43 CHL00024 psbI photosystem II p 20.5 50 0.0011 16.6 0.6 17 45-61 10-26 (36)
44 PHA03164 hypothetical protein; 20.3 1E+02 0.0022 18.3 2.0 21 74-94 55-75 (88)
No 1
>KOG4219|consensus
Probab=99.67 E-value=5.8e-17 Score=121.59 Aligned_cols=92 Identities=21% Similarity=0.366 Sum_probs=85.0
Q ss_pred cCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCccccccc
Q psy5550 35 EFGNREFEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDV 113 (127)
Q Consensus 35 ~~~~~~~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~ 113 (127)
....+..+..+++++|+++.+++++||++|+|++..+|++|+.+|+|++|||+||++.++ ..|+.......+.|.+|.+
T Consensus 27 ~f~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f 106 (423)
T KOG4219|consen 27 LFVLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSF 106 (423)
T ss_pred cccCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccc
Confidence 456788888999999999999999999999999999999999999999999999999999 9999988888889999999
Q ss_pred ccccchhhhhhcC
Q psy5550 114 LCKIMPYSQMCSE 126 (127)
Q Consensus 114 ~C~~~~~~~~~~~ 126 (127)
.|++..|+...++
T Consensus 107 ~C~f~nf~~itav 119 (423)
T KOG4219|consen 107 YCRFVNFFPITAV 119 (423)
T ss_pred eeeeccccchhhh
Confidence 9999999876553
No 2
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.62 E-value=2.3e-15 Score=112.93 Aligned_cols=86 Identities=19% Similarity=0.284 Sum_probs=71.8
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccc
Q psy5550 39 REFEMSAEICLLSIIFILSMGANLFVLITV--MWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCK 116 (127)
Q Consensus 39 ~~~~~~~~~~~~~~i~~~~i~gN~~vl~~~--~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~ 116 (127)
.+....+++++|.+++++|++||++|++++ .+++++|+++|+|++|||++|++.++.+|+.+... .++|++|+..||
T Consensus 28 ~~~~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~lp~~~~~~-~~~w~fG~~lCk 106 (338)
T PHA03234 28 LKKAQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLCVFFLTLNR-EALFNFNQAFCQ 106 (338)
T ss_pred HHHHHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCccCchhHHH
Confidence 445667889999999999999999999844 46667789999999999999999988777766543 457999999999
Q ss_pred cchhhhhhc
Q psy5550 117 IMPYSQMCS 125 (127)
Q Consensus 117 ~~~~~~~~~ 125 (127)
+..++...+
T Consensus 107 ~~~~~~~~~ 115 (338)
T PHA03234 107 CVLFIYHAS 115 (338)
T ss_pred HHHHHHHHH
Confidence 998877654
No 3
>KOG4220|consensus
Probab=99.46 E-value=2.7e-15 Score=113.11 Aligned_cols=84 Identities=25% Similarity=0.433 Sum_probs=77.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchh
Q psy5550 42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPY 120 (127)
Q Consensus 42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~ 120 (127)
+.++++++...+.++.++||++||..+..+|++|+..|+||++||+||++.+. .+|+...+.+.|+|++|...|.+...
T Consensus 29 q~v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlWLa 108 (503)
T KOG4220|consen 29 QVVFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLWLA 108 (503)
T ss_pred EEEeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHHHH
Confidence 33456777888899999999999999999999999999999999999999999 99999999999999999999999988
Q ss_pred hhhhc
Q psy5550 121 SQMCS 125 (127)
Q Consensus 121 ~~~~~ 125 (127)
+.++.
T Consensus 109 lDYva 113 (503)
T KOG4220|consen 109 LDYVA 113 (503)
T ss_pred HHHHh
Confidence 87653
No 4
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.43 E-value=8.2e-13 Score=98.36 Aligned_cols=81 Identities=20% Similarity=0.433 Sum_probs=66.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccccchhh
Q psy5550 42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKIMPYS 121 (127)
Q Consensus 42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~~~~~ 121 (127)
...+..+++.+++++|++||+++++++.++|+ +++.|+|++|||++|++..+.+|+.+.... ++|.+|+..|++..++
T Consensus 27 ~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~~~~lP~~i~~~~-~~w~~g~~~C~~~~~~ 104 (323)
T PHA02834 27 VNYFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLMLVFSFPFIIHNDL-NEWIFGEFMCKLVLGV 104 (323)
T ss_pred hhhhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHHHHHHHHHHHHHc-CCcCCcchHHHhHHHH
Confidence 34477899999999999999999998887665 467899999999999986449998876554 5799999999998766
Q ss_pred hhh
Q psy5550 122 QMC 124 (127)
Q Consensus 122 ~~~ 124 (127)
...
T Consensus 105 ~~~ 107 (323)
T PHA02834 105 YFV 107 (323)
T ss_pred HHH
Confidence 543
No 5
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.38 E-value=2.7e-12 Score=98.77 Aligned_cols=81 Identities=27% Similarity=0.548 Sum_probs=68.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccccchh
Q psy5550 41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKIMPY 120 (127)
Q Consensus 41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~~~~ 120 (127)
....+..+++.+++++|++||.++++++.+ |++|+++|++++|||++|++..+.+|+++... .++|.+|+..||+..+
T Consensus 96 ~~~~~l~~~y~lvfvlgliGN~LVl~il~~-k~lrt~t~i~llnLAisDLl~~l~lPf~i~~~-~~~W~fg~~~Ck~~~~ 173 (417)
T PHA02638 96 SISEYIKIFYIIIFILGLFGNAAIIMILFC-KKIKTITDIYIFNLAISDLIFVIDFPFIIYNE-FDQWIFGDFMCKVISA 173 (417)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCHhHHHHHHHHHHHHHHHHHHHHHHHHH-hccccccccchhhHHH
Confidence 345578889999999999999999977654 78899999999999999999866999988765 4689999999999876
Q ss_pred hhh
Q psy5550 121 SQM 123 (127)
Q Consensus 121 ~~~ 123 (127)
+..
T Consensus 174 l~~ 176 (417)
T PHA02638 174 SYY 176 (417)
T ss_pred HHH
Confidence 544
No 6
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.35 E-value=2.4e-12 Score=95.84 Aligned_cols=84 Identities=26% Similarity=0.490 Sum_probs=71.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccccchh
Q psy5550 41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKIMPY 120 (127)
Q Consensus 41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~~~~ 120 (127)
....+..+++.+++++|++||+++++++.++ ++|++.|+++.|||++|++.++..|........++|.+|+..|++..+
T Consensus 38 ~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~~~~~~~~~~~~~~~~~~~~~C~~~~~ 116 (335)
T PHA03087 38 TNSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVMTLPFQIYYYILFQWSFGEFACKIVSG 116 (335)
T ss_pred chhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHHhHHHHHHHHhCCCCCCCcHHHHHHHH
Confidence 3444778889999999999999999888887 889999999999999999887777877766666789999999999888
Q ss_pred hhhhc
Q psy5550 121 SQMCS 125 (127)
Q Consensus 121 ~~~~~ 125 (127)
+...+
T Consensus 117 ~~~~~ 121 (335)
T PHA03087 117 LYYIG 121 (335)
T ss_pred HHHHH
Confidence 76543
No 7
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.29 E-value=2.3e-11 Score=93.44 Aligned_cols=85 Identities=20% Similarity=0.213 Sum_probs=63.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cCCC-CchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCccccccccccc
Q psy5550 40 EFEMSAEICLLSIIFILSMGANLFVLITVMW-YPAL-QTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKI 117 (127)
Q Consensus 40 ~~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~-~~~~-~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~ 117 (127)
...+.+..+++.+++++|++||++|++++.+ +|++ ++..++|++|||++|++..+.+|+.+... ...|..|...|++
T Consensus 29 ~~~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~l~~lP~~i~~~-~~~~~~g~~~Ck~ 107 (409)
T PHA03235 29 SAARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLTVFVLPFIMLSN-QGLLSGSVAGCKF 107 (409)
T ss_pred hhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHHHHHHHHHHHhc-CccccCCCCeehh
Confidence 3455688999999999999999999987543 3332 35668999999999998744999887532 1223445789999
Q ss_pred chhhhhhc
Q psy5550 118 MPYSQMCS 125 (127)
Q Consensus 118 ~~~~~~~~ 125 (127)
..++...+
T Consensus 108 ~~~l~~~~ 115 (409)
T PHA03235 108 ASLLYYAS 115 (409)
T ss_pred HHHHHHHH
Confidence 98876654
No 8
>PF00001 7tm_1: 7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature; InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.02 E-value=5e-10 Score=78.51 Aligned_cols=66 Identities=32% Similarity=0.592 Sum_probs=58.3
Q ss_pred HHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhhhc
Q psy5550 60 ANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQMCS 125 (127)
Q Consensus 60 gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~~ 125 (127)
||.+++.++.++|++|++.++++.|||++|++.++ ..|........++|.++...|++..++...+
T Consensus 1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~ 67 (257)
T PF00001_consen 1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLFYFS 67 (257)
T ss_dssp HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHHHHH
T ss_pred CchhehhhhhhhccCCChhHHHHHHHHHHHHhhcccccccccccccccccccccccccccccccccc
Confidence 89999999999999999999999999999999999 8888877777678999999999998876543
No 9
>KOG2087|consensus
Probab=98.21 E-value=6.5e-07 Score=67.17 Aligned_cols=84 Identities=18% Similarity=0.207 Sum_probs=64.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHh-cC-------ccccc
Q psy5550 41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARV-SP-------QWPLG 111 (127)
Q Consensus 41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~-~~-------~w~~g 111 (127)
....++.+..++++.++++||.+|++.+...|...++..+++.|||++|+++++ ..-+..++.. .+ .|..|
T Consensus 22 lg~~~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~tg 101 (363)
T KOG2087|consen 22 LGYWILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQTG 101 (363)
T ss_pred hccceeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhhhc
Confidence 333466677788899999999999988888888778899999999999999998 4444444332 22 27655
Q ss_pred ccccccchhhhhhc
Q psy5550 112 DVLCKIMPYSQMCS 125 (127)
Q Consensus 112 ~~~C~~~~~~~~~~ 125 (127)
..|++.+|+.+++
T Consensus 102 -~gC~~aGflavFA 114 (363)
T KOG2087|consen 102 -LGCPVAGFLAVFA 114 (363)
T ss_pred -CCCchHHHHHHHH
Confidence 7899999987765
No 10
>PF10320 7TM_GPCR_Srsx: Serpentine type 7TM GPCR chemoreceptor Srsx; InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=98.16 E-value=9.1e-07 Score=64.18 Aligned_cols=67 Identities=16% Similarity=0.288 Sum_probs=53.1
Q ss_pred HHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhh
Q psy5550 55 ILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQM 123 (127)
Q Consensus 55 ~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~ 123 (127)
++|+.||..++.++.++|++|++.++++..+|++|++... .+|..+... .+ -......|-.+.+...
T Consensus 2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~-~~-~~i~~~~Cf~~~~~~~ 69 (257)
T PF10320_consen 2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLF-RD-HQITRSECFWQIFFYI 69 (257)
T ss_pred EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHH-hh-eeccHHHHHHHHHHHH
Confidence 4689999999999999999999999999999999999999 888776443 22 2345566766555443
No 11
>PF11710 Git3: G protein-coupled glucose receptor regulating Gpa2; InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins.
Probab=97.26 E-value=0.0023 Score=45.02 Aligned_cols=53 Identities=13% Similarity=0.088 Sum_probs=39.4
Q ss_pred CCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhhh
Q psy5550 72 PALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQMC 124 (127)
Q Consensus 72 ~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~ 124 (127)
+++|...+-++.||.++|++.++ .+...+.....+.-.-+...|..++++.-.
T Consensus 30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~q~ 83 (201)
T PF11710_consen 30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFLQV 83 (201)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHHHH
Confidence 55567778899999999999999 555455555444444567899999997644
No 12
>PF05462 Dicty_CAR: Slime mold cyclic AMP receptor
Probab=96.87 E-value=0.008 Score=44.88 Aligned_cols=77 Identities=19% Similarity=0.284 Sum_probs=59.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhh
Q psy5550 45 AEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQM 123 (127)
Q Consensus 45 ~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~ 123 (127)
...++..+..+++++|-+.++..+++.|++|++.+-++.-++++|++..+ ....... ..-.-+...|++++++..
T Consensus 8 ~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~~~----~~~~~~s~lC~~Qafliq 83 (303)
T PF05462_consen 8 TLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMTLS----PSAGENSFLCQFQAFLIQ 83 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCCCcchhhHhHHHH
Confidence 56666777788999999999999999999999999999999999999776 4432221 111234678999998765
Q ss_pred hc
Q psy5550 124 CS 125 (127)
Q Consensus 124 ~~ 125 (127)
..
T Consensus 84 ~f 85 (303)
T PF05462_consen 84 FF 85 (303)
T ss_pred Hh
Confidence 43
No 13
>PF05296 TAS2R: Mammalian taste receptor protein (TAS2R); InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.61 E-value=0.026 Score=42.00 Aligned_cols=74 Identities=16% Similarity=0.216 Sum_probs=48.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---hcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccc
Q psy5550 45 AEICLLSIIFILSMGANLFVLITVM---WYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIM 118 (127)
Q Consensus 45 ~~~~~~~~i~~~~i~gN~~vl~~~~---~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~ 118 (127)
+..++..+.+++|+.||+.++.+-+ ++++.-.|.+..+.+||++.++.-. ..-......+..+.......++..
T Consensus 8 i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (303)
T PF05296_consen 8 IFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFLSFFFPNIYFSENVYKII 85 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhhhHHHHH
Confidence 5677778889999999997765544 2333346899999999999999877 443333333333322333344443
No 14
>PF10328 7TM_GPCR_Srx: Serpentine type 7TM GPCR chemoreceptor Srx; InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=95.55 E-value=0.047 Score=39.75 Aligned_cols=42 Identities=24% Similarity=0.395 Sum_probs=39.2
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550 53 IFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 53 i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~ 94 (127)
+.++|++.|.+++..+.+.+++|++-+.+..+.|++|.+...
T Consensus 3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~ 44 (274)
T PF10328_consen 3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICL 44 (274)
T ss_pred eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHH
Confidence 467899999999999999999999999999999999999887
No 15
>PF10324 7TM_GPCR_Srw: Serpentine type 7TM GPCR chemoreceptor Srw; InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz [].
Probab=95.53 E-value=0.024 Score=41.94 Aligned_cols=51 Identities=20% Similarity=0.388 Sum_probs=41.1
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCc-hHHHHHHHHHHHHHHHHh-HHHHHHHHHh
Q psy5550 53 IFILSMGANLFVLITVMWYPALQT-VTNYFLLNLTAADILFCL-SIPGIMYARV 104 (127)
Q Consensus 53 i~~~~i~gN~~vl~~~~~~~~~~~-~~~~~l~nla~~Dl~~~~-~~p~~~~~~~ 104 (127)
+.++|+++|..-+.++. +|++|+ +.|.+++.+|++|++... ..+..+....
T Consensus 6 ~~~~g~~~N~~h~~VLt-rk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~~ 58 (318)
T PF10324_consen 6 LSIFGLFINIFHLIVLT-RKSMRSSSINILLIGIAICDLLYMLSILIWELFFFI 58 (318)
T ss_pred EeHHHHHHHHHHhhhcC-ChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 46789999998887664 466675 899999999999999999 8887765544
No 16
>PF10321 7TM_GPCR_Srt: Serpentine type 7TM GPCR chemoreceptor Srt; InterPro: IPR019425 Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' [].
Probab=95.38 E-value=0.12 Score=38.77 Aligned_cols=77 Identities=13% Similarity=0.059 Sum_probs=54.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccch
Q psy5550 41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMP 119 (127)
Q Consensus 41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~ 119 (127)
..+..++..+.+.+++..+-....+.++.+++.+|.+.+-.+.-||+.|++... ..-..-... ..|..+|+...
T Consensus 30 ~~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~~si~tG~l~-----i~G~vfC~~P~ 104 (313)
T PF10321_consen 30 VKRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFINSIITGILA-----IFGAVFCSYPR 104 (313)
T ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHhhhhhhhHHH-----hcCccccCCch
Confidence 334467777777788888888888888888888889999999999999999886 322222222 23456666555
Q ss_pred hhh
Q psy5550 120 YSQ 122 (127)
Q Consensus 120 ~~~ 122 (127)
+..
T Consensus 105 ~~~ 107 (313)
T PF10321_consen 105 FIY 107 (313)
T ss_pred Hhh
Confidence 433
No 17
>PF03402 V1R: Vomeronasal organ pheromone receptor family, V1R; InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=95.05 E-value=0.052 Score=39.82 Aligned_cols=51 Identities=27% Similarity=0.289 Sum_probs=38.4
Q ss_pred CCCCchHHHHHHHHHHHHHHHHh--HHHHHHHHHhcCcccccccccccchhhhhh
Q psy5550 72 PALQTVTNYFLLNLTAADILFCL--SIPGIMYARVSPQWPLGDVLCKIMPYSQMC 124 (127)
Q Consensus 72 ~~~~~~~~~~l~nla~~Dl~~~~--~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~ 124 (127)
.++.+|.+..+.|||+++.+..+ .+|........ + .+++..||+..|+.=+
T Consensus 5 ~~r~kp~dlIl~hLa~aN~lvLl~rGip~~~~~~~~-~-~~~d~gCK~v~Y~~RV 57 (265)
T PF03402_consen 5 GHRLKPIDLILIHLALANILVLLSRGIPQTMAFFGW-K-FFDDIGCKIVFYIYRV 57 (265)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHHHhhHHHHHHHhhc-c-cCCCceeeeeeeehHH
Confidence 34468999999999999999998 88865433222 2 4689999999887543
No 18
>PF10317 7TM_GPCR_Srd: Serpentine type 7TM GPCR chemoreceptor Srd; InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd [].
Probab=91.65 E-value=0.72 Score=33.97 Aligned_cols=46 Identities=22% Similarity=0.353 Sum_probs=36.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcC-CCCchHHHHHHHHHHHHHHHHh
Q psy5550 49 LLSIIFILSMGANLFVLITVMWYP-ALQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 49 ~~~~i~~~~i~gN~~vl~~~~~~~-~~~~~~~~~l~nla~~Dl~~~~ 94 (127)
++.+.+.+|+..|.+.+.++.++. +.-+...+++.|-|+.|++...
T Consensus 4 ~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~ 50 (292)
T PF10317_consen 4 YHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISII 50 (292)
T ss_pred eHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHH
Confidence 456778899999998886666533 3335678999999999999887
No 19
>PF00002 7tm_2: 7 transmembrane receptor (Secretin family); InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=85.48 E-value=0.59 Score=33.10 Aligned_cols=73 Identities=23% Similarity=0.275 Sum_probs=2.1
Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhhh
Q psy5550 51 SIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQMC 124 (127)
Q Consensus 51 ~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~ 124 (127)
.+-..+++++=.+++......|++|+..+....||++++++..+ .+.- ......+.-...+..|+..+.+.+.
T Consensus 8 ~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~~-~~~~~~~~~~~~~~~C~~~a~~~hy 81 (242)
T PF00002_consen 8 YVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSFLIG-ISQTFSPISTTNHCLCRAIAILLHY 81 (242)
T ss_dssp HHHHH----------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHHhee-hhhccccccccccccchhhhhHhHH
Confidence 34444555555555555556677788778888999999988776 3222 1111111111112349888876554
No 20
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=59.27 E-value=14 Score=23.92 Aligned_cols=9 Identities=0% Similarity=-0.174 Sum_probs=3.2
Q ss_pred HHHHhhcCC
Q psy5550 65 LITVMWYPA 73 (127)
Q Consensus 65 l~~~~~~~~ 73 (127)
.+++.|.|+
T Consensus 85 ~y~irR~~K 93 (122)
T PF01102_consen 85 SYCIRRLRK 93 (122)
T ss_dssp HHHHHHHS-
T ss_pred HHHHHHHhc
Confidence 344444333
No 21
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=53.58 E-value=19 Score=26.79 Aligned_cols=62 Identities=21% Similarity=0.222 Sum_probs=44.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-hcCCCCchHHHHHH---HHHHHHHHHHh-HHHHHHHHHhcC
Q psy5550 45 AEICLLSIIFILSMGANLFVLITVM-WYPALQTVTNYFLL---NLTAADILFCL-SIPGIMYARVSP 106 (127)
Q Consensus 45 ~~~~~~~~i~~~~i~gN~~vl~~~~-~~~~~~~~~~~~l~---nla~~Dl~~~~-~~p~~~~~~~~~ 106 (127)
++...+-+++.+++.-|.+.+..+. +.+++++-.++++. ...+.|+-.+. ..|..+.....|
T Consensus 10 ~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~ag 76 (303)
T PF10327_consen 10 WLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAG 76 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEE
Confidence 5667788889999999998885554 55555554444432 35669999998 888887776544
No 22
>PF10316 7TM_GPCR_Srbc: Serpentine type 7TM GPCR chemoreceptor Srbc ; InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=45.95 E-value=1.1e+02 Score=22.62 Aligned_cols=56 Identities=13% Similarity=0.190 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHH
Q psy5550 47 ICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYA 102 (127)
Q Consensus 47 ~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~ 102 (127)
..+-.+...+....|...+..+.+.|+.|++--.++---...|.+.+. ..+.....
T Consensus 9 ~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~~~ 65 (273)
T PF10316_consen 9 SIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLIYY 65 (273)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHHHH
Confidence 334445567778889988888886666445444444446788999998 55544433
No 23
>PF09882 DUF2109: Predicted membrane protein (DUF2109); InterPro: IPR019214 This entry is found in various hypothetical archaeal proteins and has no known function.
Probab=45.84 E-value=64 Score=19.20 Aligned_cols=46 Identities=11% Similarity=0.078 Sum_probs=30.1
Q ss_pred HHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHH
Q psy5550 56 LSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMY 101 (127)
Q Consensus 56 ~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~ 101 (127)
.|+++=...+-++.-+.+.++..+.-.+|-+++-++... --|+...
T Consensus 6 ~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g~i 52 (78)
T PF09882_consen 6 IGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMGAI 52 (78)
T ss_pred HHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHHHH
Confidence 344444444545555555677788888899999888877 6665543
No 24
>PF10323 7TM_GPCR_Srv: Serpentine type 7TM GPCR chemoreceptor Srv; InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.
Probab=45.03 E-value=36 Score=24.97 Aligned_cols=35 Identities=23% Similarity=0.326 Sum_probs=25.1
Q ss_pred HHHHHHHHHhhcCC----CCchHHHHHHHHHHHHHHHHh
Q psy5550 60 ANLFVLITVMWYPA----LQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 60 gN~~vl~~~~~~~~----~~~~~~~~l~nla~~Dl~~~~ 94 (127)
=...++..+.+.|+ .+++-+.++.+-+++|++..+
T Consensus 11 ly~~il~~l~~~r~~~~~~~~~Fy~l~~~~~iaDi~~~~ 49 (283)
T PF10323_consen 11 LYIFILYCLLKLRKRSKTFKSTFYTLLIQHCIADILSML 49 (283)
T ss_pred HHHHHHHHHHHcccCccccCCHHHHHHHHHHHHHHHHHH
Confidence 34444444554443 458899999999999999886
No 25
>PF12304 BCLP: Beta-casein like protein; InterPro: IPR020977 This entry represents eukaryotic proteins that are typically between 216 to 240 amino acids in length which have two conserved sequence motifs: VLR and TRIY. Beta-casein-like protein is associated with cell morphology and a regulation of growth pattern of tumours. It is found in adenocarcinomas of uterine cervical tissues[].
Probab=44.02 E-value=1.1e+02 Score=21.40 Aligned_cols=51 Identities=12% Similarity=0.098 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550 42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~ 94 (127)
++.+.-++-+.-.++++.+.+..+ +..|+++ +++..+-++.+++...+.+.
T Consensus 39 eY~vsNiisv~Sgll~I~~GI~AI-vlSrnl~-~~~L~W~Ll~~S~ln~LlSa 89 (188)
T PF12304_consen 39 EYAVSNIISVTSGLLSIICGIVAI-VLSRNLR-NRPLHWTLLVVSLLNALLSA 89 (188)
T ss_pred hhhHHHHHHHHHHHHHHHHhHHHH-hhhccCC-CCcchHHHHHHHHHHHHHHH
Confidence 333445555566777777666544 4567776 46677777777777666665
No 26
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=41.34 E-value=13 Score=24.97 Aligned_cols=35 Identities=11% Similarity=0.199 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCc
Q psy5550 42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQT 76 (127)
Q Consensus 42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~ 76 (127)
...+..+++.+++++|+++-..+....+.++++++
T Consensus 60 gtAIaGIVfgiVfimgvva~i~icvCmc~kn~rgs 94 (155)
T PF10873_consen 60 GTAIAGIVFGIVFIMGVVAGIAICVCMCMKNSRGS 94 (155)
T ss_pred cceeeeeehhhHHHHHHHHHHHHHHhhhhhcCCCc
Confidence 33456678888888888887766555554444333
No 27
>PF04789 DUF621: Protein of unknown function (DUF621); InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=38.65 E-value=1.6e+02 Score=22.21 Aligned_cols=48 Identities=17% Similarity=0.417 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-----HHHHHHHHHhc
Q psy5550 58 MGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-----SIPGIMYARVS 105 (127)
Q Consensus 58 i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-----~~p~~~~~~~~ 105 (127)
+.|-.+|+..+.+.+=++-+..+|+.+|..+-++.+. .+|..+.+.+.
T Consensus 29 Lt~~Flv~~i~lW~~Fk~m~ffwFl~qlt~s~fi~S~lNl~inVPatlfsl~t 81 (305)
T PF04789_consen 29 LTGAFLVLSIILWSHFKPMKFFWFLTQLTISVFIISSLNLLINVPATLFSLIT 81 (305)
T ss_pred HHHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHhhhheEeCcHHHHHhhh
Confidence 3344455554444443345678999999999888774 45655544443
No 28
>PF10329 DUF2417: Region of unknown function (DUF2417); InterPro: IPR019431 This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO).
Probab=38.25 E-value=77 Score=22.97 Aligned_cols=37 Identities=27% Similarity=0.298 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550 58 MGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 58 i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~ 94 (127)
+..|...++.+.-..+.-+..++.++-|-+.|++..+
T Consensus 84 l~~~~~~L~Ff~vpS~~~r~l~~vl~~Lllvdlilil 120 (232)
T PF10329_consen 84 LITNLFNLWFFGVPSKLERILNIVLAGLLLVDLILIL 120 (232)
T ss_pred HHHHHHHHHheecCcHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444433333333333445555555555555444
No 29
>PF02532 PsbI: Photosystem II reaction centre I protein (PSII 4.8 kDa protein); InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=34.97 E-value=63 Score=16.20 Aligned_cols=17 Identities=18% Similarity=0.397 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy5550 45 AEICLLSIIFILSMGAN 61 (127)
Q Consensus 45 ~~~~~~~~i~~~~i~gN 61 (127)
....+++.++++|.+.|
T Consensus 10 ~vV~ffv~LFifGflsn 26 (36)
T PF02532_consen 10 TVVIFFVSLFIFGFLSN 26 (36)
T ss_dssp HHHHHHHHHHHHHHHTT
T ss_pred hhHHHHHHHHhccccCC
Confidence 34455556666666655
No 30
>PF11446 DUF2897: Protein of unknown function (DUF2897); InterPro: IPR021550 This is a bacterial family of uncharacterised proteins.
Probab=34.20 E-value=61 Score=17.91 Aligned_cols=10 Identities=20% Similarity=0.129 Sum_probs=7.4
Q ss_pred HHHHHHHHHH
Q psy5550 57 SMGANLFVLI 66 (127)
Q Consensus 57 ~i~gN~~vl~ 66 (127)
-++||+.++-
T Consensus 15 vIigNia~LK 24 (55)
T PF11446_consen 15 VIIGNIAALK 24 (55)
T ss_pred HHHhHHHHHH
Confidence 4789998763
No 31
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=34.14 E-value=63 Score=24.92 Aligned_cols=30 Identities=20% Similarity=0.269 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCc
Q psy5550 47 ICLLSIIFILSMGANLFVLITVMWYPALQT 76 (127)
Q Consensus 47 ~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~ 76 (127)
++...+++++.++--.++++++.|+||.+.
T Consensus 311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKKK 340 (353)
T TIGR01477 311 PIIASIIAILIIVLIMVIIYLILRYRRKKK 340 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcch
Confidence 344555666666666677788888887553
No 32
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=32.27 E-value=2.2e+02 Score=21.44 Aligned_cols=66 Identities=18% Similarity=0.169 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHH-HHHHHHHHh-HHHHHHHHHhcCcccccccccccc
Q psy5550 50 LSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNL-TAADILFCL-SIPGIMYARVSPQWPLGDVLCKIM 118 (127)
Q Consensus 50 ~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nl-a~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~ 118 (127)
+.++.++|++-|+...+.+.+.++ .-.|.=-..| +++|.+..+ .+--.+.-.+.+ |..-++.+.+.
T Consensus 127 ml~va~~GL~vN~~~a~ll~~~~~--~~lN~r~a~LHvl~D~Lgsv~vIia~i~i~~~~-w~~~Dpi~si~ 194 (296)
T COG1230 127 MLVVAIIGLVVNLVSALLLHKGHE--ENLNMRGAYLHVLGDALGSVGVIIAAIVIRFTG-WSWLDPILSIV 194 (296)
T ss_pred hHHHHHHHHHHHHHHHHHhhCCCc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCccchHHHHH
Confidence 456688899999998888876522 1222222222 358999888 555555555554 44445555433
No 33
>PTZ00046 rifin; Provisional
Probab=30.86 E-value=78 Score=24.50 Aligned_cols=29 Identities=14% Similarity=0.282 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCc
Q psy5550 48 CLLSIIFILSMGANLFVLITVMWYPALQT 76 (127)
Q Consensus 48 ~~~~~i~~~~i~gN~~vl~~~~~~~~~~~ 76 (127)
+...+++++.++--.++++++.|+||.+.
T Consensus 317 IiaSiiAIvVIVLIMvIIYLILRYRRKKK 345 (358)
T PTZ00046 317 IIASIVAIVVIVLIMVIIYLILRYRRKKK 345 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcch
Confidence 34455566666666677788888887553
No 34
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=29.79 E-value=53 Score=24.71 Aligned_cols=27 Identities=22% Similarity=0.343 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCC
Q psy5550 49 LLSIIFILSMGANLFVLITVMWYPALQ 75 (127)
Q Consensus 49 ~~~~i~~~~i~gN~~vl~~~~~~~~~~ 75 (127)
...++.++.++-=.++|++++|+||.+
T Consensus 259 ~aSiiaIliIVLIMvIIYLILRYRRKK 285 (299)
T PF02009_consen 259 IASIIAILIIVLIMVIIYLILRYRRKK 285 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444554456677777777743
No 35
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=25.24 E-value=93 Score=19.06 Aligned_cols=6 Identities=33% Similarity=0.085 Sum_probs=3.0
Q ss_pred HHHHhh
Q psy5550 65 LITVMW 70 (127)
Q Consensus 65 l~~~~~ 70 (127)
+|.++.
T Consensus 50 lwfvCC 55 (94)
T PF05393_consen 50 LWFVCC 55 (94)
T ss_pred HHHHHH
Confidence 455553
No 36
>PF08114 PMP1_2: ATPase proteolipid family; InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=25.12 E-value=47 Score=17.25 Aligned_cols=19 Identities=5% Similarity=0.111 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHHhh
Q psy5550 52 IIFILSMGANLFVLITVMW 70 (127)
Q Consensus 52 ~i~~~~i~gN~~vl~~~~~ 70 (127)
+++++|+.|-+++...++|
T Consensus 13 VF~lVglv~i~iva~~iYR 31 (43)
T PF08114_consen 13 VFCLVGLVGIGIVALFIYR 31 (43)
T ss_pred ehHHHHHHHHHHHHHHHHH
Confidence 4445555655555444443
No 37
>PF02101 Ocular_alb: Ocular albinism type 1 protein; InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=24.89 E-value=2.7e+02 Score=21.97 Aligned_cols=59 Identities=22% Similarity=0.135 Sum_probs=33.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCC----CCch--HHHHHHHHHHHHHHHHh-HHHHHHHHH
Q psy5550 45 AEICLLSIIFILSMGANLFVLITVMWYPA----LQTV--TNYFLLNLTAADILFCL-SIPGIMYAR 103 (127)
Q Consensus 45 ~~~~~~~~i~~~~i~gN~~vl~~~~~~~~----~~~~--~~~~l~nla~~Dl~~~~-~~p~~~~~~ 103 (127)
.+-++-+....+|+.|-++-+.--.|... .+++ ..-.+..||++|++.++ .+-....+.
T Consensus 28 ~f~avCLgSs~l~l~gallQLlp~rr~~~~~~~~~sp~~~~rIl~~la~aDlLaclGVivRS~vWl 93 (405)
T PF02101_consen 28 AFNAVCLGSSVLSLLGALLQLLPRRRSAGPRAPARSPSSSRRILFWLAVADLLACLGVIVRSSVWL 93 (405)
T ss_pred hhhhhHHHHHHHHHHHHHHhhccccccccccccccCCcCCchhHHHHHHHHHHhhhhHHHHhhhhh
Confidence 44445555566666665544431111100 0111 34678899999999998 666665554
No 38
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=24.61 E-value=84 Score=19.45 Aligned_cols=6 Identities=0% Similarity=0.031 Sum_probs=2.5
Q ss_pred HhhcCC
Q psy5550 68 VMWYPA 73 (127)
Q Consensus 68 ~~~~~~ 73 (127)
+.|.|+
T Consensus 86 ILRer~ 91 (101)
T PF06024_consen 86 ILRERQ 91 (101)
T ss_pred EEeccc
Confidence 334444
No 39
>KOG4564|consensus
Probab=24.30 E-value=3.9e+02 Score=21.66 Aligned_cols=47 Identities=30% Similarity=0.431 Sum_probs=34.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550 48 CLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 48 ~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~ 94 (127)
++|.+-.-++++.=.+.+.++...|++|...|+.=.||-++=++.++
T Consensus 149 ~lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~ 195 (473)
T KOG4564|consen 149 ILYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAA 195 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHH
Confidence 34444455555554555667778889998899999999999888777
No 40
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=20.83 E-value=1.2e+02 Score=15.97 Aligned_cols=31 Identities=6% Similarity=-0.023 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCch
Q psy5550 47 ICLLSIIFILSMGANLFVLITVMWYPALQTV 77 (127)
Q Consensus 47 ~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~ 77 (127)
..+++...+++++|-.+-...=--.|.+|.|
T Consensus 7 ~~i~i~~~lv~~Tgy~iYtaFGppSk~LrDP 37 (43)
T PF02468_consen 7 LAIFISCLLVSITGYAIYTAFGPPSKELRDP 37 (43)
T ss_pred HHHHHHHHHHHHHhhhhhheeCCCccccCCc
Confidence 4445556667777755433222235666654
No 41
>PF10319 7TM_GPCR_Srj: Serpentine type 7TM GPCR chemoreceptor Srj; InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily [].
Probab=20.74 E-value=2.1e+02 Score=21.75 Aligned_cols=49 Identities=20% Similarity=0.249 Sum_probs=36.0
Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCc-hHHHHHHHHHHHHHHHHh---HHHHHH
Q psy5550 52 IIFILSMGANLFVLITVMWYPALQT-VTNYFLLNLTAADILFCL---SIPGIM 100 (127)
Q Consensus 52 ~i~~~~i~gN~~vl~~~~~~~~~~~-~~~~~l~nla~~Dl~~~~---~~p~~~ 100 (127)
+.++++.+-|.+.+.++..+|+.+- .-.++++--|+-|++.++ .+|.-+
T Consensus 13 ~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~v 65 (310)
T PF10319_consen 13 IFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICV 65 (310)
T ss_pred HHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence 3455667889999988887776653 457788889999999987 555443
No 42
>KOG4193|consensus
Probab=20.53 E-value=3.3e+02 Score=22.79 Aligned_cols=56 Identities=14% Similarity=0.030 Sum_probs=28.7
Q ss_pred HHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCccccccc--ccccchhhhhhc
Q psy5550 63 FVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDV--LCKIMPYSQMCS 125 (127)
Q Consensus 63 ~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~--~C~~~~~~~~~~ 125 (127)
+.+++....|++|+-.+....||+++=++.- ...+.+.|..+.. .|+..+.+.+..
T Consensus 339 lti~ty~~~~~l~~~~~~i~~~l~~~L~l~~-------l~fL~~~~~~~~~~~~C~~~a~llhff 396 (610)
T KOG4193|consen 339 LTIATYLLFRKLQNDRTKIHINLCLCLFLAE-------LLFLLGIDRTSTSVVLCIAAAILLHFF 396 (610)
T ss_pred HHHHHHHHHHHHHhhcchhHHHHHHHHHHHH-------HHHhcccccccCcccccHHHHHHHHHH
Confidence 3344444444444444777778888722211 1122234443333 599888766543
No 43
>CHL00024 psbI photosystem II protein I
Probab=20.49 E-value=50 Score=16.56 Aligned_cols=17 Identities=18% Similarity=0.397 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy5550 45 AEICLLSIIFILSMGAN 61 (127)
Q Consensus 45 ~~~~~~~~i~~~~i~gN 61 (127)
....+++.++++|.+.|
T Consensus 10 ~vV~ffvsLFifGFlsn 26 (36)
T CHL00024 10 TVVIFFVSLFIFGFLSN 26 (36)
T ss_pred hHHHHHHHHHHccccCC
Confidence 34455666676676655
No 44
>PHA03164 hypothetical protein; Provisional
Probab=20.34 E-value=1e+02 Score=18.34 Aligned_cols=21 Identities=38% Similarity=0.388 Sum_probs=14.0
Q ss_pred CCchHHHHHHHHHHHHHHHHh
Q psy5550 74 LQTVTNYFLLNLTAADILFCL 94 (127)
Q Consensus 74 ~~~~~~~~l~nla~~Dl~~~~ 94 (127)
+|+.+.+.+..||++-+++..
T Consensus 55 RktftFlvLtgLaIamILfii 75 (88)
T PHA03164 55 RKTFTFLVLTGLAIAMILFII 75 (88)
T ss_pred hheeehHHHHHHHHHHHHHHH
Confidence 356667777778777666554
Done!