Query         psy5550
Match_columns 127
No_of_seqs    106 out of 1364
Neff          8.9 
Searched_HMMs 46136
Date          Fri Aug 16 20:25:01 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5550.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5550hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4219|consensus               99.7 5.8E-17 1.3E-21  121.6   5.5   92   35-126    27-119 (423)
  2 PHA03234 DNA packaging protein  99.6 2.3E-15 4.9E-20  112.9   9.0   86   39-125    28-115 (338)
  3 KOG4220|consensus               99.5 2.7E-15 5.9E-20  113.1  -2.3   84   42-125    29-113 (503)
  4 PHA02834 chemokine receptor-li  99.4 8.2E-13 1.8E-17   98.4   9.2   81   42-124    27-107 (323)
  5 PHA02638 CC chemokine receptor  99.4 2.7E-12 5.9E-17   98.8   9.3   81   41-123    96-176 (417)
  6 PHA03087 G protein-coupled che  99.3 2.4E-12 5.3E-17   95.8   7.1   84   41-125    38-121 (335)
  7 PHA03235 DNA packaging protein  99.3 2.3E-11 5.1E-16   93.4   9.7   85   40-125    29-115 (409)
  8 PF00001 7tm_1:  7 transmembran  99.0   5E-10 1.1E-14   78.5   5.5   66   60-125     1-67  (257)
  9 KOG2087|consensus               98.2 6.5E-07 1.4E-11   67.2   2.0   84   41-125    22-114 (363)
 10 PF10320 7TM_GPCR_Srsx:  Serpen  98.2 9.1E-07   2E-11   64.2   1.8   67   55-123     2-69  (257)
 11 PF11710 Git3:  G protein-coupl  97.3  0.0023 4.9E-08   45.0   7.8   53   72-124    30-83  (201)
 12 PF05462 Dicty_CAR:  Slime mold  96.9   0.008 1.7E-07   44.9   7.9   77   45-125     8-85  (303)
 13 PF05296 TAS2R:  Mammalian tast  96.6   0.026 5.7E-07   42.0   9.1   74   45-118     8-85  (303)
 14 PF10328 7TM_GPCR_Srx:  Serpent  95.5   0.047   1E-06   39.7   6.0   42   53-94      3-44  (274)
 15 PF10324 7TM_GPCR_Srw:  Serpent  95.5   0.024 5.3E-07   41.9   4.5   51   53-104     6-58  (318)
 16 PF10321 7TM_GPCR_Srt:  Serpent  95.4    0.12 2.6E-06   38.8   7.8   77   41-122    30-107 (313)
 17 PF03402 V1R:  Vomeronasal orga  95.1   0.052 1.1E-06   39.8   4.8   51   72-124     5-57  (265)
 18 PF10317 7TM_GPCR_Srd:  Serpent  91.7    0.72 1.6E-05   34.0   6.0   46   49-94      4-50  (292)
 19 PF00002 7tm_2:  7 transmembran  85.5    0.59 1.3E-05   33.1   1.9   73   51-124     8-81  (242)
 20 PF01102 Glycophorin_A:  Glycop  59.3      14 0.00031   23.9   3.1    9   65-73     85-93  (122)
 21 PF10327 7TM_GPCR_Sri:  Serpent  53.6      19 0.00041   26.8   3.4   62   45-106    10-76  (303)
 22 PF10316 7TM_GPCR_Srbc:  Serpen  46.0 1.1E+02  0.0024   22.6   6.3   56   47-102     9-65  (273)
 23 PF09882 DUF2109:  Predicted me  45.8      64  0.0014   19.2   4.2   46   56-101     6-52  (78)
 24 PF10323 7TM_GPCR_Srv:  Serpent  45.0      36 0.00078   25.0   3.7   35   60-94     11-49  (283)
 25 PF12304 BCLP:  Beta-casein lik  44.0 1.1E+02  0.0024   21.4   7.9   51   42-94     39-89  (188)
 26 PF10873 DUF2668:  Protein of u  41.3      13 0.00027   25.0   0.7   35   42-76     60-94  (155)
 27 PF04789 DUF621:  Protein of un  38.7 1.6E+02  0.0034   22.2   6.1   48   58-105    29-81  (305)
 28 PF10329 DUF2417:  Region of un  38.3      77  0.0017   23.0   4.4   37   58-94     84-120 (232)
 29 PF02532 PsbI:  Photosystem II   35.0      63  0.0014   16.2   2.5   17   45-61     10-26  (36)
 30 PF11446 DUF2897:  Protein of u  34.2      61  0.0013   17.9   2.7   10   57-66     15-24  (55)
 31 TIGR01477 RIFIN variant surfac  34.1      63  0.0014   24.9   3.6   30   47-76    311-340 (353)
 32 COG1230 CzcD Co/Zn/Cd efflux s  32.3 2.2E+02  0.0048   21.4   7.0   66   50-118   127-194 (296)
 33 PTZ00046 rifin; Provisional     30.9      78  0.0017   24.5   3.6   29   48-76    317-345 (358)
 34 PF02009 Rifin_STEVOR:  Rifin/s  29.8      53  0.0011   24.7   2.5   27   49-75    259-285 (299)
 35 PF05393 Hum_adeno_E3A:  Human   25.2      93   0.002   19.1   2.5    6   65-70     50-55  (94)
 36 PF08114 PMP1_2:  ATPase proteo  25.1      47   0.001   17.3   1.1   19   52-70     13-31  (43)
 37 PF02101 Ocular_alb:  Ocular al  24.9 2.7E+02  0.0059   22.0   5.6   59   45-103    28-93  (405)
 38 PF06024 DUF912:  Nucleopolyhed  24.6      84  0.0018   19.5   2.4    6   68-73     86-91  (101)
 39 KOG4564|consensus               24.3 3.9E+02  0.0084   21.7  10.0   47   48-94    149-195 (473)
 40 PF02468 PsbN:  Photosystem II   20.8 1.2E+02  0.0025   16.0   2.1   31   47-77      7-37  (43)
 41 PF10319 7TM_GPCR_Srj:  Serpent  20.7 2.1E+02  0.0045   21.7   4.2   49   52-100    13-65  (310)
 42 KOG4193|consensus               20.5 3.3E+02  0.0071   22.8   5.6   56   63-125   339-396 (610)
 43 CHL00024 psbI photosystem II p  20.5      50  0.0011   16.6   0.6   17   45-61     10-26  (36)
 44 PHA03164 hypothetical protein;  20.3   1E+02  0.0022   18.3   2.0   21   74-94     55-75  (88)

No 1  
>KOG4219|consensus
Probab=99.67  E-value=5.8e-17  Score=121.59  Aligned_cols=92  Identities=21%  Similarity=0.366  Sum_probs=85.0

Q ss_pred             cCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCccccccc
Q psy5550          35 EFGNREFEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDV  113 (127)
Q Consensus        35 ~~~~~~~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~  113 (127)
                      ....+..+..+++++|+++.+++++||++|+|++..+|++|+.+|+|++|||+||++.++ ..|+.......+.|.+|.+
T Consensus        27 ~f~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f  106 (423)
T KOG4219|consen   27 LFVLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSF  106 (423)
T ss_pred             cccCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhccccccc
Confidence            456788888999999999999999999999999999999999999999999999999999 9999988888889999999


Q ss_pred             ccccchhhhhhcC
Q psy5550         114 LCKIMPYSQMCSE  126 (127)
Q Consensus       114 ~C~~~~~~~~~~~  126 (127)
                      .|++..|+...++
T Consensus       107 ~C~f~nf~~itav  119 (423)
T KOG4219|consen  107 YCRFVNFFPITAV  119 (423)
T ss_pred             eeeeccccchhhh
Confidence            9999999876553


No 2  
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.62  E-value=2.3e-15  Score=112.93  Aligned_cols=86  Identities=19%  Similarity=0.284  Sum_probs=71.8

Q ss_pred             chhHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccc
Q psy5550          39 REFEMSAEICLLSIIFILSMGANLFVLITV--MWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCK  116 (127)
Q Consensus        39 ~~~~~~~~~~~~~~i~~~~i~gN~~vl~~~--~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~  116 (127)
                      .+....+++++|.+++++|++||++|++++  .+++++|+++|+|++|||++|++.++.+|+.+... .++|++|+..||
T Consensus        28 ~~~~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~lp~~~~~~-~~~w~fG~~lCk  106 (338)
T PHA03234         28 LKKAQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLCVFFLTLNR-EALFNFNQAFCQ  106 (338)
T ss_pred             HHHHHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCccCchhHHH
Confidence            445667889999999999999999999844  46667789999999999999999988777766543 457999999999


Q ss_pred             cchhhhhhc
Q psy5550         117 IMPYSQMCS  125 (127)
Q Consensus       117 ~~~~~~~~~  125 (127)
                      +..++...+
T Consensus       107 ~~~~~~~~~  115 (338)
T PHA03234        107 CVLFIYHAS  115 (338)
T ss_pred             HHHHHHHHH
Confidence            998877654


No 3  
>KOG4220|consensus
Probab=99.46  E-value=2.7e-15  Score=113.11  Aligned_cols=84  Identities=25%  Similarity=0.433  Sum_probs=77.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchh
Q psy5550          42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPY  120 (127)
Q Consensus        42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~  120 (127)
                      +.++++++...+.++.++||++||..+..+|++|+..|+||++||+||++.+. .+|+...+.+.|+|++|...|.+...
T Consensus        29 q~v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlWLa  108 (503)
T KOG4220|consen   29 QVVFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLWLA  108 (503)
T ss_pred             EEEeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHHHH
Confidence            33456777888899999999999999999999999999999999999999999 99999999999999999999999988


Q ss_pred             hhhhc
Q psy5550         121 SQMCS  125 (127)
Q Consensus       121 ~~~~~  125 (127)
                      +.++.
T Consensus       109 lDYva  113 (503)
T KOG4220|consen  109 LDYVA  113 (503)
T ss_pred             HHHHh
Confidence            87653


No 4  
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.43  E-value=8.2e-13  Score=98.36  Aligned_cols=81  Identities=20%  Similarity=0.433  Sum_probs=66.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccccchhh
Q psy5550          42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKIMPYS  121 (127)
Q Consensus        42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~~~~~  121 (127)
                      ...+..+++.+++++|++||+++++++.++|+ +++.|+|++|||++|++..+.+|+.+.... ++|.+|+..|++..++
T Consensus        27 ~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~~~~lP~~i~~~~-~~w~~g~~~C~~~~~~  104 (323)
T PHA02834         27 VNYFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLMLVFSFPFIIHNDL-NEWIFGEFMCKLVLGV  104 (323)
T ss_pred             hhhhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHHHHHHHHHHHHHc-CCcCCcchHHHhHHHH
Confidence            34477899999999999999999998887665 467899999999999986449998876554 5799999999998766


Q ss_pred             hhh
Q psy5550         122 QMC  124 (127)
Q Consensus       122 ~~~  124 (127)
                      ...
T Consensus       105 ~~~  107 (323)
T PHA02834        105 YFV  107 (323)
T ss_pred             HHH
Confidence            543


No 5  
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.38  E-value=2.7e-12  Score=98.77  Aligned_cols=81  Identities=27%  Similarity=0.548  Sum_probs=68.8

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccccchh
Q psy5550          41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKIMPY  120 (127)
Q Consensus        41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~~~~  120 (127)
                      ....+..+++.+++++|++||.++++++.+ |++|+++|++++|||++|++..+.+|+++... .++|.+|+..||+..+
T Consensus        96 ~~~~~l~~~y~lvfvlgliGN~LVl~il~~-k~lrt~t~i~llnLAisDLl~~l~lPf~i~~~-~~~W~fg~~~Ck~~~~  173 (417)
T PHA02638         96 SISEYIKIFYIIIFILGLFGNAAIIMILFC-KKIKTITDIYIFNLAISDLIFVIDFPFIIYNE-FDQWIFGDFMCKVISA  173 (417)
T ss_pred             chhhHHHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCHhHHHHHHHHHHHHHHHHHHHHHHHHH-hccccccccchhhHHH
Confidence            345578889999999999999999977654 78899999999999999999866999988765 4689999999999876


Q ss_pred             hhh
Q psy5550         121 SQM  123 (127)
Q Consensus       121 ~~~  123 (127)
                      +..
T Consensus       174 l~~  176 (417)
T PHA02638        174 SYY  176 (417)
T ss_pred             HHH
Confidence            544


No 6  
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.35  E-value=2.4e-12  Score=95.84  Aligned_cols=84  Identities=26%  Similarity=0.490  Sum_probs=71.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCcccccccccccchh
Q psy5550          41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKIMPY  120 (127)
Q Consensus        41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~~~~  120 (127)
                      ....+..+++.+++++|++||+++++++.++ ++|++.|+++.|||++|++.++..|........++|.+|+..|++..+
T Consensus        38 ~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~~~~~~~~~~~~~~~~~~~~~C~~~~~  116 (335)
T PHA03087         38 TNSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVMTLPFQIYYYILFQWSFGEFACKIVSG  116 (335)
T ss_pred             chhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHHhHHHHHHHHhCCCCCCCcHHHHHHHH
Confidence            3444778889999999999999999888887 889999999999999999887777877766666789999999999888


Q ss_pred             hhhhc
Q psy5550         121 SQMCS  125 (127)
Q Consensus       121 ~~~~~  125 (127)
                      +...+
T Consensus       117 ~~~~~  121 (335)
T PHA03087        117 LYYIG  121 (335)
T ss_pred             HHHHH
Confidence            76543


No 7  
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.29  E-value=2.3e-11  Score=93.44  Aligned_cols=85  Identities=20%  Similarity=0.213  Sum_probs=63.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cCCC-CchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCccccccccccc
Q psy5550          40 EFEMSAEICLLSIIFILSMGANLFVLITVMW-YPAL-QTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDVLCKI  117 (127)
Q Consensus        40 ~~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~-~~~~-~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~~C~~  117 (127)
                      ...+.+..+++.+++++|++||++|++++.+ +|++ ++..++|++|||++|++..+.+|+.+... ...|..|...|++
T Consensus        29 ~~~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~l~~lP~~i~~~-~~~~~~g~~~Ck~  107 (409)
T PHA03235         29 SAARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLTVFVLPFIMLSN-QGLLSGSVAGCKF  107 (409)
T ss_pred             hhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHHHHHHHHHHHhc-CccccCCCCeehh
Confidence            3455688999999999999999999987543 3332 35668999999999998744999887532 1223445789999


Q ss_pred             chhhhhhc
Q psy5550         118 MPYSQMCS  125 (127)
Q Consensus       118 ~~~~~~~~  125 (127)
                      ..++...+
T Consensus       108 ~~~l~~~~  115 (409)
T PHA03235        108 ASLLYYAS  115 (409)
T ss_pred             HHHHHHHH
Confidence            98876654


No 8  
>PF00001 7tm_1:  7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature;  InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.02  E-value=5e-10  Score=78.51  Aligned_cols=66  Identities=32%  Similarity=0.592  Sum_probs=58.3

Q ss_pred             HHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhhhc
Q psy5550          60 ANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQMCS  125 (127)
Q Consensus        60 gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~~  125 (127)
                      ||.+++.++.++|++|++.++++.|||++|++.++ ..|........++|.++...|++..++...+
T Consensus         1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~   67 (257)
T PF00001_consen    1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLFYFS   67 (257)
T ss_dssp             HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHHHHH
T ss_pred             CchhehhhhhhhccCCChhHHHHHHHHHHHHhhcccccccccccccccccccccccccccccccccc
Confidence            89999999999999999999999999999999999 8888877777678999999999998876543


No 9  
>KOG2087|consensus
Probab=98.21  E-value=6.5e-07  Score=67.17  Aligned_cols=84  Identities=18%  Similarity=0.207  Sum_probs=64.7

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHh-cC-------ccccc
Q psy5550          41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARV-SP-------QWPLG  111 (127)
Q Consensus        41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~-~~-------~w~~g  111 (127)
                      ....++.+..++++.++++||.+|++.+...|...++..+++.|||++|+++++ ..-+..++.. .+       .|..|
T Consensus        22 lg~~~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~tg  101 (363)
T KOG2087|consen   22 LGYWILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQTG  101 (363)
T ss_pred             hccceeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhhhc
Confidence            333466677788899999999999988888888778899999999999999998 4444444332 22       27655


Q ss_pred             ccccccchhhhhhc
Q psy5550         112 DVLCKIMPYSQMCS  125 (127)
Q Consensus       112 ~~~C~~~~~~~~~~  125 (127)
                       ..|++.+|+.+++
T Consensus       102 -~gC~~aGflavFA  114 (363)
T KOG2087|consen  102 -LGCPVAGFLAVFA  114 (363)
T ss_pred             -CCCchHHHHHHHH
Confidence             7899999987765


No 10 
>PF10320 7TM_GPCR_Srsx:  Serpentine type 7TM GPCR chemoreceptor Srsx;  InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=98.16  E-value=9.1e-07  Score=64.18  Aligned_cols=67  Identities=16%  Similarity=0.288  Sum_probs=53.1

Q ss_pred             HHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhh
Q psy5550          55 ILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQM  123 (127)
Q Consensus        55 ~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~  123 (127)
                      ++|+.||..++.++.++|++|++.++++..+|++|++... .+|..+... .+ -......|-.+.+...
T Consensus         2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~-~~-~~i~~~~Cf~~~~~~~   69 (257)
T PF10320_consen    2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLF-RD-HQITRSECFWQIFFYI   69 (257)
T ss_pred             EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHH-hh-eeccHHHHHHHHHHHH
Confidence            4689999999999999999999999999999999999999 888776443 22 2345566766555443


No 11 
>PF11710 Git3:  G protein-coupled glucose receptor regulating Gpa2;  InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins. 
Probab=97.26  E-value=0.0023  Score=45.02  Aligned_cols=53  Identities=13%  Similarity=0.088  Sum_probs=39.4

Q ss_pred             CCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhhh
Q psy5550          72 PALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQMC  124 (127)
Q Consensus        72 ~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~  124 (127)
                      +++|...+-++.||.++|++.++ .+...+.....+.-.-+...|..++++.-.
T Consensus        30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~q~   83 (201)
T PF11710_consen   30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFLQV   83 (201)
T ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHHHH
Confidence            55567778899999999999999 555455555444444567899999997644


No 12 
>PF05462 Dicty_CAR:  Slime mold cyclic AMP receptor
Probab=96.87  E-value=0.008  Score=44.88  Aligned_cols=77  Identities=19%  Similarity=0.284  Sum_probs=59.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhh
Q psy5550          45 AEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQM  123 (127)
Q Consensus        45 ~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~  123 (127)
                      ...++..+..+++++|-+.++..+++.|++|++.+-++.-++++|++..+ .......    ..-.-+...|++++++..
T Consensus         8 ~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~~~~~~----~~~~~~s~lC~~Qafliq   83 (303)
T PF05462_consen    8 TLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASMIMTLS----PSAGENSFLCQFQAFLIQ   83 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCCCcchhhHhHHHH
Confidence            56666777788999999999999999999999999999999999999776 4432221    111234678999998765


Q ss_pred             hc
Q psy5550         124 CS  125 (127)
Q Consensus       124 ~~  125 (127)
                      ..
T Consensus        84 ~f   85 (303)
T PF05462_consen   84 FF   85 (303)
T ss_pred             Hh
Confidence            43


No 13 
>PF05296 TAS2R:  Mammalian taste receptor protein (TAS2R);  InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.61  E-value=0.026  Score=42.00  Aligned_cols=74  Identities=16%  Similarity=0.216  Sum_probs=48.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHh---hcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccc
Q psy5550          45 AEICLLSIIFILSMGANLFVLITVM---WYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIM  118 (127)
Q Consensus        45 ~~~~~~~~i~~~~i~gN~~vl~~~~---~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~  118 (127)
                      +..++..+.+++|+.||+.++.+-+   ++++.-.|.+..+.+||++.++.-. ..-......+..+.......++..
T Consensus         8 i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (303)
T PF05296_consen    8 IFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFLSFFFPNIYFSENVYKII   85 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHcchhhhhhhHHHHH
Confidence            5677778889999999997765544   2333346899999999999999877 443333333333322333344443


No 14 
>PF10328 7TM_GPCR_Srx:  Serpentine type 7TM GPCR chemoreceptor Srx;  InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=95.55  E-value=0.047  Score=39.75  Aligned_cols=42  Identities=24%  Similarity=0.395  Sum_probs=39.2

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550          53 IFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        53 i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      +.++|++.|.+++..+.+.+++|++-+.+..+.|++|.+...
T Consensus         3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~   44 (274)
T PF10328_consen    3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICL   44 (274)
T ss_pred             eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHH
Confidence            467899999999999999999999999999999999999887


No 15 
>PF10324 7TM_GPCR_Srw:  Serpentine type 7TM GPCR chemoreceptor Srw;  InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz []. 
Probab=95.53  E-value=0.024  Score=41.94  Aligned_cols=51  Identities=20%  Similarity=0.388  Sum_probs=41.1

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCc-hHHHHHHHHHHHHHHHHh-HHHHHHHHHh
Q psy5550          53 IFILSMGANLFVLITVMWYPALQT-VTNYFLLNLTAADILFCL-SIPGIMYARV  104 (127)
Q Consensus        53 i~~~~i~gN~~vl~~~~~~~~~~~-~~~~~l~nla~~Dl~~~~-~~p~~~~~~~  104 (127)
                      +.++|+++|..-+.++. +|++|+ +.|.+++.+|++|++... ..+..+....
T Consensus         6 ~~~~g~~~N~~h~~VLt-rk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~~   58 (318)
T PF10324_consen    6 LSIFGLFINIFHLIVLT-RKSMRSSSINILLIGIAICDLLYMLSILIWELFFFI   58 (318)
T ss_pred             EeHHHHHHHHHHhhhcC-ChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            46789999998887664 466675 899999999999999999 8887765544


No 16 
>PF10321 7TM_GPCR_Srt:  Serpentine type 7TM GPCR chemoreceptor Srt;  InterPro: IPR019425  Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=95.38  E-value=0.12  Score=38.77  Aligned_cols=77  Identities=13%  Similarity=0.059  Sum_probs=54.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccch
Q psy5550          41 FEMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMP  119 (127)
Q Consensus        41 ~~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~  119 (127)
                      ..+..++..+.+.+++..+-....+.++.+++.+|.+.+-.+.-||+.|++... ..-..-...     ..|..+|+...
T Consensus        30 ~~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~~si~tG~l~-----i~G~vfC~~P~  104 (313)
T PF10321_consen   30 VKRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFINSIITGILA-----IFGAVFCSYPR  104 (313)
T ss_pred             CcccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHhhhhhhhHHH-----hcCccccCCch
Confidence            334467777777788888888888888888888889999999999999999886 322222222     23456666555


Q ss_pred             hhh
Q psy5550         120 YSQ  122 (127)
Q Consensus       120 ~~~  122 (127)
                      +..
T Consensus       105 ~~~  107 (313)
T PF10321_consen  105 FIY  107 (313)
T ss_pred             Hhh
Confidence            433


No 17 
>PF03402 V1R:  Vomeronasal organ pheromone receptor family, V1R;  InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=95.05  E-value=0.052  Score=39.82  Aligned_cols=51  Identities=27%  Similarity=0.289  Sum_probs=38.4

Q ss_pred             CCCCchHHHHHHHHHHHHHHHHh--HHHHHHHHHhcCcccccccccccchhhhhh
Q psy5550          72 PALQTVTNYFLLNLTAADILFCL--SIPGIMYARVSPQWPLGDVLCKIMPYSQMC  124 (127)
Q Consensus        72 ~~~~~~~~~~l~nla~~Dl~~~~--~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~  124 (127)
                      .++.+|.+..+.|||+++.+..+  .+|........ + .+++..||+..|+.=+
T Consensus         5 ~~r~kp~dlIl~hLa~aN~lvLl~rGip~~~~~~~~-~-~~~d~gCK~v~Y~~RV   57 (265)
T PF03402_consen    5 GHRLKPIDLILIHLALANILVLLSRGIPQTMAFFGW-K-FFDDIGCKIVFYIYRV   57 (265)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHHHhhHHHHHHHhhc-c-cCCCceeeeeeeehHH
Confidence            34468999999999999999998  88865433222 2 4689999999887543


No 18 
>PF10317 7TM_GPCR_Srd:  Serpentine type 7TM GPCR chemoreceptor Srd;  InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd []. 
Probab=91.65  E-value=0.72  Score=33.97  Aligned_cols=46  Identities=22%  Similarity=0.353  Sum_probs=36.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhcC-CCCchHHHHHHHHHHHHHHHHh
Q psy5550          49 LLSIIFILSMGANLFVLITVMWYP-ALQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        49 ~~~~i~~~~i~gN~~vl~~~~~~~-~~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      ++.+.+.+|+..|.+.+.++.++. +.-+...+++.|-|+.|++...
T Consensus         4 ~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~   50 (292)
T PF10317_consen    4 YHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISII   50 (292)
T ss_pred             eHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHH
Confidence            456778899999998886666533 3335678999999999999887


No 19 
>PF00002 7tm_2:  7 transmembrane receptor (Secretin family);  InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=85.48  E-value=0.59  Score=33.10  Aligned_cols=73  Identities=23%  Similarity=0.275  Sum_probs=2.1

Q ss_pred             HHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHHHhcCcccccccccccchhhhhh
Q psy5550          51 SIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYARVSPQWPLGDVLCKIMPYSQMC  124 (127)
Q Consensus        51 ~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~~~~~~~  124 (127)
                      .+-..+++++=.+++......|++|+..+....||++++++..+ .+.- ......+.-...+..|+..+.+.+.
T Consensus         8 ~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~l~~-~~~~~~~~~~~~~~~C~~~a~~~hy   81 (242)
T PF00002_consen    8 YVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSFLIG-ISQTFSPISTTNHCLCRAIAILLHY   81 (242)
T ss_dssp             HHHHH----------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHHhee-hhhccccccccccccchhhhhHhHH
Confidence            34444555555555555556677788778888999999988776 3222 1111111111112349888876554


No 20 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=59.27  E-value=14  Score=23.92  Aligned_cols=9  Identities=0%  Similarity=-0.174  Sum_probs=3.2

Q ss_pred             HHHHhhcCC
Q psy5550          65 LITVMWYPA   73 (127)
Q Consensus        65 l~~~~~~~~   73 (127)
                      .+++.|.|+
T Consensus        85 ~y~irR~~K   93 (122)
T PF01102_consen   85 SYCIRRLRK   93 (122)
T ss_dssp             HHHHHHHS-
T ss_pred             HHHHHHHhc
Confidence            344444333


No 21 
>PF10327 7TM_GPCR_Sri:  Serpentine type 7TM GPCR chemoreceptor Sri;  InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=53.58  E-value=19  Score=26.79  Aligned_cols=62  Identities=21%  Similarity=0.222  Sum_probs=44.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHh-hcCCCCchHHHHHH---HHHHHHHHHHh-HHHHHHHHHhcC
Q psy5550          45 AEICLLSIIFILSMGANLFVLITVM-WYPALQTVTNYFLL---NLTAADILFCL-SIPGIMYARVSP  106 (127)
Q Consensus        45 ~~~~~~~~i~~~~i~gN~~vl~~~~-~~~~~~~~~~~~l~---nla~~Dl~~~~-~~p~~~~~~~~~  106 (127)
                      ++...+-+++.+++.-|.+.+..+. +.+++++-.++++.   ...+.|+-.+. ..|..+.....|
T Consensus        10 ~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~ag   76 (303)
T PF10327_consen   10 WLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAG   76 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEE
Confidence            5667788889999999998885554 55555554444432   35669999998 888887776544


No 22 
>PF10316 7TM_GPCR_Srbc:  Serpentine type 7TM GPCR chemoreceptor Srbc ;  InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=45.95  E-value=1.1e+02  Score=22.62  Aligned_cols=56  Identities=13%  Similarity=0.190  Sum_probs=36.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHHH
Q psy5550          47 ICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMYA  102 (127)
Q Consensus        47 ~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~~  102 (127)
                      ..+-.+...+....|...+..+.+.|+.|++--.++---...|.+.+. ..+.....
T Consensus         9 ~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~~~   65 (273)
T PF10316_consen    9 SIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLIYY   65 (273)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHHHH
Confidence            334445567778889988888886666445444444446788999998 55544433


No 23 
>PF09882 DUF2109:  Predicted membrane protein (DUF2109);  InterPro: IPR019214  This entry is found in various hypothetical archaeal proteins and has no known function. 
Probab=45.84  E-value=64  Score=19.20  Aligned_cols=46  Identities=11%  Similarity=0.078  Sum_probs=30.1

Q ss_pred             HHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-HHHHHHH
Q psy5550          56 LSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-SIPGIMY  101 (127)
Q Consensus        56 ~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-~~p~~~~  101 (127)
                      .|+++=...+-++.-+.+.++..+.-.+|-+++-++... --|+...
T Consensus         6 ~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g~i   52 (78)
T PF09882_consen    6 IGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMGAI   52 (78)
T ss_pred             HHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHHHH
Confidence            344444444545555555677788888899999888877 6665543


No 24 
>PF10323 7TM_GPCR_Srv:  Serpentine type 7TM GPCR chemoreceptor Srv;  InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=45.03  E-value=36  Score=24.97  Aligned_cols=35  Identities=23%  Similarity=0.326  Sum_probs=25.1

Q ss_pred             HHHHHHHHHhhcCC----CCchHHHHHHHHHHHHHHHHh
Q psy5550          60 ANLFVLITVMWYPA----LQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        60 gN~~vl~~~~~~~~----~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      =...++..+.+.|+    .+++-+.++.+-+++|++..+
T Consensus        11 ly~~il~~l~~~r~~~~~~~~~Fy~l~~~~~iaDi~~~~   49 (283)
T PF10323_consen   11 LYIFILYCLLKLRKRSKTFKSTFYTLLIQHCIADILSML   49 (283)
T ss_pred             HHHHHHHHHHHcccCccccCCHHHHHHHHHHHHHHHHHH
Confidence            34444444554443    458899999999999999886


No 25 
>PF12304 BCLP:  Beta-casein like protein;  InterPro: IPR020977  This entry represents eukaryotic proteins that are typically between 216 to 240 amino acids in length which have two conserved sequence motifs: VLR and TRIY. Beta-casein-like protein is associated with cell morphology and a regulation of growth pattern of tumours. It is found in adenocarcinomas of uterine cervical tissues[]. 
Probab=44.02  E-value=1.1e+02  Score=21.40  Aligned_cols=51  Identities=12%  Similarity=0.098  Sum_probs=32.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550          42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      ++.+.-++-+.-.++++.+.+..+ +..|+++ +++..+-++.+++...+.+.
T Consensus        39 eY~vsNiisv~Sgll~I~~GI~AI-vlSrnl~-~~~L~W~Ll~~S~ln~LlSa   89 (188)
T PF12304_consen   39 EYAVSNIISVTSGLLSIICGIVAI-VLSRNLR-NRPLHWTLLVVSLLNALLSA   89 (188)
T ss_pred             hhhHHHHHHHHHHHHHHHHhHHHH-hhhccCC-CCcchHHHHHHHHHHHHHHH
Confidence            333445555566777777666544 4567776 46677777777777666665


No 26 
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=41.34  E-value=13  Score=24.97  Aligned_cols=35  Identities=11%  Similarity=0.199  Sum_probs=22.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCc
Q psy5550          42 EMSAEICLLSIIFILSMGANLFVLITVMWYPALQT   76 (127)
Q Consensus        42 ~~~~~~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~   76 (127)
                      ...+..+++.+++++|+++-..+....+.++++++
T Consensus        60 gtAIaGIVfgiVfimgvva~i~icvCmc~kn~rgs   94 (155)
T PF10873_consen   60 GTAIAGIVFGIVFIMGVVAGIAICVCMCMKNSRGS   94 (155)
T ss_pred             cceeeeeehhhHHHHHHHHHHHHHHhhhhhcCCCc
Confidence            33456678888888888887766555554444333


No 27 
>PF04789 DUF621:  Protein of unknown function (DUF621);  InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=38.65  E-value=1.6e+02  Score=22.21  Aligned_cols=48  Identities=17%  Similarity=0.417  Sum_probs=29.9

Q ss_pred             HHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh-----HHHHHHHHHhc
Q psy5550          58 MGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL-----SIPGIMYARVS  105 (127)
Q Consensus        58 i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~-----~~p~~~~~~~~  105 (127)
                      +.|-.+|+..+.+.+=++-+..+|+.+|..+-++.+.     .+|..+.+.+.
T Consensus        29 Lt~~Flv~~i~lW~~Fk~m~ffwFl~qlt~s~fi~S~lNl~inVPatlfsl~t   81 (305)
T PF04789_consen   29 LTGAFLVLSIILWSHFKPMKFFWFLTQLTISVFIISSLNLLINVPATLFSLIT   81 (305)
T ss_pred             HHHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHhhhheEeCcHHHHHhhh
Confidence            3344455554444443345678999999999888774     45655544443


No 28 
>PF10329 DUF2417:  Region of unknown function (DUF2417);  InterPro: IPR019431  This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO). 
Probab=38.25  E-value=77  Score=22.97  Aligned_cols=37  Identities=27%  Similarity=0.298  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550          58 MGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        58 i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      +..|...++.+.-..+.-+..++.++-|-+.|++..+
T Consensus        84 l~~~~~~L~Ff~vpS~~~r~l~~vl~~Lllvdlilil  120 (232)
T PF10329_consen   84 LITNLFNLWFFGVPSKLERILNIVLAGLLLVDLILIL  120 (232)
T ss_pred             HHHHHHHHHheecCcHHHHHHHHHHHHHHHHHHHHHH
Confidence            3344444433333333333445555555555555444


No 29 
>PF02532 PsbI:  Photosystem II reaction centre I protein (PSII 4.8 kDa protein);  InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=34.97  E-value=63  Score=16.20  Aligned_cols=17  Identities=18%  Similarity=0.397  Sum_probs=9.8

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy5550          45 AEICLLSIIFILSMGAN   61 (127)
Q Consensus        45 ~~~~~~~~i~~~~i~gN   61 (127)
                      ....+++.++++|.+.|
T Consensus        10 ~vV~ffv~LFifGflsn   26 (36)
T PF02532_consen   10 TVVIFFVSLFIFGFLSN   26 (36)
T ss_dssp             HHHHHHHHHHHHHHHTT
T ss_pred             hhHHHHHHHHhccccCC
Confidence            34455556666666655


No 30 
>PF11446 DUF2897:  Protein of unknown function (DUF2897);  InterPro: IPR021550  This is a bacterial family of uncharacterised proteins. 
Probab=34.20  E-value=61  Score=17.91  Aligned_cols=10  Identities=20%  Similarity=0.129  Sum_probs=7.4

Q ss_pred             HHHHHHHHHH
Q psy5550          57 SMGANLFVLI   66 (127)
Q Consensus        57 ~i~gN~~vl~   66 (127)
                      -++||+.++-
T Consensus        15 vIigNia~LK   24 (55)
T PF11446_consen   15 VIIGNIAALK   24 (55)
T ss_pred             HHHhHHHHHH
Confidence            4789998763


No 31 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=34.14  E-value=63  Score=24.92  Aligned_cols=30  Identities=20%  Similarity=0.269  Sum_probs=19.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhcCCCCc
Q psy5550          47 ICLLSIIFILSMGANLFVLITVMWYPALQT   76 (127)
Q Consensus        47 ~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~   76 (127)
                      ++...+++++.++--.++++++.|+||.+.
T Consensus       311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKKK  340 (353)
T TIGR01477       311 PIIASIIAILIIVLIMVIIYLILRYRRKKK  340 (353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhcch
Confidence            344555666666666677788888887553


No 32 
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=32.27  E-value=2.2e+02  Score=21.44  Aligned_cols=66  Identities=18%  Similarity=0.169  Sum_probs=39.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHH-HHHHHHHHh-HHHHHHHHHhcCcccccccccccc
Q psy5550          50 LSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNL-TAADILFCL-SIPGIMYARVSPQWPLGDVLCKIM  118 (127)
Q Consensus        50 ~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nl-a~~Dl~~~~-~~p~~~~~~~~~~w~~g~~~C~~~  118 (127)
                      +.++.++|++-|+...+.+.+.++  .-.|.=-..| +++|.+..+ .+--.+.-.+.+ |..-++.+.+.
T Consensus       127 ml~va~~GL~vN~~~a~ll~~~~~--~~lN~r~a~LHvl~D~Lgsv~vIia~i~i~~~~-w~~~Dpi~si~  194 (296)
T COG1230         127 MLVVAIIGLVVNLVSALLLHKGHE--ENLNMRGAYLHVLGDALGSVGVIIAAIVIRFTG-WSWLDPILSIV  194 (296)
T ss_pred             hHHHHHHHHHHHHHHHHHhhCCCc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCccchHHHHH
Confidence            456688899999998888876522  1222222222 358999888 555555555554 44445555433


No 33 
>PTZ00046 rifin; Provisional
Probab=30.86  E-value=78  Score=24.50  Aligned_cols=29  Identities=14%  Similarity=0.282  Sum_probs=18.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhhcCCCCc
Q psy5550          48 CLLSIIFILSMGANLFVLITVMWYPALQT   76 (127)
Q Consensus        48 ~~~~~i~~~~i~gN~~vl~~~~~~~~~~~   76 (127)
                      +...+++++.++--.++++++.|+||.+.
T Consensus       317 IiaSiiAIvVIVLIMvIIYLILRYRRKKK  345 (358)
T PTZ00046        317 IIASIVAIVVIVLIMVIIYLILRYRRKKK  345 (358)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhcch
Confidence            34455566666666677788888887553


No 34 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=29.79  E-value=53  Score=24.71  Aligned_cols=27  Identities=22%  Similarity=0.343  Sum_probs=15.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhcCCCC
Q psy5550          49 LLSIIFILSMGANLFVLITVMWYPALQ   75 (127)
Q Consensus        49 ~~~~i~~~~i~gN~~vl~~~~~~~~~~   75 (127)
                      ...++.++.++-=.++|++++|+||.+
T Consensus       259 ~aSiiaIliIVLIMvIIYLILRYRRKK  285 (299)
T PF02009_consen  259 IASIIAILIIVLIMVIIYLILRYRRKK  285 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            334444444554456677777777743


No 35 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=25.24  E-value=93  Score=19.06  Aligned_cols=6  Identities=33%  Similarity=0.085  Sum_probs=3.0

Q ss_pred             HHHHhh
Q psy5550          65 LITVMW   70 (127)
Q Consensus        65 l~~~~~   70 (127)
                      +|.++.
T Consensus        50 lwfvCC   55 (94)
T PF05393_consen   50 LWFVCC   55 (94)
T ss_pred             HHHHHH
Confidence            455553


No 36 
>PF08114 PMP1_2:  ATPase proteolipid family;  InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=25.12  E-value=47  Score=17.25  Aligned_cols=19  Identities=5%  Similarity=0.111  Sum_probs=9.6

Q ss_pred             HHHHHHHHHHHHHHHHHhh
Q psy5550          52 IIFILSMGANLFVLITVMW   70 (127)
Q Consensus        52 ~i~~~~i~gN~~vl~~~~~   70 (127)
                      +++++|+.|-+++...++|
T Consensus        13 VF~lVglv~i~iva~~iYR   31 (43)
T PF08114_consen   13 VFCLVGLVGIGIVALFIYR   31 (43)
T ss_pred             ehHHHHHHHHHHHHHHHHH
Confidence            4445555655555444443


No 37 
>PF02101 Ocular_alb:  Ocular albinism type 1 protein;  InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=24.89  E-value=2.7e+02  Score=21.97  Aligned_cols=59  Identities=22%  Similarity=0.135  Sum_probs=33.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhcCC----CCch--HHHHHHHHHHHHHHHHh-HHHHHHHHH
Q psy5550          45 AEICLLSIIFILSMGANLFVLITVMWYPA----LQTV--TNYFLLNLTAADILFCL-SIPGIMYAR  103 (127)
Q Consensus        45 ~~~~~~~~i~~~~i~gN~~vl~~~~~~~~----~~~~--~~~~l~nla~~Dl~~~~-~~p~~~~~~  103 (127)
                      .+-++-+....+|+.|-++-+.--.|...    .+++  ..-.+..||++|++.++ .+-....+.
T Consensus        28 ~f~avCLgSs~l~l~gallQLlp~rr~~~~~~~~~sp~~~~rIl~~la~aDlLaclGVivRS~vWl   93 (405)
T PF02101_consen   28 AFNAVCLGSSVLSLLGALLQLLPRRRSAGPRAPARSPSSSRRILFWLAVADLLACLGVIVRSSVWL   93 (405)
T ss_pred             hhhhhHHHHHHHHHHHHHHhhccccccccccccccCCcCCchhHHHHHHHHHHhhhhHHHHhhhhh
Confidence            44445555566666665544431111100    0111  34678899999999998 666665554


No 38 
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=24.61  E-value=84  Score=19.45  Aligned_cols=6  Identities=0%  Similarity=0.031  Sum_probs=2.5

Q ss_pred             HhhcCC
Q psy5550          68 VMWYPA   73 (127)
Q Consensus        68 ~~~~~~   73 (127)
                      +.|.|+
T Consensus        86 ILRer~   91 (101)
T PF06024_consen   86 ILRERQ   91 (101)
T ss_pred             EEeccc
Confidence            334444


No 39 
>KOG4564|consensus
Probab=24.30  E-value=3.9e+02  Score=21.66  Aligned_cols=47  Identities=30%  Similarity=0.431  Sum_probs=34.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHh
Q psy5550          48 CLLSIIFILSMGANLFVLITVMWYPALQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        48 ~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      ++|.+-.-++++.=.+.+.++...|++|...|+.=.||-++=++.++
T Consensus       149 ~lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~  195 (473)
T KOG4564|consen  149 ILYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAA  195 (473)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHH
Confidence            34444455555554555667778889998899999999999888777


No 40 
>PF02468 PsbN:  Photosystem II reaction centre N protein (psbN);  InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=20.83  E-value=1.2e+02  Score=15.97  Aligned_cols=31  Identities=6%  Similarity=-0.023  Sum_probs=16.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhcCCCCch
Q psy5550          47 ICLLSIIFILSMGANLFVLITVMWYPALQTV   77 (127)
Q Consensus        47 ~~~~~~i~~~~i~gN~~vl~~~~~~~~~~~~   77 (127)
                      ..+++...+++++|-.+-...=--.|.+|.|
T Consensus         7 ~~i~i~~~lv~~Tgy~iYtaFGppSk~LrDP   37 (43)
T PF02468_consen    7 LAIFISCLLVSITGYAIYTAFGPPSKELRDP   37 (43)
T ss_pred             HHHHHHHHHHHHHhhhhhheeCCCccccCCc
Confidence            4445556667777755433222235666654


No 41 
>PF10319 7TM_GPCR_Srj:  Serpentine type 7TM GPCR chemoreceptor Srj;  InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily []. 
Probab=20.74  E-value=2.1e+02  Score=21.75  Aligned_cols=49  Identities=20%  Similarity=0.249  Sum_probs=36.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCCc-hHHHHHHHHHHHHHHHHh---HHHHHH
Q psy5550          52 IIFILSMGANLFVLITVMWYPALQT-VTNYFLLNLTAADILFCL---SIPGIM  100 (127)
Q Consensus        52 ~i~~~~i~gN~~vl~~~~~~~~~~~-~~~~~l~nla~~Dl~~~~---~~p~~~  100 (127)
                      +.++++.+-|.+.+.++..+|+.+- .-.++++--|+-|++.++   .+|.-+
T Consensus        13 ~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~v   65 (310)
T PF10319_consen   13 IFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICV   65 (310)
T ss_pred             HHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence            3455667889999988887776653 457788889999999987   555443


No 42 
>KOG4193|consensus
Probab=20.53  E-value=3.3e+02  Score=22.79  Aligned_cols=56  Identities=14%  Similarity=0.030  Sum_probs=28.7

Q ss_pred             HHHHHHhhcCCCCchHHHHHHHHHHHHHHHHhHHHHHHHHHhcCccccccc--ccccchhhhhhc
Q psy5550          63 FVLITVMWYPALQTVTNYFLLNLTAADILFCLSIPGIMYARVSPQWPLGDV--LCKIMPYSQMCS  125 (127)
Q Consensus        63 ~vl~~~~~~~~~~~~~~~~l~nla~~Dl~~~~~~p~~~~~~~~~~w~~g~~--~C~~~~~~~~~~  125 (127)
                      +.+++....|++|+-.+....||+++=++.-       ...+.+.|..+..  .|+..+.+.+..
T Consensus       339 lti~ty~~~~~l~~~~~~i~~~l~~~L~l~~-------l~fL~~~~~~~~~~~~C~~~a~llhff  396 (610)
T KOG4193|consen  339 LTIATYLLFRKLQNDRTKIHINLCLCLFLAE-------LLFLLGIDRTSTSVVLCIAAAILLHFF  396 (610)
T ss_pred             HHHHHHHHHHHHHhhcchhHHHHHHHHHHHH-------HHHhcccccccCcccccHHHHHHHHHH
Confidence            3344444444444444777778888722211       1122234443333  599888766543


No 43 
>CHL00024 psbI photosystem II protein I
Probab=20.49  E-value=50  Score=16.56  Aligned_cols=17  Identities=18%  Similarity=0.397  Sum_probs=10.0

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy5550          45 AEICLLSIIFILSMGAN   61 (127)
Q Consensus        45 ~~~~~~~~i~~~~i~gN   61 (127)
                      ....+++.++++|.+.|
T Consensus        10 ~vV~ffvsLFifGFlsn   26 (36)
T CHL00024         10 TVVIFFVSLFIFGFLSN   26 (36)
T ss_pred             hHHHHHHHHHHccccCC
Confidence            34455666676676655


No 44 
>PHA03164 hypothetical protein; Provisional
Probab=20.34  E-value=1e+02  Score=18.34  Aligned_cols=21  Identities=38%  Similarity=0.388  Sum_probs=14.0

Q ss_pred             CCchHHHHHHHHHHHHHHHHh
Q psy5550          74 LQTVTNYFLLNLTAADILFCL   94 (127)
Q Consensus        74 ~~~~~~~~l~nla~~Dl~~~~   94 (127)
                      +|+.+.+.+..||++-+++..
T Consensus        55 RktftFlvLtgLaIamILfii   75 (88)
T PHA03164         55 RKTFTFLVLTGLAIAMILFII   75 (88)
T ss_pred             hheeehHHHHHHHHHHHHHHH
Confidence            356667777778777666554


Done!