Query         psy10964
Match_columns 103
No_of_seqs    184 out of 1277
Neff          8.5 
Searched_HMMs 46136
Date          Fri Aug 16 22:58:01 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy10964.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10964hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4219|consensus               99.2 8.5E-12 1.8E-16   91.0   1.9   60   43-102    29-88  (423)
  2 PHA03234 DNA packaging protein  99.1 4.1E-10 8.8E-15   81.3   7.1   53   49-101    32-86  (338)
  3 PHA02834 chemokine receptor-li  98.7 8.4E-08 1.8E-12   68.6   7.4   49   50-99     29-77  (323)
  4 PHA02638 CC chemokine receptor  98.7 1.1E-07 2.4E-12   70.5   7.6   50   49-99     98-147 (417)
  5 PHA03235 DNA packaging protein  98.6 2.2E-07 4.7E-12   68.9   8.5   51   49-99     32-84  (409)
  6 PHA03087 G protein-coupled che  98.4 4.2E-07 9.2E-12   64.9   4.2   51   49-100    40-90  (335)
  7 KOG4220|consensus               98.4 1.1E-08 2.4E-13   75.4  -4.2   54   49-102    30-83  (503)
  8 PF00001 7tm_1:  7 transmembran  98.1 3.8E-06 8.3E-11   56.1   4.2   36   66-101     1-36  (257)
  9 PF10320 7TM_GPCR_Srsx:  Serpen  98.0 2.2E-06 4.7E-11   59.8   0.8   40   61-100     2-41  (257)
 10 KOG2087|consensus               95.8  0.0024 5.3E-08   46.7  -0.0   45   56-100    31-75  (363)
 11 PF10328 7TM_GPCR_Srx:  Serpent  95.3   0.038 8.3E-07   38.6   4.7   43   59-101     3-45  (274)
 12 PF05296 TAS2R:  Mammalian tast  95.3    0.17 3.7E-06   36.3   8.0   52   49-100     6-60  (303)
 13 PF10324 7TM_GPCR_Srw:  Serpent  94.8   0.043 9.4E-07   38.9   3.9   41   59-100     6-47  (318)
 14 PF05462 Dicty_CAR:  Slime mold  91.9    0.83 1.8E-05   33.0   6.4   49   51-99      8-56  (303)
 15 PF10321 7TM_GPCR_Srt:  Serpent  90.9     1.3 2.9E-05   32.0   6.6   52   50-101    33-84  (313)
 16 PF10317 7TM_GPCR_Srd:  Serpent  88.8    0.86 1.9E-05   32.2   4.2   47   55-101     4-51  (292)
 17 PF11710 Git3:  G protein-coupl  84.2     4.7  0.0001   27.2   5.7   23   78-100    30-52  (201)
 18 PF10323 7TM_GPCR_Srv:  Serpent  51.8      15 0.00033   25.9   2.6   22   80-101    29-50  (283)
 19 PF00002 7tm_2:  7 transmembran  47.1      15 0.00033   24.8   1.9   37   63-99     14-50  (242)
 20 PF11446 DUF2897:  Protein of u  31.5      73  0.0016   17.1   2.6   14   62-75     14-27  (55)
 21 PHA03164 hypothetical protein;  31.1      66  0.0014   18.6   2.5   18   82-99     57-74  (88)
 22 PF02532 PsbI:  Photosystem II   30.4      65  0.0014   15.8   2.1   16   52-67     11-26  (36)
 23 TIGR01477 RIFIN variant surfac  29.5 1.1E+02  0.0023   23.0   3.9   29   53-81    311-339 (353)
 24 PF08048 RepA1_leader:  Tap Rep  28.5      51  0.0011   14.6   1.4   14   80-93      2-15  (25)
 25 PF05393 Hum_adeno_E3A:  Human   27.6      84  0.0018   18.7   2.6   17   69-85     48-66  (94)
 26 PTZ00046 rifin; Provisional     27.5 1.2E+02  0.0026   22.7   4.0   28   54-81    317-344 (358)
 27 PF10326 7TM_GPCR_Str:  Serpent  27.0      52  0.0011   23.1   2.0   44   57-100     6-50  (307)
 28 PF01102 Glycophorin_A:  Glycop  27.0 1.2E+02  0.0025   19.1   3.4   12   63-74     76-87  (122)
 29 KOG3850|consensus               26.9 2.4E+02  0.0053   21.6   5.4   41   50-93    395-435 (455)
 30 PF10873 DUF2668:  Protein of u  26.1      90  0.0019   20.4   2.7   29   51-79     63-91  (155)
 31 PF15330 SIT:  SHP2-interacting  25.6 1.7E+02  0.0036   17.9   3.9   25   53-77      4-28  (107)
 32 TIGR03475 tap_IncFII_lead RepA  24.8      64  0.0014   14.5   1.3   13   80-92      2-14  (26)
 33 PF10316 7TM_GPCR_Srbc:  Serpen  24.1 2.2E+02  0.0048   20.2   4.7   47   54-100    10-56  (273)
 34 PF02009 Rifin_STEVOR:  Rifin/s  23.8      99  0.0021   22.5   2.9   26   54-79    258-283 (299)
 35 PF10327 7TM_GPCR_Sri:  Serpent  23.5 1.8E+02   0.004   20.7   4.3   50   52-101    11-64  (303)
 36 PF08114 PMP1_2:  ATPase proteo  23.3      76  0.0016   16.1   1.6   18   59-76     14-31  (43)

No 1  
>KOG4219|consensus
Probab=99.17  E-value=8.5e-12  Score=91.02  Aligned_cols=60  Identities=28%  Similarity=0.431  Sum_probs=55.1

Q ss_pred             CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhhc
Q psy10964         43 RLHMSVLLPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLLLG  102 (103)
Q Consensus        43 ~~~~~~~~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~~~  102 (103)
                      ..|.+++.++.++|..+..++++||++|+|++..+|++|+.+|+||+|||+||++.++|+
T Consensus        29 ~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn   88 (423)
T KOG4219|consen   29 VLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFN   88 (423)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHh
Confidence            345567788999999999999999999999999999999999999999999999999886


No 2  
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.08  E-value=4.1e-10  Score=81.34  Aligned_cols=53  Identities=21%  Similarity=0.221  Sum_probs=45.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH--HhcCCCCCHHHHHHHHHHHHHHHHHhh
Q psy10964         49 LLPITIVYFGIFVTGVIGNIAVCVVI--INNTSLHTATNYYLFSLAVSDLTLLLL  101 (103)
Q Consensus        49 ~~~~~~~~~~i~~~~~~gN~~vl~~~--~~~~~l~~~~~~fl~nLa~~Dll~~~~  101 (103)
                      +....++|.+++++|++||.+|+++.  .+++++|+++|+|+.|||++|+++++.
T Consensus        32 ~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~   86 (338)
T PHA03234         32 QILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLC   86 (338)
T ss_pred             HHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHH
Confidence            46677899999999999999999854  455667999999999999999998653


No 3  
>PHA02834 chemokine receptor-like protein; Provisional
Probab=98.69  E-value=8.4e-08  Score=68.65  Aligned_cols=49  Identities=35%  Similarity=0.614  Sum_probs=42.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHH
Q psy10964         50 LPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLL   99 (103)
Q Consensus        50 ~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~   99 (103)
                      .+..+++.+++++|++||+++++++.++++ ++++|+|+.|||++|++..
T Consensus        29 ~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~~   77 (323)
T PHA02834         29 YFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLMLV   77 (323)
T ss_pred             hhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHHH
Confidence            556788999999999999999998877555 5678999999999998763


No 4  
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=98.66  E-value=1.1e-07  Score=70.53  Aligned_cols=50  Identities=36%  Similarity=0.659  Sum_probs=43.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHH
Q psy10964         49 LLPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLL   99 (103)
Q Consensus        49 ~~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~   99 (103)
                      .....++|.+++++|++||.++++++. +|++|+++++++.|||++|+++.
T Consensus        98 ~~~l~~~y~lvfvlgliGN~LVl~il~-~k~lrt~t~i~llnLAisDLl~~  147 (417)
T PHA02638         98 SEYIKIFYIIIFILGLFGNAAIIMILF-CKKIKTITDIYIFNLAISDLIFV  147 (417)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCCHhHHHHHHHHHHHHHHH
Confidence            355678899999999999999998765 47889999999999999999874


No 5  
>PHA03235 DNA packaging protein UL33; Provisional
Probab=98.63  E-value=2.2e-07  Score=68.86  Aligned_cols=51  Identities=18%  Similarity=0.197  Sum_probs=40.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCHHHHHHHHHHHHHHHHH
Q psy10964         49 LLPITIVYFGIFVTGVIGNIAVCVVIINNTSL--HTATNYYLFSLAVSDLTLL   99 (103)
Q Consensus        49 ~~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l--~~~~~~fl~nLa~~Dll~~   99 (103)
                      .....+++.+++++|++||+++++++.++++.  ++..++|+.|||++|++..
T Consensus        32 ~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~l   84 (409)
T PHA03235         32 RTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLTV   84 (409)
T ss_pred             HhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHHH
Confidence            45677889999999999999999986543332  3566789999999999863


No 6  
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=98.38  E-value=4.2e-07  Score=64.90  Aligned_cols=51  Identities=37%  Similarity=0.632  Sum_probs=45.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHh
Q psy10964         49 LLPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        49 ~~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~  100 (103)
                      +....+++.+++++|++||+++++++.++ ++|++.|+++.|||++|++.++
T Consensus        40 ~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~   90 (335)
T PHA03087         40 STILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVM   90 (335)
T ss_pred             hhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHH
Confidence            45677889999999999999999988877 8899999999999999987654


No 7  
>KOG4220|consensus
Probab=98.36  E-value=1.1e-08  Score=75.38  Aligned_cols=54  Identities=33%  Similarity=0.503  Sum_probs=48.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhhc
Q psy10964         49 LLPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLLLG  102 (103)
Q Consensus        49 ~~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~~~  102 (103)
                      ...+.++...+.+++++||++|++.+..+++|++..|+||++||+||++.+.|+
T Consensus        30 ~v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~S   83 (503)
T KOG4220|consen   30 VVFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFS   83 (503)
T ss_pred             EEeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheee
Confidence            355667889999999999999999999999999999999999999999988764


No 8  
>PF00001 7tm_1:  7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature;  InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=98.12  E-value=3.8e-06  Score=56.12  Aligned_cols=36  Identities=44%  Similarity=0.641  Sum_probs=32.9

Q ss_pred             HHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhh
Q psy10964         66 GNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLLL  101 (103)
Q Consensus        66 gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~~  101 (103)
                      ||.++++++.++|++|++.++|+.|||++|++.+++
T Consensus         1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~   36 (257)
T PF00001_consen    1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLF   36 (257)
T ss_dssp             HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHT
T ss_pred             CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccc
Confidence            899999999999999999999999999999998765


No 9  
>PF10320 7TM_GPCR_Srsx:  Serpentine type 7TM GPCR chemoreceptor Srsx;  InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=97.97  E-value=2.2e-06  Score=59.82  Aligned_cols=40  Identities=30%  Similarity=0.544  Sum_probs=36.7

Q ss_pred             HHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHh
Q psy10964         61 VTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        61 ~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~  100 (103)
                      ++|+.||..++.++.++|++|+|+++++..+|++|++...
T Consensus         2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~   41 (257)
T PF10320_consen    2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLL   41 (257)
T ss_pred             EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHh
Confidence            4678999999999999999999999999999999998753


No 10 
>KOG2087|consensus
Probab=95.79  E-value=0.0024  Score=46.72  Aligned_cols=45  Identities=18%  Similarity=0.318  Sum_probs=36.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHh
Q psy10964         56 YFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        56 ~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~  100 (103)
                      --++..+++.||.+|++.....+...++..+++.|||++|++.++
T Consensus        31 vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGi   75 (363)
T KOG2087|consen   31 VWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGI   75 (363)
T ss_pred             hhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcch
Confidence            345667788999999988776666667889999999999998865


No 11 
>PF10328 7TM_GPCR_Srx:  Serpentine type 7TM GPCR chemoreceptor Srx;  InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=95.34  E-value=0.038  Score=38.62  Aligned_cols=43  Identities=26%  Similarity=0.368  Sum_probs=38.6

Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhh
Q psy10964         59 IFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLLL  101 (103)
Q Consensus        59 i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~~  101 (103)
                      +.+.|++.|.++++.+.+.+++|++.+..-.+.|++|.+.+..
T Consensus         3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~   45 (274)
T PF10328_consen    3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLI   45 (274)
T ss_pred             eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHH
Confidence            4567999999999999999999999999999999999988763


No 12 
>PF05296 TAS2R:  Mammalian taste receptor protein (TAS2R);  InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=95.33  E-value=0.17  Score=36.27  Aligned_cols=52  Identities=23%  Similarity=0.296  Sum_probs=40.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCCCCHHHHHHHHHHHHHHHHHh
Q psy10964         49 LLPITIVYFGIFVTGVIGNIAVCVVII---NNTSLHTATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        49 ~~~~~~~~~~i~~~~~~gN~~vl~~~~---~~~~l~~~~~~fl~nLa~~Dll~~~  100 (103)
                      +....++....+++|+.||+.++.+..   .++|.-.|.+..+.+||++.++.-.
T Consensus         6 ~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~   60 (303)
T PF05296_consen    6 EIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQW   60 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHH
Confidence            355677888999999999999886655   3334457999999999999877543


No 13 
>PF10324 7TM_GPCR_Srw:  Serpentine type 7TM GPCR chemoreceptor Srw;  InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz []. 
Probab=94.83  E-value=0.043  Score=38.94  Aligned_cols=41  Identities=27%  Similarity=0.453  Sum_probs=33.7

Q ss_pred             HHHHHHHHHHHHHHHHHhcCCCCC-HHHHHHHHHHHHHHHHHh
Q psy10964         59 IFVTGVIGNIAVCVVIINNTSLHT-ATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        59 i~~~~~~gN~~vl~~~~~~~~l~~-~~~~fl~nLa~~Dll~~~  100 (103)
                      +.++|+++|.+-+.+.. +|.+|+ +.|.++...|++|+...+
T Consensus         6 ~~~~g~~~N~~h~~VLt-rk~mR~~~in~~l~~Iai~Dl~~~~   47 (318)
T PF10324_consen    6 LSIFGLFINIFHLIVLT-RKSMRSSSINILLIGIAICDLLYML   47 (318)
T ss_pred             EeHHHHHHHHHHhhhcC-ChhhhcCCHHHHHHHHHHHHHHHHH
Confidence            35678999999887664 567886 899999999999998765


No 14 
>PF05462 Dicty_CAR:  Slime mold cyclic AMP receptor
Probab=91.94  E-value=0.83  Score=32.96  Aligned_cols=49  Identities=14%  Similarity=0.213  Sum_probs=40.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHH
Q psy10964         51 PITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLL   99 (103)
Q Consensus        51 ~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~   99 (103)
                      .+.++......++++|-++++....+.|++|+|.+=.+.-++++|++..
T Consensus         8 ~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~   56 (303)
T PF05462_consen    8 TLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTN   56 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHH
Confidence            3455666777888999999999999999999999999889999988754


No 15 
>PF10321 7TM_GPCR_Srt:  Serpentine type 7TM GPCR chemoreceptor Srt;  InterPro: IPR019425  Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=90.90  E-value=1.3  Score=32.04  Aligned_cols=52  Identities=17%  Similarity=0.221  Sum_probs=39.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHhh
Q psy10964         50 LPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLLL  101 (103)
Q Consensus        50 ~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~~  101 (103)
                      +.+.+.+....++..+-....+.++.+++..|.|.|-.+.-||+.|++...+
T Consensus        33 p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~   84 (313)
T PF10321_consen   33 PILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFI   84 (313)
T ss_pred             cchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHh
Confidence            4445555555566666777778888877777899999999999999987653


No 16 
>PF10317 7TM_GPCR_Srd:  Serpentine type 7TM GPCR chemoreceptor Srd;  InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd []. 
Probab=88.79  E-value=0.86  Score=32.24  Aligned_cols=47  Identities=26%  Similarity=0.256  Sum_probs=36.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhcC-CCCCHHHHHHHHHHHHHHHHHhh
Q psy10964         55 VYFGIFVTGVIGNIAVCVVIINNT-SLHTATNYYLFSLAVSDLTLLLL  101 (103)
Q Consensus        55 ~~~~i~~~~~~gN~~vl~~~~~~~-~l~~~~~~fl~nLa~~Dll~~~~  101 (103)
                      .+...+..|+..|.+++..+.++. +.-+..-+++.|-|+.|++.+..
T Consensus         4 ~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~   51 (292)
T PF10317_consen    4 YHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIIS   51 (292)
T ss_pred             eHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHH
Confidence            456778899999999988776543 43456788999999999988764


No 17 
>PF11710 Git3:  G protein-coupled glucose receptor regulating Gpa2;  InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins. 
Probab=84.19  E-value=4.7  Score=27.25  Aligned_cols=23  Identities=13%  Similarity=0.197  Sum_probs=18.6

Q ss_pred             CCCCCHHHHHHHHHHHHHHHHHh
Q psy10964         78 TSLHTATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        78 ~~l~~~~~~fl~nLa~~Dll~~~  100 (103)
                      ++.|...+-++.||.++|++.++
T Consensus        30 ~r~~~fR~~LIl~L~~aD~~qal   52 (201)
T PF11710_consen   30 YRRRSFRHQLILNLLLADFIQAL   52 (201)
T ss_pred             hhhhhHHHHHHHHHHHHHHHHHH
Confidence            44567777799999999998775


No 18 
>PF10323 7TM_GPCR_Srv:  Serpentine type 7TM GPCR chemoreceptor Srv;  InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=51.76  E-value=15  Score=25.89  Aligned_cols=22  Identities=14%  Similarity=0.306  Sum_probs=18.7

Q ss_pred             CCCHHHHHHHHHHHHHHHHHhh
Q psy10964         80 LHTATNYYLFSLAVSDLTLLLL  101 (103)
Q Consensus        80 l~~~~~~fl~nLa~~Dll~~~~  101 (103)
                      .+++.|-.+.+-+++|++....
T Consensus        29 ~~~~Fy~l~~~~~iaDi~~~~~   50 (283)
T PF10323_consen   29 FKSTFYTLLIQHCIADILSMLF   50 (283)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHH
Confidence            5689999999999999987653


No 19 
>PF00002 7tm_2:  7 transmembrane receptor (Secretin family);  InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=47.13  E-value=15  Score=24.77  Aligned_cols=37  Identities=16%  Similarity=0.098  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHH
Q psy10964         63 GVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLL   99 (103)
Q Consensus        63 ~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~   99 (103)
                      ++++-++.+......|++|+..+....||++++++..
T Consensus        14 Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~   50 (242)
T PF00002_consen   14 SIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLAN   50 (242)
T ss_dssp             -------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHH
Confidence            3334344444444456777777788889998877653


No 20 
>PF11446 DUF2897:  Protein of unknown function (DUF2897);  InterPro: IPR021550  This is a bacterial family of uncharacterised proteins. 
Probab=31.49  E-value=73  Score=17.10  Aligned_cols=14  Identities=36%  Similarity=0.304  Sum_probs=8.9

Q ss_pred             HHHHHHHHHHHHHH
Q psy10964         62 TGVIGNIAVCVVII   75 (103)
Q Consensus        62 ~~~~gN~~vl~~~~   75 (103)
                      --++||+.++--..
T Consensus        14 gvIigNia~LK~sA   27 (55)
T PF11446_consen   14 GVIIGNIAALKYSA   27 (55)
T ss_pred             HHHHhHHHHHHHhc
Confidence            34579988775443


No 21 
>PHA03164 hypothetical protein; Provisional
Probab=31.10  E-value=66  Score=18.63  Aligned_cols=18  Identities=28%  Similarity=0.272  Sum_probs=8.1

Q ss_pred             CHHHHHHHHHHHHHHHHH
Q psy10964         82 TATNYYLFSLAVSDLTLL   99 (103)
Q Consensus        82 ~~~~~fl~nLa~~Dll~~   99 (103)
                      +...+-+..||++-+++.
T Consensus        57 tftFlvLtgLaIamILfi   74 (88)
T PHA03164         57 TFTFLVLTGLAIAMILFI   74 (88)
T ss_pred             eeehHHHHHHHHHHHHHH
Confidence            334444444554444443


No 22 
>PF02532 PsbI:  Photosystem II reaction centre I protein (PSII 4.8 kDa protein);  InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=30.40  E-value=65  Score=15.78  Aligned_cols=16  Identities=25%  Similarity=0.775  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy10964         52 ITIVYFGIFVTGVIGN   67 (103)
Q Consensus        52 ~~~~~~~i~~~~~~gN   67 (103)
                      +...+..+|+.|.+.|
T Consensus        11 vV~ffv~LFifGflsn   26 (36)
T PF02532_consen   11 VVIFFVSLFIFGFLSN   26 (36)
T ss_dssp             HHHHHHHHHHHHHHTT
T ss_pred             hHHHHHHHHhccccCC
Confidence            3345556666665544


No 23 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=29.52  E-value=1.1e+02  Score=22.97  Aligned_cols=29  Identities=10%  Similarity=0.057  Sum_probs=19.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy10964         53 TIVYFGIFVTGVIGNIAVCVVIINNTSLH   81 (103)
Q Consensus        53 ~~~~~~i~~~~~~gN~~vl~~~~~~~~l~   81 (103)
                      .++++++.++.++--.++++.+.|+||.+
T Consensus       311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKK  339 (353)
T TIGR01477       311 PIIASIIAILIIVLIMVIIYLILRYRRKK  339 (353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            34566666666677778888888777643


No 24 
>PF08048 RepA1_leader:  Tap RepA1 leader peptide;  InterPro: IPR012605 This entry represents of the RepA1 leader peptide known as Tap found in IncFII plasmids. The frequency of replication of IncFII plasmid NR1 during the cell division cycle is regulated by the control of the synthesis of the plasmid-specific replication initiation protein (RepA1). When RepA1 is synthesised, it binds to the plasmid replication origin (ori) and effects the assembly of a replication complex composed of host proteins that mediate the replication of the plasmid [, ]. The tap gene encodes a 24-amino acid peptide whose translation is required for the translation of repA.
Probab=28.52  E-value=51  Score=14.64  Aligned_cols=14  Identities=21%  Similarity=0.384  Sum_probs=10.2

Q ss_pred             CCCHHHHHHHHHHH
Q psy10964         80 LHTATNYYLFSLAV   93 (103)
Q Consensus        80 l~~~~~~fl~nLa~   93 (103)
                      +|...++|++++-+
T Consensus         2 lrK~Q~~FLc~lLL   15 (25)
T PF08048_consen    2 LRKVQYLFLCHLLL   15 (25)
T ss_pred             chhHHHHHHHHHHh
Confidence            36677889988754


No 25 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=27.64  E-value=84  Score=18.72  Aligned_cols=17  Identities=0%  Similarity=-0.062  Sum_probs=8.6

Q ss_pred             HHHHHHHhc--CCCCCHHH
Q psy10964         69 AVCVVIINN--TSLHTATN   85 (103)
Q Consensus        69 ~vl~~~~~~--~~l~~~~~   85 (103)
                      +++|.+++.  +|-|.|.|
T Consensus        48 VilwfvCC~kRkrsRrPIY   66 (94)
T PF05393_consen   48 VILWFVCCKKRKRSRRPIY   66 (94)
T ss_pred             HHHHHHHHHHhhhccCCcc
Confidence            445666543  33455654


No 26 
>PTZ00046 rifin; Provisional
Probab=27.54  E-value=1.2e+02  Score=22.71  Aligned_cols=28  Identities=7%  Similarity=0.077  Sum_probs=19.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy10964         54 IVYFGIFVTGVIGNIAVCVVIINNTSLH   81 (103)
Q Consensus        54 ~~~~~i~~~~~~gN~~vl~~~~~~~~l~   81 (103)
                      ++++++.++.++--.++++.+.|+||.+
T Consensus       317 IiaSiiAIvVIVLIMvIIYLILRYRRKK  344 (358)
T PTZ00046        317 IIASIVAIVVIVLIMVIIYLILRYRRKK  344 (358)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            4556666666667777888888777643


No 27 
>PF10326 7TM_GPCR_Str:  Serpentine type 7TM GPCR chemoreceptor Str;  InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split []. 
Probab=27.00  E-value=52  Score=23.13  Aligned_cols=44  Identities=9%  Similarity=0.184  Sum_probs=25.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHhcCCCCCHHHH-HHHHHHHHHHHHHh
Q psy10964         57 FGIFVTGVIGNIAVCVVIINNTSLHTATNY-YLFSLAVSDLTLLL  100 (103)
Q Consensus        57 ~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~-fl~nLa~~Dll~~~  100 (103)
                      -+.++.++..|.+.+..+.++.+.+...|= ++...|+.|+.++.
T Consensus         6 ~~~~~~s~~~N~~Li~Li~~~s~k~~G~Yk~Lm~~fs~~~i~fs~   50 (307)
T PF10326_consen    6 YIGFVLSLFLNSLLIYLILTKSPKSLGSYKYLMIYFSIFEIIFSI   50 (307)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccCCCCCCEEEEEehhHHHHHHHHH
Confidence            345667888999999887755443333332 22334555655544


No 28 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=26.96  E-value=1.2e+02  Score=19.09  Aligned_cols=12  Identities=42%  Similarity=0.504  Sum_probs=5.0

Q ss_pred             HHHHHHHHHHHH
Q psy10964         63 GVIGNIAVCVVI   74 (103)
Q Consensus        63 ~~~gN~~vl~~~   74 (103)
                      |++|=+++++.+
T Consensus        76 GvIg~Illi~y~   87 (122)
T PF01102_consen   76 GVIGIILLISYC   87 (122)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHH
Confidence            444444444333


No 29 
>KOG3850|consensus
Probab=26.85  E-value=2.4e+02  Score=21.55  Aligned_cols=41  Identities=15%  Similarity=0.237  Sum_probs=25.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHH
Q psy10964         50 LPITIVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAV   93 (103)
Q Consensus        50 ~~~~~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~   93 (103)
                      +++.+.-+++++++.+.|++.-+   .+.|.|+.+-+|++-|++
T Consensus       395 iiLalm~VlLvfVSTIa~~v~PL---mkSR~rt~~t~~LV~l~~  435 (455)
T KOG3850|consen  395 IILALMTVLLVFVSTIANCVSPL---MKSRNRTASTFFLVFLLA  435 (455)
T ss_pred             HHHHHHHHHHHHHHHHHhhccHH---hhhhhHHHHHHHHHHHHH
Confidence            33444555666667777765443   345668888888876664


No 30 
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=26.13  E-value=90  Score=20.39  Aligned_cols=29  Identities=34%  Similarity=0.598  Sum_probs=20.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy10964         51 PITIVYFGIFVTGVIGNIAVCVVIINNTS   79 (103)
Q Consensus        51 ~~~~~~~~i~~~~~~gN~~vl~~~~~~~~   79 (103)
                      +..++|.++|+.|+++.+.+....+.++.
T Consensus        63 IaGIVfgiVfimgvva~i~icvCmc~kn~   91 (155)
T PF10873_consen   63 IAGIVFGIVFIMGVVAGIAICVCMCMKNS   91 (155)
T ss_pred             eeeeehhhHHHHHHHHHHHHHHhhhhhcC
Confidence            34467899999999888877666554443


No 31 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=25.61  E-value=1.7e+02  Score=17.89  Aligned_cols=25  Identities=0%  Similarity=-0.051  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhc
Q psy10964         53 TIVYFGIFVTGVIGNIAVCVVIINN   77 (103)
Q Consensus        53 ~~~~~~i~~~~~~gN~~vl~~~~~~   77 (103)
                      ..++.+..++.++.|++......++
T Consensus         4 l~il~llLll~l~asl~~wr~~~rq   28 (107)
T PF15330_consen    4 LGILALLLLLSLAASLLAWRMKQRQ   28 (107)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            3455666666677777655444333


No 32 
>TIGR03475 tap_IncFII_lead RepA leader peptide Tap. This protein is a translated leader peptide that actis in the regulation of the expression of the plasmid replication protein RepA in incF2 group plasmids.
Probab=24.82  E-value=64  Score=14.48  Aligned_cols=13  Identities=23%  Similarity=0.368  Sum_probs=9.4

Q ss_pred             CCCHHHHHHHHHH
Q psy10964         80 LHTATNYYLFSLA   92 (103)
Q Consensus        80 l~~~~~~fl~nLa   92 (103)
                      +|...++|++++-
T Consensus         2 ~rKvQ~~FLc~~L   14 (26)
T TIGR03475         2 LRKVQYLFLCHLL   14 (26)
T ss_pred             chhHHHHHHHHHH
Confidence            3567789998854


No 33 
>PF10316 7TM_GPCR_Srbc:  Serpentine type 7TM GPCR chemoreceptor Srbc ;  InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=24.12  E-value=2.2e+02  Score=20.24  Aligned_cols=47  Identities=9%  Similarity=-0.019  Sum_probs=27.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHh
Q psy10964         54 IVYFGIFVTGVIGNIAVCVVIINNTSLHTATNYYLFSLAVSDLTLLL  100 (103)
Q Consensus        54 ~~~~~i~~~~~~gN~~vl~~~~~~~~l~~~~~~fl~nLa~~Dll~~~  100 (103)
                      .+-.+........|...++.+...|+.+++-..+..-....|.+++.
T Consensus        10 ~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~   56 (273)
T PF10316_consen   10 IIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGF   56 (273)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHH
Confidence            34444555667789988888875555444432222334566776654


No 34 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=23.78  E-value=99  Score=22.48  Aligned_cols=26  Identities=12%  Similarity=0.082  Sum_probs=15.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy10964         54 IVYFGIFVTGVIGNIAVCVVIINNTS   79 (103)
Q Consensus        54 ~~~~~i~~~~~~gN~~vl~~~~~~~~   79 (103)
                      +..+++.++.++.=+++++.++|+||
T Consensus       258 I~aSiiaIliIVLIMvIIYLILRYRR  283 (299)
T PF02009_consen  258 IIASIIAILIIVLIMVIIYLILRYRR  283 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555555555555667777776665


No 35 
>PF10327 7TM_GPCR_Sri:  Serpentine type 7TM GPCR chemoreceptor Sri;  InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=23.51  E-value=1.8e+02  Score=20.75  Aligned_cols=50  Identities=28%  Similarity=0.298  Sum_probs=33.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH-hcCCCCCHHHHHH---HHHHHHHHHHHhh
Q psy10964         52 ITIVYFGIFVTGVIGNIAVCVVII-NNTSLHTATNYYL---FSLAVSDLTLLLL  101 (103)
Q Consensus        52 ~~~~~~~i~~~~~~gN~~vl~~~~-~~~~l~~~~~~fl---~nLa~~Dll~~~~  101 (103)
                      ....|-++..+++.-|.+.++.+. +.+|+.+..|+++   ....+.|+-.+.+
T Consensus        11 li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L   64 (303)
T PF10327_consen   11 LINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFL   64 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhh
Confidence            445677888899999999887665 4556666666555   3344567765543


No 36 
>PF08114 PMP1_2:  ATPase proteolipid family;  InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=23.31  E-value=76  Score=16.10  Aligned_cols=18  Identities=22%  Similarity=0.397  Sum_probs=8.9

Q ss_pred             HHHHHHHHHHHHHHHHHh
Q psy10964         59 IFVTGVIGNIAVCVVIIN   76 (103)
Q Consensus        59 i~~~~~~gN~~vl~~~~~   76 (103)
                      .+++|+.|-.++...+++
T Consensus        14 F~lVglv~i~iva~~iYR   31 (43)
T PF08114_consen   14 FCLVGLVGIGIVALFIYR   31 (43)
T ss_pred             hHHHHHHHHHHHHHHHHH
Confidence            344455555555544444


Done!