Query         psy3982
Match_columns 185
No_of_seqs    118 out of 1506
Neff          10.2
Searched_HMMs 46136
Date          Fri Aug 16 22:15:20 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3982.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3982hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PHA03234 DNA packaging protein  99.9 1.6E-22 3.5E-27  155.8  12.5  126   12-160   179-316 (338)
  2 PHA03235 DNA packaging protein  99.9 4.9E-22 1.1E-26  156.4  11.6  129   10-160   184-323 (409)
  3 KOG4219|consensus               99.9 2.6E-21 5.7E-26  146.6   9.2  141   11-161   185-336 (423)
  4 PHA02834 chemokine receptor-li  99.8 5.9E-21 1.3E-25  146.6   8.6  110   29-160   190-309 (323)
  5 PHA02638 CC chemokine receptor  99.8   1E-19 2.3E-24  143.7  12.2  130   11-162   252-399 (417)
  6 PHA03087 G protein-coupled che  99.8 8.8E-19 1.9E-23  135.2  12.4  133    7-160   181-323 (335)
  7 KOG4220|consensus               99.8 7.9E-19 1.7E-23  133.4   7.2   79   83-161   410-488 (503)
  8 PF00001 7tm_1:  7 transmembran  99.7 4.4E-16 9.6E-21  114.6  10.7  123   11-143   133-257 (257)
  9 KOG2087|consensus               99.3 5.4E-14 1.2E-18  105.9  -3.4  133   12-165   177-311 (363)
 10 PF10324 7TM_GPCR_Srw:  Serpent  99.3 1.1E-10 2.4E-15   89.6  11.6  117   32-160   196-317 (318)
 11 PF05296 TAS2R:  Mammalian tast  99.0 6.5E-09 1.4E-13   79.3  12.6  123   27-160   179-301 (303)
 12 PF10320 7TM_GPCR_Srsx:  Serpen  98.9 4.7E-08   1E-12   72.9  11.2  128    7-158   128-257 (257)
 13 PF10323 7TM_GPCR_Srv:  Serpent  98.6 1.5E-06 3.2E-11   65.8  13.5  116   32-160   167-283 (283)
 14 PF10321 7TM_GPCR_Srt:  Serpent  98.3 9.6E-06 2.1E-10   62.1   9.9  113   27-159   194-306 (313)
 15 PF03402 V1R:  Vomeronasal orga  98.1 3.3E-06 7.2E-11   62.8   3.2  111   34-154   153-263 (265)
 16 PF10317 7TM_GPCR_Srd:  Serpent  98.0 0.00022 4.9E-09   54.2  12.3   77   79-156   216-292 (292)
 17 PF10327 7TM_GPCR_Sri:  Serpent  98.0 0.00018 3.8E-09   55.1  11.2  108   30-153   195-302 (303)
 18 PF10328 7TM_GPCR_Srx:  Serpent  97.9 0.00064 1.4E-08   51.2  12.6  111   34-158   163-273 (274)
 19 PF05462 Dicty_CAR:  Slime mold  97.8 0.00082 1.8E-08   51.3  12.5   65   92-158   204-268 (303)
 20 PF11970 Git3_C:  G protein-cou  97.7 0.00025 5.4E-09   42.5   6.1   70   82-151     4-75  (76)
 21 PF10326 7TM_GPCR_Str:  Serpent  97.3 6.6E-05 1.4E-09   57.5   0.3  110   29-155   197-307 (307)
 22 PF10318 7TM_GPCR_Srh:  Serpent  97.1   0.009 1.9E-07   45.6  10.5   79   80-160   223-302 (302)
 23 KOG4193|consensus               96.9   0.019 4.1E-07   48.1  11.3   67   87-159   514-580 (610)
 24 PF10319 7TM_GPCR_Srj:  Serpent  96.6    0.04 8.6E-07   42.1  10.2   74   79-153   235-309 (310)
 25 PF02101 Ocular_alb:  Ocular al  96.6   0.023 4.9E-07   44.3   8.7   24   91-114   245-268 (405)
 26 PF03125 Sre:  C. elegans Sre G  96.2    0.14 2.9E-06   40.4  11.4   44  120-163   281-324 (365)
 27 PF02118 Srg:  Srg family chemo  96.2    0.21 4.6E-06   37.5  12.0   98   37-153   177-274 (275)
 28 KOG4564|consensus               95.6   0.086 1.9E-06   42.7   7.9  100   39-160   318-419 (473)
 29 PF04789 DUF621:  Protein of un  92.0     4.2 9.1E-05   30.8  11.8  112   25-158   184-296 (305)
 30 PF10292 7TM_GPCR_Srab:  Serpen  85.6      15 0.00032   28.5  12.5   78   80-158   223-304 (324)
 31 PF02076 STE3:  Pheromone A rec  82.0      21 0.00044   27.3  11.7   25   87-112   196-220 (283)
 32 PF10322 7TM_GPCR_Sru:  Serpent  81.5      13 0.00028   28.8   7.8  104   34-152   201-304 (307)
 33 PF13853 7tm_4:  Olfactory rece  71.9  0.0045 9.8E-08   42.1 -12.0   69   23-112    52-120 (144)
 34 KOG0721|consensus               70.4      40 0.00086   24.7   7.2   16  144-159   110-125 (230)
 35 KOG2575|consensus               66.3      11 0.00023   30.3   4.0   32   79-110   248-279 (510)
 36 PF11014 DUF2852:  Protein of u  64.5      13 0.00029   24.1   3.6   26   91-117    11-36  (115)
 37 PF05478 Prominin:  Prominin;    62.3      38 0.00082   30.1   7.1   33   86-118   457-489 (806)
 38 COG1862 YajC Preprotein transl  58.9      30 0.00066   21.8   4.4   30   39-68     14-43  (97)
 39 PF15086 UPF0542:  Uncharacteri  58.2      37 0.00079   20.1   6.4   37   27-63     19-56  (74)
 40 KOG3500|consensus               55.5      22 0.00048   21.5   3.1   26   85-112    30-55  (84)
 41 PRK04989 psbM photosystem II r  54.0      27 0.00058   17.4   2.8   22   29-50      7-28  (35)
 42 PF02117 7TM_GPCR_Sra:  Serpent  52.6 1.1E+02  0.0024   23.9   9.1   34   82-115   228-261 (328)
 43 COG4736 CcoQ Cbb3-type cytochr  52.2      43 0.00093   19.0   4.9   39   29-67      4-42  (60)
 44 TIGR03038 PS_II_psbM photosyst  50.2      30 0.00066   17.0   2.6   22   29-50      7-28  (33)
 45 PF06305 DUF1049:  Protein of u  50.2      47   0.001   18.9   6.7   35   29-63     24-58  (68)
 46 PHA02650 hypothetical protein;  48.6      54  0.0012   19.7   4.0   28   26-53     47-74  (81)
 47 PRK06531 yajC preprotein trans  48.1      42 0.00091   21.8   3.9    7   39-45      7-13  (113)
 48 PF09889 DUF2116:  Uncharacteri  47.6      27 0.00058   19.8   2.6   20   84-103    34-53  (59)
 49 KOG3827|consensus               47.6      12 0.00025   29.8   1.5   63   86-148    60-124 (400)
 50 CHL00080 psbM photosystem II p  45.7      39 0.00085   16.7   2.7   22   29-50      7-28  (34)
 51 PHA02844 putative transmembran  45.6      44 0.00095   19.8   3.3   26   27-52     47-72  (75)
 52 COG2322 Predicted membrane pro  45.5   1E+02  0.0023   21.6   7.9   37   81-117    71-107 (177)
 53 COG3924 Predicted membrane pro  43.0      71  0.0015   18.9   4.0   33   22-54     36-68  (80)
 54 TIGR00739 yajC preprotein tran  42.4      71  0.0015   19.4   4.1   24   43-66     12-35  (84)
 55 PHA02975 hypothetical protein;  40.3      76  0.0017   18.5   4.2   27   26-52     42-68  (69)
 56 PF10322 7TM_GPCR_Sru:  Serpent  40.3 1.8E+02  0.0038   22.7   6.9   17   37-53    208-224 (307)
 57 PF05817 Ribophorin_II:  Oligos  39.6 2.6E+02  0.0055   24.3   8.7   28   28-55    548-575 (636)
 58 PHA02819 hypothetical protein;  38.5      84  0.0018   18.5   4.0   26   26-51     44-69  (71)
 59 PF05151 PsbM:  Photosystem II   38.3      52  0.0011   15.9   3.7   21   30-50      8-28  (31)
 60 PF15102 TMEM154:  TMEM154 prot  37.2     2.5 5.4E-05   28.6  -3.1    7   35-41     59-65  (146)
 61 PF07330 DUF1467:  Protein of u  36.5      79  0.0017   19.4   3.6   34   73-106    38-71  (85)
 62 PF04238 DUF420:  Protein of un  34.2 1.5E+02  0.0031   19.9   8.6   38   81-118    30-67  (133)
 63 PF11710 Git3:  G protein-coupl  33.2 1.3E+02  0.0027   21.7   4.9   35   27-61    166-200 (201)
 64 PRK14094 psbM photosystem II r  32.1      37  0.0008   18.1   1.4   22   29-50      7-28  (50)
 65 PLN00090 photosystem II reacti  31.0   1E+02  0.0023   19.2   3.5   22   29-50     77-98  (113)
 66 COG3771 Predicted membrane pro  30.6 1.4E+02  0.0029   18.5   6.0   42   29-70     45-86  (97)
 67 PF00002 7tm_2:  7 transmembran  30.4      17 0.00037   26.6   0.0   21   87-107   195-215 (242)
 68 PF02699 YajC:  Preprotein tran  30.1 1.1E+02  0.0024   18.5   3.5   18   42-59     10-27  (82)
 69 PRK05886 yajC preprotein trans  30.1 1.6E+02  0.0034   19.1   4.8   24   37-60      6-30  (109)
 70 COG4068 Uncharacterized protei  28.8 1.2E+02  0.0025   17.2   3.2   19   84-102    38-56  (64)
 71 PF11119 DUF2633:  Protein of u  28.8 1.2E+02  0.0026   17.2   4.1   30   83-112     4-33  (59)
 72 PF12725 DUF3810:  Protein of u  28.4 2.9E+02  0.0063   21.6   7.7   22   86-107    58-79  (318)
 73 PF14163 SieB:  Superinfection   28.4 1.2E+02  0.0026   20.5   4.0   22   93-114     4-25  (151)
 74 PF02439 Adeno_E3_CR2:  Adenovi  27.9      95  0.0021   15.8   3.3   28   29-56      7-34  (38)
 75 PTZ00370 STEVOR; Provisional    27.3 1.1E+02  0.0025   23.4   3.8   17   38-54    265-281 (296)
 76 PF00664 ABC_membrane:  ABC tra  27.2 2.3E+02   0.005   20.1   9.6   20  122-141   256-275 (275)
 77 PF00375 SDF:  Sodium:dicarboxy  26.3 3.4E+02  0.0074   21.7   9.9   73   86-158   166-246 (390)
 78 PF05241 EBP:  Emopamil binding  26.0 1.2E+02  0.0025   21.8   3.6   32   27-58    160-191 (194)
 79 PHA03054 IMV membrane protein;  26.0 1.5E+02  0.0032   17.5   4.0   26   26-51     46-71  (72)
 80 TIGR02736 cbb3_Q_epsi cytochro  25.6 1.3E+02  0.0029   16.8   3.0   21   39-59      7-27  (56)
 81 COG4665 FcbT2 TRAP-type mannit  25.1 2.5E+02  0.0055   19.8   6.9   33   24-56     88-120 (182)
 82 TIGR01478 STEVOR variant surfa  24.5 1.1E+02  0.0024   23.4   3.3   15   39-53    270-284 (295)
 83 PHA03234 DNA packaging protein  24.5      89  0.0019   24.4   3.1   11  101-111   205-215 (338)
 84 PF10669 Phage_Gp23:  Protein g  22.5 2.1E+02  0.0046   18.0   5.5   15   37-51     23-37  (121)
 85 PHA01815 hypothetical protein   22.4 1.4E+02   0.003   15.8   4.3   23   31-53      9-31  (55)
 86 PF10066 DUF2304:  Uncharacteri  22.4 2.3E+02  0.0049   18.3  11.2   51   83-134    29-79  (115)
 87 TIGR03052 PS_I_psaI photosyste  22.2   1E+02  0.0022   14.9   1.9   19   32-50      8-26  (31)
 88 TIGR00934 2a38euk potassium up  21.7 5.1E+02   0.011   23.3   7.1   32   84-115   443-474 (800)
 89 PF03155 Alg6_Alg8:  ALG6, ALG8  21.3 1.4E+02   0.003   24.8   3.6   27   83-109   218-244 (469)
 90 PF08525 OapA_N:  Opacity-assoc  20.3 1.2E+02  0.0026   14.4   2.9   23   82-106     7-29  (30)
 91 PHA02909 hypothetical protein;  20.1 1.8E+02  0.0038   16.2   4.5   20   32-51     40-59  (72)
 92 PF10361 DUF2434:  Protein of u  20.1 4.2E+02  0.0091   20.5   6.8   46   10-64     32-77  (296)

No 1  
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.89  E-value=1.6e-22  Score=155.78  Aligned_cols=126  Identities=17%  Similarity=0.261  Sum_probs=94.1

Q ss_pred             ccccccccccccccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHH
Q psy3982          12 TACGTDYLTKDWHHRSYL---IFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKL   88 (185)
Q Consensus        12 ~~C~~~~~~~~~~~~~y~---~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   88 (185)
                      +.|...|+.++. ...+.   ...+.+.+++|+++|++||.+|.+.++++                      +.++++|+
T Consensus       179 ~~C~~~~~~~~~-~~~~~~~~~~~~~~~f~iPl~im~~cY~~I~~~L~~~----------------------~~~~~~k~  235 (338)
T PHA03234        179 GKCNIHISSKKA-YDLFIAIKIVFCFIWGIFPTMIFSFFYVIFCKALHAL----------------------TEKKHKKT  235 (338)
T ss_pred             CcCcccCCcchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------------------hhhhhhhh
Confidence            579887754321 11222   12234457999999999999999888742                      01346789


Q ss_pred             HHHHHHHHHHHHHHhhHHHHHHHHHhhhc----cc-----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982          89 AKVALMTISLWFLAWTPYLVINFGGILEM----VE-----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP  159 (185)
Q Consensus        89 ~~~~~~i~~~f~i~~~P~~v~~~~~~~~~----~~-----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~  159 (185)
                      +|++++++++|++||+||++..++.....    ..     .......++.+++++||++||+||++.+++||+++++.++
T Consensus       236 ~k~i~~vv~vF~iCWlPy~iv~l~~~~~~~~~~~~c~~~~~~~~~~~v~~~La~~nsclNPiIY~f~~~~FR~~~~~~~~  315 (338)
T PHA03234        236 LFFIRILILSFLCIQIPNIAILICEIAFLYIANNSCFGLAQREILQIIIRLMPEIHCFSNPLVYAFTGGDFRLRFTACFQ  315 (338)
T ss_pred             hhHHHHHHHHHHHHHhHHHHHHHHHHHHHhcccCcchHHHHHHHHHHHHHHHHHhhhhhhHHHHHHhhHHHHHHHHHHHH
Confidence            99999999999999999999887654321    00     1234567888999999999999999999999999998776


Q ss_pred             c
Q psy3982         160 S  160 (185)
Q Consensus       160 ~  160 (185)
                      +
T Consensus       316 ~  316 (338)
T PHA03234        316 D  316 (338)
T ss_pred             H
Confidence            4


No 2  
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.88  E-value=4.9e-22  Score=156.39  Aligned_cols=129  Identities=12%  Similarity=0.272  Sum_probs=99.3

Q ss_pred             Cccccccccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHH
Q psy3982          10 NLTACGTDYLTKDW--HHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIK   87 (185)
Q Consensus        10 ~~~~C~~~~~~~~~--~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   87 (185)
                      +...|...+..+..  ....|.+..+++.|++|+++|++||.+|++.++++.+                      ++++|
T Consensus       184 ~~~~C~~~~~~~~~~~~~~~y~i~l~i~~f~iPl~im~~~Y~~I~~~l~~~~~----------------------~~~~k  241 (409)
T PHA03235        184 GYETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTWFYTFFYRTLKRASY----------------------KKRSR  241 (409)
T ss_pred             CcceeeEeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----------------------hcchh
Confidence            34579765544321  1234666777788999999999999999999986421                      12457


Q ss_pred             HHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc----c-----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982          88 LAKVALMTISLWFLAWTPYLVINFGGILEMV----E-----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF  158 (185)
Q Consensus        88 ~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~----~-----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~  158 (185)
                      ..+++++++++|++||+||++..++..+...    .     ....+..++.+++++|+++||+||.+.+++||+++++.+
T Consensus       242 ~~~~v~iivv~F~iCWlPy~v~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~La~~ns~lNPiIY~~~~~~FRk~~~~~l  321 (409)
T PHA03235        242 TLTFVCILLLSFLCLQTPFVAIMIFDSYATLIWPSDCEHINLRDAVSTLSRLVPNLHCLLNPILYAFLGNDFLKRFRQCF  321 (409)
T ss_pred             hhhhHHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCchhhhHHHHHHHHHHHHHHHHHhHhHHHHHHhhHHHHHHHHHHH
Confidence            7888999999999999999999887654321    1     234567788999999999999999999999999999888


Q ss_pred             cc
Q psy3982         159 PS  160 (185)
Q Consensus       159 ~~  160 (185)
                      ++
T Consensus       322 ~~  323 (409)
T PHA03235        322 RG  323 (409)
T ss_pred             hh
Confidence            64


No 3  
>KOG4219|consensus
Probab=99.85  E-value=2.6e-21  Score=146.56  Aligned_cols=141  Identities=20%  Similarity=0.333  Sum_probs=111.9

Q ss_pred             ccccccccccc--c-----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCch
Q psy3982          11 LTACGTDYLTK--D-----WHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTS   83 (185)
Q Consensus        11 ~~~C~~~~~~~--~-----~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~   83 (185)
                      ...|...|+..  .     ...+.|..++.++.+++|++++..+|..|...+|..+.... +..         ++.+..+
T Consensus       185 ~~~~~~~~pe~~~~~~~~~~~~~~y~~vl~~lqYflPliVl~~~Yt~iav~LW~~~~~gd-~~d---------~~~~~~k  254 (423)
T KOG4219|consen  185 RVVCVTAWPEHVCPTENESLLMQGYNYVLLFLQYFLPLIVLGLAYTVIAVTLWGRRIPGD-QQD---------RKHEQLK  254 (423)
T ss_pred             eEEEEEecccccCCcchhhhhhcceeeeehhHHHHHHHHHHHHHHHHHHHHHHhccCccc-hhc---------hhhHHHH
Confidence            55676666554  1     12234888899999999999999999999999996542211 110         1334567


Q ss_pred             HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc----cchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982          84 AEIKLAKVALMTISLWFLAWTPYLVINFGGILEMV----EINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP  159 (185)
Q Consensus        84 ~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~----~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~  159 (185)
                      .++|+.||++++++.|.+||+||++..++.....+    ......+....||+.+|+|.||+||++.|++||.++++.++
T Consensus       255 ak~K~vkmliiVV~~FaicWlPyh~y~il~~~~~~i~~~k~i~~vyl~~~WLaMSst~yNPiIY~~lN~Rfr~gf~~~fr  334 (423)
T KOG4219|consen  255 AKKKVVKMLIIVVVIFAICWLPYHIYFILNATNPEINRKKFIQQVYLAIYWLAMSSTCYNPIIYCFLNKRFRGGFRRAFR  334 (423)
T ss_pred             HHHHHHHHHHHHHHHHHHhccChhHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhhccHhhhhhHHHHHHHHhhhhh
Confidence            78899999999999999999999999988764443    25577888999999999999999999999999999999997


Q ss_pred             cc
Q psy3982         160 SL  161 (185)
Q Consensus       160 ~~  161 (185)
                      ++
T Consensus       335 ~c  336 (423)
T KOG4219|consen  335 WC  336 (423)
T ss_pred             ee
Confidence            65


No 4  
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.84  E-value=5.9e-21  Score=146.61  Aligned_cols=110  Identities=20%  Similarity=0.376  Sum_probs=87.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLV  108 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v  108 (185)
                      ....+++.+++|+++|+++|.+|.+.++++++                      +.++|.+|+++++++.|++||+|+++
T Consensus       190 ~~~~~i~~f~iPl~ii~~~Y~~I~~~l~~~~~----------------------~~~~k~~k~~~~vv~~F~icWlPy~i  247 (323)
T PHA02834        190 NFEINIFGIVIPLIILIYCYSKILYTLKNCKN----------------------KNKTRSIKIILTVVTFTVVFWVPFNI  247 (323)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----------------------cccceEEeehhHHHHHHHHHHhhHHH
Confidence            33455778999999999999999999875311                      22457889999999999999999999


Q ss_pred             HHHHHhhhccc--------c--hHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982         109 INFGGILEMVE--------I--NPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS  160 (185)
Q Consensus       109 ~~~~~~~~~~~--------~--~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~  160 (185)
                      ..++..+....        .  ......+..+++++||++||+||++.+++||+++++++++
T Consensus       248 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~iNPiIY~~~~~~fR~~~~~~~~~  309 (323)
T PHA02834        248 VLFINSLQSVGLIDIGCYHFKKIVYSIDIAELISFVHCCVNPIIYAFVGKNFKKVFKNMFCR  309 (323)
T ss_pred             HHHHHHHHHhcCCCccchHHHHHHHHHHHHHHHHHhhccccHHHHHHhcHHHHHHHHHHHHh
Confidence            88876543211        0  1123457889999999999999999999999999998743


No 5  
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.82  E-value=1e-19  Score=143.74  Aligned_cols=130  Identities=15%  Similarity=0.233  Sum_probs=97.9

Q ss_pred             ccccccccccccc----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHH
Q psy3982          11 LTACGTDYLTKDW----HHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEI   86 (185)
Q Consensus        11 ~~~C~~~~~~~~~----~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   86 (185)
                      ...|...++.+..    ....+.+...++++++|+++|++||.+|++.+++.+                      .++++
T Consensus       252 ~~~C~~~~~~~~~~~~~~~~~~~~~~~i~~f~lPl~vmi~cY~~I~~~L~~~~----------------------~~~k~  309 (417)
T PHA02638        252 NYQCTLIEDNEKNNISFLGRILQFEINILGMFIPIIIFAFCYIKIILKLKQLK----------------------KSKKT  309 (417)
T ss_pred             CCeeeeeccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------------------ccccc
Confidence            4579765543221    112344455677899999999999999999987531                      12356


Q ss_pred             HHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc--------------chHHHHHHHHHHHhhhhhccchhheecchHHHH
Q psy3982          87 KLAKVALMTISLWFLAWTPYLVINFGGILEMVE--------------INPLITIWGSVFAKANAVYNPIVYAISHPKFRQ  152 (185)
Q Consensus        87 ~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~--------------~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~  152 (185)
                      |.+|+++.++++|++||+|+.+..++.......              .......+..+++++||++||+||++.+++||+
T Consensus       310 k~~rli~~ivi~f~lcW~Py~i~~ll~~~~~~~~~~~~~~~~~c~~~~l~~~~~vt~~la~~~sclNPiIY~f~~~~FR~  389 (417)
T PHA02638        310 KSIIIVSIIIICSLICWIPLNIVILFATMYSFKGFNSIISEHICGFIKLGYAMMLAEAISLTHCCINPLIYTLIGENFRM  389 (417)
T ss_pred             hhhhhHHHHHHHHHHHHhhHHHHHHHHHHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHhhhHHHHHHhCHHHHH
Confidence            778999999999999999999998877653210              123455678899999999999999999999999


Q ss_pred             HHHhhccccc
Q psy3982         153 ALDKKFPSLV  162 (185)
Q Consensus       153 ~~~~~~~~~~  162 (185)
                      ++++.+++..
T Consensus       390 ~l~~~~~~~~  399 (417)
T PHA02638        390 HLLMIFRNIF  399 (417)
T ss_pred             HHHHHHHHhc
Confidence            9999886543


No 6  
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.80  E-value=8.8e-19  Score=135.22  Aligned_cols=133  Identities=18%  Similarity=0.305  Sum_probs=98.6

Q ss_pred             ccCCcccccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHH
Q psy3982           7 PEGNLTACGTDYLTKDWHHRSY-LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAE   85 (185)
Q Consensus         7 ~~~~~~~C~~~~~~~~~~~~~y-~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (185)
                      .+.+...|...+.++......+ .....++.+++|+++++++|.+|.+.++++++                     .+++
T Consensus       181 ~~~~~~~C~~~~~~~~~~~~~~~~~~~~~~~~~lP~~ii~~~y~~i~~~l~~~~~---------------------~~~~  239 (335)
T PHA03087        181 KDHETLICCMFYNNKTMNWKLFINFEINIIGMLIPLTILLYCYSKILITLKGINK---------------------SKKN  239 (335)
T ss_pred             ccCCCceEEecCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------------chhc
Confidence            3445567887776433222222 23345677899999999999999888875321                     2346


Q ss_pred             HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhc-----c-c---chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHh
Q psy3982          86 IKLAKVALMTISLWFLAWTPYLVINFGGILEM-----V-E---INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDK  156 (185)
Q Consensus        86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~-----~-~---~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~  156 (185)
                      +|+.|++..++++|++||+|+.+..++.....     . .   ....+..+..+++.+|+++||+||++++++||+++++
T Consensus       240 ~k~~k~l~~iv~~f~i~w~P~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ns~~NPiIY~~~~~~fr~~~~~  319 (335)
T PHA03087        240 KKAIKLVLIIVILFVIFWLPFNVSVFVYSLHILHFKSGCKAVKYIQYALHVTEIISLSHCCINPLIYAFVSEFFNKHKKK  319 (335)
T ss_pred             chHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHHHHHhhhhhHHHHcCHHHHHHHHH
Confidence            78999999999999999999999877654311     1 1   2244556788899999999999999999999999999


Q ss_pred             hccc
Q psy3982         157 KFPS  160 (185)
Q Consensus       157 ~~~~  160 (185)
                      .++.
T Consensus       320 ~~~~  323 (335)
T PHA03087        320 SLKL  323 (335)
T ss_pred             HHHH
Confidence            8853


No 7  
>KOG4220|consensus
Probab=99.77  E-value=7.9e-19  Score=133.41  Aligned_cols=79  Identities=23%  Similarity=0.371  Sum_probs=74.9

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcccc
Q psy3982          83 SAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPSL  161 (185)
Q Consensus        83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~~  161 (185)
                      .+|+|++|++.+|.++|+++|.||.++.++..+.....+..++.+..||-++||-+||++|.++|+.||+.++++|.|.
T Consensus       410 ~rErKAAkTLsAILlAFIiTWtPYNImVlv~tFC~~CiP~tlW~~gYwLCYINSTiNP~CYALCNatFrkTfk~lL~Cr  488 (503)
T KOG4220|consen  410 VRERKAAKTLSAILLAFILTWTPYNIMVLVNTFCKNCIPETLWTFGYWLCYINSTINPLCYALCNATFRKTFKRLLLCR  488 (503)
T ss_pred             HHHHHHHHHHHHHHHHHHHHcccceeeeehHhhcccccchhHhhhhhheeeecccccHHHHHHHhHHHHHHHHHhheee
Confidence            5789999999999999999999999999999988888999999999999999999999999999999999999999654


No 8  
>PF00001 7tm_1:  7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature;  InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=99.68  E-value=4.4e-16  Score=114.58  Aligned_cols=123  Identities=22%  Similarity=0.525  Sum_probs=101.1

Q ss_pred             cccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHH
Q psy3982          11 LTACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAK   90 (185)
Q Consensus        11 ~~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   90 (185)
                      ...|..++..+.  ...+.....++.+++|+++++++|.+|++.+++++++...+....        ..++.++++|.++
T Consensus       133 ~~~C~~~~~~~~--~~~~~~~~~~~~~~~p~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~  202 (257)
T PF00001_consen  133 QSFCFIDFSSSS--SQIYFIYFFIVFFILPLIIILICYIRILRKLRRQRKRIKSQSSSS--------SRRRSRRERRAAR  202 (257)
T ss_dssp             CEEEEESCSSSH--HHHHHHHHHHHHTHHHHHHHHHHHHHHHHHHHHHHHCTCCHTSHH--------HHHHHHHHHHHHH
T ss_pred             cccccccccccc--cccccccccccccccceeeeeeecccccccccccccccccccccc--------ccccccccccccc
Confidence            567977665533  456788888889999999999999999999998877653221100        1234577899999


Q ss_pred             HHHHHHHHHHHHhhHHHHHHHHHhhhccc--chHHHHHHHHHHHhhhhhccchhh
Q psy3982          91 VALMTISLWFLAWTPYLVINFGGILEMVE--INPLITIWGSVFAKANAVYNPIVY  143 (185)
Q Consensus        91 ~~~~i~~~f~i~~~P~~v~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~nPiiY  143 (185)
                      +++.++++|++||+|+.+..++....+..  .......++.++.++|+++||+||
T Consensus       203 ~~~~i~~~f~~~~~P~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~~s~~nP~iY  257 (257)
T PF00001_consen  203 TLLIIVLVFLLCWLPYFILSLLSVFSPSSSLISSILFYISYFLAFLNSCLNPIIY  257 (257)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHSSTSTCSHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ccccccccccccCCceeHHHHHHHHcCccchhhHHHHHHHHHHHHHHHhhCcEEC
Confidence            99999999999999999999998888776  578899999999999999999998


No 9  
>KOG2087|consensus
Probab=99.32  E-value=5.4e-14  Score=105.93  Aligned_cols=133  Identities=16%  Similarity=0.235  Sum_probs=100.0

Q ss_pred             ccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHH-HHHHH
Q psy3982          12 TACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAE-IKLAK   90 (185)
Q Consensus        12 ~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~   90 (185)
                      ..|.+....+. ....|.+..++..-.+-+++|.++|++++..+++-                    ....... ...+|
T Consensus       177 ~vClPL~~~~~-~s~g~y~~~~l~~N~lafiiia~~Y~~iy~~l~~~--------------------~~~~~~~~~~~ak  235 (363)
T KOG2087|consen  177 SVCLPLHIEEP-LSTGYYLVALLGLNLLAFIIIAFSYGKIYCSLRKG--------------------DLSATLISTSVAK  235 (363)
T ss_pred             ceeeecccCCc-cchhHHHHHHHHHHHHHHHHHHHHhhhhheeeecC--------------------CCccccchhhhhh
Confidence            58987554432 23336666667777788999999999999999952                    1111222 37788


Q ss_pred             HHHHHHHHHHHHhhHHHHHHHHHhhhcccch-HHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcccccccc
Q psy3982          91 VALMTISLWFLAWTPYLVINFGGILEMVEIN-PLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPSLVCGT  165 (185)
Q Consensus        91 ~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~~~~~~  165 (185)
                      -+..+++.-++||.|..+..+...+..+-.. .....+..++.-+|+++||++|.+.++.||+.++.++++..+++
T Consensus       236 r~a~LvfTd~icw~Pi~f~~~~al~~~~li~~~~sk~llv~flPlns~~NP~LYa~fT~~fk~d~~~l~~k~~~~k  311 (363)
T KOG2087|consen  236 RMAFLVFTDCICWCPIAFFKFSALIGVELISVSYSKWLLVFFLPLNSCLNPFLYAFFTPVFKEDLFLLLSKVGLCK  311 (363)
T ss_pred             CeeEEEEccccccCchheeeeHHhcCCcccChhhceeEEEEEEEcccccCchhHHHcCHHHHHHHHHHHhhcchhh
Confidence            8889999999999999999998887766422 33335666777899999999999999999999999887654443


No 10 
>PF10324 7TM_GPCR_Srw:  Serpentine type 7TM GPCR chemoreceptor Srw;  InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz []. 
Probab=99.25  E-value=1.1e-10  Score=89.61  Aligned_cols=117  Identities=14%  Similarity=0.202  Sum_probs=95.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHH
Q psy3982          32 YGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINF  111 (185)
Q Consensus        32 ~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~  111 (185)
                      ..++.-++|.++..++...+...+|+.++++++..+           + ..+++.+.+++++.+++.|+++-+|..+..+
T Consensus       196 ~gi~~kiiP~il~~ilti~Li~~Lrk~~~~r~~~~~-----------~-~~~~~~~tt~li~~~ti~f~i~e~p~gi~~~  263 (318)
T PF10324_consen  196 DGIFFKIIPCILLPILTILLIIELRKAKKRRKKLSS-----------S-KSKKSDRTTKLILFMTISFLISELPQGIIFL  263 (318)
T ss_pred             hhhHhhhhhHHHHHHHHHHHHHHHHhccHhhhcccc-----------c-ccccccchHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344457899999999999999999987777653321           1 1356779999999999999999999999999


Q ss_pred             HHhhhccc-----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982         112 GGILEMVE-----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS  160 (185)
Q Consensus       112 ~~~~~~~~-----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~  160 (185)
                      +.....+.     ....+..+..++..+|+..|++||.+++.+||+++++++++
T Consensus       264 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~ns~~h~~ic~~mSsqYR~t~~~~f~~  317 (318)
T PF10324_consen  264 LESFFEEDSGLIFIIIQLSIIFNILITINSSIHFFICCFMSSQYRKTVKKLFGC  317 (318)
T ss_pred             HHHHHhhhcchhHHHHHHHHHHHHHHHHHhhhHHHhhhhhhHHHHHHHHHHhcc
Confidence            87753332     44567778899999999999999999999999999999853


No 11 
>PF05296 TAS2R:  Mammalian taste receptor protein (TAS2R);  InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=99.04  E-value=6.5e-09  Score=79.28  Aligned_cols=123  Identities=16%  Similarity=0.255  Sum_probs=93.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982          27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPY  106 (185)
Q Consensus        27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~  106 (185)
                      .+......+..++|+++++.+...+...++||.|+.+....+.        +..+.+.+.|+.|++....+.|+++++-.
T Consensus       179 ~~~~~~~~~~~~lPf~i~l~s~~lli~SL~rH~r~M~~n~~g~--------~~ps~~aH~~a~k~~~sfl~ly~~~~~~~  250 (303)
T PF05296_consen  179 FYFFILFNLGSFLPFLIFLVSSILLIFSLWRHMRRMQKNATGF--------RDPSTEAHIRAIKTMISFLILYIIYFLSL  250 (303)
T ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhhCCCCCC--------CCCCcHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444445588999999999999999999999999987543332        23356789999999988888888777554


Q ss_pred             HHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982         107 LVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS  160 (185)
Q Consensus       107 ~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~  160 (185)
                      .+........+   ......+...+..+.|..+|++-.+.|+++|+++++.+++
T Consensus       251 ~~~~~~~~~~~---~~~~~~~~~~i~~~yps~hs~iLIlgn~KLr~~~~~il~~  301 (303)
T PF05296_consen  251 ILSFLSFFFPE---NSIWFWVCEIIIALYPSGHSIILILGNPKLRQALLKILWC  301 (303)
T ss_pred             HHHHHHHHhcc---ccHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhh
Confidence            43222222222   2445667788999999999999999999999999998854


No 12 
>PF10320 7TM_GPCR_Srsx:  Serpentine type 7TM GPCR chemoreceptor Srsx;  InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=98.86  E-value=4.7e-08  Score=72.94  Aligned_cols=128  Identities=14%  Similarity=0.259  Sum_probs=87.2

Q ss_pred             ccCCcccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHH
Q psy3982           7 PEGNLTACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEI   86 (185)
Q Consensus         7 ~~~~~~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   86 (185)
                      +++....|.+...-++.   ++....... .++ .+++++.|...+..+++++++                   ..+.++
T Consensus       128 ~~~~~~~C~pp~a~~~~---~~~~~~~~~-~~i-nv~tvivY~i~~~~~~~k~~~-------------------~~~~~~  183 (257)
T PF10320_consen  128 DDETIVICNPPLAFHGT---ASQIWSYSN-III-NVITVIVYIITIIIFKRKSRS-------------------NSSRSK  183 (257)
T ss_pred             CCcccccCCCccccCcc---HHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHccc-------------------cchhHH
Confidence            44457789775443332   333222222 222 244456666666666654222                   346688


Q ss_pred             HHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc--chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982          87 KLAKVALMTISLWFLAWTPYLVINFGGILEMVE--INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF  158 (185)
Q Consensus        87 ~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~  158 (185)
                      |+.|.+...++.|++.|.-..+...+.......  ....+.....+++..+.+.|.++|.++|+|||+++++++
T Consensus       184 kv~ksL~v~v~i~i~~w~~s~~~~~v~~~~~~~~~~~~~i~~~~~i~v~~~~s~~ffV~~~~S~EYR~af~~~~  257 (257)
T PF10320_consen  184 KVFKSLKVTVIIFIFSWFLSQIINTVSLALGLDGETIAIIQMYAGIFVNISYSQNFFVYYWRSSEYRKAFRELF  257 (257)
T ss_pred             HHHHHhhhheeeeeHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHHHHHHheEEEEcCHHHHHHHHHhC
Confidence            999999999999999998887776654433332  445577778899999999999999999999999999864


No 13 
>PF10323 7TM_GPCR_Srv:  Serpentine type 7TM GPCR chemoreceptor Srv;  InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=98.65  E-value=1.5e-06  Score=65.81  Aligned_cols=116  Identities=15%  Similarity=0.220  Sum_probs=77.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHH
Q psy3982          32 YGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINF  111 (185)
Q Consensus        32 ~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~  111 (185)
                      .......+.+++.+++|..+...+|++.+...++            ..+..++|.+.+.+..+...++++...=+.....
T Consensus       167 ~~~~~~~~~cv~~iv~Y~~i~~~iRk~~k~~s~~------------~s~~~~rE~~L~~~~~i~~~a~~~~~~~~~~~~~  234 (283)
T PF10323_consen  167 IAFIFVSVTCVICIVCYGIIFIFIRKRNKKKSKS------------SSRSRRREIRLAIQVFILFCAFFVILVYYIFSNY  234 (283)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHh------------hhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3344455667888999999999999776411100            2335577888888877777666655544433333


Q ss_pred             HHhhhcccc-hHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982         112 GGILEMVEI-NPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS  160 (185)
Q Consensus       112 ~~~~~~~~~-~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~  160 (185)
                       .....+.. ...+..+-..+..+.|.+||..-.+.|+++||.+++.++|
T Consensus       235 -~~~~~~~~~~~~~r~~y~~~~~~~s~inP~~LLi~n~~lr~~~~~~~~~  283 (283)
T PF10323_consen  235 -FAQNFNTDPIFYLRAFYPILNGLLSFINPWMLLIFNKDLRKQVRRMLKC  283 (283)
T ss_pred             -HHhcccchHHHHHHHHHHHHHHHHHhhhhHHhhhccHHHHHHHHHHcCC
Confidence             23222222 3344456777888889999999999999999999998864


No 14 
>PF10321 7TM_GPCR_Srt:  Serpentine type 7TM GPCR chemoreceptor Srt;  InterPro: IPR019425  Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=98.28  E-value=9.6e-06  Score=62.06  Aligned_cols=113  Identities=15%  Similarity=0.183  Sum_probs=79.4

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982          27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPY  106 (185)
Q Consensus        27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~  106 (185)
                      .|.-......-++-.+.+..+|+.+...++++.+...              +.+..+.+++++....++.+.+.+.++-|
T Consensus       194 ~Y~N~~~~~nN~~~~~~t~~lY~~l~~~l~~k~~~~~--------------s~~~~k~~~qI~iQs~iIC~f~~i~a~iy  259 (313)
T PF10321_consen  194 LYSNIPHTFNNIIVVICTTFLYIYLCYYLIKKSKYST--------------SSQLSKAQRQIFIQSVIICFFHAIAAVIY  259 (313)
T ss_pred             chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------chhhhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence            3555555555556578888899998888887654432              23344556677766666766666665544


Q ss_pred             HHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982         107 LVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP  159 (185)
Q Consensus       107 ~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~  159 (185)
                      ....++      ..+.++..+..+.-.+++..+|+||...|+..|+++++++.
T Consensus       260 v~m~f~------~~p~~~i~~~~~~Wql~~g~~~iIYl~lNrtIR~~~~k~~~  306 (313)
T PF10321_consen  260 VYMQFF------PPPPWLIIIGQISWQLSHGCPPIIYLTLNRTIRNSVLKMLG  306 (313)
T ss_pred             Heeeee------cccHHHHHHHHHHHhccCCccceEEEEECHHHHHHHHHHHc
Confidence            333332      13466777777888899999999999999999999999884


No 15 
>PF03402 V1R:  Vomeronasal organ pheromone receptor family, V1R;  InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=98.06  E-value=3.3e-06  Score=62.82  Aligned_cols=111  Identities=15%  Similarity=0.121  Sum_probs=85.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy3982          34 FFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGG  113 (185)
Q Consensus        34 ~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~  113 (185)
                      .+.-++.+.+|+++.+.++..+.||+++.+......        ...+...|.|+++++++++..|+.++....++.++.
T Consensus       153 ~~~D~~fvgLM~~aSgyMV~~L~RH~q~VqhIhs~~--------~s~r~SpE~RAtktILlLVs~FV~fY~l~si~~~~~  224 (265)
T PF03402_consen  153 SFRDVLFVGLMIWASGYMVFLLYRHHQQVQHIHSSS--------LSPRSSPETRATKTILLLVSTFVSFYGLSSILFIYL  224 (265)
T ss_pred             HhhHHHHHHHHHhhhhhheeeeeecccccccCcCCC--------CCCCCChhHHHhCeEeeHHHHHHHHHhHHHHHHHHH
Confidence            444577788999999999999999999987543211        223456799999999999999999999988776544


Q ss_pred             hhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHH
Q psy3982         114 ILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQAL  154 (185)
Q Consensus       114 ~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~  154 (185)
                      .....  ..++.....+++..-|.+-|++....++.+.+.+
T Consensus       225 ~~~~~--~~~~~~~~~~ls~cfptisPfvLI~~d~~i~~~~  263 (265)
T PF03402_consen  225 TSFKN--SPWLLNISVFLSSCFPTISPFVLISSDKRIIKFL  263 (265)
T ss_pred             HHhcC--CcceeEHHHHHhHHhHhhChHHhhccCchHHHHh
Confidence            43332  2334457788999999999999999998887754


No 16 
>PF10317 7TM_GPCR_Srd:  Serpentine type 7TM GPCR chemoreceptor Srd;  InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd []. 
Probab=98.00  E-value=0.00022  Score=54.25  Aligned_cols=77  Identities=16%  Similarity=0.195  Sum_probs=64.1

Q ss_pred             CCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHh
Q psy3982          79 QSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDK  156 (185)
Q Consensus        79 ~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~  156 (185)
                      ++.++.+++..|.+..-+..=+++..|.....++...... ......++...+..+.+.+||+++...=+.||+.++|
T Consensus       216 ~~tk~~h~~lv~~Lt~Q~~lP~~~~~p~~~~~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~P~itl~fv~PYR~~i~r  292 (292)
T PF10317_consen  216 ERTKSMHRQLVKGLTIQALLPLFFYIPGVIIYFLSQFTGY-EHPFLEYLIFMLASLPPLIDPLITLYFVRPYRKAILR  292 (292)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcc-ccHHHHHHHHHHHHHHHHhchHhheeeeHhHHHHhcC
Confidence            3345678899999999999999999988888887776665 3466777777788899999999999999999999864


No 17 
>PF10327 7TM_GPCR_Sri:  Serpentine type 7TM GPCR chemoreceptor Sri;  InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=97.97  E-value=0.00018  Score=55.13  Aligned_cols=108  Identities=15%  Similarity=0.209  Sum_probs=84.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy3982          30 IFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVI  109 (185)
Q Consensus        30 ~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~  109 (185)
                      .....++.++-.++..+++..+++.+++.+++..               ++..++++++.+.+++-..+-.+|.+|..+.
T Consensus       195 ~~~~~~~~~~~~~~~~~~~~~m~~~L~~~k~~iS---------------~~ty~kHk~av~SLi~Q~~~~~i~~~P~~~~  259 (303)
T PF10327_consen  195 FILAFFGGFLCFVIFIFLTIDMFRMLKKLKKKIS---------------KQTYQKHKEAVRSLIAQFATSSICILPPFIF  259 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            3334444556677788888999999988777663               3356888999999998888888999999887


Q ss_pred             HHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHH
Q psy3982         110 NFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQA  153 (185)
Q Consensus       110 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~  153 (185)
                      .+......+ ..+.+..........+|.+|-++-.+.++.||+-
T Consensus       260 ~~~~~~~~~-~~q~i~~~~~~~f~~HS~~n~ivli~t~ppYR~f  302 (303)
T PF10327_consen  260 VVVVIFEFE-DAQVISEICLAIFSSHSSVNMIVLIITTPPYRKF  302 (303)
T ss_pred             hhhheecCC-CcHHHHHHHHHHHHHhhHhhheeeeEcCcchhhc
Confidence            776555443 4567778888889999999999999999999984


No 18 
>PF10328 7TM_GPCR_Srx:  Serpentine type 7TM GPCR chemoreceptor Srx;  InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=97.87  E-value=0.00064  Score=51.24  Aligned_cols=111  Identities=13%  Similarity=0.111  Sum_probs=75.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy3982          34 FFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGG  113 (185)
Q Consensus        34 ~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~  113 (185)
                      ......-.+++..++..++.++++.+++..         ...+..++++++|.+.++..+.-...|++.++-+.+..-+ 
T Consensus       163 ~~~~~~~~~~~~~lni~t~ikl~~~~~~~~---------~~~~~s~~r~rke~~f~~Qs~~Q~~~~~i~~~~~~~~~~~-  232 (274)
T PF10328_consen  163 FYKNFILVIISNILNIITFIKLRKFRKKIS---------VSSSESKKRRRKEIRFFIQSFIQDLLYLIDLIFYFFIPPL-  232 (274)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---------ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-
Confidence            333444456777778888878876655541         0111144567789999999999999999999877766443 


Q ss_pred             hhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982         114 ILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF  158 (185)
Q Consensus       114 ~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~  158 (185)
                        .++  .......+.+.-.+..++|++|+...|+|+|+.+++..
T Consensus       233 --~~~--~~~~F~~~t~~w~~~h~~DG~i~l~fN~~~r~~~~~~~  273 (274)
T PF10328_consen  233 --SSN--RWWQFFCTTFSWVLVHALDGLIMLIFNSEIRRKIRKKK  273 (274)
T ss_pred             --ccc--cHHHHHHHHHHHHHHHHhcceeEeEEcHHHHHHHHhcc
Confidence              122  23333344444456678899999999999999988753


No 19 
>PF05462 Dicty_CAR:  Slime mold cyclic AMP receptor
Probab=97.82  E-value=0.00082  Score=51.35  Aligned_cols=65  Identities=12%  Similarity=0.216  Sum_probs=52.5

Q ss_pred             HHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982          92 ALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF  158 (185)
Q Consensus        92 ~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~  158 (185)
                      +..-.++|++||+|-.+..+....++.  +..+..+...++.+.-.+|.++|++.++-.++.+...+
T Consensus       204 L~~Yp~ifiicw~fa~INRI~~~~~~~--~~~l~~Lh~~~s~lqGf~nsivy~~n~~~~~~~~~~~~  268 (303)
T PF05462_consen  204 LVNYPLIFIICWIFATINRIYNFIGKN--PFWLSVLHVGFSPLQGFFNSIVYGYNNSLMWRYLGSKI  268 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcCC--chHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence            456889999999999999999887643  34556666677777889999999999999888877665


No 20 
>PF11970 Git3_C:  G protein-coupled glucose receptor regulating Gpa2 C-term;  InterPro: IPR022596 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This family is the conserved C-terminal domain of the member proteins. 
Probab=97.67  E-value=0.00025  Score=42.54  Aligned_cols=70  Identities=7%  Similarity=0.196  Sum_probs=55.4

Q ss_pred             chHHHHHHHHHHHHHHHHHHHhh-HHHHHHHHHhhhccc-chHHHHHHHHHHHhhhhhccchhheecchHHH
Q psy3982          82 TSAEIKLAKVALMTISLWFLAWT-PYLVINFGGILEMVE-INPLITIWGSVFAKANAVYNPIVYAISHPKFR  151 (185)
Q Consensus        82 ~~~~~~~~~~~~~i~~~f~i~~~-P~~v~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr  151 (185)
                      +++-+|.++.+++-=++|+++|+ |+..-..-......+ ...++..++.++..+|..+|-++|++..+..|
T Consensus         4 r~~i~r~lr~mfiYP~~Yi~lwlfP~~~~~~~~~~~~~~~p~~~l~~i~~~~~~~~G~VD~lvf~~~erpw~   75 (76)
T PF11970_consen    4 RKRIRRQLRSMFIYPLVYIVLWLFPFAAHRMQYMYEIGHGPSFWLFCIAGFMQPSQGFVDCLVFTLRERPWR   75 (76)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCchHHHHHHHHHHHccCHHHhhheeeecccCC
Confidence            35567889999999999999999 887666555533332 55788889999999999999999999876543


No 21 
>PF10326 7TM_GPCR_Str:  Serpentine type 7TM GPCR chemoreceptor Str;  InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split []. 
Probab=97.29  E-value=6.6e-05  Score=57.48  Aligned_cols=110  Identities=13%  Similarity=0.203  Sum_probs=77.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHH-HHHHHHHHhhHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALM-TISLWFLAWTPYL  107 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-i~~~f~i~~~P~~  107 (185)
                      ..........+++.++++|-.++++.+++..+..               +.+.++.+++..+.+++ .++..++.++|..
T Consensus       197 ~~~~~~~im~~s~~iii~cg~~~~~~i~~~~~~~---------------S~~~~~lq~QLf~aLv~Qt~iP~i~~~~P~~  261 (307)
T PF10326_consen  197 GMLIMCIIMSISFFIIIYCGIKIYKKIKKLSSIM---------------SSKTRKLQKQLFKALVIQTIIPFIFMYIPVF  261 (307)
T ss_pred             eeeecchhHHhHHHHHHHHHhhhHHHHhcccccc---------------ChhhHHHHHHHHHHHHHHhhhhheeeecchh
Confidence            3444555567888999999999999986422221               23344666777777644 4566778889998


Q ss_pred             HHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHH
Q psy3982         108 VINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALD  155 (185)
Q Consensus       108 v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~  155 (185)
                      +.......+-+  ......+...+..+.|++||++-.+.-++||++++
T Consensus       262 ~~~~~p~~~i~--~~~~~~~~~~~~~~yP~iDpl~~i~~ik~yR~~i~  307 (307)
T PF10326_consen  262 IVFILPFFGID--LGFFSNIISILISLYPAIDPLPVIFIIKDYRKAIK  307 (307)
T ss_pred             heeeeeccCCC--CCccccHhhhhEEEEeehhhheeeEeeHHHHHhhC
Confidence            87776655433  13344556678889999999999999999999864


No 22 
>PF10318 7TM_GPCR_Srh:  Serpentine type 7TM GPCR chemoreceptor Srh;  InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  Srh is part of the Str superfamily of chemoreceptors []. 
Probab=97.12  E-value=0.009  Score=45.63  Aligned_cols=79  Identities=13%  Similarity=0.214  Sum_probs=56.1

Q ss_pred             CCchHHHHHHHHHHHHHH-HHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982          80 SGTSAEIKLAKVALMTIS-LWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF  158 (185)
Q Consensus        80 ~~~~~~~~~~~~~~~i~~-~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~  158 (185)
                      +..+.++|..+.+.+-++ -+++.-.|.....+....+.  ..+.+..+..++...+..+.-++..+.++.+|+.+++++
T Consensus       223 ~T~k~Qkkfl~~l~iQ~~ip~~~l~~P~~~~~~~~~~~~--~~q~~~n~~~~~~~~HG~~sti~mi~~~~pYR~~~~~~~  300 (302)
T PF10318_consen  223 KTRKMQKKFLIALIIQVLIPFIFLFIPLIYFIISIIFGY--YNQALNNISFIIISLHGIASTIVMILVHKPYRKFLLSLF  300 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccc--cccccchHHHHHHHhccHHHHHHHhhccHHHHHHHHHHh
Confidence            345667777776655444 44455557766655443332  335566777888899999999999999999999999988


Q ss_pred             cc
Q psy3982         159 PS  160 (185)
Q Consensus       159 ~~  160 (185)
                      +|
T Consensus       301 ~~  302 (302)
T PF10318_consen  301 RC  302 (302)
T ss_pred             cC
Confidence            54


No 23 
>KOG4193|consensus
Probab=96.93  E-value=0.019  Score=48.06  Aligned_cols=67  Identities=19%  Similarity=0.203  Sum_probs=44.9

Q ss_pred             HHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcc
Q psy3982          87 KLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFP  159 (185)
Q Consensus        87 ~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~  159 (185)
                      +..+..+++..++.+.|.    +.+.....+  ....+.+++.++-.+.-++=.++|++++++.|++.++.++
T Consensus       514 ~~~~~~l~L~~lLGlTW~----fgi~s~~~~--~~~v~~YlFti~NalQG~fIFi~~cll~~kvr~~~~k~~~  580 (610)
T KOG4193|consen  514 SLIRSALALLFLLGLTWI----FGIFSWLPG--TSVVFAYLFTIFNALQGVFIFIFHCLLRKKVRKEYRKWLC  580 (610)
T ss_pred             HHHHHHHHHHHHHHHHHH----HHHHHHhcc--cchHHHHHHHHHHHhhhhHhhHhhhhhhHHHHHHHHHHhc
Confidence            667788888888888884    333332221  2344445555555555555678899999999999999886


No 24 
>PF10319 7TM_GPCR_Srj:  Serpentine type 7TM GPCR chemoreceptor Srj;  InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily []. 
Probab=96.64  E-value=0.04  Score=42.07  Aligned_cols=74  Identities=14%  Similarity=0.294  Sum_probs=56.0

Q ss_pred             CCCchHHHHHHHHHHHH-HHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHHH
Q psy3982          79 QSGTSAEIKLAKVALMT-ISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQA  153 (185)
Q Consensus        79 ~~~~~~~~~~~~~~~~i-~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~  153 (185)
                      ++..+.+++..|.+..- +..-++++.|-.+......++-+ ...++.++......+-|.+||+.-.++-+.||+.
T Consensus       235 ~~T~~lq~qL~~AL~vQT~IPi~vsf~Pc~~~wy~pif~i~-~~~~~n~~~~iAls~FPf~DPlAii~~lP~~R~r  309 (310)
T PF10319_consen  235 KKTKRLQRQLFKALIVQTVIPICVSFSPCVLSWYGPIFGID-LGRWNNYFSVIALSAFPFLDPLAIILCLPAFRNR  309 (310)
T ss_pred             HhHHHHHHHHHHHHHHHHHhHHHHhhccHHHHHhHHHHcCC-hhHHHHHHHHHHHHHccccCchHhheecHHhhcc
Confidence            33456667777776554 44568889999888765555533 5677888888888899999999999999999975


No 25 
>PF02101 Ocular_alb:  Ocular albinism type 1 protein;  InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=96.59  E-value=0.023  Score=44.33  Aligned_cols=24  Identities=8%  Similarity=0.310  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHHhhHHHHHHHHHh
Q psy3982          91 VALMTISLWFLAWTPYLVINFGGI  114 (185)
Q Consensus        91 ~~~~i~~~f~i~~~P~~v~~~~~~  114 (185)
                      -.+.|.++|++||+|..+-.++.-
T Consensus       245 kFf~I~lVF~iCWlpNIINg~LL~  268 (405)
T PF02101_consen  245 KFFKIMLVFYICWLPNIINGSLLF  268 (405)
T ss_pred             HHHHHHHHHHHHhhhhhHhHHHHH
Confidence            356789999999999998776543


No 26 
>PF03125 Sre:  C. elegans Sre G protein-coupled chemoreceptor;  InterPro: IPR004151 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class e (Sre) from the Sra superfamily []. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=96.20  E-value=0.14  Score=40.45  Aligned_cols=44  Identities=18%  Similarity=0.344  Sum_probs=34.7

Q ss_pred             chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhcccccc
Q psy3982         120 INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPSLVC  163 (185)
Q Consensus       120 ~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~~~~  163 (185)
                      .......+......+++.+=|.+-....+..|+++++.+....+
T Consensus       281 ~~~~~~~~~e~~i~l~~~~i~~~~i~s~~~wrk~f~~~~~~~~~  324 (365)
T PF03125_consen  281 YRTILNHIFENCIFLNPIFICPVIIFSVPSWRKEFKRSFPKIRC  324 (365)
T ss_pred             HHHHHHHHHHHHHHHhhHHHHHHHHHcCHHHHHHHHHhhhhhhc
Confidence            34566677778888888888888888999999999988765443


No 27 
>PF02118 Srg:  Srg family chemoreceptor;  InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=96.17  E-value=0.21  Score=37.45  Aligned_cols=98  Identities=13%  Similarity=0.183  Sum_probs=52.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhh
Q psy3982          37 YFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILE  116 (185)
Q Consensus        37 ~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~  116 (185)
                      +++-.++.+.++....+++++.+++                   ..+.|++..+..+.....+.+.-+...+........
T Consensus       177 ~i~~~ii~i~~~~~~~~~l~~~~~~-------------------~~~~er~L~~is~~~~~~~~~~~~~~~~~~~~~~~~  237 (275)
T PF02118_consen  177 FIIIIIITIITNIITYRRLRKLSKR-------------------IKSVERNLTIISFIISFVQLLIAIWQIINSFAFFFD  237 (275)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhhhh-------------------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            3344455566666666666654322                   224567777776766666665554444422211111


Q ss_pred             cccchHHHHHHHHHHHhhhhhccchhheecchHHHHH
Q psy3982         117 MVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQA  153 (185)
Q Consensus       117 ~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~  153 (185)
                      ..+.......+..+..=+-....|++-.+.+++.|+.
T Consensus       238 ~~~~~~~~~~~~~f~sD~ltl~~P~iLl~fs~~vR~~  274 (275)
T PF02118_consen  238 FLPISSFLYYLLPFASDLLTLSQPYILLIFSKNVRRQ  274 (275)
T ss_pred             cccHHHHHHHHHHHHHHHHHHHHHHHHheECHhhhcc
Confidence            1111222333333333333455699999999999985


No 28 
>KOG4564|consensus
Probab=95.59  E-value=0.086  Score=42.65  Aligned_cols=100  Identities=15%  Similarity=0.229  Sum_probs=61.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc
Q psy3982          39 VPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMV  118 (185)
Q Consensus        39 ~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~  118 (185)
                      +.+++-++..+.|.+.+-++-+..               +.......+|++|..+.++=.|.+-++.       ..+..+
T Consensus       318 ~ai~vNf~flinIvrILv~KLr~~---------------~~~~~~~y~K~vKaTLvLIPLfGI~~il-------f~~~P~  375 (473)
T KOG4564|consen  318 LAILVNFIFLINIVRILVTKLRAS---------------NASETDQYRKLVKATLVLIPLFGIHYIL-------FAFRPD  375 (473)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhcCc---------------cccchHHHHHHHHHHHHHHHHcCCeeEE-------EEecCc
Confidence            334444555566666665432221               2333444788888888877666644322       112221


Q ss_pred             c--chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhccc
Q psy3982         119 E--INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKFPS  160 (185)
Q Consensus       119 ~--~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~~~  160 (185)
                      +  .......+..+|....-.+=-++|++.|++.|.++++.+.+
T Consensus       376 ~~~~~~v~~~~~~~L~SfQGf~VAvlYCFlN~EVq~elrr~W~r  419 (473)
T KOG4564|consen  376 EDTLREVYLYFELFLGSFQGFFVAVLYCFLNGEVQAELRRKWSR  419 (473)
T ss_pred             hHHHHHHHHHHHHHHHhccchheehheeecCHHHHHHHHHHHHh
Confidence            1  23444555666777777888999999999999999998754


No 29 
>PF04789 DUF621:  Protein of unknown function (DUF621);  InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=92.03  E-value=4.2  Score=30.77  Aligned_cols=112  Identities=15%  Similarity=0.187  Sum_probs=67.9

Q ss_pred             cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhh
Q psy3982          25 HRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWT  104 (185)
Q Consensus        25 ~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~  104 (185)
                      .....+.-.++.+.+|.++. ..|+.+..++-..+                  ....++.|.+++|.-..+-+.|-   .
T Consensus       184 P~~IqiiE~ilQ~~IPi~Il-~iYiAIIiKI~~MK------------------kssLnK~Ei~ILKQAifIFvlFQ---~  241 (305)
T PF04789_consen  184 PRGIQIIEIILQFGIPIFIL-VIYIAIIIKIIKMK------------------KSSLNKNEITILKQAIFIFVLFQ---A  241 (305)
T ss_pred             CCchhHHHHHHHHhHHHHHH-HHHHHHHHHHHHHh------------------hccCCchhHHHHHHHHHHHHHHH---H
Confidence            34556777888899996554 45555544443211                  12244667788777665544444   4


Q ss_pred             HHHHHHHHHhhhccc-chHHHHHHHHHHHhhhhhccchhheecchHHHHHHHhhc
Q psy3982         105 PYLVINFGGILEMVE-INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALDKKF  158 (185)
Q Consensus       105 P~~v~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~~~~  158 (185)
                      +..+..+-....-+. .-..+..+...+-..--+.-|....+.++|.||-+-...
T Consensus       242 ss~VFl~~qt~~~~~~tAFlIKR~iNT~ei~agaatP~ffffTs~eirk~~s~kv  296 (305)
T PF04789_consen  242 SSCVFLLCQTFKFNVATAFLIKRIINTMEIFAGAATPCFFFFTSKEIRKLVSSKV  296 (305)
T ss_pred             HHHHHHhheeeeecchHHHHHHHHHHHHHHHhhccCCeEEEEehHHHHHHHHhhc
Confidence            443333322222111 334566666667777788899999999999999766544


No 30 
>PF10292 7TM_GPCR_Srab:  Serpentine type 7TM GPCR receptor class ab chemoreceptor;  InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells []. 
Probab=85.64  E-value=15  Score=28.50  Aligned_cols=78  Identities=10%  Similarity=-0.028  Sum_probs=49.3

Q ss_pred             CCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc----chHHHHHHHHHHHhhhhhccchhheecchHHHHHHH
Q psy3982          80 SGTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVE----INPLITIWGSVFAKANAVYNPIVYAISHPKFRQALD  155 (185)
Q Consensus        80 ~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~~~~  155 (185)
                      -+..+..|+.+++..++....+.-+.+.+..+.......+    ....+.. .....-..+.+=|++.....++.|++.+
T Consensus       223 YQl~ENl~slr~L~p~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~y~al~e-~~~~~P~Y~ii~~~~~~~~~k~~r~~~~  301 (324)
T PF10292_consen  223 YQLEENLRSLRLLKPFIILSSIFIFFYIFASIFLRLFNPSMSKPNYFALAE-LNHIFPLYSIISPLILYRKIKKIRKKRK  301 (324)
T ss_pred             HhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCChhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445567888888777777777666666665554433332    2222222 2234455778889998888998888877


Q ss_pred             hhc
Q psy3982         156 KKF  158 (185)
Q Consensus       156 ~~~  158 (185)
                      +.+
T Consensus       302 ~~l  304 (324)
T PF10292_consen  302 KRL  304 (324)
T ss_pred             HHH
Confidence            655


No 31 
>PF02076 STE3:  Pheromone A receptor;  InterPro: IPR001499 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  Little is known about the structure and function of the mating factor receptors, STE2 and STE3. It is believed, however, that they are integral membrane proteins that may be involved in the response to mating factors on the cell membrane [, , ]. The amino acid sequences of both receptors contain high proportions of hydrophobic residues grouped into 7 domains, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins. However, while a similar 3D framework has been proposed to account for this, there is no significant sequence similarity either between STE2 and STE3, or between these and the rhodopsin-type family: the receptors thus bear their own unique '7TM' signatures. The STE3 gene of Saccharomyces cerevisiae (Baker's yeast) is the cell-surface receptor that binds the 13-residue lipopeptide a-factor. Several related fungal pheromone receptor sequences are known: these include pheromone B alpha 1 and B alpha 3, and pheromone B beta 1 receptors from Schizophyllum commune; pheromone receptor 1 from Ustilago hordei; and pheromone receptors 1 and 2 from Ustilago maydis. Members of the family share about 20% sequence identity.; GO: 0004932 mating-type factor pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=82.02  E-value=21  Score=27.32  Aligned_cols=25  Identities=12%  Similarity=0.204  Sum_probs=20.1

Q ss_pred             HHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy3982          87 KLAKVALMTISLWFLAWTPYLVINFG  112 (185)
Q Consensus        87 ~~~~~~~~i~~~f~i~~~P~~v~~~~  112 (185)
                      |..| ++++.++.+++.+|..+..++
T Consensus       196 Rf~R-L~~la~~~i~~~~P~~i~~~v  220 (283)
T PF02076_consen  196 RFFR-LLALALVIILIYLPLSIYYLV  220 (283)
T ss_pred             HHHH-HHHHHHHHHHHHHHHHHHHHH
Confidence            4444 677888889999999998888


No 32 
>PF10322 7TM_GPCR_Sru:  Serpentine type 7TM GPCR chemoreceptor Sru;  InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily []. 
Probab=81.48  E-value=13  Score=28.78  Aligned_cols=104  Identities=13%  Similarity=0.184  Sum_probs=54.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy3982          34 FFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPYLVINFGG  113 (185)
Q Consensus        34 ~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~  113 (185)
                      +.....-++.++.++..++.++|+.+.+...          .+.+.+..+.|+..+.+++++++.++.-    .+..+..
T Consensus       201 l~~s~~~~~~iii~N~lm~~Klr~~k~~l~~----------~~~s~~~~KAE~SLT~Tmi~mliP~i~n----~i~~i~~  266 (307)
T PF10322_consen  201 LFFSIFWMISIIILNILMFFKLRKLKSSLSS----------QRRSKQSRKAERSLTLTMISMLIPYITN----GIISISF  266 (307)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------hhhcccchhhheeHHHHHHHHHHHHHHH----HHHHHHH
Confidence            3334444577777888888888876521110          1113445667777777777777666532    2222222


Q ss_pred             hhhcccchHHHHHHHHHHHhhhhhccchhheecchHHHH
Q psy3982         114 ILEMVEINPLITIWGSVFAKANAVYNPIVYAISHPKFRQ  152 (185)
Q Consensus       114 ~~~~~~~~~~~~~~~~~l~~~~~~~nPiiY~~~~~~fr~  152 (185)
                      .... .....+..+-.+..=+..++=|.++.+.++=|||
T Consensus       267 ~~~~-~~~~y~~~iRp~~~D~~~~iv~w~FY~THP~Fkk  304 (307)
T PF10322_consen  267 LFFP-SYSSYILLIRPFGNDCRTCIVPWYFYLTHPIFKK  304 (307)
T ss_pred             hcCc-hHHHHHHHHhhhhhccceeEEEEEeeecChhhcC
Confidence            2211 1112222222222223344458888888887776


No 33 
>PF13853 7tm_4:  Olfactory receptor
Probab=71.89  E-value=0.0045  Score=42.07  Aligned_cols=69  Identities=17%  Similarity=0.256  Sum_probs=47.8

Q ss_pred             cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCchHHHHHHHHHHHHHHHHHHH
Q psy3982          23 WHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMNVATLRSGDQSGTSAEIKLAKVALMTISLWFLA  102 (185)
Q Consensus        23 ~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~  102 (185)
                      ..+..+......+..+.|+.+++++|++|...+.|.                     ...+.++|++.+......+..+.
T Consensus        52 ~~~~~~~~~~~~~~~~~~~~~Il~SY~~Il~avlki---------------------~S~~~r~KAfsTC~sHL~vv~lf  110 (144)
T PF13853_consen   52 SINEIVGFVVAIFILLGPLLLILFSYIRILRAVLKI---------------------PSAEGRSKAFSTCSSHLTVVSLF  110 (144)
T ss_pred             hhhheeeecccceeEEEEeeccccceeEEEehhhcc---------------------cccccccccccccccCceEEEEe
Confidence            334455566666677899999999999999988742                     22344568888877777777776


Q ss_pred             hhHHHHHHHH
Q psy3982         103 WTPYLVINFG  112 (185)
Q Consensus       103 ~~P~~v~~~~  112 (185)
                      +.|..+..+.
T Consensus       111 y~~~~~~y~~  120 (144)
T PF13853_consen  111 YGTVIFMYLS  120 (144)
T ss_pred             eeeeEEEEec
Confidence            6666554443


No 34 
>KOG0721|consensus
Probab=70.39  E-value=40  Score=24.70  Aligned_cols=16  Identities=13%  Similarity=0.115  Sum_probs=11.0

Q ss_pred             eecchHHHHHHHhhcc
Q psy3982         144 AISHPKFRQALDKKFP  159 (185)
Q Consensus       144 ~~~~~~fr~~~~~~~~  159 (185)
                      +-..+++|++.+++..
T Consensus       110 gas~~eIKkaYR~LSi  125 (230)
T KOG0721|consen  110 GASEKEIKKAYRRLSI  125 (230)
T ss_pred             CCCHHHHHHHHHHhhh
Confidence            4456778888777653


No 35 
>KOG2575|consensus
Probab=66.27  E-value=11  Score=30.33  Aligned_cols=32  Identities=19%  Similarity=0.472  Sum_probs=25.6

Q ss_pred             CCCchHHHHHHHHHHHHHHHHHHHhhHHHHHH
Q psy3982          79 QSGTSAEIKLAKVALMTISLWFLAWTPYLVIN  110 (185)
Q Consensus        79 ~~~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~  110 (185)
                      ++..+.-.+.++.-+++++.|+++|+|+....
T Consensus       248 ~k~~~~f~ri~~ia~~Vv~TF~iiw~P~~~~~  279 (510)
T KOG2575|consen  248 PKLFNSFARIIKIALAVVGTFVIIWLPFLLSG  279 (510)
T ss_pred             ccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            33444567899999999999999999997644


No 36 
>PF11014 DUF2852:  Protein of unknown function (DUF2852);  InterPro: IPR021273  This bacterial family of proteins has no known function. 
Probab=64.50  E-value=13  Score=24.10  Aligned_cols=26  Identities=8%  Similarity=0.082  Sum_probs=17.1

Q ss_pred             HHHHHHHHHHHHhhHHHHHHHHHhhhc
Q psy3982          91 VALMTISLWFLAWTPYLVINFGGILEM  117 (185)
Q Consensus        91 ~~~~i~~~f~i~~~P~~v~~~~~~~~~  117 (185)
                      .+.++|+-|+++| |..+..+.+.+.+
T Consensus        11 ~Ia~mVlGFi~fW-PlGla~Lay~iw~   36 (115)
T PF11014_consen   11 WIAAMVLGFIVFW-PLGLALLAYMIWG   36 (115)
T ss_pred             HHHHHHHHHHHHH-HHHHHHHHHHHHH
Confidence            3667888888866 8776666554433


No 37 
>PF05478 Prominin:  Prominin;  InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=62.29  E-value=38  Score=30.06  Aligned_cols=33  Identities=9%  Similarity=-0.018  Sum_probs=24.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc
Q psy3982          86 IKLAKVALMTISLWFLAWTPYLVINFGGILEMV  118 (185)
Q Consensus        86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~  118 (185)
                      .-..-+++.+.+.|+++|+-..+..+....+..
T Consensus       457 tGg~~Lm~gv~~~Flf~~~l~l~~~~~Fl~G~~  489 (806)
T PF05478_consen  457 TGGNFLMAGVGLSFLFSWFLMLLVLFYFLVGGN  489 (806)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            344557788899999999888777776665543


No 38 
>COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion]
Probab=58.89  E-value=30  Score=21.78  Aligned_cols=30  Identities=17%  Similarity=0.329  Sum_probs=19.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy3982          39 VPLILIIYAYYFIVRAVSVHEKQMREQAKK   68 (185)
Q Consensus        39 ~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~   68 (185)
                      .+++++.+.|..+.+--+++.+.++.+.+.
T Consensus        14 ~~vl~~~ifyFli~RPQrKr~K~~~~ml~s   43 (97)
T COG1862          14 PLVLIFAIFYFLIIRPQRKRMKEHQELLNS   43 (97)
T ss_pred             HHHHHHHHHHHhhcCHHHHHHHHHHHHHHh
Confidence            334566777777777777766666655443


No 39 
>PF15086 UPF0542:  Uncharacterised protein family UPF0542
Probab=58.23  E-value=37  Score=20.06  Aligned_cols=37  Identities=14%  Similarity=0.355  Sum_probs=22.9

Q ss_pred             hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH
Q psy3982          27 SYLIFYGFFVYFVPLI-LIIYAYYFIVRAVSVHEKQMR   63 (185)
Q Consensus        27 ~y~~~~~~~~~~~p~~-i~~~~y~~i~~~l~~~~~~~~   63 (185)
                      .|..+..++..+.|+. +..++..++.+.+.++.++.+
T Consensus        19 P~~Fl~~vll~LtPlfiisa~lSwkLaK~ie~~ere~K   56 (74)
T PF15086_consen   19 PYEFLTTVLLILTPLFIISAVLSWKLAKAIEKEEREKK   56 (74)
T ss_pred             hHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455555666667764 446677777777776655544


No 40 
>KOG3500|consensus
Probab=55.49  E-value=22  Score=21.49  Aligned_cols=26  Identities=15%  Similarity=0.399  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy3982          85 EIKLAKVALMTISLWFLAWTPYLVINFG  112 (185)
Q Consensus        85 ~~~~~~~~~~i~~~f~i~~~P~~v~~~~  112 (185)
                      ++...+.++.++.  ++||+-..+..+.
T Consensus        30 ~~~li~~~lvlTA--VCCwlfWli~yla   55 (84)
T KOG3500|consen   30 NRGLIRCMLVLTA--VCCWLFWLIVYLA   55 (84)
T ss_pred             CcChhHHHHHHHH--HHHHHHHHHHHHH
Confidence            4445555544444  8899776666553


No 41 
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=54.04  E-value=27  Score=17.41  Aligned_cols=22  Identities=18%  Similarity=0.366  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~   50 (185)
                      ......++.++|.+.++..|++
T Consensus         7 gfiAt~Lfi~iPt~FLlilYvq   28 (35)
T PRK04989          7 GFVASLLFVLVPTVFLIILYIQ   28 (35)
T ss_pred             HHHHHHHHHHHHHHHHHHHhee
Confidence            4455567778899888888874


No 42 
>PF02117 7TM_GPCR_Sra:  Serpentine type 7TM GPCR chemoreceptor Sra;  InterPro: IPR000344 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class a (Sra) from the Sra superfamily []. Sra receptors contain 6-7 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures.; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016021 integral to membrane
Probab=52.59  E-value=1.1e+02  Score=23.88  Aligned_cols=34  Identities=15%  Similarity=-0.047  Sum_probs=23.1

Q ss_pred             chHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhh
Q psy3982          82 TSAEIKLAKVALMTISLWFLAWTPYLVINFGGIL  115 (185)
Q Consensus        82 ~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~  115 (185)
                      .++....++.+..+++.-++|..-|.+...+...
T Consensus       228 ~~Eni~ST~aI~~i~~~Q~i~~~iys~~v~~l~~  261 (328)
T PF02117_consen  228 KRENINSTKAICIISISQFICMFIYSFGVFILRN  261 (328)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455677888888888888887777655544333


No 43 
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=52.17  E-value=43  Score=19.05  Aligned_cols=39  Identities=15%  Similarity=0.126  Sum_probs=23.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAK   67 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~   67 (185)
                      .+.....-.+..+.+.+++++.|+...|.++|...++..
T Consensus         4 e~~~~~a~a~~t~~~~l~fiavi~~ayr~~~K~~~d~aa   42 (60)
T COG4736           4 EMMRGFADAWGTIAFTLFFIAVIYFAYRPGKKGEFDEAA   42 (60)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhHHHHh
Confidence            334444444555566667777777788877666655543


No 44 
>TIGR03038 PS_II_psbM photosystem II reaction center protein PsbM. Members of this protein family are the photosystem II reaction center M protein, product of the psbM gene, in Cyanobacteria and their derived organelles in plants. This model resembles Pfam model pfam05151 but has cutoffs set to avoid false-positive matches to similar (not necessarily homologous) sequences in species that are not photosynthetic.
Probab=50.17  E-value=30  Score=16.96  Aligned_cols=22  Identities=14%  Similarity=0.345  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~   50 (185)
                      ......++.++|...++..|++
T Consensus         7 ~fiAt~Lfi~iPt~FLiilYvq   28 (33)
T TIGR03038         7 GFIATLLFILVPTVFLLILYIQ   28 (33)
T ss_pred             HHHHHHHHHHHHHHHHHHHhee
Confidence            3445566778899888888874


No 45 
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=50.16  E-value=47  Score=18.88  Aligned_cols=35  Identities=9%  Similarity=0.142  Sum_probs=17.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMR   63 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~   63 (185)
                      ....+++++++-.++....+.+.-+.+++.+++..
T Consensus        24 il~~f~~G~llg~l~~~~~~~~~r~~~~~~~k~l~   58 (68)
T PF06305_consen   24 ILIAFLLGALLGWLLSLPSRLRLRRRIRRLRKELK   58 (68)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34444455555555555555555555555544444


No 46 
>PHA02650 hypothetical protein; Provisional
Probab=48.62  E-value=54  Score=19.70  Aligned_cols=28  Identities=7%  Similarity=-0.045  Sum_probs=17.1

Q ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          26 RSYLIFYGFFVYFVPLILIIYAYYFIVR   53 (185)
Q Consensus        26 ~~y~~~~~~~~~~~p~~i~~~~y~~i~~   53 (185)
                      ..+..+++++.+++-+++.++.|.+..+
T Consensus        47 ~~~~~~ii~i~~v~i~~l~~flYLK~~~   74 (81)
T PHA02650         47 FNGQNFIFLIFSLIIVALFSFFVFKGYT   74 (81)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            3455555556566666777777776644


No 47 
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=48.14  E-value=42  Score=21.83  Aligned_cols=7  Identities=0%  Similarity=0.614  Sum_probs=2.8

Q ss_pred             HHHHHHH
Q psy3982          39 VPLILII   45 (185)
Q Consensus        39 ~p~~i~~   45 (185)
                      +|+++++
T Consensus         7 l~~vv~~   13 (113)
T PRK06531          7 IMFVVML   13 (113)
T ss_pred             HHHHHHH
Confidence            3444443


No 48 
>PF09889 DUF2116:  Uncharacterized protein containing a Zn-ribbon (DUF2116);  InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=47.63  E-value=27  Score=19.81  Aligned_cols=20  Identities=15%  Similarity=0.255  Sum_probs=10.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHh
Q psy3982          84 AEIKLAKVALMTISLWFLAW  103 (185)
Q Consensus        84 ~~~~~~~~~~~i~~~f~i~~  103 (185)
                      +.+|.-.++..+++++++-|
T Consensus        34 ~~~~~~~i~~~~~i~~l~v~   53 (59)
T PF09889_consen   34 RMRKTQYIFFGIFILFLAVW   53 (59)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            34455555666665555544


No 49 
>KOG3827|consensus
Probab=47.60  E-value=12  Score=29.76  Aligned_cols=63  Identities=13%  Similarity=0.163  Sum_probs=40.3

Q ss_pred             HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccc--hHHHHHHHHHHHhhhhhccchhheecch
Q psy3982          86 IKLAKVALMTISLWFLAWTPYLVINFGGILEMVEI--NPLITIWGSVFAKANAVYNPIVYAISHP  148 (185)
Q Consensus        86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~nPiiY~~~~~  148 (185)
                      .|--.++++-+++|++.|+-+.++..+-++.+++.  ...-.....+...+++...-++|.+-.+
T Consensus        60 ~kWR~~lliF~~sf~~SWl~Fg~iwwlIA~~hGDL~~~~~~~~~tpCV~nV~sf~sAFLFSiETQ  124 (400)
T KOG3827|consen   60 LKWRWMLLIFSLSFVLSWLFFGVIWWLIAYAHGDLEPDPPGENHTPCVMNVHSFTSAFLFSIETQ  124 (400)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccCCCCcCCCcceeeccchhhhheeeeeee
Confidence            34456788899999999999988777666655541  1111222345555666666666666544


No 50 
>CHL00080 psbM photosystem II protein M
Probab=45.67  E-value=39  Score=16.67  Aligned_cols=22  Identities=14%  Similarity=0.259  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~   50 (185)
                      ......++..+|...+++.|++
T Consensus         7 gfiAt~LFi~iPt~FLlilyvk   28 (34)
T CHL00080          7 AFIATALFILVPTAFLLIIYVK   28 (34)
T ss_pred             HHHHHHHHHHHHHHHHHHhhee
Confidence            4455566778898888888864


No 51 
>PHA02844 putative transmembrane protein; Provisional
Probab=45.57  E-value=44  Score=19.82  Aligned_cols=26  Identities=19%  Similarity=0.268  Sum_probs=16.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          27 SYLIFYGFFVYFVPLILIIYAYYFIV   52 (185)
Q Consensus        27 ~y~~~~~~~~~~~p~~i~~~~y~~i~   52 (185)
                      .+..+++++.+++..++.++.|.+..
T Consensus        47 ~~~~~ii~i~~v~~~~~~~flYLK~~   72 (75)
T PHA02844         47 STKIWILTIIFVVFATFLTFLYLKAV   72 (75)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHhee
Confidence            44445555666666677777777654


No 52 
>COG2322 Predicted membrane protein [Function unknown]
Probab=45.48  E-value=1e+02  Score=21.55  Aligned_cols=37  Identities=3%  Similarity=-0.059  Sum_probs=27.4

Q ss_pred             CchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhc
Q psy3982          81 GTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEM  117 (185)
Q Consensus        81 ~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~  117 (185)
                      ..++++++..+-....+.|+++++-.+...--..++.
T Consensus        71 ~i~~Hk~aMltA~~l~l~FlvlYltr~~l~~~t~f~~  107 (177)
T COG2322          71 NIEKHKRAMLTAFTLALVFLVLYLTRHGLGGETAFGG  107 (177)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCCC
Confidence            4467888888889999999999987776655443333


No 53 
>COG3924 Predicted membrane protein [Function unknown]
Probab=43.01  E-value=71  Score=18.89  Aligned_cols=33  Identities=9%  Similarity=0.350  Sum_probs=23.7

Q ss_pred             ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          22 DWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRA   54 (185)
Q Consensus        22 ~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~   54 (185)
                      ++....+-.....-.+.+|++.++.|++.|-..
T Consensus        36 t~G~~gfP~WFE~aCi~lPllFi~l~~~mvkfi   68 (80)
T COG3924          36 TPGFTGFPLWFEMACILLPLLFIVLCWAMVKFI   68 (80)
T ss_pred             CCCCCCChHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            344455666677778899999999998866433


No 54 
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=42.42  E-value=71  Score=19.45  Aligned_cols=24  Identities=17%  Similarity=0.437  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          43 LIIYAYYFIVRAVSVHEKQMREQA   66 (185)
Q Consensus        43 i~~~~y~~i~~~l~~~~~~~~~~~   66 (185)
                      ++++.|..+.+--++++++.+.+.
T Consensus        12 ~~~i~yf~~~rpqkK~~k~~~~m~   35 (84)
T TIGR00739        12 IFLIFYFLIIRPQRKRRKAHKKLI   35 (84)
T ss_pred             HHHHHHHheechHHHHHHHHHHHH
Confidence            344556655555555555554443


No 55 
>PHA02975 hypothetical protein; Provisional
Probab=40.35  E-value=76  Score=18.51  Aligned_cols=27  Identities=11%  Similarity=0.377  Sum_probs=17.5

Q ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          26 RSYLIFYGFFVYFVPLILIIYAYYFIV   52 (185)
Q Consensus        26 ~~y~~~~~~~~~~~p~~i~~~~y~~i~   52 (185)
                      ..+..++.++.+++..++.++.|.+..
T Consensus        42 ~~~~~~ii~i~~v~~~~~~~flYLK~~   68 (69)
T PHA02975         42 SLSIILIIFIIFITCIAVFTFLYLKLM   68 (69)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            345555555666666777788887653


No 56 
>PF10322 7TM_GPCR_Sru:  Serpentine type 7TM GPCR chemoreceptor Sru;  InterPro: IPR003839 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class u (Sru) from the Srg superfamily []. 
Probab=40.30  E-value=1.8e+02  Score=22.66  Aligned_cols=17  Identities=18%  Similarity=0.372  Sum_probs=7.4

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy3982          37 YFVPLILIIYAYYFIVR   53 (185)
Q Consensus        37 ~~~p~~i~~~~y~~i~~   53 (185)
                      .+..+++-...+.++-+
T Consensus       208 ~~~iii~N~lm~~Klr~  224 (307)
T PF10322_consen  208 MISIIILNILMFFKLRK  224 (307)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            33334444444444443


No 57 
>PF05817 Ribophorin_II:  Oligosaccharyltransferase subunit Ribophorin II;  InterPro: IPR008814 This family consists of several eukaryotic Ribophorin II (RPN2) proteins. The mammalian oligosaccharyltransferase (OST) is a protein complex that effects the cotranslational N-glycosylation of newly synthesised polypeptides, and is composed of at least four rough ER-specific membrane proteins: ribophorins I and II (RI and RII), OST48, and Dadl. The mechanism(s) by which the subunits of this complex are retained in the ER are not well understood [].; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0018279 protein N-linked glycosylation via asparagine, 0005789 endoplasmic reticulum membrane, 0008250 oligosaccharyltransferase complex
Probab=39.62  E-value=2.6e+02  Score=24.35  Aligned_cols=28  Identities=7%  Similarity=0.139  Sum_probs=17.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          28 YLIFYGFFVYFVPLILIIYAYYFIVRAV   55 (185)
Q Consensus        28 y~~~~~~~~~~~p~~i~~~~y~~i~~~l   55 (185)
                      ..-..|+...+.|+++++..+.++--.+
T Consensus       548 ~vS~~F~~~vlapl~~Ll~~W~~lG~Nl  575 (636)
T PF05817_consen  548 IVSLVFTGLVLAPLLVLLILWLKLGANL  575 (636)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHcCcch
Confidence            3444455667788877777777654333


No 58 
>PHA02819 hypothetical protein; Provisional
Probab=38.50  E-value=84  Score=18.46  Aligned_cols=26  Identities=19%  Similarity=0.080  Sum_probs=16.4

Q ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          26 RSYLIFYGFFVYFVPLILIIYAYYFI   51 (185)
Q Consensus        26 ~~y~~~~~~~~~~~p~~i~~~~y~~i   51 (185)
                      ..+..+++++.+++-+++.++.|.+.
T Consensus        44 ~~~~~~ii~l~~~~~~~~~~flYLK~   69 (71)
T PHA02819         44 FLRYYLIIGLVTIVFVIIFIIFYLKV   69 (71)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            34555555566666667777777765


No 59 
>PF05151 PsbM:  Photosystem II reaction centre M protein (PsbM);  InterPro: IPR007826 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  This family represents the low molecular weight transmembrane protein PsbM found in PSII. PsbM is one of the most hydrophobic proteins in the thylakoid membrane. The function of this protein is unknown.; GO: 0015979 photosynthesis, 0019684 photosynthesis, light reaction, 0009523 photosystem II, 0016021 integral to membrane; PDB: 3A0H_m 3ARC_m 3A0B_M 3PRR_M 3PRQ_M 1S5L_M 4FBY_e 3BZ2_M 3BZ1_M 2AXT_M ....
Probab=38.26  E-value=52  Score=15.95  Aligned_cols=21  Identities=14%  Similarity=0.309  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH
Q psy3982          30 IFYGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        30 ~~~~~~~~~~p~~i~~~~y~~   50 (185)
                      .....++.++|....+..|++
T Consensus         8 fiAtaLfi~iPt~FLiilyvq   28 (31)
T PF05151_consen    8 FIATALFILIPTAFLIILYVQ   28 (31)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHhheEee
Confidence            444566677888888888875


No 60 
>PF15102 TMEM154:  TMEM154 protein family
Probab=37.17  E-value=2.5  Score=28.63  Aligned_cols=7  Identities=29%  Similarity=0.932  Sum_probs=3.1

Q ss_pred             HHHHHHH
Q psy3982          35 FVYFVPL   41 (185)
Q Consensus        35 ~~~~~p~   41 (185)
                      +..++|+
T Consensus        59 LmIlIP~   65 (146)
T PF15102_consen   59 LMILIPL   65 (146)
T ss_pred             EEEeHHH
Confidence            3344553


No 61 
>PF07330 DUF1467:  Protein of unknown function (DUF1467);  InterPro: IPR009935 This family consists of several bacterial proteins of around 90 residues in length. The function of this family is unknown.
Probab=36.50  E-value=79  Score=19.39  Aligned_cols=34  Identities=3%  Similarity=0.057  Sum_probs=23.7

Q ss_pred             cccCCCCCCchHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982          73 TLRSGDQSGTSAEIKLAKVALMTISLWFLAWTPY  106 (185)
Q Consensus        73 ~~~~~~~~~~~~~~~~~~~~~~i~~~f~i~~~P~  106 (185)
                      +...+.+.+.+-.+|+..+.+.-+++|.+.+.-+
T Consensus        38 Gt~~sAP~~~~l~rk~~~TTiiaavi~~~~~~~~   71 (85)
T PF07330_consen   38 GTDPSAPANPRLKRKALITTIIAAVIFAIIYLII   71 (85)
T ss_pred             CCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444456677778888888888888887766433


No 62 
>PF04238 DUF420:  Protein of unknown function (DUF420);  InterPro: IPR007352 This is a predicted membrane protein with four transmembrane helices.
Probab=34.20  E-value=1.5e+02  Score=19.93  Aligned_cols=38  Identities=13%  Similarity=0.112  Sum_probs=26.8

Q ss_pred             CchHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcc
Q psy3982          81 GTSAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMV  118 (185)
Q Consensus        81 ~~~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~  118 (185)
                      +.+.+++...........|+++++-++...-...+.++
T Consensus        30 ~~~~Hr~~Ml~a~~ls~lFlv~Yl~~~~~~g~~~f~g~   67 (133)
T PF04238_consen   30 RIKLHRKLMLTAFVLSALFLVSYLYYHFLGGSTPFGGP   67 (133)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCCC
Confidence            34668888888899999999999877765333333333


No 63 
>PF11710 Git3:  G protein-coupled glucose receptor regulating Gpa2;  InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins. 
Probab=33.16  E-value=1.3e+02  Score=21.69  Aligned_cols=35  Identities=17%  Similarity=0.295  Sum_probs=25.7

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQ   61 (185)
Q Consensus        27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~   61 (185)
                      .+.....-+..++-.++++..|+.|+..+|++-++
T Consensus       166 ~~Rl~l~y~~~~~~~~~~i~iY~~if~~lrr~~~~  200 (201)
T PF11710_consen  166 WYRLWLHYIWRFIIIFAIIIIYIAIFFYLRRRIRR  200 (201)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Confidence            34555555556666888999999999999976543


No 64 
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=32.05  E-value=37  Score=18.14  Aligned_cols=22  Identities=18%  Similarity=0.177  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~   50 (185)
                      ..+..+++..+|.+.+++.|++
T Consensus         7 gfiAtaLFi~iPT~FLlilYVk   28 (50)
T PRK14094          7 GFVASLLFVGVPTIFLIGLFIS   28 (50)
T ss_pred             HHHHHHHHHHHHHHHhhheeEE
Confidence            4455566678888888888874


No 65 
>PLN00090 photosystem II reaction center M protein; Provisional
Probab=31.03  E-value=1e+02  Score=19.25  Aligned_cols=22  Identities=14%  Similarity=0.226  Sum_probs=16.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~   50 (185)
                      ..+...++.++|.+.+++.|++
T Consensus        77 afIATaLFIlIPTaFLLILYVQ   98 (113)
T PLN00090         77 AYLAVALGTFLPCLFLINLFIQ   98 (113)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhh
Confidence            4445566778899888888874


No 66 
>COG3771 Predicted membrane protein [Function unknown]
Probab=30.62  E-value=1.4e+02  Score=18.48  Aligned_cols=42  Identities=10%  Similarity=0.065  Sum_probs=30.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMREQAKKMN   70 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~~~~~~~   70 (185)
                      ....|..++.+--++...+|.+.-..+.+-.++.+.+..+.+
T Consensus        45 la~lF~~G~~lgwli~g~fy~k~~l~~~~l~rqiKr~~~q~~   86 (97)
T COG3771          45 LATLFAAGFALGWLICGLFYLKVRLSLMRLERQIKRLENQLS   86 (97)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence            345566677777788888899888888877777776665544


No 67 
>PF00002 7tm_2:  7 transmembrane receptor (Secretin family);  InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=30.37  E-value=17  Score=26.56  Aligned_cols=21  Identities=19%  Similarity=0.422  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHhhHHH
Q psy3982          87 KLAKVALMTISLWFLAWTPYL  107 (185)
Q Consensus        87 ~~~~~~~~i~~~f~i~~~P~~  107 (185)
                      +..+..+.+...+.+.|+...
T Consensus       195 ~~~~~~~~L~~llg~~w~~~~  215 (242)
T PF00002_consen  195 KILRASLILLPLLGITWLFGI  215 (242)
T ss_dssp             ---------------------
T ss_pred             hHHHHHHHHHHHHHHHheeee
Confidence            467777778888888887643


No 68 
>PF02699 YajC:  Preprotein translocase subunit;  InterPro: IPR003849 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component []. From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins []. The translocase protein subunits are encoded on the bacterial chromosome.  The translocase itself comprises 7 proteins, including a chaperone (SecB), ATPase (SecA), an integral membrane complex (SecY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) []. Other cytoplasmic/periplasmic proteins play a part in preprotein translocase activity, namely YidC and YajC []. The latter is bound in a complex to SecD and SecF, and plays a part in stabilising and regulating secretion through the SecYEG integral membrane component via SecA [].  Homologues of the YajC gene have been found in a range of pathogenic and commensal microbes. Brucella abortis YajC- and SecD-like proteins were shown to stimulate a Th1 cell-mediated immune response in mice, and conferred protection when challenged with B.abortis []. Therefore, these proteins may have an antigenic role as well as a secretory one in virulent bacteria []. A number of previously uncharacterised "hypothetical" proteins also show similarity to E.coli YajC, suggesting that this family is wider than first thought [].  More recently, the precise interactions between the E.coli SecYEG complex, SecD, SecF, YajC and YidC have been studied []. Rather than acting individually, the four proteins form a heterotetrameric complex and associate with the SecYEG heterotrimeric complex []. The SecF and YajC subunits link the complex to the integral membrane translocase. ; PDB: 2RDD_B.
Probab=30.12  E-value=1.1e+02  Score=18.47  Aligned_cols=18  Identities=17%  Similarity=0.615  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy3982          42 ILIIYAYYFIVRAVSVHE   59 (185)
Q Consensus        42 ~i~~~~y~~i~~~l~~~~   59 (185)
                      +++++.|..+.+--++++
T Consensus        10 ~~~~i~yf~~~rpqkk~~   27 (82)
T PF02699_consen   10 IIFVIFYFLMIRPQKKQQ   27 (82)
T ss_dssp             HHHHHHHHHTHHHHHHHH
T ss_pred             HHHHHHhhheecHHHHHH
Confidence            344444544444444333


No 69 
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=30.06  E-value=1.6e+02  Score=19.06  Aligned_cols=24  Identities=25%  Similarity=0.405  Sum_probs=10.2

Q ss_pred             HHHHHHH-HHHHHHHHHHHHHHHHH
Q psy3982          37 YFVPLIL-IIYAYYFIVRAVSVHEK   60 (185)
Q Consensus        37 ~~~p~~i-~~~~y~~i~~~l~~~~~   60 (185)
                      .++|+++ +++.|..+.+--+|+++
T Consensus         6 ~ll~lv~i~~i~yF~~iRPQkKr~K   30 (109)
T PRK05886          6 LFLPFLLIMGGFMYFASRRQRKAMQ   30 (109)
T ss_pred             HHHHHHHHHHHHHHHHccHHHHHHH
Confidence            3345443 34445444433343333


No 70 
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=28.81  E-value=1.2e+02  Score=17.18  Aligned_cols=19  Identities=11%  Similarity=0.076  Sum_probs=8.3

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy3982          84 AEIKLAKVALMTISLWFLA  102 (185)
Q Consensus        84 ~~~~~~~~~~~i~~~f~i~  102 (185)
                      ++.|.+.++...+++.++.
T Consensus        38 ~R~r~~~~~~~li~aLi~v   56 (64)
T COG4068          38 KRQRNFMILMFLILALILV   56 (64)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3444444444444444433


No 71 
>PF11119 DUF2633:  Protein of unknown function (DUF2633);  InterPro: IPR022576  This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known. 
Probab=28.78  E-value=1.2e+02  Score=17.21  Aligned_cols=30  Identities=10%  Similarity=0.183  Sum_probs=16.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy3982          83 SAEIKLAKVALMTISLWFLAWTPYLVINFG  112 (185)
Q Consensus        83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~  112 (185)
                      +...+.++.++++...+++.-+-|..+..+
T Consensus         4 k~~~~mtriVLLISfiIlfgRl~Y~~I~a~   33 (59)
T PF11119_consen    4 KKNSRMTRIVLLISFIILFGRLIYSAIGAW   33 (59)
T ss_pred             cccchHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence            334566676666666655554444444333


No 72 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=28.39  E-value=2.9e+02  Score=21.57  Aligned_cols=22  Identities=14%  Similarity=0.310  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHHHHHHHHhhHHH
Q psy3982          86 IKLAKVALMTISLWFLAWTPYL  107 (185)
Q Consensus        86 ~~~~~~~~~i~~~f~i~~~P~~  107 (185)
                      .+++..+..+.+.|.+.|.-..
T Consensus        58 ~~~~~~~~~~y~~F~~~WGlNY   79 (318)
T PF12725_consen   58 LNILFFLSVLYFLFYLLWGLNY   79 (318)
T ss_pred             HHHHHHHHHHHHHHHHHhhhhc
Confidence            4555555666666666665443


No 73 
>PF14163 SieB:  Superinfection exclusion protein B
Probab=28.36  E-value=1.2e+02  Score=20.51  Aligned_cols=22  Identities=14%  Similarity=0.220  Sum_probs=13.1

Q ss_pred             HHHHHHHHHHhhHHHHHHHHHh
Q psy3982          93 LMTISLWFLAWTPYLVINFGGI  114 (185)
Q Consensus        93 ~~i~~~f~i~~~P~~v~~~~~~  114 (185)
                      .+++.+-++.++|-.+...+..
T Consensus         4 ~l~i~~~~llf~P~~~~~~l~l   25 (151)
T PF14163_consen    4 WLIIFSGLLLFLPESLLEWLNL   25 (151)
T ss_pred             HHHHHHHHHHHCCHHHHHHhCc
Confidence            3444444566888877665544


No 74 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=27.93  E-value=95  Score=15.84  Aligned_cols=28  Identities=14%  Similarity=0.222  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          29 LIFYGFFVYFVPLILIIYAYYFIVRAVS   56 (185)
Q Consensus        29 ~~~~~~~~~~~p~~i~~~~y~~i~~~l~   56 (185)
                      .+...++..+.-.++.+++|+--+++-+
T Consensus         7 aIIv~V~vg~~iiii~~~~YaCcykk~~   34 (38)
T PF02439_consen    7 AIIVAVVVGMAIIIICMFYYACCYKKHR   34 (38)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHcccc
Confidence            3334444444445556666665544433


No 75 
>PTZ00370 STEVOR; Provisional
Probab=27.30  E-value=1.1e+02  Score=23.44  Aligned_cols=17  Identities=18%  Similarity=0.547  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy3982          38 FVPLILIIYAYYFIVRA   54 (185)
Q Consensus        38 ~~p~~i~~~~y~~i~~~   54 (185)
                      ++-.++++++|+.++++
T Consensus       265 lil~vvliilYiwlyrr  281 (296)
T PTZ00370        265 LILAVVLIILYIWLYRR  281 (296)
T ss_pred             HHHHHHHHHHHHHHHHh
Confidence            33345556666654443


No 76 
>PF00664 ABC_membrane:  ABC transporter transmembrane region;  InterPro: IPR001140 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain [].  The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). A variety of ATP-binding transport proteins have a six transmembrane helical region. They are all integral membrane proteins involved in a variety of transport systems. Members of this family include; the cystic fibrosis transmembrane conductance regulator (CFTR), bacterial leukotoxin secretion ATP-binding protein, multidrug resistance proteins, the yeast leptomycin B resistance protein, the mammalian sulphonylurea receptor and antigen peptide transporter 2. Many of these proteins have two such regions.; GO: 0005524 ATP binding, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0006810 transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3G61_B 3G5U_B 3G60_A 3B60_D 3QF4_B 2HYD_A 2ONJ_A 4A82_B 4AA3_A 2YL4_A.
Probab=27.24  E-value=2.3e+02  Score=20.09  Aligned_cols=20  Identities=15%  Similarity=0.217  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHhhhhhccch
Q psy3982         122 PLITIWGSVFAKANAVYNPI  141 (185)
Q Consensus       122 ~~~~~~~~~l~~~~~~~nPi  141 (185)
                      .....+...+.+.+...+|+
T Consensus       256 ~s~g~~~~~~~~~~~~~~pl  275 (275)
T PF00664_consen  256 ISIGTLVAFLSLSSQLINPL  275 (275)
T ss_dssp             SHHHHHHHHHHHHHHHHHHH
T ss_pred             cCHHHHHHHHHHHHHHHhhC
Confidence            34555666677777777774


No 77 
>PF00375 SDF:  Sodium:dicarboxylate symporter family;  InterPro: IPR001991 It has been shown [] that integral membrane proteins that mediate the uptake of a wide variety of molecules with the concomitant uptake of sodium ions (sodium symporters) can be grouped, on the basis of sequence and functional similarities into a number of distinct families. One of these families [] is known as the sodium:dicarboxylate symporter family (SDF). Such re-uptake of neurotransmitters from the synapses, is thought to be an important mechanism for terminating their action, by removing these chemicals from the synaptic cleft, and transporting them into presynaptic nerve terminals, and surrounding neuroglia. this removal is also believed to prevent them accumulating to the point of reaching neurotoxic [, ]. The structure of these transporter proteins has been variously reported to contain from 8 to 10 transmembrane (TM) regions, although 10 now seems to be the accepted value. Members of the family include: several mammalian excitatory amino acid transporters, and a number of bacterial transporters. They vary with regars to their dependence on transport of sodium, and other ions.; GO: 0017153 sodium:dicarboxylate symporter activity, 0006835 dicarboxylic acid transport, 0016020 membrane; PDB: 3V8G_B 1XFH_A 3KBC_B 2NWX_B 3V8F_B 2NWL_B 2NWW_A.
Probab=26.27  E-value=3.4e+02  Score=21.70  Aligned_cols=73  Identities=12%  Similarity=0.124  Sum_probs=41.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhccc----chHH---HHHHHHHHHhhhhhccchhhe-ecchHHHHHHHhh
Q psy3982          86 IKLAKVALMTISLWFLAWTPYLVINFGGILEMVE----INPL---ITIWGSVFAKANAVYNPIVYA-ISHPKFRQALDKK  157 (185)
Q Consensus        86 ~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~----~~~~---~~~~~~~l~~~~~~~nPiiY~-~~~~~fr~~~~~~  157 (185)
                      -+.......-++-++++..|+.+..+........    ....   +..+.......-..+.|+++. +..++..+-++++
T Consensus       166 ~~~~~~~~~~ii~~i~~~~Pigv~~l~a~~~~~~~~~~l~~l~~~v~~~~~~~~i~~~v~~pl~~~~~~~~np~~~~~~~  245 (390)
T PF00375_consen  166 FESLNEVIMKIINWIMKLAPIGVFGLIANSIATQGLSILGALGKFVLTVYVALLIHLFVVLPLILFVLTRKNPFKFLKAM  245 (390)
T ss_dssp             HHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHSSCCGHHHHHHHHHHHHHHHHHHHHHHTHHHHHH-TTT--HHHHHHHT
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHHHHHHHHHHHHHHHHHHHHHhhHHHhhhcCCCHHHHHHHH
Confidence            3445556677788899999999988765533222    1111   111111122223566899988 5555777766665


Q ss_pred             c
Q psy3982         158 F  158 (185)
Q Consensus       158 ~  158 (185)
                      .
T Consensus       246 ~  246 (390)
T PF00375_consen  246 L  246 (390)
T ss_dssp             H
T ss_pred             H
Confidence            4


No 78 
>PF05241 EBP:  Emopamil binding protein ;  InterPro: IPR007905 Emopamil binding protein (EBP) is a nonglycosylated type I integral membrane protein of endoplasmic reticulum and shows high level expression in epithelial tissues. The EBP protein has emopamil binding domains, including the sterol acceptor site and the catalytic centre, which show Delta8-Delta7 sterol isomerase activity. Human sterol isomerase, a homologue of mouse EBP, is suggested not only to play a role in cholesterol biosynthesis, but also to affect lipoprotein internalisation. In humans, mutations of EBP are known to cause the genetic disorder of X-linked dominant chondrodysplasia punctata (CDPX2). This syndrome of humans is lethal in most males, and affected females display asymmetric hyperkeratotic skin and skeletal abnormalities [].; GO: 0047750 cholestenol delta-isomerase activity, 0016125 sterol metabolic process, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=25.99  E-value=1.2e+02  Score=21.82  Aligned_cols=32  Identities=28%  Similarity=0.405  Sum_probs=26.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          27 SYLIFYGFFVYFVPLILIIYAYYFIVRAVSVH   58 (185)
Q Consensus        27 ~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~   58 (185)
                      .|.+....+-.++|+.++.-++-.|....+.+
T Consensus       160 ~y~v~~N~iWivvP~~~l~~s~~~i~~a~~~~  191 (194)
T PF05241_consen  160 FYFVFPNGIWIVVPLLLLYQSWKEIARAFRAA  191 (194)
T ss_pred             eehhhcchHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            45667778889999999999999888887754


No 79 
>PHA03054 IMV membrane protein; Provisional
Probab=25.95  E-value=1.5e+02  Score=17.45  Aligned_cols=26  Identities=23%  Similarity=0.467  Sum_probs=15.4

Q ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          26 RSYLIFYGFFVYFVPLILIIYAYYFI   51 (185)
Q Consensus        26 ~~y~~~~~~~~~~~p~~i~~~~y~~i   51 (185)
                      ..+..+++++.+++-+++.++.|.+.
T Consensus        46 ~~~~~~ii~l~~v~~~~l~~flYLK~   71 (72)
T PHA03054         46 WGWYWLIIIFFIVLILLLLIYLYLKV   71 (72)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            34555555555666666667777653


No 80 
>TIGR02736 cbb3_Q_epsi cytochrome c oxidase, cbb3-type, CcoQ subunit, epsilon-Proteobacterial. Members of this protein family are restricted to the epsilon branch of the Proteobacteria. All members are found in operons containing the other three structural subunits of the cbb3 type of cytochrome c oxidase. These small proteins show remote sequence similarity to the CcoQ subunit in other cytochrome c oxidase systems, so this family is assumed to represent the epsilonproteobacterial variant of CcoQ.
Probab=25.56  E-value=1.3e+02  Score=16.75  Aligned_cols=21  Identities=24%  Similarity=0.558  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH
Q psy3982          39 VPLILIIYAYYFIVRAVSVHE   59 (185)
Q Consensus        39 ~p~~i~~~~y~~i~~~l~~~~   59 (185)
                      +.+++.+++|.-|+.--|+++
T Consensus         7 ~ti~lvv~LYgY~yhLYrsek   27 (56)
T TIGR02736         7 FTLLLVIFLYAYIYHLYRSQK   27 (56)
T ss_pred             HHHHHHHHHHHHHHHhhhhhc
Confidence            334455566666676666443


No 81 
>COG4665 FcbT2 TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=25.07  E-value=2.5e+02  Score=19.80  Aligned_cols=33  Identities=15%  Similarity=0.398  Sum_probs=23.1

Q ss_pred             ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          24 HHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVS   56 (185)
Q Consensus        24 ~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~   56 (185)
                      ..+.+.=.+..+.|++|+++++..|.-=+..+.
T Consensus        88 R~qa~vDllGtifFLlPfc~l~iy~~~~~~~~S  120 (182)
T COG4665          88 RTQAWVDLLGTIFFLLPFCLLVIYLSWPYVALS  120 (182)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHH
Confidence            345667777888899999888776665444444


No 82 
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=24.53  E-value=1.1e+02  Score=23.45  Aligned_cols=15  Identities=20%  Similarity=0.587  Sum_probs=7.2

Q ss_pred             HHHHHHHHHHHHHHH
Q psy3982          39 VPLILIIYAYYFIVR   53 (185)
Q Consensus        39 ~p~~i~~~~y~~i~~   53 (185)
                      +-.++++++|+.+++
T Consensus       270 il~vvliiLYiWlyr  284 (295)
T TIGR01478       270 ILTVVLIILYIWLYR  284 (295)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            334555555554433


No 83 
>PHA03234 DNA packaging protein UL33; Provisional
Probab=24.50  E-value=89  Score=24.41  Aligned_cols=11  Identities=18%  Similarity=0.495  Sum_probs=6.4

Q ss_pred             HHhhHHHHHHH
Q psy3982         101 LAWTPYLVINF  111 (185)
Q Consensus       101 i~~~P~~v~~~  111 (185)
                      ...+|..++.+
T Consensus       205 ~f~iPl~im~~  215 (338)
T PHA03234        205 WGIFPTMIFSF  215 (338)
T ss_pred             HHHHHHHHHHH
Confidence            34567766554


No 84 
>PF10669 Phage_Gp23:  Protein gp23 (Bacteriophage A118);  InterPro: IPR018926  This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown. 
Probab=22.55  E-value=2.1e+02  Score=17.98  Aligned_cols=15  Identities=20%  Similarity=0.397  Sum_probs=6.1

Q ss_pred             HHHHHHHHHHHHHHH
Q psy3982          37 YFVPLILIIYAYYFI   51 (185)
Q Consensus        37 ~~~p~~i~~~~y~~i   51 (185)
                      ++..++++++.-.+|
T Consensus        23 ~i~~FiILLIi~~~I   37 (121)
T PF10669_consen   23 FIVVFIILLIITKSI   37 (121)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            333344444444444


No 85 
>PHA01815 hypothetical protein
Probab=22.39  E-value=1.4e+02  Score=15.81  Aligned_cols=23  Identities=9%  Similarity=0.217  Sum_probs=14.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          31 FYGFFVYFVPLILIIYAYYFIVR   53 (185)
Q Consensus        31 ~~~~~~~~~p~~i~~~~y~~i~~   53 (185)
                      ..+.+.|++.+++++..|+++..
T Consensus         9 ivfllaflitliilmt~~irvsf   31 (55)
T PHA01815          9 IVFLLAFLITLIILMTLHIRVSF   31 (55)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555666777777777765533


No 86 
>PF10066 DUF2304:  Uncharacterized conserved protein (DUF2304);  InterPro: IPR019277  This entry represents hypothetical archaeal and bacterial proteins that have no known function. 
Probab=22.38  E-value=2.3e+02  Score=18.25  Aligned_cols=51  Identities=16%  Similarity=0.127  Sum_probs=29.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhcccchHHHHHHHHHHHhh
Q psy3982          83 SAEIKLAKVALMTISLWFLAWTPYLVINFGGILEMVEINPLITIWGSVFAKA  134 (185)
Q Consensus        83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~~~~~~~~~~~~~~~~l~~~  134 (185)
                      +.+....+.++.+...++.. .|.....+....+-......+..+...+.+.
T Consensus        29 ~~~~~l~Wl~~~i~~l~~~i-fP~~~~~vA~~lGi~~~~n~lf~~~i~~ll~   79 (115)
T PF10066_consen   29 RLKYSLLWLVFSIILLILSI-FPNILDWVAKLLGIGRPPNLLFYLGILFLLV   79 (115)
T ss_pred             cHHHHHHHHHHHHHHHHHHh-hhhHHHHHHHHHCCCchhHHHHHHHHHHHHH
Confidence            44556666666655555544 4887777777766655444555444444433


No 87 
>TIGR03052 PS_I_psaI photosystem I reaction center subunit VIII. Members of this protein family are PsaI, subunit VIII of the photosystem I reaction center. This protein is found in both the Cyanobacteria and the chloroplasts of plants, but is absent from non-oxygenic photosynthetic bacteria such as Rhodobacter sphaeroides. Species that contain photosystem I also contain photosystem II, which splits water and releases molecular oxygen.
Probab=22.25  E-value=1e+02  Score=14.94  Aligned_cols=19  Identities=5%  Similarity=0.294  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy3982          32 YGFFVYFVPLILIIYAYYF   50 (185)
Q Consensus        32 ~~~~~~~~p~~i~~~~y~~   50 (185)
                      .-+++.+.|.+.|...+.-
T Consensus         8 VPlVglvfPai~Ma~lf~y   26 (31)
T TIGR03052         8 VPLVGLVFPAVFMALLFRY   26 (31)
T ss_pred             hhHHHHHHHHHHHHHHHHh
Confidence            3456677888888766553


No 88 
>TIGR00934 2a38euk potassium uptake protein, Trk family. The proteins of the Trk family are derived from Gram-negative and Gram-positive bacteria, yeast and wheat. The proteins of E. coli K12 TrkH and TrkG as well as several yeast proteins have been functionally characterized.The E. coli TrkH and TrkG proteins are complexed to two peripheral membrane proteins, TrkA, an NAD-binding protein, and TrkE, an ATP-binding protein. This complex forms the potassium uptake system. This family is specific for the eukaryotic Trk system.
Probab=21.70  E-value=5.1e+02  Score=23.29  Aligned_cols=32  Identities=13%  Similarity=0.144  Sum_probs=25.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhh
Q psy3982          84 AEIKLAKVALMTISLWFLAWTPYLVINFGGIL  115 (185)
Q Consensus        84 ~~~~~~~~~~~i~~~f~i~~~P~~v~~~~~~~  115 (185)
                      -|-|++|.++.++..|++.|--..++.+...+
T Consensus       443 iEYRaLk~L~~Iv~~Y~~~~~llG~i~l~~wi  474 (800)
T TIGR00934       443 IEYRALKCLCSIVLVYFLGFNILGFVLLLPWI  474 (800)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            46799999999999999999877666655443


No 89 
>PF03155 Alg6_Alg8:  ALG6, ALG8 glycosyltransferase family;  InterPro: IPR004856 N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum membrane prior to the transfer of the oligosaccharide moiety to the target asparagine residues. This oligosaccharide is composed of Glc(3)Man(9)GlcNAc(2). The addition of the three glucose residues is the final series of steps in the synthesis of the oligosaccharide precursor. Alg6 transfers the first glucose residue, and Alg8 transfers the second one []. In the human alg6 gene, a C-T transition, which causes Ala333 to be replaced with Val, has been identified as the cause of a congenital disorder of glycosylation, designated as type Ic OMIM:603147 []. ; GO: 0016758 transferase activity, transferring hexosyl groups, 0005789 endoplasmic reticulum membrane
Probab=21.28  E-value=1.4e+02  Score=24.80  Aligned_cols=27  Identities=22%  Similarity=0.443  Sum_probs=23.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy3982          83 SAEIKLAKVALMTISLWFLAWTPYLVI  109 (185)
Q Consensus        83 ~~~~~~~~~~~~i~~~f~i~~~P~~v~  109 (185)
                      ..-.|..+.-..++.+|.++|.|+...
T Consensus       218 ~~~~~~~~lg~~Vi~~f~~~~~PF~~~  244 (469)
T PF03155_consen  218 FSIKRLIKLGIVVIATFALSFGPFLYS  244 (469)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            456889999999999999999999743


No 90 
>PF08525 OapA_N:  Opacity-associated protein A N-terminal motif;  InterPro: IPR013731 This domain is found in the Haemophilus influenzae opacity-associated protein (OapA). It is required for efficient nasopharyngeal mucosal colonisation, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation [, ]. This motif occurs at the N terminus of these proteins. It contains a conserved histidine followed by a run of hydrophobic residues.  Many of the proteins in this entry are unassigned peptidases belonging to MEROPS peptidase family M23B. 
Probab=20.29  E-value=1.2e+02  Score=14.37  Aligned_cols=23  Identities=9%  Similarity=-0.021  Sum_probs=11.3

Q ss_pred             chHHHHHHHHHHHHHHHHHHHhhHH
Q psy3982          82 TSAEIKLAKVALMTISLWFLAWTPY  106 (185)
Q Consensus        82 ~~~~~~~~~~~~~i~~~f~i~~~P~  106 (185)
                      .+.+++.+..+..+++.  +.|.|.
T Consensus         7 P~~Hr~~l~~l~~v~l~--ll~~Ps   29 (30)
T PF08525_consen    7 PKLHRRALIALSAVVLV--LLLWPS   29 (30)
T ss_pred             CHHHHHHHHHHHHHHHH--HHhccC
Confidence            34455555544444444  444453


No 91 
>PHA02909 hypothetical protein; Provisional
Probab=20.15  E-value=1.8e+02  Score=16.19  Aligned_cols=20  Identities=10%  Similarity=0.422  Sum_probs=8.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q psy3982          32 YGFFVYFVPLILIIYAYYFI   51 (185)
Q Consensus        32 ~~~~~~~~p~~i~~~~y~~i   51 (185)
                      ++++.|+.-+.++...|.-|
T Consensus        40 lfviiflsmftilacsyvyi   59 (72)
T PHA02909         40 LFVIIFLSMFTILACSYVYI   59 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            33444444444444555444


No 92 
>PF10361 DUF2434:  Protein of unknown function (DUF2434);  InterPro: IPR018830  This entry represents a family of proteins conserved in fungi. Their function is not known. 
Probab=20.12  E-value=4.2e+02  Score=20.50  Aligned_cols=46  Identities=15%  Similarity=0.128  Sum_probs=22.1

Q ss_pred             CcccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3982          10 NLTACGTDYLTKDWHHRSYLIFYGFFVYFVPLILIIYAYYFIVRAVSVHEKQMRE   64 (185)
Q Consensus        10 ~~~~C~~~~~~~~~~~~~y~~~~~~~~~~~p~~i~~~~y~~i~~~l~~~~~~~~~   64 (185)
                      |.+.|+.-..+  ...+.+.-+.+.+.+.+-+++.       +..++||.+.-.+
T Consensus        32 N~TsCysPi~~--ig~rg~vGI~fav~f~i~lvlt-------LvnL~KHG~~~lp   77 (296)
T PF10361_consen   32 NGTSCYSPINP--IGTRGSVGIAFAVLFAIALVLT-------LVNLRKHGRLYLP   77 (296)
T ss_pred             cCcccCCCCcc--ccccchhHHHHHHHHHHHHHHH-------HHHHHHhhhhcCC
Confidence            56778553333  3334444444444444332222       2356777766543


Done!