Query         psy6892
Match_columns 190
No_of_seqs    310 out of 1606
Neff          7.9 
Searched_HMMs 46136
Date          Fri Aug 16 21:08:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6892.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6892hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4219|consensus               99.6 2.4E-16 5.3E-21  134.6   5.8   97    8-104    29-128 (423)
  2 PHA03234 DNA packaging protein  99.4 2.2E-13 4.9E-18  116.5   2.9   89   13-103    31-124 (338)
  3 KOG4220|consensus               99.4 6.4E-14 1.4E-18  120.2  -1.1   76   15-90     31-106 (503)
  4 PHA02834 chemokine receptor-li  99.2 6.5E-12 1.4E-16  106.6   3.2   78   13-93     27-104 (323)
  5 PHA02638 CC chemokine receptor  99.1 1.9E-11 4.1E-16  107.4   2.5   80   13-95     97-176 (417)
  6 PHA03087 G protein-coupled che  99.1 1.3E-10 2.8E-15   98.5   4.4   80   12-93     38-117 (335)
  7 PHA03235 DNA packaging protein  99.0 9.9E-11 2.2E-15  102.6   2.9   89   13-103    31-124 (409)
  8 PF00001 7tm_1:  7 transmembran  98.7 1.3E-09 2.8E-14   86.4  -1.8   64   31-94      1-64  (257)
  9 KOG2087|consensus               98.4 8.1E-08 1.7E-12   81.7   1.4   85   16-100    26-117 (363)
 10 PF10320 7TM_GPCR_Srsx:  Serpen  98.3 5.3E-08 1.2E-12   80.3  -1.9   50   26-75      2-51  (257)
 11 PF05296 TAS2R:  Mammalian tast  96.9  0.0006 1.3E-08   57.6   2.2   62   12-73      4-68  (303)
 12 PF05462 Dicty_CAR:  Slime mold  96.6  0.0014 3.1E-08   55.5   2.6   55   14-68      6-60  (303)
 13 PF10328 7TM_GPCR_Srx:  Serpent  96.1   0.001 2.3E-08   55.0  -0.8   45   24-68      3-47  (274)
 14 PF11710 Git3:  G protein-coupl  96.1  0.0019 4.1E-08   51.5   0.4   52   43-94     30-81  (201)
 15 PF10324 7TM_GPCR_Srw:  Serpent  95.8  0.0021 4.6E-08   54.1  -0.6   52   23-75      5-57  (318)
 16 PF04931 DNA_pol_phi:  DNA poly  95.7  0.0016 3.4E-08   61.8  -1.5   17   57-73    589-605 (784)
 17 PF10317 7TM_GPCR_Srd:  Serpent  95.3  0.0066 1.4E-07   50.8   1.0   49   19-67      3-52  (292)
 18 PF10321 7TM_GPCR_Srt:  Serpent  95.0   0.018 3.8E-07   49.1   2.6   56   11-66     29-84  (313)
 19 KOG3064|consensus               93.0   0.024 5.2E-07   46.5  -0.4   14   26-39    124-137 (303)
 20 PF00002 7tm_2:  7 transmembran  92.9   0.061 1.3E-06   43.3   1.9   49   19-67      5-53  (242)
 21 PF03839 Sec62:  Translocation   92.4   0.087 1.9E-06   42.7   2.1   15   50-64    141-155 (224)
 22 PF02724 CDC45:  CDC45-like pro  92.1   0.056 1.2E-06   50.2   0.8   13   51-63     55-68  (622)
 23 KOG1189|consensus               92.0    0.04 8.8E-07   51.5  -0.3   12  157-168   928-939 (960)
 24 PF10327 7TM_GPCR_Sri:  Serpent  90.5    0.61 1.3E-05   39.4   5.5   65   14-78      8-76  (303)
 25 PF03402 V1R:  Vomeronasal orga  87.6    0.55 1.2E-05   39.1   3.1   45   46-92      8-53  (265)
 26 KOG2927|consensus               84.4    0.27 5.8E-06   42.3  -0.3    6   79-84    256-261 (372)
 27 PF09882 DUF2109:  Predicted me  84.2    0.26 5.7E-06   33.0  -0.4   48   26-73      5-52  (78)
 28 PF10856 DUF2678:  Protein of u  83.6    0.38 8.3E-06   34.9   0.3   23   82-104    87-109 (118)
 29 PF10326 7TM_GPCR_Str:  Serpent  82.3    0.43 9.3E-06   39.9   0.1   47   20-66      4-51  (307)
 30 COG5406 Nucleosome binding fac  79.1    0.64 1.4E-05   43.1   0.1   13   97-109   919-931 (1001)
 31 PF09726 Macoilin:  Transmembra  76.9     1.2 2.7E-05   42.0   1.3   22   49-71     76-97  (697)
 32 KOG1832|consensus               75.8    0.76 1.6E-05   44.3  -0.4   14   50-63   1314-1327(1516)
 33 PF08229 SHR3_chaperone:  ER me  73.3     5.7 0.00012   31.6   4.0   22   15-36     59-80  (196)
 34 PF03344 Daxx:  Daxx Family;  I  72.5     1.2 2.6E-05   42.2   0.0    7   50-56    346-352 (713)
 35 PF10319 7TM_GPCR_Srj:  Serpent  70.7      10 0.00022   32.3   5.2   53   20-72     10-65  (310)
 36 PF02101 Ocular_alb:  Ocular al  70.5     6.2 0.00013   34.6   3.9   57   20-76     32-94  (405)
 37 PHA03055 Hypothetical protein;  69.9     7.9 0.00017   26.1   3.5   57   20-76     13-69  (79)
 38 PF10292 7TM_GPCR_Srab:  Serpen  66.2       5 0.00011   34.1   2.5   53   14-66     16-68  (324)
 39 KOG4193|consensus               66.0       2 4.4E-05   39.9   0.1   37   23-59    328-364 (610)
 40 KOG2082|consensus               64.9       5 0.00011   38.3   2.4   30   51-82    581-610 (1075)
 41 PHA03062 putative IMV membrane  64.7      10 0.00022   25.6   3.2   56   20-75     14-69  (78)
 42 PF11446 DUF2897:  Protein of u  63.8     2.9 6.4E-05   26.3   0.5   12   27-38     14-25  (55)
 43 PF04713 Pox_I5:  Poxvirus prot  63.3      12 0.00026   25.2   3.4   54   22-75     15-68  (77)
 44 KOG3114|consensus               63.2     5.2 0.00011   33.7   2.0   31    8-38    110-140 (290)
 45 KOG0205|consensus               62.0      18 0.00038   34.3   5.3   48   20-67    758-806 (942)
 46 PF02532 PsbI:  Photosystem II   60.5      11 0.00023   21.5   2.3   20   15-34      9-28  (36)
 47 PF10316 7TM_GPCR_Srbc:  Serpen  57.2     3.4 7.3E-05   34.6  -0.1   57   16-72      7-63  (273)
 48 PF01102 Glycophorin_A:  Glycop  56.8     2.6 5.6E-05   31.0  -0.7    8   36-43     85-92  (122)
 49 CHL00024 psbI photosystem II p  56.6     9.7 0.00021   21.6   1.7   20   15-34      9-28  (36)
 50 PRK02655 psbI photosystem II r  54.8      10 0.00023   21.7   1.7   20   15-34      9-28  (38)
 51 PF10323 7TM_GPCR_Srv:  Serpent  53.8     2.4 5.1E-05   35.4  -1.6   23   45-67     29-51  (283)
 52 KOG1832|consensus               53.5     2.3   5E-05   41.2  -1.8    9   99-107  1394-1402(1516)
 53 PF10192 GpcrRhopsn4:  Rhodopsi  53.2      18 0.00039   29.7   3.6   45   15-59     29-73  (257)
 54 PRK03557 zinc transporter ZitB  52.6     9.2  0.0002   32.4   1.8   21   24-44    127-147 (312)
 55 PF11861 DUF3381:  Domain of un  50.8      24 0.00051   27.1   3.7    6   58-63     69-74  (159)
 56 PF05297 Herpes_LMP1:  Herpesvi  50.6     5.2 0.00011   33.8   0.0    7   40-46     71-77  (381)
 57 PF14851 FAM176:  FAM176 family  50.2     5.8 0.00013   30.3   0.2   34   29-63      4-37  (153)
 58 PF10318 7TM_GPCR_Srh:  Serpent  50.0      17 0.00037   30.1   3.0   60   17-76      6-70  (302)
 59 KOG1482|consensus               49.6     6.4 0.00014   34.3   0.4   39   57-95    233-271 (379)
 60 PF10329 DUF2417:  Region of un  49.6      11 0.00024   30.8   1.8   14   50-63    105-118 (232)
 61 KOG4349|consensus               48.2     8.8 0.00019   28.2   0.9   19   28-46     61-79  (143)
 62 KOG2592|consensus               48.2      18  0.0004   31.8   3.0   30   10-39    177-206 (426)
 63 COG2181 NarI Nitrate reductase  46.9      18 0.00039   29.5   2.5   30   43-72    118-147 (228)
 64 COG5406 Nucleosome binding fac  46.7     4.2 9.2E-05   37.9  -1.2    9  157-165   978-986 (1001)
 65 PF14940 TMEM219:  Transmembran  46.5      12 0.00025   30.5   1.4   45   17-63     11-60  (223)
 66 PHA02702 ORF033 IMV membrane p  46.2      15 0.00033   24.5   1.7   26   51-76     41-66  (78)
 67 COG1230 CzcD Co/Zn/Cd efflux s  45.9      12 0.00025   31.8   1.4   57   20-78    126-183 (296)
 68 PRK13183 psbN photosystem II r  44.9      29 0.00063   20.9   2.6   32   17-48      9-40  (46)
 69 PHA03099 epidermal growth fact  44.0      25 0.00053   26.2   2.6   32   14-45    100-131 (139)
 70 TIGR00869 sec62 protein transl  43.4      16 0.00036   29.8   1.8    6   80-85    184-189 (232)
 71 CHL00020 psbN photosystem II p  43.3      30 0.00064   20.6   2.4   32   17-48      6-37  (43)
 72 PF10856 DUF2678:  Protein of u  43.2      33 0.00071   25.0   3.1    9   50-58     62-70  (118)
 73 PF02009 Rifin_STEVOR:  Rifin/s  42.4     7.2 0.00016   33.1  -0.3   21   24-44    263-283 (299)
 74 PF15330 SIT:  SHP2-interacting  42.1      12 0.00026   26.9   0.7   30   16-45      2-31  (107)
 75 KOG4564|consensus               42.0      18  0.0004   32.6   2.1   51   20-70    150-200 (473)
 76 PF13074 DUF3938:  Protein of u  41.6      66  0.0014   21.6   4.2   17   92-108    78-94  (102)
 77 PF03348 Serinc:  Serine incorp  41.1      46 0.00099   29.7   4.4   55   12-69    179-235 (429)
 78 KOG0476|consensus               40.3      18  0.0004   34.6   1.9   31   53-83    372-402 (931)
 79 PF12606 RELT:  Tumour necrosis  40.0     7.7 0.00017   23.9  -0.4    9   23-31      9-17  (50)
 80 PF06398 Pex24p:  Integral pero  38.1      39 0.00085   29.0   3.5   52   17-69     15-66  (359)
 81 PF05745 CRPA:  Chlamydia 15 kD  38.0      47   0.001   24.7   3.3   10   57-66    104-113 (150)
 82 COG4420 Predicted membrane pro  38.0      67  0.0015   25.4   4.4   38   17-59     60-97  (191)
 83 PF05915 DUF872:  Eukaryotic pr  37.4      72  0.0016   23.1   4.3   14   19-32     48-61  (115)
 84 PF09802 Sec66:  Preprotein tra  36.7     5.6 0.00012   31.5  -1.8   45   23-68     10-57  (190)
 85 PF14494 DUF4436:  Domain of un  35.8      25 0.00054   29.2   1.8   29   20-48    173-201 (256)
 86 PF02468 PsbN:  Photosystem II   35.6      37 0.00079   20.2   2.0   31   18-48      7-37  (43)
 87 PF05934 MCLC:  Mid-1-related c  35.2      15 0.00032   33.3   0.5   27   14-40    181-207 (549)
 88 PF09788 Tmemb_55A:  Transmembr  35.0      55  0.0012   27.1   3.6   12   16-27    199-210 (256)
 89 PF03006 HlyIII:  Haemolysin-II  34.9      77  0.0017   24.7   4.5   42   13-54    106-147 (222)
 90 PF07787 DUF1625:  Protein of u  34.8      72  0.0016   26.0   4.4   10   50-59    222-231 (248)
 91 PF06210 DUF1003:  Protein of u  34.5 1.2E+02  0.0026   21.7   5.0   37   18-59      7-43  (108)
 92 COG0341 SecF Preprotein transl  34.1      17 0.00036   31.0   0.6   46   21-66    191-239 (305)
 93 PF14068 YuiB:  Putative membra  33.9      54  0.0012   23.2   3.0   19   50-68     63-81  (102)
 94 TIGR01477 RIFIN variant surfac  33.4      15 0.00032   31.9   0.2   29   18-46    311-339 (353)
 95 PF04789 DUF621:  Protein of un  33.1      41  0.0009   28.4   2.7   36   33-68     33-68  (305)
 96 PF03729 DUF308:  Short repeat   33.1      82  0.0018   19.6   3.7   21   46-66     49-69  (72)
 97 PF13903 Claudin_2:  PMP-22/EMP  32.9      55  0.0012   24.1   3.3   23   22-44     76-98  (172)
 98 PTZ00046 rifin; Provisional     32.9      16 0.00034   31.8   0.2   28   19-46    317-344 (358)
 99 KOG1056|consensus               32.4      49  0.0011   32.3   3.4   43   17-59    561-610 (878)
100 PF14654 Epiglycanin_C:  Mucin,  32.1      35 0.00076   24.1   1.8   23   12-34     17-39  (106)
101 PF11683 DUF3278:  Protein of u  31.8      31 0.00067   25.3   1.6   26   19-44     35-60  (129)
102 PF09323 DUF1980:  Domain of un  31.7      73  0.0016   24.6   3.8    8   22-29     40-47  (182)
103 PF13127 DUF3955:  Protein of u  31.7 1.3E+02  0.0028   19.3   4.3   25   39-65     28-52  (63)
104 KOG4404|consensus               31.6      21 0.00045   30.7   0.7   17   47-63    156-172 (350)
105 PRK13664 hypothetical protein;  31.2      38 0.00082   21.5   1.7   16   19-34      9-24  (62)
106 KOG2443|consensus               31.2     9.1  0.0002   33.0  -1.5   27   58-84    152-178 (362)
107 PF15145 DUF4577:  Domain of un  30.8      30 0.00064   25.1   1.3   27   14-40     60-86  (128)
108 KOG3850|consensus               30.4      50  0.0011   29.1   2.8   37   18-57    398-434 (455)
109 PF07226 DUF1422:  Protein of u  30.3      10 0.00022   27.6  -1.2   27   42-68     53-79  (117)
110 KOG1281|consensus               30.2   1E+02  0.0022   28.7   4.9   29   11-39    469-497 (586)
111 PHA02974 putative IMV membrane  30.2      72  0.0016   21.6   3.0   56   20-75     13-69  (81)
112 KOG2442|consensus               29.9      11 0.00023   34.1  -1.3   29   51-79    306-334 (541)
113 PF11044 TMEMspv1-c74-12:  Plec  29.8      62  0.0013   19.4   2.3   16   16-31      7-22  (49)
114 PF12191 stn_TNFRSF12A:  Tumour  28.9      19 0.00041   26.6   0.0   14   18-31     83-96  (129)
115 PF02118 Srg:  Srg family chemo  28.8      22 0.00047   28.9   0.4   45   22-66      5-52  (275)
116 PRK06758 hypothetical protein;  28.6 1.3E+02  0.0028   21.1   4.2   17   92-108   102-118 (128)
117 cd07912 Tweety_N N-terminal do  28.2   1E+02  0.0022   27.6   4.4   15   49-63     79-93  (418)
118 PF14184 YrvL:  Regulatory prot  28.2      61  0.0013   24.0   2.7   20   16-35     41-60  (132)
119 PRK11056 hypothetical protein;  28.0      15 0.00033   26.8  -0.6   29   41-69     52-80  (120)
120 PF10873 DUF2668:  Protein of u  27.8     8.8 0.00019   29.0  -1.9   25   16-40     63-87  (155)
121 KOG0473|consensus               27.6      60  0.0013   27.2   2.7   35   21-55    265-300 (326)
122 COG2116 FocA Formate/nitrite f  27.3      38 0.00082   28.3   1.6   43   21-63    158-201 (265)
123 PTZ00415 transmission-blocking  27.3      14 0.00031   38.6  -1.1    6   29-34     30-35  (2849)
124 cd08763 Cyt_b561_CYB561 Verteb  27.3      84  0.0018   23.6   3.3   45   20-64     12-58  (143)
125 COG5130 YIP3 Prenylated rab ac  27.1      33 0.00072   26.0   1.1   23   48-70    113-135 (169)
126 KOG0065|consensus               26.4      51  0.0011   33.7   2.5   35   32-66   1250-1284(1391)
127 PRK11562 nitrite transporter N  26.0      36 0.00078   28.4   1.2   34   21-54    150-183 (268)
128 PF15109 TMEM125:  TMEM125 prot  25.9 1.8E+02  0.0038   20.6   4.4   46   16-61     21-66  (112)
129 PHA03164 hypothetical protein;  25.8      52  0.0011   22.1   1.7   22   46-67     56-77  (88)
130 KOG2632|consensus               25.6      60  0.0013   27.0   2.4   58   14-71     18-79  (258)
131 PF06024 DUF912:  Nucleopolyhed  25.5      73  0.0016   22.3   2.6   11   34-44     81-91  (101)
132 COG4970 FimT Tfp pilus assembl  24.8      76  0.0017   24.9   2.8   44   39-82     36-79  (181)
133 PF12304 BCLP:  Beta-casein lik  24.7   1E+02  0.0022   24.4   3.4   44   20-66     46-90  (188)
134 KOG3832|consensus               24.7 2.5E+02  0.0053   23.0   5.6   62   35-100   225-286 (319)
135 PF10267 Tmemb_cc2:  Predicted   24.3      50  0.0011   29.2   1.8   15   43-57    367-381 (395)
136 COG1272 Predicted membrane pro  24.2 1.6E+02  0.0034   24.0   4.5   42   19-60    119-160 (226)
137 COG5456 Predicted integral mem  24.1      50  0.0011   25.2   1.6   34   29-62     30-63  (166)
138 KOG4536|consensus               24.1 1.1E+02  0.0024   25.9   3.6   43   23-65    200-244 (347)
139 COG3125 CyoD Heme/copper-type   23.9      79  0.0017   22.8   2.5   20   46-65     45-64  (111)
140 PF01545 Cation_efflux:  Cation  23.8      20 0.00043   29.3  -0.7   10   56-65    143-152 (284)
141 PF13120 DUF3974:  Domain of un  23.6      53  0.0011   23.1   1.5   17   23-39     10-26  (126)
142 PF09801 SYS1:  Integral membra  23.5 1.3E+02  0.0028   22.7   3.7   10   50-59     93-102 (144)
143 KOG2513|consensus               23.4 1.4E+02  0.0031   28.0   4.6   52   15-66    532-586 (647)
144 KOG2302|consensus               23.4      76  0.0016   31.9   2.9   24   43-66   1191-1214(1956)
145 PF06706 CTV_P6:  Citrus triste  23.2 1.5E+02  0.0032   17.7   3.1   15   28-42     20-34  (51)
146 PF09971 DUF2206:  Predicted me  23.1      88  0.0019   27.3   3.1   58   14-72    114-171 (367)
147 PF03839 Sec62:  Translocation   22.9      48   0.001   27.0   1.3    7   79-85    177-183 (224)
148 PF10726 DUF2518:  Protein of f  22.8      67  0.0014   24.3   2.0    9   51-59     44-52  (145)
149 PF05751 FixH:  FixH;  InterPro  22.5      75  0.0016   23.2   2.3   40   20-59      7-46  (146)
150 PF10190 Tmemb_170:  Putative t  21.9      50  0.0011   23.6   1.1   11   79-89     74-84  (105)
151 cd08762 Cyt_b561_CYBASC3 Verte  21.8 1.2E+02  0.0025   23.9   3.2   47   19-65     39-89  (179)
152 KOG3058|consensus               21.8      43 0.00094   29.0   0.9   54   20-73     70-132 (351)
153 PF03073 TspO_MBR:  TspO/MBR fa  21.7 1.9E+02  0.0041   21.4   4.4   20   42-61     66-85  (148)
154 COG4758 Predicted membrane pro  21.6      50  0.0011   27.0   1.2   17   17-33     10-26  (235)
155 COG5521 Predicted integral mem  21.5 2.3E+02  0.0049   23.6   4.9   14   50-63    221-234 (275)
156 COG2456 Uncharacterized conser  21.4      56  0.0012   23.6   1.3   40   31-70     19-58  (121)
157 cd08762 Cyt_b561_CYBASC3 Verte  21.3 1.8E+02   0.004   22.8   4.3   20   46-65    108-127 (179)
158 TIGR00815 sulP high affinity s  21.2      87  0.0019   28.8   2.8   54   17-70    261-315 (563)
159 PF14798 Ca_hom_mod:  Calcium h  21.0 3.4E+02  0.0075   22.4   6.0   12   21-32     20-31  (251)
160 KOG2450|consensus               20.9 1.3E+02  0.0028   27.5   3.7   42   29-80    184-225 (501)
161 KOG3145|consensus               20.9 2.2E+02  0.0047   24.5   4.8   26   79-104   294-319 (372)
162 PHA02910 hypothetical protein;  20.8      43 0.00092   24.5   0.6   23   14-36     23-45  (171)
163 PRK05771 V-type ATP synthase s  20.7 1.4E+02  0.0031   27.8   4.2   43   24-66    359-409 (646)
164 PF12597 DUF3767:  Protein of u  20.7 1.1E+02  0.0023   22.2   2.7   16   44-59     64-79  (118)
165 TIGR01297 CDF cation diffusion  20.6      30 0.00065   28.1  -0.3   42   25-66    100-142 (268)
166 KOG1734|consensus               20.4   1E+02  0.0022   26.0   2.8   34    6-39    129-162 (328)
167 KOG4452|consensus               20.3 2.4E+02  0.0052   18.5   3.9   23   37-59     39-61  (79)
168 TIGR00834 ae anion exchange pr  20.1 1.7E+02  0.0036   28.9   4.5   42   31-72    672-715 (900)

No 1  
>KOG4219|consensus
Probab=99.64  E-value=2.4e-16  Score=134.59  Aligned_cols=97  Identities=26%  Similarity=0.349  Sum_probs=84.0

Q ss_pred             CCCchhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccc
Q psy6892           8 GLDAASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVL   87 (190)
Q Consensus         8 ~~~~~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC   87 (190)
                      ..+...+.+++++|+++.+++++||++|+|++..+|++|+.+|+||+|||+||+++++++.|+.....+...|++|.++|
T Consensus        29 ~lp~~~~~~wai~yg~l~~vAv~GN~iVlwIil~hrrMRtvtnyfL~NLAfADl~~s~Fn~~f~f~yal~~~W~~G~f~C  108 (423)
T KOG4219|consen   29 VLPAWQQALWAIAYGLLVFVAVVGNLIVLWIILAHRRMRTVTNYFLVNLAFADLSMSIFNTVFNFQYALHQEWYFGSFYC  108 (423)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHHhcCceEEEEEeehhehhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhcccccccee
Confidence            33555677888999999999999999999999999999999999999999999999999999999988889999999999


Q ss_pred             eeeccchh---hhhhhhhcc
Q psy6892          88 RLKKNILC---ENQLFKAGA  104 (190)
Q Consensus        88 ~i~~~i~c---~~slfSl~s  104 (190)
                      ++..++.-   .++.|++.+
T Consensus       109 ~f~nf~~itav~vSVfTlvA  128 (423)
T KOG4219|consen  109 RFVNFFPITAVFVSVFTLVA  128 (423)
T ss_pred             eeccccchhhhhHhHHHHHH
Confidence            98765554   445555443


No 2  
>PHA03234 DNA packaging protein UL33; Provisional
Probab=99.37  E-value=2.2e-13  Score=116.47  Aligned_cols=89  Identities=15%  Similarity=0.154  Sum_probs=67.3

Q ss_pred             hhHHHHHHHHHHHHhhhccchheeeee--eccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceee
Q psy6892          13 SWNIWAVVTLILVFCTATGNILVCLAI--SWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLK   90 (190)
Q Consensus        13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi--~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~   90 (190)
                      .+.++.++|.+++++|++||++|++++  ++++++|+++|+|++|||+||++++++ +|+.+.. ....|++|..+|++.
T Consensus        31 ~~~~~~~~y~~vf~~gl~gN~lvl~v~~~~~~~~~rt~tn~fi~NLAvaDLL~~l~-lp~~~~~-~~~~w~fG~~lCk~~  108 (338)
T PHA03234         31 AQILESAINGIMLTLIIPMIIIVICTLIIYHKVAKHNATSFYLITLFASDFLHMLC-VFFLTLN-REALFNFNQAFCQCV  108 (338)
T ss_pred             HHHHhhHHHHHHHHHHhhhHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHH-HHHHHHH-HhCCccCchhHHHHH
Confidence            456777899999999999999999854  456677999999999999999999765 4544433 245699999999986


Q ss_pred             ccch---hhhhhhhhc
Q psy6892          91 KNIL---CENQLFKAG  103 (190)
Q Consensus        91 ~~i~---c~~slfSl~  103 (190)
                      +++.   +.+++++++
T Consensus       109 ~~~~~~~~~~Si~~L~  124 (338)
T PHA03234        109 LFIYHASCSYSICMLA  124 (338)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            5333   344444443


No 3  
>KOG4220|consensus
Probab=99.36  E-value=6.4e-14  Score=120.18  Aligned_cols=76  Identities=34%  Similarity=0.382  Sum_probs=71.5

Q ss_pred             HHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceee
Q psy6892          15 NIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLK   90 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~   90 (190)
                      ..++++.+++.+++++||++|++.+..+++||+..||||++||+|||++|.+.+|++.++++.+.|++|..+|.+.
T Consensus        31 v~i~~v~~~lsLVTv~GNlLVmiSfKvnrqLqTVnNYfLfSLAcADliIG~~SMnl~t~Y~lmg~W~LG~~~CdlW  106 (503)
T KOG4220|consen   31 VFIVVVTGSLSLVTVVGNLLVMISFKVNRQLQTVNNYFLFSLACADLIIGAFSMNLYTTYTLMGYWPLGPLVCDLW  106 (503)
T ss_pred             EeeehhhhHHHHHhhhccEEEEEEEEecceeeeecceeehHHHHhhhhhheeechHHHHHHHHcccccchHHHHHH
Confidence            4566778889999999999999999999999999999999999999999999999999999999999999999875


No 4  
>PHA02834 chemokine receptor-like protein; Provisional
Probab=99.21  E-value=6.5e-12  Score=106.55  Aligned_cols=78  Identities=21%  Similarity=0.272  Sum_probs=63.3

Q ss_pred             hhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeecc
Q psy6892          13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKN   92 (190)
Q Consensus        13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~   92 (190)
                      .+.++.+++++++++|++||++|++++.++++ ++++|+|++|||++|+++ ++.+|+.+.... ..|.+|..+|++.++
T Consensus        27 ~~~~~~~~~~li~v~~~~gN~lVi~vi~~~~~-~~~~n~~i~nLAiaDll~-~~~lP~~i~~~~-~~w~~g~~~C~~~~~  103 (323)
T PHA02834         27 VNYFVIVFYILLFIFGLIGNVLVIAVLIVKRF-MFVVDVYLFNIAMSDLML-VFSFPFIIHNDL-NEWIFGEFMCKLVLG  103 (323)
T ss_pred             hhhhHHHHHHHHHHHHHhhHHHHHHHHHhccc-cchhhhhhHHHHHHHHHH-HHHHHHHHHHHc-CCcCCcchHHHhHHH
Confidence            45677899999999999999999998877665 457899999999999986 567887665443 468999999998653


Q ss_pred             c
Q psy6892          93 I   93 (190)
Q Consensus        93 i   93 (190)
                      +
T Consensus       104 ~  104 (323)
T PHA02834        104 V  104 (323)
T ss_pred             H
Confidence            3


No 5  
>PHA02638 CC chemokine receptor-like protein; Provisional
Probab=99.13  E-value=1.9e-11  Score=107.40  Aligned_cols=80  Identities=15%  Similarity=0.241  Sum_probs=65.4

Q ss_pred             hhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeecc
Q psy6892          13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKN   92 (190)
Q Consensus        13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~   92 (190)
                      ...++.+++.+++++|++||++|++++. ++++|+++|++++|||++|+++. +.+|+.+... .+.|.++..+|++.+.
T Consensus        97 ~~~~l~~~y~lvfvlgliGN~LVl~il~-~k~lrt~t~i~llnLAisDLl~~-l~lPf~i~~~-~~~W~fg~~~Ck~~~~  173 (417)
T PHA02638         97 ISEYIKIFYIIIFILGLFGNAAIIMILF-CKKIKTITDIYIFNLAISDLIFV-IDFPFIIYNE-FDQWIFGDFMCKVISA  173 (417)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCCHhHHHHHHHHHHHHHHH-HHHHHHHHHH-hccccccccchhhHHH
Confidence            3456778899999999999999997765 47889999999999999999884 5788776654 4579999999998654


Q ss_pred             chh
Q psy6892          93 ILC   95 (190)
Q Consensus        93 i~c   95 (190)
                      +.+
T Consensus       174 l~~  176 (417)
T PHA02638        174 SYY  176 (417)
T ss_pred             HHH
Confidence            444


No 6  
>PHA03087 G protein-coupled chemokine receptor-like protein; Provisional
Probab=99.06  E-value=1.3e-10  Score=98.55  Aligned_cols=80  Identities=19%  Similarity=0.387  Sum_probs=66.5

Q ss_pred             hhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeec
Q psy6892          12 ASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKK   91 (190)
Q Consensus        12 ~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~   91 (190)
                      ....++.+++.+++++|++||++++++++++ ++|+++|+|++|||++|++.++ .+|..+.......|.++..+|++.+
T Consensus        38 ~~~~~~~~~~~~i~~~gl~gN~lvl~~~~~~-~~~~~~~~ll~~laisDll~~~-~~~~~~~~~~~~~~~~~~~~C~~~~  115 (335)
T PHA03087         38 TNSTILIVVYSTIFFFGLVGNIIVIYVLTKT-KIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVS  115 (335)
T ss_pred             chhhHHHHHHHHHHHHHHHhhHhEEeeehhc-cccCchHHHHHHHHHHHHHHHH-hHHHHHHHHhCCCCCCCcHHHHHHH
Confidence            3456788889999999999999999999888 8999999999999999998765 5666655555677888889999765


Q ss_pred             cc
Q psy6892          92 NI   93 (190)
Q Consensus        92 ~i   93 (190)
                      ++
T Consensus       116 ~~  117 (335)
T PHA03087        116 GL  117 (335)
T ss_pred             HH
Confidence            44


No 7  
>PHA03235 DNA packaging protein UL33; Provisional
Probab=99.04  E-value=9.9e-11  Score=102.63  Aligned_cols=89  Identities=15%  Similarity=0.234  Sum_probs=61.8

Q ss_pred             hhHHHHHHHHHHHHhhhccchheeeeeeccCCC--CchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceee
Q psy6892          13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRL--QNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLK   90 (190)
Q Consensus        13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L--~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~   90 (190)
                      .+.+..+++++++++|++||++|++++.+++++  ++++++|++|||++|+++ ++.+|+.+... ...|..+..+|++.
T Consensus        31 ~~~~~~~~~~li~vvGiigN~lVL~~~~~~~r~~~~~~~~~~I~NLAvsDLl~-l~~lP~~i~~~-~~~~~~g~~~Ck~~  108 (409)
T PHA03235         31 ARTTETFINLLIISVGGPLNLIVLVTQLLANRVHGFSTPTLYMTNLYLANLLT-VFVLPFIMLSN-QGLLSGSVAGCKFA  108 (409)
T ss_pred             hHhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCCccHHHHHHHHHHHHHH-HHHHHHHHHhc-CccccCCCCeehhH
Confidence            456788899999999999999999875543332  356789999999999987 56788776432 11223345789987


Q ss_pred             ccc---hhhhhhhhhc
Q psy6892          91 KNI---LCENQLFKAG  103 (190)
Q Consensus        91 ~~i---~c~~slfSl~  103 (190)
                      +++   .+.+++++++
T Consensus       109 ~~l~~~~~~~Si~tL~  124 (409)
T PHA03235        109 SLLYYASCTVGFATVA  124 (409)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            543   3344554443


No 8  
>PF00001 7tm_1:  7 transmembrane receptor (rhodopsin family) Rhodopsin-like GPCR superfamily signature 5-hydroxytryptamine 7 receptor signature bradykinin receptor signature gastrin receptor signature melatonin receptor signature olfactory receptor signature;  InterPro: IPR000276 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ].; GO: 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 2KI9_A 3QAK_A 2YDV_A 3VGA_A 3PWH_A 3RFM_A 3EML_A 3VG9_A 3REY_A 3UZA_A ....
Probab=98.70  E-value=1.3e-09  Score=86.44  Aligned_cols=64  Identities=34%  Similarity=0.496  Sum_probs=55.6

Q ss_pred             cchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeeccch
Q psy6892          31 GNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNIL   94 (190)
Q Consensus        31 gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~   94 (190)
                      ||+++++++.+++++|+++++|++|||++|++++++.+|..+.....+.|.++..+|++.+++.
T Consensus         1 GN~lvi~~~~~~~~~~~~~~~~l~~Lav~Dll~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~   64 (257)
T PF00001_consen    1 GNILVILVILRSKRLRTPSNILLLNLAVADLLVGLFCIPFYIYSLLFDDWIFSSFLCRIFGFLF   64 (257)
T ss_dssp             HHHHHHHHHHHSGGG-SHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHSSCTSHHHHHHHHHHHH
T ss_pred             CchhehhhhhhhccCCChhHHHHHHHHHHHHhhccccccccccccccccccccccccccccccc
Confidence            8999999999999999999999999999999999988888877776677888889998865443


No 9  
>KOG2087|consensus
Probab=98.41  E-value=8.1e-08  Score=81.75  Aligned_cols=85  Identities=14%  Similarity=0.172  Sum_probs=66.1

Q ss_pred             HHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecc-------cccccccccce
Q psy6892          16 IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVR-------VRTNLSGAVLR   88 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~-------~~~~~s~~lC~   88 (190)
                      ++-+..+++.+++++||++|++++...+...++..++++|||++|+++++....+..+....       ..||..+..|+
T Consensus        26 ~lRi~vW~i~~lAi~gN~~Vl~~~~~~~~~~~~~~~li~~la~ad~~mGiYl~~ia~vD~~~~gey~~~ai~W~tg~gC~  105 (363)
T KOG2087|consen   26 ILRISVWVIALLAIVGNLLVLLTRFTSRYELNSHRFLICNLAFADLLMGIYLGLIASVDAKTRGEYYKHAIDWQTGLGCP  105 (363)
T ss_pred             eeeehhhhhhhHHhccCeeeeeeeeehhhhccchHHHHHHHHHHHHHcchHHHHHHHhhHHHHHHHHHHHHhhhhcCCCc
Confidence            44456678888999999999999988888888999999999999999997765555544432       23777888899


Q ss_pred             eeccchhhhhhh
Q psy6892          89 LKKNILCENQLF  100 (190)
Q Consensus        89 i~~~i~c~~slf  100 (190)
                      +.|+....++..
T Consensus       106 ~aGflavFASEl  117 (363)
T KOG2087|consen  106 VAGFLAVFASEL  117 (363)
T ss_pred             hHHHHHHHHHHH
Confidence            988776654433


No 10 
>PF10320 7TM_GPCR_Srsx:  Serpentine type 7TM GPCR chemoreceptor Srsx;  InterPro: IPR019424 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class sx (Srsx), which is a solo family amongst the superfamilies of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=98.31  E-value=5.3e-08  Score=80.29  Aligned_cols=50  Identities=20%  Similarity=0.337  Sum_probs=45.4

Q ss_pred             HhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892          26 FCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL   75 (190)
Q Consensus        26 vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~   75 (190)
                      ++|++||++++++++++|+||+|++++++.+|++|++.++..+|..+..+
T Consensus         2 ~ig~~gN~~~i~~~~~~~~Lrs~~~~li~~~~~~d~~~~~~~~~~~~~~~   51 (257)
T PF10320_consen    2 IIGLFGNLLLIILIFRNKSLRSPCYILICILCFADLICLLGTLPFMLFLF   51 (257)
T ss_pred             EEEEEccHHHHHHHHhccccccchHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence            57999999999999999999999999999999999999888888776443


No 11 
>PF05296 TAS2R:  Mammalian taste receptor protein (TAS2R);  InterPro: IPR007960 This family consists of several forms of mammalian taste receptor proteins (TAS2Rs). TAS2Rs are G protein-coupled receptors expressed in subsets of taste receptor cells of the tongue and palate epithelia and are organised in the genome in clusters. The proteins are genetically linked to loci that influence bitter perception in mice and humans [].; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0050909 sensory perception of taste, 0016021 integral to membrane
Probab=96.85  E-value=0.0006  Score=57.63  Aligned_cols=62  Identities=18%  Similarity=0.245  Sum_probs=45.8

Q ss_pred             hhhHHHHHHHHHHHHhhhccchheeeeeec---cCCCCchhHHHHHHHHHHHHHHhhhhccccee
Q psy6892          12 ASWNIWAVVTLILVFCTATGNILVCLAISW---EKRLQNVTNYFLMSLAITDLMVAVLVMPLGIL   73 (190)
Q Consensus        12 ~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~---~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~   73 (190)
                      ....++.++.++.+++|++||+.++++...   +++.-+|.+..+++||++.++.-.+.+...+.
T Consensus         4 ~~~~i~~~i~~~~~~~Gi~~N~FI~~vn~~~w~k~~~l~~~d~IL~~La~sr~~l~~~~~~~~~~   68 (303)
T PF05296_consen    4 SLEIIFLIILVVEFIIGILGNGFIVLVNCSDWVKSRKLSPSDQILTSLAISRILLQWVILLNSFL   68 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHcCCCCChHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            445667888889999999999998766543   33334689999999999998876554433333


No 12 
>PF05462 Dicty_CAR:  Slime mold cyclic AMP receptor
Probab=96.59  E-value=0.0014  Score=55.48  Aligned_cols=55  Identities=20%  Similarity=0.330  Sum_probs=47.1

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVM   68 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~l   68 (190)
                      ...+.++..+..+++++|.++++..+++.+++|++.+-++.-++++|++..+..+
T Consensus         6 ~~~~~~i~~~~s~lSllGclfiI~tf~~~k~~r~~~~rli~yl~~~~ll~~v~~~   60 (303)
T PF05462_consen    6 IRTLYAIELVASVLSLLGCLFIIITFCLFKRLRKPINRLIFYLSIANLLTNVASM   60 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHHHHH
Confidence            3456677778889999999999999999999999999999999999998775543


No 13 
>PF10328 7TM_GPCR_Srx:  Serpentine type 7TM GPCR chemoreceptor Srx;  InterPro: IPR019430 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class x (Srx) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=96.13  E-value=0.001  Score=55.01  Aligned_cols=45  Identities=22%  Similarity=0.262  Sum_probs=40.4

Q ss_pred             HHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892          24 LVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVM   68 (190)
Q Consensus        24 i~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~l   68 (190)
                      +.++|++.|+++++.+.+.+.+|++.+++..+.|++|++++++.+
T Consensus         3 ~s~~G~~~N~~v~~~~~~~~~~~~sF~~l~~~~a~~n~i~~~~~l   47 (274)
T PF10328_consen    3 ISIIGIILNWLVFIIIFKLKSLRNSFGILCASQAIANIIICLIFL   47 (274)
T ss_pred             eeHHHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHHHH
Confidence            467899999999999999999999999999999999999877543


No 14 
>PF11710 Git3:  G protein-coupled glucose receptor regulating Gpa2;  InterPro: IPR023041 This entry contains a functionally uncharacterised region belonging to the Git3 G-protein coupled receptor. Git3 is one of six proteins required for glucose-triggered adenylate cyclase activation, and is a G protein-coupled receptor responsible for the activation of adenylate cyclase through Gpa2 - heterotrimeric G protein alpha subunit, part of the glucose-detection pathway. Git3 contains seven predicted transmembrane domains, a third cytoplasmic loop and a cytoplasmic tail []. This is the conserved N-terminal domain of the member proteins. 
Probab=96.06  E-value=0.0019  Score=51.52  Aligned_cols=52  Identities=15%  Similarity=0.160  Sum_probs=34.7

Q ss_pred             CCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeeccch
Q psy6892          43 KRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNIL   94 (190)
Q Consensus        43 ~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~   94 (190)
                      ++++..-+.++.||.++|++.++..+...+.++..+.-..+..+|...|++.
T Consensus        30 ~r~~~fR~~LIl~L~~aD~~qal~~~i~~~~~l~~~~i~~~s~~C~aqGf~~   81 (201)
T PF11710_consen   30 YRRRSFRHQLILNLLLADFIQALAFLISPIRWLARGGIIAPSPFCQAQGFFL   81 (201)
T ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeCCCCchhhhHHHH
Confidence            3445666779999999999998876554444444444444567777765443


No 15 
>PF10324 7TM_GPCR_Srw:  Serpentine type 7TM GPCR chemoreceptor Srw;  InterPro: IPR019427 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class w (Srw), which is a solo family amongst the superfamilies of chemoreceptors. The genes encoding Srw do not appear to be under as strong an adaptive evolutionary pressure as those of Srz []. 
Probab=95.75  E-value=0.0021  Score=54.08  Aligned_cols=52  Identities=21%  Similarity=0.323  Sum_probs=41.3

Q ss_pred             HHHHhhhccchheeeeeeccCCCCc-hhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892          23 ILVFCTATGNILVCLAISWEKRLQN-VTNYFLMSLAITDLMVAVLVMPLGILTL   75 (190)
Q Consensus        23 ii~vlgi~gN~lvi~vi~~~~~L~~-~~~~fl~nLAvaDLl~~l~~lP~~l~~~   75 (190)
                      ++.++|+++|++.+.++ .+|.||+ ++|.+++.+|++|++..+..++..+...
T Consensus         5 ~~~~~g~~~N~~h~~VL-trk~mR~~~in~~l~~Iai~Dl~~~~~~~~~~~~~~   57 (318)
T PF10324_consen    5 ILSIFGLFINIFHLIVL-TRKSMRSSSINILLIGIAICDLLYMLSILIWELFFF   57 (318)
T ss_pred             EEeHHHHHHHHHHhhhc-CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35678999999988776 4566775 8999999999999999888777665443


No 16 
>PF04931 DNA_pol_phi:  DNA polymerase phi;  InterPro: IPR007015 Proteins of this family are predominantly nucleolar. The majority are described as transcription factor transactivators. The family also includes the fifth essential DNA polymerase (Pol5p) of Schizosaccharomyces pombe (Fission yeast) and Saccharomyces cerevisiae (Baker's yeast) (2.7.7.7 from EC). Pol5p is localized exclusively to the nucleolus and binds near or at the enhancer region of rRNA-encoding DNA repeating units.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006351 transcription, DNA-dependent
Probab=95.74  E-value=0.0016  Score=61.85  Aligned_cols=17  Identities=18%  Similarity=0.602  Sum_probs=12.0

Q ss_pred             HHHHHHHhhhhccccee
Q psy6892          57 AITDLMVAVLVMPLGIL   73 (190)
Q Consensus        57 AvaDLl~~l~~lP~~l~   73 (190)
                      .+.|++++++.-|..+.
T Consensus       589 vlveiLLslls~~s~ll  605 (784)
T PF04931_consen  589 VLVEILLSLLSQPSALL  605 (784)
T ss_pred             HHHHHHHHHHhCcchHH
Confidence            56788888877776543


No 17 
>PF10317 7TM_GPCR_Srd:  Serpentine type 7TM GPCR chemoreceptor Srd;  InterPro: IPR019421 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents the chemoreceptor Srd []. 
Probab=95.33  E-value=0.0066  Score=50.81  Aligned_cols=49  Identities=20%  Similarity=0.247  Sum_probs=38.5

Q ss_pred             HHHHHHHHhhhccchheeeeeecc-CCCCchhHHHHHHHHHHHHHHhhhh
Q psy6892          19 VVTLILVFCTATGNILVCLAISWE-KRLQNVTNYFLMSLAITDLMVAVLV   67 (190)
Q Consensus        19 v~~~ii~vlgi~gN~lvi~vi~~~-~~L~~~~~~fl~nLAvaDLl~~l~~   67 (190)
                      +++.+++++|++.|++++++++++ ++.-+...+++.|-|++|++.++..
T Consensus         3 i~~~~~~~~~~~~n~~Ll~~i~~~tp~~l~~~~~~l~~~~~~~~~~~~~~   52 (292)
T PF10317_consen    3 IYHPIFFILGIILNILLLYLIIFKTPKSLRTYSILLLNTAIFDLISIISA   52 (292)
T ss_pred             EeHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            356788899999999999877654 4444667899999999999987653


No 18 
>PF10321 7TM_GPCR_Srt:  Serpentine type 7TM GPCR chemoreceptor Srt;  InterPro: IPR019425  Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type []. Srt is a member of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. 
Probab=94.99  E-value=0.018  Score=49.07  Aligned_cols=56  Identities=20%  Similarity=0.140  Sum_probs=47.1

Q ss_pred             chhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892          11 AASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        11 ~~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      ...++++++.+++.+++.++..+++++++.+.+.+|.|.|-++.-||+.|++....
T Consensus        29 g~~~p~~G~~~~~~g~~~~~lY~p~~~~i~~~~~~k~~~ykiM~~L~i~Di~~l~~   84 (313)
T PF10321_consen   29 GVKRPILGIYFLIFGIIIIILYIPCLIAIFKKKLFKMSCYKIMFFLAIFDIIQLFI   84 (313)
T ss_pred             CCcccchhHHHHHHHHHHHHHHHHHHHHHHHhccccCcHHHHHHHHHHHHHHHHHh
Confidence            34567788888888888888899999998887777899999999999999998644


No 19 
>KOG3064|consensus
Probab=92.97  E-value=0.024  Score=46.51  Aligned_cols=14  Identities=14%  Similarity=0.007  Sum_probs=6.4

Q ss_pred             Hhhhccchheeeee
Q psy6892          26 FCTATGNILVCLAI   39 (190)
Q Consensus        26 vlgi~gN~lvi~vi   39 (190)
                      -+++-|+-..++.+
T Consensus       124 klalr~~q~~~vp~  137 (303)
T KOG3064|consen  124 KLALRGRQKKLVPI  137 (303)
T ss_pred             HHHhccCceEEeec
Confidence            44555554444333


No 20 
>PF00002 7tm_2:  7 transmembrane receptor (Secretin family);  InterPro: IPR000832 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane; PDB: 3L2J_A 1BL1_A.
Probab=92.93  E-value=0.061  Score=43.30  Aligned_cols=49  Identities=10%  Similarity=0.181  Sum_probs=4.0

Q ss_pred             HHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhh
Q psy6892          19 VVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLV   67 (190)
Q Consensus        19 v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~   67 (190)
                      ++..+...+++++-++++.+....|++|+..+....||+++.++..+..
T Consensus         5 ~i~~vg~~~Si~~ll~~i~~~~~~r~lr~~~~~i~~~l~~sll~~~~~~   53 (242)
T PF00002_consen    5 IISYVGCSLSIICLLLTIITYLLFRKLRSFRNKIHLNLCLSLLLANLSF   53 (242)
T ss_dssp             HHHHHHHH-----------------------------------------
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhHHHHHHHHHHH
Confidence            3444555566666666666666668888888888899999988776543


No 21 
>PF03839 Sec62:  Translocation protein Sec62;  InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=92.40  E-value=0.087  Score=42.73  Aligned_cols=15  Identities=20%  Similarity=0.211  Sum_probs=6.7

Q ss_pred             HHHHHHHHHHHHHHh
Q psy6892          50 NYFLMSLAITDLMVA   64 (190)
Q Consensus        50 ~~fl~nLAvaDLl~~   64 (190)
                      |+.++.|++.-++++
T Consensus       141 YlS~~~lgll~~~~~  155 (224)
T PF03839_consen  141 YLSVGALGLLGLFFA  155 (224)
T ss_pred             hhHHHHHHHHHHHHH
Confidence            444444444444443


No 22 
>PF02724 CDC45:  CDC45-like protein;  InterPro: IPR003874 CDC45 is an essential gene required for initiation of DNA replication in Saccharomyces cerevisiae (cell division control protein 45), forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human [], mouse and the smut fungus, Melampsora spp., (tsd2 protein) among others.; GO: 0006270 DNA-dependent DNA replication initiation
Probab=92.12  E-value=0.056  Score=50.20  Aligned_cols=13  Identities=31%  Similarity=0.319  Sum_probs=5.4

Q ss_pred             HHHHHH-HHHHHHH
Q psy6892          51 YFLMSL-AITDLMV   63 (190)
Q Consensus        51 ~fl~nL-AvaDLl~   63 (190)
                      ++++|. |..||.-
T Consensus        55 vilIncGa~~dl~~   68 (622)
T PF02724_consen   55 VILINCGATVDLEE   68 (622)
T ss_pred             EEEEecCchhhHHH
Confidence            333443 3345543


No 23 
>KOG1189|consensus
Probab=91.95  E-value=0.04  Score=51.45  Aligned_cols=12  Identities=25%  Similarity=0.263  Sum_probs=5.6

Q ss_pred             hhhhhhhhhhhh
Q psy6892         157 EEEEDKQGEEKR  168 (190)
Q Consensus       157 ~~~~~~~~~~~~  168 (190)
                      ++.+++++.+.+
T Consensus       928 E~gkdwdele~e  939 (960)
T KOG1189|consen  928 ESGKDWDELERE  939 (960)
T ss_pred             hhccchhhhHHH
Confidence            444455544443


No 24 
>PF10327 7TM_GPCR_Sri:  Serpentine type 7TM GPCR chemoreceptor Sri;  InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=90.55  E-value=0.61  Score=39.40  Aligned_cols=65  Identities=20%  Similarity=0.231  Sum_probs=45.4

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeeec-cCCCCchhHHHH---HHHHHHHHHHhhhhcccceeeeccc
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAISW-EKRLQNVTNYFL---MSLAITDLMVAVLVMPLGILTLVRV   78 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~-~~~L~~~~~~fl---~nLAvaDLl~~l~~lP~~l~~~l~~   78 (190)
                      ..++...+-++++++++.|.++++.++. .+++.+-.++++   +...++|+..+++..|..+..+..+
T Consensus         8 P~~li~~~~~ig~iS~~~n~~~iyLi~fks~k~~~fry~ll~~Qi~~~l~di~~t~L~qpipLfP~~ag   76 (303)
T PF10327_consen    8 PQWLINYYHIIGVISFILNSLGIYLIIFKSPKLDNFRYYLLYFQISCTLTDIHLTFLMQPIPLFPIPAG   76 (303)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHheeEEEecCCccchhhHHHHHHHHHHHhhhhhhhhccchhhcceeEE
Confidence            3456677888999999999999877665 455555444333   3455679999988888776665544


No 25 
>PF03402 V1R:  Vomeronasal organ pheromone receptor family, V1R;  InterPro: IPR004072 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The rhodopsin-like GPCRs themselves represent a widespread protein family that includes hormone, neurotransmitter and light receptors, all of which transduce extracellular signals through interaction with guanine nucleotide-binding (G) proteins. Although their activating ligands vary widely in structure and character, the amino acid sequences of the receptors are very similar and are believed to adopt a common structural framework comprising 7 transmembrane (TM) helices [, , ]. Pheromones have evolved in all animal phyla, to signal sex and dominance status, and are responsible for stereotypical social and sexual behaviour among members of the same species. In mammals, these chemical signals are believed to be detected primarily by the vomeronasal organ (VNO), a chemosensory organ located at the base of the nasal septum []. The VNO is present in most amphibia, reptiles and non-primate mammals but is absent in birds, adult catarrhine monkeys and apes []. An active role for the human VNO in the detection of pheromones is disputed; the VNO is clearly present in the foetus but appears to be atrophied or absent in adults. Three distinct families of putative pheromone receptors have been identified in the vomeronasal organ (V1Rs, V2Rs and V3Rs). All are G protein-coupled receptors but are only distantly related to the receptors of the main olfactory system, highlighting their different role []. The V1 receptors share between 50 and 90% sequence identity but have little similarity to other families of G protein-coupled receptors. They appear to be distantly related to the mammalian T2R bitter taste receptors and the rhodopsin-like GPCRs []. In rat, the family comprises 30-40 genes. These are expressed in the apical regions of the VNO, in neurons expressing Gi2. Coupling of the receptors to this protein mediates inositol trisphosphate signalling []. A number of human V1 receptor homologues have also been found. The majority of these human sequences are pseudogenes [] but an apparently functional receptor has been identified that is expressed in the human olfactory system [].; GO: 0016503 pheromone receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=87.62  E-value=0.55  Score=39.13  Aligned_cols=45  Identities=11%  Similarity=0.106  Sum_probs=30.8

Q ss_pred             CchhHHHHHHHHHHHHHHhhh-hcccceeeecccccccccccceeecc
Q psy6892          46 QNVTNYFLMSLAITDLMVAVL-VMPLGILTLVRVRTNLSGAVLRLKKN   92 (190)
Q Consensus        46 ~~~~~~fl~nLAvaDLl~~l~-~lP~~l~~~l~~~~~~s~~lC~i~~~   92 (190)
                      .+|++..+.|||++++++.++ ++|.....  .+...++...|++..+
T Consensus         8 ~kp~dlIl~hLa~aN~lvLl~rGip~~~~~--~~~~~~~d~gCK~v~Y   53 (265)
T PF03402_consen    8 LKPIDLILIHLALANILVLLSRGIPQTMAF--FGWKFFDDIGCKIVFY   53 (265)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHhhHHHHHHH--hhcccCCCceeeeeee
Confidence            468999999999999988654 55643222  2222457888997643


No 26 
>KOG2927|consensus
Probab=84.42  E-value=0.27  Score=42.28  Aligned_cols=6  Identities=0%  Similarity=-0.379  Sum_probs=3.0

Q ss_pred             cccccc
Q psy6892          79 RTNLSG   84 (190)
Q Consensus        79 ~~~~s~   84 (190)
                      .|.|+.
T Consensus       256 ~W~FPN  261 (372)
T KOG2927|consen  256 FWLFPN  261 (372)
T ss_pred             eEeccc
Confidence            455544


No 27 
>PF09882 DUF2109:  Predicted membrane protein (DUF2109);  InterPro: IPR019214  This entry is found in various hypothetical archaeal proteins and has no known function. 
Probab=84.20  E-value=0.26  Score=33.04  Aligned_cols=48  Identities=13%  Similarity=0.225  Sum_probs=34.2

Q ss_pred             HhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhccccee
Q psy6892          26 FCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGIL   73 (190)
Q Consensus        26 vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~   73 (190)
                      ++|+++-+.++.++..+.+.++.+|+..+|.+++-++....-.|+.++
T Consensus         5 i~g~Iai~~~iR~~~~~~r~~KL~yLnv~~F~iaalIaL~i~~P~g~i   52 (78)
T PF09882_consen    5 IIGIIAILMAIRIFLTKSRARKLLYLNVINFAIAALIALYIKSPMGAI   52 (78)
T ss_pred             HHHHHHHHHHHHHHHhHhHHHhhhHHHHHHHHHHHHHHHHhCCcHHHH
Confidence            344444445555666667778888999999999999987777776543


No 28 
>PF10856 DUF2678:  Protein of unknown function (DUF2678);  InterPro: IPR022564  This family of proteins has no known function. 
Probab=83.62  E-value=0.38  Score=34.85  Aligned_cols=23  Identities=17%  Similarity=-0.133  Sum_probs=9.8

Q ss_pred             cccccceeeccchhhhhhhhhcc
Q psy6892          82 LSGAVLRLKKNILCENQLFKAGA  104 (190)
Q Consensus        82 ~s~~lC~i~~~i~c~~slfSl~s  104 (190)
                      +..-+-++..++++.+.+..+++
T Consensus        87 l~Pkfr~li~~~~~~ivllci~a  109 (118)
T PF10856_consen   87 LDPKFRYLIYYNCFSIVLLCICA  109 (118)
T ss_pred             CChhHHHHHHHHHHHHHHHHHhh
Confidence            33333333444444444444443


No 29 
>PF10326 7TM_GPCR_Str:  Serpentine type 7TM GPCR chemoreceptor Str;  InterPro: IPR019428 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class r (Str) from the Str superfamily [, ]. Almost a quarter (22.5%) of str and srj family genes and pseudogenes in C. elegans appear to have been newly formed by gene duplications since the species split []. 
Probab=82.29  E-value=0.43  Score=39.90  Aligned_cols=47  Identities=15%  Similarity=0.192  Sum_probs=30.4

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCc-hhHHHHHHHHHHHHHHhhh
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQN-VTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~-~~~~fl~nLAvaDLl~~l~   66 (190)
                      +..+.++++++.|+++++.+.++.+.+. .--++++..|+.+++++++
T Consensus         4 ~~~~~~~~s~~~N~~Li~Li~~~s~k~~G~Yk~Lm~~fs~~~i~fs~~   51 (307)
T PF10326_consen    4 IQYIGFVLSLFLNSLLIYLILTKSPKSLGSYKYLMIYFSIFEIIFSIL   51 (307)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHhccCCCCCCEEEEEehhHHHHHHHHHH
Confidence            3456677889999999988765433332 2235666667777776655


No 30 
>COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics]
Probab=79.11  E-value=0.64  Score=43.08  Aligned_cols=13  Identities=15%  Similarity=0.136  Sum_probs=5.6

Q ss_pred             hhhhhhccccccc
Q psy6892          97 NQLFKAGARDETA  109 (190)
Q Consensus        97 ~slfSl~se~~~~  109 (190)
                      ...|....+.++.
T Consensus       919 gW~fL~~gsddE~  931 (1001)
T COG5406         919 GWSFLMVGSDDES  931 (1001)
T ss_pred             cceeeecCCcccc
Confidence            3344444444444


No 31 
>PF09726 Macoilin:  Transmembrane protein;  InterPro: IPR019130  This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=76.89  E-value=1.2  Score=41.98  Aligned_cols=22  Identities=18%  Similarity=0.508  Sum_probs=13.6

Q ss_pred             hHHHHHHHHHHHHHHhhhhcccc
Q psy6892          49 TNYFLMSLAITDLMVAVLVMPLG   71 (190)
Q Consensus        49 ~~~fl~nLAvaDLl~~l~~lP~~   71 (190)
                      +-+|+|-.+.+||++ ++++|+.
T Consensus        76 ~~~~~~~~~~~d~~~-~~~~p~~   97 (697)
T PF09726_consen   76 SVFFVCIAFTSDLIC-LFFIPVH   97 (697)
T ss_pred             HHHHHHHHHHHHHHH-HHHHHHH
Confidence            345555555678876 5666653


No 32 
>KOG1832|consensus
Probab=75.83  E-value=0.76  Score=44.31  Aligned_cols=14  Identities=29%  Similarity=0.430  Sum_probs=6.1

Q ss_pred             HHHHHHHHHHHHHH
Q psy6892          50 NYFLMSLAITDLMV   63 (190)
Q Consensus        50 ~~fl~nLAvaDLl~   63 (190)
                      ++....|-++|...
T Consensus      1314 ~VmYa~~~~~d~~s 1327 (1516)
T KOG1832|consen 1314 DVMYAMLNIEDVMS 1327 (1516)
T ss_pred             cchhhhhhhhhhhh
Confidence            34444444444443


No 33 
>PF08229 SHR3_chaperone:  ER membrane protein SH3 ;  InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) []. Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.
Probab=73.31  E-value=5.7  Score=31.60  Aligned_cols=22  Identities=14%  Similarity=0.066  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHhhhccchhee
Q psy6892          15 NIWAVVTLILVFCTATGNILVC   36 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~lvi   36 (190)
                      .++..++.++..+|++|.++=+
T Consensus        59 ~~v~~~Lh~v~~lglig~~iKl   80 (196)
T PF08229_consen   59 PIVKYILHIVIGLGLIGLLIKL   80 (196)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHH
Confidence            3444455556666676665443


No 34 
>PF03344 Daxx:  Daxx Family;  InterPro: IPR005012  Daxx is a ubiquitously expressed protein that functions, in part, as a transcriptional co-repressor through its interaction with a growing number of nuclear, DNA-associated proteins. Human Daxx contains four structural domains commonly found in transcriptional regulatory proteins: two predicted paired amphipathic helices, an acid-rich domain and a Ser/Pro/Thr (SPT)-rich domain. The post-translational modification status of the SPT-domain of hDaxx regulates its association with transcription factors such as Pax3 and ETS-1, effectively bringing hDaxx to sites of active transcription. Through its presence at the site of active transcription, hDaxx could then be able to associate with acetylated histones present in the nucleosomes and Dek that is associated with chromatin. Through its association with the SPT-domain of hDaxx, histone deacetylases may also be brought to the site of active transcription. As a consequence, nucleosomes in the vicinity of the site of active transcription will have the histone tails deacetylated, allowing the deactylated tail to bind to DNA, thereby leading to an inactive chromatin structure and transcriptional repression [].  The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis as a component of nuclear promyelocytic leukemia protein (PML) oncogenic domains (PODS). Daxx associates with PODs through a direct interaction with PML, a critical component of PODs. The interaction is a dynamic, cell cycle regulated event and is dependent on the post-translational modification of PML by the small ubiquitin-related modifier SUMO-1. ; PDB: 2KZS_A 2KZU_A.
Probab=72.50  E-value=1.2  Score=42.17  Aligned_cols=7  Identities=29%  Similarity=0.130  Sum_probs=0.0

Q ss_pred             HHHHHHH
Q psy6892          50 NYFLMSL   56 (190)
Q Consensus        50 ~~fl~nL   56 (190)
                      |-|.++|
T Consensus       346 ~~~~~~l  352 (713)
T PF03344_consen  346 YNFGCHL  352 (713)
T ss_dssp             -------
T ss_pred             ccccccc
Confidence            4444444


No 35 
>PF10319 7TM_GPCR_Srj:  Serpentine type 7TM GPCR chemoreceptor Srj;  InterPro: IPR019423 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class j (Srj) from the Str superfamily [, ]. The Srj family is designated as the out-group based on its location in preliminary phylogenetic analyses of the entire superfamily []. 
Probab=70.74  E-value=10  Score=32.34  Aligned_cols=53  Identities=25%  Similarity=0.310  Sum_probs=39.7

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCch-hHHHHHHHHHHHHHHhhh--hcccce
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNV-TNYFLMSLAITDLMVAVL--VMPLGI   72 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~-~~~fl~nLAvaDLl~~l~--~lP~~l   72 (190)
                      +.-+.++++.+-|-+.++++...++.+-- --++++..|+-|++.+++  .+|..+
T Consensus        10 ~Pk~~~~lsf~~Np~fiyli~~~~~~~~G~Yr~LL~~Fa~fn~~~S~~~~~vp~~v   65 (310)
T PF10319_consen   10 IPKIFGILSFIVNPIFIYLIFTEKKSQFGNYRYLLLFFAIFNLIYSVVDLLVPICV   65 (310)
T ss_pred             HHHHHHHHHHHHhhhhheeEEcccccccccHHHHHHHHHHHHHHHHHHHHHhhhee
Confidence            44566777888899999999888877643 357888999999998876  445443


No 36 
>PF02101 Ocular_alb:  Ocular albinism type 1 protein;  InterPro: IPR001414 Ocular albinism type 1 (OA1) is an X-linked disorder characterised by severe impairment of visual acuity, retinal hypopigmentation and the presence of macromelanosomes. A novel transcript from the OA1 critical region is expressed in high levels in RNA samples from retina and from melanoma and encodes a potential integral membrane protein []. This protein is of unknown function but is known to bind heterotrimeric G proteins.; GO: 0016020 membrane
Probab=70.52  E-value=6.2  Score=34.63  Aligned_cols=57  Identities=21%  Similarity=0.165  Sum_probs=31.4

Q ss_pred             HHHHHHHhhhccchheeeeeeccC----CCCch--hHHHHHHHHHHHHHHhhhhcccceeeec
Q psy6892          20 VTLILVFCTATGNILVCLAISWEK----RLQNV--TNYFLMSLAITDLMVAVLVMPLGILTLV   76 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~----~L~~~--~~~fl~nLAvaDLl~~l~~lP~~l~~~l   76 (190)
                      +.+.-.++|++|-++-|+--.|..    +.++|  ..-.+..||+||++.++..+...++++.
T Consensus        32 vCLgSs~l~l~gallQLlp~rr~~~~~~~~~sp~~~~rIl~~la~aDlLaclGVivRS~vWl~   94 (405)
T PF02101_consen   32 VCLGSSVLSLLGALLQLLPRRRSAGPRAPARSPSSSRRILFWLAVADLLACLGVIVRSSVWLG   94 (405)
T ss_pred             hHHHHHHHHHHHHHHhhccccccccccccccCCcCCchhHHHHHHHHHHhhhhHHHHhhhhhc
Confidence            334444556666555554222211    11122  3468889999999988766555444443


No 37 
>PHA03055 Hypothetical protein; Provisional
Probab=69.87  E-value=7.9  Score=26.09  Aligned_cols=57  Identities=11%  Similarity=0.219  Sum_probs=34.9

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeec
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLV   76 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l   76 (190)
                      +.+++.+..+.|-.+++-.+.-++.+-..+..+.--|.+.+++..++.+|..+..+.
T Consensus        13 ~TlL~llMiisG~ali~k~~~p~r~~~~RS~~~~rVl~~lE~va~lifIPgti~LY~   69 (79)
T PHA03055         13 ITVLMLLMVISGTAMIVKELNPNDIFTMQSLKFNRAVTIFKYIGLFIYIPGTIILYA   69 (79)
T ss_pred             HHHHHHHHHHhccHHHhhhcCccceeeehhhhhHHHHHHHHHHHHHHhhccHHHHHH
Confidence            333444445566666654444443333344467777888899888888887765543


No 38 
>PF10292 7TM_GPCR_Srab:  Serpentine type 7TM GPCR receptor class ab chemoreceptor;  InterPro: IPR019408 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. Srab is part of the Sra superfamily of chemoreceptors. The expression pattern of the srab genes is biologically intriguing. Of the six promoters successfully expressed in transgenic organisms, one was exclusively expressed in the tail phasmid neurons, two were exclusively expressed in a head amphid neuron, and two were expressed both in the head and tail neurons as well as a limited number of other cells []. 
Probab=66.24  E-value=5  Score=34.06  Aligned_cols=53  Identities=13%  Similarity=0.060  Sum_probs=39.1

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      ..+..++.+++.++|++.++.+++.+..++.+|.-+-+++....++-++.++.
T Consensus        16 l~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~H~N~ril~~~~~~~~l~~~~~   68 (324)
T PF10292_consen   16 LRLSLIFNLLLSIIAFPVIIYALWKIRNSKLFHFNTRILFIVHCFSFLIHCTG   68 (324)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhchhHHHHHHHHHHHHHHHHHH
Confidence            34455567777788888888888888788888877777777777777766543


No 39 
>KOG4193|consensus
Probab=66.00  E-value=2  Score=39.92  Aligned_cols=37  Identities=14%  Similarity=0.111  Sum_probs=22.2

Q ss_pred             HHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892          23 ILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT   59 (190)
Q Consensus        23 ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva   59 (190)
                      +-.++++++=++.+++.+..|++++..+...+||+++
T Consensus       328 VG~~iSlvcL~lti~ty~~~~~l~~~~~~i~~~l~~~  364 (610)
T KOG4193|consen  328 VGCIISLVCLLLTIATYLLFRKLQNDRTKIHINLCLC  364 (610)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHH
Confidence            3334444444555555556666665557777888877


No 40 
>KOG2082|consensus
Probab=64.95  E-value=5  Score=38.35  Aligned_cols=30  Identities=20%  Similarity=0.215  Sum_probs=14.1

Q ss_pred             HHHHHHHHHHHHHhhhhcccceeeeccccccc
Q psy6892          51 YFLMSLAITDLMVAVLVMPLGILTLVRVRTNL   82 (190)
Q Consensus        51 ~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~   82 (190)
                      |..+||||+  +.+++-.|.+-..+-.++|.+
T Consensus       581 Y~fVNLaCa--vqtLLrtPnWRPRfkyyHW~L  610 (1075)
T KOG2082|consen  581 YLFVNLACA--VQTLLRTPNWRPRFKYYHWSL  610 (1075)
T ss_pred             HHHHhHHHH--HHHHhcCCCCCccchhhhhHH
Confidence            334455554  223445565544444445543


No 41 
>PHA03062 putative IMV membrane protein; Provisional
Probab=64.74  E-value=10  Score=25.56  Aligned_cols=56  Identities=13%  Similarity=0.225  Sum_probs=34.1

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL   75 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~   75 (190)
                      +.+++.+..+.|..+++-.+.-++.+-..+..+.--+.+.+++..++.+|..+..+
T Consensus        14 ~TlL~llMivsG~ali~k~~~p~r~~~~RS~~~~rVl~~lE~va~lifIPgti~LY   69 (78)
T PHA03062         14 LTLLALLMIVSGGALLFKSLAPHRVVMMRSATFNKVVSFLEYVAILIFVPGTIALY   69 (78)
T ss_pred             HHHHHHHHHHhChHHHHhhcCccceeeehhhhhHHHHHHHHHHHHHHhhccHHHHH
Confidence            33344444556666665444433333333446777788889888888888776554


No 42 
>PF11446 DUF2897:  Protein of unknown function (DUF2897);  InterPro: IPR021550  This is a bacterial family of uncharacterised proteins. 
Probab=63.81  E-value=2.9  Score=26.29  Aligned_cols=12  Identities=25%  Similarity=0.116  Sum_probs=8.2

Q ss_pred             hhhccchheeee
Q psy6892          27 CTATGNILVCLA   38 (190)
Q Consensus        27 lgi~gN~lvi~v   38 (190)
                      -.|+||+++|-.
T Consensus        14 gvIigNia~LK~   25 (55)
T PF11446_consen   14 GVIIGNIAALKY   25 (55)
T ss_pred             HHHHhHHHHHHH
Confidence            347899887743


No 43 
>PF04713 Pox_I5:  Poxvirus protein I5;  InterPro: IPR006803 This entry represents the Poxvirus protein I5.
Probab=63.30  E-value=12  Score=25.15  Aligned_cols=54  Identities=15%  Similarity=0.285  Sum_probs=33.5

Q ss_pred             HHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892          22 LILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTL   75 (190)
Q Consensus        22 ~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~   75 (190)
                      +++.+..+.|-.+++-.+.-++.+-..+..+.--|.+.|.+..++.+|..+..+
T Consensus        15 lL~llMvisG~ali~k~~ap~~~v~~rS~~~~rVl~~le~vailifIPGTIiLY   68 (77)
T PF04713_consen   15 LLMLLMVISGGALIFKRIAPHRSVMMRSATFNRVLTFLEYVAILIFIPGTIILY   68 (77)
T ss_pred             HHHHHHHHhChHHHhhccCCcceehhhhhhhHHHHHHHHHHHHHhhcccHHHHH
Confidence            333444456666665444444444444556677788888888888888776544


No 44 
>KOG3114|consensus
Probab=63.21  E-value=5.2  Score=33.68  Aligned_cols=31  Identities=6%  Similarity=0.163  Sum_probs=22.7

Q ss_pred             CCCchhhHHHHHHHHHHHHhhhccchheeee
Q psy6892           8 GLDAASWNIWAVVTLILVFCTATGNILVCLA   38 (190)
Q Consensus         8 ~~~~~~~~~~~v~~~ii~vlgi~gN~lvi~v   38 (190)
                      ...+-.+..+++...+++++++.||+..++.
T Consensus       110 ~~~PDLYGPfWI~~TlVf~l~~~g~~~~~i~  140 (290)
T KOG3114|consen  110 QDNPDLYGPFWITATLVFALAISGNLATFIR  140 (290)
T ss_pred             CCCccccccHHHHHHHHHHHHHcccHHHHHH
Confidence            3355566778888888999999998775543


No 45 
>KOG0205|consensus
Probab=61.96  E-value=18  Score=34.29  Aligned_cols=48  Identities=8%  Similarity=0.168  Sum_probs=34.3

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCC-chhHHHHHHHHHHHHHHhhhh
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQ-NVTNYFLMSLAITDLMVAVLV   67 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~-~~~~~fl~nLAvaDLl~~l~~   67 (190)
                      ...+...++++.|.+++++..|.+..+ +|...++...-+++++.+++.
T Consensus       758 ~~a~ylqvsi~sqaliFvtrsr~w~~~erpg~~L~~af~~aqliatlia  806 (942)
T KOG0205|consen  758 MSALYLQVSIISQALIFVTRSRSWSFVERPGWLLLIAFFAAQLIATLIA  806 (942)
T ss_pred             HHhhhhhheehhceeeEEEeccCCccccCcHHHHHHHHHHHHHHHHHHH
Confidence            334455667888888888877777765 677777777777888877654


No 46 
>PF02532 PsbI:  Photosystem II reaction centre I protein (PSII 4.8 kDa protein);  InterPro: IPR003686 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  This family represents the low molecular weight transmembrane protein PsbI, which is tightly associated with the D1/D2 heterodimer in PSII. The function of PsbI is unknown, but it may be involved in the assembly, dimerisation or stabilisation of PSII dimers [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_i 3ARC_I 3A0B_i 3BZ2_I 3PRQ_I 3KZI_I 3PRR_I 2AXT_i 4FBY_I 1S5L_i ....
Probab=60.51  E-value=11  Score=21.46  Aligned_cols=20  Identities=10%  Similarity=-0.094  Sum_probs=14.2

Q ss_pred             HHHHHHHHHHHHhhhccchh
Q psy6892          15 NIWAVVTLILVFCTATGNIL   34 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~l   34 (190)
                      +...++++.++++|.+.|-.
T Consensus         9 y~vV~ffv~LFifGflsnDp   28 (36)
T PF02532_consen    9 YTVVIFFVSLFIFGFLSNDP   28 (36)
T ss_dssp             HHHHHHHHHHHHHHHHTTCT
T ss_pred             hhhHHHHHHHHhccccCCCC
Confidence            44556777888888888743


No 47 
>PF10316 7TM_GPCR_Srbc:  Serpentine type 7TM GPCR chemoreceptor Srbc ;  InterPro: IPR019420 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class b (Srb) from the Sra superfamily []. Srb receptors contain 6-8 hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. Srbc is a solo family amongst the superfamilies of chemoreceptors.
Probab=57.22  E-value=3.4  Score=34.57  Aligned_cols=57  Identities=16%  Similarity=0.155  Sum_probs=37.1

Q ss_pred             HHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccce
Q psy6892          16 IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGI   72 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l   72 (190)
                      +..++-++..+++++.|..+++.+...++.+++...++--..+.|.+++....+..+
T Consensus         7 iv~~i~i~~s~~~~~iN~~lL~~if~~Kk~kk~~l~LfY~Rf~~D~~~~~~~~~~~~   63 (273)
T PF10316_consen    7 IVSIIGIIFSIITCLINFYLLYSIFYSKKKKKPDLSLFYFRFAIDVFYGFSVFIYLI   63 (273)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCEEeeHHHHHHHHHHHHHHHHHHH
Confidence            344455556677788899888887766664455444555567788888877655433


No 48 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=56.83  E-value=2.6  Score=31.00  Aligned_cols=8  Identities=13%  Similarity=0.115  Sum_probs=3.2

Q ss_pred             eeeeeccC
Q psy6892          36 CLAISWEK   43 (190)
Q Consensus        36 i~vi~~~~   43 (190)
                      +|+++|++
T Consensus        85 ~y~irR~~   92 (122)
T PF01102_consen   85 SYCIRRLR   92 (122)
T ss_dssp             HHHHHHHS
T ss_pred             HHHHHHHh
Confidence            34444333


No 49 
>CHL00024 psbI photosystem II protein I
Probab=56.59  E-value=9.7  Score=21.65  Aligned_cols=20  Identities=10%  Similarity=-0.094  Sum_probs=14.6

Q ss_pred             HHHHHHHHHHHHhhhccchh
Q psy6892          15 NIWAVVTLILVFCTATGNIL   34 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~l   34 (190)
                      +...++++.++++|.+.|-.
T Consensus         9 y~vV~ffvsLFifGFlsnDp   28 (36)
T CHL00024          9 YTVVIFFVSLFIFGFLSNDP   28 (36)
T ss_pred             hhHHHHHHHHHHccccCCCC
Confidence            45566778888888888753


No 50 
>PRK02655 psbI photosystem II reaction center I protein I; Provisional
Probab=54.83  E-value=10  Score=21.73  Aligned_cols=20  Identities=10%  Similarity=-0.175  Sum_probs=14.5

Q ss_pred             HHHHHHHHHHHHhhhccchh
Q psy6892          15 NIWAVVTLILVFCTATGNIL   34 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~l   34 (190)
                      +.+.++++.++++|.+.|-.
T Consensus         9 y~vV~ffvsLFiFGflsnDP   28 (38)
T PRK02655          9 YIVVFFFVGLFVFGFLSSDP   28 (38)
T ss_pred             hhhHHHHHHHHHcccCCCCC
Confidence            44566778888888888743


No 51 
>PF10323 7TM_GPCR_Srv:  Serpentine type 7TM GPCR chemoreceptor Srv;  InterPro: IPR019426 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  This entry represents serpentine receptor class v (Srv) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. 
Probab=53.76  E-value=2.4  Score=35.36  Aligned_cols=23  Identities=13%  Similarity=0.322  Sum_probs=19.9

Q ss_pred             CCchhHHHHHHHHHHHHHHhhhh
Q psy6892          45 LQNVTNYFLMSLAITDLMVAVLV   67 (190)
Q Consensus        45 L~~~~~~fl~nLAvaDLl~~l~~   67 (190)
                      ++++.+.++.+-+++|++..+..
T Consensus        29 ~~~~Fy~l~~~~~iaDi~~~~~~   51 (283)
T PF10323_consen   29 FKSTFYTLLIQHCIADILSMLFY   51 (283)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHHH
Confidence            67999999999999999986653


No 52 
>KOG1832|consensus
Probab=53.50  E-value=2.3  Score=41.19  Aligned_cols=9  Identities=22%  Similarity=0.674  Sum_probs=3.6

Q ss_pred             hhhhccccc
Q psy6892          99 LFKAGARDE  107 (190)
Q Consensus        99 lfSl~se~~  107 (190)
                      ++-+.-.+.
T Consensus      1394 ~yEIGR~r~ 1402 (1516)
T KOG1832|consen 1394 MYEIGRRRP 1402 (1516)
T ss_pred             hhhhcccCC
Confidence            333444444


No 53 
>PF10192 GpcrRhopsn4:  Rhodopsin-like GPCR transmembrane domain;  InterPro: IPR019336 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).   This region of 270 amino acids is the seven transmembrane alpha-helical domains included within five GPCRRHODOPSN4 motifs of a G-protein-coupled-receptor (GPCR) protein, conserved from nematodes to humans []. 
Probab=53.23  E-value=18  Score=29.69  Aligned_cols=45  Identities=13%  Similarity=0.093  Sum_probs=24.2

Q ss_pred             HHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892          15 NIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT   59 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva   59 (190)
                      ..+..++++++++.++-++..+..+.+++.+|.+...+.+++.+.
T Consensus        29 ~~~~~~~~~~y~vl~~~~~~~~~~l~~~~~~h~~~~l~~~~l~l~   73 (257)
T PF10192_consen   29 LEIYLLFLLLYIVLSIISIYSIQSLKKRGLMHPVYKLFSAALLLQ   73 (257)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Confidence            333334444444444444444455556666677777777776654


No 54 
>PRK03557 zinc transporter ZitB; Provisional
Probab=52.60  E-value=9.2  Score=32.38  Aligned_cols=21  Identities=14%  Similarity=0.161  Sum_probs=11.0

Q ss_pred             HHHhhhccchheeeeeeccCC
Q psy6892          24 LVFCTATGNILVCLAISWEKR   44 (190)
Q Consensus        24 i~vlgi~gN~lvi~vi~~~~~   44 (190)
                      +.++|++.|++..+..++.++
T Consensus       127 v~~~~~~~~~~~~~~~~~~~~  147 (312)
T PRK03557        127 IAVAGLLANILSFWLLHHGSE  147 (312)
T ss_pred             HHHHHHHHHHHHHHHHhcccc
Confidence            344566667665554444333


No 55 
>PF11861 DUF3381:  Domain of unknown function (DUF3381);  InterPro: IPR024576 This uncharacterised domain is found in eukaryotic rRNA methyltransferases like yeast Spb1 and mammalian homologue Ftsj3. Spb1 is required for proper assembly of pre-ribosomal particles during the biogenesis of the 60S ribosomal subunit [, ].
Probab=50.79  E-value=24  Score=27.07  Aligned_cols=6  Identities=50%  Similarity=0.540  Sum_probs=2.7

Q ss_pred             HHHHHH
Q psy6892          58 ITDLMV   63 (190)
Q Consensus        58 vaDLl~   63 (190)
                      |.||-+
T Consensus        69 c~DLKV   74 (159)
T PF11861_consen   69 CKDLKV   74 (159)
T ss_pred             HHHHHh
Confidence            345444


No 56 
>PF05297 Herpes_LMP1:  Herpesvirus latent membrane protein 1 (LMP1);  InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=50.57  E-value=5.2  Score=33.80  Aligned_cols=7  Identities=29%  Similarity=0.596  Sum_probs=0.0

Q ss_pred             eccCCCC
Q psy6892          40 SWEKRLQ   46 (190)
Q Consensus        40 ~~~~~L~   46 (190)
                      ..+|+|-
T Consensus        71 lF~RrLL   77 (381)
T PF05297_consen   71 LFKRRLL   77 (381)
T ss_dssp             -------
T ss_pred             HHHHhhc
Confidence            3444443


No 57 
>PF14851 FAM176:  FAM176 family
Probab=50.20  E-value=5.8  Score=30.30  Aligned_cols=34  Identities=18%  Similarity=0.221  Sum_probs=18.1

Q ss_pred             hccchheeeeeeccCCCCchhHHHHHHHHHHHHHH
Q psy6892          29 ATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMV   63 (190)
Q Consensus        29 i~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~   63 (190)
                      |+.|.+..++.++... =+..=||+++.++.=+++
T Consensus         4 llSnsLaaya~I~~~P-E~~aLYFv~gVC~GLlLt   37 (153)
T PF14851_consen    4 LLSNSLAAYAHIRDNP-ERFALYFVSGVCAGLLLT   37 (153)
T ss_pred             HHHHHHHHHHHHHhCh-HHHHHHHHHHHHHHHHHH
Confidence            4567777666554332 233345666666554443


No 58 
>PF10318 7TM_GPCR_Srh:  Serpentine type 7TM GPCR chemoreceptor Srh;  InterPro: IPR019422 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae.  Srh is part of the Str superfamily of chemoreceptors []. 
Probab=50.03  E-value=17  Score=30.13  Aligned_cols=60  Identities=15%  Similarity=0.319  Sum_probs=35.6

Q ss_pred             HHHHHHHHHHhhhccchheeeeeecc-CCCCchhHHHHHHHH----HHHHHHhhhhcccceeeec
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWE-KRLQNVTNYFLMSLA----ITDLMVAVLVMPLGILTLV   76 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~-~~L~~~~~~fl~nLA----vaDLl~~l~~lP~~l~~~l   76 (190)
                      +..+.-++.+++++-|++.++++.++ ++-.+...+.+.|+.    +.|+..+++..|..+....
T Consensus         6 ~~~~~h~~~~i~~Pi~~~~~y~Il~ktp~~m~~~k~~ll~~~~~~~~~d~~~~~l~~P~~~~P~~   70 (302)
T PF10318_consen    6 YSIVLHIITIISIPIYIFGFYCILFKTPKSMKSVKWYLLNHHFWNFLLDLYLSFLIIPYFFFPSP   70 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHheeEeChhhhHHHHHHHHHHHHHHHHHHHHhhhEEeeeeccchH
Confidence            34455566677777787777665543 433333444444444    4688888877776555443


No 59 
>KOG1482|consensus
Probab=49.60  E-value=6.4  Score=34.30  Aligned_cols=39  Identities=13%  Similarity=0.260  Sum_probs=30.8

Q ss_pred             HHHHHHHhhhhcccceeeecccccccccccceeeccchh
Q psy6892          57 AITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILC   95 (190)
Q Consensus        57 AvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~c   95 (190)
                      ++.|++.++..+...++.++.+.|......|.+...+..
T Consensus       233 VlGDliQSvGV~iaa~Ii~f~P~~~i~DpICT~~FSiiv  271 (379)
T KOG1482|consen  233 VLGDLIQSVGVLIAALIIYFKPEYKIADPICTFVFSIIV  271 (379)
T ss_pred             HHHHHHHHHHHHhhheeEEecccceecCchhhhhHHHHH
Confidence            356899888888888888888899999999987654444


No 60 
>PF10329 DUF2417:  Region of unknown function (DUF2417);  InterPro: IPR019431  This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO). 
Probab=49.55  E-value=11  Score=30.79  Aligned_cols=14  Identities=36%  Similarity=0.608  Sum_probs=5.7

Q ss_pred             HHHHHHHHHHHHHH
Q psy6892          50 NYFLMSLAITDLMV   63 (190)
Q Consensus        50 ~~fl~nLAvaDLl~   63 (190)
                      ++.++.|-+.|+++
T Consensus       105 ~~vl~~Lllvdlil  118 (232)
T PF10329_consen  105 NIVLAGLLLVDLIL  118 (232)
T ss_pred             HHHHHHHHHHHHHH
Confidence            33444444444433


No 61 
>KOG4349|consensus
Probab=48.22  E-value=8.8  Score=28.24  Aligned_cols=19  Identities=16%  Similarity=0.100  Sum_probs=9.1

Q ss_pred             hhccchheeeeeeccCCCC
Q psy6892          28 TATGNILVCLAISWEKRLQ   46 (190)
Q Consensus        28 gi~gN~lvi~vi~~~~~L~   46 (190)
                      .+..-++.++.++++++-|
T Consensus        61 h~~A~ilalFTLRkHkygr   79 (143)
T KOG4349|consen   61 HLGATILALFTLRKHKYGR   79 (143)
T ss_pred             HhhHHHHHHHHhhcCccce
Confidence            3444445555555555444


No 62 
>KOG2592|consensus
Probab=48.20  E-value=18  Score=31.83  Aligned_cols=30  Identities=13%  Similarity=0.191  Sum_probs=20.1

Q ss_pred             CchhhHHHHHHHHHHHHhhhccchheeeee
Q psy6892          10 DAASWNIWAVVTLILVFCTATGNILVCLAI   39 (190)
Q Consensus        10 ~~~~~~~~~v~~~ii~vlgi~gN~lvi~vi   39 (190)
                      +...+..++++.++.+++++++-+++.+.+
T Consensus       177 sr~wy~~Ll~~T~~~Y~~s~~~~~l~fv~f  206 (426)
T KOG2592|consen  177 SRFWYAALLGVTLLMYLLSLVATVLLFVYF  206 (426)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHhheEe
Confidence            344566777778888888888765544333


No 63 
>COG2181 NarI Nitrate reductase gamma subunit [Energy production and conversion]
Probab=46.94  E-value=18  Score=29.48  Aligned_cols=30  Identities=13%  Similarity=0.400  Sum_probs=22.7

Q ss_pred             CCCCchhHHHHHHHHHHHHHHhhhhcccce
Q psy6892          43 KRLQNVTNYFLMSLAITDLMVAVLVMPLGI   72 (190)
Q Consensus        43 ~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l   72 (190)
                      |...+|.++|+..|-++-++.++...++..
T Consensus       118 R~~St~~D~f~lilLla~~~~Gl~~~~~~~  147 (228)
T COG2181         118 RATSTPSDIFALLLLLAQLLLGLYATPFSA  147 (228)
T ss_pred             eecCChhHHHHHHHHHHHHHhhhhhhhhcc
Confidence            334578889988888888888887777654


No 64 
>COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics]
Probab=46.65  E-value=4.2  Score=37.94  Aligned_cols=9  Identities=33%  Similarity=0.284  Sum_probs=3.8

Q ss_pred             hhhhhhhhh
Q psy6892         157 EEEEDKQGE  165 (190)
Q Consensus       157 ~~~~~~~~~  165 (190)
                      ++.|++++.
T Consensus       978 E~gEDwdel  986 (1001)
T COG5406         978 EDGEDWDEL  986 (1001)
T ss_pred             ccccchhhH
Confidence            444444443


No 65 
>PF14940 TMEM219:  Transmembrane 219
Probab=46.54  E-value=12  Score=30.50  Aligned_cols=45  Identities=31%  Similarity=0.262  Sum_probs=26.4

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCCCch-----hHHHHHHHHHHHHHH
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRLQNV-----TNYFLMSLAITDLMV   63 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~-----~~~fl~nLAvaDLl~   63 (190)
                      +.++++.+.++||.+  +.+..+.+...+++|     -|.|+.+|+=.|+++
T Consensus        11 lVvF~l~Ll~~aI~~--l~Lg~yi~~~~l~nPDi~~DWN~fL~~ls~l~fC~   60 (223)
T PF14940_consen   11 LVVFTLCLLLLAISF--LCLGYYIKRNELKNPDIPQDWNTFLLSLSQLDFCV   60 (223)
T ss_pred             chHHHHHHHHHHHHH--heeeeEecccCCCcccchhhHHHHHHhhcCeeEcc
Confidence            344444444455433  333333355566664     478998888878777


No 66 
>PHA02702 ORF033 IMV membrane protein; Provisional
Probab=46.21  E-value=15  Score=24.52  Aligned_cols=26  Identities=19%  Similarity=0.272  Sum_probs=18.7

Q ss_pred             HHHHHHHHHHHHHhhhhcccceeeec
Q psy6892          51 YFLMSLAITDLMVAVLVMPLGILTLV   76 (190)
Q Consensus        51 ~fl~nLAvaDLl~~l~~lP~~l~~~l   76 (190)
                      .+.--|.+.+++..++.+|..++.+.
T Consensus        41 ~~~Rvltvle~va~l~~IPgtIiLY~   66 (78)
T PHA02702         41 GALRVLTVLDFVSLLTTIPCTIILYF   66 (78)
T ss_pred             cchhHHHHHHHHHHHHHhchHHHHHH
Confidence            45556788888888888887766543


No 67 
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=45.93  E-value=12  Score=31.81  Aligned_cols=57  Identities=16%  Similarity=0.065  Sum_probs=34.8

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHH-HHHHHhhhhcccceeeeccc
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAI-TDLMVAVLVMPLGILTLVRV   78 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAv-aDLl~~l~~lP~~l~~~l~~   78 (190)
                      .++++.++|++.|++..+.+.+.+.  ...|.=-..|.+ +|++.++..+...++.++.+
T Consensus       126 ~ml~va~~GL~vN~~~a~ll~~~~~--~~lN~r~a~LHvl~D~Lgsv~vIia~i~i~~~~  183 (296)
T COG1230         126 GMLVVAIIGLVVNLVSALLLHKGHE--ENLNMRGAYLHVLGDALGSVGVIIAAIVIRFTG  183 (296)
T ss_pred             chHHHHHHHHHHHHHHHHHhhCCCc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            3556778899999998888776511  123443444443 68887766555555544444


No 68 
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=44.92  E-value=29  Score=20.93  Aligned_cols=32  Identities=16%  Similarity=0.083  Sum_probs=16.5

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRLQNV   48 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~   48 (190)
                      ++++++..+++|+.|-.+-...=--.+.||.|
T Consensus         9 ~~~i~i~~lL~~~TgyaiYtaFGppSk~LrDP   40 (46)
T PRK13183          9 SLAITILAILLALTGFGIYTAFGPPSKELDDP   40 (46)
T ss_pred             HHHHHHHHHHHHHhhheeeeccCCcccccCCc
Confidence            34455566666776654433332234556655


No 69 
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=44.02  E-value=25  Score=26.15  Aligned_cols=32  Identities=19%  Similarity=0.151  Sum_probs=17.3

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeeeccCCC
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAISWEKRL   45 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L   45 (190)
                      +.++.+++++++.+.+.|-++.++-..|+++|
T Consensus       100 Yia~~~il~il~~i~is~~~~~~yr~~r~~~~  131 (139)
T PHA03099        100 YIPSPGIVLVLVGIIITCCLLSVYRFTRRTKL  131 (139)
T ss_pred             hhhhhHHHHHHHHHHHHHHHHhhheeeecccC
Confidence            34445555556666666655555554444443


No 70 
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=43.38  E-value=16  Score=29.79  Aligned_cols=6  Identities=0%  Similarity=-0.257  Sum_probs=2.7

Q ss_pred             cccccc
Q psy6892          80 TNLSGA   85 (190)
Q Consensus        80 ~~~s~~   85 (190)
                      |.++..
T Consensus       184 WlfPNL  189 (232)
T TIGR00869       184 WIFPNL  189 (232)
T ss_pred             eeecch
Confidence            444433


No 71 
>CHL00020 psbN photosystem II protein N
Probab=43.31  E-value=30  Score=20.61  Aligned_cols=32  Identities=16%  Similarity=0.078  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRLQNV   48 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~   48 (190)
                      ++++++...++|+.|-.+-...=--.+.||.|
T Consensus         6 ~~~i~i~~ll~~~Tgy~iYtaFGppSk~LrDP   37 (43)
T CHL00020          6 LVAIFISGLLVSFTGYALYTAFGQPSKQLRDP   37 (43)
T ss_pred             hHHHHHHHHHHHhhheeeeeccCCchhccCCc
Confidence            34455556666666654433322234455554


No 72 
>PF10856 DUF2678:  Protein of unknown function (DUF2678);  InterPro: IPR022564  This family of proteins has no known function. 
Probab=43.20  E-value=33  Score=24.99  Aligned_cols=9  Identities=11%  Similarity=0.526  Sum_probs=3.9

Q ss_pred             HHHHHHHHH
Q psy6892          50 NYFLMSLAI   58 (190)
Q Consensus        50 ~~fl~nLAv   58 (190)
                      .+|.+++.+
T Consensus        62 iffavcI~l   70 (118)
T PF10856_consen   62 IFFAVCILL   70 (118)
T ss_pred             EehHHHHHH
Confidence            344444443


No 73 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=42.44  E-value=7.2  Score=33.08  Aligned_cols=21  Identities=10%  Similarity=0.064  Sum_probs=10.3

Q ss_pred             HHHhhhccchheeeeeeccCC
Q psy6892          24 LVFCTATGNILVCLAISWEKR   44 (190)
Q Consensus        24 i~vlgi~gN~lvi~vi~~~~~   44 (190)
                      +.++.|+.-.++|+.|+|+||
T Consensus       263 iaIliIVLIMvIIYLILRYRR  283 (299)
T PF02009_consen  263 IAILIIVLIMVIIYLILRYRR  283 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            333333333455566666554


No 74 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=42.06  E-value=12  Score=26.85  Aligned_cols=30  Identities=10%  Similarity=0.174  Sum_probs=20.0

Q ss_pred             HHHHHHHHHHHhhhccchheeeeeeccCCC
Q psy6892          16 IWAVVTLILVFCTATGNILVCLAISWEKRL   45 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L   45 (190)
                      ++.+++++++++.++.|+++.....+..+.
T Consensus         2 ~Ll~il~llLll~l~asl~~wr~~~rq~k~   31 (107)
T PF15330_consen    2 LLLGILALLLLLSLAASLLAWRMKQRQKKA   31 (107)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHhhhccc
Confidence            345667777777888888876666555443


No 75 
>KOG4564|consensus
Probab=41.98  E-value=18  Score=32.64  Aligned_cols=51  Identities=14%  Similarity=0.126  Sum_probs=33.3

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhccc
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPL   70 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~   70 (190)
                      +|.+-+-++++.=++.+.++..-|+||-..|++=.||.++=++.+++.+..
T Consensus       150 lytvGyslSl~sL~vAl~If~~FR~L~CtRn~IH~nLF~SfiLra~~~~i~  200 (473)
T KOG4564|consen  150 LYTVGYSLSLVSLLVALIIFLYFRSLHCTRNYIHMNLFASFILRAASVLIK  200 (473)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhhcchHHHHHHHHHHHHHHHHHHHHHH
Confidence            444444444444444555666778899888888889988877766554443


No 76 
>PF13074 DUF3938:  Protein of unknown function (DUF3938)
Probab=41.62  E-value=66  Score=21.64  Aligned_cols=17  Identities=18%  Similarity=0.333  Sum_probs=11.6

Q ss_pred             cchhhhhhhhhcccccc
Q psy6892          92 NILCENQLFKAGARDET  108 (190)
Q Consensus        92 ~i~c~~slfSl~se~~~  108 (190)
                      ..+|.+..|.+..+.++
T Consensus        78 iafcladvfnllqdnee   94 (102)
T PF13074_consen   78 IAFCLADVFNLLQDNEE   94 (102)
T ss_pred             HHHHHHHHHHHhcCchh
Confidence            34788888888776544


No 77 
>PF03348 Serinc:  Serine incorporator (Serinc);  InterPro: IPR005016  This is a family of proteins which display differential expression in various tumour and cell lines. The function of these proteins is unknown. ; GO: 0016020 membrane
Probab=41.07  E-value=46  Score=29.73  Aligned_cols=55  Identities=18%  Similarity=0.267  Sum_probs=30.6

Q ss_pred             hhhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHH--HHHHHHHHHHHhhhhcc
Q psy6892          12 ASWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYF--LMSLAITDLMVAVLVMP   69 (190)
Q Consensus        12 ~~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~f--l~nLAvaDLl~~l~~lP   69 (190)
                      ..+..++++.++++++++.+.++..+.+. ...  -..|.+  .+||.++=++..+...|
T Consensus       179 ~w~~~Li~~T~~~y~~si~~~v~~y~~f~-~~~--C~lN~~fIt~nliL~vi~s~lSv~p  235 (429)
T PF03348_consen  179 RWYIALIGVTLLFYAASIAGIVLMYVFFT-PSG--CSLNKFFITFNLILCVIISVLSVLP  235 (429)
T ss_pred             eehhHHHHHHHHHHHHHHHHHHHHHHHhC-CCC--CchhHHHHHHHHHHHHHHHHHHhhh
Confidence            34566666777778888877666554433 221  223433  36666665555444445


No 78 
>KOG0476|consensus
Probab=40.35  E-value=18  Score=34.59  Aligned_cols=31  Identities=23%  Similarity=0.354  Sum_probs=19.5

Q ss_pred             HHHHHHHHHHHhhhhcccceeeecccccccc
Q psy6892          53 LMSLAITDLMVAVLVMPLGILTLVRVRTNLS   83 (190)
Q Consensus        53 l~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s   83 (190)
                      ++--+++-++++.+..|..+-.++.+...+.
T Consensus       372 llyp~~~a~v~ssltfP~GlG~f~aG~l~f~  402 (931)
T KOG0476|consen  372 LLYPAFIALVFSSLTFPPGLGQFLAGRLSFR  402 (931)
T ss_pred             cHHHHHHHHHHhheecCCcccccccccccHH
Confidence            3344566677777888877766666654443


No 79 
>PF12606 RELT:  Tumour necrosis factor receptor superfamily member 19;  InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis).  RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=39.97  E-value=7.7  Score=23.91  Aligned_cols=9  Identities=22%  Similarity=0.419  Sum_probs=3.5

Q ss_pred             HHHHhhhcc
Q psy6892          23 ILVFCTATG   31 (190)
Q Consensus        23 ii~vlgi~g   31 (190)
                      +++++|++|
T Consensus         9 i~iv~~lLg   17 (50)
T PF12606_consen    9 IFIVMGLLG   17 (50)
T ss_pred             HHHHHHHHH
Confidence            333444433


No 80 
>PF06398 Pex24p:  Integral peroxisomal membrane peroxin;  InterPro: IPR010482 Peroxisomes play diverse roles in the cell, compartmentalising many activities related to lipid metabolism and functioning in the decomposition of toxic hydrogen peroxide. Sequence similarity was identified between two hypothetical proteins and the peroxin integral membrane protein Pex24p [].
Probab=38.14  E-value=39  Score=28.98  Aligned_cols=52  Identities=6%  Similarity=-0.018  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcc
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMP   69 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP   69 (190)
                      +...+..+|.+.++.+.++-++.|+++. ++.+.++++.+.|......++.+|
T Consensus        15 l~~~l~~~f~~~~~~d~vl~il~W~~p~-~t~~~L~l~t~~~l~p~l~l~~lp   66 (359)
T PF06398_consen   15 LSSRLGPIFPFQLILDRVLRILTWTNPD-YTLSFLLLYTFLCLNPYLLLLSLP   66 (359)
T ss_pred             HHHHHHHhhHHHHHHHHHHHeEEeCCCC-cchHHHHHHHHHHHHHHHHHHHHH
Confidence            3344455555555556666667777776 455556666666655555444344


No 81 
>PF05745 CRPA:  Chlamydia 15 kDa cysteine-rich outer membrane protein (CRPA);  InterPro: IPR008436 Chlamydia is a genus of bacteria, which causes the most common bacterial sexually transmitted diseases. They are obligate intracellular bacterial pathogens. Members of this genus lack a peptidoglycan layer, but as a substitute, it has been proposed that they have several cysteine rich membrane proteins. This includes the major outer membrane protein (MOMP). These form disulphide bonds to provide rigidity to the cell wall. The alignment of the amino acid sequences of the MOMP from various serovars of Chlamydia show that they have between seven and ten cysteine residues; seven of which are highly conserved []. The MOMP has been the focus of efforts to produce a vaccine for Chlamydia trachomatis []. The 15 kDa cysteine-rich protein in this entry is a multi-pass outer membrane protein. They are associated with the differentiation of reticulate bodies (RBs) into elementary bodies (EBs) []. They immunolocalise to the inclusion membrane, which is the membrane that surrounds the intracellular parasite. These proteins are recognised by CD8+ T cells in both human and mouse infections, suggesting they gain access to the host cytoplasm.; GO: 0019867 outer membrane
Probab=38.03  E-value=47  Score=24.74  Aligned_cols=10  Identities=20%  Similarity=0.577  Sum_probs=4.5

Q ss_pred             HHHHHHHhhh
Q psy6892          57 AITDLMVAVL   66 (190)
Q Consensus        57 AvaDLl~~l~   66 (190)
                      .+..++++.+
T Consensus       104 Glvkll~ts~  113 (150)
T PF05745_consen  104 GLVKLLITSL  113 (150)
T ss_pred             HHHHHHHHHH
Confidence            3445555443


No 82 
>COG4420 Predicted membrane protein [Function unknown]
Probab=38.00  E-value=67  Score=25.42  Aligned_cols=38  Identities=13%  Similarity=0.426  Sum_probs=22.0

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT   59 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva   59 (190)
                      +++.+++++++=|+.|+.++....+.+     .-|++.||.++
T Consensus        60 fil~~~~~ll~Wi~lNl~~~~~~~wDp-----yPFi~LnLllS   97 (191)
T COG4420          60 FILTFTLLLLLWIVLNLFLVPGLAWDP-----YPFILLNLLLS   97 (191)
T ss_pred             HHHHHHHHHHHHHHHHHhhhcCCcCCC-----ccHHHHHHHHH
Confidence            334444555555677877766543333     34666777765


No 83 
>PF05915 DUF872:  Eukaryotic protein of unknown function (DUF872);  InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=37.38  E-value=72  Score=23.08  Aligned_cols=14  Identities=21%  Similarity=0.000  Sum_probs=5.2

Q ss_pred             HHHHHHHHhhhccc
Q psy6892          19 VVTLILVFCTATGN   32 (190)
Q Consensus        19 v~~~ii~vlgi~gN   32 (190)
                      +++++++++.++..
T Consensus        48 ~~Lli~G~~li~~g   61 (115)
T PF05915_consen   48 VFLLIFGTVLIIIG   61 (115)
T ss_pred             HHHHHHHHHHHHHH
Confidence            33333333333333


No 84 
>PF09802 Sec66:  Preprotein translocase subunit Sec66;  InterPro: IPR018624  Members of this family of proteins are a component of the heterotetrameric Sec62/63 complex composed of SEC62, SEC63, SEC66 and SEC72. The Sec62/63 complex associates with the Sec61 complex to form the Sec complex. Sec 66 is involved in SRP-independent post-translational translocation across the endoplasmic reticulum and functions together with the Sec61 complex and KAR2 in a channel-forming translocon complex. Furthermore, Sec66 is also required for growth at elevated temperatures [, , , ]. 
Probab=36.72  E-value=5.6  Score=31.51  Aligned_cols=45  Identities=16%  Similarity=0.320  Sum_probs=25.1

Q ss_pred             HHHHhhhccchheeeeeeccCCCCc---hhHHHHHHHHHHHHHHhhhhc
Q psy6892          23 ILVFCTATGNILVCLAISWEKRLQN---VTNYFLMSLAITDLMVAVLVM   68 (190)
Q Consensus        23 ii~vlgi~gN~lvi~vi~~~~~L~~---~~~~fl~nLAvaDLl~~l~~l   68 (190)
                      ++|+..++|-+.++..++|+|+++.   ...+|.-|++ .|+.+++..+
T Consensus        10 ~~Y~~vl~~sl~~Fs~~YRkr~~~~~~~l~p~F~~~~~-rdiY~sL~~~   57 (190)
T PF09802_consen   10 LAYVAVLVGSLATFSSIYRKRKAAKSASLEPWFPEHLQ-RDIYLSLLHM   57 (190)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCchhH-HHHHHHHHhc
Confidence            3344444555556656677766542   3456666664 5666666544


No 85 
>PF14494 DUF4436:  Domain of unknown function (DUF4436)
Probab=35.79  E-value=25  Score=29.19  Aligned_cols=29  Identities=3%  Similarity=0.135  Sum_probs=13.5

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNV   48 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~   48 (190)
                      +++++.++++++=++.+.++.++|++..+
T Consensus       173 i~~vm~~L~~~al~~ai~~~~~rr~~~~~  201 (256)
T PF14494_consen  173 ICLVMWALATLALFVAIQVLRRRRKFEPP  201 (256)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhCccccccc
Confidence            33333444443333444455555666555


No 86 
>PF02468 PsbN:  Photosystem II reaction centre N protein (psbN);  InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=35.60  E-value=37  Score=20.24  Aligned_cols=31  Identities=13%  Similarity=0.099  Sum_probs=14.9

Q ss_pred             HHHHHHHHHhhhccchheeeeeeccCCCCch
Q psy6892          18 AVVTLILVFCTATGNILVCLAISWEKRLQNV   48 (190)
Q Consensus        18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~   48 (190)
                      +++++.+.++|+.|-.+-...=--.+.||.|
T Consensus         7 ~~i~i~~~lv~~Tgy~iYtaFGppSk~LrDP   37 (43)
T PF02468_consen    7 LAIFISCLLVSITGYAIYTAFGPPSKELRDP   37 (43)
T ss_pred             HHHHHHHHHHHHHhhhhhheeCCCccccCCc
Confidence            4455555666666633322222234556654


No 87 
>PF05934 MCLC:  Mid-1-related chloride channel (MCLC);  InterPro: IPR009231 This entry consists of several Chloride channel CLIC-like proteins, which function as a chloride channel when incorporated in the planar lipid bilayer [].
Probab=35.16  E-value=15  Score=33.35  Aligned_cols=27  Identities=11%  Similarity=0.132  Sum_probs=14.2

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeee
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAIS   40 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~   40 (190)
                      +.++.+++++++++.|+.+-+..++.|
T Consensus       181 ~t~l~v~l~lL~IV~iVAteLwt~V~W  207 (549)
T PF05934_consen  181 YTVLMVLLCLLCIVAIVATELWTYVSW  207 (549)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345555555566666665554444443


No 88 
>PF09788 Tmemb_55A:  Transmembrane protein 55A;  InterPro: IPR019178  Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction:  1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.  
Probab=34.98  E-value=55  Score=27.13  Aligned_cols=12  Identities=17%  Similarity=0.110  Sum_probs=4.5

Q ss_pred             HHHHHHHHHHHh
Q psy6892          16 IWAVVTLILVFC   27 (190)
Q Consensus        16 ~~~v~~~ii~vl   27 (190)
                      ++++++++++++
T Consensus       199 i~f~llgllfli  210 (256)
T PF09788_consen  199 IIFFLLGLLFLI  210 (256)
T ss_pred             HHHHHHHHHHHH
Confidence            333333333333


No 89 
>PF03006 HlyIII:  Haemolysin-III related;  InterPro: IPR004254 Members of this family are integral membrane proteins. This family includes proteins that are hemolysin-III homologs.; GO: 0016021 integral to membrane
Probab=34.93  E-value=77  Score=24.71  Aligned_cols=42  Identities=12%  Similarity=0.120  Sum_probs=17.8

Q ss_pred             hhHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHH
Q psy6892          13 SWNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLM   54 (190)
Q Consensus        13 ~~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~   54 (190)
                      .+....++..++.++|++.++.......+.+.+|+..+..+.
T Consensus       106 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~r~~~~~~~g  147 (222)
T PF03006_consen  106 LGWFYLAFIWILALIGIVLSLFPCFSSPRFRWLRTIFFLLLG  147 (222)
T ss_pred             HHHHHHHHHHHHHHHhHHhhcchhhcCCccceeeehHhHHHH
Confidence            334444444455555554443322222333444555444443


No 90 
>PF07787 DUF1625:  Protein of unknown function (DUF1625);  InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long. 
Probab=34.77  E-value=72  Score=25.97  Aligned_cols=10  Identities=30%  Similarity=0.693  Sum_probs=4.2

Q ss_pred             HHHHHHHHHH
Q psy6892          50 NYFLMSLAIT   59 (190)
Q Consensus        50 ~~fl~nLAva   59 (190)
                      -++++++.++
T Consensus       222 ~~~~~~~~~s  231 (248)
T PF07787_consen  222 GLFLVAFIIS  231 (248)
T ss_pred             HHHHHHHHHH
Confidence            3444444433


No 91 
>PF06210 DUF1003:  Protein of unknown function (DUF1003);  InterPro: IPR010406 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=34.54  E-value=1.2e+02  Score=21.70  Aligned_cols=37  Identities=14%  Similarity=0.246  Sum_probs=17.6

Q ss_pred             HHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892          18 AVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT   59 (190)
Q Consensus        18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva   59 (190)
                      ++++++++++=++.|++.+......|+    + |.+.||.++
T Consensus         7 i~~~~~~~~~Wi~~N~~~~~~~~fDpy----P-FilLnl~lS   43 (108)
T PF06210_consen    7 IIIFTVFLAVWILLNILAPPRPAFDPY----P-FILLNLVLS   43 (108)
T ss_pred             HHHHHHHHHHHHHHHhhccccCCCCCc----c-HHHHHHHHH
Confidence            334444444446667766544333333    2 445555543


No 92 
>COG0341 SecF Preprotein translocase subunit SecF [Intracellular trafficking and secretion]
Probab=34.10  E-value=17  Score=30.99  Aligned_cols=46  Identities=9%  Similarity=0.072  Sum_probs=24.0

Q ss_pred             HHHHHHhhhccchheeeeeeccC---CCCchhHHHHHHHHHHHHHHhhh
Q psy6892          21 TLILVFCTATGNILVCLAISWEK---RLQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        21 ~~ii~vlgi~gN~lvi~vi~~~~---~L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      -.++.++|..-|-.+++.=+.+.   +.|..+..-++|.|+...+..++
T Consensus       191 AAlLtiIGYSvNDtIVvfDRIREn~r~~~~~~~~~iin~si~qTlsRti  239 (305)
T COG0341         191 AALLTIIGYSVNDTIVVFDRIRENLRKYRRETLREIINTSINQTLTRTI  239 (305)
T ss_pred             HHHHHHeeeccCCeEEEEhHHHHHHhhhccCCHHHHHHHHHHHHHHHHH
Confidence            33455555555544443322222   23333344789999887665544


No 93 
>PF14068 YuiB:  Putative membrane protein
Probab=33.88  E-value=54  Score=23.25  Aligned_cols=19  Identities=26%  Similarity=0.366  Sum_probs=13.4

Q ss_pred             HHHHHHHHHHHHHHhhhhc
Q psy6892          50 NYFLMSLAITDLMVAVLVM   68 (190)
Q Consensus        50 ~~fl~nLAvaDLl~~l~~l   68 (190)
                      .--+.+|+.+|++.....+
T Consensus        63 ~~~l~~l~~~Di~il~~Gl   81 (102)
T PF14068_consen   63 GENLVSLALADILILSSGL   81 (102)
T ss_pred             HHHHhhhhHHHHHHHHHHH
Confidence            3457788999988765543


No 94 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=33.45  E-value=15  Score=31.89  Aligned_cols=29  Identities=10%  Similarity=0.143  Sum_probs=17.1

Q ss_pred             HHHHHHHHHhhhccchheeeeeeccCCCC
Q psy6892          18 AVVTLILVFCTATGNILVCLAISWEKRLQ   46 (190)
Q Consensus        18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~   46 (190)
                      .+++.++.++.|+.-.++|++|.|+||-+
T Consensus       311 ~IiaSiIAIvvIVLIMvIIYLILRYRRKK  339 (353)
T TIGR01477       311 PIIASIIAILIIVLIMVIIYLILRYRRKK  339 (353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            34444555555555566777777777643


No 95 
>PF04789 DUF621:  Protein of unknown function (DUF621);  InterPro: IPR006874 This is a conserved region found in uncharacterised proteins from Caenorhabditis elegans, and is noted to have possible G-protein-coupled receptor-like activity.
Probab=33.12  E-value=41  Score=28.40  Aligned_cols=36  Identities=25%  Similarity=0.375  Sum_probs=24.1

Q ss_pred             hheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892          33 ILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVM   68 (190)
Q Consensus        33 ~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~l   68 (190)
                      .+|+.++.+.+-.+-+..+|+..|.++-++++.+.+
T Consensus        33 Flv~~i~lW~~Fk~m~ffwFl~qlt~s~fi~S~lNl   68 (305)
T PF04789_consen   33 FLVLSIILWSHFKPMKFFWFLTQLTISVFIISSLNL   68 (305)
T ss_pred             HHHHHHHHHHhcccchHHHHHHHHHHHHHHHHhhhh
Confidence            344444444444456778999999999888876643


No 96 
>PF03729 DUF308:  Short repeat of unknown function (DUF308);  InterPro: IPR005325 This represents a group of short repeats that occurs in a limited number of membrane proteins. It may divide further in short repeats of around 7-10 residues of the pattern G-#-X(2)-#(2)-X (#=hydrophobic).
Probab=33.12  E-value=82  Score=19.57  Aligned_cols=21  Identities=5%  Similarity=0.224  Sum_probs=12.7

Q ss_pred             CchhHHHHHHHHHHHHHHhhh
Q psy6892          46 QNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        46 ~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      +.+...+..-+++.+++.+.+
T Consensus        49 ~~~~~~~~l~~gi~~i~~Gi~   69 (72)
T PF03729_consen   49 GSKGWWWSLLSGILSIVLGII   69 (72)
T ss_pred             cchhhHHHHHHHHHHHHHHHH
Confidence            444555666667777776654


No 97 
>PF13903 Claudin_2:  PMP-22/EMP/MP20/Claudin tight junction
Probab=32.89  E-value=55  Score=24.13  Aligned_cols=23  Identities=0%  Similarity=-0.005  Sum_probs=11.9

Q ss_pred             HHHHHhhhccchheeeeeeccCC
Q psy6892          22 LILVFCTATGNILVCLAISWEKR   44 (190)
Q Consensus        22 ~ii~vlgi~gN~lvi~vi~~~~~   44 (190)
                      ++-.++.++|.++.++..+++++
T Consensus        76 ~l~~~~~~~a~~~~~~~~~~~~~   98 (172)
T PF13903_consen   76 ILGLLLLLFAFVFALIGFCKRSY   98 (172)
T ss_pred             HHHHHHHHHHHHHHHHHhhcccc
Confidence            33334455666666555555444


No 98 
>PTZ00046 rifin; Provisional
Probab=32.87  E-value=16  Score=31.81  Aligned_cols=28  Identities=11%  Similarity=0.158  Sum_probs=16.9

Q ss_pred             HHHHHHHHhhhccchheeeeeeccCCCC
Q psy6892          19 VVTLILVFCTATGNILVCLAISWEKRLQ   46 (190)
Q Consensus        19 v~~~ii~vlgi~gN~lvi~vi~~~~~L~   46 (190)
                      +++.++.++.|+.-.++|++|.|+||-+
T Consensus       317 IiaSiiAIvVIVLIMvIIYLILRYRRKK  344 (358)
T PTZ00046        317 IIASIVAIVVIVLIMVIIYLILRYRRKK  344 (358)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            4445555555555567777777777643


No 99 
>KOG1056|consensus
Probab=32.40  E-value=49  Score=32.27  Aligned_cols=43  Identities=12%  Similarity=0.352  Sum_probs=24.0

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCC-------CchhHHHHHHHHHH
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRL-------QNVTNYFLMSLAIT   59 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L-------~~~~~~fl~nLAva   59 (190)
                      +.++.+++.++||+.-++|++++.++...       |..+|+++..+.++
T Consensus       561 ~~i~~l~lA~LGii~tl~vlv~fvr~~~TpvVkasgreLsyiLL~gi~l~  610 (878)
T KOG1056|consen  561 WSIAPLLLAILGIIATLFVLVLFVRYNDTPVVKASGRELSYILLLGIFLC  610 (878)
T ss_pred             ccHHHHHHHHHHHHHHHHhheeEEEecCCceEEecCcEehHHHHHHHHHH
Confidence            33455555566665555555555444322       45567888777643


No 100
>PF14654 Epiglycanin_C:  Mucin, catalytic, TM and cytoplasmic tail region
Probab=32.12  E-value=35  Score=24.05  Aligned_cols=23  Identities=4%  Similarity=-0.042  Sum_probs=12.2

Q ss_pred             hhhHHHHHHHHHHHHhhhccchh
Q psy6892          12 ASWNIWAVVTLILVFCTATGNIL   34 (190)
Q Consensus        12 ~~~~~~~v~~~ii~vlgi~gN~l   34 (190)
                      ++.++++.+..++..+|++.-++
T Consensus        17 PWeIfLItLasVvvavGl~aGLf   39 (106)
T PF14654_consen   17 PWEIFLITLASVVVAVGLFAGLF   39 (106)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHH
Confidence            33445555555666666654433


No 101
>PF11683 DUF3278:  Protein of unknown function (DUF3278);  InterPro: IPR021697  This bacterial family of proteins has no known function. 
Probab=31.84  E-value=31  Score=25.30  Aligned_cols=26  Identities=19%  Similarity=0.161  Sum_probs=16.9

Q ss_pred             HHHHHHHHhhhccchheeeeeeccCC
Q psy6892          19 VVTLILVFCTATGNILVCLAISWEKR   44 (190)
Q Consensus        19 v~~~ii~vlgi~gN~lvi~vi~~~~~   44 (190)
                      -.++.++...+++|++.++...+.+.
T Consensus        35 ~a~i~l~~~~l~~~li~l~l~~~~~~   60 (129)
T PF11683_consen   35 NAFIILFYYSLLLNLISLLLASRYPE   60 (129)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence            35666777778888887765444433


No 102
>PF09323 DUF1980:  Domain of unknown function (DUF1980);  InterPro: IPR015402  Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region.  Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined. 
Probab=31.72  E-value=73  Score=24.63  Aligned_cols=8  Identities=13%  Similarity=0.447  Sum_probs=3.0

Q ss_pred             HHHHHhhh
Q psy6892          22 LILVFCTA   29 (190)
Q Consensus        22 ~ii~vlgi   29 (190)
                      ++++++|+
T Consensus        40 i~l~ilai   47 (182)
T PF09323_consen   40 ILLLILAI   47 (182)
T ss_pred             HHHHHHHH
Confidence            33333433


No 103
>PF13127 DUF3955:  Protein of unknown function (DUF3955)
Probab=31.66  E-value=1.3e+02  Score=19.31  Aligned_cols=25  Identities=20%  Similarity=0.450  Sum_probs=14.2

Q ss_pred             eeccCCCCchhHHHHHHHHHHHHHHhh
Q psy6892          39 ISWEKRLQNVTNYFLMSLAITDLMVAV   65 (190)
Q Consensus        39 i~~~~~L~~~~~~fl~nLAvaDLl~~l   65 (190)
                      +-.+-.+|-|  ++++.++..=++.++
T Consensus        28 Vd~~G~L~Ep--FfLiPlg~l~~~~g~   52 (63)
T PF13127_consen   28 VDEDGVLHEP--FFLIPLGYLFLLIGI   52 (63)
T ss_pred             ECCCCeEecc--cHHHHHHHHHHHHHH
Confidence            3455667877  566666655444443


No 104
>KOG4404|consensus
Probab=31.64  E-value=21  Score=30.74  Aligned_cols=17  Identities=24%  Similarity=0.356  Sum_probs=8.9

Q ss_pred             chhHHHHHHHHHHHHHH
Q psy6892          47 NVTNYFLMSLAITDLMV   63 (190)
Q Consensus        47 ~~~~~fl~nLAvaDLl~   63 (190)
                      ++.|++++.+.++-++.
T Consensus       156 S~~~l~~i~~~~~~~~i  172 (350)
T KOG4404|consen  156 SVYNLVLILFTACILLI  172 (350)
T ss_pred             cHHHHHHHHHHHHHHHH
Confidence            45566666555544443


No 105
>PRK13664 hypothetical protein; Provisional
Probab=31.24  E-value=38  Score=21.48  Aligned_cols=16  Identities=6%  Similarity=0.225  Sum_probs=10.6

Q ss_pred             HHHHHHHHhhhccchh
Q psy6892          19 VVTLILVFCTATGNIL   34 (190)
Q Consensus        19 v~~~ii~vlgi~gN~l   34 (190)
                      .++++++++|++-|++
T Consensus         9 Wilill~lvG~i~N~i   24 (62)
T PRK13664          9 WILVLVFLVGVLLNVI   24 (62)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3445667778877765


No 106
>KOG2443|consensus
Probab=31.15  E-value=9.1  Score=33.02  Aligned_cols=27  Identities=11%  Similarity=0.041  Sum_probs=17.9

Q ss_pred             HHHHHHhhhhcccceeeeccccccccc
Q psy6892          58 ITDLMVAVLVMPLGILTLVRVRTNLSG   84 (190)
Q Consensus        58 vaDLl~~l~~lP~~l~~~l~~~~~~s~   84 (190)
                      -+|++..+.+.+..+.+++.++|....
T Consensus       152 ~~~iv~~vls~~i~v~~ll~~HWl~nN  178 (362)
T KOG2443|consen  152 RAQIVALVLSSMIVVWYLLTKHWLANN  178 (362)
T ss_pred             HHHHHHHHHHHHHHHHHHhhhHHHHHh
Confidence            356666666666777777778876543


No 107
>PF15145 DUF4577:  Domain of unknown function (DUF4577)
Probab=30.78  E-value=30  Score=25.06  Aligned_cols=27  Identities=15%  Similarity=0.167  Sum_probs=13.4

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeee
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAIS   40 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~   40 (190)
                      +.++++.++++.++++..-.+||+.+.
T Consensus        60 ~~lffvglii~LivSLaLVsFvIFLii   86 (128)
T PF15145_consen   60 RSLFFVGLIIVLIVSLALVSFVIFLII   86 (128)
T ss_pred             eeehHHHHHHHHHHHHHHHHHHHHhee
Confidence            445555555555555544445554443


No 108
>KOG3850|consensus
Probab=30.40  E-value=50  Score=29.11  Aligned_cols=37  Identities=27%  Similarity=0.468  Sum_probs=19.5

Q ss_pred             HHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHH
Q psy6892          18 AVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLA   57 (190)
Q Consensus        18 ~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLA   57 (190)
                      .++.+++++++.+.|++.   .....++|+.+.+|++-|+
T Consensus       398 alm~VlLvfVSTIa~~v~---PLmkSR~rt~~t~~LV~l~  434 (455)
T KOG3850|consen  398 ALMTVLLVFVSTIANCVS---PLMKSRNRTASTFFLVFLL  434 (455)
T ss_pred             HHHHHHHHHHHHHHhhcc---HHhhhhhHHHHHHHHHHHH
Confidence            333444455555555443   3344556677766666554


No 109
>PF07226 DUF1422:  Protein of unknown function (DUF1422);  InterPro: IPR009867 This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=30.28  E-value=10  Score=27.58  Aligned_cols=27  Identities=11%  Similarity=0.082  Sum_probs=15.1

Q ss_pred             cCCCCchhHHHHHHHHHHHHHHhhhhc
Q psy6892          42 EKRLQNVTNYFLMSLAITDLMVAVLVM   68 (190)
Q Consensus        42 ~~~L~~~~~~fl~nLAvaDLl~~l~~l   68 (190)
                      .++++.|+.-=+.-+|++-++++++..
T Consensus        53 Q~Yl~~~m~eg~P~~a~acFflG~f~y   79 (117)
T PF07226_consen   53 QRYLNHPMPEGTPKLALACFFLGLFGY   79 (117)
T ss_pred             HHHhcCCCCCCChHHHHHHHHHHHHHH
Confidence            345566665555555666566665543


No 110
>KOG1281|consensus
Probab=30.23  E-value=1e+02  Score=28.74  Aligned_cols=29  Identities=17%  Similarity=0.047  Sum_probs=16.2

Q ss_pred             chhhHHHHHHHHHHHHhhhccchheeeee
Q psy6892          11 AASWNIWAVVTLILVFCTATGNILVCLAI   39 (190)
Q Consensus        11 ~~~~~~~~v~~~ii~vlgi~gN~lvi~vi   39 (190)
                      +.....+++..+++.+..++.|.+...++
T Consensus       469 p~~~i~li~~ll~~v~TeF~Sntata~If  497 (586)
T KOG1281|consen  469 PVFAILLIFSLLILVVTEFVSNTATASIF  497 (586)
T ss_pred             ChHHHHHHHHHHHHHHHHHhcchhheeeh
Confidence            33334444444555566688887765444


No 111
>PHA02974 putative IMV membrane protein; Provisional
Probab=30.22  E-value=72  Score=21.61  Aligned_cols=56  Identities=14%  Similarity=0.137  Sum_probs=28.5

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCC-chhHHHHHHHHHHHHHHhhhhcccceeee
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQ-NVTNYFLMSLAITDLMVAVLVMPLGILTL   75 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~-~~~~~fl~nLAvaDLl~~l~~lP~~l~~~   75 (190)
                      +.++..+..+.|-.+++-.+.-++.+- +....|---+.+..++..++.+|..+..+
T Consensus        13 iTlL~llMiisG~aLi~k~~~P~~~~v~~ss~tf~rvv~~lE~vailifiPGti~LY   69 (81)
T PHA02974         13 ITILVVLLIISGFSLILRLIPGVYSSVIRSSFTAGKILRFMEIFSTIMFIPGIIILY   69 (81)
T ss_pred             HHHHHHHHHHhChHHHHhhcCchhhhhhhHHHHHHHHHHHHHHHHHhheeccHHHHH
Confidence            334444445666666654443332221 22224445566666666666777655443


No 112
>KOG2442|consensus
Probab=29.89  E-value=11  Score=34.10  Aligned_cols=29  Identities=14%  Similarity=0.157  Sum_probs=17.3

Q ss_pred             HHHHHHHHHHHHHhhhhcccceeeecccc
Q psy6892          51 YFLMSLAITDLMVAVLVMPLGILTLVRVR   79 (190)
Q Consensus        51 ~fl~nLAvaDLl~~l~~lP~~l~~~l~~~   79 (190)
                      .++-++...-++++++++.+.+++...++
T Consensus       306 ~~l~~~~~~~l~l~~~Cia~aV~W~v~R~  334 (541)
T KOG2442|consen  306 PRLGNMSYRLLFLSILCIAVAVVWAVFRN  334 (541)
T ss_pred             cccCChhHHHHHHHHhhhheeEEEEEeec
Confidence            44455555556666667667666665544


No 113
>PF11044 TMEMspv1-c74-12:  Plectrovirus spv1-c74 ORF 12 transmembrane protein;  InterPro: IPR022743  This is a group of proteins expressed by Plectroviruses. The Plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. This entry represents putative transmembrane proteins of unknown function. 
Probab=29.77  E-value=62  Score=19.42  Aligned_cols=16  Identities=25%  Similarity=0.177  Sum_probs=6.4

Q ss_pred             HHHHHHHHHHHhhhcc
Q psy6892          16 IWAVVTLILVFCTATG   31 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~g   31 (190)
                      .++.+.+++++++.+|
T Consensus         7 ~iFsvvIil~If~~iG   22 (49)
T PF11044_consen    7 TIFSVVIILGIFAWIG   22 (49)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3333333444433344


No 114
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=28.89  E-value=19  Score=26.62  Aligned_cols=14  Identities=0%  Similarity=0.003  Sum_probs=0.0

Q ss_pred             HHHHHHHHHhhhcc
Q psy6892          18 AVVTLILVFCTATG   31 (190)
Q Consensus        18 ~v~~~ii~vlgi~g   31 (190)
                      +.+..++++++++.
T Consensus        83 ~sal~v~lVl~lls   96 (129)
T PF12191_consen   83 GSALSVVLVLALLS   96 (129)
T ss_dssp             --------------
T ss_pred             hhHHHHHHHHHHHH
Confidence            34444444555544


No 115
>PF02118 Srg:  Srg family chemoreceptor;  InterPro: IPR000609 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents serpentine receptor class g (Srg) from the Srg superfamily [, ]. Srg receptors contain seven hydrophobic, putative transmembrane, regions and can be distinguished from other 7TM GPCR receptors by their own characteristic TM signatures. ; GO: 0004888 transmembrane signaling receptor activity, 0007606 sensory perception of chemical stimulus, 0016020 membrane
Probab=28.81  E-value=22  Score=28.86  Aligned_cols=45  Identities=13%  Similarity=0.085  Sum_probs=27.3

Q ss_pred             HHHHHhhhccchheeeeeeccCC---CCchhHHHHHHHHHHHHHHhhh
Q psy6892          22 LILVFCTATGNILVCLAISWEKR---LQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        22 ~ii~vlgi~gN~lvi~vi~~~~~---L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      ++-++.+++.+++++++++++++   .+++.+.+...=+++++++-+.
T Consensus         5 ~~Ygi~s~~l~~~~~~~i~~~~~~~~~~~sFy~l~~~d~~~ni~~~ln   52 (275)
T PF02118_consen    5 LIYGIPSLILYIFILYVIWKSKKKSYFKSSFYRLYIMDLIMNILTYLN   52 (275)
T ss_pred             hhHHHHHHHHHHHHHHHHHHhccccccCCccHHHHHHHhHHHHHHHHH
Confidence            34455566666677777766665   3666666666656666665433


No 116
>PRK06758 hypothetical protein; Provisional
Probab=28.59  E-value=1.3e+02  Score=21.13  Aligned_cols=17  Identities=18%  Similarity=0.333  Sum_probs=11.7

Q ss_pred             cchhhhhhhhhcccccc
Q psy6892          92 NILCENQLFKAGARDET  108 (190)
Q Consensus        92 ~i~c~~slfSl~se~~~  108 (190)
                      ..+|.+..|.+-.+.++
T Consensus       102 iafcladvfnllqd~ee  118 (128)
T PRK06758        102 IAFCLADVFNLLQDNEE  118 (128)
T ss_pred             HHHHHHHHHHHhcCchh
Confidence            34788888888776543


No 117
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=28.24  E-value=1e+02  Score=27.56  Aligned_cols=15  Identities=27%  Similarity=0.306  Sum_probs=8.2

Q ss_pred             hHHHHHHHHHHHHHH
Q psy6892          49 TNYFLMSLAITDLMV   63 (190)
Q Consensus        49 ~~~fl~nLAvaDLl~   63 (190)
                      ...+...|.+.-++.
T Consensus        79 ~~c~~~sLiiltL~~   93 (418)
T cd07912          79 ICCLKWSLVIATLLC   93 (418)
T ss_pred             ccHHHHHHHHHHHHH
Confidence            345555666555554


No 118
>PF14184 YrvL:  Regulatory protein YrvL
Probab=28.19  E-value=61  Score=24.03  Aligned_cols=20  Identities=10%  Similarity=0.034  Sum_probs=9.6

Q ss_pred             HHHHHHHHHHHhhhccchhe
Q psy6892          16 IWAVVTLILVFCTATGNILV   35 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~gN~lv   35 (190)
                      .++.++++.+++|++-+.+.
T Consensus        41 ~llLF~li~~~lg~~~e~~~   60 (132)
T PF14184_consen   41 SLLLFFLIIFVLGLPFELFE   60 (132)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            33444444455555555443


No 119
>PRK11056 hypothetical protein; Provisional
Probab=28.02  E-value=15  Score=26.76  Aligned_cols=29  Identities=17%  Similarity=0.162  Sum_probs=18.3

Q ss_pred             ccCCCCchhHHHHHHHHHHHHHHhhhhcc
Q psy6892          41 WEKRLQNVTNYFLMSLAITDLMVAVLVMP   69 (190)
Q Consensus        41 ~~~~L~~~~~~fl~nLAvaDLl~~l~~lP   69 (190)
                      ..++++.|+.-=+.-+|++-++++++...
T Consensus        52 yQ~Yl~~~m~eg~P~~a~acFflG~f~yS   80 (120)
T PRK11056         52 HQRYLNRPMPEGLPGLAAACFFLGVFLYS   80 (120)
T ss_pred             HHHHhcCCCCCCChHHHHHHHHHHHHHHH
Confidence            34566777766666677776777665443


No 120
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=27.83  E-value=8.8  Score=29.03  Aligned_cols=25  Identities=20%  Similarity=0.388  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHhhhccchheeeeee
Q psy6892          16 IWAVVTLILVFCTATGNILVCLAIS   40 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~gN~lvi~vi~   40 (190)
                      +..++|.++|++|+++-+.+++..+
T Consensus        63 IaGIVfgiVfimgvva~i~icvCmc   87 (155)
T PF10873_consen   63 IAGIVFGIVFIMGVVAGIAICVCMC   87 (155)
T ss_pred             eeeeehhhHHHHHHHHHHHHHHhhh
Confidence            4456777788888777666554443


No 121
>KOG0473|consensus
Probab=27.65  E-value=60  Score=27.16  Aligned_cols=35  Identities=11%  Similarity=-0.045  Sum_probs=16.5

Q ss_pred             HHHHHHhhhccchheeeeeec-cCCCCchhHHHHHH
Q psy6892          21 TLILVFCTATGNILVCLAISW-EKRLQNVTNYFLMS   55 (190)
Q Consensus        21 ~~ii~vlgi~gN~lvi~vi~~-~~~L~~~~~~fl~n   55 (190)
                      .++.|.+++.||++.++.+.. .+.+.+..+-++.|
T Consensus       265 ~L~~FgVa~~~~~l~V~mv~q~~~d~~~sv~~~~~h  300 (326)
T KOG0473|consen  265 GLKQFGVALEGFVLGVKMVFQVVADALRSVGALLVH  300 (326)
T ss_pred             hHHHHHHHhhhHHhhhHhhhhhccHHHHHHHHHHHH
Confidence            344455556666665544432 23333334444444


No 122
>COG2116 FocA Formate/nitrite family of transporters [Inorganic ion transport and metabolism]
Probab=27.33  E-value=38  Score=28.28  Aligned_cols=43  Identities=21%  Similarity=0.192  Sum_probs=27.7

Q ss_pred             HHHHHHhhhccchheeeeeeccCCCCchh-HHHHHHHHHHHHHH
Q psy6892          21 TLILVFCTATGNILVCLAISWEKRLQNVT-NYFLMSLAITDLMV   63 (190)
Q Consensus        21 ~~ii~vlgi~gN~lvi~vi~~~~~L~~~~-~~fl~nLAvaDLl~   63 (190)
                      +.-+++.||+.|.+|+.++|...+.+... -++++-+.+..+..
T Consensus       158 ~~~~F~~GI~cN~lVclAvwm~~~~k~~~~K~~~m~~pv~~Fv~  201 (265)
T COG2116         158 FLEAFAKGILCNWLVCLAVWMSYSAKDGAAKIFAIILPVMLFVA  201 (265)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcccchhhHHHHHHHHHHHHHH
Confidence            46677888888888888887766655432 45555555544443


No 123
>PTZ00415 transmission-blocking target antigen s230; Provisional
Probab=27.32  E-value=14  Score=38.55  Aligned_cols=6  Identities=33%  Similarity=0.407  Sum_probs=2.8

Q ss_pred             hccchh
Q psy6892          29 ATGNIL   34 (190)
Q Consensus        29 i~gN~l   34 (190)
                      ++||.+
T Consensus        30 ~~~~~~   35 (2849)
T PTZ00415         30 IIGNGI   35 (2849)
T ss_pred             eecCCc
Confidence            445543


No 124
>cd08763 Cyt_b561_CYB561 Vertebrate cytochrome b(561), CYB561 gene product. Cytochrome b(561), as found in vertebrates, which might act as a ferric-chelate reductase, catalyzing the reduction of Fe(3+) to Fe(2+), such as associated with the transport of iron from the endosome to the cytoplasm. It is assumed that this protein uses ascorbate as the electron donor. Belongs to the cytochrome b(561) family, which are secretory vesicle-specific electron transport proteins. Cytochromes b(561) are integral membrane proteins that bind two heme groups non-covalently, and may have six alpha-helical trans-membrane segments.
Probab=27.32  E-value=84  Score=23.61  Aligned_cols=45  Identities=18%  Similarity=0.252  Sum_probs=21.7

Q ss_pred             HHHHHHHhhhccchheeeeeec--cCCCCchhHHHHHHHHHHHHHHh
Q psy6892          20 VTLILVFCTATGNILVCLAISW--EKRLQNVTNYFLMSLAITDLMVA   64 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~--~~~L~~~~~~fl~nLAvaDLl~~   64 (190)
                      ++++++.+-+.|+.++++-..+  .++..+..+..+--+|+.=.+++
T Consensus        12 ~lm~~G~i~l~geaiL~~~~~~~~~k~~~k~~H~~L~~la~~~~~~G   58 (143)
T cd08763          12 LCMVLGLVFLCGEALLVYRVFRNETKRSTKILHGLLHIMALVISLVG   58 (143)
T ss_pred             HHHHHHHHHHHHHHHHHhccccccccchhHHHHHHHHHHHHHHHHHH
Confidence            4445555555566666554333  22223445555555555444443


No 125
>COG5130 YIP3 Prenylated rab acceptor 1 and related proteins [Intracellular trafficking and secretion / Signal transduction mechanisms]
Probab=27.13  E-value=33  Score=25.96  Aligned_cols=23  Identities=9%  Similarity=0.157  Sum_probs=12.6

Q ss_pred             hhHHHHHHHHHHHHHHhhhhccc
Q psy6892          48 VTNYFLMSLAITDLMVAVLVMPL   70 (190)
Q Consensus        48 ~~~~fl~nLAvaDLl~~l~~lP~   70 (190)
                      |.+.+...|++.-+.++++..|.
T Consensus       113 ~~~~ly~glvcvlip~gffaspI  135 (169)
T COG5130         113 PRSVLYAGLVCVLIPFGFFASPI  135 (169)
T ss_pred             ecchhhhhHHHHHHHHHHHHhHH
Confidence            34556666666555555554443


No 126
>KOG0065|consensus
Probab=26.36  E-value=51  Score=33.70  Aligned_cols=35  Identities=17%  Similarity=0.174  Sum_probs=18.8

Q ss_pred             chheeeeeeccCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892          32 NILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        32 N~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      +.+=++++...+.+++..++.-.-+++..++.+++
T Consensus      1250 ~~~Gmm~~s~tPn~~~Aav~~s~~~s~~~~F~G~l 1284 (1391)
T KOG0065|consen 1250 TTLGMMLVSLTPNLQTAAVIASLFFSFWNLFSGFL 1284 (1391)
T ss_pred             HHHHHHHHHhCCChhHHHHHHHHHHHHHHHhcccc
Confidence            33334455556666666555555555555555543


No 127
>PRK11562 nitrite transporter NirC; Provisional
Probab=25.95  E-value=36  Score=28.38  Aligned_cols=34  Identities=32%  Similarity=0.512  Sum_probs=22.1

Q ss_pred             HHHHHHhhhccchheeeeeeccCCCCchhHHHHH
Q psy6892          21 TLILVFCTATGNILVCLAISWEKRLQNVTNYFLM   54 (190)
Q Consensus        21 ~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~   54 (190)
                      +..+++.||++|.+|+..++...+.+....++.+
T Consensus       150 ~~~~f~~gI~CNwLVclavw~~~~~~~~gKi~~i  183 (268)
T PRK11562        150 AMVLFFKGALCNWLVCLAIWMAIRTEGAAKFIAI  183 (268)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhccccchhHHH
Confidence            4577888999999998877755543333333333


No 128
>PF15109 TMEM125:  TMEM125 protein family
Probab=25.87  E-value=1.8e+02  Score=20.61  Aligned_cols=46  Identities=20%  Similarity=0.182  Sum_probs=22.2

Q ss_pred             HHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHH
Q psy6892          16 IWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDL   61 (190)
Q Consensus        16 ~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDL   61 (190)
                      .+.+...++...|+-|-.+.-.+..+....|-....-+|-||+.=+
T Consensus        21 Cy~~aV~Lil~cG~gGv~LLstttS~sgeWRL~vGt~LCLLAL~vL   66 (112)
T PF15109_consen   21 CYCVAVALILGCGLGGVGLLSTTTSRSGEWRLAVGTTLCLLALLVL   66 (112)
T ss_pred             HHHHHHHHHHhcCcCceEEEecccccccceehhhhHHHHHHHHHHH
Confidence            3333344444445544444434444444455555556666665433


No 129
>PHA03164 hypothetical protein; Provisional
Probab=25.79  E-value=52  Score=22.11  Aligned_cols=22  Identities=27%  Similarity=0.407  Sum_probs=15.8

Q ss_pred             CchhHHHHHHHHHHHHHHhhhh
Q psy6892          46 QNVTNYFLMSLAITDLMVAVLV   67 (190)
Q Consensus        46 ~~~~~~fl~nLAvaDLl~~l~~   67 (190)
                      ++.+.+.+..||++=+++.++.
T Consensus        56 ktftFlvLtgLaIamILfiifv   77 (88)
T PHA03164         56 KTFTFLVLTGLAIAMILFIIFV   77 (88)
T ss_pred             heeehHHHHHHHHHHHHHHHHH
Confidence            4567788888999877765443


No 130
>KOG2632|consensus
Probab=25.57  E-value=60  Score=27.00  Aligned_cols=58  Identities=16%  Similarity=0.211  Sum_probs=34.2

Q ss_pred             hHHHHHHHHHHHHhhhccchhee----eeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccc
Q psy6892          14 WNIWAVVTLILVFCTATGNILVC----LAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLG   71 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi----~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~   71 (190)
                      ...+.++..++++++++.-+...    ..+.+++++++-..+-+.|+++-.+++..+.++..
T Consensus        18 ts~~~~~~~~i~lv~~~~~i~~~~~l~~~~l~~~ql~RL~Ty~l~H~s~~hllfnmlaL~~~   79 (258)
T KOG2632|consen   18 TSIVVVLAILIYLVSFFPGIVEVLGLPSELLINWQLYRLITYALVHLSLPHLLFNMLALWPL   79 (258)
T ss_pred             HHHHHHHHHHHHHHhccchhhhHhcCCHHHhhhHHHHHHHHHHHHhccHHHHHHHHHHHHhc
Confidence            34444555555555554444321    22345666777777778888888888776655433


No 131
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=25.54  E-value=73  Score=22.32  Aligned_cols=11  Identities=18%  Similarity=0.347  Sum_probs=5.5

Q ss_pred             heeeeeeccCC
Q psy6892          34 LVCLAISWEKR   44 (190)
Q Consensus        34 lvi~vi~~~~~   44 (190)
                      +..++|.|.|+
T Consensus        81 IyYFVILRer~   91 (101)
T PF06024_consen   81 IYYFVILRERQ   91 (101)
T ss_pred             heEEEEEeccc
Confidence            34455555554


No 132
>COG4970 FimT Tfp pilus assembly protein FimT [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=24.85  E-value=76  Score=24.93  Aligned_cols=44  Identities=11%  Similarity=0.049  Sum_probs=29.5

Q ss_pred             eeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeeccccccc
Q psy6892          39 ISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNL   82 (190)
Q Consensus        39 i~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~   82 (190)
                      .++..++++..+-+...|.++-.-......|..+.....+.|..
T Consensus        36 ~i~~~rl~s~a~eL~a~l~lARteAv~~~~pv~~~~~~~~~~~~   79 (181)
T COG4970          36 WIRSQRLRSAADELAAALQLARTEAVAQNTPVQVCPDWDGGWCL   79 (181)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeeeeCCCCCeec
Confidence            34667788888989888888765554445676666555555444


No 133
>PF12304 BCLP:  Beta-casein like protein;  InterPro: IPR020977  This entry represents eukaryotic proteins that are typically between 216 to 240 amino acids in length which have two conserved sequence motifs: VLR and TRIY. Beta-casein-like protein is associated with cell morphology and a regulation of growth pattern of tumours. It is found in adenocarcinomas of uterine cervical tissues[]. 
Probab=24.72  E-value=1e+02  Score=24.36  Aligned_cols=44  Identities=7%  Similarity=0.220  Sum_probs=23.5

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCC-CchhHHHHHHHHHHHHHHhhh
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRL-QNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L-~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      +-+.-++++|.+.++.|++   .|++ +.+..+.+..+++...+++..
T Consensus        46 isv~Sgll~I~~GI~AIvl---Srnl~~~~L~W~Ll~~S~ln~LlSaA   90 (188)
T PF12304_consen   46 ISVTSGLLSIICGIVAIVL---SRNLRNRPLHWTLLVVSLLNALLSAA   90 (188)
T ss_pred             HHHHHHHHHHHHhHHHHhh---hccCCCCcchHHHHHHHHHHHHHHHH
Confidence            3334445555554444333   3333 355677777777776666544


No 134
>KOG3832|consensus
Probab=24.70  E-value=2.5e+02  Score=23.03  Aligned_cols=62  Identities=8%  Similarity=0.056  Sum_probs=30.1

Q ss_pred             eeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeecccccccccccceeeccchhhhhhh
Q psy6892          35 VCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRTNLSGAVLRLKKNILCENQLF  100 (190)
Q Consensus        35 vi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~~~s~~lC~i~~~i~c~~slf  100 (190)
                      ++.-+.+.+++    .++..-+-+++.+.++....+.+..-....|+...++|-+-....|.+..+
T Consensus       225 tf~diqktk~i----qiltslmrw~~~~~g~~l~tmqaeedaa~a~p~arc~cgfgkl~~~aly~~  286 (319)
T KOG3832|consen  225 TFADIQKTKHI----QILTSLMRWAAFILGISLATMQAEEDAAAAHPPARCFCGFGKLFGCALYAF  286 (319)
T ss_pred             eEeccccchhH----HHHHHHHHHHhccchhhhhhhhhcccccccCCccccccccccccchhhHHH
Confidence            34444444443    333333446666655544434444444455666666666544444544333


No 135
>PF10267 Tmemb_cc2:  Predicted transmembrane and coiled-coil 2 protein;  InterPro: IPR019394  This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown. 
Probab=24.33  E-value=50  Score=29.23  Aligned_cols=15  Identities=7%  Similarity=-0.135  Sum_probs=8.1

Q ss_pred             CCCCchhHHHHHHHH
Q psy6892          43 KRLQNVTNYFLMSLA   57 (190)
Q Consensus        43 ~~L~~~~~~fl~nLA   57 (190)
                      |-++++..++..-|+
T Consensus       367 Pl~~tR~r~~~t~~~  381 (395)
T PF10267_consen  367 PLTRTRLRTLTTLLL  381 (395)
T ss_pred             HHhhccHHHHHHHHH
Confidence            445666555554444


No 136
>COG1272 Predicted membrane protein, hemolysin III homolog [General function prediction only]
Probab=24.17  E-value=1.6e+02  Score=24.03  Aligned_cols=42  Identities=10%  Similarity=-0.014  Sum_probs=24.3

Q ss_pred             HHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHH
Q psy6892          19 VVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITD   60 (190)
Q Consensus        19 v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaD   60 (190)
                      ++++++..+|++|-++-+....+.|+++...++.+.-+++.=
T Consensus       119 ~~~~iiW~lal~Gi~~kl~~~~~~r~ls~~~yl~mGw~~v~~  160 (226)
T COG1272         119 ILLGLIWGLALAGILFKLFFKKRFRKLSLVLYLAMGWLGLIV  160 (226)
T ss_pred             HHHHHHHHHHHHHHhhhhhccCcCceeeehhhHHHHHHHHHH
Confidence            344445555555555555555666677777776666655543


No 137
>COG5456 Predicted integral membrane protein linked to a cation pump [Inorganic ion transport and metabolism]
Probab=24.14  E-value=50  Score=25.22  Aligned_cols=34  Identities=15%  Similarity=0.045  Sum_probs=23.7

Q ss_pred             hccchheeeeeeccCCCCchhHHHHHHHHHHHHH
Q psy6892          29 ATGNILVCLAISWEKRLQNVTNYFLMSLAITDLM   62 (190)
Q Consensus        29 i~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl   62 (190)
                      |.-|+++.+.-.+...=....|.|+.++.+.+-.
T Consensus        30 IaVnlvma~~A~~SwSGlVv~NtYvAsqqFN~ka   63 (166)
T COG5456          30 IAVNLVMAWNASRSWSGLVVENTYVASQQFNRKA   63 (166)
T ss_pred             HHHHHHHHHHhhccccceEecchhHHHHHHHHHH
Confidence            4457777666555555556779999999987644


No 138
>KOG4536|consensus
Probab=24.06  E-value=1.1e+02  Score=25.93  Aligned_cols=43  Identities=19%  Similarity=0.326  Sum_probs=22.9

Q ss_pred             HHHHhhhccchheeeeeeccCCC--CchhHHHHHHHHHHHHHHhh
Q psy6892          23 ILVFCTATGNILVCLAISWEKRL--QNVTNYFLMSLAITDLMVAV   65 (190)
Q Consensus        23 ii~vlgi~gN~lvi~vi~~~~~L--~~~~~~fl~nLAvaDLl~~l   65 (190)
                      .++++++.|-++-++....+-+|  |...+.++.-|++.+++.++
T Consensus       200 ~l~Lvy~~~milp~~~~k~r~kLPsr~sFy~Y~~im~~Ln~Lq~~  244 (347)
T KOG4536|consen  200 KLLLVYSYGMILPMYNSKWREKLPSRPSFYVYIGIMAALNLLQLL  244 (347)
T ss_pred             HHHHHHHHHheeeccchhhhhcCCCcchHHHHHHHHHHHHHHHHH
Confidence            33344444444444433333333  34457777778887877654


No 139
>COG3125 CyoD Heme/copper-type cytochrome/quinol oxidase, subunit 4 [Energy production and conversion]
Probab=23.88  E-value=79  Score=22.82  Aligned_cols=20  Identities=25%  Similarity=0.496  Sum_probs=14.8

Q ss_pred             CchhHHHHHHHHHHHHHHhh
Q psy6892          46 QNVTNYFLMSLAITDLMVAV   65 (190)
Q Consensus        46 ~~~~~~fl~nLAvaDLl~~l   65 (190)
                      ++....++..||+..+++=+
T Consensus        45 ~~~~~~~i~~lA~iQi~vqL   64 (111)
T COG3125          45 STVTLIIILGLAVIQILVHL   64 (111)
T ss_pred             hhhHHHHHHHHHHHHHHHHH
Confidence            55677888888888777644


No 140
>PF01545 Cation_efflux:  Cation efflux family;  InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=23.80  E-value=20  Score=29.33  Aligned_cols=10  Identities=10%  Similarity=0.421  Sum_probs=4.0

Q ss_pred             HHHHHHHHhh
Q psy6892          56 LAITDLMVAV   65 (190)
Q Consensus        56 LAvaDLl~~l   65 (190)
                      -+.+|.+.++
T Consensus       143 ~~~~d~~~s~  152 (284)
T PF01545_consen  143 HSLIDVLSSL  152 (284)
T ss_dssp             HHHHHTS-SS
T ss_pred             hcccchhHHH
Confidence            3344555443


No 141
>PF13120 DUF3974:  Domain of unknown function (DUF3974)
Probab=23.58  E-value=53  Score=23.14  Aligned_cols=17  Identities=18%  Similarity=0.208  Sum_probs=6.9

Q ss_pred             HHHHhhhccchheeeee
Q psy6892          23 ILVFCTATGNILVCLAI   39 (190)
Q Consensus        23 ii~vlgi~gN~lvi~vi   39 (190)
                      +++++-++|-.+|++++
T Consensus        10 l~g~llligftivvl~v   26 (126)
T PF13120_consen   10 LIGTLLLIGFTIVVLLV   26 (126)
T ss_pred             HHHHHHHHHHHHHhhhh
Confidence            33333344444444333


No 142
>PF09801 SYS1:  Integral membrane protein S linking to the trans Golgi network;  InterPro: IPR019185  Members of this family are integral membrane proteins involved in protein trafficking between the late Golgi and endosome. They may also serve as a receptor for ADP-ribosylation factor-related protein 1 (ARFRP1) []. Sys1p is a small integral membrane protein with four predicted transmembrane domains that localises to the Trans Golgi network TGN in yeast and human cells []. 
Probab=23.50  E-value=1.3e+02  Score=22.65  Aligned_cols=10  Identities=20%  Similarity=0.381  Sum_probs=4.3

Q ss_pred             HHHHHHHHHH
Q psy6892          50 NYFLMSLAIT   59 (190)
Q Consensus        50 ~~fl~nLAva   59 (190)
                      .++++||.++
T Consensus        93 Tlh~iHli~~  102 (144)
T PF09801_consen   93 TLHFIHLIIC  102 (144)
T ss_pred             HHHHHHHHHH
Confidence            3444444433


No 143
>KOG2513|consensus
Probab=23.41  E-value=1.4e+02  Score=27.98  Aligned_cols=52  Identities=12%  Similarity=0.162  Sum_probs=33.3

Q ss_pred             HHHHHHHHHHHHhhhccchheeeeeeccCCCC---chhHHHHHHHHHHHHHHhhh
Q psy6892          15 NIWAVVTLILVFCTATGNILVCLAISWEKRLQ---NVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        15 ~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~---~~~~~fl~nLAvaDLl~~l~   66 (190)
                      ..+--++.++.++|++.|..+++......++.   ++.+..+.-+.+-.+++++.
T Consensus       532 G~Wq~~l~~lSvls~vtncaLi~~~~~~~~~~~~~s~~~~il~~V~~EH~~l~Lk  586 (647)
T KOG2513|consen  532 GAWQNALELLSVLSCVTNCALIGMYPQVNKLWPLLSPENAILIIVILEHVLLLLK  586 (647)
T ss_pred             HHHHHHHHHHHHHHHHHhHHHHhccccccccCcccchHHHHHHHHHHHHHHHHHH
Confidence            34445666777888888887776655444433   46677776666666666543


No 144
>KOG2302|consensus
Probab=23.39  E-value=76  Score=31.86  Aligned_cols=24  Identities=25%  Similarity=0.459  Sum_probs=18.3

Q ss_pred             CCCCchhHHHHHHHHHHHHHHhhh
Q psy6892          43 KRLQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        43 ~~L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      +..++....|++.+|+.|++++..
T Consensus      1191 ~ssWN~LDgflv~vsviDilvs~a 1214 (1956)
T KOG2302|consen 1191 RSSWNVLDGFLVAVSVIDILVSQA 1214 (1956)
T ss_pred             HHHHHhhhHHHHHHHHHHHHHHHh
Confidence            334566788999999999998754


No 145
>PF06706 CTV_P6:  Citrus tristeza virus 6-kDa protein;  InterPro: IPR009584 This family consists of several Citrus tristeza virus (CTV) 6 kDa, 51 residue long hydrophobic (P6) proteins. The function of this family is unknown.
Probab=23.18  E-value=1.5e+02  Score=17.70  Aligned_cols=15  Identities=13%  Similarity=0.162  Sum_probs=6.8

Q ss_pred             hhccchheeeeeecc
Q psy6892          28 TATGNILVCLAISWE   42 (190)
Q Consensus        28 gi~gN~lvi~vi~~~   42 (190)
                      ++++-++++..++|+
T Consensus        20 AF~~~iiIvitiyrc   34 (51)
T PF06706_consen   20 AFAALIIIVITIYRC   34 (51)
T ss_pred             HHHHHhEEEEEEEEe
Confidence            344444444445544


No 146
>PF09971 DUF2206:  Predicted membrane protein (DUF2206);  InterPro: IPR018701  This family of predicted membrane proteins from archaea has no known function.
Probab=23.14  E-value=88  Score=27.33  Aligned_cols=58  Identities=22%  Similarity=0.321  Sum_probs=25.3

Q ss_pred             hHHHHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccce
Q psy6892          14 WNIWAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGI   72 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l   72 (190)
                      +.+...+.+++..+.++|=+.+++-..+++.... ..++..++-..=++++.+.+|.+.
T Consensus       114 ~~i~k~i~l~~~~~i~IG~l~~~~~~~~~k~~~~-~~Yl~fs~~~~iiLia~i~lP~fa  171 (367)
T PF09971_consen  114 HSISKYIHLFIQFFIIIGFLALILKRIYKKIKFN-IEYLAFSLVSLIILIASIVLPFFA  171 (367)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-HHHHHHHHHHHHHHHHHHhccchh
Confidence            3333444444444445554444433323222222 333444444434445555677664


No 147
>PF03839 Sec62:  Translocation protein Sec62;  InterPro: IPR004728 Members of the NSCC2 family have been sequenced from various yeast, fungal and animals species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.; GO: 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane
Probab=22.93  E-value=48  Score=26.97  Aligned_cols=7  Identities=0%  Similarity=-0.382  Sum_probs=3.4

Q ss_pred             ccccccc
Q psy6892          79 RTNLSGA   85 (190)
Q Consensus        79 ~~~~s~~   85 (190)
                      .|.++..
T Consensus       177 fWlfPNL  183 (224)
T PF03839_consen  177 FWLFPNL  183 (224)
T ss_pred             EEeCCcc
Confidence            3555544


No 148
>PF10726 DUF2518:  Protein of function (DUF2518);  InterPro: IPR019664  This entry contains the Ycf51 protein family, which is conserved in Cyanobacteria. The function is not known. 
Probab=22.79  E-value=67  Score=24.30  Aligned_cols=9  Identities=22%  Similarity=0.604  Sum_probs=4.0

Q ss_pred             HHHHHHHHH
Q psy6892          51 YFLMSLAIT   59 (190)
Q Consensus        51 ~fl~nLAva   59 (190)
                      -|+.-|+++
T Consensus        44 ~F~~vLt~g   52 (145)
T PF10726_consen   44 SFMLVLTVG   52 (145)
T ss_pred             HHHHHHHHH
Confidence            344444444


No 149
>PF05751 FixH:  FixH;  InterPro: IPR008620 This family consists of several Rhizobium FixH like proteins. It has been suggested that the four proteins FixG, FixH, FixI, and FixS may participate in a membrane-bound complex coupling the FixI cation pump with a redox process catalysed by FixG [].
Probab=22.50  E-value=75  Score=23.22  Aligned_cols=40  Identities=13%  Similarity=0.101  Sum_probs=23.9

Q ss_pred             HHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHH
Q psy6892          20 VTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAIT   59 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAva   59 (190)
                      ++++++++.++.|+.++++-.+.+.-....+||--.++.-
T Consensus         7 ~ii~~~~~~v~~~~~~v~~A~~~~~~lV~~dYY~~g~~y~   46 (146)
T PF05751_consen    7 FIIAFFAVFVVANVTMVYIAISTPDGLVVDDYYEKGLAYN   46 (146)
T ss_pred             hhHHhhhhEeeeeeeEEeeeccCCCCceeccHHHhhhhhh
Confidence            3444455556667666665555554334457778777765


No 150
>PF10190 Tmemb_170:  Putative transmembrane protein 170;  InterPro: IPR019334 This entry represents a group of putative transmembrane proteins conserved from nematodes to humans. The protein is only approximately 130 amino acids in length. The function is unknown. 
Probab=21.88  E-value=50  Score=23.58  Aligned_cols=11  Identities=9%  Similarity=-0.167  Sum_probs=4.3

Q ss_pred             cccccccccee
Q psy6892          79 RTNLSGAVLRL   89 (190)
Q Consensus        79 ~~~~s~~lC~i   89 (190)
                      ....+...|.+
T Consensus        74 ~~~M~~~~a~v   84 (105)
T PF10190_consen   74 GFRMSTWEAMV   84 (105)
T ss_pred             CCcccHHHHHH
Confidence            33444443333


No 151
>cd08762 Cyt_b561_CYBASC3 Vertebrate cytochrome b(561), CYBASC3 gene product. Cytochrome b ascorbate-dependent 3, as found in vertebrates, which might act as a ferric-chelate reductase, catalyzing the reduction of Fe(3+) to Fe(2+), such as associated with the transport of iron from the endosome to the cytoplasm. It is assumed that this protein uses ascorbate as the electron donor. Belongs to the cytochrome b(561) family, which are secretory vesicle-specific electron transport proteins. Cytochromes b(561) are integral membrane proteins that bind two heme groups non-covalently, and may have six alpha-helical trans-membrane segments.
Probab=21.80  E-value=1.2e+02  Score=23.89  Aligned_cols=47  Identities=17%  Similarity=0.124  Sum_probs=26.0

Q ss_pred             HHHHHHHHhhhccchheeeeeec----cCCCCchhHHHHHHHHHHHHHHhh
Q psy6892          19 VVTLILVFCTATGNILVCLAISW----EKRLQNVTNYFLMSLAITDLMVAV   65 (190)
Q Consensus        19 v~~~ii~vlgi~gN~lvi~vi~~----~~~L~~~~~~fl~nLAvaDLl~~l   65 (190)
                      .+++++..+-+.|+.++++-+.+    .++.+++.+.++-.+|+.=.++++
T Consensus        39 P~lMv~Gfi~L~geAiL~Yr~~~~~~~~k~~~K~~H~~L~~~Al~~~vvGl   89 (179)
T cd08762          39 PVLMVTGMVVLYGNAALVYRIPLTWGGPKLPWKLLHAGLLLLAFILTVIGL   89 (179)
T ss_pred             HHHHHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHHHHHHHHH
Confidence            44555555566788777665433    223345666666666655444443


No 152
>KOG3058|consensus
Probab=21.79  E-value=43  Score=29.05  Aligned_cols=54  Identities=15%  Similarity=0.226  Sum_probs=26.8

Q ss_pred             HHHHHHHhhhccchheeeeeec-cCCC-CchhHHHHHH-------HHHHHHHHhhhhccccee
Q psy6892          20 VTLILVFCTATGNILVCLAISW-EKRL-QNVTNYFLMS-------LAITDLMVAVLVMPLGIL   73 (190)
Q Consensus        20 ~~~ii~vlgi~gN~lvi~vi~~-~~~L-~~~~~~fl~n-------LAvaDLl~~l~~lP~~l~   73 (190)
                      +..+...++.+.|.+++.+++- .++. .+.+.+++.+       .++++++..+..+..+++
T Consensus        70 lafl~~~~~~~l~~v~l~~vHervP~~~pPLPDi~f~~vp~i~wa~~~~e~~~~~~~~~~f~l  132 (351)
T KOG3058|consen   70 LAFLYLFVAALLNSVTLVYVHERVPDPYPPLPDIFFDLVPEIPWAFSLCEIIGMILVVLLFTL  132 (351)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCcHHHHhcccchHHHHHHHHHHHHHHHHHHHH
Confidence            3344445667788888766652 2321 1223333332       456676665444333333


No 153
>PF03073 TspO_MBR:  TspO/MBR family;  InterPro: IPR004307 Members of this group are involved in transmembrane signalling. In both prokaryotes and mitochondria they are localized to the outer membrane, and have been shown to bind and transport dicarboxylic tetrapyrrole intermediates of the haem biosynthetic pathway [, ]. They are associated with the major outer membrane porins (in prokaryotes) and with the voltage-dependent anion channel (in mitochondria) [].  Rhodobacter sphaeroides TspO (previously CrtK) is involved in signal transduction, functioning as a negative regulator of the expression of some photosynthesis genes (PpsR/AppA repressor/antirepressor regulon). This down-regulation is believed to be in response to oxygen levels. TspO works through (or modulates) the PpsR/AppA system and acts upstream of the site of action of these regulatory proteins []. It has been suggested that the TspO regulatory pathway works by regulating the efflux of certain tetrapyrrole intermediates of the haem/bacteriochlorophyll biosynthetic pathways in response to the availability of molecular oxygen, thereby causing the accumulation of a biosynthetic intermediate that serves as a corepressor for the regulated genes []. A homologue of the TspO protein in Rhizobium meliloti (Sinorhizobium meliloti) is involved in regulating expression of the ndi locus in response to stress conditions []. There is evidence that the S. meliloti TspO acts through, or in addition to, the FixL regulatory system. In animals, the peripheral-type benzodiazepine receptor (PBR, MBR) is a mitochondrial protein (located in the outer mitochondrial membrane) characterised by its ability to bind with nanomolar affinity to a variety of benzodiazepine-like drugs, as well as to dicarboxylic tetrapyrrole intermediates of the haem biosynthetic pathway. Depending upon the tissue, it was shown to be involved in steroidogenesis, haem biosynthesis, apoptosis, cell growth and differentiation, mitochondrial respiratory control, and immune and stress response, but the precise function of the PBR remains unclear. The role of PBR in the regulation of cholesterol transport from the outer to the inner mitochondrial membrane, the rate-determining step in steroid biosynthesis, has been studied in detail. PBR is required for the binding, uptake and release, upon ligand activation, of the substrate cholesterol []. PBR forms a multimeric complex with the voltage-dependent anion channel (VDAC) [] and adenine nucleotide carrier []. Molecular modeling of PBR suggested that it might function as a channel for cholesterol. Indeed, cholesterol uptake and transport by bacterial cells was induced upon PBR expression. Mutagenesis studies identified a cholesterol recognition/interaction motif (CRAC) in the cytoplasmic C terminus of PBR [, ]. In complementation experiments, rat PBR (pk18) functionally substitutes for its homologue TspO in R. sphaeroides, negatively affecting transcription of specific photosynthesis genes []. This suggests that PBR may function as an oxygen sensor, transducing an oxygen-triggered signal leading to an adaptive cellular response. These observations suggest that fundamental aspects of this receptor and the downstream signal transduction pathway are conserved in bacteria and higher eukaryotic mitochondria. The alpha-3 subdivision of the purple bacteria is considered to be a likely source of the endosymbiont that ultimately gave rise to the mitochondrion. Therefore, it is possible that the mammalian PBR remains both evolutionarily and functionally related to the TspO of R. sphaeroides.; GO: 0016021 integral to membrane
Probab=21.73  E-value=1.9e+02  Score=21.41  Aligned_cols=20  Identities=15%  Similarity=0.218  Sum_probs=11.7

Q ss_pred             cCCCCchhHHHHHHHHHHHH
Q psy6892          42 EKRLQNVTNYFLMSLAITDL   61 (190)
Q Consensus        42 ~~~L~~~~~~fl~nLAvaDL   61 (190)
                      .+..+.+..++.++|++.-+
T Consensus        66 ~~~~~~~l~l~~~~l~ln~~   85 (148)
T PF03073_consen   66 SPRRRRALALYAIQLALNFA   85 (148)
T ss_pred             cchhHHHHHHHHHHHHHHHH
Confidence            34445566677777766543


No 154
>COG4758 Predicted membrane protein [Function unknown]
Probab=21.64  E-value=50  Score=26.98  Aligned_cols=17  Identities=12%  Similarity=0.044  Sum_probs=8.1

Q ss_pred             HHHHHHHHHHhhhccch
Q psy6892          17 WAVVTLILVFCTATGNI   33 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~   33 (190)
                      +.++++++.+++++++.
T Consensus        10 Ii~Ifl~l~ii~~~~~~   26 (235)
T COG4758          10 IIAIFLALMIIQIVIET   26 (235)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            33444444455555554


No 155
>COG5521 Predicted integral membrane protein [Function unknown]
Probab=21.47  E-value=2.3e+02  Score=23.55  Aligned_cols=14  Identities=29%  Similarity=0.309  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHHHHH
Q psy6892          50 NYFLMSLAITDLMV   63 (190)
Q Consensus        50 ~~fl~nLAvaDLl~   63 (190)
                      ++.+++||+--++.
T Consensus       221 ~l~lnclalP~laa  234 (275)
T COG5521         221 SLALNCLALPALAA  234 (275)
T ss_pred             HHHHHhhhhHHHHH
Confidence            45555555544433


No 156
>COG2456 Uncharacterized conserved protein [Function unknown]
Probab=21.41  E-value=56  Score=23.56  Aligned_cols=40  Identities=15%  Similarity=0.020  Sum_probs=21.4

Q ss_pred             cchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhccc
Q psy6892          31 GNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPL   70 (190)
Q Consensus        31 gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~   70 (190)
                      +-.++++++.|.++-+...+-++.=+++.-+++.....|.
T Consensus        19 ~l~~m~~~~~r~k~~k~s~~~ll~W~~~wv~vlifal~P~   58 (121)
T COG2456          19 ILFLMIYVFFRLKRRKLSLSDLLFWEAFWVFVLIFALFPE   58 (121)
T ss_pred             HHHHHHHHHHHHHHhcCCccchHHHHHHHHHHHHHHhcch
Confidence            3344556666666656555656666665544443334443


No 157
>cd08762 Cyt_b561_CYBASC3 Vertebrate cytochrome b(561), CYBASC3 gene product. Cytochrome b ascorbate-dependent 3, as found in vertebrates, which might act as a ferric-chelate reductase, catalyzing the reduction of Fe(3+) to Fe(2+), such as associated with the transport of iron from the endosome to the cytoplasm. It is assumed that this protein uses ascorbate as the electron donor. Belongs to the cytochrome b(561) family, which are secretory vesicle-specific electron transport proteins. Cytochromes b(561) are integral membrane proteins that bind two heme groups non-covalently, and may have six alpha-helical trans-membrane segments.
Probab=21.29  E-value=1.8e+02  Score=22.77  Aligned_cols=20  Identities=5%  Similarity=0.076  Sum_probs=9.3

Q ss_pred             CchhHHHHHHHHHHHHHHhh
Q psy6892          46 QNVTNYFLMSLAITDLMVAV   65 (190)
Q Consensus        46 ~~~~~~fl~nLAvaDLl~~l   65 (190)
                      |+-.-+..+.|-....+.+.
T Consensus       108 HSWlGl~t~~Lf~lQ~~~Gf  127 (179)
T cd08762         108 HSWVGICTVALFTCQWVMGF  127 (179)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            44444444444444444443


No 158
>TIGR00815 sulP high affinity sulphate transporter 1. (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
Probab=21.16  E-value=87  Score=28.76  Aligned_cols=54  Identities=11%  Similarity=0.126  Sum_probs=35.9

Q ss_pred             HHHHHHHHHHhhhccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhh-ccc
Q psy6892          17 WAVVTLILVFCTATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLV-MPL   70 (190)
Q Consensus        17 ~~v~~~ii~vlgi~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~-lP~   70 (190)
                      ++...+.+.+++++.++.+.-.+......+.-.|--++.+.+++++.+++. +|.
T Consensus       261 l~~~a~~ia~v~~~e~l~~a~~~~~~~~~~~d~n~El~a~G~~N~~~~~fg~~p~  315 (563)
T TIGR00815       261 LAPDAIAIAIVGLIESIAIARSFARMTGYKIDANQELVAQGIANIVGSFFSCYPA  315 (563)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhhHHHHHHHHhCccCC
Confidence            334445566667777766665554444445567888899999999988774 444


No 159
>PF14798 Ca_hom_mod:  Calcium homeostasis modulator
Probab=20.96  E-value=3.4e+02  Score=22.38  Aligned_cols=12  Identities=33%  Similarity=0.293  Sum_probs=4.7

Q ss_pred             HHHHHHhhhccc
Q psy6892          21 TLILVFCTATGN   32 (190)
Q Consensus        21 ~~ii~vlgi~gN   32 (190)
                      ++++.++++.|-
T Consensus        20 ngi~~llt~g~e   31 (251)
T PF14798_consen   20 NGICALLTAGGE   31 (251)
T ss_pred             HHHHHHHHHHHH
Confidence            333333344333


No 160
>KOG2450|consensus
Probab=20.89  E-value=1.3e+02  Score=27.51  Aligned_cols=42  Identities=26%  Similarity=0.318  Sum_probs=26.7

Q ss_pred             hccchheeeeeeccCCCCchhHHHHHHHHHHHHHHhhhhcccceeeeccccc
Q psy6892          29 ATGNILVCLAISWEKRLQNVTNYFLMSLAITDLMVAVLVMPLGILTLVRVRT   80 (190)
Q Consensus        29 i~gN~lvi~vi~~~~~L~~~~~~fl~nLAvaDLl~~l~~lP~~l~~~l~~~~   80 (190)
                      ..||++|+     .+..++|    +..|.++.++- -..+|-.+++++.+.-
T Consensus       184 a~GNtvV~-----Kpae~tp----lsal~~~~l~~-eaG~P~GVvNii~G~G  225 (501)
T KOG2450|consen  184 AAGNTVVL-----KPAEQTP----LTALYLASLCK-EAGFPPGVVNIVPGSG  225 (501)
T ss_pred             hhCcEEEe-----cCCCCCC----chHHHHHHHhH-HhcCCcceEEEccCCC
Confidence            46887764     3445555    34444455544 5688889999888754


No 161
>KOG3145|consensus
Probab=20.85  E-value=2.2e+02  Score=24.51  Aligned_cols=26  Identities=8%  Similarity=0.058  Sum_probs=17.0

Q ss_pred             ccccccccceeeccchhhhhhhhhcc
Q psy6892          79 RTNLSGAVLRLKKNILCENQLFKAGA  104 (190)
Q Consensus        79 ~~~~s~~lC~i~~~i~c~~slfSl~s  104 (190)
                      .|+++..+..+.|......++.....
T Consensus       294 gwsIgnIlLDfTGG~~slLQMilQ~~  319 (372)
T KOG3145|consen  294 GWSIGNILLDFTGGTASLLQMILQSS  319 (372)
T ss_pred             ccccccEEEEecccHHHHHHHHHHHh
Confidence            47777777777776666666655443


No 162
>PHA02910 hypothetical protein; Provisional
Probab=20.80  E-value=43  Score=24.52  Aligned_cols=23  Identities=13%  Similarity=0.079  Sum_probs=16.1

Q ss_pred             hHHHHHHHHHHHHhhhccchhee
Q psy6892          14 WNIWAVVTLILVFCTATGNILVC   36 (190)
Q Consensus        14 ~~~~~v~~~ii~vlgi~gN~lvi   36 (190)
                      ...+..+.+.+|++||+|..+|-
T Consensus        23 kisitsicvslfligiiggyivg   45 (171)
T PHA02910         23 KISITSICVSLFLIGIIGGYIVG   45 (171)
T ss_pred             EeehhhhHHHHHHHhhhccEEEE
Confidence            34455667778888988876653


No 163
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=20.74  E-value=1.4e+02  Score=27.85  Aligned_cols=43  Identities=21%  Similarity=0.157  Sum_probs=22.3

Q ss_pred             HHHhhhccchheeeee---eccC--CCC---chhHHHHHHHHHHHHHHhhh
Q psy6892          24 LVFCTATGNILVCLAI---SWEK--RLQ---NVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        24 i~vlgi~gN~lvi~vi---~~~~--~L~---~~~~~fl~nLAvaDLl~~l~   66 (190)
                      -+++|=+|.+++++++   .+.+  ++.   ...-.++..++++-++.+++
T Consensus       359 GmM~gD~GyGLil~l~~~~l~~~~~k~~~~~~~~~~il~~~gi~sii~G~l  409 (646)
T PRK05771        359 GMMLGDAGYGLLLLLIGLLLSFKLKKKSEGLKRLLKILIYLGISTIIWGLL  409 (646)
T ss_pred             HHHHHhHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3355666776665442   1221  111   22345556777777777655


No 164
>PF12597 DUF3767:  Protein of unknown function (DUF3767);  InterPro: IPR022533  This group of proteins includes mitochodrial cytochrome c oxidase proteins [], and some transmembrane domain-containing proteins of unknown function known as FAM36A. Proteins in this family are typically between 112 and 199 amino acids in length. 
Probab=20.72  E-value=1.1e+02  Score=22.25  Aligned_cols=16  Identities=13%  Similarity=0.179  Sum_probs=8.1

Q ss_pred             CCCchhHHHHHHHHHH
Q psy6892          44 RLQNVTNYFLMSLAIT   59 (190)
Q Consensus        44 ~L~~~~~~fl~nLAva   59 (190)
                      +.++..|+-+...++.
T Consensus        64 ~~~~A~nwavgsF~l~   79 (118)
T PF12597_consen   64 NPRKAANWAVGSFFLG   79 (118)
T ss_pred             CCccchhhhhHHHHHH
Confidence            4455555555544433


No 165
>TIGR01297 CDF cation diffusion facilitator family transporter. This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
Probab=20.55  E-value=30  Score=28.06  Aligned_cols=42  Identities=14%  Similarity=0.137  Sum_probs=17.0

Q ss_pred             HHhhhccchheeeeeec-cCCCCchhHHHHHHHHHHHHHHhhh
Q psy6892          25 VFCTATGNILVCLAISW-EKRLQNVTNYFLMSLAITDLMVAVL   66 (190)
Q Consensus        25 ~vlgi~gN~lvi~vi~~-~~~L~~~~~~fl~nLAvaDLl~~l~   66 (190)
                      .++|++.|+++.+...+ .++.+++.-.-..--..+|++.++.
T Consensus       100 ~~~~~~v~~~~~~~~~~~~~~~~s~~l~a~~~~~~~D~~~s~~  142 (268)
T TIGR01297       100 AIVGLIVNLILALYLHRVGHRLGSLALRAAALHVLSDALSSVG  142 (268)
T ss_pred             HHHHHHHHHHHHHHHHHhCccCCCHHHHHHHHHHHHHHHHHHH
Confidence            33444555544333322 2233344333333333456555443


No 166
>KOG1734|consensus
Probab=20.41  E-value=1e+02  Score=25.96  Aligned_cols=34  Identities=9%  Similarity=0.064  Sum_probs=21.0

Q ss_pred             CCCCCchhhHHHHHHHHHHHHhhhccchheeeee
Q psy6892           6 SFGLDAASWNIWAVVTLILVFCTATGNILVCLAI   39 (190)
Q Consensus         6 ~~~~~~~~~~~~~v~~~ii~vlgi~gN~lvi~vi   39 (190)
                      +-..+.-.+.++..+|.+-+++|++|-++++.+.
T Consensus       129 ~g~tpRlVYkwFl~lyklSy~~g~vGyl~im~~~  162 (328)
T KOG1734|consen  129 SGDTPRLVYKWFLFLYKLSYLLGVVGYLAIMFAQ  162 (328)
T ss_pred             CCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3333344455666777777888888866655443


No 167
>KOG4452|consensus
Probab=20.27  E-value=2.4e+02  Score=18.53  Aligned_cols=23  Identities=22%  Similarity=0.322  Sum_probs=12.1

Q ss_pred             eeeeccCCCCchhHHHHHHHHHH
Q psy6892          37 LAISWEKRLQNVTNYFLMSLAIT   59 (190)
Q Consensus        37 ~vi~~~~~L~~~~~~fl~nLAva   59 (190)
                      +.+...++-|+...-++++|..+
T Consensus        39 ~~VtStKy~r~l~KELlIsl~aS   61 (79)
T KOG4452|consen   39 IQVTSTKYNRNLLKELLISLTAS   61 (79)
T ss_pred             eeEecchhhHHHHHHHHHHHHHH
Confidence            33345555556555566665533


No 168
>TIGR00834 ae anion exchange protein. They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
Probab=20.10  E-value=1.7e+02  Score=28.93  Aligned_cols=42  Identities=14%  Similarity=0.114  Sum_probs=29.7

Q ss_pred             cchheeeeeeccCCCCchh--HHHHHHHHHHHHHHhhhhcccce
Q psy6892          31 GNILVCLAISWEKRLQNVT--NYFLMSLAITDLMVAVLVMPLGI   72 (190)
Q Consensus        31 gN~lvi~vi~~~~~L~~~~--~~fl~nLAvaDLl~~l~~lP~~l   72 (190)
                      -|+..+++....++|++++  |+=++-|++..++++++++|+..
T Consensus       672 ~nIts~iv~~~e~kLkKgsgyH~Dllllg~~~~v~sllGLPw~~  715 (900)
T TIGR00834       672 QQITTLIVSKKERKLKKGSGFHLDLLLVVGMGGVAALFGLPWLS  715 (900)
T ss_pred             hhhHHHHhcCccccCCCCcccchHHHHHHHHHHHHHhcCCCccc
Confidence            4555566666667788765  56666677788888888888654


Done!