Query         psy17635
Match_columns 103
No_of_seqs    99 out of 124
Neff          4.5 
Searched_HMMs 46136
Date          Fri Aug 16 19:04:53 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy17635.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17635hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13882 Bravo_FIGEY:  Bravo-li  99.9 4.2E-26 9.1E-31  156.3   4.6   70   33-103     1-75  (90)
  2 KOG3513|consensus               98.9 3.5E-11 7.7E-16  109.2  -4.9   99    5-103   922-1027(1051)
  3 PF12273 RCR:  Chitin synthesis  96.6  0.0008 1.7E-08   47.5   1.0   24   11-34      2-25  (130)
  4 PF05454 DAG1:  Dystroglycan (D  96.2  0.0011 2.5E-08   53.6   0.0   35    8-42    142-180 (290)
  5 PF01299 Lamp:  Lysosome-associ  95.3    0.02 4.4E-07   45.4   3.5   27   13-39    275-301 (306)
  6 PF07204 Orthoreo_P10:  Orthore  94.2   0.027 5.9E-07   39.5   1.6   43    5-47     36-81  (98)
  7 PF15102 TMEM154:  TMEM154 prot  94.2   0.025 5.5E-07   42.1   1.5   15   19-33     66-80  (146)
  8 PF12877 DUF3827:  Domain of un  94.1   0.023 4.9E-07   50.8   1.3   32    6-37    265-299 (684)
  9 PF01102 Glycophorin_A:  Glycop  93.7   0.047   1E-06   39.3   2.2   28    7-36     65-92  (122)
 10 PF05393 Hum_adeno_E3A:  Human   93.0   0.082 1.8E-06   36.8   2.3   24   13-36     37-60  (94)
 11 PF05568 ASFV_J13L:  African sw  92.9    0.34 7.4E-06   37.0   5.7   56    6-63     27-82  (189)
 12 PF14979 TMEM52:  Transmembrane  92.2    0.17 3.6E-06   38.1   3.2   22    7-28     16-37  (154)
 13 PHA03164 hypothetical protein;  91.8    0.24 5.3E-06   33.9   3.4   28   12-39     61-88  (88)
 14 PF14575 EphA2_TM:  Ephrin type  91.6    0.26 5.6E-06   32.4   3.3   25   11-35      1-26  (75)
 15 PF15330 SIT:  SHP2-interacting  91.0    0.15 3.2E-06   35.9   1.7   30   11-40      1-34  (107)
 16 PF12273 RCR:  Chitin synthesis  89.7    0.12 2.5E-06   36.5   0.4   20   10-29      4-23  (130)
 17 PF01034 Syndecan:  Syndecan do  89.0    0.11 2.4E-06   33.9  -0.1   14   33-46     37-50  (64)
 18 PF15050 SCIMP:  SCIMP protein   89.0    0.38 8.2E-06   35.3   2.6   35    8-42      6-43  (133)
 19 PF06024 DUF912:  Nucleopolyhed  88.6    0.48   1E-05   32.5   2.8   28    9-37     63-91  (101)
 20 PF02439 Adeno_E3_CR2:  Adenovi  86.8     1.2 2.6E-05   26.4   3.4   25   12-36      9-34  (38)
 21 PF08173 YbgT_YccB:  Membrane b  86.7     1.1 2.3E-05   25.0   2.9   19    9-27      4-22  (28)
 22 PHA02867 C-type lectin protein  86.5    0.59 1.3E-05   35.2   2.4   25    9-33     13-37  (167)
 23 KOG3956|consensus               85.9    0.66 1.4E-05   38.6   2.6   26   16-41      8-33  (359)
 24 PF10577 UPF0560:  Uncharacteri  85.1    0.94   2E-05   41.6   3.4   30   10-39    273-303 (807)
 25 PHA02650 hypothetical protein;  84.4     2.2 4.7E-05   29.1   4.1   34    6-39     47-80  (81)
 26 TIGR02106 cyd_oper_ybgT cyd op  84.1     1.6 3.5E-05   24.6   2.9   19    9-27      4-22  (30)
 27 PF12297 EVC2_like:  Ellis van   84.0     1.4 3.1E-05   37.8   3.9   38    2-39     61-98  (429)
 28 PF04478 Mid2:  Mid2 like cell   83.4    0.44 9.5E-06   35.8   0.5   16   26-41     68-83  (154)
 29 TIGR01167 LPXTG_anchor LPXTG-m  83.1     2.4 5.1E-05   23.0   3.3   16    2-17      3-18  (34)
 30 PHA02947 S-S bond formation pa  83.0    0.78 1.7E-05   36.2   1.8   34    5-40    174-207 (215)
 31 COG4818 Predicted membrane pro  82.6     1.1 2.4E-05   31.7   2.3   32    8-41     57-88  (105)
 32 PHA02844 putative transmembran  82.5     2.6 5.6E-05   28.4   3.9   29    6-34     46-74  (75)
 33 PF10717 ODV-E18:  Occlusion-de  82.3     1.5 3.2E-05   30.2   2.7   25   15-39     27-51  (85)
 34 PF02009 Rifin_STEVOR:  Rifin/s  82.3    0.78 1.7E-05   37.4   1.6   26    9-34    255-280 (299)
 35 PF05545 FixQ:  Cbb3-type cytoc  81.7     3.9 8.5E-05   24.3   4.2   32    9-40      7-38  (49)
 36 PF00957 Synaptobrevin:  Synapt  80.7     1.9 4.1E-05   28.1   2.8    6   25-30     83-88  (89)
 37 PF07172 GRP:  Glycine rich pro  78.8     2.9 6.3E-05   28.7   3.3   14   44-57     30-43  (95)
 38 PF14991 MLANA:  Protein melan-  78.0    0.34 7.4E-06   35.0  -1.6   31   21-51     35-65  (118)
 39 PF09451 ATG27:  Autophagy-rela  77.3     3.2 6.9E-05   32.6   3.5   34    6-39    197-231 (268)
 40 PRK14740 kdbF potassium-transp  77.1       5 0.00011   22.5   3.3   26   10-35      2-27  (29)
 41 PF15069 FAM163:  FAM163 family  76.9     1.8 3.9E-05   32.2   1.9   20   13-32      9-28  (143)
 42 PF07271 Cytadhesin_P30:  Cytad  76.8    0.63 1.4E-05   38.0  -0.6   42    5-48     62-109 (279)
 43 MTH00036 ATP8 ATP synthase F0   76.6     4.7  0.0001   25.2   3.5   23    1-25      1-23  (54)
 44 PF12768 Rax2:  Cortical protei  76.0     1.2 2.7E-05   35.7   0.9   26   13-38    232-260 (281)
 45 PF13209 DUF4017:  Protein of u  75.8     1.9 4.2E-05   27.7   1.6   33    4-36     25-59  (60)
 46 PF14584 DUF4446:  Protein of u  75.4     1.8 3.8E-05   32.0   1.5   22   11-32      2-23  (151)
 47 PF00974 Rhabdo_glycop:  Rhabdo  75.4    0.91   2E-05   39.0   0.0   22   11-32    454-476 (501)
 48 PF14914 LRRC37AB_C:  LRRC37A/B  74.4     3.2 6.9E-05   31.3   2.7   23   10-32    119-141 (154)
 49 PF12606 RELT:  Tumour necrosis  74.3     4.1 8.8E-05   25.3   2.7   35   11-46      3-37  (50)
 50 KOG3637|consensus               73.4     2.6 5.6E-05   39.5   2.4   29    9-37    977-1010(1030)
 51 PHA02662 ORF131 putative membr  72.7     2.2 4.7E-05   33.9   1.5   31   10-41    186-216 (226)
 52 PF11694 DUF3290:  Protein of u  72.5     4.7  0.0001   29.7   3.2   33    9-41     15-47  (149)
 53 PHA03240 envelope glycoprotein  72.3     4.4 9.6E-05   32.6   3.2   15    6-20    210-224 (258)
 54 PRK02251 putative septation in  72.2       5 0.00011   27.5   3.0   33    7-42     29-61  (87)
 55 PTZ00046 rifin; Provisional     72.1     2.8 6.1E-05   35.2   2.2   28    9-36    314-341 (358)
 56 PF11807 DUF3328:  Domain of un  71.9     5.5 0.00012   28.3   3.3   25    4-28      3-27  (217)
 57 PF14575 EphA2_TM:  Ephrin type  71.8     3.3 7.2E-05   27.1   2.0   24   14-37      9-32  (75)
 58 PHA02955 hypothetical protein;  71.7       2 4.4E-05   33.8   1.2   28   10-40    180-207 (213)
 59 COG4890 Predicted outer membra  71.5     6.2 0.00013   23.2   2.9   21    9-29      4-24  (37)
 60 PRK13275 mtrF tetrahydromethan  71.3     6.8 0.00015   25.8   3.4   25    6-30     40-64  (67)
 61 PF02480 Herpes_gE:  Alphaherpe  71.0     1.3 2.9E-05   37.6   0.0   32   10-41    353-386 (439)
 62 PRK14749 hypothetical protein;  70.8     7.1 0.00015   22.1   2.9   19    9-27      4-22  (30)
 63 PF06103 DUF948:  Bacterial pro  69.7     3.6 7.8E-05   26.9   1.9   21   14-34      3-23  (90)
 64 PRK11901 hypothetical protein;  69.5     4.1 8.9E-05   34.0   2.5   16   15-30     38-53  (327)
 65 TIGR01477 RIFIN variant surfac  69.3     3.6 7.7E-05   34.6   2.2   27   10-36    310-336 (353)
 66 PRK11677 hypothetical protein;  69.2     6.2 0.00013   28.8   3.2   25   11-35      3-27  (134)
 67 TIGR01478 STEVOR variant surfa  69.1       4 8.7E-05   33.6   2.4   13   24-36    275-287 (295)
 68 PF00895 ATP-synt_8:  ATP synth  69.1      12 0.00026   21.8   3.9   30    1-32      1-30  (54)
 69 PF12575 DUF3753:  Protein of u  68.5       8 0.00017   25.8   3.3   18   10-27     47-64  (72)
 70 PF11395 DUF2873:  Protein of u  68.5     9.2  0.0002   23.0   3.2   19   14-32     12-31  (43)
 71 PTZ00370 STEVOR; Provisional    68.4     4.3 9.3E-05   33.5   2.4   14   24-37    271-284 (296)
 72 PF11669 WBP-1:  WW domain-bind  67.7     4.9 0.00011   27.8   2.3   11    9-19     20-30  (102)
 73 PF05478 Prominin:  Prominin;    67.5       6 0.00013   35.7   3.3   12   29-40    114-127 (806)
 74 PRK04778 septation ring format  66.9     2.9 6.2E-05   36.1   1.2   23   12-34      2-24  (569)
 75 PF02480 Herpes_gE:  Alphaherpe  66.3     1.9 4.2E-05   36.6   0.0   30    7-36    353-384 (439)
 76 PF13623 SurA_N_2:  SurA N-term  65.9     6.2 0.00013   28.7   2.6   22    5-26      3-24  (145)
 77 PRK14762 membrane protein; Pro  65.6     9.1  0.0002   21.0   2.6   17    9-25      4-20  (27)
 78 PF06143 Baculo_11_kDa:  Baculo  64.4       8 0.00017   26.4   2.7   14   12-25     36-49  (84)
 79 PF09801 SYS1:  Integral membra  64.4     7.7 0.00017   28.3   2.9   31    6-36     55-85  (144)
 80 PF02932 Neur_chan_memb:  Neuro  64.3     5.6 0.00012   26.6   2.0   33    6-38     55-87  (237)
 81 PF05297 Herpes_LMP1:  Herpesvi  63.8     2.3   5E-05   35.7   0.0   11    7-17     35-45  (381)
 82 PF08113 CoxIIa:  Cytochrome c   63.4      12 0.00026   21.7   3.0   19   13-31      6-24  (34)
 83 PF06365 CD34_antigen:  CD34/Po  62.9       3 6.5E-05   32.4   0.5   37   11-47    101-140 (202)
 84 PTZ00382 Variant-specific surf  62.2     7.1 0.00015   26.7   2.2   11   26-36     85-95  (96)
 85 PF14914 LRRC37AB_C:  LRRC37A/B  62.0      10 0.00022   28.6   3.2   27   11-37    123-150 (154)
 86 PF00517 GP41:  Retroviral enve  61.9     2.9 6.2E-05   31.9   0.2   18   25-42    173-190 (204)
 87 PRK11486 flagellar biosynthesi  61.9     7.5 0.00016   28.1   2.4   31    5-35     11-42  (124)
 88 PRK10788 periplasmic folding c  61.8     7.9 0.00017   33.4   2.9   35    7-41      8-42  (623)
 89 PHA03286 envelope glycoprotein  61.0     6.1 0.00013   34.6   2.1   18   19-36    402-419 (492)
 90 TIGR02866 CoxB cytochrome c ox  60.9      11 0.00024   28.2   3.3   18   19-36     23-40  (201)
 91 PRK10927 essential cell divisi  60.9     4.4 9.6E-05   33.7   1.2   56   15-71     35-93  (319)
 92 PRK14758 hypothetical protein;  60.7     6.2 0.00013   21.8   1.4    8   23-30     10-17  (27)
 93 PF04277 OAD_gamma:  Oxaloaceta  60.4      15 0.00032   23.3   3.4   18   10-27      4-22  (79)
 94 PF01788 PsbJ:  PsbJ;  InterPro  60.3      17 0.00037   21.8   3.3   21    8-28      8-28  (40)
 95 COG3190 FliO Flagellar biogene  59.9     9.8 0.00021   28.1   2.7   28    7-34     20-47  (137)
 96 PF02009 Rifin_STEVOR:  Rifin/s  59.8     6.2 0.00014   32.2   1.9   24   13-36    262-285 (299)
 97 PF12669 P12:  Virus attachment  59.4     5.4 0.00012   25.0   1.1   11   28-38     17-27  (58)
 98 PF09472 MtrF:  Tetrahydrometha  59.2      15 0.00032   23.9   3.2   23    7-29     41-63  (64)
 99 MTH00133 ATP8 ATP synthase F0   57.7      16 0.00035   22.3   3.1   24    1-26      1-24  (55)
100 PF15176 LRR19-TM:  Leucine-ric  57.7      11 0.00023   26.8   2.5   15    4-18     10-24  (102)
101 PHA03283 envelope glycoprotein  56.4      12 0.00026   33.2   3.1   25    8-32    396-422 (542)
102 PF05434 Tmemb_9:  TMEM9;  Inte  56.1     6.8 0.00015   29.4   1.4   30   14-43     60-93  (149)
103 PF10779 XhlA:  Haemolysin XhlA  55.7      20 0.00043   22.8   3.3   21    7-27     48-68  (71)
104 PF12575 DUF3753:  Protein of u  55.5      20 0.00043   23.9   3.4   23    5-27     45-67  (72)
105 PF02038 ATP1G1_PLM_MAT8:  ATP1  54.9      12 0.00026   23.4   2.1   16   13-28     17-32  (50)
106 PF07213 DAP10:  DAP10 membrane  54.7      22 0.00048   24.1   3.6   27    8-34     32-59  (79)
107 PF07423 DUF1510:  Protein of u  54.1      16 0.00034   28.7   3.1   23   10-32     14-36  (217)
108 PF06936 Selenoprotein_S:  Sele  53.9     4.3 9.3E-05   31.2   0.0   21    6-26     33-53  (190)
109 PRK00159 putative septation in  53.7      17 0.00037   25.0   2.9   32    7-41     28-59  (87)
110 COG3671 Predicted membrane pro  53.2      16 0.00035   26.7   2.9   37    9-45     75-115 (125)
111 KOG4331|consensus               53.1      12 0.00026   34.8   2.7   16   30-45    130-145 (865)
112 TIGR01433 CyoA cytochrome o ub  53.0      17 0.00037   28.2   3.2    9   25-33     48-56  (226)
113 COG3296 Uncharacterized protei  53.0      15 0.00032   27.4   2.7   30   12-41    103-134 (143)
114 PRK14750 kdpF potassium-transp  52.8      23 0.00051   19.8   2.9   11   13-23      3-13  (29)
115 PTZ00208 65 kDa invariant surf  52.4      16 0.00034   31.7   3.1   15   28-42    406-420 (436)
116 PF15176 LRR19-TM:  Leucine-ric  52.3      13 0.00029   26.3   2.3   29   12-40     21-50  (102)
117 PF13584 BatD:  Oxygen toleranc  51.8      17 0.00036   30.1   3.2   23   10-33    426-448 (484)
118 PF15234 LAT:  Linker for activ  51.7      57  0.0012   25.9   5.9   16   28-43     25-40  (230)
119 PHA03097 C-type lectin-like pr  51.5      18 0.00039   26.3   3.0   18   12-29     13-30  (157)
120 TIGR03063 srtB_target sortase   51.5      26 0.00056   19.5   2.9   17    2-18      2-19  (29)
121 PF06781 UPF0233:  Uncharacteri  51.5      19 0.00041   24.6   2.9   30    6-35     27-56  (87)
122 PF04906 Tweety:  Tweety;  Inte  51.2      16 0.00035   30.5   3.0    7   30-36     45-51  (406)
123 PRK10525 cytochrome o ubiquino  51.0      17 0.00037   29.9   3.1   10    9-18     40-49  (315)
124 PF06396 AGTRAP:  Angiotensin I  50.9      71  0.0015   24.1   6.1   37    7-43     85-122 (162)
125 PF13273 DUF4064:  Protein of u  50.7      25 0.00054   23.2   3.4   28    7-35     55-82  (100)
126 PF14851 FAM176:  FAM176 family  50.3      47   0.001   24.8   5.1   24   10-33     24-47  (153)
127 KOG4482|consensus               50.3      17 0.00036   31.5   3.0   27   11-39    301-327 (449)
128 PF14316 DUF4381:  Domain of un  50.2      18 0.00039   25.7   2.7   11    8-18     20-30  (146)
129 PF07466 DUF1517:  Protein of u  50.1      10 0.00022   30.6   1.6   30    8-37     60-89  (289)
130 PF00558 Vpu:  Vpu protein;  In  50.0      16 0.00035   24.8   2.3   17   13-29      6-22  (81)
131 PF06341 DUF1056:  Protein of u  49.8      24 0.00051   23.0   3.0   26   13-38     37-62  (63)
132 PF13349 DUF4097:  Domain of un  49.8      18 0.00039   25.0   2.6   18   11-28      4-21  (166)
133 PF15431 TMEM190:  Transmembran  49.7      17 0.00036   26.7   2.5   29    7-35     58-87  (134)
134 PF14610 DUF4448:  Protein of u  49.6       9  0.0002   28.4   1.2   16   19-34    168-183 (189)
135 COG1622 CyoA Heme/copper-type   49.6      21 0.00046   28.3   3.3   33    8-40     29-66  (247)
136 PF00957 Synaptobrevin:  Synapt  49.6      27 0.00058   22.6   3.3   21   12-32     67-87  (89)
137 PRK12757 cell division protein  49.4      11 0.00024   30.3   1.7   22   18-39      3-26  (256)
138 PF06900 DUF1270:  Protein of u  48.8      31 0.00067   21.8   3.3   21    9-29      6-26  (53)
139 PF03845 Spore_permease:  Spore  48.7      20 0.00042   27.9   3.0   31    9-39     33-63  (320)
140 PF06400 Alpha-2-MRAP_N:  Alpha  48.7     5.8 0.00013   28.8   0.0   24   22-45      3-26  (120)
141 MTH00186 ATP8 ATP synthase F0   48.5      38 0.00083   20.2   3.7   31    1-33      1-31  (52)
142 PF03908 Sec20:  Sec20;  InterP  48.5      17 0.00037   24.0   2.2   18   17-34     73-90  (92)
143 PF00517 GP41:  Retroviral enve  48.4     8.4 0.00018   29.4   0.9   25   11-35    155-179 (204)
144 PHA02692 hypothetical protein;  48.3      30 0.00064   23.0   3.3    8   10-17     44-51  (70)
145 PF10661 EssA:  WXG100 protein   48.1      17 0.00037   26.6   2.4   11    8-18    116-126 (145)
146 COG2268 Uncharacterized protei  47.3      28 0.00061   30.9   4.0   19   11-29     11-29  (548)
147 PF12670 DUF3792:  Protein of u  47.2      25 0.00055   24.2   3.1   28    6-33     62-89  (116)
148 PRK10299 PhoPQ regulatory prot  47.1      25 0.00055   21.7   2.6   21    9-29      4-24  (47)
149 PF03954 Lectin_N:  Hepatic lec  46.9      19 0.00041   26.7   2.5   23   11-33     32-54  (138)
150 COG4218 MtrF Tetrahydromethano  46.9      30 0.00064   23.2   3.1   22    6-27     49-70  (73)
151 PF15339 Afaf:  Acrosome format  46.7      24 0.00051   27.6   3.0   20   13-32    130-153 (200)
152 PF13908 Shisa:  Wnt and FGF in  46.5      14  0.0003   27.0   1.8    8   12-19     77-84  (179)
153 KOG3970|consensus               46.3      25 0.00054   28.7   3.2   34    7-43    250-283 (299)
154 PRK13415 flagella biosynthesis  45.9      28  0.0006   27.6   3.4   28   11-38     67-96  (219)
155 PF13179 DUF4006:  Family of un  45.7      26 0.00057   23.0   2.7   15   12-26     13-27  (66)
156 PF02397 Bac_transf:  Bacterial  45.7      22 0.00049   26.9   2.8   24   11-34      6-29  (187)
157 PLN00168 Cytochrome P450; Prov  45.6      24 0.00052   29.3   3.2   26   14-39      6-31  (519)
158 KOG2927|consensus               45.6      23 0.00051   30.1   3.1   31    9-39    225-256 (372)
159 PHA03273 envelope glycoprotein  45.3      14  0.0003   32.4   1.8   24   12-35    450-474 (486)
160 PRK10905 cell division protein  45.3      11 0.00024   31.5   1.1   12   18-29      3-14  (328)
161 PHA03240 envelope glycoprotein  45.0      22 0.00049   28.6   2.8   16   16-31    216-231 (258)
162 CHL00177 ccs1 c-type cytochrom  44.9      18 0.00038   30.8   2.3   26    6-35     70-95  (426)
163 PF03302 VSP:  Giardia variant-  44.8      14  0.0003   30.8   1.6   12   25-36    385-396 (397)
164 PF11359 gpUL132:  Glycoprotein  44.6      74  0.0016   25.5   5.6   13   34-46     80-92  (235)
165 PF00974 Rhabdo_glycop:  Rhabdo  44.2     7.5 0.00016   33.5   0.0   13   20-32    467-479 (501)
166 KOG1311|consensus               44.2      25 0.00054   27.5   3.0   28    5-32    152-179 (299)
167 PF05083 LST1:  LST-1 protein;   43.7     6.8 0.00015   26.3  -0.2   19   19-37      4-22  (74)
168 MTH00102 ATP8 ATP synthase F0   43.7      42  0.0009   21.5   3.4   20    1-22      1-20  (67)
169 PF06842 DUF1242:  Protein of u  43.7      10 0.00022   22.1   0.5   11   22-32      2-12  (36)
170 PF06809 NPDC1:  Neural prolife  43.5      25 0.00055   29.5   3.0   28   12-39    202-229 (341)
171 PF04971 Lysis_S:  Lysis protei  43.4      26 0.00056   23.1   2.5   23   12-34     35-57  (68)
172 TIGR03521 GldG gliding-associa  43.2      23 0.00049   30.6   2.8   30    7-36    520-550 (552)
173 KOG3514|consensus               42.5      24 0.00052   34.4   3.0   41    4-44   1510-1551(1591)
174 PHA02844 putative transmembran  42.5      37 0.00081   22.8   3.1   25    6-30     43-67  (75)
175 MTH00147 ATP8 ATP synthase F0   42.4      58  0.0013   19.8   3.8   32    1-34      1-32  (51)
176 COG3115 ZipA Cell division pro  42.1      36 0.00079   28.4   3.7   23   18-41     13-35  (324)
177 PHA02819 hypothetical protein;  41.7      36 0.00079   22.7   3.0   22    4-25     42-63  (71)
178 TIGR02588 conserved hypothetic  41.3      36 0.00078   24.7   3.1   33    9-41      3-37  (122)
179 PHA02692 hypothetical protein;  41.3      34 0.00073   22.7   2.8   15    4-18     41-56  (70)
180 PF14798 Ca_hom_mod:  Calcium h  41.0      34 0.00075   27.2   3.3   27    6-32    178-204 (251)
181 PF03918 CcmH:  Cytochrome C bi  40.8     9.5 0.00021   28.0   0.1   36    4-39     95-130 (148)
182 PRK03427 cell division protein  40.7      42 0.00091   28.1   3.8   30   11-40      7-36  (333)
183 TIGR01195 oadG_fam sodium pump  40.6      50  0.0011   21.9   3.6   12   19-30     17-28  (82)
184 PF02124 Marek_A:  Marek's dise  40.1      26 0.00056   27.5   2.4   18   12-29    188-205 (211)
185 PF07782 DC_STAMP:  DC-STAMP-li  39.8      39 0.00084   25.1   3.2   25    2-26    139-163 (191)
186 MTH00072 ATP8 ATP synthase F0   39.6      51  0.0011   19.5   3.2   24    1-26      1-24  (54)
187 TIGR03503 conserved hypothetic  39.5      14  0.0003   31.3   0.9   28    7-34    344-371 (374)
188 PF00429 TLV_coat:  ENV polypro  38.8      10 0.00022   33.2   0.0   10    7-16    523-532 (561)
189 PF11346 DUF3149:  Protein of u  38.6      33 0.00072   20.5   2.2   17   13-29     11-27  (42)
190 PHA03269 envelope glycoprotein  38.5      24 0.00052   31.4   2.2   18   12-29    541-558 (566)
191 PF14927 Neurensin:  Neurensin   38.5      49  0.0011   24.4   3.5   24    6-29     39-62  (140)
192 PF14884 EFF-AFF:  Type I membr  38.5      25 0.00055   31.5   2.3   24   10-33    557-582 (589)
193 PRK06569 F0F1 ATP synthase sub  38.1      49  0.0011   24.7   3.5   31    1-32      1-31  (155)
194 PF04156 IncA:  IncA protein;    38.1      40 0.00086   24.4   3.0   26   15-40     44-69  (191)
195 KOG1226|consensus               38.1      32 0.00069   31.9   3.0   37   12-48    717-755 (783)
196 PF10164 DUF2367:  Uncharacteri  37.9      40 0.00086   23.7   2.8   22   13-35     67-88  (98)
197 PHA03054 IMV membrane protein;  37.9      46   0.001   22.2   3.0   20    5-24     45-64  (72)
198 PHA02650 hypothetical protein;  37.5      40 0.00086   23.0   2.7   24    6-29     44-67  (81)
199 PRK13453 F0F1 ATP synthase sub  37.4      40 0.00087   24.6   3.0   22   11-32     18-39  (173)
200 PF03597 CcoS:  Cytochrome oxid  37.4      32  0.0007   20.7   2.1   28   13-40      3-30  (45)
201 PF03733 DUF307:  Domain of unk  37.3      56  0.0012   20.1   3.2   20    9-28     11-30  (53)
202 PF09435 DUF2015:  Fungal prote  37.1      18 0.00039   26.5   1.1   26   15-40      4-29  (128)
203 PF04226 Transgly_assoc:  Trans  37.0      65  0.0014   19.2   3.4   19    7-25     25-43  (48)
204 KOG3054|consensus               37.0      45 0.00098   27.4   3.4   35   12-46      4-38  (299)
205 COG1314 SecG Preprotein transl  37.0      57  0.0012   22.1   3.4   13   26-38     16-28  (86)
206 PF14584 DUF4446:  Protein of u  36.9      30 0.00066   25.5   2.2   15   14-28      8-22  (151)
207 PF13106 DUF3961:  Domain of un  36.7      16 0.00035   21.9   0.6   23    5-27     12-34  (40)
208 KOG3653|consensus               36.4      32 0.00069   30.6   2.6   31   10-40    154-185 (534)
209 PRK04989 psbM photosystem II r  36.1      77  0.0017   18.5   3.4   19   12-30      8-26  (35)
210 PF09930 DUF2162:  Predicted tr  36.1      32  0.0007   27.0   2.4   36    6-42    126-161 (224)
211 PF13624 SurA_N_3:  SurA N-term  35.9      12 0.00026   25.8   0.0   21    6-26      5-25  (154)
212 PF12259 DUF3609:  Protein of u  35.7      19 0.00042   30.0   1.2   22   13-34    301-322 (361)
213 MTH00171 ATP8 ATP synthase F0   35.6      74  0.0016   18.8   3.5   23    1-25      1-23  (54)
214 PRK10144 formate-dependent nit  35.6      43 0.00093   24.3   2.8   19    5-23     96-114 (126)
215 PRK14094 psbM photosystem II r  35.4      60  0.0013   20.3   3.0   27   12-39      8-34  (50)
216 PF09685 Tic20:  Tic20-like pro  35.0      46 0.00099   21.6   2.7   16   26-41     86-101 (109)
217 PHA03281 envelope glycoprotein  34.9      26 0.00057   31.6   1.9   26   11-36    558-583 (642)
218 COG3889 Predicted solute bindi  34.9      23  0.0005   33.1   1.6   29    9-37    843-872 (872)
219 PF05915 DUF872:  Eukaryotic pr  34.8      38 0.00083   23.9   2.4    8   30-37     99-106 (115)
220 smart00805 AGTRAP Angiotensin   34.7      34 0.00074   26.0   2.2   12   33-44    112-123 (159)
221 KOG0196|consensus               34.7      43 0.00094   31.7   3.3   17    8-24    546-562 (996)
222 TIGR01294 P_lamban phospholamb  34.6      52  0.0011   20.5   2.7   22    8-29     28-49  (52)
223 MTH00123 ATP8 ATP synthase F0   34.6      67  0.0014   19.3   3.2   23    1-25      1-23  (54)
224 PF13633 N_methyl_3:  Prokaryot  34.6      68  0.0015   16.9   2.8   17   12-28      3-19  (22)
225 PF01529 zf-DHHC:  DHHC palmito  34.5      60  0.0013   22.7   3.4   24    5-28     87-110 (174)
226 PF03264 Cytochrom_NNT:  NapC/N  34.5      12 0.00026   27.2  -0.2   14   10-23      4-17  (173)
227 PHA02975 hypothetical protein;  34.3      54  0.0012   21.7   2.9   21    7-27     40-60  (69)
228 PF06011 TRP:  Transient recept  34.1      55  0.0012   27.1   3.6   35    7-41    385-424 (438)
229 PF10907 DUF2749:  Protein of u  34.0      35 0.00076   22.5   1.9   38   10-49      4-42  (66)
230 PF04790 Sarcoglycan_1:  Sarcog  33.8      49  0.0011   26.5   3.1   24   10-33     14-37  (264)
231 PF13967 RSN1_TM:  Late exocyto  33.7      52  0.0011   23.4   3.0   28   18-45      9-36  (157)
232 KOG4812|consensus               33.7      36 0.00078   27.7   2.3   28    6-33    219-246 (262)
233 TIGR03147 cyt_nit_nrfF cytochr  33.7      51  0.0011   23.9   2.9   30    5-34     96-125 (126)
234 COG4768 Uncharacterized protei  33.6      54  0.0012   24.4   3.1   19   11-29      5-23  (139)
235 PF08078 PsaX:  PsaX family;  I  33.4      51  0.0011   19.4   2.4   15    8-23     15-29  (37)
236 KOG1485|consensus               33.4      26 0.00056   30.2   1.6   29    7-35    227-255 (412)
237 PRK06870 secG preprotein trans  33.4      75  0.0016   20.4   3.5   14   25-38     15-28  (76)
238 PHA02957 hypothetical protein;  33.1      45 0.00097   25.8   2.7   24   12-36     19-42  (206)
239 PF08507 COPI_assoc:  COPI asso  33.0      73  0.0016   22.3   3.6    9   10-18     88-96  (136)
240 PF08374 Protocadherin:  Protoc  32.8      20 0.00044   28.5   0.8   17   13-30     45-61  (221)
241 PF08001 CMV_US:  CMV US;  Inte  32.7      54  0.0012   26.0   3.2   27    7-33    176-202 (231)
242 PF12955 DUF3844:  Domain of un  32.7      66  0.0014   22.6   3.3   27    4-30     62-88  (103)
243 PRK06007 fliF flagellar MS-rin  32.7      44 0.00094   29.1   2.9   15   11-25    436-450 (542)
244 KOG4298|consensus               32.7      63  0.0014   25.8   3.5   26    9-34    148-173 (245)
245 PF12158 DUF3592:  Protein of u  32.1      36 0.00078   23.0   1.9   14   10-23      1-14  (148)
246 TIGR02227 sigpep_I_bact signal  32.1      40 0.00086   24.3   2.2   15   11-25      1-15  (163)
247 PF15106 TMEM156:  TMEM156 prot  32.1      53  0.0012   26.2   3.0   14    7-20    174-187 (226)
248 MTH00260 ATP8 ATP synthase F0   31.9 1.1E+02  0.0024   18.8   3.9   32    1-34      1-32  (53)
249 PRK08476 F0F1 ATP synthase sub  31.9      58  0.0013   23.1   3.0   21   11-31      7-27  (141)
250 PF10853 DUF2650:  Protein of u  31.9      40 0.00087   19.8   1.8   14    6-19     22-35  (38)
251 TIGR02507 MtrF tetrahydrometha  31.7      85  0.0018   20.5   3.4   17    6-22     40-56  (65)
252 PF04639 Baculo_E56:  Baculovir  31.6      56  0.0012   27.2   3.2    9    9-17    276-284 (305)
253 PF06422 PDR_CDR:  CDR ABC tran  31.4      82  0.0018   21.4   3.6   27   19-45     60-87  (103)
254 PF11166 DUF2951:  Protein of u  31.4      63  0.0014   22.8   3.0   21    8-28     72-92  (98)
255 COG3763 Uncharacterized protei  31.3      50  0.0011   22.0   2.3    6   51-56     36-41  (71)
256 PF10003 DUF2244:  Integral mem  31.3      84  0.0018   22.3   3.7   20    6-25      9-28  (140)
257 MTH00047 COX2 cytochrome c oxi  31.1      69  0.0015   24.3   3.5    9   33-41     35-43  (194)
258 PF10617 DUF2474:  Protein of u  31.1      89  0.0019   18.6   3.2   19    6-24      5-23  (40)
259 PF15202 Adipogenin:  Adipogeni  30.9      90  0.0019   21.0   3.5   25    7-31     12-36  (81)
260 PF09788 Tmemb_55A:  Transmembr  30.9      50  0.0011   26.8   2.7   37    6-42    193-232 (256)
261 PF08496 Peptidase_S49_N:  Pept  30.8      62  0.0013   24.0   3.0   18   20-37     20-37  (155)
262 KOG2678|consensus               30.6      63  0.0014   26.1   3.2   19    9-27    215-233 (244)
263 PHA02657 hypothetical protein;  30.5      58  0.0013   22.7   2.7   19    9-27     30-48  (95)
264 PF13172 PepSY_TM_1:  PepSY-ass  30.4      75  0.0016   17.4   2.7   14   12-25     11-24  (34)
265 KOG1024|consensus               30.4      22 0.00048   31.4   0.7   30    6-35    182-212 (563)
266 TIGR00892 2A0113 monocarboxyla  30.2      40 0.00086   27.3   2.1   26   11-36    404-429 (455)
267 TIGR02205 septum_zipA cell div  30.2      43 0.00094   27.2   2.3   10   28-37     21-30  (284)
268 COG5416 Uncharacterized integr  30.1      48   0.001   23.3   2.2    7   10-16     23-29  (98)
269 PF05140 ResB:  ResB-like famil  30.1      49  0.0011   27.7   2.7   25    6-34     50-74  (464)
270 PF04202 Mfp-3:  Foot protein 3  30.0      65  0.0014   21.4   2.7   17   24-40     12-28  (71)
271 PF07856 Orai-1:  Mediator of C  30.0      79  0.0017   23.8   3.6   26    9-34    108-133 (175)
272 PF09889 DUF2116:  Uncharacteri  29.8      78  0.0017   20.1   3.0    9    9-17     38-46  (59)
273 PF07787 DUF1625:  Protein of u  29.8      80  0.0017   24.3   3.7   21   12-32    225-245 (248)
274 PF10821 DUF2567:  Protein of u  29.6      59  0.0013   24.7   2.8   30    6-37     45-74  (167)
275 PRK09174 F0F1 ATP synthase sub  29.6      58  0.0013   24.8   2.8   26    2-28     45-70  (204)
276 PF05808 Podoplanin:  Podoplani  29.6      18 0.00038   27.5   0.0   29   10-41    133-161 (162)
277 PF15192 TMEM213:  TMEM213 fami  29.6      84  0.0018   21.4   3.3   25    5-31     41-66  (82)
278 PF06160 EzrA:  Septation ring   29.6      15 0.00033   31.8  -0.4   11   24-34     10-20  (560)
279 PF12301 CD99L2:  CD99 antigen   29.6      66  0.0014   24.3   3.1   23    6-28    111-133 (169)
280 PF08475 Baculo_VP91_N:  Viral   29.4      39 0.00085   26.0   1.9    6   17-22      4-9   (183)
281 PF01098 FTSW_RODA_SPOVE:  Cell  29.3      54  0.0012   26.3   2.7   30    7-36    263-292 (358)
282 PF04272 Phospholamban:  Phosph  29.1      73  0.0016   19.8   2.7   22    8-29     28-49  (52)
283 PF10183 ESSS:  ESSS subunit of  29.1      72  0.0016   21.9   3.0   18    9-26     61-78  (105)
284 COG4499 Predicted membrane pro  29.0      58  0.0013   28.2   3.0   31    2-32    213-244 (434)
285 PF13124 DUF3963:  Protein of u  29.0 1.2E+02  0.0025   18.1   3.4   23    8-30     16-38  (40)
286 PF09988 DUF2227:  Uncharacteri  28.9      51  0.0011   24.8   2.4   29    1-29     73-101 (169)
287 PRK00269 zipA cell division pr  28.9      32  0.0007   28.3   1.4   18   28-45     21-38  (293)
288 PRK14471 F0F1 ATP synthase sub  28.6      72  0.0016   22.8   3.0   22   11-32      8-29  (164)
289 TIGR03068 srtB_sig_NPQTN sorta  28.3      97  0.0021   17.8   2.9   18    2-19      2-19  (33)
290 PF08135 EPV_E5:  Major transfo  28.2      98  0.0021   18.8   3.1   13   13-25     10-22  (44)
291 PF07760 DUF1616:  Protein of u  28.2      47   0.001   26.2   2.2   24   13-36     87-110 (287)
292 PF07219 HemY_N:  HemY protein   28.2      87  0.0019   21.1   3.2   22    7-28     13-34  (108)
293 TIGR00847 ccoS cytochrome oxid  28.1      63  0.0014   20.0   2.3   27   14-40      5-31  (51)
294 PRK14474 F0F1 ATP synthase sub  28.1      53  0.0011   25.7   2.4   22   11-32      5-26  (250)
295 MTH00152 ND6 NADH dehydrogenas  27.9      85  0.0019   22.9   3.4   29    9-37    132-160 (163)
296 PRK13461 F0F1 ATP synthase sub  27.9      76  0.0016   22.6   3.1   22   11-32      5-26  (159)
297 PF01456 Mucin:  Mucin-like gly  27.8      43 0.00094   23.3   1.7   10   15-24      6-15  (143)
298 KOG4433|consensus               27.6      70  0.0015   28.4   3.3    6   30-35     69-74  (526)
299 PF09777 OSTMP1:  Osteopetrosis  27.5      90  0.0019   24.4   3.6   24    9-32    189-212 (237)
300 COG3105 Uncharacterized protei  27.5      97  0.0021   23.0   3.6   28    7-34      4-31  (138)
301 MTH00045 ND6 NADH dehydrogenas  27.4      80  0.0017   23.2   3.1   27    9-35    128-154 (162)
302 PRK03814 oxaloacetate decarbox  27.4 1.1E+02  0.0024   20.6   3.6   12   19-30     21-32  (85)
303 PHA03289 envelope glycoprotein  27.2      61  0.0013   27.4   2.7   16   15-30    277-292 (352)
304 PRK11901 hypothetical protein;  27.0      79  0.0017   26.5   3.3   30    3-33     30-59  (327)
305 KOG3249|consensus               27.0      94   0.002   24.0   3.5   10   33-42    144-155 (181)
306 PF11857 DUF3377:  Domain of un  27.0      61  0.0013   21.7   2.2    6   33-38     54-59  (74)
307 PF10327 7TM_GPCR_Sri:  Serpent  26.9      37  0.0008   26.9   1.4   22   15-36     96-117 (303)
308 PHA00476 hypothetical protein   26.9 2.4E+02  0.0052   20.2   5.4   43   10-52     38-86  (110)
309 PF14030 DUF4245:  Protein of u  26.8      73  0.0016   23.5   2.8   16   17-32     12-27  (169)
310 PF07672 MFS_Mycoplasma:  Mycop  26.6      82  0.0018   25.6   3.3   31    7-37    169-199 (267)
311 TIGR00810 secG protein translo  26.5      94   0.002   19.8   3.0   14   25-38     14-27  (73)
312 PLN03150 hypothetical protein;  26.5      39 0.00085   29.5   1.5   10   26-35    562-571 (623)
313 PF11696 DUF3292:  Protein of u  26.5      70  0.0015   29.1   3.1   24    6-29    133-156 (642)
314 PF11466 Doppel:  Prion-like pr  26.4      62  0.0013   18.3   1.8   12   10-21      7-18  (30)
315 PF02430 AMA-1:  Apical membran  26.4      22 0.00047   31.1   0.0   25   10-34    429-453 (471)
316 PF06678 DUF1179:  Protein of u  26.4      93   0.002   22.1   3.2   10   10-19      4-13  (103)
317 COG3736 VirB8 Type IV secretor  26.3   1E+02  0.0022   24.7   3.7   24    9-32     42-65  (239)
318 PF01561 Hanta_G2:  Hantavirus   26.3      34 0.00074   30.0   1.1   16   19-34    465-480 (485)
319 TIGR01478 STEVOR variant surfa  26.3      51  0.0011   27.3   2.1   12   17-28    261-272 (295)
320 PF01601 Corona_S2:  Coronaviru  26.3      22 0.00048   32.0   0.0   15   11-25    548-562 (610)
321 PF14316 DUF4381:  Domain of un  26.2      53  0.0011   23.3   2.0   28   13-40     21-50  (146)
322 PF05454 DAG1:  Dystroglycan (D  26.2      22 0.00048   29.0   0.0    7   30-36    165-171 (290)
323 CHL00080 psbM photosystem II p  26.2 1.4E+02  0.0031   17.3   3.4   19   12-30      8-26  (34)
324 PRK14473 F0F1 ATP synthase sub  26.2      81  0.0018   22.6   2.9   20   11-30      8-27  (164)
325 TIGR01655 yxeA_fam conserved h  26.1      73  0.0016   22.1   2.6    8   39-46     32-39  (114)
326 PRK00888 ftsB cell division pr  26.1      82  0.0018   21.7   2.8   12   11-22      3-14  (105)
327 PF10112 Halogen_Hydrol:  5-bro  26.0      86  0.0019   23.1   3.1    9   10-18     10-18  (199)
328 PF01167 Tub:  Tub family;  Int  25.9      21 0.00045   28.0  -0.2   12   27-38      7-18  (246)
329 PRK10905 cell division protein  25.8      50  0.0011   27.7   2.0   17   17-33      5-21  (328)
330 MTH00166 ND6 NADH dehydrogenas  25.7      95  0.0021   22.4   3.3   25   13-37    130-154 (160)
331 PLN00028 nitrate transmembrane  25.6      93   0.002   25.4   3.5   24   10-33    410-433 (476)
332 PTZ00370 STEVOR; Provisional    25.6      55  0.0012   27.1   2.2   18    8-25    248-265 (296)
333 TIGR03644 marine_trans_1 proba  25.5      94   0.002   26.3   3.6   38   14-52    355-392 (404)
334 TIGR01432 QOXA cytochrome aa3   25.4      55  0.0012   24.9   2.0   16   17-32     31-46  (217)
335 PF09911 DUF2140:  Uncharacteri  25.4      65  0.0014   24.3   2.4   19   10-28      2-20  (187)
336 PF05399 EVI2A:  Ectropic viral  25.3      68  0.0015   25.6   2.6    8   32-39    161-169 (227)
337 PF13038 DUF3899:  Domain of un  25.3 1.1E+02  0.0023   19.9   3.2   25    7-31     67-91  (92)
338 PF00003 7tm_3:  7 transmembran  25.3      88  0.0019   22.9   3.1   34    7-40    146-179 (238)
339 TIGR00889 2A0110 nucleoside tr  25.1      79  0.0017   25.3   3.0    9   40-48    409-417 (418)
340 PRK07718 fliL flagellar basal   25.1 1.2E+02  0.0025   21.7   3.6   26   11-36      6-31  (142)
341 PF14715 FixP_N:  N-terminal do  25.0 1.2E+02  0.0026   18.6   3.1   20    9-28     22-41  (51)
342 TIGR03778 VPDSG_CTERM VPDSG-CT  24.9      64  0.0014   17.6   1.7   16   20-35      9-24  (26)
343 PF11862 DUF3382:  Domain of un  24.8 1.1E+02  0.0024   20.6   3.3   36   10-45      4-39  (101)
344 PF15048 OSTbeta:  Organic solu  24.7      76  0.0016   23.2   2.5   28   19-46     45-72  (125)
345 PF02790 COX2_TM:  Cytochrome C  24.7   1E+02  0.0022   19.2   2.9   15   19-33     34-48  (84)
346 TIGR02794 tolA_full TolA prote  24.6      74  0.0016   26.3   2.8   13   17-29      8-20  (346)
347 PF06697 DUF1191:  Protein of u  24.6      15 0.00033   29.9  -1.2   23   14-36    218-241 (278)
348 PF09583 Phageshock_PspG:  Phag  24.5 1.3E+02  0.0028   19.7   3.3   28    7-34     31-63  (65)
349 PF01998 DUF131:  Protein of un  24.5   1E+02  0.0023   19.8   2.9   16   12-27     46-61  (64)
350 TIGR03141 cytochro_ccmD heme e  24.3      42  0.0009   19.8   0.9   21   17-37     12-32  (45)
351 PF15339 Afaf:  Acrosome format  24.1      76  0.0016   24.8   2.6   21   13-33    136-157 (200)
352 KOG3832|consensus               24.1      84  0.0018   25.7   2.9   13    4-16     35-47  (319)
353 PF13858 DUF4199:  Protein of u  24.0      98  0.0021   21.6   3.0   28    9-36    135-163 (163)
354 PF05510 Sarcoglycan_2:  Sarcog  24.0      65  0.0014   27.4   2.4   17   15-31    293-309 (386)
355 PF05297 Herpes_LMP1:  Herpesvi  23.9      26 0.00056   29.6   0.0   25    8-32    160-184 (381)
356 PF13400 Tad:  Putative Flp pil  23.9      44 0.00094   19.4   1.0   15   16-30      6-20  (48)
357 PRK01844 hypothetical protein;  23.8      82  0.0018   21.0   2.4   24   10-34      3-26  (72)
358 PRK05696 fliL flagellar basal   23.7      58  0.0013   23.8   1.8    8   11-18     21-28  (170)
359 PRK13872 conjugal transfer pro  23.6 1.2E+02  0.0025   23.1   3.6    9   30-38     71-79  (228)
360 COG4477 EzrA Negative regulato  23.6      32  0.0007   30.7   0.5   24   13-36      2-25  (570)
361 PF14257 DUF4349:  Domain of un  23.6 1.1E+02  0.0023   23.6   3.3    9   10-18    234-242 (262)
362 PF02158 Neuregulin:  Neureguli  23.6      27 0.00058   30.0   0.0   16   17-32     13-28  (404)
363 KOG3359|consensus               23.2      88  0.0019   28.8   3.2   39    9-49    576-614 (723)
364 KOG1094|consensus               23.2      37  0.0008   31.4   0.8   26    8-33    389-414 (807)
365 cd01811 OASL_repeat1 2'-5' oli  23.1      44 0.00094   22.8   1.0   13   84-96     53-65  (80)
366 PF10873 DUF2668:  Protein of u  23.0      84  0.0018   23.8   2.6   10   32-41     88-97  (155)
367 PF04906 Tweety:  Tweety;  Inte  22.9   1E+02  0.0022   25.9   3.3   12   29-40    205-216 (406)
368 PRK03001 M48 family peptidase;  22.9      63  0.0014   25.3   2.0   35   10-45     29-64  (283)
369 TIGR03142 cytochro_ccmI cytoch  22.6   1E+02  0.0023   21.1   2.8    7   10-16      2-8   (117)
370 PF05568 ASFV_J13L:  African sw  22.5      93   0.002   23.9   2.7   26    7-32     22-47  (189)
371 PRK00523 hypothetical protein;  22.5      82  0.0018   21.0   2.2   13   22-34     15-27  (72)
372 PF04507 DUF576:  Protein of un  22.2      29 0.00064   28.1   0.0   14   19-32     11-24  (257)
373 PHA02681 ORF089 virion membran  22.2      83  0.0018   21.8   2.2    6   30-35     22-27  (92)
374 PF12751 Vac7:  Vacuolar segreg  22.1      91   0.002   26.7   2.9   22   11-32    302-323 (387)
375 PF13074 DUF3938:  Protein of u  22.1      88  0.0019   21.6   2.3   14   10-23     75-88  (102)
376 PRK13836 conjugal transfer pro  22.0 1.3E+02  0.0028   22.8   3.5   15    8-22     29-43  (220)
377 PF13937 DUF4212:  Domain of un  21.8      43 0.00094   22.4   0.7    6   37-42     74-79  (81)
378 PRK10856 cytoskeletal protein   21.8      95  0.0021   25.4   2.9   26    8-33    109-134 (331)
379 TIGR00383 corA magnesium Mg(2+  21.7      70  0.0015   24.8   2.0   13   11-23    290-302 (318)
380 PF07695 7TMR-DISM_7TM:  7TM di  21.6 1.1E+02  0.0025   21.1   2.9   11   30-40    144-154 (205)
381 PTZ00046 rifin; Provisional     21.6      88  0.0019   26.5   2.7   23   14-36    322-344 (358)
382 PF03988 DUF347:  Repeat of Unk  21.5 1.5E+02  0.0031   18.1   3.1   17   19-35     37-53  (55)
383 PF10731 Anophelin:  Thrombin i  21.5 1.8E+02  0.0039   19.0   3.6   10   48-57     26-35  (65)
384 PF15159 PIG-Y:  Phosphatidylin  21.4 1.7E+02  0.0037   19.2   3.5   21    9-29      2-22  (72)
385 PF15145 DUF4577:  Domain of un  21.3 1.5E+02  0.0032   21.7   3.4   35   10-45     62-98  (128)
386 PF04531 Phage_holin_1:  Bacter  21.2 1.5E+02  0.0032   19.6   3.3   23    6-28      8-30  (84)
387 COG4698 Uncharacterized protei  20.9 1.5E+02  0.0032   23.3   3.6   38   10-47     10-48  (197)
388 PF13131 DUF3951:  Protein of u  20.9 1.5E+02  0.0033   18.7   3.0   11   17-27     11-21  (53)
389 COG3462 Predicted membrane pro  20.9 1.2E+02  0.0026   21.9   2.9   18   11-28     50-67  (117)
390 TIGR03746 conj_TIGR03746 integ  20.9 1.3E+02  0.0028   23.6   3.3   23    6-28     12-34  (202)
391 TIGR01477 RIFIN variant surfac  20.8      92   0.002   26.3   2.6   24   12-36    316-339 (353)
392 PF07009 DUF1312:  Protein of u  20.8      31 0.00068   23.6  -0.1    9   33-41     21-29  (113)
393 COG5522 Predicted integral mem  20.8      60  0.0013   26.0   1.5   33   14-46    155-187 (236)
394 PF14029 DUF4244:  Protein of u  20.8   2E+02  0.0044   18.1   3.6   26    3-28     12-37  (56)
395 PF10601 zf-LITAF-like:  LITAF-  20.7 1.6E+02  0.0034   18.4   3.2    8    9-16     28-35  (73)
396 PHA02911 C-type lectin-like pr  20.6 1.1E+02  0.0023   24.4   2.8   19   10-28     30-48  (213)
397 PF06667 PspB:  Phage shock pro  20.6 1.3E+02  0.0028   19.9   2.9    9   33-41     29-37  (75)
398 TIGR02595 PEP_exosort PEP-CTER  20.6      80  0.0017   16.6   1.5    9   28-36     15-23  (26)
399 PF07790 DUF1628:  Protein of u  20.5 1.4E+02   0.003   18.8   2.9   13   13-25      7-19  (80)
400 TIGR02975 phageshock_pspG phag  20.5 1.8E+02   0.004   19.0   3.4   28    7-34     30-62  (64)
401 COG4059 MtrE Tetrahydromethano  20.4      95  0.0021   25.4   2.5   33    8-40    260-294 (304)
402 PF14985 TM140:  TM140 protein   20.4 1.3E+02  0.0028   23.2   3.2   26   12-37    147-173 (181)
403 PF04246 RseC_MucC:  Positive r  20.4      97  0.0021   21.5   2.4   24   15-38    100-123 (135)
404 PRK10884 SH3 domain-containing  20.4   1E+02  0.0022   23.7   2.7   26    7-32    170-195 (206)
405 PHA02851 EEV glycoprotein; Pro  20.4      81  0.0018   24.5   2.1   15   15-29      8-22  (223)
406 PRK11466 hybrid sensory histid  20.3      87  0.0019   27.5   2.5   24    9-32    329-352 (914)
407 PRK06287 cobalt transport prot  20.2 1.3E+02  0.0028   20.9   2.9    9   28-36     97-105 (107)
408 PRK14759 potassium-transportin  20.2 1.8E+02  0.0039   16.3   3.2   19   14-32      6-24  (29)
409 PF11084 DUF2621:  Protein of u  20.2 1.4E+02   0.003   22.3   3.2   20    9-28      3-22  (141)
410 COG1704 LemA Uncharacterized c  20.2 1.2E+02  0.0026   23.5   2.9   14   10-23      3-16  (185)
411 PRK13865 type IV secretion sys  20.1 1.5E+02  0.0032   23.2   3.5   12    7-18     37-48  (229)
412 KOG2052|consensus               20.1      73  0.0016   28.2   2.0   20    9-28    131-150 (513)
413 KOG4221|consensus               20.1      37 0.00081   33.2   0.2   13   22-34   1073-1085(1381)
414 TIGR03321 alt_F1F0_F0_B altern  20.1   1E+02  0.0023   23.7   2.7   22   11-32      5-26  (246)
415 PRK09848 glucuronide transport  20.0 1.4E+02   0.003   23.7   3.5    9   39-47    426-434 (448)

No 1  
>PF13882 Bravo_FIGEY:  Bravo-like intracellular region
Probab=99.92  E-value=4.2e-26  Score=156.29  Aligned_cols=70  Identities=46%  Similarity=0.717  Sum_probs=60.6

Q ss_pred             eecCCCcccchhhhhhcCCC---CCCCCCCCcccCCCCCCC-CC-CCCCCCCCCCCCCCCcchhhhhCCCCCCCCC
Q psy17635         33 KRNRGGKYAVHEREAAHGRH---DYPEEPGFNEYSQPLDGK-PR-TRSSMSSEKHPVESDTDSMAEYGDGDTGTKN  103 (103)
Q Consensus        33 ~R~rGgKY~VkeKE~~~g~~---~~~dd~~F~EYsd~dek~-~~-~s~~~~~~~~~~~~s~DSl~dYgd~~~~~Fn  103 (103)
                      ||||||||+|++||++|+++   ||+| ++|+||+++++++ ++ +|++++++..+.++|+|||++||+++++|||
T Consensus         1 kR~kGGKY~V~eKE~~~~~~~~~~~kd-~~f~ey~~~e~~~~~~~~s~~s~~~~~~~~~s~DSL~~YGd~~~~~FN   75 (90)
T PF13882_consen    1 KRNKGGKYSVKEKEDAHGDPEDQPMKD-ETFGEYSDPEDPKPLKRGSSQSLDGSRKKSSSDDSLDDYGDGDTGQFN   75 (90)
T ss_pred             CCCCCCCCCCccccccCCCCccccccC-CCcccccCcccccccccccccccccCcCcCcCccCHhHhccCCCcccc
Confidence            69999999999999999998   4566 7999999875544 43 4667777888899999999999999999998


No 2  
>KOG3513|consensus
Probab=98.91  E-value=3.5e-11  Score=109.23  Aligned_cols=99  Identities=25%  Similarity=0.254  Sum_probs=84.8

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhcCCC----CCCCCCCCcccCCC-CCCCCC--CCCCC
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRH----DYPEEPGFNEYSQP-LDGKPR--TRSSM   77 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~g~~----~~~dd~~F~EYsd~-dek~~~--~s~~~   77 (103)
                      .++..+|||+.++++.++|+..++.|+.+|+++++|.|+.+|+.|.+.    .|+.+.+|.+|++. ++...+  +.+..
T Consensus       922 ~~~p~g~~~~~~~~~~~~l~w~~v~~~~nes~v~gYkV~~~~~~~~~~~~~~t~~~~~~~~~p~~~~~~y~i~v~~~~~g 1001 (1051)
T KOG3513|consen  922 DIAPPGNFIWKFSASILLLLWLLVSAFENESEVGGYKVLYREDLQNDIEIILTMKLDAEFPEPSDLDGKYDIKVRGYSPG 1001 (1051)
T ss_pred             ccCCCcceEEeeeeeEEEEEEeeEEEEeecccCcceEEEEeecccCCceeEecccccccccCccccCCcceeEeccccCC
Confidence            457899999999999999999999999999999999999999999887    45677899999875 333333  44556


Q ss_pred             CCCCCCCCCCcchhhhhCCCCCCCCC
Q psy17635         78 SSEKHPVESDTDSMAEYGDGDTGTKN  103 (103)
Q Consensus        78 ~~~~~~~~~s~DSl~dYgd~~~~~Fn  103 (103)
                      +.++++++.+.|||++||..++++|+
T Consensus      1002 gdg~~s~~~~~~s~~~~~~~~~~~~~ 1027 (1051)
T KOG3513|consen 1002 GDGTISEEISIDSLGSYGTMDAGTFS 1027 (1051)
T ss_pred             CCCCcCCccccccccCcCCCCCcccc
Confidence            66888889999999999999999986


No 3  
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=96.61  E-value=0.0008  Score=47.53  Aligned_cols=24  Identities=21%  Similarity=0.736  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheee
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R   34 (103)
                      |.+.+++++++||+|++++|+-+|
T Consensus         2 W~l~~iii~~i~l~~~~~~~~~rR   25 (130)
T PF12273_consen    2 WVLFAIIIVAILLFLFLFYCHNRR   25 (130)
T ss_pred             eeeHHHHHHHHHHHHHHHHHHHHH
Confidence            877777777777777777676543


No 4  
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=96.24  E-value=0.0011  Score=53.61  Aligned_cols=35  Identities=23%  Similarity=0.413  Sum_probs=0.0

Q ss_pred             hhhHHHHH----HHHHHHHHHHHHHHhheeecCCCcccc
Q psy17635          8 TAGWFIGM----LLAIAFLILVLILVCLIKRNRGGKYAV   42 (103)
Q Consensus         8 t~gWfIg~----~~ai~lLllilli~C~i~R~rGgKY~V   42 (103)
                      ...|+.-+    ++|++|||+.++++||.+|+|.||-.+
T Consensus       142 ~d~yL~T~IpaVVI~~iLLIA~iIa~icyrrkR~GK~~~  180 (290)
T PF05454_consen  142 SDDYLHTFIPAVVIAAILLIAGIIACICYRRKRKGKMSL  180 (290)
T ss_dssp             ---------------------------------------
T ss_pred             ccchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccccc
Confidence            33455443    333344444455567779999999887


No 5  
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=95.25  E-value=0.02  Score=45.40  Aligned_cols=27  Identities=33%  Similarity=0.491  Sum_probs=20.6

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      |+|-+||+.|+||+||+++|.|+|--.
T Consensus       275 IaVG~~La~lvlivLiaYli~Rrr~~~  301 (306)
T PF01299_consen  275 IAVGAALAGLVLIVLIAYLIGRRRSRA  301 (306)
T ss_pred             HHHHHHHHHHHHHHHHhheeEeccccc
Confidence            566677888889999999997666443


No 6  
>PF07204 Orthoreo_P10:  Orthoreovirus membrane fusion protein p10;  InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=94.20  E-value=0.027  Score=39.49  Aligned_cols=43  Identities=12%  Similarity=0.270  Sum_probs=24.6

Q ss_pred             chhhhhHHHHHHHH-HHHHHHH-HHHHhheeecC-CCcccchhhhh
Q psy17635          5 TVATAGWFIGMLLA-IAFLILV-LILVCLIKRNR-GGKYAVHEREA   47 (103)
Q Consensus         5 ~~at~gWfIg~~~a-i~lLlli-lli~C~i~R~r-GgKY~VkeKE~   47 (103)
                      ++-++.|++....+ ++|+||| +|+.||..|.| -.+=.|-+||.
T Consensus        36 S~~~ayWpyLA~GGG~iLilIii~Lv~CC~~K~K~~~~r~~~~reL   81 (98)
T PF07204_consen   36 SSFVAYWPYLAAGGGLILILIIIALVCCCRAKHKTSAARNTFHREL   81 (98)
T ss_pred             ehHHhhhHHhhccchhhhHHHHHHHHHHhhhhhhhHhhhhHHHHHH
Confidence            35567898876655 4444433 44445444444 55666777774


No 7  
>PF15102 TMEM154:  TMEM154 protein family
Probab=94.15  E-value=0.025  Score=42.07  Aligned_cols=15  Identities=33%  Similarity=0.848  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHhhee
Q psy17635         19 IAFLILVLILVCLIK   33 (103)
Q Consensus        19 i~lLllilli~C~i~   33 (103)
                      |+|+||+|+++|+|.
T Consensus        66 VLLvlLLl~vV~lv~   80 (146)
T PF15102_consen   66 VLLVLLLLSVVCLVI   80 (146)
T ss_pred             HHHHHHHHHHHHhee
Confidence            455556666677773


No 8  
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=94.09  E-value=0.023  Score=50.80  Aligned_cols=32  Identities=22%  Similarity=0.581  Sum_probs=19.6

Q ss_pred             hhhhhHH-HHHHHHHHHHHHHHHHH--hheeecCC
Q psy17635          6 VATAGWF-IGMLLAIAFLILVLILV--CLIKRNRG   37 (103)
Q Consensus         6 ~at~gWf-Ig~~~ai~lLllilli~--C~i~R~rG   37 (103)
                      -.+..|+ ||++..|++++||++|+  |||+++|.
T Consensus       265 ~~~NlWII~gVlvPv~vV~~Iiiil~~~LCRk~K~  299 (684)
T PF12877_consen  265 PPNNLWIIAGVLVPVLVVLLIIIILYWKLCRKNKL  299 (684)
T ss_pred             CCCCeEEEehHhHHHHHHHHHHHHHHHHHhccccc
Confidence            3456788 67766666666665554  55555554


No 9  
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=93.73  E-value=0.047  Score=39.33  Aligned_cols=28  Identities=25%  Similarity=0.487  Sum_probs=15.5

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      +..+=++|+|++|+++  ||||.+||+|.|
T Consensus        65 ~i~~Ii~gv~aGvIg~--Illi~y~irR~~   92 (122)
T PF01102_consen   65 AIIGIIFGVMAGVIGI--ILLISYCIRRLR   92 (122)
T ss_dssp             CHHHHHHHHHHHHHHH--HHHHHHHHHHHS
T ss_pred             ceeehhHHHHHHHHHH--HHHHHHHHHHHh
Confidence            3445556777776553  345555555543


No 10 
>PF05393 Hum_adeno_E3A:  Human adenovirus early E3A glycoprotein;  InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=92.98  E-value=0.082  Score=36.81  Aligned_cols=24  Identities=33%  Similarity=0.602  Sum_probs=16.9

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecC
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~r   36 (103)
                      +.++|.|-++++|+..+||.+|.|
T Consensus        37 ~lvI~~iFil~VilwfvCC~kRkr   60 (94)
T PF05393_consen   37 FLVICGIFILLVILWFVCCKKRKR   60 (94)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhh
Confidence            456677767777778888887644


No 11 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=92.90  E-value=0.34  Score=36.96  Aligned_cols=56  Identities=14%  Similarity=0.256  Sum_probs=32.3

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhcCCCCCCCCCCCccc
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDYPEEPGFNEY   63 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~g~~~~~dd~~F~EY   63 (103)
                      |++.-.+|.+-++|..+++++||.+|..|.|-.- +--++|+.|-..|..|. ..+|-
T Consensus        27 fsthm~tILiaIvVliiiiivli~lcssRKkKaa-AAi~eediQfinpyqDq-qw~~v   82 (189)
T PF05568_consen   27 FSTHMYTILIAIVVLIIIIIVLIYLCSSRKKKAA-AAIEEEDIQFINPYQDQ-QWAGV   82 (189)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHH-hhhhhhcccccCcccch-hhccC
Confidence            4444445544444445556677777776655433 45567788877776653 33343


No 12 
>PF14979 TMEM52:  Transmembrane 52
Probab=92.20  E-value=0.17  Score=38.11  Aligned_cols=22  Identities=36%  Similarity=0.673  Sum_probs=16.1

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli   28 (103)
                      -+..|.|++++.+++|+|+.=+
T Consensus        16 W~~LWyIwLill~~~llLLCG~   37 (154)
T PF14979_consen   16 WSSLWYIWLILLIGFLLLLCGL   37 (154)
T ss_pred             eehhhHHHHHHHHHHHHHHHHH
Confidence            3467999999888777776433


No 13 
>PHA03164 hypothetical protein; Provisional
Probab=91.83  E-value=0.24  Score=33.90  Aligned_cols=28  Identities=29%  Similarity=0.609  Sum_probs=21.7

Q ss_pred             HHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      +|..-.||+.+|+|++++.+...||||+
T Consensus        61 lvLtgLaIamILfiifvlyvFnVnr~~~   88 (88)
T PHA03164         61 LVLTGLAIAMILFIIFVLYVFNVNRGGT   88 (88)
T ss_pred             HHHHHHHHHHHHHHHHHHHheeeccCCC
Confidence            4444556888888888888888888885


No 14 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=91.63  E-value=0.26  Score=32.41  Aligned_cols=25  Identities=20%  Similarity=0.490  Sum_probs=10.2

Q ss_pred             HHHHHHHH-HHHHHHHHHHHhheeec
Q psy17635         11 WFIGMLLA-IAFLILVLILVCLIKRN   35 (103)
Q Consensus        11 WfIg~~~a-i~lLllilli~C~i~R~   35 (103)
                      |+|+.+++ ++++|++++++.+++|+
T Consensus         1 ~ii~~~~~g~~~ll~~v~~~~~~~rr   26 (75)
T PF14575_consen    1 LIIASIIVGVLLLLVLVIIVIVCFRR   26 (75)
T ss_dssp             -HHHHHHHHHHHHHHHHHHHHCCCTT
T ss_pred             CEEehHHHHHHHHHHhheeEEEEEee
Confidence            45554444 44444344333334443


No 15 
>PF15330 SIT:  SHP2-interacting transmembrane adaptor protein, SIT
Probab=90.98  E-value=0.15  Score=35.89  Aligned_cols=30  Identities=27%  Similarity=0.611  Sum_probs=16.6

Q ss_pred             HHHHHHHHHHHHH-HHHHHHhhee---ecCCCcc
Q psy17635         11 WFIGMLLAIAFLI-LVLILVCLIK---RNRGGKY   40 (103)
Q Consensus        11 WfIg~~~ai~lLl-lilli~C~i~---R~rGgKY   40 (103)
                      |.|-.+++++||| +++-|+|+..   |.|++.|
T Consensus         1 w~Ll~il~llLll~l~asl~~wr~~~rq~k~~~~   34 (107)
T PF15330_consen    1 WLLLGILALLLLLSLAASLLAWRMKQRQKKAGQY   34 (107)
T ss_pred             ChHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC
Confidence            6666666665544 4455556553   3445555


No 16 
>PF12273 RCR:  Chitin synthesis regulation, resistance to Congo red;  InterPro: IPR020999  RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 []. 
Probab=89.71  E-value=0.12  Score=36.47  Aligned_cols=20  Identities=20%  Similarity=0.188  Sum_probs=14.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~   29 (103)
                      .|+|.|+++|++|++++++-
T Consensus         4 l~~iii~~i~l~~~~~~~~~   23 (130)
T PF12273_consen    4 LFAIIIVAILLFLFLFYCHN   23 (130)
T ss_pred             eHHHHHHHHHHHHHHHHHHH
Confidence            46777777777777777664


No 17 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=88.99  E-value=0.11  Score=33.94  Aligned_cols=14  Identities=21%  Similarity=0.605  Sum_probs=3.0

Q ss_pred             eecCCCcccchhhh
Q psy17635         33 KRNRGGKYAVHERE   46 (103)
Q Consensus        33 ~R~rGgKY~VkeKE   46 (103)
                      |+.--|.|.|.|..
T Consensus        37 rkkdEGSY~l~e~K   50 (64)
T PF01034_consen   37 RKKDEGSYDLDEPK   50 (64)
T ss_dssp             S------SS--S--
T ss_pred             HhcCCCCccCCCCC
Confidence            44446789888765


No 18 
>PF15050 SCIMP:  SCIMP protein
Probab=88.96  E-value=0.38  Score=35.33  Aligned_cols=35  Identities=17%  Similarity=0.470  Sum_probs=20.6

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHH-hhee--ecCCCcccc
Q psy17635          8 TAGWFIGMLLAIAFLILVLILV-CLIK--RNRGGKYAV   42 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~-C~i~--R~rGgKY~V   42 (103)
                      ...|+|..+.+|++-+.+-||+ |+|+  +++|.|..+
T Consensus         6 ~nFWiiLAVaII~vS~~lglIlyCvcR~~lRqGkkwei   43 (133)
T PF15050_consen    6 DNFWIILAVAIILVSVVLGLILYCVCRWQLRQGKKWEI   43 (133)
T ss_pred             hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccee
Confidence            4568876665554444444444 5444  377888864


No 19 
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=88.58  E-value=0.48  Score=32.54  Aligned_cols=28  Identities=14%  Similarity=0.199  Sum_probs=12.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHHH-hheeecCC
Q psy17635          9 AGWFIGMLLAIAFLILVLILV-CLIKRNRG   37 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~-C~i~R~rG   37 (103)
                      .-|++.+ +.+++++|+.+|. .+|.|.|.
T Consensus        63 iili~ll-s~v~IlVily~IyYFVILRer~   91 (101)
T PF06024_consen   63 IILISLL-SFVCILVILYAIYYFVILRERQ   91 (101)
T ss_pred             chHHHHH-HHHHHHHHHhhheEEEEEeccc
Confidence            3444333 3333444444444 44456544


No 20 
>PF02439 Adeno_E3_CR2:  Adenovirus E3 region protein CR2;  InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=86.84  E-value=1.2  Score=26.42  Aligned_cols=25  Identities=20%  Similarity=0.677  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHHHHH-HhheeecC
Q psy17635         12 FIGMLLAIAFLILVLIL-VCLIKRNR   36 (103)
Q Consensus        12 fIg~~~ai~lLllilli-~C~i~R~r   36 (103)
                      +.|++..+++++|..++ +|+.||.+
T Consensus         9 Iv~V~vg~~iiii~~~~YaCcykk~~   34 (38)
T PF02439_consen    9 IVAVVVGMAIIIICMFYYACCYKKHR   34 (38)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHcccc
Confidence            33444444443333222 35556554


No 21 
>PF08173 YbgT_YccB:  Membrane bound YbgT-like protein;  InterPro: IPR012994 This family contains a set of membrane proteins, typically 33 amino acids long. The family has no known function, but the protein is found in the operon CydAB in Escherichia coli. Members have a consensus motif (MWYFXW), which is rich in aromatic residues. The protein forms a single membrane-spanning helix. This family seems to be restricted to proteobacteria [].
Probab=86.67  E-value=1.1  Score=24.99  Aligned_cols=19  Identities=47%  Similarity=0.803  Sum_probs=15.9

Q ss_pred             hhHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllill   27 (103)
                      ..|++|+..|+++-++.++
T Consensus         4 faWilG~~lA~~~~i~~a~   22 (28)
T PF08173_consen    4 FAWILGVLLACAFGILNAM   22 (28)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4699999999998887764


No 22 
>PHA02867 C-type lectin protein; Provisional
Probab=86.48  E-value=0.59  Score=35.17  Aligned_cols=25  Identities=16%  Similarity=0.384  Sum_probs=21.2

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      .-.||-++.||+|+||++|+||-++
T Consensus        13 ~~~~~~~~~~~~~~~~~~~~v~~~~   37 (167)
T PHA02867         13 SCYAITVLGILCLILFTILVVVTCK   37 (167)
T ss_pred             eeehhHHHHHHHHHHHHHhhheeEE
Confidence            3578999999999999999887665


No 23 
>KOG3956|consensus
Probab=85.88  E-value=0.66  Score=38.56  Aligned_cols=26  Identities=23%  Similarity=0.418  Sum_probs=20.7

Q ss_pred             HHHHHHHHHHHHHHhheeecCCCccc
Q psy17635         16 LLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus        16 ~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      ..+++||||+++|--.-....|+||+
T Consensus         8 r~~~allllll~~G~~~~kkqskKyS   33 (359)
T KOG3956|consen    8 RNHFALLLLLLVIGSAHNKKQSKKYS   33 (359)
T ss_pred             HhhHHHHHHHHHhCcccchhhccccc
Confidence            34588888888887767788899997


No 24 
>PF10577 UPF0560:  Uncharacterised protein family UPF0560;  InterPro: IPR018890  This family of proteins has no known function. 
Probab=85.09  E-value=0.94  Score=41.57  Aligned_cols=30  Identities=20%  Similarity=0.329  Sum_probs=15.5

Q ss_pred             hHHHHHHHHHHHHHHHH-HHHhheeecCCCc
Q psy17635         10 GWFIGMLLAIAFLILVL-ILVCLIKRNRGGK   39 (103)
Q Consensus        10 gWfIg~~~ai~lLllil-li~C~i~R~rGgK   39 (103)
                      ..+.++|.+++||+|++ |+++|.||.|-+|
T Consensus       273 ~fLl~ILG~~~livl~lL~vLl~yCrrkc~~  303 (807)
T PF10577_consen  273 VFLLAILGGTALIVLILLCVLLCYCRRKCLK  303 (807)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhcccCC
Confidence            34466666666666553 3333334544444


No 25 
>PHA02650 hypothetical protein; Provisional
Probab=84.37  E-value=2.2  Score=29.13  Aligned_cols=34  Identities=12%  Similarity=0.201  Sum_probs=17.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      .....|+|.++.++.+++|+.+-+=.++|+-.|.
T Consensus        47 ~~~~~~ii~i~~v~i~~l~~flYLK~~~r~~~~~   80 (81)
T PHA02650         47 FNGQNFIFLIFSLIIVALFSFFVFKGYTRNLNGR   80 (81)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHhccccCCC
Confidence            3334444444444444444444455677876653


No 26 
>TIGR02106 cyd_oper_ybgT cyd operon protein YbgT. This model describes a very small (as short as 33 amino acids) protein of unknown function, essentially always found in an operon with CydAB, subunits of the cytochrome d terminal oxidase. It begins with an aromatic motif MWYFXW and appears to contain a membrane-spanning helix. This protein appears to be restricted to the Proteobacteria and exist in a single copy only. We suggest it may be a membrane subunit of the terminal oxidase. The family is named after the E. coli member YbgT. This model excludes the apparently related protein YccB.
Probab=84.11  E-value=1.6  Score=24.63  Aligned_cols=19  Identities=42%  Similarity=0.749  Sum_probs=15.6

Q ss_pred             hhHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllill   27 (103)
                      ..|++|+..|+++.++..+
T Consensus         4 faWilG~~lA~~~~v~~a~   22 (30)
T TIGR02106         4 FAWILGTLLACAFGVLNAM   22 (30)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3699999999988887664


No 27 
>PF12297 EVC2_like:  Ellis van Creveld protein 2 like protein;  InterPro: IPR022076  This family of proteins is found in eukaryotes. Proteins in this family are typically between 571 and 1310 amino acids in length. There are two conserved sequence motifs: LPA and ELH. EVC2 is implicated in Ellis van Creveld chondrodysplastic dwarfism in humans. Mutations in this protein can give rise to this congenital condition. LIMBIN is a protein which shares around 80% sequence homology with EVC2 and it is implicated in a similar condition in bovine chondrodysplastic dwarfism. 
Probab=84.04  E-value=1.4  Score=37.81  Aligned_cols=38  Identities=21%  Similarity=0.504  Sum_probs=28.1

Q ss_pred             CCcchhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635          2 PSDTVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus         2 ~~~~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      |.-+....|.|||++++|||..+.++++--++.-+|+-
T Consensus        61 pNHGlhaagFfvaflvslVL~~l~~f~l~r~~~l~~~~   98 (429)
T PF12297_consen   61 PNHGLHAAGFFVAFLVSLVLTWLCFFLLARTRCLQGRP   98 (429)
T ss_pred             cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHccccccc
Confidence            45577889999999999999988887775444444433


No 28 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=83.41  E-value=0.44  Score=35.84  Aligned_cols=16  Identities=6%  Similarity=0.241  Sum_probs=7.8

Q ss_pred             HHHHhheeecCCCccc
Q psy17635         26 LILVCLIKRNRGGKYA   41 (103)
Q Consensus        26 lli~C~i~R~rGgKY~   41 (103)
                      +|+.|||.|+|--.|-
T Consensus        68 ~lvf~~c~r~kktdfi   83 (154)
T PF04478_consen   68 ALVFIFCIRRKKTDFI   83 (154)
T ss_pred             HhheeEEEecccCccc
Confidence            3334555555544554


No 29 
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=83.06  E-value=2.4  Score=22.96  Aligned_cols=16  Identities=13%  Similarity=0.146  Sum_probs=9.2

Q ss_pred             CCcchhhhhHHHHHHH
Q psy17635          2 PSDTVATAGWFIGMLL   17 (103)
Q Consensus         2 ~~~~~at~gWfIg~~~   17 (103)
                      |+++-..+.|++.+..
T Consensus         3 P~TG~~~~~~~~~~G~   18 (34)
T TIGR01167         3 PKTGESGNSLLLLLGL   18 (34)
T ss_pred             CCCCCcccHHHHHHHH
Confidence            5666555566655543


No 30 
>PHA02947 S-S bond formation pathway protein; Provisional
Probab=83.01  E-value=0.78  Score=36.17  Aligned_cols=34  Identities=35%  Similarity=0.636  Sum_probs=22.7

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      +++..-|||.  ++++.+++.++.+|.+||.=+=||
T Consensus       174 ~~~~~~W~i~--~~~~i~~i~~i~i~~irR~i~lky  207 (215)
T PHA02947        174 PYSNKPWFIV--GVVIILIIFVIAICSIKRKINLKY  207 (215)
T ss_pred             CcCCCchHHH--HHHHHHHHHHHHHHHHHHHheeeE
Confidence            4556689998  443444444555899999877776


No 31 
>COG4818 Predicted membrane protein [Function unknown]
Probab=82.60  E-value=1.1  Score=31.73  Aligned_cols=32  Identities=31%  Similarity=0.765  Sum_probs=23.2

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635          8 TAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      +.||++.-+..  +..+|+-++|.+|-.||.+|-
T Consensus        57 ~Ig~lls~~v~--l~a~iLwlv~mykAyrGe~fK   88 (105)
T COG4818          57 YIGWLLSGLVG--LAAFILWLVCMYKAYRGERFK   88 (105)
T ss_pred             hhHHHHHhHHH--HHHHHHHHHHHHHHHcCCeec
Confidence            34777766554  444566678999999999994


No 32 
>PHA02844 putative transmembrane protein; Provisional
Probab=82.52  E-value=2.6  Score=28.40  Aligned_cols=29  Identities=14%  Similarity=0.392  Sum_probs=11.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      .....|+|.++.++.+++++.+-+=.++|
T Consensus        46 ~~~~~~ii~i~~v~~~~~~~flYLK~~~r   74 (75)
T PHA02844         46 SSTKIWILTIIFVVFATFLTFLYLKAVPR   74 (75)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHheecC
Confidence            33444444433333333333333333444


No 33 
>PF10717 ODV-E18:  Occlusion-derived virus envelope protein ODV-E18;  InterPro: IPR019655  Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=82.33  E-value=1.5  Score=30.20  Aligned_cols=25  Identities=16%  Similarity=0.603  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         15 MLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        15 ~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      +|..++.|++|+|++-++.-+.+|.
T Consensus        27 lMtILivLVIIiLlImlfqsSS~~~   51 (85)
T PF10717_consen   27 LMTILIVLVIIILLIMLFQSSSNGN   51 (85)
T ss_pred             HHHHHHHHHHHHHHHHHHhccCCCC
Confidence            4555555555555555555444443


No 34 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=82.28  E-value=0.78  Score=37.37  Aligned_cols=26  Identities=27%  Similarity=0.599  Sum_probs=19.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      +.=+|+-++||++++||++|+.+|-|
T Consensus       255 ~t~I~aSiiaIliIVLIMvIIYLILR  280 (299)
T PF02009_consen  255 TTAIIASIIAILIIVLIMVIIYLILR  280 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34467777788888888888877754


No 35 
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=81.72  E-value=3.9  Score=24.32  Aligned_cols=32  Identities=9%  Similarity=0.289  Sum_probs=16.6

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      +++.-.+...+.+++++.++++..++++..+|
T Consensus         7 ~~~~~~~~~v~~~~~F~gi~~w~~~~~~k~~~   38 (49)
T PF05545_consen    7 QGFARSIGTVLFFVFFIGIVIWAYRPRNKKRF   38 (49)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcccchhhH
Confidence            34444444455555566555665555553333


No 36 
>PF00957 Synaptobrevin:  Synaptobrevin;  InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=80.71  E-value=1.9  Score=28.10  Aligned_cols=6  Identities=33%  Similarity=1.458  Sum_probs=1.8

Q ss_pred             HHHHHh
Q psy17635         25 VLILVC   30 (103)
Q Consensus        25 illi~C   30 (103)
                      |++++|
T Consensus        83 i~~~~~   88 (89)
T PF00957_consen   83 IIIVIC   88 (89)
T ss_dssp             HHHTT-
T ss_pred             HHHHHh
Confidence            333334


No 37 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=78.76  E-value=2.9  Score=28.71  Aligned_cols=14  Identities=14%  Similarity=0.349  Sum_probs=5.6

Q ss_pred             hhhhhcCCCCCCCC
Q psy17635         44 EREAAHGRHDYPEE   57 (103)
Q Consensus        44 eKE~~~g~~~~~dd   57 (103)
                      ++|..+...+.+|+
T Consensus        30 ~~~~~~~~~~v~~~   43 (95)
T PF07172_consen   30 ETEKEEEENEVQDD   43 (95)
T ss_pred             hccccccCCCCCcc
Confidence            33333333344443


No 38 
>PF14991 MLANA:  Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=77.97  E-value=0.34  Score=35.03  Aligned_cols=31  Identities=19%  Similarity=0.446  Sum_probs=0.0

Q ss_pred             HHHHHHHHHhheeecCCCcccchhhhhhcCC
Q psy17635         21 FLILVLILVCLIKRNRGGKYAVHEREAAHGR   51 (103)
Q Consensus        21 lLllilli~C~i~R~rGgKY~VkeKE~~~g~   51 (103)
                      +|-++|||-|+.+|+|.|==.+++|-...+-
T Consensus        35 ILgiLLliGCWYckRRSGYk~L~~k~~~~gt   65 (118)
T PF14991_consen   35 ILGILLLIGCWYCKRRSGYKTLRDKSLHAGT   65 (118)
T ss_dssp             -------------------------------
T ss_pred             HHHHHHHHhheeeeecchhhhhhhccccccC
Confidence            3334455567777777776678777654443


No 39 
>PF09451 ATG27:  Autophagy-related protein 27;  InterPro: IPR018939 Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. There are more than 25 AuTophaGy-related (ATG) genes that are essential for autophagy, although it is still not known how the autophagosome is made. Atg9 is a potential membrane carrier to deliver lipids that are used to form the vesicle. Atg27 is another transmembrane protein, and is a cycling protein []. It acts as an effector of VPS34 phosphatidylinositol 3-phosphate kinase signalling and regulates the cytoplasm to vacuole transport (Cvt) vesicle formation. It is also required for autophagy-dependent cycling of ATG9. 
Probab=77.35  E-value=3.2  Score=32.57  Aligned_cols=34  Identities=21%  Similarity=0.338  Sum_probs=19.8

Q ss_pred             hhhhhHHHHHHHHHHHHH-HHHHHHhheeecCCCc
Q psy17635          6 VATAGWFIGMLLAIAFLI-LVLILVCLIKRNRGGK   39 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLl-lilli~C~i~R~rGgK   39 (103)
                      ...-|||..+++.++|.+ +.+++--++..+++|.
T Consensus       197 ~~~~g~f~wl~i~~~l~~~~Y~i~g~~~n~~~~g~  231 (268)
T PF09451_consen  197 SGGWGFFTWLFIILFLFLAAYLIFGSWYNYNRYGA  231 (268)
T ss_pred             cccccHHHHHHHHHHHHHHHHhhhhhheeeccCCC
Confidence            345578866665544443 3355556677777665


No 40 
>PRK14740 kdbF potassium-transporting ATPase subunit F; Provisional
Probab=77.13  E-value=5  Score=22.55  Aligned_cols=26  Identities=27%  Similarity=0.337  Sum_probs=20.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      .|+.++-.|.++++++-|++.+.|-.
T Consensus         2 ~~~~wls~a~a~~Lf~YLv~ALlRae   27 (29)
T PRK14740          2 TVLDWLSLALATGLFVYLLVALLRAD   27 (29)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHhccc
Confidence            48899989988888887777766543


No 41 
>PF15069 FAM163:  FAM163 family
Probab=76.92  E-value=1.8  Score=32.23  Aligned_cols=20  Identities=25%  Similarity=0.610  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHHHHHHHhhe
Q psy17635         13 IGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i   32 (103)
                      .|-++|.+.||.|+.++|.+
T Consensus         9 tGgILAtVILLcIIaVLCYC   28 (143)
T PF15069_consen    9 TGGILATVILLCIIAVLCYC   28 (143)
T ss_pred             echHHHHHHHHHHHHHHHHH
Confidence            35566666666666666544


No 42 
>PF07271 Cytadhesin_P30:  Cytadhesin P30/P32;  InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=76.81  E-value=0.63  Score=37.95  Aligned_cols=42  Identities=26%  Similarity=0.445  Sum_probs=25.6

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHHHH------hheeecCCCcccchhhhhh
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLILV------CLIKRNRGGKYAVHEREAA   48 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllilli~------C~i~R~rGgKY~VkeKE~~   48 (103)
                      .+..+.|||-+..+++-|++++||+      =++|  |-.|-.+.++|+.
T Consensus        62 ~i~~~~W~~P~v~~~~G~~~v~liLgl~ig~p~~k--rkek~~iee~e~~  109 (279)
T PF07271_consen   62 PITEQSWFIPVVGGSAGLLAVALILGLAIGIPIYK--RKEKRMIEEKEEH  109 (279)
T ss_pred             ecccccceeeeccchhhHHHHHHHHHHhhcchhhh--hhHHHHHHHHHHH
Confidence            3567889997765544444333332      3334  4488888888864


No 43 
>MTH00036 ATP8 ATP synthase F0 subunit 8; Validated
Probab=76.56  E-value=4.7  Score=25.25  Aligned_cols=23  Identities=30%  Similarity=0.648  Sum_probs=15.8

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLlli   25 (103)
                      |||-.+  .-||+-++++.++++++
T Consensus         1 MPQLd~--s~Wf~~~~~~W~~l~~~   23 (54)
T MTH00036          1 MPQLDF--TWWLFNFLLAWSLVFIV   23 (54)
T ss_pred             CCCCcc--hHHHHHHHHHHHHHHHH
Confidence            777665  47999998885444433


No 44 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=75.95  E-value=1.2  Score=35.67  Aligned_cols=26  Identities=35%  Similarity=0.736  Sum_probs=12.4

Q ss_pred             HHHHHHHH---HHHHHHHHHhheeecCCC
Q psy17635         13 IGMLLAIA---FLILVLILVCLIKRNRGG   38 (103)
Q Consensus        13 Ig~~~ai~---lLllilli~C~i~R~rGg   38 (103)
                      |++-||+.   ||.|+-+|+.+++|+|-|
T Consensus       232 IslAiALG~v~ll~l~Gii~~~~~r~~~~  260 (281)
T PF12768_consen  232 ISLAIALGTVFLLVLIGIILAYIRRRRQG  260 (281)
T ss_pred             EehHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            45555533   333444445666555433


No 45 
>PF13209 DUF4017:  Protein of unknown function (DUF4017)
Probab=75.80  E-value=1.9  Score=27.72  Aligned_cols=33  Identities=30%  Similarity=0.711  Sum_probs=25.6

Q ss_pred             cchhhhhH--HHHHHHHHHHHHHHHHHHhheeecC
Q psy17635          4 DTVATAGW--FIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus         4 ~~~at~gW--fIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      ++-.+.||  |+|-.-||-.+++..+|-.++.|+|
T Consensus        25 egYN~vgWKlfvGQ~YAiPif~i~aiitFyinkk~   59 (60)
T PF13209_consen   25 EGYNTVGWKLFVGQAYAIPIFIITAIITFYINKKK   59 (60)
T ss_pred             cCccccchhheecchhHhHHHHHHHHHhheecccc
Confidence            45667778  6777888888888888888888775


No 46 
>PF14584 DUF4446:  Protein of unknown function (DUF4446)
Probab=75.43  E-value=1.8  Score=32.02  Aligned_cols=22  Identities=32%  Similarity=0.594  Sum_probs=15.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      |++..++++++++||+++++.+
T Consensus         2 ~i~i~l~~l~iilli~~~~~~~   23 (151)
T PF14584_consen    2 YIIIGLLVLVIILLILIIILNI   23 (151)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHH
Confidence            6777777777777777776555


No 47 
>PF00974 Rhabdo_glycop:  Rhabdovirus spike glycoprotein;  InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=75.35  E-value=0.91  Score=39.05  Aligned_cols=22  Identities=32%  Similarity=0.772  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHH-HHHHhhe
Q psy17635         11 WFIGMLLAIAFLILV-LILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLlli-lli~C~i   32 (103)
                      |+.....+|++++|+ +++.||.
T Consensus       454 ~~~~~~~~vi~~illi~l~~cc~  476 (501)
T PF00974_consen  454 WLSIIAIAVILLILLILLIRCCC  476 (501)
T ss_dssp             -----------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhh
Confidence            444444444444444 3344443


No 48 
>PF14914 LRRC37AB_C:  LRRC37A/B like protein 1 C-terminal domain
Probab=74.37  E-value=3.2  Score=31.32  Aligned_cols=23  Identities=22%  Similarity=0.527  Sum_probs=15.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhe
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i   32 (103)
                      -|+|..+++.+.+.++++|+|+|
T Consensus       119 nklilaisvtvv~~iliii~CLi  141 (154)
T PF14914_consen  119 NKLILAISVTVVVMILIIIFCLI  141 (154)
T ss_pred             chhHHHHHHHHHHHHHHHHHHHH
Confidence            47777777766666666666654


No 49 
>PF12606 RELT:  Tumour necrosis factor receptor superfamily member 19;  InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis).  RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=74.27  E-value=4.1  Score=25.29  Aligned_cols=35  Identities=26%  Similarity=0.273  Sum_probs=18.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHERE   46 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE   46 (103)
                      ||+.+...+++.+|.+ .+|=+-+.+|=|..|...|
T Consensus         3 ~~~iV~i~iv~~lLg~-~I~~~~K~ygYkht~d~~e   37 (50)
T PF12606_consen    3 AFLIVSIFIVMGLLGL-SICTTLKAYGYKHTVDPLE   37 (50)
T ss_pred             ehHHHHHHHHHHHHHH-HHHHHhhccccccccCCCC
Confidence            5555555544444443 3455566667676665533


No 50 
>KOG3637|consensus
Probab=73.38  E-value=2.6  Score=39.49  Aligned_cols=29  Identities=31%  Similarity=0.552  Sum_probs=17.3

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHh-----heeecCC
Q psy17635          9 AGWFIGMLLAIAFLILVLILVC-----LIKRNRG   37 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C-----~i~R~rG   37 (103)
                      -.|+|.+=+...||||+||++|     |.||+|=
T Consensus       977 p~wiIi~svl~GLLlL~llv~~LwK~GFFKR~r~ 1010 (1030)
T KOG3637|consen  977 PLWIIILSVLGGLLLLALLVLLLWKCGFFKRNRK 1010 (1030)
T ss_pred             ceeeehHHHHHHHHHHHHHHHHHHhcCccccCCC
Confidence            4577766555555555555554     4477764


No 51 
>PHA02662 ORF131 putative membrane protein; Provisional
Probab=72.73  E-value=2.2  Score=33.94  Aligned_cols=31  Identities=23%  Similarity=0.433  Sum_probs=20.8

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      -|||.+..+ +.+++.++.+|.++|.=+=||-
T Consensus       186 ~W~i~~~v~-~i~~i~vv~i~~irR~i~lkYr  216 (226)
T PHA02662        186 PWTLLLAVA-AVTVLGVVAVSLLRRALRIRFR  216 (226)
T ss_pred             cchhHHHHH-HHHHHHHHHHHHHHHHhheeee
Confidence            699876655 2333444457999998777773


No 52 
>PF11694 DUF3290:  Protein of unknown function (DUF3290);  InterPro: IPR021707  This family of proteins with unknown function appears to be restricted to Firmicutes. 
Probab=72.47  E-value=4.7  Score=29.74  Aligned_cols=33  Identities=18%  Similarity=0.461  Sum_probs=18.3

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      .-|+-.++.+++++++++..+-..|.++.-||-
T Consensus        15 ~~~~~~~~i~~ll~~l~~~~~~Y~r~r~~tKyR   47 (149)
T PF11694_consen   15 NDYLRYILIIILLLVLIFFFIKYLRNRLDTKYR   47 (149)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcCcchhh
Confidence            344444444444455554445556677788884


No 53 
>PHA03240 envelope glycoprotein M; Provisional
Probab=72.30  E-value=4.4  Score=32.59  Aligned_cols=15  Identities=20%  Similarity=0.465  Sum_probs=9.3

Q ss_pred             hhhhhHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIA   20 (103)
Q Consensus         6 ~at~gWfIg~~~ai~   20 (103)
                      .+-..|+|.+++.|+
T Consensus       210 aaH~~WIiilIIiIi  224 (258)
T PHA03240        210 AAHIAWIFIAIIIII  224 (258)
T ss_pred             cchHhHHHHHHHHHH
Confidence            455678876655433


No 54 
>PRK02251 putative septation inhibitor protein; Reviewed
Probab=72.19  E-value=5  Score=27.55  Aligned_cols=33  Identities=18%  Similarity=0.617  Sum_probs=24.8

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccc
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAV   42 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~V   42 (103)
                      .+-.||..+|+++.++=|+=+++..+   -|+.||+
T Consensus        29 ~sP~W~~~~m~~lm~~Gl~WlvvyYl---~~~~~P~   61 (87)
T PRK02251         29 SNPRWFVPLFVALMIIGLIWLVVYYL---SNGSLPI   61 (87)
T ss_pred             CCCchHHHHHHHHHHHHHHHHHHHhh---hCCCcCc
Confidence            45689999999988888887777666   2566654


No 55 
>PTZ00046 rifin; Provisional
Probab=72.14  E-value=2.8  Score=35.22  Aligned_cols=28  Identities=29%  Similarity=0.579  Sum_probs=21.1

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      +.=+|+-++||++++||.+|+.+|-|=|
T Consensus       314 ~taIiaSiiAIvVIVLIMvIIYLILRYR  341 (358)
T PTZ00046        314 QTAIIASIVAIVVIVLIMVIIYLILRYR  341 (358)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            3346777788888888889999886644


No 56 
>PF11807 DUF3328:  Domain of unknown function (DUF3328);  InterPro: IPR021765  This family of proteins are functionally uncharacterised. This family is only found in eukaryotes. 
Probab=71.86  E-value=5.5  Score=28.29  Aligned_cols=25  Identities=32%  Similarity=0.438  Sum_probs=14.7

Q ss_pred             cchhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          4 DTVATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         4 ~~~at~gWfIg~~~ai~lLllilli   28 (103)
                      +.-.+..|++.++.+++++++.+++
T Consensus         3 p~~r~~~w~~~~~~~~~~l~~~l~~   27 (217)
T PF11807_consen    3 PRRRRRRWRRLLLLFLLLLILSLLL   27 (217)
T ss_pred             CCCCCccHHHHHHHHHHHHHHHHHH
Confidence            3455677888866665555444333


No 57 
>PF14575 EphA2_TM:  Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=71.78  E-value=3.3  Score=27.10  Aligned_cols=24  Identities=25%  Similarity=0.506  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHHHHHHHhheeecCC
Q psy17635         14 GMLLAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R~rG   37 (103)
                      |++..++++++++++.--.+++++
T Consensus         9 g~~~ll~~v~~~~~~~rr~~~~~~   32 (75)
T PF14575_consen    9 GVLLLLVLVIIVIVCFRRCKYSKA   32 (75)
T ss_dssp             HHHHHHHHHHHHHCCCTT------
T ss_pred             HHHHHHHhheeEEEEEeeEcCCCC
Confidence            344444444444444333334444


No 58 
>PHA02955 hypothetical protein; Provisional
Probab=71.73  E-value=2  Score=33.78  Aligned_cols=28  Identities=36%  Similarity=0.629  Sum_probs=19.1

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      -|||....++   +++++++|.+||.=+=||
T Consensus       180 ~w~ii~~v~i---i~~~v~l~yikR~i~~ky  207 (213)
T PHA02955        180 KWFIIYIVLC---LLILIILGYIYRTVRIKY  207 (213)
T ss_pred             cchhHHHHHH---HHHHHHHHHHHHHheeeE
Confidence            6888554443   455555899999877666


No 59 
>COG4890 Predicted outer membrane lipoprotein [Function unknown]
Probab=71.47  E-value=6.2  Score=23.19  Aligned_cols=21  Identities=38%  Similarity=0.749  Sum_probs=17.6

Q ss_pred             hhHHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~   29 (103)
                      .+|+.|++.|-++-+|-.+-+
T Consensus         4 FaWiLG~lLAcAFgiinAlwl   24 (37)
T COG4890           4 FAWILGLLLACAFGIINALWL   24 (37)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            469999999999998877654


No 60 
>PRK13275 mtrF tetrahydromethanopterin S-methyltransferase subunit F; Provisional
Probab=71.31  E-value=6.8  Score=25.76  Aligned_cols=25  Identities=28%  Similarity=0.527  Sum_probs=18.1

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHh
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVC   30 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C   30 (103)
                      ....|..||++.|++|.++.+++..
T Consensus        40 ~~~~G~aiG~~~AlvLv~ip~~l~~   64 (67)
T PRK13275         40 TGIIGFAIGFLLALLLVVVPPLLYG   64 (67)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4567888999999887766655544


No 61 
>PF02480 Herpes_gE:  Alphaherpesvirus glycoprotein E;  InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=71.01  E-value=1.3  Score=37.58  Aligned_cols=32  Identities=22%  Similarity=0.314  Sum_probs=0.0

Q ss_pred             hHHHHHHHHHHHHHHHHHH--HhheeecCCCccc
Q psy17635         10 GWFIGMLLAIAFLILVLIL--VCLIKRNRGGKYA   41 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli--~C~i~R~rGgKY~   41 (103)
                      .++.+++++++++++++++  +|+.+|.|--+|.
T Consensus       353 ~~l~vVlgvavlivVv~viv~vc~~~rrrR~~~~  386 (439)
T PF02480_consen  353 ALLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQRD  386 (439)
T ss_dssp             ----------------------------------
T ss_pred             chHHHHHHHHHHHHHHHHHhheeeeehhcccccc
Confidence            3444444544444444333  3444455444444


No 62 
>PRK14749 hypothetical protein; Provisional
Probab=70.81  E-value=7.1  Score=22.10  Aligned_cols=19  Identities=42%  Similarity=0.857  Sum_probs=15.0

Q ss_pred             hhHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllill   27 (103)
                      ..|+.|+..|..+-+|-++
T Consensus         4 faWiLG~~lAc~f~ilna~   22 (30)
T PRK14749          4 LLWFVGILLMCSLSTLVLV   22 (30)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3699999999888777654


No 63 
>PF06103 DUF948:  Bacterial protein of unknown function (DUF948);  InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=69.69  E-value=3.6  Score=26.89  Aligned_cols=21  Identities=43%  Similarity=0.868  Sum_probs=14.9

Q ss_pred             HHHHHHHHHHHHHHHHhheee
Q psy17635         14 GMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R   34 (103)
                      +++.||+|++|+++++.++++
T Consensus         3 ~lI~Aiaf~vLvi~l~~~l~~   23 (90)
T PF06103_consen    3 GLIAAIAFAVLVIFLIKVLKK   23 (90)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            566778888888777666554


No 64 
>PRK11901 hypothetical protein; Reviewed
Probab=69.50  E-value=4.1  Score=33.96  Aligned_cols=16  Identities=31%  Similarity=0.874  Sum_probs=8.2

Q ss_pred             HHHHHHHHHHHHHHHh
Q psy17635         15 MLLAIAFLILVLILVC   30 (103)
Q Consensus        15 ~~~ai~lLllilli~C   30 (103)
                      +|++|.+|+|+||||-
T Consensus        38 ~MiGiGilVLlLLIi~   53 (327)
T PRK11901         38 MMIGIGILVLLLLIIA   53 (327)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4455555555555554


No 65 
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=69.30  E-value=3.6  Score=34.58  Aligned_cols=27  Identities=30%  Similarity=0.593  Sum_probs=20.7

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      .=+|+-++||++++||.+|+.+|-|=|
T Consensus       310 t~IiaSiIAIvvIVLIMvIIYLILRYR  336 (353)
T TIGR01477       310 TPIIASIIAILIIVLIMVIIYLILRYR  336 (353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            346777788888888889998886633


No 66 
>PRK11677 hypothetical protein; Provisional
Probab=69.19  E-value=6.2  Score=28.80  Aligned_cols=25  Identities=12%  Similarity=0.246  Sum_probs=17.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeec
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      |++++++.|+-+++-+++.-+..++
T Consensus         3 W~~a~i~livG~iiG~~~~R~~~~~   27 (134)
T PRK11677          3 WEYALIGLVVGIIIGAVAMRFGNRK   27 (134)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhccch
Confidence            9888877777776666666555443


No 67 
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=69.10  E-value=4  Score=33.59  Aligned_cols=13  Identities=23%  Similarity=0.731  Sum_probs=7.3

Q ss_pred             HHHHHHhheeecC
Q psy17635         24 LVLILVCLIKRNR   36 (103)
Q Consensus        24 lilli~C~i~R~r   36 (103)
                      ||+|-+|+.||+|
T Consensus       275 liiLYiWlyrrRK  287 (295)
T TIGR01478       275 LIILYIWLYRRRK  287 (295)
T ss_pred             HHHHHHHHHHhhc
Confidence            3344457777665


No 68 
>PF00895 ATP-synt_8:  ATP synthase protein 8;  InterPro: IPR001421 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit 8 found in the F0 complex of mitochondrial F-ATPases from Metazoa. This subunit appears to be an integral component of the stator stalk in yeast mitochondrial F-ATPases []. The stator stalk is anchored in the membrane, and acts to prevent futile rotation of the ATPase subunits relative to the rotor during coupled ATP synthesis/hydrolysis. This subunit may have an analogous function in Metazoa. Subunit 8 differs in sequence between Metazoa, plants (IPR003319 from INTERPRO) and fungi (IPR009230 from INTERPRO). More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0000276 mitochondrial proton-transporting ATP synthase complex, coupling factor F(o)
Probab=69.10  E-value=12  Score=21.82  Aligned_cols=30  Identities=27%  Similarity=0.620  Sum_probs=19.1

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      |||-  +..-||+-.+++...|++++.+..+.
T Consensus         1 MPQL--~P~~W~~~f~~~~~~l~~~~~~~~~~   30 (54)
T PF00895_consen    1 MPQL--NPMPWFFLFLFFWLILLIIMILIYFF   30 (54)
T ss_pred             CCCC--ChHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            5553  34469999999876666555544433


No 69 
>PF12575 DUF3753:  Protein of unknown function (DUF3753);  InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=68.51  E-value=8  Score=25.79  Aligned_cols=18  Identities=39%  Similarity=0.748  Sum_probs=7.2

Q ss_pred             hHHHHHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLILVLI   27 (103)
Q Consensus        10 gWfIg~~~ai~lLllill   27 (103)
                      +|.+.++..|+.++++++
T Consensus        47 ~~~~~ii~ii~v~ii~~l   64 (72)
T PF12575_consen   47 NWIILIISIIFVLIIVLL   64 (72)
T ss_pred             hHHHHHHHHHHHHHHHHH
Confidence            344444444443333333


No 70 
>PF11395 DUF2873:  Protein of unknown function (DUF2873);  InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=68.48  E-value=9.2  Score=22.97  Aligned_cols=19  Identities=16%  Similarity=0.501  Sum_probs=9.8

Q ss_pred             HHHHHHHHHHHHHHH-Hhhe
Q psy17635         14 GMLLAIAFLILVLIL-VCLI   32 (103)
Q Consensus        14 g~~~ai~lLllilli-~C~i   32 (103)
                      -++..+++|++|.+| .||+
T Consensus        12 c~l~~llflv~imliif~f~   31 (43)
T PF11395_consen   12 CFLSFLLFLVIIMLIIFWFS   31 (43)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            344445555555444 4664


No 71 
>PTZ00370 STEVOR; Provisional
Probab=68.38  E-value=4.3  Score=33.45  Aligned_cols=14  Identities=21%  Similarity=0.688  Sum_probs=7.6

Q ss_pred             HHHHHHhheeecCC
Q psy17635         24 LVLILVCLIKRNRG   37 (103)
Q Consensus        24 lilli~C~i~R~rG   37 (103)
                      ||+|-+|+.||+|.
T Consensus       271 liilYiwlyrrRK~  284 (296)
T PTZ00370        271 LIILYIWLYRRRKN  284 (296)
T ss_pred             HHHHHHHHHHhhcc
Confidence            33444577776653


No 72 
>PF11669 WBP-1:  WW domain-binding protein 1;  InterPro: IPR021684  This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain []. 
Probab=67.69  E-value=4.9  Score=27.76  Aligned_cols=11  Identities=9%  Similarity=0.567  Sum_probs=5.2

Q ss_pred             hhHHHHHHHHH
Q psy17635          9 AGWFIGMLLAI   19 (103)
Q Consensus         9 ~gWfIg~~~ai   19 (103)
                      +.|++.++.+|
T Consensus        20 ~~w~FWlv~~l   30 (102)
T PF11669_consen   20 ELWYFWLVWVL   30 (102)
T ss_pred             HHHHHHHHHHH
Confidence            44655554333


No 73 
>PF05478 Prominin:  Prominin;  InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=67.50  E-value=6  Score=35.68  Aligned_cols=12  Identities=33%  Similarity=0.625  Sum_probs=6.5

Q ss_pred             Hhhe--eecCCCcc
Q psy17635         29 VCLI--KRNRGGKY   40 (103)
Q Consensus        29 ~C~i--~R~rGgKY   40 (103)
                      .|||  +++=||+.
T Consensus       114 fCcCRCc~~CGg~~  127 (806)
T PF05478_consen  114 FCCCRCCGNCGGRM  127 (806)
T ss_pred             HhccccCCCcCCcc
Confidence            4555  24557775


No 74 
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=66.89  E-value=2.9  Score=36.12  Aligned_cols=23  Identities=35%  Similarity=0.582  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHHHHHHHHhheee
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R   34 (103)
                      ++.++++|+++++|++++.++.|
T Consensus         2 ~~~~ii~i~ii~i~~~~~~~~~r   24 (569)
T PRK04778          2 MIYLIIAIVVIIIIAYLAGLILR   24 (569)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHH
Confidence            34455555555555554445544


No 75 
>PF02480 Herpes_gE:  Alphaherpesvirus glycoprotein E;  InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=66.29  E-value=1.9  Score=36.64  Aligned_cols=30  Identities=23%  Similarity=0.403  Sum_probs=0.0

Q ss_pred             hhhhHHHHHHHH--HHHHHHHHHHHhheeecC
Q psy17635          7 ATAGWFIGMLLA--IAFLILVLILVCLIKRNR   36 (103)
Q Consensus         7 at~gWfIg~~~a--i~lLllilli~C~i~R~r   36 (103)
                      .-.+.++|+..+  |+++++++++.|..+|.+
T Consensus       353 ~~l~vVlgvavlivVv~viv~vc~~~rrrR~~  384 (439)
T PF02480_consen  353 ALLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQ  384 (439)
T ss_dssp             --------------------------------
T ss_pred             chHHHHHHHHHHHHHHHHHhheeeeehhcccc
Confidence            344455444444  455556688888887543


No 76 
>PF13623 SurA_N_2:  SurA N-terminal domain
Probab=65.87  E-value=6.2  Score=28.71  Aligned_cols=22  Identities=9%  Similarity=0.422  Sum_probs=17.2

Q ss_pred             chhhhhHHHHHHHHHHHHHHHH
Q psy17635          5 TVATAGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllil   26 (103)
                      .++..+|++.++++++|+.+|+
T Consensus         3 kIR~r~~lLi~vIglAL~aFIv   24 (145)
T PF13623_consen    3 KIRQRGGLLIIVIGLALFAFIV   24 (145)
T ss_pred             hHhhcchHHHHHHHHHHHHHHH
Confidence            3567799988888888887777


No 77 
>PRK14762 membrane protein; Provisional
Probab=65.59  E-value=9.1  Score=21.04  Aligned_cols=17  Identities=18%  Similarity=0.528  Sum_probs=13.3

Q ss_pred             hhHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILV   25 (103)
Q Consensus         9 ~gWfIg~~~ai~lLlli   25 (103)
                      ..|.|+++-.|.||++.
T Consensus         4 ~lw~i~iifligllvvt   20 (27)
T PRK14762          4 ILWAVLIIFLIGLLVVT   20 (27)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            47999888888877765


No 78 
>PF06143 Baculo_11_kDa:  Baculovirus 11 kDa family;  InterPro: IPR009313 This is a family of uncharacterised Baculovirus proteins that are all about 11 kDa in size.
Probab=64.44  E-value=8  Score=26.43  Aligned_cols=14  Identities=7%  Similarity=0.477  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILV   25 (103)
Q Consensus        12 fIg~~~ai~lLlli   25 (103)
                      ||-++|++++++++
T Consensus        36 FvLVic~~lVfVii   49 (84)
T PF06143_consen   36 FVLVICCFLVFVII   49 (84)
T ss_pred             HHHHHHHHHHHHHH
Confidence            67778875555444


No 79 
>PF09801 SYS1:  Integral membrane protein S linking to the trans Golgi network;  InterPro: IPR019185  Members of this family are integral membrane proteins involved in protein trafficking between the late Golgi and endosome. They may also serve as a receptor for ADP-ribosylation factor-related protein 1 (ARFRP1) []. Sys1p is a small integral membrane protein with four predicted transmembrane domains that localises to the Trans Golgi network TGN in yeast and human cells []. 
Probab=64.37  E-value=7.7  Score=28.32  Aligned_cols=31  Identities=23%  Similarity=0.558  Sum_probs=24.4

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      ..++||.++++-.+.-++-.+.+.++|+|.|
T Consensus        55 ~~~~g~~~~~~~ll~sl~~~~~L~~iV~R~K   85 (144)
T PF09801_consen   55 DTTSGWLVILAWLLNSLVCALFLAFIVQRSK   85 (144)
T ss_pred             CCcchHHHHHHHHHHHHHHHHHheeeEeccc
Confidence            3467999998887777777777788888876


No 80 
>PF02932 Neur_chan_memb:  Neurotransmitter-gated ion-channel transmembrane region ion channel family signature gamma-aminobutyric acid (GABA) receptor signature nicotinic acetylcholine receptor signature;  InterPro: IPR006029 Neurotransmitter ligand-gated ion channels are transmembrane receptor-ion channel complexes that open transiently upon binding of specific ligands, allowing rapid transmission of signals at chemical synapses [, ]. Five of these ion channel receptor families have been shown to form a sequence-related superfamily:   Nicotinic acetylcholine receptor (AchR), an excitatory cation channel in vertebrates and invertebrates; in vertebrate motor endplates it is composed of alpha, beta, gamma and delta/epsilon subunits; in neurons it is composed of alpha and non-alpha (or beta) subunits []. Glycine receptor, an inhibitory chloride ion channel composed of alpha and beta subunits []. Gamma-aminobutyric acid (GABA) receptor, an inhibitory chloride ion channel; at least four types of subunits (alpha, beta, gamma and delta) are known []. Serotonin 5HT3 receptor, of which there are seven major types (5HT3-5HT7) []. Glutamate receptor, an excitatory cation channel of which at least three types have been described (kainate, N-methyl-D-aspartate (NMDA) and quisqualate) [].   These receptors possess a pentameric structure (made up of varying subunits), surrounding a central pore. All known sequences of subunits from neurotransmitter-gated ion-channels are structurally related. They are composed of a large extracellular glycosylated N-terminal ligand-binding domain, followed by three hydrophobic transmembrane regions which form the ionic channel, followed by an intracellular region of variable length. A fourth hydrophobic region is found at the C-terminal of the sequence [, ]. This domain represents four transmembrane helices of a variety of neurotransmitter-gated ion-channels.; GO: 0006811 ion transport, 0016020 membrane; PDB: 1DXZ_A 3MRA_A 1EQ8_C 1OED_C 2PR9_P 1A11_A 1CEK_A 2BG9_E 2KSR_A 2K59_B ....
Probab=64.26  E-value=5.6  Score=26.64  Aligned_cols=33  Identities=18%  Similarity=0.282  Sum_probs=19.7

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCC
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGG   38 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGg   38 (103)
                      .+-.+|+++.++++..+++.+.++++..+.|+.
T Consensus        55 ~pl~~~~~~~~~~v~~~~~~~avv~~~~~~~~~   87 (237)
T PF02932_consen   55 KPLDGWYFICTMFVFSASLEFAVVVYNIHHRGP   87 (237)
T ss_dssp             SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHST
T ss_pred             cchhhhhccccchhhhHHHHhhhhhhhhccccc
Confidence            466789988877766655555444444433333


No 81 
>PF05297 Herpes_LMP1:  Herpesvirus latent membrane protein 1 (LMP1);  InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=63.81  E-value=2.3  Score=35.68  Aligned_cols=11  Identities=27%  Similarity=0.552  Sum_probs=0.0

Q ss_pred             hhhhHHHHHHH
Q psy17635          7 ATAGWFIGMLL   17 (103)
Q Consensus         7 at~gWfIg~~~   17 (103)
                      |-.-||..+|+
T Consensus        35 ail~w~~iims   45 (381)
T PF05297_consen   35 AILVWFFIIMS   45 (381)
T ss_dssp             -----------
T ss_pred             HHHHHHHHHHh
Confidence            44567776664


No 82 
>PF08113 CoxIIa:  Cytochrome c oxidase subunit IIa family;  InterPro: IPR012538 This family consists of the cytochrome c oxidase subunit IIa family. The bax-type cytochrome c oxidase from Thermus thermophilus is known as a two subunit enzyme. From its crystal structure, it was discovered that an additional transmembrane helix, subunit IIa, spans the membrane. This subunit consists of 34 residues forming one helix across the membrane. The presence of this subunit seems to be important for the function of cytochrome c oxidases [].; PDB: 2QPD_C 3QJR_C 3EH5_C 3BVD_C 3S39_C 3QJU_C 3QJS_C 4EV3_C 3QJT_C 4FA7_C ....
Probab=63.39  E-value=12  Score=21.69  Aligned_cols=19  Identities=16%  Similarity=0.312  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHHhh
Q psy17635         13 IGMLLAIAFLILVLILVCL   31 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~   31 (103)
                      +|.|..|++|.+++|+.+|
T Consensus         6 ~Gal~vv~iLt~~ILvFWf   24 (34)
T PF08113_consen    6 KGALGVVMILTAFILVFWF   24 (34)
T ss_dssp             HHHHHHHHHHHHHHHHHHH
T ss_pred             cceeeeHHHHHHHHHHHHH
Confidence            4566667777777777664


No 83 
>PF06365 CD34_antigen:  CD34/Podocalyxin family;  InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=62.88  E-value=3  Score=32.45  Aligned_cols=37  Identities=19%  Similarity=0.160  Sum_probs=16.1

Q ss_pred             HHHHHHHHH--HHHHHHHHHH-hheeecCCCcccchhhhh
Q psy17635         11 WFIGMLLAI--AFLILVLILV-CLIKRNRGGKYAVHEREA   47 (103)
Q Consensus        11 WfIg~~~ai--~lLllilli~-C~i~R~rGgKY~VkeKE~   47 (103)
                      =||++++..  +||+++++.+ ||.+|+.-.+=..+--|+
T Consensus       101 ~lI~lv~~g~~lLla~~~~~~Y~~~~Rrs~~~~~~rl~Ee  140 (202)
T PF06365_consen  101 TLIALVTSGSFLLLAILLGAGYCCHQRRSWSKKGQRLGEE  140 (202)
T ss_pred             EEEehHHhhHHHHHHHHHHHHHHhhhhccCCcchhhhccC
Confidence            456666554  3333333333 444454434433444443


No 84 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=62.19  E-value=7.1  Score=26.68  Aligned_cols=11  Identities=9%  Similarity=0.298  Sum_probs=6.6

Q ss_pred             HHHHhheeecC
Q psy17635         26 LILVCLIKRNR   36 (103)
Q Consensus        26 lli~C~i~R~r   36 (103)
                      +|..+|+.|.|
T Consensus        85 ~l~w~f~~r~k   95 (96)
T PTZ00382         85 FLCWWFVCRGK   95 (96)
T ss_pred             HHhheeEEeec
Confidence            34456677765


No 85 
>PF14914 LRRC37AB_C:  LRRC37A/B like protein 1 C-terminal domain
Probab=62.04  E-value=10  Score=28.63  Aligned_cols=27  Identities=19%  Similarity=0.309  Sum_probs=15.5

Q ss_pred             HHHHHHHHHHHHHHHHHH-HhheeecCC
Q psy17635         11 WFIGMLLAIAFLILVLIL-VCLIKRNRG   37 (103)
Q Consensus        11 WfIg~~~ai~lLllilli-~C~i~R~rG   37 (103)
                      --|.++.++++|++|+|+ -++..|+..
T Consensus       123 laisvtvv~~iliii~CLiei~shr~a~  150 (154)
T PF14914_consen  123 LAISVTVVVMILIIIFCLIEICSHRRAS  150 (154)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence            346777776666666544 444444433


No 86 
>PF00517 GP41:  Retroviral envelope protein;  InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=61.91  E-value=2.9  Score=31.94  Aligned_cols=18  Identities=17%  Similarity=0.394  Sum_probs=0.0

Q ss_pred             HHHHHhheeecCCCcccc
Q psy17635         25 VLILVCLIKRNRGGKYAV   42 (103)
Q Consensus        25 illi~C~i~R~rGgKY~V   42 (103)
                      +.+++++|+|-|-|-=|+
T Consensus       173 ~~~i~~~~~~~~qgy~~l  190 (204)
T PF00517_consen  173 LMCILRICRRVRQGYSPL  190 (204)
T ss_dssp             ------------------
T ss_pred             HHHHHHHHHHHhCCCCCc
Confidence            344445556656665455


No 87 
>PRK11486 flagellar biosynthesis protein FliO; Provisional
Probab=61.86  E-value=7.5  Score=28.14  Aligned_cols=31  Identities=19%  Similarity=0.180  Sum_probs=17.8

Q ss_pred             chhhhhHHHHHHHHHHH-HHHHHHHHhheeec
Q psy17635          5 TVATAGWFIGMLLAIAF-LILVLILVCLIKRN   35 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~l-Lllilli~C~i~R~   35 (103)
                      ...+.+=++-++.++++ +.+|+++.|++||-
T Consensus        11 ~~~~~~~l~qv~~~L~lVl~lI~~~aWLlkR~   42 (124)
T PRK11486         11 APVSGSPLLQVSGALIGIIALILAAAWLVKRL   42 (124)
T ss_pred             CCCCcchHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence            34444455556666443 33556667888883


No 88 
>PRK10788 periplasmic folding chaperone; Provisional
Probab=61.81  E-value=7.9  Score=33.36  Aligned_cols=35  Identities=23%  Similarity=0.252  Sum_probs=22.5

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      .+++|++.++.++++|.|++.-+.-+....|+++.
T Consensus         8 ~~~~~~~~~i~~li~l~F~~~G~~~~~~~~~~~~v   42 (623)
T PRK10788          8 AANSVVLKIILALIILSFILTGVGGYLIGGSNNYA   42 (623)
T ss_pred             hccChHHHHHHHHHHHHHHHHcchhhccCCCCCee
Confidence            36799999888877777776555433333355553


No 89 
>PHA03286 envelope glycoprotein E; Provisional
Probab=61.00  E-value=6.1  Score=34.57  Aligned_cols=18  Identities=28%  Similarity=0.582  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHhheeecC
Q psy17635         19 IAFLILVLILVCLIKRNR   36 (103)
Q Consensus        19 i~lLllilli~C~i~R~r   36 (103)
                      ++++++-+||+|++||+|
T Consensus       402 ~~~~~~~~~~~~~~~r~~  419 (492)
T PHA03286        402 LVVLLFALCIAGLYRRRR  419 (492)
T ss_pred             HHHHHHHHHhHhHhhhhh
Confidence            444455588888887654


No 90 
>TIGR02866 CoxB cytochrome c oxidase, subunit II. Cytochrome c oxidase is the terminal electron acceptor of mitochondria (and one of several possible acceptors in prokaryotes) in the electron transport chain of aerobic respiration. The enzyme couples the oxidation of reduced cytochrome c with the reduction of molecular oxygen to water. This process results in the pumping of four protons across the membrane which are used in the proton gradient powered synthesis of ATP. The oxidase contains two heme a cofactors and three copper atoms as well as other bound ions.
Probab=60.88  E-value=11  Score=28.17  Aligned_cols=18  Identities=28%  Similarity=0.420  Sum_probs=8.1

Q ss_pred             HHHHHHHHHHHhheeecC
Q psy17635         19 IAFLILVLILVCLIKRNR   36 (103)
Q Consensus        19 i~lLllilli~C~i~R~r   36 (103)
                      |.+++..+++.+++|.++
T Consensus        23 I~v~V~~~l~~~~~k~r~   40 (201)
T TIGR02866        23 ISLLVAALLAYVVWKFRR   40 (201)
T ss_pred             HHHHHHHHHHHhhhhhhc
Confidence            333444445455554433


No 91 
>PRK10927 essential cell division protein FtsN; Provisional
Probab=60.86  E-value=4.4  Score=33.66  Aligned_cols=56  Identities=18%  Similarity=0.332  Sum_probs=26.4

Q ss_pred             HHHHHHHHHHHHHH--HhheeecCCCcccchhhhhhcCCC-CCCCCCCCcccCCCCCCCC
Q psy17635         15 MLLAIAFLILVLIL--VCLIKRNRGGKYAVHEREAAHGRH-DYPEEPGFNEYSQPLDGKP   71 (103)
Q Consensus        15 ~~~ai~lLllilli--~C~i~R~rGgKY~VkeKE~~~g~~-~~~dd~~F~EYsd~dek~~   71 (103)
                      .|+||+++||+++|  ++||..||.+-=.+-..-...++- |-|.|+-+ .|-.+.|++-
T Consensus        35 ~m~alAvavlv~fiGGLyFith~k~~e~~~~~~~~~~~ngLPPKPEErW-rYIKELEnRq   93 (319)
T PRK10927         35 AMVAIAAAVLVTFIGGLYFITHHKKEESETLQSQKVTGNGLPPKPEERW-RYIKELESRQ   93 (319)
T ss_pred             HHHHHHHHHHHHHhhheEEEecCCCCCCcccccCCCCCCCCCCCcchhh-HHHHHHhccC
Confidence            34555555555555  688888875422211100111221 44555555 5655555543


No 92 
>PRK14758 hypothetical protein; Provisional
Probab=60.66  E-value=6.2  Score=21.79  Aligned_cols=8  Identities=38%  Similarity=1.364  Sum_probs=3.8

Q ss_pred             HHHHHHHh
Q psy17635         23 ILVLILVC   30 (103)
Q Consensus        23 llilli~C   30 (103)
                      +|+++|+|
T Consensus        10 iLivlIlC   17 (27)
T PRK14758         10 ILIILILC   17 (27)
T ss_pred             HHHHHHHH
Confidence            34444555


No 93 
>PF04277 OAD_gamma:  Oxaloacetate decarboxylase, gamma chain ;  InterPro: IPR005899  This family comprises distantly related, low complexity, hydrophobic small subunits of several related sodium ion-pumping decarboxylases. These include oxaloacetate decarboxylase gamma subunit and methylmalonyl-CoA decarboxylase delta subunit [].; GO: 0008948 oxaloacetate decarboxylase activity, 0015081 sodium ion transmembrane transporter activity, 0071436 sodium ion export, 0016020 membrane
Probab=60.38  E-value=15  Score=23.30  Aligned_cols=18  Identities=39%  Similarity=0.669  Sum_probs=7.3

Q ss_pred             hHHHHHHHH-HHHHHHHHH
Q psy17635         10 GWFIGMLLA-IAFLILVLI   27 (103)
Q Consensus        10 gWfIg~~~a-i~lLllill   27 (103)
                      ||-+-+++. ++|+.|++|
T Consensus         4 gl~i~i~Gm~iVF~~L~lL   22 (79)
T PF04277_consen    4 GLQIMIIGMGIVFLVLILL   22 (79)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            444444333 344433333


No 94 
>PF01788 PsbJ:  PsbJ;  InterPro: IPR002682 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  This family represents the low molecular weight transmembrane protein PsbJ found in PSII. PsbJ is one of the most hydrophobic proteins in the thylakoid membrane, and is located in a gene cluster with PsbE, PsbF and PsbL (PsbEFJL). Both PsbJ and PsbL (IPR003372 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbJ cause the light-harvesting antenna to remain detached from the PSII dimers []. In addition, both PsbJ and PsbL are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_J 3ARC_J 3A0B_J 3KZI_J 2AXT_J 3PRQ_J 4FBY_b 3BZ2_J 1S5L_j 3PRR_J ....
Probab=60.31  E-value=17  Score=21.80  Aligned_cols=21  Identities=29%  Similarity=0.545  Sum_probs=14.5

Q ss_pred             hhhHHHHHHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli   28 (103)
                      ---|+|+.+..++.+-++-+.
T Consensus         8 IPLWlVgtv~G~~vi~lvglF   28 (40)
T PF01788_consen    8 IPLWLVGTVAGIAVIGLVGLF   28 (40)
T ss_dssp             S-HHHHHHHHHHHHHHHHHHH
T ss_pred             ccchHHHHHHHHHHHHHHHHh
Confidence            346999999987766655443


No 95 
>COG3190 FliO Flagellar biogenesis protein [Cell motility and secretion]
Probab=59.90  E-value=9.8  Score=28.12  Aligned_cols=28  Identities=32%  Similarity=0.284  Sum_probs=14.5

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      +..-|+--+...|++|.||++.+|++||
T Consensus        20 ~~~~~~~~~gsL~~iL~lil~~~wl~kr   47 (137)
T COG3190          20 AALELAQMFGSLILILALILFLAWLVKR   47 (137)
T ss_pred             chhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3333443333445555566666666664


No 96 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=59.83  E-value=6.2  Score=32.18  Aligned_cols=24  Identities=21%  Similarity=0.512  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecC
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~r   36 (103)
                      |.+++.|||.+||++++.-.||.|
T Consensus       262 iiaIliIVLIMvIIYLILRYRRKK  285 (299)
T PF02009_consen  262 IIAILIIVLIMVIIYLILRYRRKK  285 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHh
Confidence            444455555555555556677743


No 97 
>PF12669 P12:  Virus attachment protein p12 family
Probab=59.38  E-value=5.4  Score=25.04  Aligned_cols=11  Identities=27%  Similarity=0.483  Sum_probs=6.3

Q ss_pred             HHhheeecCCC
Q psy17635         28 LVCLIKRNRGG   38 (103)
Q Consensus        28 i~C~i~R~rGg   38 (103)
                      +.-++|+.|.|
T Consensus        17 ~r~~~k~~K~G   27 (58)
T PF12669_consen   17 IRKFIKDKKKG   27 (58)
T ss_pred             HHHHHHHhhcC
Confidence            35666665555


No 98 
>PF09472 MtrF:  Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF);  InterPro: IPR013347  Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump []. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C-terminal region of what appears to be a fusion of the MtrA and MtrF proteins [, ]. The function of these proteins is unknown, though it is likely that they are involved in C1 metabolism.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016020 membrane
Probab=59.15  E-value=15  Score=23.87  Aligned_cols=23  Identities=26%  Similarity=0.615  Sum_probs=15.6

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~   29 (103)
                      ...|..||++.|++|.++.+++.
T Consensus        41 ~~~GfaiG~~~AlvLv~ip~~l~   63 (64)
T PF09472_consen   41 GIKGFAIGFLFALVLVGIPILLM   63 (64)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHh
Confidence            44577788888877776665543


No 99 
>MTH00133 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=57.69  E-value=16  Score=22.31  Aligned_cols=24  Identities=21%  Similarity=0.539  Sum_probs=15.5

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllil   26 (103)
                      |||-  ...-||..++++-+.|++++
T Consensus         1 MPQL--np~PWf~i~l~tW~~l~~~~   24 (55)
T MTH00133          1 MPQL--NPAPWFAILVFSWLVFLTII   24 (55)
T ss_pred             CCCC--CCccHHHHHHHHHHHHHHHH
Confidence            5553  34479999999855544443


No 100
>PF15176 LRR19-TM:  Leucine-rich repeat family 19 TM domain
Probab=57.65  E-value=11  Score=26.80  Aligned_cols=15  Identities=7%  Similarity=0.038  Sum_probs=7.4

Q ss_pred             cchhhhhHHHHHHHH
Q psy17635          4 DTVATAGWFIGMLLA   18 (103)
Q Consensus         4 ~~~at~gWfIg~~~a   18 (103)
                      .+.....|-..+-++
T Consensus        10 ~~~~g~sW~~LVGVv   24 (102)
T PF15176_consen   10 PGEGGRSWPFLVGVV   24 (102)
T ss_pred             CCCCCcccHhHHHHH
Confidence            344466775444333


No 101
>PHA03283 envelope glycoprotein E; Provisional
Probab=56.35  E-value=12  Score=33.18  Aligned_cols=25  Identities=16%  Similarity=0.232  Sum_probs=12.9

Q ss_pred             hhhHH--HHHHHHHHHHHHHHHHHhhe
Q psy17635          8 TAGWF--IGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         8 t~gWf--Ig~~~ai~lLllilli~C~i   32 (103)
                      +..|+  ++++++++.++++.|++++|
T Consensus       396 ~~~~l~~~~~~~~~~~~~~~~l~vw~c  422 (542)
T PHA03283        396 TRHYLAFLLAIICTCAALLVALVVWGC  422 (542)
T ss_pred             ccccchhHHHHHHHHHHHHHHHhhhhe
Confidence            35564  44444555555665555433


No 102
>PF05434 Tmemb_9:  TMEM9;  InterPro: IPR008853 This family contains several eukaryotic transmembrane proteins which are homologous to Homo sapiens transmembrane protein 9 Q9P0T7 from SWISSPROT. The TMEM9 gene encodes a 183 amino-acid protein that contains an N-terminal signal peptide, a single transmembrane region, three potential N-glycosylation sites and three conserved cys-rich domains in the N terminus, but no known functional domains. The protein is highly conserved between species from Caenorhabditis elegans to H. sapiens and belongs to a novel family of transmembrane proteins. The exact function of TMEM9 is unknown although it has been found to be widely expressed and localised to the late endosomes and lysosomes []. Members of this family contain CXCXC repeats IPR004153 from INTERPRO in their N-terminal region.; GO: 0016021 integral to membrane
Probab=56.11  E-value=6.8  Score=29.35  Aligned_cols=30  Identities=30%  Similarity=0.776  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHHHHH-h---heeecCCCcccch
Q psy17635         14 GMLLAIAFLILVLILV-C---LIKRNRGGKYAVH   43 (103)
Q Consensus        14 g~~~ai~lLllilli~-C---~i~R~rGgKY~Vk   43 (103)
                      .+|+.|.+|+|..+.+ |   +++|..+.+|..+
T Consensus        60 ivl~Vi~lLvlYM~fL~~ldPllrk~~~~~~~~~   93 (149)
T PF05434_consen   60 IVLWVIGLLVLYMLFLMCLDPLLRKRFGHNYQIH   93 (149)
T ss_pred             EeHHHHHHHHHHHHHHHHHhHHHhcccccccccc
Confidence            3444444444432222 2   2467777777643


No 103
>PF10779 XhlA:  Haemolysin XhlA;  InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes []. 
Probab=55.68  E-value=20  Score=22.83  Aligned_cols=21  Identities=5%  Similarity=0.398  Sum_probs=14.6

Q ss_pred             hhhhHHHHHHHHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllill   27 (103)
                      +++-|+++++...++..++.+
T Consensus        48 ~n~kW~~r~iiGaiI~~i~~~   68 (71)
T PF10779_consen   48 SNTKWIWRTIIGAIITAIIYL   68 (71)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            577899998887555544433


No 104
>PF12575 DUF3753:  Protein of unknown function (DUF3753);  InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=55.51  E-value=20  Score=23.90  Aligned_cols=23  Identities=9%  Similarity=0.291  Sum_probs=15.9

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHH
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllill   27 (103)
                      ++....|+|+++.++.+++|+.+
T Consensus        45 ~~~~~~~ii~ii~v~ii~~l~fl   67 (72)
T PF12575_consen   45 NFNWIILIISIIFVLIIVLLTFL   67 (72)
T ss_pred             cchHHHHHHHHHHHHHHHHHHHH
Confidence            46778888888887666665543


No 105
>PF02038 ATP1G1_PLM_MAT8:  ATP1G1/PLM/MAT8 family;  InterPro: IPR000272  The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable.   Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=54.88  E-value=12  Score=23.38  Aligned_cols=16  Identities=25%  Similarity=0.517  Sum_probs=8.4

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy17635         13 IGMLLAIAFLILVLIL   28 (103)
Q Consensus        13 Ig~~~ai~lLllilli   28 (103)
                      -|+++|.+|.++-++|
T Consensus        17 gGLi~A~vlfi~Gi~i   32 (50)
T PF02038_consen   17 GGLIFAGVLFILGILI   32 (50)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             cchHHHHHHHHHHHHH
Confidence            4677775444444333


No 106
>PF07213 DAP10:  DAP10 membrane protein;  InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=54.66  E-value=22  Score=24.06  Aligned_cols=27  Identities=22%  Similarity=0.536  Sum_probs=19.6

Q ss_pred             hhhHHHHHHHH-HHHHHHHHHHHhheee
Q psy17635          8 TAGWFIGMLLA-IAFLILVLILVCLIKR   34 (103)
Q Consensus         8 t~gWfIg~~~a-i~lLllilli~C~i~R   34 (103)
                      +.+-+.|+++| +++-|||++.+..+-|
T Consensus        32 s~g~LaGiV~~D~vlTLLIv~~vy~car   59 (79)
T PF07213_consen   32 SPGLLAGIVAADAVLTLLIVLVVYYCAR   59 (79)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence            34667888888 7777788777766654


No 107
>PF07423 DUF1510:  Protein of unknown function (DUF1510);  InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=54.05  E-value=16  Score=28.69  Aligned_cols=23  Identities=9%  Similarity=0.144  Sum_probs=10.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhe
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i   32 (103)
                      .||=.+|.+++|||+|++...|+
T Consensus        14 ~iLNiaI~IV~lLIiiva~~lf~   36 (217)
T PF07423_consen   14 KILNIAIGIVSLLIIIVAYQLFF   36 (217)
T ss_pred             hhHHHHHHHHHHHHHHHhhhhee
Confidence            34444444444444444444554


No 108
>PF06936 Selenoprotein_S:  Selenoprotein S (SelS);  InterPro: IPR009703 This family consists of several mammalian selenoprotein S (SelS) sequences. SelS is a plasma membrane protein and is present in a variety of tissues and cell types. These proteins are involved in the degradation process of misfolded endoplasmic reticulum (ER) luminal proteins which participate in the transfer of misfolded proteins from the ER to the cytosol, where they are destroyed by the proteasome in a ubiquitin-dependent manner []. They probably serve as a linker between DER1, which mediates the retro-translocation of misfolded proteins into the cytosol, and the ATPase complex VCP, which mediates the translocation and ubiquitination.; GO: 0008430 selenium binding, 0006886 intracellular protein transport, 0030176 integral to endoplasmic reticulum membrane; PDB: 2Q2F_A.
Probab=53.88  E-value=4.3  Score=31.18  Aligned_cols=21  Identities=19%  Similarity=0.506  Sum_probs=0.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllil   26 (103)
                      +++-||+|.++|+++.+|+--
T Consensus        33 L~~yGWyil~~~I~ly~l~qk   53 (190)
T PF06936_consen   33 LSSYGWYILFGCILLYLLWQK   53 (190)
T ss_dssp             ---------------------
T ss_pred             HHHhCHHHHHHHHHHHHHHHH
Confidence            677899998777655554443


No 109
>PRK00159 putative septation inhibitor protein; Reviewed
Probab=53.71  E-value=17  Score=24.96  Aligned_cols=32  Identities=16%  Similarity=0.495  Sum_probs=23.8

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      .+-.||..+|+++.++=|+=+++..+-   |+.++
T Consensus        28 ~sp~W~~~~m~glm~~GllWlvvyYl~---~~~~P   59 (87)
T PRK00159         28 PSSVWYVVLMLGLMLIGLAWLVVNYLA---GPAIP   59 (87)
T ss_pred             CCCccHHHHHHHHHHHHHHHHHHHhhc---cCCCC
Confidence            356899999999888878777776662   45555


No 110
>COG3671 Predicted membrane protein [Function unknown]
Probab=53.21  E-value=16  Score=26.73  Aligned_cols=37  Identities=41%  Similarity=0.565  Sum_probs=22.0

Q ss_pred             hhHHHHHHHH---HHHHHHHHHHHhheee-cCCCcccchhh
Q psy17635          9 AGWFIGMLLA---IAFLILVLILVCLIKR-NRGGKYAVHER   45 (103)
Q Consensus         9 ~gWfIg~~~a---i~lLllilli~C~i~R-~rGgKY~VkeK   45 (103)
                      .+|+|+++.+   |.+++++++-++++-| -.|=||.++.|
T Consensus        75 l~~iIg~Llt~lgiGv~i~~AlgvW~i~Riv~G~~yl~~g~  115 (125)
T COG3671          75 LWWIIGLLLTFLGIGVVILVALGVWYIYRIVIGFKYLNEGK  115 (125)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence            4677887765   4455566666666655 44555555443


No 111
>KOG4331|consensus
Probab=53.05  E-value=12  Score=34.83  Aligned_cols=16  Identities=25%  Similarity=0.441  Sum_probs=9.2

Q ss_pred             hheeecCCCcccchhh
Q psy17635         30 CLIKRNRGGKYAVHER   45 (103)
Q Consensus        30 C~i~R~rGgKY~VkeK   45 (103)
                      |+|.|++||+-.=..|
T Consensus       130 c~C~~rc~~r~~~~~~  145 (865)
T KOG4331|consen  130 CRCCRRCGGRIKSALK  145 (865)
T ss_pred             eeeHhhhcCCCCchhc
Confidence            3345777777654444


No 112
>TIGR01433 CyoA cytochrome o ubiquinol oxidase subunit II. This enzyme catalyzes the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in gram positive bacteria but which is in complex with subunits which utilize cytochromes a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cyt. c as the oxidant.
Probab=53.04  E-value=17  Score=28.17  Aligned_cols=9  Identities=22%  Similarity=0.279  Sum_probs=4.1

Q ss_pred             HHHHHhhee
Q psy17635         25 VLILVCLIK   33 (103)
Q Consensus        25 illi~C~i~   33 (103)
                      ++++.+++|
T Consensus        48 ~~~~~~~~r   56 (226)
T TIGR01433        48 LMTLFFAWK   56 (226)
T ss_pred             HHHheeeEE
Confidence            333445554


No 113
>COG3296 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.95  E-value=15  Score=27.39  Aligned_cols=30  Identities=40%  Similarity=0.657  Sum_probs=17.7

Q ss_pred             HHHHHHHHHHHH--HHHHHHhheeecCCCccc
Q psy17635         12 FIGMLLAIAFLI--LVLILVCLIKRNRGGKYA   41 (103)
Q Consensus        12 fIg~~~ai~lLl--lilli~C~i~R~rGgKY~   41 (103)
                      ||+++...+|.+  |++-|+.++|-+.|.-|.
T Consensus       103 li~il~~~alsi~~lv~ti~a~~Ka~eGq~Yr  134 (143)
T COG3296         103 LIGILSIFALSILSLVLTIIAAIKAYEGQEYR  134 (143)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhCCceee
Confidence            444444333333  455556778888888773


No 114
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=52.76  E-value=23  Score=19.84  Aligned_cols=11  Identities=0%  Similarity=0.283  Sum_probs=5.5

Q ss_pred             HHHHHHHHHHH
Q psy17635         13 IGMLLAIAFLI   23 (103)
Q Consensus        13 Ig~~~ai~lLl   23 (103)
                      |++|+.++|.+
T Consensus         3 ~~vi~g~llv~   13 (29)
T PRK14750          3 FSIVCGALLVL   13 (29)
T ss_pred             HHHHHHHHHHH
Confidence            45666644433


No 115
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=52.43  E-value=16  Score=31.69  Aligned_cols=15  Identities=27%  Similarity=0.470  Sum_probs=11.3

Q ss_pred             HHhheeecCCCcccc
Q psy17635         28 LVCLIKRNRGGKYAV   42 (103)
Q Consensus        28 i~C~i~R~rGgKY~V   42 (103)
                      .+.|++|+||.--.|
T Consensus       406 ~~~~v~rrr~~~~dv  420 (436)
T PTZ00208        406 FFIMVKRRRNSSEDV  420 (436)
T ss_pred             hheeeeeccCCchhc
Confidence            456779999887666


No 116
>PF15176 LRR19-TM:  Leucine-rich repeat family 19 TM domain
Probab=52.34  E-value=13  Score=26.30  Aligned_cols=29  Identities=14%  Similarity=0.183  Sum_probs=18.1

Q ss_pred             HHHHHHHHHHHHHHHHHH-hheeecCCCcc
Q psy17635         12 FIGMLLAIAFLILVLILV-CLIKRNRGGKY   40 (103)
Q Consensus        12 fIg~~~ai~lLllilli~-C~i~R~rGgKY   40 (103)
                      .=.++.||++-|||+|.+ |=+.|+==-.|
T Consensus        21 VGVv~~al~~SlLIalaaKC~~~~k~~~SY   50 (102)
T PF15176_consen   21 VGVVVTALVTSLLIALAAKCPVWYKYLASY   50 (102)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHhcc
Confidence            345778888777776665 65655433444


No 117
>PF13584 BatD:  Oxygen tolerance
Probab=51.84  E-value=17  Score=30.15  Aligned_cols=23  Identities=35%  Similarity=0.657  Sum_probs=9.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      .||+.++ ++.++++++++++..+
T Consensus       426 ~~~~~l~-~~~~l~~~~~~~~~~~  448 (484)
T PF13584_consen  426 WWFWLLL-LLPLLLLLLLLILRRK  448 (484)
T ss_pred             HHHHHHH-HHHHHHHHHHHHHHHH
Confidence            3444443 3333333333444443


No 118
>PF15234 LAT:  Linker for activation of T-cells
Probab=51.71  E-value=57  Score=25.85  Aligned_cols=16  Identities=25%  Similarity=0.478  Sum_probs=11.4

Q ss_pred             HHhheeecCCCcccch
Q psy17635         28 LVCLIKRNRGGKYAVH   43 (103)
Q Consensus        28 i~C~i~R~rGgKY~Vk   43 (103)
                      .+|+.+|.=-|.|---
T Consensus        25 ALCvrCReLpgSydsa   40 (230)
T PF15234_consen   25 ALCVRCRELPGSYDSA   40 (230)
T ss_pred             HHHHHHhhCCCccccC
Confidence            4577778888888644


No 119
>PHA03097 C-type lectin-like protein; Provisional
Probab=51.54  E-value=18  Score=26.33  Aligned_cols=18  Identities=11%  Similarity=0.288  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILVLILV   29 (103)
Q Consensus        12 fIg~~~ai~lLllilli~   29 (103)
                      .|+++|+...+.||++++
T Consensus        13 ~~~~~~~~~~~~~~~~~~   30 (157)
T PHA03097         13 WILVPCGSIIIALIALVI   30 (157)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            377888866666555554


No 120
>TIGR03063 srtB_target sortase B cell surface sorting signal. Two different classes of sorting signal, both analogous to the sortase A signal LPXTG, may be recognized by the sortase SrtB. These are given as NXZTN and NPKXZ. Proteins sorted by this class of sortase are less common than the sortase A and LPXTG system. This model describes a number of cell surface protein C-terminal regions from Gram-positive bacteria that appear to be sortase B (SrtB) sorting signals.
Probab=51.53  E-value=26  Score=19.55  Aligned_cols=17  Identities=18%  Similarity=0.421  Sum_probs=8.6

Q ss_pred             CCcchhhh-hHHHHHHHH
Q psy17635          2 PSDTVATA-GWFIGMLLA   18 (103)
Q Consensus         2 ~~~~~at~-gWfIg~~~a   18 (103)
                      |+++-.++ +|++.++.+
T Consensus         2 PkT~D~a~i~ly~~l~~~   19 (29)
T TIGR03063         2 PKTGDSAQIGLYAVLFLG   19 (29)
T ss_pred             CCCccchhHHHHHHHHHH
Confidence            55555554 454444433


No 121
>PF06781 UPF0233:  Uncharacterised protein family (UPF0233);  InterPro: IPR009619 This is a group of proteins of unknown function.
Probab=51.49  E-value=19  Score=24.61  Aligned_cols=30  Identities=23%  Similarity=0.526  Sum_probs=23.8

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      -.+-.||..+|+++.++=|+=+++..+..+
T Consensus        27 ~~sp~W~~p~m~~lmllGL~WiVvyYi~~~   56 (87)
T PF06781_consen   27 KPSPRWYAPLMLGLMLLGLLWIVVYYISGG   56 (87)
T ss_pred             CCCCccHHHHHHHHHHHHHHHHhhhhcccC
Confidence            456789999999988888887777776544


No 122
>PF04906 Tweety:  Tweety;  InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised. However, they are considered to be transmembrane proteins with five potential membrane-spanning regions. A number of potential functions have been suggested on the basis of homology to the yeast FTR1 and FTH1 iron transporter proteins and the mammalian neurotensin receptors 1 and 2 in that they have a similar hydrophobicity profiles although there is no detectable sequence homology to the tweety-related proteins. It has been proposed that the tweety-related proteins could be involved in transport of iron or other divalent cations or alternatively that they may be membrane-bound receptors [].
Probab=51.20  E-value=16  Score=30.54  Aligned_cols=7  Identities=43%  Similarity=0.956  Sum_probs=3.2

Q ss_pred             hheeecC
Q psy17635         30 CLIKRNR   36 (103)
Q Consensus        30 C~i~R~r   36 (103)
                      ||+||.+
T Consensus        45 CC~r~~~   51 (406)
T PF04906_consen   45 CCCRRPR   51 (406)
T ss_pred             hhCCCCC
Confidence            5554433


No 123
>PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional
Probab=50.96  E-value=17  Score=29.87  Aligned_cols=10  Identities=10%  Similarity=-0.071  Sum_probs=3.8

Q ss_pred             hhHHHHHHHH
Q psy17635          9 AGWFIGMLLA   18 (103)
Q Consensus         9 ~gWfIg~~~a   18 (103)
                      +.++|.++.+
T Consensus        40 ~~~Li~~~~~   49 (315)
T PRK10525         40 QRSLILTAFG   49 (315)
T ss_pred             HHHHHHHHHH
Confidence            3343433333


No 124
>PF06396 AGTRAP:  Angiotensin II, type I receptor-associated protein (AGTRAP);  InterPro: IPR009436 This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the C-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Probab=50.94  E-value=71  Score=24.09  Aligned_cols=37  Identities=24%  Similarity=0.367  Sum_probs=21.1

Q ss_pred             hhhhHHHHHHHHHHHHHH-HHHHHhheeecCCCcccch
Q psy17635          7 ATAGWFIGMLLAIAFLIL-VLILVCLIKRNRGGKYAVH   43 (103)
Q Consensus         7 at~gWfIg~~~ai~lLll-illi~C~i~R~rGgKY~Vk   43 (103)
                      .+..|-.++.+.-++|-- .++.+.-..|.|||.|.|+
T Consensus        85 ~~~~Fs~~maIinLllKP~s~~~l~r~~reRGG~l~~~  122 (162)
T PF06396_consen   85 DTDRFSAGMAIINLLLKPISCFFLYRMYRERGGELSVN  122 (162)
T ss_pred             ccchhhHHHHHHHHHHHHHHHHHHHHHHHhcCCceecc
Confidence            344555555444333332 2334444568999999876


No 125
>PF13273 DUF4064:  Protein of unknown function (DUF4064)
Probab=50.74  E-value=25  Score=23.23  Aligned_cols=28  Identities=36%  Similarity=0.477  Sum_probs=14.2

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      .+..|+.+++.. +.+++.++..+++|+|
T Consensus        55 ~~~~~~~~i~~i-i~~il~iia~i~ikk~   82 (100)
T PF13273_consen   55 STFGIILGIIAI-ISSILGIIASILIKKN   82 (100)
T ss_pred             HHHHHHHHHHHH-HHHHHHHHHHHHHcCC
Confidence            344555555543 3444444445667663


No 126
>PF14851 FAM176:  FAM176 family
Probab=50.33  E-value=47  Score=24.81  Aligned_cols=24  Identities=21%  Similarity=0.290  Sum_probs=12.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      -.|+.=+|+=+||+|.+|++.+.+
T Consensus        24 LYFv~gVC~GLlLtLcllV~risc   47 (153)
T PF14851_consen   24 LYFVSGVCAGLLLTLCLLVIRISC   47 (153)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhee
Confidence            456666666444445444443333


No 127
>KOG4482|consensus
Probab=50.29  E-value=17  Score=31.46  Aligned_cols=27  Identities=22%  Similarity=0.517  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      ..|-+++|++|.+++.+|+||  |+.|-|
T Consensus       301 faIpl~Valll~~~La~imc~--rrEg~~  327 (449)
T KOG4482|consen  301 FAIPLGVALLLVLALAYIMCC--RREGQK  327 (449)
T ss_pred             HHHHHHHHHHHHHHHHHHHhh--hhhccc
Confidence            334555555555555556664  555544


No 128
>PF14316 DUF4381:  Domain of unknown function (DUF4381)
Probab=50.18  E-value=18  Score=25.72  Aligned_cols=11  Identities=27%  Similarity=0.851  Sum_probs=5.3

Q ss_pred             hhhHHHHHHHH
Q psy17635          8 TAGWFIGMLLA   18 (103)
Q Consensus         8 t~gWfIg~~~a   18 (103)
                      +-||-+.++.+
T Consensus        20 a~GWwll~~ll   30 (146)
T PF14316_consen   20 APGWWLLLALL   30 (146)
T ss_pred             cHHHHHHHHHH
Confidence            44565544443


No 129
>PF07466 DUF1517:  Protein of unknown function (DUF1517);  InterPro: IPR010903 This family consists of several hypothetical glycine rich plant and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=50.10  E-value=10  Score=30.59  Aligned_cols=30  Identities=20%  Similarity=0.403  Sum_probs=18.6

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635          8 TAGWFIGMLLAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~C~i~R~rG   37 (103)
                      .-++|++++.++++..++.+++-+++|.+.
T Consensus        60 ~Ggg~~gl~~iLIl~~Ia~~vv~~~r~~~~   89 (289)
T PF07466_consen   60 FGGGFGGLFDILILFGIAFFVVRFFRRRRS   89 (289)
T ss_pred             cCcccchHHHHHHHHHHHHHHHHHHHhhcc
Confidence            345567777776666666666666665443


No 130
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=50.00  E-value=16  Score=24.79  Aligned_cols=17  Identities=24%  Similarity=0.481  Sum_probs=6.4

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy17635         13 IGMLLAIAFLILVLILV   29 (103)
Q Consensus        13 Ig~~~ai~lLllilli~   29 (103)
                      |..+.|++++++|.+++
T Consensus         6 i~~iialiv~~iiaIvv   22 (81)
T PF00558_consen    6 ILAIIALIVALIIAIVV   22 (81)
T ss_dssp             --HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            34444444444444443


No 131
>PF06341 DUF1056:  Protein of unknown function (DUF1056);  InterPro: IPR009406 This entry is represented by Bacteriophage bIL286, Orf42. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several putative head-tail joining bacteriophage proteins.
Probab=49.76  E-value=24  Score=23.01  Aligned_cols=26  Identities=23%  Similarity=0.420  Sum_probs=15.8

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecCCC
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNRGG   38 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~rGg   38 (103)
                      +|++..-+-|++..++.-++.++|||
T Consensus        37 ~g~i~i~I~l~l~G~isE~i~~~Kgg   62 (63)
T PF06341_consen   37 AGLISIGITLFLAGLISEFISKQKGG   62 (63)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhccCC
Confidence            34444444455556666777778887


No 132
>PF13349 DUF4097:  Domain of unknown function (DUF4097)
Probab=49.76  E-value=18  Score=25.01  Aligned_cols=18  Identities=33%  Similarity=0.480  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILVLIL   28 (103)
Q Consensus        11 WfIg~~~ai~lLllilli   28 (103)
                      |+|+++|+++.++++++.
T Consensus         4 ~~i~~~l~viG~il~~~g   21 (166)
T PF13349_consen    4 LIIGLILIVIGIILIGIG   21 (166)
T ss_pred             ehHHHHHHHHHHHHHHHH
Confidence            566666665555554443


No 133
>PF15431 TMEM190:  Transmembrane protein 190
Probab=49.74  E-value=17  Score=26.66  Aligned_cols=29  Identities=24%  Similarity=0.390  Sum_probs=17.6

Q ss_pred             hhhhHHHHHHHH-HHHHHHHHHHHhheeec
Q psy17635          7 ATAGWFIGMLLA-IAFLILVLILVCLIKRN   35 (103)
Q Consensus         7 at~gWfIg~~~a-i~lLllilli~C~i~R~   35 (103)
                      ...-|-++.-|+ +++|+..+++.+..+|+
T Consensus        58 rKHmWaL~wtC~gll~Li~~iclFWWAkRr   87 (134)
T PF15431_consen   58 RKHMWALGWTCGGLLLLICSICLFWWAKRR   87 (134)
T ss_pred             HHHHHHHHHHHHhHHHHHHHHHHHHHHHHh
Confidence            456799999998 33333444555555554


No 134
>PF14610 DUF4448:  Protein of unknown function (DUF4448)
Probab=49.65  E-value=9  Score=28.36  Aligned_cols=16  Identities=19%  Similarity=0.341  Sum_probs=6.5

Q ss_pred             HHHHHHHHHHHhheee
Q psy17635         19 IAFLILVLILVCLIKR   34 (103)
Q Consensus        19 i~lLllilli~C~i~R   34 (103)
                      +++++|++++.|+.+|
T Consensus       168 v~~~~~~~~~~~~~~R  183 (189)
T PF14610_consen  168 VVVLALIMYGFFFWNR  183 (189)
T ss_pred             HHHHHHHHHhhheeec
Confidence            3333333444444443


No 135
>COG1622 CyoA Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]
Probab=49.61  E-value=21  Score=28.26  Aligned_cols=33  Identities=21%  Similarity=0.275  Sum_probs=15.4

Q ss_pred             hhhHHHHH----HHHHHHHHHHHHHHhhee-ecCCCcc
Q psy17635          8 TAGWFIGM----LLAIAFLILVLILVCLIK-RNRGGKY   40 (103)
Q Consensus         8 t~gWfIg~----~~ai~lLllilli~C~i~-R~rGgKY   40 (103)
                      .+-|+|.+    |..|+++++.+++.+++| |.+.+.-
T Consensus        29 ~~~~~~~~~~~~~~~I~~~V~~~~~~~~~k~R~~~~~~   66 (247)
T COG1622          29 EQRDLIILSTLLMLVIVLPVIVLLVYFAWKYRASNNAA   66 (247)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCC
Confidence            34455433    333445555555555554 4444443


No 136
>PF00957 Synaptobrevin:  Synaptobrevin;  InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=49.58  E-value=27  Score=22.57  Aligned_cols=21  Identities=24%  Similarity=0.594  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhhe
Q psy17635         12 FIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i   32 (103)
                      -+.+++++++++++++|+..+
T Consensus        67 k~~~i~~~iv~~~~~~i~~~~   87 (89)
T PF00957_consen   67 KLYIIIIIIVIIIILIIIIVI   87 (89)
T ss_dssp             HHHHHHHHHHHHHHHHHHHTT
T ss_pred             HHHHhHHhhhhhhhhHHHHHH
Confidence            344555555555566655544


No 137
>PRK12757 cell division protein FtsN; Provisional
Probab=49.36  E-value=11  Score=30.30  Aligned_cols=22  Identities=27%  Similarity=0.397  Sum_probs=12.9

Q ss_pred             HHHHHHHHHH--HHhheeecCCCc
Q psy17635         18 AIAFLILVLI--LVCLIKRNRGGK   39 (103)
Q Consensus        18 ai~lLllill--i~C~i~R~rGgK   39 (103)
                      |++++||+++  -++||..||.+-
T Consensus         3 alava~lv~f~gGLyfi~h~k~~~   26 (256)
T PRK12757          3 ALAVAVLVTFIGGLYFITHNKKEE   26 (256)
T ss_pred             HHHHHHHHHHhhheEEEecCCCCC
Confidence            4444444444  478888888653


No 138
>PF06900 DUF1270:  Protein of unknown function (DUF1270);  InterPro: IPR009682 This entry is represented by Bacteriophage 92, Orf86. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical Staphylococcus aureus and phage proteins of 53 residues in length. The function of this family is unknown.
Probab=48.77  E-value=31  Score=21.80  Aligned_cols=21  Identities=14%  Similarity=0.265  Sum_probs=14.6

Q ss_pred             hhHHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~   29 (103)
                      ...+|++||.+++-+.+..++
T Consensus         6 kSylIA~l~ftvlsi~L~~~l   26 (53)
T PF06900_consen    6 KSYLIANLCFTVLSIALMPFL   26 (53)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            467899999877766554443


No 139
>PF03845 Spore_permease:  Spore germination protein;  InterPro: IPR004761 Amino acid permeases are integral membrane proteins involved in the transport of amino acids into the cell. A number of such proteins have been found to be evolutionary related [, , ]. These proteins seem to contain up to 12 transmembrane segments. The best conserved region in this family is located in the second transmembrane segment. Spore germination protein (amino acid permease) is involved in the response to the germinative mixture of L-asparagine, glucose, fructose and potassium ions (AFFK). These proteins could be amino acid transporters.; GO: 0009847 spore germination, 0016021 integral to membrane
Probab=48.71  E-value=20  Score=27.95  Aligned_cols=31  Identities=29%  Similarity=0.487  Sum_probs=25.2

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      .+|+..++..+..++++.++..+.+|.+|..
T Consensus        33 d~Wi~~ll~~~~~l~~~~l~~~l~~~~p~~~   63 (320)
T PF03845_consen   33 DAWISVLLGGLIGLLLALLIYYLLKRFPGKT   63 (320)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
Confidence            5899988888888888888888888876643


No 140
>PF06400 Alpha-2-MRAP_N:  Alpha-2-macroglobulin RAP, N-terminal domain;  InterPro: IPR009066 The alpha-2-macroglobulin receptor-associated protein (RAP) is a glycoprotein that binds to the alpha-2-macroglobulin receptor, as well as to other members of the low density lipoprotein receptor family (IPR002172 from INTERPRO). RAP acts to inhibit the binding of all know ligands for these receptors, and may prevent receptor aggregation and degradation in the endoplasmic reticulum, thereby acting as a molecular chaperone []. RAP may be under the regulatory control of calmodulin, since it is able to bind calmodulin and be phosphorylated by calmodulin-dependent kinase II (IPR002048 from INTERPRO). RAP is comprised of three domains. Both domains 1 and 3 are involved in binding to the alpha-2-macroglobulin receptor, while domain 1 is also involved in inhibiting the binding of activated alpha-2-macroglobulin (IPR001599 from INTERPRO). Structural studies have revealed the RAP domain 1 to be comprised of a partly opened bundle of three helices, the first one being shorter than the other two.; PDB: 1NRE_A 2P03_A 1OV2_A 1LRE_A 2FYL_A 1OP1_A 2P01_A.
Probab=48.67  E-value=5.8  Score=28.82  Aligned_cols=24  Identities=29%  Similarity=0.530  Sum_probs=0.0

Q ss_pred             HHHHHHHHhheeecCCCcccchhh
Q psy17635         22 LILVLILVCLIKRNRGGKYAVHER   45 (103)
Q Consensus        22 Lllilli~C~i~R~rGgKY~VkeK   45 (103)
                      +++++++.++.....++||+-.-.
T Consensus         3 ~~~~ll~~~~~~~~~~~KYSre~N   26 (120)
T PF06400_consen    3 LLLLLLLFPIAAVKAAGKYSREAN   26 (120)
T ss_dssp             ------------------------
T ss_pred             HHHHHHHHHhHHHHhccccCHhhC
Confidence            334444445555778999985444


No 141
>MTH00186 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=48.53  E-value=38  Score=20.19  Aligned_cols=31  Identities=13%  Similarity=0.339  Sum_probs=19.7

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      |||=+.  ..|++..+..++.++++...+.+..
T Consensus         1 MPQm~P--~~W~~l~~~~~~~~~~~~~~~~~~~   31 (52)
T MTH00186          1 MPQLSP--LFWALLFIMIWLLIFLAMSMTWWLQ   31 (52)
T ss_pred             CCCCCc--hHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            555443  4799887777666666665666554


No 142
>PF03908 Sec20:  Sec20;  InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=48.47  E-value=17  Score=24.02  Aligned_cols=18  Identities=28%  Similarity=0.680  Sum_probs=9.6

Q ss_pred             HHHHHHHHHHHHHhheee
Q psy17635         17 LAIAFLILVLILVCLIKR   34 (103)
Q Consensus        17 ~ai~lLllilli~C~i~R   34 (103)
                      ...+|++|++++++++.|
T Consensus        73 i~~~~~~f~~~v~yI~~r   90 (92)
T PF03908_consen   73 IFFAFLFFLLVVLYILWR   90 (92)
T ss_pred             HHHHHHHHHHHHHHHhhh
Confidence            334555555555555554


No 143
>PF00517 GP41:  Retroviral envelope protein;  InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=48.45  E-value=8.4  Score=29.38  Aligned_cols=25  Identities=28%  Similarity=0.542  Sum_probs=3.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeec
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      |+|.+...|++.++.+.|++++-|.
T Consensus       155 ~~Iki~i~iv~~iI~lri~~~i~~~  179 (204)
T PF00517_consen  155 WYIKIFIMIVIGIIGLRILMCILRI  179 (204)
T ss_dssp             HHHHH--------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4566666666666667777777777


No 144
>PHA02692 hypothetical protein; Provisional
Probab=48.33  E-value=30  Score=23.03  Aligned_cols=8  Identities=25%  Similarity=0.621  Sum_probs=3.3

Q ss_pred             hHHHHHHH
Q psy17635         10 GWFIGMLL   17 (103)
Q Consensus        10 gWfIg~~~   17 (103)
                      +|.+.++.
T Consensus        44 ~~~~~ii~   51 (70)
T PHA02692         44 PWTTVFLI   51 (70)
T ss_pred             chHHHHHH
Confidence            34444433


No 145
>PF10661 EssA:  WXG100 protein secretion system (Wss), protein EssA;  InterPro: IPR018920  The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria [].   Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions. 
Probab=48.15  E-value=17  Score=26.62  Aligned_cols=11  Identities=9%  Similarity=-0.115  Sum_probs=5.4

Q ss_pred             hhhHHHHHHHH
Q psy17635          8 TAGWFIGMLLA   18 (103)
Q Consensus         8 t~gWfIg~~~a   18 (103)
                      +..|.|+++++
T Consensus       116 ~~~~~i~~~i~  126 (145)
T PF10661_consen  116 PISPTILLSIG  126 (145)
T ss_pred             chhHHHHHHHH
Confidence            34455555444


No 146
>COG2268 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.27  E-value=28  Score=30.94  Aligned_cols=19  Identities=5%  Similarity=0.468  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILVLILV   29 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~   29 (103)
                      ||++.++++++++++++++
T Consensus        11 ~~~~~ivvv~i~~ilv~if   29 (548)
T COG2268          11 LMLIAIVVVVILVILVLIF   29 (548)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            8887777766555554443


No 147
>PF12670 DUF3792:  Protein of unknown function (DUF3792);  InterPro: IPR023804  Members of this family of strongly hydrophobic putative transmembrane protein average about 125 amino acids in length and occur mostly, but not exclusively, in the Firmicutes. Members are quite diverse in sequence. Their function is unknown. 
Probab=47.20  E-value=25  Score=24.25  Aligned_cols=28  Identities=21%  Similarity=0.783  Sum_probs=21.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      ..+.+|+.|++.++...++++++-.+..
T Consensus        62 ~~~kG~l~G~~~Gl~y~~il~lis~~~~   89 (116)
T PF12670_consen   62 AGSKGWLHGLLVGLLYFLILLLISFLFG   89 (116)
T ss_pred             HccchHHHHHHHHHHHHHHHHHHHHHHc
Confidence            4568999999999887777766655443


No 148
>PRK10299 PhoPQ regulatory protein; Provisional
Probab=47.11  E-value=25  Score=21.73  Aligned_cols=21  Identities=19%  Similarity=0.420  Sum_probs=15.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~   29 (103)
                      ..|.|.++.+++-++|.++.+
T Consensus         4 ~rwiili~iv~~Cl~lyl~al   24 (47)
T PRK10299          4 FRWVVLVVVVLACLLLWAQVF   24 (47)
T ss_pred             eeehHHHHHHHHHHHHHHHHH
Confidence            469988877777666666654


No 149
>PF03954 Lectin_N:  Hepatic lectin, N-terminal domain;  InterPro: IPR005640 Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. C-type lectins display a wide range of specificities. They require Ca2+ for their activity They are found predominantly but not exclusively in vertebrates. This entry presents N-terminal domain, which is found in C-type lectins.; GO: 0005529 sugar binding, 0016020 membrane
Probab=46.92  E-value=19  Score=26.70  Aligned_cols=23  Identities=26%  Similarity=0.541  Sum_probs=13.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhee
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~   33 (103)
                      |+--++.++.|-+|.+++||+|-
T Consensus        32 ~~~l~LlsLgl~~LLLV~IcVig   54 (138)
T PF03954_consen   32 GPRLLLLSLGLSLLLLVVICVIG   54 (138)
T ss_pred             cchHHHHHHHHHHHHHHHHHhhc
Confidence            33445555555566666788874


No 150
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=46.86  E-value=30  Score=23.20  Aligned_cols=22  Identities=27%  Similarity=0.448  Sum_probs=14.6

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllill   27 (103)
                      ....|..||++.|++|+.+.++
T Consensus        49 t~i~GlaiGfvfA~vLv~illl   70 (73)
T COG4218          49 TRIAGLAIGFVFAGVLVGILLL   70 (73)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHH
Confidence            3456777888888766655544


No 151
>PF15339 Afaf:  Acrosome formation-associated factor
Probab=46.65  E-value=24  Score=27.57  Aligned_cols=20  Identities=25%  Similarity=0.718  Sum_probs=10.5

Q ss_pred             HHHHHHHHH----HHHHHHHHhhe
Q psy17635         13 IGMLLAIAF----LILVLILVCLI   32 (103)
Q Consensus        13 Ig~~~ai~l----Lllilli~C~i   32 (103)
                      +-+|+.|.|    |+++||++|+.
T Consensus       130 lkLmLGIsLmTl~lfv~Ll~~c~a  153 (200)
T PF15339_consen  130 LKLMLGISLMTLFLFVILLAFCSA  153 (200)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            445555443    33446666765


No 152
>PF13908 Shisa:  Wnt and FGF inhibitory regulator
Probab=46.54  E-value=14  Score=26.97  Aligned_cols=8  Identities=13%  Similarity=0.737  Sum_probs=3.4

Q ss_pred             HHHHHHHH
Q psy17635         12 FIGMLLAI   19 (103)
Q Consensus        12 fIg~~~ai   19 (103)
                      .++++++|
T Consensus        77 ~~~iivgv   84 (179)
T PF13908_consen   77 ITGIIVGV   84 (179)
T ss_pred             eeeeeeeh
Confidence            33444444


No 153
>KOG3970|consensus
Probab=46.27  E-value=25  Score=28.71  Aligned_cols=34  Identities=21%  Similarity=0.445  Sum_probs=22.0

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccch
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVH   43 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~Vk   43 (103)
                      -..+||+.++..++|++||.|+-.+   .|+|.-+|.
T Consensus       250 ~~ra~fli~lgvLafi~~i~lM~rl---Gr~g~ds~D  283 (299)
T KOG3970|consen  250 KKRALFLIFLGVLAFITIIMLMKRL---GRSGEDSSD  283 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHh---cccccccCC
Confidence            4567888888777777777665543   556654443


No 154
>PRK13415 flagella biosynthesis protein FliZ; Provisional
Probab=45.85  E-value=28  Score=27.56  Aligned_cols=28  Identities=36%  Similarity=0.376  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHHHHH-HHH-HhheeecCCC
Q psy17635         11 WFIGMLLAIAFLILV-LIL-VCLIKRNRGG   38 (103)
Q Consensus        11 WfIg~~~ai~lLlli-lli-~C~i~R~rGg   38 (103)
                      =++-++.+++++|++ ..+ -++.+|++++
T Consensus        67 ~l~qmi~aL~~VI~Liy~l~rwL~rR~~~~   96 (219)
T PRK13415         67 DFVKLIGATLFVIFLIYALVKWLNKRNRLL   96 (219)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence            355566554444322 222 2666776654


No 155
>PF13179 DUF4006:  Family of unknown function (DUF4006)
Probab=45.74  E-value=26  Score=22.99  Aligned_cols=15  Identities=40%  Similarity=0.833  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILVL   26 (103)
Q Consensus        12 fIg~~~ai~lLllil   26 (103)
                      +.|+++|++|||-|+
T Consensus        13 i~G~LIAvvLLLsIl   27 (66)
T PF13179_consen   13 ITGMLIAVVLLLSIL   27 (66)
T ss_pred             hHhHHHHHHHHHHHH
Confidence            356777766666443


No 156
>PF02397 Bac_transf:  Bacterial sugar transferase;  InterPro: IPR003362 This entry represents a conserved region from a number of different bacterial sugar transferases, involved in diverse biosynthesis pathways. Examples include galactosyl-P-P-undecaprenol synthetase (2.7.8.6 from EC), which transfers galatose-1-phosphate to the lipid precursor undecaprenol phosphate in the first steps of O-polysaccharide biosynthesis; UDP-galactose-lipid carrier transferase, which is involved in the biosynthesis of amylovoran; and galactosyl transferase CpsD, which is essential for assembly of the group B Streptococci (GBS) type III capsular polysaccharide.
Probab=45.66  E-value=22  Score=26.91  Aligned_cols=24  Identities=38%  Similarity=0.476  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheee
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R   34 (103)
                      ++++++..|+++-+++++.++++-
T Consensus         6 i~~a~~~li~~~Pl~l~iai~i~l   29 (187)
T PF02397_consen    6 IVLALLLLILLSPLFLIIAILIKL   29 (187)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344555545555555555555555


No 157
>PLN00168 Cytochrome P450; Provisional
Probab=45.58  E-value=24  Score=29.26  Aligned_cols=26  Identities=38%  Similarity=0.524  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         14 GMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      .+.|+.+||++++++.|...+.++++
T Consensus         6 ~~~~~~~~~~~~~~~~~~~~~~~~~~   31 (519)
T PLN00168          6 LLLLAALLLLPLLLLLLGKHGGRGGK   31 (519)
T ss_pred             HHHHHHHHHHHHHHHhhhhhhccCCC
Confidence            34455666666666677666655664


No 158
>KOG2927|consensus
Probab=45.56  E-value=23  Score=30.07  Aligned_cols=31  Identities=23%  Similarity=0.490  Sum_probs=16.9

Q ss_pred             hhHHHHHHHHHHHHHHH-HHHHhheeecCCCc
Q psy17635          9 AGWFIGMLLAIAFLILV-LILVCLIKRNRGGK   39 (103)
Q Consensus         9 ~gWfIg~~~ai~lLlli-lli~C~i~R~rGgK   39 (103)
                      .|=||+++.++|++=+| ++|++++.+.++|-
T Consensus       225 ~~gfl~~IlvLaIvRlILF~I~~il~~g~~g~  256 (372)
T KOG2927|consen  225 AGGFLAFILVLAIVRLILFGITWILTGGKHGF  256 (372)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCce
Confidence            44467766665555444 34456666655553


No 159
>PHA03273 envelope glycoprotein C; Provisional
Probab=45.29  E-value=14  Score=32.41  Aligned_cols=24  Identities=25%  Similarity=0.216  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHH-HHHHHHhheeec
Q psy17635         12 FIGMLLAIAFLI-LVLILVCLIKRN   35 (103)
Q Consensus        12 fIg~~~ai~lLl-lilli~C~i~R~   35 (103)
                      +||++|++++++ +|++.++|+.++
T Consensus       450 ivaV~~g~~a~g~~ilitalC~y~s  474 (486)
T PHA03273        450 IIAVTCGAAALALVVLITAVCFYCS  474 (486)
T ss_pred             ehHHHHHHHHHHHHHheEEEEEEec
Confidence            368888866664 434433334333


No 160
>PRK10905 cell division protein DamX; Validated
Probab=45.26  E-value=11  Score=31.52  Aligned_cols=12  Identities=25%  Similarity=0.852  Sum_probs=5.4

Q ss_pred             HHHHHHHHHHHH
Q psy17635         18 AIAFLILVLILV   29 (103)
Q Consensus        18 ai~lLllilli~   29 (103)
                      +|.+|+|+||||
T Consensus         3 GiGilVLlLLIi   14 (328)
T PRK10905          3 GVGILVLLLLII   14 (328)
T ss_pred             chhHHHHHHHHH
Confidence            344444444444


No 161
>PHA03240 envelope glycoprotein M; Provisional
Probab=44.99  E-value=22  Score=28.64  Aligned_cols=16  Identities=25%  Similarity=0.594  Sum_probs=7.2

Q ss_pred             HHHHHHHHHHHHHHhh
Q psy17635         16 LLAIAFLILVLILVCL   31 (103)
Q Consensus        16 ~~ai~lLllilli~C~   31 (103)
                      |+.|+++|+|++++||
T Consensus       216 IiilIIiIiIIIL~cf  231 (258)
T PHA03240        216 IFIAIIIIIVIILFFF  231 (258)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3333444444444555


No 162
>CHL00177 ccs1 c-type cytochrome biogenensis protein; Validated
Probab=44.92  E-value=18  Score=30.79  Aligned_cols=26  Identities=23%  Similarity=0.677  Sum_probs=17.4

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      ..+..||+++|    +|+.+-|++|.++|.
T Consensus        70 vY~s~WF~~ll----~ll~isL~~Cs~~R~   95 (426)
T CHL00177         70 IYKTWWFLSLL----LLFGLSLLLCTLLQQ   95 (426)
T ss_pred             cccCHHHHHHH----HHHHHHHhheeeccc
Confidence            45667998765    445556777888773


No 163
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=44.77  E-value=14  Score=30.83  Aligned_cols=12  Identities=17%  Similarity=0.207  Sum_probs=9.5

Q ss_pred             HHHHHhheeecC
Q psy17635         25 VLILVCLIKRNR   36 (103)
Q Consensus        25 illi~C~i~R~r   36 (103)
                      -+|..|||.|-|
T Consensus       385 GfLcWwf~crgk  396 (397)
T PF03302_consen  385 GFLCWWFICRGK  396 (397)
T ss_pred             HHHhhheeeccc
Confidence            377789999976


No 164
>PF11359 gpUL132:  Glycoprotein UL132;  InterPro: IPR021023  Glycoprotein UL132 is a low-abundance structural component of Human herpesvirus 5 []. The function of this protein is not fully understood. 
Probab=44.60  E-value=74  Score=25.53  Aligned_cols=13  Identities=23%  Similarity=0.299  Sum_probs=6.9

Q ss_pred             ecCCCcccchhhh
Q psy17635         34 RNRGGKYAVHERE   46 (103)
Q Consensus        34 R~rGgKY~VkeKE   46 (103)
                      |..|-||.-.+.|
T Consensus        80 ~~pg~~~~f~~de   92 (235)
T PF11359_consen   80 RRPGRLTRFDDDE   92 (235)
T ss_pred             hCCCcccccChhh
Confidence            5556655544444


No 165
>PF00974 Rhabdo_glycop:  Rhabdovirus spike glycoprotein;  InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=44.23  E-value=7.5  Score=33.54  Aligned_cols=13  Identities=8%  Similarity=0.069  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHhhe
Q psy17635         20 AFLILVLILVCLI   32 (103)
Q Consensus        20 ~lLllilli~C~i   32 (103)
                      ++++|+.|+..|.
T Consensus       467 lli~l~~cc~~~~  479 (501)
T PF00974_consen  467 LLILLIRCCCRCR  479 (501)
T ss_dssp             -------------
T ss_pred             HHHHHHHHhhhhc
Confidence            3334443333334


No 166
>KOG1311|consensus
Probab=44.19  E-value=25  Score=27.49  Aligned_cols=28  Identities=14%  Similarity=0.250  Sum_probs=20.9

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      +-.++.||+.++..++++.+..+.+|++
T Consensus       152 G~rNyr~F~~f~~~~~l~~i~~~~~~~~  179 (299)
T KOG1311|consen  152 GERNYRYFVLFLFYLALGVLLALAFLFY  179 (299)
T ss_pred             CCCchHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3568889998888777777777766554


No 167
>PF05083 LST1:  LST-1 protein;  InterPro: IPR007775 B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length. B144/LST1 is strongly expressed in dendritic cells. Transfection of B144/LST1 into a variety of cells induces morphologic changes including the production of long, thin filopodia []. A possible role in modulating immune responses. Induces morphological changes including production of filopodia and microspikes when overexpressed in a variety of cell types and may be involved in dendritic cell maturation. Isoform 1 and isoform 2 have an inhibitory effect on lymphocyte proliferation [, ]. ; GO: 0000902 cell morphogenesis, 0006955 immune response, 0016020 membrane
Probab=43.74  E-value=6.8  Score=26.26  Aligned_cols=19  Identities=32%  Similarity=0.637  Sum_probs=11.4

Q ss_pred             HHHHHHHHHHHhheeecCC
Q psy17635         19 IAFLILVLILVCLIKRNRG   37 (103)
Q Consensus        19 i~lLllilli~C~i~R~rG   37 (103)
                      +.||.++++.+|+|.-+|+
T Consensus         4 llll~vvll~~clC~lsrR   22 (74)
T PF05083_consen    4 LLLLAVVLLSACLCRLSRR   22 (74)
T ss_pred             hhhHHHHHHHHHHHHHHhh
Confidence            4556666677787654443


No 168
>MTH00102 ATP8 ATP synthase F0 subunit 8; Validated
Probab=43.74  E-value=42  Score=21.51  Aligned_cols=20  Identities=40%  Similarity=0.697  Sum_probs=13.1

Q ss_pred             CCCcchhhhhHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFL   22 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lL   22 (103)
                      |||-+  ..-||..++++-+.+
T Consensus         1 MPQLn--P~PWf~il~~tWl~l   20 (67)
T MTH00102          1 MPQLD--TSTWFITILSMLLTL   20 (67)
T ss_pred             CCCCC--CcHHHHHHHHHHHHH
Confidence            55543  447999998884433


No 169
>PF06842 DUF1242:  Protein of unknown function (DUF1242);  InterPro: IPR009653 This family consists of a number of eukaryotic proteins of around 72 residues in length. The function of this family is unknown.
Probab=43.72  E-value=10  Score=22.09  Aligned_cols=11  Identities=36%  Similarity=0.948  Sum_probs=5.4

Q ss_pred             HHHHHHHHhhe
Q psy17635         22 LILVLILVCLI   32 (103)
Q Consensus        22 Lllilli~C~i   32 (103)
                      |++|||++|-+
T Consensus         2 L~ViLL~ICTC   12 (36)
T PF06842_consen    2 LTVILLLICTC   12 (36)
T ss_pred             hhhHHHHHHHh
Confidence            44455555543


No 170
>PF06809 NPDC1:  Neural proliferation differentiation control-1 protein (NPDC1);  InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=43.48  E-value=25  Score=29.55  Aligned_cols=28  Identities=14%  Similarity=0.154  Sum_probs=20.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      +|+++|+....-+|+..+|++|-+|.-+
T Consensus       202 ~Iv~~cvaG~aAliva~~cW~Rlqr~~r  229 (341)
T PF06809_consen  202 LIVVCCVAGAAALIVAGYCWYRLQREIR  229 (341)
T ss_pred             hhHHHHHHHHHHHHHhhheEEEeccccc
Confidence            5666666555667777789998877766


No 171
>PF04971 Lysis_S:  Lysis protein S ;  InterPro: IPR007054 The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane [].
Probab=43.41  E-value=26  Score=23.14  Aligned_cols=23  Identities=26%  Similarity=0.284  Sum_probs=17.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhheee
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R   34 (103)
                      .||++..|++.+|..|+=...|+
T Consensus        35 aIGvi~gi~~~~lt~ltN~YFK~   57 (68)
T PF04971_consen   35 AIGVIGGIFFGLLTYLTNLYFKI   57 (68)
T ss_pred             hHHHHHHHHHHHHHHHhHhhhhh
Confidence            58888888888888887555544


No 172
>TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=43.16  E-value=23  Score=30.59  Aligned_cols=30  Identities=13%  Similarity=0.238  Sum_probs=19.0

Q ss_pred             hhhhHHHHHHHHH-HHHHHHHHHHhheeecC
Q psy17635          7 ATAGWFIGMLLAI-AFLILVLILVCLIKRNR   36 (103)
Q Consensus         7 at~gWfIg~~~ai-~lLllilli~C~i~R~r   36 (103)
                      ....|-+..+..+ ++++++-+++|+.||+|
T Consensus       520 ~~~~~~~~~i~~pp~~~l~~G~~~~~~Rrr~  550 (552)
T TIGR03521       520 DRTTWQLINIGLPILLLLLFGLSFTYIRKRK  550 (552)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            4456776666665 55555666677777665


No 173
>KOG3514|consensus
Probab=42.55  E-value=24  Score=34.44  Aligned_cols=41  Identities=37%  Similarity=0.625  Sum_probs=30.2

Q ss_pred             cchhhhhHHHHHHHHHHHHHHHHHHHhheeecC-CCcccchh
Q psy17635          4 DTVATAGWFIGMLLAIAFLILVLILVCLIKRNR-GGKYAVHE   44 (103)
Q Consensus         4 ~~~at~gWfIg~~~ai~lLllilli~C~i~R~r-GgKY~Vke   44 (103)
                      ++..+.|-..|++.|.+|-|||+|-.-.--||| -|+|.|.+
T Consensus      1510 esssttGmVvGIvaAaaLcILilL~am~kyRnrdEGSYhvd~ 1551 (1591)
T KOG3514|consen 1510 ESSSTTGMVVGIVAAAALCILILLYAMYKYRNRDEGSYHVDE 1551 (1591)
T ss_pred             cCCCccchhhHHHHHHHHHHHHHHhhcccccccccCcccccc
Confidence            456788999999999888888876654333433 58998876


No 174
>PHA02844 putative transmembrane protein; Provisional
Probab=42.52  E-value=37  Score=22.84  Aligned_cols=25  Identities=16%  Similarity=0.135  Sum_probs=14.4

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHh
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVC   30 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C   30 (103)
                      ....+|.+.++..++.+++++++.-
T Consensus        43 ~~~~~~~~~ii~i~~v~~~~~~~fl   67 (75)
T PHA02844         43 VCSSSTKIWILTIIFVVFATFLTFL   67 (75)
T ss_pred             cCChhHHHHHHHHHHHHHHHHHHHH
Confidence            4457888888664444444444433


No 175
>MTH00147 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=42.35  E-value=58  Score=19.77  Aligned_cols=32  Identities=19%  Similarity=0.434  Sum_probs=22.0

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      |||-  +---|++..+..++.++++..++.+...
T Consensus         1 MPQm--sPm~W~~l~~~~~~~~~~~~~~~~~~~~   32 (51)
T MTH00147          1 MPQL--SPLNWIFLFILFWSAVIIVSVIIWWSTK   32 (51)
T ss_pred             CCCC--cchHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            4553  3357999988887777777777766543


No 176
>COG3115 ZipA Cell division protein [Cell division and chromosome partitioning]
Probab=42.07  E-value=36  Score=28.45  Aligned_cols=23  Identities=26%  Similarity=0.349  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHhheeecCCCccc
Q psy17635         18 AIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus        18 ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      |||++.|++--+| ..|.++++|=
T Consensus        13 ~IAIiaLLvhGlW-tsRkE~s~~F   35 (324)
T COG3115          13 AIAIIALLVHGLW-TSRKERSSYF   35 (324)
T ss_pred             HHHHHHHHHhhhh-hcchhhcccc
Confidence            3443333333334 5566777774


No 177
>PHA02819 hypothetical protein; Provisional
Probab=41.70  E-value=36  Score=22.68  Aligned_cols=22  Identities=14%  Similarity=0.315  Sum_probs=13.9

Q ss_pred             cchhhhhHHHHHHHHHHHHHHH
Q psy17635          4 DTVATAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         4 ~~~at~gWfIg~~~ai~lLlli   25 (103)
                      .+.....|+|.++.++.+++++
T Consensus        42 ~~~~~~~~ii~l~~~~~~~~~~   63 (71)
T PHA02819         42 KSFLRYYLIIGLVTIVFVIIFI   63 (71)
T ss_pred             CChhHHHHHHHHHHHHHHHHHH
Confidence            3577888998866554444433


No 178
>TIGR02588 conserved hypothetical protein TIGR02588. The function of this protein is unknown. It is always found as part of a two-gene operon with TIGR02587, a protein that appears to span the membrane seven times. It is found in Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Sinorhizobium meliloti, and Gloeobacter violaceus, so far, all of which are bacterial.
Probab=41.31  E-value=36  Score=24.72  Aligned_cols=33  Identities=12%  Similarity=0.176  Sum_probs=21.3

Q ss_pred             hhHHHHHHHHHHHHHHHHHHH--hheeecCCCccc
Q psy17635          9 AGWFIGMLLAIAFLILVLILV--CLIKRNRGGKYA   41 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~--C~i~R~rGgKY~   41 (103)
                      .=|+-+.+++++||.++.|++  ++..+++--...
T Consensus         3 ~EWvt~~Is~~ill~viglv~y~~l~~~~~pp~l~   37 (122)
T TIGR02588         3 AEWVTFGISTLILAAMFGLVAYDWLRYSNKAAVLE   37 (122)
T ss_pred             HHhhhHHHHHHHHHHHHHHHHHHhhccCCCCCeEE
Confidence            459988888888888887775  333444333333


No 179
>PHA02692 hypothetical protein; Provisional
Probab=41.30  E-value=34  Score=22.75  Aligned_cols=15  Identities=20%  Similarity=0.193  Sum_probs=10.0

Q ss_pred             cchhhhhHHHH-HHHH
Q psy17635          4 DTVATAGWFIG-MLLA   18 (103)
Q Consensus         4 ~~~at~gWfIg-~~~a   18 (103)
                      .+.....|+|. ++.+
T Consensus        41 ~~~~~~~~ii~~~~~~   56 (70)
T PHA02692         41 KGVPWTTVFLIGLIAA   56 (70)
T ss_pred             CCcchHHHHHHHHHHH
Confidence            35677888888 4444


No 180
>PF14798 Ca_hom_mod:  Calcium homeostasis modulator
Probab=40.99  E-value=34  Score=27.17  Aligned_cols=27  Identities=19%  Similarity=0.469  Sum_probs=18.6

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      ....||++.++.+++.+|...+.-|+-
T Consensus       178 SQ~lGW~LI~~~~i~a~l~~c~~rC~S  204 (251)
T PF14798_consen  178 SQVLGWILIALVIILAFLVTCLRRCFS  204 (251)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            345699888777777766666666654


No 181
>PF03918 CcmH:  Cytochrome C biogenesis protein;  InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=40.83  E-value=9.5  Score=27.96  Aligned_cols=36  Identities=14%  Similarity=0.153  Sum_probs=0.5

Q ss_pred             cchhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635          4 DTVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus         4 ~~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      +++...+|+..++=++++++.+++++-.++|.+...
T Consensus        95 Pp~~~~~~~lW~~P~~~l~~g~~~~~~~~rr~~~~~  130 (148)
T PF03918_consen   95 PPFKGFTWLLWLGPFLLLLLGGALLFRRLRRWRRRA  130 (148)
T ss_dssp             --S---------------------------------
T ss_pred             CCCCccHHHHHHHHHHHHHHHHHHHHHHHHhcccCC
Confidence            345566788888888777777777777777665544


No 182
>PRK03427 cell division protein ZipA; Provisional
Probab=40.65  E-value=42  Score=28.09  Aligned_cols=30  Identities=17%  Similarity=0.255  Sum_probs=20.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      -++.++.|||++-|++=-+|=+|+.|.--|
T Consensus         7 LiLivvGAIAIiAlL~HGlWtsRKers~~f   36 (333)
T PRK03427          7 LILIIVGAIAIIALLVHGFWTSRKERSSMF   36 (333)
T ss_pred             hHHHHHHHHHHHHHHHHhhhhccccccchh
Confidence            345566677766666666788888876555


No 183
>TIGR01195 oadG_fam sodium pump decarboxylases, gamma subunit. Most sequences scoring between the noise and trusted cutoffs are eukaryotic sodium channel proteins.
Probab=40.62  E-value=50  Score=21.86  Aligned_cols=12  Identities=42%  Similarity=0.780  Sum_probs=5.7

Q ss_pred             HHHHHHHHHHHh
Q psy17635         19 IAFLILVLILVC   30 (103)
Q Consensus        19 i~lLllilli~C   30 (103)
                      ++|+.|++||+|
T Consensus        17 ~VF~fL~lLi~~   28 (82)
T TIGR01195        17 IVFLFLSLLIYA   28 (82)
T ss_pred             HHHHHHHHHHHH
Confidence            444444544443


No 184
>PF02124 Marek_A:  Marek's disease glycoprotein A;  InterPro: IPR001038  Equid herpesvirus 1 (Equine herpesvirus 1, EHV-1) glycoprotein 13 (EHV-1 gp13) has the characteristic features of a membrane-spanning protein: an N-terminal signal sequence; a hydrophobic membrane anchor region; a charged C-terminal cytoplasmic tail; and an exterior domain with nine potential N-glycosylation sites []. EHV-1 gp13 is the structural homologue of the gC-like glycoproteins of the Human herpesvirus 1 (HHV-1) and Human herpesvirus 2 (HHV-2) (gC-1 and gC-2 respectively), Pseudorabies virus (strain Indiana-Funkhauser/Becker) (PRV) (gIII) and Human herpesvirus 3 (HHV-3) (gp66).  Secretory glycoprotein GP57-65 precursor (glycoprotein A - GA) is similar to Herpesvirus glycoprotein C, and belongs to the immunoglobulin gene superfamily [, ]. GA is thought to play an immunoevasive role in the pathogenesis of Marek's disease. It is a candidate for causing the early-stage immunosuppression that occurs after MDHV infection.
Probab=40.14  E-value=26  Score=27.51  Aligned_cols=18  Identities=22%  Similarity=0.467  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILVLILV   29 (103)
Q Consensus        12 fIg~~~ai~lLllilli~   29 (103)
                      +|+++++++++.++++|.
T Consensus       188 i~~Vi~g~~~~g~~~~i~  205 (211)
T PF02124_consen  188 IIGVICGAGALGLLVLIA  205 (211)
T ss_pred             hHHHHHHHHHHHHHHhhE
Confidence            467777766666555543


No 185
>PF07782 DC_STAMP:  DC-STAMP-like protein;  InterPro: IPR012858 This group of sequences is similar to a region of the dendritic cell-specific transmembrane protein (DC-STAMP, Q9H295 from SWISSPROT). This is thought to be a novel receptor protein that shares no identity with other multimembrane-spanning proteins []. It is thought to have seven putative transmembrane regions [], two of which are found in the region featured in this family. DC-STAMP is also described as having potential N-linked glycosylation sites and a potential phosphorylation site for PKC [], but these are not conserved. ; GO: 0016021 integral to membrane
Probab=39.76  E-value=39  Score=25.05  Aligned_cols=25  Identities=28%  Similarity=0.442  Sum_probs=16.0

Q ss_pred             CCcchhhhhHHHHHHHHHHHHHHHH
Q psy17635          2 PSDTVATAGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         2 ~~~~~at~gWfIg~~~ai~lLllil   26 (103)
                      |........|-|+++++++++++++
T Consensus       139 P~~p~~~~~~~i~~l~~l~~ll~~l  163 (191)
T PF07782_consen  139 PSPPDYSVYIQIGLLYLLLWLLVLL  163 (191)
T ss_pred             CcCCChHHHHHHHHHHHHHHHHHHH
Confidence            3344566677788887766665553


No 186
>MTH00072 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=39.58  E-value=51  Score=19.55  Aligned_cols=24  Identities=29%  Similarity=0.641  Sum_probs=15.7

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllil   26 (103)
                      |||-  +..-||..+++..+.+++++
T Consensus         1 MPQL--~P~pW~~~~~~~W~~ll~~~   24 (54)
T MTH00072          1 MPQL--NPGPWFAIFLMSWLIFLLIL   24 (54)
T ss_pred             CCCC--CchHHHHHHHHHHHHHHHHH
Confidence            4553  34579999998866655444


No 187
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=39.49  E-value=14  Score=31.28  Aligned_cols=28  Identities=18%  Similarity=0.258  Sum_probs=12.9

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      ....|+|.++.-+++|||.++.+.+.+|
T Consensus       344 ~~~~~~~ii~~N~v~lllg~~~~~~~rk  371 (374)
T TIGR03503       344 DSKFWIIIIVGNVVILLLGGIGFFVWRK  371 (374)
T ss_pred             hhhhhHhhHhhhhhhhhhheeeEEEEEE
Confidence            3446644544445555554433333333


No 188
>PF00429 TLV_coat:  ENV polyprotein (coat polyprotein);  InterPro: IPR018154 Enveloped viruses such as Human immunodeficiency virus 1, influenza virus, and Ebola virus sp. express a surface glycoprotein that mediates both cell attachment and fusion of viral and cellular membranes. The ENV polyprotein (coat polyprotein) usually contains two coat proteins which differ depending on the source. The structure of a number of the ENV polyprotein domains have been determined:    The crystal structure of an extraviral segment of the Moloney murine leukemia virus (MoMuLV) transmembrane (TM) subunit has been determined to 1.7-A resolution. This segment contains a trimeric coiled coil, with a hydrophobic cluster at its base and a strand that packs in an antiparallel orientation against the coiled coil. This structure serves as a model for a wide range of viral fusion proteins; key residues in this structure are conserved among C- and D-type retroviruses and the filovirus ebola [].   An essential step in retrovirus infection is the binding of the virus to its receptor on a target cell. The structure of the receptor-binding domain of the envelope glycoprotein from Friend murine leukemia virus (F-MuLV) has been determined determined to 2.0-A resolution. The core of the domain is an antiparallel beta sandwich, with two interstrand loops forming a helical subdomain atop the sandwich. The residues in the helical region, but not in the beta sandwich, are highly variable among mammalian C-type retroviruses with distinct tropisms, indicating that the helical subdomain determines the receptor specificity of the virus []. ; PDB: 1LCS_B 1MOF_A 1XNL_A 2XZ3_A 1AOL_A 1Y4M_C.
Probab=38.79  E-value=10  Score=33.22  Aligned_cols=10  Identities=20%  Similarity=0.528  Sum_probs=0.0

Q ss_pred             hhhhHHHHHH
Q psy17635          7 ATAGWFIGML   16 (103)
Q Consensus         7 at~gWfIg~~   16 (103)
                      ..-.||..+|
T Consensus       523 ~~~~w~~~ll  532 (561)
T PF00429_consen  523 GWSPWLKPLL  532 (561)
T ss_dssp             ----------
T ss_pred             ccchHHHHHH
Confidence            3446887777


No 189
>PF11346 DUF3149:  Protein of unknown function (DUF3149);  InterPro: IPR021494  This bacterial family of proteins has no known function. 
Probab=38.64  E-value=33  Score=20.49  Aligned_cols=17  Identities=18%  Similarity=0.370  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy17635         13 IGMLLAIAFLILVLILV   29 (103)
Q Consensus        13 Ig~~~ai~lLllilli~   29 (103)
                      +|+|+.++.+..+.+++
T Consensus        11 vGL~Sl~vI~~~igm~~   27 (42)
T PF11346_consen   11 VGLMSLIVIVFTIGMGV   27 (42)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            56666655554444443


No 190
>PHA03269 envelope glycoprotein C; Provisional
Probab=38.47  E-value=24  Score=31.43  Aligned_cols=18  Identities=28%  Similarity=0.567  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILVLILV   29 (103)
Q Consensus        12 fIg~~~ai~lLllilli~   29 (103)
                      +||++|+++++.+++||.
T Consensus       541 IigV~~~~~algliilIt  558 (566)
T PHA03269        541 IIGIISIIGILGLIALIA  558 (566)
T ss_pred             ehHHHHHHHHHHHHHhhe
Confidence            468888877766655553


No 191
>PF14927 Neurensin:  Neurensin
Probab=38.46  E-value=49  Score=24.41  Aligned_cols=24  Identities=17%  Similarity=0.544  Sum_probs=17.6

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~   29 (103)
                      .....|=++++++++||++-+..+
T Consensus        39 w~s~~wkV~~i~g~l~Ll~Gi~~l   62 (140)
T PF14927_consen   39 WSSVCWKVGFISGLLLLLLGIVAL   62 (140)
T ss_pred             CcchhHHHHHHHHHHHHHHHHHHH
Confidence            456678899999988888654443


No 192
>PF14884 EFF-AFF:  Type I membrane glycoproteins cell-cell fusogen
Probab=38.45  E-value=25  Score=31.52  Aligned_cols=24  Identities=25%  Similarity=0.571  Sum_probs=15.3

Q ss_pred             hHHHHHHHH--HHHHHHHHHHHhhee
Q psy17635         10 GWFIGMLLA--IAFLILVLILVCLIK   33 (103)
Q Consensus        10 gWfIg~~~a--i~lLllilli~C~i~   33 (103)
                      +|+=+++++  ||+.+++++|+|++.
T Consensus       557 s~~e~v~~~~dIv~~i~~~~il~~l~  582 (589)
T PF14884_consen  557 SWTEAVMMALDIVLYIGLLFILYLLI  582 (589)
T ss_pred             cHHHHHHHHHHHHHHhhhhhhhhhhh
Confidence            566666776  666666666666653


No 193
>PRK06569 F0F1 ATP synthase subunit B'; Validated
Probab=38.14  E-value=49  Score=24.67  Aligned_cols=31  Identities=26%  Similarity=0.346  Sum_probs=15.4

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      |||-...+- |--.+..+|.|++|++++-+|+
T Consensus         1 mPQfd~~~~-~sqifw~iI~FlILy~ll~kf~   31 (155)
T PRK06569          1 MPQFDIATY-YSQIFWLIVTFGLLYIFVYKFI   31 (155)
T ss_pred             CCCCchhhh-hHHHHHHHHHHHHHHHHHHHHh
Confidence            677665532 2222334455666665555544


No 194
>PF04156 IncA:  IncA protein;  InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=38.14  E-value=40  Score=24.43  Aligned_cols=26  Identities=31%  Similarity=0.529  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635         15 MLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        15 ~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      .++|+.++++++-+.||+++.+-..+
T Consensus        44 ~~lAlg~vL~~~g~~~~~~~~~~~~~   69 (191)
T PF04156_consen   44 ALLALGVVLLSLGLLCLLSKRPVQSV   69 (191)
T ss_pred             HHHHHHHHHHHHHHHHHHHccccccc
Confidence            34444455555555566655544443


No 195
>KOG1226|consensus
Probab=38.12  E-value=32  Score=31.90  Aligned_cols=37  Identities=22%  Similarity=0.415  Sum_probs=21.0

Q ss_pred             HHHHHHHHHHHHHHHHHHhhe--eecCCCcccchhhhhh
Q psy17635         12 FIGMLLAIAFLILVLILVCLI--KRNRGGKYAVHEREAA   48 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i--~R~rGgKY~VkeKE~~   48 (103)
                      .++++.+|+|+.|+||++|-+  ..+--.-|+==+||+.
T Consensus       717 ~lgvv~~ivligl~llliwkll~~~~DrrE~akFe~er~  755 (783)
T KOG1226|consen  717 VLGVVAGIVLIGLALLLIWKLLTTIHDRREFAKFEKERL  755 (783)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHhheecccHHhhhhhHHHH
Confidence            456677787777777777532  3222334554455543


No 196
>PF10164 DUF2367:  Uncharacterized conserved protein (DUF2367);  InterPro: IPR019317  This is a highly conserved set of proteins which contains three pairs of cysteine residues within a length of 42 amino acids and is rich in proline residues towards the N terminus. It includes a membrane protein that has been found to be highly expressed in the mouse brain and consequently, several members have been assigned as brain protein i3 (Bri3). Their function is unknown.
Probab=37.94  E-value=40  Score=23.74  Aligned_cols=22  Identities=36%  Similarity=0.682  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHHHHHhheeec
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~   35 (103)
                      .|+++||.+.=|-+| .||..|.
T Consensus        67 ~Gi~~AI~fFPiGil-cc~~~r~   88 (98)
T PF10164_consen   67 CGILCAIFFFPIGIL-CCLAMRE   88 (98)
T ss_pred             HHHHHHHHHHhhHHH-HhhhcCc
Confidence            355556555444332 4555443


No 197
>PHA03054 IMV membrane protein; Provisional
Probab=37.94  E-value=46  Score=22.24  Aligned_cols=20  Identities=15%  Similarity=0.149  Sum_probs=12.9

Q ss_pred             chhhhhHHHHHHHHHHHHHH
Q psy17635          5 TVATAGWFIGMLLAIAFLIL   24 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLll   24 (103)
                      +.....|+|.++.++.++++
T Consensus        45 ~~~~~~~ii~l~~v~~~~l~   64 (72)
T PHA03054         45 CWGWYWLIIIFFIVLILLLL   64 (72)
T ss_pred             CchHHHHHHHHHHHHHHHHH
Confidence            56788899886555444433


No 198
>PHA02650 hypothetical protein; Provisional
Probab=37.49  E-value=40  Score=23.01  Aligned_cols=24  Identities=17%  Similarity=0.113  Sum_probs=13.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~   29 (103)
                      ....+|.+.++..++++++++++.
T Consensus        44 ~~~~~~~~~ii~i~~v~i~~l~~f   67 (81)
T PHA02650         44 VSWFNGQNFIFLIFSLIIVALFSF   67 (81)
T ss_pred             cCCchHHHHHHHHHHHHHHHHHHH
Confidence            345677777766444444444443


No 199
>PRK13453 F0F1 ATP synthase subunit B; Provisional
Probab=37.44  E-value=40  Score=24.64  Aligned_cols=22  Identities=18%  Similarity=0.495  Sum_probs=13.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      |...++-+|.|++|++++-.|+
T Consensus        18 ~~t~~~~iInFliL~~lL~~~l   39 (173)
T PRK13453         18 WGTVIVTVLTFIVLLALLKKFA   39 (173)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            6555556666666666655444


No 200
>PF03597 CcoS:  Cytochrome oxidase maturation protein cbb3-type;  InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase. 
Probab=37.41  E-value=32  Score=20.65  Aligned_cols=28  Identities=11%  Similarity=0.465  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      |.+|..+.+++.++.++.|+---|.|.|
T Consensus         3 l~~lip~sl~l~~~~l~~f~Wavk~GQf   30 (45)
T PF03597_consen    3 LYILIPVSLILGLIALAAFLWAVKSGQF   30 (45)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHccCCC
Confidence            3444444444444444444433445666


No 201
>PF03733 DUF307:  Domain of unknown function (DUF307);  InterPro: IPR005185 This proteins contain a domain which occurs as one or more copies in a small family of putative membrane proteins.
Probab=37.34  E-value=56  Score=20.05  Aligned_cols=20  Identities=15%  Similarity=0.637  Sum_probs=10.6

Q ss_pred             hhHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli   28 (103)
                      .||.+++++.++-+++.+.|
T Consensus        11 ~G~~lal~~~~~~~~~~iti   30 (53)
T PF03733_consen   11 FGWWLALIWLLAGILCCITI   30 (53)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            46666666665544444333


No 202
>PF09435 DUF2015:  Fungal protein of unknown function (DUF2015);  InterPro: IPR018559  This entry represents uncharacterised proteins found in fungi. 
Probab=37.09  E-value=18  Score=26.49  Aligned_cols=26  Identities=19%  Similarity=0.418  Sum_probs=16.2

Q ss_pred             HHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635         15 MLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        15 ~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      ++-.++++++|+.+++|..|+|=-.+
T Consensus         4 lly~~~~~~~i~~t~lf~~R~r~~~~   29 (128)
T PF09435_consen    4 LLYSLTFFVLIIGTLLFFTRHRWLPL   29 (128)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            44455666666666777777765444


No 203
>PF04226 Transgly_assoc:  Transglycosylase associated protein;  InterPro: IPR007341 This bacterial protein is predicted to be an integral membrane protein. Some family members have been annotated as transglycosylase-associated proteins, but no experimental evidence is provided. This family was annotated based on the information in P76011 from SWISSPROT.; GO: 0016021 integral to membrane
Probab=37.04  E-value=65  Score=19.23  Aligned_cols=19  Identities=16%  Similarity=0.370  Sum_probs=11.7

Q ss_pred             hhhhHHHHHHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         7 at~gWfIg~~~ai~lLlli   25 (103)
                      .-.+|+.+++.|+++|.+.
T Consensus        25 ~~~~~i~aviGAiill~i~   43 (48)
T PF04226_consen   25 SWGSFIVAVIGAIILLFIY   43 (48)
T ss_pred             hHHHHHHHHHHHHHHHHHH
Confidence            4456677777776665544


No 204
>KOG3054|consensus
Probab=37.04  E-value=45  Score=27.45  Aligned_cols=35  Identities=26%  Similarity=0.421  Sum_probs=17.5

Q ss_pred             HHHHHHHHHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKRNRGGKYAVHERE   46 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE   46 (103)
                      +++++++.+||+.++++...-+|+-.-.=+|.-.|
T Consensus         4 ~v~vlVaa~llV~~i~l~l~~r~raA~~~~l~a~~   38 (299)
T KOG3054|consen    4 IVAVLVAAALLVAVILLFLWKRRRAARTDMLLARE   38 (299)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhh
Confidence            35666666666655444443344433344444444


No 205
>COG1314 SecG Preprotein translocase subunit SecG [Intracellular trafficking and secretion]
Probab=36.99  E-value=57  Score=22.07  Aligned_cols=13  Identities=38%  Similarity=0.917  Sum_probs=7.8

Q ss_pred             HHHHhheeecCCC
Q psy17635         26 LILVCLIKRNRGG   38 (103)
Q Consensus        26 lli~C~i~R~rGg   38 (103)
                      ++++.++.|.||+
T Consensus        16 LI~~VLlQ~~kg~   28 (86)
T COG1314          16 LIILVLLQRGKGA   28 (86)
T ss_pred             HHHheeeecCCCC
Confidence            3344566788754


No 206
>PF14584 DUF4446:  Protein of unknown function (DUF4446)
Probab=36.92  E-value=30  Score=25.47  Aligned_cols=15  Identities=13%  Similarity=0.470  Sum_probs=6.5

Q ss_pred             HHHHHHHHHHHHHHH
Q psy17635         14 GMLLAIAFLILVLIL   28 (103)
Q Consensus        14 g~~~ai~lLllilli   28 (103)
                      +++++|+|++++.+.
T Consensus         8 ~~l~iilli~~~~~~   22 (151)
T PF14584_consen    8 LVLVIILLILIIILN   22 (151)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            344444444444444


No 207
>PF13106 DUF3961:  Domain of unknown function (DUF3961)
Probab=36.70  E-value=16  Score=21.87  Aligned_cols=23  Identities=26%  Similarity=0.473  Sum_probs=16.3

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHH
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllill   27 (103)
                      ..+.|-||.|.....++++++..
T Consensus        12 ~~sdqIWFYG~~~ls~~m~~~ty   34 (40)
T PF13106_consen   12 CKSDQIWFYGFFGLSIFMILFTY   34 (40)
T ss_pred             cccccEEEeeHHHHHHHHHHHHH
Confidence            45678999999887666555543


No 208
>KOG3653|consensus
Probab=36.43  E-value=32  Score=30.56  Aligned_cols=31  Identities=23%  Similarity=0.221  Sum_probs=12.7

Q ss_pred             hHHHHHHHH-HHHHHHHHHHHhheeecCCCcc
Q psy17635         10 GWFIGMLLA-IAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        10 gWfIg~~~a-i~lLllilli~C~i~R~rGgKY   40 (103)
                      .|.+.++.. +.+|++|++.+|+.-|.+-.+|
T Consensus       154 ~~~al~~~~~v~~l~~lvi~~~~~~r~~k~~~  185 (534)
T KOG3653|consen  154 LIYALIPLLLVSLLAALVILAFLGYRQRKNAR  185 (534)
T ss_pred             ehhhHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence            344433333 3333333333444445555544


No 209
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=36.10  E-value=77  Score=18.47  Aligned_cols=19  Identities=26%  Similarity=0.492  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHHHHHHHh
Q psy17635         12 FIGMLLAIAFLILVLILVC   30 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C   30 (103)
                      ||+.+..|.+=...|+|+.
T Consensus         8 fiAt~Lfi~iPt~FLlilY   26 (35)
T PRK04989          8 FVASLLFVLVPTVFLIILY   26 (35)
T ss_pred             HHHHHHHHHHHHHHHHHHh
Confidence            4555554444444444444


No 210
>PF09930 DUF2162:  Predicted transporter (DUF2162);  InterPro: IPR017199 This group represents a predicted membrane transporter, MTH672 type.
Probab=36.07  E-value=32  Score=26.97  Aligned_cols=36  Identities=28%  Similarity=0.722  Sum_probs=26.7

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccc
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAV   42 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~V   42 (103)
                      +.-.+|-+|+..++++.+++++.-.+.| ..+.+||+
T Consensus       126 ~~~s~~~ig~~~g~if~i~il~ss~i~r-~~~~~~p~  161 (224)
T PF09930_consen  126 IGLSGWEIGLVLGLIFFILILLSSFIFR-RLKKPYPI  161 (224)
T ss_pred             cCchHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCch
Confidence            3446899999999999999966655554 55667763


No 211
>PF13624 SurA_N_3:  SurA N-terminal domain; PDB: 3NRK_A.
Probab=35.86  E-value=12  Score=25.80  Aligned_cols=21  Identities=24%  Similarity=0.434  Sum_probs=0.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllil   26 (103)
                      +.+..|.++++.+++++++++
T Consensus         5 ~r~~~~~~~~~~~iv~~~~~l   25 (154)
T PF13624_consen    5 IRKNSKVFKILIGIVLAIFVL   25 (154)
T ss_dssp             ---------------------
T ss_pred             cccchhhhhhhHHHHHHHHHH
Confidence            344457666555555444443


No 212
>PF12259 DUF3609:  Protein of unknown function (DUF3609);  InterPro: IPR022048  This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length. 
Probab=35.70  E-value=19  Score=29.97  Aligned_cols=22  Identities=27%  Similarity=0.440  Sum_probs=13.0

Q ss_pred             HHHHHHHHHHHHHHHHHhheee
Q psy17635         13 IGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R   34 (103)
                      |++..+++|.++++.++|++++
T Consensus       301 i~v~~~~vli~vl~~~~~~~~~  322 (361)
T PF12259_consen  301 IAVCGAIVLIIVLISLAWLYRT  322 (361)
T ss_pred             EehhHHHHHHHHHHHHHhheee
Confidence            4455555555555566677765


No 213
>MTH00171 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=35.64  E-value=74  Score=18.75  Aligned_cols=23  Identities=22%  Similarity=0.595  Sum_probs=15.1

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLlli   25 (103)
                      |||  .+..-||+.++.+...|+++
T Consensus         1 MPQ--L~p~pWf~~~~~~W~~l~~~   23 (54)
T MTH00171          1 MPQ--LNPIPWFFIFLFSWLIFLTL   23 (54)
T ss_pred             CCC--CCchHHHHHHHHHHHHHHHH
Confidence            555  34557999999985555533


No 214
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=35.61  E-value=43  Score=24.31  Aligned_cols=19  Identities=11%  Similarity=0.179  Sum_probs=10.3

Q ss_pred             chhhhhHHHHHHHHHHHHH
Q psy17635          5 TVATAGWFIGMLLAIAFLI   23 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLl   23 (103)
                      ++...+|+..++=+++|++
T Consensus        96 p~~~~t~~LW~~P~~lll~  114 (126)
T PRK10144         96 PLTGQTLVLWALPVVLLLL  114 (126)
T ss_pred             CCCcchHHHHHHHHHHHHH
Confidence            3455566666665544443


No 215
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=35.45  E-value=60  Score=20.26  Aligned_cols=27  Identities=26%  Similarity=0.444  Sum_probs=11.7

Q ss_pred             HHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKRNRGGK   39 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R~rGgK   39 (103)
                      ||+.+..|++=...| ++-+|+-..++|
T Consensus         8 fiAtaLFi~iPT~FL-lilYVkT~s~~k   34 (50)
T PRK14094          8 FVASLLFVGVPTIFL-IGLFISTQDGEK   34 (50)
T ss_pred             HHHHHHHHHHHHHHh-hheeEEecccCc
Confidence            444444333333333 344466554444


No 216
>PF09685 Tic20:  Tic20-like protein;  InterPro: IPR019109  This entry represents a group of uncharacterised conserved proteins including a chloroplast protein import component called Tic20.  Chloroplast function requires the import of nuclear encoded proteins from the cytoplasm across the chloroplast double membrane. This is accomplished by two protein complexes, the Toc complex located at the outer membrane and the Tic complex located at the inner membrane. The Toc complex recognises specific proteins by a cleavable N-terminal sequence and is primarily responsible for translocation through the outer membrane, while the Tic complex translocates the protein through the inner membrane. Tic20 is a core member of the Tic complex and is deeply embedded in the inner envelope membrane. It is thought to function as a protein conducting component of the Tic complex []. 
Probab=35.03  E-value=46  Score=21.63  Aligned_cols=16  Identities=38%  Similarity=0.634  Sum_probs=8.5

Q ss_pred             HHHHhheeecCCCccc
Q psy17635         26 LILVCLIKRNRGGKYA   41 (103)
Q Consensus        26 lli~C~i~R~rGgKY~   41 (103)
                      +.|.-.+|-++|..|.
T Consensus        86 ~~I~~~~~a~~g~~~~  101 (109)
T PF09685_consen   86 LSIIGAIKANKGEPYR  101 (109)
T ss_pred             HHHHHHHHHHCCCeee
Confidence            3333445667776654


No 217
>PHA03281 envelope glycoprotein E; Provisional
Probab=34.86  E-value=26  Score=31.60  Aligned_cols=26  Identities=31%  Similarity=0.483  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      =+++-|.|.|||+|++..+|..++-|
T Consensus       558 ~l~~~~a~~~ll~l~~~~~c~~~~~~  583 (642)
T PHA03281        558 AITGGFAALALLCLAIALICTAKKFG  583 (642)
T ss_pred             hhhhhhHHHHHHHHHHHHHHHHHHhh
Confidence            34566778888888888889665433


No 218
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=34.86  E-value=23  Score=33.07  Aligned_cols=29  Identities=24%  Similarity=0.240  Sum_probs=17.3

Q ss_pred             hhHHHHHHHHHHHHH-HHHHHHhheeecCC
Q psy17635          9 AGWFIGMLLAIAFLI-LVLILVCLIKRNRG   37 (103)
Q Consensus         9 ~gWfIg~~~ai~lLl-lilli~C~i~R~rG   37 (103)
                      ++=.++.+|+.+.|| .++.+..+.+|+||
T Consensus       843 ~ttt~~~i~g~i~iiv~LaAla~lLrRRrg  872 (872)
T COG3889         843 QTTTGGGICGPIVIIVGLAALALLLRRRRG  872 (872)
T ss_pred             ccccccccchHHHHHHHHHHHHHHHHhhcC
Confidence            444578888866444 44444455667665


No 219
>PF05915 DUF872:  Eukaryotic protein of unknown function (DUF872);  InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=34.81  E-value=38  Score=23.92  Aligned_cols=8  Identities=13%  Similarity=0.256  Sum_probs=3.7

Q ss_pred             hheeecCC
Q psy17635         30 CLIKRNRG   37 (103)
Q Consensus        30 C~i~R~rG   37 (103)
                      |..|+.||
T Consensus        99 ~a~rg~~G  106 (115)
T PF05915_consen   99 YAWRGYKG  106 (115)
T ss_pred             HHHcCCCC
Confidence            44444444


No 220
>smart00805 AGTRAP Angiotensin II, type I receptor-associated protein. This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the C-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Probab=34.69  E-value=34  Score=25.98  Aligned_cols=12  Identities=42%  Similarity=0.761  Sum_probs=9.6

Q ss_pred             eecCCCcccchh
Q psy17635         33 KRNRGGKYAVHE   44 (103)
Q Consensus        33 ~R~rGgKY~Vke   44 (103)
                      -|.|||.|+|.-
T Consensus       112 ~~eRGG~l~v~~  123 (159)
T smart00805      112 YRERGGELPVNT  123 (159)
T ss_pred             HHHccCcccccc
Confidence            489999998653


No 221
>KOG0196|consensus
Probab=34.68  E-value=43  Score=31.72  Aligned_cols=17  Identities=29%  Similarity=0.401  Sum_probs=10.5

Q ss_pred             hhhHHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLIL   24 (103)
Q Consensus         8 t~gWfIg~~~ai~lLll   24 (103)
                      ...|+||.+++.+++||
T Consensus       546 ~l~~i~g~~~~~v~~ll  562 (996)
T KOG0196|consen  546 QLPLIIGSILAGVVFLL  562 (996)
T ss_pred             chhhHHHHHHHHHHHHH
Confidence            34588888888444333


No 222
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=34.64  E-value=52  Score=20.50  Aligned_cols=22  Identities=32%  Similarity=0.557  Sum_probs=10.9

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~   29 (103)
                      -|--|+-+...+.-|+||.+|+
T Consensus        28 lq~lfvnf~lilicllli~iiv   49 (52)
T TIGR01294        28 LQNLFINFCLILICLLLICIIV   49 (52)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3455665554444444444443


No 223
>MTH00123 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=34.63  E-value=67  Score=19.35  Aligned_cols=23  Identities=48%  Similarity=0.739  Sum_probs=14.6

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLlli   25 (103)
                      |||-  ...-||.-++++-+.|+++
T Consensus         1 MPQL--np~pWf~~~l~tWl~l~~~   23 (54)
T MTH00123          1 MPQL--NPAPWFLIMLLTWLTLILI   23 (54)
T ss_pred             CCCC--CchHHHHHHHHHHHHHHHH
Confidence            5553  3457999999885444443


No 224
>PF13633 N_methyl_3:  Prokaryotic N-terminal methylation site
Probab=34.63  E-value=68  Score=16.85  Aligned_cols=17  Identities=35%  Similarity=0.485  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILVLIL   28 (103)
Q Consensus        12 fIg~~~ai~lLllilli   28 (103)
                      +|=+|+|++++-+.+++
T Consensus         3 LIEvlIa~~i~~i~~~g   19 (22)
T PF13633_consen    3 LIEVLIAIAILGILALG   19 (22)
T ss_pred             HHHHHHHHHHHHHHHHh
Confidence            46677777766655554


No 225
>PF01529 zf-DHHC:  DHHC palmitoyltransferase;  InterPro: IPR001594 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the DHHC-type zinc finger domain, which is also known as NEW1 []. The DHHC Zn-finger was first isolated in the Drosophila putative transcription factor DNZ1 and was named after a conserved sequence motif []. This domain has palmitoyltransferase activity; this post-translational modification attaches the C16 saturated fatty acid palmitate via a thioester linkage, predominantly to cysteine residues []. This domain is found in the DHHC proteins which are palmitoyl transferases []; the DHHC motif is found within a cysteine-rich domain which is thought to contain the catalytic site. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding
Probab=34.55  E-value=60  Score=22.70  Aligned_cols=24  Identities=17%  Similarity=0.346  Sum_probs=15.2

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllilli   28 (103)
                      +..+..||+..+....+++++.++
T Consensus        87 G~~N~~~F~~fl~~~~~~~~~~~~  110 (174)
T PF01529_consen   87 GRRNHRYFLLFLLYLCLYCLYFFI  110 (174)
T ss_pred             ccccHHHHHHHHHHHHHHHHHHHH
Confidence            456788898777665555544444


No 226
>PF03264 Cytochrom_NNT:  NapC/NirT cytochrome c family, N-terminal region;  InterPro: IPR005126 Within the NapC/NirT family of cytochrome c proteins, some members, such as NapC P33932 from SWISSPROT and NirT P24038 from SWISSPROT, bind four haem groups, while others, such as TorC P33226 from SWISSPROT, bind five haems. This family aligns the common N-terminal region that contains four haem-binding C-X(2)-CH motifs.; PDB: 2VR0_F 2J7A_C.
Probab=34.51  E-value=12  Score=27.16  Aligned_cols=14  Identities=21%  Similarity=0.634  Sum_probs=3.7

Q ss_pred             hHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLI   23 (103)
Q Consensus        10 gWfIg~~~ai~lLl   23 (103)
                      -|++++++++++++
T Consensus         4 ~~~~~~~~~~~~~~   17 (173)
T PF03264_consen    4 KWSLLLLLLVGVVL   17 (173)
T ss_dssp             ------HSTTCHHH
T ss_pred             hhHHHHHHHHHHHH
Confidence            46665555544433


No 227
>PHA02975 hypothetical protein; Provisional
Probab=34.28  E-value=54  Score=21.75  Aligned_cols=21  Identities=10%  Similarity=0.123  Sum_probs=12.6

Q ss_pred             hhhhHHHHHHHHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllill   27 (103)
                      ...+|.+.++..++.++++++
T Consensus        40 ~~~~~~~~ii~i~~v~~~~~~   60 (69)
T PHA02975         40 KSSLSIILIIFIIFITCIAVF   60 (69)
T ss_pred             CCchHHHHHHHHHHHHHHHHH
Confidence            556788877775444444443


No 228
>PF06011 TRP:  Transient receptor potential (TRP) ion channel;  InterPro: IPR010308 This family consists of hypothetical proteins of unknown function found in fungi.
Probab=34.11  E-value=55  Score=27.14  Aligned_cols=35  Identities=26%  Similarity=0.629  Sum_probs=21.0

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHH-----HhheeecCCCccc
Q psy17635          7 ATAGWFIGMLLAIAFLILVLIL-----VCLIKRNRGGKYA   41 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli-----~C~i~R~rGgKY~   41 (103)
                      .--|+.|.++-+++++++++++     +.++++++..||.
T Consensus       385 ~~vg~vi~~i~~~v~~~~~i~~~~~~~~~~~~~~~~~~~~  424 (438)
T PF06011_consen  385 TVVGYVIIIINAIVLLILFILIIVSTIISLFRKRPDSRYD  424 (438)
T ss_pred             chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCcCCC
Confidence            3456777777776665544433     2445677777764


No 229
>PF10907 DUF2749:  Protein of unknown function (DUF2749);  InterPro: IPR024475 This bacterial family of proteins represent the TrbJ and TrbK genes of the Ti plasmid conjugative transfer operon [].
Probab=33.97  E-value=35  Score=22.45  Aligned_cols=38  Identities=26%  Similarity=0.302  Sum_probs=21.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeecC-CCcccchhhhhhc
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRNR-GGKYAVHEREAAH   49 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~r-GgKY~VkeKE~~~   49 (103)
                      .|+|+|++|++.+  ...+.++|-.++ .++=+..+..++|
T Consensus         4 ~viIaL~~avaa~--a~~atwviVq~~~a~~p~s~eeQr~~   42 (66)
T PF10907_consen    4 RVIIALVVAVAAA--AGAATWVIVQPRPAGNPASSEEQRAH   42 (66)
T ss_pred             chhHHHHHHHHhh--hceeEEEEECCCCCCCCCChHHHHHH
Confidence            5788888875554  333456566666 4444444443444


No 230
>PF04790 Sarcoglycan_1:  Sarcoglycan complex subunit protein;  InterPro: IPR006875 The dystrophin glycoprotein complex (DGC) is a membrane-spanning complex that links the interior cytoskeleton to the extracellular matrix in muscle. The sarcoglycan complex is a subcomplex within the DGC and is composed of several muscle-specific, transmembrane proteins (alpha-, beta-, gamma-, delta- and zeta-sarcoglycan). The sarcoglycans are asparagine-linked glycosylated proteins with single transmembrane domains. This family contains beta, gamma and delta members [, ].; GO: 0007010 cytoskeleton organization, 0016012 sarcoglycan complex, 0016021 integral to membrane
Probab=33.80  E-value=49  Score=26.47  Aligned_cols=24  Identities=21%  Similarity=0.351  Sum_probs=11.8

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      .|++.++++|+.++=++|.+++++
T Consensus        14 ly~~vllL~il~iiNL~LTiwIl~   37 (264)
T PF04790_consen   14 LYLFVLLLFILAIINLALTIWILK   37 (264)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhhe
Confidence            355555555444444444455554


No 231
>PF13967 RSN1_TM:  Late exocytosis, associated with Golgi transport 
Probab=33.73  E-value=52  Score=23.38  Aligned_cols=28  Identities=14%  Similarity=0.497  Sum_probs=15.8

Q ss_pred             HHHHHHHHHHHHhheeecCCCcccchhh
Q psy17635         18 AIAFLILVLILVCLIKRNRGGKYAVHER   45 (103)
Q Consensus        18 ai~lLllilli~C~i~R~rGgKY~VkeK   45 (103)
                      .+++.++.+++.|++||+-..-|.-+..
T Consensus         9 ~~~i~~~~~~~F~~lR~~~~~iY~pR~~   36 (157)
T PF13967_consen    9 NLIIFLVLLLLFCILRKRFPRIYQPRSY   36 (157)
T ss_pred             HHHHHHHHHHHHHHHHhccHHhcccccc
Confidence            3444444455567666665666765544


No 232
>KOG4812|consensus
Probab=33.66  E-value=36  Score=27.70  Aligned_cols=28  Identities=18%  Similarity=0.396  Sum_probs=21.9

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      +..|.|+-.++|++.|||++--++=+||
T Consensus       219 ~n~q~wLwwi~~vlG~ll~lr~~i~Yik  246 (262)
T KOG4812|consen  219 FNGQYWLWWIFLVLGLLLFLRGFINYIK  246 (262)
T ss_pred             cccchHHHHHHHHHHHHHHHHHHHhHHH
Confidence            4458899999999999998876666665


No 233
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=33.66  E-value=51  Score=23.91  Aligned_cols=30  Identities=13%  Similarity=0.219  Sum_probs=16.4

Q ss_pred             chhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          5 TVATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         5 ~~at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      ++...+|+..++=+++|++-+++++-.+||
T Consensus        96 p~~~~t~~LW~~P~lll~~G~~~~~~~~rr  125 (126)
T TIGR03147        96 PFKWQTLLLWLLPVLLLLLAFVLLWRVRRR  125 (126)
T ss_pred             CCCcchHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            445566777666665555544444444444


No 234
>COG4768 Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]
Probab=33.63  E-value=54  Score=24.40  Aligned_cols=19  Identities=42%  Similarity=0.759  Sum_probs=11.1

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILVLILV   29 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~   29 (103)
                      .+-+++.||+|++|.+.++
T Consensus         5 yIs~~iiAiAf~vL~I~li   23 (139)
T COG4768           5 YISLAIIAIAFLVLVIYLI   23 (139)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3445666677776655443


No 235
>PF08078 PsaX:  PsaX family;  InterPro: IPR012986 This family consists of the PsaX family of photosystem I (PSI) protein subunits. PSI is a large multi-subunit pigment protein complex embedded in the thylakoid membranes of green plants and cyanobacteria. PsaX is one of the 12 protein subunits found in PSI and these subunits are arranged as monomers or trimers within the membrane as shown by the structure of the trimeric complex from Synechococcus elongatus [].; PDB: 3PCQ_X 1JB0_X.
Probab=33.43  E-value=51  Score=19.38  Aligned_cols=15  Identities=53%  Similarity=1.170  Sum_probs=9.9

Q ss_pred             hhhHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLI   23 (103)
Q Consensus         8 t~gWfIg~~~ai~lLl   23 (103)
                      ..+|.| ++.||-+|+
T Consensus        15 rt~Wa~-llLaINflV   29 (37)
T PF08078_consen   15 RTGWAL-LLLAINFLV   29 (37)
T ss_dssp             HHHHHH-HHHHHHHHH
T ss_pred             hHHHHH-HHHHHHHHH
Confidence            457988 666666654


No 236
>KOG1485|consensus
Probab=33.42  E-value=26  Score=30.17  Aligned_cols=29  Identities=34%  Similarity=0.594  Sum_probs=23.8

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      -.+.|.|++|+.+...+|.+...|-+++|
T Consensus       227 ~~a~~~i~i~is~~~vk~~l~~~c~~~~n  255 (412)
T KOG1485|consen  227 INALWLIAIMISAKEVKLRLTLYCAIKTN  255 (412)
T ss_pred             cchhhhheehhhHHHHHHHHHHHHHHhcC
Confidence            34569999999988888888888877666


No 237
>PRK06870 secG preprotein translocase subunit SecG; Reviewed
Probab=33.38  E-value=75  Score=20.40  Aligned_cols=14  Identities=29%  Similarity=0.778  Sum_probs=7.2

Q ss_pred             HHHHHhheeecCCC
Q psy17635         25 VLILVCLIKRNRGG   38 (103)
Q Consensus        25 illi~C~i~R~rGg   38 (103)
                      ++.++.+...+||+
T Consensus        15 ~LI~~vLlQ~~k~~   28 (76)
T PRK06870         15 LLIILVLLQSGKGA   28 (76)
T ss_pred             HHHHHheeeCCCCC
Confidence            33334555666654


No 238
>PHA02957 hypothetical protein; Provisional
Probab=33.07  E-value=45  Score=25.76  Aligned_cols=24  Identities=29%  Similarity=0.701  Sum_probs=11.7

Q ss_pred             HHHHHHHHHHHHHHHHHHhheeecC
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      |.+++..|-+++|++++.| +.|+|
T Consensus        19 f~ai~~~~~~~~lli~~t~-~~~sk   42 (206)
T PHA02957         19 FLAILLIIPLELLLICHTC-ITASK   42 (206)
T ss_pred             HHHHHHHHHHHHHHhhhee-eeccc
Confidence            3344444555555555555 34443


No 239
>PF08507 COPI_assoc:  COPI associated protein;  InterPro: IPR013714 Proteins in this family co-localise with COPI vesicle coat proteins []. In yeast it is a Golgi membrane protein involved in vesicular trafficking, interacting with TVP18 []. 
Probab=32.97  E-value=73  Score=22.34  Aligned_cols=9  Identities=44%  Similarity=0.693  Sum_probs=4.0

Q ss_pred             hHHHHHHHH
Q psy17635         10 GWFIGMLLA   18 (103)
Q Consensus        10 gWfIg~~~a   18 (103)
                      .|++|++..
T Consensus        88 ~~i~g~~~~   96 (136)
T PF08507_consen   88 SIIIGLLLF   96 (136)
T ss_pred             HHHHHHHHH
Confidence            344444433


No 240
>PF08374 Protocadherin:  Protocadherin;  InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated []. 
Probab=32.83  E-value=20  Score=28.45  Aligned_cols=17  Identities=29%  Similarity=0.723  Sum_probs=7.2

Q ss_pred             HHHHHHHHHHHHHHHHHh
Q psy17635         13 IGMLLAIAFLILVLILVC   30 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C   30 (103)
                      .|+|.+| |+|||+.+++
T Consensus        45 AG~~tVI-LVI~i~v~vR   61 (221)
T PF08374_consen   45 AGIMTVI-LVIFIVVLVR   61 (221)
T ss_pred             cchhhhH-HHHHHHHHHH
Confidence            3444443 4444444444


No 241
>PF08001 CMV_US:  CMV US;  InterPro: IPR012536 This is a family of unique short (US) cytoplasmic glycoproteins which are expressed in cytomegalovirus [].; GO: 0030683 evasion by virus of host immune response, 0030176 integral to endoplasmic reticulum membrane
Probab=32.71  E-value=54  Score=26.00  Aligned_cols=27  Identities=22%  Similarity=0.378  Sum_probs=20.7

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      ....|...++.+-.+|++++||.|++.
T Consensus       176 l~~~wv~~i~~~r~tl~~~vla~~~~~  202 (231)
T PF08001_consen  176 LTAFWVYLIIFSRVTLMGVVLAQCVYG  202 (231)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            456788888888777777888887764


No 242
>PF12955 DUF3844:  Domain of unknown function (DUF3844);  InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=32.71  E-value=66  Score=22.63  Aligned_cols=27  Identities=7%  Similarity=0.145  Sum_probs=17.6

Q ss_pred             cchhhhhHHHHHHHHHHHHHHHHHHHh
Q psy17635          4 DTVATAGWFIGMLLAIAFLILVLILVC   30 (103)
Q Consensus         4 ~~~at~gWfIg~~~ai~lLllilli~C   30 (103)
                      .+++.+.|+|+-...+++++++.-|-.
T Consensus        62 kDvS~~F~L~~~~ti~lv~~~~~~I~l   88 (103)
T PF12955_consen   62 KDVSVPFWLFAGFTIALVVLVAGAIGL   88 (103)
T ss_pred             ccccchhhHHHHHHHHHHHHHHHHHHH
Confidence            356778888887766666555555543


No 243
>PRK06007 fliF flagellar MS-ring protein; Reviewed
Probab=32.68  E-value=44  Score=29.13  Aligned_cols=15  Identities=27%  Similarity=0.693  Sum_probs=6.7

Q ss_pred             HHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILV   25 (103)
Q Consensus        11 WfIg~~~ai~lLlli   25 (103)
                      ||..++-.++++|++
T Consensus       436 ~~~~~~~~~~~~l~~  450 (542)
T PRK06007        436 WFMDLIKLAAGALLI  450 (542)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            555554443333333


No 244
>KOG4298|consensus
Probab=32.66  E-value=63  Score=25.79  Aligned_cols=26  Identities=15%  Similarity=0.616  Sum_probs=18.7

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      .+|....++.++|.++=+.++|++|=
T Consensus       148 laW~FST~iGllLFL~Ei~llcwvKF  173 (245)
T KOG4298|consen  148 LAWAFSTVIGLLLFLAEIVLLCWVKF  173 (245)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHheeee
Confidence            46877777777666666777888874


No 245
>PF12158 DUF3592:  Protein of unknown function (DUF3592);  InterPro: IPR021994  This family of proteins is functionally uncharacterised.This family of proteins is found in bacteria, archaea, eukaryotes and viruses. Proteins in this family are typically between 150 and 242 amino acids in length. 
Probab=32.09  E-value=36  Score=22.99  Aligned_cols=14  Identities=36%  Similarity=0.493  Sum_probs=7.7

Q ss_pred             hHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLI   23 (103)
Q Consensus        10 gWfIg~~~ai~lLl   23 (103)
                      +|+..++.++.+++
T Consensus         1 ~~~~~~~~~i~l~~   14 (148)
T PF12158_consen    1 GVFLLLFGIIFLLI   14 (148)
T ss_pred             CeEhHHHHHHHHHH
Confidence            46666655554444


No 246
>TIGR02227 sigpep_I_bact signal peptidase I, bacterial type. A related model finds a simlar protein in many archaea and a few bacteria, as well as a microsomal (endoplasmic reticulum) protein in eukaryotes.
Probab=32.07  E-value=40  Score=24.26  Aligned_cols=15  Identities=27%  Similarity=0.645  Sum_probs=9.4

Q ss_pred             HHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILV   25 (103)
Q Consensus        11 WfIg~~~ai~lLlli   25 (103)
                      |++.+++||++.+++
T Consensus         1 ~~~~~~~~~~~~~~i   15 (163)
T TIGR02227         1 LILSLLIAILLALLI   15 (163)
T ss_pred             CHHHHHHHHHHHHHH
Confidence            666666666555554


No 247
>PF15106 TMEM156:  TMEM156 protein family
Probab=32.06  E-value=53  Score=26.20  Aligned_cols=14  Identities=21%  Similarity=0.605  Sum_probs=8.1

Q ss_pred             hhhhHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIA   20 (103)
Q Consensus         7 at~gWfIg~~~ai~   20 (103)
                      -...|.+.+++.++
T Consensus       174 mKITWYvLVllVfi  187 (226)
T PF15106_consen  174 MKITWYVLVLLVFI  187 (226)
T ss_pred             hhhHHHHHHHHHHH
Confidence            35678876555433


No 248
>MTH00260 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=31.92  E-value=1.1e+02  Score=18.78  Aligned_cols=32  Identities=16%  Similarity=0.347  Sum_probs=21.9

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      |||-  +--.|++..+..++.++++..++.+.+-
T Consensus         1 MPQm--sPm~W~~l~~~f~~~~~~~~~~~~~~~~   32 (53)
T MTH00260          1 MPHL--SPMSWLTAMIIFWFILLIFASSMWWSQS   32 (53)
T ss_pred             CCCc--ccHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            4543  3357999888777777777777777653


No 249
>PRK08476 F0F1 ATP synthase subunit B'; Validated
Probab=31.89  E-value=58  Score=23.14  Aligned_cols=21  Identities=24%  Similarity=0.453  Sum_probs=10.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHhh
Q psy17635         11 WFIGMLLAIAFLILVLILVCL   31 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~   31 (103)
                      |...++-+|.||+|++++-.|
T Consensus         7 ~~~~~~qli~Flil~~~l~kf   27 (141)
T PRK08476          7 PYLMLATFVVFLLLIVILNSW   27 (141)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            544445555566655555433


No 250
>PF10853 DUF2650:  Protein of unknown function (DUF2650);  InterPro: IPR022559  This region is found in proteins with unknown function in metazoa. 
Probab=31.85  E-value=40  Score=19.79  Aligned_cols=14  Identities=21%  Similarity=0.406  Sum_probs=9.1

Q ss_pred             hhhhhHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAI   19 (103)
Q Consensus         6 ~at~gWfIg~~~ai   19 (103)
                      ++-|.|++.+|..+
T Consensus        22 f~lq~Wv~v~l~v~   35 (38)
T PF10853_consen   22 FRLQIWVIVLLAVL   35 (38)
T ss_pred             HHHHHHHHHHHHHH
Confidence            56688887665443


No 251
>TIGR02507 MtrF tetrahydromethanopterin S-methyltransferase, F subunit. coenzyme M methyltransferase in methanogenic archaea. This methyltranferase is membrane-associated enzyme complex that uses methy-transfer reaction to drive sodium-ion pump. Archaea domain, have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=31.69  E-value=85  Score=20.53  Aligned_cols=17  Identities=24%  Similarity=0.507  Sum_probs=12.6

Q ss_pred             hhhhhHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFL   22 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lL   22 (103)
                      ....|..||++.|++|.
T Consensus        40 ~~~~G~~iG~~~Al~lV   56 (65)
T TIGR02507        40 TTITGLAYGFLFAVLLV   56 (65)
T ss_pred             hhhHHHHHHHHHHHHHH
Confidence            45678888888887664


No 252
>PF04639 Baculo_E56:  Baculoviral E56 protein, specific to ODV envelope;  InterPro: IPR006733 This family represents the E56 protein, which is localized to the occlusion derived virus (ODV) envelope, but not to the budded virus (BV) envelope []. Signals necessary for transport and/or retention into this structure are believed to be found within the C-terminal portion of ODV-E56.; GO: 0019031 viral envelope
Probab=31.59  E-value=56  Score=27.15  Aligned_cols=9  Identities=11%  Similarity=0.006  Sum_probs=3.3

Q ss_pred             hhHHHHHHH
Q psy17635          9 AGWFIGMLL   17 (103)
Q Consensus         9 ~gWfIg~~~   17 (103)
                      ..+-|.+|+
T Consensus       276 ~l~piil~I  284 (305)
T PF04639_consen  276 SLLPIILII  284 (305)
T ss_pred             hhhHHHHHH
Confidence            333333333


No 253
>PF06422 PDR_CDR:  CDR ABC transporter;  InterPro: IPR010929 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain [].  The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). In yeast, the PDR and CDR ABC transporters display extensive sequence homology, and confer resistance to several anti-fungal compounds by actively transporting their substrates out of the cell. These transporters have two homologous halves, each with an N-terminal intracellular hydrophilic region that contains an ATP-binding site, followed by a C-terminal membrane-associated region containing six transmembrane segments []. This entry represents a domain of the PDR/CDR ABC transporter comprising extracellular loop 3, transmembrane segment 6 and a linker region.; GO: 0005524 ATP binding, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane
Probab=31.44  E-value=82  Score=21.36  Aligned_cols=27  Identities=33%  Similarity=0.567  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHhheeecCCC-cccchhh
Q psy17635         19 IAFLILVLILVCLIKRNRGG-KYAVHER   45 (103)
Q Consensus        19 i~lLllilli~C~i~R~rGg-KY~VkeK   45 (103)
                      |.++++.++.+-+++..+++ .-.|..|
T Consensus        60 i~f~~~~~~~~e~~~~~~~~~~~l~f~r   87 (103)
T PF06422_consen   60 IFFIVLTLLATEFIKFEKSGGEVLVFKR   87 (103)
T ss_pred             HHHHHHHHHHHHHhcccCCCccEEEEeC
Confidence            55555666666666655444 3333333


No 254
>PF11166 DUF2951:  Protein of unknown function (DUF2951);  InterPro: IPR021337  This family of proteins has no known function. It has a highly conserved sequence. 
Probab=31.38  E-value=63  Score=22.77  Aligned_cols=21  Identities=19%  Similarity=0.471  Sum_probs=16.1

Q ss_pred             hhhHHHHHHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli   28 (103)
                      -..|+.|++.-|+-.++|+++
T Consensus        72 ~KmwilGlvgTi~gsliia~l   92 (98)
T PF11166_consen   72 IKMWILGLVGTIFGSLIIALL   92 (98)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            357999999988877777654


No 255
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.30  E-value=50  Score=22.03  Aligned_cols=6  Identities=17%  Similarity=0.030  Sum_probs=2.8

Q ss_pred             CCCCCC
Q psy17635         51 RHDYPE   56 (103)
Q Consensus        51 ~~~~~d   56 (103)
                      +||+.+
T Consensus        36 NPpine   41 (71)
T COG3763          36 NPPINE   41 (71)
T ss_pred             CCCCCH
Confidence            445543


No 256
>PF10003 DUF2244:  Integral membrane protein (DUF2244);  InterPro: IPR019253  This entry consists of various bacterial putative membrane proteins with no known function. 
Probab=31.25  E-value=84  Score=22.33  Aligned_cols=20  Identities=25%  Similarity=0.369  Sum_probs=11.4

Q ss_pred             hhhhhHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLlli   25 (103)
                      .....||+++++++.+++.+
T Consensus         9 ~~g~~~~~~~~~~~~~~~a~   28 (140)
T PF10003_consen    9 PRGFLIFIAILAAVSLIIAI   28 (140)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            34445777777665555433


No 257
>MTH00047 COX2 cytochrome c oxidase subunit II; Provisional
Probab=31.10  E-value=69  Score=24.35  Aligned_cols=9  Identities=11%  Similarity=0.202  Sum_probs=4.0

Q ss_pred             eecCCCccc
Q psy17635         33 KRNRGGKYA   41 (103)
Q Consensus        33 ~R~rGgKY~   41 (103)
                      +|+...+|.
T Consensus        35 ~~~~~~~~~   43 (194)
T MTH00047         35 SGNGSVNFG   43 (194)
T ss_pred             ccccccccc
Confidence            334444554


No 258
>PF10617 DUF2474:  Protein of unknown function (DUF2474);  InterPro: IPR018895  This family of short proteins has no known function. 
Probab=31.10  E-value=89  Score=18.58  Aligned_cols=19  Identities=21%  Similarity=0.398  Sum_probs=14.0

Q ss_pred             hhhhhHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLIL   24 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLll   24 (103)
                      ...-+||+++=++-++.+.
T Consensus         5 wkRl~W~v~iW~~SV~aL~   23 (40)
T PF10617_consen    5 WKRLGWFVLIWAASVLALG   23 (40)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4567999999888555553


No 259
>PF15202 Adipogenin:  Adipogenin
Probab=30.93  E-value=90  Score=21.05  Aligned_cols=25  Identities=28%  Similarity=0.300  Sum_probs=12.8

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhh
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCL   31 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~   31 (103)
                      -|...+.--+|.-+-|+|.++|+++
T Consensus        12 ltfsflvfwlclpv~lllfl~ivwl   36 (81)
T PF15202_consen   12 LTFSFLVFWLCLPVGLLLFLLIVWL   36 (81)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444444455664445555555543


No 260
>PF09788 Tmemb_55A:  Transmembrane protein 55A;  InterPro: IPR019178  Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction:  1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.  
Probab=30.86  E-value=50  Score=26.80  Aligned_cols=37  Identities=30%  Similarity=0.406  Sum_probs=17.8

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHH---hheeecCCCcccc
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILV---CLIKRNRGGKYAV   42 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~---C~i~R~rGgKY~V   42 (103)
                      ++...+++-+++++++|++.+.+.   +-..++.||.|.+
T Consensus       193 faRkR~i~f~llgllfliiaigltvGT~~~A~~~~giY~~  232 (256)
T PF09788_consen  193 FARKRAIIFFLLGLLFLIIAIGLTVGTWTYAKTYGGIYVS  232 (256)
T ss_pred             HhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHhhcCcEeHH
Confidence            444444444444444433332221   3234777888864


No 261
>PF08496 Peptidase_S49_N:  Peptidase family S49 N-terminal;  InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=30.81  E-value=62  Score=23.98  Aligned_cols=18  Identities=22%  Similarity=0.399  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHhheeecCC
Q psy17635         20 AFLILVLILVCLIKRNRG   37 (103)
Q Consensus        20 ~lLllilli~C~i~R~rG   37 (103)
                      ++++++.+|+-...|+++
T Consensus        20 aI~~vv~~I~~~~~k~k~   37 (155)
T PF08496_consen   20 AILAVVGLIVAAAQKKKK   37 (155)
T ss_pred             HHHHHHHHHHHHhccCCC
Confidence            333344445555566664


No 262
>KOG2678|consensus
Probab=30.55  E-value=63  Score=26.06  Aligned_cols=19  Identities=21%  Similarity=0.481  Sum_probs=14.0

Q ss_pred             hhHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllill   27 (103)
                      --||...|++++++.||.-
T Consensus       215 s~wf~~~miI~v~~sFVsM  233 (244)
T KOG2678|consen  215 SYWFYITMIIFVILSFVSM  233 (244)
T ss_pred             hHHHHHHHHHHHHHHHHHH
Confidence            4699888888777776643


No 263
>PHA02657 hypothetical protein; Provisional
Probab=30.49  E-value=58  Score=22.71  Aligned_cols=19  Identities=21%  Similarity=0.429  Sum_probs=11.6

Q ss_pred             hhHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLI   27 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllill   27 (103)
                      ..-|+.++|.++++||.+.
T Consensus        30 itvfv~vI~il~flLLYLv   48 (95)
T PHA02657         30 FTIFIFVVCILIYLLIYLV   48 (95)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3456666666666666553


No 264
>PF13172 PepSY_TM_1:  PepSY-associated TM helix
Probab=30.43  E-value=75  Score=17.35  Aligned_cols=14  Identities=36%  Similarity=0.819  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILV   25 (103)
Q Consensus        12 fIg~~~ai~lLlli   25 (103)
                      .+|+++++.++++.
T Consensus        11 ~~g~~~~~~ll~~~   24 (34)
T PF13172_consen   11 WLGLIAAIFLLLLA   24 (34)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45666665555443


No 265
>KOG1024|consensus
Probab=30.38  E-value=22  Score=31.38  Aligned_cols=30  Identities=17%  Similarity=0.412  Sum_probs=20.4

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhhe-eec
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLI-KRN   35 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i-~R~   35 (103)
                      ......|+++.+|.+++.|+++|+.|. .++
T Consensus       182 s~~~~f~V~v~va~a~v~lva~il~c~~~~s  212 (563)
T KOG1024|consen  182 SGLVTFIVGVIVALALVSLVALILYCAKGPS  212 (563)
T ss_pred             cccEEEeeeHHHHHHHHHHHHHHHHHHhccc
Confidence            345567888888888887777776444 344


No 266
>TIGR00892 2A0113 monocarboxylate transporter 1.
Probab=30.24  E-value=40  Score=27.32  Aligned_cols=26  Identities=15%  Similarity=0.011  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      |+.++++.++.+.+++.+..++++.|
T Consensus       404 ~~~~~~~li~~~~~~~~~~~~~~~~~  429 (455)
T TIGR00892       404 YASGSIVVSAGLFLAIGNYINYRLLA  429 (455)
T ss_pred             HHhhHHHHHHHHHHHHHHHHHHHHHH
Confidence            44444444444444444444444443


No 267
>TIGR02205 septum_zipA cell division protein ZipA. This model represents the full length of bacterial cell division protein ZipA. The N-terminal hydrophobic stretch is an uncleaved signal-anchor sequence. This is followed by an unconserved, variable length, low complexity region, and then a conserved C-terminal region of about 140 amino acids (see pfam04354) that interacts with the tubulin-like cell division protein FtsZ.
Probab=30.19  E-value=43  Score=27.19  Aligned_cols=10  Identities=0%  Similarity=0.092  Sum_probs=4.2

Q ss_pred             HHhheeecCC
Q psy17635         28 LVCLIKRNRG   37 (103)
Q Consensus        28 i~C~i~R~rG   37 (103)
                      -+|=.||.|+
T Consensus        21 Glwt~Rke~s   30 (284)
T TIGR02205        21 GLWTSRKEKS   30 (284)
T ss_pred             cccccccccc
Confidence            3454444444


No 268
>COG5416 Uncharacterized integral membrane protein [Function unknown]
Probab=30.14  E-value=48  Score=23.33  Aligned_cols=7  Identities=14%  Similarity=0.662  Sum_probs=3.4

Q ss_pred             hHHHHHH
Q psy17635         10 GWFIGML   16 (103)
Q Consensus        10 gWfIg~~   16 (103)
                      -|++.+.
T Consensus        23 ~w~vi~~   29 (98)
T COG5416          23 QWTVIIV   29 (98)
T ss_pred             eeeHHHH
Confidence            4555444


No 269
>PF05140 ResB:  ResB-like family ;  InterPro: IPR007816 This domain is found in a number of known and suspected cytochrome c biogenesis proteins, including ResB []. Mutations in ResB indicate that they are essential for growth []. ResB is predicted to be a transmembrane protein.
Probab=30.11  E-value=49  Score=27.68  Aligned_cols=25  Identities=28%  Similarity=0.699  Sum_probs=15.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      ..+..||+++|..+    .+=|++|.++|
T Consensus        50 vy~S~wF~~ll~ll----~~sL~~Cs~~R   74 (464)
T PF05140_consen   50 VYSSWWFLLLLVLL----ALSLIACSIDR   74 (464)
T ss_pred             eEecHHHHHHHHHH----HHHHHHHHHHH
Confidence            45678998876543    34445555554


No 270
>PF04202 Mfp-3:  Foot protein 3;  InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=29.97  E-value=65  Score=21.44  Aligned_cols=17  Identities=35%  Similarity=0.438  Sum_probs=8.5

Q ss_pred             HHHHHHhheeecCCCcc
Q psy17635         24 LVLILVCLIKRNRGGKY   40 (103)
Q Consensus        24 lilli~C~i~R~rGgKY   40 (103)
                      |+|+-.|-|.-..|.-|
T Consensus        12 LvLIg~fAVqSdag~~y   28 (71)
T PF04202_consen   12 LVLIGSFAVQSDAGYYY   28 (71)
T ss_pred             HHHHhhheeeecCcccc
Confidence            33444455555555544


No 271
>PF07856 Orai-1:  Mediator of CRAC channel activity;  InterPro: IPR012446 This entry includes Drosophila Orai and human Orai1, Orai2 and Orai3. ORAI-1 GFP reporters are co-expressed with STIM-1 (ER CA(2+) sensors) in the gonad and intestine. The protein has four predicted transmembrane domains with a highly conserved region between TM2 ad TM3. This conserved domain is thought to function in channel regulation. ORAI1-related proteins are required for the production of the calcium channel, CRAC, along with STIM1-related proteins [].
Probab=29.97  E-value=79  Score=23.84  Aligned_cols=26  Identities=27%  Similarity=0.459  Sum_probs=19.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      ..|..+....|.|.+.-+.++|+||=
T Consensus       108 ~~W~~s~~lGi~lFL~~l~l~~WIKF  133 (175)
T PF07856_consen  108 LAWRFSTVLGIPLFLAELALLGWIKF  133 (175)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHheee
Confidence            45888888887777777777888873


No 272
>PF09889 DUF2116:  Uncharacterized protein containing a Zn-ribbon (DUF2116);  InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=29.76  E-value=78  Score=20.09  Aligned_cols=9  Identities=0%  Similarity=0.069  Sum_probs=3.7

Q ss_pred             hhHHHHHHH
Q psy17635          9 AGWFIGMLL   17 (103)
Q Consensus         9 ~gWfIg~~~   17 (103)
                      .-|+.-++.
T Consensus        38 ~~~i~~~~~   46 (59)
T PF09889_consen   38 TQYIFFGIF   46 (59)
T ss_pred             HHHHHHHHH
Confidence            344443333


No 273
>PF07787 DUF1625:  Protein of unknown function (DUF1625);  InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long. 
Probab=29.76  E-value=80  Score=24.28  Aligned_cols=21  Identities=14%  Similarity=0.521  Sum_probs=10.0

Q ss_pred             HHHHHHHHHHHHHHHHHHhhe
Q psy17635         12 FIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i   32 (103)
                      ++++++++.|-++++.+.++.
T Consensus       225 ~~~~~~s~~lsl~~Ia~aW~~  245 (248)
T PF07787_consen  225 LVAFIISFSLSLLTIALAWLF  245 (248)
T ss_pred             HHHHHHHHHHHHHHHHHhhee
Confidence            344444555555544444443


No 274
>PF10821 DUF2567:  Protein of unknown function (DUF2567);  InterPro: IPR021213  This is a bacterial family of proteins with unknown function. 
Probab=29.64  E-value=59  Score=24.67  Aligned_cols=30  Identities=23%  Similarity=0.420  Sum_probs=17.5

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rG   37 (103)
                      |..-+||.+++++..+  +..+.+++.||+||
T Consensus        45 F~a~a~f~~l~lv~Gv--vaav~~W~~R~~RG   74 (167)
T PF10821_consen   45 FDADALFVLLGLVLGV--VAAVAVWLWRRRRG   74 (167)
T ss_pred             hhHHHHHHHHHHHHHH--HHHHHHHHHHhhcC
Confidence            4555677666655443  34455677666665


No 275
>PRK09174 F0F1 ATP synthase subunit B'; Validated
Probab=29.61  E-value=58  Score=24.85  Aligned_cols=26  Identities=23%  Similarity=0.229  Sum_probs=10.1

Q ss_pred             CCcchhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          2 PSDTVATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         2 ~~~~~at~gWfIg~~~ai~lLllilli   28 (103)
                      |+-.+.+..|-+.. .+|.|+||++++
T Consensus        45 p~~~~~~~~~~l~w-~~I~FliL~~lL   70 (204)
T PRK09174         45 PPFDSTHYASQLLW-LAITFGLFYLFM   70 (204)
T ss_pred             CCCcchhccHHHHH-HHHHHHHHHHHH
Confidence            44444433333322 233444444443


No 276
>PF05808 Podoplanin:  Podoplanin;  InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=29.61  E-value=18  Score=27.50  Aligned_cols=29  Identities=34%  Similarity=0.726  Sum_probs=0.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYA   41 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY~   41 (103)
                      |=++|+|+||.||-   =|+.++.|.=.|.|+
T Consensus       133 GIIVGVLlaIG~ig---GIIivvvRKmSGRys  161 (162)
T PF05808_consen  133 GIIVGVLLAIGFIG---GIIIVVVRKMSGRYS  161 (162)
T ss_dssp             --------------------------------
T ss_pred             eehhhHHHHHHHHh---heeeEEeehhccccC
Confidence            33455555544433   223334455567775


No 277
>PF15192 TMEM213:  TMEM213 family
Probab=29.60  E-value=84  Score=21.38  Aligned_cols=25  Identities=32%  Similarity=0.668  Sum_probs=14.2

Q ss_pred             chhhhhHHHHHHHH-HHHHHHHHHHHhh
Q psy17635          5 TVATAGWFIGMLLA-IAFLILVLILVCL   31 (103)
Q Consensus         5 ~~at~gWfIg~~~a-i~lLllilli~C~   31 (103)
                      +.---||+.+.+.= +-||.|||  +|+
T Consensus        41 gvDeyGWIAAAVGWSLwFLTLIL--LCv   66 (82)
T PF15192_consen   41 GVDEYGWIAAAVGWSLWFLTLIL--LCV   66 (82)
T ss_pred             CCchhhHHHHHHhHHHHHHHHHH--HHH
Confidence            34456898776665 44444544  454


No 278
>PF06160 EzrA:  Septation ring formation regulator, EzrA ;  InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=29.59  E-value=15  Score=31.83  Aligned_cols=11  Identities=27%  Similarity=0.597  Sum_probs=4.0

Q ss_pred             HHHHHHhheee
Q psy17635         24 LVLILVCLIKR   34 (103)
Q Consensus        24 lilli~C~i~R   34 (103)
                      |+++++.++.|
T Consensus        10 l~~~~~~~~~r   20 (560)
T PF06160_consen   10 LIIYIIGYIYR   20 (560)
T ss_pred             HHHHHHHHHHH
Confidence            33333333333


No 279
>PF12301 CD99L2:  CD99 antigen like protein 2;  InterPro: IPR022078  This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum. 
Probab=29.56  E-value=66  Score=24.30  Aligned_cols=23  Identities=22%  Similarity=0.299  Sum_probs=18.3

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli   28 (103)
                      -+.++=+.++++||+++|+-++.
T Consensus       111 ~~~~g~IaGIvsav~valvGAvs  133 (169)
T PF12301_consen  111 EAEAGTIAGIVSAVVVALVGAVS  133 (169)
T ss_pred             CcccchhhhHHHHHHHHHHHHHH
Confidence            45567788999999999888766


No 280
>PF08475 Baculo_VP91_N:  Viral capsid protein 91 N-terminal;  InterPro: IPR013682 This domain is found in Baculoviridae including the nucleopolyhedrovirus at the N terminus of the viral capsid protein 91 (VP91) []. 
Probab=29.41  E-value=39  Score=26.02  Aligned_cols=6  Identities=33%  Similarity=0.656  Sum_probs=2.2

Q ss_pred             HHHHHH
Q psy17635         17 LAIAFL   22 (103)
Q Consensus        17 ~ai~lL   22 (103)
                      +||+|+
T Consensus         4 vai~l~    9 (183)
T PF08475_consen    4 VAILLI    9 (183)
T ss_pred             hHHHHH
Confidence            333333


No 281
>PF01098 FTSW_RODA_SPOVE:  Cell cycle protein;  InterPro: IPR001182 A number of prokaryotic integral membrane proteins involved in cell cycle processes have been found to be structurally related [, ]. These proteins include, the Escherichia coli and related bacteria cell division protein ftsW and the rod shape-determining protein rodA (or mrdB), the Bacillus subtilis stage V sporulation protein E (spoVE), the B. subtilis hypothetical proteins ywcF and ylaO and the Cyanophora paradoxa cyanelle ftsW homolog.; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=29.26  E-value=54  Score=26.29  Aligned_cols=30  Identities=30%  Similarity=0.716  Sum_probs=18.0

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      ..-||+.++++.++++++++.++...+|.|
T Consensus       263 ee~G~ig~i~v~~l~~~l~~~~~~~a~~~~  292 (358)
T PF01098_consen  263 EEFGWIGGILVLLLFLLLIFRLFRIARRAK  292 (358)
T ss_pred             HHHhHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence            345777666666666666666655555533


No 282
>PF04272 Phospholamban:  Phospholamban;  InterPro: IPR005984  Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17.   The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=29.13  E-value=73  Score=19.84  Aligned_cols=22  Identities=27%  Similarity=0.525  Sum_probs=10.6

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~   29 (103)
                      -|--|+-+...+.-|+||.+|+
T Consensus        28 lqelfvnfclilicllli~iiv   49 (52)
T PF04272_consen   28 LQELFVNFCLILICLLLICIIV   49 (52)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3455665544444444444443


No 283
>PF10183 ESSS:  ESSS subunit of NADH:ubiquinone oxidoreductase (complex I) ;  InterPro: IPR019329  NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I [].  This entry represents the ESSS subunit from mitochondrial NADH:ubiquinone oxidoreductase (complex I). It carries mitochondrial import sequences []. 
Probab=29.09  E-value=72  Score=21.87  Aligned_cols=18  Identities=22%  Similarity=0.344  Sum_probs=10.3

Q ss_pred             hhHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVL   26 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllil   26 (103)
                      ..||+++...++++.+++
T Consensus        61 ~~~f~~~~~~~v~~~~~~   78 (105)
T PF10183_consen   61 LPFFFGFSGSLVFGGVFL   78 (105)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            356777666655554443


No 284
>COG4499 Predicted membrane protein [Function unknown]
Probab=29.04  E-value=58  Score=28.25  Aligned_cols=31  Identities=13%  Similarity=0.405  Sum_probs=15.5

Q ss_pred             CCcchhhhhHHH-HHHHHHHHHHHHHHHHhhe
Q psy17635          2 PSDTVATAGWFI-GMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         2 ~~~~~at~gWfI-g~~~ai~lLllilli~C~i   32 (103)
                      |......+-||- |++.++|++++.+.-+.|.
T Consensus       213 pK~k~~ifk~~giGliillvl~li~~~Y~~f~  244 (434)
T COG4499         213 PKKKYTIFKYFGIGLIILLVLLLIYFTYYYFS  244 (434)
T ss_pred             ccccceehhhHHHhHHHHHHHHHHHHHHHHHH
Confidence            344456677874 4555554444444334433


No 285
>PF13124 DUF3963:  Protein of unknown function (DUF3963)
Probab=29.02  E-value=1.2e+02  Score=18.08  Aligned_cols=23  Identities=22%  Similarity=0.558  Sum_probs=16.3

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHh
Q psy17635          8 TAGWFIGMLLAIAFLILVLILVC   30 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~C   30 (103)
                      -|-|+=-+-+..+|+++++..++
T Consensus        16 iqkwirnit~cfal~vv~lvslw   38 (40)
T PF13124_consen   16 IQKWIRNITFCFALLVVVLVSLW   38 (40)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            36788777777777777776655


No 286
>PF09988 DUF2227:  Uncharacterized metal-binding protein (DUF2227);  InterPro: IPR019250  This entry represents hypothetical bacterial proteins that possess metal binding properties; however, their exact function has not yet been determined. 
Probab=28.87  E-value=51  Score=24.82  Aligned_cols=29  Identities=31%  Similarity=0.520  Sum_probs=19.2

Q ss_pred             CCCcchhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635          1 MPSDTVATAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         1 ~~~~~~at~gWfIg~~~ai~lLllilli~   29 (103)
                      ++|-++=+.+|+||.+.=|+-|.++++++
T Consensus        73 ~~HRs~lSH~piiGt~~RllYL~~~~~~l  101 (169)
T PF09988_consen   73 FRHRSFLSHGPIIGTLLRLLYLALWLLLL  101 (169)
T ss_pred             cCCCCcccccchhhHHHHHHHHHHHHHHH
Confidence            35667778899999877655555444443


No 287
>PRK00269 zipA cell division protein ZipA; Reviewed
Probab=28.85  E-value=32  Score=28.31  Aligned_cols=18  Identities=39%  Similarity=0.368  Sum_probs=10.5

Q ss_pred             HHhheeecCCCcccchhh
Q psy17635         28 LVCLIKRNRGGKYAVHER   45 (103)
Q Consensus        28 i~C~i~R~rGgKY~VkeK   45 (103)
                      ++--.+|-||||=-+|.|
T Consensus        21 l~~~~~r~r~~~gk~~~~   38 (293)
T PRK00269         21 LFDGWRRMRGGKGKLKFR   38 (293)
T ss_pred             HHHHHHHHhccccccccc
Confidence            333446777877655544


No 288
>PRK14471 F0F1 ATP synthase subunit B; Provisional
Probab=28.60  E-value=72  Score=22.83  Aligned_cols=22  Identities=32%  Similarity=0.437  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      |...+.-+|.|+||++++-.|+
T Consensus         8 ~~~~~~~~i~Flil~~ll~~~l   29 (164)
T PRK14471          8 FGLFFWQTILFLILLLLLAKFA   29 (164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHh
Confidence            5444445566666665554443


No 289
>TIGR03068 srtB_sig_NPQTN sortase B signal domain, NPQTN class. This model represents one of the boutique (rare) sortase signals, recognized by sortase B (SrtB) rather than by the housekeeping-type SrtA class sortase. This sequence, beginning NPQTN, shows little similarity to several other SrtB substrates.
Probab=28.33  E-value=97  Score=17.83  Aligned_cols=18  Identities=17%  Similarity=0.366  Sum_probs=10.2

Q ss_pred             CCcchhhhhHHHHHHHHH
Q psy17635          2 PSDTVATAGWFIGMLLAI   19 (103)
Q Consensus         2 ~~~~~at~gWfIg~~~ai   19 (103)
                      ||++.++-.+..++..+-
T Consensus         2 PQTs~gtp~y~y~Ip~v~   19 (33)
T TIGR03068         2 PQTNAGTPAYIYAIPVAS   19 (33)
T ss_pred             CCCCCCCcchhhHHHHHH
Confidence            566666665555554443


No 290
>PF08135 EPV_E5:  Major transforming protein E5 family;  InterPro: IPR012555 This family consists of the major transforming proteins (E5) of the bovine papilloma virus (BPV). The equine sarcoid is one of the most common dermatological lesion in equids. It is a benign, locally invasive dermal fibroblastic lesion and studies have shown an association of the lesions with BPV. E5 is a short hydrophobic membrane protein localising to the Golgi apparatus and other intracellular membranes. It binds to and constitutively activates the platelet-derived growth factor-beta in transformed cells. This stimulation activates a receptor signalling cascade which results in an intracellular growth stimulatory signal [].
Probab=28.22  E-value=98  Score=18.82  Aligned_cols=13  Identities=38%  Similarity=0.874  Sum_probs=5.2

Q ss_pred             HHHHHHHHHHHHH
Q psy17635         13 IGMLLAIAFLILV   25 (103)
Q Consensus        13 Ig~~~ai~lLlli   25 (103)
                      .+++-|+-|++|+
T Consensus        10 lgl~~~lQL~LL~   22 (44)
T PF08135_consen   10 LGLTFALQLLLLV   22 (44)
T ss_pred             HHHHHHHHHHHHH
Confidence            3444444344433


No 291
>PF07760 DUF1616:  Protein of unknown function (DUF1616);  InterPro: IPR011674 This is a group of sequences from hypothetical archaeal proteins. The region in question is approximately 330 amino acid residues long.
Probab=28.22  E-value=47  Score=26.23  Aligned_cols=24  Identities=33%  Similarity=0.715  Sum_probs=18.1

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecC
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~r   36 (103)
                      .-++.++.++++++++++++||.|
T Consensus        87 ~~i~~~l~~~t~~~~~~a~~rr~~  110 (287)
T PF07760_consen   87 IPILISLSIFTLVLSIIAYIRRRR  110 (287)
T ss_pred             hHHHHHHHHHHHHHHHHHHHhccc
Confidence            445666778888888888888876


No 292
>PF07219 HemY_N:  HemY protein N-terminus;  InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=28.17  E-value=87  Score=21.08  Aligned_cols=22  Identities=36%  Similarity=0.679  Sum_probs=11.0

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHH
Q psy17635          7 ATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli   28 (103)
                      -+.-|+..++.+++++++.+++
T Consensus        13 e~sl~~~~~~l~~~~~~l~ll~   34 (108)
T PF07219_consen   13 ETSLWVALILLLLLFVVLYLLL   34 (108)
T ss_pred             EeeHHHHHHHHHHHHHHHHHHH
Confidence            3445666555554444444433


No 293
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=28.15  E-value=63  Score=20.00  Aligned_cols=27  Identities=26%  Similarity=0.458  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635         14 GMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      .+|+.+.+++.++.++.|+.-=|.|.|
T Consensus         5 ~~LIpiSl~l~~~~l~~f~Wavk~GQf   31 (51)
T TIGR00847         5 TILIPISLLLGGVGLVAFLWSLKSGQY   31 (51)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHccCCC
Confidence            344444444433333333333345566


No 294
>PRK14474 F0F1 ATP synthase subunit B; Provisional
Probab=28.13  E-value=53  Score=25.70  Aligned_cols=22  Identities=41%  Similarity=0.739  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      |+..++-+|.|+||++++--|.
T Consensus         5 ~~t~~~qiInFlILv~lL~~fl   26 (250)
T PRK14474          5 WFTVVAQIINFLILVYLLRRFL   26 (250)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            7777777777777776665544


No 295
>MTH00152 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=27.94  E-value=85  Score=22.86  Aligned_cols=29  Identities=31%  Similarity=0.590  Sum_probs=13.4

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~rG   37 (103)
                      ..|.+-++.++.|++-.+.++-.+++++|
T Consensus       132 ~~~~~~i~l~~~Lll~Li~Vvki~~~~~g  160 (163)
T MTH00152        132 SGWSIFVFLAVLLLIVMVSVVKICYYEEG  160 (163)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHhhcCCCCC
Confidence            34555555554444333333444455544


No 296
>PRK13461 F0F1 ATP synthase subunit B; Provisional
Probab=27.89  E-value=76  Score=22.57  Aligned_cols=22  Identities=32%  Similarity=0.362  Sum_probs=9.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      |...+.-+|.|++|++++-.|+
T Consensus         5 ~~~~~~~~inF~il~~iL~~f~   26 (159)
T PRK13461          5 IPTIIATIINFIILLLILKHFF   26 (159)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHh
Confidence            4333333444555554444333


No 297
>PF01456 Mucin:  Mucin-like glycoprotein;  InterPro: IPR000458 This family of trypanosomal proteins resemble vertebrate mucins. The protein consists of three regions. The N and C terminii are conserved between all members of the family, whereas the central region is not well conserved and contains a large number of threonine residues which can be glycosylated []. Indirect evidence suggested that these genes might encode the core protein of parasite mucins, glycoproteins that were proposed to be involved in the interaction with, and invasion of, mammalian host cells.
Probab=27.83  E-value=43  Score=23.26  Aligned_cols=10  Identities=40%  Similarity=0.554  Sum_probs=5.7

Q ss_pred             HHHHHHHHHH
Q psy17635         15 MLLAIAFLIL   24 (103)
Q Consensus        15 ~~~ai~lLll   24 (103)
                      ||||+++|-|
T Consensus         6 LLCalLvlaL   15 (143)
T PF01456_consen    6 LLCALLVLAL   15 (143)
T ss_pred             HHHHHHHHHH
Confidence            5777554444


No 298
>KOG4433|consensus
Probab=27.57  E-value=70  Score=28.43  Aligned_cols=6  Identities=33%  Similarity=0.955  Sum_probs=2.8

Q ss_pred             hheeec
Q psy17635         30 CLIKRN   35 (103)
Q Consensus        30 C~i~R~   35 (103)
                      ||.||.
T Consensus        69 cC~Rr~   74 (526)
T KOG4433|consen   69 CCCRRE   74 (526)
T ss_pred             HHcCCC
Confidence            544444


No 299
>PF09777 OSTMP1:  Osteopetrosis-associated transmembrane protein 1 precursor;  InterPro: IPR019172 Osteopetrosis-associated transmembrane protein 1 (OSTM1) is required for osteoclast and melanocyte maturation and function. Mutations in OSTM1 give rise to autosomal recessive osteopetrosis, also called autosomal recessive Albers-Schonberg disease [, ]. 
Probab=27.53  E-value=90  Score=24.43  Aligned_cols=24  Identities=17%  Similarity=0.279  Sum_probs=16.7

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      .+|||++...+++|.+++.+.-++
T Consensus       189 ~~~~i~v~~~vl~lpv~FY~~s~~  212 (237)
T PF09777_consen  189 ETAVIAVSVFVLFLPVLFYLSSYL  212 (237)
T ss_pred             hhHHHHHHHHHHHHHHHHHHhhee
Confidence            467788877777777777665444


No 300
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.51  E-value=97  Score=23.03  Aligned_cols=28  Identities=14%  Similarity=0.266  Sum_probs=19.8

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      .+..|..+++..||-++|-.||+-+.+|
T Consensus         4 t~~~W~~a~igLvvGi~IG~li~Rlt~~   31 (138)
T COG3105           4 TFMTWEYALIGLVVGIIIGALIARLTNR   31 (138)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHcch
Confidence            4567888877777777777777766654


No 301
>MTH00045 ND6 NADH dehydrogenase subunit 6; Validated
Probab=27.39  E-value=80  Score=23.22  Aligned_cols=27  Identities=11%  Similarity=0.209  Sum_probs=13.5

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRN   35 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~   35 (103)
                      .++.+-++|+..|++-.+.++=++|.+
T Consensus       128 ~g~~~li~~~~~LLltL~vVl~i~~~~  154 (162)
T MTH00045        128 SGGLYLLLGGFILLVALVVALVISFGH  154 (162)
T ss_pred             CCHHHHHHHHHHHHHHhhhhhheecCC
Confidence            345555666655554444444444444


No 302
>PRK03814 oxaloacetate decarboxylase subunit gamma; Provisional
Probab=27.36  E-value=1.1e+02  Score=20.57  Aligned_cols=12  Identities=25%  Similarity=0.667  Sum_probs=5.2

Q ss_pred             HHHHHHHHHHHh
Q psy17635         19 IAFLILVLILVC   30 (103)
Q Consensus        19 i~lLllilli~C   30 (103)
                      .+++.|++||.|
T Consensus        21 ~VF~fL~lLi~~   32 (85)
T PRK03814         21 VVFIFLTLLVYL   32 (85)
T ss_pred             HHHHHHHHHHHH
Confidence            444444444443


No 303
>PHA03289 envelope glycoprotein I; Provisional
Probab=27.19  E-value=61  Score=27.43  Aligned_cols=16  Identities=38%  Similarity=0.661  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHHHh
Q psy17635         15 MLLAIAFLILVLILVC   30 (103)
Q Consensus        15 ~~~ai~lLllilli~C   30 (103)
                      ++|.|+|||..++..|
T Consensus       277 ~~~~~~l~i~~~~~~~  292 (352)
T PHA03289        277 VLCIIALLIVTVCSAC  292 (352)
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            3444444444433333


No 304
>PRK11901 hypothetical protein; Reviewed
Probab=27.05  E-value=79  Score=26.50  Aligned_cols=30  Identities=20%  Similarity=0.365  Sum_probs=17.7

Q ss_pred             CcchhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          3 SDTVATAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         3 ~~~~at~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      .-..+.|--+||+=+. ||||||+-|---+|
T Consensus        30 k~~vSRQh~MiGiGil-VLlLLIi~IgSALk   59 (327)
T PRK11901         30 KLAVSRQHMMIGIGIL-VLLLLIIAIGSALK   59 (327)
T ss_pred             CchHHHHHHHHHHHHH-HHHHHHHHHhhhcc
Confidence            3446677778887655 55555555544344


No 305
>KOG3249|consensus
Probab=27.01  E-value=94  Score=24.04  Aligned_cols=10  Identities=60%  Similarity=1.002  Sum_probs=4.9

Q ss_pred             eecCCCc--ccc
Q psy17635         33 KRNRGGK--YAV   42 (103)
Q Consensus        33 ~R~rGgK--Y~V   42 (103)
                      +|.||.+  |+|
T Consensus       144 ~rk~gEmSAYSV  155 (181)
T KOG3249|consen  144 KRKRGEMSAYSV  155 (181)
T ss_pred             CCCCCccchhhh
Confidence            4555544  444


No 306
>PF11857 DUF3377:  Domain of unknown function (DUF3377);  InterPro: IPR021805  This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=26.96  E-value=61  Score=21.73  Aligned_cols=6  Identities=17%  Similarity=0.440  Sum_probs=3.4

Q ss_pred             eecCCC
Q psy17635         33 KRNRGG   38 (103)
Q Consensus        33 ~R~rGg   38 (103)
                      -|++|.
T Consensus        54 fkrkGt   59 (74)
T PF11857_consen   54 FKRKGT   59 (74)
T ss_pred             eeecCC
Confidence            466664


No 307
>PF10327 7TM_GPCR_Sri:  Serpentine type 7TM GPCR chemoreceptor Sri;  InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=26.92  E-value=37  Score=26.87  Aligned_cols=22  Identities=5%  Similarity=0.528  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHHHHHhheeecC
Q psy17635         15 MLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        15 ~~~ai~lLllilli~C~i~R~r   36 (103)
                      +..+++..-+-.|++||++|++
T Consensus        96 i~~~l~~~q~~~L~~CF~~KHQ  117 (303)
T PF10327_consen   96 IWIFLFIFQIESLVICFLRKHQ  117 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3344444445677889998753


No 308
>PHA00476 hypothetical protein
Probab=26.89  E-value=2.4e+02  Score=20.18  Aligned_cols=43  Identities=21%  Similarity=0.261  Sum_probs=22.9

Q ss_pred             hHHHHHHHHHHHHHHH-HHHHhheeecCCCcc-----cchhhhhhcCCC
Q psy17635         10 GWFIGMLLAIAFLILV-LILVCLIKRNRGGKY-----AVHEREAAHGRH   52 (103)
Q Consensus        10 gWfIg~~~ai~lLlli-lli~C~i~R~rGgKY-----~VkeKE~~~g~~   52 (103)
                      .||--+.+-++++++. +|-.+=..|..+|--     +-||||-++|.+
T Consensus        38 ~lssfvfsSvallvil~~L~TW~TTkkpdgLNRla~l~~kekeI~AG~~   86 (110)
T PHA00476         38 TLSSFVFSSVALLVILVLLGTWSTTRKPDGLNRLATLAEKEKEILAGKA   86 (110)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccccccCCCcchHHHHHHHHHHHHHcCCC
Confidence            3444444444444433 333455566656643     366777777766


No 309
>PF14030 DUF4245:  Protein of unknown function (DUF4245)
Probab=26.75  E-value=73  Score=23.55  Aligned_cols=16  Identities=13%  Similarity=0.256  Sum_probs=7.1

Q ss_pred             HHHHHHHHHHHHHhhe
Q psy17635         17 LAIAFLILVLILVCLI   32 (103)
Q Consensus        17 ~ai~lLllilli~C~i   32 (103)
                      +.+++++++++|+.++
T Consensus        12 SL~vl~~~~~~i~~~~   27 (169)
T PF14030_consen   12 SLAVLVAIVALIVAGV   27 (169)
T ss_pred             HHHHHHHHHHHHHHhh
Confidence            3344444444553444


No 310
>PF07672 MFS_Mycoplasma:  Mycoplasma MFS transporter;  InterPro: IPR011699 These proteins share some similarity with members of the Major Facilitator Superfamily (MFS).
Probab=26.61  E-value=82  Score=25.58  Aligned_cols=31  Identities=19%  Similarity=0.344  Sum_probs=21.1

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~rG   37 (103)
                      ..+.|||+.|..+.++++++.++|......+
T Consensus       169 ~kRk~fi~~~~~~gi~~~~l~~~~~~~~g~~  199 (267)
T PF07672_consen  169 YKRKPFIHFIISLGIVFFVLSIVVVYFVGPG  199 (267)
T ss_pred             hhhhhHHHHHHHHHHHHHHHHHHHHHHhCcc
Confidence            5678999988777777777677773344333


No 311
>TIGR00810 secG protein translocase, SecG subunit. This family of proteins forms a complex with SecY and SecE. SecA then recruits the SecYEG complex to form an active protein translocation channel.
Probab=26.55  E-value=94  Score=19.80  Aligned_cols=14  Identities=36%  Similarity=0.752  Sum_probs=7.6

Q ss_pred             HHHHHhheeecCCC
Q psy17635         25 VLILVCLIKRNRGG   38 (103)
Q Consensus        25 illi~C~i~R~rGg   38 (103)
                      +++++.+....||+
T Consensus        14 ~LI~~vLlQ~~k~~   27 (73)
T TIGR00810        14 LLIGLVLLQSGKGG   27 (73)
T ss_pred             HHHHHheeecCCCC
Confidence            33344556666665


No 312
>PLN03150 hypothetical protein; Provisional
Probab=26.48  E-value=39  Score=29.48  Aligned_cols=10  Identities=30%  Similarity=0.644  Sum_probs=4.6

Q ss_pred             HHHHhheeec
Q psy17635         26 LILVCLIKRN   35 (103)
Q Consensus        26 lli~C~i~R~   35 (103)
                      ++++|+.+|.
T Consensus       562 ~~~~~~~~~r  571 (623)
T PLN03150        562 ICAMCWWKRR  571 (623)
T ss_pred             HHHhhheeeh
Confidence            3334555554


No 313
>PF11696 DUF3292:  Protein of unknown function (DUF3292);  InterPro: IPR021709  This eukaryotic family of proteins has no known function. 
Probab=26.46  E-value=70  Score=29.07  Aligned_cols=24  Identities=21%  Similarity=0.607  Sum_probs=17.5

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli~   29 (103)
                      .....|+.-++...++.+||+||+
T Consensus       133 vYf~aW~~dll~p~~~~~L~~li~  156 (642)
T PF11696_consen  133 VYFIAWLLDLLVPAFFAFLIALIL  156 (642)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhc
Confidence            345678888888877777777775


No 314
>PF11466 Doppel:  Prion-like protein Doppel;  InterPro: IPR021566  Dpl is a homologue related to the prion protein (PrP). Dpl is toxic to neurons and is expressed in the brains of mice that do not express PrP. In DHPC and SDS micelles, Dpl shoes about 40% alpha-helical structure however in aqueous solution it consists of a random coil. The alpha helical segment can adopt a transmembrane localisation also in a membrane. The unprocessed Dpl protein is thought to posses a possible channel formation mechanism which may be related to toxicity through direct interaction with cell membranes and damage to the cell membrane. ; PDB: 1Z65_A.
Probab=26.41  E-value=62  Score=18.27  Aligned_cols=12  Identities=25%  Similarity=1.088  Sum_probs=8.0

Q ss_pred             hHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAF   21 (103)
Q Consensus        10 gWfIg~~~ai~l   21 (103)
                      +|-++++|.+++
T Consensus         7 ~~~lAi~c~LL~   18 (30)
T PF11466_consen    7 GWWLAIVCVLLF   18 (30)
T ss_dssp             SHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHH
Confidence            677777776443


No 315
>PF02430 AMA-1:  Apical membrane antigen 1;  InterPro: IPR003298 A novel antigen of Plasmodium falciparum has been cloned that contains a hydrophobic domain typical of an integral membrane protein. The antigen is designated apical membrane antigen 1 (AMA-1) by virtue of appearing to be located in the apical complex []. AMA-1 appears to be transported to the merozoite surface close to the time of schizont rupture. The 66kDa merozoite surface antigen (PK66) of Plasmodium knowlesi, a simian malaria, possesses vaccine-related properties believed to originate from a receptor-like role in parasite invasion of erythrocytes []. The sequence of PK66 is conserved throughout plasmodium, and shows high similarity to P. falciparum AMA-1. Following schizont rupture, the distribution of PK66 changes in a coordinate manner associated with merozoite invasion. Prior to rupture, the protein is concentrated at the apical end, following which it distributes itself entirely across the surface of the free merozoite. Immunofluorescence studies suggest that, during invasion, PK66 is excluded from the erythrocyte at, and behind, the invasion interface []. ; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2Y8R_D 2X2Z_A 2Y8T_A 2Y8S_D 1HN6_A 2Z8V_B 2Z8W_A 1Z40_E 2Q8B_A 2Q8A_A ....
Probab=26.38  E-value=22  Score=31.11  Aligned_cols=25  Identities=24%  Similarity=0.265  Sum_probs=0.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheee
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      -=.|.+++++++.||+|+++++.||
T Consensus       429 ~~~~~~~~~~~~~~~~l~~~~~~k~  453 (471)
T PF02430_consen  429 RIVIIISIATGAVLLALAIYWYYKR  453 (471)
T ss_dssp             -------------------------
T ss_pred             eeehhhHHHHHHHHHHHHHHHHHhh
Confidence            3456677778888888888887774


No 316
>PF06678 DUF1179:  Protein of unknown function (DUF1179);  InterPro: IPR009564 This family consists of several hypothetical Caenorhabditis elegans proteins of around 106 residues in length. The function of the family is unknown.
Probab=26.36  E-value=93  Score=22.08  Aligned_cols=10  Identities=30%  Similarity=0.514  Sum_probs=5.4

Q ss_pred             hHHHHHHHHH
Q psy17635         10 GWFIGMLLAI   19 (103)
Q Consensus        10 gWfIg~~~ai   19 (103)
                      -|||.-.+.+
T Consensus         4 f~~i~~~i~l   13 (103)
T PF06678_consen    4 FWFILKSIFL   13 (103)
T ss_pred             hHHHHHHHHH
Confidence            4776554444


No 317
>COG3736 VirB8 Type IV secretory pathway, component VirB8 [Intracellular trafficking and secretion]
Probab=26.32  E-value=1e+02  Score=24.65  Aligned_cols=24  Identities=17%  Similarity=0.502  Sum_probs=11.7

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      ..|.++.+..++.+++++.|+|++
T Consensus        42 ~~~~va~~~~~l~v~~~~~Ia~ll   65 (239)
T COG3736          42 LAWRVAILFTLLAVAAVIAIAILL   65 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhc
Confidence            345555544444444445555544


No 318
>PF01561 Hanta_G2:  Hantavirus glycoprotein G2;  InterPro: IPR002532 The medium (M) genome segment of Hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins [], G1 and G2, as a polyprotein precursor. This entry represents the polyprotein region which forms the G2 glycoprotein.; GO: 0030683 evasion by virus of host immune response, 0044423 virion part
Probab=26.27  E-value=34  Score=29.96  Aligned_cols=16  Identities=19%  Similarity=0.268  Sum_probs=6.7

Q ss_pred             HHHHHHHHHHHhheee
Q psy17635         19 IAFLILVLILVCLIKR   34 (103)
Q Consensus        19 i~lLllilli~C~i~R   34 (103)
                      |.|+-|||+++||=.|
T Consensus       465 lll~siil~s~~cP~r  480 (485)
T PF01561_consen  465 LLLLSIILFSFFCPVR  480 (485)
T ss_pred             HHHHHHHHHheeCcch
Confidence            3333344444444333


No 319
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=26.25  E-value=51  Score=27.25  Aligned_cols=12  Identities=33%  Similarity=0.642  Sum_probs=4.8

Q ss_pred             HHHHHHHHHHHH
Q psy17635         17 LAIAFLILVLIL   28 (103)
Q Consensus        17 ~ai~lLllilli   28 (103)
                      |.|+.|+||+|+
T Consensus       261 cgiaalvllil~  272 (295)
T TIGR01478       261 YGIAALVLIILT  272 (295)
T ss_pred             cHHHHHHHHHHH
Confidence            444444443333


No 320
>PF01601 Corona_S2:  Coronavirus S2 glycoprotein;  InterPro: IPR002552 The type I glycoprotein S of Coronavirus, trimers of which constitute the typical viral spikes, is assembled into virions through noncovalent interactions with the M protein. The spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 IPR002551 from INTERPRO and S2 []. Both chimeric S proteins appeared to cause cell fusion when expressed individually, suggesting that they were biologically fully active []. The spike is a type I membrane glycoprotein that possesses a conserved transmembrane anchor and an unusual cysteine-rich (cys) domain that bridges the putative junction of the anchor and the cytoplasmic tail [].; GO: 0006944 cellular membrane fusion, 0046813 virion attachment, binding of host cell surface receptor, 0016021 integral to membrane, 0019031 viral envelope; PDB: 2BEQ_B 2FXP_A 1ZVB_A 1WNC_D 1ZV8_H 1ZV7_B 1WYY_B 1ZVA_A 2BEZ_F 1WDG_A ....
Probab=26.25  E-value=22  Score=32.01  Aligned_cols=15  Identities=13%  Similarity=0.740  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILV   25 (103)
Q Consensus        11 WfIg~~~ai~lLlli   25 (103)
                      |-+.|+.++++++++
T Consensus       548 WyVWL~i~~~li~~~  562 (610)
T PF01601_consen  548 WYVWLAIILALIAFA  562 (610)
T ss_dssp             ---------------
T ss_pred             HHHHHHHHHHHHHHH
Confidence            666666555555444


No 321
>PF14316 DUF4381:  Domain of unknown function (DUF4381)
Probab=26.25  E-value=53  Score=23.28  Aligned_cols=28  Identities=32%  Similarity=0.592  Sum_probs=11.1

Q ss_pred             HHHHHHHHH-HHHHHHHHhh-eeecCCCcc
Q psy17635         13 IGMLLAIAF-LILVLILVCL-IKRNRGGKY   40 (103)
Q Consensus        13 Ig~~~ai~l-Lllilli~C~-i~R~rGgKY   40 (103)
                      +|--+.+++ ++++++++.+ .+|.+--+|
T Consensus        21 ~GWwll~~lll~~~~~~~~~~~r~~~~~~y   50 (146)
T PF14316_consen   21 PGWWLLLALLLLLLILLLWRLWRRWRRNRY   50 (146)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHccHH
Confidence            444444343 3333333333 233333445


No 322
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=26.24  E-value=22  Score=29.00  Aligned_cols=7  Identities=14%  Similarity=-0.206  Sum_probs=0.0

Q ss_pred             hheeecC
Q psy17635         30 CLIKRNR   36 (103)
Q Consensus        30 C~i~R~r   36 (103)
                      +.+.-=|
T Consensus       165 Ia~icyr  171 (290)
T PF05454_consen  165 IACICYR  171 (290)
T ss_dssp             -------
T ss_pred             HHHHhhh
Confidence            4444444


No 323
>CHL00080 psbM photosystem II protein M
Probab=26.19  E-value=1.4e+02  Score=17.26  Aligned_cols=19  Identities=26%  Similarity=0.396  Sum_probs=7.5

Q ss_pred             HHHHHHHHHHHHHHHHHHh
Q psy17635         12 FIGMLLAIAFLILVLILVC   30 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C   30 (103)
                      ||+.+..|.+=...++|+.
T Consensus         8 fiAt~LFi~iPt~FLlily   26 (34)
T CHL00080          8 FIATALFILVPTAFLLIIY   26 (34)
T ss_pred             HHHHHHHHHHHHHHHHHhh
Confidence            4444433333333344443


No 324
>PRK14473 F0F1 ATP synthase subunit B; Provisional
Probab=26.17  E-value=81  Score=22.55  Aligned_cols=20  Identities=25%  Similarity=0.366  Sum_probs=9.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHh
Q psy17635         11 WFIGMLLAIAFLILVLILVC   30 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C   30 (103)
                      |.+.+.-+|.|++|++++--
T Consensus         8 ~~~~~~~~inflil~~lL~~   27 (164)
T PRK14473          8 LGLLIAQLINFLLLIFLLRT   27 (164)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            44444445555555544433


No 325
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=26.12  E-value=73  Score=22.09  Aligned_cols=8  Identities=13%  Similarity=0.459  Sum_probs=4.2

Q ss_pred             cccchhhh
Q psy17635         39 KYAVHERE   46 (103)
Q Consensus        39 KY~VkeKE   46 (103)
                      .|=|+.+.
T Consensus        32 nyyvki~~   39 (114)
T TIGR01655        32 NYYIKIKN   39 (114)
T ss_pred             ccEEEEee
Confidence            45566554


No 326
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=26.07  E-value=82  Score=21.66  Aligned_cols=12  Identities=25%  Similarity=0.293  Sum_probs=5.9

Q ss_pred             HHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFL   22 (103)
Q Consensus        11 WfIg~~~ai~lL   22 (103)
                      ||+.++++++++
T Consensus         3 ~~~~vll~ll~~   14 (105)
T PRK00888          3 LLTLLLLALLVW   14 (105)
T ss_pred             HHHHHHHHHHHH
Confidence            555555444333


No 327
>PF10112 Halogen_Hydrol:  5-bromo-4-chloroindolyl phosphate hydrolysis protein;  InterPro: IPR018770 This entry consists of prokaryotic proteins that mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds. 
Probab=26.01  E-value=86  Score=23.15  Aligned_cols=9  Identities=44%  Similarity=1.272  Sum_probs=4.8

Q ss_pred             hHHHHHHHH
Q psy17635         10 GWFIGMLLA   18 (103)
Q Consensus        10 gWfIg~~~a   18 (103)
                      +||+|++++
T Consensus        10 ~~~~~~~~~   18 (199)
T PF10112_consen   10 RWILGVLIA   18 (199)
T ss_pred             HHHHHHHHH
Confidence            456665444


No 328
>PF01167 Tub:  Tub family;  InterPro: IPR000007  Tubby, an autosomal recessive mutation, mapping to mouse chromosome 7, was recently found to be the result of a splicing defect in a novel gene with unknown function. This mutation maps to the tub gene [, ]. The mouse tubby mutation is the cause of maturity-onset obesity, insulin resistance and sensory deficits. By contrast with the rapid juvenile-onset weight gain seen in diabetes (db) and obese (ob) mice, obesity in tubby mice develops gradually, and strongly resembles the late-onset obesity observed in the human population. Excessive deposition of adipose tissue culminates in a two-fold increase of body weight. Tubby mice also suffer retinal degeneration and neurosensory hearing loss. The tripartite character of the tubby phenotype is highly similar to human obesity syndromes, such as Alstrom and Bardet-Biedl. Although these phenotypes indicate a vital role for tubby proteins, no biochemical function has yet been ascribed to any family member [], although it has been suggested that the phenotypic features of tubby mice may be the result of cellular apoptosis triggered by expression of the mutated tub gene. TUB is the founding-member of the tubby-like proteins, the TULPs. TULPs are found in multicellular organisms from both the plant and animal kingdoms. Ablation of members of this protein family cause disease phenotypes that are indicative of their importance in nervous-system function and development []. Mammalian TUB is a hydrophilic protein of ~500 residues. The N-terminal (IPR005398 from INTERPRO) portion of the protein is conserved neither in length nor sequence, but, in TUB, contains the nuclear localisation signal and may have transcriptional-activation activity. The C-terminal 250 residues are highly conserved. The C-terminal extremity contains a cysteine residue that might play an important role in the normal functioning of these proteins. The crystal structure of the C-terminal core domain from mouse tubby has been determined to 1.9A resolution. This domain is arranged as a 12-stranded, all anti-parallel, closed beta-barrel that surrounds a central alpha helix, (which is at the extreme carboxyl terminus of the protein) that forms most of the hydrophobic core. Structural analyses suggest that TULPs constitute a unique family of bipartite transcription factors [].; PDB: 3C5N_B 2FIM_A 1I7E_A 1C8Z_A 1S31_A.
Probab=25.93  E-value=21  Score=28.01  Aligned_cols=12  Identities=33%  Similarity=0.958  Sum_probs=9.7

Q ss_pred             HHHhheeecCCC
Q psy17635         27 ILVCLIKRNRGG   38 (103)
Q Consensus        27 li~C~i~R~rGg   38 (103)
                      .|.|+|+|+|+|
T Consensus         7 ~vqC~I~R~k~g   18 (246)
T PF01167_consen    7 PVQCFIRRDKSG   18 (246)
T ss_dssp             EEEEEEEEESTT
T ss_pred             EEEEEEEEECCC
Confidence            356999999876


No 329
>PRK10905 cell division protein DamX; Validated
Probab=25.78  E-value=50  Score=27.68  Aligned_cols=17  Identities=18%  Similarity=0.321  Sum_probs=9.6

Q ss_pred             HHHHHHHHHHHHHhhee
Q psy17635         17 LAIAFLILVLILVCLIK   33 (103)
Q Consensus        17 ~ai~lLllilli~C~i~   33 (103)
                      .++||||||+-|---+|
T Consensus         5 GilVLlLLIigIgSALk   21 (328)
T PRK10905          5 GILVLLLLIIGIGSALK   21 (328)
T ss_pred             hHHHHHHHHHHHhHhhc
Confidence            34566666666654444


No 330
>MTH00166 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=25.73  E-value=95  Score=22.41  Aligned_cols=25  Identities=20%  Similarity=0.479  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecCC
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~rG   37 (103)
                      +-++.++-|++..+.++..+++++|
T Consensus       130 ~~i~l~~yLll~Li~vvki~~~~~g  154 (160)
T MTH00166        130 ITLFMIIYLLLTLIVVVKITNIFKG  154 (160)
T ss_pred             HHHHHHHHHHHHHHHHHHHHccCCC
Confidence            3344444444444445566666665


No 331
>PLN00028 nitrate transmembrane transporter; Provisional
Probab=25.64  E-value=93  Score=25.42  Aligned_cols=24  Identities=21%  Similarity=0.213  Sum_probs=13.8

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      +|-.+++.+.+++++-+++.++++
T Consensus       410 ~y~~~f~~~~~~~~i~~~~~~~~~  433 (476)
T PLN00028        410 STETGISLMGVMIIACTLPVAFIH  433 (476)
T ss_pred             cHhhHHHHHHHHHHHHHHHHHhee
Confidence            466666666555555555556564


No 332
>PTZ00370 STEVOR; Provisional
Probab=25.58  E-value=55  Score=27.09  Aligned_cols=18  Identities=28%  Similarity=0.137  Sum_probs=6.7

Q ss_pred             hhhHHHHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFLILV   25 (103)
Q Consensus         8 t~gWfIg~~~ai~lLlli   25 (103)
                      +.++-|-.=|.|+.|+|+
T Consensus       248 taAsaaF~Pygiaalvll  265 (296)
T PTZ00370        248 SAASSAFYPYGIAALVLL  265 (296)
T ss_pred             HHHHHhhcccHHHHHHHH
Confidence            333333333334433333


No 333
>TIGR03644 marine_trans_1 probable ammonium transporter, marine subtype. Members of this protein family are well conserved subclass of putative ammonimum transporters, belonging to the much broader set of ammonium/methylammonium transporter described by TIGR00836. Species with this transporter tend to be marine bacteria. Partial phylogenetic profiling (PPP) picks a member of this protein family as the single best-scoring protein vs. a reference profile for the marine environment Genome Property for a large number of different query genomes. This finding by PPP suggests that this transporter family represents an important adaptation to the marine environment.
Probab=25.50  E-value=94  Score=26.27  Aligned_cols=38  Identities=24%  Similarity=0.368  Sum_probs=19.1

Q ss_pred             HHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhcCCC
Q psy17635         14 GMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRH   52 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~g~~   52 (103)
                      ++++.++.-.++-++++++-|.-++ ..|+++|+..|-+
T Consensus       355 g~~~~~~~~~~~s~~~~~il~~~~~-lRv~~e~E~~GlD  392 (404)
T TIGR03644       355 GAATIFAWVFVTSLIVWFILKATMG-IRVSEEEEYEGLD  392 (404)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhccc-ccCCHHHHHcCCC
Confidence            3344444433444444444343333 6788777777643


No 334
>TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II. This enzyme catalyzes the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in gram positive bacteria of the Bacillus/Staphylococcus group. Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes a similar to the cyt. c oxidases (as opposed to cytochromes b).
Probab=25.43  E-value=55  Score=24.88  Aligned_cols=16  Identities=38%  Similarity=0.804  Sum_probs=6.6

Q ss_pred             HHHHHHHHHHHHHhhe
Q psy17635         17 LAIAFLILVLILVCLI   32 (103)
Q Consensus        17 ~ai~lLllilli~C~i   32 (103)
                      ++|++++.++++.++.
T Consensus        31 ~iI~lvv~~lli~~~~   46 (217)
T TIGR01432        31 LVIVFVVFVLFTIFLV   46 (217)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3344333444444444


No 335
>PF09911 DUF2140:  Uncharacterized protein conserved in bacteria (DUF2140);  InterPro: IPR018672  This family of conserved hypothetical proteins has no known function. 
Probab=25.42  E-value=65  Score=24.26  Aligned_cols=19  Identities=21%  Similarity=0.462  Sum_probs=11.4

Q ss_pred             hHHHHHHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli   28 (103)
                      .|-.+.++.+++++++++.
T Consensus         2 ~WK~aF~~Lla~~l~~~~~   20 (187)
T PF09911_consen    2 WWKWAFLILLALNLAFVIV   20 (187)
T ss_pred             hHHHHHHHHHHHHHHHHhh
Confidence            4777777766655544433


No 336
>PF05399 EVI2A:  Ectropic viral integration site 2A protein (EVI2A);  InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=25.35  E-value=68  Score=25.65  Aligned_cols=8  Identities=50%  Similarity=0.962  Sum_probs=4.3

Q ss_pred             eeecCC-Cc
Q psy17635         32 IKRNRG-GK   39 (103)
Q Consensus        32 i~R~rG-gK   39 (103)
                      .||++. ||
T Consensus       161 LKrskQ~gK  169 (227)
T PF05399_consen  161 LKRSKQVGK  169 (227)
T ss_pred             HHHHHHhhc
Confidence            356655 54


No 337
>PF13038 DUF3899:  Domain of unknown function (DUF3899)
Probab=25.30  E-value=1.1e+02  Score=19.93  Aligned_cols=25  Identities=28%  Similarity=0.506  Sum_probs=12.7

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhh
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCL   31 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~   31 (103)
                      ....|...++.+.++++++.+++++
T Consensus        67 ~~~~~~~~~ll~~~ll~l~~iil~f   91 (92)
T PF13038_consen   67 RVSRWTYPLLLIGLLLILLSIILSF   91 (92)
T ss_pred             HhHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3345766666555444444444443


No 338
>PF00003 7tm_3:  7 transmembrane sweet-taste receptor of 3 GCPR;  InterPro: IPR017978 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents the C-terminal region of family 3 GPCR receptor proteins, which contains the seven transmembrane region. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness []. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=25.27  E-value=88  Score=22.88  Aligned_cols=34  Identities=21%  Similarity=0.334  Sum_probs=25.5

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKY   40 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY   40 (103)
                      .+..|++..+.--++|+++.+.+++-.|+--..|
T Consensus       146 ~~~~~~~~~~~y~~~Ll~~~~~la~~~R~~~~~~  179 (238)
T PF00003_consen  146 NSNIWLILSLGYNGLLLLIGFFLAFKTRNVPSNF  179 (238)
T ss_pred             CccchHHHHHHHHHHHHHHHHHHHHhhCCCCccc
Confidence            5667998888888888888888887767655554


No 339
>TIGR00889 2A0110 nucleoside transporter. This family of proteins transports nucleosides at a high affinity. The transport mechanism is driven by proton motive force. This family includes nucleoside permease NupG and xanthosine permease from E.Coli.
Probab=25.12  E-value=79  Score=25.35  Aligned_cols=9  Identities=33%  Similarity=0.567  Sum_probs=4.0

Q ss_pred             ccchhhhhh
Q psy17635         40 YAVHEREAA   48 (103)
Q Consensus        40 Y~VkeKE~~   48 (103)
                      -.|.|++.+
T Consensus       409 ~~~~~~~~~  417 (418)
T TIGR00889       409 IKVDDRDVA  417 (418)
T ss_pred             CCCcccccc
Confidence            345554433


No 340
>PRK07718 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=25.06  E-value=1.2e+02  Score=21.71  Aligned_cols=26  Identities=12%  Similarity=0.114  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635         11 WFIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      |+|.++..|+++++......|..+..
T Consensus         6 ~~i~~i~l~~l~~~g~~~~~~~~~~~   31 (142)
T PRK07718          6 IKIMLIILIVIALIGTAALVLVMGFS   31 (142)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhcccC
Confidence            44544444444444444445555543


No 341
>PF14715 FixP_N:  N-terminal domain of cytochrome oxidase-cbb3, FixP 
Probab=24.99  E-value=1.2e+02  Score=18.58  Aligned_cols=20  Identities=25%  Similarity=0.364  Sum_probs=15.2

Q ss_pred             hhHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli   28 (103)
                      ..|++.+.+.|++.+..++.
T Consensus        22 ~ww~~~f~~tivfa~~Y~~~   41 (51)
T PF14715_consen   22 RWWLWLFYGTIVFAVGYLVL   41 (51)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            47888888888888776654


No 342
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=24.92  E-value=64  Score=17.62  Aligned_cols=16  Identities=31%  Similarity=0.281  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHhheeec
Q psy17635         20 AFLILVLILVCLIKRN   35 (103)
Q Consensus        20 ~lLllilli~C~i~R~   35 (103)
                      +||.+-++++..++|.
T Consensus         9 ~Ll~~~l~~l~~~rRr   24 (26)
T TIGR03778         9 ALLGLGLLGLLGLRRR   24 (26)
T ss_pred             HHHHHHHHHHHHHhhc
Confidence            4444444444455553


No 343
>PF11862 DUF3382:  Domain of unknown function (DUF3382);  InterPro: IPR021807  This entry represents the N-terminal domain of the LivHM type high-affinity branched-chain amino acid transport system permease proteins. The domain is about 100 amino acids in length, and is found associated with PF02653 from PFAM. 
Probab=24.79  E-value=1.1e+02  Score=20.59  Aligned_cols=36  Identities=25%  Similarity=0.305  Sum_probs=25.7

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhh
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHER   45 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeK   45 (103)
                      .+-=++++|++.++|...++.+-..+.|++..|.-+
T Consensus         4 ~Lk~Al~aall~lvl~~pi~Gl~l~~~g~~L~~~~r   39 (101)
T PF11862_consen    4 NLKDALFAALLALVLFGPIVGLKLDNQGGQLVLEPR   39 (101)
T ss_pred             HHHHHHHHHHHHHHHHHHheEEEEecCCcEEEEEec
Confidence            445567777777777777777777777887776544


No 344
>PF15048 OSTbeta:  Organic solute transporter subunit beta protein
Probab=24.71  E-value=76  Score=23.22  Aligned_cols=28  Identities=25%  Similarity=0.396  Sum_probs=13.7

Q ss_pred             HHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635         19 IAFLILVLILVCLIKRNRGGKYAVHERE   46 (103)
Q Consensus        19 i~lLllilli~C~i~R~rGgKY~VkeKE   46 (103)
                      ++++|=++|+.-=|..||.-|-.-.+|+
T Consensus        45 vvlvi~~~LLgrsi~ANRnrK~~~~~k~   72 (125)
T PF15048_consen   45 VVLVISFFLLGRSIQANRNRKMQPQEKQ   72 (125)
T ss_pred             HHHHHHHHHHHHHhHhcccccccccccc
Confidence            3333333333334566776675544454


No 345
>PF02790 COX2_TM:  Cytochrome C oxidase subunit II, transmembrane domain This family corresponds to chains b and o.;  InterPro: IPR011759 Cytochrome c oxidase (1.9.3.1 from EC) [, ] is an oligomeric enzymatic complex which is a component of the respiratory chain and is involved in the transfer of electrons from cytochrome c to oxygen. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. The enzyme complex consists of 3-4 subunits (prokaryotes) to up to 13 polypeptides (mammals). Subunit 2 (CO II) transfers the electrons from cytochrome c to the catalytic subunit 1. It contains two adjacent transmembrane regions in its N terminus and the major part of the protein is exposed to the periplasmic or to the mitochondrial intermembrane space, respectively. CO II provides the substrate-binding site and contains a copper centre called Cu(A) (see IPR001505 from INTERPRO), probably the primary acceptor in cytochrome c oxidase. An exception is the corresponding subunit of the cbb3-type oxidase which lacks the copper A redox-centre. Several bacterial CO II have a C-terminal extension that contains a covalently bound haem c.  The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices. ; GO: 0004129 cytochrome-c oxidase activity, 0005507 copper ion binding, 0009055 electron carrier activity, 0022900 electron transport chain, 0016021 integral to membrane; PDB: 3VRJ_C 2EIN_B 3AG3_B 2DYR_O 3AG1_B 2EIK_O 3ASN_B 1OCR_B 2EIJ_B 1OCC_O ....
Probab=24.70  E-value=1e+02  Score=19.17  Aligned_cols=15  Identities=13%  Similarity=0.430  Sum_probs=6.2

Q ss_pred             HHHHHHHHHHHhhee
Q psy17635         19 IAFLILVLILVCLIK   33 (103)
Q Consensus        19 i~lLllilli~C~i~   33 (103)
                      |...++.+++.++.+
T Consensus        34 I~~~V~~~l~~~~~~   48 (84)
T PF02790_consen   34 IFVFVFYFLIYFLFN   48 (84)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             eeeeEeeeccccccc
Confidence            333333344444444


No 346
>TIGR02794 tolA_full TolA protein. TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB.
Probab=24.63  E-value=74  Score=26.29  Aligned_cols=13  Identities=31%  Similarity=0.634  Sum_probs=6.4

Q ss_pred             HHHHHHHHHHHHH
Q psy17635         17 LAIAFLILVLILV   29 (103)
Q Consensus        17 ~ai~lLllilli~   29 (103)
                      +++.+|||++||+
T Consensus         8 v~lHvlLi~lL~~   20 (346)
T TIGR02794         8 LLLHILLLGLLIL   20 (346)
T ss_pred             HHHHHHHHHHHHH
Confidence            3345555555554


No 347
>PF06697 DUF1191:  Protein of unknown function (DUF1191);  InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function.
Probab=24.58  E-value=15  Score=29.91  Aligned_cols=23  Identities=26%  Similarity=0.615  Sum_probs=11.6

Q ss_pred             HHHHHHHHH-HHHHHHHhheeecC
Q psy17635         14 GMLLAIAFL-ILVLILVCLIKRNR   36 (103)
Q Consensus        14 g~~~ai~lL-llilli~C~i~R~r   36 (103)
                      |++..+++| +|.++++|++|-.|
T Consensus       218 g~~~G~~~L~ll~~lv~~~vr~kr  241 (278)
T PF06697_consen  218 GVVGGVVLLGLLSLLVAMLVRYKR  241 (278)
T ss_pred             EehHHHHHHHHHHHHHHhhhhhhH
Confidence            434444444 44456667765443


No 348
>PF09583 Phageshock_PspG:  Phage shock protein G (Phageshock_PspG);  InterPro: IPR014318 This protein previously was designated yjbO in Escherichia coli. It is found only in genomes that have the phage shock operon (psp), but it is only rarely encoded near other psp genes. The psp regulon is upregulated in response to a number of stress conditions, including ethanol, expression of the filamentous phage secretin protein IV and other secretins and heat shock.
Probab=24.50  E-value=1.3e+02  Score=19.74  Aligned_cols=28  Identities=29%  Similarity=0.501  Sum_probs=13.5

Q ss_pred             hhhhHHHHHHHHHHH-----HHHHHHHHhheee
Q psy17635          7 ATAGWFIGMLLAIAF-----LILVLILVCLIKR   34 (103)
Q Consensus         7 at~gWfIg~~~ai~l-----Lllilli~C~i~R   34 (103)
                      ++.-++++=|.++++     |+++++.++++|+
T Consensus        31 a~~vm~l~Gm~~lviKLLPWLil~~~~vW~~r~   63 (65)
T PF09583_consen   31 AFAVMFLGGMFGLVIKLLPWLILAAVVVWFYRA   63 (65)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            444455555555444     2334444555554


No 349
>PF01998 DUF131:  Protein of unknown function DUF131;  InterPro: IPR002849 This archaebacterial protein family has no known function. The proteins are predicted to contain two transmembrane helices.
Probab=24.49  E-value=1e+02  Score=19.77  Aligned_cols=16  Identities=50%  Similarity=0.783  Sum_probs=7.3

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy17635         12 FIGMLLAIAFLILVLI   27 (103)
Q Consensus        12 fIg~~~ai~lLllill   27 (103)
                      .++++.||++.++.++
T Consensus        46 ~~~~ilaiil~i~~~l   61 (64)
T PF01998_consen   46 KIAMILAIILMILALL   61 (64)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3444555444444433


No 350
>TIGR03141 cytochro_ccmD heme exporter protein CcmD. The model for this protein family describes a small, hydrophobic, and only moderately well-conserved protein, tricky to identify accurately for all of these reasons. However, members are found as part of large operons involved in heme export across the inner membrane for assembly of c-type cytochromes in a large number of bacteria. The gray zone between the trusted cutoff (13.0) and noise cutoff (4.75) includes both low-scoring examples and false-positive matches to hydrophobic domains of longer proteins.
Probab=24.29  E-value=42  Score=19.75  Aligned_cols=21  Identities=19%  Similarity=0.292  Sum_probs=9.7

Q ss_pred             HHHHHHHHHHHHHhheeecCC
Q psy17635         17 LAIAFLILVLILVCLIKRNRG   37 (103)
Q Consensus        17 ~ai~lLllilli~C~i~R~rG   37 (103)
                      -++.+++|+.+++....++|.
T Consensus        12 Yg~t~l~l~~li~~~~~~~r~   32 (45)
T TIGR03141        12 YGITALVLAGLILWSLLDRRR   32 (45)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            344445554444444444443


No 351
>PF15339 Afaf:  Acrosome formation-associated factor
Probab=24.14  E-value=76  Score=24.82  Aligned_cols=21  Identities=19%  Similarity=0.362  Sum_probs=12.3

Q ss_pred             HHHHHH-HHHHHHHHHHHhhee
Q psy17635         13 IGMLLA-IAFLILVLILVCLIK   33 (103)
Q Consensus        13 Ig~~~a-i~lLllilli~C~i~   33 (103)
                      |.+|.. |.+.||++|.+-+.|
T Consensus       136 IsLmTl~lfv~Ll~~c~atlyk  157 (200)
T PF15339_consen  136 ISLMTLFLFVILLAFCSATLYK  157 (200)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            444443 666677777665553


No 352
>KOG3832|consensus
Probab=24.07  E-value=84  Score=25.73  Aligned_cols=13  Identities=38%  Similarity=0.915  Sum_probs=9.7

Q ss_pred             cchhhhhHHHHHH
Q psy17635          4 DTVATAGWFIGML   16 (103)
Q Consensus         4 ~~~at~gWfIg~~   16 (103)
                      ..|+|+||+..+.
T Consensus        35 kafatagwllsi~   47 (319)
T KOG3832|consen   35 KAFATAGWLLSIT   47 (319)
T ss_pred             HhHhhccHHHHHH
Confidence            3699999986543


No 353
>PF13858 DUF4199:  Protein of unknown function (DUF4199)
Probab=24.04  E-value=98  Score=21.58  Aligned_cols=28  Identities=14%  Similarity=0.422  Sum_probs=18.0

Q ss_pred             hhHHH-HHHHHHHHHHHHHHHHhheeecC
Q psy17635          9 AGWFI-GMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus         9 ~gWfI-g~~~ai~lLllilli~C~i~R~r   36 (103)
                      ..+++ +++.-+.+-+++-||++.+.|+|
T Consensus       135 ~~~~~~~~~~~l~~G~i~sli~a~i~kkk  163 (163)
T PF13858_consen  135 FSLAFSGFISNLIFGFIISLIIALILKKK  163 (163)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence            45666 55666667777777777666654


No 354
>PF05510 Sarcoglycan_2:  Sarcoglycan alpha/epsilon;  InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=23.96  E-value=65  Score=27.45  Aligned_cols=17  Identities=24%  Similarity=0.757  Sum_probs=7.4

Q ss_pred             HHHHHHHHHHHHHHHhh
Q psy17635         15 MLLAIAFLILVLILVCL   31 (103)
Q Consensus        15 ~~~ai~lLllilli~C~   31 (103)
                      ++++++|++|+.+|+||
T Consensus       293 l~i~llL~llLs~Imc~  309 (386)
T PF05510_consen  293 LIIALLLLLLLSYIMCC  309 (386)
T ss_pred             HHHHHHHHHHHHHHhee
Confidence            33334444444445554


No 355
>PF05297 Herpes_LMP1:  Herpesvirus latent membrane protein 1 (LMP1);  InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=23.93  E-value=26  Score=29.56  Aligned_cols=25  Identities=24%  Similarity=0.481  Sum_probs=0.0

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          8 TAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      .|.||..+|=..-|++++.+++++.
T Consensus       160 ~qaWfT~L~dL~WL~LFlaiLIWlY  184 (381)
T PF05297_consen  160 HQAWFTILVDLYWLLLFLAILIWLY  184 (381)
T ss_dssp             -------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3567776666655666665555554


No 356
>PF13400 Tad:  Putative Flp pilus-assembly TadE/G-like
Probab=23.89  E-value=44  Score=19.41  Aligned_cols=15  Identities=20%  Similarity=0.487  Sum_probs=5.8

Q ss_pred             HHHHHHHHHHHHHHh
Q psy17635         16 LLAIAFLILVLILVC   30 (103)
Q Consensus        16 ~~ai~lLllilli~C   30 (103)
                      +.+++++.+++++.+
T Consensus         6 ~~~~~~~~~l~~~~~   20 (48)
T PF13400_consen    6 IFALVLVPLLLLIGL   20 (48)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            334343333333333


No 357
>PRK01844 hypothetical protein; Provisional
Probab=23.77  E-value=82  Score=20.99  Aligned_cols=24  Identities=21%  Similarity=0.246  Sum_probs=9.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhheee
Q psy17635         10 GWFIGMLLAIAFLILVLILVCLIKR   34 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~C~i~R   34 (103)
                      .|+..++..+ .|++-+++-.|+-|
T Consensus         3 ~~~~I~l~I~-~li~G~~~Gff~ar   26 (72)
T PRK01844          3 IWLGILVGVV-ALVAGVALGFFIAR   26 (72)
T ss_pred             HHHHHHHHHH-HHHHHHHHHHHHHH
Confidence            3543333333 33333344455544


No 358
>PRK05696 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=23.73  E-value=58  Score=23.81  Aligned_cols=8  Identities=13%  Similarity=-0.047  Sum_probs=3.5

Q ss_pred             HHHHHHHH
Q psy17635         11 WFIGMLLA   18 (103)
Q Consensus        11 WfIg~~~a   18 (103)
                      |+|+++.+
T Consensus        21 iiii~~~~   28 (170)
T PRK05696         21 IIIIVIGV   28 (170)
T ss_pred             EeeHHHHH
Confidence            55444333


No 359
>PRK13872 conjugal transfer protein TrbF; Provisional
Probab=23.64  E-value=1.2e+02  Score=23.11  Aligned_cols=9  Identities=22%  Similarity=0.453  Sum_probs=4.8

Q ss_pred             hheeecCCC
Q psy17635         30 CLIKRNRGG   38 (103)
Q Consensus        30 C~i~R~rGg   38 (103)
                      .+|.-++-|
T Consensus        71 yvv~VD~tG   79 (228)
T PRK13872         71 WVVEVDRLG   79 (228)
T ss_pred             EEEEEcCCC
Confidence            445556645


No 360
>COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning]
Probab=23.59  E-value=32  Score=30.74  Aligned_cols=24  Identities=17%  Similarity=0.365  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHHHHHHhheeecC
Q psy17635         13 IGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        13 Ig~~~ai~lLllilli~C~i~R~r   36 (103)
                      |-+|+||++++++++.+-++.|.|
T Consensus         2 ~~liv~llVilv~~~~~g~~lRkk   25 (570)
T COG4477           2 IYLIVALLVILVAAYAVGYLLRKK   25 (570)
T ss_pred             eehHHHHHHHHHHHHHHHHHHHHh
Confidence            346667777777766666665544


No 361
>PF14257 DUF4349:  Domain of unknown function (DUF4349)
Probab=23.59  E-value=1.1e+02  Score=23.55  Aligned_cols=9  Identities=11%  Similarity=0.228  Sum_probs=3.8

Q ss_pred             hHHHHHHHH
Q psy17635         10 GWFIGMLLA   18 (103)
Q Consensus        10 gWfIg~~~a   18 (103)
                      .|+|.++++
T Consensus       234 ~~lv~~l~~  242 (262)
T PF14257_consen  234 SGLVVFLVG  242 (262)
T ss_pred             HHHHHHHHH
Confidence            344444444


No 362
>PF02158 Neuregulin:  Neuregulin family;  InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers.  The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission.   The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=23.57  E-value=27  Score=30.05  Aligned_cols=16  Identities=31%  Similarity=0.681  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHhhe
Q psy17635         17 LAIAFLILVLILVCLI   32 (103)
Q Consensus        17 ~ai~lLllilli~C~i   32 (103)
                      ..|++.||++=|+|.+
T Consensus        13 TgIcvaLlVVGi~Cvv   28 (404)
T PF02158_consen   13 TGICVALLVVGIVCVV   28 (404)
T ss_dssp             ----------------
T ss_pred             hhhhHHHHHHHHHHHH
Confidence            3355555555555555


No 363
>KOG3359|consensus
Probab=23.21  E-value=88  Score=28.84  Aligned_cols=39  Identities=23%  Similarity=0.348  Sum_probs=22.2

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhc
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAH   49 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~   49 (103)
                      ..|-+.+++.+++.+++  +.+.++|+||--+.+.+.+..+
T Consensus       576 ~~Ww~~~~~v~if~~v~--~~~l~~wqr~~~~~~~~~~~~~  614 (723)
T KOG3359|consen  576 VIWWSSTLSLAIFALVV--LFRLLRWQRGTILSDPQIWRVH  614 (723)
T ss_pred             HHHHHHHHHHHHHHHHH--HHHHHHhccCCccCchhhhhHH
Confidence            34555555554444333  3566788888655555555444


No 364
>KOG1094|consensus
Probab=23.17  E-value=37  Score=31.35  Aligned_cols=26  Identities=19%  Similarity=0.401  Sum_probs=15.7

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          8 TAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      +..-.|+++.||.|++++++.+|+-+
T Consensus       389 ~t~~~~~~f~~if~iva~ii~~~L~R  414 (807)
T KOG1094|consen  389 PTAILIIIFVAIFLIVALIIALMLWR  414 (807)
T ss_pred             CceehHHHHHHHHHHHHHHHHHHHHH
Confidence            34455666677666666655566554


No 365
>cd01811 OASL_repeat1 2'-5' oligoadenylate synthetase-like protein, repeat 1 of 2. OASL_repeat1   (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
Probab=23.07  E-value=44  Score=22.76  Aligned_cols=13  Identities=38%  Similarity=0.583  Sum_probs=11.0

Q ss_pred             CCCCcchhhhhCC
Q psy17635         84 VESDTDSMAEYGD   96 (103)
Q Consensus        84 ~~~s~DSl~dYgd   96 (103)
                      ..+|.+|||+||-
T Consensus        53 lL~s~~sLA~yGi   65 (80)
T cd01811          53 LLSSRKSLADYGI   65 (80)
T ss_pred             cccccccHhhhcc
Confidence            4788999999984


No 366
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=22.99  E-value=84  Score=23.78  Aligned_cols=10  Identities=30%  Similarity=0.690  Sum_probs=5.5

Q ss_pred             eeecCCCccc
Q psy17635         32 IKRNRGGKYA   41 (103)
Q Consensus        32 i~R~rGgKY~   41 (103)
                      .+.|||-.=.
T Consensus        88 ~kn~rgsRvg   97 (155)
T PF10873_consen   88 MKNSRGSRVG   97 (155)
T ss_pred             hhcCCCcccc
Confidence            4667665433


No 367
>PF04906 Tweety:  Tweety;  InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised. However, they are considered to be transmembrane proteins with five potential membrane-spanning regions. A number of potential functions have been suggested on the basis of homology to the yeast FTR1 and FTH1 iron transporter proteins and the mammalian neurotensin receptors 1 and 2 in that they have a similar hydrophobicity profiles although there is no detectable sequence homology to the tweety-related proteins. It has been proposed that the tweety-related proteins could be involved in transport of iron or other divalent cations or alternatively that they may be membrane-bound receptors [].
Probab=22.88  E-value=1e+02  Score=25.89  Aligned_cols=12  Identities=8%  Similarity=0.047  Sum_probs=7.9

Q ss_pred             HhheeecCCCcc
Q psy17635         29 VCLIKRNRGGKY   40 (103)
Q Consensus        29 ~C~i~R~rGgKY   40 (103)
                      +-++||+|..--
T Consensus       205 ~glar~Sk~~li  216 (406)
T PF04906_consen  205 LGLARQSKCLLI  216 (406)
T ss_pred             HHHHhcCcceEE
Confidence            346688887653


No 368
>PRK03001 M48 family peptidase; Provisional
Probab=22.86  E-value=63  Score=25.31  Aligned_cols=35  Identities=17%  Similarity=0.307  Sum_probs=16.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHH-hheeecCCCcccchhh
Q psy17635         10 GWFIGMLLAIAFLILVLILV-CLIKRNRGGKYAVHER   45 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli~-C~i~R~rGgKY~VkeK   45 (103)
                      +|+++++.++++.++...+. .++.+.=| +.+|.+.
T Consensus        29 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~l~~~   64 (283)
T PRK03001         29 GMLIALLFALGMNFFSYWFSDKMVLKMYN-AQEVDEN   64 (283)
T ss_pred             hHHHHHHHHHHHHHHHHHHhHHHHHHHcC-CEECCcc
Confidence            46666555555444443333 33444323 3556654


No 369
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=22.56  E-value=1e+02  Score=21.12  Aligned_cols=7  Identities=29%  Similarity=0.805  Sum_probs=3.4

Q ss_pred             hHHHHHH
Q psy17635         10 GWFIGML   16 (103)
Q Consensus        10 gWfIg~~   16 (103)
                      -|++..+
T Consensus         2 fWl~~a~    8 (117)
T TIGR03142         2 FWIVAAL    8 (117)
T ss_pred             HHHHHHH
Confidence            3755433


No 370
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=22.53  E-value=93  Score=23.89  Aligned_cols=26  Identities=23%  Similarity=0.535  Sum_probs=18.1

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      ..-.+|-.-||.|++.+.|++|+.++
T Consensus        22 ~~psffsthm~tILiaIvVliiiiiv   47 (189)
T PF05568_consen   22 TPPSFFSTHMYTILIAIVVLIIIIIV   47 (189)
T ss_pred             CCccHHHHHHHHHHHHHHHHHHHHHH
Confidence            44578888899888777666555443


No 371
>PRK00523 hypothetical protein; Provisional
Probab=22.50  E-value=82  Score=20.98  Aligned_cols=13  Identities=23%  Similarity=0.335  Sum_probs=5.9

Q ss_pred             HHHHHHHHhheee
Q psy17635         22 LILVLILVCLIKR   34 (103)
Q Consensus        22 Lllilli~C~i~R   34 (103)
                      |++-+++-.|+-|
T Consensus        15 li~G~~~Gffiar   27 (72)
T PRK00523         15 LIVGGIIGYFVSK   27 (72)
T ss_pred             HHHHHHHHHHHHH
Confidence            3333444455544


No 372
>PF04507 DUF576:  Protein of unknown function, DUF576;  InterPro: IPR007595 This family contains several uncharacterised staphylococcal proteins. Members of this family are mostly predicted lipoproteins, found in Staphylococcus aureus but are also found clustered in Staphylococcus epidermidis.; PDB: 4EG9_A 4EGD_B.
Probab=22.23  E-value=29  Score=28.14  Aligned_cols=14  Identities=29%  Similarity=0.418  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHhhe
Q psy17635         19 IAFLILVLILVCLI   32 (103)
Q Consensus        19 i~lLllilli~C~i   32 (103)
                      |.+||||++|++|.
T Consensus        11 Is~liLii~I~GCg   24 (257)
T PF04507_consen   11 ISLLILIIFIGGCG   24 (257)
T ss_dssp             --------------
T ss_pred             HHHHhHheeeeecc
Confidence            44455555555443


No 373
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=22.19  E-value=83  Score=21.85  Aligned_cols=6  Identities=17%  Similarity=0.528  Sum_probs=2.5

Q ss_pred             hheeec
Q psy17635         30 CLIKRN   35 (103)
Q Consensus        30 C~i~R~   35 (103)
                      ...||.
T Consensus        22 a~YRR~   27 (92)
T PHA02681         22 MMYRRS   27 (92)
T ss_pred             HHHHhc
Confidence            334444


No 374
>PF12751 Vac7:  Vacuolar segregation subunit 7;  InterPro: IPR024260 Vac7 is localised at the vacuole membrane, a location which is consistent with its involvement in vacuole morphology and inheritance []. Vac7 has been shown to function as an upstream regulator of the Fab1 lipid kinase pathway []. The Fab1 lipid pathway is important for correct regulation of membrane trafficking events.
Probab=22.10  E-value=91  Score=26.71  Aligned_cols=22  Identities=14%  Similarity=0.353  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      ..+-+++.|++||+|..+++|+
T Consensus       302 ~~~c~~~~i~~lL~ig~~~gFv  323 (387)
T PF12751_consen  302 FASCIYLSILLLLVIGFAIGFV  323 (387)
T ss_pred             hhHHHHHHHHHHHHHHHHHHhh


No 375
>PF13074 DUF3938:  Protein of unknown function (DUF3938)
Probab=22.06  E-value=88  Score=21.63  Aligned_cols=14  Identities=43%  Similarity=0.814  Sum_probs=10.5

Q ss_pred             hHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLI   23 (103)
Q Consensus        10 gWfIg~~~ai~lLl   23 (103)
                      .|||+...|=++-+
T Consensus        75 twfiafcladvfnl   88 (102)
T PF13074_consen   75 TWFIAFCLADVFNL   88 (102)
T ss_pred             HHHHHHHHHHHHHH
Confidence            69999888866544


No 376
>PRK13836 conjugal transfer protein TrbF; Provisional
Probab=22.02  E-value=1.3e+02  Score=22.76  Aligned_cols=15  Identities=27%  Similarity=0.507  Sum_probs=9.4

Q ss_pred             hhhHHHHHHHHHHHH
Q psy17635          8 TAGWFIGMLLAIAFL   22 (103)
Q Consensus         8 t~gWfIg~~~ai~lL   22 (103)
                      ...|+++.+.++++.
T Consensus        29 ~~~W~~~a~~~l~~a   43 (220)
T PRK13836         29 AAAWRIVGILGLTMA   43 (220)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            367987776554443


No 377
>PF13937 DUF4212:  Domain of unknown function (DUF4212)
Probab=21.76  E-value=43  Score=22.42  Aligned_cols=6  Identities=50%  Similarity=0.778  Sum_probs=2.3

Q ss_pred             CCcccc
Q psy17635         37 GGKYAV   42 (103)
Q Consensus        37 GgKY~V   42 (103)
                      -.||-|
T Consensus        74 D~~~gv   79 (81)
T PF13937_consen   74 DRKYGV   79 (81)
T ss_pred             HHHHCC
Confidence            334433


No 378
>PRK10856 cytoskeletal protein RodZ; Provisional
Probab=21.75  E-value=95  Score=25.42  Aligned_cols=26  Identities=15%  Similarity=0.418  Sum_probs=14.0

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635          8 TAGWFIGMLLAIAFLILVLILVCLIK   33 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli~C~i~   33 (103)
                      ..+|+..+...|++++|.+..+++..
T Consensus       109 ~~~~~~~~~~lv~~vvl~l~~~wwwq  134 (331)
T PRK10856        109 RDGWLMTFTWLVLFVVIGLTGAWWWQ  134 (331)
T ss_pred             cCCchHHHHHHHHHHHHHHHHHHHhh
Confidence            44577655555555555554455553


No 379
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=21.70  E-value=70  Score=24.81  Aligned_cols=13  Identities=15%  Similarity=0.445  Sum_probs=4.9

Q ss_pred             HHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLI   23 (103)
Q Consensus        11 WfIg~~~ai~lLl   23 (103)
                      |-.|..+++++.+
T Consensus       290 ~~~gy~~~l~~m~  302 (318)
T TIGR00383       290 WKYGYPAVLIVMA  302 (318)
T ss_pred             chhHHHHHHHHHH
Confidence            4334433333333


No 380
>PF07695 7TMR-DISM_7TM:  7TM diverse intracellular signalling;  InterPro: IPR011623 This entry represents the transmembrane region of the 7TM-DISM (7TM Receptors with Diverse Intracellular Signalling Modules) [].
Probab=21.63  E-value=1.1e+02  Score=21.07  Aligned_cols=11  Identities=36%  Similarity=0.531  Sum_probs=5.4

Q ss_pred             hheeecCCCcc
Q psy17635         30 CLIKRNRGGKY   40 (103)
Q Consensus        30 C~i~R~rGgKY   40 (103)
                      ...||+++.+|
T Consensus       144 ~~~~~~~~a~~  154 (205)
T PF07695_consen  144 AWRKGNRPARY  154 (205)
T ss_pred             HHHcCCccHHH
Confidence            43455555544


No 381
>PTZ00046 rifin; Provisional
Probab=21.60  E-value=88  Score=26.48  Aligned_cols=23  Identities=13%  Similarity=0.475  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHHHHHHhheeecC
Q psy17635         14 GMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R~r   36 (103)
                      ..++.|||..+|+.++-=.||.|
T Consensus       322 iAIvVIVLIMvIIYLILRYRRKK  344 (358)
T PTZ00046        322 VAIVVIVLIMVIIYLILRYRRKK  344 (358)
T ss_pred             HHHHHHHHHHHHHHHHHHhhhcc
Confidence            33344444444444444456654


No 382
>PF03988 DUF347:  Repeat of Unknown Function (DUF347) ;  InterPro: IPR007136 This repeat is found as four tandem repeats in a family of bacterial membrane proteins. Each repeat contains two transmembrane regions and a conserved tryptophan.
Probab=21.54  E-value=1.5e+02  Score=18.07  Aligned_cols=17  Identities=41%  Similarity=0.366  Sum_probs=7.0

Q ss_pred             HHHHHHHHHHHhheeec
Q psy17635         19 IAFLILVLILVCLIKRN   35 (103)
Q Consensus        19 i~lLllilli~C~i~R~   35 (103)
                      ++++++++.+....+|+
T Consensus        37 ~~~l~~~~~~~~~~~~~   53 (55)
T PF03988_consen   37 AALLAVVLALWYRSKRY   53 (55)
T ss_pred             HHHHHHHHHHHHHHhcc
Confidence            33333334444444444


No 383
>PF10731 Anophelin:  Thrombin inhibitor from mosquito;  InterPro: IPR018932  Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing. 
Probab=21.54  E-value=1.8e+02  Score=19.00  Aligned_cols=10  Identities=30%  Similarity=0.311  Sum_probs=5.2

Q ss_pred             hcCCCCCCCC
Q psy17635         48 AHGRHDYPEE   57 (103)
Q Consensus        48 ~~g~~~~~dd   57 (103)
                      ++|+.|--||
T Consensus        26 a~GeeP~YDE   35 (65)
T PF10731_consen   26 APGEEPSYDE   35 (65)
T ss_pred             CCCCCCCcCc
Confidence            5666654333


No 384
>PF15159 PIG-Y:  Phosphatidylinositol N-acetylglucosaminyltransferase subunit Y
Probab=21.44  E-value=1.7e+02  Score=19.21  Aligned_cols=21  Identities=19%  Similarity=0.490  Sum_probs=13.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLILV   29 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~   29 (103)
                      .||++.+++++.++.+.-.++
T Consensus         2 ~G~~~il~~~v~fv~~~y~~v   22 (72)
T PF15159_consen    2 LGWLLILFTLVFFVGFFYAAV   22 (72)
T ss_pred             cchHHHHHHHHHHHHHHHHHH
Confidence            378888877766666554443


No 385
>PF15145 DUF4577:  Domain of unknown function (DUF4577)
Probab=21.27  E-value=1.5e+02  Score=21.74  Aligned_cols=35  Identities=31%  Similarity=0.584  Sum_probs=16.7

Q ss_pred             hHHHHHHHHHH-HHHHHHHHHhheeecCCCcc-cchhh
Q psy17635         10 GWFIGMLLAIA-FLILVLILVCLIKRNRGGKY-AVHER   45 (103)
Q Consensus        10 gWfIg~~~ai~-lLllilli~C~i~R~rGgKY-~VkeK   45 (103)
                      -+|+|+++++. -|.|+.+++++| -+-|.|- .|..+
T Consensus        62 lffvglii~LivSLaLVsFvIFLi-iQTgnkMddvSrR   98 (128)
T PF15145_consen   62 LFFVGLIIVLIVSLALVSFVIFLI-IQTGNKMDDVSRR   98 (128)
T ss_pred             ehHHHHHHHHHHHHHHHHHHHHhe-eeccchHHHHHHH
Confidence            46788777733 333443333333 2334443 24444


No 386
>PF04531 Phage_holin_1:  Bacteriophage holin;  InterPro: IPR006485 Phage proteins for bacterial lysis typically include a membrane-disrupting protein, or holin, and one or more cell wall degrading enzymes that reach the cell wall because of holin action. Holins are found in a large number of mutually non-homologous families.  This entry is represented by the Bacteriophage phi-LC3, holin. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=21.20  E-value=1.5e+02  Score=19.60  Aligned_cols=23  Identities=13%  Similarity=0.273  Sum_probs=15.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli   28 (103)
                      +.+-.|.++++.+|++++..++.
T Consensus         8 ~kN~~~w~ali~~i~l~vq~~~~   30 (84)
T PF04531_consen    8 FKNKAFWVALISAILLLVQQVGG   30 (84)
T ss_pred             ccCHHHHHHHHHHHHHHHHHHHH
Confidence            44556667777777777665544


No 387
>COG4698 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.91  E-value=1.5e+02  Score=23.26  Aligned_cols=38  Identities=11%  Similarity=0.217  Sum_probs=21.3

Q ss_pred             hHHHHHHHHHHHHHHHH-HHHhheeecCCCcccchhhhh
Q psy17635         10 GWFIGMLLAIAFLILVL-ILVCLIKRNRGGKYAVHEREA   47 (103)
Q Consensus        10 gWfIg~~~ai~lLllil-li~C~i~R~rGgKY~VkeKE~   47 (103)
                      .|=-+.+|.++|+++++ +|+.++.--|-.-|.+.+++.
T Consensus        10 ~WKw~f~iLLAln~l~~~~i~~~vlsp~ee~t~~~~a~~   48 (197)
T COG4698          10 YWKWLFFILLALNTLLAVLIALFVLSPREEPTHLEDASE   48 (197)
T ss_pred             HHHHHHHHHHHHHHHHHHHhheeeccCCCCCchhhccCc
Confidence            46666666666666554 333444334444666666654


No 388
>PF13131 DUF3951:  Protein of unknown function (DUF3951)
Probab=20.91  E-value=1.5e+02  Score=18.71  Aligned_cols=11  Identities=18%  Similarity=0.365  Sum_probs=4.8

Q ss_pred             HHHHHHHHHHH
Q psy17635         17 LAIAFLILVLI   27 (103)
Q Consensus        17 ~ai~lLllill   27 (103)
                      .++++++||.+
T Consensus        11 ~~~~I~~lIgf   21 (53)
T PF13131_consen   11 FTIFIFFLIGF   21 (53)
T ss_pred             HHHHHHHHHHH
Confidence            33444444444


No 389
>COG3462 Predicted membrane protein [Function unknown]
Probab=20.91  E-value=1.2e+02  Score=21.95  Aligned_cols=18  Identities=22%  Similarity=0.622  Sum_probs=9.5

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy17635         11 WFIGMLLAIAFLILVLIL   28 (103)
Q Consensus        11 WfIg~~~ai~lLllilli   28 (103)
                      |+|-.+.+++.++++++.
T Consensus        50 ~lImpI~~~vvli~lvvf   67 (117)
T COG3462          50 WLIMPIFWAVVLIFLVVF   67 (117)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            677666664444444333


No 390
>TIGR03746 conj_TIGR03746 integrating conjugative element protein, PFL_4703 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition. The function is unknown.
Probab=20.87  E-value=1.3e+02  Score=23.62  Aligned_cols=23  Identities=22%  Similarity=0.154  Sum_probs=12.9

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          6 VATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         6 ~at~gWfIg~~~ai~lLllilli   28 (103)
                      +.+..++|++|++|++++.+.+.
T Consensus        12 i~tLr~~~~~L~~i~~~l~~gw~   34 (202)
T TIGR03746        12 ILTLRIAIGLLALILLALWFGWR   34 (202)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            44556666666665555544433


No 391
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=20.85  E-value=92  Score=26.32  Aligned_cols=24  Identities=21%  Similarity=0.650  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhheeecC
Q psy17635         12 FIGMLLAIAFLILVLILVCLIKRNR   36 (103)
Q Consensus        12 fIg~~~ai~lLllilli~C~i~R~r   36 (103)
                      +|++++ |||..+|+.++-=.||.|
T Consensus       316 iIAIvv-IVLIMvIIYLILRYRRKK  339 (353)
T TIGR01477       316 IIAILI-IVLIMVIIYLILRYRRKK  339 (353)
T ss_pred             HHHHHH-HHHHHHHHHHHHHhhhcc
Confidence            344444 444444444444466654


No 392
>PF07009 DUF1312:  Protein of unknown function (DUF1312);  InterPro: IPR010739 This family consists of several bacterial proteins of around 120 residues in length. The function of this family is unknown.; PDB: 4ESN_B 1NPP_B 1M1G_D 1NPR_A 1M1H_A 2KPP_A 3LD7_C.
Probab=20.84  E-value=31  Score=23.62  Aligned_cols=9  Identities=67%  Similarity=1.088  Sum_probs=3.2

Q ss_pred             eecCCCccc
Q psy17635         33 KRNRGGKYA   41 (103)
Q Consensus        33 ~R~rGgKY~   41 (103)
                      ..+.+++|.
T Consensus        21 ~~~~~~~~~   29 (113)
T PF07009_consen   21 GNNSGGKYA   29 (113)
T ss_dssp             ----SEEEE
T ss_pred             ccCCCCeEE
Confidence            456666665


No 393
>COG5522 Predicted integral membrane protein [Function unknown]
Probab=20.79  E-value=60  Score=26.03  Aligned_cols=33  Identities=24%  Similarity=0.471  Sum_probs=22.0

Q ss_pred             HHHHHHHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635         14 GMLLAIAFLILVLILVCLIKRNRGGKYAVHERE   46 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i~R~rGgKY~VkeKE   46 (103)
                      |+.|+++++++...++.++.|.=|.+|.-=.||
T Consensus       155 gl~~svl~~~~lg~~~lfinrrLGtNYlylsk~  187 (236)
T COG5522         155 GLVMSVLVAISLGIMCLFINRRLGTNYLYLSKE  187 (236)
T ss_pred             chhHHHHHHHHHHHHHHHHHHHhcCceeEeecC
Confidence            355665555555666667889999999754443


No 394
>PF14029 DUF4244:  Protein of unknown function (DUF4244)
Probab=20.79  E-value=2e+02  Score=18.10  Aligned_cols=26  Identities=27%  Similarity=0.363  Sum_probs=20.2

Q ss_pred             CcchhhhhHHHHHHHHHHHHHHHHHH
Q psy17635          3 SDTVATAGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         3 ~~~~at~gWfIg~~~ai~lLllilli   28 (103)
                      ..+.+|.=|-||.+.|.+|.-++..+
T Consensus        12 d~GmsTaEYAvgtlAA~afA~vL~~v   37 (56)
T PF14029_consen   12 DAGMSTAEYAVGTLAAAAFAGVLYKV   37 (56)
T ss_pred             cCchhHHHHHHHHHHHHHHHHHHHHH
Confidence            35688899999999998887665544


No 395
>PF10601 zf-LITAF-like:  LITAF-like zinc ribbon domain;  InterPro: IPR006629 Members of this family display a conserved zinc ribbon structure [] with the motif C-XX-C- separated from the more C-terminal HX-C(P)X-C-X4-G-R motif by a variable region of usually 25-30 (hydrophobic) residues. Although it belongs to one of the zinc finger's fold groups (zinc ribbon), this particular domain was first identified in LPS-induced tumour necrosis alpha factor (LITAF) which is produced in mammalian cells after being challenged with lipopolysaccharide (LPS). The hydrophobic region probably inserts into the membrane rather than traversing it. Such an insertion brings together the N- and C-terminal C-XX-C motifs to form a compact Zn2+-binding structure []. 
Probab=20.72  E-value=1.6e+02  Score=18.38  Aligned_cols=8  Identities=13%  Similarity=0.252  Sum_probs=3.4

Q ss_pred             hhHHHHHH
Q psy17635          9 AGWFIGML   16 (103)
Q Consensus         9 ~gWfIg~~   16 (103)
                      .+|+++++
T Consensus        28 ~t~~~~~~   35 (73)
T PF10601_consen   28 MTYICAAL   35 (73)
T ss_pred             HHHHHHHH
Confidence            34544443


No 396
>PHA02911 C-type lectin-like protein; Provisional
Probab=20.63  E-value=1.1e+02  Score=24.36  Aligned_cols=19  Identities=21%  Similarity=0.392  Sum_probs=10.0

Q ss_pred             hHHHHHHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus        10 gWfIg~~~ai~lLllilli   28 (103)
                      .-+|+.||.|..+++|.|.
T Consensus        30 ~~~~~~~~~~~~~~~~~~~   48 (213)
T PHA02911         30 ALIIACLCLILTIIIICLF   48 (213)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3456666665555544443


No 397
>PF06667 PspB:  Phage shock protein B;  InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=20.61  E-value=1.3e+02  Score=19.90  Aligned_cols=9  Identities=11%  Similarity=0.062  Sum_probs=4.0

Q ss_pred             eecCCCccc
Q psy17635         33 KRNRGGKYA   41 (103)
Q Consensus        33 ~R~rGgKY~   41 (103)
                      |++.++.-+
T Consensus        29 k~~~~~gLs   37 (75)
T PF06667_consen   29 KWKSSQGLS   37 (75)
T ss_pred             hcccCCCCC
Confidence            444444444


No 398
>TIGR02595 PEP_exosort PEP-CTERM putative exosortase interaction domain. This model describes a 25-residue domain that includes a near-invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In nearly every case, this motif is found within nine residues, and usually within five residues, of the extreme C-terminus of the protein. Proteins with this motif typically have signal sequences at the N-terminus. This region appears many times per genome or not at all, and co-occurs in genomes with a proposed protein-sorting integral membrane protein we designate exosortase (see TIGR02602). PEP-CTERM proteins frequently are poorly conserved, Ser/Thr-rich proteins and may become extensively modified proteinaceous constituents of extracellular material in bacterial biofilms.
Probab=20.58  E-value=80  Score=16.64  Aligned_cols=9  Identities=33%  Similarity=0.497  Sum_probs=3.8

Q ss_pred             HHhheeecC
Q psy17635         28 LVCLIKRNR   36 (103)
Q Consensus        28 i~C~i~R~r   36 (103)
                      .....+|.|
T Consensus        15 ~~~~~rrrk   23 (26)
T TIGR02595        15 GFLLLRRRR   23 (26)
T ss_pred             HHHHHhhcc
Confidence            334444443


No 399
>PF07790 DUF1628:  Protein of unknown function (DUF1628);  InterPro: IPR012859 The sequences making up this family are derived from hypothetical proteins of unknown function expressed by various archaeal species. The region in question is approximately 160 residues long. 
Probab=20.54  E-value=1.4e+02  Score=18.85  Aligned_cols=13  Identities=38%  Similarity=0.810  Sum_probs=6.1

Q ss_pred             HHHHHHHHHHHHH
Q psy17635         13 IGMLLAIAFLILV   25 (103)
Q Consensus        13 Ig~~~ai~lLlli   25 (103)
                      ||+++.|++.+++
T Consensus         7 iGviLliaitVil   19 (80)
T PF07790_consen    7 IGVILLIAITVIL   19 (80)
T ss_pred             HHHHHHHHHHHHH
Confidence            4555544444433


No 400
>TIGR02975 phageshock_pspG phage shock protein G. This protein previously was designated yjbO in E. coli. It is found only in genomes that have the phage shock operon (psp), but only rarely is encoded near other psp genes. The psp regulon is upregulated in response to a number of stress conditions, including ethanol, expression of the filamentous phage secretin protein IV and other secretins, and heat shock.
Probab=20.54  E-value=1.8e+02  Score=18.97  Aligned_cols=28  Identities=32%  Similarity=0.466  Sum_probs=13.6

Q ss_pred             hhhhHHHHHHHHHHH-----HHHHHHHHhheee
Q psy17635          7 ATAGWFIGMLLAIAF-----LILVLILVCLIKR   34 (103)
Q Consensus         7 at~gWfIg~~~ai~l-----Lllilli~C~i~R   34 (103)
                      ++.-++++=|.++++     |++++..++++|+
T Consensus        30 a~~vm~l~Gm~~lviKLLPWLil~~v~vW~~r~   62 (64)
T TIGR02975        30 AVLFMALGGMFALMIKLLPWLILAVVVVWFIKS   62 (64)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence            444555555555443     2334444555554


No 401
>COG4059 MtrE Tetrahydromethanopterin S-methyltransferase, subunit E [Coenzyme metabolism]
Probab=20.45  E-value=95  Score=25.42  Aligned_cols=33  Identities=33%  Similarity=0.451  Sum_probs=15.5

Q ss_pred             hhhHHHHHHHHHHHHHHHHHH--HhheeecCCCcc
Q psy17635          8 TAGWFIGMLLAIAFLILVLIL--VCLIKRNRGGKY   40 (103)
Q Consensus         8 t~gWfIg~~~ai~lLllilli--~C~i~R~rGgKY   40 (103)
                      +.||+-.++..|.+++||++=  +=.-.||+=|.|
T Consensus       260 ~~G~isiv~Gliiv~iLii~N~kvEv~ARn~YGpY  294 (304)
T COG4059         260 QGGWISIVAGLIIVLILIIWNRKVEVKARNAYGPY  294 (304)
T ss_pred             ccchhhhHHHHHHHHHHHHhcchhhhhhhhccCCc
Confidence            345665555544444443321  112236666666


No 402
>PF14985 TM140:  TM140 protein family
Probab=20.44  E-value=1.3e+02  Score=23.16  Aligned_cols=26  Identities=31%  Similarity=0.411  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHHH-HHHHhheeecCC
Q psy17635         12 FIGMLLAIAFLILV-LILVCLIKRNRG   37 (103)
Q Consensus        12 fIg~~~ai~lLlli-lli~C~i~R~rG   37 (103)
                      |.++..|=++|+|. +.++.|-.|.+-
T Consensus       147 FlAL~~AqalliLlL~~~v~fp~r~~~  173 (181)
T PF14985_consen  147 FLALGGAQALLILLLMATVVFPPRAKR  173 (181)
T ss_pred             HHHHHHHHHHHHHHHHHHHhcCCcccc
Confidence            67788885555544 444555555443


No 403
>PF04246 RseC_MucC:  Positive regulator of sigma(E), RseC/MucC;  InterPro: IPR007359 This bacterial family of integral membrane proteins represents a positive regulator of the sigma(E) transcription factor, namely RseC/MucC. The sigma(E) transcription factor is up-regulated by cell envelope protein misfolding, and regulates the expression of genes that are collectively termed ECF (devoted to Extra-Cellular Functions) []. In Pseudomonas aeruginosa, derepression of sigma(E) is associated with the alginate-overproducing phenotype characteristic of chronic respiratory tract colonization in cystic fibrosis patients. The mechanism by which RseC/MucC positively regulates the sigma(E) transcription factor is unknown. RseC is also thought to have a role in thiamine biosynthesis in Salmonella typhimurium []. In addition, this family also includes an N-terminal part of RnfF, a Rhodobacter capsulatus protein, of unknown function, that is essential for nitrogen fixation. This protein also contains a domain found in ApbE protein IPR003374 from INTERPRO, which is itself involved in thiamine biosynthesis.
Probab=20.44  E-value=97  Score=21.46  Aligned_cols=24  Identities=21%  Similarity=0.227  Sum_probs=13.0

Q ss_pred             HHHHHHHHHHHHHHHhheeecCCC
Q psy17635         15 MLLAIAFLILVLILVCLIKRNRGG   38 (103)
Q Consensus        15 ~~~ai~lLllilli~C~i~R~rGg   38 (103)
                      ++++++.+++.++++-+..|+...
T Consensus       100 ~l~~l~~l~~~~~~~~~~~~~~~~  123 (135)
T PF04246_consen  100 ILGGLLGLALGFLILRLFDRRLKK  123 (135)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhcc
Confidence            445555555556666555554443


No 404
>PRK10884 SH3 domain-containing protein; Provisional
Probab=20.42  E-value=1e+02  Score=23.72  Aligned_cols=26  Identities=31%  Similarity=0.188  Sum_probs=16.6

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          7 ATAGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         7 at~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      ..+.|+.|-++|.+-|||-+++--++
T Consensus       170 ~~~wf~~Gg~v~~~GlllGlilp~l~  195 (206)
T PRK10884        170 IMQWFMYGGGVAGIGLLLGLLLPHLI  195 (206)
T ss_pred             HHHHHHHchHHHHHHHHHHHHhcccc
Confidence            34566677777766666666665656


No 405
>PHA02851 EEV glycoprotein; Provisional
Probab=20.41  E-value=81  Score=24.53  Aligned_cols=15  Identities=27%  Similarity=0.509  Sum_probs=6.0

Q ss_pred             HHHHHHHHHHHHHHH
Q psy17635         15 MLLAIAFLILVLILV   29 (103)
Q Consensus        15 ~~~ai~lLllilli~   29 (103)
                      +-|.+++|++.++.+
T Consensus         8 vscSliILIlY~llI   22 (223)
T PHA02851          8 VSCSLIILIVYLFLI   22 (223)
T ss_pred             HHhHHHHHHHHHHHH
Confidence            334444444443333


No 406
>PRK11466 hybrid sensory histidine kinase TorS; Provisional
Probab=20.33  E-value=87  Score=27.54  Aligned_cols=24  Identities=21%  Similarity=0.335  Sum_probs=11.0

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635          9 AGWFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli~C~i   32 (103)
                      ..|++.++.+++++++++++..++
T Consensus       329 ~~~~~~~~~~~~~l~~~~~~~~~~  352 (914)
T PRK11466        329 GQYSLLLLGMVSLCALILILWRVV  352 (914)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345555555544444444443333


No 407
>PRK06287 cobalt transport protein CbiN; Validated
Probab=20.24  E-value=1.3e+02  Score=20.89  Aligned_cols=9  Identities=11%  Similarity=0.224  Sum_probs=3.9

Q ss_pred             HHhheeecC
Q psy17635         28 LVCLIKRNR   36 (103)
Q Consensus        28 i~C~i~R~r   36 (103)
                      +-.+++|++
T Consensus        97 ~~~~l~r~~  105 (107)
T PRK06287         97 VGKIFKKKS  105 (107)
T ss_pred             HHHHHhccc
Confidence            334445443


No 408
>PRK14759 potassium-transporting ATPase subunit F; Provisional
Probab=20.24  E-value=1.8e+02  Score=16.25  Aligned_cols=19  Identities=16%  Similarity=0.342  Sum_probs=8.0

Q ss_pred             HHHHHHHHHHHHHHHHhhe
Q psy17635         14 GMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        14 g~~~ai~lLllilli~C~i   32 (103)
                      .+-.++++.+++=|+..++
T Consensus         6 ~l~~~va~~L~vYL~~ALl   24 (29)
T PRK14759          6 SLAGAVSLGLLIYLTYALL   24 (29)
T ss_pred             HHHHHHHHHHHHHHHHHHh
Confidence            3334444444444444433


No 409
>PF11084 DUF2621:  Protein of unknown function (DUF2621);  InterPro: IPR020203 This entry represents a group of uncharacterised proteins.
Probab=20.22  E-value=1.4e+02  Score=22.29  Aligned_cols=20  Identities=25%  Similarity=0.665  Sum_probs=12.6

Q ss_pred             hhHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli   28 (103)
                      ++||..+|..-+++++.+..
T Consensus         3 ~~wFm~fI~~W~~vli~l~~   22 (141)
T PF11084_consen    3 NGWFMWFILFWVVVLIGLMA   22 (141)
T ss_pred             hhHHHHHHHHHHHHHHHHHH
Confidence            57998888775554444433


No 410
>COG1704 LemA Uncharacterized conserved protein [Function unknown]
Probab=20.18  E-value=1.2e+02  Score=23.52  Aligned_cols=14  Identities=14%  Similarity=0.425  Sum_probs=6.2

Q ss_pred             hHHHHHHHHHHHHH
Q psy17635         10 GWFIGMLLAIAFLI   23 (103)
Q Consensus        10 gWfIg~~~ai~lLl   23 (103)
                      .|+|++.+.+++++
T Consensus         3 ~~~i~l~vi~il~l   16 (185)
T COG1704           3 RFLIILAVIVILLL   16 (185)
T ss_pred             hhHHHHHHHHHHHH
Confidence            35555544433333


No 411
>PRK13865 type IV secretion system protein VirB8; Provisional
Probab=20.09  E-value=1.5e+02  Score=23.16  Aligned_cols=12  Identities=0%  Similarity=-0.205  Sum_probs=7.2

Q ss_pred             hhhhHHHHHHHH
Q psy17635          7 ATAGWFIGMLLA   18 (103)
Q Consensus         7 at~gWfIg~~~a   18 (103)
                      ...+|+++.+++
T Consensus        37 ~k~aw~va~~~~   48 (229)
T PRK13865         37 SRVLAAVAGIAV   48 (229)
T ss_pred             HHHHHHHHHHHH
Confidence            345787776544


No 412
>KOG2052|consensus
Probab=20.07  E-value=73  Score=28.23  Aligned_cols=20  Identities=30%  Similarity=0.446  Sum_probs=10.4

Q ss_pred             hhHHHHHHHHHHHHHHHHHH
Q psy17635          9 AGWFIGMLLAIAFLILVLIL   28 (103)
Q Consensus         9 ~gWfIg~~~ai~lLllilli   28 (103)
                      ..|++.++..++++++++++
T Consensus       131 ~~~~~ii~~~~~~~~~~~~~  150 (513)
T KOG2052|consen  131 VELVAIILGPVVFLLLIAIL  150 (513)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            34555555555555544444


No 413
>KOG4221|consensus
Probab=20.07  E-value=37  Score=33.24  Aligned_cols=13  Identities=46%  Similarity=0.779  Sum_probs=5.8

Q ss_pred             HHHHHHHHhheee
Q psy17635         22 LILVLILVCLIKR   34 (103)
Q Consensus        22 Lllilli~C~i~R   34 (103)
                      ++++++++||-+|
T Consensus      1073 ~~~~~v~v~C~~r 1085 (1381)
T KOG4221|consen 1073 LVLLLVLVTCRRR 1085 (1381)
T ss_pred             HHHhhheeEEecc
Confidence            3344444554444


No 414
>TIGR03321 alt_F1F0_F0_B alternate F1F0 ATPase, F0 subunit B. CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
Probab=20.07  E-value=1e+02  Score=23.66  Aligned_cols=22  Identities=41%  Similarity=0.741  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhe
Q psy17635         11 WFIGMLLAIAFLILVLILVCLI   32 (103)
Q Consensus        11 WfIg~~~ai~lLllilli~C~i   32 (103)
                      |+..+.-+|.|++|++++..|.
T Consensus         5 ~~t~~~qiInFlil~~lL~kfl   26 (246)
T TIGR03321         5 WFTVIAQLINFLILVWLLKRFL   26 (246)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHh
Confidence            5444444566666665554444


No 415
>PRK09848 glucuronide transporter; Provisional
Probab=20.03  E-value=1.4e+02  Score=23.75  Aligned_cols=9  Identities=11%  Similarity=0.394  Sum_probs=5.6

Q ss_pred             cccchhhhh
Q psy17635         39 KYAVHEREA   47 (103)
Q Consensus        39 KY~VkeKE~   47 (103)
                      +|+..+|+.
T Consensus       426 ~y~l~~~~~  434 (448)
T PRK09848        426 FYPLTDKKF  434 (448)
T ss_pred             HeeCCHHHH
Confidence            577766653


Done!