Query psy17635
Match_columns 103
No_of_seqs 99 out of 124
Neff 4.5
Searched_HMMs 46136
Date Fri Aug 16 19:04:53 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17635.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17635hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13882 Bravo_FIGEY: Bravo-li 99.9 4.2E-26 9.1E-31 156.3 4.6 70 33-103 1-75 (90)
2 KOG3513|consensus 98.9 3.5E-11 7.7E-16 109.2 -4.9 99 5-103 922-1027(1051)
3 PF12273 RCR: Chitin synthesis 96.6 0.0008 1.7E-08 47.5 1.0 24 11-34 2-25 (130)
4 PF05454 DAG1: Dystroglycan (D 96.2 0.0011 2.5E-08 53.6 0.0 35 8-42 142-180 (290)
5 PF01299 Lamp: Lysosome-associ 95.3 0.02 4.4E-07 45.4 3.5 27 13-39 275-301 (306)
6 PF07204 Orthoreo_P10: Orthore 94.2 0.027 5.9E-07 39.5 1.6 43 5-47 36-81 (98)
7 PF15102 TMEM154: TMEM154 prot 94.2 0.025 5.5E-07 42.1 1.5 15 19-33 66-80 (146)
8 PF12877 DUF3827: Domain of un 94.1 0.023 4.9E-07 50.8 1.3 32 6-37 265-299 (684)
9 PF01102 Glycophorin_A: Glycop 93.7 0.047 1E-06 39.3 2.2 28 7-36 65-92 (122)
10 PF05393 Hum_adeno_E3A: Human 93.0 0.082 1.8E-06 36.8 2.3 24 13-36 37-60 (94)
11 PF05568 ASFV_J13L: African sw 92.9 0.34 7.4E-06 37.0 5.7 56 6-63 27-82 (189)
12 PF14979 TMEM52: Transmembrane 92.2 0.17 3.6E-06 38.1 3.2 22 7-28 16-37 (154)
13 PHA03164 hypothetical protein; 91.8 0.24 5.3E-06 33.9 3.4 28 12-39 61-88 (88)
14 PF14575 EphA2_TM: Ephrin type 91.6 0.26 5.6E-06 32.4 3.3 25 11-35 1-26 (75)
15 PF15330 SIT: SHP2-interacting 91.0 0.15 3.2E-06 35.9 1.7 30 11-40 1-34 (107)
16 PF12273 RCR: Chitin synthesis 89.7 0.12 2.5E-06 36.5 0.4 20 10-29 4-23 (130)
17 PF01034 Syndecan: Syndecan do 89.0 0.11 2.4E-06 33.9 -0.1 14 33-46 37-50 (64)
18 PF15050 SCIMP: SCIMP protein 89.0 0.38 8.2E-06 35.3 2.6 35 8-42 6-43 (133)
19 PF06024 DUF912: Nucleopolyhed 88.6 0.48 1E-05 32.5 2.8 28 9-37 63-91 (101)
20 PF02439 Adeno_E3_CR2: Adenovi 86.8 1.2 2.6E-05 26.4 3.4 25 12-36 9-34 (38)
21 PF08173 YbgT_YccB: Membrane b 86.7 1.1 2.3E-05 25.0 2.9 19 9-27 4-22 (28)
22 PHA02867 C-type lectin protein 86.5 0.59 1.3E-05 35.2 2.4 25 9-33 13-37 (167)
23 KOG3956|consensus 85.9 0.66 1.4E-05 38.6 2.6 26 16-41 8-33 (359)
24 PF10577 UPF0560: Uncharacteri 85.1 0.94 2E-05 41.6 3.4 30 10-39 273-303 (807)
25 PHA02650 hypothetical protein; 84.4 2.2 4.7E-05 29.1 4.1 34 6-39 47-80 (81)
26 TIGR02106 cyd_oper_ybgT cyd op 84.1 1.6 3.5E-05 24.6 2.9 19 9-27 4-22 (30)
27 PF12297 EVC2_like: Ellis van 84.0 1.4 3.1E-05 37.8 3.9 38 2-39 61-98 (429)
28 PF04478 Mid2: Mid2 like cell 83.4 0.44 9.5E-06 35.8 0.5 16 26-41 68-83 (154)
29 TIGR01167 LPXTG_anchor LPXTG-m 83.1 2.4 5.1E-05 23.0 3.3 16 2-17 3-18 (34)
30 PHA02947 S-S bond formation pa 83.0 0.78 1.7E-05 36.2 1.8 34 5-40 174-207 (215)
31 COG4818 Predicted membrane pro 82.6 1.1 2.4E-05 31.7 2.3 32 8-41 57-88 (105)
32 PHA02844 putative transmembran 82.5 2.6 5.6E-05 28.4 3.9 29 6-34 46-74 (75)
33 PF10717 ODV-E18: Occlusion-de 82.3 1.5 3.2E-05 30.2 2.7 25 15-39 27-51 (85)
34 PF02009 Rifin_STEVOR: Rifin/s 82.3 0.78 1.7E-05 37.4 1.6 26 9-34 255-280 (299)
35 PF05545 FixQ: Cbb3-type cytoc 81.7 3.9 8.5E-05 24.3 4.2 32 9-40 7-38 (49)
36 PF00957 Synaptobrevin: Synapt 80.7 1.9 4.1E-05 28.1 2.8 6 25-30 83-88 (89)
37 PF07172 GRP: Glycine rich pro 78.8 2.9 6.3E-05 28.7 3.3 14 44-57 30-43 (95)
38 PF14991 MLANA: Protein melan- 78.0 0.34 7.4E-06 35.0 -1.6 31 21-51 35-65 (118)
39 PF09451 ATG27: Autophagy-rela 77.3 3.2 6.9E-05 32.6 3.5 34 6-39 197-231 (268)
40 PRK14740 kdbF potassium-transp 77.1 5 0.00011 22.5 3.3 26 10-35 2-27 (29)
41 PF15069 FAM163: FAM163 family 76.9 1.8 3.9E-05 32.2 1.9 20 13-32 9-28 (143)
42 PF07271 Cytadhesin_P30: Cytad 76.8 0.63 1.4E-05 38.0 -0.6 42 5-48 62-109 (279)
43 MTH00036 ATP8 ATP synthase F0 76.6 4.7 0.0001 25.2 3.5 23 1-25 1-23 (54)
44 PF12768 Rax2: Cortical protei 76.0 1.2 2.7E-05 35.7 0.9 26 13-38 232-260 (281)
45 PF13209 DUF4017: Protein of u 75.8 1.9 4.2E-05 27.7 1.6 33 4-36 25-59 (60)
46 PF14584 DUF4446: Protein of u 75.4 1.8 3.8E-05 32.0 1.5 22 11-32 2-23 (151)
47 PF00974 Rhabdo_glycop: Rhabdo 75.4 0.91 2E-05 39.0 0.0 22 11-32 454-476 (501)
48 PF14914 LRRC37AB_C: LRRC37A/B 74.4 3.2 6.9E-05 31.3 2.7 23 10-32 119-141 (154)
49 PF12606 RELT: Tumour necrosis 74.3 4.1 8.8E-05 25.3 2.7 35 11-46 3-37 (50)
50 KOG3637|consensus 73.4 2.6 5.6E-05 39.5 2.4 29 9-37 977-1010(1030)
51 PHA02662 ORF131 putative membr 72.7 2.2 4.7E-05 33.9 1.5 31 10-41 186-216 (226)
52 PF11694 DUF3290: Protein of u 72.5 4.7 0.0001 29.7 3.2 33 9-41 15-47 (149)
53 PHA03240 envelope glycoprotein 72.3 4.4 9.6E-05 32.6 3.2 15 6-20 210-224 (258)
54 PRK02251 putative septation in 72.2 5 0.00011 27.5 3.0 33 7-42 29-61 (87)
55 PTZ00046 rifin; Provisional 72.1 2.8 6.1E-05 35.2 2.2 28 9-36 314-341 (358)
56 PF11807 DUF3328: Domain of un 71.9 5.5 0.00012 28.3 3.3 25 4-28 3-27 (217)
57 PF14575 EphA2_TM: Ephrin type 71.8 3.3 7.2E-05 27.1 2.0 24 14-37 9-32 (75)
58 PHA02955 hypothetical protein; 71.7 2 4.4E-05 33.8 1.2 28 10-40 180-207 (213)
59 COG4890 Predicted outer membra 71.5 6.2 0.00013 23.2 2.9 21 9-29 4-24 (37)
60 PRK13275 mtrF tetrahydromethan 71.3 6.8 0.00015 25.8 3.4 25 6-30 40-64 (67)
61 PF02480 Herpes_gE: Alphaherpe 71.0 1.3 2.9E-05 37.6 0.0 32 10-41 353-386 (439)
62 PRK14749 hypothetical protein; 70.8 7.1 0.00015 22.1 2.9 19 9-27 4-22 (30)
63 PF06103 DUF948: Bacterial pro 69.7 3.6 7.8E-05 26.9 1.9 21 14-34 3-23 (90)
64 PRK11901 hypothetical protein; 69.5 4.1 8.9E-05 34.0 2.5 16 15-30 38-53 (327)
65 TIGR01477 RIFIN variant surfac 69.3 3.6 7.7E-05 34.6 2.2 27 10-36 310-336 (353)
66 PRK11677 hypothetical protein; 69.2 6.2 0.00013 28.8 3.2 25 11-35 3-27 (134)
67 TIGR01478 STEVOR variant surfa 69.1 4 8.7E-05 33.6 2.4 13 24-36 275-287 (295)
68 PF00895 ATP-synt_8: ATP synth 69.1 12 0.00026 21.8 3.9 30 1-32 1-30 (54)
69 PF12575 DUF3753: Protein of u 68.5 8 0.00017 25.8 3.3 18 10-27 47-64 (72)
70 PF11395 DUF2873: Protein of u 68.5 9.2 0.0002 23.0 3.2 19 14-32 12-31 (43)
71 PTZ00370 STEVOR; Provisional 68.4 4.3 9.3E-05 33.5 2.4 14 24-37 271-284 (296)
72 PF11669 WBP-1: WW domain-bind 67.7 4.9 0.00011 27.8 2.3 11 9-19 20-30 (102)
73 PF05478 Prominin: Prominin; 67.5 6 0.00013 35.7 3.3 12 29-40 114-127 (806)
74 PRK04778 septation ring format 66.9 2.9 6.2E-05 36.1 1.2 23 12-34 2-24 (569)
75 PF02480 Herpes_gE: Alphaherpe 66.3 1.9 4.2E-05 36.6 0.0 30 7-36 353-384 (439)
76 PF13623 SurA_N_2: SurA N-term 65.9 6.2 0.00013 28.7 2.6 22 5-26 3-24 (145)
77 PRK14762 membrane protein; Pro 65.6 9.1 0.0002 21.0 2.6 17 9-25 4-20 (27)
78 PF06143 Baculo_11_kDa: Baculo 64.4 8 0.00017 26.4 2.7 14 12-25 36-49 (84)
79 PF09801 SYS1: Integral membra 64.4 7.7 0.00017 28.3 2.9 31 6-36 55-85 (144)
80 PF02932 Neur_chan_memb: Neuro 64.3 5.6 0.00012 26.6 2.0 33 6-38 55-87 (237)
81 PF05297 Herpes_LMP1: Herpesvi 63.8 2.3 5E-05 35.7 0.0 11 7-17 35-45 (381)
82 PF08113 CoxIIa: Cytochrome c 63.4 12 0.00026 21.7 3.0 19 13-31 6-24 (34)
83 PF06365 CD34_antigen: CD34/Po 62.9 3 6.5E-05 32.4 0.5 37 11-47 101-140 (202)
84 PTZ00382 Variant-specific surf 62.2 7.1 0.00015 26.7 2.2 11 26-36 85-95 (96)
85 PF14914 LRRC37AB_C: LRRC37A/B 62.0 10 0.00022 28.6 3.2 27 11-37 123-150 (154)
86 PF00517 GP41: Retroviral enve 61.9 2.9 6.2E-05 31.9 0.2 18 25-42 173-190 (204)
87 PRK11486 flagellar biosynthesi 61.9 7.5 0.00016 28.1 2.4 31 5-35 11-42 (124)
88 PRK10788 periplasmic folding c 61.8 7.9 0.00017 33.4 2.9 35 7-41 8-42 (623)
89 PHA03286 envelope glycoprotein 61.0 6.1 0.00013 34.6 2.1 18 19-36 402-419 (492)
90 TIGR02866 CoxB cytochrome c ox 60.9 11 0.00024 28.2 3.3 18 19-36 23-40 (201)
91 PRK10927 essential cell divisi 60.9 4.4 9.6E-05 33.7 1.2 56 15-71 35-93 (319)
92 PRK14758 hypothetical protein; 60.7 6.2 0.00013 21.8 1.4 8 23-30 10-17 (27)
93 PF04277 OAD_gamma: Oxaloaceta 60.4 15 0.00032 23.3 3.4 18 10-27 4-22 (79)
94 PF01788 PsbJ: PsbJ; InterPro 60.3 17 0.00037 21.8 3.3 21 8-28 8-28 (40)
95 COG3190 FliO Flagellar biogene 59.9 9.8 0.00021 28.1 2.7 28 7-34 20-47 (137)
96 PF02009 Rifin_STEVOR: Rifin/s 59.8 6.2 0.00014 32.2 1.9 24 13-36 262-285 (299)
97 PF12669 P12: Virus attachment 59.4 5.4 0.00012 25.0 1.1 11 28-38 17-27 (58)
98 PF09472 MtrF: Tetrahydrometha 59.2 15 0.00032 23.9 3.2 23 7-29 41-63 (64)
99 MTH00133 ATP8 ATP synthase F0 57.7 16 0.00035 22.3 3.1 24 1-26 1-24 (55)
100 PF15176 LRR19-TM: Leucine-ric 57.7 11 0.00023 26.8 2.5 15 4-18 10-24 (102)
101 PHA03283 envelope glycoprotein 56.4 12 0.00026 33.2 3.1 25 8-32 396-422 (542)
102 PF05434 Tmemb_9: TMEM9; Inte 56.1 6.8 0.00015 29.4 1.4 30 14-43 60-93 (149)
103 PF10779 XhlA: Haemolysin XhlA 55.7 20 0.00043 22.8 3.3 21 7-27 48-68 (71)
104 PF12575 DUF3753: Protein of u 55.5 20 0.00043 23.9 3.4 23 5-27 45-67 (72)
105 PF02038 ATP1G1_PLM_MAT8: ATP1 54.9 12 0.00026 23.4 2.1 16 13-28 17-32 (50)
106 PF07213 DAP10: DAP10 membrane 54.7 22 0.00048 24.1 3.6 27 8-34 32-59 (79)
107 PF07423 DUF1510: Protein of u 54.1 16 0.00034 28.7 3.1 23 10-32 14-36 (217)
108 PF06936 Selenoprotein_S: Sele 53.9 4.3 9.3E-05 31.2 0.0 21 6-26 33-53 (190)
109 PRK00159 putative septation in 53.7 17 0.00037 25.0 2.9 32 7-41 28-59 (87)
110 COG3671 Predicted membrane pro 53.2 16 0.00035 26.7 2.9 37 9-45 75-115 (125)
111 KOG4331|consensus 53.1 12 0.00026 34.8 2.7 16 30-45 130-145 (865)
112 TIGR01433 CyoA cytochrome o ub 53.0 17 0.00037 28.2 3.2 9 25-33 48-56 (226)
113 COG3296 Uncharacterized protei 53.0 15 0.00032 27.4 2.7 30 12-41 103-134 (143)
114 PRK14750 kdpF potassium-transp 52.8 23 0.00051 19.8 2.9 11 13-23 3-13 (29)
115 PTZ00208 65 kDa invariant surf 52.4 16 0.00034 31.7 3.1 15 28-42 406-420 (436)
116 PF15176 LRR19-TM: Leucine-ric 52.3 13 0.00029 26.3 2.3 29 12-40 21-50 (102)
117 PF13584 BatD: Oxygen toleranc 51.8 17 0.00036 30.1 3.2 23 10-33 426-448 (484)
118 PF15234 LAT: Linker for activ 51.7 57 0.0012 25.9 5.9 16 28-43 25-40 (230)
119 PHA03097 C-type lectin-like pr 51.5 18 0.00039 26.3 3.0 18 12-29 13-30 (157)
120 TIGR03063 srtB_target sortase 51.5 26 0.00056 19.5 2.9 17 2-18 2-19 (29)
121 PF06781 UPF0233: Uncharacteri 51.5 19 0.00041 24.6 2.9 30 6-35 27-56 (87)
122 PF04906 Tweety: Tweety; Inte 51.2 16 0.00035 30.5 3.0 7 30-36 45-51 (406)
123 PRK10525 cytochrome o ubiquino 51.0 17 0.00037 29.9 3.1 10 9-18 40-49 (315)
124 PF06396 AGTRAP: Angiotensin I 50.9 71 0.0015 24.1 6.1 37 7-43 85-122 (162)
125 PF13273 DUF4064: Protein of u 50.7 25 0.00054 23.2 3.4 28 7-35 55-82 (100)
126 PF14851 FAM176: FAM176 family 50.3 47 0.001 24.8 5.1 24 10-33 24-47 (153)
127 KOG4482|consensus 50.3 17 0.00036 31.5 3.0 27 11-39 301-327 (449)
128 PF14316 DUF4381: Domain of un 50.2 18 0.00039 25.7 2.7 11 8-18 20-30 (146)
129 PF07466 DUF1517: Protein of u 50.1 10 0.00022 30.6 1.6 30 8-37 60-89 (289)
130 PF00558 Vpu: Vpu protein; In 50.0 16 0.00035 24.8 2.3 17 13-29 6-22 (81)
131 PF06341 DUF1056: Protein of u 49.8 24 0.00051 23.0 3.0 26 13-38 37-62 (63)
132 PF13349 DUF4097: Domain of un 49.8 18 0.00039 25.0 2.6 18 11-28 4-21 (166)
133 PF15431 TMEM190: Transmembran 49.7 17 0.00036 26.7 2.5 29 7-35 58-87 (134)
134 PF14610 DUF4448: Protein of u 49.6 9 0.0002 28.4 1.2 16 19-34 168-183 (189)
135 COG1622 CyoA Heme/copper-type 49.6 21 0.00046 28.3 3.3 33 8-40 29-66 (247)
136 PF00957 Synaptobrevin: Synapt 49.6 27 0.00058 22.6 3.3 21 12-32 67-87 (89)
137 PRK12757 cell division protein 49.4 11 0.00024 30.3 1.7 22 18-39 3-26 (256)
138 PF06900 DUF1270: Protein of u 48.8 31 0.00067 21.8 3.3 21 9-29 6-26 (53)
139 PF03845 Spore_permease: Spore 48.7 20 0.00042 27.9 3.0 31 9-39 33-63 (320)
140 PF06400 Alpha-2-MRAP_N: Alpha 48.7 5.8 0.00013 28.8 0.0 24 22-45 3-26 (120)
141 MTH00186 ATP8 ATP synthase F0 48.5 38 0.00083 20.2 3.7 31 1-33 1-31 (52)
142 PF03908 Sec20: Sec20; InterP 48.5 17 0.00037 24.0 2.2 18 17-34 73-90 (92)
143 PF00517 GP41: Retroviral enve 48.4 8.4 0.00018 29.4 0.9 25 11-35 155-179 (204)
144 PHA02692 hypothetical protein; 48.3 30 0.00064 23.0 3.3 8 10-17 44-51 (70)
145 PF10661 EssA: WXG100 protein 48.1 17 0.00037 26.6 2.4 11 8-18 116-126 (145)
146 COG2268 Uncharacterized protei 47.3 28 0.00061 30.9 4.0 19 11-29 11-29 (548)
147 PF12670 DUF3792: Protein of u 47.2 25 0.00055 24.2 3.1 28 6-33 62-89 (116)
148 PRK10299 PhoPQ regulatory prot 47.1 25 0.00055 21.7 2.6 21 9-29 4-24 (47)
149 PF03954 Lectin_N: Hepatic lec 46.9 19 0.00041 26.7 2.5 23 11-33 32-54 (138)
150 COG4218 MtrF Tetrahydromethano 46.9 30 0.00064 23.2 3.1 22 6-27 49-70 (73)
151 PF15339 Afaf: Acrosome format 46.7 24 0.00051 27.6 3.0 20 13-32 130-153 (200)
152 PF13908 Shisa: Wnt and FGF in 46.5 14 0.0003 27.0 1.8 8 12-19 77-84 (179)
153 KOG3970|consensus 46.3 25 0.00054 28.7 3.2 34 7-43 250-283 (299)
154 PRK13415 flagella biosynthesis 45.9 28 0.0006 27.6 3.4 28 11-38 67-96 (219)
155 PF13179 DUF4006: Family of un 45.7 26 0.00057 23.0 2.7 15 12-26 13-27 (66)
156 PF02397 Bac_transf: Bacterial 45.7 22 0.00049 26.9 2.8 24 11-34 6-29 (187)
157 PLN00168 Cytochrome P450; Prov 45.6 24 0.00052 29.3 3.2 26 14-39 6-31 (519)
158 KOG2927|consensus 45.6 23 0.00051 30.1 3.1 31 9-39 225-256 (372)
159 PHA03273 envelope glycoprotein 45.3 14 0.0003 32.4 1.8 24 12-35 450-474 (486)
160 PRK10905 cell division protein 45.3 11 0.00024 31.5 1.1 12 18-29 3-14 (328)
161 PHA03240 envelope glycoprotein 45.0 22 0.00049 28.6 2.8 16 16-31 216-231 (258)
162 CHL00177 ccs1 c-type cytochrom 44.9 18 0.00038 30.8 2.3 26 6-35 70-95 (426)
163 PF03302 VSP: Giardia variant- 44.8 14 0.0003 30.8 1.6 12 25-36 385-396 (397)
164 PF11359 gpUL132: Glycoprotein 44.6 74 0.0016 25.5 5.6 13 34-46 80-92 (235)
165 PF00974 Rhabdo_glycop: Rhabdo 44.2 7.5 0.00016 33.5 0.0 13 20-32 467-479 (501)
166 KOG1311|consensus 44.2 25 0.00054 27.5 3.0 28 5-32 152-179 (299)
167 PF05083 LST1: LST-1 protein; 43.7 6.8 0.00015 26.3 -0.2 19 19-37 4-22 (74)
168 MTH00102 ATP8 ATP synthase F0 43.7 42 0.0009 21.5 3.4 20 1-22 1-20 (67)
169 PF06842 DUF1242: Protein of u 43.7 10 0.00022 22.1 0.5 11 22-32 2-12 (36)
170 PF06809 NPDC1: Neural prolife 43.5 25 0.00055 29.5 3.0 28 12-39 202-229 (341)
171 PF04971 Lysis_S: Lysis protei 43.4 26 0.00056 23.1 2.5 23 12-34 35-57 (68)
172 TIGR03521 GldG gliding-associa 43.2 23 0.00049 30.6 2.8 30 7-36 520-550 (552)
173 KOG3514|consensus 42.5 24 0.00052 34.4 3.0 41 4-44 1510-1551(1591)
174 PHA02844 putative transmembran 42.5 37 0.00081 22.8 3.1 25 6-30 43-67 (75)
175 MTH00147 ATP8 ATP synthase F0 42.4 58 0.0013 19.8 3.8 32 1-34 1-32 (51)
176 COG3115 ZipA Cell division pro 42.1 36 0.00079 28.4 3.7 23 18-41 13-35 (324)
177 PHA02819 hypothetical protein; 41.7 36 0.00079 22.7 3.0 22 4-25 42-63 (71)
178 TIGR02588 conserved hypothetic 41.3 36 0.00078 24.7 3.1 33 9-41 3-37 (122)
179 PHA02692 hypothetical protein; 41.3 34 0.00073 22.7 2.8 15 4-18 41-56 (70)
180 PF14798 Ca_hom_mod: Calcium h 41.0 34 0.00075 27.2 3.3 27 6-32 178-204 (251)
181 PF03918 CcmH: Cytochrome C bi 40.8 9.5 0.00021 28.0 0.1 36 4-39 95-130 (148)
182 PRK03427 cell division protein 40.7 42 0.00091 28.1 3.8 30 11-40 7-36 (333)
183 TIGR01195 oadG_fam sodium pump 40.6 50 0.0011 21.9 3.6 12 19-30 17-28 (82)
184 PF02124 Marek_A: Marek's dise 40.1 26 0.00056 27.5 2.4 18 12-29 188-205 (211)
185 PF07782 DC_STAMP: DC-STAMP-li 39.8 39 0.00084 25.1 3.2 25 2-26 139-163 (191)
186 MTH00072 ATP8 ATP synthase F0 39.6 51 0.0011 19.5 3.2 24 1-26 1-24 (54)
187 TIGR03503 conserved hypothetic 39.5 14 0.0003 31.3 0.9 28 7-34 344-371 (374)
188 PF00429 TLV_coat: ENV polypro 38.8 10 0.00022 33.2 0.0 10 7-16 523-532 (561)
189 PF11346 DUF3149: Protein of u 38.6 33 0.00072 20.5 2.2 17 13-29 11-27 (42)
190 PHA03269 envelope glycoprotein 38.5 24 0.00052 31.4 2.2 18 12-29 541-558 (566)
191 PF14927 Neurensin: Neurensin 38.5 49 0.0011 24.4 3.5 24 6-29 39-62 (140)
192 PF14884 EFF-AFF: Type I membr 38.5 25 0.00055 31.5 2.3 24 10-33 557-582 (589)
193 PRK06569 F0F1 ATP synthase sub 38.1 49 0.0011 24.7 3.5 31 1-32 1-31 (155)
194 PF04156 IncA: IncA protein; 38.1 40 0.00086 24.4 3.0 26 15-40 44-69 (191)
195 KOG1226|consensus 38.1 32 0.00069 31.9 3.0 37 12-48 717-755 (783)
196 PF10164 DUF2367: Uncharacteri 37.9 40 0.00086 23.7 2.8 22 13-35 67-88 (98)
197 PHA03054 IMV membrane protein; 37.9 46 0.001 22.2 3.0 20 5-24 45-64 (72)
198 PHA02650 hypothetical protein; 37.5 40 0.00086 23.0 2.7 24 6-29 44-67 (81)
199 PRK13453 F0F1 ATP synthase sub 37.4 40 0.00087 24.6 3.0 22 11-32 18-39 (173)
200 PF03597 CcoS: Cytochrome oxid 37.4 32 0.0007 20.7 2.1 28 13-40 3-30 (45)
201 PF03733 DUF307: Domain of unk 37.3 56 0.0012 20.1 3.2 20 9-28 11-30 (53)
202 PF09435 DUF2015: Fungal prote 37.1 18 0.00039 26.5 1.1 26 15-40 4-29 (128)
203 PF04226 Transgly_assoc: Trans 37.0 65 0.0014 19.2 3.4 19 7-25 25-43 (48)
204 KOG3054|consensus 37.0 45 0.00098 27.4 3.4 35 12-46 4-38 (299)
205 COG1314 SecG Preprotein transl 37.0 57 0.0012 22.1 3.4 13 26-38 16-28 (86)
206 PF14584 DUF4446: Protein of u 36.9 30 0.00066 25.5 2.2 15 14-28 8-22 (151)
207 PF13106 DUF3961: Domain of un 36.7 16 0.00035 21.9 0.6 23 5-27 12-34 (40)
208 KOG3653|consensus 36.4 32 0.00069 30.6 2.6 31 10-40 154-185 (534)
209 PRK04989 psbM photosystem II r 36.1 77 0.0017 18.5 3.4 19 12-30 8-26 (35)
210 PF09930 DUF2162: Predicted tr 36.1 32 0.0007 27.0 2.4 36 6-42 126-161 (224)
211 PF13624 SurA_N_3: SurA N-term 35.9 12 0.00026 25.8 0.0 21 6-26 5-25 (154)
212 PF12259 DUF3609: Protein of u 35.7 19 0.00042 30.0 1.2 22 13-34 301-322 (361)
213 MTH00171 ATP8 ATP synthase F0 35.6 74 0.0016 18.8 3.5 23 1-25 1-23 (54)
214 PRK10144 formate-dependent nit 35.6 43 0.00093 24.3 2.8 19 5-23 96-114 (126)
215 PRK14094 psbM photosystem II r 35.4 60 0.0013 20.3 3.0 27 12-39 8-34 (50)
216 PF09685 Tic20: Tic20-like pro 35.0 46 0.00099 21.6 2.7 16 26-41 86-101 (109)
217 PHA03281 envelope glycoprotein 34.9 26 0.00057 31.6 1.9 26 11-36 558-583 (642)
218 COG3889 Predicted solute bindi 34.9 23 0.0005 33.1 1.6 29 9-37 843-872 (872)
219 PF05915 DUF872: Eukaryotic pr 34.8 38 0.00083 23.9 2.4 8 30-37 99-106 (115)
220 smart00805 AGTRAP Angiotensin 34.7 34 0.00074 26.0 2.2 12 33-44 112-123 (159)
221 KOG0196|consensus 34.7 43 0.00094 31.7 3.3 17 8-24 546-562 (996)
222 TIGR01294 P_lamban phospholamb 34.6 52 0.0011 20.5 2.7 22 8-29 28-49 (52)
223 MTH00123 ATP8 ATP synthase F0 34.6 67 0.0014 19.3 3.2 23 1-25 1-23 (54)
224 PF13633 N_methyl_3: Prokaryot 34.6 68 0.0015 16.9 2.8 17 12-28 3-19 (22)
225 PF01529 zf-DHHC: DHHC palmito 34.5 60 0.0013 22.7 3.4 24 5-28 87-110 (174)
226 PF03264 Cytochrom_NNT: NapC/N 34.5 12 0.00026 27.2 -0.2 14 10-23 4-17 (173)
227 PHA02975 hypothetical protein; 34.3 54 0.0012 21.7 2.9 21 7-27 40-60 (69)
228 PF06011 TRP: Transient recept 34.1 55 0.0012 27.1 3.6 35 7-41 385-424 (438)
229 PF10907 DUF2749: Protein of u 34.0 35 0.00076 22.5 1.9 38 10-49 4-42 (66)
230 PF04790 Sarcoglycan_1: Sarcog 33.8 49 0.0011 26.5 3.1 24 10-33 14-37 (264)
231 PF13967 RSN1_TM: Late exocyto 33.7 52 0.0011 23.4 3.0 28 18-45 9-36 (157)
232 KOG4812|consensus 33.7 36 0.00078 27.7 2.3 28 6-33 219-246 (262)
233 TIGR03147 cyt_nit_nrfF cytochr 33.7 51 0.0011 23.9 2.9 30 5-34 96-125 (126)
234 COG4768 Uncharacterized protei 33.6 54 0.0012 24.4 3.1 19 11-29 5-23 (139)
235 PF08078 PsaX: PsaX family; I 33.4 51 0.0011 19.4 2.4 15 8-23 15-29 (37)
236 KOG1485|consensus 33.4 26 0.00056 30.2 1.6 29 7-35 227-255 (412)
237 PRK06870 secG preprotein trans 33.4 75 0.0016 20.4 3.5 14 25-38 15-28 (76)
238 PHA02957 hypothetical protein; 33.1 45 0.00097 25.8 2.7 24 12-36 19-42 (206)
239 PF08507 COPI_assoc: COPI asso 33.0 73 0.0016 22.3 3.6 9 10-18 88-96 (136)
240 PF08374 Protocadherin: Protoc 32.8 20 0.00044 28.5 0.8 17 13-30 45-61 (221)
241 PF08001 CMV_US: CMV US; Inte 32.7 54 0.0012 26.0 3.2 27 7-33 176-202 (231)
242 PF12955 DUF3844: Domain of un 32.7 66 0.0014 22.6 3.3 27 4-30 62-88 (103)
243 PRK06007 fliF flagellar MS-rin 32.7 44 0.00094 29.1 2.9 15 11-25 436-450 (542)
244 KOG4298|consensus 32.7 63 0.0014 25.8 3.5 26 9-34 148-173 (245)
245 PF12158 DUF3592: Protein of u 32.1 36 0.00078 23.0 1.9 14 10-23 1-14 (148)
246 TIGR02227 sigpep_I_bact signal 32.1 40 0.00086 24.3 2.2 15 11-25 1-15 (163)
247 PF15106 TMEM156: TMEM156 prot 32.1 53 0.0012 26.2 3.0 14 7-20 174-187 (226)
248 MTH00260 ATP8 ATP synthase F0 31.9 1.1E+02 0.0024 18.8 3.9 32 1-34 1-32 (53)
249 PRK08476 F0F1 ATP synthase sub 31.9 58 0.0013 23.1 3.0 21 11-31 7-27 (141)
250 PF10853 DUF2650: Protein of u 31.9 40 0.00087 19.8 1.8 14 6-19 22-35 (38)
251 TIGR02507 MtrF tetrahydrometha 31.7 85 0.0018 20.5 3.4 17 6-22 40-56 (65)
252 PF04639 Baculo_E56: Baculovir 31.6 56 0.0012 27.2 3.2 9 9-17 276-284 (305)
253 PF06422 PDR_CDR: CDR ABC tran 31.4 82 0.0018 21.4 3.6 27 19-45 60-87 (103)
254 PF11166 DUF2951: Protein of u 31.4 63 0.0014 22.8 3.0 21 8-28 72-92 (98)
255 COG3763 Uncharacterized protei 31.3 50 0.0011 22.0 2.3 6 51-56 36-41 (71)
256 PF10003 DUF2244: Integral mem 31.3 84 0.0018 22.3 3.7 20 6-25 9-28 (140)
257 MTH00047 COX2 cytochrome c oxi 31.1 69 0.0015 24.3 3.5 9 33-41 35-43 (194)
258 PF10617 DUF2474: Protein of u 31.1 89 0.0019 18.6 3.2 19 6-24 5-23 (40)
259 PF15202 Adipogenin: Adipogeni 30.9 90 0.0019 21.0 3.5 25 7-31 12-36 (81)
260 PF09788 Tmemb_55A: Transmembr 30.9 50 0.0011 26.8 2.7 37 6-42 193-232 (256)
261 PF08496 Peptidase_S49_N: Pept 30.8 62 0.0013 24.0 3.0 18 20-37 20-37 (155)
262 KOG2678|consensus 30.6 63 0.0014 26.1 3.2 19 9-27 215-233 (244)
263 PHA02657 hypothetical protein; 30.5 58 0.0013 22.7 2.7 19 9-27 30-48 (95)
264 PF13172 PepSY_TM_1: PepSY-ass 30.4 75 0.0016 17.4 2.7 14 12-25 11-24 (34)
265 KOG1024|consensus 30.4 22 0.00048 31.4 0.7 30 6-35 182-212 (563)
266 TIGR00892 2A0113 monocarboxyla 30.2 40 0.00086 27.3 2.1 26 11-36 404-429 (455)
267 TIGR02205 septum_zipA cell div 30.2 43 0.00094 27.2 2.3 10 28-37 21-30 (284)
268 COG5416 Uncharacterized integr 30.1 48 0.001 23.3 2.2 7 10-16 23-29 (98)
269 PF05140 ResB: ResB-like famil 30.1 49 0.0011 27.7 2.7 25 6-34 50-74 (464)
270 PF04202 Mfp-3: Foot protein 3 30.0 65 0.0014 21.4 2.7 17 24-40 12-28 (71)
271 PF07856 Orai-1: Mediator of C 30.0 79 0.0017 23.8 3.6 26 9-34 108-133 (175)
272 PF09889 DUF2116: Uncharacteri 29.8 78 0.0017 20.1 3.0 9 9-17 38-46 (59)
273 PF07787 DUF1625: Protein of u 29.8 80 0.0017 24.3 3.7 21 12-32 225-245 (248)
274 PF10821 DUF2567: Protein of u 29.6 59 0.0013 24.7 2.8 30 6-37 45-74 (167)
275 PRK09174 F0F1 ATP synthase sub 29.6 58 0.0013 24.8 2.8 26 2-28 45-70 (204)
276 PF05808 Podoplanin: Podoplani 29.6 18 0.00038 27.5 0.0 29 10-41 133-161 (162)
277 PF15192 TMEM213: TMEM213 fami 29.6 84 0.0018 21.4 3.3 25 5-31 41-66 (82)
278 PF06160 EzrA: Septation ring 29.6 15 0.00033 31.8 -0.4 11 24-34 10-20 (560)
279 PF12301 CD99L2: CD99 antigen 29.6 66 0.0014 24.3 3.1 23 6-28 111-133 (169)
280 PF08475 Baculo_VP91_N: Viral 29.4 39 0.00085 26.0 1.9 6 17-22 4-9 (183)
281 PF01098 FTSW_RODA_SPOVE: Cell 29.3 54 0.0012 26.3 2.7 30 7-36 263-292 (358)
282 PF04272 Phospholamban: Phosph 29.1 73 0.0016 19.8 2.7 22 8-29 28-49 (52)
283 PF10183 ESSS: ESSS subunit of 29.1 72 0.0016 21.9 3.0 18 9-26 61-78 (105)
284 COG4499 Predicted membrane pro 29.0 58 0.0013 28.2 3.0 31 2-32 213-244 (434)
285 PF13124 DUF3963: Protein of u 29.0 1.2E+02 0.0025 18.1 3.4 23 8-30 16-38 (40)
286 PF09988 DUF2227: Uncharacteri 28.9 51 0.0011 24.8 2.4 29 1-29 73-101 (169)
287 PRK00269 zipA cell division pr 28.9 32 0.0007 28.3 1.4 18 28-45 21-38 (293)
288 PRK14471 F0F1 ATP synthase sub 28.6 72 0.0016 22.8 3.0 22 11-32 8-29 (164)
289 TIGR03068 srtB_sig_NPQTN sorta 28.3 97 0.0021 17.8 2.9 18 2-19 2-19 (33)
290 PF08135 EPV_E5: Major transfo 28.2 98 0.0021 18.8 3.1 13 13-25 10-22 (44)
291 PF07760 DUF1616: Protein of u 28.2 47 0.001 26.2 2.2 24 13-36 87-110 (287)
292 PF07219 HemY_N: HemY protein 28.2 87 0.0019 21.1 3.2 22 7-28 13-34 (108)
293 TIGR00847 ccoS cytochrome oxid 28.1 63 0.0014 20.0 2.3 27 14-40 5-31 (51)
294 PRK14474 F0F1 ATP synthase sub 28.1 53 0.0011 25.7 2.4 22 11-32 5-26 (250)
295 MTH00152 ND6 NADH dehydrogenas 27.9 85 0.0019 22.9 3.4 29 9-37 132-160 (163)
296 PRK13461 F0F1 ATP synthase sub 27.9 76 0.0016 22.6 3.1 22 11-32 5-26 (159)
297 PF01456 Mucin: Mucin-like gly 27.8 43 0.00094 23.3 1.7 10 15-24 6-15 (143)
298 KOG4433|consensus 27.6 70 0.0015 28.4 3.3 6 30-35 69-74 (526)
299 PF09777 OSTMP1: Osteopetrosis 27.5 90 0.0019 24.4 3.6 24 9-32 189-212 (237)
300 COG3105 Uncharacterized protei 27.5 97 0.0021 23.0 3.6 28 7-34 4-31 (138)
301 MTH00045 ND6 NADH dehydrogenas 27.4 80 0.0017 23.2 3.1 27 9-35 128-154 (162)
302 PRK03814 oxaloacetate decarbox 27.4 1.1E+02 0.0024 20.6 3.6 12 19-30 21-32 (85)
303 PHA03289 envelope glycoprotein 27.2 61 0.0013 27.4 2.7 16 15-30 277-292 (352)
304 PRK11901 hypothetical protein; 27.0 79 0.0017 26.5 3.3 30 3-33 30-59 (327)
305 KOG3249|consensus 27.0 94 0.002 24.0 3.5 10 33-42 144-155 (181)
306 PF11857 DUF3377: Domain of un 27.0 61 0.0013 21.7 2.2 6 33-38 54-59 (74)
307 PF10327 7TM_GPCR_Sri: Serpent 26.9 37 0.0008 26.9 1.4 22 15-36 96-117 (303)
308 PHA00476 hypothetical protein 26.9 2.4E+02 0.0052 20.2 5.4 43 10-52 38-86 (110)
309 PF14030 DUF4245: Protein of u 26.8 73 0.0016 23.5 2.8 16 17-32 12-27 (169)
310 PF07672 MFS_Mycoplasma: Mycop 26.6 82 0.0018 25.6 3.3 31 7-37 169-199 (267)
311 TIGR00810 secG protein translo 26.5 94 0.002 19.8 3.0 14 25-38 14-27 (73)
312 PLN03150 hypothetical protein; 26.5 39 0.00085 29.5 1.5 10 26-35 562-571 (623)
313 PF11696 DUF3292: Protein of u 26.5 70 0.0015 29.1 3.1 24 6-29 133-156 (642)
314 PF11466 Doppel: Prion-like pr 26.4 62 0.0013 18.3 1.8 12 10-21 7-18 (30)
315 PF02430 AMA-1: Apical membran 26.4 22 0.00047 31.1 0.0 25 10-34 429-453 (471)
316 PF06678 DUF1179: Protein of u 26.4 93 0.002 22.1 3.2 10 10-19 4-13 (103)
317 COG3736 VirB8 Type IV secretor 26.3 1E+02 0.0022 24.7 3.7 24 9-32 42-65 (239)
318 PF01561 Hanta_G2: Hantavirus 26.3 34 0.00074 30.0 1.1 16 19-34 465-480 (485)
319 TIGR01478 STEVOR variant surfa 26.3 51 0.0011 27.3 2.1 12 17-28 261-272 (295)
320 PF01601 Corona_S2: Coronaviru 26.3 22 0.00048 32.0 0.0 15 11-25 548-562 (610)
321 PF14316 DUF4381: Domain of un 26.2 53 0.0011 23.3 2.0 28 13-40 21-50 (146)
322 PF05454 DAG1: Dystroglycan (D 26.2 22 0.00048 29.0 0.0 7 30-36 165-171 (290)
323 CHL00080 psbM photosystem II p 26.2 1.4E+02 0.0031 17.3 3.4 19 12-30 8-26 (34)
324 PRK14473 F0F1 ATP synthase sub 26.2 81 0.0018 22.6 2.9 20 11-30 8-27 (164)
325 TIGR01655 yxeA_fam conserved h 26.1 73 0.0016 22.1 2.6 8 39-46 32-39 (114)
326 PRK00888 ftsB cell division pr 26.1 82 0.0018 21.7 2.8 12 11-22 3-14 (105)
327 PF10112 Halogen_Hydrol: 5-bro 26.0 86 0.0019 23.1 3.1 9 10-18 10-18 (199)
328 PF01167 Tub: Tub family; Int 25.9 21 0.00045 28.0 -0.2 12 27-38 7-18 (246)
329 PRK10905 cell division protein 25.8 50 0.0011 27.7 2.0 17 17-33 5-21 (328)
330 MTH00166 ND6 NADH dehydrogenas 25.7 95 0.0021 22.4 3.3 25 13-37 130-154 (160)
331 PLN00028 nitrate transmembrane 25.6 93 0.002 25.4 3.5 24 10-33 410-433 (476)
332 PTZ00370 STEVOR; Provisional 25.6 55 0.0012 27.1 2.2 18 8-25 248-265 (296)
333 TIGR03644 marine_trans_1 proba 25.5 94 0.002 26.3 3.6 38 14-52 355-392 (404)
334 TIGR01432 QOXA cytochrome aa3 25.4 55 0.0012 24.9 2.0 16 17-32 31-46 (217)
335 PF09911 DUF2140: Uncharacteri 25.4 65 0.0014 24.3 2.4 19 10-28 2-20 (187)
336 PF05399 EVI2A: Ectropic viral 25.3 68 0.0015 25.6 2.6 8 32-39 161-169 (227)
337 PF13038 DUF3899: Domain of un 25.3 1.1E+02 0.0023 19.9 3.2 25 7-31 67-91 (92)
338 PF00003 7tm_3: 7 transmembran 25.3 88 0.0019 22.9 3.1 34 7-40 146-179 (238)
339 TIGR00889 2A0110 nucleoside tr 25.1 79 0.0017 25.3 3.0 9 40-48 409-417 (418)
340 PRK07718 fliL flagellar basal 25.1 1.2E+02 0.0025 21.7 3.6 26 11-36 6-31 (142)
341 PF14715 FixP_N: N-terminal do 25.0 1.2E+02 0.0026 18.6 3.1 20 9-28 22-41 (51)
342 TIGR03778 VPDSG_CTERM VPDSG-CT 24.9 64 0.0014 17.6 1.7 16 20-35 9-24 (26)
343 PF11862 DUF3382: Domain of un 24.8 1.1E+02 0.0024 20.6 3.3 36 10-45 4-39 (101)
344 PF15048 OSTbeta: Organic solu 24.7 76 0.0016 23.2 2.5 28 19-46 45-72 (125)
345 PF02790 COX2_TM: Cytochrome C 24.7 1E+02 0.0022 19.2 2.9 15 19-33 34-48 (84)
346 TIGR02794 tolA_full TolA prote 24.6 74 0.0016 26.3 2.8 13 17-29 8-20 (346)
347 PF06697 DUF1191: Protein of u 24.6 15 0.00033 29.9 -1.2 23 14-36 218-241 (278)
348 PF09583 Phageshock_PspG: Phag 24.5 1.3E+02 0.0028 19.7 3.3 28 7-34 31-63 (65)
349 PF01998 DUF131: Protein of un 24.5 1E+02 0.0023 19.8 2.9 16 12-27 46-61 (64)
350 TIGR03141 cytochro_ccmD heme e 24.3 42 0.0009 19.8 0.9 21 17-37 12-32 (45)
351 PF15339 Afaf: Acrosome format 24.1 76 0.0016 24.8 2.6 21 13-33 136-157 (200)
352 KOG3832|consensus 24.1 84 0.0018 25.7 2.9 13 4-16 35-47 (319)
353 PF13858 DUF4199: Protein of u 24.0 98 0.0021 21.6 3.0 28 9-36 135-163 (163)
354 PF05510 Sarcoglycan_2: Sarcog 24.0 65 0.0014 27.4 2.4 17 15-31 293-309 (386)
355 PF05297 Herpes_LMP1: Herpesvi 23.9 26 0.00056 29.6 0.0 25 8-32 160-184 (381)
356 PF13400 Tad: Putative Flp pil 23.9 44 0.00094 19.4 1.0 15 16-30 6-20 (48)
357 PRK01844 hypothetical protein; 23.8 82 0.0018 21.0 2.4 24 10-34 3-26 (72)
358 PRK05696 fliL flagellar basal 23.7 58 0.0013 23.8 1.8 8 11-18 21-28 (170)
359 PRK13872 conjugal transfer pro 23.6 1.2E+02 0.0025 23.1 3.6 9 30-38 71-79 (228)
360 COG4477 EzrA Negative regulato 23.6 32 0.0007 30.7 0.5 24 13-36 2-25 (570)
361 PF14257 DUF4349: Domain of un 23.6 1.1E+02 0.0023 23.6 3.3 9 10-18 234-242 (262)
362 PF02158 Neuregulin: Neureguli 23.6 27 0.00058 30.0 0.0 16 17-32 13-28 (404)
363 KOG3359|consensus 23.2 88 0.0019 28.8 3.2 39 9-49 576-614 (723)
364 KOG1094|consensus 23.2 37 0.0008 31.4 0.8 26 8-33 389-414 (807)
365 cd01811 OASL_repeat1 2'-5' oli 23.1 44 0.00094 22.8 1.0 13 84-96 53-65 (80)
366 PF10873 DUF2668: Protein of u 23.0 84 0.0018 23.8 2.6 10 32-41 88-97 (155)
367 PF04906 Tweety: Tweety; Inte 22.9 1E+02 0.0022 25.9 3.3 12 29-40 205-216 (406)
368 PRK03001 M48 family peptidase; 22.9 63 0.0014 25.3 2.0 35 10-45 29-64 (283)
369 TIGR03142 cytochro_ccmI cytoch 22.6 1E+02 0.0023 21.1 2.8 7 10-16 2-8 (117)
370 PF05568 ASFV_J13L: African sw 22.5 93 0.002 23.9 2.7 26 7-32 22-47 (189)
371 PRK00523 hypothetical protein; 22.5 82 0.0018 21.0 2.2 13 22-34 15-27 (72)
372 PF04507 DUF576: Protein of un 22.2 29 0.00064 28.1 0.0 14 19-32 11-24 (257)
373 PHA02681 ORF089 virion membran 22.2 83 0.0018 21.8 2.2 6 30-35 22-27 (92)
374 PF12751 Vac7: Vacuolar segreg 22.1 91 0.002 26.7 2.9 22 11-32 302-323 (387)
375 PF13074 DUF3938: Protein of u 22.1 88 0.0019 21.6 2.3 14 10-23 75-88 (102)
376 PRK13836 conjugal transfer pro 22.0 1.3E+02 0.0028 22.8 3.5 15 8-22 29-43 (220)
377 PF13937 DUF4212: Domain of un 21.8 43 0.00094 22.4 0.7 6 37-42 74-79 (81)
378 PRK10856 cytoskeletal protein 21.8 95 0.0021 25.4 2.9 26 8-33 109-134 (331)
379 TIGR00383 corA magnesium Mg(2+ 21.7 70 0.0015 24.8 2.0 13 11-23 290-302 (318)
380 PF07695 7TMR-DISM_7TM: 7TM di 21.6 1.1E+02 0.0025 21.1 2.9 11 30-40 144-154 (205)
381 PTZ00046 rifin; Provisional 21.6 88 0.0019 26.5 2.7 23 14-36 322-344 (358)
382 PF03988 DUF347: Repeat of Unk 21.5 1.5E+02 0.0031 18.1 3.1 17 19-35 37-53 (55)
383 PF10731 Anophelin: Thrombin i 21.5 1.8E+02 0.0039 19.0 3.6 10 48-57 26-35 (65)
384 PF15159 PIG-Y: Phosphatidylin 21.4 1.7E+02 0.0037 19.2 3.5 21 9-29 2-22 (72)
385 PF15145 DUF4577: Domain of un 21.3 1.5E+02 0.0032 21.7 3.4 35 10-45 62-98 (128)
386 PF04531 Phage_holin_1: Bacter 21.2 1.5E+02 0.0032 19.6 3.3 23 6-28 8-30 (84)
387 COG4698 Uncharacterized protei 20.9 1.5E+02 0.0032 23.3 3.6 38 10-47 10-48 (197)
388 PF13131 DUF3951: Protein of u 20.9 1.5E+02 0.0033 18.7 3.0 11 17-27 11-21 (53)
389 COG3462 Predicted membrane pro 20.9 1.2E+02 0.0026 21.9 2.9 18 11-28 50-67 (117)
390 TIGR03746 conj_TIGR03746 integ 20.9 1.3E+02 0.0028 23.6 3.3 23 6-28 12-34 (202)
391 TIGR01477 RIFIN variant surfac 20.8 92 0.002 26.3 2.6 24 12-36 316-339 (353)
392 PF07009 DUF1312: Protein of u 20.8 31 0.00068 23.6 -0.1 9 33-41 21-29 (113)
393 COG5522 Predicted integral mem 20.8 60 0.0013 26.0 1.5 33 14-46 155-187 (236)
394 PF14029 DUF4244: Protein of u 20.8 2E+02 0.0044 18.1 3.6 26 3-28 12-37 (56)
395 PF10601 zf-LITAF-like: LITAF- 20.7 1.6E+02 0.0034 18.4 3.2 8 9-16 28-35 (73)
396 PHA02911 C-type lectin-like pr 20.6 1.1E+02 0.0023 24.4 2.8 19 10-28 30-48 (213)
397 PF06667 PspB: Phage shock pro 20.6 1.3E+02 0.0028 19.9 2.9 9 33-41 29-37 (75)
398 TIGR02595 PEP_exosort PEP-CTER 20.6 80 0.0017 16.6 1.5 9 28-36 15-23 (26)
399 PF07790 DUF1628: Protein of u 20.5 1.4E+02 0.003 18.8 2.9 13 13-25 7-19 (80)
400 TIGR02975 phageshock_pspG phag 20.5 1.8E+02 0.004 19.0 3.4 28 7-34 30-62 (64)
401 COG4059 MtrE Tetrahydromethano 20.4 95 0.0021 25.4 2.5 33 8-40 260-294 (304)
402 PF14985 TM140: TM140 protein 20.4 1.3E+02 0.0028 23.2 3.2 26 12-37 147-173 (181)
403 PF04246 RseC_MucC: Positive r 20.4 97 0.0021 21.5 2.4 24 15-38 100-123 (135)
404 PRK10884 SH3 domain-containing 20.4 1E+02 0.0022 23.7 2.7 26 7-32 170-195 (206)
405 PHA02851 EEV glycoprotein; Pro 20.4 81 0.0018 24.5 2.1 15 15-29 8-22 (223)
406 PRK11466 hybrid sensory histid 20.3 87 0.0019 27.5 2.5 24 9-32 329-352 (914)
407 PRK06287 cobalt transport prot 20.2 1.3E+02 0.0028 20.9 2.9 9 28-36 97-105 (107)
408 PRK14759 potassium-transportin 20.2 1.8E+02 0.0039 16.3 3.2 19 14-32 6-24 (29)
409 PF11084 DUF2621: Protein of u 20.2 1.4E+02 0.003 22.3 3.2 20 9-28 3-22 (141)
410 COG1704 LemA Uncharacterized c 20.2 1.2E+02 0.0026 23.5 2.9 14 10-23 3-16 (185)
411 PRK13865 type IV secretion sys 20.1 1.5E+02 0.0032 23.2 3.5 12 7-18 37-48 (229)
412 KOG2052|consensus 20.1 73 0.0016 28.2 2.0 20 9-28 131-150 (513)
413 KOG4221|consensus 20.1 37 0.00081 33.2 0.2 13 22-34 1073-1085(1381)
414 TIGR03321 alt_F1F0_F0_B altern 20.1 1E+02 0.0023 23.7 2.7 22 11-32 5-26 (246)
415 PRK09848 glucuronide transport 20.0 1.4E+02 0.003 23.7 3.5 9 39-47 426-434 (448)
No 1
>PF13882 Bravo_FIGEY: Bravo-like intracellular region
Probab=99.92 E-value=4.2e-26 Score=156.29 Aligned_cols=70 Identities=46% Similarity=0.717 Sum_probs=60.6
Q ss_pred eecCCCcccchhhhhhcCCC---CCCCCCCCcccCCCCCCC-CC-CCCCCCCCCCCCCCCcchhhhhCCCCCCCCC
Q psy17635 33 KRNRGGKYAVHEREAAHGRH---DYPEEPGFNEYSQPLDGK-PR-TRSSMSSEKHPVESDTDSMAEYGDGDTGTKN 103 (103)
Q Consensus 33 ~R~rGgKY~VkeKE~~~g~~---~~~dd~~F~EYsd~dek~-~~-~s~~~~~~~~~~~~s~DSl~dYgd~~~~~Fn 103 (103)
||||||||+|++||++|+++ ||+| ++|+||+++++++ ++ +|++++++..+.++|+|||++||+++++|||
T Consensus 1 kR~kGGKY~V~eKE~~~~~~~~~~~kd-~~f~ey~~~e~~~~~~~~s~~s~~~~~~~~~s~DSL~~YGd~~~~~FN 75 (90)
T PF13882_consen 1 KRNKGGKYSVKEKEDAHGDPEDQPMKD-ETFGEYSDPEDPKPLKRGSSQSLDGSRKKSSSDDSLDDYGDGDTGQFN 75 (90)
T ss_pred CCCCCCCCCCccccccCCCCccccccC-CCcccccCcccccccccccccccccCcCcCcCccCHhHhccCCCcccc
Confidence 69999999999999999998 4566 7999999875544 43 4667777888899999999999999999998
No 2
>KOG3513|consensus
Probab=98.91 E-value=3.5e-11 Score=109.23 Aligned_cols=99 Identities=25% Similarity=0.254 Sum_probs=84.8
Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhcCCC----CCCCCCCCcccCCC-CCCCCC--CCCCC
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRH----DYPEEPGFNEYSQP-LDGKPR--TRSSM 77 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~g~~----~~~dd~~F~EYsd~-dek~~~--~s~~~ 77 (103)
.++..+|||+.++++.++|+..++.|+.+|+++++|.|+.+|+.|.+. .|+.+.+|.+|++. ++...+ +.+..
T Consensus 922 ~~~p~g~~~~~~~~~~~~l~w~~v~~~~nes~v~gYkV~~~~~~~~~~~~~~t~~~~~~~~~p~~~~~~y~i~v~~~~~g 1001 (1051)
T KOG3513|consen 922 DIAPPGNFIWKFSASILLLLWLLVSAFENESEVGGYKVLYREDLQNDIEIILTMKLDAEFPEPSDLDGKYDIKVRGYSPG 1001 (1051)
T ss_pred ccCCCcceEEeeeeeEEEEEEeeEEEEeecccCcceEEEEeecccCCceeEecccccccccCccccCCcceeEeccccCC
Confidence 457899999999999999999999999999999999999999999887 45677899999875 333333 44556
Q ss_pred CCCCCCCCCCcchhhhhCCCCCCCCC
Q psy17635 78 SSEKHPVESDTDSMAEYGDGDTGTKN 103 (103)
Q Consensus 78 ~~~~~~~~~s~DSl~dYgd~~~~~Fn 103 (103)
+.++++++.+.|||++||..++++|+
T Consensus 1002 gdg~~s~~~~~~s~~~~~~~~~~~~~ 1027 (1051)
T KOG3513|consen 1002 GDGTISEEISIDSLGSYGTMDAGTFS 1027 (1051)
T ss_pred CCCCcCCccccccccCcCCCCCcccc
Confidence 66888889999999999999999986
No 3
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=96.61 E-value=0.0008 Score=47.53 Aligned_cols=24 Identities=21% Similarity=0.736 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhheee
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R 34 (103)
|.+.+++++++||+|++++|+-+|
T Consensus 2 W~l~~iii~~i~l~~~~~~~~~rR 25 (130)
T PF12273_consen 2 WVLFAIIIVAILLFLFLFYCHNRR 25 (130)
T ss_pred eeeHHHHHHHHHHHHHHHHHHHHH
Confidence 877777777777777777676543
No 4
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=96.24 E-value=0.0011 Score=53.61 Aligned_cols=35 Identities=23% Similarity=0.413 Sum_probs=0.0
Q ss_pred hhhHHHHH----HHHHHHHHHHHHHHhheeecCCCcccc
Q psy17635 8 TAGWFIGM----LLAIAFLILVLILVCLIKRNRGGKYAV 42 (103)
Q Consensus 8 t~gWfIg~----~~ai~lLllilli~C~i~R~rGgKY~V 42 (103)
...|+.-+ ++|++|||+.++++||.+|+|.||-.+
T Consensus 142 ~d~yL~T~IpaVVI~~iLLIA~iIa~icyrrkR~GK~~~ 180 (290)
T PF05454_consen 142 SDDYLHTFIPAVVIAAILLIAGIIACICYRRKRKGKMSL 180 (290)
T ss_dssp ---------------------------------------
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccccc
Confidence 33455443 333344444455567779999999887
No 5
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=95.25 E-value=0.02 Score=45.40 Aligned_cols=27 Identities=33% Similarity=0.491 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~rGgK 39 (103)
|+|-+||+.|+||+||+++|.|+|--.
T Consensus 275 IaVG~~La~lvlivLiaYli~Rrr~~~ 301 (306)
T PF01299_consen 275 IAVGAALAGLVLIVLIAYLIGRRRSRA 301 (306)
T ss_pred HHHHHHHHHHHHHHHHhheeEeccccc
Confidence 566677888889999999997666443
No 6
>PF07204 Orthoreo_P10: Orthoreovirus membrane fusion protein p10; InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=94.20 E-value=0.027 Score=39.49 Aligned_cols=43 Identities=12% Similarity=0.270 Sum_probs=24.6
Q ss_pred chhhhhHHHHHHHH-HHHHHHH-HHHHhheeecC-CCcccchhhhh
Q psy17635 5 TVATAGWFIGMLLA-IAFLILV-LILVCLIKRNR-GGKYAVHEREA 47 (103)
Q Consensus 5 ~~at~gWfIg~~~a-i~lLlli-lli~C~i~R~r-GgKY~VkeKE~ 47 (103)
++-++.|++....+ ++|+||| +|+.||..|.| -.+=.|-+||.
T Consensus 36 S~~~ayWpyLA~GGG~iLilIii~Lv~CC~~K~K~~~~r~~~~reL 81 (98)
T PF07204_consen 36 SSFVAYWPYLAAGGGLILILIIIALVCCCRAKHKTSAARNTFHREL 81 (98)
T ss_pred ehHHhhhHHhhccchhhhHHHHHHHHHHhhhhhhhHhhhhHHHHHH
Confidence 35567898876655 4444433 44445444444 55666777774
No 7
>PF15102 TMEM154: TMEM154 protein family
Probab=94.15 E-value=0.025 Score=42.07 Aligned_cols=15 Identities=33% Similarity=0.848 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHhhee
Q psy17635 19 IAFLILVLILVCLIK 33 (103)
Q Consensus 19 i~lLllilli~C~i~ 33 (103)
|+|+||+|+++|+|.
T Consensus 66 VLLvlLLl~vV~lv~ 80 (146)
T PF15102_consen 66 VLLVLLLLSVVCLVI 80 (146)
T ss_pred HHHHHHHHHHHHhee
Confidence 455556666677773
No 8
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=94.09 E-value=0.023 Score=50.80 Aligned_cols=32 Identities=22% Similarity=0.581 Sum_probs=19.6
Q ss_pred hhhhhHH-HHHHHHHHHHHHHHHHH--hheeecCC
Q psy17635 6 VATAGWF-IGMLLAIAFLILVLILV--CLIKRNRG 37 (103)
Q Consensus 6 ~at~gWf-Ig~~~ai~lLllilli~--C~i~R~rG 37 (103)
-.+..|+ ||++..|++++||++|+ |||+++|.
T Consensus 265 ~~~NlWII~gVlvPv~vV~~Iiiil~~~LCRk~K~ 299 (684)
T PF12877_consen 265 PPNNLWIIAGVLVPVLVVLLIIIILYWKLCRKNKL 299 (684)
T ss_pred CCCCeEEEehHhHHHHHHHHHHHHHHHHHhccccc
Confidence 3456788 67766666666665554 55555554
No 9
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=93.73 E-value=0.047 Score=39.33 Aligned_cols=28 Identities=25% Similarity=0.487 Sum_probs=15.5
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~r 36 (103)
+..+=++|+|++|+++ ||||.+||+|.|
T Consensus 65 ~i~~Ii~gv~aGvIg~--Illi~y~irR~~ 92 (122)
T PF01102_consen 65 AIIGIIFGVMAGVIGI--ILLISYCIRRLR 92 (122)
T ss_dssp CHHHHHHHHHHHHHHH--HHHHHHHHHHHS
T ss_pred ceeehhHHHHHHHHHH--HHHHHHHHHHHh
Confidence 3445556777776553 345555555543
No 10
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=92.98 E-value=0.082 Score=36.81 Aligned_cols=24 Identities=33% Similarity=0.602 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHhheeecC
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~r 36 (103)
+.++|.|-++++|+..+||.+|.|
T Consensus 37 ~lvI~~iFil~VilwfvCC~kRkr 60 (94)
T PF05393_consen 37 FLVICGIFILLVILWFVCCKKRKR 60 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhh
Confidence 456677767777778888887644
No 11
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=92.90 E-value=0.34 Score=36.96 Aligned_cols=56 Identities=14% Similarity=0.256 Sum_probs=32.3
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhcCCCCCCCCCCCccc
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRHDYPEEPGFNEY 63 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~g~~~~~dd~~F~EY 63 (103)
|++.-.+|.+-++|..+++++||.+|..|.|-.- +--++|+.|-..|..|. ..+|-
T Consensus 27 fsthm~tILiaIvVliiiiivli~lcssRKkKaa-AAi~eediQfinpyqDq-qw~~v 82 (189)
T PF05568_consen 27 FSTHMYTILIAIVVLIIIIIVLIYLCSSRKKKAA-AAIEEEDIQFINPYQDQ-QWAGV 82 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHH-hhhhhhcccccCcccch-hhccC
Confidence 4444445544444445556677777776655433 45567788877776653 33343
No 12
>PF14979 TMEM52: Transmembrane 52
Probab=92.20 E-value=0.17 Score=38.11 Aligned_cols=22 Identities=36% Similarity=0.673 Sum_probs=16.1
Q ss_pred hhhhHHHHHHHHHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli 28 (103)
-+..|.|++++.+++|+|+.=+
T Consensus 16 W~~LWyIwLill~~~llLLCG~ 37 (154)
T PF14979_consen 16 WSSLWYIWLILLIGFLLLLCGL 37 (154)
T ss_pred eehhhHHHHHHHHHHHHHHHHH
Confidence 3467999999888777776433
No 13
>PHA03164 hypothetical protein; Provisional
Probab=91.83 E-value=0.24 Score=33.90 Aligned_cols=28 Identities=29% Similarity=0.609 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
+|..-.||+.+|+|++++.+...||||+
T Consensus 61 lvLtgLaIamILfiifvlyvFnVnr~~~ 88 (88)
T PHA03164 61 LVLTGLAIAMILFIIFVLYVFNVNRGGT 88 (88)
T ss_pred HHHHHHHHHHHHHHHHHHHheeeccCCC
Confidence 4444556888888888888888888885
No 14
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=91.63 E-value=0.26 Score=32.41 Aligned_cols=25 Identities=20% Similarity=0.490 Sum_probs=10.2
Q ss_pred HHHHHHHH-HHHHHHHHHHHhheeec
Q psy17635 11 WFIGMLLA-IAFLILVLILVCLIKRN 35 (103)
Q Consensus 11 WfIg~~~a-i~lLllilli~C~i~R~ 35 (103)
|+|+.+++ ++++|++++++.+++|+
T Consensus 1 ~ii~~~~~g~~~ll~~v~~~~~~~rr 26 (75)
T PF14575_consen 1 LIIASIIVGVLLLLVLVIIVIVCFRR 26 (75)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHCCCTT
T ss_pred CEEehHHHHHHHHHHhheeEEEEEee
Confidence 45554444 44444344333334443
No 15
>PF15330 SIT: SHP2-interacting transmembrane adaptor protein, SIT
Probab=90.98 E-value=0.15 Score=35.89 Aligned_cols=30 Identities=27% Similarity=0.611 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHH-HHHHHHhhee---ecCCCcc
Q psy17635 11 WFIGMLLAIAFLI-LVLILVCLIK---RNRGGKY 40 (103)
Q Consensus 11 WfIg~~~ai~lLl-lilli~C~i~---R~rGgKY 40 (103)
|.|-.+++++||| +++-|+|+.. |.|++.|
T Consensus 1 w~Ll~il~llLll~l~asl~~wr~~~rq~k~~~~ 34 (107)
T PF15330_consen 1 WLLLGILALLLLLSLAASLLAWRMKQRQKKAGQY 34 (107)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC
Confidence 6666666665544 4455556553 3445555
No 16
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=89.71 E-value=0.12 Score=36.47 Aligned_cols=20 Identities=20% Similarity=0.188 Sum_probs=14.0
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~ 29 (103)
.|+|.|+++|++|++++++-
T Consensus 4 l~~iii~~i~l~~~~~~~~~ 23 (130)
T PF12273_consen 4 LFAIIIVAILLFLFLFYCHN 23 (130)
T ss_pred eHHHHHHHHHHHHHHHHHHH
Confidence 46777777777777777664
No 17
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=88.99 E-value=0.11 Score=33.94 Aligned_cols=14 Identities=21% Similarity=0.605 Sum_probs=3.0
Q ss_pred eecCCCcccchhhh
Q psy17635 33 KRNRGGKYAVHERE 46 (103)
Q Consensus 33 ~R~rGgKY~VkeKE 46 (103)
|+.--|.|.|.|..
T Consensus 37 rkkdEGSY~l~e~K 50 (64)
T PF01034_consen 37 RKKDEGSYDLDEPK 50 (64)
T ss_dssp S------SS--S--
T ss_pred HhcCCCCccCCCCC
Confidence 44446789888765
No 18
>PF15050 SCIMP: SCIMP protein
Probab=88.96 E-value=0.38 Score=35.33 Aligned_cols=35 Identities=17% Similarity=0.470 Sum_probs=20.6
Q ss_pred hhhHHHHHHHHHHHHHHHHHHH-hhee--ecCCCcccc
Q psy17635 8 TAGWFIGMLLAIAFLILVLILV-CLIK--RNRGGKYAV 42 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~-C~i~--R~rGgKY~V 42 (103)
...|+|..+.+|++-+.+-||+ |+|+ +++|.|..+
T Consensus 6 ~nFWiiLAVaII~vS~~lglIlyCvcR~~lRqGkkwei 43 (133)
T PF15050_consen 6 DNFWIILAVAIILVSVVLGLILYCVCRWQLRQGKKWEI 43 (133)
T ss_pred hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccee
Confidence 4568876665554444444444 5444 377888864
No 19
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=88.58 E-value=0.48 Score=32.54 Aligned_cols=28 Identities=14% Similarity=0.199 Sum_probs=12.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHH-hheeecCC
Q psy17635 9 AGWFIGMLLAIAFLILVLILV-CLIKRNRG 37 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~-C~i~R~rG 37 (103)
.-|++.+ +.+++++|+.+|. .+|.|.|.
T Consensus 63 iili~ll-s~v~IlVily~IyYFVILRer~ 91 (101)
T PF06024_consen 63 IILISLL-SFVCILVILYAIYYFVILRERQ 91 (101)
T ss_pred chHHHHH-HHHHHHHHHhhheEEEEEeccc
Confidence 3444333 3333444444444 44456544
No 20
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=86.84 E-value=1.2 Score=26.42 Aligned_cols=25 Identities=20% Similarity=0.677 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHH-HhheeecC
Q psy17635 12 FIGMLLAIAFLILVLIL-VCLIKRNR 36 (103)
Q Consensus 12 fIg~~~ai~lLllilli-~C~i~R~r 36 (103)
+.|++..+++++|..++ +|+.||.+
T Consensus 9 Iv~V~vg~~iiii~~~~YaCcykk~~ 34 (38)
T PF02439_consen 9 IVAVVVGMAIIIICMFYYACCYKKHR 34 (38)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcccc
Confidence 33444444443333222 35556554
No 21
>PF08173 YbgT_YccB: Membrane bound YbgT-like protein; InterPro: IPR012994 This family contains a set of membrane proteins, typically 33 amino acids long. The family has no known function, but the protein is found in the operon CydAB in Escherichia coli. Members have a consensus motif (MWYFXW), which is rich in aromatic residues. The protein forms a single membrane-spanning helix. This family seems to be restricted to proteobacteria [].
Probab=86.67 E-value=1.1 Score=24.99 Aligned_cols=19 Identities=47% Similarity=0.803 Sum_probs=15.9
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllill 27 (103)
..|++|+..|+++-++.++
T Consensus 4 faWilG~~lA~~~~i~~a~ 22 (28)
T PF08173_consen 4 FAWILGVLLACAFGILNAM 22 (28)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4699999999998887764
No 22
>PHA02867 C-type lectin protein; Provisional
Probab=86.48 E-value=0.59 Score=35.17 Aligned_cols=25 Identities=16% Similarity=0.384 Sum_probs=21.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
.-.||-++.||+|+||++|+||-++
T Consensus 13 ~~~~~~~~~~~~~~~~~~~~v~~~~ 37 (167)
T PHA02867 13 SCYAITVLGILCLILFTILVVVTCK 37 (167)
T ss_pred eeehhHHHHHHHHHHHHHhhheeEE
Confidence 3578999999999999999887665
No 23
>KOG3956|consensus
Probab=85.88 E-value=0.66 Score=38.56 Aligned_cols=26 Identities=23% Similarity=0.418 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 16 LLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 16 ~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
..+++||||+++|--.-....|+||+
T Consensus 8 r~~~allllll~~G~~~~kkqskKyS 33 (359)
T KOG3956|consen 8 RNHFALLLLLLVIGSAHNKKQSKKYS 33 (359)
T ss_pred HhhHHHHHHHHHhCcccchhhccccc
Confidence 34588888888887767788899997
No 24
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=85.09 E-value=0.94 Score=41.57 Aligned_cols=30 Identities=20% Similarity=0.329 Sum_probs=15.5
Q ss_pred hHHHHHHHHHHHHHHHH-HHHhheeecCCCc
Q psy17635 10 GWFIGMLLAIAFLILVL-ILVCLIKRNRGGK 39 (103)
Q Consensus 10 gWfIg~~~ai~lLllil-li~C~i~R~rGgK 39 (103)
..+.++|.+++||+|++ |+++|.||.|-+|
T Consensus 273 ~fLl~ILG~~~livl~lL~vLl~yCrrkc~~ 303 (807)
T PF10577_consen 273 VFLLAILGGTALIVLILLCVLLCYCRRKCLK 303 (807)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcccCC
Confidence 34466666666666553 3333334544444
No 25
>PHA02650 hypothetical protein; Provisional
Probab=84.37 E-value=2.2 Score=29.13 Aligned_cols=34 Identities=12% Similarity=0.201 Sum_probs=17.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
.....|+|.++.++.+++|+.+-+=.++|+-.|.
T Consensus 47 ~~~~~~ii~i~~v~i~~l~~flYLK~~~r~~~~~ 80 (81)
T PHA02650 47 FNGQNFIFLIFSLIIVALFSFFVFKGYTRNLNGR 80 (81)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhccccCCC
Confidence 3334444444444444444444455677876653
No 26
>TIGR02106 cyd_oper_ybgT cyd operon protein YbgT. This model describes a very small (as short as 33 amino acids) protein of unknown function, essentially always found in an operon with CydAB, subunits of the cytochrome d terminal oxidase. It begins with an aromatic motif MWYFXW and appears to contain a membrane-spanning helix. This protein appears to be restricted to the Proteobacteria and exist in a single copy only. We suggest it may be a membrane subunit of the terminal oxidase. The family is named after the E. coli member YbgT. This model excludes the apparently related protein YccB.
Probab=84.11 E-value=1.6 Score=24.63 Aligned_cols=19 Identities=42% Similarity=0.749 Sum_probs=15.6
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllill 27 (103)
..|++|+..|+++.++..+
T Consensus 4 faWilG~~lA~~~~v~~a~ 22 (30)
T TIGR02106 4 FAWILGTLLACAFGVLNAM 22 (30)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3699999999988887664
No 27
>PF12297 EVC2_like: Ellis van Creveld protein 2 like protein; InterPro: IPR022076 This family of proteins is found in eukaryotes. Proteins in this family are typically between 571 and 1310 amino acids in length. There are two conserved sequence motifs: LPA and ELH. EVC2 is implicated in Ellis van Creveld chondrodysplastic dwarfism in humans. Mutations in this protein can give rise to this congenital condition. LIMBIN is a protein which shares around 80% sequence homology with EVC2 and it is implicated in a similar condition in bovine chondrodysplastic dwarfism.
Probab=84.04 E-value=1.4 Score=37.81 Aligned_cols=38 Identities=21% Similarity=0.504 Sum_probs=28.1
Q ss_pred CCcchhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 2 PSDTVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 2 ~~~~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
|.-+....|.|||++++|||..+.++++--++.-+|+-
T Consensus 61 pNHGlhaagFfvaflvslVL~~l~~f~l~r~~~l~~~~ 98 (429)
T PF12297_consen 61 PNHGLHAAGFFVAFLVSLVLTWLCFFLLARTRCLQGRP 98 (429)
T ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHccccccc
Confidence 45577889999999999999988887775444444433
No 28
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=83.41 E-value=0.44 Score=35.84 Aligned_cols=16 Identities=6% Similarity=0.241 Sum_probs=7.8
Q ss_pred HHHHhheeecCCCccc
Q psy17635 26 LILVCLIKRNRGGKYA 41 (103)
Q Consensus 26 lli~C~i~R~rGgKY~ 41 (103)
+|+.|||.|+|--.|-
T Consensus 68 ~lvf~~c~r~kktdfi 83 (154)
T PF04478_consen 68 ALVFIFCIRRKKTDFI 83 (154)
T ss_pred HhheeEEEecccCccc
Confidence 3334555555544554
No 29
>TIGR01167 LPXTG_anchor LPXTG-motif cell wall anchor domain. A common feature of this proteins containing this domain appears to be a high proportion of charged and zwitterionic residues immediatedly upstream of the LPXTG motif. This model differs from other descriptions of the LPXTG region by including a portion of that upstream charged region.
Probab=83.06 E-value=2.4 Score=22.96 Aligned_cols=16 Identities=13% Similarity=0.146 Sum_probs=9.2
Q ss_pred CCcchhhhhHHHHHHH
Q psy17635 2 PSDTVATAGWFIGMLL 17 (103)
Q Consensus 2 ~~~~~at~gWfIg~~~ 17 (103)
|+++-..+.|++.+..
T Consensus 3 P~TG~~~~~~~~~~G~ 18 (34)
T TIGR01167 3 PKTGESGNSLLLLLGL 18 (34)
T ss_pred CCCCCcccHHHHHHHH
Confidence 5666555566655543
No 30
>PHA02947 S-S bond formation pathway protein; Provisional
Probab=83.01 E-value=0.78 Score=36.17 Aligned_cols=34 Identities=35% Similarity=0.636 Sum_probs=22.7
Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
+++..-|||. ++++.+++.++.+|.+||.=+=||
T Consensus 174 ~~~~~~W~i~--~~~~i~~i~~i~i~~irR~i~lky 207 (215)
T PHA02947 174 PYSNKPWFIV--GVVIILIIFVIAICSIKRKINLKY 207 (215)
T ss_pred CcCCCchHHH--HHHHHHHHHHHHHHHHHHHheeeE
Confidence 4556689998 443444444555899999877776
No 31
>COG4818 Predicted membrane protein [Function unknown]
Probab=82.60 E-value=1.1 Score=31.73 Aligned_cols=32 Identities=31% Similarity=0.765 Sum_probs=23.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 8 TAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
+.||++.-+.. +..+|+-++|.+|-.||.+|-
T Consensus 57 ~Ig~lls~~v~--l~a~iLwlv~mykAyrGe~fK 88 (105)
T COG4818 57 YIGWLLSGLVG--LAAFILWLVCMYKAYRGERFK 88 (105)
T ss_pred hhHHHHHhHHH--HHHHHHHHHHHHHHHcCCeec
Confidence 34777766554 444566678999999999994
No 32
>PHA02844 putative transmembrane protein; Provisional
Probab=82.52 E-value=2.6 Score=28.40 Aligned_cols=29 Identities=14% Similarity=0.392 Sum_probs=11.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
.....|+|.++.++.+++++.+-+=.++|
T Consensus 46 ~~~~~~ii~i~~v~~~~~~~flYLK~~~r 74 (75)
T PHA02844 46 SSTKIWILTIIFVVFATFLTFLYLKAVPR 74 (75)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHheecC
Confidence 33444444433333333333333333444
No 33
>PF10717 ODV-E18: Occlusion-derived virus envelope protein ODV-E18; InterPro: IPR019655 Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=82.33 E-value=1.5 Score=30.20 Aligned_cols=25 Identities=16% Similarity=0.603 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 15 MLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 15 ~~~ai~lLllilli~C~i~R~rGgK 39 (103)
+|..++.|++|+|++-++.-+.+|.
T Consensus 27 lMtILivLVIIiLlImlfqsSS~~~ 51 (85)
T PF10717_consen 27 LMTILIVLVIIILLIMLFQSSSNGN 51 (85)
T ss_pred HHHHHHHHHHHHHHHHHHhccCCCC
Confidence 4555555555555555555444443
No 34
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=82.28 E-value=0.78 Score=37.37 Aligned_cols=26 Identities=27% Similarity=0.599 Sum_probs=19.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
+.=+|+-++||++++||++|+.+|-|
T Consensus 255 ~t~I~aSiiaIliIVLIMvIIYLILR 280 (299)
T PF02009_consen 255 TTAIIASIIAILIIVLIMVIIYLILR 280 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34467777788888888888877754
No 35
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=81.72 E-value=3.9 Score=24.32 Aligned_cols=32 Identities=9% Similarity=0.289 Sum_probs=16.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
+++.-.+...+.+++++.++++..++++..+|
T Consensus 7 ~~~~~~~~~v~~~~~F~gi~~w~~~~~~k~~~ 38 (49)
T PF05545_consen 7 QGFARSIGTVLFFVFFIGIVIWAYRPRNKKRF 38 (49)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcccchhhH
Confidence 34444444455555566555665555553333
No 36
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=80.71 E-value=1.9 Score=28.10 Aligned_cols=6 Identities=33% Similarity=1.458 Sum_probs=1.8
Q ss_pred HHHHHh
Q psy17635 25 VLILVC 30 (103)
Q Consensus 25 illi~C 30 (103)
|++++|
T Consensus 83 i~~~~~ 88 (89)
T PF00957_consen 83 IIIVIC 88 (89)
T ss_dssp HHHTT-
T ss_pred HHHHHh
Confidence 333334
No 37
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=78.76 E-value=2.9 Score=28.71 Aligned_cols=14 Identities=14% Similarity=0.349 Sum_probs=5.6
Q ss_pred hhhhhcCCCCCCCC
Q psy17635 44 EREAAHGRHDYPEE 57 (103)
Q Consensus 44 eKE~~~g~~~~~dd 57 (103)
++|..+...+.+|+
T Consensus 30 ~~~~~~~~~~v~~~ 43 (95)
T PF07172_consen 30 ETEKEEEENEVQDD 43 (95)
T ss_pred hccccccCCCCCcc
Confidence 33333333344443
No 38
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=77.97 E-value=0.34 Score=35.03 Aligned_cols=31 Identities=19% Similarity=0.446 Sum_probs=0.0
Q ss_pred HHHHHHHHHhheeecCCCcccchhhhhhcCC
Q psy17635 21 FLILVLILVCLIKRNRGGKYAVHEREAAHGR 51 (103)
Q Consensus 21 lLllilli~C~i~R~rGgKY~VkeKE~~~g~ 51 (103)
+|-++|||-|+.+|+|.|==.+++|-...+-
T Consensus 35 ILgiLLliGCWYckRRSGYk~L~~k~~~~gt 65 (118)
T PF14991_consen 35 ILGILLLIGCWYCKRRSGYKTLRDKSLHAGT 65 (118)
T ss_dssp -------------------------------
T ss_pred HHHHHHHHhheeeeecchhhhhhhccccccC
Confidence 3334455567777777776678777654443
No 39
>PF09451 ATG27: Autophagy-related protein 27; InterPro: IPR018939 Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. There are more than 25 AuTophaGy-related (ATG) genes that are essential for autophagy, although it is still not known how the autophagosome is made. Atg9 is a potential membrane carrier to deliver lipids that are used to form the vesicle. Atg27 is another transmembrane protein, and is a cycling protein []. It acts as an effector of VPS34 phosphatidylinositol 3-phosphate kinase signalling and regulates the cytoplasm to vacuole transport (Cvt) vesicle formation. It is also required for autophagy-dependent cycling of ATG9.
Probab=77.35 E-value=3.2 Score=32.57 Aligned_cols=34 Identities=21% Similarity=0.338 Sum_probs=19.8
Q ss_pred hhhhhHHHHHHHHHHHHH-HHHHHHhheeecCCCc
Q psy17635 6 VATAGWFIGMLLAIAFLI-LVLILVCLIKRNRGGK 39 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLl-lilli~C~i~R~rGgK 39 (103)
...-|||..+++.++|.+ +.+++--++..+++|.
T Consensus 197 ~~~~g~f~wl~i~~~l~~~~Y~i~g~~~n~~~~g~ 231 (268)
T PF09451_consen 197 SGGWGFFTWLFIILFLFLAAYLIFGSWYNYNRYGA 231 (268)
T ss_pred cccccHHHHHHHHHHHHHHHHhhhhhheeeccCCC
Confidence 345578866665544443 3355556677777665
No 40
>PRK14740 kdbF potassium-transporting ATPase subunit F; Provisional
Probab=77.13 E-value=5 Score=22.55 Aligned_cols=26 Identities=27% Similarity=0.337 Sum_probs=20.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~ 35 (103)
.|+.++-.|.++++++-|++.+.|-.
T Consensus 2 ~~~~wls~a~a~~Lf~YLv~ALlRae 27 (29)
T PRK14740 2 TVLDWLSLALATGLFVYLLVALLRAD 27 (29)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 48899989988888887777766543
No 41
>PF15069 FAM163: FAM163 family
Probab=76.92 E-value=1.8 Score=32.23 Aligned_cols=20 Identities=25% Similarity=0.610 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHHHHhhe
Q psy17635 13 IGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i 32 (103)
.|-++|.+.||.|+.++|.+
T Consensus 9 tGgILAtVILLcIIaVLCYC 28 (143)
T PF15069_consen 9 TGGILATVILLCIIAVLCYC 28 (143)
T ss_pred echHHHHHHHHHHHHHHHHH
Confidence 35566666666666666544
No 42
>PF07271 Cytadhesin_P30: Cytadhesin P30/P32; InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=76.81 E-value=0.63 Score=37.95 Aligned_cols=42 Identities=26% Similarity=0.445 Sum_probs=25.6
Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHH------hheeecCCCcccchhhhhh
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLILV------CLIKRNRGGKYAVHEREAA 48 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllilli~------C~i~R~rGgKY~VkeKE~~ 48 (103)
.+..+.|||-+..+++-|++++||+ =++| |-.|-.+.++|+.
T Consensus 62 ~i~~~~W~~P~v~~~~G~~~v~liLgl~ig~p~~k--rkek~~iee~e~~ 109 (279)
T PF07271_consen 62 PITEQSWFIPVVGGSAGLLAVALILGLAIGIPIYK--RKEKRMIEEKEEH 109 (279)
T ss_pred ecccccceeeeccchhhHHHHHHHHHHhhcchhhh--hhHHHHHHHHHHH
Confidence 3567889997765544444333332 3334 4488888888864
No 43
>MTH00036 ATP8 ATP synthase F0 subunit 8; Validated
Probab=76.56 E-value=4.7 Score=25.25 Aligned_cols=23 Identities=30% Similarity=0.648 Sum_probs=15.8
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLlli 25 (103)
|||-.+ .-||+-++++.++++++
T Consensus 1 MPQLd~--s~Wf~~~~~~W~~l~~~ 23 (54)
T MTH00036 1 MPQLDF--TWWLFNFLLAWSLVFIV 23 (54)
T ss_pred CCCCcc--hHHHHHHHHHHHHHHHH
Confidence 777665 47999998885444433
No 44
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=75.95 E-value=1.2 Score=35.67 Aligned_cols=26 Identities=35% Similarity=0.736 Sum_probs=12.4
Q ss_pred HHHHHHHH---HHHHHHHHHhheeecCCC
Q psy17635 13 IGMLLAIA---FLILVLILVCLIKRNRGG 38 (103)
Q Consensus 13 Ig~~~ai~---lLllilli~C~i~R~rGg 38 (103)
|++-||+. ||.|+-+|+.+++|+|-|
T Consensus 232 IslAiALG~v~ll~l~Gii~~~~~r~~~~ 260 (281)
T PF12768_consen 232 ISLAIALGTVFLLVLIGIILAYIRRRRQG 260 (281)
T ss_pred EehHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 45555533 333444445666555433
No 45
>PF13209 DUF4017: Protein of unknown function (DUF4017)
Probab=75.80 E-value=1.9 Score=27.72 Aligned_cols=33 Identities=30% Similarity=0.711 Sum_probs=25.6
Q ss_pred cchhhhhH--HHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 4 DTVATAGW--FIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 4 ~~~at~gW--fIg~~~ai~lLllilli~C~i~R~r 36 (103)
++-.+.|| |+|-.-||-.+++..+|-.++.|+|
T Consensus 25 egYN~vgWKlfvGQ~YAiPif~i~aiitFyinkk~ 59 (60)
T PF13209_consen 25 EGYNTVGWKLFVGQAYAIPIFIITAIITFYINKKK 59 (60)
T ss_pred cCccccchhheecchhHhHHHHHHHHHhheecccc
Confidence 45667778 6777888888888888888888775
No 46
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=75.43 E-value=1.8 Score=32.02 Aligned_cols=22 Identities=32% Similarity=0.594 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
|++..++++++++||+++++.+
T Consensus 2 ~i~i~l~~l~iilli~~~~~~~ 23 (151)
T PF14584_consen 2 YIIIGLLVLVIILLILIIILNI 23 (151)
T ss_pred cHHHHHHHHHHHHHHHHHHHHH
Confidence 6777777777777777776555
No 47
>PF00974 Rhabdo_glycop: Rhabdovirus spike glycoprotein; InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=75.35 E-value=0.91 Score=39.05 Aligned_cols=22 Identities=32% Similarity=0.772 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHH-HHHHhhe
Q psy17635 11 WFIGMLLAIAFLILV-LILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLlli-lli~C~i 32 (103)
|+.....+|++++|+ +++.||.
T Consensus 454 ~~~~~~~~vi~~illi~l~~cc~ 476 (501)
T PF00974_consen 454 WLSIIAIAVILLILLILLIRCCC 476 (501)
T ss_dssp -----------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Confidence 444444444444444 3344443
No 48
>PF14914 LRRC37AB_C: LRRC37A/B like protein 1 C-terminal domain
Probab=74.37 E-value=3.2 Score=31.32 Aligned_cols=23 Identities=22% Similarity=0.527 Sum_probs=15.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i 32 (103)
-|+|..+++.+.+.++++|+|+|
T Consensus 119 nklilaisvtvv~~iliii~CLi 141 (154)
T PF14914_consen 119 NKLILAISVTVVVMILIIIFCLI 141 (154)
T ss_pred chhHHHHHHHHHHHHHHHHHHHH
Confidence 47777777766666666666654
No 49
>PF12606 RELT: Tumour necrosis factor receptor superfamily member 19; InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis). RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=74.27 E-value=4.1 Score=25.29 Aligned_cols=35 Identities=26% Similarity=0.273 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHERE 46 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE 46 (103)
||+.+...+++.+|.+ .+|=+-+.+|=|..|...|
T Consensus 3 ~~~iV~i~iv~~lLg~-~I~~~~K~ygYkht~d~~e 37 (50)
T PF12606_consen 3 AFLIVSIFIVMGLLGL-SICTTLKAYGYKHTVDPLE 37 (50)
T ss_pred ehHHHHHHHHHHHHHH-HHHHHhhccccccccCCCC
Confidence 5555555544444443 3455566667676665533
No 50
>KOG3637|consensus
Probab=73.38 E-value=2.6 Score=39.49 Aligned_cols=29 Identities=31% Similarity=0.552 Sum_probs=17.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHh-----heeecCC
Q psy17635 9 AGWFIGMLLAIAFLILVLILVC-----LIKRNRG 37 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C-----~i~R~rG 37 (103)
-.|+|.+=+...||||+||++| |.||+|=
T Consensus 977 p~wiIi~svl~GLLlL~llv~~LwK~GFFKR~r~ 1010 (1030)
T KOG3637|consen 977 PLWIIILSVLGGLLLLALLVLLLWKCGFFKRNRK 1010 (1030)
T ss_pred ceeeehHHHHHHHHHHHHHHHHHHhcCccccCCC
Confidence 4577766555555555555554 4477764
No 51
>PHA02662 ORF131 putative membrane protein; Provisional
Probab=72.73 E-value=2.2 Score=33.94 Aligned_cols=31 Identities=23% Similarity=0.433 Sum_probs=20.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
-|||.+..+ +.+++.++.+|.++|.=+=||-
T Consensus 186 ~W~i~~~v~-~i~~i~vv~i~~irR~i~lkYr 216 (226)
T PHA02662 186 PWTLLLAVA-AVTVLGVVAVSLLRRALRIRFR 216 (226)
T ss_pred cchhHHHHH-HHHHHHHHHHHHHHHHhheeee
Confidence 699876655 2333444457999998777773
No 52
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=72.47 E-value=4.7 Score=29.74 Aligned_cols=33 Identities=18% Similarity=0.461 Sum_probs=18.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
.-|+-.++.+++++++++..+-..|.++.-||-
T Consensus 15 ~~~~~~~~i~~ll~~l~~~~~~Y~r~r~~tKyR 47 (149)
T PF11694_consen 15 NDYLRYILIIILLLVLIFFFIKYLRNRLDTKYR 47 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCcchhh
Confidence 344444444444455554445556677788884
No 53
>PHA03240 envelope glycoprotein M; Provisional
Probab=72.30 E-value=4.4 Score=32.59 Aligned_cols=15 Identities=20% Similarity=0.465 Sum_probs=9.3
Q ss_pred hhhhhHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIA 20 (103)
Q Consensus 6 ~at~gWfIg~~~ai~ 20 (103)
.+-..|+|.+++.|+
T Consensus 210 aaH~~WIiilIIiIi 224 (258)
T PHA03240 210 AAHIAWIFIAIIIII 224 (258)
T ss_pred cchHhHHHHHHHHHH
Confidence 455678876655433
No 54
>PRK02251 putative septation inhibitor protein; Reviewed
Probab=72.19 E-value=5 Score=27.55 Aligned_cols=33 Identities=18% Similarity=0.617 Sum_probs=24.8
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccc
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAV 42 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~V 42 (103)
.+-.||..+|+++.++=|+=+++..+ -|+.||+
T Consensus 29 ~sP~W~~~~m~~lm~~Gl~WlvvyYl---~~~~~P~ 61 (87)
T PRK02251 29 SNPRWFVPLFVALMIIGLIWLVVYYL---SNGSLPI 61 (87)
T ss_pred CCCchHHHHHHHHHHHHHHHHHHHhh---hCCCcCc
Confidence 45689999999988888887777666 2566654
No 55
>PTZ00046 rifin; Provisional
Probab=72.14 E-value=2.8 Score=35.22 Aligned_cols=28 Identities=29% Similarity=0.579 Sum_probs=21.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~r 36 (103)
+.=+|+-++||++++||.+|+.+|-|=|
T Consensus 314 ~taIiaSiiAIvVIVLIMvIIYLILRYR 341 (358)
T PTZ00046 314 QTAIIASIVAIVVIVLIMVIIYLILRYR 341 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3346777788888888889999886644
No 56
>PF11807 DUF3328: Domain of unknown function (DUF3328); InterPro: IPR021765 This family of proteins are functionally uncharacterised. This family is only found in eukaryotes.
Probab=71.86 E-value=5.5 Score=28.29 Aligned_cols=25 Identities=32% Similarity=0.438 Sum_probs=14.7
Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 4 DTVATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 4 ~~~at~gWfIg~~~ai~lLllilli 28 (103)
+.-.+..|++.++.+++++++.+++
T Consensus 3 p~~r~~~w~~~~~~~~~~l~~~l~~ 27 (217)
T PF11807_consen 3 PRRRRRRWRRLLLLFLLLLILSLLL 27 (217)
T ss_pred CCCCCccHHHHHHHHHHHHHHHHHH
Confidence 3455677888866665555444333
No 57
>PF14575 EphA2_TM: Ephrin type-A receptor 2 transmembrane domain; PDB: 3KUL_A 2XVD_A 2VX1_A 2VWV_A 2VX0_A 2VWY_A 2VWZ_A 2VWW_A 2VWU_A 2VWX_A ....
Probab=71.78 E-value=3.3 Score=27.10 Aligned_cols=24 Identities=25% Similarity=0.506 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHHHHHHhheeecCC
Q psy17635 14 GMLLAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R~rG 37 (103)
|++..++++++++++.--.+++++
T Consensus 9 g~~~ll~~v~~~~~~~rr~~~~~~ 32 (75)
T PF14575_consen 9 GVLLLLVLVIIVIVCFRRCKYSKA 32 (75)
T ss_dssp HHHHHHHHHHHHHCCCTT------
T ss_pred HHHHHHHhheeEEEEEeeEcCCCC
Confidence 344444444444444333334444
No 58
>PHA02955 hypothetical protein; Provisional
Probab=71.73 E-value=2 Score=33.78 Aligned_cols=28 Identities=36% Similarity=0.629 Sum_probs=19.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
-|||....++ +++++++|.+||.=+=||
T Consensus 180 ~w~ii~~v~i---i~~~v~l~yikR~i~~ky 207 (213)
T PHA02955 180 KWFIIYIVLC---LLILIILGYIYRTVRIKY 207 (213)
T ss_pred cchhHHHHHH---HHHHHHHHHHHHHheeeE
Confidence 6888554443 455555899999877666
No 59
>COG4890 Predicted outer membrane lipoprotein [Function unknown]
Probab=71.47 E-value=6.2 Score=23.19 Aligned_cols=21 Identities=38% Similarity=0.749 Sum_probs=17.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~ 29 (103)
.+|+.|++.|-++-+|-.+-+
T Consensus 4 FaWiLG~lLAcAFgiinAlwl 24 (37)
T COG4890 4 FAWILGLLLACAFGIINALWL 24 (37)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 469999999999998877654
No 60
>PRK13275 mtrF tetrahydromethanopterin S-methyltransferase subunit F; Provisional
Probab=71.31 E-value=6.8 Score=25.76 Aligned_cols=25 Identities=28% Similarity=0.527 Sum_probs=18.1
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHh
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C 30 (103)
....|..||++.|++|.++.+++..
T Consensus 40 ~~~~G~aiG~~~AlvLv~ip~~l~~ 64 (67)
T PRK13275 40 TGIIGFAIGFLLALLLVVVPPLLYG 64 (67)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567888999999887766655544
No 61
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=71.01 E-value=1.3 Score=37.58 Aligned_cols=32 Identities=22% Similarity=0.314 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHHHHHH--HhheeecCCCccc
Q psy17635 10 GWFIGMLLAIAFLILVLIL--VCLIKRNRGGKYA 41 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli--~C~i~R~rGgKY~ 41 (103)
.++.+++++++++++++++ +|+.+|.|--+|.
T Consensus 353 ~~l~vVlgvavlivVv~viv~vc~~~rrrR~~~~ 386 (439)
T PF02480_consen 353 ALLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQRD 386 (439)
T ss_dssp ----------------------------------
T ss_pred chHHHHHHHHHHHHHHHHHhheeeeehhcccccc
Confidence 3444444544444444333 3444455444444
No 62
>PRK14749 hypothetical protein; Provisional
Probab=70.81 E-value=7.1 Score=22.10 Aligned_cols=19 Identities=42% Similarity=0.857 Sum_probs=15.0
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllill 27 (103)
..|+.|+..|..+-+|-++
T Consensus 4 faWiLG~~lAc~f~ilna~ 22 (30)
T PRK14749 4 LLWFVGILLMCSLSTLVLV 22 (30)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3699999999888777654
No 63
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=69.69 E-value=3.6 Score=26.89 Aligned_cols=21 Identities=43% Similarity=0.868 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHhheee
Q psy17635 14 GMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R 34 (103)
+++.||+|++|+++++.++++
T Consensus 3 ~lI~Aiaf~vLvi~l~~~l~~ 23 (90)
T PF06103_consen 3 GLIAAIAFAVLVIFLIKVLKK 23 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 566778888888777666554
No 64
>PRK11901 hypothetical protein; Reviewed
Probab=69.50 E-value=4.1 Score=33.96 Aligned_cols=16 Identities=31% Similarity=0.874 Sum_probs=8.2
Q ss_pred HHHHHHHHHHHHHHHh
Q psy17635 15 MLLAIAFLILVLILVC 30 (103)
Q Consensus 15 ~~~ai~lLllilli~C 30 (103)
+|++|.+|+|+||||-
T Consensus 38 ~MiGiGilVLlLLIi~ 53 (327)
T PRK11901 38 MMIGIGILVLLLLIIA 53 (327)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4455555555555554
No 65
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=69.30 E-value=3.6 Score=34.58 Aligned_cols=27 Identities=30% Similarity=0.593 Sum_probs=20.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~r 36 (103)
.=+|+-++||++++||.+|+.+|-|=|
T Consensus 310 t~IiaSiIAIvvIVLIMvIIYLILRYR 336 (353)
T TIGR01477 310 TPIIASIIAILIIVLIMVIIYLILRYR 336 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 346777788888888889998886633
No 66
>PRK11677 hypothetical protein; Provisional
Probab=69.19 E-value=6.2 Score=28.80 Aligned_cols=25 Identities=12% Similarity=0.246 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~ 35 (103)
|++++++.|+-+++-+++.-+..++
T Consensus 3 W~~a~i~livG~iiG~~~~R~~~~~ 27 (134)
T PRK11677 3 WEYALIGLVVGIIIGAVAMRFGNRK 27 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHhhccch
Confidence 9888877777776666666555443
No 67
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=69.10 E-value=4 Score=33.59 Aligned_cols=13 Identities=23% Similarity=0.731 Sum_probs=7.3
Q ss_pred HHHHHHhheeecC
Q psy17635 24 LVLILVCLIKRNR 36 (103)
Q Consensus 24 lilli~C~i~R~r 36 (103)
||+|-+|+.||+|
T Consensus 275 liiLYiWlyrrRK 287 (295)
T TIGR01478 275 LIILYIWLYRRRK 287 (295)
T ss_pred HHHHHHHHHHhhc
Confidence 3344457777665
No 68
>PF00895 ATP-synt_8: ATP synthase protein 8; InterPro: IPR001421 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit 8 found in the F0 complex of mitochondrial F-ATPases from Metazoa. This subunit appears to be an integral component of the stator stalk in yeast mitochondrial F-ATPases []. The stator stalk is anchored in the membrane, and acts to prevent futile rotation of the ATPase subunits relative to the rotor during coupled ATP synthesis/hydrolysis. This subunit may have an analogous function in Metazoa. Subunit 8 differs in sequence between Metazoa, plants (IPR003319 from INTERPRO) and fungi (IPR009230 from INTERPRO). More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0000276 mitochondrial proton-transporting ATP synthase complex, coupling factor F(o)
Probab=69.10 E-value=12 Score=21.82 Aligned_cols=30 Identities=27% Similarity=0.620 Sum_probs=19.1
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i 32 (103)
|||- +..-||+-.+++...|++++.+..+.
T Consensus 1 MPQL--~P~~W~~~f~~~~~~l~~~~~~~~~~ 30 (54)
T PF00895_consen 1 MPQL--NPMPWFFLFLFFWLILLIIMILIYFF 30 (54)
T ss_pred CCCC--ChHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 5553 34469999999876666555544433
No 69
>PF12575 DUF3753: Protein of unknown function (DUF3753); InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=68.51 E-value=8 Score=25.79 Aligned_cols=18 Identities=39% Similarity=0.748 Sum_probs=7.2
Q ss_pred hHHHHHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 10 gWfIg~~~ai~lLllill 27 (103)
+|.+.++..|+.++++++
T Consensus 47 ~~~~~ii~ii~v~ii~~l 64 (72)
T PF12575_consen 47 NWIILIISIIFVLIIVLL 64 (72)
T ss_pred hHHHHHHHHHHHHHHHHH
Confidence 344444444443333333
No 70
>PF11395 DUF2873: Protein of unknown function (DUF2873); InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=68.48 E-value=9.2 Score=22.97 Aligned_cols=19 Identities=16% Similarity=0.501 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHH-Hhhe
Q psy17635 14 GMLLAIAFLILVLIL-VCLI 32 (103)
Q Consensus 14 g~~~ai~lLllilli-~C~i 32 (103)
-++..+++|++|.+| .||+
T Consensus 12 c~l~~llflv~imliif~f~ 31 (43)
T PF11395_consen 12 CFLSFLLFLVIIMLIIFWFS 31 (43)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 344445555555444 4664
No 71
>PTZ00370 STEVOR; Provisional
Probab=68.38 E-value=4.3 Score=33.45 Aligned_cols=14 Identities=21% Similarity=0.688 Sum_probs=7.6
Q ss_pred HHHHHHhheeecCC
Q psy17635 24 LVLILVCLIKRNRG 37 (103)
Q Consensus 24 lilli~C~i~R~rG 37 (103)
||+|-+|+.||+|.
T Consensus 271 liilYiwlyrrRK~ 284 (296)
T PTZ00370 271 LIILYIWLYRRRKN 284 (296)
T ss_pred HHHHHHHHHHhhcc
Confidence 33444577776653
No 72
>PF11669 WBP-1: WW domain-binding protein 1; InterPro: IPR021684 This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain [].
Probab=67.69 E-value=4.9 Score=27.76 Aligned_cols=11 Identities=9% Similarity=0.567 Sum_probs=5.2
Q ss_pred hhHHHHHHHHH
Q psy17635 9 AGWFIGMLLAI 19 (103)
Q Consensus 9 ~gWfIg~~~ai 19 (103)
+.|++.++.+|
T Consensus 20 ~~w~FWlv~~l 30 (102)
T PF11669_consen 20 ELWYFWLVWVL 30 (102)
T ss_pred HHHHHHHHHHH
Confidence 44655554333
No 73
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=67.50 E-value=6 Score=35.68 Aligned_cols=12 Identities=33% Similarity=0.625 Sum_probs=6.5
Q ss_pred Hhhe--eecCCCcc
Q psy17635 29 VCLI--KRNRGGKY 40 (103)
Q Consensus 29 ~C~i--~R~rGgKY 40 (103)
.||| +++=||+.
T Consensus 114 fCcCRCc~~CGg~~ 127 (806)
T PF05478_consen 114 FCCCRCCGNCGGRM 127 (806)
T ss_pred HhccccCCCcCCcc
Confidence 4555 24557775
No 74
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=66.89 E-value=2.9 Score=36.12 Aligned_cols=23 Identities=35% Similarity=0.582 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHHHHHhheee
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R 34 (103)
++.++++|+++++|++++.++.|
T Consensus 2 ~~~~ii~i~ii~i~~~~~~~~~r 24 (569)
T PRK04778 2 MIYLIIAIVVIIIIAYLAGLILR 24 (569)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555554445544
No 75
>PF02480 Herpes_gE: Alphaherpesvirus glycoprotein E; InterPro: IPR003404 Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation [].; GO: 0016020 membrane; PDB: 2GJ7_F 2GIY_B.
Probab=66.29 E-value=1.9 Score=36.64 Aligned_cols=30 Identities=23% Similarity=0.403 Sum_probs=0.0
Q ss_pred hhhhHHHHHHHH--HHHHHHHHHHHhheeecC
Q psy17635 7 ATAGWFIGMLLA--IAFLILVLILVCLIKRNR 36 (103)
Q Consensus 7 at~gWfIg~~~a--i~lLllilli~C~i~R~r 36 (103)
.-.+.++|+..+ |+++++++++.|..+|.+
T Consensus 353 ~~l~vVlgvavlivVv~viv~vc~~~rrrR~~ 384 (439)
T PF02480_consen 353 ALLGVVLGVAVLIVVVGVIVWVCLRCRRRRRQ 384 (439)
T ss_dssp --------------------------------
T ss_pred chHHHHHHHHHHHHHHHHHhheeeeehhcccc
Confidence 344455444444 455556688888887543
No 76
>PF13623 SurA_N_2: SurA N-terminal domain
Probab=65.87 E-value=6.2 Score=28.71 Aligned_cols=22 Identities=9% Similarity=0.422 Sum_probs=17.2
Q ss_pred chhhhhHHHHHHHHHHHHHHHH
Q psy17635 5 TVATAGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllil 26 (103)
.++..+|++.++++++|+.+|+
T Consensus 3 kIR~r~~lLi~vIglAL~aFIv 24 (145)
T PF13623_consen 3 KIRQRGGLLIIVIGLALFAFIV 24 (145)
T ss_pred hHhhcchHHHHHHHHHHHHHHH
Confidence 3567799988888888887777
No 77
>PRK14762 membrane protein; Provisional
Probab=65.59 E-value=9.1 Score=21.04 Aligned_cols=17 Identities=18% Similarity=0.528 Sum_probs=13.3
Q ss_pred hhHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILV 25 (103)
Q Consensus 9 ~gWfIg~~~ai~lLlli 25 (103)
..|.|+++-.|.||++.
T Consensus 4 ~lw~i~iifligllvvt 20 (27)
T PRK14762 4 ILWAVLIIFLIGLLVVT 20 (27)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 47999888888877765
No 78
>PF06143 Baculo_11_kDa: Baculovirus 11 kDa family; InterPro: IPR009313 This is a family of uncharacterised Baculovirus proteins that are all about 11 kDa in size.
Probab=64.44 E-value=8 Score=26.43 Aligned_cols=14 Identities=7% Similarity=0.477 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILV 25 (103)
Q Consensus 12 fIg~~~ai~lLlli 25 (103)
||-++|++++++++
T Consensus 36 FvLVic~~lVfVii 49 (84)
T PF06143_consen 36 FVLVICCFLVFVII 49 (84)
T ss_pred HHHHHHHHHHHHHH
Confidence 67778875555444
No 79
>PF09801 SYS1: Integral membrane protein S linking to the trans Golgi network; InterPro: IPR019185 Members of this family are integral membrane proteins involved in protein trafficking between the late Golgi and endosome. They may also serve as a receptor for ADP-ribosylation factor-related protein 1 (ARFRP1) []. Sys1p is a small integral membrane protein with four predicted transmembrane domains that localises to the Trans Golgi network TGN in yeast and human cells [].
Probab=64.37 E-value=7.7 Score=28.32 Aligned_cols=31 Identities=23% Similarity=0.558 Sum_probs=24.4
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~r 36 (103)
..++||.++++-.+.-++-.+.+.++|+|.|
T Consensus 55 ~~~~g~~~~~~~ll~sl~~~~~L~~iV~R~K 85 (144)
T PF09801_consen 55 DTTSGWLVILAWLLNSLVCALFLAFIVQRSK 85 (144)
T ss_pred CCcchHHHHHHHHHHHHHHHHHheeeEeccc
Confidence 3467999998887777777777788888876
No 80
>PF02932 Neur_chan_memb: Neurotransmitter-gated ion-channel transmembrane region ion channel family signature gamma-aminobutyric acid (GABA) receptor signature nicotinic acetylcholine receptor signature; InterPro: IPR006029 Neurotransmitter ligand-gated ion channels are transmembrane receptor-ion channel complexes that open transiently upon binding of specific ligands, allowing rapid transmission of signals at chemical synapses [, ]. Five of these ion channel receptor families have been shown to form a sequence-related superfamily: Nicotinic acetylcholine receptor (AchR), an excitatory cation channel in vertebrates and invertebrates; in vertebrate motor endplates it is composed of alpha, beta, gamma and delta/epsilon subunits; in neurons it is composed of alpha and non-alpha (or beta) subunits []. Glycine receptor, an inhibitory chloride ion channel composed of alpha and beta subunits []. Gamma-aminobutyric acid (GABA) receptor, an inhibitory chloride ion channel; at least four types of subunits (alpha, beta, gamma and delta) are known []. Serotonin 5HT3 receptor, of which there are seven major types (5HT3-5HT7) []. Glutamate receptor, an excitatory cation channel of which at least three types have been described (kainate, N-methyl-D-aspartate (NMDA) and quisqualate) []. These receptors possess a pentameric structure (made up of varying subunits), surrounding a central pore. All known sequences of subunits from neurotransmitter-gated ion-channels are structurally related. They are composed of a large extracellular glycosylated N-terminal ligand-binding domain, followed by three hydrophobic transmembrane regions which form the ionic channel, followed by an intracellular region of variable length. A fourth hydrophobic region is found at the C-terminal of the sequence [, ]. This domain represents four transmembrane helices of a variety of neurotransmitter-gated ion-channels.; GO: 0006811 ion transport, 0016020 membrane; PDB: 1DXZ_A 3MRA_A 1EQ8_C 1OED_C 2PR9_P 1A11_A 1CEK_A 2BG9_E 2KSR_A 2K59_B ....
Probab=64.26 E-value=5.6 Score=26.64 Aligned_cols=33 Identities=18% Similarity=0.282 Sum_probs=19.7
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCC
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGG 38 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGg 38 (103)
.+-.+|+++.++++..+++.+.++++..+.|+.
T Consensus 55 ~pl~~~~~~~~~~v~~~~~~~avv~~~~~~~~~ 87 (237)
T PF02932_consen 55 KPLDGWYFICTMFVFSASLEFAVVVYNIHHRGP 87 (237)
T ss_dssp SHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHST
T ss_pred cchhhhhccccchhhhHHHHhhhhhhhhccccc
Confidence 466789988877766655555444444433333
No 81
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=63.81 E-value=2.3 Score=35.68 Aligned_cols=11 Identities=27% Similarity=0.552 Sum_probs=0.0
Q ss_pred hhhhHHHHHHH
Q psy17635 7 ATAGWFIGMLL 17 (103)
Q Consensus 7 at~gWfIg~~~ 17 (103)
|-.-||..+|+
T Consensus 35 ail~w~~iims 45 (381)
T PF05297_consen 35 AILVWFFIIMS 45 (381)
T ss_dssp -----------
T ss_pred HHHHHHHHHHh
Confidence 44567776664
No 82
>PF08113 CoxIIa: Cytochrome c oxidase subunit IIa family; InterPro: IPR012538 This family consists of the cytochrome c oxidase subunit IIa family. The bax-type cytochrome c oxidase from Thermus thermophilus is known as a two subunit enzyme. From its crystal structure, it was discovered that an additional transmembrane helix, subunit IIa, spans the membrane. This subunit consists of 34 residues forming one helix across the membrane. The presence of this subunit seems to be important for the function of cytochrome c oxidases [].; PDB: 2QPD_C 3QJR_C 3EH5_C 3BVD_C 3S39_C 3QJU_C 3QJS_C 4EV3_C 3QJT_C 4FA7_C ....
Probab=63.39 E-value=12 Score=21.69 Aligned_cols=19 Identities=16% Similarity=0.312 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHHhh
Q psy17635 13 IGMLLAIAFLILVLILVCL 31 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~ 31 (103)
+|.|..|++|.+++|+.+|
T Consensus 6 ~Gal~vv~iLt~~ILvFWf 24 (34)
T PF08113_consen 6 KGALGVVMILTAFILVFWF 24 (34)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred cceeeeHHHHHHHHHHHHH
Confidence 4566667777777777664
No 83
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=62.88 E-value=3 Score=32.45 Aligned_cols=37 Identities=19% Similarity=0.160 Sum_probs=16.1
Q ss_pred HHHHHHHHH--HHHHHHHHHH-hheeecCCCcccchhhhh
Q psy17635 11 WFIGMLLAI--AFLILVLILV-CLIKRNRGGKYAVHEREA 47 (103)
Q Consensus 11 WfIg~~~ai--~lLllilli~-C~i~R~rGgKY~VkeKE~ 47 (103)
=||++++.. +||+++++.+ ||.+|+.-.+=..+--|+
T Consensus 101 ~lI~lv~~g~~lLla~~~~~~Y~~~~Rrs~~~~~~rl~Ee 140 (202)
T PF06365_consen 101 TLIALVTSGSFLLLAILLGAGYCCHQRRSWSKKGQRLGEE 140 (202)
T ss_pred EEEehHHhhHHHHHHHHHHHHHHhhhhccCCcchhhhccC
Confidence 456666554 3333333333 444454434433444443
No 84
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=62.19 E-value=7.1 Score=26.68 Aligned_cols=11 Identities=9% Similarity=0.298 Sum_probs=6.6
Q ss_pred HHHHhheeecC
Q psy17635 26 LILVCLIKRNR 36 (103)
Q Consensus 26 lli~C~i~R~r 36 (103)
+|..+|+.|.|
T Consensus 85 ~l~w~f~~r~k 95 (96)
T PTZ00382 85 FLCWWFVCRGK 95 (96)
T ss_pred HHhheeEEeec
Confidence 34456677765
No 85
>PF14914 LRRC37AB_C: LRRC37A/B like protein 1 C-terminal domain
Probab=62.04 E-value=10 Score=28.63 Aligned_cols=27 Identities=19% Similarity=0.309 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHH-HhheeecCC
Q psy17635 11 WFIGMLLAIAFLILVLIL-VCLIKRNRG 37 (103)
Q Consensus 11 WfIg~~~ai~lLllilli-~C~i~R~rG 37 (103)
--|.++.++++|++|+|+ -++..|+..
T Consensus 123 laisvtvv~~iliii~CLiei~shr~a~ 150 (154)
T PF14914_consen 123 LAISVTVVVMILIIIFCLIEICSHRRAS 150 (154)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 346777776666666544 444444433
No 86
>PF00517 GP41: Retroviral envelope protein; InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=61.91 E-value=2.9 Score=31.94 Aligned_cols=18 Identities=17% Similarity=0.394 Sum_probs=0.0
Q ss_pred HHHHHhheeecCCCcccc
Q psy17635 25 VLILVCLIKRNRGGKYAV 42 (103)
Q Consensus 25 illi~C~i~R~rGgKY~V 42 (103)
+.+++++|+|-|-|-=|+
T Consensus 173 ~~~i~~~~~~~~qgy~~l 190 (204)
T PF00517_consen 173 LMCILRICRRVRQGYSPL 190 (204)
T ss_dssp ------------------
T ss_pred HHHHHHHHHHHhCCCCCc
Confidence 344445556656665455
No 87
>PRK11486 flagellar biosynthesis protein FliO; Provisional
Probab=61.86 E-value=7.5 Score=28.14 Aligned_cols=31 Identities=19% Similarity=0.180 Sum_probs=17.8
Q ss_pred chhhhhHHHHHHHHHHH-HHHHHHHHhheeec
Q psy17635 5 TVATAGWFIGMLLAIAF-LILVLILVCLIKRN 35 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~l-Lllilli~C~i~R~ 35 (103)
...+.+=++-++.++++ +.+|+++.|++||-
T Consensus 11 ~~~~~~~l~qv~~~L~lVl~lI~~~aWLlkR~ 42 (124)
T PRK11486 11 APVSGSPLLQVSGALIGIIALILAAAWLVKRL 42 (124)
T ss_pred CCCCcchHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 34444455556666443 33556667888883
No 88
>PRK10788 periplasmic folding chaperone; Provisional
Probab=61.81 E-value=7.9 Score=33.36 Aligned_cols=35 Identities=23% Similarity=0.252 Sum_probs=22.5
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
.+++|++.++.++++|.|++.-+.-+....|+++.
T Consensus 8 ~~~~~~~~~i~~li~l~F~~~G~~~~~~~~~~~~v 42 (623)
T PRK10788 8 AANSVVLKIILALIILSFILTGVGGYLIGGSNNYA 42 (623)
T ss_pred hccChHHHHHHHHHHHHHHHHcchhhccCCCCCee
Confidence 36799999888877777776555433333355553
No 89
>PHA03286 envelope glycoprotein E; Provisional
Probab=61.00 E-value=6.1 Score=34.57 Aligned_cols=18 Identities=28% Similarity=0.582 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHhheeecC
Q psy17635 19 IAFLILVLILVCLIKRNR 36 (103)
Q Consensus 19 i~lLllilli~C~i~R~r 36 (103)
++++++-+||+|++||+|
T Consensus 402 ~~~~~~~~~~~~~~~r~~ 419 (492)
T PHA03286 402 LVVLLFALCIAGLYRRRR 419 (492)
T ss_pred HHHHHHHHHhHhHhhhhh
Confidence 444455588888887654
No 90
>TIGR02866 CoxB cytochrome c oxidase, subunit II. Cytochrome c oxidase is the terminal electron acceptor of mitochondria (and one of several possible acceptors in prokaryotes) in the electron transport chain of aerobic respiration. The enzyme couples the oxidation of reduced cytochrome c with the reduction of molecular oxygen to water. This process results in the pumping of four protons across the membrane which are used in the proton gradient powered synthesis of ATP. The oxidase contains two heme a cofactors and three copper atoms as well as other bound ions.
Probab=60.88 E-value=11 Score=28.17 Aligned_cols=18 Identities=28% Similarity=0.420 Sum_probs=8.1
Q ss_pred HHHHHHHHHHHhheeecC
Q psy17635 19 IAFLILVLILVCLIKRNR 36 (103)
Q Consensus 19 i~lLllilli~C~i~R~r 36 (103)
|.+++..+++.+++|.++
T Consensus 23 I~v~V~~~l~~~~~k~r~ 40 (201)
T TIGR02866 23 ISLLVAALLAYVVWKFRR 40 (201)
T ss_pred HHHHHHHHHHHhhhhhhc
Confidence 333444445455554433
No 91
>PRK10927 essential cell division protein FtsN; Provisional
Probab=60.86 E-value=4.4 Score=33.66 Aligned_cols=56 Identities=18% Similarity=0.332 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHHH--HhheeecCCCcccchhhhhhcCCC-CCCCCCCCcccCCCCCCCC
Q psy17635 15 MLLAIAFLILVLIL--VCLIKRNRGGKYAVHEREAAHGRH-DYPEEPGFNEYSQPLDGKP 71 (103)
Q Consensus 15 ~~~ai~lLllilli--~C~i~R~rGgKY~VkeKE~~~g~~-~~~dd~~F~EYsd~dek~~ 71 (103)
.|+||+++||+++| ++||..||.+-=.+-..-...++- |-|.|+-+ .|-.+.|++-
T Consensus 35 ~m~alAvavlv~fiGGLyFith~k~~e~~~~~~~~~~~ngLPPKPEErW-rYIKELEnRq 93 (319)
T PRK10927 35 AMVAIAAAVLVTFIGGLYFITHHKKEESETLQSQKVTGNGLPPKPEERW-RYIKELESRQ 93 (319)
T ss_pred HHHHHHHHHHHHHhhheEEEecCCCCCCcccccCCCCCCCCCCCcchhh-HHHHHHhccC
Confidence 34555555555555 688888875422211100111221 44555555 5655555543
No 92
>PRK14758 hypothetical protein; Provisional
Probab=60.66 E-value=6.2 Score=21.79 Aligned_cols=8 Identities=38% Similarity=1.364 Sum_probs=3.8
Q ss_pred HHHHHHHh
Q psy17635 23 ILVLILVC 30 (103)
Q Consensus 23 llilli~C 30 (103)
+|+++|+|
T Consensus 10 iLivlIlC 17 (27)
T PRK14758 10 ILIILILC 17 (27)
T ss_pred HHHHHHHH
Confidence 34444555
No 93
>PF04277 OAD_gamma: Oxaloacetate decarboxylase, gamma chain ; InterPro: IPR005899 This family comprises distantly related, low complexity, hydrophobic small subunits of several related sodium ion-pumping decarboxylases. These include oxaloacetate decarboxylase gamma subunit and methylmalonyl-CoA decarboxylase delta subunit [].; GO: 0008948 oxaloacetate decarboxylase activity, 0015081 sodium ion transmembrane transporter activity, 0071436 sodium ion export, 0016020 membrane
Probab=60.38 E-value=15 Score=23.30 Aligned_cols=18 Identities=39% Similarity=0.669 Sum_probs=7.3
Q ss_pred hHHHHHHHH-HHHHHHHHH
Q psy17635 10 GWFIGMLLA-IAFLILVLI 27 (103)
Q Consensus 10 gWfIg~~~a-i~lLllill 27 (103)
||-+-+++. ++|+.|++|
T Consensus 4 gl~i~i~Gm~iVF~~L~lL 22 (79)
T PF04277_consen 4 GLQIMIIGMGIVFLVLILL 22 (79)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 444444333 344433333
No 94
>PF01788 PsbJ: PsbJ; InterPro: IPR002682 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbJ found in PSII. PsbJ is one of the most hydrophobic proteins in the thylakoid membrane, and is located in a gene cluster with PsbE, PsbF and PsbL (PsbEFJL). Both PsbJ and PsbL (IPR003372 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbJ cause the light-harvesting antenna to remain detached from the PSII dimers []. In addition, both PsbJ and PsbL are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_J 3ARC_J 3A0B_J 3KZI_J 2AXT_J 3PRQ_J 4FBY_b 3BZ2_J 1S5L_j 3PRR_J ....
Probab=60.31 E-value=17 Score=21.80 Aligned_cols=21 Identities=29% Similarity=0.545 Sum_probs=14.5
Q ss_pred hhhHHHHHHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli 28 (103)
---|+|+.+..++.+-++-+.
T Consensus 8 IPLWlVgtv~G~~vi~lvglF 28 (40)
T PF01788_consen 8 IPLWLVGTVAGIAVIGLVGLF 28 (40)
T ss_dssp S-HHHHHHHHHHHHHHHHHHH
T ss_pred ccchHHHHHHHHHHHHHHHHh
Confidence 346999999987766655443
No 95
>COG3190 FliO Flagellar biogenesis protein [Cell motility and secretion]
Probab=59.90 E-value=9.8 Score=28.12 Aligned_cols=28 Identities=32% Similarity=0.284 Sum_probs=14.5
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
+..-|+--+...|++|.||++.+|++||
T Consensus 20 ~~~~~~~~~gsL~~iL~lil~~~wl~kr 47 (137)
T COG3190 20 AALELAQMFGSLILILALILFLAWLVKR 47 (137)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333443333445555566666666664
No 96
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=59.83 E-value=6.2 Score=32.18 Aligned_cols=24 Identities=21% Similarity=0.512 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHhheeecC
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~r 36 (103)
|.+++.|||.+||++++.-.||.|
T Consensus 262 iiaIliIVLIMvIIYLILRYRRKK 285 (299)
T PF02009_consen 262 IIAILIIVLIMVIIYLILRYRRKK 285 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444455555555555556677743
No 97
>PF12669 P12: Virus attachment protein p12 family
Probab=59.38 E-value=5.4 Score=25.04 Aligned_cols=11 Identities=27% Similarity=0.483 Sum_probs=6.3
Q ss_pred HHhheeecCCC
Q psy17635 28 LVCLIKRNRGG 38 (103)
Q Consensus 28 i~C~i~R~rGg 38 (103)
+.-++|+.|.|
T Consensus 17 ~r~~~k~~K~G 27 (58)
T PF12669_consen 17 IRKFIKDKKKG 27 (58)
T ss_pred HHHHHHHhhcC
Confidence 35666665555
No 98
>PF09472 MtrF: Tetrahydromethanopterin S-methyltransferase, F subunit (MtrF); InterPro: IPR013347 Many archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This domain is mostly found in MtrF, where it covers the entire length of the protein. This polypeptide is one of eight subunits of the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase complex found in methanogenic archaea. This is a membrane-associated enzyme complex that uses methyl-transfer reactions to drive a sodium-ion pump []. MtrF itself is involved in the transfer of the methyl group from N5-methyltetrahydromethanopterin to coenzyme M. Subsequently, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase. In some organisms this domain is found at the C-terminal region of what appears to be a fusion of the MtrA and MtrF proteins [, ]. The function of these proteins is unknown, though it is likely that they are involved in C1 metabolism.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016020 membrane
Probab=59.15 E-value=15 Score=23.87 Aligned_cols=23 Identities=26% Similarity=0.615 Sum_probs=15.6
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~ 29 (103)
...|..||++.|++|.++.+++.
T Consensus 41 ~~~GfaiG~~~AlvLv~ip~~l~ 63 (64)
T PF09472_consen 41 GIKGFAIGFLFALVLVGIPILLM 63 (64)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHh
Confidence 44577788888877776665543
No 99
>MTH00133 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=57.69 E-value=16 Score=22.31 Aligned_cols=24 Identities=21% Similarity=0.539 Sum_probs=15.5
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllil 26 (103)
|||- ...-||..++++-+.|++++
T Consensus 1 MPQL--np~PWf~i~l~tW~~l~~~~ 24 (55)
T MTH00133 1 MPQL--NPAPWFAILVFSWLVFLTII 24 (55)
T ss_pred CCCC--CCccHHHHHHHHHHHHHHHH
Confidence 5553 34479999999855544443
No 100
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=57.65 E-value=11 Score=26.80 Aligned_cols=15 Identities=7% Similarity=0.038 Sum_probs=7.4
Q ss_pred cchhhhhHHHHHHHH
Q psy17635 4 DTVATAGWFIGMLLA 18 (103)
Q Consensus 4 ~~~at~gWfIg~~~a 18 (103)
.+.....|-..+-++
T Consensus 10 ~~~~g~sW~~LVGVv 24 (102)
T PF15176_consen 10 PGEGGRSWPFLVGVV 24 (102)
T ss_pred CCCCCcccHhHHHHH
Confidence 344466775444333
No 101
>PHA03283 envelope glycoprotein E; Provisional
Probab=56.35 E-value=12 Score=33.18 Aligned_cols=25 Identities=16% Similarity=0.232 Sum_probs=12.9
Q ss_pred hhhHH--HHHHHHHHHHHHHHHHHhhe
Q psy17635 8 TAGWF--IGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 8 t~gWf--Ig~~~ai~lLllilli~C~i 32 (103)
+..|+ ++++++++.++++.|++++|
T Consensus 396 ~~~~l~~~~~~~~~~~~~~~~l~vw~c 422 (542)
T PHA03283 396 TRHYLAFLLAIICTCAALLVALVVWGC 422 (542)
T ss_pred ccccchhHHHHHHHHHHHHHHHhhhhe
Confidence 35564 44444555555665555433
No 102
>PF05434 Tmemb_9: TMEM9; InterPro: IPR008853 This family contains several eukaryotic transmembrane proteins which are homologous to Homo sapiens transmembrane protein 9 Q9P0T7 from SWISSPROT. The TMEM9 gene encodes a 183 amino-acid protein that contains an N-terminal signal peptide, a single transmembrane region, three potential N-glycosylation sites and three conserved cys-rich domains in the N terminus, but no known functional domains. The protein is highly conserved between species from Caenorhabditis elegans to H. sapiens and belongs to a novel family of transmembrane proteins. The exact function of TMEM9 is unknown although it has been found to be widely expressed and localised to the late endosomes and lysosomes []. Members of this family contain CXCXC repeats IPR004153 from INTERPRO in their N-terminal region.; GO: 0016021 integral to membrane
Probab=56.11 E-value=6.8 Score=29.35 Aligned_cols=30 Identities=30% Similarity=0.776 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHH-h---heeecCCCcccch
Q psy17635 14 GMLLAIAFLILVLILV-C---LIKRNRGGKYAVH 43 (103)
Q Consensus 14 g~~~ai~lLllilli~-C---~i~R~rGgKY~Vk 43 (103)
.+|+.|.+|+|..+.+ | +++|..+.+|..+
T Consensus 60 ivl~Vi~lLvlYM~fL~~ldPllrk~~~~~~~~~ 93 (149)
T PF05434_consen 60 IVLWVIGLLVLYMLFLMCLDPLLRKRFGHNYQIH 93 (149)
T ss_pred EeHHHHHHHHHHHHHHHHHhHHHhcccccccccc
Confidence 3444444444432222 2 2467777777643
No 103
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=55.68 E-value=20 Score=22.83 Aligned_cols=21 Identities=5% Similarity=0.398 Sum_probs=14.6
Q ss_pred hhhhHHHHHHHHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllill 27 (103)
+++-|+++++...++..++.+
T Consensus 48 ~n~kW~~r~iiGaiI~~i~~~ 68 (71)
T PF10779_consen 48 SNTKWIWRTIIGAIITAIIYL 68 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 577899998887555544433
No 104
>PF12575 DUF3753: Protein of unknown function (DUF3753); InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=55.51 E-value=20 Score=23.90 Aligned_cols=23 Identities=9% Similarity=0.291 Sum_probs=15.9
Q ss_pred chhhhhHHHHHHHHHHHHHHHHH
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllill 27 (103)
++....|+|+++.++.+++|+.+
T Consensus 45 ~~~~~~~ii~ii~v~ii~~l~fl 67 (72)
T PF12575_consen 45 NFNWIILIISIIFVLIIVLLTFL 67 (72)
T ss_pred cchHHHHHHHHHHHHHHHHHHHH
Confidence 46778888888887666665543
No 105
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=54.88 E-value=12 Score=23.38 Aligned_cols=16 Identities=25% Similarity=0.517 Sum_probs=8.4
Q ss_pred HHHHHHHHHHHHHHHH
Q psy17635 13 IGMLLAIAFLILVLIL 28 (103)
Q Consensus 13 Ig~~~ai~lLllilli 28 (103)
-|+++|.+|.++-++|
T Consensus 17 gGLi~A~vlfi~Gi~i 32 (50)
T PF02038_consen 17 GGLIFAGVLFILGILI 32 (50)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred cchHHHHHHHHHHHHH
Confidence 4677775444444333
No 106
>PF07213 DAP10: DAP10 membrane protein; InterPro: IPR009861 This family consists of several mammalian DAP10 membrane proteins. In activated mouse natural killer (NK) cells, the NKG2D receptor associates with two intracellular adaptors, DAP10 and DAP12, which trigger phosphatidyl inositol 3 kinase (PI3K) and Syk family protein tyrosine kinases, respectively. It has been suggested that the DAP10-PI3K pathway is sufficient to initiate NKG2D-mediated killing of target cells [].
Probab=54.66 E-value=22 Score=24.06 Aligned_cols=27 Identities=22% Similarity=0.536 Sum_probs=19.6
Q ss_pred hhhHHHHHHHH-HHHHHHHHHHHhheee
Q psy17635 8 TAGWFIGMLLA-IAFLILVLILVCLIKR 34 (103)
Q Consensus 8 t~gWfIg~~~a-i~lLllilli~C~i~R 34 (103)
+.+-+.|+++| +++-|||++.+..+-|
T Consensus 32 s~g~LaGiV~~D~vlTLLIv~~vy~car 59 (79)
T PF07213_consen 32 SPGLLAGIVAADAVLTLLIVLVVYYCAR 59 (79)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 34667888888 7777788777766654
No 107
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=54.05 E-value=16 Score=28.69 Aligned_cols=23 Identities=9% Similarity=0.144 Sum_probs=10.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i 32 (103)
.||=.+|.+++|||+|++...|+
T Consensus 14 ~iLNiaI~IV~lLIiiva~~lf~ 36 (217)
T PF07423_consen 14 KILNIAIGIVSLLIIIVAYQLFF 36 (217)
T ss_pred hhHHHHHHHHHHHHHHHhhhhee
Confidence 34444444444444444444554
No 108
>PF06936 Selenoprotein_S: Selenoprotein S (SelS); InterPro: IPR009703 This family consists of several mammalian selenoprotein S (SelS) sequences. SelS is a plasma membrane protein and is present in a variety of tissues and cell types. These proteins are involved in the degradation process of misfolded endoplasmic reticulum (ER) luminal proteins which participate in the transfer of misfolded proteins from the ER to the cytosol, where they are destroyed by the proteasome in a ubiquitin-dependent manner []. They probably serve as a linker between DER1, which mediates the retro-translocation of misfolded proteins into the cytosol, and the ATPase complex VCP, which mediates the translocation and ubiquitination.; GO: 0008430 selenium binding, 0006886 intracellular protein transport, 0030176 integral to endoplasmic reticulum membrane; PDB: 2Q2F_A.
Probab=53.88 E-value=4.3 Score=31.18 Aligned_cols=21 Identities=19% Similarity=0.506 Sum_probs=0.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllil 26 (103)
+++-||+|.++|+++.+|+--
T Consensus 33 L~~yGWyil~~~I~ly~l~qk 53 (190)
T PF06936_consen 33 LSSYGWYILFGCILLYLLWQK 53 (190)
T ss_dssp ---------------------
T ss_pred HHHhCHHHHHHHHHHHHHHHH
Confidence 677899998777655554443
No 109
>PRK00159 putative septation inhibitor protein; Reviewed
Probab=53.71 E-value=17 Score=24.96 Aligned_cols=32 Identities=16% Similarity=0.495 Sum_probs=23.8
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
.+-.||..+|+++.++=|+=+++..+- |+.++
T Consensus 28 ~sp~W~~~~m~glm~~GllWlvvyYl~---~~~~P 59 (87)
T PRK00159 28 PSSVWYVVLMLGLMLIGLAWLVVNYLA---GPAIP 59 (87)
T ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhc---cCCCC
Confidence 356899999999888878777776662 45555
No 110
>COG3671 Predicted membrane protein [Function unknown]
Probab=53.21 E-value=16 Score=26.73 Aligned_cols=37 Identities=41% Similarity=0.565 Sum_probs=22.0
Q ss_pred hhHHHHHHHH---HHHHHHHHHHHhheee-cCCCcccchhh
Q psy17635 9 AGWFIGMLLA---IAFLILVLILVCLIKR-NRGGKYAVHER 45 (103)
Q Consensus 9 ~gWfIg~~~a---i~lLllilli~C~i~R-~rGgKY~VkeK 45 (103)
.+|+|+++.+ |.+++++++-++++-| -.|=||.++.|
T Consensus 75 l~~iIg~Llt~lgiGv~i~~AlgvW~i~Riv~G~~yl~~g~ 115 (125)
T COG3671 75 LWWIIGLLLTFLGIGVVILVALGVWYIYRIVIGFKYLNEGK 115 (125)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 4677887765 4455566666666655 44555555443
No 111
>KOG4331|consensus
Probab=53.05 E-value=12 Score=34.83 Aligned_cols=16 Identities=25% Similarity=0.441 Sum_probs=9.2
Q ss_pred hheeecCCCcccchhh
Q psy17635 30 CLIKRNRGGKYAVHER 45 (103)
Q Consensus 30 C~i~R~rGgKY~VkeK 45 (103)
|+|.|++||+-.=..|
T Consensus 130 c~C~~rc~~r~~~~~~ 145 (865)
T KOG4331|consen 130 CRCCRRCGGRIKSALK 145 (865)
T ss_pred eeeHhhhcCCCCchhc
Confidence 3345777777654444
No 112
>TIGR01433 CyoA cytochrome o ubiquinol oxidase subunit II. This enzyme catalyzes the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in gram positive bacteria but which is in complex with subunits which utilize cytochromes a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cyt. c as the oxidant.
Probab=53.04 E-value=17 Score=28.17 Aligned_cols=9 Identities=22% Similarity=0.279 Sum_probs=4.1
Q ss_pred HHHHHhhee
Q psy17635 25 VLILVCLIK 33 (103)
Q Consensus 25 illi~C~i~ 33 (103)
++++.+++|
T Consensus 48 ~~~~~~~~r 56 (226)
T TIGR01433 48 LMTLFFAWK 56 (226)
T ss_pred HHHheeeEE
Confidence 333445554
No 113
>COG3296 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.95 E-value=15 Score=27.39 Aligned_cols=30 Identities=40% Similarity=0.657 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHH--HHHHHHhheeecCCCccc
Q psy17635 12 FIGMLLAIAFLI--LVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 12 fIg~~~ai~lLl--lilli~C~i~R~rGgKY~ 41 (103)
||+++...+|.+ |++-|+.++|-+.|.-|.
T Consensus 103 li~il~~~alsi~~lv~ti~a~~Ka~eGq~Yr 134 (143)
T COG3296 103 LIGILSIFALSILSLVLTIIAAIKAYEGQEYR 134 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhCCceee
Confidence 444444333333 455556778888888773
No 114
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=52.76 E-value=23 Score=19.84 Aligned_cols=11 Identities=0% Similarity=0.283 Sum_probs=5.5
Q ss_pred HHHHHHHHHHH
Q psy17635 13 IGMLLAIAFLI 23 (103)
Q Consensus 13 Ig~~~ai~lLl 23 (103)
|++|+.++|.+
T Consensus 3 ~~vi~g~llv~ 13 (29)
T PRK14750 3 FSIVCGALLVL 13 (29)
T ss_pred HHHHHHHHHHH
Confidence 45666644433
No 115
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=52.43 E-value=16 Score=31.69 Aligned_cols=15 Identities=27% Similarity=0.470 Sum_probs=11.3
Q ss_pred HHhheeecCCCcccc
Q psy17635 28 LVCLIKRNRGGKYAV 42 (103)
Q Consensus 28 i~C~i~R~rGgKY~V 42 (103)
.+.|++|+||.--.|
T Consensus 406 ~~~~v~rrr~~~~dv 420 (436)
T PTZ00208 406 FFIMVKRRRNSSEDV 420 (436)
T ss_pred hheeeeeccCCchhc
Confidence 456779999887666
No 116
>PF15176 LRR19-TM: Leucine-rich repeat family 19 TM domain
Probab=52.34 E-value=13 Score=26.30 Aligned_cols=29 Identities=14% Similarity=0.183 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHH-hheeecCCCcc
Q psy17635 12 FIGMLLAIAFLILVLILV-CLIKRNRGGKY 40 (103)
Q Consensus 12 fIg~~~ai~lLllilli~-C~i~R~rGgKY 40 (103)
.=.++.||++-|||+|.+ |=+.|+==-.|
T Consensus 21 VGVv~~al~~SlLIalaaKC~~~~k~~~SY 50 (102)
T PF15176_consen 21 VGVVVTALVTSLLIALAAKCPVWYKYLASY 50 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHhcc
Confidence 345778888777776665 65655433444
No 117
>PF13584 BatD: Oxygen tolerance
Probab=51.84 E-value=17 Score=30.15 Aligned_cols=23 Identities=35% Similarity=0.657 Sum_probs=9.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~ 33 (103)
.||+.++ ++.++++++++++..+
T Consensus 426 ~~~~~l~-~~~~l~~~~~~~~~~~ 448 (484)
T PF13584_consen 426 WWFWLLL-LLPLLLLLLLLILRRK 448 (484)
T ss_pred HHHHHHH-HHHHHHHHHHHHHHHH
Confidence 3444443 3333333333444443
No 118
>PF15234 LAT: Linker for activation of T-cells
Probab=51.71 E-value=57 Score=25.85 Aligned_cols=16 Identities=25% Similarity=0.478 Sum_probs=11.4
Q ss_pred HHhheeecCCCcccch
Q psy17635 28 LVCLIKRNRGGKYAVH 43 (103)
Q Consensus 28 i~C~i~R~rGgKY~Vk 43 (103)
.+|+.+|.=-|.|---
T Consensus 25 ALCvrCReLpgSydsa 40 (230)
T PF15234_consen 25 ALCVRCRELPGSYDSA 40 (230)
T ss_pred HHHHHHhhCCCccccC
Confidence 4577778888888644
No 119
>PHA03097 C-type lectin-like protein; Provisional
Probab=51.54 E-value=18 Score=26.33 Aligned_cols=18 Identities=11% Similarity=0.288 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILVLILV 29 (103)
Q Consensus 12 fIg~~~ai~lLllilli~ 29 (103)
.|+++|+...+.||++++
T Consensus 13 ~~~~~~~~~~~~~~~~~~ 30 (157)
T PHA03097 13 WILVPCGSIIIALIALVI 30 (157)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 377888866666555554
No 120
>TIGR03063 srtB_target sortase B cell surface sorting signal. Two different classes of sorting signal, both analogous to the sortase A signal LPXTG, may be recognized by the sortase SrtB. These are given as NXZTN and NPKXZ. Proteins sorted by this class of sortase are less common than the sortase A and LPXTG system. This model describes a number of cell surface protein C-terminal regions from Gram-positive bacteria that appear to be sortase B (SrtB) sorting signals.
Probab=51.53 E-value=26 Score=19.55 Aligned_cols=17 Identities=18% Similarity=0.421 Sum_probs=8.6
Q ss_pred CCcchhhh-hHHHHHHHH
Q psy17635 2 PSDTVATA-GWFIGMLLA 18 (103)
Q Consensus 2 ~~~~~at~-gWfIg~~~a 18 (103)
|+++-.++ +|++.++.+
T Consensus 2 PkT~D~a~i~ly~~l~~~ 19 (29)
T TIGR03063 2 PKTGDSAQIGLYAVLFLG 19 (29)
T ss_pred CCCccchhHHHHHHHHHH
Confidence 55555554 454444433
No 121
>PF06781 UPF0233: Uncharacterised protein family (UPF0233); InterPro: IPR009619 This is a group of proteins of unknown function.
Probab=51.49 E-value=19 Score=24.61 Aligned_cols=30 Identities=23% Similarity=0.526 Sum_probs=23.8
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~ 35 (103)
-.+-.||..+|+++.++=|+=+++..+..+
T Consensus 27 ~~sp~W~~p~m~~lmllGL~WiVvyYi~~~ 56 (87)
T PF06781_consen 27 KPSPRWYAPLMLGLMLLGLLWIVVYYISGG 56 (87)
T ss_pred CCCCccHHHHHHHHHHHHHHHHhhhhcccC
Confidence 456789999999988888887777776544
No 122
>PF04906 Tweety: Tweety; InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised. However, they are considered to be transmembrane proteins with five potential membrane-spanning regions. A number of potential functions have been suggested on the basis of homology to the yeast FTR1 and FTH1 iron transporter proteins and the mammalian neurotensin receptors 1 and 2 in that they have a similar hydrophobicity profiles although there is no detectable sequence homology to the tweety-related proteins. It has been proposed that the tweety-related proteins could be involved in transport of iron or other divalent cations or alternatively that they may be membrane-bound receptors [].
Probab=51.20 E-value=16 Score=30.54 Aligned_cols=7 Identities=43% Similarity=0.956 Sum_probs=3.2
Q ss_pred hheeecC
Q psy17635 30 CLIKRNR 36 (103)
Q Consensus 30 C~i~R~r 36 (103)
||+||.+
T Consensus 45 CC~r~~~ 51 (406)
T PF04906_consen 45 CCCRRPR 51 (406)
T ss_pred hhCCCCC
Confidence 5554433
No 123
>PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional
Probab=50.96 E-value=17 Score=29.87 Aligned_cols=10 Identities=10% Similarity=-0.071 Sum_probs=3.8
Q ss_pred hhHHHHHHHH
Q psy17635 9 AGWFIGMLLA 18 (103)
Q Consensus 9 ~gWfIg~~~a 18 (103)
+.++|.++.+
T Consensus 40 ~~~Li~~~~~ 49 (315)
T PRK10525 40 QRSLILTAFG 49 (315)
T ss_pred HHHHHHHHHH
Confidence 3343433333
No 124
>PF06396 AGTRAP: Angiotensin II, type I receptor-associated protein (AGTRAP); InterPro: IPR009436 This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the C-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Probab=50.94 E-value=71 Score=24.09 Aligned_cols=37 Identities=24% Similarity=0.367 Sum_probs=21.1
Q ss_pred hhhhHHHHHHHHHHHHHH-HHHHHhheeecCCCcccch
Q psy17635 7 ATAGWFIGMLLAIAFLIL-VLILVCLIKRNRGGKYAVH 43 (103)
Q Consensus 7 at~gWfIg~~~ai~lLll-illi~C~i~R~rGgKY~Vk 43 (103)
.+..|-.++.+.-++|-- .++.+.-..|.|||.|.|+
T Consensus 85 ~~~~Fs~~maIinLllKP~s~~~l~r~~reRGG~l~~~ 122 (162)
T PF06396_consen 85 DTDRFSAGMAIINLLLKPISCFFLYRMYRERGGELSVN 122 (162)
T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHhcCCceecc
Confidence 344555555444333332 2334444568999999876
No 125
>PF13273 DUF4064: Protein of unknown function (DUF4064)
Probab=50.74 E-value=25 Score=23.23 Aligned_cols=28 Identities=36% Similarity=0.477 Sum_probs=14.2
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~ 35 (103)
.+..|+.+++.. +.+++.++..+++|+|
T Consensus 55 ~~~~~~~~i~~i-i~~il~iia~i~ikk~ 82 (100)
T PF13273_consen 55 STFGIILGIIAI-ISSILGIIASILIKKN 82 (100)
T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHcCC
Confidence 344555555543 3444444445667663
No 126
>PF14851 FAM176: FAM176 family
Probab=50.33 E-value=47 Score=24.81 Aligned_cols=24 Identities=21% Similarity=0.290 Sum_probs=12.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~ 33 (103)
-.|+.=+|+=+||+|.+|++.+.+
T Consensus 24 LYFv~gVC~GLlLtLcllV~risc 47 (153)
T PF14851_consen 24 LYFVSGVCAGLLLTLCLLVIRISC 47 (153)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhee
Confidence 456666666444445444443333
No 127
>KOG4482|consensus
Probab=50.29 E-value=17 Score=31.46 Aligned_cols=27 Identities=22% Similarity=0.517 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
..|-+++|++|.+++.+|+|| |+.|-|
T Consensus 301 faIpl~Valll~~~La~imc~--rrEg~~ 327 (449)
T KOG4482|consen 301 FAIPLGVALLLVLALAYIMCC--RREGQK 327 (449)
T ss_pred HHHHHHHHHHHHHHHHHHHhh--hhhccc
Confidence 334555555555555556664 555544
No 128
>PF14316 DUF4381: Domain of unknown function (DUF4381)
Probab=50.18 E-value=18 Score=25.72 Aligned_cols=11 Identities=27% Similarity=0.851 Sum_probs=5.3
Q ss_pred hhhHHHHHHHH
Q psy17635 8 TAGWFIGMLLA 18 (103)
Q Consensus 8 t~gWfIg~~~a 18 (103)
+-||-+.++.+
T Consensus 20 a~GWwll~~ll 30 (146)
T PF14316_consen 20 APGWWLLLALL 30 (146)
T ss_pred cHHHHHHHHHH
Confidence 44565544443
No 129
>PF07466 DUF1517: Protein of unknown function (DUF1517); InterPro: IPR010903 This family consists of several hypothetical glycine rich plant and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=50.10 E-value=10 Score=30.59 Aligned_cols=30 Identities=20% Similarity=0.403 Sum_probs=18.6
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635 8 TAGWFIGMLLAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~C~i~R~rG 37 (103)
.-++|++++.++++..++.+++-+++|.+.
T Consensus 60 ~Ggg~~gl~~iLIl~~Ia~~vv~~~r~~~~ 89 (289)
T PF07466_consen 60 FGGGFGGLFDILILFGIAFFVVRFFRRRRS 89 (289)
T ss_pred cCcccchHHHHHHHHHHHHHHHHHHHhhcc
Confidence 345567777776666666666666665443
No 130
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=50.00 E-value=16 Score=24.79 Aligned_cols=17 Identities=24% Similarity=0.481 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy17635 13 IGMLLAIAFLILVLILV 29 (103)
Q Consensus 13 Ig~~~ai~lLllilli~ 29 (103)
|..+.|++++++|.+++
T Consensus 6 i~~iialiv~~iiaIvv 22 (81)
T PF00558_consen 6 ILAIIALIVALIIAIVV 22 (81)
T ss_dssp --HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 131
>PF06341 DUF1056: Protein of unknown function (DUF1056); InterPro: IPR009406 This entry is represented by Bacteriophage bIL286, Orf42. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several putative head-tail joining bacteriophage proteins.
Probab=49.76 E-value=24 Score=23.01 Aligned_cols=26 Identities=23% Similarity=0.420 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHhheeecCCC
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNRGG 38 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~rGg 38 (103)
+|++..-+-|++..++.-++.++|||
T Consensus 37 ~g~i~i~I~l~l~G~isE~i~~~Kgg 62 (63)
T PF06341_consen 37 AGLISIGITLFLAGLISEFISKQKGG 62 (63)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCC
Confidence 34444444455556666777778887
No 132
>PF13349 DUF4097: Domain of unknown function (DUF4097)
Probab=49.76 E-value=18 Score=25.01 Aligned_cols=18 Identities=33% Similarity=0.480 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 11 WfIg~~~ai~lLllilli 28 (103)
|+|+++|+++.++++++.
T Consensus 4 ~~i~~~l~viG~il~~~g 21 (166)
T PF13349_consen 4 LIIGLILIVIGIILIGIG 21 (166)
T ss_pred ehHHHHHHHHHHHHHHHH
Confidence 566666665555554443
No 133
>PF15431 TMEM190: Transmembrane protein 190
Probab=49.74 E-value=17 Score=26.66 Aligned_cols=29 Identities=24% Similarity=0.390 Sum_probs=17.6
Q ss_pred hhhhHHHHHHHH-HHHHHHHHHHHhheeec
Q psy17635 7 ATAGWFIGMLLA-IAFLILVLILVCLIKRN 35 (103)
Q Consensus 7 at~gWfIg~~~a-i~lLllilli~C~i~R~ 35 (103)
...-|-++.-|+ +++|+..+++.+..+|+
T Consensus 58 rKHmWaL~wtC~gll~Li~~iclFWWAkRr 87 (134)
T PF15431_consen 58 RKHMWALGWTCGGLLLLICSICLFWWAKRR 87 (134)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHh
Confidence 456799999998 33333444555555554
No 134
>PF14610 DUF4448: Protein of unknown function (DUF4448)
Probab=49.65 E-value=9 Score=28.36 Aligned_cols=16 Identities=19% Similarity=0.341 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHhheee
Q psy17635 19 IAFLILVLILVCLIKR 34 (103)
Q Consensus 19 i~lLllilli~C~i~R 34 (103)
+++++|++++.|+.+|
T Consensus 168 v~~~~~~~~~~~~~~R 183 (189)
T PF14610_consen 168 VVVLALIMYGFFFWNR 183 (189)
T ss_pred HHHHHHHHHhhheeec
Confidence 3333333444444443
No 135
>COG1622 CyoA Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]
Probab=49.61 E-value=21 Score=28.26 Aligned_cols=33 Identities=21% Similarity=0.275 Sum_probs=15.4
Q ss_pred hhhHHHHH----HHHHHHHHHHHHHHhhee-ecCCCcc
Q psy17635 8 TAGWFIGM----LLAIAFLILVLILVCLIK-RNRGGKY 40 (103)
Q Consensus 8 t~gWfIg~----~~ai~lLllilli~C~i~-R~rGgKY 40 (103)
.+-|+|.+ |..|+++++.+++.+++| |.+.+.-
T Consensus 29 ~~~~~~~~~~~~~~~I~~~V~~~~~~~~~k~R~~~~~~ 66 (247)
T COG1622 29 EQRDLIILSTLLMLVIVLPVIVLLVYFAWKYRASNNAA 66 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCC
Confidence 34455433 333445555555555554 4444443
No 136
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=49.58 E-value=27 Score=22.57 Aligned_cols=21 Identities=24% Similarity=0.594 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhe
Q psy17635 12 FIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i 32 (103)
-+.+++++++++++++|+..+
T Consensus 67 k~~~i~~~iv~~~~~~i~~~~ 87 (89)
T PF00957_consen 67 KLYIIIIIIVIIIILIIIIVI 87 (89)
T ss_dssp HHHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHhHHhhhhhhhhHHHHHH
Confidence 344555555555566655544
No 137
>PRK12757 cell division protein FtsN; Provisional
Probab=49.36 E-value=11 Score=30.30 Aligned_cols=22 Identities=27% Similarity=0.397 Sum_probs=12.9
Q ss_pred HHHHHHHHHH--HHhheeecCCCc
Q psy17635 18 AIAFLILVLI--LVCLIKRNRGGK 39 (103)
Q Consensus 18 ai~lLllill--i~C~i~R~rGgK 39 (103)
|++++||+++ -++||..||.+-
T Consensus 3 alava~lv~f~gGLyfi~h~k~~~ 26 (256)
T PRK12757 3 ALAVAVLVTFIGGLYFITHNKKEE 26 (256)
T ss_pred HHHHHHHHHHhhheEEEecCCCCC
Confidence 4444444444 478888888653
No 138
>PF06900 DUF1270: Protein of unknown function (DUF1270); InterPro: IPR009682 This entry is represented by Bacteriophage 92, Orf86. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical Staphylococcus aureus and phage proteins of 53 residues in length. The function of this family is unknown.
Probab=48.77 E-value=31 Score=21.80 Aligned_cols=21 Identities=14% Similarity=0.265 Sum_probs=14.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~ 29 (103)
...+|++||.+++-+.+..++
T Consensus 6 kSylIA~l~ftvlsi~L~~~l 26 (53)
T PF06900_consen 6 KSYLIANLCFTVLSIALMPFL 26 (53)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 467899999877766554443
No 139
>PF03845 Spore_permease: Spore germination protein; InterPro: IPR004761 Amino acid permeases are integral membrane proteins involved in the transport of amino acids into the cell. A number of such proteins have been found to be evolutionary related [, , ]. These proteins seem to contain up to 12 transmembrane segments. The best conserved region in this family is located in the second transmembrane segment. Spore germination protein (amino acid permease) is involved in the response to the germinative mixture of L-asparagine, glucose, fructose and potassium ions (AFFK). These proteins could be amino acid transporters.; GO: 0009847 spore germination, 0016021 integral to membrane
Probab=48.71 E-value=20 Score=27.95 Aligned_cols=31 Identities=29% Similarity=0.487 Sum_probs=25.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
.+|+..++..+..++++.++..+.+|.+|..
T Consensus 33 d~Wi~~ll~~~~~l~~~~l~~~l~~~~p~~~ 63 (320)
T PF03845_consen 33 DAWISVLLGGLIGLLLALLIYYLLKRFPGKT 63 (320)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
Confidence 5899988888888888888888888876643
No 140
>PF06400 Alpha-2-MRAP_N: Alpha-2-macroglobulin RAP, N-terminal domain; InterPro: IPR009066 The alpha-2-macroglobulin receptor-associated protein (RAP) is a glycoprotein that binds to the alpha-2-macroglobulin receptor, as well as to other members of the low density lipoprotein receptor family (IPR002172 from INTERPRO). RAP acts to inhibit the binding of all know ligands for these receptors, and may prevent receptor aggregation and degradation in the endoplasmic reticulum, thereby acting as a molecular chaperone []. RAP may be under the regulatory control of calmodulin, since it is able to bind calmodulin and be phosphorylated by calmodulin-dependent kinase II (IPR002048 from INTERPRO). RAP is comprised of three domains. Both domains 1 and 3 are involved in binding to the alpha-2-macroglobulin receptor, while domain 1 is also involved in inhibiting the binding of activated alpha-2-macroglobulin (IPR001599 from INTERPRO). Structural studies have revealed the RAP domain 1 to be comprised of a partly opened bundle of three helices, the first one being shorter than the other two.; PDB: 1NRE_A 2P03_A 1OV2_A 1LRE_A 2FYL_A 1OP1_A 2P01_A.
Probab=48.67 E-value=5.8 Score=28.82 Aligned_cols=24 Identities=29% Similarity=0.530 Sum_probs=0.0
Q ss_pred HHHHHHHHhheeecCCCcccchhh
Q psy17635 22 LILVLILVCLIKRNRGGKYAVHER 45 (103)
Q Consensus 22 Lllilli~C~i~R~rGgKY~VkeK 45 (103)
+++++++.++.....++||+-.-.
T Consensus 3 ~~~~ll~~~~~~~~~~~KYSre~N 26 (120)
T PF06400_consen 3 LLLLLLLFPIAAVKAAGKYSREAN 26 (120)
T ss_dssp ------------------------
T ss_pred HHHHHHHHHhHHHHhccccCHhhC
Confidence 334444445555778999985444
No 141
>MTH00186 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=48.53 E-value=38 Score=20.19 Aligned_cols=31 Identities=13% Similarity=0.339 Sum_probs=19.7
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
|||=+. ..|++..+..++.++++...+.+..
T Consensus 1 MPQm~P--~~W~~l~~~~~~~~~~~~~~~~~~~ 31 (52)
T MTH00186 1 MPQLSP--LFWALLFIMIWLLIFLAMSMTWWLQ 31 (52)
T ss_pred CCCCCc--hHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 555443 4799887777666666665666554
No 142
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=48.47 E-value=17 Score=24.02 Aligned_cols=18 Identities=28% Similarity=0.680 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHhheee
Q psy17635 17 LAIAFLILVLILVCLIKR 34 (103)
Q Consensus 17 ~ai~lLllilli~C~i~R 34 (103)
...+|++|++++++++.|
T Consensus 73 i~~~~~~f~~~v~yI~~r 90 (92)
T PF03908_consen 73 IFFAFLFFLLVVLYILWR 90 (92)
T ss_pred HHHHHHHHHHHHHHHhhh
Confidence 334555555555555554
No 143
>PF00517 GP41: Retroviral envelope protein; InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=48.45 E-value=8.4 Score=29.38 Aligned_cols=25 Identities=28% Similarity=0.542 Sum_probs=3.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~ 35 (103)
|+|.+...|++.++.+.|++++-|.
T Consensus 155 ~~Iki~i~iv~~iI~lri~~~i~~~ 179 (204)
T PF00517_consen 155 WYIKIFIMIVIGIIGLRILMCILRI 179 (204)
T ss_dssp HHHHH--------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566666666666667777777777
No 144
>PHA02692 hypothetical protein; Provisional
Probab=48.33 E-value=30 Score=23.03 Aligned_cols=8 Identities=25% Similarity=0.621 Sum_probs=3.3
Q ss_pred hHHHHHHH
Q psy17635 10 GWFIGMLL 17 (103)
Q Consensus 10 gWfIg~~~ 17 (103)
+|.+.++.
T Consensus 44 ~~~~~ii~ 51 (70)
T PHA02692 44 PWTTVFLI 51 (70)
T ss_pred chHHHHHH
Confidence 34444433
No 145
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=48.15 E-value=17 Score=26.62 Aligned_cols=11 Identities=9% Similarity=-0.115 Sum_probs=5.4
Q ss_pred hhhHHHHHHHH
Q psy17635 8 TAGWFIGMLLA 18 (103)
Q Consensus 8 t~gWfIg~~~a 18 (103)
+..|.|+++++
T Consensus 116 ~~~~~i~~~i~ 126 (145)
T PF10661_consen 116 PISPTILLSIG 126 (145)
T ss_pred chhHHHHHHHH
Confidence 34455555444
No 146
>COG2268 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.27 E-value=28 Score=30.94 Aligned_cols=19 Identities=5% Similarity=0.468 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~ 29 (103)
||++.++++++++++++++
T Consensus 11 ~~~~~ivvv~i~~ilv~if 29 (548)
T COG2268 11 LMLIAIVVVVILVILVLIF 29 (548)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 8887777766555554443
No 147
>PF12670 DUF3792: Protein of unknown function (DUF3792); InterPro: IPR023804 Members of this family of strongly hydrophobic putative transmembrane protein average about 125 amino acids in length and occur mostly, but not exclusively, in the Firmicutes. Members are quite diverse in sequence. Their function is unknown.
Probab=47.20 E-value=25 Score=24.25 Aligned_cols=28 Identities=21% Similarity=0.783 Sum_probs=21.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
..+.+|+.|++.++...++++++-.+..
T Consensus 62 ~~~kG~l~G~~~Gl~y~~il~lis~~~~ 89 (116)
T PF12670_consen 62 AGSKGWLHGLLVGLLYFLILLLISFLFG 89 (116)
T ss_pred HccchHHHHHHHHHHHHHHHHHHHHHHc
Confidence 4568999999999887777766655443
No 148
>PRK10299 PhoPQ regulatory protein; Provisional
Probab=47.11 E-value=25 Score=21.73 Aligned_cols=21 Identities=19% Similarity=0.420 Sum_probs=15.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~ 29 (103)
..|.|.++.+++-++|.++.+
T Consensus 4 ~rwiili~iv~~Cl~lyl~al 24 (47)
T PRK10299 4 FRWVVLVVVVLACLLLWAQVF 24 (47)
T ss_pred eeehHHHHHHHHHHHHHHHHH
Confidence 469988877777666666654
No 149
>PF03954 Lectin_N: Hepatic lectin, N-terminal domain; InterPro: IPR005640 Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. C-type lectins display a wide range of specificities. They require Ca2+ for their activity They are found predominantly but not exclusively in vertebrates. This entry presents N-terminal domain, which is found in C-type lectins.; GO: 0005529 sugar binding, 0016020 membrane
Probab=46.92 E-value=19 Score=26.70 Aligned_cols=23 Identities=26% Similarity=0.541 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhee
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~ 33 (103)
|+--++.++.|-+|.+++||+|-
T Consensus 32 ~~~l~LlsLgl~~LLLV~IcVig 54 (138)
T PF03954_consen 32 GPRLLLLSLGLSLLLLVVICVIG 54 (138)
T ss_pred cchHHHHHHHHHHHHHHHHHhhc
Confidence 33445555555566666788874
No 150
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=46.86 E-value=30 Score=23.20 Aligned_cols=22 Identities=27% Similarity=0.448 Sum_probs=14.6
Q ss_pred hhhhhHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllill 27 (103)
....|..||++.|++|+.+.++
T Consensus 49 t~i~GlaiGfvfA~vLv~illl 70 (73)
T COG4218 49 TRIAGLAIGFVFAGVLVGILLL 70 (73)
T ss_pred hhhHHHHHHHHHHHHHHHHHHH
Confidence 3456777888888766655544
No 151
>PF15339 Afaf: Acrosome formation-associated factor
Probab=46.65 E-value=24 Score=27.57 Aligned_cols=20 Identities=25% Similarity=0.718 Sum_probs=10.5
Q ss_pred HHHHHHHHH----HHHHHHHHhhe
Q psy17635 13 IGMLLAIAF----LILVLILVCLI 32 (103)
Q Consensus 13 Ig~~~ai~l----Lllilli~C~i 32 (103)
+-+|+.|.| |+++||++|+.
T Consensus 130 lkLmLGIsLmTl~lfv~Ll~~c~a 153 (200)
T PF15339_consen 130 LKLMLGISLMTLFLFVILLAFCSA 153 (200)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555443 33446666765
No 152
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=46.54 E-value=14 Score=26.97 Aligned_cols=8 Identities=13% Similarity=0.737 Sum_probs=3.4
Q ss_pred HHHHHHHH
Q psy17635 12 FIGMLLAI 19 (103)
Q Consensus 12 fIg~~~ai 19 (103)
.++++++|
T Consensus 77 ~~~iivgv 84 (179)
T PF13908_consen 77 ITGIIVGV 84 (179)
T ss_pred eeeeeeeh
Confidence 33444444
No 153
>KOG3970|consensus
Probab=46.27 E-value=25 Score=28.71 Aligned_cols=34 Identities=21% Similarity=0.445 Sum_probs=22.0
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccch
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVH 43 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~Vk 43 (103)
-..+||+.++..++|++||.|+-.+ .|+|.-+|.
T Consensus 250 ~~ra~fli~lgvLafi~~i~lM~rl---Gr~g~ds~D 283 (299)
T KOG3970|consen 250 KKRALFLIFLGVLAFITIIMLMKRL---GRSGEDSSD 283 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---cccccccCC
Confidence 4567888888777777777665543 556654443
No 154
>PRK13415 flagella biosynthesis protein FliZ; Provisional
Probab=45.85 E-value=28 Score=27.56 Aligned_cols=28 Identities=36% Similarity=0.376 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHH-HHH-HhheeecCCC
Q psy17635 11 WFIGMLLAIAFLILV-LIL-VCLIKRNRGG 38 (103)
Q Consensus 11 WfIg~~~ai~lLlli-lli-~C~i~R~rGg 38 (103)
=++-++.+++++|++ ..+ -++.+|++++
T Consensus 67 ~l~qmi~aL~~VI~Liy~l~rwL~rR~~~~ 96 (219)
T PRK13415 67 DFVKLIGATLFVIFLIYALVKWLNKRNRLL 96 (219)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 355566554444322 222 2666776654
No 155
>PF13179 DUF4006: Family of unknown function (DUF4006)
Probab=45.74 E-value=26 Score=22.99 Aligned_cols=15 Identities=40% Similarity=0.833 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILVL 26 (103)
Q Consensus 12 fIg~~~ai~lLllil 26 (103)
+.|+++|++|||-|+
T Consensus 13 i~G~LIAvvLLLsIl 27 (66)
T PF13179_consen 13 ITGMLIAVVLLLSIL 27 (66)
T ss_pred hHhHHHHHHHHHHHH
Confidence 356777766666443
No 156
>PF02397 Bac_transf: Bacterial sugar transferase; InterPro: IPR003362 This entry represents a conserved region from a number of different bacterial sugar transferases, involved in diverse biosynthesis pathways. Examples include galactosyl-P-P-undecaprenol synthetase (2.7.8.6 from EC), which transfers galatose-1-phosphate to the lipid precursor undecaprenol phosphate in the first steps of O-polysaccharide biosynthesis; UDP-galactose-lipid carrier transferase, which is involved in the biosynthesis of amylovoran; and galactosyl transferase CpsD, which is essential for assembly of the group B Streptococci (GBS) type III capsular polysaccharide.
Probab=45.66 E-value=22 Score=26.91 Aligned_cols=24 Identities=38% Similarity=0.476 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhheee
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R 34 (103)
++++++..|+++-+++++.++++-
T Consensus 6 i~~a~~~li~~~Pl~l~iai~i~l 29 (187)
T PF02397_consen 6 IVLALLLLILLSPLFLIIAILIKL 29 (187)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555545555555555555555
No 157
>PLN00168 Cytochrome P450; Provisional
Probab=45.58 E-value=24 Score=29.26 Aligned_cols=26 Identities=38% Similarity=0.524 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 14 GMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R~rGgK 39 (103)
.+.|+.+||++++++.|...+.++++
T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 31 (519)
T PLN00168 6 LLLLAALLLLPLLLLLLGKHGGRGGK 31 (519)
T ss_pred HHHHHHHHHHHHHHHhhhhhhccCCC
Confidence 34455666666666677666655664
No 158
>KOG2927|consensus
Probab=45.56 E-value=23 Score=30.07 Aligned_cols=31 Identities=23% Similarity=0.490 Sum_probs=16.9
Q ss_pred hhHHHHHHHHHHHHHHH-HHHHhheeecCCCc
Q psy17635 9 AGWFIGMLLAIAFLILV-LILVCLIKRNRGGK 39 (103)
Q Consensus 9 ~gWfIg~~~ai~lLlli-lli~C~i~R~rGgK 39 (103)
.|=||+++.++|++=+| ++|++++.+.++|-
T Consensus 225 ~~gfl~~IlvLaIvRlILF~I~~il~~g~~g~ 256 (372)
T KOG2927|consen 225 AGGFLAFILVLAIVRLILFGITWILTGGKHGF 256 (372)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCce
Confidence 44467766665555444 34456666655553
No 159
>PHA03273 envelope glycoprotein C; Provisional
Probab=45.29 E-value=14 Score=32.41 Aligned_cols=24 Identities=25% Similarity=0.216 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHH-HHHHHHhheeec
Q psy17635 12 FIGMLLAIAFLI-LVLILVCLIKRN 35 (103)
Q Consensus 12 fIg~~~ai~lLl-lilli~C~i~R~ 35 (103)
+||++|++++++ +|++.++|+.++
T Consensus 450 ivaV~~g~~a~g~~ilitalC~y~s 474 (486)
T PHA03273 450 IIAVTCGAAALALVVLITAVCFYCS 474 (486)
T ss_pred ehHHHHHHHHHHHHHheEEEEEEec
Confidence 368888866664 434433334333
No 160
>PRK10905 cell division protein DamX; Validated
Probab=45.26 E-value=11 Score=31.52 Aligned_cols=12 Identities=25% Similarity=0.852 Sum_probs=5.4
Q ss_pred HHHHHHHHHHHH
Q psy17635 18 AIAFLILVLILV 29 (103)
Q Consensus 18 ai~lLllilli~ 29 (103)
+|.+|+|+||||
T Consensus 3 GiGilVLlLLIi 14 (328)
T PRK10905 3 GVGILVLLLLII 14 (328)
T ss_pred chhHHHHHHHHH
Confidence 344444444444
No 161
>PHA03240 envelope glycoprotein M; Provisional
Probab=44.99 E-value=22 Score=28.64 Aligned_cols=16 Identities=25% Similarity=0.594 Sum_probs=7.2
Q ss_pred HHHHHHHHHHHHHHhh
Q psy17635 16 LLAIAFLILVLILVCL 31 (103)
Q Consensus 16 ~~ai~lLllilli~C~ 31 (103)
|+.|+++|+|++++||
T Consensus 216 IiilIIiIiIIIL~cf 231 (258)
T PHA03240 216 IFIAIIIIIVIILFFF 231 (258)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333444444444555
No 162
>CHL00177 ccs1 c-type cytochrome biogenensis protein; Validated
Probab=44.92 E-value=18 Score=30.79 Aligned_cols=26 Identities=23% Similarity=0.677 Sum_probs=17.4
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~ 35 (103)
..+..||+++| +|+.+-|++|.++|.
T Consensus 70 vY~s~WF~~ll----~ll~isL~~Cs~~R~ 95 (426)
T CHL00177 70 IYKTWWFLSLL----LLFGLSLLLCTLLQQ 95 (426)
T ss_pred cccCHHHHHHH----HHHHHHHhheeeccc
Confidence 45667998765 445556777888773
No 163
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=44.77 E-value=14 Score=30.83 Aligned_cols=12 Identities=17% Similarity=0.207 Sum_probs=9.5
Q ss_pred HHHHHhheeecC
Q psy17635 25 VLILVCLIKRNR 36 (103)
Q Consensus 25 illi~C~i~R~r 36 (103)
-+|..|||.|-|
T Consensus 385 GfLcWwf~crgk 396 (397)
T PF03302_consen 385 GFLCWWFICRGK 396 (397)
T ss_pred HHHhhheeeccc
Confidence 377789999976
No 164
>PF11359 gpUL132: Glycoprotein UL132; InterPro: IPR021023 Glycoprotein UL132 is a low-abundance structural component of Human herpesvirus 5 []. The function of this protein is not fully understood.
Probab=44.60 E-value=74 Score=25.53 Aligned_cols=13 Identities=23% Similarity=0.299 Sum_probs=6.9
Q ss_pred ecCCCcccchhhh
Q psy17635 34 RNRGGKYAVHERE 46 (103)
Q Consensus 34 R~rGgKY~VkeKE 46 (103)
|..|-||.-.+.|
T Consensus 80 ~~pg~~~~f~~de 92 (235)
T PF11359_consen 80 RRPGRLTRFDDDE 92 (235)
T ss_pred hCCCcccccChhh
Confidence 5556655544444
No 165
>PF00974 Rhabdo_glycop: Rhabdovirus spike glycoprotein; InterPro: IPR001903 Different families of ssRNA negative-strand viruses contain glycoproteins responsible for forming spikes on the surface of the virion. The glycoprotein spike is made up of a trimer of glycoproteins. These proteins are frequently abbreviated to G protein. Channel formed by glycoprotein spike is thought to function in a similar manner to Influenza virus M2 protein channel, thus allowing a signal to pass across the viral membrane to signal for viral uncoating [, ].; GO: 0019031 viral envelope; PDB: 2CMZ_C 2J6J_A 3EGD_D.
Probab=44.23 E-value=7.5 Score=33.54 Aligned_cols=13 Identities=8% Similarity=0.069 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhhe
Q psy17635 20 AFLILVLILVCLI 32 (103)
Q Consensus 20 ~lLllilli~C~i 32 (103)
++++|+.|+..|.
T Consensus 467 lli~l~~cc~~~~ 479 (501)
T PF00974_consen 467 LLILLIRCCCRCR 479 (501)
T ss_dssp -------------
T ss_pred HHHHHHHHhhhhc
Confidence 3334443333334
No 166
>KOG1311|consensus
Probab=44.19 E-value=25 Score=27.49 Aligned_cols=28 Identities=14% Similarity=0.250 Sum_probs=20.9
Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllilli~C~i 32 (103)
+-.++.||+.++..++++.+..+.+|++
T Consensus 152 G~rNyr~F~~f~~~~~l~~i~~~~~~~~ 179 (299)
T KOG1311|consen 152 GERNYRYFVLFLFYLALGVLLALAFLFY 179 (299)
T ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3568889998888777777777766554
No 167
>PF05083 LST1: LST-1 protein; InterPro: IPR007775 B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length. B144/LST1 is strongly expressed in dendritic cells. Transfection of B144/LST1 into a variety of cells induces morphologic changes including the production of long, thin filopodia []. A possible role in modulating immune responses. Induces morphological changes including production of filopodia and microspikes when overexpressed in a variety of cell types and may be involved in dendritic cell maturation. Isoform 1 and isoform 2 have an inhibitory effect on lymphocyte proliferation [, ]. ; GO: 0000902 cell morphogenesis, 0006955 immune response, 0016020 membrane
Probab=43.74 E-value=6.8 Score=26.26 Aligned_cols=19 Identities=32% Similarity=0.637 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHhheeecCC
Q psy17635 19 IAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 19 i~lLllilli~C~i~R~rG 37 (103)
+.||.++++.+|+|.-+|+
T Consensus 4 llll~vvll~~clC~lsrR 22 (74)
T PF05083_consen 4 LLLLAVVLLSACLCRLSRR 22 (74)
T ss_pred hhhHHHHHHHHHHHHHHhh
Confidence 4556666677787654443
No 168
>MTH00102 ATP8 ATP synthase F0 subunit 8; Validated
Probab=43.74 E-value=42 Score=21.51 Aligned_cols=20 Identities=40% Similarity=0.697 Sum_probs=13.1
Q ss_pred CCCcchhhhhHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFL 22 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lL 22 (103)
|||-+ ..-||..++++-+.+
T Consensus 1 MPQLn--P~PWf~il~~tWl~l 20 (67)
T MTH00102 1 MPQLD--TSTWFITILSMLLTL 20 (67)
T ss_pred CCCCC--CcHHHHHHHHHHHHH
Confidence 55543 447999998884433
No 169
>PF06842 DUF1242: Protein of unknown function (DUF1242); InterPro: IPR009653 This family consists of a number of eukaryotic proteins of around 72 residues in length. The function of this family is unknown.
Probab=43.72 E-value=10 Score=22.09 Aligned_cols=11 Identities=36% Similarity=0.948 Sum_probs=5.4
Q ss_pred HHHHHHHHhhe
Q psy17635 22 LILVLILVCLI 32 (103)
Q Consensus 22 Lllilli~C~i 32 (103)
|++|||++|-+
T Consensus 2 L~ViLL~ICTC 12 (36)
T PF06842_consen 2 LTVILLLICTC 12 (36)
T ss_pred hhhHHHHHHHh
Confidence 44455555543
No 170
>PF06809 NPDC1: Neural proliferation differentiation control-1 protein (NPDC1); InterPro: IPR009635 This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the Caenorhabditis elegans protein CAB-1 (Q93249 from SWISSPROT) which is known to interact with AEX-3 [].; GO: 0016021 integral to membrane
Probab=43.48 E-value=25 Score=29.55 Aligned_cols=28 Identities=14% Similarity=0.154 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
+|+++|+....-+|+..+|++|-+|.-+
T Consensus 202 ~Iv~~cvaG~aAliva~~cW~Rlqr~~r 229 (341)
T PF06809_consen 202 LIVVCCVAGAAALIVAGYCWYRLQREIR 229 (341)
T ss_pred hhHHHHHHHHHHHHHhhheEEEeccccc
Confidence 5666666555667777789998877766
No 171
>PF04971 Lysis_S: Lysis protein S ; InterPro: IPR007054 The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane [].
Probab=43.41 E-value=26 Score=23.14 Aligned_cols=23 Identities=26% Similarity=0.284 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHhheee
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R 34 (103)
.||++..|++.+|..|+=...|+
T Consensus 35 aIGvi~gi~~~~lt~ltN~YFK~ 57 (68)
T PF04971_consen 35 AIGVIGGIFFGLLTYLTNLYFKI 57 (68)
T ss_pred hHHHHHHHHHHHHHHHhHhhhhh
Confidence 58888888888888887555544
No 172
>TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=43.16 E-value=23 Score=30.59 Aligned_cols=30 Identities=13% Similarity=0.238 Sum_probs=19.0
Q ss_pred hhhhHHHHHHHHH-HHHHHHHHHHhheeecC
Q psy17635 7 ATAGWFIGMLLAI-AFLILVLILVCLIKRNR 36 (103)
Q Consensus 7 at~gWfIg~~~ai-~lLllilli~C~i~R~r 36 (103)
....|-+..+..+ ++++++-+++|+.||+|
T Consensus 520 ~~~~~~~~~i~~pp~~~l~~G~~~~~~Rrr~ 550 (552)
T TIGR03521 520 DRTTWQLINIGLPILLLLLFGLSFTYIRKRK 550 (552)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4456776666665 55555666677777665
No 173
>KOG3514|consensus
Probab=42.55 E-value=24 Score=34.44 Aligned_cols=41 Identities=37% Similarity=0.625 Sum_probs=30.2
Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHhheeecC-CCcccchh
Q psy17635 4 DTVATAGWFIGMLLAIAFLILVLILVCLIKRNR-GGKYAVHE 44 (103)
Q Consensus 4 ~~~at~gWfIg~~~ai~lLllilli~C~i~R~r-GgKY~Vke 44 (103)
++..+.|-..|++.|.+|-|||+|-.-.--||| -|+|.|.+
T Consensus 1510 esssttGmVvGIvaAaaLcILilL~am~kyRnrdEGSYhvd~ 1551 (1591)
T KOG3514|consen 1510 ESSSTTGMVVGIVAAAALCILILLYAMYKYRNRDEGSYHVDE 1551 (1591)
T ss_pred cCCCccchhhHHHHHHHHHHHHHHhhcccccccccCcccccc
Confidence 456788999999999888888876654333433 58998876
No 174
>PHA02844 putative transmembrane protein; Provisional
Probab=42.52 E-value=37 Score=22.84 Aligned_cols=25 Identities=16% Similarity=0.135 Sum_probs=14.4
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHh
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C 30 (103)
....+|.+.++..++.+++++++.-
T Consensus 43 ~~~~~~~~~ii~i~~v~~~~~~~fl 67 (75)
T PHA02844 43 VCSSSTKIWILTIIFVVFATFLTFL 67 (75)
T ss_pred cCChhHHHHHHHHHHHHHHHHHHHH
Confidence 4457888888664444444444433
No 175
>MTH00147 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=42.35 E-value=58 Score=19.77 Aligned_cols=32 Identities=19% Similarity=0.434 Sum_probs=22.0
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
|||- +---|++..+..++.++++..++.+...
T Consensus 1 MPQm--sPm~W~~l~~~~~~~~~~~~~~~~~~~~ 32 (51)
T MTH00147 1 MPQL--SPLNWIFLFILFWSAVIIVSVIIWWSTK 32 (51)
T ss_pred CCCC--cchHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 4553 3357999988887777777777766543
No 176
>COG3115 ZipA Cell division protein [Cell division and chromosome partitioning]
Probab=42.07 E-value=36 Score=28.45 Aligned_cols=23 Identities=26% Similarity=0.349 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHhheeecCCCccc
Q psy17635 18 AIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 18 ai~lLllilli~C~i~R~rGgKY~ 41 (103)
|||++.|++--+| ..|.++++|=
T Consensus 13 ~IAIiaLLvhGlW-tsRkE~s~~F 35 (324)
T COG3115 13 AIAIIALLVHGLW-TSRKERSSYF 35 (324)
T ss_pred HHHHHHHHHhhhh-hcchhhcccc
Confidence 3443333333334 5566777774
No 177
>PHA02819 hypothetical protein; Provisional
Probab=41.70 E-value=36 Score=22.68 Aligned_cols=22 Identities=14% Similarity=0.315 Sum_probs=13.9
Q ss_pred cchhhhhHHHHHHHHHHHHHHH
Q psy17635 4 DTVATAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 4 ~~~at~gWfIg~~~ai~lLlli 25 (103)
.+.....|+|.++.++.+++++
T Consensus 42 ~~~~~~~~ii~l~~~~~~~~~~ 63 (71)
T PHA02819 42 KSFLRYYLIIGLVTIVFVIIFI 63 (71)
T ss_pred CChhHHHHHHHHHHHHHHHHHH
Confidence 3577888998866554444433
No 178
>TIGR02588 conserved hypothetical protein TIGR02588. The function of this protein is unknown. It is always found as part of a two-gene operon with TIGR02587, a protein that appears to span the membrane seven times. It is found in Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Sinorhizobium meliloti, and Gloeobacter violaceus, so far, all of which are bacterial.
Probab=41.31 E-value=36 Score=24.72 Aligned_cols=33 Identities=12% Similarity=0.176 Sum_probs=21.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHH--hheeecCCCccc
Q psy17635 9 AGWFIGMLLAIAFLILVLILV--CLIKRNRGGKYA 41 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~--C~i~R~rGgKY~ 41 (103)
.=|+-+.+++++||.++.|++ ++..+++--...
T Consensus 3 ~EWvt~~Is~~ill~viglv~y~~l~~~~~pp~l~ 37 (122)
T TIGR02588 3 AEWVTFGISTLILAAMFGLVAYDWLRYSNKAAVLE 37 (122)
T ss_pred HHhhhHHHHHHHHHHHHHHHHHHhhccCCCCCeEE
Confidence 459988888888888887775 333444333333
No 179
>PHA02692 hypothetical protein; Provisional
Probab=41.30 E-value=34 Score=22.75 Aligned_cols=15 Identities=20% Similarity=0.193 Sum_probs=10.0
Q ss_pred cchhhhhHHHH-HHHH
Q psy17635 4 DTVATAGWFIG-MLLA 18 (103)
Q Consensus 4 ~~~at~gWfIg-~~~a 18 (103)
.+.....|+|. ++.+
T Consensus 41 ~~~~~~~~ii~~~~~~ 56 (70)
T PHA02692 41 KGVPWTTVFLIGLIAA 56 (70)
T ss_pred CCcchHHHHHHHHHHH
Confidence 35677888888 4444
No 180
>PF14798 Ca_hom_mod: Calcium homeostasis modulator
Probab=40.99 E-value=34 Score=27.17 Aligned_cols=27 Identities=19% Similarity=0.469 Sum_probs=18.6
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i 32 (103)
....||++.++.+++.+|...+.-|+-
T Consensus 178 SQ~lGW~LI~~~~i~a~l~~c~~rC~S 204 (251)
T PF14798_consen 178 SQVLGWILIALVIILAFLVTCLRRCFS 204 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 345699888777777766666666654
No 181
>PF03918 CcmH: Cytochrome C biogenesis protein; InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=40.83 E-value=9.5 Score=27.96 Aligned_cols=36 Identities=14% Similarity=0.153 Sum_probs=0.5
Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 4 DTVATAGWFIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 4 ~~~at~gWfIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
+++...+|+..++=++++++.+++++-.++|.+...
T Consensus 95 Pp~~~~~~~lW~~P~~~l~~g~~~~~~~~rr~~~~~ 130 (148)
T PF03918_consen 95 PPFKGFTWLLWLGPFLLLLLGGALLFRRLRRWRRRA 130 (148)
T ss_dssp --S---------------------------------
T ss_pred CCCCccHHHHHHHHHHHHHHHHHHHHHHHHhcccCC
Confidence 345566788888888777777777777777665544
No 182
>PRK03427 cell division protein ZipA; Provisional
Probab=40.65 E-value=42 Score=28.09 Aligned_cols=30 Identities=17% Similarity=0.255 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
-++.++.|||++-|++=-+|=+|+.|.--|
T Consensus 7 LiLivvGAIAIiAlL~HGlWtsRKers~~f 36 (333)
T PRK03427 7 LILIIVGAIAIIALLVHGFWTSRKERSSMF 36 (333)
T ss_pred hHHHHHHHHHHHHHHHHhhhhccccccchh
Confidence 345566677766666666788888876555
No 183
>TIGR01195 oadG_fam sodium pump decarboxylases, gamma subunit. Most sequences scoring between the noise and trusted cutoffs are eukaryotic sodium channel proteins.
Probab=40.62 E-value=50 Score=21.86 Aligned_cols=12 Identities=42% Similarity=0.780 Sum_probs=5.7
Q ss_pred HHHHHHHHHHHh
Q psy17635 19 IAFLILVLILVC 30 (103)
Q Consensus 19 i~lLllilli~C 30 (103)
++|+.|++||+|
T Consensus 17 ~VF~fL~lLi~~ 28 (82)
T TIGR01195 17 IVFLFLSLLIYA 28 (82)
T ss_pred HHHHHHHHHHHH
Confidence 444444544443
No 184
>PF02124 Marek_A: Marek's disease glycoprotein A; InterPro: IPR001038 Equid herpesvirus 1 (Equine herpesvirus 1, EHV-1) glycoprotein 13 (EHV-1 gp13) has the characteristic features of a membrane-spanning protein: an N-terminal signal sequence; a hydrophobic membrane anchor region; a charged C-terminal cytoplasmic tail; and an exterior domain with nine potential N-glycosylation sites []. EHV-1 gp13 is the structural homologue of the gC-like glycoproteins of the Human herpesvirus 1 (HHV-1) and Human herpesvirus 2 (HHV-2) (gC-1 and gC-2 respectively), Pseudorabies virus (strain Indiana-Funkhauser/Becker) (PRV) (gIII) and Human herpesvirus 3 (HHV-3) (gp66). Secretory glycoprotein GP57-65 precursor (glycoprotein A - GA) is similar to Herpesvirus glycoprotein C, and belongs to the immunoglobulin gene superfamily [, ]. GA is thought to play an immunoevasive role in the pathogenesis of Marek's disease. It is a candidate for causing the early-stage immunosuppression that occurs after MDHV infection.
Probab=40.14 E-value=26 Score=27.51 Aligned_cols=18 Identities=22% Similarity=0.467 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILVLILV 29 (103)
Q Consensus 12 fIg~~~ai~lLllilli~ 29 (103)
+|+++++++++.++++|.
T Consensus 188 i~~Vi~g~~~~g~~~~i~ 205 (211)
T PF02124_consen 188 IIGVICGAGALGLLVLIA 205 (211)
T ss_pred hHHHHHHHHHHHHHHhhE
Confidence 467777766666555543
No 185
>PF07782 DC_STAMP: DC-STAMP-like protein; InterPro: IPR012858 This group of sequences is similar to a region of the dendritic cell-specific transmembrane protein (DC-STAMP, Q9H295 from SWISSPROT). This is thought to be a novel receptor protein that shares no identity with other multimembrane-spanning proteins []. It is thought to have seven putative transmembrane regions [], two of which are found in the region featured in this family. DC-STAMP is also described as having potential N-linked glycosylation sites and a potential phosphorylation site for PKC [], but these are not conserved. ; GO: 0016021 integral to membrane
Probab=39.76 E-value=39 Score=25.05 Aligned_cols=25 Identities=28% Similarity=0.442 Sum_probs=16.0
Q ss_pred CCcchhhhhHHHHHHHHHHHHHHHH
Q psy17635 2 PSDTVATAGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 2 ~~~~~at~gWfIg~~~ai~lLllil 26 (103)
|........|-|+++++++++++++
T Consensus 139 P~~p~~~~~~~i~~l~~l~~ll~~l 163 (191)
T PF07782_consen 139 PSPPDYSVYIQIGLLYLLLWLLVLL 163 (191)
T ss_pred CcCCChHHHHHHHHHHHHHHHHHHH
Confidence 3344566677788887766665553
No 186
>MTH00072 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=39.58 E-value=51 Score=19.55 Aligned_cols=24 Identities=29% Similarity=0.641 Sum_probs=15.7
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllil 26 (103)
|||- +..-||..+++..+.+++++
T Consensus 1 MPQL--~P~pW~~~~~~~W~~ll~~~ 24 (54)
T MTH00072 1 MPQL--NPGPWFAIFLMSWLIFLLIL 24 (54)
T ss_pred CCCC--CchHHHHHHHHHHHHHHHHH
Confidence 4553 34579999998866655444
No 187
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=39.49 E-value=14 Score=31.28 Aligned_cols=28 Identities=18% Similarity=0.258 Sum_probs=12.9
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
....|+|.++.-+++|||.++.+.+.+|
T Consensus 344 ~~~~~~~ii~~N~v~lllg~~~~~~~rk 371 (374)
T TIGR03503 344 DSKFWIIIIVGNVVILLLGGIGFFVWRK 371 (374)
T ss_pred hhhhhHhhHhhhhhhhhhheeeEEEEEE
Confidence 3446644544445555554433333333
No 188
>PF00429 TLV_coat: ENV polyprotein (coat polyprotein); InterPro: IPR018154 Enveloped viruses such as Human immunodeficiency virus 1, influenza virus, and Ebola virus sp. express a surface glycoprotein that mediates both cell attachment and fusion of viral and cellular membranes. The ENV polyprotein (coat polyprotein) usually contains two coat proteins which differ depending on the source. The structure of a number of the ENV polyprotein domains have been determined: The crystal structure of an extraviral segment of the Moloney murine leukemia virus (MoMuLV) transmembrane (TM) subunit has been determined to 1.7-A resolution. This segment contains a trimeric coiled coil, with a hydrophobic cluster at its base and a strand that packs in an antiparallel orientation against the coiled coil. This structure serves as a model for a wide range of viral fusion proteins; key residues in this structure are conserved among C- and D-type retroviruses and the filovirus ebola []. An essential step in retrovirus infection is the binding of the virus to its receptor on a target cell. The structure of the receptor-binding domain of the envelope glycoprotein from Friend murine leukemia virus (F-MuLV) has been determined determined to 2.0-A resolution. The core of the domain is an antiparallel beta sandwich, with two interstrand loops forming a helical subdomain atop the sandwich. The residues in the helical region, but not in the beta sandwich, are highly variable among mammalian C-type retroviruses with distinct tropisms, indicating that the helical subdomain determines the receptor specificity of the virus []. ; PDB: 1LCS_B 1MOF_A 1XNL_A 2XZ3_A 1AOL_A 1Y4M_C.
Probab=38.79 E-value=10 Score=33.22 Aligned_cols=10 Identities=20% Similarity=0.528 Sum_probs=0.0
Q ss_pred hhhhHHHHHH
Q psy17635 7 ATAGWFIGML 16 (103)
Q Consensus 7 at~gWfIg~~ 16 (103)
..-.||..+|
T Consensus 523 ~~~~w~~~ll 532 (561)
T PF00429_consen 523 GWSPWLKPLL 532 (561)
T ss_dssp ----------
T ss_pred ccchHHHHHH
Confidence 3446887777
No 189
>PF11346 DUF3149: Protein of unknown function (DUF3149); InterPro: IPR021494 This bacterial family of proteins has no known function.
Probab=38.64 E-value=33 Score=20.49 Aligned_cols=17 Identities=18% Similarity=0.370 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy17635 13 IGMLLAIAFLILVLILV 29 (103)
Q Consensus 13 Ig~~~ai~lLllilli~ 29 (103)
+|+|+.++.+..+.+++
T Consensus 11 vGL~Sl~vI~~~igm~~ 27 (42)
T PF11346_consen 11 VGLMSLIVIVFTIGMGV 27 (42)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 56666655554444443
No 190
>PHA03269 envelope glycoprotein C; Provisional
Probab=38.47 E-value=24 Score=31.43 Aligned_cols=18 Identities=28% Similarity=0.567 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILVLILV 29 (103)
Q Consensus 12 fIg~~~ai~lLllilli~ 29 (103)
+||++|+++++.+++||.
T Consensus 541 IigV~~~~~algliilIt 558 (566)
T PHA03269 541 IIGIISIIGILGLIALIA 558 (566)
T ss_pred ehHHHHHHHHHHHHHhhe
Confidence 468888877766655553
No 191
>PF14927 Neurensin: Neurensin
Probab=38.46 E-value=49 Score=24.41 Aligned_cols=24 Identities=17% Similarity=0.544 Sum_probs=17.6
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~ 29 (103)
.....|=++++++++||++-+..+
T Consensus 39 w~s~~wkV~~i~g~l~Ll~Gi~~l 62 (140)
T PF14927_consen 39 WSSVCWKVGFISGLLLLLLGIVAL 62 (140)
T ss_pred CcchhHHHHHHHHHHHHHHHHHHH
Confidence 456678899999988888654443
No 192
>PF14884 EFF-AFF: Type I membrane glycoproteins cell-cell fusogen
Probab=38.45 E-value=25 Score=31.52 Aligned_cols=24 Identities=25% Similarity=0.571 Sum_probs=15.3
Q ss_pred hHHHHHHHH--HHHHHHHHHHHhhee
Q psy17635 10 GWFIGMLLA--IAFLILVLILVCLIK 33 (103)
Q Consensus 10 gWfIg~~~a--i~lLllilli~C~i~ 33 (103)
+|+=+++++ ||+.+++++|+|++.
T Consensus 557 s~~e~v~~~~dIv~~i~~~~il~~l~ 582 (589)
T PF14884_consen 557 SWTEAVMMALDIVLYIGLLFILYLLI 582 (589)
T ss_pred cHHHHHHHHHHHHHHhhhhhhhhhhh
Confidence 566666776 666666666666653
No 193
>PRK06569 F0F1 ATP synthase subunit B'; Validated
Probab=38.14 E-value=49 Score=24.67 Aligned_cols=31 Identities=26% Similarity=0.346 Sum_probs=15.4
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i 32 (103)
|||-...+- |--.+..+|.|++|++++-+|+
T Consensus 1 mPQfd~~~~-~sqifw~iI~FlILy~ll~kf~ 31 (155)
T PRK06569 1 MPQFDIATY-YSQIFWLIVTFGLLYIFVYKFI 31 (155)
T ss_pred CCCCchhhh-hHHHHHHHHHHHHHHHHHHHHh
Confidence 677665532 2222334455666665555544
No 194
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=38.14 E-value=40 Score=24.43 Aligned_cols=26 Identities=31% Similarity=0.529 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 15 MLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 15 ~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
.++|+.++++++-+.||+++.+-..+
T Consensus 44 ~~lAlg~vL~~~g~~~~~~~~~~~~~ 69 (191)
T PF04156_consen 44 ALLALGVVLLSLGLLCLLSKRPVQSV 69 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHccccccc
Confidence 34444455555555566655544443
No 195
>KOG1226|consensus
Probab=38.12 E-value=32 Score=31.90 Aligned_cols=37 Identities=22% Similarity=0.415 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhe--eecCCCcccchhhhhh
Q psy17635 12 FIGMLLAIAFLILVLILVCLI--KRNRGGKYAVHEREAA 48 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i--~R~rGgKY~VkeKE~~ 48 (103)
.++++.+|+|+.|+||++|-+ ..+--.-|+==+||+.
T Consensus 717 ~lgvv~~ivligl~llliwkll~~~~DrrE~akFe~er~ 755 (783)
T KOG1226|consen 717 VLGVVAGIVLIGLALLLIWKLLTTIHDRREFAKFEKERL 755 (783)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhheecccHHhhhhhHHHH
Confidence 456677787777777777532 3222334554455543
No 196
>PF10164 DUF2367: Uncharacterized conserved protein (DUF2367); InterPro: IPR019317 This is a highly conserved set of proteins which contains three pairs of cysteine residues within a length of 42 amino acids and is rich in proline residues towards the N terminus. It includes a membrane protein that has been found to be highly expressed in the mouse brain and consequently, several members have been assigned as brain protein i3 (Bri3). Their function is unknown.
Probab=37.94 E-value=40 Score=23.74 Aligned_cols=22 Identities=36% Similarity=0.682 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHhheeec
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~ 35 (103)
.|+++||.+.=|-+| .||..|.
T Consensus 67 ~Gi~~AI~fFPiGil-cc~~~r~ 88 (98)
T PF10164_consen 67 CGILCAIFFFPIGIL-CCLAMRE 88 (98)
T ss_pred HHHHHHHHHHhhHHH-HhhhcCc
Confidence 355556555444332 4555443
No 197
>PHA03054 IMV membrane protein; Provisional
Probab=37.94 E-value=46 Score=22.24 Aligned_cols=20 Identities=15% Similarity=0.149 Sum_probs=12.9
Q ss_pred chhhhhHHHHHHHHHHHHHH
Q psy17635 5 TVATAGWFIGMLLAIAFLIL 24 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLll 24 (103)
+.....|+|.++.++.++++
T Consensus 45 ~~~~~~~ii~l~~v~~~~l~ 64 (72)
T PHA03054 45 CWGWYWLIIIFFIVLILLLL 64 (72)
T ss_pred CchHHHHHHHHHHHHHHHHH
Confidence 56788899886555444433
No 198
>PHA02650 hypothetical protein; Provisional
Probab=37.49 E-value=40 Score=23.01 Aligned_cols=24 Identities=17% Similarity=0.113 Sum_probs=13.2
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~ 29 (103)
....+|.+.++..++++++++++.
T Consensus 44 ~~~~~~~~~ii~i~~v~i~~l~~f 67 (81)
T PHA02650 44 VSWFNGQNFIFLIFSLIIVALFSF 67 (81)
T ss_pred cCCchHHHHHHHHHHHHHHHHHHH
Confidence 345677777766444444444443
No 199
>PRK13453 F0F1 ATP synthase subunit B; Provisional
Probab=37.44 E-value=40 Score=24.64 Aligned_cols=22 Identities=18% Similarity=0.495 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
|...++-+|.|++|++++-.|+
T Consensus 18 ~~t~~~~iInFliL~~lL~~~l 39 (173)
T PRK13453 18 WGTVIVTVLTFIVLLALLKKFA 39 (173)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 6555556666666666655444
No 200
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=37.41 E-value=32 Score=20.65 Aligned_cols=28 Identities=11% Similarity=0.465 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
|.+|..+.+++.++.++.|+---|.|.|
T Consensus 3 l~~lip~sl~l~~~~l~~f~Wavk~GQf 30 (45)
T PF03597_consen 3 LYILIPVSLILGLIALAAFLWAVKSGQF 30 (45)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCCC
Confidence 3444444444444444444433445666
No 201
>PF03733 DUF307: Domain of unknown function (DUF307); InterPro: IPR005185 This proteins contain a domain which occurs as one or more copies in a small family of putative membrane proteins.
Probab=37.34 E-value=56 Score=20.05 Aligned_cols=20 Identities=15% Similarity=0.637 Sum_probs=10.6
Q ss_pred hhHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli 28 (103)
.||.+++++.++-+++.+.|
T Consensus 11 ~G~~lal~~~~~~~~~~iti 30 (53)
T PF03733_consen 11 FGWWLALIWLLAGILCCITI 30 (53)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 46666666665544444333
No 202
>PF09435 DUF2015: Fungal protein of unknown function (DUF2015); InterPro: IPR018559 This entry represents uncharacterised proteins found in fungi.
Probab=37.09 E-value=18 Score=26.49 Aligned_cols=26 Identities=19% Similarity=0.418 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 15 MLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 15 ~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
++-.++++++|+.+++|..|+|=-.+
T Consensus 4 lly~~~~~~~i~~t~lf~~R~r~~~~ 29 (128)
T PF09435_consen 4 LLYSLTFFVLIIGTLLFFTRHRWLPL 29 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44455666666666777777765444
No 203
>PF04226 Transgly_assoc: Transglycosylase associated protein; InterPro: IPR007341 This bacterial protein is predicted to be an integral membrane protein. Some family members have been annotated as transglycosylase-associated proteins, but no experimental evidence is provided. This family was annotated based on the information in P76011 from SWISSPROT.; GO: 0016021 integral to membrane
Probab=37.04 E-value=65 Score=19.23 Aligned_cols=19 Identities=16% Similarity=0.370 Sum_probs=11.7
Q ss_pred hhhhHHHHHHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 7 at~gWfIg~~~ai~lLlli 25 (103)
.-.+|+.+++.|+++|.+.
T Consensus 25 ~~~~~i~aviGAiill~i~ 43 (48)
T PF04226_consen 25 SWGSFIVAVIGAIILLFIY 43 (48)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 4456677777776665544
No 204
>KOG3054|consensus
Probab=37.04 E-value=45 Score=27.45 Aligned_cols=35 Identities=26% Similarity=0.421 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKRNRGGKYAVHERE 46 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE 46 (103)
+++++++.+||+.++++...-+|+-.-.=+|.-.|
T Consensus 4 ~v~vlVaa~llV~~i~l~l~~r~raA~~~~l~a~~ 38 (299)
T KOG3054|consen 4 IVAVLVAAALLVAVILLFLWKRRRAARTDMLLARE 38 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhh
Confidence 35666666666655444443344433344444444
No 205
>COG1314 SecG Preprotein translocase subunit SecG [Intracellular trafficking and secretion]
Probab=36.99 E-value=57 Score=22.07 Aligned_cols=13 Identities=38% Similarity=0.917 Sum_probs=7.8
Q ss_pred HHHHhheeecCCC
Q psy17635 26 LILVCLIKRNRGG 38 (103)
Q Consensus 26 lli~C~i~R~rGg 38 (103)
++++.++.|.||+
T Consensus 16 LI~~VLlQ~~kg~ 28 (86)
T COG1314 16 LIILVLLQRGKGA 28 (86)
T ss_pred HHHheeeecCCCC
Confidence 3344566788754
No 206
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=36.92 E-value=30 Score=25.47 Aligned_cols=15 Identities=13% Similarity=0.470 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHHHHH
Q psy17635 14 GMLLAIAFLILVLIL 28 (103)
Q Consensus 14 g~~~ai~lLllilli 28 (103)
+++++|+|++++.+.
T Consensus 8 ~~l~iilli~~~~~~ 22 (151)
T PF14584_consen 8 LVLVIILLILIIILN 22 (151)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444444444444
No 207
>PF13106 DUF3961: Domain of unknown function (DUF3961)
Probab=36.70 E-value=16 Score=21.87 Aligned_cols=23 Identities=26% Similarity=0.473 Sum_probs=16.3
Q ss_pred chhhhhHHHHHHHHHHHHHHHHH
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllill 27 (103)
..+.|-||.|.....++++++..
T Consensus 12 ~~sdqIWFYG~~~ls~~m~~~ty 34 (40)
T PF13106_consen 12 CKSDQIWFYGFFGLSIFMILFTY 34 (40)
T ss_pred cccccEEEeeHHHHHHHHHHHHH
Confidence 45678999999887666555543
No 208
>KOG3653|consensus
Probab=36.43 E-value=32 Score=30.56 Aligned_cols=31 Identities=23% Similarity=0.221 Sum_probs=12.7
Q ss_pred hHHHHHHHH-HHHHHHHHHHHhheeecCCCcc
Q psy17635 10 GWFIGMLLA-IAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 10 gWfIg~~~a-i~lLllilli~C~i~R~rGgKY 40 (103)
.|.+.++.. +.+|++|++.+|+.-|.+-.+|
T Consensus 154 ~~~al~~~~~v~~l~~lvi~~~~~~r~~k~~~ 185 (534)
T KOG3653|consen 154 LIYALIPLLLVSLLAALVILAFLGYRQRKNAR 185 (534)
T ss_pred ehhhHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 344433333 3333333333444445555544
No 209
>PRK04989 psbM photosystem II reaction center protein M; Provisional
Probab=36.10 E-value=77 Score=18.47 Aligned_cols=19 Identities=26% Similarity=0.492 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q psy17635 12 FIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C 30 (103)
||+.+..|.+=...|+|+.
T Consensus 8 fiAt~Lfi~iPt~FLlilY 26 (35)
T PRK04989 8 FVASLLFVLVPTVFLIILY 26 (35)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 4555554444444444444
No 210
>PF09930 DUF2162: Predicted transporter (DUF2162); InterPro: IPR017199 This group represents a predicted membrane transporter, MTH672 type.
Probab=36.07 E-value=32 Score=26.97 Aligned_cols=36 Identities=28% Similarity=0.722 Sum_probs=26.7
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcccc
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAV 42 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~V 42 (103)
+.-.+|-+|+..++++.+++++.-.+.| ..+.+||+
T Consensus 126 ~~~s~~~ig~~~g~if~i~il~ss~i~r-~~~~~~p~ 161 (224)
T PF09930_consen 126 IGLSGWEIGLVLGLIFFILILLSSFIFR-RLKKPYPI 161 (224)
T ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCch
Confidence 3446899999999999999966655554 55667763
No 211
>PF13624 SurA_N_3: SurA N-terminal domain; PDB: 3NRK_A.
Probab=35.86 E-value=12 Score=25.80 Aligned_cols=21 Identities=24% Similarity=0.434 Sum_probs=0.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllil 26 (103)
+.+..|.++++.+++++++++
T Consensus 5 ~r~~~~~~~~~~~iv~~~~~l 25 (154)
T PF13624_consen 5 IRKNSKVFKILIGIVLAIFVL 25 (154)
T ss_dssp ---------------------
T ss_pred cccchhhhhhhHHHHHHHHHH
Confidence 344457666555555444443
No 212
>PF12259 DUF3609: Protein of unknown function (DUF3609); InterPro: IPR022048 This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length.
Probab=35.70 E-value=19 Score=29.97 Aligned_cols=22 Identities=27% Similarity=0.440 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHhheee
Q psy17635 13 IGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R 34 (103)
|++..+++|.++++.++|++++
T Consensus 301 i~v~~~~vli~vl~~~~~~~~~ 322 (361)
T PF12259_consen 301 IAVCGAIVLIIVLISLAWLYRT 322 (361)
T ss_pred EehhHHHHHHHHHHHHHhheee
Confidence 4455555555555566677765
No 213
>MTH00171 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=35.64 E-value=74 Score=18.75 Aligned_cols=23 Identities=22% Similarity=0.595 Sum_probs=15.1
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLlli 25 (103)
||| .+..-||+.++.+...|+++
T Consensus 1 MPQ--L~p~pWf~~~~~~W~~l~~~ 23 (54)
T MTH00171 1 MPQ--LNPIPWFFIFLFSWLIFLTL 23 (54)
T ss_pred CCC--CCchHHHHHHHHHHHHHHHH
Confidence 555 34557999999985555533
No 214
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=35.61 E-value=43 Score=24.31 Aligned_cols=19 Identities=11% Similarity=0.179 Sum_probs=10.3
Q ss_pred chhhhhHHHHHHHHHHHHH
Q psy17635 5 TVATAGWFIGMLLAIAFLI 23 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLl 23 (103)
++...+|+..++=+++|++
T Consensus 96 p~~~~t~~LW~~P~~lll~ 114 (126)
T PRK10144 96 PLTGQTLVLWALPVVLLLL 114 (126)
T ss_pred CCCcchHHHHHHHHHHHHH
Confidence 3455566666665544443
No 215
>PRK14094 psbM photosystem II reaction center protein M; Provisional
Probab=35.45 E-value=60 Score=20.26 Aligned_cols=27 Identities=26% Similarity=0.444 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHHHhheeecCCCc
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKRNRGGK 39 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R~rGgK 39 (103)
||+.+..|++=...| ++-+|+-..++|
T Consensus 8 fiAtaLFi~iPT~FL-lilYVkT~s~~k 34 (50)
T PRK14094 8 FVASLLFVGVPTIFL-IGLFISTQDGEK 34 (50)
T ss_pred HHHHHHHHHHHHHHh-hheeEEecccCc
Confidence 444444333333333 344466554444
No 216
>PF09685 Tic20: Tic20-like protein; InterPro: IPR019109 This entry represents a group of uncharacterised conserved proteins including a chloroplast protein import component called Tic20. Chloroplast function requires the import of nuclear encoded proteins from the cytoplasm across the chloroplast double membrane. This is accomplished by two protein complexes, the Toc complex located at the outer membrane and the Tic complex located at the inner membrane. The Toc complex recognises specific proteins by a cleavable N-terminal sequence and is primarily responsible for translocation through the outer membrane, while the Tic complex translocates the protein through the inner membrane. Tic20 is a core member of the Tic complex and is deeply embedded in the inner envelope membrane. It is thought to function as a protein conducting component of the Tic complex [].
Probab=35.03 E-value=46 Score=21.63 Aligned_cols=16 Identities=38% Similarity=0.634 Sum_probs=8.5
Q ss_pred HHHHhheeecCCCccc
Q psy17635 26 LILVCLIKRNRGGKYA 41 (103)
Q Consensus 26 lli~C~i~R~rGgKY~ 41 (103)
+.|.-.+|-++|..|.
T Consensus 86 ~~I~~~~~a~~g~~~~ 101 (109)
T PF09685_consen 86 LSIIGAIKANKGEPYR 101 (109)
T ss_pred HHHHHHHHHHCCCeee
Confidence 3333445667776654
No 217
>PHA03281 envelope glycoprotein E; Provisional
Probab=34.86 E-value=26 Score=31.60 Aligned_cols=26 Identities=31% Similarity=0.483 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~r 36 (103)
=+++-|.|.|||+|++..+|..++-|
T Consensus 558 ~l~~~~a~~~ll~l~~~~~c~~~~~~ 583 (642)
T PHA03281 558 AITGGFAALALLCLAIALICTAKKFG 583 (642)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHhh
Confidence 34566778888888888889665433
No 218
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=34.86 E-value=23 Score=33.07 Aligned_cols=29 Identities=24% Similarity=0.240 Sum_probs=17.3
Q ss_pred hhHHHHHHHHHHHHH-HHHHHHhheeecCC
Q psy17635 9 AGWFIGMLLAIAFLI-LVLILVCLIKRNRG 37 (103)
Q Consensus 9 ~gWfIg~~~ai~lLl-lilli~C~i~R~rG 37 (103)
++=.++.+|+.+.|| .++.+..+.+|+||
T Consensus 843 ~ttt~~~i~g~i~iiv~LaAla~lLrRRrg 872 (872)
T COG3889 843 QTTTGGGICGPIVIIVGLAALALLLRRRRG 872 (872)
T ss_pred ccccccccchHHHHHHHHHHHHHHHHhhcC
Confidence 444578888866444 44444455667665
No 219
>PF05915 DUF872: Eukaryotic protein of unknown function (DUF872); InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=34.81 E-value=38 Score=23.92 Aligned_cols=8 Identities=13% Similarity=0.256 Sum_probs=3.7
Q ss_pred hheeecCC
Q psy17635 30 CLIKRNRG 37 (103)
Q Consensus 30 C~i~R~rG 37 (103)
|..|+.||
T Consensus 99 ~a~rg~~G 106 (115)
T PF05915_consen 99 YAWRGYKG 106 (115)
T ss_pred HHHcCCCC
Confidence 44444444
No 220
>smart00805 AGTRAP Angiotensin II, type I receptor-associated protein. This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the C-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Probab=34.69 E-value=34 Score=25.98 Aligned_cols=12 Identities=42% Similarity=0.761 Sum_probs=9.6
Q ss_pred eecCCCcccchh
Q psy17635 33 KRNRGGKYAVHE 44 (103)
Q Consensus 33 ~R~rGgKY~Vke 44 (103)
-|.|||.|+|.-
T Consensus 112 ~~eRGG~l~v~~ 123 (159)
T smart00805 112 YRERGGELPVNT 123 (159)
T ss_pred HHHccCcccccc
Confidence 489999998653
No 221
>KOG0196|consensus
Probab=34.68 E-value=43 Score=31.72 Aligned_cols=17 Identities=29% Similarity=0.401 Sum_probs=10.5
Q ss_pred hhhHHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLIL 24 (103)
Q Consensus 8 t~gWfIg~~~ai~lLll 24 (103)
...|+||.+++.+++||
T Consensus 546 ~l~~i~g~~~~~v~~ll 562 (996)
T KOG0196|consen 546 QLPLIIGSILAGVVFLL 562 (996)
T ss_pred chhhHHHHHHHHHHHHH
Confidence 34588888888444333
No 222
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=34.64 E-value=52 Score=20.50 Aligned_cols=22 Identities=32% Similarity=0.557 Sum_probs=10.9
Q ss_pred hhhHHHHHHHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~ 29 (103)
-|--|+-+...+.-|+||.+|+
T Consensus 28 lq~lfvnf~lilicllli~iiv 49 (52)
T TIGR01294 28 LQNLFINFCLILICLLLICIIV 49 (52)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3455665554444444444443
No 223
>MTH00123 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=34.63 E-value=67 Score=19.35 Aligned_cols=23 Identities=48% Similarity=0.739 Sum_probs=14.6
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLlli 25 (103)
|||- ...-||.-++++-+.|+++
T Consensus 1 MPQL--np~pWf~~~l~tWl~l~~~ 23 (54)
T MTH00123 1 MPQL--NPAPWFLIMLLTWLTLILI 23 (54)
T ss_pred CCCC--CchHHHHHHHHHHHHHHHH
Confidence 5553 3457999999885444443
No 224
>PF13633 N_methyl_3: Prokaryotic N-terminal methylation site
Probab=34.63 E-value=68 Score=16.85 Aligned_cols=17 Identities=35% Similarity=0.485 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILVLIL 28 (103)
Q Consensus 12 fIg~~~ai~lLllilli 28 (103)
+|=+|+|++++-+.+++
T Consensus 3 LIEvlIa~~i~~i~~~g 19 (22)
T PF13633_consen 3 LIEVLIAIAILGILALG 19 (22)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 46677777766655554
No 225
>PF01529 zf-DHHC: DHHC palmitoyltransferase; InterPro: IPR001594 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the DHHC-type zinc finger domain, which is also known as NEW1 []. The DHHC Zn-finger was first isolated in the Drosophila putative transcription factor DNZ1 and was named after a conserved sequence motif []. This domain has palmitoyltransferase activity; this post-translational modification attaches the C16 saturated fatty acid palmitate via a thioester linkage, predominantly to cysteine residues []. This domain is found in the DHHC proteins which are palmitoyl transferases []; the DHHC motif is found within a cysteine-rich domain which is thought to contain the catalytic site. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding
Probab=34.55 E-value=60 Score=22.70 Aligned_cols=24 Identities=17% Similarity=0.346 Sum_probs=15.2
Q ss_pred chhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllilli 28 (103)
+..+..||+..+....+++++.++
T Consensus 87 G~~N~~~F~~fl~~~~~~~~~~~~ 110 (174)
T PF01529_consen 87 GRRNHRYFLLFLLYLCLYCLYFFI 110 (174)
T ss_pred ccccHHHHHHHHHHHHHHHHHHHH
Confidence 456788898777665555544444
No 226
>PF03264 Cytochrom_NNT: NapC/NirT cytochrome c family, N-terminal region; InterPro: IPR005126 Within the NapC/NirT family of cytochrome c proteins, some members, such as NapC P33932 from SWISSPROT and NirT P24038 from SWISSPROT, bind four haem groups, while others, such as TorC P33226 from SWISSPROT, bind five haems. This family aligns the common N-terminal region that contains four haem-binding C-X(2)-CH motifs.; PDB: 2VR0_F 2J7A_C.
Probab=34.51 E-value=12 Score=27.16 Aligned_cols=14 Identities=21% Similarity=0.634 Sum_probs=3.7
Q ss_pred hHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLI 23 (103)
Q Consensus 10 gWfIg~~~ai~lLl 23 (103)
-|++++++++++++
T Consensus 4 ~~~~~~~~~~~~~~ 17 (173)
T PF03264_consen 4 KWSLLLLLLVGVVL 17 (173)
T ss_dssp ------HSTTCHHH
T ss_pred hhHHHHHHHHHHHH
Confidence 46665555544433
No 227
>PHA02975 hypothetical protein; Provisional
Probab=34.28 E-value=54 Score=21.75 Aligned_cols=21 Identities=10% Similarity=0.123 Sum_probs=12.6
Q ss_pred hhhhHHHHHHHHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllill 27 (103)
...+|.+.++..++.++++++
T Consensus 40 ~~~~~~~~ii~i~~v~~~~~~ 60 (69)
T PHA02975 40 KSSLSIILIIFIIFITCIAVF 60 (69)
T ss_pred CCchHHHHHHHHHHHHHHHHH
Confidence 556788877775444444443
No 228
>PF06011 TRP: Transient receptor potential (TRP) ion channel; InterPro: IPR010308 This family consists of hypothetical proteins of unknown function found in fungi.
Probab=34.11 E-value=55 Score=27.14 Aligned_cols=35 Identities=26% Similarity=0.629 Sum_probs=21.0
Q ss_pred hhhhHHHHHHHHHHHHHHHHHH-----HhheeecCCCccc
Q psy17635 7 ATAGWFIGMLLAIAFLILVLIL-----VCLIKRNRGGKYA 41 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli-----~C~i~R~rGgKY~ 41 (103)
.--|+.|.++-+++++++++++ +.++++++..||.
T Consensus 385 ~~vg~vi~~i~~~v~~~~~i~~~~~~~~~~~~~~~~~~~~ 424 (438)
T PF06011_consen 385 TVVGYVIIIINAIVLLILFILIIVSTIISLFRKRPDSRYD 424 (438)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCcCCC
Confidence 3456777777776665544433 2445677777764
No 229
>PF10907 DUF2749: Protein of unknown function (DUF2749); InterPro: IPR024475 This bacterial family of proteins represent the TrbJ and TrbK genes of the Ti plasmid conjugative transfer operon [].
Probab=33.97 E-value=35 Score=22.45 Aligned_cols=38 Identities=26% Similarity=0.302 Sum_probs=21.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeecC-CCcccchhhhhhc
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRNR-GGKYAVHEREAAH 49 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~r-GgKY~VkeKE~~~ 49 (103)
.|+|+|++|++.+ ...+.++|-.++ .++=+..+..++|
T Consensus 4 ~viIaL~~avaa~--a~~atwviVq~~~a~~p~s~eeQr~~ 42 (66)
T PF10907_consen 4 RVIIALVVAVAAA--AGAATWVIVQPRPAGNPASSEEQRAH 42 (66)
T ss_pred chhHHHHHHHHhh--hceeEEEEECCCCCCCCCChHHHHHH
Confidence 5788888875554 333456566666 4444444443444
No 230
>PF04790 Sarcoglycan_1: Sarcoglycan complex subunit protein; InterPro: IPR006875 The dystrophin glycoprotein complex (DGC) is a membrane-spanning complex that links the interior cytoskeleton to the extracellular matrix in muscle. The sarcoglycan complex is a subcomplex within the DGC and is composed of several muscle-specific, transmembrane proteins (alpha-, beta-, gamma-, delta- and zeta-sarcoglycan). The sarcoglycans are asparagine-linked glycosylated proteins with single transmembrane domains. This family contains beta, gamma and delta members [, ].; GO: 0007010 cytoskeleton organization, 0016012 sarcoglycan complex, 0016021 integral to membrane
Probab=33.80 E-value=49 Score=26.47 Aligned_cols=24 Identities=21% Similarity=0.351 Sum_probs=11.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~ 33 (103)
.|++.++++|+.++=++|.+++++
T Consensus 14 ly~~vllL~il~iiNL~LTiwIl~ 37 (264)
T PF04790_consen 14 LYLFVLLLFILAIINLALTIWILK 37 (264)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhe
Confidence 355555555444444444455554
No 231
>PF13967 RSN1_TM: Late exocytosis, associated with Golgi transport
Probab=33.73 E-value=52 Score=23.38 Aligned_cols=28 Identities=14% Similarity=0.497 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHhheeecCCCcccchhh
Q psy17635 18 AIAFLILVLILVCLIKRNRGGKYAVHER 45 (103)
Q Consensus 18 ai~lLllilli~C~i~R~rGgKY~VkeK 45 (103)
.+++.++.+++.|++||+-..-|.-+..
T Consensus 9 ~~~i~~~~~~~F~~lR~~~~~iY~pR~~ 36 (157)
T PF13967_consen 9 NLIIFLVLLLLFCILRKRFPRIYQPRSY 36 (157)
T ss_pred HHHHHHHHHHHHHHHHhccHHhcccccc
Confidence 3444444455567666665666765544
No 232
>KOG4812|consensus
Probab=33.66 E-value=36 Score=27.70 Aligned_cols=28 Identities=18% Similarity=0.396 Sum_probs=21.9
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
+..|.|+-.++|++.|||++--++=+||
T Consensus 219 ~n~q~wLwwi~~vlG~ll~lr~~i~Yik 246 (262)
T KOG4812|consen 219 FNGQYWLWWIFLVLGLLLFLRGFINYIK 246 (262)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHhHHH
Confidence 4458899999999999998876666665
No 233
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=33.66 E-value=51 Score=23.91 Aligned_cols=30 Identities=13% Similarity=0.219 Sum_probs=16.4
Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 5 TVATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 5 ~~at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
++...+|+..++=+++|++-+++++-.+||
T Consensus 96 p~~~~t~~LW~~P~lll~~G~~~~~~~~rr 125 (126)
T TIGR03147 96 PFKWQTLLLWLLPVLLLLLAFVLLWRVRRR 125 (126)
T ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 445566777666665555544444444444
No 234
>COG4768 Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only]
Probab=33.63 E-value=54 Score=24.40 Aligned_cols=19 Identities=42% Similarity=0.759 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~ 29 (103)
.+-+++.||+|++|.+.++
T Consensus 5 yIs~~iiAiAf~vL~I~li 23 (139)
T COG4768 5 YISLAIIAIAFLVLVIYLI 23 (139)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3445666677776655443
No 235
>PF08078 PsaX: PsaX family; InterPro: IPR012986 This family consists of the PsaX family of photosystem I (PSI) protein subunits. PSI is a large multi-subunit pigment protein complex embedded in the thylakoid membranes of green plants and cyanobacteria. PsaX is one of the 12 protein subunits found in PSI and these subunits are arranged as monomers or trimers within the membrane as shown by the structure of the trimeric complex from Synechococcus elongatus [].; PDB: 3PCQ_X 1JB0_X.
Probab=33.43 E-value=51 Score=19.38 Aligned_cols=15 Identities=53% Similarity=1.170 Sum_probs=9.9
Q ss_pred hhhHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLI 23 (103)
Q Consensus 8 t~gWfIg~~~ai~lLl 23 (103)
..+|.| ++.||-+|+
T Consensus 15 rt~Wa~-llLaINflV 29 (37)
T PF08078_consen 15 RTGWAL-LLLAINFLV 29 (37)
T ss_dssp HHHHHH-HHHHHHHHH
T ss_pred hHHHHH-HHHHHHHHH
Confidence 457988 666666654
No 236
>KOG1485|consensus
Probab=33.42 E-value=26 Score=30.17 Aligned_cols=29 Identities=34% Similarity=0.594 Sum_probs=23.8
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~ 35 (103)
-.+.|.|++|+.+...+|.+...|-+++|
T Consensus 227 ~~a~~~i~i~is~~~vk~~l~~~c~~~~n 255 (412)
T KOG1485|consen 227 INALWLIAIMISAKEVKLRLTLYCAIKTN 255 (412)
T ss_pred cchhhhheehhhHHHHHHHHHHHHHHhcC
Confidence 34569999999988888888888877666
No 237
>PRK06870 secG preprotein translocase subunit SecG; Reviewed
Probab=33.38 E-value=75 Score=20.40 Aligned_cols=14 Identities=29% Similarity=0.778 Sum_probs=7.2
Q ss_pred HHHHHhheeecCCC
Q psy17635 25 VLILVCLIKRNRGG 38 (103)
Q Consensus 25 illi~C~i~R~rGg 38 (103)
++.++.+...+||+
T Consensus 15 ~LI~~vLlQ~~k~~ 28 (76)
T PRK06870 15 LLIILVLLQSGKGA 28 (76)
T ss_pred HHHHHheeeCCCCC
Confidence 33334555666654
No 238
>PHA02957 hypothetical protein; Provisional
Probab=33.07 E-value=45 Score=25.76 Aligned_cols=24 Identities=29% Similarity=0.701 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R~r 36 (103)
|.+++..|-+++|++++.| +.|+|
T Consensus 19 f~ai~~~~~~~~lli~~t~-~~~sk 42 (206)
T PHA02957 19 FLAILLIIPLELLLICHTC-ITASK 42 (206)
T ss_pred HHHHHHHHHHHHHHhhhee-eeccc
Confidence 3344444555555555555 34443
No 239
>PF08507 COPI_assoc: COPI associated protein; InterPro: IPR013714 Proteins in this family co-localise with COPI vesicle coat proteins []. In yeast it is a Golgi membrane protein involved in vesicular trafficking, interacting with TVP18 [].
Probab=32.97 E-value=73 Score=22.34 Aligned_cols=9 Identities=44% Similarity=0.693 Sum_probs=4.0
Q ss_pred hHHHHHHHH
Q psy17635 10 GWFIGMLLA 18 (103)
Q Consensus 10 gWfIg~~~a 18 (103)
.|++|++..
T Consensus 88 ~~i~g~~~~ 96 (136)
T PF08507_consen 88 SIIIGLLLF 96 (136)
T ss_pred HHHHHHHHH
Confidence 344444433
No 240
>PF08374 Protocadherin: Protocadherin; InterPro: IPR013585 The structure of protocadherins is similar to that of classic cadherins (IPR002126 from INTERPRO), but they also have some unique features associated with the cytoplasmic domains. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated [].
Probab=32.83 E-value=20 Score=28.45 Aligned_cols=17 Identities=29% Similarity=0.723 Sum_probs=7.2
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy17635 13 IGMLLAIAFLILVLILVC 30 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C 30 (103)
.|+|.+| |+|||+.+++
T Consensus 45 AG~~tVI-LVI~i~v~vR 61 (221)
T PF08374_consen 45 AGIMTVI-LVIFIVVLVR 61 (221)
T ss_pred cchhhhH-HHHHHHHHHH
Confidence 3444443 4444444444
No 241
>PF08001 CMV_US: CMV US; InterPro: IPR012536 This is a family of unique short (US) cytoplasmic glycoproteins which are expressed in cytomegalovirus [].; GO: 0030683 evasion by virus of host immune response, 0030176 integral to endoplasmic reticulum membrane
Probab=32.71 E-value=54 Score=26.00 Aligned_cols=27 Identities=22% Similarity=0.378 Sum_probs=20.7
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
....|...++.+-.+|++++||.|++.
T Consensus 176 l~~~wv~~i~~~r~tl~~~vla~~~~~ 202 (231)
T PF08001_consen 176 LTAFWVYLIIFSRVTLMGVVLAQCVYG 202 (231)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 456788888888777777888887764
No 242
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=32.71 E-value=66 Score=22.63 Aligned_cols=27 Identities=7% Similarity=0.145 Sum_probs=17.6
Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHh
Q psy17635 4 DTVATAGWFIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 4 ~~~at~gWfIg~~~ai~lLllilli~C 30 (103)
.+++.+.|+|+-...+++++++.-|-.
T Consensus 62 kDvS~~F~L~~~~ti~lv~~~~~~I~l 88 (103)
T PF12955_consen 62 KDVSVPFWLFAGFTIALVVLVAGAIGL 88 (103)
T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHH
Confidence 356778888887766666555555543
No 243
>PRK06007 fliF flagellar MS-ring protein; Reviewed
Probab=32.68 E-value=44 Score=29.13 Aligned_cols=15 Identities=27% Similarity=0.693 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILV 25 (103)
Q Consensus 11 WfIg~~~ai~lLlli 25 (103)
||..++-.++++|++
T Consensus 436 ~~~~~~~~~~~~l~~ 450 (542)
T PRK06007 436 WFMDLIKLAAGALLI 450 (542)
T ss_pred HHHHHHHHHHHHHHH
Confidence 555554443333333
No 244
>KOG4298|consensus
Probab=32.66 E-value=63 Score=25.79 Aligned_cols=26 Identities=15% Similarity=0.616 Sum_probs=18.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
.+|....++.++|.++=+.++|++|=
T Consensus 148 laW~FST~iGllLFL~Ei~llcwvKF 173 (245)
T KOG4298|consen 148 LAWAFSTVIGLLLFLAEIVLLCWVKF 173 (245)
T ss_pred HHHHHHHHHHHHHHHHHHHHHheeee
Confidence 46877777777666666777888874
No 245
>PF12158 DUF3592: Protein of unknown function (DUF3592); InterPro: IPR021994 This family of proteins is functionally uncharacterised.This family of proteins is found in bacteria, archaea, eukaryotes and viruses. Proteins in this family are typically between 150 and 242 amino acids in length.
Probab=32.09 E-value=36 Score=22.99 Aligned_cols=14 Identities=36% Similarity=0.493 Sum_probs=7.7
Q ss_pred hHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLI 23 (103)
Q Consensus 10 gWfIg~~~ai~lLl 23 (103)
+|+..++.++.+++
T Consensus 1 ~~~~~~~~~i~l~~ 14 (148)
T PF12158_consen 1 GVFLLLFGIIFLLI 14 (148)
T ss_pred CeEhHHHHHHHHHH
Confidence 46666655554444
No 246
>TIGR02227 sigpep_I_bact signal peptidase I, bacterial type. A related model finds a simlar protein in many archaea and a few bacteria, as well as a microsomal (endoplasmic reticulum) protein in eukaryotes.
Probab=32.07 E-value=40 Score=24.26 Aligned_cols=15 Identities=27% Similarity=0.645 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILV 25 (103)
Q Consensus 11 WfIg~~~ai~lLlli 25 (103)
|++.+++||++.+++
T Consensus 1 ~~~~~~~~~~~~~~i 15 (163)
T TIGR02227 1 LILSLLIAILLALLI 15 (163)
T ss_pred CHHHHHHHHHHHHHH
Confidence 666666666555554
No 247
>PF15106 TMEM156: TMEM156 protein family
Probab=32.06 E-value=53 Score=26.20 Aligned_cols=14 Identities=21% Similarity=0.605 Sum_probs=8.1
Q ss_pred hhhhHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIA 20 (103)
Q Consensus 7 at~gWfIg~~~ai~ 20 (103)
-...|.+.+++.++
T Consensus 174 mKITWYvLVllVfi 187 (226)
T PF15106_consen 174 MKITWYVLVLLVFI 187 (226)
T ss_pred hhhHHHHHHHHHHH
Confidence 35678876555433
No 248
>MTH00260 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=31.92 E-value=1.1e+02 Score=18.78 Aligned_cols=32 Identities=16% Similarity=0.347 Sum_probs=21.9
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
|||- +--.|++..+..++.++++..++.+.+-
T Consensus 1 MPQm--sPm~W~~l~~~f~~~~~~~~~~~~~~~~ 32 (53)
T MTH00260 1 MPHL--SPMSWLTAMIIFWFILLIFASSMWWSQS 32 (53)
T ss_pred CCCc--ccHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 4543 3357999888777777777777777653
No 249
>PRK08476 F0F1 ATP synthase subunit B'; Validated
Probab=31.89 E-value=58 Score=23.14 Aligned_cols=21 Identities=24% Similarity=0.453 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhh
Q psy17635 11 WFIGMLLAIAFLILVLILVCL 31 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~ 31 (103)
|...++-+|.||+|++++-.|
T Consensus 7 ~~~~~~qli~Flil~~~l~kf 27 (141)
T PRK08476 7 PYLMLATFVVFLLLIVILNSW 27 (141)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 544445555566655555433
No 250
>PF10853 DUF2650: Protein of unknown function (DUF2650); InterPro: IPR022559 This region is found in proteins with unknown function in metazoa.
Probab=31.85 E-value=40 Score=19.79 Aligned_cols=14 Identities=21% Similarity=0.406 Sum_probs=9.1
Q ss_pred hhhhhHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAI 19 (103)
Q Consensus 6 ~at~gWfIg~~~ai 19 (103)
++-|.|++.+|..+
T Consensus 22 f~lq~Wv~v~l~v~ 35 (38)
T PF10853_consen 22 FRLQIWVIVLLAVL 35 (38)
T ss_pred HHHHHHHHHHHHHH
Confidence 56688887665443
No 251
>TIGR02507 MtrF tetrahydromethanopterin S-methyltransferase, F subunit. coenzyme M methyltransferase in methanogenic archaea. This methyltranferase is membrane-associated enzyme complex that uses methy-transfer reaction to drive sodium-ion pump. Archaea domain, have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=31.69 E-value=85 Score=20.53 Aligned_cols=17 Identities=24% Similarity=0.507 Sum_probs=12.6
Q ss_pred hhhhhHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFL 22 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lL 22 (103)
....|..||++.|++|.
T Consensus 40 ~~~~G~~iG~~~Al~lV 56 (65)
T TIGR02507 40 TTITGLAYGFLFAVLLV 56 (65)
T ss_pred hhhHHHHHHHHHHHHHH
Confidence 45678888888887664
No 252
>PF04639 Baculo_E56: Baculoviral E56 protein, specific to ODV envelope; InterPro: IPR006733 This family represents the E56 protein, which is localized to the occlusion derived virus (ODV) envelope, but not to the budded virus (BV) envelope []. Signals necessary for transport and/or retention into this structure are believed to be found within the C-terminal portion of ODV-E56.; GO: 0019031 viral envelope
Probab=31.59 E-value=56 Score=27.15 Aligned_cols=9 Identities=11% Similarity=0.006 Sum_probs=3.3
Q ss_pred hhHHHHHHH
Q psy17635 9 AGWFIGMLL 17 (103)
Q Consensus 9 ~gWfIg~~~ 17 (103)
..+-|.+|+
T Consensus 276 ~l~piil~I 284 (305)
T PF04639_consen 276 SLLPIILII 284 (305)
T ss_pred hhhHHHHHH
Confidence 333333333
No 253
>PF06422 PDR_CDR: CDR ABC transporter; InterPro: IPR010929 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain []. The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). In yeast, the PDR and CDR ABC transporters display extensive sequence homology, and confer resistance to several anti-fungal compounds by actively transporting their substrates out of the cell. These transporters have two homologous halves, each with an N-terminal intracellular hydrophilic region that contains an ATP-binding site, followed by a C-terminal membrane-associated region containing six transmembrane segments []. This entry represents a domain of the PDR/CDR ABC transporter comprising extracellular loop 3, transmembrane segment 6 and a linker region.; GO: 0005524 ATP binding, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane
Probab=31.44 E-value=82 Score=21.36 Aligned_cols=27 Identities=33% Similarity=0.567 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHhheeecCCC-cccchhh
Q psy17635 19 IAFLILVLILVCLIKRNRGG-KYAVHER 45 (103)
Q Consensus 19 i~lLllilli~C~i~R~rGg-KY~VkeK 45 (103)
|.++++.++.+-+++..+++ .-.|..|
T Consensus 60 i~f~~~~~~~~e~~~~~~~~~~~l~f~r 87 (103)
T PF06422_consen 60 IFFIVLTLLATEFIKFEKSGGEVLVFKR 87 (103)
T ss_pred HHHHHHHHHHHHHhcccCCCccEEEEeC
Confidence 55555666666666655444 3333333
No 254
>PF11166 DUF2951: Protein of unknown function (DUF2951); InterPro: IPR021337 This family of proteins has no known function. It has a highly conserved sequence.
Probab=31.38 E-value=63 Score=22.77 Aligned_cols=21 Identities=19% Similarity=0.471 Sum_probs=16.1
Q ss_pred hhhHHHHHHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli 28 (103)
-..|+.|++.-|+-.++|+++
T Consensus 72 ~KmwilGlvgTi~gsliia~l 92 (98)
T PF11166_consen 72 IKMWILGLVGTIFGSLIIALL 92 (98)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 357999999988877777654
No 255
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.30 E-value=50 Score=22.03 Aligned_cols=6 Identities=17% Similarity=0.030 Sum_probs=2.8
Q ss_pred CCCCCC
Q psy17635 51 RHDYPE 56 (103)
Q Consensus 51 ~~~~~d 56 (103)
+||+.+
T Consensus 36 NPpine 41 (71)
T COG3763 36 NPPINE 41 (71)
T ss_pred CCCCCH
Confidence 445543
No 256
>PF10003 DUF2244: Integral membrane protein (DUF2244); InterPro: IPR019253 This entry consists of various bacterial putative membrane proteins with no known function.
Probab=31.25 E-value=84 Score=22.33 Aligned_cols=20 Identities=25% Similarity=0.369 Sum_probs=11.4
Q ss_pred hhhhhHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLlli 25 (103)
.....||+++++++.+++.+
T Consensus 9 ~~g~~~~~~~~~~~~~~~a~ 28 (140)
T PF10003_consen 9 PRGFLIFIAILAAVSLIIAI 28 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34445777777665555433
No 257
>MTH00047 COX2 cytochrome c oxidase subunit II; Provisional
Probab=31.10 E-value=69 Score=24.35 Aligned_cols=9 Identities=11% Similarity=0.202 Sum_probs=4.0
Q ss_pred eecCCCccc
Q psy17635 33 KRNRGGKYA 41 (103)
Q Consensus 33 ~R~rGgKY~ 41 (103)
+|+...+|.
T Consensus 35 ~~~~~~~~~ 43 (194)
T MTH00047 35 SGNGSVNFG 43 (194)
T ss_pred ccccccccc
Confidence 334444554
No 258
>PF10617 DUF2474: Protein of unknown function (DUF2474); InterPro: IPR018895 This family of short proteins has no known function.
Probab=31.10 E-value=89 Score=18.58 Aligned_cols=19 Identities=21% Similarity=0.398 Sum_probs=14.0
Q ss_pred hhhhhHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLIL 24 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLll 24 (103)
...-+||+++=++-++.+.
T Consensus 5 wkRl~W~v~iW~~SV~aL~ 23 (40)
T PF10617_consen 5 WKRLGWFVLIWAASVLALG 23 (40)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4567999999888555553
No 259
>PF15202 Adipogenin: Adipogenin
Probab=30.93 E-value=90 Score=21.05 Aligned_cols=25 Identities=28% Similarity=0.300 Sum_probs=12.8
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhh
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCL 31 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~ 31 (103)
-|...+.--+|.-+-|+|.++|+++
T Consensus 12 ltfsflvfwlclpv~lllfl~ivwl 36 (81)
T PF15202_consen 12 LTFSFLVFWLCLPVGLLLFLLIVWL 36 (81)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444455664445555555543
No 260
>PF09788 Tmemb_55A: Transmembrane protein 55A; InterPro: IPR019178 Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Probab=30.86 E-value=50 Score=26.80 Aligned_cols=37 Identities=30% Similarity=0.406 Sum_probs=17.8
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHH---hheeecCCCcccc
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILV---CLIKRNRGGKYAV 42 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~---C~i~R~rGgKY~V 42 (103)
++...+++-+++++++|++.+.+. +-..++.||.|.+
T Consensus 193 faRkR~i~f~llgllfliiaigltvGT~~~A~~~~giY~~ 232 (256)
T PF09788_consen 193 FARKRAIIFFLLGLLFLIIAIGLTVGTWTYAKTYGGIYVS 232 (256)
T ss_pred HhhhHHHHHHHHHHHHHHHHHHHhhhhHHHHhhcCcEeHH
Confidence 444444444444444433332221 3234777888864
No 261
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=30.81 E-value=62 Score=23.98 Aligned_cols=18 Identities=22% Similarity=0.399 Sum_probs=9.0
Q ss_pred HHHHHHHHHHhheeecCC
Q psy17635 20 AFLILVLILVCLIKRNRG 37 (103)
Q Consensus 20 ~lLllilli~C~i~R~rG 37 (103)
++++++.+|+-...|+++
T Consensus 20 aI~~vv~~I~~~~~k~k~ 37 (155)
T PF08496_consen 20 AILAVVGLIVAAAQKKKK 37 (155)
T ss_pred HHHHHHHHHHHHhccCCC
Confidence 333344445555566664
No 262
>KOG2678|consensus
Probab=30.55 E-value=63 Score=26.06 Aligned_cols=19 Identities=21% Similarity=0.481 Sum_probs=14.0
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllill 27 (103)
--||...|++++++.||.-
T Consensus 215 s~wf~~~miI~v~~sFVsM 233 (244)
T KOG2678|consen 215 SYWFYITMIIFVILSFVSM 233 (244)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 4699888888777776643
No 263
>PHA02657 hypothetical protein; Provisional
Probab=30.49 E-value=58 Score=22.71 Aligned_cols=19 Identities=21% Similarity=0.429 Sum_probs=11.6
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLI 27 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllill 27 (103)
..-|+.++|.++++||.+.
T Consensus 30 itvfv~vI~il~flLLYLv 48 (95)
T PHA02657 30 FTIFIFVVCILIYLLIYLV 48 (95)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666553
No 264
>PF13172 PepSY_TM_1: PepSY-associated TM helix
Probab=30.43 E-value=75 Score=17.35 Aligned_cols=14 Identities=36% Similarity=0.819 Sum_probs=7.6
Q ss_pred HHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILV 25 (103)
Q Consensus 12 fIg~~~ai~lLlli 25 (103)
.+|+++++.++++.
T Consensus 11 ~~g~~~~~~ll~~~ 24 (34)
T PF13172_consen 11 WLGLIAAIFLLLLA 24 (34)
T ss_pred HHHHHHHHHHHHHH
Confidence 45666665555443
No 265
>KOG1024|consensus
Probab=30.38 E-value=22 Score=31.38 Aligned_cols=30 Identities=17% Similarity=0.412 Sum_probs=20.4
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhhe-eec
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLI-KRN 35 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i-~R~ 35 (103)
......|+++.+|.+++.|+++|+.|. .++
T Consensus 182 s~~~~f~V~v~va~a~v~lva~il~c~~~~s 212 (563)
T KOG1024|consen 182 SGLVTFIVGVIVALALVSLVALILYCAKGPS 212 (563)
T ss_pred cccEEEeeeHHHHHHHHHHHHHHHHHHhccc
Confidence 345567888888888887777776444 344
No 266
>TIGR00892 2A0113 monocarboxylate transporter 1.
Probab=30.24 E-value=40 Score=27.32 Aligned_cols=26 Identities=15% Similarity=0.011 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~r 36 (103)
|+.++++.++.+.+++.+..++++.|
T Consensus 404 ~~~~~~~li~~~~~~~~~~~~~~~~~ 429 (455)
T TIGR00892 404 YASGSIVVSAGLFLAIGNYINYRLLA 429 (455)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444444443
No 267
>TIGR02205 septum_zipA cell division protein ZipA. This model represents the full length of bacterial cell division protein ZipA. The N-terminal hydrophobic stretch is an uncleaved signal-anchor sequence. This is followed by an unconserved, variable length, low complexity region, and then a conserved C-terminal region of about 140 amino acids (see pfam04354) that interacts with the tubulin-like cell division protein FtsZ.
Probab=30.19 E-value=43 Score=27.19 Aligned_cols=10 Identities=0% Similarity=0.092 Sum_probs=4.2
Q ss_pred HHhheeecCC
Q psy17635 28 LVCLIKRNRG 37 (103)
Q Consensus 28 i~C~i~R~rG 37 (103)
-+|=.||.|+
T Consensus 21 Glwt~Rke~s 30 (284)
T TIGR02205 21 GLWTSRKEKS 30 (284)
T ss_pred cccccccccc
Confidence 3454444444
No 268
>COG5416 Uncharacterized integral membrane protein [Function unknown]
Probab=30.14 E-value=48 Score=23.33 Aligned_cols=7 Identities=14% Similarity=0.662 Sum_probs=3.4
Q ss_pred hHHHHHH
Q psy17635 10 GWFIGML 16 (103)
Q Consensus 10 gWfIg~~ 16 (103)
-|++.+.
T Consensus 23 ~w~vi~~ 29 (98)
T COG5416 23 QWTVIIV 29 (98)
T ss_pred eeeHHHH
Confidence 4555444
No 269
>PF05140 ResB: ResB-like family ; InterPro: IPR007816 This domain is found in a number of known and suspected cytochrome c biogenesis proteins, including ResB []. Mutations in ResB indicate that they are essential for growth []. ResB is predicted to be a transmembrane protein.
Probab=30.11 E-value=49 Score=27.68 Aligned_cols=25 Identities=28% Similarity=0.699 Sum_probs=15.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
..+..||+++|..+ .+=|++|.++|
T Consensus 50 vy~S~wF~~ll~ll----~~sL~~Cs~~R 74 (464)
T PF05140_consen 50 VYSSWWFLLLLVLL----ALSLIACSIDR 74 (464)
T ss_pred eEecHHHHHHHHHH----HHHHHHHHHHH
Confidence 45678998876543 34445555554
No 270
>PF04202 Mfp-3: Foot protein 3; InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=29.97 E-value=65 Score=21.44 Aligned_cols=17 Identities=35% Similarity=0.438 Sum_probs=8.5
Q ss_pred HHHHHHhheeecCCCcc
Q psy17635 24 LVLILVCLIKRNRGGKY 40 (103)
Q Consensus 24 lilli~C~i~R~rGgKY 40 (103)
|+|+-.|-|.-..|.-|
T Consensus 12 LvLIg~fAVqSdag~~y 28 (71)
T PF04202_consen 12 LVLIGSFAVQSDAGYYY 28 (71)
T ss_pred HHHHhhheeeecCcccc
Confidence 33444455555555544
No 271
>PF07856 Orai-1: Mediator of CRAC channel activity; InterPro: IPR012446 This entry includes Drosophila Orai and human Orai1, Orai2 and Orai3. ORAI-1 GFP reporters are co-expressed with STIM-1 (ER CA(2+) sensors) in the gonad and intestine. The protein has four predicted transmembrane domains with a highly conserved region between TM2 ad TM3. This conserved domain is thought to function in channel regulation. ORAI1-related proteins are required for the production of the calcium channel, CRAC, along with STIM1-related proteins [].
Probab=29.97 E-value=79 Score=23.84 Aligned_cols=26 Identities=27% Similarity=0.459 Sum_probs=19.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
..|..+....|.|.+.-+.++|+||=
T Consensus 108 ~~W~~s~~lGi~lFL~~l~l~~WIKF 133 (175)
T PF07856_consen 108 LAWRFSTVLGIPLFLAELALLGWIKF 133 (175)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHheee
Confidence 45888888887777777777888873
No 272
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=29.76 E-value=78 Score=20.09 Aligned_cols=9 Identities=0% Similarity=0.069 Sum_probs=3.7
Q ss_pred hhHHHHHHH
Q psy17635 9 AGWFIGMLL 17 (103)
Q Consensus 9 ~gWfIg~~~ 17 (103)
.-|+.-++.
T Consensus 38 ~~~i~~~~~ 46 (59)
T PF09889_consen 38 TQYIFFGIF 46 (59)
T ss_pred HHHHHHHHH
Confidence 344443333
No 273
>PF07787 DUF1625: Protein of unknown function (DUF1625); InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long.
Probab=29.76 E-value=80 Score=24.28 Aligned_cols=21 Identities=14% Similarity=0.521 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhe
Q psy17635 12 FIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i 32 (103)
++++++++.|-++++.+.++.
T Consensus 225 ~~~~~~s~~lsl~~Ia~aW~~ 245 (248)
T PF07787_consen 225 LVAFIISFSLSLLTIALAWLF 245 (248)
T ss_pred HHHHHHHHHHHHHHHHHhhee
Confidence 344444555555544444443
No 274
>PF10821 DUF2567: Protein of unknown function (DUF2567); InterPro: IPR021213 This is a bacterial family of proteins with unknown function.
Probab=29.64 E-value=59 Score=24.67 Aligned_cols=30 Identities=23% Similarity=0.420 Sum_probs=17.5
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~C~i~R~rG 37 (103)
|..-+||.+++++..+ +..+.+++.||+||
T Consensus 45 F~a~a~f~~l~lv~Gv--vaav~~W~~R~~RG 74 (167)
T PF10821_consen 45 FDADALFVLLGLVLGV--VAAVAVWLWRRRRG 74 (167)
T ss_pred hhHHHHHHHHHHHHHH--HHHHHHHHHHhhcC
Confidence 4555677666655443 34455677666665
No 275
>PRK09174 F0F1 ATP synthase subunit B'; Validated
Probab=29.61 E-value=58 Score=24.85 Aligned_cols=26 Identities=23% Similarity=0.229 Sum_probs=10.1
Q ss_pred CCcchhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 2 PSDTVATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 2 ~~~~~at~gWfIg~~~ai~lLllilli 28 (103)
|+-.+.+..|-+.. .+|.|+||++++
T Consensus 45 p~~~~~~~~~~l~w-~~I~FliL~~lL 70 (204)
T PRK09174 45 PPFDSTHYASQLLW-LAITFGLFYLFM 70 (204)
T ss_pred CCCcchhccHHHHH-HHHHHHHHHHHH
Confidence 44444433333322 233444444443
No 276
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=29.61 E-value=18 Score=27.50 Aligned_cols=29 Identities=34% Similarity=0.726 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeecCCCccc
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYA 41 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY~ 41 (103)
|=++|+|+||.||- =|+.++.|.=.|.|+
T Consensus 133 GIIVGVLlaIG~ig---GIIivvvRKmSGRys 161 (162)
T PF05808_consen 133 GIIVGVLLAIGFIG---GIIIVVVRKMSGRYS 161 (162)
T ss_dssp --------------------------------
T ss_pred eehhhHHHHHHHHh---heeeEEeehhccccC
Confidence 33455555544433 223334455567775
No 277
>PF15192 TMEM213: TMEM213 family
Probab=29.60 E-value=84 Score=21.38 Aligned_cols=25 Identities=32% Similarity=0.668 Sum_probs=14.2
Q ss_pred chhhhhHHHHHHHH-HHHHHHHHHHHhh
Q psy17635 5 TVATAGWFIGMLLA-IAFLILVLILVCL 31 (103)
Q Consensus 5 ~~at~gWfIg~~~a-i~lLllilli~C~ 31 (103)
+.---||+.+.+.= +-||.||| +|+
T Consensus 41 gvDeyGWIAAAVGWSLwFLTLIL--LCv 66 (82)
T PF15192_consen 41 GVDEYGWIAAAVGWSLWFLTLIL--LCV 66 (82)
T ss_pred CCchhhHHHHHHhHHHHHHHHHH--HHH
Confidence 34456898776665 44444544 454
No 278
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=29.59 E-value=15 Score=31.83 Aligned_cols=11 Identities=27% Similarity=0.597 Sum_probs=4.0
Q ss_pred HHHHHHhheee
Q psy17635 24 LVLILVCLIKR 34 (103)
Q Consensus 24 lilli~C~i~R 34 (103)
|+++++.++.|
T Consensus 10 l~~~~~~~~~r 20 (560)
T PF06160_consen 10 LIIYIIGYIYR 20 (560)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 279
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=29.56 E-value=66 Score=24.30 Aligned_cols=23 Identities=22% Similarity=0.299 Sum_probs=18.3
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli 28 (103)
-+.++=+.++++||+++|+-++.
T Consensus 111 ~~~~g~IaGIvsav~valvGAvs 133 (169)
T PF12301_consen 111 EAEAGTIAGIVSAVVVALVGAVS 133 (169)
T ss_pred CcccchhhhHHHHHHHHHHHHHH
Confidence 45567788999999999888766
No 280
>PF08475 Baculo_VP91_N: Viral capsid protein 91 N-terminal; InterPro: IPR013682 This domain is found in Baculoviridae including the nucleopolyhedrovirus at the N terminus of the viral capsid protein 91 (VP91) [].
Probab=29.41 E-value=39 Score=26.02 Aligned_cols=6 Identities=33% Similarity=0.656 Sum_probs=2.2
Q ss_pred HHHHHH
Q psy17635 17 LAIAFL 22 (103)
Q Consensus 17 ~ai~lL 22 (103)
+||+|+
T Consensus 4 vai~l~ 9 (183)
T PF08475_consen 4 VAILLI 9 (183)
T ss_pred hHHHHH
Confidence 333333
No 281
>PF01098 FTSW_RODA_SPOVE: Cell cycle protein; InterPro: IPR001182 A number of prokaryotic integral membrane proteins involved in cell cycle processes have been found to be structurally related [, ]. These proteins include, the Escherichia coli and related bacteria cell division protein ftsW and the rod shape-determining protein rodA (or mrdB), the Bacillus subtilis stage V sporulation protein E (spoVE), the B. subtilis hypothetical proteins ywcF and ylaO and the Cyanophora paradoxa cyanelle ftsW homolog.; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=29.26 E-value=54 Score=26.29 Aligned_cols=30 Identities=30% Similarity=0.716 Sum_probs=18.0
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~r 36 (103)
..-||+.++++.++++++++.++...+|.|
T Consensus 263 ee~G~ig~i~v~~l~~~l~~~~~~~a~~~~ 292 (358)
T PF01098_consen 263 EEFGWIGGILVLLLFLLLIFRLFRIARRAK 292 (358)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 345777666666666666666655555533
No 282
>PF04272 Phospholamban: Phospholamban; InterPro: IPR005984 Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17. The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=29.13 E-value=73 Score=19.84 Aligned_cols=22 Identities=27% Similarity=0.525 Sum_probs=10.6
Q ss_pred hhhHHHHHHHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~ 29 (103)
-|--|+-+...+.-|+||.+|+
T Consensus 28 lqelfvnfclilicllli~iiv 49 (52)
T PF04272_consen 28 LQELFVNFCLILICLLLICIIV 49 (52)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3455665544444444444443
No 283
>PF10183 ESSS: ESSS subunit of NADH:ubiquinone oxidoreductase (complex I) ; InterPro: IPR019329 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This entry represents the ESSS subunit from mitochondrial NADH:ubiquinone oxidoreductase (complex I). It carries mitochondrial import sequences [].
Probab=29.09 E-value=72 Score=21.87 Aligned_cols=18 Identities=22% Similarity=0.344 Sum_probs=10.3
Q ss_pred hhHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVL 26 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllil 26 (103)
..||+++...++++.+++
T Consensus 61 ~~~f~~~~~~~v~~~~~~ 78 (105)
T PF10183_consen 61 LPFFFGFSGSLVFGGVFL 78 (105)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 356777666655554443
No 284
>COG4499 Predicted membrane protein [Function unknown]
Probab=29.04 E-value=58 Score=28.25 Aligned_cols=31 Identities=13% Similarity=0.405 Sum_probs=15.5
Q ss_pred CCcchhhhhHHH-HHHHHHHHHHHHHHHHhhe
Q psy17635 2 PSDTVATAGWFI-GMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 2 ~~~~~at~gWfI-g~~~ai~lLllilli~C~i 32 (103)
|......+-||- |++.++|++++.+.-+.|.
T Consensus 213 pK~k~~ifk~~giGliillvl~li~~~Y~~f~ 244 (434)
T COG4499 213 PKKKYTIFKYFGIGLIILLVLLLIYFTYYYFS 244 (434)
T ss_pred ccccceehhhHHHhHHHHHHHHHHHHHHHHHH
Confidence 344456677874 4555554444444334433
No 285
>PF13124 DUF3963: Protein of unknown function (DUF3963)
Probab=29.02 E-value=1.2e+02 Score=18.08 Aligned_cols=23 Identities=22% Similarity=0.558 Sum_probs=16.3
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHh
Q psy17635 8 TAGWFIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~C 30 (103)
-|-|+=-+-+..+|+++++..++
T Consensus 16 iqkwirnit~cfal~vv~lvslw 38 (40)
T PF13124_consen 16 IQKWIRNITFCFALLVVVLVSLW 38 (40)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 36788777777777777776655
No 286
>PF09988 DUF2227: Uncharacterized metal-binding protein (DUF2227); InterPro: IPR019250 This entry represents hypothetical bacterial proteins that possess metal binding properties; however, their exact function has not yet been determined.
Probab=28.87 E-value=51 Score=24.82 Aligned_cols=29 Identities=31% Similarity=0.520 Sum_probs=19.2
Q ss_pred CCCcchhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635 1 MPSDTVATAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 1 ~~~~~~at~gWfIg~~~ai~lLllilli~ 29 (103)
++|-++=+.+|+||.+.=|+-|.++++++
T Consensus 73 ~~HRs~lSH~piiGt~~RllYL~~~~~~l 101 (169)
T PF09988_consen 73 FRHRSFLSHGPIIGTLLRLLYLALWLLLL 101 (169)
T ss_pred cCCCCcccccchhhHHHHHHHHHHHHHHH
Confidence 35667778899999877655555444443
No 287
>PRK00269 zipA cell division protein ZipA; Reviewed
Probab=28.85 E-value=32 Score=28.31 Aligned_cols=18 Identities=39% Similarity=0.368 Sum_probs=10.5
Q ss_pred HHhheeecCCCcccchhh
Q psy17635 28 LVCLIKRNRGGKYAVHER 45 (103)
Q Consensus 28 i~C~i~R~rGgKY~VkeK 45 (103)
++--.+|-||||=-+|.|
T Consensus 21 l~~~~~r~r~~~gk~~~~ 38 (293)
T PRK00269 21 LFDGWRRMRGGKGKLKFR 38 (293)
T ss_pred HHHHHHHHhccccccccc
Confidence 333446777877655544
No 288
>PRK14471 F0F1 ATP synthase subunit B; Provisional
Probab=28.60 E-value=72 Score=22.83 Aligned_cols=22 Identities=32% Similarity=0.437 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
|...+.-+|.|+||++++-.|+
T Consensus 8 ~~~~~~~~i~Flil~~ll~~~l 29 (164)
T PRK14471 8 FGLFFWQTILFLILLLLLAKFA 29 (164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 5444445566666665554443
No 289
>TIGR03068 srtB_sig_NPQTN sortase B signal domain, NPQTN class. This model represents one of the boutique (rare) sortase signals, recognized by sortase B (SrtB) rather than by the housekeeping-type SrtA class sortase. This sequence, beginning NPQTN, shows little similarity to several other SrtB substrates.
Probab=28.33 E-value=97 Score=17.83 Aligned_cols=18 Identities=17% Similarity=0.366 Sum_probs=10.2
Q ss_pred CCcchhhhhHHHHHHHHH
Q psy17635 2 PSDTVATAGWFIGMLLAI 19 (103)
Q Consensus 2 ~~~~~at~gWfIg~~~ai 19 (103)
||++.++-.+..++..+-
T Consensus 2 PQTs~gtp~y~y~Ip~v~ 19 (33)
T TIGR03068 2 PQTNAGTPAYIYAIPVAS 19 (33)
T ss_pred CCCCCCCcchhhHHHHHH
Confidence 566666665555554443
No 290
>PF08135 EPV_E5: Major transforming protein E5 family; InterPro: IPR012555 This family consists of the major transforming proteins (E5) of the bovine papilloma virus (BPV). The equine sarcoid is one of the most common dermatological lesion in equids. It is a benign, locally invasive dermal fibroblastic lesion and studies have shown an association of the lesions with BPV. E5 is a short hydrophobic membrane protein localising to the Golgi apparatus and other intracellular membranes. It binds to and constitutively activates the platelet-derived growth factor-beta in transformed cells. This stimulation activates a receptor signalling cascade which results in an intracellular growth stimulatory signal [].
Probab=28.22 E-value=98 Score=18.82 Aligned_cols=13 Identities=38% Similarity=0.874 Sum_probs=5.2
Q ss_pred HHHHHHHHHHHHH
Q psy17635 13 IGMLLAIAFLILV 25 (103)
Q Consensus 13 Ig~~~ai~lLlli 25 (103)
.+++-|+-|++|+
T Consensus 10 lgl~~~lQL~LL~ 22 (44)
T PF08135_consen 10 LGLTFALQLLLLV 22 (44)
T ss_pred HHHHHHHHHHHHH
Confidence 3444444344433
No 291
>PF07760 DUF1616: Protein of unknown function (DUF1616); InterPro: IPR011674 This is a group of sequences from hypothetical archaeal proteins. The region in question is approximately 330 amino acid residues long.
Probab=28.22 E-value=47 Score=26.23 Aligned_cols=24 Identities=33% Similarity=0.715 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHhheeecC
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~r 36 (103)
.-++.++.++++++++++++||.|
T Consensus 87 ~~i~~~l~~~t~~~~~~a~~rr~~ 110 (287)
T PF07760_consen 87 IPILISLSIFTLVLSIIAYIRRRR 110 (287)
T ss_pred hHHHHHHHHHHHHHHHHHHHhccc
Confidence 445666778888888888888876
No 292
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=28.17 E-value=87 Score=21.08 Aligned_cols=22 Identities=36% Similarity=0.679 Sum_probs=11.0
Q ss_pred hhhhHHHHHHHHHHHHHHHHHH
Q psy17635 7 ATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli 28 (103)
-+.-|+..++.+++++++.+++
T Consensus 13 e~sl~~~~~~l~~~~~~l~ll~ 34 (108)
T PF07219_consen 13 ETSLWVALILLLLLFVVLYLLL 34 (108)
T ss_pred EeeHHHHHHHHHHHHHHHHHHH
Confidence 3445666555554444444433
No 293
>TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type. CcoS from Rhodobacter capsulatus has been shown essential for incorporation of redox-active prosthetic groups (heme, Cu) into cytochrome cbb(3) oxidase. FixS of Bradyrhizobium japonicum appears to have the same function. Members of this family are found so far in organisms with a cbb3-type cytochrome oxidase, including Neisseria meningitidis, Helicobacter pylori, Campylobacter jejuni, Caulobacter crescentus, Bradyrhizobium japonicum, and Rhodobacter capsulatus.
Probab=28.15 E-value=63 Score=20.00 Aligned_cols=27 Identities=26% Similarity=0.458 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 14 GMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
.+|+.+.+++.++.++.|+.-=|.|.|
T Consensus 5 ~~LIpiSl~l~~~~l~~f~Wavk~GQf 31 (51)
T TIGR00847 5 TILIPISLLLGGVGLVAFLWSLKSGQY 31 (51)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHccCCC
Confidence 344444444433333333333345566
No 294
>PRK14474 F0F1 ATP synthase subunit B; Provisional
Probab=28.13 E-value=53 Score=25.70 Aligned_cols=22 Identities=41% Similarity=0.739 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
|+..++-+|.|+||++++--|.
T Consensus 5 ~~t~~~qiInFlILv~lL~~fl 26 (250)
T PRK14474 5 WFTVVAQIINFLILVYLLRRFL 26 (250)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 7777777777777776665544
No 295
>MTH00152 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=27.94 E-value=85 Score=22.86 Aligned_cols=29 Identities=31% Similarity=0.590 Sum_probs=13.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~rG 37 (103)
..|.+-++.++.|++-.+.++-.+++++|
T Consensus 132 ~~~~~~i~l~~~Lll~Li~Vvki~~~~~g 160 (163)
T MTH00152 132 SGWSIFVFLAVLLLIVMVSVVKICYYEEG 160 (163)
T ss_pred CcHHHHHHHHHHHHHHHHHHHhhcCCCCC
Confidence 34555555554444333333444455544
No 296
>PRK13461 F0F1 ATP synthase subunit B; Provisional
Probab=27.89 E-value=76 Score=22.57 Aligned_cols=22 Identities=32% Similarity=0.362 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
|...+.-+|.|++|++++-.|+
T Consensus 5 ~~~~~~~~inF~il~~iL~~f~ 26 (159)
T PRK13461 5 IPTIIATIINFIILLLILKHFF 26 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 4333333444555554444333
No 297
>PF01456 Mucin: Mucin-like glycoprotein; InterPro: IPR000458 This family of trypanosomal proteins resemble vertebrate mucins. The protein consists of three regions. The N and C terminii are conserved between all members of the family, whereas the central region is not well conserved and contains a large number of threonine residues which can be glycosylated []. Indirect evidence suggested that these genes might encode the core protein of parasite mucins, glycoproteins that were proposed to be involved in the interaction with, and invasion of, mammalian host cells.
Probab=27.83 E-value=43 Score=23.26 Aligned_cols=10 Identities=40% Similarity=0.554 Sum_probs=5.7
Q ss_pred HHHHHHHHHH
Q psy17635 15 MLLAIAFLIL 24 (103)
Q Consensus 15 ~~~ai~lLll 24 (103)
||||+++|-|
T Consensus 6 LLCalLvlaL 15 (143)
T PF01456_consen 6 LLCALLVLAL 15 (143)
T ss_pred HHHHHHHHHH
Confidence 5777554444
No 298
>KOG4433|consensus
Probab=27.57 E-value=70 Score=28.43 Aligned_cols=6 Identities=33% Similarity=0.955 Sum_probs=2.8
Q ss_pred hheeec
Q psy17635 30 CLIKRN 35 (103)
Q Consensus 30 C~i~R~ 35 (103)
||.||.
T Consensus 69 cC~Rr~ 74 (526)
T KOG4433|consen 69 CCCRRE 74 (526)
T ss_pred HHcCCC
Confidence 544444
No 299
>PF09777 OSTMP1: Osteopetrosis-associated transmembrane protein 1 precursor; InterPro: IPR019172 Osteopetrosis-associated transmembrane protein 1 (OSTM1) is required for osteoclast and melanocyte maturation and function. Mutations in OSTM1 give rise to autosomal recessive osteopetrosis, also called autosomal recessive Albers-Schonberg disease [, ].
Probab=27.53 E-value=90 Score=24.43 Aligned_cols=24 Identities=17% Similarity=0.279 Sum_probs=16.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i 32 (103)
.+|||++...+++|.+++.+.-++
T Consensus 189 ~~~~i~v~~~vl~lpv~FY~~s~~ 212 (237)
T PF09777_consen 189 ETAVIAVSVFVLFLPVLFYLSSYL 212 (237)
T ss_pred hhHHHHHHHHHHHHHHHHHHhhee
Confidence 467788877777777777665444
No 300
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.51 E-value=97 Score=23.03 Aligned_cols=28 Identities=14% Similarity=0.266 Sum_probs=19.8
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R 34 (103)
.+..|..+++..||-++|-.||+-+.+|
T Consensus 4 t~~~W~~a~igLvvGi~IG~li~Rlt~~ 31 (138)
T COG3105 4 TFMTWEYALIGLVVGIIIGALIARLTNR 31 (138)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcch
Confidence 4567888877777777777777766654
No 301
>MTH00045 ND6 NADH dehydrogenase subunit 6; Validated
Probab=27.39 E-value=80 Score=23.22 Aligned_cols=27 Identities=11% Similarity=0.209 Sum_probs=13.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeec
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRN 35 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~ 35 (103)
.++.+-++|+..|++-.+.++=++|.+
T Consensus 128 ~g~~~li~~~~~LLltL~vVl~i~~~~ 154 (162)
T MTH00045 128 SGGLYLLLGGFILLVALVVALVISFGH 154 (162)
T ss_pred CCHHHHHHHHHHHHHHhhhhhheecCC
Confidence 345555666655554444444444444
No 302
>PRK03814 oxaloacetate decarboxylase subunit gamma; Provisional
Probab=27.36 E-value=1.1e+02 Score=20.57 Aligned_cols=12 Identities=25% Similarity=0.667 Sum_probs=5.2
Q ss_pred HHHHHHHHHHHh
Q psy17635 19 IAFLILVLILVC 30 (103)
Q Consensus 19 i~lLllilli~C 30 (103)
.+++.|++||.|
T Consensus 21 ~VF~fL~lLi~~ 32 (85)
T PRK03814 21 VVFIFLTLLVYL 32 (85)
T ss_pred HHHHHHHHHHHH
Confidence 444444444443
No 303
>PHA03289 envelope glycoprotein I; Provisional
Probab=27.19 E-value=61 Score=27.43 Aligned_cols=16 Identities=38% Similarity=0.661 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHHHHh
Q psy17635 15 MLLAIAFLILVLILVC 30 (103)
Q Consensus 15 ~~~ai~lLllilli~C 30 (103)
++|.|+|||..++..|
T Consensus 277 ~~~~~~l~i~~~~~~~ 292 (352)
T PHA03289 277 VLCIIALLIVTVCSAC 292 (352)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 3444444444433333
No 304
>PRK11901 hypothetical protein; Reviewed
Probab=27.05 E-value=79 Score=26.50 Aligned_cols=30 Identities=20% Similarity=0.365 Sum_probs=17.7
Q ss_pred CcchhhhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 3 SDTVATAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 3 ~~~~at~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
.-..+.|--+||+=+. ||||||+-|---+|
T Consensus 30 k~~vSRQh~MiGiGil-VLlLLIi~IgSALk 59 (327)
T PRK11901 30 KLAVSRQHMMIGIGIL-VLLLLIIAIGSALK 59 (327)
T ss_pred CchHHHHHHHHHHHHH-HHHHHHHHHhhhcc
Confidence 3446677778887655 55555555544344
No 305
>KOG3249|consensus
Probab=27.01 E-value=94 Score=24.04 Aligned_cols=10 Identities=60% Similarity=1.002 Sum_probs=4.9
Q ss_pred eecCCCc--ccc
Q psy17635 33 KRNRGGK--YAV 42 (103)
Q Consensus 33 ~R~rGgK--Y~V 42 (103)
+|.||.+ |+|
T Consensus 144 ~rk~gEmSAYSV 155 (181)
T KOG3249|consen 144 KRKRGEMSAYSV 155 (181)
T ss_pred CCCCCccchhhh
Confidence 4555544 444
No 306
>PF11857 DUF3377: Domain of unknown function (DUF3377); InterPro: IPR021805 This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=26.96 E-value=61 Score=21.73 Aligned_cols=6 Identities=17% Similarity=0.440 Sum_probs=3.4
Q ss_pred eecCCC
Q psy17635 33 KRNRGG 38 (103)
Q Consensus 33 ~R~rGg 38 (103)
-|++|.
T Consensus 54 fkrkGt 59 (74)
T PF11857_consen 54 FKRKGT 59 (74)
T ss_pred eeecCC
Confidence 466664
No 307
>PF10327 7TM_GPCR_Sri: Serpentine type 7TM GPCR chemoreceptor Sri; InterPro: IPR019429 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The nematode Caenorhabditis elegans has only 14 types of chemosensory neuron, yet is able to sense and respond to several hundred different chemicals because each neuron detects several stimuli []. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf' []. Chemoreception in C. elegans is mediated by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs). More than 1300 potential chemoreceptor genes have been identified in C. elegans, which are generally prefixed sr for serpentine receptor. The receptor superfamilies include Sra (Sra, Srb, Srab, Sre), Str (Srh, Str, Sri, Srd, Srj, Srm, Srn) and Srg (Srx, Srt, Srg, Sru, Srv, Srxa), as well as the families Srw, Srz, Srbc, Srsx and Srr [, , ]. Many of these proteins have homologues in Caenorhabditis briggsae. This entry represents Sri, which is part of the Str superfamily of chemoreceptors.
Probab=26.92 E-value=37 Score=26.87 Aligned_cols=22 Identities=5% Similarity=0.528 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHhheeecC
Q psy17635 15 MLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 15 ~~~ai~lLllilli~C~i~R~r 36 (103)
+..+++..-+-.|++||++|++
T Consensus 96 i~~~l~~~q~~~L~~CF~~KHQ 117 (303)
T PF10327_consen 96 IWIFLFIFQIESLVICFLRKHQ 117 (303)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444445677889998753
No 308
>PHA00476 hypothetical protein
Probab=26.89 E-value=2.4e+02 Score=20.18 Aligned_cols=43 Identities=21% Similarity=0.261 Sum_probs=22.9
Q ss_pred hHHHHHHHHHHHHHHH-HHHHhheeecCCCcc-----cchhhhhhcCCC
Q psy17635 10 GWFIGMLLAIAFLILV-LILVCLIKRNRGGKY-----AVHEREAAHGRH 52 (103)
Q Consensus 10 gWfIg~~~ai~lLlli-lli~C~i~R~rGgKY-----~VkeKE~~~g~~ 52 (103)
.||--+.+-++++++. +|-.+=..|..+|-- +-||||-++|.+
T Consensus 38 ~lssfvfsSvallvil~~L~TW~TTkkpdgLNRla~l~~kekeI~AG~~ 86 (110)
T PHA00476 38 TLSSFVFSSVALLVILVLLGTWSTTRKPDGLNRLATLAEKEKEILAGKA 86 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccccCCCcchHHHHHHHHHHHHHcCCC
Confidence 3444444444444433 333455566656643 366777777766
No 309
>PF14030 DUF4245: Protein of unknown function (DUF4245)
Probab=26.75 E-value=73 Score=23.55 Aligned_cols=16 Identities=13% Similarity=0.256 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHHHhhe
Q psy17635 17 LAIAFLILVLILVCLI 32 (103)
Q Consensus 17 ~ai~lLllilli~C~i 32 (103)
+.+++++++++|+.++
T Consensus 12 SL~vl~~~~~~i~~~~ 27 (169)
T PF14030_consen 12 SLAVLVAIVALIVAGV 27 (169)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 3344444444553444
No 310
>PF07672 MFS_Mycoplasma: Mycoplasma MFS transporter; InterPro: IPR011699 These proteins share some similarity with members of the Major Facilitator Superfamily (MFS).
Probab=26.61 E-value=82 Score=25.58 Aligned_cols=31 Identities=19% Similarity=0.344 Sum_probs=21.1
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecCC
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~rG 37 (103)
..+.|||+.|..+.++++++.++|......+
T Consensus 169 ~kRk~fi~~~~~~gi~~~~l~~~~~~~~g~~ 199 (267)
T PF07672_consen 169 YKRKPFIHFIISLGIVFFVLSIVVVYFVGPG 199 (267)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHhCcc
Confidence 5678999988777777777677773344333
No 311
>TIGR00810 secG protein translocase, SecG subunit. This family of proteins forms a complex with SecY and SecE. SecA then recruits the SecYEG complex to form an active protein translocation channel.
Probab=26.55 E-value=94 Score=19.80 Aligned_cols=14 Identities=36% Similarity=0.752 Sum_probs=7.6
Q ss_pred HHHHHhheeecCCC
Q psy17635 25 VLILVCLIKRNRGG 38 (103)
Q Consensus 25 illi~C~i~R~rGg 38 (103)
+++++.+....||+
T Consensus 14 ~LI~~vLlQ~~k~~ 27 (73)
T TIGR00810 14 LLIGLVLLQSGKGG 27 (73)
T ss_pred HHHHHheeecCCCC
Confidence 33344556666665
No 312
>PLN03150 hypothetical protein; Provisional
Probab=26.48 E-value=39 Score=29.48 Aligned_cols=10 Identities=30% Similarity=0.644 Sum_probs=4.6
Q ss_pred HHHHhheeec
Q psy17635 26 LILVCLIKRN 35 (103)
Q Consensus 26 lli~C~i~R~ 35 (103)
++++|+.+|.
T Consensus 562 ~~~~~~~~~r 571 (623)
T PLN03150 562 ICAMCWWKRR 571 (623)
T ss_pred HHHhhheeeh
Confidence 3334555554
No 313
>PF11696 DUF3292: Protein of unknown function (DUF3292); InterPro: IPR021709 This eukaryotic family of proteins has no known function.
Probab=26.46 E-value=70 Score=29.07 Aligned_cols=24 Identities=21% Similarity=0.607 Sum_probs=17.5
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli~ 29 (103)
.....|+.-++...++.+||+||+
T Consensus 133 vYf~aW~~dll~p~~~~~L~~li~ 156 (642)
T PF11696_consen 133 VYFIAWLLDLLVPAFFAFLIALIL 156 (642)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 345678888888877777777775
No 314
>PF11466 Doppel: Prion-like protein Doppel; InterPro: IPR021566 Dpl is a homologue related to the prion protein (PrP). Dpl is toxic to neurons and is expressed in the brains of mice that do not express PrP. In DHPC and SDS micelles, Dpl shoes about 40% alpha-helical structure however in aqueous solution it consists of a random coil. The alpha helical segment can adopt a transmembrane localisation also in a membrane. The unprocessed Dpl protein is thought to posses a possible channel formation mechanism which may be related to toxicity through direct interaction with cell membranes and damage to the cell membrane. ; PDB: 1Z65_A.
Probab=26.41 E-value=62 Score=18.27 Aligned_cols=12 Identities=25% Similarity=1.088 Sum_probs=8.0
Q ss_pred hHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAF 21 (103)
Q Consensus 10 gWfIg~~~ai~l 21 (103)
+|-++++|.+++
T Consensus 7 ~~~lAi~c~LL~ 18 (30)
T PF11466_consen 7 GWWLAIVCVLLF 18 (30)
T ss_dssp SHHHHHHHHHHH
T ss_pred hHHHHHHHHHHH
Confidence 677777776443
No 315
>PF02430 AMA-1: Apical membrane antigen 1; InterPro: IPR003298 A novel antigen of Plasmodium falciparum has been cloned that contains a hydrophobic domain typical of an integral membrane protein. The antigen is designated apical membrane antigen 1 (AMA-1) by virtue of appearing to be located in the apical complex []. AMA-1 appears to be transported to the merozoite surface close to the time of schizont rupture. The 66kDa merozoite surface antigen (PK66) of Plasmodium knowlesi, a simian malaria, possesses vaccine-related properties believed to originate from a receptor-like role in parasite invasion of erythrocytes []. The sequence of PK66 is conserved throughout plasmodium, and shows high similarity to P. falciparum AMA-1. Following schizont rupture, the distribution of PK66 changes in a coordinate manner associated with merozoite invasion. Prior to rupture, the protein is concentrated at the apical end, following which it distributes itself entirely across the surface of the free merozoite. Immunofluorescence studies suggest that, during invasion, PK66 is excluded from the erythrocyte at, and behind, the invasion interface []. ; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2Y8R_D 2X2Z_A 2Y8T_A 2Y8S_D 1HN6_A 2Z8V_B 2Z8W_A 1Z40_E 2Q8B_A 2Q8A_A ....
Probab=26.38 E-value=22 Score=31.11 Aligned_cols=25 Identities=24% Similarity=0.265 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R 34 (103)
-=.|.+++++++.||+|+++++.||
T Consensus 429 ~~~~~~~~~~~~~~~~l~~~~~~k~ 453 (471)
T PF02430_consen 429 RIVIIISIATGAVLLALAIYWYYKR 453 (471)
T ss_dssp -------------------------
T ss_pred eeehhhHHHHHHHHHHHHHHHHHhh
Confidence 3456677778888888888887774
No 316
>PF06678 DUF1179: Protein of unknown function (DUF1179); InterPro: IPR009564 This family consists of several hypothetical Caenorhabditis elegans proteins of around 106 residues in length. The function of the family is unknown.
Probab=26.36 E-value=93 Score=22.08 Aligned_cols=10 Identities=30% Similarity=0.514 Sum_probs=5.4
Q ss_pred hHHHHHHHHH
Q psy17635 10 GWFIGMLLAI 19 (103)
Q Consensus 10 gWfIg~~~ai 19 (103)
-|||.-.+.+
T Consensus 4 f~~i~~~i~l 13 (103)
T PF06678_consen 4 FWFILKSIFL 13 (103)
T ss_pred hHHHHHHHHH
Confidence 4776554444
No 317
>COG3736 VirB8 Type IV secretory pathway, component VirB8 [Intracellular trafficking and secretion]
Probab=26.32 E-value=1e+02 Score=24.65 Aligned_cols=24 Identities=17% Similarity=0.502 Sum_probs=11.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i 32 (103)
..|.++.+..++.+++++.|+|++
T Consensus 42 ~~~~va~~~~~l~v~~~~~Ia~ll 65 (239)
T COG3736 42 LAWRVAILFTLLAVAAVIAIAILL 65 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhc
Confidence 345555544444444445555544
No 318
>PF01561 Hanta_G2: Hantavirus glycoprotein G2; InterPro: IPR002532 The medium (M) genome segment of Hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins [], G1 and G2, as a polyprotein precursor. This entry represents the polyprotein region which forms the G2 glycoprotein.; GO: 0030683 evasion by virus of host immune response, 0044423 virion part
Probab=26.27 E-value=34 Score=29.96 Aligned_cols=16 Identities=19% Similarity=0.268 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHhheee
Q psy17635 19 IAFLILVLILVCLIKR 34 (103)
Q Consensus 19 i~lLllilli~C~i~R 34 (103)
|.|+-|||+++||=.|
T Consensus 465 lll~siil~s~~cP~r 480 (485)
T PF01561_consen 465 LLLLSIILFSFFCPVR 480 (485)
T ss_pred HHHHHHHHHheeCcch
Confidence 3333344444444333
No 319
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=26.25 E-value=51 Score=27.25 Aligned_cols=12 Identities=33% Similarity=0.642 Sum_probs=4.8
Q ss_pred HHHHHHHHHHHH
Q psy17635 17 LAIAFLILVLIL 28 (103)
Q Consensus 17 ~ai~lLllilli 28 (103)
|.|+.|+||+|+
T Consensus 261 cgiaalvllil~ 272 (295)
T TIGR01478 261 YGIAALVLIILT 272 (295)
T ss_pred cHHHHHHHHHHH
Confidence 444444443333
No 320
>PF01601 Corona_S2: Coronavirus S2 glycoprotein; InterPro: IPR002552 The type I glycoprotein S of Coronavirus, trimers of which constitute the typical viral spikes, is assembled into virions through noncovalent interactions with the M protein. The spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 IPR002551 from INTERPRO and S2 []. Both chimeric S proteins appeared to cause cell fusion when expressed individually, suggesting that they were biologically fully active []. The spike is a type I membrane glycoprotein that possesses a conserved transmembrane anchor and an unusual cysteine-rich (cys) domain that bridges the putative junction of the anchor and the cytoplasmic tail [].; GO: 0006944 cellular membrane fusion, 0046813 virion attachment, binding of host cell surface receptor, 0016021 integral to membrane, 0019031 viral envelope; PDB: 2BEQ_B 2FXP_A 1ZVB_A 1WNC_D 1ZV8_H 1ZV7_B 1WYY_B 1ZVA_A 2BEZ_F 1WDG_A ....
Probab=26.25 E-value=22 Score=32.01 Aligned_cols=15 Identities=13% Similarity=0.740 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILV 25 (103)
Q Consensus 11 WfIg~~~ai~lLlli 25 (103)
|-+.|+.++++++++
T Consensus 548 WyVWL~i~~~li~~~ 562 (610)
T PF01601_consen 548 WYVWLAIILALIAFA 562 (610)
T ss_dssp ---------------
T ss_pred HHHHHHHHHHHHHHH
Confidence 666666555555444
No 321
>PF14316 DUF4381: Domain of unknown function (DUF4381)
Probab=26.25 E-value=53 Score=23.28 Aligned_cols=28 Identities=32% Similarity=0.592 Sum_probs=11.1
Q ss_pred HHHHHHHHH-HHHHHHHHhh-eeecCCCcc
Q psy17635 13 IGMLLAIAF-LILVLILVCL-IKRNRGGKY 40 (103)
Q Consensus 13 Ig~~~ai~l-Lllilli~C~-i~R~rGgKY 40 (103)
+|--+.+++ ++++++++.+ .+|.+--+|
T Consensus 21 ~GWwll~~lll~~~~~~~~~~~r~~~~~~y 50 (146)
T PF14316_consen 21 PGWWLLLALLLLLLILLLWRLWRRWRRNRY 50 (146)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHccHH
Confidence 444444343 3333333333 233333445
No 322
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=26.24 E-value=22 Score=29.00 Aligned_cols=7 Identities=14% Similarity=-0.206 Sum_probs=0.0
Q ss_pred hheeecC
Q psy17635 30 CLIKRNR 36 (103)
Q Consensus 30 C~i~R~r 36 (103)
+.+.-=|
T Consensus 165 Ia~icyr 171 (290)
T PF05454_consen 165 IACICYR 171 (290)
T ss_dssp -------
T ss_pred HHHHhhh
Confidence 4444444
No 323
>CHL00080 psbM photosystem II protein M
Probab=26.19 E-value=1.4e+02 Score=17.26 Aligned_cols=19 Identities=26% Similarity=0.396 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q psy17635 12 FIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C 30 (103)
||+.+..|.+=...++|+.
T Consensus 8 fiAt~LFi~iPt~FLlily 26 (34)
T CHL00080 8 FIATALFILVPTAFLLIIY 26 (34)
T ss_pred HHHHHHHHHHHHHHHHHhh
Confidence 4444433333333344443
No 324
>PRK14473 F0F1 ATP synthase subunit B; Provisional
Probab=26.17 E-value=81 Score=22.55 Aligned_cols=20 Identities=25% Similarity=0.366 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy17635 11 WFIGMLLAIAFLILVLILVC 30 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C 30 (103)
|.+.+.-+|.|++|++++--
T Consensus 8 ~~~~~~~~inflil~~lL~~ 27 (164)
T PRK14473 8 LGLLIAQLINFLLLIFLLRT 27 (164)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44444445555555544433
No 325
>TIGR01655 yxeA_fam conserved hypothetical protein TIGR01655. This model represents a family of small (about 115 amino acids) uncharacterized proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.
Probab=26.12 E-value=73 Score=22.09 Aligned_cols=8 Identities=13% Similarity=0.459 Sum_probs=4.2
Q ss_pred cccchhhh
Q psy17635 39 KYAVHERE 46 (103)
Q Consensus 39 KY~VkeKE 46 (103)
.|=|+.+.
T Consensus 32 nyyvki~~ 39 (114)
T TIGR01655 32 NYYIKIKN 39 (114)
T ss_pred ccEEEEee
Confidence 45566554
No 326
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=26.07 E-value=82 Score=21.66 Aligned_cols=12 Identities=25% Similarity=0.293 Sum_probs=5.9
Q ss_pred HHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFL 22 (103)
Q Consensus 11 WfIg~~~ai~lL 22 (103)
||+.++++++++
T Consensus 3 ~~~~vll~ll~~ 14 (105)
T PRK00888 3 LLTLLLLALLVW 14 (105)
T ss_pred HHHHHHHHHHHH
Confidence 555555444333
No 327
>PF10112 Halogen_Hydrol: 5-bromo-4-chloroindolyl phosphate hydrolysis protein; InterPro: IPR018770 This entry consists of prokaryotic proteins that mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds.
Probab=26.01 E-value=86 Score=23.15 Aligned_cols=9 Identities=44% Similarity=1.272 Sum_probs=4.8
Q ss_pred hHHHHHHHH
Q psy17635 10 GWFIGMLLA 18 (103)
Q Consensus 10 gWfIg~~~a 18 (103)
+||+|++++
T Consensus 10 ~~~~~~~~~ 18 (199)
T PF10112_consen 10 RWILGVLIA 18 (199)
T ss_pred HHHHHHHHH
Confidence 456665444
No 328
>PF01167 Tub: Tub family; InterPro: IPR000007 Tubby, an autosomal recessive mutation, mapping to mouse chromosome 7, was recently found to be the result of a splicing defect in a novel gene with unknown function. This mutation maps to the tub gene [, ]. The mouse tubby mutation is the cause of maturity-onset obesity, insulin resistance and sensory deficits. By contrast with the rapid juvenile-onset weight gain seen in diabetes (db) and obese (ob) mice, obesity in tubby mice develops gradually, and strongly resembles the late-onset obesity observed in the human population. Excessive deposition of adipose tissue culminates in a two-fold increase of body weight. Tubby mice also suffer retinal degeneration and neurosensory hearing loss. The tripartite character of the tubby phenotype is highly similar to human obesity syndromes, such as Alstrom and Bardet-Biedl. Although these phenotypes indicate a vital role for tubby proteins, no biochemical function has yet been ascribed to any family member [], although it has been suggested that the phenotypic features of tubby mice may be the result of cellular apoptosis triggered by expression of the mutated tub gene. TUB is the founding-member of the tubby-like proteins, the TULPs. TULPs are found in multicellular organisms from both the plant and animal kingdoms. Ablation of members of this protein family cause disease phenotypes that are indicative of their importance in nervous-system function and development []. Mammalian TUB is a hydrophilic protein of ~500 residues. The N-terminal (IPR005398 from INTERPRO) portion of the protein is conserved neither in length nor sequence, but, in TUB, contains the nuclear localisation signal and may have transcriptional-activation activity. The C-terminal 250 residues are highly conserved. The C-terminal extremity contains a cysteine residue that might play an important role in the normal functioning of these proteins. The crystal structure of the C-terminal core domain from mouse tubby has been determined to 1.9A resolution. This domain is arranged as a 12-stranded, all anti-parallel, closed beta-barrel that surrounds a central alpha helix, (which is at the extreme carboxyl terminus of the protein) that forms most of the hydrophobic core. Structural analyses suggest that TULPs constitute a unique family of bipartite transcription factors [].; PDB: 3C5N_B 2FIM_A 1I7E_A 1C8Z_A 1S31_A.
Probab=25.93 E-value=21 Score=28.01 Aligned_cols=12 Identities=33% Similarity=0.958 Sum_probs=9.7
Q ss_pred HHHhheeecCCC
Q psy17635 27 ILVCLIKRNRGG 38 (103)
Q Consensus 27 li~C~i~R~rGg 38 (103)
.|.|+|+|+|+|
T Consensus 7 ~vqC~I~R~k~g 18 (246)
T PF01167_consen 7 PVQCFIRRDKSG 18 (246)
T ss_dssp EEEEEEEEESTT
T ss_pred EEEEEEEEECCC
Confidence 356999999876
No 329
>PRK10905 cell division protein DamX; Validated
Probab=25.78 E-value=50 Score=27.68 Aligned_cols=17 Identities=18% Similarity=0.321 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHhhee
Q psy17635 17 LAIAFLILVLILVCLIK 33 (103)
Q Consensus 17 ~ai~lLllilli~C~i~ 33 (103)
.++||||||+-|---+|
T Consensus 5 GilVLlLLIigIgSALk 21 (328)
T PRK10905 5 GILVLLLLIIGIGSALK 21 (328)
T ss_pred hHHHHHHHHHHHhHhhc
Confidence 34566666666654444
No 330
>MTH00166 ND6 NADH dehydrogenase subunit 6; Provisional
Probab=25.73 E-value=95 Score=22.41 Aligned_cols=25 Identities=20% Similarity=0.479 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHhheeecCC
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~rG 37 (103)
+-++.++-|++..+.++..+++++|
T Consensus 130 ~~i~l~~yLll~Li~vvki~~~~~g 154 (160)
T MTH00166 130 ITLFMIIYLLLTLIVVVKITNIFKG 154 (160)
T ss_pred HHHHHHHHHHHHHHHHHHHHccCCC
Confidence 3344444444444445566666665
No 331
>PLN00028 nitrate transmembrane transporter; Provisional
Probab=25.64 E-value=93 Score=25.42 Aligned_cols=24 Identities=21% Similarity=0.213 Sum_probs=13.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~ 33 (103)
+|-.+++.+.+++++-+++.++++
T Consensus 410 ~y~~~f~~~~~~~~i~~~~~~~~~ 433 (476)
T PLN00028 410 STETGISLMGVMIIACTLPVAFIH 433 (476)
T ss_pred cHhhHHHHHHHHHHHHHHHHHhee
Confidence 466666666555555555556564
No 332
>PTZ00370 STEVOR; Provisional
Probab=25.58 E-value=55 Score=27.09 Aligned_cols=18 Identities=28% Similarity=0.137 Sum_probs=6.7
Q ss_pred hhhHHHHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFLILV 25 (103)
Q Consensus 8 t~gWfIg~~~ai~lLlli 25 (103)
+.++-|-.=|.|+.|+|+
T Consensus 248 taAsaaF~Pygiaalvll 265 (296)
T PTZ00370 248 SAASSAFYPYGIAALVLL 265 (296)
T ss_pred HHHHHhhcccHHHHHHHH
Confidence 333333333334433333
No 333
>TIGR03644 marine_trans_1 probable ammonium transporter, marine subtype. Members of this protein family are well conserved subclass of putative ammonimum transporters, belonging to the much broader set of ammonium/methylammonium transporter described by TIGR00836. Species with this transporter tend to be marine bacteria. Partial phylogenetic profiling (PPP) picks a member of this protein family as the single best-scoring protein vs. a reference profile for the marine environment Genome Property for a large number of different query genomes. This finding by PPP suggests that this transporter family represents an important adaptation to the marine environment.
Probab=25.50 E-value=94 Score=26.27 Aligned_cols=38 Identities=24% Similarity=0.368 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhcCCC
Q psy17635 14 GMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAHGRH 52 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~g~~ 52 (103)
++++.++.-.++-++++++-|.-++ ..|+++|+..|-+
T Consensus 355 g~~~~~~~~~~~s~~~~~il~~~~~-lRv~~e~E~~GlD 392 (404)
T TIGR03644 355 GAATIFAWVFVTSLIVWFILKATMG-IRVSEEEEYEGLD 392 (404)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccc-ccCCHHHHHcCCC
Confidence 3344444433444444444343333 6788777777643
No 334
>TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II. This enzyme catalyzes the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in gram positive bacteria of the Bacillus/Staphylococcus group. Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes a similar to the cyt. c oxidases (as opposed to cytochromes b).
Probab=25.43 E-value=55 Score=24.88 Aligned_cols=16 Identities=38% Similarity=0.804 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHHHhhe
Q psy17635 17 LAIAFLILVLILVCLI 32 (103)
Q Consensus 17 ~ai~lLllilli~C~i 32 (103)
++|++++.++++.++.
T Consensus 31 ~iI~lvv~~lli~~~~ 46 (217)
T TIGR01432 31 LVIVFVVFVLFTIFLV 46 (217)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344333444444444
No 335
>PF09911 DUF2140: Uncharacterized protein conserved in bacteria (DUF2140); InterPro: IPR018672 This family of conserved hypothetical proteins has no known function.
Probab=25.42 E-value=65 Score=24.26 Aligned_cols=19 Identities=21% Similarity=0.462 Sum_probs=11.4
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli 28 (103)
.|-.+.++.+++++++++.
T Consensus 2 ~WK~aF~~Lla~~l~~~~~ 20 (187)
T PF09911_consen 2 WWKWAFLILLALNLAFVIV 20 (187)
T ss_pred hHHHHHHHHHHHHHHHHhh
Confidence 4777777766655544433
No 336
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=25.35 E-value=68 Score=25.65 Aligned_cols=8 Identities=50% Similarity=0.962 Sum_probs=4.3
Q ss_pred eeecCC-Cc
Q psy17635 32 IKRNRG-GK 39 (103)
Q Consensus 32 i~R~rG-gK 39 (103)
.||++. ||
T Consensus 161 LKrskQ~gK 169 (227)
T PF05399_consen 161 LKRSKQVGK 169 (227)
T ss_pred HHHHHHhhc
Confidence 356655 54
No 337
>PF13038 DUF3899: Domain of unknown function (DUF3899)
Probab=25.30 E-value=1.1e+02 Score=19.93 Aligned_cols=25 Identities=28% Similarity=0.506 Sum_probs=12.7
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhh
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCL 31 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~ 31 (103)
....|...++.+.++++++.+++++
T Consensus 67 ~~~~~~~~~ll~~~ll~l~~iil~f 91 (92)
T PF13038_consen 67 RVSRWTYPLLLIGLLLILLSIILSF 91 (92)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345766666555444444444443
No 338
>PF00003 7tm_3: 7 transmembrane sweet-taste receptor of 3 GCPR; InterPro: IPR017978 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents the C-terminal region of family 3 GPCR receptor proteins, which contains the seven transmembrane region. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness []. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=25.27 E-value=88 Score=22.88 Aligned_cols=34 Identities=21% Similarity=0.334 Sum_probs=25.5
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhheeecCCCcc
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLIKRNRGGKY 40 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i~R~rGgKY 40 (103)
.+..|++..+.--++|+++.+.+++-.|+--..|
T Consensus 146 ~~~~~~~~~~~y~~~Ll~~~~~la~~~R~~~~~~ 179 (238)
T PF00003_consen 146 NSNIWLILSLGYNGLLLLIGFFLAFKTRNVPSNF 179 (238)
T ss_pred CccchHHHHHHHHHHHHHHHHHHHHhhCCCCccc
Confidence 5667998888888888888888887767655554
No 339
>TIGR00889 2A0110 nucleoside transporter. This family of proteins transports nucleosides at a high affinity. The transport mechanism is driven by proton motive force. This family includes nucleoside permease NupG and xanthosine permease from E.Coli.
Probab=25.12 E-value=79 Score=25.35 Aligned_cols=9 Identities=33% Similarity=0.567 Sum_probs=4.0
Q ss_pred ccchhhhhh
Q psy17635 40 YAVHEREAA 48 (103)
Q Consensus 40 Y~VkeKE~~ 48 (103)
-.|.|++.+
T Consensus 409 ~~~~~~~~~ 417 (418)
T TIGR00889 409 IKVDDRDVA 417 (418)
T ss_pred CCCcccccc
Confidence 345554433
No 340
>PRK07718 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=25.06 E-value=1.2e+02 Score=21.71 Aligned_cols=26 Identities=12% Similarity=0.114 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 11 WFIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i~R~r 36 (103)
|+|.++..|+++++......|..+..
T Consensus 6 ~~i~~i~l~~l~~~g~~~~~~~~~~~ 31 (142)
T PRK07718 6 IKIMLIILIVIALIGTAALVLVMGFS 31 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccC
Confidence 44544444444444444445555543
No 341
>PF14715 FixP_N: N-terminal domain of cytochrome oxidase-cbb3, FixP
Probab=24.99 E-value=1.2e+02 Score=18.58 Aligned_cols=20 Identities=25% Similarity=0.364 Sum_probs=15.2
Q ss_pred hhHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli 28 (103)
..|++.+.+.|++.+..++.
T Consensus 22 ~ww~~~f~~tivfa~~Y~~~ 41 (51)
T PF14715_consen 22 RWWLWLFYGTIVFAVGYLVL 41 (51)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 47888888888888776654
No 342
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=24.92 E-value=64 Score=17.62 Aligned_cols=16 Identities=31% Similarity=0.281 Sum_probs=7.6
Q ss_pred HHHHHHHHHHhheeec
Q psy17635 20 AFLILVLILVCLIKRN 35 (103)
Q Consensus 20 ~lLllilli~C~i~R~ 35 (103)
+||.+-++++..++|.
T Consensus 9 ~Ll~~~l~~l~~~rRr 24 (26)
T TIGR03778 9 ALLGLGLLGLLGLRRR 24 (26)
T ss_pred HHHHHHHHHHHHHhhc
Confidence 4444444444455553
No 343
>PF11862 DUF3382: Domain of unknown function (DUF3382); InterPro: IPR021807 This entry represents the N-terminal domain of the LivHM type high-affinity branched-chain amino acid transport system permease proteins. The domain is about 100 amino acids in length, and is found associated with PF02653 from PFAM.
Probab=24.79 E-value=1.1e+02 Score=20.59 Aligned_cols=36 Identities=25% Similarity=0.305 Sum_probs=25.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhh
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHER 45 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeK 45 (103)
.+-=++++|++.++|...++.+-..+.|++..|.-+
T Consensus 4 ~Lk~Al~aall~lvl~~pi~Gl~l~~~g~~L~~~~r 39 (101)
T PF11862_consen 4 NLKDALFAALLALVLFGPIVGLKLDNQGGQLVLEPR 39 (101)
T ss_pred HHHHHHHHHHHHHHHHHHheEEEEecCCcEEEEEec
Confidence 445567777777777777777777777887776544
No 344
>PF15048 OSTbeta: Organic solute transporter subunit beta protein
Probab=24.71 E-value=76 Score=23.22 Aligned_cols=28 Identities=25% Similarity=0.396 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635 19 IAFLILVLILVCLIKRNRGGKYAVHERE 46 (103)
Q Consensus 19 i~lLllilli~C~i~R~rGgKY~VkeKE 46 (103)
++++|=++|+.-=|..||.-|-.-.+|+
T Consensus 45 vvlvi~~~LLgrsi~ANRnrK~~~~~k~ 72 (125)
T PF15048_consen 45 VVLVISFFLLGRSIQANRNRKMQPQEKQ 72 (125)
T ss_pred HHHHHHHHHHHHHhHhcccccccccccc
Confidence 3333333333334566776675544454
No 345
>PF02790 COX2_TM: Cytochrome C oxidase subunit II, transmembrane domain This family corresponds to chains b and o.; InterPro: IPR011759 Cytochrome c oxidase (1.9.3.1 from EC) [, ] is an oligomeric enzymatic complex which is a component of the respiratory chain and is involved in the transfer of electrons from cytochrome c to oxygen. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. The enzyme complex consists of 3-4 subunits (prokaryotes) to up to 13 polypeptides (mammals). Subunit 2 (CO II) transfers the electrons from cytochrome c to the catalytic subunit 1. It contains two adjacent transmembrane regions in its N terminus and the major part of the protein is exposed to the periplasmic or to the mitochondrial intermembrane space, respectively. CO II provides the substrate-binding site and contains a copper centre called Cu(A) (see IPR001505 from INTERPRO), probably the primary acceptor in cytochrome c oxidase. An exception is the corresponding subunit of the cbb3-type oxidase which lacks the copper A redox-centre. Several bacterial CO II have a C-terminal extension that contains a covalently bound haem c. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices. ; GO: 0004129 cytochrome-c oxidase activity, 0005507 copper ion binding, 0009055 electron carrier activity, 0022900 electron transport chain, 0016021 integral to membrane; PDB: 3VRJ_C 2EIN_B 3AG3_B 2DYR_O 3AG1_B 2EIK_O 3ASN_B 1OCR_B 2EIJ_B 1OCC_O ....
Probab=24.70 E-value=1e+02 Score=19.17 Aligned_cols=15 Identities=13% Similarity=0.430 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHhhee
Q psy17635 19 IAFLILVLILVCLIK 33 (103)
Q Consensus 19 i~lLllilli~C~i~ 33 (103)
|...++.+++.++.+
T Consensus 34 I~~~V~~~l~~~~~~ 48 (84)
T PF02790_consen 34 IFVFVFYFLIYFLFN 48 (84)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred eeeeEeeeccccccc
Confidence 333333344444444
No 346
>TIGR02794 tolA_full TolA protein. TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB.
Probab=24.63 E-value=74 Score=26.29 Aligned_cols=13 Identities=31% Similarity=0.634 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHHH
Q psy17635 17 LAIAFLILVLILV 29 (103)
Q Consensus 17 ~ai~lLllilli~ 29 (103)
+++.+|||++||+
T Consensus 8 v~lHvlLi~lL~~ 20 (346)
T TIGR02794 8 LLLHILLLGLLIL 20 (346)
T ss_pred HHHHHHHHHHHHH
Confidence 3345555555554
No 347
>PF06697 DUF1191: Protein of unknown function (DUF1191); InterPro: IPR010605 This family contains hypothetical plant proteins of unknown function.
Probab=24.58 E-value=15 Score=29.91 Aligned_cols=23 Identities=26% Similarity=0.615 Sum_probs=11.6
Q ss_pred HHHHHHHHH-HHHHHHHhheeecC
Q psy17635 14 GMLLAIAFL-ILVLILVCLIKRNR 36 (103)
Q Consensus 14 g~~~ai~lL-llilli~C~i~R~r 36 (103)
|++..+++| +|.++++|++|-.|
T Consensus 218 g~~~G~~~L~ll~~lv~~~vr~kr 241 (278)
T PF06697_consen 218 GVVGGVVLLGLLSLLVAMLVRYKR 241 (278)
T ss_pred EehHHHHHHHHHHHHHHhhhhhhH
Confidence 434444444 44456667765443
No 348
>PF09583 Phageshock_PspG: Phage shock protein G (Phageshock_PspG); InterPro: IPR014318 This protein previously was designated yjbO in Escherichia coli. It is found only in genomes that have the phage shock operon (psp), but it is only rarely encoded near other psp genes. The psp regulon is upregulated in response to a number of stress conditions, including ethanol, expression of the filamentous phage secretin protein IV and other secretins and heat shock.
Probab=24.50 E-value=1.3e+02 Score=19.74 Aligned_cols=28 Identities=29% Similarity=0.501 Sum_probs=13.5
Q ss_pred hhhhHHHHHHHHHHH-----HHHHHHHHhheee
Q psy17635 7 ATAGWFIGMLLAIAF-----LILVLILVCLIKR 34 (103)
Q Consensus 7 at~gWfIg~~~ai~l-----Lllilli~C~i~R 34 (103)
++.-++++=|.++++ |+++++.++++|+
T Consensus 31 a~~vm~l~Gm~~lviKLLPWLil~~~~vW~~r~ 63 (65)
T PF09583_consen 31 AFAVMFLGGMFGLVIKLLPWLILAAVVVWFYRA 63 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 444455555555444 2334444555554
No 349
>PF01998 DUF131: Protein of unknown function DUF131; InterPro: IPR002849 This archaebacterial protein family has no known function. The proteins are predicted to contain two transmembrane helices.
Probab=24.49 E-value=1e+02 Score=19.77 Aligned_cols=16 Identities=50% Similarity=0.783 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHH
Q psy17635 12 FIGMLLAIAFLILVLI 27 (103)
Q Consensus 12 fIg~~~ai~lLllill 27 (103)
.++++.||++.++.++
T Consensus 46 ~~~~ilaiil~i~~~l 61 (64)
T PF01998_consen 46 KIAMILAIILMILALL 61 (64)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444555444444433
No 350
>TIGR03141 cytochro_ccmD heme exporter protein CcmD. The model for this protein family describes a small, hydrophobic, and only moderately well-conserved protein, tricky to identify accurately for all of these reasons. However, members are found as part of large operons involved in heme export across the inner membrane for assembly of c-type cytochromes in a large number of bacteria. The gray zone between the trusted cutoff (13.0) and noise cutoff (4.75) includes both low-scoring examples and false-positive matches to hydrophobic domains of longer proteins.
Probab=24.29 E-value=42 Score=19.75 Aligned_cols=21 Identities=19% Similarity=0.292 Sum_probs=9.7
Q ss_pred HHHHHHHHHHHHHhheeecCC
Q psy17635 17 LAIAFLILVLILVCLIKRNRG 37 (103)
Q Consensus 17 ~ai~lLllilli~C~i~R~rG 37 (103)
-++.+++|+.+++....++|.
T Consensus 12 Yg~t~l~l~~li~~~~~~~r~ 32 (45)
T TIGR03141 12 YGITALVLAGLILWSLLDRRR 32 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344445554444444444443
No 351
>PF15339 Afaf: Acrosome formation-associated factor
Probab=24.14 E-value=76 Score=24.82 Aligned_cols=21 Identities=19% Similarity=0.362 Sum_probs=12.3
Q ss_pred HHHHHH-HHHHHHHHHHHhhee
Q psy17635 13 IGMLLA-IAFLILVLILVCLIK 33 (103)
Q Consensus 13 Ig~~~a-i~lLllilli~C~i~ 33 (103)
|.+|.. |.+.||++|.+-+.|
T Consensus 136 IsLmTl~lfv~Ll~~c~atlyk 157 (200)
T PF15339_consen 136 ISLMTLFLFVILLAFCSATLYK 157 (200)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 444443 666677777665553
No 352
>KOG3832|consensus
Probab=24.07 E-value=84 Score=25.73 Aligned_cols=13 Identities=38% Similarity=0.915 Sum_probs=9.7
Q ss_pred cchhhhhHHHHHH
Q psy17635 4 DTVATAGWFIGML 16 (103)
Q Consensus 4 ~~~at~gWfIg~~ 16 (103)
..|+|+||+..+.
T Consensus 35 kafatagwllsi~ 47 (319)
T KOG3832|consen 35 KAFATAGWLLSIT 47 (319)
T ss_pred HhHhhccHHHHHH
Confidence 3699999986543
No 353
>PF13858 DUF4199: Protein of unknown function (DUF4199)
Probab=24.04 E-value=98 Score=21.58 Aligned_cols=28 Identities=14% Similarity=0.422 Sum_probs=18.0
Q ss_pred hhHHH-HHHHHHHHHHHHHHHHhheeecC
Q psy17635 9 AGWFI-GMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 9 ~gWfI-g~~~ai~lLllilli~C~i~R~r 36 (103)
..+++ +++.-+.+-+++-||++.+.|+|
T Consensus 135 ~~~~~~~~~~~l~~G~i~sli~a~i~kkk 163 (163)
T PF13858_consen 135 FSLAFSGFISNLIFGFIISLIIALILKKK 163 (163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence 45666 55666667777777777666654
No 354
>PF05510 Sarcoglycan_2: Sarcoglycan alpha/epsilon; InterPro: IPR008908 Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear []. This family contains alpha and epsilon members.; GO: 0016012 sarcoglycan complex
Probab=23.96 E-value=65 Score=27.45 Aligned_cols=17 Identities=24% Similarity=0.757 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy17635 15 MLLAIAFLILVLILVCL 31 (103)
Q Consensus 15 ~~~ai~lLllilli~C~ 31 (103)
++++++|++|+.+|+||
T Consensus 293 l~i~llL~llLs~Imc~ 309 (386)
T PF05510_consen 293 LIIALLLLLLLSYIMCC 309 (386)
T ss_pred HHHHHHHHHHHHHHhee
Confidence 33334444444445554
No 355
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=23.93 E-value=26 Score=29.56 Aligned_cols=25 Identities=24% Similarity=0.481 Sum_probs=0.0
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 8 TAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~C~i 32 (103)
.|.||..+|=..-|++++.+++++.
T Consensus 160 ~qaWfT~L~dL~WL~LFlaiLIWlY 184 (381)
T PF05297_consen 160 HQAWFTILVDLYWLLLFLAILIWLY 184 (381)
T ss_dssp -------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3567776666655666665555554
No 356
>PF13400 Tad: Putative Flp pilus-assembly TadE/G-like
Probab=23.89 E-value=44 Score=19.41 Aligned_cols=15 Identities=20% Similarity=0.487 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHHHHh
Q psy17635 16 LLAIAFLILVLILVC 30 (103)
Q Consensus 16 ~~ai~lLllilli~C 30 (103)
+.+++++.+++++.+
T Consensus 6 ~~~~~~~~~l~~~~~ 20 (48)
T PF13400_consen 6 IFALVLVPLLLLIGL 20 (48)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334343333333333
No 357
>PRK01844 hypothetical protein; Provisional
Probab=23.77 E-value=82 Score=20.99 Aligned_cols=24 Identities=21% Similarity=0.246 Sum_probs=9.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHhheee
Q psy17635 10 GWFIGMLLAIAFLILVLILVCLIKR 34 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~C~i~R 34 (103)
.|+..++..+ .|++-+++-.|+-|
T Consensus 3 ~~~~I~l~I~-~li~G~~~Gff~ar 26 (72)
T PRK01844 3 IWLGILVGVV-ALVAGVALGFFIAR 26 (72)
T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHH
Confidence 3543333333 33333344455544
No 358
>PRK05696 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=23.73 E-value=58 Score=23.81 Aligned_cols=8 Identities=13% Similarity=-0.047 Sum_probs=3.5
Q ss_pred HHHHHHHH
Q psy17635 11 WFIGMLLA 18 (103)
Q Consensus 11 WfIg~~~a 18 (103)
|+|+++.+
T Consensus 21 iiii~~~~ 28 (170)
T PRK05696 21 IIIIVIGV 28 (170)
T ss_pred EeeHHHHH
Confidence 55444333
No 359
>PRK13872 conjugal transfer protein TrbF; Provisional
Probab=23.64 E-value=1.2e+02 Score=23.11 Aligned_cols=9 Identities=22% Similarity=0.453 Sum_probs=4.8
Q ss_pred hheeecCCC
Q psy17635 30 CLIKRNRGG 38 (103)
Q Consensus 30 C~i~R~rGg 38 (103)
.+|.-++-|
T Consensus 71 yvv~VD~tG 79 (228)
T PRK13872 71 WVVEVDRLG 79 (228)
T ss_pred EEEEEcCCC
Confidence 445556645
No 360
>COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning]
Probab=23.59 E-value=32 Score=30.74 Aligned_cols=24 Identities=17% Similarity=0.365 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHhheeecC
Q psy17635 13 IGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 13 Ig~~~ai~lLllilli~C~i~R~r 36 (103)
|-+|+||++++++++.+-++.|.|
T Consensus 2 ~~liv~llVilv~~~~~g~~lRkk 25 (570)
T COG4477 2 IYLIVALLVILVAAYAVGYLLRKK 25 (570)
T ss_pred eehHHHHHHHHHHHHHHHHHHHHh
Confidence 346667777777766666665544
No 361
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=23.59 E-value=1.1e+02 Score=23.55 Aligned_cols=9 Identities=11% Similarity=0.228 Sum_probs=3.8
Q ss_pred hHHHHHHHH
Q psy17635 10 GWFIGMLLA 18 (103)
Q Consensus 10 gWfIg~~~a 18 (103)
.|+|.++++
T Consensus 234 ~~lv~~l~~ 242 (262)
T PF14257_consen 234 SGLVVFLVG 242 (262)
T ss_pred HHHHHHHHH
Confidence 344444444
No 362
>PF02158 Neuregulin: Neuregulin family; InterPro: IPR002154 Neuregulins are a sub-family of EGF-like molecules that have been shown to play multiple essential roles in vertebrate embryogenesis including: cardiac development, Schwann cell and oligodendrocyte differentiation, some aspects of neuronal development, as well as the formation of neuromuscular synapses [, ]. Included in the family are heregulin; neu differentiation factor; acetylcholine receptor synthesis stimulator; glial growth factor; and sensory and motor-neuron derived factor []. Multiple family members are generated by alternate splicing or by use of several cell type-specific transcription initiation sites. In general, they bind to and activate the erbB family of receptor tyrosine kinases (erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4)), functioning both as heterodimers and homodimers. The transmembrane forms of neuregulin 1 (NRG1) are present within synaptic vesicles, including those containing glutamate []. After exocytosis, NRG1 is in the presynaptic membrane, where the ectodomain of NRG1 may be cleaved off. The ectodomain then migrates across the synaptic cleft and binds to and activates a member of the EGF-receptor family on the postsynaptic membrane. This has been shown to increase the expression of certain glutamate-receptor subunits. NRG1 appears to signal for glutamate-receptor subunit expression, localisation, and /or phosphorylation facilitating subsequent glutamate transmission. The NRG1 gene has been identified as a potential gene determining susceptibility to schizophrenia by a combination of genetic linkage and association approaches []. ; GO: 0005102 receptor binding, 0009790 embryo development; PDB: 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=23.57 E-value=27 Score=30.05 Aligned_cols=16 Identities=31% Similarity=0.681 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHhhe
Q psy17635 17 LAIAFLILVLILVCLI 32 (103)
Q Consensus 17 ~ai~lLllilli~C~i 32 (103)
..|++.||++=|+|.+
T Consensus 13 TgIcvaLlVVGi~Cvv 28 (404)
T PF02158_consen 13 TGICVALLVVGIVCVV 28 (404)
T ss_dssp ----------------
T ss_pred hhhhHHHHHHHHHHHH
Confidence 3355555555555555
No 363
>KOG3359|consensus
Probab=23.21 E-value=88 Score=28.84 Aligned_cols=39 Identities=23% Similarity=0.348 Sum_probs=22.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhheeecCCCcccchhhhhhc
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLIKRNRGGKYAVHEREAAH 49 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i~R~rGgKY~VkeKE~~~ 49 (103)
..|-+.+++.+++.+++ +.+.++|+||--+.+.+.+..+
T Consensus 576 ~~Ww~~~~~v~if~~v~--~~~l~~wqr~~~~~~~~~~~~~ 614 (723)
T KOG3359|consen 576 VIWWSSTLSLAIFALVV--LFRLLRWQRGTILSDPQIWRVH 614 (723)
T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHhccCCccCchhhhhHH
Confidence 34555555554444333 3566788888655555555444
No 364
>KOG1094|consensus
Probab=23.17 E-value=37 Score=31.35 Aligned_cols=26 Identities=19% Similarity=0.401 Sum_probs=15.7
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 8 TAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
+..-.|+++.||.|++++++.+|+-+
T Consensus 389 ~t~~~~~~f~~if~iva~ii~~~L~R 414 (807)
T KOG1094|consen 389 PTAILIIIFVAIFLIVALIIALMLWR 414 (807)
T ss_pred CceehHHHHHHHHHHHHHHHHHHHHH
Confidence 34455666677666666655566554
No 365
>cd01811 OASL_repeat1 2'-5' oligoadenylate synthetase-like protein, repeat 1 of 2. OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
Probab=23.07 E-value=44 Score=22.76 Aligned_cols=13 Identities=38% Similarity=0.583 Sum_probs=11.0
Q ss_pred CCCCcchhhhhCC
Q psy17635 84 VESDTDSMAEYGD 96 (103)
Q Consensus 84 ~~~s~DSl~dYgd 96 (103)
..+|.+|||+||-
T Consensus 53 lL~s~~sLA~yGi 65 (80)
T cd01811 53 LLSSRKSLADYGI 65 (80)
T ss_pred cccccccHhhhcc
Confidence 4788999999984
No 366
>PF10873 DUF2668: Protein of unknown function (DUF2668); InterPro: IPR022640 Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known [].
Probab=22.99 E-value=84 Score=23.78 Aligned_cols=10 Identities=30% Similarity=0.690 Sum_probs=5.5
Q ss_pred eeecCCCccc
Q psy17635 32 IKRNRGGKYA 41 (103)
Q Consensus 32 i~R~rGgKY~ 41 (103)
.+.|||-.=.
T Consensus 88 ~kn~rgsRvg 97 (155)
T PF10873_consen 88 MKNSRGSRVG 97 (155)
T ss_pred hhcCCCcccc
Confidence 4667665433
No 367
>PF04906 Tweety: Tweety; InterPro: IPR006990 None of the members of the tweety (tty) family have been functionally characterised. However, they are considered to be transmembrane proteins with five potential membrane-spanning regions. A number of potential functions have been suggested on the basis of homology to the yeast FTR1 and FTH1 iron transporter proteins and the mammalian neurotensin receptors 1 and 2 in that they have a similar hydrophobicity profiles although there is no detectable sequence homology to the tweety-related proteins. It has been proposed that the tweety-related proteins could be involved in transport of iron or other divalent cations or alternatively that they may be membrane-bound receptors [].
Probab=22.88 E-value=1e+02 Score=25.89 Aligned_cols=12 Identities=8% Similarity=0.047 Sum_probs=7.9
Q ss_pred HhheeecCCCcc
Q psy17635 29 VCLIKRNRGGKY 40 (103)
Q Consensus 29 ~C~i~R~rGgKY 40 (103)
+-++||+|..--
T Consensus 205 ~glar~Sk~~li 216 (406)
T PF04906_consen 205 LGLARQSKCLLI 216 (406)
T ss_pred HHHHhcCcceEE
Confidence 346688887653
No 368
>PRK03001 M48 family peptidase; Provisional
Probab=22.86 E-value=63 Score=25.31 Aligned_cols=35 Identities=17% Similarity=0.307 Sum_probs=16.6
Q ss_pred hHHHHHHHHHHHHHHHHHHH-hheeecCCCcccchhh
Q psy17635 10 GWFIGMLLAIAFLILVLILV-CLIKRNRGGKYAVHER 45 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli~-C~i~R~rGgKY~VkeK 45 (103)
+|+++++.++++.++...+. .++.+.=| +.+|.+.
T Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~l~~~ 64 (283)
T PRK03001 29 GMLIALLFALGMNFFSYWFSDKMVLKMYN-AQEVDEN 64 (283)
T ss_pred hHHHHHHHHHHHHHHHHHHhHHHHHHHcC-CEECCcc
Confidence 46666555555444443333 33444323 3556654
No 369
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=22.56 E-value=1e+02 Score=21.12 Aligned_cols=7 Identities=29% Similarity=0.805 Sum_probs=3.4
Q ss_pred hHHHHHH
Q psy17635 10 GWFIGML 16 (103)
Q Consensus 10 gWfIg~~ 16 (103)
-|++..+
T Consensus 2 fWl~~a~ 8 (117)
T TIGR03142 2 FWIVAAL 8 (117)
T ss_pred HHHHHHH
Confidence 3755433
No 370
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=22.53 E-value=93 Score=23.89 Aligned_cols=26 Identities=23% Similarity=0.535 Sum_probs=18.1
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i 32 (103)
..-.+|-.-||.|++.+.|++|+.++
T Consensus 22 ~~psffsthm~tILiaIvVliiiiiv 47 (189)
T PF05568_consen 22 TPPSFFSTHMYTILIAIVVLIIIIIV 47 (189)
T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHH
Confidence 44578888899888777666555443
No 371
>PRK00523 hypothetical protein; Provisional
Probab=22.50 E-value=82 Score=20.98 Aligned_cols=13 Identities=23% Similarity=0.335 Sum_probs=5.9
Q ss_pred HHHHHHHHhheee
Q psy17635 22 LILVLILVCLIKR 34 (103)
Q Consensus 22 Lllilli~C~i~R 34 (103)
|++-+++-.|+-|
T Consensus 15 li~G~~~Gffiar 27 (72)
T PRK00523 15 LIVGGIIGYFVSK 27 (72)
T ss_pred HHHHHHHHHHHHH
Confidence 3333444455544
No 372
>PF04507 DUF576: Protein of unknown function, DUF576; InterPro: IPR007595 This family contains several uncharacterised staphylococcal proteins. Members of this family are mostly predicted lipoproteins, found in Staphylococcus aureus but are also found clustered in Staphylococcus epidermidis.; PDB: 4EG9_A 4EGD_B.
Probab=22.23 E-value=29 Score=28.14 Aligned_cols=14 Identities=29% Similarity=0.418 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHhhe
Q psy17635 19 IAFLILVLILVCLI 32 (103)
Q Consensus 19 i~lLllilli~C~i 32 (103)
|.+||||++|++|.
T Consensus 11 Is~liLii~I~GCg 24 (257)
T PF04507_consen 11 ISLLILIIFIGGCG 24 (257)
T ss_dssp --------------
T ss_pred HHHHhHheeeeecc
Confidence 44455555555443
No 373
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=22.19 E-value=83 Score=21.85 Aligned_cols=6 Identities=17% Similarity=0.528 Sum_probs=2.5
Q ss_pred hheeec
Q psy17635 30 CLIKRN 35 (103)
Q Consensus 30 C~i~R~ 35 (103)
...||.
T Consensus 22 a~YRR~ 27 (92)
T PHA02681 22 MMYRRS 27 (92)
T ss_pred HHHHhc
Confidence 334444
No 374
>PF12751 Vac7: Vacuolar segregation subunit 7; InterPro: IPR024260 Vac7 is localised at the vacuole membrane, a location which is consistent with its involvement in vacuole morphology and inheritance []. Vac7 has been shown to function as an upstream regulator of the Fab1 lipid kinase pathway []. The Fab1 lipid pathway is important for correct regulation of membrane trafficking events.
Probab=22.10 E-value=91 Score=26.71 Aligned_cols=22 Identities=14% Similarity=0.353 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
..+-+++.|++||+|..+++|+
T Consensus 302 ~~~c~~~~i~~lL~ig~~~gFv 323 (387)
T PF12751_consen 302 FASCIYLSILLLLVIGFAIGFV 323 (387)
T ss_pred hhHHHHHHHHHHHHHHHHHHhh
No 375
>PF13074 DUF3938: Protein of unknown function (DUF3938)
Probab=22.06 E-value=88 Score=21.63 Aligned_cols=14 Identities=43% Similarity=0.814 Sum_probs=10.5
Q ss_pred hHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLI 23 (103)
Q Consensus 10 gWfIg~~~ai~lLl 23 (103)
.|||+...|=++-+
T Consensus 75 twfiafcladvfnl 88 (102)
T PF13074_consen 75 TWFIAFCLADVFNL 88 (102)
T ss_pred HHHHHHHHHHHHHH
Confidence 69999888866544
No 376
>PRK13836 conjugal transfer protein TrbF; Provisional
Probab=22.02 E-value=1.3e+02 Score=22.76 Aligned_cols=15 Identities=27% Similarity=0.507 Sum_probs=9.4
Q ss_pred hhhHHHHHHHHHHHH
Q psy17635 8 TAGWFIGMLLAIAFL 22 (103)
Q Consensus 8 t~gWfIg~~~ai~lL 22 (103)
...|+++.+.++++.
T Consensus 29 ~~~W~~~a~~~l~~a 43 (220)
T PRK13836 29 AAAWRIVGILGLTMA 43 (220)
T ss_pred HHHHHHHHHHHHHHH
Confidence 367987776554443
No 377
>PF13937 DUF4212: Domain of unknown function (DUF4212)
Probab=21.76 E-value=43 Score=22.42 Aligned_cols=6 Identities=50% Similarity=0.778 Sum_probs=2.3
Q ss_pred CCcccc
Q psy17635 37 GGKYAV 42 (103)
Q Consensus 37 GgKY~V 42 (103)
-.||-|
T Consensus 74 D~~~gv 79 (81)
T PF13937_consen 74 DRKYGV 79 (81)
T ss_pred HHHHCC
Confidence 334433
No 378
>PRK10856 cytoskeletal protein RodZ; Provisional
Probab=21.75 E-value=95 Score=25.42 Aligned_cols=26 Identities=15% Similarity=0.418 Sum_probs=14.0
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHhhee
Q psy17635 8 TAGWFIGMLLAIAFLILVLILVCLIK 33 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli~C~i~ 33 (103)
..+|+..+...|++++|.+..+++..
T Consensus 109 ~~~~~~~~~~lv~~vvl~l~~~wwwq 134 (331)
T PRK10856 109 RDGWLMTFTWLVLFVVIGLTGAWWWQ 134 (331)
T ss_pred cCCchHHHHHHHHHHHHHHHHHHHhh
Confidence 44577655555555555554455553
No 379
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=21.70 E-value=70 Score=24.81 Aligned_cols=13 Identities=15% Similarity=0.445 Sum_probs=4.9
Q ss_pred HHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLI 23 (103)
Q Consensus 11 WfIg~~~ai~lLl 23 (103)
|-.|..+++++.+
T Consensus 290 ~~~gy~~~l~~m~ 302 (318)
T TIGR00383 290 WKYGYPAVLIVMA 302 (318)
T ss_pred chhHHHHHHHHHH
Confidence 4334433333333
No 380
>PF07695 7TMR-DISM_7TM: 7TM diverse intracellular signalling; InterPro: IPR011623 This entry represents the transmembrane region of the 7TM-DISM (7TM Receptors with Diverse Intracellular Signalling Modules) [].
Probab=21.63 E-value=1.1e+02 Score=21.07 Aligned_cols=11 Identities=36% Similarity=0.531 Sum_probs=5.4
Q ss_pred hheeecCCCcc
Q psy17635 30 CLIKRNRGGKY 40 (103)
Q Consensus 30 C~i~R~rGgKY 40 (103)
...||+++.+|
T Consensus 144 ~~~~~~~~a~~ 154 (205)
T PF07695_consen 144 AWRKGNRPARY 154 (205)
T ss_pred HHHcCCccHHH
Confidence 43455555544
No 381
>PTZ00046 rifin; Provisional
Probab=21.60 E-value=88 Score=26.48 Aligned_cols=23 Identities=13% Similarity=0.475 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHhheeecC
Q psy17635 14 GMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R~r 36 (103)
..++.|||..+|+.++-=.||.|
T Consensus 322 iAIvVIVLIMvIIYLILRYRRKK 344 (358)
T PTZ00046 322 VAIVVIVLIMVIIYLILRYRRKK 344 (358)
T ss_pred HHHHHHHHHHHHHHHHHHhhhcc
Confidence 33344444444444444456654
No 382
>PF03988 DUF347: Repeat of Unknown Function (DUF347) ; InterPro: IPR007136 This repeat is found as four tandem repeats in a family of bacterial membrane proteins. Each repeat contains two transmembrane regions and a conserved tryptophan.
Probab=21.54 E-value=1.5e+02 Score=18.07 Aligned_cols=17 Identities=41% Similarity=0.366 Sum_probs=7.0
Q ss_pred HHHHHHHHHHHhheeec
Q psy17635 19 IAFLILVLILVCLIKRN 35 (103)
Q Consensus 19 i~lLllilli~C~i~R~ 35 (103)
++++++++.+....+|+
T Consensus 37 ~~~l~~~~~~~~~~~~~ 53 (55)
T PF03988_consen 37 AALLAVVLALWYRSKRY 53 (55)
T ss_pred HHHHHHHHHHHHHHhcc
Confidence 33333334444444444
No 383
>PF10731 Anophelin: Thrombin inhibitor from mosquito; InterPro: IPR018932 Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing.
Probab=21.54 E-value=1.8e+02 Score=19.00 Aligned_cols=10 Identities=30% Similarity=0.311 Sum_probs=5.2
Q ss_pred hcCCCCCCCC
Q psy17635 48 AHGRHDYPEE 57 (103)
Q Consensus 48 ~~g~~~~~dd 57 (103)
++|+.|--||
T Consensus 26 a~GeeP~YDE 35 (65)
T PF10731_consen 26 APGEEPSYDE 35 (65)
T ss_pred CCCCCCCcCc
Confidence 5666654333
No 384
>PF15159 PIG-Y: Phosphatidylinositol N-acetylglucosaminyltransferase subunit Y
Probab=21.44 E-value=1.7e+02 Score=19.21 Aligned_cols=21 Identities=19% Similarity=0.490 Sum_probs=13.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLILV 29 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~ 29 (103)
.||++.+++++.++.+.-.++
T Consensus 2 ~G~~~il~~~v~fv~~~y~~v 22 (72)
T PF15159_consen 2 LGWLLILFTLVFFVGFFYAAV 22 (72)
T ss_pred cchHHHHHHHHHHHHHHHHHH
Confidence 378888877766666554443
No 385
>PF15145 DUF4577: Domain of unknown function (DUF4577)
Probab=21.27 E-value=1.5e+02 Score=21.74 Aligned_cols=35 Identities=31% Similarity=0.584 Sum_probs=16.7
Q ss_pred hHHHHHHHHHH-HHHHHHHHHhheeecCCCcc-cchhh
Q psy17635 10 GWFIGMLLAIA-FLILVLILVCLIKRNRGGKY-AVHER 45 (103)
Q Consensus 10 gWfIg~~~ai~-lLllilli~C~i~R~rGgKY-~VkeK 45 (103)
-+|+|+++++. -|.|+.+++++| -+-|.|- .|..+
T Consensus 62 lffvglii~LivSLaLVsFvIFLi-iQTgnkMddvSrR 98 (128)
T PF15145_consen 62 LFFVGLIIVLIVSLALVSFVIFLI-IQTGNKMDDVSRR 98 (128)
T ss_pred ehHHHHHHHHHHHHHHHHHHHHhe-eeccchHHHHHHH
Confidence 46788777733 333443333333 2334443 24444
No 386
>PF04531 Phage_holin_1: Bacteriophage holin; InterPro: IPR006485 Phage proteins for bacterial lysis typically include a membrane-disrupting protein, or holin, and one or more cell wall degrading enzymes that reach the cell wall because of holin action. Holins are found in a large number of mutually non-homologous families. This entry is represented by the Bacteriophage phi-LC3, holin. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=21.20 E-value=1.5e+02 Score=19.60 Aligned_cols=23 Identities=13% Similarity=0.273 Sum_probs=15.0
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli 28 (103)
+.+-.|.++++.+|++++..++.
T Consensus 8 ~kN~~~w~ali~~i~l~vq~~~~ 30 (84)
T PF04531_consen 8 FKNKAFWVALISAILLLVQQVGG 30 (84)
T ss_pred ccCHHHHHHHHHHHHHHHHHHHH
Confidence 44556667777777777665544
No 387
>COG4698 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.91 E-value=1.5e+02 Score=23.26 Aligned_cols=38 Identities=11% Similarity=0.217 Sum_probs=21.3
Q ss_pred hHHHHHHHHHHHHHHHH-HHHhheeecCCCcccchhhhh
Q psy17635 10 GWFIGMLLAIAFLILVL-ILVCLIKRNRGGKYAVHEREA 47 (103)
Q Consensus 10 gWfIg~~~ai~lLllil-li~C~i~R~rGgKY~VkeKE~ 47 (103)
.|=-+.+|.++|+++++ +|+.++.--|-.-|.+.+++.
T Consensus 10 ~WKw~f~iLLAln~l~~~~i~~~vlsp~ee~t~~~~a~~ 48 (197)
T COG4698 10 YWKWLFFILLALNTLLAVLIALFVLSPREEPTHLEDASE 48 (197)
T ss_pred HHHHHHHHHHHHHHHHHHHhheeeccCCCCCchhhccCc
Confidence 46666666666666554 333444334444666666654
No 388
>PF13131 DUF3951: Protein of unknown function (DUF3951)
Probab=20.91 E-value=1.5e+02 Score=18.71 Aligned_cols=11 Identities=18% Similarity=0.365 Sum_probs=4.8
Q ss_pred HHHHHHHHHHH
Q psy17635 17 LAIAFLILVLI 27 (103)
Q Consensus 17 ~ai~lLllill 27 (103)
.++++++||.+
T Consensus 11 ~~~~I~~lIgf 21 (53)
T PF13131_consen 11 FTIFIFFLIGF 21 (53)
T ss_pred HHHHHHHHHHH
Confidence 33444444444
No 389
>COG3462 Predicted membrane protein [Function unknown]
Probab=20.91 E-value=1.2e+02 Score=21.95 Aligned_cols=18 Identities=22% Similarity=0.622 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy17635 11 WFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 11 WfIg~~~ai~lLllilli 28 (103)
|+|-.+.+++.++++++.
T Consensus 50 ~lImpI~~~vvli~lvvf 67 (117)
T COG3462 50 WLIMPIFWAVVLIFLVVF 67 (117)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 677666664444444333
No 390
>TIGR03746 conj_TIGR03746 integrating conjugative element protein, PFL_4703 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition. The function is unknown.
Probab=20.87 E-value=1.3e+02 Score=23.62 Aligned_cols=23 Identities=22% Similarity=0.154 Sum_probs=12.9
Q ss_pred hhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 6 VATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 6 ~at~gWfIg~~~ai~lLllilli 28 (103)
+.+..++|++|++|++++.+.+.
T Consensus 12 i~tLr~~~~~L~~i~~~l~~gw~ 34 (202)
T TIGR03746 12 ILTLRIAIGLLALILLALWFGWR 34 (202)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556666666665555544433
No 391
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=20.85 E-value=92 Score=26.32 Aligned_cols=24 Identities=21% Similarity=0.650 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHhheeecC
Q psy17635 12 FIGMLLAIAFLILVLILVCLIKRNR 36 (103)
Q Consensus 12 fIg~~~ai~lLllilli~C~i~R~r 36 (103)
+|++++ |||..+|+.++-=.||.|
T Consensus 316 iIAIvv-IVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 316 IIAILI-IVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHH-HHHHHHHHHHHHHhhhcc
Confidence 344444 444444444444466654
No 392
>PF07009 DUF1312: Protein of unknown function (DUF1312); InterPro: IPR010739 This family consists of several bacterial proteins of around 120 residues in length. The function of this family is unknown.; PDB: 4ESN_B 1NPP_B 1M1G_D 1NPR_A 1M1H_A 2KPP_A 3LD7_C.
Probab=20.84 E-value=31 Score=23.62 Aligned_cols=9 Identities=67% Similarity=1.088 Sum_probs=3.2
Q ss_pred eecCCCccc
Q psy17635 33 KRNRGGKYA 41 (103)
Q Consensus 33 ~R~rGgKY~ 41 (103)
..+.+++|.
T Consensus 21 ~~~~~~~~~ 29 (113)
T PF07009_consen 21 GNNSGGKYA 29 (113)
T ss_dssp ----SEEEE
T ss_pred ccCCCCeEE
Confidence 456666665
No 393
>COG5522 Predicted integral membrane protein [Function unknown]
Probab=20.79 E-value=60 Score=26.03 Aligned_cols=33 Identities=24% Similarity=0.471 Sum_probs=22.0
Q ss_pred HHHHHHHHHHHHHHHHhheeecCCCcccchhhh
Q psy17635 14 GMLLAIAFLILVLILVCLIKRNRGGKYAVHERE 46 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i~R~rGgKY~VkeKE 46 (103)
|+.|+++++++...++.++.|.=|.+|.-=.||
T Consensus 155 gl~~svl~~~~lg~~~lfinrrLGtNYlylsk~ 187 (236)
T COG5522 155 GLVMSVLVAISLGIMCLFINRRLGTNYLYLSKE 187 (236)
T ss_pred chhHHHHHHHHHHHHHHHHHHHhcCceeEeecC
Confidence 355665555555666667889999999754443
No 394
>PF14029 DUF4244: Protein of unknown function (DUF4244)
Probab=20.79 E-value=2e+02 Score=18.10 Aligned_cols=26 Identities=27% Similarity=0.363 Sum_probs=20.2
Q ss_pred CcchhhhhHHHHHHHHHHHHHHHHHH
Q psy17635 3 SDTVATAGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 3 ~~~~at~gWfIg~~~ai~lLllilli 28 (103)
..+.+|.=|-||.+.|.+|.-++..+
T Consensus 12 d~GmsTaEYAvgtlAA~afA~vL~~v 37 (56)
T PF14029_consen 12 DAGMSTAEYAVGTLAAAAFAGVLYKV 37 (56)
T ss_pred cCchhHHHHHHHHHHHHHHHHHHHHH
Confidence 35688899999999998887665544
No 395
>PF10601 zf-LITAF-like: LITAF-like zinc ribbon domain; InterPro: IPR006629 Members of this family display a conserved zinc ribbon structure [] with the motif C-XX-C- separated from the more C-terminal HX-C(P)X-C-X4-G-R motif by a variable region of usually 25-30 (hydrophobic) residues. Although it belongs to one of the zinc finger's fold groups (zinc ribbon), this particular domain was first identified in LPS-induced tumour necrosis alpha factor (LITAF) which is produced in mammalian cells after being challenged with lipopolysaccharide (LPS). The hydrophobic region probably inserts into the membrane rather than traversing it. Such an insertion brings together the N- and C-terminal C-XX-C motifs to form a compact Zn2+-binding structure [].
Probab=20.72 E-value=1.6e+02 Score=18.38 Aligned_cols=8 Identities=13% Similarity=0.252 Sum_probs=3.4
Q ss_pred hhHHHHHH
Q psy17635 9 AGWFIGML 16 (103)
Q Consensus 9 ~gWfIg~~ 16 (103)
.+|+++++
T Consensus 28 ~t~~~~~~ 35 (73)
T PF10601_consen 28 MTYICAAL 35 (73)
T ss_pred HHHHHHHH
Confidence 34544443
No 396
>PHA02911 C-type lectin-like protein; Provisional
Probab=20.63 E-value=1.1e+02 Score=24.36 Aligned_cols=19 Identities=21% Similarity=0.392 Sum_probs=10.0
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 10 gWfIg~~~ai~lLllilli 28 (103)
.-+|+.||.|..+++|.|.
T Consensus 30 ~~~~~~~~~~~~~~~~~~~ 48 (213)
T PHA02911 30 ALIIACLCLILTIIIICLF 48 (213)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3456666665555544443
No 397
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=20.61 E-value=1.3e+02 Score=19.90 Aligned_cols=9 Identities=11% Similarity=0.062 Sum_probs=4.0
Q ss_pred eecCCCccc
Q psy17635 33 KRNRGGKYA 41 (103)
Q Consensus 33 ~R~rGgKY~ 41 (103)
|++.++.-+
T Consensus 29 k~~~~~gLs 37 (75)
T PF06667_consen 29 KWKSSQGLS 37 (75)
T ss_pred hcccCCCCC
Confidence 444444444
No 398
>TIGR02595 PEP_exosort PEP-CTERM putative exosortase interaction domain. This model describes a 25-residue domain that includes a near-invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In nearly every case, this motif is found within nine residues, and usually within five residues, of the extreme C-terminus of the protein. Proteins with this motif typically have signal sequences at the N-terminus. This region appears many times per genome or not at all, and co-occurs in genomes with a proposed protein-sorting integral membrane protein we designate exosortase (see TIGR02602). PEP-CTERM proteins frequently are poorly conserved, Ser/Thr-rich proteins and may become extensively modified proteinaceous constituents of extracellular material in bacterial biofilms.
Probab=20.58 E-value=80 Score=16.64 Aligned_cols=9 Identities=33% Similarity=0.497 Sum_probs=3.8
Q ss_pred HHhheeecC
Q psy17635 28 LVCLIKRNR 36 (103)
Q Consensus 28 i~C~i~R~r 36 (103)
.....+|.|
T Consensus 15 ~~~~~rrrk 23 (26)
T TIGR02595 15 GFLLLRRRR 23 (26)
T ss_pred HHHHHhhcc
Confidence 334444443
No 399
>PF07790 DUF1628: Protein of unknown function (DUF1628); InterPro: IPR012859 The sequences making up this family are derived from hypothetical proteins of unknown function expressed by various archaeal species. The region in question is approximately 160 residues long.
Probab=20.54 E-value=1.4e+02 Score=18.85 Aligned_cols=13 Identities=38% Similarity=0.810 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHHH
Q psy17635 13 IGMLLAIAFLILV 25 (103)
Q Consensus 13 Ig~~~ai~lLlli 25 (103)
||+++.|++.+++
T Consensus 7 iGviLliaitVil 19 (80)
T PF07790_consen 7 IGVILLIAITVIL 19 (80)
T ss_pred HHHHHHHHHHHHH
Confidence 4555544444433
No 400
>TIGR02975 phageshock_pspG phage shock protein G. This protein previously was designated yjbO in E. coli. It is found only in genomes that have the phage shock operon (psp), but only rarely is encoded near other psp genes. The psp regulon is upregulated in response to a number of stress conditions, including ethanol, expression of the filamentous phage secretin protein IV and other secretins, and heat shock.
Probab=20.54 E-value=1.8e+02 Score=18.97 Aligned_cols=28 Identities=32% Similarity=0.466 Sum_probs=13.6
Q ss_pred hhhhHHHHHHHHHHH-----HHHHHHHHhheee
Q psy17635 7 ATAGWFIGMLLAIAF-----LILVLILVCLIKR 34 (103)
Q Consensus 7 at~gWfIg~~~ai~l-----Lllilli~C~i~R 34 (103)
++.-++++=|.++++ |++++..++++|+
T Consensus 30 a~~vm~l~Gm~~lviKLLPWLil~~v~vW~~r~ 62 (64)
T TIGR02975 30 AVLFMALGGMFALMIKLLPWLILAVVVVWFIKS 62 (64)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 444555555555443 2334444555554
No 401
>COG4059 MtrE Tetrahydromethanopterin S-methyltransferase, subunit E [Coenzyme metabolism]
Probab=20.45 E-value=95 Score=25.42 Aligned_cols=33 Identities=33% Similarity=0.451 Sum_probs=15.5
Q ss_pred hhhHHHHHHHHHHHHHHHHHH--HhheeecCCCcc
Q psy17635 8 TAGWFIGMLLAIAFLILVLIL--VCLIKRNRGGKY 40 (103)
Q Consensus 8 t~gWfIg~~~ai~lLllilli--~C~i~R~rGgKY 40 (103)
+.||+-.++..|.+++||++= +=.-.||+=|.|
T Consensus 260 ~~G~isiv~Gliiv~iLii~N~kvEv~ARn~YGpY 294 (304)
T COG4059 260 QGGWISIVAGLIIVLILIIWNRKVEVKARNAYGPY 294 (304)
T ss_pred ccchhhhHHHHHHHHHHHHhcchhhhhhhhccCCc
Confidence 345665555544444443321 112236666666
No 402
>PF14985 TM140: TM140 protein family
Probab=20.44 E-value=1.3e+02 Score=23.16 Aligned_cols=26 Identities=31% Similarity=0.411 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHH-HHHHhheeecCC
Q psy17635 12 FIGMLLAIAFLILV-LILVCLIKRNRG 37 (103)
Q Consensus 12 fIg~~~ai~lLlli-lli~C~i~R~rG 37 (103)
|.++..|=++|+|. +.++.|-.|.+-
T Consensus 147 FlAL~~AqalliLlL~~~v~fp~r~~~ 173 (181)
T PF14985_consen 147 FLALGGAQALLILLLMATVVFPPRAKR 173 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCCcccc
Confidence 67788885555544 444555555443
No 403
>PF04246 RseC_MucC: Positive regulator of sigma(E), RseC/MucC; InterPro: IPR007359 This bacterial family of integral membrane proteins represents a positive regulator of the sigma(E) transcription factor, namely RseC/MucC. The sigma(E) transcription factor is up-regulated by cell envelope protein misfolding, and regulates the expression of genes that are collectively termed ECF (devoted to Extra-Cellular Functions) []. In Pseudomonas aeruginosa, derepression of sigma(E) is associated with the alginate-overproducing phenotype characteristic of chronic respiratory tract colonization in cystic fibrosis patients. The mechanism by which RseC/MucC positively regulates the sigma(E) transcription factor is unknown. RseC is also thought to have a role in thiamine biosynthesis in Salmonella typhimurium []. In addition, this family also includes an N-terminal part of RnfF, a Rhodobacter capsulatus protein, of unknown function, that is essential for nitrogen fixation. This protein also contains a domain found in ApbE protein IPR003374 from INTERPRO, which is itself involved in thiamine biosynthesis.
Probab=20.44 E-value=97 Score=21.46 Aligned_cols=24 Identities=21% Similarity=0.227 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHhheeecCCC
Q psy17635 15 MLLAIAFLILVLILVCLIKRNRGG 38 (103)
Q Consensus 15 ~~~ai~lLllilli~C~i~R~rGg 38 (103)
++++++.+++.++++-+..|+...
T Consensus 100 ~l~~l~~l~~~~~~~~~~~~~~~~ 123 (135)
T PF04246_consen 100 ILGGLLGLALGFLILRLFDRRLKK 123 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhcc
Confidence 445555555556666555554443
No 404
>PRK10884 SH3 domain-containing protein; Provisional
Probab=20.42 E-value=1e+02 Score=23.72 Aligned_cols=26 Identities=31% Similarity=0.188 Sum_probs=16.6
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 7 ATAGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 7 at~gWfIg~~~ai~lLllilli~C~i 32 (103)
..+.|+.|-++|.+-|||-+++--++
T Consensus 170 ~~~wf~~Gg~v~~~GlllGlilp~l~ 195 (206)
T PRK10884 170 IMQWFMYGGGVAGIGLLLGLLLPHLI 195 (206)
T ss_pred HHHHHHHchHHHHHHHHHHHHhcccc
Confidence 34566677777766666666665656
No 405
>PHA02851 EEV glycoprotein; Provisional
Probab=20.41 E-value=81 Score=24.53 Aligned_cols=15 Identities=27% Similarity=0.509 Sum_probs=6.0
Q ss_pred HHHHHHHHHHHHHHH
Q psy17635 15 MLLAIAFLILVLILV 29 (103)
Q Consensus 15 ~~~ai~lLllilli~ 29 (103)
+-|.+++|++.++.+
T Consensus 8 vscSliILIlY~llI 22 (223)
T PHA02851 8 VSCSLIILIVYLFLI 22 (223)
T ss_pred HHhHHHHHHHHHHHH
Confidence 334444444443333
No 406
>PRK11466 hybrid sensory histidine kinase TorS; Provisional
Probab=20.33 E-value=87 Score=27.54 Aligned_cols=24 Identities=21% Similarity=0.335 Sum_probs=11.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhe
Q psy17635 9 AGWFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli~C~i 32 (103)
..|++.++.+++++++++++..++
T Consensus 329 ~~~~~~~~~~~~~l~~~~~~~~~~ 352 (914)
T PRK11466 329 GQYSLLLLGMVSLCALILILWRVV 352 (914)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555544444444443333
No 407
>PRK06287 cobalt transport protein CbiN; Validated
Probab=20.24 E-value=1.3e+02 Score=20.89 Aligned_cols=9 Identities=11% Similarity=0.224 Sum_probs=3.9
Q ss_pred HHhheeecC
Q psy17635 28 LVCLIKRNR 36 (103)
Q Consensus 28 i~C~i~R~r 36 (103)
+-.+++|++
T Consensus 97 ~~~~l~r~~ 105 (107)
T PRK06287 97 VGKIFKKKS 105 (107)
T ss_pred HHHHHhccc
Confidence 334445443
No 408
>PRK14759 potassium-transporting ATPase subunit F; Provisional
Probab=20.24 E-value=1.8e+02 Score=16.25 Aligned_cols=19 Identities=16% Similarity=0.342 Sum_probs=8.0
Q ss_pred HHHHHHHHHHHHHHHHhhe
Q psy17635 14 GMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 14 g~~~ai~lLllilli~C~i 32 (103)
.+-.++++.+++=|+..++
T Consensus 6 ~l~~~va~~L~vYL~~ALl 24 (29)
T PRK14759 6 SLAGAVSLGLLIYLTYALL 24 (29)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3334444444444444433
No 409
>PF11084 DUF2621: Protein of unknown function (DUF2621); InterPro: IPR020203 This entry represents a group of uncharacterised proteins.
Probab=20.22 E-value=1.4e+02 Score=22.29 Aligned_cols=20 Identities=25% Similarity=0.665 Sum_probs=12.6
Q ss_pred hhHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli 28 (103)
++||..+|..-+++++.+..
T Consensus 3 ~~wFm~fI~~W~~vli~l~~ 22 (141)
T PF11084_consen 3 NGWFMWFILFWVVVLIGLMA 22 (141)
T ss_pred hhHHHHHHHHHHHHHHHHHH
Confidence 57998888775554444433
No 410
>COG1704 LemA Uncharacterized conserved protein [Function unknown]
Probab=20.18 E-value=1.2e+02 Score=23.52 Aligned_cols=14 Identities=14% Similarity=0.425 Sum_probs=6.2
Q ss_pred hHHHHHHHHHHHHH
Q psy17635 10 GWFIGMLLAIAFLI 23 (103)
Q Consensus 10 gWfIg~~~ai~lLl 23 (103)
.|+|++.+.+++++
T Consensus 3 ~~~i~l~vi~il~l 16 (185)
T COG1704 3 RFLIILAVIVILLL 16 (185)
T ss_pred hhHHHHHHHHHHHH
Confidence 35555544433333
No 411
>PRK13865 type IV secretion system protein VirB8; Provisional
Probab=20.09 E-value=1.5e+02 Score=23.16 Aligned_cols=12 Identities=0% Similarity=-0.205 Sum_probs=7.2
Q ss_pred hhhhHHHHHHHH
Q psy17635 7 ATAGWFIGMLLA 18 (103)
Q Consensus 7 at~gWfIg~~~a 18 (103)
...+|+++.+++
T Consensus 37 ~k~aw~va~~~~ 48 (229)
T PRK13865 37 SRVLAAVAGIAV 48 (229)
T ss_pred HHHHHHHHHHHH
Confidence 345787776544
No 412
>KOG2052|consensus
Probab=20.07 E-value=73 Score=28.23 Aligned_cols=20 Identities=30% Similarity=0.446 Sum_probs=10.4
Q ss_pred hhHHHHHHHHHHHHHHHHHH
Q psy17635 9 AGWFIGMLLAIAFLILVLIL 28 (103)
Q Consensus 9 ~gWfIg~~~ai~lLllilli 28 (103)
..|++.++..++++++++++
T Consensus 131 ~~~~~ii~~~~~~~~~~~~~ 150 (513)
T KOG2052|consen 131 VELVAIILGPVVFLLLIAIL 150 (513)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555544444
No 413
>KOG4221|consensus
Probab=20.07 E-value=37 Score=33.24 Aligned_cols=13 Identities=46% Similarity=0.779 Sum_probs=5.8
Q ss_pred HHHHHHHHhheee
Q psy17635 22 LILVLILVCLIKR 34 (103)
Q Consensus 22 Lllilli~C~i~R 34 (103)
++++++++||-+|
T Consensus 1073 ~~~~~v~v~C~~r 1085 (1381)
T KOG4221|consen 1073 LVLLLVLVTCRRR 1085 (1381)
T ss_pred HHHhhheeEEecc
Confidence 3344444554444
No 414
>TIGR03321 alt_F1F0_F0_B alternate F1F0 ATPase, F0 subunit B. CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
Probab=20.07 E-value=1e+02 Score=23.66 Aligned_cols=22 Identities=41% Similarity=0.741 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHHHHHHhhe
Q psy17635 11 WFIGMLLAIAFLILVLILVCLI 32 (103)
Q Consensus 11 WfIg~~~ai~lLllilli~C~i 32 (103)
|+..+.-+|.|++|++++..|.
T Consensus 5 ~~t~~~qiInFlil~~lL~kfl 26 (246)
T TIGR03321 5 WFTVIAQLINFLILVWLLKRFL 26 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 5444444566666665554444
No 415
>PRK09848 glucuronide transporter; Provisional
Probab=20.03 E-value=1.4e+02 Score=23.75 Aligned_cols=9 Identities=11% Similarity=0.394 Sum_probs=5.6
Q ss_pred cccchhhhh
Q psy17635 39 KYAVHEREA 47 (103)
Q Consensus 39 KY~VkeKE~ 47 (103)
+|+..+|+.
T Consensus 426 ~y~l~~~~~ 434 (448)
T PRK09848 426 FYPLTDKKF 434 (448)
T ss_pred HeeCCHHHH
Confidence 577766653
Done!