Query psy6549
Match_columns 109
No_of_seqs 102 out of 477
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 20:49:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6549.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6549hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1666|consensus 100.0 1.6E-32 3.5E-37 198.0 14.5 105 2-106 116-220 (220)
2 PF12352 V-SNARE_C: Snare regi 99.7 2.3E-16 4.9E-21 95.6 9.5 66 14-79 1-66 (66)
3 PF03908 Sec20: Sec20; InterP 99.5 4.1E-13 9E-18 86.4 12.9 88 17-104 4-91 (92)
4 KOG3251|consensus 99.5 2.1E-13 4.4E-18 99.2 11.6 87 18-105 126-212 (213)
5 KOG3208|consensus 98.8 1E-07 2.3E-12 69.8 11.4 77 18-94 146-222 (231)
6 KOG0812|consensus 98.5 7.3E-06 1.6E-10 62.3 13.8 91 3-93 208-299 (311)
7 KOG3065|consensus 98.4 4.2E-06 9.1E-11 63.5 10.1 73 5-77 62-135 (273)
8 PF00957 Synaptobrevin: Synapt 98.3 7.3E-05 1.6E-09 47.3 13.1 85 21-105 3-87 (89)
9 smart00397 t_SNARE Helical reg 98.1 7.2E-05 1.6E-09 43.8 8.6 62 13-74 4-65 (66)
10 KOG3202|consensus 97.9 0.00073 1.6E-08 50.4 12.7 89 14-104 145-233 (235)
11 KOG0860|consensus 97.8 0.0012 2.6E-08 44.2 11.7 84 19-102 27-114 (116)
12 COG5074 t-SNARE complex subuni 97.5 0.0014 3E-08 49.0 9.7 80 21-101 185-268 (280)
13 KOG0810|consensus 97.4 0.0046 1E-07 47.6 11.2 81 18-98 203-286 (297)
14 PF05739 SNARE: SNARE domain; 97.4 0.0066 1.4E-07 35.6 9.6 59 20-78 3-61 (63)
15 PF09753 Use1: Membrane fusion 97.3 0.0099 2.1E-07 44.3 12.0 74 31-106 177-251 (251)
16 KOG0811|consensus 97.3 0.015 3.3E-07 44.2 13.0 89 16-104 175-266 (269)
17 COG5325 t-SNARE complex subuni 97.2 0.018 4E-07 43.8 12.9 86 21-106 195-281 (283)
18 PRK10884 SH3 domain-containing 97.2 0.036 7.9E-07 40.5 13.7 67 31-101 124-190 (206)
19 KOG3385|consensus 97.1 0.0052 1.1E-07 41.2 7.5 64 30-94 45-108 (118)
20 KOG2678|consensus 97.0 0.065 1.4E-06 39.8 13.7 75 34-108 168-242 (244)
21 KOG3894|consensus 97.0 0.035 7.5E-07 43.0 12.7 87 17-103 228-314 (316)
22 cd00193 t_SNARE Soluble NSF (N 97.0 0.012 2.7E-07 33.5 8.2 55 19-73 4-58 (60)
23 KOG0809|consensus 95.4 0.29 6.2E-06 37.8 10.0 70 24-93 221-290 (305)
24 PF00957 Synaptobrevin: Synapt 95.2 0.43 9.3E-06 29.8 12.2 82 14-101 3-87 (89)
25 PRK01026 tetrahydromethanopter 95.1 0.44 9.6E-06 29.7 8.5 57 50-106 16-73 (77)
26 PF04210 MtrG: Tetrahydrometha 95.0 0.44 9.5E-06 29.2 8.0 56 50-105 13-69 (70)
27 TIGR01149 mtrG N5-methyltetrah 94.0 0.81 1.8E-05 28.0 8.3 55 51-105 14-69 (70)
28 PF12911 OppC_N: N-terminal TM 93.1 0.16 3.4E-06 29.0 3.3 32 71-102 4-35 (56)
29 COG4064 MtrG Tetrahydromethano 92.9 1.3 2.9E-05 27.1 8.2 54 54-107 20-74 (75)
30 KOG3065|consensus 92.9 1.1 2.4E-05 34.2 8.5 60 15-74 212-271 (273)
31 PF09889 DUF2116: Uncharacteri 92.3 0.58 1.3E-05 27.7 5.0 31 70-100 26-56 (59)
32 KOG0859|consensus 90.5 1.3 2.8E-05 32.5 6.3 63 20-82 124-186 (217)
33 PF07423 DUF1510: Protein of u 89.7 0.48 1E-05 35.0 3.6 30 77-107 8-37 (217)
34 PRK10132 hypothetical protein; 89.5 4.5 9.7E-05 26.7 12.3 59 42-100 44-103 (108)
35 PF12669 P12: Virus attachment 88.8 0.32 6.9E-06 28.6 1.8 12 97-108 12-23 (58)
36 PHA03240 envelope glycoprotein 87.6 0.64 1.4E-05 34.6 3.0 22 85-106 213-234 (258)
37 PF08114 PMP1_2: ATPase proteo 87.5 0.51 1.1E-05 26.0 1.9 18 92-109 20-37 (43)
38 PF01102 Glycophorin_A: Glycop 87.2 1.1 2.3E-05 30.3 3.8 21 85-105 69-89 (122)
39 PF04678 DUF607: Protein of un 87.1 8.7 0.00019 27.2 9.9 53 54-106 62-115 (180)
40 KOG0862|consensus 87.1 10 0.00022 28.1 11.0 67 19-85 132-198 (216)
41 PTZ00382 Variant-specific surf 86.2 0.16 3.4E-06 32.9 -0.7 20 90-109 76-95 (96)
42 PF12777 MT: Microtubule-bindi 85.9 14 0.0003 28.7 9.9 101 6-106 220-327 (344)
43 PF05283 MGC-24: Multi-glycosy 85.7 0.92 2E-05 32.8 3.0 23 85-107 163-185 (186)
44 PF02439 Adeno_E3_CR2: Adenovi 85.6 2 4.4E-05 23.2 3.6 23 86-108 9-31 (38)
45 PHA02844 putative transmembran 84.8 1.7 3.8E-05 26.9 3.5 22 87-108 53-74 (75)
46 PF08693 SKG6: Transmembrane a 84.7 0.53 1.1E-05 25.8 1.0 14 96-109 27-40 (40)
47 PHA02650 hypothetical protein; 84.4 2.3 5E-05 26.6 3.9 21 88-108 55-75 (81)
48 PF00523 Fusion_gly: Fusion gl 84.3 0.68 1.5E-05 38.1 2.0 41 50-90 439-479 (490)
49 PF09889 DUF2116: Uncharacteri 83.7 5.6 0.00012 23.5 5.2 32 75-106 28-59 (59)
50 PTZ00464 SNF-7-like protein; P 83.6 15 0.00033 26.9 9.1 73 4-77 75-149 (211)
51 PF10151 DUF2359: Uncharacteri 83.0 22 0.00048 29.2 10.1 45 62-106 242-286 (469)
52 PF14283 DUF4366: Domain of un 81.2 0.24 5.2E-06 36.6 -1.6 15 95-109 172-186 (218)
53 PF12352 V-SNARE_C: Snare regi 80.8 8.9 0.00019 22.3 9.2 62 7-68 1-62 (66)
54 PRK07021 fliL flagellar basal 79.5 2.8 6.1E-05 29.2 3.4 27 82-108 16-42 (162)
55 PF12575 DUF3753: Protein of u 79.2 3.4 7.4E-05 25.4 3.3 22 84-105 50-71 (72)
56 PF10779 XhlA: Haemolysin XhlA 79.2 11 0.00025 22.6 9.2 21 79-99 47-67 (71)
57 PF05568 ASFV_J13L: African sw 78.9 3.6 7.7E-05 29.0 3.7 22 84-106 32-53 (189)
58 PHA02975 hypothetical protein; 78.7 6.7 0.00014 23.9 4.4 23 84-106 46-68 (69)
59 PF14937 DUF4500: Domain of un 78.2 3.5 7.5E-05 26.2 3.2 37 71-107 23-61 (86)
60 KOG3433|consensus 78.1 24 0.00052 25.7 11.2 71 8-78 89-166 (203)
61 PF11694 DUF3290: Protein of u 77.9 6.5 0.00014 27.4 4.8 32 77-108 10-41 (149)
62 PF15102 TMEM154: TMEM154 prot 77.5 1.1 2.4E-05 31.2 0.9 7 97-103 76-82 (146)
63 KOG1666|consensus 77.3 27 0.00058 25.9 12.6 61 11-72 118-179 (220)
64 PF09753 Use1: Membrane fusion 76.9 27 0.0006 25.8 11.2 61 43-103 178-244 (251)
65 PHA02819 hypothetical protein; 76.4 5.9 0.00013 24.3 3.7 19 89-107 53-71 (71)
66 PF03302 VSP: Giardia variant- 76.2 1.4 3E-05 35.2 1.2 23 87-109 374-396 (397)
67 PF10661 EssA: WXG100 protein 76.0 4.6 9.9E-05 28.0 3.6 21 86-106 121-141 (145)
68 PF06024 DUF912: Nucleopolyhed 75.6 1.2 2.6E-05 28.9 0.6 23 84-106 64-86 (101)
69 PF05961 Chordopox_A13L: Chord 74.6 5.3 0.00011 24.3 3.2 21 84-104 5-25 (68)
70 COG4499 Predicted membrane pro 74.4 4.5 9.8E-05 32.6 3.6 30 78-107 216-245 (434)
71 KOG1656|consensus 74.0 6.9 0.00015 28.9 4.2 60 23-84 67-126 (221)
72 PF05478 Prominin: Prominin; 73.8 61 0.0013 28.3 13.3 87 17-103 338-433 (806)
73 PF03904 DUF334: Domain of unk 73.6 35 0.00077 25.5 11.5 23 32-54 110-132 (230)
74 PF02009 Rifin_STEVOR: Rifin/s 72.8 5.1 0.00011 31.0 3.5 9 97-105 271-279 (299)
75 cd07912 Tweety_N N-terminal do 70.5 56 0.0012 26.5 13.0 28 80-107 205-232 (418)
76 PRK14762 membrane protein; Pro 70.4 5.8 0.00013 19.5 2.2 16 83-98 2-17 (27)
77 PHA02692 hypothetical protein; 70.3 8.8 0.00019 23.5 3.5 17 89-105 53-69 (70)
78 KOG0811|consensus 70.2 20 0.00043 27.4 6.2 17 57-73 209-225 (269)
79 PF12877 DUF3827: Domain of un 70.1 2.2 4.8E-05 36.3 1.1 29 81-109 267-297 (684)
80 KOG0860|consensus 68.7 32 0.0007 23.1 11.8 80 27-106 28-114 (116)
81 PF11395 DUF2873: Protein of u 68.7 14 0.00031 19.9 3.7 20 86-105 11-30 (43)
82 PF02009 Rifin_STEVOR: Rifin/s 68.5 6.6 0.00014 30.4 3.3 22 84-105 262-283 (299)
83 PHA03164 hypothetical protein; 68.1 8.7 0.00019 24.1 3.2 22 83-104 60-81 (88)
84 PRK10787 DNA-binding ATP-depen 67.9 38 0.00083 29.5 8.1 72 1-72 175-255 (784)
85 PF05961 Chordopox_A13L: Chord 67.7 9.1 0.0002 23.3 3.1 20 88-107 5-24 (68)
86 PRK08455 fliL flagellar basal 67.5 5.9 0.00013 28.3 2.7 19 87-106 24-42 (182)
87 PF10883 DUF2681: Protein of u 67.2 11 0.00023 24.1 3.5 18 89-106 9-26 (87)
88 TIGR02161 napC_nirT periplasmi 66.8 17 0.00036 26.1 4.9 8 96-103 26-33 (185)
89 PHA03049 IMV membrane protein; 65.3 11 0.00024 22.8 3.2 19 84-102 5-23 (68)
90 PF10717 ODV-E18: Occlusion-de 65.1 10 0.00022 24.0 3.1 18 87-104 28-45 (85)
91 PHA03054 IMV membrane protein; 65.0 14 0.0003 22.7 3.6 17 89-105 55-71 (72)
92 COG5074 t-SNARE complex subuni 64.7 60 0.0013 24.7 10.1 51 16-69 194-244 (280)
93 PF04999 FtsL: Cell division p 64.4 26 0.00057 21.9 5.1 19 75-93 5-24 (97)
94 PF13131 DUF3951: Protein of u 64.3 13 0.00029 21.4 3.2 15 95-109 17-31 (53)
95 PRK09546 zntB zinc transporter 63.8 63 0.0014 24.6 12.7 17 93-109 306-322 (324)
96 PF15106 TMEM156: TMEM156 prot 63.4 12 0.00026 27.7 3.7 27 80-106 172-198 (226)
97 PF08651 DASH_Duo1: DASH compl 62.8 34 0.00074 21.2 6.6 36 43-78 2-37 (78)
98 PF08802 CytB6-F_Fe-S: Cytochr 62.7 23 0.0005 19.2 4.8 20 74-93 5-24 (39)
99 PRK10884 SH3 domain-containing 62.5 58 0.0013 23.8 14.2 52 54-105 137-191 (206)
100 COG4942 Membrane-bound metallo 61.9 87 0.0019 25.6 9.9 65 14-78 38-102 (420)
101 COG0598 CorA Mg2+ and Co2+ tra 61.8 70 0.0015 24.5 12.7 23 87-109 298-320 (322)
102 PHA03049 IMV membrane protein; 61.5 14 0.0003 22.4 3.1 21 88-108 5-25 (68)
103 PF04639 Baculo_E56: Baculovir 60.8 4.1 8.8E-05 31.5 0.9 17 86-102 282-298 (305)
104 PF00558 Vpu: Vpu protein; In 60.6 15 0.00032 23.2 3.2 9 95-103 16-24 (81)
105 PF11812 DUF3333: Domain of un 60.4 29 0.00063 24.2 5.1 27 74-100 4-30 (155)
106 TIGR00763 lon ATP-dependent pr 60.2 98 0.0021 26.8 9.2 61 3-63 175-244 (775)
107 PF08649 DASH_Dad1: DASH compl 60.1 33 0.00072 20.2 5.4 37 3-39 4-41 (58)
108 PF05545 FixQ: Cbb3-type cytoc 59.6 23 0.00051 19.5 3.7 15 90-104 16-30 (49)
109 PTZ00046 rifin; Provisional 59.5 13 0.00028 29.6 3.5 7 87-93 319-325 (358)
110 PF11346 DUF3149: Protein of u 59.5 16 0.00034 20.1 2.9 21 86-106 13-33 (42)
111 TIGR01478 STEVOR variant surfa 59.5 12 0.00026 29.0 3.2 8 100-107 277-284 (295)
112 PRK10404 hypothetical protein; 59.0 47 0.001 21.5 12.1 64 37-100 32-97 (101)
113 PRK11466 hybrid sensory histid 58.8 1.2E+02 0.0026 26.1 12.4 6 97-102 344-349 (914)
114 PF11669 WBP-1: WW domain-bind 58.6 16 0.00034 23.8 3.3 23 85-107 23-45 (102)
115 PF11031 Phage_holin_T: Bacter 58.5 8.9 0.00019 28.2 2.3 36 70-105 14-49 (216)
116 PTZ00370 STEVOR; Provisional 58.4 13 0.00027 28.8 3.2 8 100-107 273-280 (296)
117 PF01519 DUF16: Protein of unk 58.1 51 0.0011 21.6 6.8 38 42-79 60-97 (102)
118 PF06008 Laminin_I: Laminin Do 57.4 77 0.0017 23.5 10.5 31 9-39 187-217 (264)
119 TIGR01294 P_lamban phospholamb 57.3 33 0.00072 19.3 5.3 26 74-99 22-47 (52)
120 PRK15136 multidrug efflux syst 56.8 13 0.00029 29.2 3.2 22 84-105 23-44 (390)
121 PRK10856 cytoskeletal protein 56.8 11 0.00023 29.5 2.6 27 80-106 108-134 (331)
122 PF13253 DUF4044: Protein of u 56.5 28 0.00061 18.4 3.4 24 80-103 8-31 (35)
123 PF15102 TMEM154: TMEM154 prot 56.4 4.3 9.3E-05 28.3 0.3 15 92-106 67-81 (146)
124 PTZ00446 vacuolar sorting prot 56.2 75 0.0016 23.0 9.7 73 3-76 80-154 (191)
125 PF11932 DUF3450: Protein of u 56.0 80 0.0017 23.3 9.4 57 4-60 49-105 (251)
126 PF01299 Lamp: Lysosome-associ 55.7 9.2 0.0002 29.2 2.1 21 87-108 277-297 (306)
127 PF04478 Mid2: Mid2 like cell 55.6 2.9 6.3E-05 29.4 -0.6 12 95-106 64-75 (154)
128 COG5052 YOP1 Protein involved 55.6 77 0.0017 23.0 6.6 50 49-99 3-52 (186)
129 PF15018 InaF-motif: TRP-inter 55.5 21 0.00047 19.2 2.9 13 94-106 19-31 (38)
130 PF05399 EVI2A: Ectropic viral 54.4 18 0.0004 26.8 3.3 18 86-103 132-149 (227)
131 PHA03099 epidermal growth fact 54.1 12 0.00027 25.6 2.3 10 99-108 119-128 (139)
132 PF15179 Myc_target_1: Myc tar 53.9 23 0.00051 25.7 3.7 26 84-109 25-50 (197)
133 PF05335 DUF745: Protein of un 53.6 83 0.0018 22.7 10.8 61 15-75 110-170 (188)
134 TIGR01195 oadG_fam sodium pump 53.3 25 0.00055 21.9 3.5 18 88-105 14-31 (82)
135 KOG2911|consensus 53.0 1.3E+02 0.0028 24.7 8.6 71 4-74 287-359 (439)
136 PRK11637 AmiB activator; Provi 52.8 1.2E+02 0.0025 24.2 11.9 44 40-83 94-137 (428)
137 PRK05696 fliL flagellar basal 51.8 12 0.00025 26.3 2.0 22 84-105 21-42 (170)
138 COG5415 Predicted integral mem 51.7 1E+02 0.0022 23.1 8.3 32 48-79 14-45 (251)
139 KOG1556|consensus 51.2 45 0.00098 25.6 5.0 38 30-67 165-215 (309)
140 PF13800 Sigma_reg_N: Sigma fa 51.1 46 0.001 20.8 4.5 13 70-82 3-15 (96)
141 PTZ00046 rifin; Provisional 50.6 21 0.00045 28.4 3.4 20 84-103 321-340 (358)
142 TIGR01477 RIFIN variant surfac 50.4 21 0.00046 28.3 3.3 24 84-107 316-339 (353)
143 PF11657 Activator-TraM: Trans 50.3 83 0.0018 21.8 15.1 10 61-70 104-113 (144)
144 COG4068 Uncharacterized protei 50.2 31 0.00066 20.5 3.2 14 80-93 40-53 (64)
145 PF06072 Herpes_US9: Alphaherp 50.1 53 0.0011 19.5 4.4 14 66-79 9-22 (60)
146 PHA03240 envelope glycoprotein 49.9 28 0.0006 26.1 3.7 23 84-106 215-242 (258)
147 PF12495 Vip3A_N: Vegetative i 49.8 83 0.0018 21.7 7.3 85 7-91 38-129 (177)
148 PF04272 Phospholamban: Phosph 49.5 47 0.001 18.7 4.1 16 76-91 24-39 (52)
149 PRK10617 cytochrome c-type pro 48.9 68 0.0015 23.4 5.6 9 95-103 34-42 (200)
150 PF07798 DUF1640: Protein of u 48.7 92 0.002 21.8 14.9 37 32-68 99-139 (177)
151 PF14960 ATP_synth_reg: ATP sy 48.0 29 0.00063 19.7 2.8 11 99-109 39-49 (49)
152 PF13396 PLDc_N: Phospholipase 48.0 11 0.00024 20.4 1.1 24 84-107 22-45 (46)
153 PTZ00208 65 kDa invariant surf 47.8 47 0.001 27.1 4.9 34 75-108 378-412 (436)
154 PF01708 Gemini_mov: Geminivir 47.5 7.1 0.00015 25.1 0.3 22 86-107 39-60 (91)
155 PRK05529 cell division protein 47.4 18 0.00038 27.1 2.4 22 72-93 24-45 (255)
156 PF07835 COX4_pro_2: Bacterial 47.4 28 0.00061 19.2 2.7 15 84-98 24-38 (44)
157 cd01324 cbb3_Oxidase_CcoQ Cyto 47.0 35 0.00076 19.1 3.1 14 94-107 21-34 (48)
158 TIGR03142 cytochro_ccmI cytoch 47.0 79 0.0017 20.6 7.0 16 90-105 99-114 (117)
159 KOG1693|consensus 46.9 1.2E+02 0.0025 22.5 7.9 23 84-106 178-200 (209)
160 COG4736 CcoQ Cbb3-type cytochr 46.2 52 0.0011 19.5 3.8 19 87-106 13-31 (60)
161 PF06459 RR_TM4-6: Ryanodine R 46.0 65 0.0014 24.6 5.3 44 60-106 150-194 (274)
162 PF02419 PsbL: PsbL protein; 45.5 40 0.00088 18.0 2.9 19 85-103 17-35 (37)
163 PF01102 Glycophorin_A: Glycop 45.4 39 0.00085 22.8 3.6 22 85-107 73-94 (122)
164 PF09451 ATG27: Autophagy-rela 45.2 26 0.00056 26.3 3.0 19 84-102 202-220 (268)
165 PF14362 DUF4407: Domain of un 45.1 1.3E+02 0.0029 22.6 12.1 25 83-107 262-286 (301)
166 PHA02947 S-S bond formation pa 44.7 34 0.00074 25.3 3.5 10 70-79 154-163 (215)
167 PF02706 Wzz: Chain length det 44.5 7.4 0.00016 25.6 0.0 22 72-93 5-26 (152)
168 KOG3287|consensus 44.3 1.3E+02 0.0029 22.5 7.1 35 50-84 154-188 (236)
169 PF06008 Laminin_I: Laminin Do 43.6 1.3E+02 0.0029 22.2 10.8 57 14-70 185-241 (264)
170 PHA03395 p10 fibrous body prot 43.4 86 0.0019 20.0 6.1 27 39-65 39-65 (87)
171 PF07889 DUF1664: Protein of u 43.3 1E+02 0.0022 20.8 7.9 36 42-77 75-110 (126)
172 PRK11901 hypothetical protein; 43.2 30 0.00065 27.2 3.1 14 86-99 40-53 (327)
173 PF10140 YukC: WXG100 protein 43.2 12 0.00026 29.7 1.0 38 70-107 181-219 (359)
174 PHA02662 ORF131 putative membr 43.2 42 0.00091 25.0 3.8 31 70-100 163-204 (226)
175 TIGR03521 GldG gliding-associa 43.2 52 0.0011 27.3 4.7 26 80-105 522-547 (552)
176 PF08372 PRT_C: Plant phosphor 43.1 1.1E+02 0.0025 21.4 8.9 33 42-74 59-91 (156)
177 PLN03160 uncharacterized prote 43.1 14 0.00031 27.0 1.3 32 77-108 32-64 (219)
178 PF13800 Sigma_reg_N: Sigma fa 43.0 65 0.0014 20.1 4.3 12 75-86 4-15 (96)
179 KOG4433|consensus 42.9 2E+02 0.0044 24.1 11.8 24 82-105 208-231 (526)
180 PF05337 CSF-1: Macrophage col 42.3 8.4 0.00018 29.6 0.0 23 86-108 231-253 (285)
181 PF09815 XK-related: XK-relate 42.0 40 0.00086 26.0 3.7 21 89-109 312-332 (332)
182 PF15168 TRIQK: Triple QxxK/R 41.8 67 0.0015 20.0 3.9 21 87-107 55-75 (79)
183 PRK06287 cobalt transport prot 41.6 49 0.0011 21.6 3.6 25 84-108 79-103 (107)
184 PF14914 LRRC37AB_C: LRRC37A/B 40.8 42 0.00092 23.5 3.3 18 76-93 113-130 (154)
185 PF03408 Foamy_virus_ENV: Foam 40.7 1.7E+02 0.0036 26.3 7.4 68 25-105 907-974 (981)
186 PF13997 YqjK: YqjK-like prote 40.1 86 0.0019 19.0 7.0 37 41-77 2-38 (73)
187 PF00558 Vpu: Vpu protein; In 39.9 36 0.00077 21.4 2.6 19 89-107 17-35 (81)
188 PF12768 Rax2: Cortical protei 39.9 38 0.00082 25.9 3.2 14 95-108 244-257 (281)
189 COG1422 Predicted membrane pro 39.8 1.5E+02 0.0033 21.8 9.5 35 36-70 59-93 (201)
190 PTZ00087 thrombosponding-relat 39.8 14 0.00031 28.6 0.9 26 84-109 298-323 (340)
191 CHL00038 psbL photosystem II p 39.8 60 0.0013 17.4 3.0 19 85-103 18-36 (38)
192 PF15468 DUF4636: Domain of un 39.6 46 0.001 24.9 3.5 29 78-107 36-64 (243)
193 COG4839 FtsL Protein required 39.4 1E+02 0.0022 20.8 4.8 23 82-104 37-59 (120)
194 TIGR00383 corA magnesium Mg(2+ 39.4 1.6E+02 0.0035 22.0 12.8 17 93-109 300-316 (318)
195 PRK15111 antimicrobial peptide 39.4 65 0.0014 24.4 4.5 30 72-101 16-45 (296)
196 PF01105 EMP24_GP25L: emp24/gp 39.1 8.4 0.00018 26.0 -0.4 23 48-70 111-133 (183)
197 PRK15348 type III secretion sy 39.0 86 0.0019 23.6 5.0 40 66-108 209-248 (249)
198 TIGR03503 conserved hypothetic 39.0 22 0.00049 28.4 1.9 18 92-109 356-373 (374)
199 PF06365 CD34_antigen: CD34/Po 39.0 41 0.00088 24.7 3.1 17 92-108 112-128 (202)
200 TIGR01006 polys_exp_MPA1 polys 38.8 72 0.0016 22.8 4.5 17 72-88 10-26 (226)
201 PRK15041 methyl-accepting chem 38.8 2.2E+02 0.0048 23.4 11.0 67 9-75 428-494 (554)
202 COG4192 Signal transduction hi 38.7 2E+02 0.0043 24.4 7.3 56 23-78 258-327 (673)
203 PF11877 DUF3397: Protein of u 38.6 1.1E+02 0.0024 19.9 5.3 36 73-108 48-83 (116)
204 PF09788 Tmemb_55A: Transmembr 38.4 30 0.00065 26.3 2.4 28 73-100 187-214 (256)
205 PF06716 DUF1201: Protein of u 38.4 77 0.0017 18.0 4.0 7 100-106 25-31 (54)
206 PRK12800 fliF flagellar MS-rin 38.4 1.1E+02 0.0024 25.8 6.0 23 81-104 30-52 (574)
207 PF12409 P5-ATPase: P5-type AT 38.3 53 0.0011 21.5 3.4 23 83-105 16-38 (119)
208 PF06009 Laminin_II: Laminin D 37.8 11 0.00023 25.5 0.0 51 24-74 6-56 (138)
209 PF09716 ETRAMP: Malarial earl 37.6 1E+02 0.0022 19.2 5.3 21 71-91 42-62 (84)
210 KOG3202|consensus 37.6 1.7E+02 0.0038 21.9 11.2 63 4-70 141-208 (235)
211 PF07889 DUF1664: Protein of u 37.5 1.3E+02 0.0028 20.3 8.7 45 21-65 68-112 (126)
212 PF11947 DUF3464: Protein of u 37.1 1.1E+02 0.0024 21.5 4.9 32 76-107 58-89 (153)
213 TIGR02796 tolQ TolQ protein. T 36.9 61 0.0013 23.6 3.8 22 82-103 13-34 (215)
214 PF00015 MCPsignal: Methyl-acc 36.9 1.4E+02 0.003 20.5 12.1 55 14-68 128-182 (213)
215 PF13163 DUF3999: Protein of u 36.9 30 0.00064 27.9 2.4 22 81-102 406-427 (429)
216 PF05478 Prominin: Prominin; 36.8 2.9E+02 0.0063 24.2 12.3 56 18-75 354-409 (806)
217 cd07597 BAR_SNX8 The Bin/Amphi 36.3 1.8E+02 0.0038 21.5 7.1 64 16-79 93-161 (246)
218 KOG2736|consensus 36.3 63 0.0014 26.1 4.0 28 80-107 70-97 (406)
219 KOG4684|consensus 36.3 68 0.0015 24.1 3.9 27 70-96 196-222 (275)
220 PRK00753 psbL photosystem II r 36.1 62 0.0013 17.4 2.8 19 85-103 19-37 (39)
221 PRK10144 formate-dependent nit 36.1 72 0.0016 21.7 3.8 24 83-106 101-124 (126)
222 PRK07193 fliF flagellar MS-rin 36.1 98 0.0021 26.0 5.3 23 81-104 25-47 (552)
223 PF14316 DUF4381: Domain of un 35.7 61 0.0013 21.9 3.5 25 84-108 24-48 (146)
224 PHA02902 putative IMV membrane 35.7 84 0.0018 19.0 3.6 21 84-104 6-26 (70)
225 COG1033 Predicted exporters of 35.4 61 0.0013 28.2 4.1 70 30-107 525-594 (727)
226 PRK10414 biopolymer transport 35.3 66 0.0014 24.1 3.8 25 80-104 21-45 (244)
227 PF14899 DUF4492: Domain of un 35.1 75 0.0016 19.1 3.3 23 85-107 20-42 (64)
228 KOG3637|consensus 34.9 39 0.00084 30.5 2.9 23 86-108 983-1007(1030)
229 PF07010 Endomucin: Endomucin; 34.8 64 0.0014 24.4 3.6 8 99-106 207-214 (259)
230 PF05957 DUF883: Bacterial pro 34.6 1.1E+02 0.0025 18.9 13.1 71 22-92 10-82 (94)
231 PF05454 DAG1: Dystroglycan (D 34.6 13 0.00028 28.7 0.0 9 96-104 163-171 (290)
232 PF04790 Sarcoglycan_1: Sarcog 34.5 62 0.0013 24.6 3.6 17 82-98 10-26 (264)
233 PF10389 CoatB: Bacteriophage 34.5 54 0.0012 18.4 2.5 13 96-108 32-44 (46)
234 PRK06007 fliF flagellar MS-rin 34.5 62 0.0013 27.0 3.9 25 84-108 440-464 (542)
235 PF06679 DUF1180: Protein of u 34.3 66 0.0014 22.7 3.5 16 87-102 99-114 (163)
236 PF06103 DUF948: Bacterial pro 34.2 1.1E+02 0.0024 18.7 10.1 37 32-68 41-77 (90)
237 KOG1326|consensus 34.0 73 0.0016 28.9 4.4 33 71-103 1061-1093(1105)
238 PF06570 DUF1129: Protein of u 33.9 1.5E+02 0.0033 21.2 5.5 27 17-43 11-37 (206)
239 PF05393 Hum_adeno_E3A: Human 33.9 92 0.002 20.0 3.8 21 84-104 37-57 (94)
240 TIGR02797 exbB tonB-system ene 33.9 52 0.0011 23.8 3.1 22 82-103 13-34 (211)
241 KOG1094|consensus 33.8 55 0.0012 28.4 3.5 20 84-103 395-414 (807)
242 PF10732 DUF2524: Protein of u 33.8 1.3E+02 0.0027 19.1 5.5 19 45-63 62-80 (84)
243 PRK15048 methyl-accepting chem 33.7 2.6E+02 0.0056 22.7 10.9 65 11-75 428-492 (553)
244 COG4640 Predicted membrane pro 33.7 55 0.0012 26.7 3.3 21 80-100 48-68 (465)
245 PRK11638 lipopolysaccharide bi 33.7 1.1E+02 0.0023 24.1 4.9 31 72-104 12-42 (342)
246 PF06009 Laminin_II: Laminin D 33.5 14 0.0003 25.0 0.0 40 30-69 5-44 (138)
247 TIGR01432 QOXA cytochrome aa3 33.5 73 0.0016 23.1 3.8 8 100-107 43-50 (217)
248 TIGR01478 STEVOR variant surfa 33.3 64 0.0014 25.0 3.5 22 86-107 266-287 (295)
249 PRK10801 colicin uptake protei 33.0 76 0.0017 23.4 3.8 22 82-103 14-35 (227)
250 PF10183 ESSS: ESSS subunit of 33.0 62 0.0013 20.9 3.0 19 86-104 63-81 (105)
251 PF15145 DUF4577: Domain of un 32.9 68 0.0015 21.6 3.2 22 84-105 62-83 (128)
252 PF09577 Spore_YpjB: Sporulati 32.8 2.1E+02 0.0045 21.4 6.7 23 84-106 201-223 (232)
253 TIGR02866 CoxB cytochrome c ox 32.8 67 0.0015 22.9 3.5 21 85-105 16-36 (201)
254 PF00517 GP41: Retroviral enve 32.7 1.9E+02 0.0042 20.9 8.6 60 26-93 106-165 (204)
255 PF05781 MRVI1: MRVI1 protein; 32.6 1.9E+02 0.004 24.5 6.3 27 64-90 461-487 (538)
256 PF10661 EssA: WXG100 protein 32.6 81 0.0017 21.7 3.7 22 84-105 122-143 (145)
257 PF15361 RIC3: Resistance to i 32.4 60 0.0013 22.5 3.0 22 85-109 87-108 (152)
258 TIGR00206 fliF flagellar basal 32.1 1.5E+02 0.0033 24.9 5.8 22 81-102 22-43 (555)
259 PF10032 Pho88: Phosphate tran 32.1 78 0.0017 22.9 3.7 24 86-109 35-58 (192)
260 TIGR03147 cyt_nit_nrfF cytochr 31.7 95 0.0021 21.1 3.8 25 83-108 101-125 (126)
261 PTZ00370 STEVOR; Provisional 31.6 71 0.0015 24.8 3.5 22 86-107 262-283 (296)
262 PRK11875 psbT photosystem II r 31.5 71 0.0015 16.4 2.4 11 96-106 13-23 (31)
263 PF03381 CDC50: LEM3 (ligand-e 31.3 80 0.0017 23.9 3.8 7 96-102 260-266 (278)
264 PF11353 DUF3153: Protein of u 31.3 71 0.0015 23.0 3.4 14 95-108 194-207 (209)
265 PRK11637 AmiB activator; Provi 31.2 2.7E+02 0.0059 22.1 13.0 67 22-88 69-135 (428)
266 PF10577 UPF0560: Uncharacteri 31.0 59 0.0013 28.7 3.3 21 84-104 277-297 (807)
267 PRK09458 pspB phage shock prot 31.0 85 0.0018 19.4 3.2 18 89-106 8-25 (75)
268 PF04639 Baculo_E56: Baculovir 30.8 45 0.00098 25.9 2.4 27 80-106 273-299 (305)
269 TIGR02185 Trep_Strep conserved 30.8 72 0.0016 22.7 3.3 29 81-109 157-186 (189)
270 PLN03094 Substrate binding sub 30.5 62 0.0013 25.8 3.2 30 75-104 78-107 (370)
271 PHA02955 hypothetical protein; 30.4 52 0.0011 24.3 2.5 9 82-90 179-187 (213)
272 KOG2067|consensus 30.4 1.3E+02 0.0028 24.8 4.9 45 14-60 369-418 (472)
273 PF06363 Picorna_P3A: Picornav 30.2 1.6E+02 0.0034 19.1 5.2 40 69-108 55-94 (100)
274 PF15339 Afaf: Acrosome format 30.2 2E+02 0.0044 20.8 5.4 36 72-107 125-160 (200)
275 PF12259 DUF3609: Protein of u 30.0 33 0.00071 27.2 1.6 21 88-108 305-325 (361)
276 PF02936 COX4: Cytochrome c ox 30.0 1.1E+02 0.0023 21.0 4.0 24 81-104 73-96 (142)
277 COG3167 PilO Tfp pilus assembl 29.9 77 0.0017 23.3 3.3 22 82-103 23-44 (211)
278 PRK09793 methyl-accepting prot 29.7 3.1E+02 0.0067 22.3 10.9 66 11-76 426-491 (533)
279 PRK15053 dpiB sensor histidine 29.7 1.3E+02 0.0027 24.1 4.9 22 72-93 10-31 (545)
280 PF01034 Syndecan: Syndecan do 29.6 18 0.0004 21.7 0.1 6 102-107 33-38 (64)
281 PF13179 DUF4006: Family of un 29.6 87 0.0019 18.9 3.0 15 87-101 16-30 (66)
282 PRK06654 fliL flagellar basal 29.5 70 0.0015 23.1 3.0 13 83-95 32-44 (181)
283 PRK05287 hypothetical protein; 29.4 2.5E+02 0.0053 21.2 6.1 53 35-87 58-120 (250)
284 PRK05585 yajC preprotein trans 29.2 68 0.0015 20.9 2.7 9 97-105 27-35 (106)
285 PF06789 UPF0258: Uncharacteri 28.9 67 0.0015 22.7 2.8 27 69-95 117-143 (159)
286 PF07072 DUF1342: Protein of u 28.9 2.3E+02 0.0051 20.7 6.7 52 35-86 40-102 (211)
287 PRK12785 fliL flagellar basal 28.8 7.2 0.00016 27.4 -2.1 22 84-105 26-47 (166)
288 PRK09821 putative transporter; 28.7 88 0.0019 25.6 3.8 27 83-109 177-203 (454)
289 COG2443 Sss1 Preprotein transl 28.6 1.4E+02 0.003 18.0 6.3 49 55-103 5-56 (65)
290 PF10808 DUF2542: Protein of u 28.5 36 0.00077 21.2 1.2 22 84-107 58-79 (79)
291 PF11887 DUF3407: Protein of u 28.1 2.6E+02 0.0056 21.0 9.9 54 7-60 52-105 (267)
292 PF06682 DUF1183: Protein of u 28.0 65 0.0014 25.2 2.9 10 96-105 167-176 (318)
293 PF11026 DUF2721: Protein of u 27.7 1.9E+02 0.0041 19.2 7.0 23 69-91 49-72 (130)
294 PRK09400 secE preprotein trans 27.7 1.4E+02 0.003 17.6 6.4 48 57-104 6-56 (61)
295 CHL00031 psbT photosystem II p 27.6 73 0.0016 16.6 2.1 11 96-106 13-23 (33)
296 PRK06007 fliF flagellar MS-rin 27.6 1.9E+02 0.004 24.2 5.6 22 80-101 21-42 (542)
297 PF03597 CcoS: Cytochrome oxid 27.2 1.2E+02 0.0026 16.8 3.7 6 88-93 8-13 (45)
298 PF03896 TRAP_alpha: Transloco 27.1 93 0.002 23.9 3.5 21 84-104 208-228 (285)
299 PHA03010 hypothetical protein; 27.1 88 0.0019 25.5 3.5 19 90-108 522-540 (546)
300 PRK15471 chain length determin 26.7 1.8E+02 0.0038 22.7 5.0 33 72-106 20-52 (325)
301 PF10112 Halogen_Hydrol: 5-bro 26.4 2.4E+02 0.0051 19.9 5.8 57 9-67 141-197 (199)
302 KOG0771|consensus 26.2 88 0.0019 25.3 3.3 22 82-103 368-389 (398)
303 cd00922 Cyt_c_Oxidase_IV Cytoc 26.2 1.3E+02 0.0028 20.5 3.8 22 83-104 75-96 (136)
304 PF02790 COX2_TM: Cytochrome C 26.1 1.5E+02 0.0032 17.5 4.1 22 85-106 27-48 (84)
305 COG0342 SecD Preprotein transl 26.0 90 0.0019 25.9 3.5 24 83-106 344-367 (506)
306 PF11174 DUF2970: Protein of u 25.8 1.4E+02 0.0031 17.2 3.6 20 84-103 33-52 (56)
307 PF08656 DASH_Dad3: DASH compl 25.6 1.7E+02 0.0038 18.1 4.1 23 8-30 7-29 (78)
308 PF11044 TMEMspv1-c74-12: Plec 25.5 1.3E+02 0.0029 16.8 3.8 10 97-106 21-30 (49)
309 TIGR00996 Mtu_fam_mce virulenc 25.4 2.8E+02 0.0061 20.5 9.8 25 30-54 211-235 (291)
310 PF15136 UPF0449: Uncharacteri 25.2 2E+02 0.0043 18.7 6.1 19 42-60 74-92 (97)
311 PF12958 DUF3847: Protein of u 25.2 1.9E+02 0.004 18.3 6.8 48 44-91 3-53 (86)
312 PF13198 DUF4014: Protein of u 25.1 1.7E+02 0.0038 17.9 3.8 10 95-104 29-38 (72)
313 PF04834 Adeno_E3_14_5: Early 25.0 87 0.0019 20.4 2.6 11 95-105 37-47 (97)
314 PF07253 Gypsy: Gypsy protein; 25.0 1E+02 0.0022 25.6 3.5 24 85-108 420-443 (472)
315 PRK10913 dipeptide transporter 25.0 2E+02 0.0044 21.8 5.1 28 73-100 20-47 (300)
316 TIGR00327 secE_euk_arch protei 24.9 1.6E+02 0.0034 17.4 4.7 46 60-105 5-53 (61)
317 PHA03386 P10 fibrous body prot 24.8 2E+02 0.0043 18.6 5.4 26 41-66 35-60 (94)
318 PF05531 NPV_P10: Nucleopolyhe 24.8 1.8E+02 0.0039 18.0 7.1 27 40-66 40-66 (75)
319 PF14387 DUF4418: Domain of un 24.7 1.1E+02 0.0023 20.6 3.1 17 75-91 94-110 (124)
320 COG1766 fliF Flagellar basal b 24.7 2E+02 0.0044 24.2 5.3 14 80-93 21-34 (545)
321 PRK11486 flagellar biosynthesi 24.6 1.4E+02 0.0029 20.2 3.6 11 92-102 26-36 (124)
322 PF06738 DUF1212: Protein of u 24.6 2.4E+02 0.0053 19.5 5.2 41 52-92 71-112 (193)
323 PF07584 BatA: Aerotolerance r 24.4 1.3E+02 0.0028 17.9 3.3 7 100-106 21-27 (77)
324 PHA02689 ORF051 putative membr 24.4 1E+02 0.0022 21.0 2.9 11 96-106 39-49 (128)
325 PF11241 DUF3043: Protein of u 24.3 1.1E+02 0.0025 21.8 3.3 7 98-104 119-125 (170)
326 PRK10772 cell division protein 24.3 2E+02 0.0044 18.9 4.3 32 72-104 12-44 (108)
327 PRK10249 phenylalanine transpo 24.1 94 0.002 24.8 3.2 14 96-109 445-458 (458)
328 PRK14750 kdpF potassium-transp 24.0 1.1E+02 0.0025 15.4 3.1 18 86-104 6-23 (29)
329 PF04888 SseC: Secretion syste 24.0 3.2E+02 0.0069 20.6 8.9 16 80-95 57-72 (306)
330 PRK10381 LPS O-antigen length 23.8 2.1E+02 0.0046 22.7 5.1 20 72-91 29-48 (377)
331 PF08655 DASH_Ask1: DASH compl 23.7 1.2E+02 0.0026 18.2 2.9 20 54-73 8-27 (66)
332 PRK13831 conjugal transfer pro 23.7 55 0.0012 26.8 1.8 22 82-104 23-44 (432)
333 PF11023 DUF2614: Protein of u 23.6 1.5E+02 0.0033 19.8 3.6 14 92-105 47-60 (114)
334 PF01405 PsbT: Photosystem II 23.6 1.2E+02 0.0025 15.4 2.7 11 96-106 13-23 (29)
335 PF14991 MLANA: Protein melan- 23.5 16 0.00035 24.5 -1.1 16 92-107 33-48 (118)
336 PRK15082 glutathione ABC trans 23.5 2.3E+02 0.005 21.4 5.1 25 73-97 26-50 (301)
337 PF02411 MerT: MerT mercuric t 23.3 1.9E+02 0.004 19.2 4.0 26 81-106 90-115 (116)
338 TIGR01000 bacteriocin_acc bact 23.1 76 0.0016 25.5 2.5 21 87-107 22-42 (457)
339 COG3087 FtsN Cell division pro 23.1 1.3E+02 0.0028 23.0 3.6 23 86-108 18-40 (264)
340 PRK02935 hypothetical protein; 23.0 1.7E+02 0.0036 19.5 3.6 14 92-105 48-61 (110)
341 PF06422 PDR_CDR: CDR ABC tran 23.0 1.4E+02 0.0029 19.1 3.3 18 79-96 48-65 (103)
342 COG3765 WzzB Chain length dete 22.9 2.4E+02 0.0051 22.5 5.1 33 72-106 21-53 (347)
343 PF07664 FeoB_C: Ferrous iron 22.9 1.5E+02 0.0033 16.5 3.5 18 85-102 4-21 (54)
344 KOG0994|consensus 22.8 6.8E+02 0.015 23.9 10.0 9 37-45 1565-1573(1758)
345 PF13572 DUF4134: Domain of un 22.8 1.4E+02 0.0031 19.2 3.3 22 85-106 46-67 (98)
346 PF10875 DUF2670: Protein of u 22.7 1.4E+02 0.0031 20.3 3.4 13 96-108 32-44 (139)
347 PF04415 DUF515: Protein of un 22.7 66 0.0014 26.2 2.0 29 78-106 26-55 (416)
348 PF05371 Phage_Coat_Gp8: Phage 22.6 1.4E+02 0.003 17.2 2.9 10 97-106 40-49 (52)
349 PF03994 DUF350: Domain of Unk 22.5 1.1E+02 0.0024 16.9 2.5 21 87-107 1-21 (54)
350 PHA02681 ORF089 virion membran 22.5 1.5E+02 0.0033 18.8 3.2 8 97-104 19-26 (92)
351 PRK10927 essential cell divisi 22.5 41 0.00088 26.4 0.8 9 100-108 50-58 (319)
352 KOG3633|consensus 22.5 3.2E+02 0.0069 20.0 5.5 46 1-46 91-152 (219)
353 COG2991 Uncharacterized protei 22.3 1.3E+02 0.0029 18.5 2.9 14 84-97 4-17 (77)
354 TIGR02848 spore_III_AC stage I 22.2 1.9E+02 0.0041 17.3 3.9 33 67-99 18-50 (64)
355 COG4575 ElaB Uncharacterized c 22.0 2.4E+02 0.0053 18.5 11.0 78 22-99 20-99 (104)
356 PF07010 Endomucin: Endomucin; 21.9 1.2E+02 0.0027 22.9 3.2 24 84-107 189-212 (259)
357 PRK00409 recombination and DNA 21.9 5.5E+02 0.012 22.5 8.8 42 12-53 514-555 (782)
358 PHA02955 hypothetical protein; 21.9 1.3E+02 0.0028 22.3 3.3 23 84-106 178-200 (213)
359 PF03554 Herpes_UL73: UL73 vir 21.6 1.5E+02 0.0033 18.5 3.2 19 86-104 51-69 (82)
360 PRK14780 lipoprotein signal pe 21.5 1.5E+02 0.0033 22.6 3.7 28 81-109 203-230 (263)
361 PF09605 Trep_Strep: Hypotheti 21.5 1.4E+02 0.003 21.2 3.3 32 78-109 151-183 (186)
362 PRK15406 oligopeptide ABC tran 21.4 2.8E+02 0.006 21.2 5.2 26 73-98 27-52 (302)
363 PF07111 HCR: Alpha helical co 21.4 5.7E+02 0.012 22.5 11.5 75 9-83 530-623 (739)
364 PF09125 COX2-transmemb: Cytoc 21.2 1.5E+02 0.0033 15.8 3.8 20 82-102 14-33 (38)
365 PF03918 CcmH: Cytochrome C bi 21.1 32 0.00069 23.8 0.0 26 83-108 101-126 (148)
366 PF11712 Vma12: Endoplasmic re 21.0 1.8E+02 0.0039 19.6 3.7 25 82-106 111-135 (142)
367 TIGR02209 ftsL_broad cell divi 20.9 1.4E+02 0.0031 17.8 2.9 15 86-100 6-20 (85)
368 PF09972 DUF2207: Predicted me 20.9 2E+02 0.0043 22.6 4.5 7 100-106 247-253 (511)
369 KOG0810|consensus 20.8 4E+02 0.0088 20.6 10.6 55 49-103 226-288 (297)
370 PRK06870 secG preprotein trans 20.7 1.8E+02 0.0038 17.5 3.3 23 79-101 50-72 (76)
371 MTH00147 ATP8 ATP synthase F0 20.6 1.8E+02 0.0038 16.3 3.7 23 82-104 8-30 (51)
372 MTH00158 ATP8 ATP synthase F0 20.6 1.4E+02 0.003 15.1 3.5 21 83-103 9-29 (32)
373 PHA02967 hypothetical protein; 20.5 1.2E+02 0.0026 20.7 2.6 10 97-106 37-46 (128)
374 PF07234 DUF1426: Protein of u 20.4 2.1E+02 0.0046 18.8 3.7 21 79-100 10-30 (117)
375 PF05454 DAG1: Dystroglycan (D 20.4 34 0.00073 26.4 0.0 23 87-109 151-173 (290)
376 PF05008 V-SNARE: Vesicle tran 20.3 1.8E+02 0.0038 17.2 3.2 21 56-76 25-45 (79)
377 PRK15347 two component system 20.3 1.2E+02 0.0026 26.0 3.2 16 79-94 8-23 (921)
378 PF13125 DUF3958: Protein of u 20.2 2.6E+02 0.0057 18.2 6.4 41 29-69 53-93 (99)
379 PRK14585 pgaD putative PGA bio 20.2 2E+02 0.0043 19.9 3.7 25 80-105 50-74 (137)
No 1
>KOG1666|consensus
Probab=100.00 E-value=1.6e-32 Score=198.00 Aligned_cols=105 Identities=33% Similarity=0.539 Sum_probs=103.6
Q ss_pred chHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHh
Q psy6549 2 EKIHRETLRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFY 81 (109)
Q Consensus 2 ~~~~R~~ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~ 81 (109)
+.+||.+||+|++++++++++|.+|+|++.|||+||.+|+++|+.|||+|+++++.+.++|+++++|+++|+.|.||+++
T Consensus 116 ~~dQR~rLl~nTerLeRst~rl~ds~Ria~ETEqIG~~IL~dL~~QRe~L~rar~rL~~td~~lgkS~kiL~tM~RR~~~ 195 (220)
T KOG1666|consen 116 SADQRARLLQNTERLERSTDRLKDSQRIALETEQIGSEILEDLHGQREQLERARERLRETDANLGKSRKILTTMTRRLIR 195 (220)
T ss_pred chhHHHHHHhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhHHHHHHHHHHHHHHH
Confidence 67999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
|||++++||++++++|++++|+||+
T Consensus 196 nk~~~~aii~~l~~~il~ilY~kf~ 220 (220)
T KOG1666|consen 196 NKFTLTAIIALLVLAILLILYSKFT 220 (220)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 9999999999999999999999995
No 2
>PF12352 V-SNARE_C: Snare region anchored in the vesicle membrane C-terminus; PDB: 1GL2_C 2NPS_C.
Probab=99.70 E-value=2.3e-16 Score=95.57 Aligned_cols=66 Identities=38% Similarity=0.582 Sum_probs=63.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHH
Q psy6549 14 QALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRV 79 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~ 79 (109)
+.+.+++++|++|.++++||+++|.+|+.+|..|||+|.++++++++++++++.|+++++.|.||.
T Consensus 1 d~l~~e~~~L~~s~~~~~e~~~~g~~~l~~L~~Qre~L~~~~~kl~~i~~~l~~s~~~l~~I~rR~ 66 (66)
T PF12352_consen 1 DRLLRESDSLQRSHRMADETEEIGAATLEDLRSQREQLKRVRDKLDDIDSNLPKSNSLLKRISRRK 66 (66)
T ss_dssp HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHccC
Confidence 467899999999999999999999999999999999999999999999999999999999999984
No 3
>PF03908 Sec20: Sec20; InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=99.54 E-value=4.1e-13 Score=86.36 Aligned_cols=88 Identities=14% Similarity=0.231 Sum_probs=82.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy6549 17 VRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMT 96 (109)
Q Consensus 17 ~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~ 96 (109)
.+-++.|.++++++.+..+-|..+++.|.+|.++|..+++..+++++.++.|+++++.+.|+..+||+++|+.++++++.
T Consensus 4 ~~vT~~L~rt~~~m~~ev~~s~~t~~~L~~Ss~~L~~~~~e~~~~~~~l~~s~~ll~~l~r~~~~D~~li~~~~~~f~~~ 83 (92)
T PF03908_consen 4 SDVTESLRRTRQMMAQEVERSELTLQTLEESSATLRSTNDEYDGQSSLLKKSRKLLKKLERRDKTDRILIFFAFLFFLLV 83 (92)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 46789999999999999999999999999999999999999999999999999999999999999999999999888777
Q ss_pred HHHHHHHH
Q psy6549 97 LSVMLYLK 104 (109)
Q Consensus 97 i~~viy~k 104 (109)
+++|+|..
T Consensus 84 v~yI~~rR 91 (92)
T PF03908_consen 84 VLYILWRR 91 (92)
T ss_pred HHHHhhhc
Confidence 77777643
No 4
>KOG3251|consensus
Probab=99.53 E-value=2.1e-13 Score=99.17 Aligned_cols=87 Identities=20% Similarity=0.286 Sum_probs=78.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy6549 18 RTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTL 97 (109)
Q Consensus 18 ~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i 97 (109)
+-.+++.+|++.++|.-+.|.+|+++|..||-.|.++++++.++-..+|-|+.+|+.|.||...||+|+|+.++++++++
T Consensus 126 ~~~d~l~~s~~~lDd~l~~G~~ile~l~~Q~~~L~~~~~ki~~~~ntLGlSn~ti~lIeRR~~~Dk~iF~~G~i~~~v~~ 205 (213)
T KOG3251|consen 126 QENDSLKRSHNMLDDLLESGSAILENLVEQRLTLKGTQKKILDILNTLGLSNQTIRLIERRVREDKIIFYGGVILTLVIM 205 (213)
T ss_pred HhhhHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 45588999999999999999999999999999999999999999999999999999999999999999999998886644
Q ss_pred HHHHHHHH
Q psy6549 98 SVMLYLKI 105 (109)
Q Consensus 98 ~~viy~k~ 105 (109)
. .+|+||
T Consensus 206 y-l~~~wl 212 (213)
T KOG3251|consen 206 Y-LFYRWL 212 (213)
T ss_pred H-HHHHHh
Confidence 4 444443
No 5
>KOG3208|consensus
Probab=98.82 E-value=1e-07 Score=69.80 Aligned_cols=77 Identities=16% Similarity=0.166 Sum_probs=73.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy6549 18 RTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIES 94 (109)
Q Consensus 18 ~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~ 94 (109)
+-.++|++|.+.++|..+++.++-++|+.||..+.+++.++.++..-.---+.++..|.+|--+|-+|+.++|.+++
T Consensus 146 kE~~~in~s~~~vde~Is~A~aTre~l~~Qrs~l~~i~~k~~~~a~r~P~IN~Ll~kIk~kkrrdslILa~Vis~C~ 222 (231)
T KOG3208|consen 146 KEHDHINNSIRLVDELISQAQATRENLHSQRSVLGGINNKVNNIANRFPAINQLLQKIKIKKRRDSLILAAVISVCT 222 (231)
T ss_pred HHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcchHHHHHHHHHHHhhhhhHHHHHHHHHHH
Confidence 45689999999999999999999999999999999999999999999999999999999999999999999997773
No 6
>KOG0812|consensus
Probab=98.51 E-value=7.3e-06 Score=62.32 Aligned_cols=91 Identities=19% Similarity=0.225 Sum_probs=75.4
Q ss_pred hHHHHHHHhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHh
Q psy6549 3 KIHRETLRETNQ-ALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFY 81 (109)
Q Consensus 3 ~~~R~~ll~~~~-~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~ 81 (109)
+++...++..++ -..+=...+++.+....|.-+|=.+...-...|+|.+.|+++++++++.+++-|...|....-|+-.
T Consensus 208 qqqQm~ll~es~~Y~Q~R~~~~q~IEstIsElG~IF~QLA~mVseQ~E~i~RID~nv~ds~lnI~gA~~ellKy~e~vSS 287 (311)
T KOG0812|consen 208 QQQQMALLDESDEYVQERAKTMQNIESTISELGGIFQQLASMVSEQEETIQRIDDNVDDSDLNIEGAHSELLKYFERVSS 287 (311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhhhHHHHHHHHHHHHHhcc
Confidence 345566666653 3344456677778888888888888888889999999999999999999999999999999999999
Q ss_pred hHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIE 93 (109)
Q Consensus 82 ~K~il~~ii~~l 93 (109)
|+|.++=|++++
T Consensus 288 NRwLmvkiF~i~ 299 (311)
T KOG0812|consen 288 NRWLMVKIFGIL 299 (311)
T ss_pred chHHHHHHHHHH
Confidence 999998776655
No 7
>KOG3065|consensus
Probab=98.39 E-value=4.2e-06 Score=63.47 Aligned_cols=73 Identities=21% Similarity=0.236 Sum_probs=63.2
Q ss_pred HHHHHHhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHH
Q psy6549 5 HRETLRETN-QALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRR 77 (109)
Q Consensus 5 ~R~~ll~~~-~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~r 77 (109)
..+.|.+.- .....+.++-.++.+++.|+++.|..|+..|..|+|||.++.+.+++++.++.+|.+.+..+..
T Consensus 62 ~~~~l~~e~~~~~~eSl~St~~~L~~~~e~~~~g~~Tl~~L~~Q~eQL~rte~~lD~i~~d~~~~er~l~~l~~ 135 (273)
T KOG3065|consen 62 EVDELEQEIESTAQESLKSTRRMLKLAEESREDGSRTLVMLSEQGEQLERTEKNLDDIKVDLKRAERNLTELKG 135 (273)
T ss_pred HHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhHHhhhhhhHHHHHHHHHHHHHHHH
Confidence 444444433 3346888999999999999999999999999999999999999999999999999999998764
No 8
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=98.32 E-value=7.3e-05 Score=47.30 Aligned_cols=85 Identities=15% Similarity=0.284 Sum_probs=70.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 21 ESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 21 ~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
+.+...+.-++|+.++-.+-+..+-+..|.|....++.++...+...=.+.-+.+.|+++-.++-+++++++++++++++
T Consensus 3 dkl~~i~~~v~~v~~im~~Ni~~ll~Rge~L~~L~~kt~~L~~~a~~F~k~a~~l~r~~~~~~~k~~~i~~~iv~~~~~~ 82 (89)
T PF00957_consen 3 DKLEQIQEQVEEVKNIMRENIDKLLERGEKLEELEDKTEELSDNAKQFKKNAKKLKRKMWWRNYKLYIIIIIIVIIIILI 82 (89)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHcCchHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhhhhhH
Confidence 56677788888888888888999999999999999999999998888888888899998888887777777776666666
Q ss_pred HHHHH
Q psy6549 101 LYLKI 105 (109)
Q Consensus 101 iy~k~ 105 (109)
+++-+
T Consensus 83 i~~~~ 87 (89)
T PF00957_consen 83 IIIVI 87 (89)
T ss_dssp HHHTT
T ss_pred HHHHH
Confidence 66544
No 9
>smart00397 t_SNARE Helical region found in SNAREs. All alpha-helical motifs that form twisted and parallel four-helix bundles in target soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptor proteins. This motif found in "Q-SNAREs".
Probab=98.07 E-value=7.2e-05 Score=43.80 Aligned_cols=62 Identities=23% Similarity=0.258 Sum_probs=56.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHH
Q psy6549 13 NQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQA 74 (109)
Q Consensus 13 ~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~ 74 (109)
...+++..+.|.+..+.+.+..++|.++...+..|.+.|.++..+++.++..+..|.+-++.
T Consensus 4 ~~~~~~~~~~l~~l~~~i~~l~~l~~~i~~~v~~Q~~~ld~i~~~~d~~~~~~~~~~~~l~~ 65 (66)
T smart00397 4 DQMEEERDEELEQLEKSIGELKQIFLDMGTELEEQGEQLDRIEDNVDDADVNLKKANKRLKK 65 (66)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhc
Confidence 45677888999999999999999999999999999999999999999999999999877654
No 10
>KOG3202|consensus
Probab=97.87 E-value=0.00073 Score=50.38 Aligned_cols=89 Identities=13% Similarity=0.115 Sum_probs=69.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy6549 14 QALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l 93 (109)
+-++++...|+.-...+.-..++|..+-++|..|..-|..-...++.+++-+.+..+-+..|.+ ....|=.|++++++
T Consensus 145 qm~~eQDe~Ld~ls~ti~rlk~~a~~~g~EL~~Q~~llDdl~~e~d~t~srl~~~~~~l~~v~~--~~s~~~~~~~il~l 222 (235)
T KOG3202|consen 145 QMLQEQDEGLDGLSATVQRLKGMALAMGEELEEQGRLLDDLDNEMDRTESRLDRVMKRLAKVNR--MASQCSQWCAILLL 222 (235)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HhccccchhHHHHH
Confidence 3445666777777788888899999999999999999999999999999999999999999999 34444444455555
Q ss_pred HHHHHHHHHHH
Q psy6549 94 SMTLSVMLYLK 104 (109)
Q Consensus 94 ~l~i~~viy~k 104 (109)
++++++++..+
T Consensus 223 ~~~~~lvv~i~ 233 (235)
T KOG3202|consen 223 VGLLLLVVIIF 233 (235)
T ss_pred HHHHHHHHHHh
Confidence 45555554433
No 11
>KOG0860|consensus
Probab=97.81 E-value=0.0012 Score=44.19 Aligned_cols=84 Identities=12% Similarity=0.173 Sum_probs=58.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHH----HHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy6549 19 TGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVS----RSRRMIQAIRRRVFYNKLFLIMIIVIES 94 (109)
Q Consensus 19 t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~----~a~~~l~~m~rr~~~~K~il~~ii~~l~ 94 (109)
+++.+.+++.-++|+.+|-.+=.+..-+--|.|....++.+.....-+ .|.++=+.|-.+.++-++++++++++++
T Consensus 27 ~~~k~~~tq~QvdeVv~IMr~NV~KVlER~ekL~~L~drad~L~~~as~F~~~A~klkrk~wWkn~Km~~il~~v~~i~l 106 (116)
T KOG0860|consen 27 ANDKLQQTQAQVDEVVDIMRENVEKVLERGEKLDELDDRADQLQAGASQFEKTAVKLKRKMWWKNCKMRIILGLVIIILL 106 (116)
T ss_pred hhHHHHHHHHHHHHHHHHHHHhHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356777777778888887777666666666777666666665544433 3667777777788888888888887776
Q ss_pred HHHHHHHH
Q psy6549 95 MTLSVMLY 102 (109)
Q Consensus 95 l~i~~viy 102 (109)
++|++.+|
T Consensus 107 ~iiii~~~ 114 (116)
T KOG0860|consen 107 VVIIIYIF 114 (116)
T ss_pred HHHHHHHh
Confidence 66655554
No 12
>COG5074 t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion]
Probab=97.53 E-value=0.0014 Score=49.02 Aligned_cols=80 Identities=15% Similarity=0.129 Sum_probs=64.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHH----HHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy6549 21 ESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSR----RMIQAIRRRVFYNKLFLIMIIVIESMT 96 (109)
Q Consensus 21 ~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~----~~l~~m~rr~~~~K~il~~ii~~l~l~ 96 (109)
+.|...++++.|.-++=...-+....|.|....+.+.+.+...+++.+. +-++. +|.+..+||+-|+|+++.+++
T Consensus 185 ~~ikkiEkt~ael~qLfndm~~~V~eq~e~Vd~I~~~~~~~~~n~~~g~~h~d~Avks-aRaaRkkki~c~gI~~iii~v 263 (280)
T COG5074 185 QEIKKIEKTMAELTQLFNDMEELVIEQQENVDVIDKNVEDAQENVEQGVGHTDKAVKS-ARAARKKKIRCYGICFIIIIV 263 (280)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHHhhHhhHHhhHHHhhhhHHHHHHH-HHHHHhcceehhhhHHHHHHH
Confidence 5677788888888888888888888999999999999999988888865 45566 788899999999888766544
Q ss_pred HHHHH
Q psy6549 97 LSVML 101 (109)
Q Consensus 97 i~~vi 101 (109)
|++|+
T Consensus 264 iv~vv 268 (280)
T COG5074 264 IVVVV 268 (280)
T ss_pred HHHHH
Confidence 44443
No 13
>KOG0810|consensus
Probab=97.37 E-value=0.0046 Score=47.57 Aligned_cols=81 Identities=15% Similarity=0.172 Sum_probs=61.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHH---HHHHHHHhhHHHHHHHHHHHH
Q psy6549 18 RTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQ---AIRRRVFYNKLFLIMIIVIES 94 (109)
Q Consensus 18 ~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~---~m~rr~~~~K~il~~ii~~l~ 94 (109)
.=.+.+.+-++.+.|..++=.+...--..|.|.+.++..+|....+.+++|..-++ .-.+++.+.|||.++++++++
T Consensus 203 ~Rh~~ik~LEksi~ELhqlFlDMa~LVe~QgEmvd~IE~nV~~A~~~V~~g~~~~~kAv~~qkkaRK~k~i~ii~~iii~ 282 (297)
T KOG0810|consen 203 ERHDEIKKLEKSIRELHQLFLDMAVLVESQGEMVDRIENNVENAVDYVEQGVDHLKKAVKYQKKARKWKIIIIIILIIII 282 (297)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhceeeeehHHHHHH
Confidence 33456666777788888888888888889999999999999999999999998888 566666666666655555443
Q ss_pred HHHH
Q psy6549 95 MTLS 98 (109)
Q Consensus 95 l~i~ 98 (109)
++++
T Consensus 283 ~v~v 286 (297)
T KOG0810|consen 283 VVLV 286 (297)
T ss_pred HHHh
Confidence 3333
No 14
>PF05739 SNARE: SNARE domain; InterPro: IPR000727 The process of vesicular fusion with target membranes depends on a set of SNAREs (SNAP-Receptors), which are associated with the fusing membranes [, ]. Target SNAREs (t-SNAREs) are localised on the target membrane and belong to two different families, the syntaxin-like family and the SNAP-25 like family. One member of each family, together with a v-SNARE localised on the vesicular membrane, are required for fusion. The Syntaxins are type-I transmembrane proteins that contain several regions with coiled-coil propensity in their cytosolic part, the SNARE motif. SNAP-25 (IPR000928 from INTERPRO) is a protein consisting of two coiled-coil regions, which is associated with the membrane by lipid anchors. SNARE motifs assemble into parallel four helix bundles stabilised by the burial of these hydrophobic helix faces in the bundle core. Monomeric SNARE motifs are disordered so this assembly reaction is accompanied by a dramatic increase in alpha-helical secondary structure []. The parallel arrangement of SNARE motifs within complexes bring the transmembrane anchors, and the two membranes, into close proximity. Recently, it was shown that the two coiled-coil regions of SNAP-25 and one of the coiled-coil regions of the syntaxins are related []. This domain is found in both Syntaxin and SNAP-25 families as well as in other proteins.; GO: 0005515 protein binding; PDB: 1URQ_B 3RL0_R 1HVV_B 1SFC_B 1N7S_B 3IPD_B 3C98_B 3HD7_F 3RK2_B 1KIL_B ....
Probab=97.37 E-value=0.0066 Score=35.56 Aligned_cols=59 Identities=17% Similarity=0.193 Sum_probs=53.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHH
Q psy6549 20 GESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRR 78 (109)
Q Consensus 20 ~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr 78 (109)
.+.|..-...+.+..+++.++-.++..|.+.|.++..+++.+...+..+.+-|....+.
T Consensus 3 d~~l~~l~~~i~~l~~~~~~i~~ev~~Q~~~ld~i~~~vd~~~~~l~~~~~~l~ka~~~ 61 (63)
T PF05739_consen 3 DEELDELEQSIQELKQMFQDIGEEVEEQNEMLDRIEDNVDRANENLKKGNKKLKKALKY 61 (63)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45677788888999999999999999999999999999999999999999988887765
No 15
>PF09753 Use1: Membrane fusion protein Use1; InterPro: IPR019150 This entry represents a family of proteins, approximately 300 residues in length, involved in vesicle transport. They have a single C-terminal transmembrane domain and a SNARE [soluble NSF (N-ethylmaleimide-sensitive fusion protein) attachment protein receptor] domain of approximately 60 residues. The SNARE domains are essential for membrane fusion and are conserved from yeasts to humans. Use1 is one of the three protein subunits that make up the SNARE complex and it is specifically required for Golgi-endoplasmic reticulum retrograde transport [].
Probab=97.29 E-value=0.0099 Score=44.28 Aligned_cols=74 Identities=12% Similarity=0.210 Sum_probs=51.2
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH-HHHHHHHHHHHHHHh
Q psy6549 31 LEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIV-IESMTLSVMLYLKIF 106 (109)
Q Consensus 31 ~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~-~l~l~i~~viy~k~~ 106 (109)
....+-....-..|....+.|.++...++.--+.++.+..-++.+.++-+. |+.|++++ ++++.|+.|+|.|+|
T Consensus 177 ~~LK~~s~~~~~~l~~D~~~L~~~~~~~d~n~~~l~~~~~rl~~~~~~~~~--~~~~~~i~~v~~~Fi~mvl~iri~ 251 (251)
T PF09753_consen 177 RQLKENSLAFSQILKEDNKVLDRTEEGLDRNLSSLKRESKRLKEHSSKSWG--CWTWLMIFVVIIVFIMMVLFIRIF 251 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--HHHHHHHHHHHHHHHHHHHHheeC
Confidence 333444444556788899999999999999999999999889998776555 66665444 333444444555543
No 16
>KOG0811|consensus
Probab=97.29 E-value=0.015 Score=44.18 Aligned_cols=89 Identities=10% Similarity=0.100 Sum_probs=64.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHH---HHhhHHHHHHHHHH
Q psy6549 16 LVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRR---VFYNKLFLIMIIVI 92 (109)
Q Consensus 16 l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr---~~~~K~il~~ii~~ 92 (109)
++.-.+.+.+-++-+.|..+|=.+-..--+.|.+.+..+.++++....++..|..-|+.=++- ...-+|++.+++++
T Consensus 175 ieeR~q~I~~lE~dI~dvN~IFkdL~~lV~eQG~~VDsIe~nve~a~~nveqg~~~L~kA~~yq~~~~k~~~~ll~v~~~ 254 (269)
T KOG0811|consen 175 IEEREQAIEQLEADIIDVNEIFKDLGSLVHEQGELVDSIEANVENASVNVEQGTENLRKAAKYQRKARKKKCILLLVGGP 254 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhhhhHHHHH
Confidence 344556667777777777777777777788999999999999999999999999888775443 33333555555555
Q ss_pred HHHHHHHHHHHH
Q psy6549 93 ESMTLSVMLYLK 104 (109)
Q Consensus 93 l~l~i~~viy~k 104 (109)
.++++++++|+.
T Consensus 255 v~lii~l~i~~~ 266 (269)
T KOG0811|consen 255 VGLIIGLIIAGI 266 (269)
T ss_pred HHHHHHHHHHHh
Confidence 555566666654
No 17
>COG5325 t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion]
Probab=97.24 E-value=0.018 Score=43.80 Aligned_cols=86 Identities=17% Similarity=0.138 Sum_probs=65.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHH-HHHHHHhhHHHHHHHHHHHHHHHHH
Q psy6549 21 ESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQA-IRRRVFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 21 ~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~-m~rr~~~~K~il~~ii~~l~l~i~~ 99 (109)
+.+.+-.+-..|..++=.+.-.=..+|.+...+++-+++.+..++.-|++-|.. +..+-.++||-+|+.+++++++.++
T Consensus 195 ~eI~~l~~gI~Eln~IF~dL~~lV~eQG~lVdrID~Ni~~t~~n~k~A~kEL~kA~~hqrrt~k~~~~~Llil~vv~lfv 274 (283)
T COG5325 195 EEIKNLARGIYELNEIFRDLGSLVGEQGELVDRIDFNIENTSDNLKNANKELEKAPAHQRRTKKCRFYLLLILLVVLLFV 274 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhhhHHHHhhHHHHHHhHHHHhhhccchhhHHHHHHHHHHHH
Confidence 455555666666666666655556799999999999999999999999876654 5666788999999888888777766
Q ss_pred HHHHHHh
Q psy6549 100 MLYLKIF 106 (109)
Q Consensus 100 viy~k~~ 106 (109)
.+..+..
T Consensus 275 ~l~~kl~ 281 (283)
T COG5325 275 SLIKKLR 281 (283)
T ss_pred HHHHHhc
Confidence 6665554
No 18
>PRK10884 SH3 domain-containing protein; Provisional
Probab=97.19 E-value=0.036 Score=40.53 Aligned_cols=67 Identities=18% Similarity=0.156 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy6549 31 LEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVML 101 (109)
Q Consensus 31 ~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~vi 101 (109)
.+..+--.+...+|..+.++|. ..+......++..+.-+....+....+ ||+||..++++.+++.+|
T Consensus 124 ~~~~~~~~~~~~~L~~~n~~L~---~~l~~~~~~~~~l~~~~~~~~~~~~~~-wf~~Gg~v~~~GlllGli 190 (206)
T PRK10884 124 QQKVAQSDSVINGLKEENQKLK---NQLIVAQKKVDAANLQLDDKQRTIIMQ-WFMYGGGVAGIGLLLGLL 190 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHchHHHHHHHHHHHH
Confidence 3333334455555666666663 345556666666666666666666555 555555555545454444
No 19
>KOG3385|consensus
Probab=97.07 E-value=0.0052 Score=41.15 Aligned_cols=64 Identities=16% Similarity=0.105 Sum_probs=49.1
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy6549 30 ALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIES 94 (109)
Q Consensus 30 ~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~ 94 (109)
+.-...+.-+|-.+...|..-|.+.++..+.+.+-|+.+-.-++.|+|+ =.-+...|.++++++
T Consensus 45 V~aLKsLs~dIg~Ev~~qnklld~mdddfdsts~~L~gtm~r~~~~ar~-sg~~l~~~m~~f~lV 108 (118)
T KOG3385|consen 45 VKALKSLSLDIGDEVRTQNKLLDGMDDDFDSTSGFLSGTMGRLKTMARR-SGISLLCWMAVFSLV 108 (118)
T ss_pred HHHHHHHHHHhccccchHHHHHHHhccchhhhHHHHHHHHHHHHHHHhc-CCcchHHHHHHHHHH
Confidence 3334556667777889999999999999999999999999999999999 333444445554443
No 20
>KOG2678|consensus
Probab=97.04 E-value=0.065 Score=39.81 Aligned_cols=75 Identities=17% Similarity=0.211 Sum_probs=60.6
Q ss_pred HHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 34 EAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 34 e~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
.+.+...-.-|....+.+.++...+|.-...|+..+.-+..-.+.-..+++-...+|++++..|..|++.+||||
T Consensus 168 KtnalAfqsalkeDnQvl~~~~k~~D~N~~~L~~~Serve~y~ksk~s~wf~~~miI~v~~sFVsMiliiqifkk 242 (244)
T KOG2678|consen 168 KTNALAFQSALKEDNQVLGAAEKGIDVNSQGLMDVSERVEKYDKSKLSYWFYITMIIFVILSFVSMILIIQIFKK 242 (244)
T ss_pred HHhHHHHHHHHHhhHHHHHHHHHHHhHHHHHHHhhhHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 334444456788899999999999999888899888888888888777777777777777788888899999986
No 21
>KOG3894|consensus
Probab=97.02 E-value=0.035 Score=42.99 Aligned_cols=87 Identities=13% Similarity=0.135 Sum_probs=73.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy6549 17 VRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMT 96 (109)
Q Consensus 17 ~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~ 96 (109)
+...+-....++.+-|...+-.-..+.+..|-+.|..+++.+.++..++..++.-++...++....+.++.+.++++-.+
T Consensus 228 n~~~devrqie~~lvEI~~Lq~ifsehvl~Q~~~Id~I~d~~~~~teNIk~gNe~irka~~~~~~~r~~~lf~llvlsf~ 307 (316)
T KOG3894|consen 228 NELLDEVRQIEKRLVEISALQDIFSEHVLQQDQNIDLIHDLQSGATENIKDGNEEIRKAKRNNGGLRVFLLFFLLVLSFS 307 (316)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhhhhHHHHHHHHHhcccchhHHHHHHHHHHHH
Confidence 34456666777888899999999999999999999999999999999999999999999999887777777777777666
Q ss_pred HHHHHHH
Q psy6549 97 LSVMLYL 103 (109)
Q Consensus 97 i~~viy~ 103 (109)
+.++-||
T Consensus 308 lLFldwy 314 (316)
T KOG3894|consen 308 LLFLDWY 314 (316)
T ss_pred HHHHhhc
Confidence 6666554
No 22
>cd00193 t_SNARE Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent
Probab=97.02 E-value=0.012 Score=33.53 Aligned_cols=55 Identities=24% Similarity=0.272 Sum_probs=49.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHH
Q psy6549 19 TGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQ 73 (109)
Q Consensus 19 t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~ 73 (109)
..+.+......+.+..+++.++...+..|.+.|.++..+++.+...++.+.+-++
T Consensus 4 ~~~~l~~l~~~i~~l~~l~~~i~~~v~~Q~~~ld~i~~~~~~~~~~~~~~~~~l~ 58 (60)
T cd00193 4 RDEELEQLEASIGELKQIFLDLGTEVEEQGELLDRIEDNVDNADVNVKRANKRLK 58 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3456777888888999999999999999999999999999999999999987665
No 23
>KOG0809|consensus
Probab=95.43 E-value=0.29 Score=37.77 Aligned_cols=70 Identities=11% Similarity=0.093 Sum_probs=50.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy6549 24 VRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 24 ~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l 93 (109)
..-.+-..|..++=.+--.-..+|.-.+.|++-+++.+...++.|.+-|....+--..+|-..++.++++
T Consensus 221 ~ql~~sI~dL~~if~DL~~lVvdQGtvvDRIDyNvEqt~~~v~~a~keL~KAe~yQk~~~k~~~i~~L~l 290 (305)
T KOG0809|consen 221 TQLVESIYDLNQIFKDLSALVVDQGTVVDRIDYNVEQTQVRVEDALKELHKAERYQKRNKKMKVILMLTL 290 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccchhheecchhhhhhhHHhHHHHHHHHHHHHhcCCceEehHHHHH
Confidence 3333344444444444444456999999999999999999999999999998888777765554444433
No 24
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=95.17 E-value=0.43 Score=29.82 Aligned_cols=82 Identities=13% Similarity=0.217 Sum_probs=54.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy6549 14 QALVRTGESIVRSEQAALEAEAIGT---GVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMII 90 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~ete~iG~---~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii 90 (109)
+++.+..+.+++.+..+.+.-+--. +-+++|..+.|.|........ ..|+++=+.|..+-+.-++++.+++
T Consensus 3 dkl~~i~~~v~~v~~im~~Ni~~ll~Rge~L~~L~~kt~~L~~~a~~F~------k~a~~l~r~~~~~~~k~~~i~~~iv 76 (89)
T PF00957_consen 3 DKLEQIQEQVEEVKNIMRENIDKLLERGEKLEELEDKTEELSDNAKQFK------KNAKKLKRKMWWRNYKLYIIIIIIV 76 (89)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHcCchHHHHHHHHHHHHHHhHHHH------HHHHHHHHHHHHHHHHHHHhHHhhh
Confidence 4555666666666666666544322 335667777777665544444 4677777788889999988888888
Q ss_pred HHHHHHHHHHH
Q psy6549 91 VIESMTLSVML 101 (109)
Q Consensus 91 ~~l~l~i~~vi 101 (109)
++++++|++++
T Consensus 77 ~~~~~~i~~~~ 87 (89)
T PF00957_consen 77 IIIILIIIIVI 87 (89)
T ss_dssp HHHHHHHHHTT
T ss_pred hhhhhHHHHHH
Confidence 88877766654
No 25
>PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional
Probab=95.11 E-value=0.44 Score=29.70 Aligned_cols=57 Identities=18% Similarity=0.219 Sum_probs=41.0
Q ss_pred HHHHHhhhHhhhHHHHHHHH-HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 50 ALVRARDRLVDTDLHVSRSR-RMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 50 ~L~~~~~~~~~i~~~l~~a~-~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
......+++|+++.-+.-++ .+.+...++.-+|==|+|++++-+++.++++.+..++
T Consensus 16 d~~~i~~rLD~iEeKVEftn~Ei~Qr~GkkvGRDiGIlYG~viGlli~~i~~~~~~~~ 73 (77)
T PRK01026 16 DFKEIQKRLDEIEEKVEFTNAEIFQRIGKKVGRDIGILYGLVIGLLIVLVYIILSPIF 73 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556677777777777655 4566778888889899999988776766666665554
No 26
>PF04210 MtrG: Tetrahydromethanopterin S-methyltransferase, subunit G ; InterPro: IPR005866 This model describes the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit G in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive a sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of a methyl group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016021 integral to membrane
Probab=95.02 E-value=0.44 Score=29.17 Aligned_cols=56 Identities=9% Similarity=0.084 Sum_probs=37.9
Q ss_pred HHHHHhhhHhhhHHHHHHHH-HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 50 ALVRARDRLVDTDLHVSRSR-RMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 50 ~L~~~~~~~~~i~~~l~~a~-~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
+...+.+++++++.-+.-++ .+-+...++.-+|-=|+|++++-+++.+++++..++
T Consensus 13 ~~~~i~~rLd~iEeKvEf~~~Ei~Qr~GkkiGRDiGIlYG~v~Glii~~~~~~l~~~ 69 (70)
T PF04210_consen 13 DFNEIMKRLDEIEEKVEFTNAEIAQRAGKKIGRDIGILYGLVIGLIIFIIYIVLSSM 69 (70)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHhHHhhhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34455666666666666544 456678888889999999998776655555554443
No 27
>TIGR01149 mtrG N5-methyltetrahydromethanopterin:coenzyme M methyltransferase subunit G. coenzyme M methyltransferase subunit G in methanogenic archaea. This methyltranfersae is membrane-associated enzyme complex that uses methyl-transfer reaction to drive sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=94.02 E-value=0.81 Score=27.98 Aligned_cols=55 Identities=16% Similarity=0.126 Sum_probs=36.8
Q ss_pred HHHHhhhHhhhHHHHHHHH-HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 51 LVRARDRLVDTDLHVSRSR-RMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 51 L~~~~~~~~~i~~~l~~a~-~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
.....+++++++.-+.-.+ .+.+...++.-+|-=|+|++++-+++.+++.+.+.+
T Consensus 14 ~~~i~~rLd~iEeKVEf~~~E~~Qr~Gkk~GRDiGIlYG~viGlli~~~~~~l~~~ 69 (70)
T TIGR01149 14 FNEVMKRLDEIEEKVEFVNGEVAQRIGKKVGRDIGILYGLVIGLILFLIYILLSSM 69 (70)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3455666666666666544 456677888888888999988776555555554443
No 28
>PF12911 OppC_N: N-terminal TM domain of oligopeptide transport permease C
Probab=93.10 E-value=0.16 Score=29.01 Aligned_cols=32 Identities=25% Similarity=0.265 Sum_probs=25.2
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Q psy6549 71 MIQAIRRRVFYNKLFLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 71 ~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy 102 (109)
..+.+-+|..+||..+++.+++++++++.++.
T Consensus 4 ~~~~~~~~f~~nk~a~~gl~il~~~vl~ai~~ 35 (56)
T PF12911_consen 4 PWKDAWRRFRRNKLAVIGLIILLILVLLAIFA 35 (56)
T ss_pred HHHHHHHHHHhCchHHHHHHHHHHHHHHHHHH
Confidence 46788899999999999988887666655543
No 29
>COG4064 MtrG Tetrahydromethanopterin S-methyltransferase, subunit G [Coenzyme metabolism]
Probab=92.91 E-value=1.3 Score=27.12 Aligned_cols=54 Identities=15% Similarity=0.151 Sum_probs=34.0
Q ss_pred HhhhHhhhHHHHHHHH-HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 54 ARDRLVDTDLHVSRSR-RMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 54 ~~~~~~~i~~~l~~a~-~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+++.+++++.-++=.+ .+-+.+.++.-+|-=|+|++++-|++..++++..+.|+
T Consensus 20 ~~kRLdeieekvef~~~Ev~Qr~GkkiGRDIGILYGlVIGlil~~i~~~l~~~~~ 74 (75)
T COG4064 20 IHKRLDEIEEKVEFVNGEVYQRIGKKIGRDIGILYGLVIGLILCMIYILLGVAFR 74 (75)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444444444444333 23567788888898999998887766666666555543
No 30
>KOG3065|consensus
Probab=92.88 E-value=1.1 Score=34.23 Aligned_cols=60 Identities=20% Similarity=0.194 Sum_probs=52.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHH
Q psy6549 15 ALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQA 74 (109)
Q Consensus 15 ~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~ 74 (109)
.-++.-..|.+...++.....+|.+.=.++..|.+.|.+++++++..+..+..+++-++.
T Consensus 212 ~edeiD~NL~qis~~lg~LK~mA~dmg~Eie~Qn~~Ld~I~~k~d~~d~~v~~~n~R~~k 271 (273)
T KOG3065|consen 212 AEDEIDENLDQLSAILGRLKNMALDMGSEIESQNERLDRIEDKVDRLDLRVDKANKRAKK 271 (273)
T ss_pred hhhHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhHHHhHHHHHHHhhhhHHHHHHHHHHh
Confidence 334666788888899999999999999999999999999999999999999998876654
No 31
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=92.28 E-value=0.58 Score=27.73 Aligned_cols=31 Identities=10% Similarity=0.112 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 70 RMIQAIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 70 ~~l~~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
.....-.+|+.+.++++++++++++++.+++
T Consensus 26 ~~~~k~qk~~~~~~~i~~~~~i~~l~v~~~~ 56 (59)
T PF09889_consen 26 EEYRKRQKRMRKTQYIFFGIFILFLAVWIFM 56 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555667777788887777664444433
No 32
>KOG0859|consensus
Probab=90.54 E-value=1.3 Score=32.50 Aligned_cols=63 Identities=11% Similarity=0.123 Sum_probs=47.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhh
Q psy6549 20 GESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYN 82 (109)
Q Consensus 20 ~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~ 82 (109)
.+.|.+.+..+.|..++-.+=.+..-+-.|.|+=.-++..+..++-..=++.-+.+.|.++-.
T Consensus 124 id~lskvkaqv~evk~vM~eNIekvldRGekiELLVdKTenl~~~s~~fr~q~r~~~r~mw~~ 186 (217)
T KOG0859|consen 124 ISKLAKVKAQVTEVKGVMMENIEKVLDRGEKIELLVDKTENLRSKSFDFRTQGRKLRRKMWFQ 186 (217)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhccCeEEeeechhhhhhhhhHHHHHHHHHHHHHHHHh
Confidence 577888888888888888888888888888888877777777777666666666666665433
No 33
>PF07423 DUF1510: Protein of unknown function (DUF1510); InterPro: IPR009988 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown.
Probab=89.65 E-value=0.48 Score=35.03 Aligned_cols=30 Identities=23% Similarity=0.392 Sum_probs=21.1
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 77 RRVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 77 rr~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+|-..|+++=++|++++ |+|++|.|+.||.
T Consensus 8 KrRK~N~iLNiaI~IV~-lLIiiva~~lf~~ 37 (217)
T PF07423_consen 8 KRRKTNKILNIAIGIVS-LLIIIVAYQLFFG 37 (217)
T ss_pred HhhhhhhhHHHHHHHHH-HHHHHHhhhheec
Confidence 45567777767776666 6677788888873
No 34
>PRK10132 hypothetical protein; Provisional
Probab=89.47 E-value=4.5 Score=26.70 Aligned_cols=59 Identities=8% Similarity=0.023 Sum_probs=27.5
Q ss_pred HHHHHh-hHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 42 SELGVQ-REALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 42 ~~L~~Q-re~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
.+|+.+ ...|..+++.+.+.......++........-+-.|-|--++|...+.++++++
T Consensus 44 ~~lR~r~~~~L~~ar~~l~~~~~~~~~~~~a~~~~~~~V~~~Pw~svgiaagvG~llG~L 103 (108)
T PRK10132 44 EAARRKAQALLKETRARMHGRTRVQQAARDAVGCADTFVRERPWCSVGTAAAVGIFIGAL 103 (108)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhCcHHHHHHHHHHHHHHHHH
Confidence 444433 23455555566655554333443333333333445555566555544444433
No 35
>PF12669 P12: Virus attachment protein p12 family
Probab=88.83 E-value=0.32 Score=28.63 Aligned_cols=12 Identities=17% Similarity=0.229 Sum_probs=5.9
Q ss_pred HHHHHHHHHhcc
Q psy6549 97 LSVMLYLKIFKK 108 (109)
Q Consensus 97 i~~viy~k~~~~ 108 (109)
++++++++|+|+
T Consensus 12 ~~~v~~r~~~k~ 23 (58)
T PF12669_consen 12 VAYVAIRKFIKD 23 (58)
T ss_pred HHHHHHHHHHHH
Confidence 334455565543
No 36
>PHA03240 envelope glycoprotein M; Provisional
Probab=87.58 E-value=0.64 Score=34.59 Aligned_cols=22 Identities=18% Similarity=0.386 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~ 106 (109)
..|+++++++++|++++++||=
T Consensus 213 ~~WIiilIIiIiIIIL~cfKiP 234 (258)
T PHA03240 213 IAWIFIAIIIIIVIILFFFKIP 234 (258)
T ss_pred HhHHHHHHHHHHHHHHHHHhcc
Confidence 4566666666667777777763
No 37
>PF08114 PMP1_2: ATPase proteolipid family; InterPro: IPR012589 This family consists of small proteolipids associated with the plasma membrane H+ ATPase. Two proteolipids (PMP1 and PMP2) are associated with the ATPase and both genes are similarly expressed in the wild-type strain of yeast. No modification of the level of transcription of one PMP gene is detected in a strain deleted of the other. Though both proteolipids show similarity with other small proteolipids associated with other cation -transporting ATPases, their functions remain unclear [].
Probab=87.53 E-value=0.51 Score=26.02 Aligned_cols=18 Identities=11% Similarity=0.278 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHhccC
Q psy6549 92 IESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 92 ~l~l~i~~viy~k~~~~~ 109 (109)
+-+.+++.++|.||..+|
T Consensus 20 v~i~iva~~iYRKw~aRk 37 (43)
T PF08114_consen 20 VGIGIVALFIYRKWQARK 37 (43)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 444566777999997543
No 38
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=87.24 E-value=1.1 Score=30.34 Aligned_cols=21 Identities=5% Similarity=-0.007 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 85 FLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~ 105 (109)
|.+|+++-++++|++++|+-.
T Consensus 69 Ii~gv~aGvIg~Illi~y~ir 89 (122)
T PF01102_consen 69 IIFGVMAGVIGIILLISYCIR 89 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHH
Confidence 333444433344444444433
No 39
>PF04678 DUF607: Protein of unknown function, DUF607; InterPro: IPR006769 This entry represents the C-terminal domain of coiled-coil domain containing protein 109.
Probab=87.13 E-value=8.7 Score=27.24 Aligned_cols=53 Identities=13% Similarity=0.204 Sum_probs=34.3
Q ss_pred HhhhHhhhHHHHHHHHHHHHHHHHHHHhh-HHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 54 ARDRLVDTDLHVSRSRRMIQAIRRRVFYN-KLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 54 ~~~~~~~i~~~l~~a~~~l~~m~rr~~~~-K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
....++....++..-...-..|..++... +.++|++++++++-.+++.|+.|.
T Consensus 62 l~~~l~~~~~el~~le~~k~~id~~A~~~~~~~~w~gl~~l~~q~~~l~rLTf~ 115 (180)
T PF04678_consen 62 LRKRLEELRQELAPLEKIKQEIDEKAEKRARRLLWGGLALLVVQFGILARLTFW 115 (180)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34455566666666555556666665333 677788888887777777777664
No 40
>KOG0862|consensus
Probab=87.08 E-value=10 Score=28.06 Aligned_cols=67 Identities=10% Similarity=0.135 Sum_probs=50.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHH
Q psy6549 19 TGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLF 85 (109)
Q Consensus 19 t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~i 85 (109)
+...+..-.....++..+-..-++++-.-.|.|........+...+-..-.+..+.|.++..-+++-
T Consensus 132 ~~~n~~~~n~el~~v~~im~~niedvl~rg~~l~~l~~~~s~l~~~s~~y~~~a~~in~~sl~~~~a 198 (216)
T KOG0862|consen 132 SQRNLLKLNQELQDVQRIMVENLEDVLQRGEVLNALSSMASELSSESRKYPKTAKGINRKSLIRKYA 198 (216)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHhhchHHHhhhhhhhcccHHHHhhHHHHHHHHHHHHHHHHH
Confidence 3566677777777888888888899888888888877777776666656667777888887777665
No 41
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=86.22 E-value=0.16 Score=32.88 Aligned_cols=20 Identities=0% Similarity=0.174 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHhccC
Q psy6549 90 IVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 90 i~~l~l~i~~viy~k~~~~~ 109 (109)
++++.+++++++|+.++|+|
T Consensus 76 ~~~v~~lv~~l~w~f~~r~k 95 (96)
T PTZ00382 76 VAVVGGLVGFLCWWFVCRGK 95 (96)
T ss_pred hhHHHHHHHHHhheeEEeec
Confidence 33333444555566555654
No 42
>PF12777 MT: Microtubule-binding stalk of dynein motor; InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=85.94 E-value=14 Score=28.73 Aligned_cols=101 Identities=15% Similarity=0.187 Sum_probs=62.7
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHh----
Q psy6549 6 RETLRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFY---- 81 (109)
Q Consensus 6 R~~ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~---- 81 (109)
|.++-...+.++.+...|...+..+.+.++-=...-.++..--.........+..+..-+.+|.+++...+.-..+
T Consensus 220 ~~~l~~a~~~l~~~~~~L~~~~~~l~~l~~~l~~l~~~~~~~~~e~~~l~~~~~~~~~kl~rA~~Li~~L~~E~~RW~~~ 299 (344)
T PF12777_consen 220 RQKLEEAEAELEEAEEQLAEKQAELAELEEKLAALQKEYEEAQKEKQELEEEIEETERKLERAEKLISGLSGEKERWSEQ 299 (344)
T ss_dssp HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHCH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHhhhcchhhhHHHH
Confidence 5566666677777778888777777776655444444444444445566666788888899999999886654211
Q ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 82 ---NKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 82 ---~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
-+--+..+++-.+++-+++.|+..|
T Consensus 300 ~~~l~~~~~~l~GD~llaaa~isY~G~f 327 (344)
T PF12777_consen 300 IEELEEQLKNLVGDSLLAAAFISYLGPF 327 (344)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHCCCCT
T ss_pred HHHHHHHhcccHHHHHHHHHHHHHcCCC
Confidence 1111233444445566667776544
No 43
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=85.71 E-value=0.92 Score=32.80 Aligned_cols=23 Identities=17% Similarity=0.267 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~~ 107 (109)
|+.||++.|.+.-++++.|||.|
T Consensus 163 FiGGIVL~LGv~aI~ff~~KF~k 185 (186)
T PF05283_consen 163 FIGGIVLTLGVLAIIFFLYKFCK 185 (186)
T ss_pred hhhHHHHHHHHHHHHHHHhhhcc
Confidence 45566666666556667778876
No 44
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=85.59 E-value=2 Score=23.22 Aligned_cols=23 Identities=17% Similarity=0.366 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~~ 108 (109)
..++++.++++++.++||-+-.|
T Consensus 9 Iv~V~vg~~iiii~~~~YaCcyk 31 (38)
T PF02439_consen 9 IVAVVVGMAIIIICMFYYACCYK 31 (38)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHc
Confidence 34444445555555666555444
No 45
>PHA02844 putative transmembrane protein; Provisional
Probab=84.82 E-value=1.7 Score=26.87 Aligned_cols=22 Identities=32% Similarity=0.492 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhcc
Q psy6549 87 IMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~~ 108 (109)
++++.++++++...+|+|..++
T Consensus 53 i~i~~v~~~~~~~flYLK~~~r 74 (75)
T PHA02844 53 LTIIFVVFATFLTFLYLKAVPR 74 (75)
T ss_pred HHHHHHHHHHHHHHHHHheecC
Confidence 4455556666777789888765
No 46
>PF08693 SKG6: Transmembrane alpha-helix domain; InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=84.67 E-value=0.53 Score=25.81 Aligned_cols=14 Identities=21% Similarity=0.439 Sum_probs=7.3
Q ss_pred HHHHHHHHHHhccC
Q psy6549 96 TLSVMLYLKIFKKR 109 (109)
Q Consensus 96 ~i~~viy~k~~~~~ 109 (109)
++++++|++++|+|
T Consensus 27 vl~~~l~~~~rR~k 40 (40)
T PF08693_consen 27 VLGAFLFFWYRRKK 40 (40)
T ss_pred HHHHHhheEEeccC
Confidence 34445555566654
No 47
>PHA02650 hypothetical protein; Provisional
Probab=84.36 E-value=2.3 Score=26.64 Aligned_cols=21 Identities=14% Similarity=0.297 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHhcc
Q psy6549 88 MIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 88 ~ii~~l~l~i~~viy~k~~~~ 108 (109)
.+++++++++.+.+|+|..++
T Consensus 55 ~i~~v~i~~l~~flYLK~~~r 75 (81)
T PHA02650 55 LIFSLIIVALFSFFVFKGYTR 75 (81)
T ss_pred HHHHHHHHHHHHHHHHHHhcc
Confidence 344455566677789988765
No 48
>PF00523 Fusion_gly: Fusion glycoprotein F0; InterPro: IPR000776 The fusion glycoproteins from this family are found in ssRNA negative-strand viruses. This protein directs fusion of viral and cellular membranes, resulting in viral penetration, and can direct fusion of infected cells with adjoining cells, resulting in the formation of syncytia. The mature form is a dimer of polypeptides F1 and F2 linked by a disulphide bond [].; GO: 0006948 induction by virus of host cell-cell fusion; PDB: 2FYZ_D 3MAW_B 4DAG_A 1G5G_D 1SVF_A 2B9B_A 1G2C_M 3RRT_A 3RRR_D 3RKI_A ....
Probab=84.26 E-value=0.68 Score=38.07 Aligned_cols=41 Identities=15% Similarity=0.362 Sum_probs=23.7
Q ss_pred HHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy6549 50 ALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMII 90 (109)
Q Consensus 50 ~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii 90 (109)
.|..+...+++..+.+++|+++|+.+......+..+...++
T Consensus 439 eL~~vn~sL~~A~~~L~~Sn~iL~~v~~~~~~~~~iI~liV 479 (490)
T PF00523_consen 439 ELGQVNNSLNNAKDLLDKSNQILDSVNPGISSNSIIIILIV 479 (490)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHTTTT--------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccchHHHHHHHH
Confidence 46778888889999999999999999987777766554443
No 49
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=83.66 E-value=5.6 Score=23.52 Aligned_cols=32 Identities=19% Similarity=0.263 Sum_probs=22.5
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 75 IRRRVFYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 75 m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
..++.-+.+-..++..+++++++++++|..|+
T Consensus 28 ~~k~qk~~~~~~~i~~~~~i~~l~v~~~~~~~ 59 (59)
T PF09889_consen 28 YRKRQKRMRKTQYIFFGIFILFLAVWIFMTFF 59 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 44455556667788888887777777877764
No 50
>PTZ00464 SNF-7-like protein; Provisional
Probab=83.61 E-value=15 Score=26.94 Aligned_cols=73 Identities=4% Similarity=0.005 Sum_probs=45.4
Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHH
Q psy6549 4 IHRETLRETNQALVRTGESIVRSEQA--ALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRR 77 (109)
Q Consensus 4 ~~R~~ll~~~~~l~~t~~~L~~s~~~--~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~r 77 (109)
.+..++......+++....|+.++-. +.++-..|..+|..++.+= .+..+.+-++++.+.+..++.+=..+++
T Consensus 75 ~ql~~l~~q~~nleq~~~~ie~a~~~~~vv~amk~g~kaLK~~~k~i-~id~Vd~l~Dei~E~~e~~~EI~e~Ls~ 149 (211)
T PTZ00464 75 NQQDMMMQQQFNMDQLQFTTESVKDTKVQVDAMKQAAKTLKKQFKKL-NVDKVEDLQDELADLYEDTQEIQEIMGR 149 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 44555555566666666666665543 3356667777777777765 5667777777777776666655544443
No 51
>PF10151 DUF2359: Uncharacterised conserved protein (DUF2359); InterPro: IPR019308 This is a 450 amino acid region of a family of proteins conserved from insects to humans. The function is not known.
Probab=82.95 E-value=22 Score=29.23 Aligned_cols=45 Identities=7% Similarity=0.056 Sum_probs=32.4
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 62 DLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 62 ~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
+..+..+++..+.+..|+-..+|.-...+++++++++.+++|-+.
T Consensus 242 ~~~lk~~dk~Ck~il~K~~~~~c~w~~l~llllvliaG~l~yDv~ 286 (469)
T PF10151_consen 242 DESLKECDKACKVILGKMSGSSCPWTRLLLLLLVLIAGFLAYDVR 286 (469)
T ss_pred hHHHHHHHHHHHHHHHhhcCCCCchHHHHHHHHHHHHHHHHHhhh
Confidence 346677889999999998877766555555555666677777653
No 52
>PF14283 DUF4366: Domain of unknown function (DUF4366)
Probab=81.22 E-value=0.24 Score=36.60 Aligned_cols=15 Identities=20% Similarity=0.483 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHhccC
Q psy6549 95 MTLSVMLYLKIFKKR 109 (109)
Q Consensus 95 l~i~~viy~k~~~~~ 109 (109)
++.++..||||.|+|
T Consensus 172 ~gGGa~yYfK~~K~K 186 (218)
T PF14283_consen 172 IGGGAYYYFKFYKPK 186 (218)
T ss_pred hhcceEEEEEEeccc
Confidence 344555888888876
No 53
>PF12352 V-SNARE_C: Snare region anchored in the vesicle membrane C-terminus; PDB: 1GL2_C 2NPS_C.
Probab=80.76 E-value=8.9 Score=22.31 Aligned_cols=62 Identities=15% Similarity=0.097 Sum_probs=50.2
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHH
Q psy6549 7 ETLRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRS 68 (109)
Q Consensus 7 ~~ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a 68 (109)
+.|++.++.++++...++++..++.++-+-=..=-+.|..=+..+..+...+...+.-+..-
T Consensus 1 d~l~~e~~~L~~s~~~~~e~~~~g~~~l~~L~~Qre~L~~~~~kl~~i~~~l~~s~~~l~~I 62 (66)
T PF12352_consen 1 DRLLRESDSLQRSHRMADETEEIGAATLEDLRSQREQLKRVRDKLDDIDSNLPKSNSLLKRI 62 (66)
T ss_dssp HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 46788889999999999999999988865555556778888899999999988888877653
No 54
>PRK07021 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=79.49 E-value=2.8 Score=29.20 Aligned_cols=27 Identities=26% Similarity=0.313 Sum_probs=15.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
.|+++++++++++++.++..||+|.++
T Consensus 16 kkl~ii~l~~l~l~~~g~gg~~~~~~~ 42 (162)
T PRK07021 16 RKLWLIILILLLLAAAAGAGYSWWLSK 42 (162)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHHhhc
Confidence 455666655555555556666666543
No 55
>PF12575 DUF3753: Protein of unknown function (DUF3753); InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=79.23 E-value=3.4 Score=25.44 Aligned_cols=22 Identities=36% Similarity=0.676 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
++++++++++++++.+.+|+|.
T Consensus 50 ~~ii~ii~v~ii~~l~flYLK~ 71 (72)
T PF12575_consen 50 ILIISIIFVLIIVLLTFLYLKL 71 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHhcc
Confidence 3444555555555555677663
No 56
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=79.17 E-value=11 Score=22.58 Aligned_cols=21 Identities=14% Similarity=-0.097 Sum_probs=12.9
Q ss_pred HHhhHHHHHHHHHHHHHHHHH
Q psy6549 79 VFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 79 ~~~~K~il~~ii~~l~l~i~~ 99 (109)
.-..||+...++..++.++++
T Consensus 47 ~~n~kW~~r~iiGaiI~~i~~ 67 (71)
T PF10779_consen 47 KSNTKWIWRTIIGAIITAIIY 67 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 455678877777666544433
No 57
>PF05568 ASFV_J13L: African swine fever virus J13L protein; InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=78.93 E-value=3.6 Score=29.00 Aligned_cols=22 Identities=14% Similarity=0.196 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~ 106 (109)
+++.+.|++++++|+ |+.++|-
T Consensus 32 ~tILiaIvVliiiii-vli~lcs 53 (189)
T PF05568_consen 32 YTILIAIVVLIIIII-VLIYLCS 53 (189)
T ss_pred HHHHHHHHHHHHHHH-HHHHHHh
Confidence 333344444433333 3333443
No 58
>PHA02975 hypothetical protein; Provisional
Probab=78.72 E-value=6.7 Score=23.92 Aligned_cols=23 Identities=35% Similarity=0.759 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~ 106 (109)
++.++++.++++++...+|+|..
T Consensus 46 ~~ii~i~~v~~~~~~~flYLK~~ 68 (69)
T PHA02975 46 ILIIFIIFITCIAVFTFLYLKLM 68 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Confidence 33444555666666777888764
No 59
>PF14937 DUF4500: Domain of unknown function (DUF4500)
Probab=78.24 E-value=3.5 Score=26.24 Aligned_cols=37 Identities=11% Similarity=0.211 Sum_probs=27.2
Q ss_pred HHHHHHHHH--HhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 71 MIQAIRRRV--FYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 71 ~l~~m~rr~--~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+.+.++-.. .-||.||.+.++.+.++++++.|+...+
T Consensus 23 lFRavNfELy~kPNk~iM~~Gl~a~~~c~gYi~Ym~~~~ 61 (86)
T PF14937_consen 23 LFRAVNFELYVKPNKPIMAFGLIAITLCVGYIAYMHATY 61 (86)
T ss_pred hhhhhCHHHhccCCchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 344444443 2389999999999999999999987653
No 60
>KOG3433|consensus
Probab=78.13 E-value=24 Score=25.73 Aligned_cols=71 Identities=17% Similarity=0.121 Sum_probs=42.4
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHH
Q psy6549 8 TLRETNQALVRTGESIVRSEQAALEAEAIGTGVISELG-------VQREALVRARDRLVDTDLHVSRSRRMIQAIRRR 78 (109)
Q Consensus 8 ~ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~-------~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr 78 (109)
+|..++++...-.++++++++.-.+||+...+.-..|. .+|..+...+..=..+-......++.+..-+.|
T Consensus 89 ~L~~~~qk~~tl~e~~en~K~~~e~tEer~~el~kklnslkk~~e~lr~el~k~~e~dpqv~~k~~~~~K~~~eaanr 166 (203)
T KOG3433|consen 89 QLATGSQKKATLGESIENRKAGREETEERTDELTKKLNSLKKILESLRWELAKIQETDPQVFEKKVHLEKTMAEAANR 166 (203)
T ss_pred HHHHhhhhHhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHhh
Confidence 45556666667778999999999999999966654443 444444444333333333444444554444433
No 61
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=77.91 E-value=6.5 Score=27.35 Aligned_cols=32 Identities=13% Similarity=0.180 Sum_probs=18.9
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 77 RRVFYNKLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 77 rr~~~~K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
.+.-.+-++.+++++++++++++.++.-+.+|
T Consensus 10 ~q~~~~~~~~~~~i~~ll~~l~~~~~~Y~r~r 41 (149)
T PF11694_consen 10 SQQSQNDYLRYILIIILLLVLIFFFIKYLRNR 41 (149)
T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34445667777777777665555555444443
No 62
>PF15102 TMEM154: TMEM154 protein family
Probab=77.49 E-value=1.1 Score=31.22 Aligned_cols=7 Identities=14% Similarity=0.757 Sum_probs=2.6
Q ss_pred HHHHHHH
Q psy6549 97 LSVMLYL 103 (109)
Q Consensus 97 i~~viy~ 103 (109)
+++|+|+
T Consensus 76 V~lv~~~ 82 (146)
T PF15102_consen 76 VCLVIYY 82 (146)
T ss_pred HHheeEE
Confidence 3333333
No 63
>KOG1666|consensus
Probab=77.34 E-value=27 Score=25.93 Aligned_cols=61 Identities=16% Similarity=0.275 Sum_probs=33.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-HHHHHHhhhHhhhHHHHHHHHHHH
Q psy6549 11 ETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQR-EALVRARDRLVDTDLHVSRSRRMI 72 (109)
Q Consensus 11 ~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qr-e~L~~~~~~~~~i~~~l~~a~~~l 72 (109)
+...+|-+++++|++|-..+.++-.++.++ +.++.+= +-|.+=++.+....+.|..++..+
T Consensus 118 dQR~rLl~nTerLeRst~rl~ds~Ria~ET-EqIG~~IL~dL~~QRe~L~rar~rL~~td~~l 179 (220)
T KOG1666|consen 118 DQRARLLQNTERLERSTDRLKDSQRIALET-EQIGSEILEDLHGQREQLERARERLRETDANL 179 (220)
T ss_pred hHHHHHHhhhHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhchhhh
Confidence 344566778888888888888877777763 2222221 333333444444444444444333
No 64
>PF09753 Use1: Membrane fusion protein Use1; InterPro: IPR019150 This entry represents a family of proteins, approximately 300 residues in length, involved in vesicle transport. They have a single C-terminal transmembrane domain and a SNARE [soluble NSF (N-ethylmaleimide-sensitive fusion protein) attachment protein receptor] domain of approximately 60 residues. The SNARE domains are essential for membrane fusion and are conserved from yeasts to humans. Use1 is one of the three protein subunits that make up the SNARE complex and it is specifically required for Golgi-endoplasmic reticulum retrograde transport [].
Probab=76.94 E-value=27 Score=25.81 Aligned_cols=61 Identities=13% Similarity=0.153 Sum_probs=39.1
Q ss_pred HHHHhh----HHHHHHhhhHhhhHHHHHHHHHHHHHHHHHH--HhhHHHHHHHHHHHHHHHHHHHHH
Q psy6549 43 ELGVQR----EALVRARDRLVDTDLHVSRSRRMIQAIRRRV--FYNKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 43 ~L~~Qr----e~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~--~~~K~il~~ii~~l~l~i~~viy~ 103 (109)
.|.++- ..|..-..-++.+...++..-.-++.-..|+ ...|..-|+..+++++++++.++.
T Consensus 178 ~LK~~s~~~~~~l~~D~~~L~~~~~~~d~n~~~l~~~~~rl~~~~~~~~~~~~~~~i~~v~~~Fi~m 244 (251)
T PF09753_consen 178 QLKENSLAFSQILKEDNKVLDRTEEGLDRNLSSLKRESKRLKEHSSKSWGCWTWLMIFVVIIVFIMM 244 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHH
Confidence 455554 5666677777788888888887888877775 445566666666555444433333
No 65
>PHA02819 hypothetical protein; Provisional
Probab=76.36 E-value=5.9 Score=24.28 Aligned_cols=19 Identities=26% Similarity=0.634 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHhc
Q psy6549 89 IIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~~~ 107 (109)
++.++++++...+|+|..|
T Consensus 53 l~~~~~~~~~~flYLK~~~ 71 (71)
T PHA02819 53 LVTIVFVIIFIIFYLKVIK 71 (71)
T ss_pred HHHHHHHHHHHHHHHHhcC
Confidence 4444555566667777643
No 66
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=76.19 E-value=1.4 Score=35.17 Aligned_cols=23 Identities=9% Similarity=0.291 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhccC
Q psy6549 87 IMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
+++|+|+..+++|+.||.+.|.|
T Consensus 374 vavvvvVgglvGfLcWwf~crgk 396 (397)
T PF03302_consen 374 VAVVVVVGGLVGFLCWWFICRGK 396 (397)
T ss_pred ehhHHHHHHHHHHHhhheeeccc
Confidence 45566666677888888887876
No 67
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=75.97 E-value=4.6 Score=27.97 Aligned_cols=21 Identities=10% Similarity=0.126 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy6549 86 LIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~ 106 (109)
++++|+..+++|++++|..+.
T Consensus 121 i~~~i~g~ll~i~~giy~~~r 141 (145)
T PF10661_consen 121 ILLSIGGILLAICGGIYVVLR 141 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333344444667777776654
No 68
>PF06024 DUF912: Nucleopolyhedrovirus protein of unknown function (DUF912); InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=75.58 E-value=1.2 Score=28.89 Aligned_cols=23 Identities=4% Similarity=0.307 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~ 106 (109)
++++++.++.++++++++||.+.
T Consensus 64 ili~lls~v~IlVily~IyYFVI 86 (101)
T PF06024_consen 64 ILISLLSFVCILVILYAIYYFVI 86 (101)
T ss_pred hHHHHHHHHHHHHHHhhheEEEE
Confidence 33333334444444555555443
No 69
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=74.57 E-value=5.3 Score=24.29 Aligned_cols=21 Identities=24% Similarity=0.257 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k 104 (109)
+++.+|+++.+++|++-+|.+
T Consensus 5 ~iLi~ICVaii~lIlY~iYnr 25 (68)
T PF05961_consen 5 FILIIICVAIIGLILYGIYNR 25 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHhc
Confidence 455555555544444445443
No 70
>COG4499 Predicted membrane protein [Function unknown]
Probab=74.40 E-value=4.5 Score=32.58 Aligned_cols=30 Identities=20% Similarity=0.206 Sum_probs=24.1
Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 78 RVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 78 r~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+--..||+-+|.|++++++++++.|+-|+.
T Consensus 216 k~~ifk~~giGliillvl~li~~~Y~~f~~ 245 (434)
T COG4499 216 KYTIFKYFGIGLIILLVLLLIYFTYYYFSN 245 (434)
T ss_pred cceehhhHHHhHHHHHHHHHHHHHHHHHHc
Confidence 345678888888899999999998888763
No 71
>KOG1656|consensus
Probab=74.00 E-value=6.9 Score=28.88 Aligned_cols=60 Identities=20% Similarity=0.258 Sum_probs=42.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHH
Q psy6549 23 IVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKL 84 (109)
Q Consensus 23 L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~ 84 (109)
|.+=++.=.+..+|+ +++.+++.||+.|++++-+.. +-..++.+-+-++.+.+.+=.||+
T Consensus 67 LkrKK~~E~qL~qid-G~l~tie~Qr~alEnA~~n~E-vl~~m~~~A~AmK~~h~~mDiDkV 126 (221)
T KOG1656|consen 67 LKRKKRYEKQLAQID-GTLSTIEFQREALENANTNTE-VLDAMGSAAKAMKAAHKNMDIDKV 126 (221)
T ss_pred HHHHHHHHHHHHHHh-hHHHHHHHHHHHHHcccccHH-HHHHHHHHHHHHHHHHhccChhHH
Confidence 444444444555554 578999999999999877654 555677777788888777766665
No 72
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=73.79 E-value=61 Score=28.26 Aligned_cols=87 Identities=9% Similarity=0.081 Sum_probs=46.4
Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhh--------HHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy6549 17 VRTGESIVRS-EQAALEAEAIGTGVISELGVQREALVRARDRLVDT--------DLHVSRSRRMIQAIRRRVFYNKLFLI 87 (109)
Q Consensus 17 ~~t~~~L~~s-~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i--------~~~l~~a~~~l~~m~rr~~~~K~il~ 87 (109)
.++.+.+++. ..+.++|.++-..+-..|...++++...-+.+... ..+..++-+-...-..+.-..+|+..
T Consensus 338 ~~~~~~~~~ip~~v~~qt~~~v~~ik~~l~~~~~~i~~~a~~i~~~~~~~~s~~~~~~~~~~~~~~~~~~~y~~yR~~~~ 417 (806)
T PF05478_consen 338 QEGNSRFNDIPEKVQNQTSDVVPPIKRDLDSIGKQIRSQAKQIPNQIDSNISDILNNTERSSRSFEDEYEKYDSYRWIVG 417 (806)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHHHH
Confidence 3444444443 33445666666677777777777776655554442 22222222222234456667777776
Q ss_pred HHHHHHHHHHHHHHHH
Q psy6549 88 MIIVIESMTLSVMLYL 103 (109)
Q Consensus 88 ~ii~~l~l~i~~viy~ 103 (109)
.++..++++|++++++
T Consensus 418 lil~~~llLIv~~~~l 433 (806)
T PF05478_consen 418 LILCCVLLLIVLCLLL 433 (806)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 6666555555555444
No 73
>PF03904 DUF334: Domain of unknown function (DUF334); InterPro: IPR005602 This is a family of proteins found in Staphylococcus aureus plasmid with no characterised function.
Probab=73.64 E-value=35 Score=25.50 Aligned_cols=23 Identities=13% Similarity=0.112 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHH
Q psy6549 32 EAEAIGTGVISELGVQREALVRA 54 (109)
Q Consensus 32 ete~iG~~il~~L~~Qre~L~~~ 54 (109)
+|+++-..+..+...+|+-+.+.
T Consensus 110 ~tde~k~~~~~ei~k~r~e~~~m 132 (230)
T PF03904_consen 110 DTDELKNIAQNEIKKVREENKSM 132 (230)
T ss_pred chHHHHHHHHHHHHHHHHHHHHH
Confidence 45556666666666666555555
No 74
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=72.80 E-value=5.1 Score=31.00 Aligned_cols=9 Identities=33% Similarity=0.837 Sum_probs=3.5
Q ss_pred HHHHHHHHH
Q psy6549 97 LSVMLYLKI 105 (109)
Q Consensus 97 i~~viy~k~ 105 (109)
|.+|||+-|
T Consensus 271 IMvIIYLIL 279 (299)
T PF02009_consen 271 IMVIIYLIL 279 (299)
T ss_pred HHHHHHHHH
Confidence 333344433
No 75
>cd07912 Tweety_N N-terminal domain of the protein encoded by the Drosophila tweety gene and related proteins, a family of chloride ion channels. The protein product of the Drosophila tweety (tty) gene is thought to form a trans-membrane protein with five membrane-spanning regions and a cytoplasmic C-terminus. This N-terminal domain contains the putative transmembrane spanning regions. Tweety has been suggested as a candidate for a large conductance chloride channel, both in vertebrate and insect cells. Three human homologs have been identified and designated TTYH1-3. TTYH2 has been associated with the progression of cancer, and Drosophila melanogaster tweety has been assumed to play a role in development. TTYH2, and TTYH3 bind to and are ubiquinated by Nedd4-2, a HECT type E3 ubiquitin ligase, which most likely plays a role in controlling the cellular levels of tweety family proteins.
Probab=70.54 E-value=56 Score=26.49 Aligned_cols=28 Identities=4% Similarity=0.098 Sum_probs=16.6
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
-.-||..+.+++++.++++++..+.+-|
T Consensus 205 E~~Rw~~~l~lL~~~lviC~~~l~gl~r 232 (418)
T cd07912 205 ESYRWLAYLGLLSLLLVICLVLLVGLAR 232 (418)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666666665555443
No 76
>PRK14762 membrane protein; Provisional
Probab=70.35 E-value=5.8 Score=19.54 Aligned_cols=16 Identities=25% Similarity=0.393 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHH
Q psy6549 83 KLFLIMIIVIESMTLS 98 (109)
Q Consensus 83 K~il~~ii~~l~l~i~ 98 (109)
|+++|.+.+++++...
T Consensus 2 ki~lw~i~iifligll 17 (27)
T PRK14762 2 KIILWAVLIIFLIGLL 17 (27)
T ss_pred eeHHHHHHHHHHHHHH
Confidence 4567766665544333
No 77
>PHA02692 hypothetical protein; Provisional
Probab=70.26 E-value=8.8 Score=23.49 Aligned_cols=17 Identities=29% Similarity=0.397 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy6549 89 IIVIESMTLSVMLYLKI 105 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~ 105 (109)
+++++++++...+|+|.
T Consensus 53 ~~~~~~~vll~flYLK~ 69 (70)
T PHA02692 53 LIAAAIGVLLCFHYLKL 69 (70)
T ss_pred HHHHHHHHHHHHHHHhc
Confidence 44455566666677774
No 78
>KOG0811|consensus
Probab=70.21 E-value=20 Score=27.37 Aligned_cols=17 Identities=6% Similarity=0.094 Sum_probs=8.9
Q ss_pred hHhhhHHHHHHHHHHHH
Q psy6549 57 RLVDTDLHVSRSRRMIQ 73 (109)
Q Consensus 57 ~~~~i~~~l~~a~~~l~ 73 (109)
.||.|+++++.|.--+.
T Consensus 209 ~VDsIe~nve~a~~nve 225 (269)
T KOG0811|consen 209 LVDSIEANVENASVNVE 225 (269)
T ss_pred HHhHHHHHHHHHHHHHH
Confidence 45555555555554443
No 79
>PF12877 DUF3827: Domain of unknown function (DUF3827); InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells.
Probab=70.12 E-value=2.2 Score=36.27 Aligned_cols=29 Identities=21% Similarity=0.448 Sum_probs=18.2
Q ss_pred hhHHHHHHHHH--HHHHHHHHHHHHHHhccC
Q psy6549 81 YNKLFLIMIIV--IESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 81 ~~K~il~~ii~--~l~l~i~~viy~k~~~~~ 109 (109)
.|=||++++++ +++++|++++|+++-||+
T Consensus 267 ~NlWII~gVlvPv~vV~~Iiiil~~~LCRk~ 297 (684)
T PF12877_consen 267 NNLWIIAGVLVPVLVVLLIIIILYWKLCRKN 297 (684)
T ss_pred CCeEEEehHhHHHHHHHHHHHHHHHHHhccc
Confidence 45566665543 445666677888887653
No 80
>KOG0860|consensus
Probab=68.74 E-value=32 Score=23.05 Aligned_cols=80 Identities=14% Similarity=0.279 Sum_probs=37.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHH---HHHHHHHHHHH----HHHHHHhhHHHHHHHHHHHHHHHHH
Q psy6549 27 EQAALEAEAIGTGVISELGVQREALVRARDRLVDTDL---HVSRSRRMIQA----IRRRVFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 27 ~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~---~l~~a~~~l~~----m~rr~~~~K~il~~ii~~l~l~i~~ 99 (109)
.+-+..|.+--.++..=++..=++...-..++++.++ .|..+....+. +.|.++=..+=+..++++.++++++
T Consensus 28 ~~k~~~tq~QvdeVv~IMr~NV~KVlER~ekL~~L~drad~L~~~as~F~~~A~klkrk~wWkn~Km~~il~~v~~i~l~ 107 (116)
T KOG0860|consen 28 NDKLQQTQAQVDEVVDIMRENVEKVLERGEKLDELDDRADQLQAGASQFEKTAVKLKRKMWWKNCKMRIILGLVIIILLV 107 (116)
T ss_pred hHHHHHHHHHHHHHHHHHHHhHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555555555555556655544 33334333332 3344433333344444444444444
Q ss_pred HHHHHHh
Q psy6549 100 MLYLKIF 106 (109)
Q Consensus 100 viy~k~~ 106 (109)
+++..|+
T Consensus 108 iiii~~~ 114 (116)
T KOG0860|consen 108 VIIIYIF 114 (116)
T ss_pred HHHHHHh
Confidence 4444444
No 81
>PF11395 DUF2873: Protein of unknown function (DUF2873); InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=68.67 E-value=14 Score=19.93 Aligned_cols=20 Identities=5% Similarity=0.212 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~ 105 (109)
+++..+++.+.++..++++|
T Consensus 11 lc~l~~llflv~imliif~f 30 (43)
T PF11395_consen 11 LCFLSFLLFLVIIMLIIFWF 30 (43)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34333333333333333333
No 82
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=68.53 E-value=6.6 Score=30.38 Aligned_cols=22 Identities=23% Similarity=0.428 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
++.+.+|++.+++|.+|+-|+-
T Consensus 262 iiaIliIVLIMvIIYLILRYRR 283 (299)
T PF02009_consen 262 IIAILIIVLIMVIIYLILRYRR 283 (299)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444555554544
No 83
>PHA03164 hypothetical protein; Provisional
Probab=68.15 E-value=8.7 Score=24.07 Aligned_cols=22 Identities=18% Similarity=0.226 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k 104 (109)
-+++.+.++..++.|++|+|..
T Consensus 60 FlvLtgLaIamILfiifvlyvF 81 (88)
T PHA03164 60 FLVLTGLAIAMILFIIFVLYVF 81 (88)
T ss_pred hHHHHHHHHHHHHHHHHHHHhe
Confidence 3556666666666777777653
No 84
>PRK10787 DNA-binding ATP-dependent protease La; Provisional
Probab=67.86 E-value=38 Score=29.52 Aligned_cols=72 Identities=10% Similarity=0.070 Sum_probs=53.6
Q ss_pred CchHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------hHHHHHHhhhHhhhHHHHHHHHHH
Q psy6549 1 MEKIHRETLRETN---QALVRTGESIVRSEQAALEAEAIGTGVISELGVQ------REALVRARDRLVDTDLHVSRSRRM 71 (109)
Q Consensus 1 ~~~~~R~~ll~~~---~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Q------re~L~~~~~~~~~i~~~l~~a~~~ 71 (109)
++.++|++||... ++++.-...|.+-...+.=-.+|...+-+++.+| ||||+-+++.+.+.++.-++..++
T Consensus 175 l~~~eKQ~LLE~~d~~eRLe~Ll~lL~~Eleil~l~~~I~~~v~~~~~k~q~e~~lreq~~~i~~elg~~~~~~~~~~~~ 254 (784)
T PRK10787 175 LKLADKQSVLEMSDVNERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEAL 254 (784)
T ss_pred CCHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcchhhhhhhcccccCCCcchhHHHHH
Confidence 4678999999876 5677777777777777777778888888888876 689999999998765543344433
Q ss_pred H
Q psy6549 72 I 72 (109)
Q Consensus 72 l 72 (109)
-
T Consensus 255 ~ 255 (784)
T PRK10787 255 K 255 (784)
T ss_pred H
Confidence 3
No 85
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=67.73 E-value=9.1 Score=23.26 Aligned_cols=20 Identities=15% Similarity=0.454 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHHhc
Q psy6549 88 MIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 88 ~ii~~l~l~i~~viy~k~~~ 107 (109)
+++++.|++|+.++.|-+..
T Consensus 5 ~iLi~ICVaii~lIlY~iYn 24 (68)
T PF05961_consen 5 FILIIICVAIIGLILYGIYN 24 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 44444445554444444443
No 86
>PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=67.55 E-value=5.9 Score=28.33 Aligned_cols=19 Identities=11% Similarity=0.347 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy6549 87 IMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~ 106 (109)
++++++++++.+ +++|.|+
T Consensus 24 i~~~llll~~~G-~~~~~~~ 42 (182)
T PRK08455 24 IGVVVLLLLIVG-VIAMLLM 42 (182)
T ss_pred HHHHHHHHHHHH-HHHHHHh
Confidence 333444433334 4444443
No 87
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=67.23 E-value=11 Score=24.09 Aligned_cols=18 Identities=11% Similarity=0.014 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy6549 89 IIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~~ 106 (109)
+++.+++++++.+||+..
T Consensus 9 ~~~~v~~~i~~y~~~k~~ 26 (87)
T PF10883_consen 9 GVGAVVALILAYLWWKVK 26 (87)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333444445555543
No 88
>TIGR02161 napC_nirT periplasmic nitrate (or nitrite) reductase c-type cytochrome, NapC/NirT family. Nearly every member of this subfamily is NapC, a predicted membrane-anchored four-heme c-type cytochrome that forms one component of the periplasmic nitrate reductase along with NapA, NapB, NapD, NapE, and NapF subunits. A single known exception at this time is NirT, which is instead a component of a nitrite reductase. This family excludes TorC subunits of trimethylamine N-oxide (TMAO) reductases.
Probab=66.75 E-value=17 Score=26.15 Aligned_cols=8 Identities=0% Similarity=0.102 Sum_probs=3.6
Q ss_pred HHHHHHHH
Q psy6549 96 TLSVMLYL 103 (109)
Q Consensus 96 ~i~~viy~ 103 (109)
++++++|.
T Consensus 26 ~~G~~~~~ 33 (185)
T TIGR02161 26 VGGIVFWG 33 (185)
T ss_pred HHHHHHHH
Confidence 44444444
No 89
>PHA03049 IMV membrane protein; Provisional
Probab=65.32 E-value=11 Score=22.83 Aligned_cols=19 Identities=21% Similarity=0.375 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy 102 (109)
+++.+|+++.+++|++-+|
T Consensus 5 ~~l~iICVaIi~lIvYgiY 23 (68)
T PHA03049 5 IILVIICVVIIGLIVYGIY 23 (68)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3444444444333433343
No 90
>PF10717 ODV-E18: Occlusion-derived virus envelope protein ODV-E18; InterPro: IPR019655 Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=65.13 E-value=10 Score=24.02 Aligned_cols=18 Identities=6% Similarity=0.451 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6549 87 IMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k 104 (109)
.-|.++|+|+|.+|+++.
T Consensus 28 MtILivLVIIiLlImlfq 45 (85)
T PF10717_consen 28 MTILIVLVIIILLIMLFQ 45 (85)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 334444555555555543
No 91
>PHA03054 IMV membrane protein; Provisional
Probab=65.02 E-value=14 Score=22.68 Aligned_cols=17 Identities=29% Similarity=0.628 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy6549 89 IIVIESMTLSVMLYLKI 105 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~ 105 (109)
++.++++++...+|+|.
T Consensus 55 l~~v~~~~l~~flYLK~ 71 (72)
T PHA03054 55 FFIVLILLLLIYLYLKV 71 (72)
T ss_pred HHHHHHHHHHHHHHHhc
Confidence 33344445555566663
No 92
>COG5074 t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion]
Probab=64.75 E-value=60 Score=24.68 Aligned_cols=51 Identities=14% Similarity=0.118 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHH
Q psy6549 16 LVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSR 69 (109)
Q Consensus 16 l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~ 69 (109)
+.+-++-+++...++.|-++.-..|-.++..=++.. ...+.++|..+..|+
T Consensus 194 ~ael~qLfndm~~~V~eq~e~Vd~I~~~~~~~~~n~---~~g~~h~d~AvksaR 244 (280)
T COG5074 194 MAELTQLFNDMEELVIEQQENVDVIDKNVEDAQENV---EQGVGHTDKAVKSAR 244 (280)
T ss_pred HHHHHHHHHHHHHHHHhhcchHHHHHhhHhhHHhhH---HHhhhhHHHHHHHHH
Confidence 334455556666666666666666555555443332 233444444444433
No 93
>PF04999 FtsL: Cell division protein FtsL; InterPro: IPR007082 In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein IPR007060 from INTERPRO [], however this interaction may be indirect [].; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=64.36 E-value=26 Score=21.91 Aligned_cols=19 Identities=32% Similarity=0.564 Sum_probs=7.7
Q ss_pred HHHHHHhh-HHHHHHHHHHH
Q psy6549 75 IRRRVFYN-KLFLIMIIVIE 93 (109)
Q Consensus 75 m~rr~~~~-K~il~~ii~~l 93 (109)
+.+..... ++.++++++++
T Consensus 5 ~~~~~~~~~~l~i~l~~~v~ 24 (97)
T PF04999_consen 5 IIRDIKRQKKLIILLVIVVL 24 (97)
T ss_pred HHHHHHHhhHHHHHHHHHHH
Confidence 34444444 33333333333
No 94
>PF13131 DUF3951: Protein of unknown function (DUF3951)
Probab=64.26 E-value=13 Score=21.37 Aligned_cols=15 Identities=20% Similarity=0.357 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHhccC
Q psy6549 95 MTLSVMLYLKIFKKR 109 (109)
Q Consensus 95 l~i~~viy~k~~~~~ 109 (109)
++|+++-|.-|.+|+
T Consensus 17 ~lIgfity~mfV~K~ 31 (53)
T PF13131_consen 17 FLIGFITYKMFVKKA 31 (53)
T ss_pred HHHHHHHHHhheecC
Confidence 345555566666654
No 95
>PRK09546 zntB zinc transporter; Reviewed
Probab=63.84 E-value=63 Score=24.62 Aligned_cols=17 Identities=6% Similarity=-0.144 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHhccC
Q psy6549 93 ESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 93 l~l~i~~viy~k~~~~~ 109 (109)
+.+++++++|+.|+|||
T Consensus 306 im~~i~~~~~~~fkrk~ 322 (324)
T PRK09546 306 LLVVLIGGVAWWLKRSK 322 (324)
T ss_pred HHHHHHHHHHHHHHhcc
Confidence 33444445555555554
No 96
>PF15106 TMEM156: TMEM156 protein family
Probab=63.38 E-value=12 Score=27.74 Aligned_cols=27 Identities=15% Similarity=0.305 Sum_probs=15.9
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
+.=|+.-|..++++++..++++++|++
T Consensus 172 CsmKITWYvLVllVfiflii~iI~KIl 198 (226)
T PF15106_consen 172 CSMKITWYVLVLLVFIFLIILIIYKIL 198 (226)
T ss_pred eehhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 444566665555555555556666765
No 97
>PF08651 DASH_Duo1: DASH complex subunit Duo1; InterPro: IPR013960 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ].
Probab=62.76 E-value=34 Score=21.15 Aligned_cols=36 Identities=8% Similarity=0.085 Sum_probs=26.8
Q ss_pred HHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHH
Q psy6549 43 ELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRR 78 (109)
Q Consensus 43 ~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr 78 (109)
.|....++|.++...+..+.+.+..|..-+..+.+.
T Consensus 2 aL~kEL~~Lr~IN~~ie~~~~~L~~a~~~~~~v~~~ 37 (78)
T PF08651_consen 2 ALEKELEQLRKINPVIEGLIETLRSAKSNMNRVQET 37 (78)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466677777777777777777777777777777665
No 98
>PF08802 CytB6-F_Fe-S: Cytochrome B6-F complex Fe-S subunit ; InterPro: IPR014909 The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI), cyclic electron flow around PSI, and state transitions. The cytochrome b6-f complex has 4 large subunits, these are: cytochrome b6, subunit IV (17 kDa polypeptide, PetD), cytochrome f and the Rieske protein, while the 4 small subunits are: PetG, PetL, PetM and PetN. The complex functions as a dimer. This protein corresponds to the alpha helical transmembrane domain of the cytochrome b6-f complex Rieske iron-sulphur subunit. ; GO: 0009496 plastoquinol-plastocyanin reductase activity, 0051537 2 iron, 2 sulfur cluster binding, 0055114 oxidation-reduction process, 0042651 thylakoid membrane; PDB: 1Q90_R 1VF5_D 2E75_D 2E74_D 2E76_D 2D2C_Q 2ZT9_D.
Probab=62.69 E-value=23 Score=19.19 Aligned_cols=20 Identities=25% Similarity=0.217 Sum_probs=12.9
Q ss_pred HHHHHHHhhHHHHHHHHHHH
Q psy6549 74 AIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 74 ~m~rr~~~~K~il~~ii~~l 93 (109)
+|.||.+-|.+....+.+..
T Consensus 5 dm~RR~lmN~ll~Gava~~a 24 (39)
T PF08802_consen 5 DMSRRQLMNLLLGGAVAVPA 24 (39)
T ss_dssp -HHHHHHHHHHHHHHHHHHH
T ss_pred ChhHHHHHHHHHHhhHHHHH
Confidence 69999999975554444333
No 99
>PRK10884 SH3 domain-containing protein; Provisional
Probab=62.50 E-value=58 Score=23.75 Aligned_cols=52 Identities=4% Similarity=-0.050 Sum_probs=30.5
Q ss_pred HhhhHhhhHHHHHHHHHH---HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 54 ARDRLVDTDLHVSRSRRM---IQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 54 ~~~~~~~i~~~l~~a~~~---l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
......+....+..+..- +..-....-.+...-||+..-.++++++++-+-+
T Consensus 137 L~~~n~~L~~~l~~~~~~~~~l~~~~~~~~~~~~~~wf~~Gg~v~~~GlllGlil 191 (206)
T PRK10884 137 LKEENQKLKNQLIVAQKKVDAANLQLDDKQRTIIMQWFMYGGGVAGIGLLLGLLL 191 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHchHHHHHHHHHHHHh
Confidence 333344444444444433 4445555666666778888888777777765554
No 100
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=61.94 E-value=87 Score=25.55 Aligned_cols=65 Identities=15% Similarity=0.217 Sum_probs=45.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHH
Q psy6549 14 QALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRR 78 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr 78 (109)
++++++-..|..-++...+..+.-...-.+|..+.+.+..+...+.++..++...++-+.++..+
T Consensus 38 ~~l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~i~~~~~ql~~s~~~l~~~~~~I~~~~~~ 102 (420)
T COG4942 38 KQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNAR 102 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHH
Confidence 56666666677777777777777777777788887777777777777777666666666665554
No 101
>COG0598 CorA Mg2+ and Co2+ transporters [Inorganic ion transport and metabolism]
Probab=61.82 E-value=70 Score=24.48 Aligned_cols=23 Identities=30% Similarity=0.426 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhccC
Q psy6549 87 IMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
|-+++++.++++++.|+.|+||+
T Consensus 298 y~~~l~~m~~~~~~~~~~frrk~ 320 (322)
T COG0598 298 YPIALILMLLLALLLYLYFRRKG 320 (322)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcC
Confidence 34444555556666666666654
No 102
>PHA03049 IMV membrane protein; Provisional
Probab=61.50 E-value=14 Score=22.45 Aligned_cols=21 Identities=24% Similarity=0.475 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHhcc
Q psy6549 88 MIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 88 ~ii~~l~l~i~~viy~k~~~~ 108 (109)
+++++.|++|+..+.|-+..|
T Consensus 5 ~~l~iICVaIi~lIvYgiYnk 25 (68)
T PHA03049 5 IILVIICVVIIGLIVYGIYNK 25 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHhc
Confidence 444455555555555555544
No 103
>PF04639 Baculo_E56: Baculoviral E56 protein, specific to ODV envelope; InterPro: IPR006733 This family represents the E56 protein, which is localized to the occlusion derived virus (ODV) envelope, but not to the budded virus (BV) envelope []. Signals necessary for transport and/or retention into this structure are believed to be found within the C-terminal portion of ODV-E56.; GO: 0019031 viral envelope
Probab=60.76 E-value=4.1 Score=31.49 Aligned_cols=17 Identities=18% Similarity=0.278 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVMLY 102 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy 102 (109)
+++.+++++++|++++|
T Consensus 282 l~IG~vl~i~~Ig~~if 298 (305)
T PF04639_consen 282 LIIGGVLLIVFIGYFIF 298 (305)
T ss_pred HHHHHHHHHHHhhheee
Confidence 33333333333444333
No 104
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=60.60 E-value=15 Score=23.16 Aligned_cols=9 Identities=0% Similarity=0.571 Sum_probs=3.5
Q ss_pred HHHHHHHHH
Q psy6549 95 MTLSVMLYL 103 (109)
Q Consensus 95 l~i~~viy~ 103 (109)
+++.+|+|.
T Consensus 16 ~iiaIvvW~ 24 (81)
T PF00558_consen 16 LIIAIVVWT 24 (81)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHH
Confidence 333334443
No 105
>PF11812 DUF3333: Domain of unknown function (DUF3333); InterPro: IPR024573 This N-terminal domain is functionally uncharacterised and it is found in proteins annotated as putative phosphate ABC transporter permease proteins. This presumed domain is typically between 116 to 159 amino acids in length.
Probab=60.37 E-value=29 Score=24.24 Aligned_cols=27 Identities=11% Similarity=0.225 Sum_probs=16.6
Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 74 AIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 74 ~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
...||...+|.+-+..+..++++++++
T Consensus 4 ~lkkR~~~e~rFr~~g~~Ai~~~l~fL 30 (155)
T PF11812_consen 4 RLKKRYRAERRFRAYGLAAIAIALAFL 30 (155)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467788888888655554444444444
No 106
>TIGR00763 lon ATP-dependent protease La. This protein is induced by heat shock and other stresses in E. coli, B. subtilis, and other species. The yeast member, designated PIM1, is located in the mitochondrial matrix, required for mitochondrial function, and also induced by heat shock.
Probab=60.19 E-value=98 Score=26.82 Aligned_cols=61 Identities=18% Similarity=0.070 Sum_probs=46.6
Q ss_pred hHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------hHHHHHHhhhHhhhHH
Q psy6549 3 KIHRETLRETN---QALVRTGESIVRSEQAALEAEAIGTGVISELGVQ------REALVRARDRLVDTDL 63 (109)
Q Consensus 3 ~~~R~~ll~~~---~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Q------re~L~~~~~~~~~i~~ 63 (109)
.++|++||... ++++.-...|.+-...+.=..+|...+-+++.+| ||||+.+++.+.+.++
T Consensus 175 ~~eKQ~LLE~~d~~~RL~~l~~lL~~ele~l~l~~~I~~~v~~~~~~~qr~~~Lreqlk~i~~eLg~~~~ 244 (775)
T TIGR00763 175 KDELQEVLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKD 244 (775)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCC
Confidence 78999999876 4666777777777777777778888888888754 5799999999877543
No 107
>PF08649 DASH_Dad1: DASH complex subunit Dad1; InterPro: IPR013958 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules []. Throughout the cell cycle Dad1 remains bound to kinetochores and its association is dependent on the Mis6 and Mal2 [].
Probab=60.08 E-value=33 Score=20.17 Aligned_cols=37 Identities=8% Similarity=0.153 Sum_probs=26.3
Q ss_pred hHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 3 KIHRETLRE-TNQALVRTGESIVRSEQAALEAEAIGTG 39 (109)
Q Consensus 3 ~~~R~~ll~-~~~~l~~t~~~L~~s~~~~~ete~iG~~ 39 (109)
+.||++|++ =++.+++--..++.-.|.++++.++|.+
T Consensus 4 e~qR~~Li~eI~~~~e~vl~nlN~LNRsLE~~i~VGkE 41 (58)
T PF08649_consen 4 ERQRDRLIQEISESMESVLNNLNALNRSLESVISVGKE 41 (58)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhh
Confidence 468888887 3456677777777777777777777765
No 108
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=59.58 E-value=23 Score=19.54 Aligned_cols=15 Identities=0% Similarity=0.266 Sum_probs=5.6
Q ss_pred HHHHHHHHHHHHHHH
Q psy6549 90 IVIESMTLSVMLYLK 104 (109)
Q Consensus 90 i~~l~l~i~~viy~k 104 (109)
++++++.+++++|.-
T Consensus 16 v~~~~~F~gi~~w~~ 30 (49)
T PF05545_consen 16 VLFFVFFIGIVIWAY 30 (49)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333333333333333
No 109
>PTZ00046 rifin; Provisional
Probab=59.52 E-value=13 Score=29.60 Aligned_cols=7 Identities=29% Similarity=0.340 Sum_probs=2.5
Q ss_pred HHHHHHH
Q psy6549 87 IMIIVIE 93 (109)
Q Consensus 87 ~~ii~~l 93 (109)
+.+++++
T Consensus 319 aSiiAIv 325 (358)
T PTZ00046 319 ASIVAIV 325 (358)
T ss_pred HHHHHHH
Confidence 3333333
No 110
>PF11346 DUF3149: Protein of unknown function (DUF3149); InterPro: IPR021494 This bacterial family of proteins has no known function.
Probab=59.48 E-value=16 Score=20.12 Aligned_cols=21 Identities=24% Similarity=0.395 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy6549 86 LIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~ 106 (109)
+...++++..+.+.+.|.+++
T Consensus 13 L~Sl~vI~~~igm~~~~~~~F 33 (42)
T PF11346_consen 13 LMSLIVIVFTIGMGVFFIRYF 33 (42)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333334444444
No 111
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=59.45 E-value=12 Score=28.97 Aligned_cols=8 Identities=25% Similarity=1.007 Sum_probs=3.5
Q ss_pred HHHHHHhc
Q psy6549 100 MLYLKIFK 107 (109)
Q Consensus 100 viy~k~~~ 107 (109)
++|.|+.|
T Consensus 277 iLYiWlyr 284 (295)
T TIGR01478 277 ILYIWLYR 284 (295)
T ss_pred HHHHHHHH
Confidence 34444443
No 112
>PRK10404 hypothetical protein; Provisional
Probab=58.99 E-value=47 Score=21.52 Aligned_cols=64 Identities=13% Similarity=0.074 Sum_probs=28.4
Q ss_pred HHHHHHHHHHhh-HHHHHHhhhHhhhHHHHHH-HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 37 GTGVISELGVQR-EALVRARDRLVDTDLHVSR-SRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 37 G~~il~~L~~Qr-e~L~~~~~~~~~i~~~l~~-a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
+.+-..+|+.+= +.|..+++.+.+....... ++........-+-.|-|--++|.+.+.++++++
T Consensus 32 a~e~~~~lR~r~~~~L~~ar~~l~~~~~~~~~~~k~aa~~td~yV~e~Pw~avGiaagvGlllG~L 97 (101)
T PRK10404 32 ADQKYVELKARAEKALDDVKKRVSQASDSYYYRAKQAVYRADDYVHEKPWQGIGVGAAVGLVLGLL 97 (101)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhCcHHHHHHHHHHHHHHHHH
Confidence 333344444332 3444445455554443322 333333333333445555566655554444443
No 113
>PRK11466 hybrid sensory histidine kinase TorS; Provisional
Probab=58.78 E-value=1.2e+02 Score=26.09 Aligned_cols=6 Identities=33% Similarity=0.811 Sum_probs=2.4
Q ss_pred HHHHHH
Q psy6549 97 LSVMLY 102 (109)
Q Consensus 97 i~~viy 102 (109)
+++++|
T Consensus 344 ~~~~~~ 349 (914)
T PRK11466 344 LILILW 349 (914)
T ss_pred HHHHHH
Confidence 333433
No 114
>PF11669 WBP-1: WW domain-binding protein 1; InterPro: IPR021684 This family of proteins represents WBP-1, a ligand of the WW domain of Yes-associated protein. This protein has a proline-rich domain. WBP-1 does not bind to the SH3 domain [].
Probab=58.59 E-value=16 Score=23.76 Aligned_cols=23 Identities=4% Similarity=0.053 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~~ 107 (109)
.+|.++++++++.++-+|..+.+
T Consensus 23 ~FWlv~~liill~c~c~~~~~r~ 45 (102)
T PF11669_consen 23 YFWLVWVLIILLSCCCACRHRRR 45 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444433344445555443
No 115
>PF11031 Phage_holin_T: Bacteriophage T holin; InterPro: IPR020982 One mechanism by which bacteriophages effect host lysis begins with the accumulation of a holin in the host membrane and an endolysin in the host cytoplasm during late gene expression. At an allele-specific time, the holin disrupts the membrane, thus allowing the endolysin to enter the periplasm and degrade the cell wall peptidoglycan. This entry represents a specific holin, known as T, which has an unusual C-terminal periplasmic domain thought to be involved in the transduction of environmental information for the real-time control of lysis timing [].
Probab=58.46 E-value=8.9 Score=28.24 Aligned_cols=36 Identities=14% Similarity=0.249 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 70 RMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 70 ~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
.+|..+.+-..+-|+++.=+++++++.+.+++|++.
T Consensus 14 ~lLdrlfkd~~tgk~L~~Rv~~iivlfim~l~wyk~ 49 (216)
T PF11031_consen 14 GLLDRLFKDNKTGKVLFSRVIVIIVLFIMALIWYKG 49 (216)
T ss_pred HHHHHHHhccCcHHHHHHHHHHHHHHHHHhheeecc
Confidence 455666666778888888888888888888888764
No 116
>PTZ00370 STEVOR; Provisional
Probab=58.44 E-value=13 Score=28.84 Aligned_cols=8 Identities=25% Similarity=1.007 Sum_probs=3.5
Q ss_pred HHHHHHhc
Q psy6549 100 MLYLKIFK 107 (109)
Q Consensus 100 viy~k~~~ 107 (109)
++|.|+.|
T Consensus 273 ilYiwlyr 280 (296)
T PTZ00370 273 ILYIWLYR 280 (296)
T ss_pred HHHHHHHH
Confidence 34444443
No 117
>PF01519 DUF16: Protein of unknown function DUF16; InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=58.06 E-value=51 Score=21.64 Aligned_cols=38 Identities=13% Similarity=0.161 Sum_probs=27.8
Q ss_pred HHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHH
Q psy6549 42 SELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRV 79 (109)
Q Consensus 42 ~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~ 79 (109)
+....|.|+|......+.-....+..-...|..|..|.
T Consensus 60 e~V~~QGEqIkel~~e~k~qgktL~~I~~~L~~inkRL 97 (102)
T PF01519_consen 60 EKVDKQGEQIKELQVEQKAQGKTLQLILKTLQSINKRL 97 (102)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556777777777777777777777777788887774
No 118
>PF06008 Laminin_I: Laminin Domain I; InterPro: IPR009254 Laminins are glycoproteins that are major constituents of the basement membrane of cells. Laminins are trimeric molecules; laminin-1 is an alpha1 beta1 gamma1 trimer. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure []. Binding to cells via a high affinity receptor, laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components.; GO: 0005102 receptor binding, 0030155 regulation of cell adhesion, 0030334 regulation of cell migration, 0045995 regulation of embryonic development, 0005606 laminin-1 complex
Probab=57.42 E-value=77 Score=23.51 Aligned_cols=31 Identities=19% Similarity=0.261 Sum_probs=14.0
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 9 LRETNQALVRTGESIVRSEQAALEAEAIGTG 39 (109)
Q Consensus 9 ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~ 39 (109)
|-.-..++++.-+.|++|.....+++.+=..
T Consensus 187 L~~~~~kL~Dl~~~l~eA~~~~~ea~~ln~~ 217 (264)
T PF06008_consen 187 LNDYNAKLQDLRDLLNEAQNKTREAEDLNRA 217 (264)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444444444444333
No 119
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=57.30 E-value=33 Score=19.30 Aligned_cols=26 Identities=15% Similarity=0.237 Sum_probs=12.5
Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy6549 74 AIRRRVFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 74 ~m~rr~~~~K~il~~ii~~l~l~i~~ 99 (109)
.=+|+-.++-++=+..|++.+++|++
T Consensus 22 ~qar~~lq~lfvnf~lilicllli~i 47 (52)
T TIGR01294 22 QQARQNLQNLFINFCLILICLLLICI 47 (52)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556666655544444443333433
No 120
>PRK15136 multidrug efflux system protein EmrA; Provisional
Probab=56.81 E-value=13 Score=29.22 Aligned_cols=22 Identities=9% Similarity=0.304 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
|+++++++++++++++.+||.+
T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~ 44 (390)
T PRK15136 23 ALLLLTLLFIIIGVAYGIYWFL 44 (390)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4444433333333333344433
No 121
>PRK10856 cytoskeletal protein RodZ; Provisional
Probab=56.77 E-value=11 Score=29.50 Aligned_cols=27 Identities=0% Similarity=-0.135 Sum_probs=13.7
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
..+.+++.+..++++++++++.+|+|.
T Consensus 108 ~~~~~~~~~~~lv~~vvl~l~~~wwwq 134 (331)
T PRK10856 108 KRDGWLMTFTWLVLFVVIGLTGAWWWQ 134 (331)
T ss_pred ccCCchHHHHHHHHHHHHHHHHHHHhh
Confidence 344455555555555555555444443
No 122
>PF13253 DUF4044: Protein of unknown function (DUF4044)
Probab=56.49 E-value=28 Score=18.40 Aligned_cols=24 Identities=8% Similarity=0.303 Sum_probs=15.7
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~ 103 (109)
..+|+.++.+++.+++.++.+++-
T Consensus 8 ~fekiT~v~v~lM~i~tvg~v~~~ 31 (35)
T PF13253_consen 8 TFEKITMVVVWLMLILTVGSVVAS 31 (35)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777777777776666666553
No 123
>PF15102 TMEM154: TMEM154 protein family
Probab=56.44 E-value=4.3 Score=28.30 Aligned_cols=15 Identities=20% Similarity=0.350 Sum_probs=5.9
Q ss_pred HHHHHHHHHHHHHHh
Q psy6549 92 IESMTLSVMLYLKIF 106 (109)
Q Consensus 92 ~l~l~i~~viy~k~~ 106 (109)
+|++++++++++-++
T Consensus 67 LLvlLLl~vV~lv~~ 81 (146)
T PF15102_consen 67 LLVLLLLSVVCLVIY 81 (146)
T ss_pred HHHHHHHHHHHheeE
Confidence 333334444444333
No 124
>PTZ00446 vacuolar sorting protein SNF7-like; Provisional
Probab=56.25 E-value=75 Score=23.03 Aligned_cols=73 Identities=8% Similarity=0.018 Sum_probs=45.6
Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHH
Q psy6549 3 KIHRETLRETNQALVRTGESIVRSEQAA--LEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIR 76 (109)
Q Consensus 3 ~~~R~~ll~~~~~l~~t~~~L~~s~~~~--~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~ 76 (109)
+.+..++......+++....|+.|+-.. .++-..|..+|..++..= .+.++.+-+++++.....++.+=..++
T Consensus 80 E~ql~q~~~ql~nLEq~~~~iE~a~~~~ev~~aLk~g~~aLK~~~k~~-~idkVd~lmDei~E~~e~~~EIseaLs 154 (191)
T PTZ00446 80 EQEIENILNNRLTLEDNMINLENMHLHKIAVNALSYAANTHKKLNNEI-NTQKVEKIIDTIQENKDIQEEINQALS 154 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 3455666666677777777777765433 356666777777777554 466666666666666666655444443
No 125
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=56.05 E-value=80 Score=23.27 Aligned_cols=57 Identities=19% Similarity=0.140 Sum_probs=32.1
Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhh
Q psy6549 4 IHRETLRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVD 60 (109)
Q Consensus 4 ~~R~~ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~ 60 (109)
++++.|+...+.+++..+.|..-.+......+--..-+.+|.+|-+.+..+...+..
T Consensus 49 ~e~~~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p 105 (251)
T PF11932_consen 49 DEKQELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQELVP 105 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556666666666666666655555555555555555555555555555554444
No 126
>PF01299 Lamp: Lysosome-associated membrane glycoprotein (Lamp); InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below. +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+ In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100. Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail. Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=55.67 E-value=9.2 Score=29.16 Aligned_cols=21 Identities=5% Similarity=0.056 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhcc
Q psy6549 87 IMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~~ 108 (109)
+|++++.+++|++|. |.+.||
T Consensus 277 VG~~La~lvlivLia-Yli~Rr 297 (306)
T PF01299_consen 277 VGAALAGLVLIVLIA-YLIGRR 297 (306)
T ss_pred HHHHHHHHHHHHHHh-heeEec
Confidence 344444434444344 444443
No 127
>PF04478 Mid2: Mid2 like cell wall stress sensor; InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=55.62 E-value=2.9 Score=29.37 Aligned_cols=12 Identities=0% Similarity=0.324 Sum_probs=4.9
Q ss_pred HHHHHHHHHHHh
Q psy6549 95 MTLSVMLYLKIF 106 (109)
Q Consensus 95 l~i~~viy~k~~ 106 (109)
+++++++|+...
T Consensus 64 l~il~lvf~~c~ 75 (154)
T PF04478_consen 64 LGILALVFIFCI 75 (154)
T ss_pred HHHHHhheeEEE
Confidence 344444443333
No 128
>COG5052 YOP1 Protein involved in membrane traffic [Intracellular trafficking and secretion]
Probab=55.61 E-value=77 Score=22.96 Aligned_cols=50 Identities=10% Similarity=0.020 Sum_probs=31.6
Q ss_pred HHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy6549 49 EALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 49 e~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~ 99 (109)
+++.+++......+..+ .+.++++.+.++.-..|..+..++..+++.+++
T Consensus 3 ~~l~~is~aM~~l~~t~-~~~piL~~ie~~~~~~k~Y~~~~asf~~l~~lf 52 (186)
T COG5052 3 GQLVNISVAMLVLDNTL-QAFPILREIENLYNRYKKYFMAGASFLYLLNLF 52 (186)
T ss_pred hHHHHHHHHHHHHHHHH-HhhHHHHHHHHHhCcchhhHHHHHHHHHHHHHH
Confidence 56677777777666655 466789999999866555544444444343433
No 129
>PF15018 InaF-motif: TRP-interacting helix
Probab=55.47 E-value=21 Score=19.22 Aligned_cols=13 Identities=15% Similarity=0.424 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHh
Q psy6549 94 SMTLSVMLYLKIF 106 (109)
Q Consensus 94 ~l~i~~viy~k~~ 106 (109)
+.++.+.+||-|+
T Consensus 19 l~Ai~LsiYY~f~ 31 (38)
T PF15018_consen 19 LAAIVLSIYYIFF 31 (38)
T ss_pred HHHHHHHHHHhee
Confidence 3567777888765
No 130
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=54.39 E-value=18 Score=26.82 Aligned_cols=18 Identities=22% Similarity=0.554 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~ 103 (109)
.|+||+.++++||.++|+
T Consensus 132 IClIIIAVLfLICT~LfL 149 (227)
T PF05399_consen 132 ICLIIIAVLFLICTLLFL 149 (227)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344444444555555544
No 131
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=54.13 E-value=12 Score=25.64 Aligned_cols=10 Identities=0% Similarity=0.278 Sum_probs=5.0
Q ss_pred HHHHHHHhcc
Q psy6549 99 VMLYLKIFKK 108 (109)
Q Consensus 99 ~viy~k~~~~ 108 (109)
+.++++|.||
T Consensus 119 ~~~~yr~~r~ 128 (139)
T PHA03099 119 LLSVYRFTRR 128 (139)
T ss_pred HHhhheeeec
Confidence 3345555554
No 132
>PF15179 Myc_target_1: Myc target protein 1
Probab=53.86 E-value=23 Score=25.71 Aligned_cols=26 Identities=19% Similarity=0.128 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
|.+.+.|++++.+++.+++.+.-|+|
T Consensus 25 F~vSm~iGLviG~li~~LltwlSRRR 50 (197)
T PF15179_consen 25 FCVSMAIGLVIGALIWALLTWLSRRR 50 (197)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 34445555565666666665555543
No 133
>PF05335 DUF745: Protein of unknown function (DUF745); InterPro: IPR007999 This family consists of several uncharacterised Drosophila melanogaster proteins of unknown function.
Probab=53.56 E-value=83 Score=22.72 Aligned_cols=61 Identities=16% Similarity=0.219 Sum_probs=48.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHH
Q psy6549 15 ALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAI 75 (109)
Q Consensus 15 ~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m 75 (109)
.++.-...|+.++..+...+.+..+.-.+|..++.=|...+..++.+...|..++.-+..-
T Consensus 110 q~~~L~~~l~~a~~nl~~a~~~a~~AQ~el~eK~qLLeaAk~Rve~L~~QL~~Ar~D~~~t 170 (188)
T PF05335_consen 110 QLETLKAALKAAQANLANAEQVAEGAQQELAEKTQLLEAAKRRVEELQRQLQAARADYEKT 170 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556777777778888888888888999999999999999999988888888655443
No 134
>TIGR01195 oadG_fam sodium pump decarboxylases, gamma subunit. Most sequences scoring between the noise and trusted cutoffs are eukaryotic sodium channel proteins.
Probab=53.35 E-value=25 Score=21.86 Aligned_cols=18 Identities=0% Similarity=-0.129 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6549 88 MIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 88 ~ii~~l~l~i~~viy~k~ 105 (109)
|+.++++.+++++++.++
T Consensus 14 GM~~VF~fL~lLi~~i~~ 31 (82)
T TIGR01195 14 GMGIVFLFLSLLIYAVRG 31 (82)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333333444443333
No 135
>KOG2911|consensus
Probab=53.05 E-value=1.3e+02 Score=24.73 Aligned_cols=71 Identities=17% Similarity=0.135 Sum_probs=52.3
Q ss_pred HHHHHHHhhhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHH
Q psy6549 4 IHRETLRETNQALVRTGESIVRS--EQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQA 74 (109)
Q Consensus 4 ~~R~~ll~~~~~l~~t~~~L~~s--~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~ 74 (109)
..+++.....++++.--.++++| .+++.++-..|.+++...-.|--.-.++++-++++.+.+++.+.+=..
T Consensus 287 K~~er~~~~l~~l~~vl~~Id~s~~nkvvl~AyksGs~alK~il~~~~s~ekVed~Ldev~et~d~~~EV~~~ 359 (439)
T KOG2911|consen 287 KDLERKVSSLNNLETVLSQIDNSQTNKVVLQAYKSGSEALKAILAQGGSTEKVEDVLDEVNETLDRQEEVEDA 359 (439)
T ss_pred hhHHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHhHHHHHHHHhccCChhhHHHHHHHHHHHHhhHHHHHHH
Confidence 44556666666666666777766 467889999999999998888777777778888888877776654333
No 136
>PRK11637 AmiB activator; Provisional
Probab=52.76 E-value=1.2e+02 Score=24.20 Aligned_cols=44 Identities=9% Similarity=0.129 Sum_probs=23.5
Q ss_pred HHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhH
Q psy6549 40 VISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNK 83 (109)
Q Consensus 40 il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K 83 (109)
+-.++..-...|......+.+....+......+....+.++.+.
T Consensus 94 ~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlra~Y~~g 137 (428)
T PRK11637 94 TQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLDAAFRQG 137 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 33344444444555555555555555555556666666666543
No 137
>PRK05696 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=51.76 E-value=12 Score=26.26 Aligned_cols=22 Identities=14% Similarity=0.214 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
+|+++++++++++.++..||.+
T Consensus 21 iiii~~~~lll~~~g~~~~f~l 42 (170)
T PRK05696 21 IIIIVIGVLLALGGGGAAWFFM 42 (170)
T ss_pred EeeHHHHHHHHHHHHHHHHhhh
Confidence 3444444444444454555544
No 138
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=51.73 E-value=1e+02 Score=23.12 Aligned_cols=32 Identities=16% Similarity=0.354 Sum_probs=21.4
Q ss_pred hHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHH
Q psy6549 48 REALVRARDRLVDTDLHVSRSRRMIQAIRRRV 79 (109)
Q Consensus 48 re~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~ 79 (109)
++.|.+...++++.+.+++.+..+++.+.-|.
T Consensus 14 ~~~L~rle~qi~q~~~~~~~~qs~l~~~~~r~ 45 (251)
T COG5415 14 TADLSRLESQIHQLDVALKKSQSILSQWQSRL 45 (251)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566666667777777777777777766654
No 139
>KOG1556|consensus
Probab=51.16 E-value=45 Score=25.55 Aligned_cols=38 Identities=24% Similarity=0.275 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHH-------------HHHHHhhHHHHHHhhhHhhhHHHHHH
Q psy6549 30 ALEAEAIGTGVI-------------SELGVQREALVRARDRLVDTDLHVSR 67 (109)
Q Consensus 30 ~~ete~iG~~il-------------~~L~~Qre~L~~~~~~~~~i~~~l~~ 67 (109)
++|.|++|.+=+ ..+..|-+.|.+-+..+.+|.+.++.
T Consensus 165 AeEAEEvGVEHLlRDikd~t~gtla~rit~ql~sLkgl~~~L~eI~~YL~~ 215 (309)
T KOG1556|consen 165 AEEAEEVGVEHLLRDIKDTTVGTLATRITNQLMSLKGLQSRLREIRSYLDK 215 (309)
T ss_pred hhHHHHhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566666665533 56778888888888888888887765
No 140
>PF13800 Sigma_reg_N: Sigma factor regulator N-terminal
Probab=51.15 E-value=46 Score=20.80 Aligned_cols=13 Identities=8% Similarity=0.373 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHhh
Q psy6549 70 RMIQAIRRRVFYN 82 (109)
Q Consensus 70 ~~l~~m~rr~~~~ 82 (109)
++++.-.|+....
T Consensus 3 ~i~kK~K~k~~l~ 15 (96)
T PF13800_consen 3 KILKKAKRKSRLR 15 (96)
T ss_pred hHHHHHHHHHHHH
Confidence 3455555554433
No 141
>PTZ00046 rifin; Provisional
Probab=50.64 E-value=21 Score=28.42 Aligned_cols=20 Identities=20% Similarity=0.416 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~ 103 (109)
++.+.+|++..++|-+|+-|
T Consensus 321 iiAIvVIVLIMvIIYLILRY 340 (358)
T PTZ00046 321 IVAIVVIVLIMVIIYLILRY 340 (358)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 33344444333333333333
No 142
>TIGR01477 RIFIN variant surface antigen, rifin family. This model represents the rifin branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of rifin sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 20 bits.
Probab=50.42 E-value=21 Score=28.34 Aligned_cols=24 Identities=25% Similarity=0.391 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+|.+.+|++..++|-+++-|+-+|
T Consensus 316 iIAIvvIVLIMvIIYLILRYRRKK 339 (353)
T TIGR01477 316 IIAILIIVLIMVIIYLILRYRRKK 339 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhcc
Confidence 344455555555666666665543
No 143
>PF11657 Activator-TraM: Transcriptional activator TraM
Probab=50.27 E-value=83 Score=21.80 Aligned_cols=10 Identities=10% Similarity=0.109 Sum_probs=4.2
Q ss_pred hHHHHHHHHH
Q psy6549 61 TDLHVSRSRR 70 (109)
Q Consensus 61 i~~~l~~a~~ 70 (109)
++..+.+++.
T Consensus 104 i~~~~~~~~~ 113 (144)
T PF11657_consen 104 IDNSLAEVND 113 (144)
T ss_pred HHHHHHHHHH
Confidence 3444444443
No 144
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=50.21 E-value=31 Score=20.54 Aligned_cols=14 Identities=21% Similarity=0.311 Sum_probs=6.7
Q ss_pred HhhHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIE 93 (109)
Q Consensus 80 ~~~K~il~~ii~~l 93 (109)
.++-.++++.+++|
T Consensus 40 ~r~~~~~~~li~aL 53 (64)
T COG4068 40 QRNFMILMFLILAL 53 (64)
T ss_pred HHHHHHHHHHHHHH
Confidence 34444555555544
No 145
>PF06072 Herpes_US9: Alphaherpesvirus tegument protein US9; InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=50.13 E-value=53 Score=19.48 Aligned_cols=14 Identities=7% Similarity=0.321 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHH
Q psy6549 66 SRSRRMIQAIRRRV 79 (109)
Q Consensus 66 ~~a~~~l~~m~rr~ 79 (109)
+.|..-|+.|.|+.
T Consensus 9 ETA~~FL~RvGr~q 22 (60)
T PF06072_consen 9 ETATEFLRRVGRQQ 22 (60)
T ss_pred ccHHHHHHHHhHHH
Confidence 44666677777665
No 146
>PHA03240 envelope glycoprotein M; Provisional
Probab=49.94 E-value=28 Score=26.10 Aligned_cols=23 Identities=30% Similarity=0.346 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHH-----HHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVM-----LYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~v-----iy~k~~ 106 (109)
||++++|++.++++++. +|=||.
T Consensus 215 WIiilIIiIiIIIL~cfKiPQKl~dKw~ 242 (258)
T PHA03240 215 WIFIAIIIIIVIILFFFKIPQKLFDKWD 242 (258)
T ss_pred HHHHHHHHHHHHHHHHHhccHHHHHHHh
Confidence 44444444444444444 555554
No 147
>PF12495 Vip3A_N: Vegetative insecticide protein 3A N terminal ; InterPro: IPR022180 This family of proteins is found in bacteria. Proteins in this family are typically between 170 and 789 amino acids in length. The family is found in association with PF02018 from PFAM. Vip3A represents a novel class of proteins insecticidal to lepidopteran insect larvae.
Probab=49.81 E-value=83 Score=21.65 Aligned_cols=85 Identities=11% Similarity=0.132 Sum_probs=60.8
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHH
Q psy6549 7 ETLRETNQALVRTGESIVRSEQAALEA-------EAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRV 79 (109)
Q Consensus 7 ~~ll~~~~~l~~t~~~L~~s~~~~~et-------e~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~ 79 (109)
+..+.+.+-+++-++.|+-..--+.+. .+...+++.--.+|...|..+..+++.+.+.+..--.-+.+|-...
T Consensus 38 deilknq~lln~is~kldging~l~dliaqgnln~el~ke~lkianeqn~~ln~vn~~l~~in~~l~~ylpkitsmls~v 117 (177)
T PF12495_consen 38 DEILKNQQLLNQISDKLDGINGSLNDLIAQGNLNSELSKEILKIANEQNQMLNNVNNQLNSINSMLNTYLPKITSMLSDV 117 (177)
T ss_pred HHHHHhHHHHHHhcccccccCCcHHHHHHcCCccHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 345666676776666665544444443 3445566666678999999999999999999998888888888888
Q ss_pred HhhHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIV 91 (109)
Q Consensus 80 ~~~K~il~~ii~ 91 (109)
+++-+.+..-|-
T Consensus 118 mkqny~lslqie 129 (177)
T PF12495_consen 118 MKQNYVLSLQIE 129 (177)
T ss_pred HHhhhhhhhhHH
Confidence 777776655443
No 148
>PF04272 Phospholamban: Phospholamban; InterPro: IPR005984 Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17. The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=49.51 E-value=47 Score=18.70 Aligned_cols=16 Identities=13% Similarity=0.216 Sum_probs=7.0
Q ss_pred HHHHHhhHHHHHHHHH
Q psy6549 76 RRRVFYNKLFLIMIIV 91 (109)
Q Consensus 76 ~rr~~~~K~il~~ii~ 91 (109)
+|+-.++-++=+..|+
T Consensus 24 a~qnlqelfvnfclil 39 (52)
T PF04272_consen 24 ARQNLQELFVNFCLIL 39 (52)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444555444333333
No 149
>PRK10617 cytochrome c-type protein NapC; Provisional
Probab=48.94 E-value=68 Score=23.36 Aligned_cols=9 Identities=0% Similarity=0.183 Sum_probs=4.3
Q ss_pred HHHHHHHHH
Q psy6549 95 MTLSVMLYL 103 (109)
Q Consensus 95 l~i~~viy~ 103 (109)
+++++++|.
T Consensus 34 ~~~G~~~~~ 42 (200)
T PRK10617 34 FVGGIIFWG 42 (200)
T ss_pred HHHHHHHHH
Confidence 344555554
No 150
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=48.66 E-value=92 Score=21.81 Aligned_cols=37 Identities=14% Similarity=0.198 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHH----HhhHHHHHHhhhHhhhHHHHHHH
Q psy6549 32 EAEAIGTGVISELG----VQREALVRARDRLVDTDLHVSRS 68 (109)
Q Consensus 32 ete~iG~~il~~L~----~Qre~L~~~~~~~~~i~~~l~~a 68 (109)
|...+=.++--++. +=|+-......++.+++..++.-
T Consensus 99 ei~~l~a~~klD~n~eK~~~r~e~~~~~~ki~e~~~ki~~e 139 (177)
T PF07798_consen 99 EINKLRAEVKLDLNLEKGRIREEQAKQELKIQELNNKIDTE 139 (177)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444433333 23344455555555555554443
No 151
>PF14960 ATP_synth_reg: ATP synthase regulation
Probab=48.04 E-value=29 Score=19.75 Aligned_cols=11 Identities=36% Similarity=0.932 Sum_probs=8.1
Q ss_pred HHHHHHHhccC
Q psy6549 99 VMLYLKIFKKR 109 (109)
Q Consensus 99 ~viy~k~~~~~ 109 (109)
+++|+|+.+||
T Consensus 39 i~~~~k~~~kk 49 (49)
T PF14960_consen 39 IILYFKLRRKK 49 (49)
T ss_pred HHHHHhcccCC
Confidence 35777888876
No 152
>PF13396 PLDc_N: Phospholipase_D-nuclease N-terminal
Probab=48.02 E-value=11 Score=20.38 Aligned_cols=24 Identities=21% Similarity=0.325 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
-+.|.++++++=.++.++|+-+-|
T Consensus 22 k~~W~~~i~~~P~iG~i~Yl~~gr 45 (46)
T PF13396_consen 22 KILWLIVILFFPIIGPILYLIFGR 45 (46)
T ss_pred hhHHHHHHHHHHHHHHhheEEEeC
Confidence 356666666667778777765544
No 153
>PTZ00208 65 kDa invariant surface glycoprotein; Provisional
Probab=47.78 E-value=47 Score=27.05 Aligned_cols=34 Identities=9% Similarity=0.317 Sum_probs=18.7
Q ss_pred HHHHHHhhHHHHHHHHH-HHHHHHHHHHHHHHhcc
Q psy6549 75 IRRRVFYNKLFLIMIIV-IESMTLSVMLYLKIFKK 108 (109)
Q Consensus 75 m~rr~~~~K~il~~ii~-~l~l~i~~viy~k~~~~ 108 (109)
..+|.-.-++|+.++.+ +.+++|+.+.||-|+|+
T Consensus 378 ~~~~~~~~~~i~~avl~p~~il~~~~~~~~~~v~r 412 (436)
T PTZ00208 378 VSSRHQRTAMIILAVLVPAIILAIIAVAFFIMVKR 412 (436)
T ss_pred HHHhhhhhHHHHHHHHHHHHHHHHHHHHhheeeee
Confidence 44555556666655544 33344555556666654
No 154
>PF01708 Gemini_mov: Geminivirus putative movement protein ; InterPro: IPR002621 This family consists of putative movement proteins from Maize streak virus and Wheat dwarf virus [].; GO: 0046740 spread of virus in host, cell to cell, 0016021 integral to membrane
Probab=47.50 E-value=7.1 Score=25.08 Aligned_cols=22 Identities=14% Similarity=0.202 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhc
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~ 107 (109)
.++|++++.+++++..|.+|.|
T Consensus 39 ~v~i~~lvaVg~~YL~y~~fLk 60 (91)
T PF01708_consen 39 EVAIFTLVAVGCLYLAYTWFLK 60 (91)
T ss_pred eeeehHHHHHHHHHHHHHHHHH
Confidence 3455555566677778877765
No 155
>PRK05529 cell division protein FtsQ; Provisional
Probab=47.38 E-value=18 Score=27.08 Aligned_cols=22 Identities=9% Similarity=0.120 Sum_probs=14.1
Q ss_pred HHHHHHHHHhhHHHHHHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l 93 (109)
.+.+.+|..+.+.++.+.+.++
T Consensus 24 ~~~~~~~~~~r~~~~~~~~~~~ 45 (255)
T PRK05529 24 VRRFTTRIRRRFILLACAVGAV 45 (255)
T ss_pred hhchhhhccchhhhHHHHHHHH
Confidence 6777777777666666555433
No 156
>PF07835 COX4_pro_2: Bacterial aa3 type cytochrome c oxidase subunit IV; InterPro: IPR012422 Bacterial cytochrome c oxidase is found bound to the to the cell membrane, where it is involved in the generation of the transmembrane proton electrochemical gradient. It is composed of four subunits. Subunit IV consists of one transmembrane helix that does not interact directly with the other subunits, but maintains its position by indirect contacts via phospholipid molecules found in the structure. The function of subunit IV is as yet unknown []. ; PDB: 1QLE_D 1M57_J 1M56_J.
Probab=47.37 E-value=28 Score=19.21 Aligned_cols=15 Identities=20% Similarity=0.095 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLS 98 (109)
Q Consensus 84 ~il~~ii~~l~l~i~ 98 (109)
+..|+.+++.+++++
T Consensus 24 ~~k~~~~~~~~~li~ 38 (44)
T PF07835_consen 24 LTKWGTIAIAAILIF 38 (44)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 445555555444333
No 157
>cd01324 cbb3_Oxidase_CcoQ Cytochrome cbb oxidase CcoQ. Cytochrome cbb3 oxidase, the terminal oxidase in the respiratory chains of proteobacteria, is a multi-chain transmembrane protein located in the cell membrane. Like other cytochrome oxidases, it catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. Found exclusively in proteobacteria, cbb3 is believed to be a modern enzyme that has evolved independently to perform a specialized function in microaerobic energy metabolism. The cbb3 operon contains four genes (ccoNOQP or fixNOQP), with ccoN coding for subunit I. Instead of a CuA-containing subunit II analogous to other cytochrome oxidases, cbb3 utilizes subunits ccoO and ccoP, which contain one and two hemes, respectively, to transfer electrons to the binuclear center. ccoQ, the fourth subunit, is a single transmembrane helix protein. It has been shown to protect the core complex from proteolytic degradation by serine proteases. See cd00919, cd01322
Probab=47.03 E-value=35 Score=19.08 Aligned_cols=14 Identities=14% Similarity=0.021 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHHhc
Q psy6549 94 SMTLSVMLYLKIFK 107 (109)
Q Consensus 94 ~l~i~~viy~k~~~ 107 (109)
++.+++++|.-+.+
T Consensus 21 ~~Figiv~wa~~p~ 34 (48)
T cd01324 21 LFFLGVVVWAFRPG 34 (48)
T ss_pred HHHHHHHHHHhCCC
Confidence 34444455544433
No 158
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=46.99 E-value=79 Score=20.61 Aligned_cols=16 Identities=38% Similarity=0.567 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHH
Q psy6549 90 IVIESMTLSVMLYLKI 105 (109)
Q Consensus 90 i~~l~l~i~~viy~k~ 105 (109)
+++++.++.+.+|+++
T Consensus 99 ~~~~lp~~a~~lY~~l 114 (117)
T TIGR03142 99 VVLLLPVLALGLYLKL 114 (117)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 3344455666677764
No 159
>KOG1693|consensus
Probab=46.87 E-value=1.2e+02 Score=22.46 Aligned_cols=23 Identities=26% Similarity=0.495 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~ 106 (109)
|-++.+++++++.++=|+.+|+|
T Consensus 178 ~Sl~e~~~vv~iSi~Qv~ilk~f 200 (209)
T KOG1693|consen 178 WSLLEIIAVVVISIAQVFILKFF 200 (209)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445555555666666666655
No 160
>COG4736 CcoQ Cbb3-type cytochrome oxidase, subunit 3 [Posttranslational modification, protein turnover, chaperones]
Probab=46.17 E-value=52 Score=19.46 Aligned_cols=19 Identities=11% Similarity=0.181 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy6549 87 IMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~ 106 (109)
++.+.+.++.++ ++|+-+.
T Consensus 13 ~~t~~~~l~fia-vi~~ayr 31 (60)
T COG4736 13 WGTIAFTLFFIA-VIYFAYR 31 (60)
T ss_pred HHHHHHHHHHHH-HHHHHhc
Confidence 344444433333 4444433
No 161
>PF06459 RR_TM4-6: Ryanodine Receptor TM 4-6; InterPro: IPR009460 The release of Ca2+ ions from intracellular stores is a key step in a wide variety of cellular functions. In striated muscle, the release of Ca2+ from the sarcoplasmic reticulum (SR) leads to muscle contraction. Ca2+ release occurs through large, high-conductance Ca2+ release channels, also known as ryanodine receptors (RyRs) because they bind the plant alkaloid ryanodine with high affinity and specificity []. This region covers TM regions 4-6 of the ryanodine receptor 1 family.; GO: 0005219 ryanodine-sensitive calcium-release channel activity, 0006874 cellular calcium ion homeostasis, 0016021 integral to membrane
Probab=46.01 E-value=65 Score=24.62 Aligned_cols=44 Identities=7% Similarity=0.182 Sum_probs=28.2
Q ss_pred hhHHHHHHH-HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 60 DTDLHVSRS-RRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 60 ~i~~~l~~a-~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
.+...+..- .++|+=++|.-+.=|++-.++. +.++++++|||..
T Consensus 150 ~~~~~l~~y~~k~lnylARNFYNlr~lALflA---FaINFILLFYKVs 194 (274)
T PF06459_consen 150 QFWKELEVYRTKFLNYLARNFYNLRFLALFLA---FAINFILLFYKVS 194 (274)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhc
Confidence 344445553 3677777777777666554444 4568888999864
No 162
>PF02419 PsbL: PsbL protein; InterPro: IPR003372 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbL found in PSII. PsbL is located in a gene cluster with PsbE, PsbF and PsbJ (PsbEFJL). Both PsbL and PsbJ (IPR002682 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbL prevent the formation of both PSII core dimers and PSII-light harvesting complex []. In addition, both PsbL and PsbJ are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_L 3A0B_l 3ARC_l 1S5L_l 2AXT_l 3BZ2_L 4FBY_L 3PRQ_L 3PRR_L 3KZI_L ....
Probab=45.49 E-value=40 Score=17.99 Aligned_cols=19 Identities=11% Similarity=0.224 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 85 FLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~ 103 (109)
.-|+.++++++++.+--|+
T Consensus 17 LY~GLllifvl~vLFssyf 35 (37)
T PF02419_consen 17 LYWGLLLIFVLAVLFSSYF 35 (37)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhhhh
Confidence 3456666666666655554
No 163
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=45.36 E-value=39 Score=22.76 Aligned_cols=22 Identities=18% Similarity=0.039 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+++|+|++. ++|.|++....+|
T Consensus 73 v~aGvIg~I-lli~y~irR~~Kk 94 (122)
T PF01102_consen 73 VMAGVIGII-LLISYCIRRLRKK 94 (122)
T ss_dssp HHHHHHHHH-HHHHHHHHHHS--
T ss_pred HHHHHHHHH-HHHHHHHHHHhcc
Confidence 344444333 3455555555443
No 164
>PF09451 ATG27: Autophagy-related protein 27; InterPro: IPR018939 Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. There are more than 25 AuTophaGy-related (ATG) genes that are essential for autophagy, although it is still not known how the autophagosome is made. Atg9 is a potential membrane carrier to deliver lipids that are used to form the vesicle. Atg27 is another transmembrane protein, and is a cycling protein []. It acts as an effector of VPS34 phosphatidylinositol 3-phosphate kinase signalling and regulates the cytoplasm to vacuole transport (Cvt) vesicle formation. It is also required for autophagy-dependent cycling of ATG9.
Probab=45.19 E-value=26 Score=26.33 Aligned_cols=19 Identities=11% Similarity=0.411 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy 102 (109)
|+.|++|++++.+++++++
T Consensus 202 ~f~wl~i~~~l~~~~Y~i~ 220 (268)
T PF09451_consen 202 FFTWLFIILFLFLAAYLIF 220 (268)
T ss_pred HHHHHHHHHHHHHHHHhhh
Confidence 4455555554444444443
No 165
>PF14362 DUF4407: Domain of unknown function (DUF4407)
Probab=45.12 E-value=1.3e+02 Score=22.60 Aligned_cols=25 Identities=12% Similarity=0.147 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
.++-++++++++.+=++.+++|++-
T Consensus 262 ~~~~~~i~llfi~iel~Pv~~Kl~~ 286 (301)
T PF14362_consen 262 LLASLFIFLLFIAIELLPVLFKLLS 286 (301)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3444666666667777889988764
No 166
>PHA02947 S-S bond formation pathway protein; Provisional
Probab=44.71 E-value=34 Score=25.33 Aligned_cols=10 Identities=10% Similarity=0.580 Sum_probs=6.8
Q ss_pred HHHHHHHHHH
Q psy6549 70 RMIQAIRRRV 79 (109)
Q Consensus 70 ~~l~~m~rr~ 79 (109)
.+++.+.+|.
T Consensus 154 ~I~~aL~kr~ 163 (215)
T PHA02947 154 TIVKSLNKRY 163 (215)
T ss_pred HHHHHHhccc
Confidence 6677777773
No 167
>PF02706 Wzz: Chain length determinant protein; InterPro: IPR003856 A number of related proteins are involved in the synthesis of lipopolysaccharide, O-antigen polysaccharide, capsule polysaccharide and exopolysaccharides. Chain length determinant protein (or wzz protein) is involved in lipopolysaccharide (lps) biosynthesis, conferring a modal distribution of chain length on the O-antigen component of lps []. It gives rise to a reduced number of short chain molecules and increases in numbers of longer molecules, with a modal value of 20. The MPA/MPA2 proteins function in CPS and EPS polymerisation and export [].; GO: 0009103 lipopolysaccharide biosynthetic process, 0016020 membrane; PDB: 4E2H_C 3B8P_A 4E2C_B 4E29_A 3B8O_G 4E2L_I 3B8N_D 3B8M_C.
Probab=44.49 E-value=7.4 Score=25.64 Aligned_cols=22 Identities=14% Similarity=0.422 Sum_probs=0.0
Q ss_pred HHHHHHHHHhhHHHHHHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l 93 (109)
|+++.+-..++|++++++++++
T Consensus 5 L~~l~~~l~r~~~~i~~~~~l~ 26 (152)
T PF02706_consen 5 LRDLLRILWRRKWLIIIVTLLF 26 (152)
T ss_dssp ----------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4555566666766655555444
No 168
>KOG3287|consensus
Probab=44.34 E-value=1.3e+02 Score=22.48 Aligned_cols=35 Identities=11% Similarity=0.097 Sum_probs=18.6
Q ss_pred HHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHH
Q psy6549 50 ALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKL 84 (109)
Q Consensus 50 ~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~ 84 (109)
+++.+.+.+..+..++..+.+.-.-..-|.++|+.
T Consensus 154 kl~di~~~i~~i~~nl~k~~~~q~~lRa~EaRDr~ 188 (236)
T KOG3287|consen 154 KLDDIEDSIGTIKNNLNKMWQYQALLRAREARDRN 188 (236)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH
Confidence 45555556666666666666544443344444443
No 169
>PF06008 Laminin_I: Laminin Domain I; InterPro: IPR009254 Laminins are glycoproteins that are major constituents of the basement membrane of cells. Laminins are trimeric molecules; laminin-1 is an alpha1 beta1 gamma1 trimer. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure []. Binding to cells via a high affinity receptor, laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components.; GO: 0005102 receptor binding, 0030155 regulation of cell adhesion, 0030334 regulation of cell migration, 0045995 regulation of embryonic development, 0005606 laminin-1 complex
Probab=43.59 E-value=1.3e+02 Score=22.22 Aligned_cols=57 Identities=7% Similarity=-0.039 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHH
Q psy6549 14 QALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRR 70 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~ 70 (109)
+.+++-+..|.+....+.++.....++-.-...-+..+....++..++...-..+..
T Consensus 185 ~~L~~~~~kL~Dl~~~l~eA~~~~~ea~~ln~~n~~~l~~~~~k~~~l~~~~~~~~~ 241 (264)
T PF06008_consen 185 DDLNDYNAKLQDLRDLLNEAQNKTREAEDLNRANQKNLEDLEKKKQELSEQQNEVSE 241 (264)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555555555554444444444455444444444444443333
No 170
>PHA03395 p10 fibrous body protein; Provisional
Probab=43.41 E-value=86 Score=19.98 Aligned_cols=27 Identities=11% Similarity=0.059 Sum_probs=21.2
Q ss_pred HHHHHHHHhhHHHHHHhhhHhhhHHHH
Q psy6549 39 GVISELGVQREALVRARDRLVDTDLHV 65 (109)
Q Consensus 39 ~il~~L~~Qre~L~~~~~~~~~i~~~l 65 (109)
++-+.|..|.++|.....++++|.+-+
T Consensus 39 ~l~~kLdaq~~~Ltti~tkv~~I~diL 65 (87)
T PHA03395 39 EINEKLDAQSASLDTISSAVDNITDIL 65 (87)
T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHcc
Confidence 344568899999998888888887654
No 171
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=43.27 E-value=1e+02 Score=20.84 Aligned_cols=36 Identities=17% Similarity=0.287 Sum_probs=20.4
Q ss_pred HHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHH
Q psy6549 42 SELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRR 77 (109)
Q Consensus 42 ~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~r 77 (109)
..|.+|.|....+.+.|.++..+++..+.-+..+..
T Consensus 75 ~klDe~~ei~~~i~~eV~~v~~dv~~i~~dv~~v~~ 110 (126)
T PF07889_consen 75 DKLDEQKEISKQIKDEVTEVREDVSQIGDDVDSVQQ 110 (126)
T ss_pred hhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Confidence 345566666666666666666665555555544443
No 172
>PRK11901 hypothetical protein; Reviewed
Probab=43.23 E-value=30 Score=27.22 Aligned_cols=14 Identities=7% Similarity=0.071 Sum_probs=5.7
Q ss_pred HHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSV 99 (109)
Q Consensus 86 l~~ii~~l~l~i~~ 99 (109)
+.+.|+||+++|+.
T Consensus 40 iGiGilVLlLLIi~ 53 (327)
T PRK11901 40 IGIGILVLLLLIIA 53 (327)
T ss_pred HHHHHHHHHHHHHH
Confidence 33334444444443
No 173
>PF10140 YukC: WXG100 protein secretion system (Wss), protein YukC; InterPro: IPR018778 Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This protein is designated YukC in Bacillus subtilis and EssB is Staphylococcus aureus. ; PDB: 4ANN_A.
Probab=43.22 E-value=12 Score=29.71 Aligned_cols=38 Identities=18% Similarity=0.175 Sum_probs=0.0
Q ss_pred HHHHHHHHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 70 RMIQAIRRR-VFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 70 ~~l~~m~rr-~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+.+....++ -..-|++.++.++++++++++++|+-|+.
T Consensus 181 ~~~~~V~Kk~~k~~K~~~i~l~~l~v~l~~~~~Y~~f~~ 219 (359)
T PF10140_consen 181 KTYILVPKKKWKIFKYASIGLSILLVLLLIPLGYLYFFK 219 (359)
T ss_dssp ---------------------------------------
T ss_pred hhheEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 333344443 56677888888888888888888887774
No 174
>PHA02662 ORF131 putative membrane protein; Provisional
Probab=43.17 E-value=42 Score=25.04 Aligned_cols=31 Identities=19% Similarity=0.417 Sum_probs=16.2
Q ss_pred HHHHHHHHHH------Hhh----HHHHHHHHH-HHHHHHHHH
Q psy6549 70 RMIQAIRRRV------FYN----KLFLIMIIV-IESMTLSVM 100 (109)
Q Consensus 70 ~~l~~m~rr~------~~~----K~il~~ii~-~l~l~i~~v 100 (109)
.+++.+.+|. +.+ +|++.++++ +++++++.+
T Consensus 163 ~I~~~ltkr~~~nr~~~~~~~~~~W~i~~~v~~i~~i~vv~i 204 (226)
T PHA02662 163 RVATALTRRVPASRHGLAEGGTPPWTLLLAVAAVTVLGVVAV 204 (226)
T ss_pred HHHHHHhhhccccccccccCCCCcchhHHHHHHHHHHHHHHH
Confidence 5677777775 233 444444433 554555444
No 175
>TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=43.17 E-value=52 Score=27.33 Aligned_cols=26 Identities=8% Similarity=-0.031 Sum_probs=13.3
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
..-+++.++++.++++++++++|++-
T Consensus 522 ~~~~~~~i~~pp~~~l~~G~~~~~~R 547 (552)
T TIGR03521 522 TTWQLINIGLPILLLLLFGLSFTYIR 547 (552)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344445555455555665555443
No 176
>PF08372 PRT_C: Plant phosphoribosyltransferase C-terminal; InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO).
Probab=43.14 E-value=1.1e+02 Score=21.36 Aligned_cols=33 Identities=15% Similarity=0.104 Sum_probs=22.5
Q ss_pred HHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHH
Q psy6549 42 SELGVQREALVRARDRLVDTDLHVSRSRRMIQA 74 (109)
Q Consensus 42 ~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~ 74 (109)
....+=|+-..+++.-++++.+..++.+.++++
T Consensus 59 ~Rydrlr~va~rvQ~vlgd~At~gERl~allsW 91 (156)
T PF08372_consen 59 MRYDRLRSVAGRVQNVLGDVATQGERLQALLSW 91 (156)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 445555666667777777777777777777766
No 177
>PLN03160 uncharacterized protein; Provisional
Probab=43.07 E-value=14 Score=27.04 Aligned_cols=32 Identities=19% Similarity=0.147 Sum_probs=15.2
Q ss_pred HHHHhhHHHHHHHHHHHH-HHHHHHHHHHHhcc
Q psy6549 77 RRVFYNKLFLIMIIVIES-MTLSVMLYLKIFKK 108 (109)
Q Consensus 77 rr~~~~K~il~~ii~~l~-l~i~~viy~k~~~~ 108 (109)
||-...+|+-|++.++++ ++++++++|-++|.
T Consensus 32 ~r~~~~~c~~~~~a~~l~l~~v~~~l~~~vfrP 64 (219)
T PLN03160 32 RRRNCIKCCGCITATLLILATTILVLVFTVFRV 64 (219)
T ss_pred ccccceEEHHHHHHHHHHHHHHHHheeeEEEEc
Confidence 344455565555444443 33444445545544
No 178
>PF13800 Sigma_reg_N: Sigma factor regulator N-terminal
Probab=42.99 E-value=65 Score=20.10 Aligned_cols=12 Identities=8% Similarity=0.022 Sum_probs=5.0
Q ss_pred HHHHHHhhHHHH
Q psy6549 75 IRRRVFYNKLFL 86 (109)
Q Consensus 75 m~rr~~~~K~il 86 (109)
+.||.-..-.+-
T Consensus 4 i~kK~K~k~~l~ 15 (96)
T PF13800_consen 4 ILKKAKRKSRLR 15 (96)
T ss_pred HHHHHHHHHHHH
Confidence 444443333333
No 179
>KOG4433|consensus
Probab=42.92 E-value=2e+02 Score=24.10 Aligned_cols=24 Identities=8% Similarity=0.292 Sum_probs=12.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
-||..+.+.+.+.+.++++...-+
T Consensus 208 ~RW~~~v~lL~l~LvvC~v~vlgl 231 (526)
T KOG4433|consen 208 YRWLAYVLLLTLLLVVCLVLVLGL 231 (526)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555555555455555444433
No 180
>PF05337 CSF-1: Macrophage colony stimulating factor-1 (CSF-1); InterPro: IPR008001 Colony stimulating factor 1 (CSF-1) is a homodimeric polypeptide growth factor whose primary function is to regulate the survival, proliferation, differentiation, and function of cells of the mononuclear phagocytic lineage. This lineage includes mononuclear phagocytic precursors, blood monocytes, tissue macrophages, osteoclasts, and microglia of the brain, all of which possess cell surface receptors for CSF-1. The protein has also been linked with male fertility [] and mutations in the Csf-1 gene have been found to cause osteopetrosis and failure of tooth eruption [].; GO: 0005125 cytokine activity, 0008083 growth factor activity, 0016021 integral to membrane; PDB: 3EJJ_A.
Probab=42.27 E-value=8.4 Score=29.63 Aligned_cols=23 Identities=13% Similarity=0.371 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~~ 108 (109)
++-.|++++++++.++||++.++
T Consensus 231 LVPSiILVLLaVGGLLfYr~rrR 253 (285)
T PF05337_consen 231 LVPSIILVLLAVGGLLFYRRRRR 253 (285)
T ss_dssp -----------------------
T ss_pred cccchhhhhhhccceeeeccccc
Confidence 34444445578889999998875
No 181
>PF09815 XK-related: XK-related protein; InterPro: IPR018629 This entry is represented by the multipass membrane protein XK, which may be involved in sodium-dependent transport of neutral amino acids or oligopeptides. It forms a heterodimer with Kell. In humans, Kell is an 93kDa type II membrane glycoprotein with endothelin-3-converting enzyme activity that is linked by a single disulphide bond to XK, that spans the membrane ten times. An absence of XK leads to clinical symptoms termed the McLeod syndrome [MIM:314850], an X-linked multi-system disorder characterised by late onset abnormalities in the neuromuscular and hematopoietic systems [, ].
Probab=41.96 E-value=40 Score=25.96 Aligned_cols=21 Identities=14% Similarity=0.164 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHHhccC
Q psy6549 89 IIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~~~~~ 109 (109)
++..++.+...++||+++|++
T Consensus 312 ~~~~~lGi~~m~~YY~~~HP~ 332 (332)
T PF09815_consen 312 LGGFLLGIAFMLLYYRFFHPN 332 (332)
T ss_pred HHHHHHHHHHHHHHHhhcCCC
Confidence 333444556667999999875
No 182
>PF15168 TRIQK: Triple QxxK/R motif-containing protein family
Probab=41.76 E-value=67 Score=20.02 Aligned_cols=21 Identities=19% Similarity=0.450 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHHHHHHHHhc
Q psy6549 87 IMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~ 107 (109)
.+.|+++++++...+|+.|.+
T Consensus 55 l~ail~lL~a~Ya~fyl~ls~ 75 (79)
T PF15168_consen 55 LAAILVLLLAFYAFFYLNLSK 75 (79)
T ss_pred HHHHHHHHHHHHHHHHHhhcc
Confidence 344444445566666766643
No 183
>PRK06287 cobalt transport protein CbiN; Validated
Probab=41.62 E-value=49 Score=21.61 Aligned_cols=25 Identities=28% Similarity=0.141 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
.++.+++.++++++++....++.+|
T Consensus 79 ~ilsgiiGv~i~l~l~~~~~~~l~r 103 (107)
T PRK06287 79 EIIAMVIGTLLVLALAYGVGKIFKK 103 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4555566555444444444444443
No 184
>PF14914 LRRC37AB_C: LRRC37A/B like protein 1 C-terminal domain
Probab=40.77 E-value=42 Score=23.54 Aligned_cols=18 Identities=22% Similarity=0.314 Sum_probs=10.8
Q ss_pred HHHHHhhHHHHHHHHHHH
Q psy6549 76 RRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 76 ~rr~~~~K~il~~ii~~l 93 (109)
-.--+.||+|+...+.+.
T Consensus 113 p~~gY~nklilaisvtvv 130 (154)
T PF14914_consen 113 PGYGYNNKLILAISVTVV 130 (154)
T ss_pred ccccccchhHHHHHHHHH
Confidence 334567888876555444
No 185
>PF03408 Foamy_virus_ENV: Foamy virus envelope protein ; InterPro: IPR005070 Expression of the envelope (Env) glycoprotein is essential for viral particle egress. This feature is unique to the Spumavirinae, a subclass of the Retroviridae. ; GO: 0019031 viral envelope
Probab=40.73 E-value=1.7e+02 Score=26.26 Aligned_cols=68 Identities=19% Similarity=0.143 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 25 RSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 25 ~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k 104 (109)
=.+.++..|+++.-.+.+.+..=.+=|.++-+.+-++.- .+| .-=|=|++|+++++++++++=|..+
T Consensus 907 WikqL~~At~DvWPaaA~~~~~iGnfL~~ta~giFGtaf------s~L-------gY~KPiliGi~ii~liili~kIisW 973 (981)
T PF03408_consen 907 WIKQLASATKDVWPAAASFLSGIGNFLSGTAGGIFGTAF------SIL-------GYAKPILIGIVIILLIILIFKIISW 973 (981)
T ss_pred HHHHHHHHhhhhhHHHHHHHHHHHHHhhhccccccchHH------HHH-------hhhhhHHHHHHHHHHHHHHHHHHHh
Confidence 356788889999999888887766666666554443321 111 1224556666665555444444443
Q ss_pred H
Q psy6549 105 I 105 (109)
Q Consensus 105 ~ 105 (109)
+
T Consensus 974 l 974 (981)
T PF03408_consen 974 L 974 (981)
T ss_pred c
Confidence 3
No 186
>PF13997 YqjK: YqjK-like protein
Probab=40.08 E-value=86 Score=19.01 Aligned_cols=37 Identities=16% Similarity=0.333 Sum_probs=31.0
Q ss_pred HHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHH
Q psy6549 41 ISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRR 77 (109)
Q Consensus 41 l~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~r 77 (109)
+.+..+||..|-...+...+.-+-.|++-..+..+.+
T Consensus 2 l~qi~qQR~~La~~~~~w~~~ta~~Dr~w~~l~~lr~ 38 (73)
T PF13997_consen 2 LRQIQQQRLDLAANAEPWLEATAPYDRGWQTLRSLRR 38 (73)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhHHHhhHHHHHHHHHH
Confidence 4567899999999999999999999998888875544
No 187
>PF00558 Vpu: Vpu protein; InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=39.95 E-value=36 Score=21.41 Aligned_cols=19 Identities=37% Similarity=0.410 Sum_probs=4.4
Q ss_pred HHHHHHHHHHHHHHHHHhc
Q psy6549 89 IIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~~~ 107 (109)
++.+++-.++++-|.|..+
T Consensus 17 iiaIvvW~iv~ieYrk~~r 35 (81)
T PF00558_consen 17 IIAIVVWTIVYIEYRKIKR 35 (81)
T ss_dssp HHHHHHHHHH---------
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3334444455555666554
No 188
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=39.91 E-value=38 Score=25.87 Aligned_cols=14 Identities=14% Similarity=0.237 Sum_probs=6.3
Q ss_pred HHHHHHHHHHHhcc
Q psy6549 95 MTLSVMLYLKIFKK 108 (109)
Q Consensus 95 l~i~~viy~k~~~~ 108 (109)
+.++.+++.+++||
T Consensus 244 l~l~Gii~~~~~r~ 257 (281)
T PF12768_consen 244 LVLIGIILAYIRRR 257 (281)
T ss_pred HHHHHHHHHHHHhh
Confidence 33444444444444
No 189
>COG1422 Predicted membrane protein [Function unknown]
Probab=39.85 E-value=1.5e+02 Score=21.77 Aligned_cols=35 Identities=6% Similarity=0.168 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHH
Q psy6549 36 IGTGVISELGVQREALVRARDRLVDTDLHVSRSRR 70 (109)
Q Consensus 36 iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~ 70 (109)
+=.++++.+--.+|.+++.++..++......+|++
T Consensus 59 l~~~i~~~~liD~ekm~~~qk~m~efq~e~~eA~~ 93 (201)
T COG1422 59 LYITILQKLLIDQEKMKELQKMMKEFQKEFREAQE 93 (201)
T ss_pred HHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455667777777777777777777777776664
No 190
>PTZ00087 thrombosponding-related protein; Provisional
Probab=39.77 E-value=14 Score=28.56 Aligned_cols=26 Identities=19% Similarity=0.524 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
+.+++.=+++++.++.++|.-|.+||
T Consensus 298 i~~i~~piv~vi~v~~ily~ify~~k 323 (340)
T PTZ00087 298 ILIILLPIVLIICVMGILYHIFYKKK 323 (340)
T ss_pred EeeeehhHHHHHHHHHHHHHHhhhcc
Confidence 33444445556666777787777765
No 191
>CHL00038 psbL photosystem II protein L
Probab=39.76 E-value=60 Score=17.41 Aligned_cols=19 Identities=11% Similarity=0.209 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 85 FLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~ 103 (109)
+-||..+++++++.+--|+
T Consensus 18 Ly~GLLlifvl~vlfssyf 36 (38)
T CHL00038 18 LYWGLLLIFVLAVLFSNYF 36 (38)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3456666666666555553
No 192
>PF15468 DUF4636: Domain of unknown function (DUF4636)
Probab=39.61 E-value=46 Score=24.87 Aligned_cols=29 Identities=7% Similarity=0.062 Sum_probs=13.8
Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 78 RVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 78 r~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
++..+ +++|..+++|++.++.-.++++..
T Consensus 36 ~~iG~-fLlWyfviilvLm~~~ras~Wmse 64 (243)
T PF15468_consen 36 GAIGS-FLLWYFVIILVLMFFSRASVWMSE 64 (243)
T ss_pred chhhh-HHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44444 445544444545444444444443
No 193
>COG4839 FtsL Protein required for the initiation of cell division [Cell division and chromosome partitioning]
Probab=39.44 E-value=1e+02 Score=20.83 Aligned_cols=23 Identities=9% Similarity=0.197 Sum_probs=14.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k 104 (109)
.|++...+++..++..++++|.+
T Consensus 37 EKvly~~~~va~L~vai~ii~~q 59 (120)
T COG4839 37 EKVLYTTLAVAALVVAISIISVQ 59 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 77777666665555566666654
No 194
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=39.38 E-value=1.6e+02 Score=22.03 Aligned_cols=17 Identities=6% Similarity=-0.011 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHhccC
Q psy6549 93 ESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 93 l~l~i~~viy~k~~~~~ 109 (109)
++++++++.|+.|+||+
T Consensus 300 ~m~~i~~~~~~~fkrk~ 316 (318)
T TIGR00383 300 VMAVIALGPLIYFRRKG 316 (318)
T ss_pred HHHHHHHHHHHHHHHcC
Confidence 33444445555555554
No 195
>PRK15111 antimicrobial peptide ABC transporter permease SapC; Provisional
Probab=39.37 E-value=65 Score=24.37 Aligned_cols=30 Identities=10% Similarity=0.305 Sum_probs=17.8
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIESMTLSVML 101 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l~l~i~~vi 101 (109)
.+.+-|+..+||..+++.+++++++++.++
T Consensus 16 ~~~~~~~~~~~~~~~~~~~il~~~~~~a~~ 45 (296)
T PRK15111 16 LRTAWRKFYSDALAMVGLYGCAGLALLCLF 45 (296)
T ss_pred HHHHHHHHhhChHHHHHHHHHHHHHHHHHH
Confidence 344556777888777666555544444443
No 196
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=39.15 E-value=8.4 Score=25.96 Aligned_cols=23 Identities=13% Similarity=0.196 Sum_probs=0.0
Q ss_pred hHHHHHHhhhHhhhHHHHHHHHH
Q psy6549 48 REALVRARDRLVDTDLHVSRSRR 70 (109)
Q Consensus 48 re~L~~~~~~~~~i~~~l~~a~~ 70 (109)
.+.+..+...+..++..+.....
T Consensus 111 ~~~~~~~~~~l~~l~~~l~~i~~ 133 (183)
T PF01105_consen 111 KEHLDPLEESLEKLESNLKEIKD 133 (183)
T ss_dssp -----------------------
T ss_pred hhhhhhhHHHHHHHHHHHHHHHH
Confidence 44444455555555554444443
No 197
>PRK15348 type III secretion system lipoprotein SsaJ; Provisional
Probab=39.04 E-value=86 Score=23.65 Aligned_cols=40 Identities=3% Similarity=0.065 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 66 SRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 66 ~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
....+.+++..+.. .+++.+..++++....++.|++..||
T Consensus 209 ~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (249)
T PRK15348 209 ANKGKVVKWLMKYP---YQLMLSLTGLLLGVGILIGYFCLRRR 248 (249)
T ss_pred cchhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 33445567766666 34444444444455555666666555
No 198
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=39.04 E-value=22 Score=28.40 Aligned_cols=18 Identities=17% Similarity=0.180 Sum_probs=9.7
Q ss_pred HHHHHHHHHHHHHHhccC
Q psy6549 92 IESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 92 ~l~l~i~~viy~k~~~~~ 109 (109)
+++++++++.|+.|.|+|
T Consensus 356 ~v~lllg~~~~~~~rk~k 373 (374)
T TIGR03503 356 VVILLLGGIGFFVWRKKK 373 (374)
T ss_pred hhhhhhheeeEEEEEEee
Confidence 344445555666666654
No 199
>PF06365 CD34_antigen: CD34/Podocalyxin family; InterPro: IPR013836 This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation [, ]. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion [].
Probab=39.03 E-value=41 Score=24.68 Aligned_cols=17 Identities=12% Similarity=-0.111 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHhcc
Q psy6549 92 IESMTLSVMLYLKIFKK 108 (109)
Q Consensus 92 ~l~l~i~~viy~k~~~~ 108 (109)
+|+++.++.+|+.+.|+
T Consensus 112 lLla~~~~~~Y~~~~Rr 128 (202)
T PF06365_consen 112 LLLAILLGAGYCCHQRR 128 (202)
T ss_pred HHHHHHHHHHHHhhhhc
Confidence 55555555677777654
No 200
>TIGR01006 polys_exp_MPA1 polysaccharide export protein, MPA1 family, Gram-positive type. This family contains members from Low GC Gram-positive bacteria; they are proposed to have a function in the export of complex polysaccharides.
Probab=38.85 E-value=72 Score=22.83 Aligned_cols=17 Identities=12% Similarity=0.548 Sum_probs=8.9
Q ss_pred HHHHHHHHHhhHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIM 88 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ 88 (109)
++++.+...++|++.++
T Consensus 10 l~~l~~~l~r~~~~ill 26 (226)
T TIGR01006 10 LLQLLKKLWKRKLLILI 26 (226)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44555555555555443
No 201
>PRK15041 methyl-accepting chemotaxis protein I; Provisional
Probab=38.83 E-value=2.2e+02 Score=23.39 Aligned_cols=67 Identities=10% Similarity=0.108 Sum_probs=47.9
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHH
Q psy6549 9 LRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAI 75 (109)
Q Consensus 9 ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m 75 (109)
.-.+.+..+.+.+.+++......++.+.-.++......|......+...+.+++............+
T Consensus 428 v~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~~eq~~~~~~i~~~i~~i~~~~~~~~~~~~~~ 494 (554)
T PRK15041 428 VDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEMDRVTQQNAALVEES 494 (554)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344555666777777777777777888888888888888888888888888877666644444433
No 202
>COG4192 Signal transduction histidine kinase regulating phosphoglycerate transport system [Signal transduction mechanisms]
Probab=38.66 E-value=2e+02 Score=24.39 Aligned_cols=56 Identities=21% Similarity=0.264 Sum_probs=38.0
Q ss_pred HHHHHHHHHHHHHHHHH---------HHHHHHHhhHHHHHHhhhHhhh-----HHHHHHHHHHHHHHHHH
Q psy6549 23 IVRSEQAALEAEAIGTG---------VISELGVQREALVRARDRLVDT-----DLHVSRSRRMIQAIRRR 78 (109)
Q Consensus 23 L~~s~~~~~ete~iG~~---------il~~L~~Qre~L~~~~~~~~~i-----~~~l~~a~~~l~~m~rr 78 (109)
.---+++.+|..++|.+ .=.++..|+|+|....+++.+. .+.++-+..-+.....+
T Consensus 258 tvtl~q~~deL~~ig~n~~~~P~~~~~rdei~aq~eqLi~~s~kv~EafRt~~~sqigns~qql~t~~~~ 327 (673)
T COG4192 258 TVTLHQLSDELDAIGHNDLSHPITVDGRDEIGAQSEQLILYSKKVEEAFRTNALSQIGNSQQQLITCDGQ 327 (673)
T ss_pred hhHHHHHHHHHHHhcCCcccCCchhhHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhCchHHHHHHHhhH
Confidence 33456777788888763 3468999999999999998885 33444455555544433
No 203
>PF11877 DUF3397: Protein of unknown function (DUF3397); InterPro: IPR024515 This family of bacterial proteins is currently functionally uncharacterised.
Probab=38.59 E-value=1.1e+02 Score=19.92 Aligned_cols=36 Identities=11% Similarity=0.296 Sum_probs=24.8
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 73 QAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 73 ~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
..+.......-++-|.+++++++++++.++....++
T Consensus 48 ~~ls~~~~~~s~lpy~~l~~~ll~i~l~~~~~~~~~ 83 (116)
T PF11877_consen 48 HLLSNNIFGHSFLPYLLLVLLLLAIILAIYQARKKG 83 (116)
T ss_pred HHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 345566677777888888888888877765555443
No 204
>PF09788 Tmemb_55A: Transmembrane protein 55A; InterPro: IPR019178 Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Probab=38.44 E-value=30 Score=26.28 Aligned_cols=28 Identities=4% Similarity=0.205 Sum_probs=18.6
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 73 QAIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 73 ~~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
.++..+..+.|+|+++++.++++++++.
T Consensus 187 SSVG~~faRkR~i~f~llgllfliiaig 214 (256)
T PF09788_consen 187 SSVGPRFARKRAIIFFLLGLLFLIIAIG 214 (256)
T ss_pred ccccchHhhhHHHHHHHHHHHHHHHHHH
Confidence 3455667777888777777666655544
No 205
>PF06716 DUF1201: Protein of unknown function (DUF1201); InterPro: IPR009591 This entry consists of several Beet yellows virus (BYV) putative membrane-binding proteins of around 54 residues in length. The function of this currently unknown.
Probab=38.41 E-value=77 Score=17.97 Aligned_cols=7 Identities=0% Similarity=0.425 Sum_probs=2.8
Q ss_pred HHHHHHh
Q psy6549 100 MLYLKIF 106 (109)
Q Consensus 100 viy~k~~ 106 (109)
+.|++|+
T Consensus 25 ~~F~~F~ 31 (54)
T PF06716_consen 25 VVFIWFV 31 (54)
T ss_pred HHHHHHH
Confidence 3344443
No 206
>PRK12800 fliF flagellar MS-ring protein; Reviewed
Probab=38.40 E-value=1.1e+02 Score=25.84 Aligned_cols=23 Identities=13% Similarity=0.330 Sum_probs=9.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy~k 104 (109)
..|+++.+++++++ ++++++|++
T Consensus 30 ~r~i~l~~~~~~~v-~~~~~l~~~ 52 (574)
T PRK12800 30 TRKLTMMAMIALAV-AAGLAVFFW 52 (574)
T ss_pred hHHHHHHHHHHHHH-HHHHHHHHh
Confidence 33555544444333 333334443
No 207
>PF12409 P5-ATPase: P5-type ATPase cation transporter
Probab=38.33 E-value=53 Score=21.47 Aligned_cols=23 Identities=13% Similarity=0.284 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~ 105 (109)
|.+++.++.++.++++.++++++
T Consensus 16 r~~l~~~l~ilT~Gll~L~~~W~ 38 (119)
T PF12409_consen 16 RTILYYFLCILTLGLLYLVFRWF 38 (119)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhH
Confidence 45565555555566665555444
No 208
>PF06009 Laminin_II: Laminin Domain II; InterPro: IPR010307 It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure [].; GO: 0007155 cell adhesion, 0005604 basement membrane; PDB: 2WJS_A.
Probab=37.79 E-value=11 Score=25.55 Aligned_cols=51 Identities=12% Similarity=0.100 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHH
Q psy6549 24 VRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQA 74 (109)
Q Consensus 24 ~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~ 74 (109)
.+|...+.++.+.-..+..+|..-.+.+..+...++++...+..+.+.+..
T Consensus 6 ~~A~~~a~~v~~~~~~i~~~l~~~~~~~~~~~~~v~~t~~~~~~~~~~l~~ 56 (138)
T PF06009_consen 6 DEANETAANVLDRLDPISENLENWSENLGEINSDVEETNQDISDANKALDD 56 (138)
T ss_dssp ---------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555566666666666666655554444444444433
No 209
>PF09716 ETRAMP: Malarial early transcribed membrane protein (ETRAMP); InterPro: IPR006389 These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii []. A homologue from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane []. Members have an initial hydrophobic, Phe/Tyr-rich stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region, and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes.
Probab=37.61 E-value=1e+02 Score=19.15 Aligned_cols=21 Identities=19% Similarity=0.230 Sum_probs=12.9
Q ss_pred HHHHHHHHHHhhHHHHHHHHH
Q psy6549 71 MIQAIRRRVFYNKLFLIMIIV 91 (109)
Q Consensus 71 ~l~~m~rr~~~~K~il~~ii~ 91 (109)
.-..+.+..-.+|.+++..+.
T Consensus 42 ~d~~i~kK~k~kK~iiiS~i~ 62 (84)
T PF09716_consen 42 IDDKIEKKKKNKKKIIISTIA 62 (84)
T ss_pred hhHHHHHHHhccchhhHHHHH
Confidence 335667776777777664443
No 210
>KOG3202|consensus
Probab=37.60 E-value=1.7e+02 Score=21.86 Aligned_cols=63 Identities=14% Similarity=0.163 Sum_probs=35.1
Q ss_pred HHHHHHHhhhH----HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHH
Q psy6549 4 IHRETLRETNQ----ALVRTGESIVR-SEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRR 70 (109)
Q Consensus 4 ~~R~~ll~~~~----~l~~t~~~L~~-s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~ 70 (109)
..+++|++..+ .+.++-.++.. |..+.+|+++.|. ++. .+-.-+.+++.+++.....+..-++
T Consensus 141 ~~qqqm~~eQDe~Ld~ls~ti~rlk~~a~~~g~EL~~Q~~-llD---dl~~e~d~t~srl~~~~~~l~~v~~ 208 (235)
T KOG3202|consen 141 QLQQQMLQEQDEGLDGLSATVQRLKGMALAMGEELEEQGR-LLD---DLDNEMDRTESRLDRVMKRLAKVNR 208 (235)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH-HHH---HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667776554 34455544443 3344444444443 333 4556677777777777666666555
No 211
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=37.51 E-value=1.3e+02 Score=20.35 Aligned_cols=45 Identities=20% Similarity=0.213 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHH
Q psy6549 21 ESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHV 65 (109)
Q Consensus 21 ~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l 65 (109)
++|++--.-++|..++-..|-++...=++.+.+++..++.+...+
T Consensus 68 qRId~vd~klDe~~ei~~~i~~eV~~v~~dv~~i~~dv~~v~~~V 112 (126)
T PF07889_consen 68 QRIDRVDDKLDEQKEISKQIKDEVTEVREDVSQIGDDVDSVQQMV 112 (126)
T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 444444445555566666655555554555555554444444433
No 212
>PF11947 DUF3464: Protein of unknown function (DUF3464); InterPro: IPR021855 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 137 to 196 amino acids in length.
Probab=37.06 E-value=1.1e+02 Score=21.46 Aligned_cols=32 Identities=13% Similarity=0.167 Sum_probs=19.2
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 76 RRRVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 76 ~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
..||++.=.+.+++=.++.++.+.++|+-..+
T Consensus 58 s~RM~rRm~~~~GiP~~lG~~~f~~~y~l~~~ 89 (153)
T PF11947_consen 58 SNRMLRRMAVFVGIPTALGVAVFVVFYYLKSR 89 (153)
T ss_pred HHHHHHHHHHHhchHHHHHHHHHHHHHHHHhc
Confidence 33444444567777777767666666665443
No 213
>TIGR02796 tolQ TolQ protein. TolQ is one of the essential components of the Tol-Pal system. Together with TolR, it harnesses protonmotive force to energize TolA, which spans the periplasm to reach the complex of TolB and Pal at the outer member. The tol-pal system proves to be important for maintaining outer membrane integrity. Gene pairs similar to the TolQ and TolR gene pair often number several per genome, but this model describes specificially TolQ per se, as found in tol-pal operons. A close homolog, excluded from this model, is ExbB of the ExbB/ExbD/TonB protein complex, which powers transport of siderophores and vitamin B12 across the bacterial outer membrane. The Tol-Pal system is exploited by colicin and filamentous phage DNA to enter the cell. It is also implicated in pathogenesis in several bacterial species
Probab=36.93 E-value=61 Score=23.55 Aligned_cols=22 Identities=9% Similarity=0.401 Sum_probs=14.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~ 103 (109)
.|+++|.++++-++++.++++.
T Consensus 13 ~~~vm~~Ll~~Sii~~aviieR 34 (215)
T TIGR02796 13 VKLVMLILLLASIISWAIIFQK 34 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3578888777776666666543
No 214
>PF00015 MCPsignal: Methyl-accepting chemotaxis protein (MCP) signalling domain; InterPro: IPR004089 Methyl-accepting chemotaxis proteins (MCPs) are a family of bacterial receptors that mediate chemotaxis to diverse signals, responding to changes in the concentration of attractants and repellents in the environment by altering swimming behaviour []. Environmental diversity gives rise to diversity in bacterial signalling receptors, and consequently there are many genes encoding MCPs []. For example, there are four well-characterised MCPs found in Escherichia coli: Tar (taxis towards aspartate and maltose, away from nickel and cobalt), Tsr (taxis towards serine, away from leucine, indole and weak acids), Trg (taxis towards galactose and ribose) and Tap (taxis towards dipeptides). MCPs share similar topology and signalling mechanisms. MCPs either bind ligands directly or interact with ligand-binding proteins, transducing the signal to downstream signalling proteins in the cytoplasm. MCPs undergo two covalent modifications: deamidation and reversible methylation at a number of glutamate residues. Attractants increase the level of methylation, while repellents decrease it. The methyl groups are added by the methyl-transferase cheR and are removed by the methylesterase cheB. Most MCPs are homodimers that contain the following organisation: an N-terminal signal sequence that acts as a transmembrane domain in the mature protein; a poorly-conserved periplasmic receptor (ligand-binding) domain; a second transmembrane domain; and a highly-conserved C-terminal cytoplasmic domain that interacts with downstream signalling components. The C-terminal domain contains the glycosylated glutamate residues. This entry represents the signalling domain found in several methyl-accepting chemotaxis proteins. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs.; GO: 0004871 signal transducer activity, 0007165 signal transduction, 0016020 membrane; PDB: 2CH7_A 3ZX6_B 1QU7_A 3G6B_B 3UR1_C 3G67_B.
Probab=36.92 E-value=1.4e+02 Score=20.52 Aligned_cols=55 Identities=11% Similarity=0.085 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHH
Q psy6549 14 QALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRS 68 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a 68 (109)
+..+.+.+.+.+......+..+.-.++......|-+.+..+...+.++.......
T Consensus 128 ~~~~~~~~~l~~i~~~~~~i~~~i~~i~~~~~~~~~~~~~i~~~i~~i~~~~~~~ 182 (213)
T PF00015_consen 128 ESVEETSESLEEIAESVEEISDSIEEISESAEEQSESIEQINESIEEISEISEQI 182 (213)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccchhcchhhhhhhhhhhHHhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444444444444444444444444444444444433333
No 215
>PF13163 DUF3999: Protein of unknown function (DUF3999)
Probab=36.91 E-value=30 Score=27.90 Aligned_cols=22 Identities=18% Similarity=0.329 Sum_probs=17.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHH
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy 102 (109)
.+|+++|+++++-+++++++.|
T Consensus 406 ~~~~~LW~~Lv~gV~vL~~mA~ 427 (429)
T PF13163_consen 406 WKRWLLWGALVLGVAVLGGMAW 427 (429)
T ss_pred hhhhHHHHHHHHHHHHHHHHhe
Confidence 6889999999888777776655
No 216
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=36.84 E-value=2.9e+02 Score=24.16 Aligned_cols=56 Identities=9% Similarity=-0.048 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHH
Q psy6549 18 RTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAI 75 (109)
Q Consensus 18 ~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m 75 (109)
++.+.+.+.++.+++..+.=.+...++..+ ......+-..++....+.....+...
T Consensus 354 qt~~~v~~ik~~l~~~~~~i~~~a~~i~~~--~~~~~s~~~~~~~~~~~~~~~~~~~y 409 (806)
T PF05478_consen 354 QTSDVVPPIKRDLDSIGKQIRSQAKQIPNQ--IDSNISDILNNTERSSRSFEDEYEKY 409 (806)
T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHhHHH--HHHHHHHHHHHHHHHhhcchhHHHHH
Confidence 444445555555544444334444444444 22333444555555555555555555
No 217
>cd07597 BAR_SNX8 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 8. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. SNX8 and the yeast counterpart Mvp1p are involved in sorting and delivery of late-Golgi proteins, such as carboxypeptidase Y, to vacuoles. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=36.34 E-value=1.8e+02 Score=21.53 Aligned_cols=64 Identities=22% Similarity=0.262 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhH----HHHHHHHHHHHHHHHHH
Q psy6549 16 LVRTGESIVRSE-QAALEAEAIGTGVISELGVQREALVRARDRLVDTD----LHVSRSRRMIQAIRRRV 79 (109)
Q Consensus 16 l~~t~~~L~~s~-~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~----~~l~~a~~~l~~m~rr~ 79 (109)
+...+.++..+. ....++...-..+++.|..|++-|...++-.+..+ ++..++.+-+..+..+.
T Consensus 93 l~~~s~~~~~~s~~~~~~a~~~~~~vlE~Lk~~~d~l~S~r~lf~R~~k~~~~~i~~l~~ri~~~~~kl 161 (246)
T cd07597 93 LSSLSKHFQLLSDLSEDEARAEEDGVLEKLKLQLDLLVSLRDLFERHEKLSLNNIQRLLKRIELNKKKL 161 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHH
Confidence 334444443333 34456777778899999999999999999888433 45555555555555443
No 218
>KOG2736|consensus
Probab=36.27 E-value=63 Score=26.08 Aligned_cols=28 Identities=29% Similarity=0.575 Sum_probs=19.6
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
..|-+++++++++...+.++..+|+|.|
T Consensus 70 l~N~li~i~viv~~Tfllv~ly~~rfyk 97 (406)
T KOG2736|consen 70 LLNALIMISVIVVMTFLLVVLYKYRFYK 97 (406)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567788888887766666666666654
No 219
>KOG4684|consensus
Probab=36.27 E-value=68 Score=24.11 Aligned_cols=27 Identities=15% Similarity=0.354 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy6549 70 RMIQAIRRRVFYNKLFLIMIIVIESMT 96 (109)
Q Consensus 70 ~~l~~m~rr~~~~K~il~~ii~~l~l~ 96 (109)
+-+.++.+|....+.++++|+.+++.+
T Consensus 196 rKvSsvGsrfar~Ra~~ffilal~~av 222 (275)
T KOG4684|consen 196 RKVSSVGSRFARRRALLFFILALTVAV 222 (275)
T ss_pred cchhhhhhHHhhhhhHHHHHHHHHHHH
Confidence 446678888888888888877766443
No 220
>PRK00753 psbL photosystem II reaction center L; Provisional
Probab=36.14 E-value=62 Score=17.41 Aligned_cols=19 Identities=5% Similarity=0.277 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 85 FLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~ 103 (109)
+-||..+++++++.+--|+
T Consensus 19 Ly~GlLlifvl~vLFssYf 37 (39)
T PRK00753 19 LYLGLLLVFVLGILFSSYF 37 (39)
T ss_pred HHHHHHHHHHHHHHHHhhc
Confidence 3456666666666555543
No 221
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=36.11 E-value=72 Score=21.66 Aligned_cols=24 Identities=17% Similarity=0.234 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
-+++|++=++++++.++++|..+.
T Consensus 101 t~~LW~~P~~lll~g~~~~~~~~r 124 (126)
T PRK10144 101 TLVLWALPVVLLLLMALILWRVRA 124 (126)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Confidence 477888766555544445555443
No 222
>PRK07193 fliF flagellar MS-ring protein; Reviewed
Probab=36.08 E-value=98 Score=26.01 Aligned_cols=23 Identities=22% Similarity=0.411 Sum_probs=9.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy~k 104 (109)
..++++.++++++ ++.++++|++
T Consensus 25 ~r~~~l~~~~~~~-va~~~~~~~~ 47 (552)
T PRK07193 25 NRKLILLALLALL-VAAAIVLSLW 47 (552)
T ss_pred hhhHHHHHHHHHH-HHHHHHHHHh
Confidence 3344444433333 3333333433
No 223
>PF14316 DUF4381: Domain of unknown function (DUF4381)
Probab=35.70 E-value=61 Score=21.92 Aligned_cols=25 Identities=12% Similarity=0.250 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
+++.++++++++..++..|..+.++
T Consensus 24 wll~~lll~~~~~~~~~~~r~~~~~ 48 (146)
T PF14316_consen 24 WLLLALLLLLLILLLWRLWRRWRRN 48 (146)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 4555555566666677766666543
No 224
>PHA02902 putative IMV membrane protein; Provisional
Probab=35.67 E-value=84 Score=19.01 Aligned_cols=21 Identities=24% Similarity=0.341 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k 104 (109)
++..+++++.+..+++..|.+
T Consensus 6 fvi~~v~v~Ivclliya~YrR 26 (70)
T PHA02902 6 FVILAVIVIIFCLLIYAAYKR 26 (70)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333344443
No 225
>COG1033 Predicted exporters of the RND superfamily [General function prediction only]
Probab=35.38 E-value=61 Score=28.21 Aligned_cols=70 Identities=11% Similarity=0.189 Sum_probs=42.4
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 30 ALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 30 ~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
-.|.+..+.+++.++..+-.+.. +.....-..-+...|..-+.....+.....+++++++++++|..+.+
T Consensus 525 ~~~~~~~~~~~~~~i~~~~~~~g--------v~~~vtG~~vi~~~m~~~i~~sq~~~t~l~~~~V~~ll~i~fRs~~~ 594 (727)
T COG1033 525 QGELEDVGREILRDIEKENIPTG--------VKVYVTGESVIYVEMNELLTSSQLISTVLGIILVFALLLIIFRSPLK 594 (727)
T ss_pred hhHHHHHHHHHHHHHHhhcCCCC--------cEEEEcCchhHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhchHH
Confidence 34555566666555555443333 11222223345666777788888888777777777788888877654
No 226
>PRK10414 biopolymer transport protein ExbB; Provisional
Probab=35.25 E-value=66 Score=24.06 Aligned_cols=25 Identities=16% Similarity=0.230 Sum_probs=16.0
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k 104 (109)
.-=|+++|.++++-++...++++..
T Consensus 21 ~~~k~Vm~~Ll~~Si~swaiIieR~ 45 (244)
T PRK10414 21 IVVKCVMIGLILASVVTWAIFFSKS 45 (244)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3446888887777766666554433
No 227
>PF14899 DUF4492: Domain of unknown function (DUF4492)
Probab=35.11 E-value=75 Score=19.10 Aligned_cols=23 Identities=30% Similarity=0.413 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~~ 107 (109)
.+|.||++=+.++.+|+=.-|++
T Consensus 20 tLW~IIliKLfImF~vLK~FfFp 42 (64)
T PF14899_consen 20 TLWLIILIKLFIMFAVLKLFFFP 42 (64)
T ss_pred HHHHHHHHHHHHHHHHHHHHHCc
Confidence 47888887767677666655554
No 228
>KOG3637|consensus
Probab=34.92 E-value=39 Score=30.52 Aligned_cols=23 Identities=30% Similarity=0.416 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHH--Hhcc
Q psy6549 86 LIMIIVIESMTLSVMLYLK--IFKK 108 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k--~~~~ 108 (109)
+..++.+|++++++++-|| |||+
T Consensus 983 ~svl~GLLlL~llv~~LwK~GFFKR 1007 (1030)
T KOG3637|consen 983 LSVLGGLLLLALLVLLLWKCGFFKR 1007 (1030)
T ss_pred HHHHHHHHHHHHHHHHHHhcCcccc
Confidence 3344445555554444444 5553
No 229
>PF07010 Endomucin: Endomucin; InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=34.76 E-value=64 Score=24.36 Aligned_cols=8 Identities=0% Similarity=0.143 Sum_probs=3.1
Q ss_pred HHHHHHHh
Q psy6549 99 VMLYLKIF 106 (109)
Q Consensus 99 ~viy~k~~ 106 (109)
++-+|++-
T Consensus 207 LvgLyr~C 214 (259)
T PF07010_consen 207 LVGLYRMC 214 (259)
T ss_pred HHHHHHHh
Confidence 33344433
No 230
>PF05957 DUF883: Bacterial protein of unknown function (DUF883); InterPro: IPR010279 This family consists of several bacterial proteins of unknown function that include the Escherichia coli genes for ElaB, YgaM and YqjD.
Probab=34.60 E-value=1.1e+02 Score=18.87 Aligned_cols=71 Identities=23% Similarity=0.092 Sum_probs=35.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh-hHHHHHHhhhHhhhHHHHHH-HHHHHHHHHHHHHhhHHHHHHHHHH
Q psy6549 22 SIVRSEQAALEAEAIGTGVISELGVQ-REALVRARDRLVDTDLHVSR-SRRMIQAIRRRVFYNKLFLIMIIVI 92 (109)
Q Consensus 22 ~L~~s~~~~~ete~iG~~il~~L~~Q-re~L~~~~~~~~~i~~~l~~-a~~~l~~m~rr~~~~K~il~~ii~~ 92 (109)
-..+...++.++-+.+.+...++..+ .+.+..+.+.+.+....+.. ++........-+-.|-|--+++.+.
T Consensus 10 l~~d~~~l~~~~~~~~~~~~~~~r~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~V~e~P~~svgiAag 82 (94)
T PF05957_consen 10 LRADLEDLARSAADLAGEKADEARDRAEEALDDARDRAEDAADQAREQAREAAEQTEDYVRENPWQSVGIAAG 82 (94)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHChHHHHHHHHH
Confidence 33444445555555555555555433 35566666666666554443 3444444444444444444444433
No 231
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=34.59 E-value=13 Score=28.67 Aligned_cols=9 Identities=0% Similarity=0.154 Sum_probs=0.0
Q ss_pred HHHHHHHHH
Q psy6549 96 TLSVMLYLK 104 (109)
Q Consensus 96 ~i~~viy~k 104 (109)
+++++++|+
T Consensus 163 ~iIa~icyr 171 (290)
T PF05454_consen 163 GIIACICYR 171 (290)
T ss_dssp ---------
T ss_pred HHHHHHhhh
Confidence 344445555
No 232
>PF04790 Sarcoglycan_1: Sarcoglycan complex subunit protein; InterPro: IPR006875 The dystrophin glycoprotein complex (DGC) is a membrane-spanning complex that links the interior cytoskeleton to the extracellular matrix in muscle. The sarcoglycan complex is a subcomplex within the DGC and is composed of several muscle-specific, transmembrane proteins (alpha-, beta-, gamma-, delta- and zeta-sarcoglycan). The sarcoglycans are asparagine-linked glycosylated proteins with single transmembrane domains. This family contains beta, gamma and delta members [, ].; GO: 0007010 cytoskeleton organization, 0016012 sarcoglycan complex, 0016021 integral to membrane
Probab=34.54 E-value=62 Score=24.56 Aligned_cols=17 Identities=12% Similarity=0.157 Sum_probs=10.0
Q ss_pred hHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLS 98 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~ 98 (109)
.|+++|+++++|+++.+
T Consensus 10 rk~cly~~vllL~il~i 26 (264)
T PF04790_consen 10 RKRCLYLFVLLLFILAI 26 (264)
T ss_pred hhhhHHHHHHHHHHHHH
Confidence 46677766666544433
No 233
>PF10389 CoatB: Bacteriophage coat protein B ; InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 2IFO_A.
Probab=34.49 E-value=54 Score=18.39 Aligned_cols=13 Identities=8% Similarity=0.169 Sum_probs=7.4
Q ss_pred HHHHHHHHHHhcc
Q psy6549 96 TLSVMLYLKIFKK 108 (109)
Q Consensus 96 ~i~~viy~k~~~~ 108 (109)
+++.+--|||.||
T Consensus 32 v~V~i~v~kwiRr 44 (46)
T PF10389_consen 32 VIVGIAVYKWIRR 44 (46)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHh
Confidence 3445556666654
No 234
>PRK06007 fliF flagellar MS-ring protein; Reviewed
Probab=34.47 E-value=62 Score=26.99 Aligned_cols=25 Identities=8% Similarity=0.066 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
++.+++++++++++++++|..+.++
T Consensus 440 ~~~~~~~~l~~l~l~~~v~rp~~~~ 464 (542)
T PRK06007 440 LIKLAAGALLILILIFFVLRPRLRP 464 (542)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3345555555566666666666553
No 235
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=34.31 E-value=66 Score=22.74 Aligned_cols=16 Identities=0% Similarity=0.279 Sum_probs=6.0
Q ss_pred HHHHHHHHHHHHHHHH
Q psy6549 87 IMIIVIESMTLSVMLY 102 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy 102 (109)
|.++++..++++++++
T Consensus 99 ~Vl~g~s~l~i~yfvi 114 (163)
T PF06679_consen 99 YVLVGLSALAILYFVI 114 (163)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333333333443333
No 236
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=34.20 E-value=1.1e+02 Score=18.67 Aligned_cols=37 Identities=16% Similarity=0.159 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHH
Q psy6549 32 EAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRS 68 (109)
Q Consensus 32 ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a 68 (109)
+..++-.++-.-++...+.+..++.+++.++.-.+..
T Consensus 41 ~~~~i~~e~~~ll~~~n~l~~dv~~k~~~v~~~~~~v 77 (90)
T PF06103_consen 41 QVDPITKEINDLLHNTNELLEDVNEKLEKVDPVFEAV 77 (90)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3344555555556666666666666666655544443
No 237
>KOG1326|consensus
Probab=34.00 E-value=73 Score=28.91 Aligned_cols=33 Identities=21% Similarity=0.375 Sum_probs=21.8
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy6549 71 MIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 71 ~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~ 103 (109)
.++-+..+.++-.++++++|+++++..++++|.
T Consensus 1061 ~~~~i~W~~yr~~il~~l~ililll~l~~fly~ 1093 (1105)
T KOG1326|consen 1061 SFKFILWHRYRWYILLLLLILILLLLLALFLYS 1093 (1105)
T ss_pred chhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344566667777777777777776666666664
No 238
>PF06570 DUF1129: Protein of unknown function (DUF1129); InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=33.93 E-value=1.5e+02 Score=21.15 Aligned_cols=27 Identities=11% Similarity=0.079 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 17 VRTGESIVRSEQAALEAEAIGTGVISE 43 (109)
Q Consensus 17 ~~t~~~L~~s~~~~~ete~iG~~il~~ 43 (109)
.+....|..+...-+|.|++=.+++.+
T Consensus 11 ~~l~~~L~~~~~~e~~~e~~L~eil~~ 37 (206)
T PF06570_consen 11 FDLRKYLRSSGVSEEEIEELLEEILPH 37 (206)
T ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 333344433333333444444444433
No 239
>PF05393 Hum_adeno_E3A: Human adenovirus early E3A glycoprotein; InterPro: IPR008652 This family consists of several early glycoproteins (E3A), from human adenovirus type 2.; GO: 0016021 integral to membrane
Probab=33.92 E-value=92 Score=20.01 Aligned_cols=21 Identities=24% Similarity=0.162 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k 104 (109)
|+.++.+++++++.-++.+.|
T Consensus 37 ~lvI~~iFil~VilwfvCC~k 57 (94)
T PF05393_consen 37 FLVICGIFILLVILWFVCCKK 57 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444444443
No 240
>TIGR02797 exbB tonB-system energizer ExbB. This model describes ExbB proteins, part of the MotA/TolQ/ExbB protein family. The paired proteins MotA and MotB, TolQ and TolR, and ExbB and ExbD harness the proton-motive force to drive the flagellar motor, energize the Tol-Pal system, or energize TonB, respectively. Tol-Pal and TonB are both active at the outer membrane. Genomes may have many different TonB-dependent receptors, of which many of those characterized are involved in siderophore transport across the outer membrane.
Probab=33.88 E-value=52 Score=23.82 Aligned_cols=22 Identities=14% Similarity=0.167 Sum_probs=14.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~ 103 (109)
-|+++|.++++-++...++++.
T Consensus 13 ~k~vm~~Ll~~Si~s~aiiieR 34 (211)
T TIGR02797 13 VKAVMIGLALASVVTWTIWIAK 34 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4678888877776666555433
No 241
>KOG1094|consensus
Probab=33.81 E-value=55 Score=28.37 Aligned_cols=20 Identities=20% Similarity=0.605 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~ 103 (109)
.|+.+|+++++++|.+++|.
T Consensus 395 ~~f~~if~iva~ii~~~L~R 414 (807)
T KOG1094|consen 395 IIFVAIFLIVALIIALMLWR 414 (807)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 45556666666666666666
No 242
>PF10732 DUF2524: Protein of unknown function (DUF2524); InterPro: IPR019668 This entry represents proteins with unknown function, and appear to be restricted to the Bacillaceae.
Probab=33.79 E-value=1.3e+02 Score=19.09 Aligned_cols=19 Identities=37% Similarity=0.281 Sum_probs=14.1
Q ss_pred HHhhHHHHHHhhhHhhhHH
Q psy6549 45 GVQREALVRARDRLVDTDL 63 (109)
Q Consensus 45 ~~Qre~L~~~~~~~~~i~~ 63 (109)
.+|||+|.|.+-.+....+
T Consensus 62 ~qQREqL~R~rlQlqqLQN 80 (84)
T PF10732_consen 62 PQQREQLHRMRLQLQQLQN 80 (84)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 4899999988777665544
No 243
>PRK15048 methyl-accepting chemotaxis protein II; Provisional
Probab=33.73 E-value=2.6e+02 Score=22.72 Aligned_cols=65 Identities=12% Similarity=0.099 Sum_probs=42.1
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHH
Q psy6549 11 ETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAI 75 (109)
Q Consensus 11 ~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m 75 (109)
.+.+..+.+.+.+.+....+.++.+.-.++.....+|.+....+...+.++.............+
T Consensus 428 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~ 492 (553)
T PRK15048 428 TGSVLVESAGETMNNIVNAVTRVTDIMGEIASASDEQSRGIDQVALAVSEMDRVTQQNASLVQES 492 (553)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455556666666666666666666667777777777777777777777776665544444443
No 244
>COG4640 Predicted membrane protein [Function unknown]
Probab=33.68 E-value=55 Score=26.70 Aligned_cols=21 Identities=5% Similarity=-0.058 Sum_probs=11.3
Q ss_pred HhhHHHHHHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~v 100 (109)
-..++|.|.++++.+++|++.
T Consensus 48 skK~ii~was~a~~lIlii~~ 68 (465)
T COG4640 48 SKKKIIPWASGAFILILIIIL 68 (465)
T ss_pred ccceeehhHHHHHHHHHHHHH
Confidence 344566776666554444433
No 245
>PRK11638 lipopolysaccharide biosynthesis protein WzzE; Provisional
Probab=33.66 E-value=1.1e+02 Score=24.11 Aligned_cols=31 Identities=23% Similarity=0.376 Sum_probs=19.0
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k 104 (109)
+..+.+-..+.|++.+++.++. +++.++|..
T Consensus 12 l~~L~~~Lw~~k~~Ii~~t~~~--~~~~~~~s~ 42 (342)
T PRK11638 12 IRGLCRTLWAGKLWIIGMALLF--ALIALGYSF 42 (342)
T ss_pred HHHHHHHHHHhhHHHHHHHHHH--HHHHHHHHh
Confidence 5666777777777766665554 344455553
No 246
>PF06009 Laminin_II: Laminin Domain II; InterPro: IPR010307 It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure [].; GO: 0007155 cell adhesion, 0005604 basement membrane; PDB: 2WJS_A.
Probab=33.51 E-value=14 Score=25.00 Aligned_cols=40 Identities=23% Similarity=0.231 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHH
Q psy6549 30 ALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSR 69 (109)
Q Consensus 30 ~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~ 69 (109)
+.++.++..+++..+..=.+.+.+....+.++...++...
T Consensus 5 a~~A~~~a~~v~~~~~~i~~~l~~~~~~~~~~~~~v~~t~ 44 (138)
T PF06009_consen 5 ADEANETAANVLDRLDPISENLENWSENLGEINSDVEETN 44 (138)
T ss_dssp ----------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH
Confidence 3333334444444333333333334444443333333333
No 247
>TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II. This enzyme catalyzes the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in gram positive bacteria of the Bacillus/Staphylococcus group. Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes a similar to the cyt. c oxidases (as opposed to cytochromes b).
Probab=33.47 E-value=73 Score=23.13 Aligned_cols=8 Identities=13% Similarity=0.148 Sum_probs=3.5
Q ss_pred HHHHHHhc
Q psy6549 100 MLYLKIFK 107 (109)
Q Consensus 100 viy~k~~~ 107 (109)
.+++|+.+
T Consensus 43 ~~~~kyr~ 50 (217)
T TIGR01432 43 IFLVKYRY 50 (217)
T ss_pred HHHHHHHh
Confidence 34445443
No 248
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=33.31 E-value=64 Score=25.02 Aligned_cols=22 Identities=18% Similarity=0.306 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhc
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~ 107 (109)
++.+|+.++++|++|-.|+-.|
T Consensus 266 lvllil~vvliiLYiWlyrrRK 287 (295)
T TIGR01478 266 LVLIILTVVLIILYIWLYRRRK 287 (295)
T ss_pred HHHHHHHHHHHHHHHHHHHhhc
Confidence 4455555566677775555444
No 249
>PRK10801 colicin uptake protein TolQ; Provisional
Probab=33.02 E-value=76 Score=23.36 Aligned_cols=22 Identities=14% Similarity=0.345 Sum_probs=14.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~ 103 (109)
-|+++|.++++.++...+++..
T Consensus 14 ~k~vm~~Ll~~Si~s~aIiieR 35 (227)
T PRK10801 14 VKLIMLILIGFSIASWAIIIQR 35 (227)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3578888777776666655533
No 250
>PF10183 ESSS: ESSS subunit of NADH:ubiquinone oxidoreductase (complex I) ; InterPro: IPR019329 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This entry represents the ESSS subunit from mitochondrial NADH:ubiquinone oxidoreductase (complex I). It carries mitochondrial import sequences [].
Probab=33.02 E-value=62 Score=20.94 Aligned_cols=19 Identities=16% Similarity=-0.013 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k 104 (109)
+|+.+.+.+++.+++++|+
T Consensus 63 ~f~~~~~~~v~~~~~~~y~ 81 (105)
T PF10183_consen 63 FFFGFSGSLVFGGVFLAYK 81 (105)
T ss_pred HHHHHHHHHHHHHHHHHcC
Confidence 3333333334444444443
No 251
>PF15145 DUF4577: Domain of unknown function (DUF4577)
Probab=32.86 E-value=68 Score=21.57 Aligned_cols=22 Identities=18% Similarity=0.550 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
.++++.|++|++.+.+|.|.-|
T Consensus 62 lffvglii~LivSLaLVsFvIF 83 (128)
T PF15145_consen 62 LFFVGLIIVLIVSLALVSFVIF 83 (128)
T ss_pred ehHHHHHHHHHHHHHHHHHHHH
Confidence 4555666666555555544443
No 252
>PF09577 Spore_YpjB: Sporulation protein YpjB (SpoYpjB); InterPro: IPR014231 Proteins in thie entry, typified by YpjB, are restricted to a subset of the endospore-forming bacteria which includes Bacillus species, but not species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon []. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set.
Probab=32.82 E-value=2.1e+02 Score=21.36 Aligned_cols=23 Identities=22% Similarity=0.059 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~ 106 (109)
|+++.+-.++++++.+|.|.|+.
T Consensus 201 Wv~l~iG~iIi~tLtYvGwRKYr 223 (232)
T PF09577_consen 201 WVMLSIGGIIIATLTYVGWRKYR 223 (232)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333334455677888888876
No 253
>TIGR02866 CoxB cytochrome c oxidase, subunit II. Cytochrome c oxidase is the terminal electron acceptor of mitochondria (and one of several possible acceptors in prokaryotes) in the electron transport chain of aerobic respiration. The enzyme couples the oxidation of reduced cytochrome c with the reduction of molecular oxygen to water. This process results in the pumping of four protons across the membrane which are used in the proton gradient powered synthesis of ATP. The oxidase contains two heme a cofactors and three copper atoms as well as other bound ions.
Probab=32.77 E-value=67 Score=22.92 Aligned_cols=21 Identities=14% Similarity=0.314 Sum_probs=8.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 85 FLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~ 105 (109)
+++++++++++..++++|+-+
T Consensus 16 ~~~i~~iI~v~V~~~l~~~~~ 36 (201)
T TIGR02866 16 LLAVATTISLLVAALLAYVVW 36 (201)
T ss_pred HHHHHHHHHHHHHHHHHHhhh
Confidence 344444444444444444433
No 254
>PF00517 GP41: Retroviral envelope protein; InterPro: IPR000328 This entry represents envelope proteins from a variety of retroviruses. It includes the GP41 subunit of the envelope protein complex from Human immunodeficiency virus (HIV) and Simian-Human immunodeficiency virus (SIV), which mediate membrane fusion during viral entry []. It has a core composed of a six-helix bundle and is folded by its trimeric N- and C-terminal heptad-repeats (NHR and CHR) []. Derivatives of this protein prevent HIV-1 from entering cell lines and primary human CD4+ cells in vitro [], making it an attractive subject of gene therapy studies against HIV and related retroviruses. The entry also represents envelop proteins from Bovine immunodeficiency virus, Feline immunodeficiency virus and Equine infectious anemia virus (EIAV) [, ], as well as the Gp36 protein from Mouse mammary tumor virus (MMTV) and Human endogenous retrovirus (HERV).; GO: 0005198 structural molecule activity, 0019031 viral envelope; PDB: 2EZO_B 2EZQ_B 2EZR_A 2JNR_B 1F23_D 2EZP_A 1JEK_A 2Q7C_A 2Q5U_A 2Q3I_A ....
Probab=32.68 E-value=1.9e+02 Score=20.86 Aligned_cols=60 Identities=12% Similarity=-0.010 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy6549 26 SEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 26 s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l 93 (109)
=.+-....++.=.+.+++-..|.|.-+..-.++++-+ ++...-.+.+.+ ++|.+++++++
T Consensus 106 W~~~i~~~~~~i~~ll~~a~~qqe~n~~~l~~Ld~w~-~l~~wfdit~W~-------~~Iki~i~iv~ 165 (204)
T PF00517_consen 106 WEKEISNYTGNIYNLLEEAQNQQEKNEQDLLKLDSWT-NLWSWFDITKWL-------WYIKIFIMIVI 165 (204)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTH-HHHHHHCHHHHH-------HHHHH------
T ss_pred HHHHhcccHHHHHHHHHHHHhchhhhhhhhcCCcHHh-hhhhHHhHHHHH-------HHHHHHHHHHH
Confidence 3344444444445556666667766666555555522 222233333333 55656555554
No 255
>PF05781 MRVI1: MRVI1 protein; InterPro: IPR008677 This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene []. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP []. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins [].
Probab=32.64 E-value=1.9e+02 Score=24.46 Aligned_cols=27 Identities=7% Similarity=-0.005 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy6549 64 HVSRSRRMIQAIRRRVFYNKLFLIMII 90 (109)
Q Consensus 64 ~l~~a~~~l~~m~rr~~~~K~il~~ii 90 (109)
-+...-..|+.+..+.++-=||..++|
T Consensus 461 rves~~~~Lk~s~pKanK~LWIsvAli 487 (538)
T PF05781_consen 461 RVESWASYLKTSFPKANKVLWISVALI 487 (538)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444557777887773333333333
No 256
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=32.58 E-value=81 Score=21.75 Aligned_cols=22 Identities=18% Similarity=0.076 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
+..++++++++.+.++++..++
T Consensus 122 ~~~i~g~ll~i~~giy~~~r~~ 143 (145)
T PF10661_consen 122 LLSIGGILLAICGGIYVVLRKV 143 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 4444433333344444444444
No 257
>PF15361 RIC3: Resistance to inhibitors of cholinesterase homologue 3
Probab=32.37 E-value=60 Score=22.54 Aligned_cols=22 Identities=27% Similarity=0.264 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
-+|.|.+++ .++..++|++++|
T Consensus 87 PlYtiGI~~---f~lY~l~Ki~~~k 108 (152)
T PF15361_consen 87 PLYTIGIVL---FILYTLFKIKKKK 108 (152)
T ss_pred HHHHHHHHH---HHHHHHHHHHhcC
Confidence 345555433 2233444544443
No 258
>TIGR00206 fliF flagellar basal-body M-ring protein/flagellar hook-basal body protein (fliF). Component of the M (cytoplasmic associated) ring, one of four rings (L,P,S,M) which make up the flagellar hook-basal body which is a major portion of the flagellar organelle. Although the basic structure of the flagella appears to be similar for all bacteria, additional rings and structures surrounding the basal body have been observed for some bacteria (eg Vibrio cholerae and Treponema pallidum).
Probab=32.13 E-value=1.5e+02 Score=24.85 Aligned_cols=22 Identities=18% Similarity=0.187 Sum_probs=11.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHH
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy 102 (109)
..|++++++++++++++++++|
T Consensus 22 ~qk~~l~~~~~~vv~~~~~~~~ 43 (555)
T TIGR00206 22 VQKIALILAGFAIIAALVFLIL 43 (555)
T ss_pred hhHHHHHHHHHHHHHHHHHHHH
Confidence 4566665555555444443333
No 259
>PF10032 Pho88: Phosphate transport (Pho88); InterPro: IPR019263 This entry represents proteins involved in inorganic phosphate transport, as well as telomere length regulation and maintenance [, , , ].
Probab=32.12 E-value=78 Score=22.94 Aligned_cols=24 Identities=17% Similarity=0.189 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
.|++.-++++++.+.+|+++.+|+
T Consensus 35 ~y~~s~~i~~~~y~yi~~~I~~kn 58 (192)
T PF10032_consen 35 AYVASQLIILGVYLYIFSKIKKKN 58 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcC
Confidence 344444555666667777777653
No 260
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=31.68 E-value=95 Score=21.06 Aligned_cols=25 Identities=16% Similarity=0.377 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
-+++|+.=++++++ ++++++...|+
T Consensus 101 t~~LW~~P~lll~~-G~~~~~~~~rr 125 (126)
T TIGR03147 101 TLLLWLLPVLLLLL-AFVLLWRVRRR 125 (126)
T ss_pred hHHHHHHHHHHHHH-HHHHHHHHHhc
Confidence 46788776655444 33344444444
No 261
>PTZ00370 STEVOR; Provisional
Probab=31.56 E-value=71 Score=24.81 Aligned_cols=22 Identities=14% Similarity=0.285 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhc
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~ 107 (109)
++.+|+.++++|.+|-.|+-.|
T Consensus 262 lvllil~vvliilYiwlyrrRK 283 (296)
T PTZ00370 262 LVLLILAVVLIILYIWLYRRRK 283 (296)
T ss_pred HHHHHHHHHHHHHHHHHHHhhc
Confidence 4445555556677775555443
No 262
>PRK11875 psbT photosystem II reaction center protein T; Reviewed
Probab=31.48 E-value=71 Score=16.41 Aligned_cols=11 Identities=18% Similarity=0.498 Sum_probs=4.6
Q ss_pred HHHHHHHHHHh
Q psy6549 96 TLSVMLYLKIF 106 (109)
Q Consensus 96 ~i~~viy~k~~ 106 (109)
.++++++--||
T Consensus 13 tlgiiFFAIfF 23 (31)
T PRK11875 13 ALVTLFFAIAF 23 (31)
T ss_pred HHHHHHHhhhc
Confidence 34444444444
No 263
>PF03381 CDC50: LEM3 (ligand-effect modulator 3) family / CDC50 family; InterPro: IPR005045 Members of this family have no known function. They have predicted transmembrane helices.; GO: 0016020 membrane
Probab=31.33 E-value=80 Score=23.93 Aligned_cols=7 Identities=0% Similarity=0.316 Sum_probs=2.6
Q ss_pred HHHHHHH
Q psy6549 96 TLSVMLY 102 (109)
Q Consensus 96 ~i~~viy 102 (109)
+++++++
T Consensus 260 ~i~~~~~ 266 (278)
T PF03381_consen 260 AIIFLII 266 (278)
T ss_pred HHHHHHH
Confidence 3333333
No 264
>PF11353 DUF3153: Protein of unknown function (DUF3153); InterPro: IPR021499 This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed.
Probab=31.27 E-value=71 Score=22.97 Aligned_cols=14 Identities=7% Similarity=0.102 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHhcc
Q psy6549 95 MTLSVMLYLKIFKK 108 (109)
Q Consensus 95 l~i~~viy~k~~~~ 108 (109)
+++++.+|.+..|.
T Consensus 194 ~l~~~~~~l~~~r~ 207 (209)
T PF11353_consen 194 LLILLGFLLRRRRL 207 (209)
T ss_pred HHHHHHHHHHHhhc
Confidence 33444456655543
No 265
>PRK11637 AmiB activator; Provisional
Probab=31.22 E-value=2.7e+02 Score=22.13 Aligned_cols=67 Identities=7% Similarity=-0.030 Sum_probs=32.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHHHHHHhhHHHHHH
Q psy6549 22 SIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIRRRVFYNKLFLIM 88 (109)
Q Consensus 22 ~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~rr~~~~K~il~~ 88 (109)
.+.+...-+.+.+.-=..+-.++....+.|..+...+......+.....-+..-...+.....-+|-
T Consensus 69 ~~~~~~~~l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlra~Y~ 135 (428)
T PRK11637 69 QRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLDAAFR 135 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333444445555555555555555555555555555555555555455554
No 266
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=31.03 E-value=59 Score=28.65 Aligned_cols=21 Identities=24% Similarity=0.135 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k 104 (109)
.|+.++++++++++++++||=
T Consensus 277 ~ILG~~~livl~lL~vLl~yC 297 (807)
T PF10577_consen 277 AILGGTALIVLILLCVLLCYC 297 (807)
T ss_pred HHHHHHHHHHHHHHHHHHHhh
Confidence 355556666666666666653
No 267
>PRK09458 pspB phage shock protein B; Provisional
Probab=31.03 E-value=85 Score=19.43 Aligned_cols=18 Identities=11% Similarity=0.117 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy6549 89 IIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 89 ii~~l~l~i~~viy~k~~ 106 (109)
+-+++++++++.+|+.++
T Consensus 8 ~PliiF~ifVaPiWL~LH 25 (75)
T PRK09458 8 IPLTIFVLFVAPIWLWLH 25 (75)
T ss_pred HhHHHHHHHHHHHHHHHh
Confidence 333333455556676665
No 268
>PF04639 Baculo_E56: Baculoviral E56 protein, specific to ODV envelope; InterPro: IPR006733 This family represents the E56 protein, which is localized to the occlusion derived virus (ODV) envelope, but not to the budded virus (BV) envelope []. Signals necessary for transport and/or retention into this structure are believed to be found within the C-terminal portion of ODV-E56.; GO: 0019031 viral envelope
Probab=30.84 E-value=45 Score=25.88 Aligned_cols=27 Identities=15% Similarity=0.263 Sum_probs=19.5
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
..++++-++.++-.++++++|.|+-|+
T Consensus 273 ~s~~l~piil~IG~vl~i~~Ig~~ifK 299 (305)
T PF04639_consen 273 VSDSLLPIILIIGGVLLIVFIGYFIFK 299 (305)
T ss_pred hhhhhhHHHHHHHHHHHHHHhhheeeE
Confidence 456777788887777777777776554
No 269
>TIGR02185 Trep_Strep conserved hypothetical integral membrane protein TIGR02185. This family consists of strongly hydrophobic proteins about 190 amino acids in length with a strongly basic motif near the C-terminus. If is found in rather few species, but in paralogous families of 12 members in the oral pathogenic spirochaete Treponema denticola and 2 in Streptococcus pneumoniae R6.
Probab=30.84 E-value=72 Score=22.70 Aligned_cols=29 Identities=14% Similarity=0.206 Sum_probs=19.0
Q ss_pred hhHHH-HHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 81 YNKLF-LIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 81 ~~K~i-l~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
...++ ...++.+++.+++..+-.|+.|||
T Consensus 157 ~~~~~~~~~~~t~v~~~iG~~iG~kllkKH 186 (189)
T TIGR02185 157 SAIWAVIMIVLTAVAGIAGVLIGKKLLKKH 186 (189)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44433 344455666677778889998886
No 270
>PLN03094 Substrate binding subunit of ER-derived-lipid transporter; Provisional
Probab=30.54 E-value=62 Score=25.82 Aligned_cols=30 Identities=7% Similarity=0.201 Sum_probs=19.3
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 75 IRRRVFYNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 75 m~rr~~~~K~il~~ii~~l~l~i~~viy~k 104 (109)
|.||..+...+=.++++.++++.+++.|++
T Consensus 78 ~~rrsvrEg~VGlfvL~gi~ll~~~~~~L~ 107 (370)
T PLN03094 78 FGKRSVWEGGVGLFLLSGAALLALTLAWLR 107 (370)
T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 778888887776666655555555555544
No 271
>PHA02955 hypothetical protein; Provisional
Probab=30.38 E-value=52 Score=24.35 Aligned_cols=9 Identities=44% Similarity=0.755 Sum_probs=3.5
Q ss_pred hHHHHHHHH
Q psy6549 82 NKLFLIMII 90 (109)
Q Consensus 82 ~K~il~~ii 90 (109)
.+|++++++
T Consensus 179 ~~w~ii~~v 187 (213)
T PHA02955 179 IKWFIIYIV 187 (213)
T ss_pred CcchhHHHH
Confidence 344443333
No 272
>KOG2067|consensus
Probab=30.38 E-value=1.3e+02 Score=24.76 Aligned_cols=45 Identities=16% Similarity=0.175 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHhhHHHHHHhhhHhh
Q psy6549 14 QALVRTGESIVRSEQAAL-----EAEAIGTGVISELGVQREALVRARDRLVD 60 (109)
Q Consensus 14 ~~l~~t~~~L~~s~~~~~-----ete~iG~~il~~L~~Qre~L~~~~~~~~~ 60 (109)
+.++++-.+|..+.-+-. .-|++|.+|+.. ++|.+....-+++++
T Consensus 369 ~el~RAK~qlkS~LlMNLESR~V~~EDvGRQVL~~--g~rk~p~e~~~~Ie~ 418 (472)
T KOG2067|consen 369 EELERAKTQLKSMLLMNLESRPVAFEDVGRQVLTT--GERKPPDEFIKKIEQ 418 (472)
T ss_pred HHHHHHHHHHHHHHHhcccccchhHHHHhHHHHhc--cCcCCHHHHHHHHHh
Confidence 445555555554444433 459999999987 888777777666665
No 273
>PF06363 Picorna_P3A: Picornaviridae P3A protein; InterPro: IPR009419 The viral polyprotein of parechoviruses contains: coat protein VP0 (P1AB); coat protein VP3 (P1C); coat protein VP1 (P1D); picornain 2A (3.4.22.29 from EC, core protein P2A); core protein P2B; core protein P2C; core protein P3A; genome-linked protein VPg (P3B); picornain 3C (3.4.22.28 from EC, MEROPS peptidase subfamily 3CF: parechovirus picornain 3C (P3C))[]. This entry consists of the parechovirus P3A protein. P3A has been identified as a genome-linked protein (VPg), which is involved in replication [].; GO: 0019012 virion
Probab=30.24 E-value=1.6e+02 Score=19.11 Aligned_cols=40 Identities=25% Similarity=0.369 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 69 RRMIQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 69 ~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
...+++|..=..+||--+..+-++...+-++.+.++++||
T Consensus 55 ~~k~k~~~~FV~RNk~W~T~~S~~tS~isIL~LV~~~~KK 94 (100)
T PF06363_consen 55 KNKMKSMLSFVERNKAWFTVVSAVTSFISILLLVTKIFKK 94 (100)
T ss_pred HHHHHHHHHHHHHcchHhhHHHHHHHHHHHHHHHHHHHhh
Confidence 3455666666677776554444444333333444455443
No 274
>PF15339 Afaf: Acrosome formation-associated factor
Probab=30.23 E-value=2e+02 Score=20.83 Aligned_cols=36 Identities=3% Similarity=0.234 Sum_probs=21.9
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+..+.-+.+.--.+|.+++++.++++++...||+++
T Consensus 125 le~~KlkLmLGIsLmTl~lfv~Ll~~c~atlyklk~ 160 (200)
T PF15339_consen 125 LEELKLKLMLGISLMTLFLFVILLAFCSATLYKLKH 160 (200)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444444556666666677777777777753
No 275
>PF12259 DUF3609: Protein of unknown function (DUF3609); InterPro: IPR022048 This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length.
Probab=30.02 E-value=33 Score=27.22 Aligned_cols=21 Identities=19% Similarity=0.344 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHHHHhcc
Q psy6549 88 MIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 88 ~ii~~l~l~i~~viy~k~~~~ 108 (109)
+.+++.++++.++.||.++++
T Consensus 305 ~~~vli~vl~~~~~~~~~~~~ 325 (361)
T PF12259_consen 305 GAIVLIIVLISLAWLYRTFRR 325 (361)
T ss_pred HHHHHHHHHHHHHhheeehHH
Confidence 333333344555555555544
No 276
>PF02936 COX4: Cytochrome c oxidase subunit IV; InterPro: IPR004203 Cytochrome c oxidase, a 13 sub-unit complex (1.9.3.1 from EC) is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium discoideum (Slime mould) member of this family is called COX VI. The Saccharomyces cerevisiae protein YGX6_YEAST appears to be the yeast COX IV subunit.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3ABK_Q 3AG1_Q 3ASN_Q 1OCZ_D 2EIN_Q 2OCC_D 2YBB_O 3AG3_D 1OCO_Q 1V55_Q ....
Probab=29.97 E-value=1.1e+02 Score=21.02 Aligned_cols=24 Identities=4% Similarity=0.127 Sum_probs=14.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy~k 104 (109)
-.|.++.++++++.+..+++++.+
T Consensus 73 ewk~v~~~~~~~i~~s~~l~~~~r 96 (142)
T PF02936_consen 73 EWKKVFGGVFIFIGFSVLLFIWQR 96 (142)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred chHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666666666655555555554
No 277
>COG3167 PilO Tfp pilus assembly protein PilO [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=29.88 E-value=77 Score=23.30 Aligned_cols=22 Identities=23% Similarity=0.475 Sum_probs=12.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~ 103 (109)
-|+++|..+.+++++.++-+|+
T Consensus 23 ~rlv~~lL~~~~V~~lGy~f~~ 44 (211)
T COG3167 23 PRLVFCLLAVAAVLGLGYAFYL 44 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 4566666666665555555554
No 278
>PRK09793 methyl-accepting protein IV; Provisional
Probab=29.71 E-value=3.1e+02 Score=22.34 Aligned_cols=66 Identities=8% Similarity=0.118 Sum_probs=39.9
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHH
Q psy6549 11 ETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSRRMIQAIR 76 (109)
Q Consensus 11 ~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~ 76 (109)
.+.+..+.+...+.+....+.++.+.-.++......|.+....+...+.+++............+.
T Consensus 426 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~i~~~~~e~~~~~~~i~~~i~~i~~~~~~~~~~~~~~~ 491 (533)
T PRK09793 426 QGSKLVNNAAATMTDIVSSVTRVNDIMGEIASASEEQRRGIEQVAQAVSQMDQVTQQNASLVEEAA 491 (533)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444555555565556666666666666677777777777777777776665555444444433
No 279
>PRK15053 dpiB sensor histidine kinase DpiB; Provisional
Probab=29.65 E-value=1.3e+02 Score=24.08 Aligned_cols=22 Identities=18% Similarity=0.422 Sum_probs=12.4
Q ss_pred HHHHHHHHHhhHHHHHHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIE 93 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l 93 (109)
+..|.+.-++.|++++.+++++
T Consensus 10 ~~~~~~~~i~~kl~~~~~~~i~ 31 (545)
T PRK15053 10 FAFFQRLAFPLRIFLLILVFSI 31 (545)
T ss_pred HHHHhcccHHHHHHHHHHHHHH
Confidence 4445555577787774444433
No 280
>PF01034 Syndecan: Syndecan domain; InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains: A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains; A transmembrane region; A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins. The proteins known to belong to this family are: Syndecan 1. Syndecan 2 or fibroglycan. Syndecan 3 or neuroglycan or N-syndecan. Syndecan 4 or amphiglycan or ryudocan. Drosophila syndecan. Caenorhabditis elegans probable syndecan (F57C7.3). Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=29.57 E-value=18 Score=21.74 Aligned_cols=6 Identities=17% Similarity=0.251 Sum_probs=0.4
Q ss_pred HHHHhc
Q psy6549 102 YLKIFK 107 (109)
Q Consensus 102 y~k~~~ 107 (109)
.|++.|
T Consensus 33 iyR~rk 38 (64)
T PF01034_consen 33 IYRMRK 38 (64)
T ss_dssp ----S-
T ss_pred HHHHHh
Confidence 334433
No 281
>PF13179 DUF4006: Family of unknown function (DUF4006)
Probab=29.55 E-value=87 Score=18.93 Aligned_cols=15 Identities=20% Similarity=0.463 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHH
Q psy6549 87 IMIIVIESMTLSVML 101 (109)
Q Consensus 87 ~~ii~~l~l~i~~vi 101 (109)
++|+++|+++|++++
T Consensus 16 ~LIAvvLLLsIl~~l 30 (66)
T PF13179_consen 16 MLIAVVLLLSILAFL 30 (66)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444444444443
No 282
>PRK06654 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=29.54 E-value=70 Score=23.06 Aligned_cols=13 Identities=15% Similarity=0.199 Sum_probs=6.1
Q ss_pred HHHHHHHHHHHHH
Q psy6549 83 KLFLIMIIVIESM 95 (109)
Q Consensus 83 K~il~~ii~~l~l 95 (109)
|++-|..|++..+
T Consensus 32 k~l~~~~i~~~a~ 44 (181)
T PRK06654 32 KILQWVAIGLFAV 44 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 4455555544433
No 283
>PRK05287 hypothetical protein; Provisional
Probab=29.35 E-value=2.5e+02 Score=21.23 Aligned_cols=53 Identities=13% Similarity=0.198 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHhhHHHHHH-------hhhHhhhHHHHHHHHHHHHHHH---HHHHhhHHHHH
Q psy6549 35 AIGTGVISELGVQREALVRA-------RDRLVDTDLHVSRSRRMIQAIR---RRVFYNKLFLI 87 (109)
Q Consensus 35 ~iG~~il~~L~~Qre~L~~~-------~~~~~~i~~~l~~a~~~l~~m~---rr~~~~K~il~ 87 (109)
++=.+++.+|.+|+.+|..- ...++.+-++++.+..-|..+. .....|+|+..
T Consensus 58 DlKsdLlKeLerq~~~L~~~~~~p~vd~~~l~~~l~~l~~~~~~L~~~~r~Gq~Lrede~L~s 120 (250)
T PRK05287 58 DLKSDLLKELERQRQKLQKWRGNPGVDQEALEALLQELEQASAALNAAPRIGQSLREDRFLSS 120 (250)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHhcccccchhhhhHHHHH
Confidence 44568999999999999863 3445666677777777776655 23566777654
No 284
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=29.20 E-value=68 Score=20.90 Aligned_cols=9 Identities=22% Similarity=0.574 Sum_probs=3.6
Q ss_pred HHHHHHHHH
Q psy6549 97 LSVMLYLKI 105 (109)
Q Consensus 97 i~~viy~k~ 105 (109)
+++++|+..
T Consensus 27 i~~i~yf~~ 35 (106)
T PRK05585 27 FFAIFYFLI 35 (106)
T ss_pred HHHHHHHHh
Confidence 333444433
No 285
>PF06789 UPF0258: Uncharacterised protein family (UPF0258); InterPro: IPR009626 This is a group of proteins of unknown function.
Probab=28.94 E-value=67 Score=22.67 Aligned_cols=27 Identities=7% Similarity=0.220 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy6549 69 RRMIQAIRRRVFYNKLFLIMIIVIESM 95 (109)
Q Consensus 69 ~~~l~~m~rr~~~~K~il~~ii~~l~l 95 (109)
+.+|+.=.....+.|++.++.++++++
T Consensus 117 DsLLKkKEae~kr~K~Cki~~Li~~~v 143 (159)
T PF06789_consen 117 DSLLKKKEAELKRSKVCKIFALIVLAV 143 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355666666667777776644444433
No 286
>PF07072 DUF1342: Protein of unknown function (DUF1342); InterPro: IPR009777 This family consists of several hypothetical bacterial proteins of around 250 residues in length. Members of this family are often known as YacF after the Escherichia coli protein P36680 from SWISSPROT. The function of this family is unknown.; PDB: 2OEZ_A.
Probab=28.89 E-value=2.3e+02 Score=20.69 Aligned_cols=52 Identities=15% Similarity=0.225 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHhhHHHHHH-------hhhHhhhHHHHHHHHHHHHHHH----HHHHhhHHHH
Q psy6549 35 AIGTGVISELGVQREALVRA-------RDRLVDTDLHVSRSRRMIQAIR----RRVFYNKLFL 86 (109)
Q Consensus 35 ~iG~~il~~L~~Qre~L~~~-------~~~~~~i~~~l~~a~~~l~~m~----rr~~~~K~il 86 (109)
++=.+++.+|.+|+.+|.+- .+.++.+-+.++.+..-|..+. ....-|+|+.
T Consensus 40 DlK~eLlKELeRq~~~L~~~~~~p~vd~~~l~~~l~~l~~~~~~L~~~~gr~gq~Lrene~L~ 102 (211)
T PF07072_consen 40 DLKSELLKELERQRQKLNQWRDNPGVDQEALDALLQELDQALQALQQAPGRIGQHLRENEFLM 102 (211)
T ss_dssp -HHHHHHHHHHHHHHHHHCTTT-TTS-HHHHHHHHHHHHHHHHHHHH---STTHHHHC-HHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCcchhhhhhhHhHHHH
Confidence 35578999999999999963 3345556667777777777665 3455666654
No 287
>PRK12785 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=28.83 E-value=7.2 Score=27.37 Aligned_cols=22 Identities=9% Similarity=-0.011 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~ 105 (109)
+|+++++++++++.++..||.|
T Consensus 26 liii~~~~lll~~~g~g~~f~~ 47 (166)
T PRK12785 26 LIIIAAAAVLLLGGGGGGFFFF 47 (166)
T ss_pred EeeHHHHHHHHHhcchheEEEE
Confidence 4444444444444444455433
No 288
>PRK09821 putative transporter; Provisional
Probab=28.67 E-value=88 Score=25.63 Aligned_cols=27 Identities=22% Similarity=0.219 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
..++|++++.+...++..+|.+|..||
T Consensus 177 ~~~l~Giivaip~~i~g~~~~~~~~~~ 203 (454)
T PRK09821 177 WLTIIGIAISIPVGIVGYFAAKIINRR 203 (454)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 466677776666666777888887664
No 289
>COG2443 Sss1 Preprotein translocase subunit Sss1 [Intracellular trafficking and secretion]
Probab=28.61 E-value=1.4e+02 Score=17.97 Aligned_cols=49 Identities=12% Similarity=0.223 Sum_probs=23.4
Q ss_pred hhhHhhhHHHHHHHHHHHHHHHH---HHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy6549 55 RDRLVDTDLHVSRSRRMIQAIRR---RVFYNKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 55 ~~~~~~i~~~l~~a~~~l~~m~r---r~~~~K~il~~ii~~l~l~i~~viy~ 103 (109)
.+..+++.+.+.+.++.++.-.+ ..+..-.=..++-++++.++++++|.
T Consensus 5 ~~~~e~~~~~lke~~rvl~~arKP~~eEy~~~aKi~~~Gi~liG~IGfiI~l 56 (65)
T COG2443 5 MDKPEELREFLKEYRRVLKVARKPDWEEYSKIAKITGLGILLIGIIGFIIYL 56 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455566666666665543221 12222222234444555666666664
No 290
>PF10808 DUF2542: Protein of unknown function (DUF2542) ; InterPro: IPR020155 This entry represents transmembrane proteins with no known function.; GO: 0016021 integral to membrane
Probab=28.51 E-value=36 Score=21.20 Aligned_cols=22 Identities=23% Similarity=0.502 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+++|+.+.++ .++.++|+-|.|
T Consensus 58 i~~Y~G~gil--~~gm~IyllFyR 79 (79)
T PF10808_consen 58 IFAYFGCGIL--SLGMIIYLLFYR 79 (79)
T ss_pred HHHHHHHHHH--HHHHhheeEEeC
Confidence 4566666665 566667776654
No 291
>PF11887 DUF3407: Protein of unknown function (DUF3407); InterPro: IPR024516 This entry represents a domain of unknown function found at the C terminus of many proteins in the mammalian cell entry family.
Probab=28.14 E-value=2.6e+02 Score=20.96 Aligned_cols=54 Identities=20% Similarity=0.086 Sum_probs=36.2
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhh
Q psy6549 7 ETLRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVD 60 (109)
Q Consensus 7 ~~ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~ 60 (109)
..+-..-+.+.+-++.+.++.--+..+-+-...+..+|.+||++|...=..+..
T Consensus 52 ~~l~~~l~~l~~v~~~~a~aapdL~~~l~~~~~~s~tL~~~~~~L~~lL~~~~~ 105 (267)
T PF11887_consen 52 PQLREDLRNLADVADTYADAAPDLLDALDNLTTTSRTLVDQRQQLDALLLSATG 105 (267)
T ss_pred hHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 334444455566666677777777777777777888888888888776544443
No 292
>PF06682 DUF1183: Protein of unknown function (DUF1183); InterPro: IPR009567 This family consists of several eukaryotic proteins of around 360 residues in length. The function of this family is unknown.
Probab=28.04 E-value=65 Score=25.23 Aligned_cols=10 Identities=20% Similarity=0.471 Sum_probs=4.5
Q ss_pred HHHHHHHHHH
Q psy6549 96 TLSVMLYLKI 105 (109)
Q Consensus 96 ~i~~viy~k~ 105 (109)
++++|+|..|
T Consensus 167 vla~ivY~~~ 176 (318)
T PF06682_consen 167 VLAFIVYSLF 176 (318)
T ss_pred HHHHHHHHHH
Confidence 3444455444
No 293
>PF11026 DUF2721: Protein of unknown function (DUF2721); InterPro: IPR021279 This family is conserved in bacteria. The function is not known.
Probab=27.74 E-value=1.9e+02 Score=19.25 Aligned_cols=23 Identities=22% Similarity=0.494 Sum_probs=13.8
Q ss_pred HHHHHHHHHHH-HhhHHHHHHHHH
Q psy6549 69 RRMIQAIRRRV-FYNKLFLIMIIV 91 (109)
Q Consensus 69 ~~~l~~m~rr~-~~~K~il~~ii~ 91 (109)
.+.++...||+ ..++.+..+++.
T Consensus 49 ~~el~~L~rR~~li~~ai~~~~~s 72 (130)
T PF11026_consen 49 RRELRILRRRARLIRRAITLATLS 72 (130)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56677777775 555555544443
No 294
>PRK09400 secE preprotein translocase subunit SecE; Reviewed
Probab=27.70 E-value=1.4e+02 Score=17.59 Aligned_cols=48 Identities=13% Similarity=0.160 Sum_probs=27.2
Q ss_pred hHhhhHHHHHHHHHHHHHHH---HHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 57 RLVDTDLHVSRSRRMIQAIR---RRVFYNKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 57 ~~~~i~~~l~~a~~~l~~m~---rr~~~~K~il~~ii~~l~l~i~~viy~k 104 (109)
..+.+.+-+..+.++++.-. +..+..=.-..++-++++.+|++++++-
T Consensus 6 ~~e~~~~f~~d~~rvl~~~~KPd~~Ef~~ia~~~~iG~~i~G~iGf~Ikli 56 (61)
T PRK09400 6 LQENVKNFLEDYKRVLKVARKPTREEFLLVAKVTGLGILLIGLIGFIIYLI 56 (61)
T ss_pred HHHhHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455566666666665533 3344444445555556666777776653
No 295
>CHL00031 psbT photosystem II protein T
Probab=27.58 E-value=73 Score=16.60 Aligned_cols=11 Identities=27% Similarity=0.736 Sum_probs=4.4
Q ss_pred HHHHHHHHHHh
Q psy6549 96 TLSVMLYLKIF 106 (109)
Q Consensus 96 ~i~~viy~k~~ 106 (109)
.++++++--||
T Consensus 13 tlgilFFAI~F 23 (33)
T CHL00031 13 TLGIIFFAIFF 23 (33)
T ss_pred HHHHHHHhhee
Confidence 34434444343
No 296
>PRK06007 fliF flagellar MS-ring protein; Reviewed
Probab=27.57 E-value=1.9e+02 Score=24.20 Aligned_cols=22 Identities=14% Similarity=0.012 Sum_probs=11.7
Q ss_pred HhhHHHHHHHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESMTLSVML 101 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~vi 101 (109)
...|+++.++++++++++++++
T Consensus 21 ~~qk~~l~~~~~~~v~~~~~l~ 42 (542)
T PRK06007 21 KKRKIALIGAGAAVVAAIVALV 42 (542)
T ss_pred hhhHHHHHHHHHHHHHHHHHHH
Confidence 3556666665555544444433
No 297
>PF03597 CcoS: Cytochrome oxidase maturation protein cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation []. Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase.
Probab=27.25 E-value=1.2e+02 Score=16.75 Aligned_cols=6 Identities=17% Similarity=0.107 Sum_probs=2.2
Q ss_pred HHHHHH
Q psy6549 88 MIIVIE 93 (109)
Q Consensus 88 ~ii~~l 93 (109)
.+-+++
T Consensus 8 p~sl~l 13 (45)
T PF03597_consen 8 PVSLIL 13 (45)
T ss_pred HHHHHH
Confidence 333333
No 298
>PF03896 TRAP_alpha: Translocon-associated protein (TRAP), alpha subunit; InterPro: IPR005595 The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across the membrane [].; GO: 0005783 endoplasmic reticulum
Probab=27.10 E-value=93 Score=23.92 Aligned_cols=21 Identities=14% Similarity=0.249 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k 104 (109)
+++|++++.++++.++..|..
T Consensus 208 iFLY~~l~a~~~l~l~~~~~~ 228 (285)
T PF03896_consen 208 IFLYLFLAALGVLGLYFVYQF 228 (285)
T ss_pred hhHHHHHHHHHHHHHHHHHHH
Confidence 567766555544444333333
No 299
>PHA03010 hypothetical protein; Provisional
Probab=27.09 E-value=88 Score=25.45 Aligned_cols=19 Identities=16% Similarity=0.410 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHhcc
Q psy6549 90 IVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 90 i~~l~l~i~~viy~k~~~~ 108 (109)
++++.+++++++..||.|+
T Consensus 522 mfvfsi~~i~i~~ikfl~~ 540 (546)
T PHA03010 522 MFVFSIAIIAIFAIKFLHK 540 (546)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3455667777788888765
No 300
>PRK15471 chain length determinant protein WzzB; Provisional
Probab=26.69 E-value=1.8e+02 Score=22.71 Aligned_cols=33 Identities=12% Similarity=0.313 Sum_probs=15.6
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
+..+.+-....|+..+++.++. +++.++|..+.
T Consensus 20 l~~l~~~L~r~k~~Ii~~~~~~--~~lg~~Ya~~a 52 (325)
T PRK15471 20 LIDLLVQLWRGKMTIIISVIVA--IALAVGYLAVA 52 (325)
T ss_pred HHHHHHHHHHhhHHHHHHHHHH--HHHHHHHHHhC
Confidence 3445555566666544333332 33444554443
No 301
>PF10112 Halogen_Hydrol: 5-bromo-4-chloroindolyl phosphate hydrolysis protein; InterPro: IPR018770 This entry consists of prokaryotic proteins that mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds.
Probab=26.37 E-value=2.4e+02 Score=19.90 Aligned_cols=57 Identities=19% Similarity=0.141 Sum_probs=29.4
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHH
Q psy6549 9 LRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSR 67 (109)
Q Consensus 9 ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~ 67 (109)
|+.....++.+...-.+......|++++=.++-.....|-+.+.. +.+.|++.+++.
T Consensus 141 l~~kY~~l~~~~~~~~~~~~~l~e~~~~L~~l~~~f~~~~~~l~~--~d~~dl~~ei~V 197 (199)
T PF10112_consen 141 LLEKYAELESQPVKSEEIKQSLEEIEETLDTLNQAFEKDLDKLLE--DDIMDLDTEISV 197 (199)
T ss_pred HHHHHHHHHhccCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hHHHHHHHHHHh
Confidence 334444444444334445555556666655666666666655543 345555555543
No 302
>KOG0771|consensus
Probab=26.22 E-value=88 Score=25.32 Aligned_cols=22 Identities=14% Similarity=0.221 Sum_probs=12.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~ 103 (109)
.++..|+++++++..|+.++|+
T Consensus 368 ~~~~~~~~lL~~l~~I~~~~y~ 389 (398)
T KOG0771|consen 368 RKIPAWLALLFVLSYIIAYFYL 389 (398)
T ss_pred hhhhHHHHHHHHHHHHHHHHHH
Confidence 4566776666655555544443
No 303
>cd00922 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit IV is the largest of the nuclear-encoded subunits. It binds ATP at the matrix side, leading to an allosteric inhibition of enzyme activity at high intramitochondrial ATP/ADP ratios. In mammals, subunit IV has a lung-specific isoform and a ubiquitously expressed isoform.
Probab=26.16 E-value=1.3e+02 Score=20.46 Aligned_cols=22 Identities=9% Similarity=0.028 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k 104 (109)
|.++.++++++.+.++++++.+
T Consensus 75 k~v~~~~~~~i~~s~~~~~~~r 96 (136)
T cd00922 75 KTVFGGVLAFIGITGVIFGLQR 96 (136)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445555444444444443
No 304
>PF02790 COX2_TM: Cytochrome C oxidase subunit II, transmembrane domain This family corresponds to chains b and o.; InterPro: IPR011759 Cytochrome c oxidase (1.9.3.1 from EC) [, ] is an oligomeric enzymatic complex which is a component of the respiratory chain and is involved in the transfer of electrons from cytochrome c to oxygen. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. The enzyme complex consists of 3-4 subunits (prokaryotes) to up to 13 polypeptides (mammals). Subunit 2 (CO II) transfers the electrons from cytochrome c to the catalytic subunit 1. It contains two adjacent transmembrane regions in its N terminus and the major part of the protein is exposed to the periplasmic or to the mitochondrial intermembrane space, respectively. CO II provides the substrate-binding site and contains a copper centre called Cu(A) (see IPR001505 from INTERPRO), probably the primary acceptor in cytochrome c oxidase. An exception is the corresponding subunit of the cbb3-type oxidase which lacks the copper A redox-centre. Several bacterial CO II have a C-terminal extension that contains a covalently bound haem c. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices. ; GO: 0004129 cytochrome-c oxidase activity, 0005507 copper ion binding, 0009055 electron carrier activity, 0022900 electron transport chain, 0016021 integral to membrane; PDB: 3VRJ_C 2EIN_B 3AG3_B 2DYR_O 3AG1_B 2EIK_O 3ASN_B 1OCR_B 2EIJ_B 1OCC_O ....
Probab=26.13 E-value=1.5e+02 Score=17.53 Aligned_cols=22 Identities=18% Similarity=0.427 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~ 106 (109)
+++.++++++++.++++|..+.
T Consensus 27 ~~~i~~~I~~~V~~~l~~~~~~ 48 (84)
T PF02790_consen 27 VMIIMIFIFVFVFYFLIYFLFN 48 (84)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhheeeeEeeeccccccc
Confidence 3444444444444444444443
No 305
>COG0342 SecD Preprotein translocase subunit SecD [Intracellular trafficking and secretion]
Probab=26.04 E-value=90 Score=25.94 Aligned_cols=24 Identities=13% Similarity=0.286 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
.-+..+++.+++++++.++||+++
T Consensus 344 ~gi~Agl~g~~~V~vfm~~~Yr~~ 367 (506)
T COG0342 344 AGLIAGLIGLALVAVFMLLYYRLA 367 (506)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 345566666666777778888754
No 306
>PF11174 DUF2970: Protein of unknown function (DUF2970); InterPro: IPR021344 This short family is conserved in Proteobacteria. The function is not known.
Probab=25.80 E-value=1.4e+02 Score=17.21 Aligned_cols=20 Identities=15% Similarity=0.373 Sum_probs=8.9
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~ 103 (109)
+|..+++..++.+.+++...
T Consensus 33 ~Ii~gii~~~~fV~~Lv~lV 52 (56)
T PF11174_consen 33 FIIVGIILAALFVAGLVLLV 52 (56)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44445554444444444433
No 307
>PF08656 DASH_Dad3: DASH complex subunit Dad3; InterPro: IPR013965 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ].
Probab=25.56 E-value=1.7e+02 Score=18.12 Aligned_cols=23 Identities=9% Similarity=0.083 Sum_probs=12.2
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHH
Q psy6549 8 TLRETNQALVRTGESIVRSEQAA 30 (109)
Q Consensus 8 ~ll~~~~~l~~t~~~L~~s~~~~ 30 (109)
..|...+++...-..|.+..+.+
T Consensus 7 ~VL~eY~~La~~L~~L~~~l~~L 29 (78)
T PF08656_consen 7 EVLDEYQRLADNLKTLSDTLKDL 29 (78)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555555554444
No 308
>PF11044 TMEMspv1-c74-12: Plectrovirus spv1-c74 ORF 12 transmembrane protein; InterPro: IPR022743 This is a group of proteins expressed by Plectroviruses. The Plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. This entry represents putative transmembrane proteins of unknown function.
Probab=25.50 E-value=1.3e+02 Score=16.79 Aligned_cols=10 Identities=30% Similarity=0.551 Sum_probs=6.5
Q ss_pred HHHHHHHHHh
Q psy6549 97 LSVMLYLKIF 106 (109)
Q Consensus 97 i~~viy~k~~ 106 (109)
+++.+|-|+.
T Consensus 21 iGl~IyQkik 30 (49)
T PF11044_consen 21 IGLSIYQKIK 30 (49)
T ss_pred HHHHHHHHHH
Confidence 5666777664
No 309
>TIGR00996 Mtu_fam_mce virulence factor Mce family protein. Members of this paralogous family are found as six tandem homologous proteins in the same orientation per cassette, in four separate cassettes in Mycobacterium tuberculosis. The six members of each cassette represent six subfamilies. One subfamily includes the protein mce (mycobacterial cell entry), a virulence protein required for invasion of non-phagocytic cells.
Probab=25.42 E-value=2.8e+02 Score=20.49 Aligned_cols=25 Identities=16% Similarity=-0.020 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHH
Q psy6549 30 ALEAEAIGTGVISELGVQREALVRA 54 (109)
Q Consensus 30 ~~ete~iG~~il~~L~~Qre~L~~~ 54 (109)
+.++-+-.......|.++++++.++
T Consensus 211 l~~~v~~l~~~~~~l~~~~~~l~~~ 235 (291)
T TIGR00996 211 LDRLLDNLATLTAQLADRDDALDDA 235 (291)
T ss_pred HHHHHHHHHHHHHHHHhChHHHHHH
Confidence 3333333444444455555555444
No 310
>PF15136 UPF0449: Uncharacterised protein family UPF0449
Probab=25.22 E-value=2e+02 Score=18.66 Aligned_cols=19 Identities=26% Similarity=0.284 Sum_probs=13.3
Q ss_pred HHHHHhhHHHHHHhhhHhh
Q psy6549 42 SELGVQREALVRARDRLVD 60 (109)
Q Consensus 42 ~~L~~Qre~L~~~~~~~~~ 60 (109)
..|.+.+|.|..+.+.++.
T Consensus 74 ~~Lkkk~e~L~~age~Le~ 92 (97)
T PF15136_consen 74 DQLKKKCEELRQAGEELER 92 (97)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 5677777877777766653
No 311
>PF12958 DUF3847: Protein of unknown function (DUF3847); InterPro: IPR024215 This entry represents a family of uncharacterised proteins that were found by clustering human gut metagenomic sequences [].
Probab=25.22 E-value=1.9e+02 Score=18.31 Aligned_cols=48 Identities=13% Similarity=0.177 Sum_probs=33.9
Q ss_pred HHHhhHHHHHHhhhHhhhHHHHHHHHHHHHHHH---HHHHhhHHHHHHHHH
Q psy6549 44 LGVQREALVRARDRLVDTDLHVSRSRRMIQAIR---RRVFYNKLFLIMIIV 91 (109)
Q Consensus 44 L~~Qre~L~~~~~~~~~i~~~l~~a~~~l~~m~---rr~~~~K~il~~ii~ 91 (109)
|..-++.+..+..++............-.+... |+..++++|--|.++
T Consensus 3 Le~l~~e~e~~~~kl~q~e~~~k~L~nr~k~l~k~eRK~RtHRLi~rGa~l 53 (86)
T PF12958_consen 3 LEELQAEIEKAEKKLEQAEHKIKQLENRKKKLEKKERKERTHRLIERGAIL 53 (86)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Confidence 444455566666777777777777776666665 888888888877765
No 312
>PF13198 DUF4014: Protein of unknown function (DUF4014)
Probab=25.06 E-value=1.7e+02 Score=17.91 Aligned_cols=10 Identities=20% Similarity=0.534 Sum_probs=4.5
Q ss_pred HHHHHHHHHH
Q psy6549 95 MTLSVMLYLK 104 (109)
Q Consensus 95 l~i~~viy~k 104 (109)
+.|.-+++..
T Consensus 29 ipI~pll~~~ 38 (72)
T PF13198_consen 29 IPISPLLFVW 38 (72)
T ss_pred HHHHHHHHHH
Confidence 4444444443
No 313
>PF04834 Adeno_E3_14_5: Early E3 14.5 kDa protein; InterPro: IPR008131 The E3B 14.5 kDa was first identified in human adenovirus type 5. It is an integral membrane protein oriented with its C terminus in the cytoplasm. It functions to down-regulate the epidermal growth factor receptor and prevent tumour necrosis factor cytolysis. It achieves this through the interaction with E3 10.4 kDa protein [, ]. ; GO: 0009966 regulation of signal transduction, 0016021 integral to membrane
Probab=24.99 E-value=87 Score=20.36 Aligned_cols=11 Identities=18% Similarity=0.274 Sum_probs=5.9
Q ss_pred HHHHHHHHHHH
Q psy6549 95 MTLSVMLYLKI 105 (109)
Q Consensus 95 l~i~~viy~k~ 105 (109)
.++++.||.+|
T Consensus 37 t~~~l~iYp~f 47 (97)
T PF04834_consen 37 TFFSLAIYPCF 47 (97)
T ss_pred HHHHHhhhhee
Confidence 34445566655
No 314
>PF07253 Gypsy: Gypsy protein; InterPro: IPR009882 This family consists of several Gypsy/Env proteins from Drosophila and Ceratitis fruit fly species. Gypsy is an endogenous retrovirus of Drosophila melanogaster. Phylogenetic studies suggest that occasional horizontal transfer events of gypsy occur between Drosophila species. Gypsy possesses infective properties associated with the products of the envelope gene that might be at the origin of these interspecies transfers [].
Probab=24.99 E-value=1e+02 Score=25.56 Aligned_cols=24 Identities=17% Similarity=0.283 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
+..++++.+.+++++++|....+|
T Consensus 420 l~~gv~l~~~Ii~~i~~~~~~r~~ 443 (472)
T PF07253_consen 420 LVFGVLLSIMIIIIIALILMLRKK 443 (472)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Confidence 334444444444555555444433
No 315
>PRK10913 dipeptide transporter; Provisional
Probab=24.98 E-value=2e+02 Score=21.76 Aligned_cols=28 Identities=14% Similarity=0.093 Sum_probs=15.5
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy6549 73 QAIRRRVFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 73 ~~m~rr~~~~K~il~~ii~~l~l~i~~v 100 (109)
+.+.++..+||..+++.+++++++++.+
T Consensus 20 ~~~~~~~~~~~~~~~~~~il~~~~~~~~ 47 (300)
T PRK10913 20 QEFWHYFKRNKGAVVGLVYVVIMLFIAI 47 (300)
T ss_pred HHHHHHHhhChHHHHHHHHHHHHHHHHH
Confidence 4445667778876665555444433333
No 316
>TIGR00327 secE_euk_arch protein translocase SEC61 complex gamma subunit, archaeal and eukaryotic. This model describes archaeal SEC61-like and eukaryotic SEC61 but not bacterial secE proteins, for which a Pfam pfam00584 (SecE) has been created.
Probab=24.91 E-value=1.6e+02 Score=17.39 Aligned_cols=46 Identities=7% Similarity=0.025 Sum_probs=26.4
Q ss_pred hhHHHHHHHHHHHHHH---HHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 60 DTDLHVSRSRRMIQAI---RRRVFYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 60 ~i~~~l~~a~~~l~~m---~rr~~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
+..+-+.++.++++.= .+..+..=.-..++-++++.++++++++-+
T Consensus 5 ~~~~f~k~~~r~lk~~~KPd~~Ef~~iak~t~iG~~i~G~IGf~Ikli~ 53 (61)
T TIGR00327 5 APVEFIKEGTRVLAVCKKPDLEEYLKVAKVTGIGIIIVGIIGYIIKIIA 53 (61)
T ss_pred hHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555666655543 234555545555666666677777776644
No 317
>PHA03386 P10 fibrous body protein; Provisional
Probab=24.84 E-value=2e+02 Score=18.56 Aligned_cols=26 Identities=12% Similarity=0.084 Sum_probs=21.8
Q ss_pred HHHHHHhhHHHHHHhhhHhhhHHHHH
Q psy6549 41 ISELGVQREALVRARDRLVDTDLHVS 66 (109)
Q Consensus 41 l~~L~~Qre~L~~~~~~~~~i~~~l~ 66 (109)
...|..|.++|.....++.+|.+.|+
T Consensus 35 ~~~LDa~~~qL~~l~tkV~~Iq~iLn 60 (94)
T PHA03386 35 SQPLDGLPAQLTELDTKVSDIQSILT 60 (94)
T ss_pred chhhhhHHHHHHHHHHHHHHHHHhcC
Confidence 56789999999999999998887664
No 318
>PF05531 NPV_P10: Nucleopolyhedrovirus P10 protein; InterPro: IPR008702 This family consists of several nucleopolyhedrovirus P10 proteins which are thought to be involved in the morphogenesis of the polyhedra [].; GO: 0019028 viral capsid
Probab=24.84 E-value=1.8e+02 Score=17.97 Aligned_cols=27 Identities=11% Similarity=0.144 Sum_probs=21.1
Q ss_pred HHHHHHHhhHHHHHHhhhHhhhHHHHH
Q psy6549 40 VISELGVQREALVRARDRLVDTDLHVS 66 (109)
Q Consensus 40 il~~L~~Qre~L~~~~~~~~~i~~~l~ 66 (109)
+-..|..|..+|.....++.++.+-|.
T Consensus 40 l~~klDa~~~~l~~l~~~V~~I~~iL~ 66 (75)
T PF05531_consen 40 LNKKLDAQSAQLTTLNTKVNEIQDILN 66 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 456688888888888888888877654
No 319
>PF14387 DUF4418: Domain of unknown function (DUF4418)
Probab=24.67 E-value=1.1e+02 Score=20.62 Aligned_cols=17 Identities=0% Similarity=-0.089 Sum_probs=9.8
Q ss_pred HHHHHHhhHHHHHHHHH
Q psy6549 75 IRRRVFYNKLFLIMIIV 91 (109)
Q Consensus 75 m~rr~~~~K~il~~ii~ 91 (109)
|.++..+..++.+..++
T Consensus 94 M~Ch~~T~p~v~v~~~l 110 (124)
T PF14387_consen 94 MHCHTVTKPAVRVLGGL 110 (124)
T ss_pred CChhhhHHHHHHHHHHH
Confidence 77777655555444443
No 320
>COG1766 fliF Flagellar basal body M-ring protein [Cell motility and secretion]
Probab=24.65 E-value=2e+02 Score=24.18 Aligned_cols=14 Identities=14% Similarity=-0.095 Sum_probs=6.6
Q ss_pred HhhHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIE 93 (109)
Q Consensus 80 ~~~K~il~~ii~~l 93 (109)
...|+.+.++++++
T Consensus 21 ~~~ki~l~~~~~~~ 34 (545)
T COG1766 21 KKQKIVLLGAGAAL 34 (545)
T ss_pred hhHHHHHHHHHHHH
Confidence 34455555444444
No 321
>PRK11486 flagellar biosynthesis protein FliO; Provisional
Probab=24.63 E-value=1.4e+02 Score=20.25 Aligned_cols=11 Identities=18% Similarity=0.135 Sum_probs=4.0
Q ss_pred HHHHHHHHHHH
Q psy6549 92 IESMTLSVMLY 102 (109)
Q Consensus 92 ~l~l~i~~viy 102 (109)
+++++.+++++
T Consensus 26 ~lVl~lI~~~a 36 (124)
T PRK11486 26 IGIIALILAAA 36 (124)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 322
>PF06738 DUF1212: Protein of unknown function (DUF1212); InterPro: IPR010619 This entry represents a predicted domain found within a number of hypothetical proteins of unknown function found in eukaryotes, bacteria and archaea. Some of these sequences are predicted to be membrane proteins.
Probab=24.59 E-value=2.4e+02 Score=19.46 Aligned_cols=41 Identities=20% Similarity=0.297 Sum_probs=23.7
Q ss_pred HHHhhhHhhhHHHHHHHHHHHHHHHHHH-HhhHHHHHHHHHH
Q psy6549 52 VRARDRLVDTDLHVSRSRRMIQAIRRRV-FYNKLFLIMIIVI 92 (109)
Q Consensus 52 ~~~~~~~~~i~~~l~~a~~~l~~m~rr~-~~~K~il~~ii~~ 92 (109)
.+....+.+=+-.++++...|+.+.+.- ..++|+.++...+
T Consensus 71 ~~l~~~~~~~~~~~~ea~~~L~~I~~~~~~y~~~~~~l~~~l 112 (193)
T PF06738_consen 71 NRLSRRIVAGQLSLEEAIERLDEIDREPPRYPPWLVILAAGL 112 (193)
T ss_pred HHHHHHHhcCCCCHHHHHHHHHHHhhCCCCCCHHHHHHHHHH
Confidence 3333344333446777888888888776 5555554444333
No 323
>PF07584 BatA: Aerotolerance regulator N-terminal; InterPro: IPR024163 The Batl operon appears to be important in pathogenicity and aerotolerance. BatA ensures bacterial survival in the early stages of the infection process, when the infected sites are aerobic, and is produced under conditions of oxidative stress []. Proteins produced by the Batl operon share a highly-conserved sequence at their N terminus and the full length proteins carry multiple membrane-spanning domains []. This entry represents the conserved N-terminal domain, which is also found in some uncharacterised proteins.
Probab=24.43 E-value=1.3e+02 Score=17.91 Aligned_cols=7 Identities=0% Similarity=-0.026 Sum_probs=2.7
Q ss_pred HHHHHHh
Q psy6549 100 MLYLKIF 106 (109)
Q Consensus 100 viy~k~~ 106 (109)
++||...
T Consensus 21 ~~~~~~~ 27 (77)
T PF07584_consen 21 IIHYFLR 27 (77)
T ss_pred HHHHHhc
Confidence 3444333
No 324
>PHA02689 ORF051 putative membrane protein; Provisional
Probab=24.37 E-value=1e+02 Score=21.00 Aligned_cols=11 Identities=36% Similarity=0.516 Sum_probs=4.5
Q ss_pred HHHHHHHHHHh
Q psy6549 96 TLSVMLYLKIF 106 (109)
Q Consensus 96 ~i~~viy~k~~ 106 (109)
++++++.+.||
T Consensus 39 ~va~~L~~~FF 49 (128)
T PHA02689 39 LLALALALVFF 49 (128)
T ss_pred HHHHHHHHHHH
Confidence 33334444444
No 325
>PF11241 DUF3043: Protein of unknown function (DUF3043); InterPro: IPR021403 Some members in this family of proteins with unknown function are annotated as membrane proteins. This cannot be confirmed.
Probab=24.32 E-value=1.1e+02 Score=21.76 Aligned_cols=7 Identities=29% Similarity=0.325 Sum_probs=2.7
Q ss_pred HHHHHHH
Q psy6549 98 SVMLYLK 104 (109)
Q Consensus 98 ~~viy~k 104 (109)
+++++.+
T Consensus 119 ~~~l~r~ 125 (170)
T PF11241_consen 119 GVILGRR 125 (170)
T ss_pred HHHHHHH
Confidence 3334433
No 326
>PRK10772 cell division protein FtsL; Provisional
Probab=24.28 E-value=2e+02 Score=18.90 Aligned_cols=32 Identities=16% Similarity=0.181 Sum_probs=17.4
Q ss_pred HHHHHHHHHh-hHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 72 IQAIRRRVFY-NKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 72 l~~m~rr~~~-~K~il~~ii~~l~l~i~~viy~k 104 (109)
.+-|..-.+. +|+.+...+++++-+++ |+|..
T Consensus 12 ~~iI~~Dl~~~~kl~l~Ll~~vv~SAl~-VV~~~ 44 (108)
T PRK10772 12 PGVIGDDLLRNGKLPLCLFIAVIVSAVT-VVTTA 44 (108)
T ss_pred HHHHHHHHHHcChHHHHHHHHHHHHHHH-HHHHH
Confidence 3344444555 77777666666544444 44443
No 327
>PRK10249 phenylalanine transporter; Provisional
Probab=24.08 E-value=94 Score=24.77 Aligned_cols=14 Identities=7% Similarity=0.384 Sum_probs=8.2
Q ss_pred HHHHHHHHHHhccC
Q psy6549 96 TLSVMLYLKIFKKR 109 (109)
Q Consensus 96 ~i~~viy~k~~~~~ 109 (109)
.+++.++|++++||
T Consensus 445 ~~~~~~~~~~~~~~ 458 (458)
T PRK10249 445 IVFLFVAFKTLRRK 458 (458)
T ss_pred HHHHHHHHHheecC
Confidence 34455566676665
No 328
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=24.02 E-value=1.1e+02 Score=15.40 Aligned_cols=18 Identities=11% Similarity=0.283 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k 104 (109)
++++++++ +..++.+|--
T Consensus 6 i~g~llv~-lLl~YLvYAL 23 (29)
T PRK14750 6 VCGALLVL-LLLGYLVYAL 23 (29)
T ss_pred HHHHHHHH-HHHHHHHHHH
Confidence 33444333 3344455543
No 329
>PF04888 SseC: Secretion system effector C (SseC) like family ; InterPro: IPR006972 SseC is a secreted protein that forms a complex together with SecB and SecD on the surface of Salmonella typhimurium. All these proteins are secreted by the type III secretion system []. Many mucosal pathogens use type III secretion systems for the injection of effector proteins into target cells. SecB, SseC and SecD are inserted into the target cell membrane. where they form a small pore or translocon [, ]. In addition to SseC, this family includes the bacterial secreted proteins PopB, PepB, YopB and EspD which are thought to be directly involved in pore formation, and type III secretion system translocon.; GO: 0009405 pathogenesis
Probab=24.02 E-value=3.2e+02 Score=20.56 Aligned_cols=16 Identities=13% Similarity=0.318 Sum_probs=9.0
Q ss_pred HhhHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESM 95 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l 95 (109)
|..|++-|++.++-++
T Consensus 57 i~~kIf~wi~~avsvv 72 (306)
T PF04888_consen 57 IFSKIFGWIGTAVSVV 72 (306)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 4566666665555443
No 330
>PRK10381 LPS O-antigen length regulator; Provisional
Probab=23.79 E-value=2.1e+02 Score=22.67 Aligned_cols=20 Identities=15% Similarity=0.167 Sum_probs=10.1
Q ss_pred HHHHHHHHHhhHHHHHHHHH
Q psy6549 72 IQAIRRRVFYNKLFLIMIIV 91 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~ 91 (109)
+..+.+-....|+.++++.+
T Consensus 29 l~~ll~~L~r~k~~Il~~~~ 48 (377)
T PRK10381 29 LFELISVLWKAKKTIIAITF 48 (377)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34455555566655544443
No 331
>PF08655 DASH_Ask1: DASH complex subunit Ask1; InterPro: IPR013964 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=23.75 E-value=1.2e+02 Score=18.23 Aligned_cols=20 Identities=30% Similarity=0.336 Sum_probs=15.8
Q ss_pred HhhhHhhhHHHHHHHHHHHH
Q psy6549 54 ARDRLVDTDLHVSRSRRMIQ 73 (109)
Q Consensus 54 ~~~~~~~i~~~l~~a~~~l~ 73 (109)
+--.+.+||.|++++.+++.
T Consensus 8 iTl~LQeID~N~s~~~~iit 27 (66)
T PF08655_consen 8 ITLLLQEIDSNFSRCHRIIT 27 (66)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 34468899999999997765
No 332
>PRK13831 conjugal transfer protein TrbI; Provisional
Probab=23.74 E-value=55 Score=26.76 Aligned_cols=22 Identities=23% Similarity=0.446 Sum_probs=10.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k 104 (109)
||+-++++|+++++ +++|++|-
T Consensus 23 n~~p~~~~~~~~~~-~~~~i~~g 44 (432)
T PRK13831 23 NRLPIIVAIVLVVL-FLGVIFYG 44 (432)
T ss_pred ccCChhHHHHHHHH-HHHHHHHH
Confidence 55555555444433 33444443
No 333
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=23.63 E-value=1.5e+02 Score=19.82 Aligned_cols=14 Identities=21% Similarity=0.351 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHH
Q psy6549 92 IESMTLSVMLYLKI 105 (109)
Q Consensus 92 ~l~l~i~~viy~k~ 105 (109)
++.++...++|+++
T Consensus 47 ~L~~l~S~~VYfwI 60 (114)
T PF11023_consen 47 LLAILASTAVYFWI 60 (114)
T ss_pred HHHHHHHHHHHHHh
Confidence 44444455667664
No 334
>PF01405 PsbT: Photosystem II reaction centre T protein; InterPro: IPR001743 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbT found in PSII, which is thought to be associated with the D1 (PsbA) - D2 (PsbD) heterodimer. PsbT may be involved in the formation and/or stabilisation of dimeric PSII complexes, because in the absence of this protein dimeric PSII complexes were found to be less abundant. Furthermore, although PsbT does not confer photo-protection, it is required for the efficient recovery of photo-damaged PSII [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3BZ1_T 1S5L_t 2AXT_t 3KZI_T 3PRQ_T 3BZ2_T 3PRR_T 4FBY_g 3A0H_t 3A0B_T ....
Probab=23.57 E-value=1.2e+02 Score=15.39 Aligned_cols=11 Identities=27% Similarity=0.745 Sum_probs=4.4
Q ss_pred HHHHHHHHHHh
Q psy6549 96 TLSVMLYLKIF 106 (109)
Q Consensus 96 ~i~~viy~k~~ 106 (109)
.++++++.-++
T Consensus 13 tlgilffAI~F 23 (29)
T PF01405_consen 13 TLGILFFAIFF 23 (29)
T ss_dssp HHHHHHHHHHS
T ss_pred HHHHHHhhhhc
Confidence 33434444444
No 335
>PF14991 MLANA: Protein melan-A; PDB: 2GTZ_F 2GT9_F 3MRO_P 2GUO_C 3MRQ_P 2GTW_C 3L6F_C 3MRP_P.
Probab=23.53 E-value=16 Score=24.52 Aligned_cols=16 Identities=13% Similarity=0.061 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHhc
Q psy6549 92 IESMTLSVMLYLKIFK 107 (109)
Q Consensus 92 ~l~l~i~~viy~k~~~ 107 (109)
+++++|++++=-|+.|
T Consensus 33 ~VILgiLLliGCWYck 48 (118)
T PF14991_consen 33 IVILGILLLIGCWYCK 48 (118)
T ss_dssp ----------------
T ss_pred HHHHHHHHHHhheeee
Confidence 3334444444333433
No 336
>PRK15082 glutathione ABC transporter permease GsiD; Provisional
Probab=23.49 E-value=2.3e+02 Score=21.45 Aligned_cols=25 Identities=12% Similarity=0.174 Sum_probs=13.3
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHH
Q psy6549 73 QAIRRRVFYNKLFLIMIIVIESMTL 97 (109)
Q Consensus 73 ~~m~rr~~~~K~il~~ii~~l~l~i 97 (109)
+.+-++..+++..+++.++++++++
T Consensus 26 ~~~~~~~~~~~~~~~~~~il~~~~~ 50 (301)
T PRK15082 26 HEFWRRFRRQHVALVAGLFVLLLIV 50 (301)
T ss_pred HHHHHHHhhChHHHHHHHHHHHHHH
Confidence 3345566677766655554443333
No 337
>PF02411 MerT: MerT mercuric transport protein; InterPro: IPR003457 MerT is an mercuric transport integral membrane protein and is responsible for transport of the Hg2+ iron from periplasmic MerP (also part of the transport system) to mercuric reductase (MerA).; GO: 0015097 mercury ion transmembrane transporter activity, 0015694 mercury ion transport, 0016020 membrane
Probab=23.27 E-value=1.9e+02 Score=19.22 Aligned_cols=26 Identities=23% Similarity=0.466 Sum_probs=16.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
..|.++|++.++.++++.+-.|.-+|
T Consensus 90 ~~~~~lwi~t~~vl~~l~~py~~p~f 115 (116)
T PF02411_consen 90 QTKILLWIVTVLVLLLLAFPYYAPLF 115 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34677777766666666666666554
No 338
>TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein. This family represents an accessory protein that works with the bacteriocin maturation and ABC transport secretion protein described by TIGR01193.
Probab=23.15 E-value=76 Score=25.47 Aligned_cols=21 Identities=19% Similarity=0.293 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHhc
Q psy6549 87 IMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~ 107 (109)
+++.+++++++++++|..|-+
T Consensus 22 ~~~~~~~~~~~~~~~WA~~~~ 42 (457)
T TIGR01000 22 LVIVPIFLLLVFLVLFSLFAK 42 (457)
T ss_pred hHHHHHHHHHHHHHHHHHhEe
Confidence 333344444455666666543
No 339
>COG3087 FtsN Cell division protein [Cell division and chromosome partitioning]
Probab=23.08 E-value=1.3e+02 Score=23.02 Aligned_cols=23 Identities=22% Similarity=0.226 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 86 LIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k~~~~ 108 (109)
+.++++.+++++++.+|+...++
T Consensus 18 ~~~i~~~viv~~~~~~~~~t~~~ 40 (264)
T COG3087 18 FVAIAAAVIVTFIGGLYFITHHK 40 (264)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhc
Confidence 33333333344444444444433
No 340
>PRK02935 hypothetical protein; Provisional
Probab=23.00 E-value=1.7e+02 Score=19.46 Aligned_cols=14 Identities=21% Similarity=0.359 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHH
Q psy6549 92 IESMTLSVMLYLKI 105 (109)
Q Consensus 92 ~l~l~i~~viy~k~ 105 (109)
++.++...++|+++
T Consensus 48 ~l~~l~S~vvYFwi 61 (110)
T PRK02935 48 FLAVIASTVVYFWI 61 (110)
T ss_pred HHHHHHHHHHHHHH
Confidence 34444455666664
No 341
>PF06422 PDR_CDR: CDR ABC transporter; InterPro: IPR010929 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain []. The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). In yeast, the PDR and CDR ABC transporters display extensive sequence homology, and confer resistance to several anti-fungal compounds by actively transporting their substrates out of the cell. These transporters have two homologous halves, each with an N-terminal intracellular hydrophilic region that contains an ATP-binding site, followed by a C-terminal membrane-associated region containing six transmembrane segments []. This entry represents a domain of the PDR/CDR ABC transporter comprising extracellular loop 3, transmembrane segment 6 and a linker region.; GO: 0005524 ATP binding, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane
Probab=22.96 E-value=1.4e+02 Score=19.14 Aligned_cols=18 Identities=17% Similarity=0.041 Sum_probs=9.4
Q ss_pred HHhhHHHHHHHHHHHHHH
Q psy6549 79 VFYNKLFLIMIIVIESMT 96 (109)
Q Consensus 79 ~~~~K~il~~ii~~l~l~ 96 (109)
..+|--|+++.+++.+++
T Consensus 48 ~WRN~GIli~f~i~f~~~ 65 (103)
T PF06422_consen 48 RWRNFGILIAFWIFFIVL 65 (103)
T ss_pred hhhhHHHHHHHHHHHHHH
Confidence 445666666555544333
No 342
>COG3765 WzzB Chain length determinant protein [Cell envelope biogenesis, outer membrane]
Probab=22.94 E-value=2.4e+02 Score=22.51 Aligned_cols=33 Identities=15% Similarity=0.261 Sum_probs=20.1
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 72 IQAIRRRVFYNKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 72 l~~m~rr~~~~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
+....+-...-|++.+++.++. +++++.|.-+.
T Consensus 21 L~~L~~~LW~~K~~Ii~~~l~~--~~~al~ys~l~ 53 (347)
T COG3765 21 LRELFRTLWRGKWTIIGIVLLF--AAAALGYSFLA 53 (347)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHhc
Confidence 5667777888888776665554 33444444443
No 343
>PF07664 FeoB_C: Ferrous iron transport protein B C terminus; InterPro: IPR011640 Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions []. FeoB has been identified as part of this transport system. FeoB is a large 700-800 amino acid integral membrane protein. The N terminus has been previously erroneously described as being ATP-binding []. Recent work shows that it is similar to eukaryotic G-proteins and that it is a GTPase [].; GO: 0015093 ferrous iron transmembrane transporter activity, 0015684 ferrous iron transport, 0016021 integral to membrane
Probab=22.93 E-value=1.5e+02 Score=16.52 Aligned_cols=18 Identities=11% Similarity=0.309 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy6549 85 FLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy 102 (109)
.+|++.+++.++.+.++-
T Consensus 4 ~~y~~~~~~~l~~~~il~ 21 (54)
T PF07664_consen 4 SLYLLGILVALLVGLILK 21 (54)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 345444444444443333
No 344
>KOG0994|consensus
Probab=22.78 E-value=6.8e+02 Score=23.93 Aligned_cols=9 Identities=22% Similarity=0.387 Sum_probs=3.4
Q ss_pred HHHHHHHHH
Q psy6549 37 GTGVISELG 45 (109)
Q Consensus 37 G~~il~~L~ 45 (109)
+.+|.+.|.
T Consensus 1565 ae~V~eaL~ 1573 (1758)
T KOG0994|consen 1565 AEDVVEALE 1573 (1758)
T ss_pred HHHHHHHHH
Confidence 333333333
No 345
>PF13572 DUF4134: Domain of unknown function (DUF4134)
Probab=22.75 E-value=1.4e+02 Score=19.23 Aligned_cols=22 Identities=14% Similarity=0.238 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy6549 85 FLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 85 il~~ii~~l~l~i~~viy~k~~ 106 (109)
++|.+..++.++-.+-+|.||-
T Consensus 46 l~yaI~aVvglIGai~VY~k~~ 67 (98)
T PF13572_consen 46 LMYAIGAVVGLIGAIRVYIKWN 67 (98)
T ss_pred HHHHHHHHHHHHHHHHHHHhhc
Confidence 3455555554444556898885
No 346
>PF10875 DUF2670: Protein of unknown function (DUF2670); InterPro: IPR022714 This bacterial family of proteins has no known function and appears to be restricted to Rickettsiaceae.
Probab=22.67 E-value=1.4e+02 Score=20.29 Aligned_cols=13 Identities=15% Similarity=0.327 Sum_probs=6.8
Q ss_pred HHHHHHHHHHhcc
Q psy6549 96 TLSVMLYLKIFKK 108 (109)
Q Consensus 96 ~i~~viy~k~~~~ 108 (109)
+-++++||-+..+
T Consensus 32 aSlI~lyy~v~gl 44 (139)
T PF10875_consen 32 ASLITLYYTVLGL 44 (139)
T ss_pred HHHHHHHHHHhhh
Confidence 3344566666543
No 347
>PF04415 DUF515: Protein of unknown function (DUF515) ; InterPro: IPR007509 This is a family of hypothetical archaeal proteins.
Probab=22.65 E-value=66 Score=26.20 Aligned_cols=29 Identities=7% Similarity=0.180 Sum_probs=11.9
Q ss_pred HHHhhHHHHHH-HHHHHHHHHHHHHHHHHh
Q psy6549 78 RVFYNKLFLIM-IIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 78 r~~~~K~il~~-ii~~l~l~i~~viy~k~~ 106 (109)
|....+++..+ ++++.++++++.+|+.+.
T Consensus 26 ke~~~riiiGa~Vl~iIii~~~~~~Y~~~~ 55 (416)
T PF04415_consen 26 KERRKRIIIGAAVLIIIIIFIVYNIYYFLQ 55 (416)
T ss_pred hhhhhhhhhhhhhHhHHHHHHHHHHHHHhh
Confidence 34444444423 222333334444555544
No 348
>PF05371 Phage_Coat_Gp8: Phage major coat protein, Gp8; InterPro: IPR008020 The major coat protein in the capsid of filamentous bacteriophage forms a helical assembly of about 7000 identical protomers, with each protomer comprised of 46 amino acids, after the cleavage of the signal peptide. Each protomer forms a slightly curved helix that combines to form a tubular structure that encapsulates the viral DNA [].; PDB: 1IFK_A 2C0W_A 2HI5_A 1FDM_A 1IFJ_A 2C0X_A 1IFI_A 1IFD_A 1MZT_A 1IFL_A ....
Probab=22.59 E-value=1.4e+02 Score=17.17 Aligned_cols=10 Identities=20% Similarity=0.451 Sum_probs=4.7
Q ss_pred HHHHHHHHHh
Q psy6549 97 LSVMLYLKIF 106 (109)
Q Consensus 97 i~~viy~k~~ 106 (109)
|++=+|-||.
T Consensus 40 igirlFKKf~ 49 (52)
T PF05371_consen 40 IGIRLFKKFA 49 (52)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHHh
Confidence 3333555554
No 349
>PF03994 DUF350: Domain of Unknown Function (DUF350) ; InterPro: IPR007140 This motif occurs in a small set of bacterial proteins. It has two transmembrane regions, and often occurs as tandem repeats. The are no conserved catalytic residues.
Probab=22.54 E-value=1.1e+02 Score=16.92 Aligned_cols=21 Identities=10% Similarity=0.110 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhc
Q psy6549 87 IMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~ 107 (109)
|+++.+++..++..+|-++.|
T Consensus 1 y~~~g~~l~~~~~~~~~~~tp 21 (54)
T PF03994_consen 1 YGLVGIVLLLLGFFVFDLLTP 21 (54)
T ss_pred CHHHHHHHHHHHHHHHHHhcC
Confidence 345555555566666655544
No 350
>PHA02681 ORF089 virion membrane protein; Provisional
Probab=22.53 E-value=1.5e+02 Score=18.84 Aligned_cols=8 Identities=25% Similarity=0.845 Sum_probs=3.2
Q ss_pred HHHHHHHH
Q psy6549 97 LSVMLYLK 104 (109)
Q Consensus 97 i~~viy~k 104 (109)
+++..|.+
T Consensus 19 liya~YRR 26 (92)
T PHA02681 19 IVIMMYRR 26 (92)
T ss_pred HHHHHHHh
Confidence 33344443
No 351
>PRK10927 essential cell division protein FtsN; Provisional
Probab=22.51 E-value=41 Score=26.39 Aligned_cols=9 Identities=33% Similarity=0.419 Sum_probs=4.0
Q ss_pred HHHHHHhcc
Q psy6549 100 MLYLKIFKK 108 (109)
Q Consensus 100 viy~k~~~~ 108 (109)
.+||--++|
T Consensus 50 GLyFith~k 58 (319)
T PRK10927 50 GLYFITHHK 58 (319)
T ss_pred heEEEecCC
Confidence 355444443
No 352
>KOG3633|consensus
Probab=22.47 E-value=3.2e+02 Score=20.02 Aligned_cols=46 Identities=22% Similarity=0.190 Sum_probs=29.0
Q ss_pred CchHHHHHHHhhhHHHH----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 1 MEKIHRETLRETNQALV----------------RTGESIVRSEQAALEAEAIGTGVISELGV 46 (109)
Q Consensus 1 ~~~~~R~~ll~~~~~l~----------------~t~~~L~~s~~~~~ete~iG~~il~~L~~ 46 (109)
||+.+|+.+.-..+++. .+.++|..+..+.+|...+|...+..-.-
T Consensus 91 ~se~eREEi~l~~~Rl~~Rt~TVev~VeT~Rn~~Q~~sL~~at~lIDev~~m~~~~ig~ak~ 152 (219)
T KOG3633|consen 91 MSEAEREEIILYLQRLSSRTATVEVRVETVRNNSQEDSLSQATVLIDEVIKMGDPVIGRAKC 152 (219)
T ss_pred HhhhhHHHHHHHHHHHhcccceEEEEEeccCChHHHHHHHHHHHHHHHHHHhccchHHHHHH
Confidence 45666666666666553 33477777777777777777765544333
No 353
>COG2991 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.29 E-value=1.3e+02 Score=18.54 Aligned_cols=14 Identities=14% Similarity=0.152 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHH
Q psy6549 84 LFLIMIIVIESMTL 97 (109)
Q Consensus 84 ~il~~ii~~l~l~i 97 (109)
+++.+++++++++-
T Consensus 4 ~lltFg~Fllvi~g 17 (77)
T COG2991 4 FLLTFGIFLLVIAG 17 (77)
T ss_pred HHHHHHHHHHHHHH
Confidence 55666666664433
No 354
>TIGR02848 spore_III_AC stage III sporulation protein AC. Members of this protein family are designated SpoIIIAC, part of the spoIIIA operon of sporulation genes whose mutant phenotype is linked to sporulation stage III. Members of this family are encoded by the genome of a species if and only if that species is capable of endospore formation, as in Bacillus subtilis. The molecular function of this small, probable integral membrane protein is unknown.
Probab=22.15 E-value=1.9e+02 Score=17.34 Aligned_cols=33 Identities=12% Similarity=0.141 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy6549 67 RSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 67 ~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~ 99 (109)
-=+.+++...|.......-+.|++++|...+-+
T Consensus 18 vl~~vLk~sGkee~A~~~tLaG~iiVL~~Vi~~ 50 (64)
T TIGR02848 18 VIHTILKQSGKEEQAQMVTLAGIVVVLFMVITL 50 (64)
T ss_pred HHHHHHHHcCcHHHHHHHHHHHHHHHHHHHHHH
Confidence 345677777888888877788887777554443
No 355
>COG4575 ElaB Uncharacterized conserved protein [Function unknown]
Probab=22.01 E-value=2.4e+02 Score=18.51 Aligned_cols=78 Identities=27% Similarity=0.224 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhH-HHHHHhhhHhhh-HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy6549 22 SIVRSEQAALEAEAIGTGVISELGVQRE-ALVRARDRLVDT-DLHVSRSRRMIQAIRRRVFYNKLFLIMIIVIESMTLSV 99 (109)
Q Consensus 22 ~L~~s~~~~~ete~iG~~il~~L~~Qre-~L~~~~~~~~~i-~~~l~~a~~~l~~m~rr~~~~K~il~~ii~~l~l~i~~ 99 (109)
-+++.+.++.++-..+.+=..+|+.+=+ -|.++++.+.+. +.-..+++..+..=.-=.--|-|.-++|....-+++++
T Consensus 20 L~d~lEevL~ssg~~a~~e~~~lR~r~~~~Lk~~r~rl~~~~d~v~~~sk~a~~~tD~yV~e~PWq~VGvaAaVGlllGl 99 (104)
T COG4575 20 LLDTLEEVLKSSGSLAGDEAEELRSKAESALKEARDRLGDTGDAVVQRSKAAADATDDYVRENPWQGVGVAAAVGLLLGL 99 (104)
T ss_pred HHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHcCCchHHHHHHHHHHHHHH
Confidence 3344444444444444444555554443 244555555555 33333343333222222233445555555444444443
No 356
>PF07010 Endomucin: Endomucin; InterPro: IPR010740 This family consists of several mammalian endomucin proteins. Endomucin is an early endothelial-specific antigen that is also expressed on putative hematopoietic progenitor cells.
Probab=21.93 E-value=1.2e+02 Score=22.89 Aligned_cols=24 Identities=25% Similarity=0.533 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIFK 107 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~~ 107 (109)
+|+-++|+++++.+.++..+-|+|
T Consensus 189 vilpvvIaliVitl~vf~LvgLyr 212 (259)
T PF07010_consen 189 VILPVVIALIVITLSVFTLVGLYR 212 (259)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Confidence 555566666666666666665554
No 357
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=21.92 E-value=5.5e+02 Score=22.54 Aligned_cols=42 Identities=14% Similarity=0.100 Sum_probs=18.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy6549 12 TNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVR 53 (109)
Q Consensus 12 ~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~ 53 (109)
....+++--..|++-++.+.+-.+-......++...++.++.
T Consensus 514 ~~~~~~~li~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~ 555 (782)
T PRK00409 514 DKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEE 555 (782)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444444444444433333333
No 358
>PHA02955 hypothetical protein; Provisional
Probab=21.89 E-value=1.3e+02 Score=22.33 Aligned_cols=23 Identities=22% Similarity=0.193 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 84 LFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 84 ~il~~ii~~l~l~i~~viy~k~~ 106 (109)
-..|++++++++++++++.+-++
T Consensus 178 ~~~w~ii~~v~ii~~~v~l~yik 200 (213)
T PHA02955 178 SIKWFIIYIVLCLLILIILGYIY 200 (213)
T ss_pred CCcchhHHHHHHHHHHHHHHHHH
Confidence 45688999998877777644333
No 359
>PF03554 Herpes_UL73: UL73 viral envelope glycoprotein ; InterPro: IPR005211 This entry represents a conserved region found in a number of viral proteins: BLRF1, U46, 53, and UL73, collectively known as glycoprotein N. These UL73-like envelope glycoproteins, which associate in a high molecular mass complex with their counterpart protein gM, induce neutralizing antibody responses in the host. These glycoproteins are highly polymorphic, particularly in the N-terminal region [].; GO: 0019031 viral envelope
Probab=21.62 E-value=1.5e+02 Score=18.54 Aligned_cols=19 Identities=21% Similarity=0.389 Sum_probs=9.7
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 86 l~~ii~~l~l~i~~viy~k 104 (109)
.|.++-++++++..++|++
T Consensus 51 IW~iiN~~il~~A~~vyLr 69 (82)
T PF03554_consen 51 IWAIINVVILLCAFCVYLR 69 (82)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3445545555555555554
No 360
>PRK14780 lipoprotein signal peptidase; Provisional
Probab=21.53 E-value=1.5e+02 Score=22.64 Aligned_cols=28 Identities=11% Similarity=0.277 Sum_probs=14.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhccC
Q psy6549 81 YNKLFLIMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 81 ~~K~il~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
-|-+|.++++++++.+++- ++.+|.++|
T Consensus 203 ADi~I~~G~~llv~~~li~-~~~~~~~~~ 230 (263)
T PRK14780 203 FDVYIVVGVCVLVVILIIS-FIIKWKKDK 230 (263)
T ss_pred HHHHHHHHHHHHHHHHHHH-HHHhhcccc
Confidence 3556666665555443332 455665543
No 361
>PF09605 Trep_Strep: Hypothetical bacterial integral membrane protein (Trep_Strep); InterPro: IPR011733 This family consists of strongly hydrophobic proteins about 190 amino acids in length with a strongly basic motif near the C terminus. If is found in rather few species, but in paralogous families of 12 members in the oral pathogenic spirochaete Treponema denticola and 2 in Streptococcus pneumoniae (strain ATCC BAA-255 / R6).
Probab=21.49 E-value=1.4e+02 Score=21.17 Aligned_cols=32 Identities=28% Similarity=0.422 Sum_probs=18.4
Q ss_pred HHHhhHHHHH-HHHHHHHHHHHHHHHHHHhccC
Q psy6549 78 RVFYNKLFLI-MIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 78 r~~~~K~il~-~ii~~l~l~i~~viy~k~~~~~ 109 (109)
...+++++.. .++.+.+.+++..+=.|+.|||
T Consensus 151 ~~~~~~~~~~~~~~~~v~a~lG~~lG~kllkKH 183 (186)
T PF09605_consen 151 SFFTPWMLIIIIIITFVGALLGALLGKKLLKKH 183 (186)
T ss_pred HHcchHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3445555544 3344555556666777777775
No 362
>PRK15406 oligopeptide ABC transporter permease OppC; Provisional
Probab=21.42 E-value=2.8e+02 Score=21.16 Aligned_cols=26 Identities=31% Similarity=0.400 Sum_probs=14.3
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy6549 73 QAIRRRVFYNKLFLIMIIVIESMTLS 98 (109)
Q Consensus 73 ~~m~rr~~~~K~il~~ii~~l~l~i~ 98 (109)
+.+-++..++|..+++.+++++++++
T Consensus 27 ~~~~~~~~~~~~~~~g~~il~~~~~~ 52 (302)
T PRK15406 27 QDARRRFMHNRAAVASLIVLVLIALF 52 (302)
T ss_pred HHHHHHHhhChHHHHHHHHHHHHHHH
Confidence 34455666777766655554444333
No 363
>PF07111 HCR: Alpha helical coiled-coil rod protein (HCR); InterPro: IPR009800 This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation [].; GO: 0030154 cell differentiation, 0005634 nucleus, 0005737 cytoplasm
Probab=21.40 E-value=5.7e+02 Score=22.54 Aligned_cols=75 Identities=20% Similarity=0.302 Sum_probs=47.0
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhh-hHhhhH-----------HHHHHHH-------
Q psy6549 9 LRETNQALVRTGESIVRSEQAALEAEAIGTGVISELGVQREALVRARD-RLVDTD-----------LHVSRSR------- 69 (109)
Q Consensus 9 ll~~~~~l~~t~~~L~~s~~~~~ete~iG~~il~~L~~Qre~L~~~~~-~~~~i~-----------~~l~~a~------- 69 (109)
|....+.+......|.-+.+-..++.+-+...-.+|.+|++.-.+.-. +|.++. .-++.|+
T Consensus 530 Lq~~qe~la~l~~QL~~Ar~~lqes~eea~~lR~EL~~QQ~~y~~alqekvsevEsrl~E~L~~~E~rLNeARREHtKaV 609 (739)
T PF07111_consen 530 LQEKQESLAELEEQLEAARKSLQESTEEAAELRRELTQQQEVYERALQEKVSEVESRLREQLSEMEKRLNEARREHTKAV 609 (739)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445555666777778888888888888888999999876664332 444433 3333333
Q ss_pred HHHHHHHHHHHhhH
Q psy6549 70 RMIQAIRRRVFYNK 83 (109)
Q Consensus 70 ~~l~~m~rr~~~~K 83 (109)
--|+.+.|++..+|
T Consensus 610 VsLRQ~qrqa~reK 623 (739)
T PF07111_consen 610 VSLRQIQRQAAREK 623 (739)
T ss_pred HHHHHHHHHHHHhh
Confidence 34566666666554
No 364
>PF09125 COX2-transmemb: Cytochrome C oxidase subunit II, transmembrane; InterPro: IPR015209 This N-terminal domain forms the transmembrane region in subunit II of cytochrome c oxidase from Thermus thermophilus. This domain adopts a tertiary structure consisting of two antiparallel transmembrane helices, in a transmembrane helix hairpin fold []. ; PDB: 1EHK_B 2QPE_B 3S8F_B 4EV3_B 3BVD_B 3S8G_B 3EH3_B 3S3C_B 3S39_B 3QJQ_B ....
Probab=21.23 E-value=1.5e+02 Score=15.84 Aligned_cols=20 Identities=10% Similarity=0.242 Sum_probs=8.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLY 102 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy 102 (109)
.+|+..++..++ +.++++-|
T Consensus 14 r~Wi~F~l~mi~-vFi~li~y 33 (38)
T PF09125_consen 14 RGWIAFALAMIL-VFIALIGY 33 (38)
T ss_dssp HHHHHHHHHHHH-HHHHHHHH
T ss_pred HhHHHHHHHHHH-HHHHHHHH
Confidence 345544433333 33444444
No 365
>PF03918 CcmH: Cytochrome C biogenesis protein; InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=21.08 E-value=32 Score=23.79 Aligned_cols=26 Identities=8% Similarity=0.080 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy6549 83 KLFLIMIIVIESMTLSVMLYLKIFKK 108 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~k~~~~ 108 (109)
-+++|+.=++++++.++++|+.+.|+
T Consensus 101 ~~~lW~~P~~~l~~g~~~~~~~~rr~ 126 (148)
T PF03918_consen 101 TWLLWLGPFLLLLLGGALLFRRLRRW 126 (148)
T ss_dssp --------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 46677777776666666666655544
No 366
>PF11712 Vma12: Endoplasmic reticulum-based factor for assembly of V-ATPase; InterPro: IPR021013 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. The yeast vacuolar proton-translocating ATPase (V-ATPase) is the best characterised member of the V-ATPase family. A total of thirteen genes are required for encoding the subunits of the enzyme complex itself and an additional three for providing factors necessary for the assembly of the whole. Vma12 is one of these latter, all three of which are localised to the endoplasmic reticulum [].
Probab=20.98 E-value=1.8e+02 Score=19.55 Aligned_cols=25 Identities=16% Similarity=0.380 Sum_probs=11.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHh
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLKIF 106 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k~~ 106 (109)
-|+++++++.+++++.=+++|.+..
T Consensus 111 ~Rvllgl~~al~vlvAEv~l~~~y~ 135 (142)
T PF11712_consen 111 YRVLLGLFGALLVLVAEVVLYIRYL 135 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444554443
No 367
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=20.86 E-value=1.4e+02 Score=17.81 Aligned_cols=15 Identities=20% Similarity=0.501 Sum_probs=6.0
Q ss_pred HHHHHHHHHHHHHHH
Q psy6549 86 LIMIIVIESMTLSVM 100 (109)
Q Consensus 86 l~~ii~~l~l~i~~v 100 (109)
+.+++++++++++++
T Consensus 6 ~~l~~~v~~~~~~~v 20 (85)
T TIGR02209 6 VLLLLAILVSAISVV 20 (85)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333444443444444
No 368
>PF09972 DUF2207: Predicted membrane protein (DUF2207); InterPro: IPR018702 This domain has no known function.
Probab=20.86 E-value=2e+02 Score=22.63 Aligned_cols=7 Identities=14% Similarity=0.216 Sum_probs=3.0
Q ss_pred HHHHHHh
Q psy6549 100 MLYLKIF 106 (109)
Q Consensus 100 viy~k~~ 106 (109)
++|+++.
T Consensus 247 ~~~~~~~ 253 (511)
T PF09972_consen 247 IIWRKYG 253 (511)
T ss_pred HHhhhcc
Confidence 3444443
No 369
>KOG0810|consensus
Probab=20.81 E-value=4e+02 Score=20.59 Aligned_cols=55 Identities=15% Similarity=0.218 Sum_probs=29.5
Q ss_pred HHHHHHh-hhHhhhHHHHHHHHHHHHHHHHH-------HHhhHHHHHHHHHHHHHHHHHHHHH
Q psy6549 49 EALVRAR-DRLVDTDLHVSRSRRMIQAIRRR-------VFYNKLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 49 e~L~~~~-~~~~~i~~~l~~a~~~l~~m~rr-------~~~~K~il~~ii~~l~l~i~~viy~ 103 (109)
..|.-.. +.+++|+.++..|...+..=.-. ....+...|++|+++++++++++++
T Consensus 226 a~LVe~QgEmvd~IE~nV~~A~~~V~~g~~~~~kAv~~qkkaRK~k~i~ii~~iii~~v~v~~ 288 (297)
T KOG0810|consen 226 AVLVESQGEMVDRIENNVENAVDYVEQGVDHLKKAVKYQKKARKWKIIIIIILIIIIVVLVVV 288 (297)
T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhceeeeehHHHHHHHHHhhh
Confidence 3443333 45666788888887666552221 1233444556666665555555544
No 370
>PRK06870 secG preprotein translocase subunit SecG; Reviewed
Probab=20.72 E-value=1.8e+02 Score=17.52 Aligned_cols=23 Identities=13% Similarity=0.332 Sum_probs=12.5
Q ss_pred HHhhHHHHHHHHHHHHHHHHHHH
Q psy6549 79 VFYNKLFLIMIIVIESMTLSVML 101 (109)
Q Consensus 79 ~~~~K~il~~ii~~l~l~i~~vi 101 (109)
..-+|+..++.++.+++++++.+
T Consensus 50 ~~L~k~T~il~~~F~i~~l~l~~ 72 (76)
T PRK06870 50 NFLSRLTAVLAVLFFVLSLALGY 72 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456666666665555554443
No 371
>MTH00147 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=20.65 E-value=1.8e+02 Score=16.35 Aligned_cols=23 Identities=22% Similarity=0.495 Sum_probs=10.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 82 NKLFLIMIIVIESMTLSVMLYLK 104 (109)
Q Consensus 82 ~K~il~~ii~~l~l~i~~viy~k 104 (109)
+.+++++...+.++.+...+|+.
T Consensus 8 ~W~~l~~~~~~~~~~~~~~~~~~ 30 (51)
T MTH00147 8 NWIFLFILFWSAVIIVSVIIWWS 30 (51)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 34444444444444444455553
No 372
>MTH00158 ATP8 ATP synthase F0 subunit 8; Provisional
Probab=20.59 E-value=1.4e+02 Score=15.08 Aligned_cols=21 Identities=19% Similarity=0.517 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy6549 83 KLFLIMIIVIESMTLSVMLYL 103 (109)
Q Consensus 83 K~il~~ii~~l~l~i~~viy~ 103 (109)
+.+++....+....+....|+
T Consensus 9 W~~l~~~f~~~~~~~~~~~~f 29 (32)
T MTH00158 9 WLILFILFLITFILFNILNYF 29 (32)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444
No 373
>PHA02967 hypothetical protein; Provisional
Probab=20.51 E-value=1.2e+02 Score=20.70 Aligned_cols=10 Identities=10% Similarity=0.368 Sum_probs=3.9
Q ss_pred HHHHHHHHHh
Q psy6549 97 LSVMLYLKIF 106 (109)
Q Consensus 97 i~~viy~k~~ 106 (109)
+++++.+.||
T Consensus 37 val~L~~~FF 46 (128)
T PHA02967 37 VALIIINFFF 46 (128)
T ss_pred HHHHHHHHHH
Confidence 3333334444
No 374
>PF07234 DUF1426: Protein of unknown function (DUF1426); InterPro: IPR009871 This family consists of several Banana bunchy top virus proteins of around 120 residues in length. Q9IGU4 from SWISSPROT is annotated a movement protein whereas most other family members are hypothetical. The function of this family is unknown.
Probab=20.42 E-value=2.1e+02 Score=18.82 Aligned_cols=21 Identities=19% Similarity=0.464 Sum_probs=10.6
Q ss_pred HHhhHHHHHHHHHHHHHHHHHH
Q psy6549 79 VFYNKLFLIMIIVIESMTLSVM 100 (109)
Q Consensus 79 ~~~~K~il~~ii~~l~l~i~~v 100 (109)
.+..++++++.|++. +.|+++
T Consensus 10 lfFEwFLF~~AIFiA-ItIlYI 30 (117)
T PF07234_consen 10 LFFEWFLFFGAIFIA-ITILYI 30 (117)
T ss_pred HHHHHHHHHHHHHHH-HHHHHH
Confidence 455666655555444 344444
No 375
>PF05454 DAG1: Dystroglycan (Dystrophin-associated glycoprotein 1); InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=20.39 E-value=34 Score=26.43 Aligned_cols=23 Identities=0% Similarity=0.249 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhccC
Q psy6549 87 IMIIVIESMTLSVMLYLKIFKKR 109 (109)
Q Consensus 87 ~~ii~~l~l~i~~viy~k~~~~~ 109 (109)
.++|++++++|+++|..-..|||
T Consensus 151 paVVI~~iLLIA~iIa~icyrrk 173 (290)
T PF05454_consen 151 PAVVIAAILLIAGIIACICYRRK 173 (290)
T ss_dssp -----------------------
T ss_pred HHHHHHHHHHHHHHHHHHhhhhh
Confidence 45555555667777777777765
No 376
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=20.26 E-value=1.8e+02 Score=17.18 Aligned_cols=21 Identities=0% Similarity=0.264 Sum_probs=12.5
Q ss_pred hhHhhhHHHHHHHHHHHHHHH
Q psy6549 56 DRLVDTDLHVSRSRRMIQAIR 76 (109)
Q Consensus 56 ~~~~~i~~~l~~a~~~l~~m~ 76 (109)
..+.++...+++|..+|+.|.
T Consensus 25 ~~i~~~e~~l~ea~~~l~qMe 45 (79)
T PF05008_consen 25 SLIREIERDLDEAEELLKQME 45 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 345556666666666666664
No 377
>PRK15347 two component system sensor kinase SsrA; Provisional
Probab=20.25 E-value=1.2e+02 Score=25.98 Aligned_cols=16 Identities=13% Similarity=0.164 Sum_probs=9.9
Q ss_pred HHhhHHHHHHHHHHHH
Q psy6549 79 VFYNKLFLIMIIVIES 94 (109)
Q Consensus 79 ~~~~K~il~~ii~~l~ 94 (109)
-+..|+++++++++++
T Consensus 8 si~~Kl~l~~~~~~~~ 23 (921)
T PRK15347 8 SLVIRLTLLLGLTLII 23 (921)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4667777776555443
No 378
>PF13125 DUF3958: Protein of unknown function (DUF3958)
Probab=20.23 E-value=2.6e+02 Score=18.19 Aligned_cols=41 Identities=27% Similarity=0.320 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHhhhHhhhHHHHHHHH
Q psy6549 29 AALEAEAIGTGVISELGVQREALVRARDRLVDTDLHVSRSR 69 (109)
Q Consensus 29 ~~~ete~iG~~il~~L~~Qre~L~~~~~~~~~i~~~l~~a~ 69 (109)
...|.-.+...+.-+|..|.|+|.+-+..+.+-.+++.--.
T Consensus 53 ~~qe~~~ieRk~t~eLE~qkE~LlkEkR~L~d~E~~l~yq~ 93 (99)
T PF13125_consen 53 MLQEAQHIERKLTFELEEQKETLLKEKRDLSDLENDLSYQQ 93 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566778888889999999999999999999888876544
No 379
>PRK14585 pgaD putative PGA biosynthesis protein; Provisional
Probab=20.22 E-value=2e+02 Score=19.88 Aligned_cols=25 Identities=4% Similarity=0.242 Sum_probs=18.3
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHH
Q psy6549 80 FYNKLFLIMIIVIESMTLSVMLYLKI 105 (109)
Q Consensus 80 ~~~K~il~~ii~~l~l~i~~viy~k~ 105 (109)
..+.+.+|+.|+++ -++++|+|.++
T Consensus 50 ~l~tl~~Y~~iAv~-nAvvLI~WA~Y 74 (137)
T PRK14585 50 ARSRLQFYFLLAVA-NAVVLIVWALY 74 (137)
T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHH
Confidence 56778888888776 56677887765
Done!