Query psy510
Match_columns 380
No_of_seqs 261 out of 1111
Neff 7.1
Searched_HMMs 46136
Date Fri Aug 16 17:18:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy510.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/510hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2189|consensus 100.0 1.6E-79 3.4E-84 629.4 28.5 309 2-316 186-494 (829)
2 PF01496 V_ATPase_I: V-type AT 100.0 2.8E-63 6.1E-68 535.9 13.2 287 2-294 156-443 (759)
3 PRK05771 V-type ATP synthase s 100.0 1.3E-52 2.9E-57 445.8 27.2 263 35-312 174-447 (646)
4 COG1269 NtpI Archaeal/vacuolar 100.0 3.9E-45 8.5E-50 387.4 26.9 243 34-291 186-431 (660)
5 KOG2189|consensus 99.8 2.4E-19 5.3E-24 186.0 3.3 67 313-379 331-397 (829)
6 PF01496 V_ATPase_I: V-type AT 98.6 4.4E-09 9.5E-14 114.7 -0.5 67 313-379 298-364 (759)
7 PRK05771 V-type ATP synthase s 98.0 3.8E-06 8.3E-11 90.2 4.0 62 313-379 283-347 (646)
8 COG1269 NtpI Archaeal/vacuolar 97.8 1.7E-05 3.7E-10 85.3 3.6 62 313-379 297-361 (660)
9 PF05377 FlaC_arch: Flagella a 80.2 15 0.00032 26.7 7.3 47 79-129 2-48 (55)
10 PF03962 Mnd1: Mnd1 family; I 64.0 7 0.00015 35.6 3.2 17 33-49 47-63 (188)
11 PF07666 MpPF26: M penetrans p 63.4 31 0.00068 29.6 6.8 80 201-285 40-120 (130)
12 COG2383 Uncharacterized conser 57.8 12 0.00026 30.5 3.1 25 224-248 15-39 (109)
13 PF01102 Glycophorin_A: Glycop 56.5 15 0.00033 31.1 3.7 24 220-246 67-90 (122)
14 PF08946 Osmo_CC: Osmosensory 54.2 30 0.00065 24.1 4.1 31 76-106 11-41 (46)
15 PF12725 DUF3810: Protein of u 53.3 24 0.00053 34.8 5.2 57 226-287 23-81 (318)
16 PF04977 DivIC: Septum formati 51.8 95 0.002 23.2 7.3 28 79-106 19-46 (80)
17 PRK15028 cytochrome bd-II oxid 51.3 48 0.001 33.6 7.0 73 219-293 74-147 (378)
18 PF06103 DUF948: Bacterial pro 50.7 1.2E+02 0.0026 23.7 8.4 32 77-108 26-57 (90)
19 PF12325 TMF_TATA_bd: TATA ele 50.4 1.3E+02 0.0028 25.4 8.4 13 114-126 73-85 (120)
20 KOG4800|consensus 49.1 56 0.0012 30.6 6.4 76 213-288 70-154 (248)
21 KOG2264|consensus 47.2 32 0.0007 36.5 5.0 54 52-105 82-135 (907)
22 PF06570 DUF1129: Protein of u 46.5 1.8E+02 0.004 26.5 9.6 16 213-228 86-101 (206)
23 COG1730 GIM5 Predicted prefold 45.7 1.4E+02 0.0031 26.1 8.2 36 73-108 90-125 (145)
24 PF06103 DUF948: Bacterial pro 45.5 1.5E+02 0.0031 23.2 8.7 39 75-113 31-69 (90)
25 PF07106 TBPIP: Tat binding pr 44.0 2E+02 0.0044 25.2 9.2 15 146-160 142-156 (169)
26 PRK10884 SH3 domain-containing 43.8 1.7E+02 0.0037 27.0 8.9 50 74-123 90-139 (206)
27 COG3301 NrfD Formate-dependent 43.6 64 0.0014 31.5 6.1 77 214-292 79-160 (305)
28 PF15290 Syntaphilin: Golgi-lo 42.9 1.6E+02 0.0035 28.6 8.6 51 79-129 77-137 (305)
29 PF11381 DUF3185: Protein of u 41.3 45 0.00097 24.6 3.7 49 234-284 4-57 (59)
30 PF05377 FlaC_arch: Flagella a 41.3 1E+02 0.0022 22.4 5.5 28 78-105 15-42 (55)
31 PF08112 ATP-synt_E_2: ATP syn 40.7 1.4E+02 0.003 21.5 6.6 41 74-114 8-49 (56)
32 PF04123 DUF373: Domain of unk 40.6 73 0.0016 31.9 6.3 86 199-287 150-244 (344)
33 PF11348 DUF3150: Protein of u 40.5 1.7E+02 0.0037 28.0 8.7 64 48-117 55-118 (257)
34 PF07178 TraL: TraL protein; 40.4 39 0.00085 27.1 3.7 41 210-250 18-58 (95)
35 PF02388 FemAB: FemAB family; 37.8 2.5E+02 0.0053 28.6 9.9 49 76-124 241-295 (406)
36 PF11068 YlqD: YlqD protein; 37.7 2.6E+02 0.0057 23.9 8.8 36 73-108 16-51 (131)
37 PRK00888 ftsB cell division pr 37.7 2.3E+02 0.0049 23.2 8.3 29 79-107 29-57 (105)
38 PRK14158 heat shock protein Gr 37.6 1.6E+02 0.0034 27.1 7.5 34 73-106 36-69 (194)
39 PF11833 DUF3353: Protein of u 36.9 2.3E+02 0.005 25.9 8.6 55 224-294 107-165 (194)
40 PRK15035 cytochrome bd-II oxid 35.6 1.4E+02 0.0031 31.5 7.8 66 214-288 381-446 (514)
41 PRK15003 cytochrome d ubiquino 35.2 1.3E+02 0.0029 30.5 7.3 71 218-293 73-147 (379)
42 PRK14151 heat shock protein Gr 33.9 1.2E+02 0.0027 27.2 6.2 23 78-100 21-43 (176)
43 PF06810 Phage_GP20: Phage min 33.0 1.5E+02 0.0032 26.1 6.4 45 53-97 3-47 (155)
44 COG3352 FlaC Putative archaeal 32.9 3.5E+02 0.0075 23.9 8.5 78 50-127 49-133 (157)
45 PF15079 DUF4546: Domain of un 32.5 2E+02 0.0043 25.8 6.9 23 81-103 51-73 (205)
46 cd00632 Prefoldin_beta Prefold 31.9 2.5E+02 0.0054 22.6 7.2 34 73-106 59-92 (105)
47 PRK11677 hypothetical protein; 31.7 2.5E+02 0.0054 24.2 7.4 46 80-128 32-77 (134)
48 PF02994 Transposase_22: L1 tr 31.6 4.6E+02 0.0099 26.4 10.5 27 80-106 140-166 (370)
49 PRK13729 conjugal transfer pil 31.0 1.9E+02 0.0041 30.3 7.6 45 78-122 70-117 (475)
50 COG1030 NfeD Membrane-bound se 30.6 1.3E+02 0.0028 31.1 6.3 67 205-285 253-321 (436)
51 PF14389 Lzipper-MIP1: Leucine 30.3 2.2E+02 0.0047 22.6 6.4 52 52-103 28-80 (88)
52 PRK14154 heat shock protein Gr 30.0 1.4E+02 0.0031 27.6 6.0 25 79-103 54-78 (208)
53 TIGR01069 mutS2 MutS2 family p 29.8 1E+02 0.0022 34.4 5.8 9 76-84 514-522 (771)
54 PF10947 DUF2628: Protein of u 29.5 2.9E+02 0.0063 22.2 7.3 31 215-245 40-70 (108)
55 PF06295 DUF1043: Protein of u 29.5 2.8E+02 0.0061 23.4 7.4 47 80-129 28-74 (128)
56 PHA03029 hypothetical protein; 29.4 56 0.0012 25.3 2.6 17 269-285 16-32 (92)
57 PF06698 DUF1192: Protein of u 29.1 1.1E+02 0.0023 22.6 4.0 25 79-103 23-47 (59)
58 PRK14143 heat shock protein Gr 29.0 2.5E+02 0.0053 26.7 7.5 30 75-104 65-94 (238)
59 PRK14140 heat shock protein Gr 28.8 2.4E+02 0.0052 25.8 7.2 32 75-106 35-66 (191)
60 KOG2264|consensus 28.8 2.6E+02 0.0057 30.0 8.2 50 76-125 92-144 (907)
61 PF10315 DUF2416: Protein of u 28.0 1.2E+02 0.0025 25.3 4.5 73 208-289 28-107 (108)
62 PF04111 APG6: Autophagy prote 27.9 3.9E+02 0.0084 26.3 9.1 15 82-96 55-69 (314)
63 TIGR02338 gimC_beta prefoldin, 27.5 3.3E+02 0.0071 22.2 7.3 35 73-107 63-97 (110)
64 PRK15097 cytochrome d terminal 27.5 2.6E+02 0.0056 29.7 8.0 66 214-288 381-446 (522)
65 PF06156 DUF972: Protein of un 27.2 3.6E+02 0.0077 22.2 8.1 35 74-108 5-39 (107)
66 KOG4348|consensus 27.2 2E+02 0.0044 29.7 6.9 47 76-129 568-614 (627)
67 PRK14162 heat shock protein Gr 27.0 2.5E+02 0.0053 25.8 7.0 31 76-106 38-68 (194)
68 KOG2391|consensus 27.0 1E+02 0.0022 30.7 4.7 8 184-191 335-342 (365)
69 PRK14011 prefoldin subunit alp 26.8 4.3E+02 0.0093 23.0 8.2 35 73-107 84-118 (144)
70 PRK14139 heat shock protein Gr 26.1 2.5E+02 0.0055 25.5 6.8 23 79-101 34-56 (185)
71 PF07698 7TM-7TMR_HD: 7TM rece 25.9 3.5E+02 0.0077 23.9 7.9 71 214-290 86-157 (194)
72 KOG0994|consensus 25.8 2E+02 0.0044 33.4 7.1 57 49-106 1205-1261(1758)
73 COG1294 AppB Cytochrome bd-typ 25.3 2E+02 0.0042 28.9 6.4 72 219-293 76-149 (346)
74 PF10146 zf-C4H2: Zinc finger- 25.1 4.5E+02 0.0098 24.7 8.5 18 344-365 188-205 (230)
75 KOG4302|consensus 24.8 3.5E+02 0.0075 29.6 8.5 56 49-104 72-130 (660)
76 cd02433 Nodulin-21_like_2 Nodu 24.6 3E+02 0.0065 25.9 7.3 20 271-290 215-234 (234)
77 PRK14155 heat shock protein Gr 24.5 1.8E+02 0.0039 27.0 5.6 23 80-102 16-38 (208)
78 PRK13729 conjugal transfer pil 24.4 2.4E+02 0.0052 29.5 7.0 44 76-119 75-121 (475)
79 PF05082 Rop-like: Rop-like; 24.3 1.9E+02 0.0042 21.8 4.7 30 78-107 3-32 (66)
80 PF11471 Sugarporin_N: Maltopo 24.3 1.9E+02 0.0042 21.3 4.7 26 74-99 29-54 (60)
81 PF04977 DivIC: Septum formati 24.2 1.8E+02 0.0038 21.7 4.8 29 75-103 22-50 (80)
82 PF07544 Med9: RNA polymerase 24.1 2.2E+02 0.0048 22.2 5.3 52 71-123 22-73 (83)
83 PF15290 Syntaphilin: Golgi-lo 24.0 2.6E+02 0.0057 27.2 6.7 17 79-95 84-100 (305)
84 PF02060 ISK_Channel: Slow vol 23.9 74 0.0016 27.1 2.7 50 269-322 46-97 (129)
85 PRK14148 heat shock protein Gr 23.8 3.4E+02 0.0075 24.9 7.3 29 77-105 40-68 (195)
86 PRK13169 DNA replication intia 23.8 4.3E+02 0.0093 22.0 8.1 35 74-108 5-39 (110)
87 COG1448 TyrB Aspartate/tyrosin 23.2 3.1E+02 0.0066 28.0 7.3 72 33-106 252-331 (396)
88 COG1983 PspC Putative stress-r 23.1 61 0.0013 24.8 1.9 41 203-245 20-60 (70)
89 PRK00293 dipZ thiol:disulfide 22.9 2.7E+02 0.0059 29.8 7.4 41 232-277 337-377 (571)
90 PRK14150 heat shock protein Gr 22.8 3.5E+02 0.0076 24.7 7.2 13 84-96 41-53 (193)
91 PF03188 Cytochrom_B561: Eukar 22.7 1.6E+02 0.0034 24.6 4.7 22 268-289 115-136 (137)
92 KOG0948|consensus 22.5 5.3E+02 0.011 29.0 9.3 36 189-224 925-966 (1041)
93 PF14817 HAUS5: HAUS augmin-li 22.3 1.5E+02 0.0032 32.3 5.2 17 358-374 502-518 (632)
94 PRK14160 heat shock protein Gr 22.2 4.1E+02 0.0088 24.7 7.5 45 79-123 56-103 (211)
95 PF07439 DUF1515: Protein of u 22.1 4.7E+02 0.01 21.8 8.2 24 76-99 7-30 (112)
96 PRK00409 recombination and DNA 22.1 1.6E+02 0.0035 32.8 5.7 15 74-88 517-531 (782)
97 PF11945 WASH_WAHD: WAHD domai 22.1 7.3E+02 0.016 24.3 9.6 55 75-129 16-70 (297)
98 PF07889 DUF1664: Protein of u 21.8 5E+02 0.011 22.1 8.1 33 75-107 41-73 (126)
99 PRK14145 heat shock protein Gr 21.8 3.7E+02 0.008 24.7 7.0 26 78-103 46-71 (196)
100 PF10267 Tmemb_cc2: Predicted 21.6 4.2E+02 0.0091 27.1 8.1 36 76-111 211-246 (395)
101 cd00890 Prefoldin Prefoldin is 21.4 4.3E+02 0.0092 21.6 7.0 34 74-107 84-117 (129)
102 PF02996 Prefoldin: Prefoldin 21.4 3.6E+02 0.0078 21.8 6.5 49 55-107 59-107 (120)
103 PRK09343 prefoldin subunit bet 21.4 4.9E+02 0.011 21.7 7.6 34 73-106 67-100 (121)
104 PF11471 Sugarporin_N: Maltopo 21.3 2.4E+02 0.0052 20.7 4.7 23 81-103 29-51 (60)
105 TIGR02209 ftsL_broad cell divi 21.3 3.7E+02 0.0081 20.4 7.5 29 79-107 26-54 (85)
106 PHA01750 hypothetical protein 21.2 3.1E+02 0.0068 20.7 5.2 15 83-97 41-55 (75)
107 PRK14161 heat shock protein Gr 21.2 3.6E+02 0.0078 24.3 6.8 16 82-97 24-39 (178)
108 KOG3850|consensus 21.2 5.3E+02 0.012 26.3 8.4 31 79-109 262-292 (455)
109 PF01024 Colicin: Colicin pore 21.1 1.2E+02 0.0026 27.7 3.7 32 213-244 144-175 (187)
110 PF00509 Hemagglutinin: Haemag 21.1 8.2E+02 0.018 26.1 10.1 57 49-105 375-431 (550)
111 PF06645 SPC12: Microsomal sig 21.0 68 0.0015 24.7 1.8 22 268-289 14-35 (76)
112 PF09325 Vps5: Vps5 C terminal 21.0 4E+02 0.0087 24.2 7.4 43 74-116 160-202 (236)
113 PF08181 DegQ: DegQ (SacQ) fam 20.9 2.8E+02 0.0061 18.8 5.4 29 78-106 5-33 (46)
114 PF00038 Filament: Intermediat 20.7 6E+02 0.013 24.4 8.9 6 120-125 297-302 (312)
115 cd00584 Prefoldin_alpha Prefol 20.6 4.5E+02 0.0097 21.7 7.0 49 55-107 69-117 (129)
116 PF11221 Med21: Subunit 21 of 20.5 3.8E+02 0.0083 23.0 6.7 36 67-106 91-126 (144)
117 PF07895 DUF1673: Protein of u 20.5 3.3E+02 0.0072 25.0 6.6 16 143-158 11-26 (205)
118 PF04508 Pox_A_type_inc: Viral 20.5 1.5E+02 0.0033 17.6 2.8 15 80-94 4-18 (23)
119 PF11119 DUF2633: Protein of u 20.4 80 0.0017 23.3 1.9 26 214-248 12-38 (59)
120 TIGR02005 PTS-IIBC-alpha PTS s 20.4 1.4E+02 0.003 31.7 4.5 49 233-285 141-189 (524)
121 PRK14144 heat shock protein Gr 20.2 3.6E+02 0.0077 24.9 6.6 23 79-101 47-69 (199)
122 PRK11677 hypothetical protein; 20.0 5.5E+02 0.012 22.1 7.4 47 74-120 33-79 (134)
No 1
>KOG2189|consensus
Probab=100.00 E-value=1.6e-79 Score=629.43 Aligned_cols=309 Identities=66% Similarity=1.106 Sum_probs=297.2
Q ss_pred hhhHHHHHHhcCCceEEEeeecCCcccCCCCCCccccEEEEEEEeChhhHHHHHHHHcccCceeecCCCCcchHHHHHHH
Q psy510 2 PAFERMLWRACRGNVFLRQAEIETPLEDINSGDPVYKSVFIIFFQGDQLKTRVMKICEGFRATLYPCPEAPTDRREMSMG 81 (380)
Q Consensus 2 ~~f~r~l~R~trGn~~~~~~~i~~~~~d~~~ge~~~k~v~ii~~~~~~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~ 81 (380)
..|||+|||+||||+|++++|+++|+.||++|+.++|+||||+++|++++.||+|||++|+++.||+|++++++.+.+.+
T Consensus 186 ~~fER~LWRa~Rgn~f~r~~~ie~~l~dp~Tge~~~K~vFivF~~Geql~~kIkKIcd~f~a~~yp~p~~~~er~~~~~~ 265 (829)
T KOG2189|consen 186 FAFERMLWRACRGNLFIRQSDIEEPLEDPKTGEPVEKNVFIVFFQGEQLKQKIKKICDGFGATLYPCPESPEERKEMLLE 265 (829)
T ss_pred HHHHHHHHHHhccceEEEeecccccccCCccCCcceeEEEEEEeecHHHHHHHHHHHhccCcEeecCCCChHHHHHHHHH
Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcEEEEEEeccccCHHHHHHHHHhc
Q psy510 82 VTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTLNFFNLDVTQKCLIAECWVPVLDIDAIQLALRRG 161 (380)
Q Consensus 82 l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~t~~~~~~~GWvP~~~~~~l~~~l~~~ 161 (380)
++.+|++++.++.++++++.+.+..+++.+..|...++|+|++|++||++++|.|++|+++|||||..+++.++++|++.
T Consensus 266 v~~ri~DL~~Vl~~t~~~r~~vL~~~~~~l~~W~~~v~K~KaIyhtLN~fn~Dvt~KCLIaE~W~P~~dl~~vq~aL~~~ 345 (829)
T KOG2189|consen 266 VNTRISDLQTVLDQTEDHRSRVLQAAAKNLPSWLIKVRKEKAIYHTLNMFNFDVTQKCLIAEGWCPVADLPDLQRALERG 345 (829)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHhccCccccCceEEEEeecchhhHHHHHHHHHHh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccCCCcccceecccCCCCCCCccccCcccchHHHHHHHHhcCCCCCccCCCchhhhHHHHHHHHHhhhhhHHHHHHHHH
Q psy510 162 TDRSGSSVPPILNRMDTFEDPPTYNRTNKFTKAFQALIDAYGVSSYREMNPTPYTIITFPFLFAVMFGDLGHGAIMFLFA 241 (380)
Q Consensus 162 ~~~~~~~v~~~~~~~~~~~~pPT~l~n~~~~~pFE~iv~~Yg~P~Y~EiDPT~~~aitFp~fFG~MfGD~GyGlll~l~~ 241 (380)
++.+|+.++.+.+..++++.|||+.|+|||+++||+|||.||+++|+|+||+|+++|||||+|++||||+|||++|+|++
T Consensus 346 ~~~sgS~v~~i~nv~~T~e~PPTy~RTNKFT~~FQ~IvDaYGVa~YrEvNPa~yTiITFPFLFAVMFGD~GHG~imlL~a 425 (829)
T KOG2189|consen 346 SEESGSQVPSILNVMETNEMPPTYFRTNKFTAGFQNIVDAYGVASYREVNPAPYTIITFPFLFAVMFGDLGHGLIMLLAA 425 (829)
T ss_pred hhhcCCcchhhHhheecCCCCCcchhcchhhHHHHHHHHhcccccccccCCCceeEeehHHHHHHHhcccchHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCCCCCCCCccccccCCCCCCCCcccccccc
Q psy510 242 AWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDDRSGSSVPPILNRMDTFEDPPTYNRTNKF 316 (380)
Q Consensus 242 ~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~fFG~~l~~~~l~~~~~~~~~~~~~~t~~~ 316 (380)
+|+++++||+..++..+++|+|+|+||||+++||++|||+|+||||+|+.+++. +++ .|...++.++..
T Consensus 426 l~~Vl~Ekkl~~~k~~~EI~~mfF~GRYIIlLMGlFSiYTGliYND~FSks~ni-----FgS-~W~~~~~~~~~~ 494 (829)
T KOG2189|consen 426 LWMVLNEKKLASQKIGDEIFNMFFGGRYIILLMGLFSIYTGLIYNDFFSKSMNI-----FGS-SWSNPYNVTAVL 494 (829)
T ss_pred HHHHHhcchhhhhhhHHHHHHHHhcchHHHHHHHHHHHHHhhhhhhhccccccc-----ccC-cccCccccchhc
Confidence 999999999987777789999999999999999999999999999999999873 344 677777776443
No 2
>PF01496 V_ATPase_I: V-type ATPase 116kDa subunit family ; InterPro: IPR002490 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. The V-ATPases (or V1V0-ATPase) and A-ATPases (or A1A0-ATPase) are each composed of two linked complexes: the V1 or A1 complex contains the catalytic core that hydrolyses/synthesizes ATP, and the V0 or A0 complex that forms the membrane-spanning pore. The V- and A-ATPases both contain rotary motors, one that drives proton translocation across the membrane and one that drives ATP synthesis/hydrolysis [, , ]. The V- and A-ATPases more closely resemble one another in subunit structure than they do the F-ATPases, although the function of A-ATPases is closer to that of F-ATPases. This entry represents the 116kDa subunit (or subunit a) and subunit I found in the V0 or A0 complex of V- or A-ATPases, respectively. The 116kDa subunit is a transmembrane glycoprotein required for the assembly and proton transport activity of the ATPase complex. Several isoforms of the 116kDa subunit exist, providing a potential role in the differential targeting and regulation of the V-ATPase for specific organelles []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015991 ATP hydrolysis coupled proton transport, 0033177 proton-transporting two-sector ATPase complex, proton-transporting domain; PDB: 2RPW_X 2NVJ_A 2JTW_A 3RRK_A.
Probab=100.00 E-value=2.8e-63 Score=535.88 Aligned_cols=287 Identities=41% Similarity=0.713 Sum_probs=125.5
Q ss_pred hhhHHHHHHhcCCceEEEeeecCCcccCCCCCCccccEEEEEEEeChhhHHHHHHHHcccCceeecCCCCcchHHHHHHH
Q psy510 2 PAFERMLWRACRGNVFLRQAEIETPLEDINSGDPVYKSVFIIFFQGDQLKTRVMKICEGFRATLYPCPEAPTDRREMSMG 81 (380)
Q Consensus 2 ~~f~r~l~R~trGn~~~~~~~i~~~~~d~~~ge~~~k~v~ii~~~~~~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~ 81 (380)
.+|+|.+||++|||+|+++.+++.+..|+. +..+++++++++++++.++++++||+++|++++++|+.++.+.+.+++
T Consensus 156 ~~f~~~l~r~~~~N~fi~~~~Ie~~~~d~~--e~~~k~v~vv~~~~~~~~~kv~~il~~~~f~~~~~p~~~~~p~e~~~~ 233 (759)
T PF01496_consen 156 ESFERILWRATRGNIFIRFSEIEEILEDPK--EEVEKEVFVVFFSGKELEEKVKKILRSFGFERYDLPEDEGTPEEAIKE 233 (759)
T ss_dssp HHHHHHHHHHHTT-----S------EEEE---EE-SSSEEEEEEEEGGGHHHHHHHHHTTT--B----GGGGG-HHHHHH
T ss_pred HHHHHHHHHhccCCeEEEEEeeeccccccc--ceeeeeeEEEEEEchhhHHHHHHHhhccCceecCCCCccccHHHHHHH
Confidence 579999999999999999999999888875 778899999999999999999999999999999999888899999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccC-cEEEEEEeccccCHHHHHHHHHh
Q psy510 82 VTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTLNFFNLDVTQ-KCLIAECWVPVLDIDAIQLALRR 160 (380)
Q Consensus 82 l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~t~-~~~~~~GWvP~~~~~~l~~~l~~ 160 (380)
++++++++++++++++++++++++++.+.+..|..++.+++++|++++++. .++ ++++++||||+++++++++.|++
T Consensus 234 l~~~i~~l~~~~~~~~~~l~~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~--~~~~~~~~~~GWvP~~~~~~l~~~l~~ 311 (759)
T PF01496_consen 234 LEEEIEELEKELEELEEELKKLLEKYAEELEAWYEYLRKEKEIYEALNKFA--STETNVFILEGWVPEKDVEELKKALEE 311 (759)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--TT---SEEEEEEE-TTTHHHHHHT--S
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cccccEEEEEEeccHHHHHHHHHHHHh
Confidence 999999999999999999999999999999999999999999999999766 455 89999999999999999999998
Q ss_pred ccccCCCcccceecccCCCCCCCccccCcccchHHHHHHHHhcCCCCCccCCCchhhhHHHHHHHHHhhhhhHHHHHHHH
Q psy510 161 GTDRSGSSVPPILNRMDTFEDPPTYNRTNKFTKAFQALIDAYGVSSYREMNPTPYTIITFPFLFAVMFGDLGHGAIMFLF 240 (380)
Q Consensus 161 ~~~~~~~~v~~~~~~~~~~~~pPT~l~n~~~~~pFE~iv~~Yg~P~Y~EiDPT~~~aitFp~fFG~MfGD~GyGlll~l~ 240 (380)
........+....++++++++|||+++||+|++|||.||+|||+|+|+|||||+++|+||||||||||||+|||++|+++
T Consensus 312 ~~~~~~~~v~~~~~~~~~~~~pPt~lknn~~~~pFe~iv~~Yg~P~Y~EiDPt~~~ai~fp~fFG~MfGD~GyGlll~l~ 391 (759)
T PF01496_consen 312 ATDGSEYSVPSIEEEPEEEEEPPTKLKNNKFTKPFEMIVDMYGLPKYREIDPTPFMAITFPFFFGMMFGDAGYGLLLLLF 391 (759)
T ss_dssp S-EEEE--------------------------------------------------------------------------
T ss_pred hccccccccccccccccccCCCCeeecCchhhhHHHHHHHhcCCCCCCccccchHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 76543322333445556778999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCCCCCCCC
Q psy510 241 AAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDDRSGSSV 294 (380)
Q Consensus 241 ~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~fFG~~l~ 294 (380)
|++++++.++... ..++++++++++|+++++||++|||||+|||||||.++.
T Consensus 392 ~l~l~~~~~~~~~--~~~e~~~~~~~~~~il~~~gi~si~~G~iyg~~FG~~~~ 443 (759)
T PF01496_consen 392 GLLLIKKFKKLKK--MKNEIFNMLFKLRYILLLMGISSIIFGFIYGSFFGDSLN 443 (759)
T ss_dssp ------------------------------------------------------
T ss_pred HHHHHHhccccch--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcch
Confidence 9999988777653 237899999999999999999999999999999997653
No 3
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=100.00 E-value=1.3e-52 Score=445.76 Aligned_cols=263 Identities=20% Similarity=0.250 Sum_probs=224.0
Q ss_pred ccccEEEEEEEeChhhHHHHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 35 PVYKSVFIIFFQGDQLKTRVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNW 114 (380)
Q Consensus 35 ~~~k~v~ii~~~~~~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~ 114 (380)
...+.++++++.+++..+++.++|++++++.+++|. ++.+.+.+++++++++++++++++++++++++...+...+..|
T Consensus 174 ~~~~~~~~vvv~~~~~~~~~~~~l~~~~f~~~~~p~-~~~p~~~l~~l~~~l~~l~~~~~~~~~~l~~~~~~~~~~~~~~ 252 (646)
T PRK05771 174 TDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEE-EGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLAL 252 (646)
T ss_pred ecCCcEEEEEEEEhhhHHHHHHHHHHCCCEEecCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456667777777777788999999999999999998 8999999999999999999999999999999998888888889
Q ss_pred HHHHHHHHHHHHHhccccccccCcEEEEEEeccccCHHHHHHHHHhccccCCCcccceeccc-CCCCCCCccccCcccch
Q psy510 115 FIKVRKIKAIYHTLNFFNLDVTQKCLIAECWVPVLDIDAIQLALRRGTDRSGSSVPPILNRM-DTFEDPPTYNRTNKFTK 193 (380)
Q Consensus 115 ~~~~~~~~~~~~~l~~~~~~~t~~~~~~~GWvP~~~~~~l~~~l~~~~~~~~~~v~~~~~~~-~~~~~pPT~l~n~~~~~ 193 (380)
..++..+++.+++++++ ..|+++++++||||+++++++++.+++.... .+.....++ +++++|||+++||+|++
T Consensus 253 ~~~l~~~~~~~~~~~~~--~~t~~~~~l~GWvP~~~~~~l~~~l~~~~~~---~~~v~~~~~~~~~~~~Pt~l~N~~~~~ 327 (646)
T PRK05771 253 YEYLEIELERAEALSKF--LKTDKTFAIEGWVPEDRVKKLKELIDKATGG---SAYVEFVEPDEEEEEVPTKLKNPKFIK 327 (646)
T ss_pred HHHHHHHHHHHHHHHhh--hcCCcEEEEEEEeehhHHHHHHHHHHHhcCC---cEEEEEeCCCCcCCCCCEEeeCCchhh
Confidence 99999999999999855 4789999999999999999999999986422 233345556 46689999999999999
Q ss_pred HHHHHHHHhcCCCCCccCCCchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHH
Q psy510 194 AFQALIDAYGVSSYREMNPTPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILL 273 (380)
Q Consensus 194 pFE~iv~~Yg~P~Y~EiDPT~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~ 273 (380)
|||.+|+|||+|+|+|+|||+++|+|||+||||||||+|||++|+++|++++++.+|. ++.++ +.++++++
T Consensus 328 pFE~lv~mYg~P~Y~EiDPT~~~ai~f~lfFGmM~gD~GyGLil~l~~~~l~~~~~k~------~~~~~---~~~~il~~ 398 (646)
T PRK05771 328 PFESLTEMYSLPKYNEIDPTPFLAIFFPLFFGMMLGDAGYGLLLLLIGLLLSFKLKKK------SEGLK---RLLKILIY 398 (646)
T ss_pred hHHHHHHHcCCCCCCCcCCccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhcccc------cHHHH---HHHHHHHH
Confidence 9999999999999999999999999999999999999999999999999988654332 12222 34689999
Q ss_pred HhHHHHHHHHHhcCCCCCCCCcccc----------ccCCCCCCCCcccc
Q psy510 274 MGLFSMYTGLIYNDDRSGSSVPPIL----------NRMDTFEDPPTYNR 312 (380)
Q Consensus 274 ~Gi~si~~G~lyg~fFG~~l~~~~l----------~~~~~~~~~~~~~~ 312 (380)
||++||+||++||||||.++..+.+ ..++|++||+.++.
T Consensus 399 ~gi~sii~G~lyG~fFG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 447 (646)
T PRK05771 399 LGISTIIWGLLTGSFFGFSLPIFLPGGYLELPEGYPSLSTENDVMTILI 447 (646)
T ss_pred HHHHHHHHHHHHHhHhcCccccccccccccccCCccccCCCccHHHHHH
Confidence 9999999999999999987753221 01467777776655
No 4
>COG1269 NtpI Archaeal/vacuolar-type H+-ATPase subunit I [Energy production and conversion]
Probab=100.00 E-value=3.9e-45 Score=387.41 Aligned_cols=243 Identities=26% Similarity=0.409 Sum_probs=214.2
Q ss_pred CccccEEEEEEEeChhhHHHHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 34 DPVYKSVFIIFFQGDQLKTRVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKN 113 (380)
Q Consensus 34 e~~~k~v~ii~~~~~~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~ 113 (380)
+...+.+++++.+.++..+++.+++++.+++.++.|+.++.+.+.+.++++++++.+.++++++++++++.+.+.+++..
T Consensus 186 ~~~~~~~~~v~~~~~~~~~~v~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~l~~~~~~~~~~l~~ 265 (660)
T COG1269 186 ENVEASVVIVVAHGAEDLDKVSKILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLA 265 (660)
T ss_pred hccccceEEEEEecccchHHHHHHHHhCCcEEeeccccCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44567788888998999999999999999999999988777999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHhccccccccCcEEEEEEeccccCHHHHHHHHHhccccCCCcccceecccCC---CCCCCccccCcc
Q psy510 114 WFIKVRKIKAIYHTLNFFNLDVTQKCLIAECWVPVLDIDAIQLALRRGTDRSGSSVPPILNRMDT---FEDPPTYNRTNK 190 (380)
Q Consensus 114 ~~~~~~~~~~~~~~l~~~~~~~t~~~~~~~GWvP~~~~~~l~~~l~~~~~~~~~~v~~~~~~~~~---~~~pPT~l~n~~ 190 (380)
|+..+..++..++.++.+. .|++++.++||||+++++++++.+++.++. .. .++..+. +++|||+++||+
T Consensus 266 ~~~~l~~e~~~~~~~~~~~--~t~~~~~~eGWvP~~~~~~~~~~i~~~~~~---~~--~~~~~~~~~~~e~~Pt~l~n~~ 338 (660)
T COG1269 266 VREILEIEKALGDVLSKLA--RTEYTLAIEGWVPADEVEKLKKIINRATGG---AA--YFEVSETDEDKEEVPTKLRNPK 338 (660)
T ss_pred HHHHHHHHHHHHHHHHHhh--ccceEEEEEEeccHHHHHHHHHHHHHhcCC---ce--EEEeecCCCccCCCCEeecCCc
Confidence 9999999999999998554 677899999999999999999999997531 22 2333333 489999999999
Q ss_pred cchHHHHHHHHhcCCCCCccCCCchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHH
Q psy510 191 FTKAFQALIDAYGVSSYREMNPTPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYI 270 (380)
Q Consensus 191 ~~~pFE~iv~~Yg~P~Y~EiDPT~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~i 270 (380)
|++|||.+|+|||+|+|+|||||+++|+|||+||||||||+|||++++++|++++++.+|. ..+.++.+ ..+
T Consensus 339 ~i~~Fe~l~emY~iPkY~EidPt~~~a~~Fp~fFG~M~gD~gyGlll~l~sl~l~~~~~~~-----~~~~~~~l---~~~ 410 (660)
T COG1269 339 FISPFESLTEMYGIPKYGEIDPTPFLALFFPLFFGIMFGDLGYGLLLFLISLLLLRYFKKR-----LPEGLKKL---GKI 410 (660)
T ss_pred ccchHHHHHHHhcCCCCCCcCCcchHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhcccc-----cchhHHHH---HHH
Confidence 9999999999999999999999999999999999999999999999999999999887641 12233333 248
Q ss_pred HHHHhHHHHHHHHHhcCCCCC
Q psy510 271 ILLMGLFSMYTGLIYNDDRSG 291 (380)
Q Consensus 271 l~~~Gi~si~~G~lyg~fFG~ 291 (380)
++++|++|++||++||+|||.
T Consensus 411 ~~~~~i~t~i~G~l~g~~fG~ 431 (660)
T COG1269 411 LLYLGISTIIWGFLYGEFFGP 431 (660)
T ss_pred HHHHHHHHHHHHHHhccccCC
Confidence 999999999999999999996
No 5
>KOG2189|consensus
Probab=99.75 E-value=2.4e-19 Score=186.00 Aligned_cols=67 Identities=52% Similarity=0.783 Sum_probs=60.8
Q ss_pred ccccchhHHHHHHhhcccCCCCCCCeeecCCCCCCCCCccccchhhHhHhhhhhhhccCCccccCCC
Q psy510 313 TNKFTKAFQALIDAYGDRSGSSVPPILNRMDTFEDPPTYNRTNKFTKAFQALIDAYGVSSYREMNPS 379 (380)
Q Consensus 313 t~~~~~~~q~ll~~~~~~~~~~~~~~~~~~~~~~~ppt~~~~n~ft~~fq~~~~~yg~~~y~evnp~ 379 (380)
|+..-..+|..|+++++.+|++|++++|.++++++||||+||||||++||+|||||||++|||||||
T Consensus 331 P~~dl~~vq~aL~~~~~~sgS~v~~i~nv~~T~e~PPTy~RTNKFT~~FQ~IvDaYGVa~YrEvNPa 397 (829)
T KOG2189|consen 331 PVADLPDLQRALERGSEESGSQVPSILNVMETNEMPPTYFRTNKFTAGFQNIVDAYGVASYREVNPA 397 (829)
T ss_pred chhhHHHHHHHHHHhhhhcCCcchhhHhheecCCCCCcchhcchhhHHHHHHHHhcccccccccCCC
Confidence 3334456677788889999999999999999999999999999999999999999999999999998
No 6
>PF01496 V_ATPase_I: V-type ATPase 116kDa subunit family ; InterPro: IPR002490 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. The V-ATPases (or V1V0-ATPase) and A-ATPases (or A1A0-ATPase) are each composed of two linked complexes: the V1 or A1 complex contains the catalytic core that hydrolyses/synthesizes ATP, and the V0 or A0 complex that forms the membrane-spanning pore. The V- and A-ATPases both contain rotary motors, one that drives proton translocation across the membrane and one that drives ATP synthesis/hydrolysis [, , ]. The V- and A-ATPases more closely resemble one another in subunit structure than they do the F-ATPases, although the function of A-ATPases is closer to that of F-ATPases. This entry represents the 116kDa subunit (or subunit a) and subunit I found in the V0 or A0 complex of V- or A-ATPases, respectively. The 116kDa subunit is a transmembrane glycoprotein required for the assembly and proton transport activity of the ATPase complex. Several isoforms of the 116kDa subunit exist, providing a potential role in the differential targeting and regulation of the V-ATPase for specific organelles []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015991 ATP hydrolysis coupled proton transport, 0033177 proton-transporting two-sector ATPase complex, proton-transporting domain; PDB: 2RPW_X 2NVJ_A 2JTW_A 3RRK_A.
Probab=98.64 E-value=4.4e-09 Score=114.70 Aligned_cols=67 Identities=36% Similarity=0.633 Sum_probs=10.1
Q ss_pred ccccchhHHHHHHhhcccCCCCCCCeeecCCCCCCCCCccccchhhHhHhhhhhhhccCCccccCCC
Q psy510 313 TNKFTKAFQALIDAYGDRSGSSVPPILNRMDTFEDPPTYNRTNKFTKAFQALIDAYGVSSYREMNPS 379 (380)
Q Consensus 313 t~~~~~~~q~ll~~~~~~~~~~~~~~~~~~~~~~~ppt~~~~n~ft~~fq~~~~~yg~~~y~evnp~ 379 (380)
+++..+.++..+++.+......++...........|||++++|+|+.+||.|||+||+|+|+|+||+
T Consensus 298 P~~~~~~l~~~l~~~~~~~~~~v~~~~~~~~~~~~pPt~lknn~~~~pFe~iv~~Yg~P~Y~EiDPt 364 (759)
T PF01496_consen 298 PEKDVEELKKALEEATDGSEYSVPSIEEEPEEEEEPPTKLKNNKFTKPFEMIVDMYGLPKYREIDPT 364 (759)
T ss_dssp -TTTHHHHHHT--SS-EEEE-----------------------------------------------
T ss_pred cHHHHHHHHHHHHhhccccccccccccccccccCCCCeeecCchhhhHHHHHHHhcCCCCCCccccc
Confidence 5556677777777666544444566666666788999999999999999999999999999999996
No 7
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=98.01 E-value=3.8e-06 Score=90.22 Aligned_cols=62 Identities=29% Similarity=0.528 Sum_probs=46.1
Q ss_pred ccccchhHHHHHHhhcccCCCCCCCeeecC---CCCCCCCCccccchhhHhHhhhhhhhccCCccccCCC
Q psy510 313 TNKFTKAFQALIDAYGDRSGSSVPPILNRM---DTFEDPPTYNRTNKFTKAFQALIDAYGVSSYREMNPS 379 (380)
Q Consensus 313 t~~~~~~~q~ll~~~~~~~~~~~~~~~~~~---~~~~~ppt~~~~n~ft~~fq~~~~~yg~~~y~evnp~ 379 (380)
|++.-+.++..+++.+. + . ..+... ...+.|||+.++|+|+..||.++|.||+|+|+|+||+
T Consensus 283 P~~~~~~l~~~l~~~~~--~-~--~~v~~~~~~~~~~~~Pt~l~N~~~~~pFE~lv~mYg~P~Y~EiDPT 347 (646)
T PRK05771 283 PEDRVKKLKELIDKATG--G-S--AYVEFVEPDEEEEEVPTKLKNPKFIKPFESLTEMYSLPKYNEIDPT 347 (646)
T ss_pred ehhHHHHHHHHHHHhcC--C-c--EEEEEeCCCCcCCCCCEEeeCCchhhhHHHHHHHcCCCCCCCcCCc
Confidence 55556667777775331 1 1 223222 2346899999999999999999999999999999996
No 8
>COG1269 NtpI Archaeal/vacuolar-type H+-ATPase subunit I [Energy production and conversion]
Probab=97.76 E-value=1.7e-05 Score=85.34 Aligned_cols=62 Identities=24% Similarity=0.450 Sum_probs=50.1
Q ss_pred ccccchhHHHHHHhhcccCCCCCCCeeecCCCC---CCCCCccccchhhHhHhhhhhhhccCCccccCCC
Q psy510 313 TNKFTKAFQALIDAYGDRSGSSVPPILNRMDTF---EDPPTYNRTNKFTKAFQALIDAYGVSSYREMNPS 379 (380)
Q Consensus 313 t~~~~~~~q~ll~~~~~~~~~~~~~~~~~~~~~---~~ppt~~~~n~ft~~fq~~~~~yg~~~y~evnp~ 379 (380)
+++--..++..+++++. ....++..+++ +.|||++++|+|++.||.+++.||+|.|.|+||+
T Consensus 297 P~~~~~~~~~~i~~~~~-----~~~~~~~~~~~~~~e~~Pt~l~n~~~i~~Fe~l~emY~iPkY~EidPt 361 (660)
T COG1269 297 PADEVEKLKKIINRATG-----GAAYFEVSETDEDKEEVPTKLRNPKFISPFESLTEMYGIPKYGEIDPT 361 (660)
T ss_pred cHHHHHHHHHHHHHhcC-----CceEEEeecCCCccCCCCEeecCCcccchHHHHHHHhcCCCCCCcCCc
Confidence 44445577777776542 33566666666 7999999999999999999999999999999996
No 9
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=80.18 E-value=15 Score=26.75 Aligned_cols=47 Identities=6% Similarity=0.219 Sum_probs=30.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTLN 129 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~ 129 (380)
+.+++.++..++..+...+++..++.+ .++.....+++..++|+..+
T Consensus 2 i~elEn~~~~~~~~i~tvk~en~~i~~----~ve~i~envk~ll~lYE~Vs 48 (55)
T PF05377_consen 2 IDELENELPRIESSINTVKKENEEISE----SVEKIEENVKDLLSLYEVVS 48 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH
Confidence 345666777777766666666555433 34556667788888888654
No 10
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=63.96 E-value=7 Score=35.63 Aligned_cols=17 Identities=12% Similarity=0.114 Sum_probs=11.1
Q ss_pred CCccccEEEEEEEeChh
Q psy510 33 GDPVYKSVFIIFFQGDQ 49 (380)
Q Consensus 33 ge~~~k~v~ii~~~~~~ 49 (380)
.+..+...+..+|++..
T Consensus 47 ~EKiGssn~YWsFps~~ 63 (188)
T PF03962_consen 47 VEKIGSSNYYWSFPSQA 63 (188)
T ss_pred hhhccCeeEEEecChHH
Confidence 35666777777777643
No 11
>PF07666 MpPF26: M penetrans paralogue family 26; InterPro: IPR011655 These proteins include those ascribed to M penetrans paralogue family 26 in [].
Probab=63.42 E-value=31 Score=29.57 Aligned_cols=80 Identities=14% Similarity=0.175 Sum_probs=48.9
Q ss_pred HhcCCCCCccCCCchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHH-H
Q psy510 201 AYGVSSYREMNPTPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFS-M 279 (380)
Q Consensus 201 ~Yg~P~Y~EiDPT~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~s-i 279 (380)
....+++.|.+++...+..-...+-+|+-|. ++++.++++..+..|.++++.+...-+-.. ...+++++|++= -
T Consensus 40 ~~~~~~~~~~~~~~~~~~l~igil~i~~~~i-~~i~~~Il~Ivl~iKis~~k~~~~~~~k~~----~~~iL~IIGi~i~~ 114 (130)
T PF07666_consen 40 SNNTSNNYEEESSMSIGNLVIGILLIIFSGI-FYIVNFILGIVLIIKISSLKNKHPEFKKVT----VHKILLIIGIFISP 114 (130)
T ss_pred hhccccccccccchhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCcccccch----hhhhhhhhhhHHhh
Confidence 4445666667777777666566666777777 999999999999887665432111111101 134667777654 4
Q ss_pred HHHHHh
Q psy510 280 YTGLIY 285 (380)
Q Consensus 280 ~~G~ly 285 (380)
++++++
T Consensus 115 i~~ii~ 120 (130)
T PF07666_consen 115 ICSIID 120 (130)
T ss_pred HHHHHH
Confidence 555543
No 12
>COG2383 Uncharacterized conserved protein [Function unknown]
Probab=57.75 E-value=12 Score=30.54 Aligned_cols=25 Identities=32% Similarity=0.515 Sum_probs=21.5
Q ss_pred HHHHhhhhhHHHHHHHHHHHHHHhh
Q psy510 224 FAVMFGDLGHGAIMFLFAAWMVMKE 248 (380)
Q Consensus 224 FG~MfGD~GyGlll~l~~~~l~~~~ 248 (380)
|+-|.+|+|-|.++-++.-|..+|.
T Consensus 15 f~g~l~dlG~G~ilGf~~GyAlkk~ 39 (109)
T COG2383 15 FTGILPDLGIGGILGFAAGYALKKA 39 (109)
T ss_pred hhhccccccchhHHHHHHHHHHHHH
Confidence 7889999999999988888887543
No 13
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=56.48 E-value=15 Score=31.13 Aligned_cols=24 Identities=29% Similarity=0.548 Sum_probs=14.9
Q ss_pred HHHHHHHHhhhhhHHHHHHHHHHHHHH
Q psy510 220 FPFLFAVMFGDLGHGAIMFLFAAWMVM 246 (380)
Q Consensus 220 Fp~fFG~MfGD~GyGlll~l~~~~l~~ 246 (380)
..+.||+|.|=+| +++|+.+++.|
T Consensus 67 ~~Ii~gv~aGvIg---~Illi~y~irR 90 (122)
T PF01102_consen 67 IGIIFGVMAGVIG---IILLISYCIRR 90 (122)
T ss_dssp HHHHHHHHHHHHH---HHHHHHHHHHH
T ss_pred eehhHHHHHHHHH---HHHHHHHHHHH
Confidence 4688899988776 33344444443
No 14
>PF08946 Osmo_CC: Osmosensory transporter coiled coil; InterPro: IPR015041 The osmosensory transporter coiled coil is a C-terminal domain found in various bacterial osmoprotective transporters, such as ProP, Proline/betaine transporter, Proline permease 2 and the citrate proton symporters. It adopts an antiparallel coiled-coil structure, and is essential for osmosensory and osmoprotectant transporter function []. ; PDB: 1R48_B.
Probab=54.15 E-value=30 Score=24.05 Aligned_cols=31 Identities=26% Similarity=0.438 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
.+....++++++++..+|.++++++..+...
T Consensus 11 qe~~d~IEqkiedid~qIaeLe~KR~~Lv~q 41 (46)
T PF08946_consen 11 QEHYDNIEQKIEDIDEQIAELEAKRQRLVDQ 41 (46)
T ss_dssp ----THHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhHHHhHHHHHHHHHHHHHHHHHHHHh
Confidence 4445567777777777777777666665544
No 15
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=53.34 E-value=24 Score=34.81 Aligned_cols=57 Identities=18% Similarity=0.333 Sum_probs=33.8
Q ss_pred HHhhhhhHHHHHHHHHHHHHHhhccccccccchhH--HHhhhhhHHHHHHHhHHHHHHHHHhcC
Q psy510 226 VMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEI--WKIFFGGRYIILLMGLFSMYTGLIYND 287 (380)
Q Consensus 226 ~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~--~~~~f~~r~il~~~Gi~si~~G~lyg~ 287 (380)
+-+||+.+-++++++..++++..++.. +.+... .+.+ ..+..+.-++.+.||+-|.-
T Consensus 23 FSvgdi~~~~~il~ll~~~~~~~~~~~--k~~~~~~l~~~~---~~~~~~y~~F~~~WGlNY~R 81 (318)
T PF12725_consen 23 FSVGDILYYLLILFLLYYLIRLIRKIF--KKKKRFKLLNIL---FFLSVLYFLFYLLWGLNYYR 81 (318)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHH--hhhhHHHHHHHH---HHHHHHHHHHHHHhhhhcCC
Confidence 458999999877766666665444332 111111 2221 23556667888889988753
No 16
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=51.79 E-value=95 Score=23.24 Aligned_cols=28 Identities=14% Similarity=0.192 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
+.+++++++++++++++++++..++-.+
T Consensus 19 ~~~~~~ei~~l~~~i~~l~~e~~~L~~e 46 (80)
T PF04977_consen 19 YYQLNQEIAELQKEIEELKKENEELKEE 46 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666666655554333
No 17
>PRK15028 cytochrome bd-II oxidase subunit 2; Provisional
Probab=51.31 E-value=48 Score=33.60 Aligned_cols=73 Identities=11% Similarity=0.185 Sum_probs=44.9
Q ss_pred HHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCCC-CCCC
Q psy510 219 TFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDDR-SGSS 293 (380)
Q Consensus 219 tFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~fF-G~~l 293 (380)
-||..++.||+-.==-+++.|+|+.+.--....+. |..++.||...+ +-+..-.-+.++.+|.+.|.+. |.++
T Consensus 74 AFP~~Ya~lfS~lYlpl~l~L~~LIlRgvafEfR~-k~~~~~wr~~Wd-~~f~vgS~l~~f~~Gv~~g~~v~G~p~ 147 (378)
T PRK15028 74 AWPRVYAAAFSGFYVAMILVLCSLFFRPLAFDYRG-KIADARWRKMWD-AGLVIGSLVPPVVFGIAFGNLLLGVPF 147 (378)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhheecc-cCCChHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHcCcee
Confidence 69999999999887778888888887543222322 111223333333 1122333467778888888886 6544
No 18
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=50.67 E-value=1.2e+02 Score=23.70 Aligned_cols=32 Identities=13% Similarity=0.144 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 77 EMSMGVTTRIEDLNTVIGETQDHRHRVLVAAA 108 (380)
Q Consensus 77 ~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~ 108 (380)
+.++++++.++++++++..+..+..+++.+..
T Consensus 26 ~~l~~~~~ti~~l~~~~~~i~~e~~~ll~~~n 57 (90)
T PF06103_consen 26 KTLDEVNKTIDTLQEQVDPITKEINDLLHNTN 57 (90)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 34445555555555555555555555444443
No 19
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=50.40 E-value=1.3e+02 Score=25.41 Aligned_cols=13 Identities=8% Similarity=0.227 Sum_probs=5.3
Q ss_pred HHHHHHHHHHHHH
Q psy510 114 WFIKVRKIKAIYH 126 (380)
Q Consensus 114 ~~~~~~~~~~~~~ 126 (380)
+...+......|+
T Consensus 73 L~~el~~l~~ry~ 85 (120)
T PF12325_consen 73 LEQELEELQQRYQ 85 (120)
T ss_pred HHHHHHHHHHHHH
Confidence 3334444444443
No 20
>KOG4800|consensus
Probab=49.11 E-value=56 Score=30.64 Aligned_cols=76 Identities=12% Similarity=0.155 Sum_probs=48.1
Q ss_pred CchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhcccc---ccccchh-HHHhhhhhHHH-----HHHHhHHHHHHHH
Q psy510 213 TPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLM---TLKTDNE-IWKIFFGGRYI-----ILLMGLFSMYTGL 283 (380)
Q Consensus 213 T~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~---~~~~~~~-~~~~~f~~r~i-----l~~~Gi~si~~G~ 283 (380)
-..+-++..+|.|+.-+=.+||+++++-|+|-.-..+++- +.+.-++ +-.+|.+.-|+ +++.|.+++..|+
T Consensus 70 ~~wI~ifqyvf~~iaa~f~~yG~~il~egF~ttgA~r~~~g~~k~r~cGr~Vs~~f~~lTy~l~f~W~~I~~f~~v~v~i 149 (248)
T KOG4800|consen 70 VSWICIFQYVFYGIAAFFFLYGILILAEGFYTTGAVRKLYGDFKTRMCGRCVSGVFVGLTYLLAFVWLLIFGFSAVPVFI 149 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHhhhhhhhhhHHHHHhhhhceecCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4466778899999999999999999999998765433211 0011122 22224444454 2335777888888
Q ss_pred HhcCC
Q psy510 284 IYNDD 288 (380)
Q Consensus 284 lyg~f 288 (380)
.++=+
T Consensus 150 y~~fw 154 (248)
T KOG4800|consen 150 YFNFW 154 (248)
T ss_pred HHHHH
Confidence 77644
No 21
>KOG2264|consensus
Probab=47.19 E-value=32 Score=36.46 Aligned_cols=54 Identities=15% Similarity=0.092 Sum_probs=25.2
Q ss_pred HHHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 52 TRVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLV 105 (380)
Q Consensus 52 ~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~ 105 (380)
.+..+|+.+.+.+.-++....++....+++++.+|+++++.|.+.+.++.++..
T Consensus 82 ~e~~RI~~sVs~EL~ele~krqel~seI~~~n~kiEelk~~i~~~q~eL~~Lk~ 135 (907)
T KOG2264|consen 82 REQKRILASVSLELTELEVKRQELNSEIEEINTKIEELKRLIPQKQLELSALKG 135 (907)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhHHHHHHHHh
Confidence 345555555544443332222333333555555555555555555554444433
No 22
>PF06570 DUF1129: Protein of unknown function (DUF1129); InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=46.46 E-value=1.8e+02 Score=26.50 Aligned_cols=16 Identities=25% Similarity=0.334 Sum_probs=10.2
Q ss_pred CchhhhHHHHHHHHHh
Q psy510 213 TPYTIITFPFLFAVMF 228 (380)
Q Consensus 213 T~~~aitFp~fFG~Mf 228 (380)
+.++...|.+++|+|.
T Consensus 86 ~L~~~~if~~~~gi~~ 101 (206)
T PF06570_consen 86 SLLFFGIFSLLFGIMG 101 (206)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4445556777777775
No 23
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=45.69 E-value=1.4e+02 Score=26.07 Aligned_cols=36 Identities=19% Similarity=0.157 Sum_probs=29.5
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAA 108 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~ 108 (380)
.+..++++-++++++++++.++.++..+.++...+.
T Consensus 90 ~~~~eAie~l~k~~~~l~~~~~~l~~~l~~l~~~~~ 125 (145)
T COG1730 90 KSADEAIEFLKKRIEELEKAIEKLQQALAELAQRIE 125 (145)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467888999999999999999998888877766653
No 24
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=45.46 E-value=1.5e+02 Score=23.20 Aligned_cols=39 Identities=5% Similarity=0.175 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 75 RREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKN 113 (380)
Q Consensus 75 ~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~ 113 (380)
..+.++.++++++++.++..++-.+..++.+.....++.
T Consensus 31 ~~~ti~~l~~~~~~i~~e~~~ll~~~n~l~~dv~~k~~~ 69 (90)
T PF06103_consen 31 VNKTIDTLQEQVDPITKEINDLLHNTNELLEDVNEKLEK 69 (90)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344455555555555555555555555555554443333
No 25
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=44.03 E-value=2e+02 Score=25.25 Aligned_cols=15 Identities=7% Similarity=0.222 Sum_probs=9.4
Q ss_pred ccccCHHHHHHHHHh
Q psy510 146 VPVLDIDAIQLALRR 160 (380)
Q Consensus 146 vP~~~~~~l~~~l~~ 160 (380)
|+.++.+++.+....
T Consensus 142 vs~ee~~~~~~~~~~ 156 (169)
T PF07106_consen 142 VSPEEKEKLEKEYKK 156 (169)
T ss_pred CCHHHHHHHHHHHHH
Confidence 566667766665544
No 26
>PRK10884 SH3 domain-containing protein; Provisional
Probab=43.80 E-value=1.7e+02 Score=27.01 Aligned_cols=50 Identities=6% Similarity=0.100 Sum_probs=31.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKA 123 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 123 (380)
...+.+.++++++++++.++.++..+..+...+..+.+......+.+++.
T Consensus 90 ~~~~rlp~le~el~~l~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 139 (206)
T PRK10884 90 SLRTRVPDLENQVKTLTDKLNNIDNTWNQRTAEMQQKVAQSDSVINGLKE 139 (206)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44566778888888888888888777666555555544443333333333
No 27
>COG3301 NrfD Formate-dependent nitrite reductase, membrane component [Inorganic ion transport and metabolism]
Probab=43.60 E-value=64 Score=31.45 Aligned_cols=77 Identities=14% Similarity=0.167 Sum_probs=45.9
Q ss_pred chhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccc--cccccc--hhHHHhhhhhHHHHHHHhH-HHHHHHHHhcCC
Q psy510 214 PYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPL--MTLKTD--NEIWKIFFGGRYIILLMGL-FSMYTGLIYNDD 288 (380)
Q Consensus 214 ~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~--~~~~~~--~~~~~~~f~~r~il~~~Gi-~si~~G~lyg~f 288 (380)
+.+.++|--.+.|-.|=++-|+.++...+|+..-.+++ ++.+.. .+..-..++ .++...|+ .+.|+||+-+.-
T Consensus 79 ~mf~~~~~~sS~ms~Gvml~~~~mLfl~L~la~i~~~l~~r~v~~~~~~~~~~~~~~--iL~~i~gi~~~aYTGflLaa~ 156 (305)
T COG3301 79 LMFNYSFNASSWMSMGVMLFQLYMLFLVLWLAPIFKDLQNRWVPKWGLSEAVHRALE--ILMAILGVLLGAYTGFLLAAL 156 (305)
T ss_pred HHhchhcCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccchhhhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHh
Confidence 34444555567888899999999988888876644421 111101 111111111 34445553 578999999999
Q ss_pred CCCC
Q psy510 289 RSGS 292 (380)
Q Consensus 289 FG~~ 292 (380)
+|.+
T Consensus 157 ~giP 160 (305)
T COG3301 157 KGIP 160 (305)
T ss_pred cCCC
Confidence 9865
No 28
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=42.95 E-value=1.6e+02 Score=28.57 Aligned_cols=51 Identities=18% Similarity=0.216 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHhc
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVAAAK----------NIKNWFIKVRKIKAIYHTLN 129 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~----------~l~~~~~~~~~~~~~~~~l~ 129 (380)
+++-++++++-+.+|.+++.++.+..+.|-+ .|++-+..++.++.+.+++.
T Consensus 77 Lkes~~~l~dRetEI~eLksQL~RMrEDWIEEECHRVEAQLALKEARkEIkQLkQvieTmr 137 (305)
T PF15290_consen 77 LKESENRLHDRETEIDELKSQLARMREDWIEEECHRVEAQLALKEARKEIKQLKQVIETMR 137 (305)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444455555555555554444432 24455667777777777665
No 29
>PF11381 DUF3185: Protein of unknown function (DUF3185); InterPro: IPR021521 Some members in this bacterial family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known.
Probab=41.33 E-value=45 Score=24.62 Aligned_cols=49 Identities=14% Similarity=0.137 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHhhccccccccchhHHHhhhhh-----HHHHHHHhHHHHHHHHH
Q psy510 234 GAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGG-----RYIILLMGLFSMYTGLI 284 (380)
Q Consensus 234 Glll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~-----r~il~~~Gi~si~~G~l 284 (380)
|++|++.|..+.....+.. .+..+....+|.| ...++..|..+.+.|++
T Consensus 4 gi~Llv~GivLl~~G~~~~--~S~~s~~s~~~TG~~t~~t~~~ligG~va~ivGl~ 57 (59)
T PF11381_consen 4 GIALLVGGIVLLYFGYQAS--DSLGSQVSRAFTGSPTDKTIWYLIGGAVAVIVGLF 57 (59)
T ss_pred eehHHHHHHHHHHhhhhhh--hhHHHHHHHHhcCCCCchhHHHHHhHHHHHHHHHh
Confidence 5667777777665433221 1212222222222 24567789999999975
No 30
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=41.32 E-value=1e+02 Score=22.42 Aligned_cols=28 Identities=11% Similarity=0.183 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 78 MSMGVTTRIEDLNTVIGETQDHRHRVLV 105 (380)
Q Consensus 78 ~l~~l~~~i~~l~~~i~~~~~~~~~~~~ 105 (380)
.+..++++++++++.++.+++..++++.
T Consensus 15 ~i~tvk~en~~i~~~ve~i~envk~ll~ 42 (55)
T PF05377_consen 15 SINTVKKENEEISESVEKIEENVKDLLS 42 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555566666666666655555544443
No 31
>PF08112 ATP-synt_E_2: ATP synthase epsilon subunit; InterPro: IPR012508 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition with V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases []. A-ATPases are composed of two linked complexes: the A1 complex consisting of seven subunits contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex consisting of at least two subunits forms the membrane-spanning pore []. The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C []. A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. The epsilon subunit is the smallest (7 kDa) of those found in the A1 complex. Unlike the A, B and C subunits, the epsilon subunit does not have a homologous counterpart in F- or V-ATPases []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0033178 proton-transporting two-sector ATPase complex, catalytic domain
Probab=40.67 E-value=1.4e+02 Score=21.52 Aligned_cols=41 Identities=7% Similarity=0.101 Sum_probs=21.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIG-ETQDHRHRVLVAAAKNIKNW 114 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~-~~~~~~~~~~~~~~~~l~~~ 114 (380)
....-++.++..+++-+++|. .+..+...++..-.+.++..
T Consensus 8 ~~d~yI~~Lk~kLd~Kk~Eil~~ln~EY~kiLk~r~~~lEev 49 (56)
T PF08112_consen 8 TIDKYISILKSKLDEKKSEILSNLNMEYEKILKQRRKELEEV 49 (56)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345556677777776554443 33445555555544444443
No 32
>PF04123 DUF373: Domain of unknown function (DUF373); InterPro: IPR007254 This archaeal family of unknown function is predicted to be an integral membrane protein with six transmembrane regions.
Probab=40.65 E-value=73 Score=31.91 Aligned_cols=86 Identities=22% Similarity=0.412 Sum_probs=48.5
Q ss_pred HHHhcCCCCCcc---CCCchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhh----hhhH--H
Q psy510 199 IDAYGVSSYREM---NPTPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIF----FGGR--Y 269 (380)
Q Consensus 199 v~~Yg~P~Y~Ei---DPT~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~----f~~r--~ 269 (380)
-+...=|+|... =| -++...|+++--+-.+-.+-+++++++|++++.|...+. +...+.++.+ ..|+ .
T Consensus 150 Kk~l~Dp~~~~~~lGvP-G~~lLiy~i~~l~~~~~~a~~~i~~~iG~yll~kGfgld--~~~~~~~~~~~~~l~~g~it~ 226 (344)
T PF04123_consen 150 KKALSDPEYRRTFLGVP-GLILLIYAILALLGYPAYALGIILLLIGLYLLYKGFGLD--DYLREWLERFRESLYEGRITF 226 (344)
T ss_pred HHhhcChhhhceeecch-HHHHHHHHHHHHHcchHHHHHHHHHHHHHHHHHHhcCcH--HHHHHHHHHhccccccceeeh
Confidence 455556666653 34 345556666655555666669999999999998766553 1223333332 1111 2
Q ss_pred HHHHHhHHHHHHHHHhcC
Q psy510 270 IILLMGLFSMYTGLIYND 287 (380)
Q Consensus 270 il~~~Gi~si~~G~lyg~ 287 (380)
+.-..++..++.|+++|-
T Consensus 227 ityvva~~l~iig~i~g~ 244 (344)
T PF04123_consen 227 ITYVVALLLIIIGIIYGY 244 (344)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333445555666666553
No 33
>PF11348 DUF3150: Protein of unknown function (DUF3150); InterPro: IPR021496 This bacterial family of proteins with unknown function appears to be restricted to Proteobacteria.
Probab=40.52 E-value=1.7e+02 Score=27.96 Aligned_cols=64 Identities=16% Similarity=0.356 Sum_probs=38.1
Q ss_pred hhhHHHHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 48 DQLKTRVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIK 117 (380)
Q Consensus 48 ~~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~ 117 (380)
..++.+.++.|...|..-. +.-.-+.+...++.+++++++.+- .++..+++..+.+.++.|...
T Consensus 55 ~~lk~~A~r~~~~~G~rFl---gG~aVP~~~~~~l~~~L~~i~~eF---~~~k~~Fl~~Yd~~i~~w~~~ 118 (257)
T PF11348_consen 55 SKLKKRAERLCLKVGVRFL---GGYAVPEDKAEELAEELEDIKTEF---EQEKQDFLANYDQAIEEWIDR 118 (257)
T ss_pred HHHHHHHHHHHHHcCCccc---ceeEcCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 3478899999988777652 233344555666776666666533 333444555555555555544
No 34
>PF07178 TraL: TraL protein; InterPro: IPR009838 This entry represents bacterial TraL proteins. TraL is a predicted peripheral membrane protein involved in bacterial sex pilus assembly []. TraL is part of the type IV secretion system for conjugative plasmid transfer []. The exact function of TraL is unknown.; GO: 0000746 conjugation, 0019867 outer membrane
Probab=40.39 E-value=39 Score=27.09 Aligned_cols=41 Identities=17% Similarity=0.200 Sum_probs=32.0
Q ss_pred cCCCchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhcc
Q psy510 210 MNPTPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKP 250 (380)
Q Consensus 210 iDPT~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k 250 (380)
.+..-++...-++..|++.|=...|+++.++..+.++|.|+
T Consensus 18 ~~~De~~~~~~~~~~gi~~~~~~~g~i~g~~~~~~~~k~K~ 58 (95)
T PF07178_consen 18 WPMDEFIPALILFVIGILSGHFLIGLILGIVLWWGYRKFKK 58 (95)
T ss_pred ecHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHc
Confidence 34444566677889999999999999999988888876654
No 35
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=37.77 E-value=2.5e+02 Score=28.61 Aligned_cols=49 Identities=10% Similarity=0.253 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDH------RHRVLVAAAKNIKNWFIKVRKIKAI 124 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~------~~~~~~~~~~~l~~~~~~~~~~~~~ 124 (380)
.+.++.++++++++++++++++++ ....+.+..+.+..+..++.+.++.
T Consensus 241 ~~~~~~l~~~~~~~~~~i~~l~~~l~~~~k~~~k~~~~~~q~~~~~k~~~~~~~~ 295 (406)
T PF02388_consen 241 KEYLESLQEKLEKLEKEIEKLEEKLEKNPKKKNKLKELEEQLASLEKRIEEAEEL 295 (406)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH-THHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCcchhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666677777776666666554 2333344444455555555544443
No 36
>PF11068 YlqD: YlqD protein; InterPro: IPR021297 This bacterial family of proteins has no known function. ; PDB: 4DCI_C.
Probab=37.70 E-value=2.6e+02 Score=23.94 Aligned_cols=36 Identities=11% Similarity=0.196 Sum_probs=27.4
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAA 108 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~ 108 (380)
....+..+++++++..++.++.+++-+.++...+..
T Consensus 16 ~~K~~l~~~l~~~i~~~d~el~QLefq~kr~~~e~~ 51 (131)
T PF11068_consen 16 KWKEELLQELQEQIQQLDQELQQLEFQGKRMIKEIK 51 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 445667778888888888888888877777777754
No 37
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=37.69 E-value=2.3e+02 Score=23.21 Aligned_cols=29 Identities=3% Similarity=0.095 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
+.+++++++++++++++.+++..++..++
T Consensus 29 ~~~l~~q~~~~~~e~~~l~~~n~~L~~eI 57 (105)
T PRK00888 29 YWRVNDQVAAQQQTNAKLKARNDQLFAEI 57 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677777777777777777766665554
No 38
>PRK14158 heat shock protein GrpE; Provisional
Probab=37.62 E-value=1.6e+02 Score=27.07 Aligned_cols=34 Identities=6% Similarity=-0.036 Sum_probs=22.7
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
....+.+..++++++++++++.+.++++..+..+
T Consensus 36 ~~~~~~~~~le~~l~~le~e~~el~d~~lR~~Ae 69 (194)
T PRK14158 36 VAAADRIKELEEALAAKEAEAAANWDKYLRERAD 69 (194)
T ss_pred CCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445567778888888888877776665444333
No 39
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=36.87 E-value=2.3e+02 Score=25.91 Aligned_cols=55 Identities=16% Similarity=0.263 Sum_probs=31.1
Q ss_pred HHHHh-hhhhHHHHHHHH---HHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCCCCCCCC
Q psy510 224 FAVMF-GDLGHGAIMFLF---AAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDDRSGSSV 294 (380)
Q Consensus 224 FG~Mf-GD~GyGlll~l~---~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~fFG~~l~ 294 (380)
..+++ +|-|-++.|.+. ++|++.+ | .++.++. ++.|+.+.+.|++.|+.....+.
T Consensus 107 ~~~~~~~~~~~~l~Lal~~~~~iyfl~~--K------~~~~~rA--------~~~~~~~L~~G~~lGs~l~~~l~ 165 (194)
T PF11833_consen 107 WSLLFPAASGPGLQLALGLGACIYFLNR--K------ERKLGRA--------FLWTLGGLVVGLILGSLLASWLP 165 (194)
T ss_pred HHHHHcCCCCcchHHHHHHHHHHHHHHH--h------cchHHHH--------HHHHHHHHHHHHHHHHHHHhhcc
Confidence 34445 677777776554 2333322 1 1334442 35566666888888888865544
No 40
>PRK15035 cytochrome bd-II oxidase subunit 1; Provisional
Probab=35.62 E-value=1.4e+02 Score=31.48 Aligned_cols=66 Identities=15% Similarity=0.192 Sum_probs=36.9
Q ss_pred chhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCC
Q psy510 214 PYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDD 288 (380)
Q Consensus 214 ~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~f 288 (380)
|-+++.| .-|=+|.|=-.--+++.++|+++.+| +++. ..+..++.. -+...++.-++.+|-+.-|.
T Consensus 381 PpV~~~f-~sFriMVglG~l~~ll~~~~l~~~~r-~~l~---~~rw~L~~~----~~~~Plp~iA~~~GWi~tE~ 446 (514)
T PRK15035 381 PQVAPVF-WSFRIMVGCGSLLLLVMLIALVQTLR-GKID---QHRWVLKMA----LWSLPLPWIAIEAGWFMTEF 446 (514)
T ss_pred CCCchhH-HHHHHHHHHHHHHHHHHHHHHHHHHc-CCcc---cCcHHHHHH----HHHHHHHHHHHHHhHHHccc
Confidence 4455666 56778876655556777777777654 3332 112222211 13344566677777777664
No 41
>PRK15003 cytochrome d ubiquinol oxidase subunit 2; Provisional
Probab=35.22 E-value=1.3e+02 Score=30.51 Aligned_cols=71 Identities=10% Similarity=0.135 Sum_probs=42.7
Q ss_pred hHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhcccccc---ccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCCC-CCCC
Q psy510 218 ITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTL---KTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDDR-SGSS 293 (380)
Q Consensus 218 itFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~---~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~fF-G~~l 293 (380)
--||..+|.+|+-.==-+++.|+|+.+.--....+.+ +.+++.|+..| ..-.=+.++.+|.+.|.+. |.++
T Consensus 73 aAFP~~Ya~lfs~lYlpl~l~L~gLI~RgvafEfR~k~~~~~wr~~Wd~~f-----~igSll~~f~~Gv~lg~~v~G~p~ 147 (379)
T PRK15003 73 AAWPMVYAAAFSGFYVAMILVLASLFFRPVGFDYRSKIEETRWRNMWDWGI-----FIGSFVPPLVIGVAFGNLLQGVPF 147 (379)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccCCChhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHhcCccc
Confidence 3699999999998877778888888765322222221 12344555422 2223356677788777776 5444
No 42
>PRK14151 heat shock protein GrpE; Provisional
Probab=33.88 E-value=1.2e+02 Score=27.24 Aligned_cols=23 Identities=17% Similarity=0.411 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy510 78 MSMGVTTRIEDLNTVIGETQDHR 100 (380)
Q Consensus 78 ~l~~l~~~i~~l~~~i~~~~~~~ 100 (380)
....++++++++++++.+.++++
T Consensus 21 ~~~~l~~~i~~le~e~~el~d~~ 43 (176)
T PRK14151 21 AGDDLTARVQELEEQLAAAKDQS 43 (176)
T ss_pred chhhHHHHHHHHHHHHHHHHHHH
Confidence 34455556666666665555443
No 43
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=33.02 E-value=1.5e+02 Score=26.09 Aligned_cols=45 Identities=11% Similarity=0.114 Sum_probs=19.6
Q ss_pred HHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 53 RVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQ 97 (380)
Q Consensus 53 ~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~ 97 (380)
.+.+|+...|...-..........+.+..++.++.+..+.|+.++
T Consensus 3 qi~~Im~~~gk~i~~~K~~~~~~~~e~~~~k~ql~~~d~~i~~Lk 47 (155)
T PF06810_consen 3 QIDKIMAENGKDIEAPKAKVDKVKEERDNLKTQLKEADKQIKDLK 47 (155)
T ss_pred HHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666555554432211122223334444444444444444443
No 44
>COG3352 FlaC Putative archaeal flagellar protein C [Cell motility and secretion]
Probab=32.94 E-value=3.5e+02 Score=23.92 Aligned_cols=78 Identities=13% Similarity=0.117 Sum_probs=38.2
Q ss_pred hHHHHHHHHcccCceeecCCCC---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH
Q psy510 50 LKTRVMKICEGFRATLYPCPEA---PTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAK----NIKNWFIKVRKIK 122 (380)
Q Consensus 50 ~~~~v~ki~~~~~~~~~~~p~~---~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~----~l~~~~~~~~~~~ 122 (380)
..++..+|=+.++--..++.+. .+...+.+++++.+++++....+...+++.-+..+..+ .++.+..++.+++
T Consensus 49 imer~~~ieNdlg~~~~~~~g~kk~~~~~~eelerLe~~iKdl~~lye~Vs~d~Npf~s~~~qes~~~veel~eqV~el~ 128 (157)
T COG3352 49 IMERMTDIENDLGKVKIEIEGQKKQLQDIKEELERLEENIKDLVSLYELVSRDFNPFMSKTPQESRGIVEELEEQVNELK 128 (157)
T ss_pred HHHHHHHHHhhcccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHhhhHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444433333322 23344445555556655555555554444333333322 2455666777777
Q ss_pred HHHHH
Q psy510 123 AIYHT 127 (380)
Q Consensus 123 ~~~~~ 127 (380)
.+++.
T Consensus 129 ~i~em 133 (157)
T COG3352 129 MIVEM 133 (157)
T ss_pred HHHHH
Confidence 77663
No 45
>PF15079 DUF4546: Domain of unknown function (DUF4546)
Probab=32.50 E-value=2e+02 Score=25.85 Aligned_cols=23 Identities=4% Similarity=0.267 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy510 81 GVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 81 ~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
+++.++.|.++++++--+++.++
T Consensus 51 eLkNeLREVREELkEKmeEIKQI 73 (205)
T PF15079_consen 51 ELKNELREVREELKEKMEEIKQI 73 (205)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444433333333
No 46
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=31.87 E-value=2.5e+02 Score=22.63 Aligned_cols=34 Identities=12% Similarity=0.231 Sum_probs=24.5
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
....++...++++++.++..++.+..++..+-..
T Consensus 59 ~~~~ea~~~Le~~~e~le~~i~~l~~~~~~l~~~ 92 (105)
T cd00632 59 QEKEEARTELKERLETIELRIKRLERQEEDLQEK 92 (105)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3457777888888888888888877776555444
No 47
>PRK11677 hypothetical protein; Provisional
Probab=31.65 E-value=2.5e+02 Score=24.21 Aligned_cols=46 Identities=9% Similarity=0.192 Sum_probs=26.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy510 80 MGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTL 128 (380)
Q Consensus 80 ~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l 128 (380)
.++++++++.+.++++.++++.+-...-.+.+..+. +..+.+|+.|
T Consensus 32 ~~le~eLe~~k~ele~YkqeV~~HFa~TA~Ll~~L~---~~Y~~Ly~Hl 77 (134)
T PRK11677 32 QALQYELEKNKAELEEYRQELVSHFARSAELLDTMA---KDYRQLYQHM 77 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH
Confidence 467777777777777777666665555444333322 3334444444
No 48
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=31.58 E-value=4.6e+02 Score=26.43 Aligned_cols=27 Identities=26% Similarity=0.401 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 80 MGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 80 ~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
..+..+++++++.+.++++.+.++...
T Consensus 140 ~~l~~Ri~e~Eeris~lEd~~~~i~~~ 166 (370)
T PF02994_consen 140 ESLNSRIDELEERISELEDRIEEIEQA 166 (370)
T ss_dssp ----HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHhhH
Confidence 345667777777777766666555443
No 49
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=31.01 E-value=1.9e+02 Score=30.25 Aligned_cols=45 Identities=4% Similarity=0.003 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Q psy510 78 MSMGVTTRIEDLNTVIGETQDHRH---RVLVAAAKNIKNWFIKVRKIK 122 (380)
Q Consensus 78 ~l~~l~~~i~~l~~~i~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~ 122 (380)
++.+.+.+.+++++++++++.++. +......+.|+.+...+..++
T Consensus 70 ALteqQ~kasELEKqLaaLrqElq~~saq~~dle~KIkeLEaE~~~Lk 117 (475)
T PRK13729 70 ATTEMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALA 117 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHH
Confidence 444555555555555555554443 223333333444444444333
No 50
>COG1030 NfeD Membrane-bound serine protease (ClpP class) [Posttranslational modification, protein turnover, chaperones]
Probab=30.59 E-value=1.3e+02 Score=31.09 Aligned_cols=67 Identities=19% Similarity=0.277 Sum_probs=41.5
Q ss_pred CCCCccCCCchhhhH-HHHHHHHHhhhhh-HHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHH
Q psy510 205 SSYREMNPTPYTIIT-FPFLFAVMFGDLG-HGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTG 282 (380)
Q Consensus 205 P~Y~EiDPT~~~ait-Fp~fFG~MfGD~G-yGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G 282 (380)
=+.+-.-|+.+-++. -..|+|.-+.+.. +|++|+++|+.++.-+-... -| .++-+.|+.+++.|
T Consensus 253 ~spG~~vpg~iG~i~LlL~f~g~~~~~~~~~gllLiilG~iLiv~E~~~p-----------~f---Gvigl~Gii~~iiG 318 (436)
T COG1030 253 LSPGFGVPGIIGAILLLLGFYGLLFLGINWAGLLLIILGAILIVAEAFVP-----------GF---GVIGLLGIILFIIG 318 (436)
T ss_pred hccCccchHHHHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHhcc-----------cc---hHHHHHHHHHHHHh
Confidence 334445555554333 3345666666555 88999999998876542221 01 26778899999988
Q ss_pred HHh
Q psy510 283 LIY 285 (380)
Q Consensus 283 ~ly 285 (380)
.+.
T Consensus 319 ~~~ 321 (436)
T COG1030 319 LLL 321 (436)
T ss_pred hhh
Confidence 753
No 51
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=30.34 E-value=2.2e+02 Score=22.57 Aligned_cols=52 Identities=10% Similarity=0.142 Sum_probs=29.9
Q ss_pred HHHHHHHc-ccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 52 TRVMKICE-GFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 52 ~~v~ki~~-~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
..++.++. +++....+.+..+.......+++-.+|.-++.++..++.++..+
T Consensus 28 ~~~r~aLe~al~~~~~~~~~~~~~lp~~~keLL~EIA~lE~eV~~LE~~v~~L 80 (88)
T PF14389_consen 28 QDLRRALEKALGRSSGSLPSSPSSLPKKAKELLEEIALLEAEVAKLEQKVLSL 80 (88)
T ss_pred HHHHHHHHHHhCCCCcccCCccccCChHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666664 55555555544444444555666666666777666666554444
No 52
>PRK14154 heat shock protein GrpE; Provisional
Probab=29.98 E-value=1.4e+02 Score=27.64 Aligned_cols=25 Identities=8% Similarity=0.158 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
+..++++++++++++.+.++++..+
T Consensus 54 ~~~l~~el~~le~e~~elkd~~lRl 78 (208)
T PRK14154 54 REKLEGQLTRMERKVDEYKTQYLRA 78 (208)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555555554444333
No 53
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=29.83 E-value=1e+02 Score=34.40 Aligned_cols=9 Identities=0% Similarity=0.010 Sum_probs=3.5
Q ss_pred HHHHHHHHH
Q psy510 76 REMSMGVTT 84 (380)
Q Consensus 76 ~~~l~~l~~ 84 (380)
.+.++++++
T Consensus 514 ~~li~~L~~ 522 (771)
T TIGR01069 514 NVLIEKLSA 522 (771)
T ss_pred HHHHHHHHH
Confidence 333344433
No 54
>PF10947 DUF2628: Protein of unknown function (DUF2628) ; InterPro: IPR024399 Some members in this family of proteins have been annotated as YigF. Their function is currently unknown.
Probab=29.50 E-value=2.9e+02 Score=22.23 Aligned_cols=31 Identities=10% Similarity=0.001 Sum_probs=23.0
Q ss_pred hhhhHHHHHHHHHhhhhhHHHHHHHHHHHHH
Q psy510 215 YTIITFPFLFAVMFGDLGHGAIMFLFAAWMV 245 (380)
Q Consensus 215 ~~aitFp~fFG~MfGD~GyGlll~l~~~~l~ 245 (380)
+.|.+|+.+....=+...+|++++++.+.+.
T Consensus 40 w~Af~f~~~w~l~r~mw~~~~~~~~~~~~~~ 70 (108)
T PF10947_consen 40 WWAFFFGPLWLLYRKMWLYAIIFLALLVALA 70 (108)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5677778888888888899977666655553
No 55
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=29.49 E-value=2.8e+02 Score=23.43 Aligned_cols=47 Identities=11% Similarity=0.242 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy510 80 MGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTLN 129 (380)
Q Consensus 80 ~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~ 129 (380)
.++++++++.+.++++.+.++.+-.....+.+..+ ....+.+|+.|.
T Consensus 28 ~~l~~eL~~~k~el~~yk~~V~~HF~~ta~Ll~~l---~~~Y~~l~~Hla 74 (128)
T PF06295_consen 28 AKLEQELEQAKQELEQYKQEVNDHFAQTAELLDNL---TQDYQKLYQHLA 74 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH
Confidence 46777777777777777766666555544333322 233445555443
No 56
>PHA03029 hypothetical protein; Provisional
Probab=29.43 E-value=56 Score=25.26 Aligned_cols=17 Identities=29% Similarity=0.780 Sum_probs=14.4
Q ss_pred HHHHHHhHHHHHHHHHh
Q psy510 269 YIILLMGLFSMYTGLIY 285 (380)
Q Consensus 269 ~il~~~Gi~si~~G~ly 285 (380)
||++++++-.++||++-
T Consensus 16 yiilila~igiiwg~ll 32 (92)
T PHA03029 16 YIILILAIIGIIWGFLL 32 (92)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 67888888889999874
No 57
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=29.07 E-value=1.1e+02 Score=22.63 Aligned_cols=25 Identities=16% Similarity=0.110 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
+.+|+++|..|+.+|..++.++.+.
T Consensus 23 v~EL~~RIa~L~aEI~R~~~~~~~K 47 (59)
T PF06698_consen 23 VEELEERIALLEAEIARLEAAIAKK 47 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4578888888888887777665543
No 58
>PRK14143 heat shock protein GrpE; Provisional
Probab=29.02 E-value=2.5e+02 Score=26.68 Aligned_cols=30 Identities=13% Similarity=0.183 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 75 RREMSMGVTTRIEDLNTVIGETQDHRHRVL 104 (380)
Q Consensus 75 ~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~ 104 (380)
..+.+..++.+++++++++++.++++.++.
T Consensus 65 ~~~~~~~l~~el~~l~~e~~elkd~~lR~~ 94 (238)
T PRK14143 65 NAARLAQLEQELESLKQELEELNSQYMRIA 94 (238)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444566777777777777777665554443
No 59
>PRK14140 heat shock protein GrpE; Provisional
Probab=28.85 E-value=2.4e+02 Score=25.79 Aligned_cols=32 Identities=16% Similarity=0.246 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 75 RREMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 75 ~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
..+.+++++++++++++++.+.++.+.....+
T Consensus 35 ~~~~~~~l~~~i~~l~~ei~elkd~~lR~~Ae 66 (191)
T PRK14140 35 EAELLDEEQAKIAELEAKLDELEERYLRLQAD 66 (191)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456667777777777777766655444443
No 60
>KOG2264|consensus
Probab=28.81 E-value=2.6e+02 Score=29.97 Aligned_cols=50 Identities=8% Similarity=0.032 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDHRHRVLVA---AAKNIKNWFIKVRKIKAIY 125 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~ 125 (380)
...+.+++.+.++++.+|+++..+++++.+. .+.+|..+...++.....+
T Consensus 92 s~EL~ele~krqel~seI~~~n~kiEelk~~i~~~q~eL~~Lk~~ieqaq~~~ 144 (907)
T KOG2264|consen 92 SLELTELEVKRQELNSEIEEINTKIEELKRLIPQKQLELSALKGEIEQAQRQL 144 (907)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhHHHHHHHHhHHHHHHHHH
Confidence 3344455555555555555554444433333 2345566665554444333
No 61
>PF10315 DUF2416: Protein of unknown function (DUF2416); InterPro: IPR019419 This entry represents conserved proteins with unknown function and is restricted to fungi.
Probab=27.97 E-value=1.2e+02 Score=25.27 Aligned_cols=73 Identities=21% Similarity=0.272 Sum_probs=49.1
Q ss_pred CccCCCchhhhHHHHHHHH---Hhhh----hhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHH
Q psy510 208 REMNPTPYTIITFPFLFAV---MFGD----LGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMY 280 (380)
Q Consensus 208 ~EiDPT~~~aitFp~fFG~---MfGD----~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~ 280 (380)
.-+-||.-.+.+|...|++ |.-| =|-|++-.=-.+|++...|+ .++.++.||.--+.++..+..
T Consensus 28 ~~~~Pt~~~~llFg~a~~lg~ymi~dGD~~NGsGf~~AWS~lYLivngr~---------siksl~~gr~~PL~Ls~~a~~ 98 (108)
T PF10315_consen 28 RYIGPTRKSCLLFGAAFALGGYMIYDGDLENGSGFVTAWSTLYLIVNGRK---------SIKSLRRGRVWPLALSGLALG 98 (108)
T ss_pred CccCCchhHhHHHHHHHHhhhheeecCCccccchHHHHHHHHHHHHcChh---------hhHHhccCCchHHHHHHHHHh
Confidence 5577888777777776664 5544 36788888888888765432 233334455556677777888
Q ss_pred HHHHhcCCC
Q psy510 281 TGLIYNDDR 289 (380)
Q Consensus 281 ~G~lyg~fF 289 (380)
.+.+||.-|
T Consensus 99 na~lYG~~f 107 (108)
T PF10315_consen 99 NAGLYGRRF 107 (108)
T ss_pred chhheeeec
Confidence 889998654
No 62
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=27.88 E-value=3.9e+02 Score=26.29 Aligned_cols=15 Identities=20% Similarity=0.204 Sum_probs=5.5
Q ss_pred HHHHHHHHHHHHHHH
Q psy510 82 VTTRIEDLNTVIGET 96 (380)
Q Consensus 82 l~~~i~~l~~~i~~~ 96 (380)
++++.+++.++++++
T Consensus 55 le~Ee~~l~~eL~~L 69 (314)
T PF04111_consen 55 LEQEEEELLQELEEL 69 (314)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 333333333333333
No 63
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=27.52 E-value=3.3e+02 Score=22.19 Aligned_cols=35 Identities=20% Similarity=0.223 Sum_probs=26.1
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
....+++.+++.+++.++..++.++++...+-+.+
T Consensus 63 ~~~~e~~~~l~~r~e~ie~~i~~lek~~~~l~~~l 97 (110)
T TIGR02338 63 TDKEEAIQELKEKKETLELRVKTLQRQEERLREQL 97 (110)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677888899999988888888877765554443
No 64
>PRK15097 cytochrome d terminal oxidase subunit 1; Provisional
Probab=27.51 E-value=2.6e+02 Score=29.73 Aligned_cols=66 Identities=11% Similarity=0.160 Sum_probs=36.0
Q ss_pred chhhhHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCC
Q psy510 214 PYTIITFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDD 288 (380)
Q Consensus 214 ~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~f 288 (380)
|-+++.| .-|=+|.|=-.--+++.++++++.+| +++. ..+..++.. -+...++.-++.+|-+.-|.
T Consensus 381 PpV~~vf-~sFriMVg~G~l~~~l~~~~l~l~~r-~~l~---~~rw~L~~~----~~~~plp~iA~~~GWi~tEv 446 (522)
T PRK15097 381 PRVAPLY-FAFRIMVACGFLMLAIIALSFWSVIR-NRIG---EKKWLLRAA----LYGIPLPWIAVEAGWFVAEY 446 (522)
T ss_pred CCCchhH-HHHHHHHHHHHHHHHHHHHHHHHHHc-Cccc---cCcHHHHHH----HHHHHHHHHHHHhhhhheec
Confidence 4455555 46778876554555677777777654 3332 112222211 13344567777777777665
No 65
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=27.22 E-value=3.6e+02 Score=22.25 Aligned_cols=35 Identities=11% Similarity=0.099 Sum_probs=26.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAA 108 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~ 108 (380)
..-+.+.++++++.++-..+.++++.+.+++++-.
T Consensus 5 ~l~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~ 39 (107)
T PF06156_consen 5 ELFDRLDQLEQQLGQLLEELEELKKQLQELLEENA 39 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566778888888888888888888877777653
No 66
>KOG4348|consensus
Probab=27.15 E-value=2e+02 Score=29.72 Aligned_cols=47 Identities=11% Similarity=0.171 Sum_probs=27.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTLN 129 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~ 129 (380)
...+.++..|+.|+.-.++.+++ +..++|++++..++.++.+...|.
T Consensus 568 k~s~delr~qi~el~~ive~lk~-------~~~kel~kl~~dleeek~mr~~le 614 (627)
T KOG4348|consen 568 KNSLDELRAQIIELLCIVEALKK-------DHGKELEKLRKDLEEEKTMRSNLE 614 (627)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHhhhH
Confidence 33455666666666655555443 344556666666777776655443
No 67
>PRK14162 heat shock protein GrpE; Provisional
Probab=27.04 E-value=2.5e+02 Score=25.80 Aligned_cols=31 Identities=16% Similarity=0.160 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
...+..++.+++++++++.+.++++..+..+
T Consensus 38 ~~e~~~l~~~l~~l~~e~~elkd~~lR~~AE 68 (194)
T PRK14162 38 QNPVEDLEKEIADLKAKNKDLEDKYLRSQAE 68 (194)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445566667777777766665555444333
No 68
>KOG2391|consensus
Probab=26.99 E-value=1e+02 Score=30.69 Aligned_cols=8 Identities=25% Similarity=0.488 Sum_probs=3.2
Q ss_pred ccccCccc
Q psy510 184 TYNRTNKF 191 (380)
Q Consensus 184 T~l~n~~~ 191 (380)
++|||=+.
T Consensus 335 ~yLr~VR~ 342 (365)
T KOG2391|consen 335 QYLRHVRL 342 (365)
T ss_pred HHHHHHHH
Confidence 34444333
No 69
>PRK14011 prefoldin subunit alpha; Provisional
Probab=26.77 E-value=4.3e+02 Score=23.00 Aligned_cols=35 Identities=14% Similarity=0.152 Sum_probs=23.0
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
.+..++++.++++++++++.++.+...+.++...+
T Consensus 84 k~~~eA~~~~~~ri~~l~~~~~~l~~~i~~~~~~~ 118 (144)
T PRK14011 84 KDVSEVIEDFKKSVEELDKTKKEGNKKIEELNKEI 118 (144)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667777778777777776666666555554443
No 70
>PRK14139 heat shock protein GrpE; Provisional
Probab=26.05 E-value=2.5e+02 Score=25.51 Aligned_cols=23 Identities=13% Similarity=0.148 Sum_probs=11.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRH 101 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~ 101 (380)
+..++++++++++++.+.++++.
T Consensus 34 ~~~l~~~l~~le~e~~elkd~~l 56 (185)
T PRK14139 34 APALEAELAEAEAKAAELQDSFL 56 (185)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555544443
No 71
>PF07698 7TM-7TMR_HD: 7TM receptor with intracellular HD hydrolase; InterPro: IPR011621 These bacterial 7TM receptor proteins have an intracellular domain IPR006674 from INTERPRO. This entry corresponds to the 7 helix transmembrane domain. These proteins also contain an N-terminal extracellular domain.
Probab=25.90 E-value=3.5e+02 Score=23.95 Aligned_cols=71 Identities=18% Similarity=0.269 Sum_probs=41.2
Q ss_pred chhhhHHHHHHHHHhh-hhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHhcCCCC
Q psy510 214 PYTIITFPFLFAVMFG-DLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIYNDDRS 290 (380)
Q Consensus 214 ~~~aitFp~fFG~MfG-D~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~lyg~fFG 290 (380)
.+++.+..++-++|++ |....+..++.|.....-.+|.+ .+.+.++ .|-++-..-.+....+|++.|+-++
T Consensus 86 i~~~~~~sl~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~---~R~~~i~---ag~~v~l~~~~~~l~~~l~~~~~~~ 157 (194)
T PF07698_consen 86 ILASLFLSLLASLLFGFDFEFFLYSLVSGIVAIFSVRRIR---SRSDIIK---AGLLVGLVNALMILALGLIQGSSFS 157 (194)
T ss_pred HHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHH---HHHHHHH---HHHHHHHHHHHHHHHHHHHhcCCch
Confidence 3455566667777763 77777777666666554444432 2233332 3444555555666777887777664
No 72
>KOG0994|consensus
Probab=25.76 E-value=2e+02 Score=33.44 Aligned_cols=57 Identities=5% Similarity=0.096 Sum_probs=32.3
Q ss_pred hhHHHHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 49 QLKTRVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 49 ~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
++++|+..|-.-+++..+... +.+....+.+.++++|.+..+.+.+++..+.++..+
T Consensus 1205 ~me~kl~~ir~il~~~svs~~-~i~~l~~~~~~lr~~l~~~~e~L~~~E~~Lsdi~~~ 1261 (1758)
T KOG0994|consen 1205 DMEEKLEEIRAILSAPSVSAE-DIAQLASATESLRRQLQALTEDLPQEEETLSDITNS 1261 (1758)
T ss_pred HHHHHHHHHHHHhcCCCccHH-HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhhc
Confidence 355566555444444333221 224455666777777777777777776666655443
No 73
>COG1294 AppB Cytochrome bd-type quinol oxidase, subunit 2 [Energy production and conversion]
Probab=25.31 E-value=2e+02 Score=28.94 Aligned_cols=72 Identities=19% Similarity=0.352 Sum_probs=42.7
Q ss_pred HHHHHHHHHhhhhhHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHh-HHHHHHHHHhcCCC-CCCC
Q psy510 219 TFPFLFAVMFGDLGHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMG-LFSMYTGLIYNDDR-SGSS 293 (380)
Q Consensus 219 tFp~fFG~MfGD~GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~G-i~si~~G~lyg~fF-G~~l 293 (380)
.||.-++.+|+..==-+++.|+++++.--....+. |..++.|+.+.+ +.+...| +.+..+|++.|.++ |.++
T Consensus 76 aFP~~ya~~~s~lYip~~~~L~~Li~R~v~fefR~-k~~~~~~k~~wd--~~~~igs~~~~~~~Gvalg~~~~G~pi 149 (346)
T COG1294 76 AFPLAYATLFSGLYLPMILVLFGLIFRGVAFEFRS-KIEDPRWKKFWD--WAFFIGSFLPPLLLGVALGNLLQGVPI 149 (346)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcc-cccChhhHhHHH--HHHHhhhHHHHHHHHHHHHHHhcCcee
Confidence 78999999999888888888888887532222221 112333333322 1233333 34567777777776 5444
No 74
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=25.08 E-value=4.5e+02 Score=24.73 Aligned_cols=18 Identities=22% Similarity=0.339 Sum_probs=10.8
Q ss_pred CCCCCCCccccchhhHhHhhhh
Q psy510 344 TFEDPPTYNRTNKFTKAFQALI 365 (380)
Q Consensus 344 ~~~~ppt~~~~n~ft~~fq~~~ 365 (380)
-+.+|| .||. ..++|.|+
T Consensus 188 ~~Q~PP--MK~C--~sC~qqIH 205 (230)
T PF10146_consen 188 RQQPPP--MKTC--QSCHQQIH 205 (230)
T ss_pred cccCCC--cchh--HhHHHHHh
Confidence 355666 3433 46788876
No 75
>KOG4302|consensus
Probab=24.77 E-value=3.5e+02 Score=29.64 Aligned_cols=56 Identities=9% Similarity=0.134 Sum_probs=33.8
Q ss_pred hhHHHHHHHHcccCceeecCC---CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 49 QLKTRVMKICEGFRATLYPCP---EAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVL 104 (380)
Q Consensus 49 ~~~~~v~ki~~~~~~~~~~~p---~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~ 104 (380)
..+.++..+|..+|....... ...+...+.+..+...++++.+...+-..+..++.
T Consensus 72 ~~eael~~l~s~l~~~~~~~~~~~k~e~tLke~l~~l~~~le~lr~qk~eR~~ef~el~ 130 (660)
T KOG4302|consen 72 VIEAELNDLCSALGEPSIIGEISDKIEGTLKEQLESLKPYLEGLRKQKDERRAEFKELY 130 (660)
T ss_pred HHHHHHHHHHHHhCCcccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356788899998888876544 33456666666666666655554444444433333
No 76
>cd02433 Nodulin-21_like_2 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_2: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter.
Probab=24.59 E-value=3e+02 Score=25.92 Aligned_cols=20 Identities=10% Similarity=-0.165 Sum_probs=12.4
Q ss_pred HHHHhHHHHHHHHHhcCCCC
Q psy510 271 ILLMGLFSMYTGLIYNDDRS 290 (380)
Q Consensus 271 l~~~Gi~si~~G~lyg~fFG 290 (380)
.+.+|+.++...++.|.+||
T Consensus 215 ~~~~G~~aa~vsy~iG~l~~ 234 (234)
T cd02433 215 QLAIGGGAAAVTYLLGLLFG 234 (234)
T ss_pred HHHHHHHHHHHHHHHHHHcC
Confidence 34556666666666666665
No 77
>PRK14155 heat shock protein GrpE; Provisional
Probab=24.46 E-value=1.8e+02 Score=27.00 Aligned_cols=23 Identities=22% Similarity=0.304 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy510 80 MGVTTRIEDLNTVIGETQDHRHR 102 (380)
Q Consensus 80 ~~l~~~i~~l~~~i~~~~~~~~~ 102 (380)
++++++++++++++.+.++++..
T Consensus 16 ~~l~~~l~~le~e~~elkd~~lR 38 (208)
T PRK14155 16 DDAAQEIEALKAEVAALKDQALR 38 (208)
T ss_pred cchHHHHHHHHHHHHHHHHHHHH
Confidence 34555556666655555444433
No 78
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=24.37 E-value=2.4e+02 Score=29.54 Aligned_cols=44 Identities=7% Similarity=0.152 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIG---ETQDHRHRVLVAAAKNIKNWFIKVR 119 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~---~~~~~~~~~~~~~~~~l~~~~~~~~ 119 (380)
.+..+++++++++++.+++ ...+++++.+++...++..+..+++
T Consensus 75 Q~kasELEKqLaaLrqElq~~saq~~dle~KIkeLEaE~~~Lk~Ql~ 121 (475)
T PRK13729 75 QVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVK 121 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 4456677777777776666 3344445555555556666666663
No 79
>PF05082 Rop-like: Rop-like; InterPro: IPR007774 This family contains several uncharacterised bacterial proteins. These proteins are found in nitrogen fixation operons, so are likely to play a role in this process.; PDB: 3CSX_A 2JS5_B.
Probab=24.29 E-value=1.9e+02 Score=21.82 Aligned_cols=30 Identities=10% Similarity=0.063 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 78 MSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 78 ~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
.+++++.++..++..-.+.+-++.++.+..
T Consensus 3 d~~eLk~evkKL~~~A~~~kmdLHDLaEdL 32 (66)
T PF05082_consen 3 DIEELKKEVKKLNRKATQAKMDLHDLAEDL 32 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHCT
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 466788888888888888887877776653
No 80
>PF11471 Sugarporin_N: Maltoporin periplasmic N-terminal extension; InterPro: IPR021570 This N-terminal domain is found in members of the sugar porin family 1.B.3 from TC, They are related to LamB - the well characterised maltoporin of Escherichia coli for which the three-dimensional structures with and without its substrate have been obtained by X-ray diffraction. The protein consists of an 18 beta-stranded beta-barrel in contrast to proteins of the general bacterial porin family (GBP) and the Rhodobacter PorCa Porin (RPP) family which consist of 16 beta-stranded beta-barrels. Although maltoporin contains a wider beta-barrel than the porins of the GBP and RPP families (1.B.1 from TC and 1.B.7 from TC), it exhibits a narrower channel, showing only 5% of the ionic conductance of the latter porins.
Probab=24.26 E-value=1.9e+02 Score=21.27 Aligned_cols=26 Identities=4% Similarity=0.064 Sum_probs=13.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDH 99 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~ 99 (380)
+.++-+..+++++++.+++.+..+.+
T Consensus 29 tiEqRLa~LE~rL~~ae~ra~~ae~~ 54 (60)
T PF11471_consen 29 TIEQRLAALEQRLQAAEQRAQAAEAR 54 (60)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555555555555554444
No 81
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=24.18 E-value=1.8e+02 Score=21.68 Aligned_cols=29 Identities=17% Similarity=0.208 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 75 RREMSMGVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 75 ~~~~l~~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
....++++++++++++++.++++.++..+
T Consensus 22 ~~~ei~~l~~~i~~l~~e~~~L~~ei~~l 50 (80)
T PF04977_consen 22 LNQEIAELQKEIEELKKENEELKEEIERL 50 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44557778888888888888877776666
No 82
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=24.12 E-value=2.2e+02 Score=22.17 Aligned_cols=52 Identities=13% Similarity=0.201 Sum_probs=26.5
Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 71 APTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKA 123 (380)
Q Consensus 71 ~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 123 (380)
++.........++.+++.++..++++.. +..-.++....++.+..++++.++
T Consensus 22 ~~kd~~~~~~~lk~Klq~ar~~i~~lpg-i~~s~eeq~~~i~~Le~~i~~k~~ 73 (83)
T PF07544_consen 22 SSKDLDTATGSLKHKLQKARAAIRELPG-IDRSVEEQEEEIEELEEQIRKKRE 73 (83)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhCCC-ccCCHHHHHHHHHHHHHHHHHHHH
Confidence 3455566667777777777777666432 122222333334444444444443
No 83
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=23.97 E-value=2.6e+02 Score=27.18 Aligned_cols=17 Identities=18% Similarity=0.304 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGE 95 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~ 95 (380)
+++-+.+|++|+..+..
T Consensus 84 l~dRetEI~eLksQL~R 100 (305)
T PF15290_consen 84 LHDRETEIDELKSQLAR 100 (305)
T ss_pred HHhhHHHHHHHHHHHHH
Confidence 33444444444444433
No 84
>PF02060 ISK_Channel: Slow voltage-gated potassium channel; InterPro: IPR000369 Potassium channels are the most diverse group of the ion channel family [, ]. They are important in shaping the action potential, and in neuronal excitability and plasticity []. The potassium channel family is composed of several functionally distinct isoforms, which can be broadly separated into 2 groups []: the practically non-inactivating 'delayed' group and the rapidly inactivating 'transient' group. These are all highly similar proteins, with only small amino acid changes causing the diversity of the voltage-dependent gating mechanism, channel conductance and toxin binding properties. Each type of K+ channel is activated by different signals and conditions depending on their type of regulation: some open in response to depolarisation of the plasma membrane; others in response to hyperpolarisation or an increase in intracellular calcium concentration; some can be regulated by binding of a transmitter, together with intracellular kinases; while others are regulated by GTP-binding proteins or other second messengers []. In eukaryotic cells, K+ channels are involved in neural signalling and generation of the cardiac rhythm, act as effectors in signal transduction pathways involving G protein-coupled receptors (GPCRs) and may have a role in target cell lysis by cytotoxic T-lymphocytes []. In prokaryotic cells, they play a role in the maintenance of ionic homeostasis []. All K+ channels discovered so far possess a core of alpha subunits, each comprising either one or two copies of a highly conserved pore loop domain (P-domain). The P-domain contains the sequence (T/SxxTxGxG), which has been termed the K+ selectivity sequence. In families that contain one P-domain, four subunits assemble to form a selective pathway for K+ across the membrane. However, it remains unclear how the 2 P-domain subunits assemble to form a selective pore. The functional diversity of these families can arise through homo- or hetero-associations of alpha subunits or association with auxiliary cytoplasmic beta subunits. K+ channel subunits containing one pore domain can be assigned into one of two superfamilies: those that possess six transmembrane (TM) domains and those that possess only two TM domains. The six TM domain superfamily can be further subdivided into conserved gene families: the voltage-gated (Kv) channels; the KCNQ channels (originally known as KvLQT channels); the EAG-like K+ channels; and three types of calcium (Ca)-activated K+ channels (BK, IK and SK) []. The 2TM domain family comprises inward-rectifying K+ channels. In addition, there are K+ channel alpha-subunits that possess two P-domains. These are usually highly regulated K+ selective leak channels. Two types of beta subunit (KCNE and KCNAB) are presently known to associate with voltage-gated alpha subunits (Kv, KCNQ and eag-like). However, not all combinations of alpha and beta subunits are possible. The KCNE family of K+ channel subunits are membrane glycoproteins that possess a single transmembrane (TM) domain. They share no structural relationship with the alpha subunit proteins, which possess pore forming domains. The subunits appear to have a regulatory function, modulating the kinetics and voltage dependence of the alpha subunits of voltage-dependent K+ channels. KCNE subunits are formed from short polypeptides of ~130 amino acids, and are divided into five subfamilies: KCNE1 (MinK/IsK), KCNE2 (MiRP1), KCNE3 (MiRP2), KCNE4 (MiRP3) and KCNE1L (AMMECR2). ; GO: 0005249 voltage-gated potassium channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2K21_A.
Probab=23.95 E-value=74 Score=27.12 Aligned_cols=50 Identities=28% Similarity=0.466 Sum_probs=25.3
Q ss_pred HHHHHHhHHHHHH-HHHhcCCCCCCCCccccccCCCCCCCCcccc-ccccchhHHH
Q psy510 269 YIILLMGLFSMYT-GLIYNDDRSGSSVPPILNRMDTFEDPPTYNR-TNKFTKAFQA 322 (380)
Q Consensus 269 ~il~~~Gi~si~~-G~lyg~fFG~~l~~~~l~~~~~~~~~~~~~~-t~~~~~~~q~ 322 (380)
||+++||+++++. |++.|-..+.-.-. +-||+...+.-.. ..+..+-+|+
T Consensus 46 YIL~vmgfFgff~~gImlsyvRSKK~E~----s~DPyh~YIe~d~W~eK~K~~~qa 97 (129)
T PF02060_consen 46 YILVVMGFFGFFTVGIMLSYVRSKKREH----SHDPYHQYIEEDDWQEKDKAYFQA 97 (129)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH---------TTTTTS-TTT--S--HHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhcc----ccChHHHHhHHHHHHHHHHHHHHH
Confidence 8999999998765 55555555533321 1266665554332 4444444554
No 85
>PRK14148 heat shock protein GrpE; Provisional
Probab=23.82 E-value=3.4e+02 Score=24.88 Aligned_cols=29 Identities=17% Similarity=0.171 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 77 EMSMGVTTRIEDLNTVIGETQDHRHRVLV 105 (380)
Q Consensus 77 ~~l~~l~~~i~~l~~~i~~~~~~~~~~~~ 105 (380)
+.+..++++++++++++++.++++.....
T Consensus 40 ~e~~~l~~~l~~l~~e~~elkd~~lR~~A 68 (195)
T PRK14148 40 EQLERAKDTIKELEDSCDQFKDEALRAKA 68 (195)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33556666677677666666555444433
No 86
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=23.75 E-value=4.3e+02 Score=21.97 Aligned_cols=35 Identities=9% Similarity=0.072 Sum_probs=27.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAA 108 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~ 108 (380)
..-+.+.++++++.++-+.+.++++.+.+++++..
T Consensus 5 elfd~l~~le~~l~~l~~el~~LK~~~~el~EEN~ 39 (110)
T PRK13169 5 EIFDALDDLEQNLGVLLKELGALKKQLAELLEENT 39 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566778888888888888888888888777753
No 87
>COG1448 TyrB Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism]
Probab=23.24 E-value=3.1e+02 Score=28.00 Aligned_cols=72 Identities=19% Similarity=0.207 Sum_probs=42.6
Q ss_pred CCccccEEEEEEEeChhhHHHHHHHHcccCceeecCCCCcchH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 33 GDPVYKSVFIIFFQGDQLKTRVMKICEGFRATLYPCPEAPTDR--------REMSMGVTTRIEDLNTVIGETQDHRHRVL 104 (380)
Q Consensus 33 ge~~~k~v~ii~~~~~~~~~~v~ki~~~~~~~~~~~p~~~~~~--------~~~l~~l~~~i~~l~~~i~~~~~~~~~~~ 104 (380)
|+++.- +.++....+..+++..-+...=-..|.-|..++.. .+...++++++++++.+|.+.+..+.+.+
T Consensus 252 gERVGa--~~vva~~~~~a~~v~sqlk~~iR~~ySnPP~~Ga~vva~IL~~p~Lra~W~~El~~Mr~Ri~~mR~~lv~~L 329 (396)
T COG1448 252 GERVGA--LSVVAEDAEEADRVLSQLKAIIRTNYSNPPAHGAAVVATILNNPELRAEWEQELEEMRQRILEMRQALVDAL 329 (396)
T ss_pred hhccce--eEEEeCCHHHHHHHHHHHHHHHHhccCCCchhhHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444443 45556655544444444333333456555544322 44567788999999988888877766666
Q ss_pred HH
Q psy510 105 VA 106 (380)
Q Consensus 105 ~~ 106 (380)
++
T Consensus 330 ~~ 331 (396)
T COG1448 330 KA 331 (396)
T ss_pred Hh
Confidence 55
No 88
>COG1983 PspC Putative stress-responsive transcriptional regulator [Transcription / Signal transduction mechanisms]
Probab=23.06 E-value=61 Score=24.77 Aligned_cols=41 Identities=24% Similarity=0.494 Sum_probs=27.0
Q ss_pred cCCCCCccCCCchhhhHHHHHHHHHhhhhhHHHHHHHHHHHHH
Q psy510 203 GVSSYREMNPTPYTIITFPFLFAVMFGDLGHGAIMFLFAAWMV 245 (380)
Q Consensus 203 g~P~Y~EiDPT~~~aitFp~fFG~MfGD~GyGlll~l~~~~l~ 245 (380)
|+..|-.+|||..=.++-. +.++.+=.|-|+++-++..+++
T Consensus 20 Gla~yf~id~tlVRll~vl--~~~~~~~~~~~ii~Yiia~~im 60 (70)
T COG1983 20 GLAEYFGIDPTLVRLLFVL--LTLFGGLTGFGIIAYIIAALIM 60 (70)
T ss_pred hHHHHhCCChHHHHHHHHH--HHHHhcchhHHHHHHHHHHHHC
Confidence 5677999999987665433 3333335677777777776664
No 89
>PRK00293 dipZ thiol:disulfide interchange protein precursor; Provisional
Probab=22.87 E-value=2.7e+02 Score=29.77 Aligned_cols=41 Identities=20% Similarity=0.263 Sum_probs=18.8
Q ss_pred hHHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHH
Q psy510 232 GHGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLF 277 (380)
Q Consensus 232 GyGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~ 277 (380)
|.|+-++++|.+..+..+|. +.+.+.++.+++ .+++.+|++
T Consensus 337 G~~~Plll~~~~~~~~lpk~---g~wm~~~k~~~G--~~ll~~~~~ 377 (571)
T PRK00293 337 GMGLPLILITTFGNKLLPKS---GPWMNQVKTAFG--FVLLALPVF 377 (571)
T ss_pred HHHHHHHHHHHHHHhhcccC---ccHHHHHHHHHH--HHHHHHHHH
Confidence 45555566665543221221 234444554443 355555544
No 90
>PRK14150 heat shock protein GrpE; Provisional
Probab=22.84 E-value=3.5e+02 Score=24.71 Aligned_cols=13 Identities=31% Similarity=0.603 Sum_probs=5.4
Q ss_pred HHHHHHHHHHHHH
Q psy510 84 TRIEDLNTVIGET 96 (380)
Q Consensus 84 ~~i~~l~~~i~~~ 96 (380)
.+++++++.+.+.
T Consensus 41 ~~i~~l~~~l~~~ 53 (193)
T PRK14150 41 ARIAELEAQLAEA 53 (193)
T ss_pred HHHHHHHHHHHHH
Confidence 3444444444433
No 91
>PF03188 Cytochrom_B561: Eukaryotic cytochrome b561; InterPro: IPR004877 Cytochrome b561 is a secretory vesicle-specific electron transport protein []. It is an integral membrane protein, that binds two haem groups non-covalently. This entry represents the eukaryotic family. Members of the 'bacterial cytochrome b561' family can be found in IPR011577 from INTERPRO.; GO: 0016021 integral to membrane
Probab=22.66 E-value=1.6e+02 Score=24.55 Aligned_cols=22 Identities=18% Similarity=0.401 Sum_probs=20.0
Q ss_pred HHHHHHHhHHHHHHHHHhcCCC
Q psy510 268 RYIILLMGLFSMYTGLIYNDDR 289 (380)
Q Consensus 268 r~il~~~Gi~si~~G~lyg~fF 289 (380)
++++..+|+.+++.|+.+.++|
T Consensus 115 G~~~~~l~~~~i~~G~~~~~~f 136 (137)
T PF03188_consen 115 GYLIYVLAIATIFLGLTEKAWF 136 (137)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 4788999999999999999887
No 92
>KOG0948|consensus
Probab=22.46 E-value=5.3e+02 Score=28.98 Aligned_cols=36 Identities=14% Similarity=0.194 Sum_probs=23.0
Q ss_pred cccchHHHHHHHH------hcCCCCCccCCCchhhhHHHHHH
Q psy510 189 NKFTKAFQALIDA------YGVSSYREMNPTPYTIITFPFLF 224 (380)
Q Consensus 189 ~~~~~pFE~iv~~------Yg~P~Y~EiDPT~~~aitFp~fF 224 (380)
+.+..|+..+-++ -+.-.--|||+.-++--|=|.+-
T Consensus 925 ~el~~~l~~lqe~ArrIAkVs~ecKlEide~~Yv~sFkp~LM 966 (1041)
T KOG0948|consen 925 EELAGPLRQLQESARRIAKVSKECKLEIDEEDYVESFKPELM 966 (1041)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccCHHHHHHhcChHHH
Confidence 3455666666554 34455678998888877666543
No 93
>PF14817 HAUS5: HAUS augmin-like complex subunit 5
Probab=22.30 E-value=1.5e+02 Score=32.31 Aligned_cols=17 Identities=18% Similarity=0.528 Sum_probs=10.6
Q ss_pred hHhHhhhhhhhccCCcc
Q psy510 358 TKAFQALIDAYGVSSYR 374 (380)
Q Consensus 358 t~~fq~~~~~yg~~~y~ 374 (380)
..+|+.++++.|.|-|+
T Consensus 502 ~~~l~~l~~~L~fp~~k 518 (632)
T PF14817_consen 502 GASLISLLESLGFPLYK 518 (632)
T ss_pred CchHHHHHHhcCCCCCC
Confidence 45666666666666654
No 94
>PRK14160 heat shock protein GrpE; Provisional
Probab=22.24 E-value=4.1e+02 Score=24.75 Aligned_cols=45 Identities=9% Similarity=-0.001 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVAAA---KNIKNWFIKVRKIKA 123 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~---~~l~~~~~~~~~~~~ 123 (380)
+..++.+++.++..+...+.++.++...+. .+++.++.+..+++.
T Consensus 56 ~~~l~~e~~~l~~~l~~l~~e~~elkd~~lR~~AefeN~RKR~~kE~e 103 (211)
T PRK14160 56 IEELKDENNKLKEENKKLENELEALKDRLLRTVAEYDNYRKRTAKEKE 103 (211)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444333333322 223444545444443
No 95
>PF07439 DUF1515: Protein of unknown function (DUF1515); InterPro: IPR010889 This family consists of several hypothetical bacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Rhizobium species. The function of this family is unknown.
Probab=22.14 E-value=4.7e+02 Score=21.80 Aligned_cols=24 Identities=13% Similarity=0.253 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDH 99 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~ 99 (380)
.+.+..++++++++++.+++.+++
T Consensus 7 ~~q~~~l~~~v~~lRed~r~SEdr 30 (112)
T PF07439_consen 7 HQQLGTLNAEVKELREDIRRSEDR 30 (112)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556666666666666555444
No 96
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=22.14 E-value=1.6e+02 Score=32.80 Aligned_cols=15 Identities=7% Similarity=0.062 Sum_probs=6.3
Q ss_pred hHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIED 88 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~ 88 (380)
...+.+++++++..+
T Consensus 517 ~~~~li~~l~~~~~~ 531 (782)
T PRK00409 517 KLNELIASLEELERE 531 (782)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334444444443333
No 97
>PF11945 WASH_WAHD: WAHD domain of WASH complex; InterPro: IPR021854 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. This entry represents the WASH subunit of the WASH complex. WASH genes duplicated to multiple chromosomal ends during primate evolution, with highest copy number reached in humans, whose WASH repertoires probably vary extensively among individuals []. It is therefore difficult to determine which gene is functional or not. The telomeric region of chromosome 9p is paralogous to the pericentromeric regions of chromosome 9 as well as to 2q. Paralogous regions contain 7 transcriptional units. Duplicated WASH genes are also present in the Xq/Yq pseudoautosomal region, as well as on chromosome 1 and 15. The chromosome 16 copy seems to be a pseudogene.
Probab=22.12 E-value=7.3e+02 Score=24.32 Aligned_cols=55 Identities=13% Similarity=0.160 Sum_probs=35.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy510 75 RREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRKIKAIYHTLN 129 (380)
Q Consensus 75 ~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~ 129 (380)
..|.+.++-+.++.|++....+-.++..-++.....++.+..++.+.++..+.|.
T Consensus 16 ~eEti~qi~~aL~~L~~v~~diF~rI~~Rv~~~~~~l~~i~~Ri~~~qaKi~~l~ 70 (297)
T PF11945_consen 16 REETILQIADALEYLDKVSNDIFSRISARVERNRERLQAIQQRIEVAQAKIEKLQ 70 (297)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4455666667777777777777666666666666666666666666666555443
No 98
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=21.83 E-value=5e+02 Score=22.10 Aligned_cols=33 Identities=12% Similarity=0.250 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 75 RREMSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 75 ~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
..++...+.++++++.+.+..+++|+.+-+..+
T Consensus 41 m~~A~~~v~kql~~vs~~l~~tKkhLsqRId~v 73 (126)
T PF07889_consen 41 MSDAVASVSKQLEQVSESLSSTKKHLSQRIDRV 73 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555555555544444333
No 99
>PRK14145 heat shock protein GrpE; Provisional
Probab=21.80 E-value=3.7e+02 Score=24.74 Aligned_cols=26 Identities=12% Similarity=0.107 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 78 MSMGVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 78 ~l~~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
.+..++.+++++++++.+.++.+..+
T Consensus 46 e~~~l~~~l~~le~e~~el~d~~lR~ 71 (196)
T PRK14145 46 EIEELKQKLQQKEVEAQEYLDIAQRL 71 (196)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556666666666665555444333
No 100
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=21.61 E-value=4.2e+02 Score=27.11 Aligned_cols=36 Identities=0% Similarity=0.011 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 76 REMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNI 111 (380)
Q Consensus 76 ~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l 111 (380)
...++.+.+++.++++....++.++.++-..+..++
T Consensus 211 ~~~l~~~~~el~eik~~~~~L~~~~e~Lk~~~~~e~ 246 (395)
T PF10267_consen 211 NLGLQKILEELREIKESQSRLEESIEKLKEQYQREY 246 (395)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334555556666666555555555555544444333
No 101
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=21.42 E-value=4.3e+02 Score=21.57 Aligned_cols=34 Identities=15% Similarity=0.129 Sum_probs=22.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
+..++.+.++++++.+++.++++++.+.++...+
T Consensus 84 ~~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~ 117 (129)
T cd00890 84 SLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQI 117 (129)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556667777777777777777666655554443
No 102
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=21.41 E-value=3.6e+02 Score=21.82 Aligned_cols=49 Identities=14% Similarity=0.133 Sum_probs=30.3
Q ss_pred HHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 55 MKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 55 ~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
.+++-++|+..|- ..+..++++-++++++.+++.++...+++.++...+
T Consensus 59 ~~vlV~lG~~~~v----E~s~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~ 107 (120)
T PF02996_consen 59 DKVLVSLGAGYYV----EMSLEEAIEFLKKRIKELEEQLEKLEKELAELQAQI 107 (120)
T ss_dssp TEEEEEEETTEEE----EEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CEEEEEeeCCeEE----EecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445544431 234677777888888888887777776666654443
No 103
>PRK09343 prefoldin subunit beta; Provisional
Probab=21.39 E-value=4.9e+02 Score=21.74 Aligned_cols=34 Identities=12% Similarity=0.137 Sum_probs=24.5
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 73 TDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 73 ~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
....++.++++.+++-++..|+.++++...+-+.
T Consensus 67 qd~~e~~~~l~~r~E~ie~~ik~lekq~~~l~~~ 100 (121)
T PRK09343 67 VDKTKVEKELKERKELLELRSRTLEKQEKKLREK 100 (121)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567778888888888888888877665554433
No 104
>PF11471 Sugarporin_N: Maltoporin periplasmic N-terminal extension; InterPro: IPR021570 This N-terminal domain is found in members of the sugar porin family 1.B.3 from TC, They are related to LamB - the well characterised maltoporin of Escherichia coli for which the three-dimensional structures with and without its substrate have been obtained by X-ray diffraction. The protein consists of an 18 beta-stranded beta-barrel in contrast to proteins of the general bacterial porin family (GBP) and the Rhodobacter PorCa Porin (RPP) family which consist of 16 beta-stranded beta-barrels. Although maltoporin contains a wider beta-barrel than the porins of the GBP and RPP families (1.B.1 from TC and 1.B.7 from TC), it exhibits a narrower channel, showing only 5% of the ionic conductance of the latter porins.
Probab=21.28 E-value=2.4e+02 Score=20.75 Aligned_cols=23 Identities=9% Similarity=0.258 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy510 81 GVTTRIEDLNTVIGETQDHRHRV 103 (380)
Q Consensus 81 ~l~~~i~~l~~~i~~~~~~~~~~ 103 (380)
.+++|+..+|+++.+.+++..+.
T Consensus 29 tiEqRLa~LE~rL~~ae~ra~~a 51 (60)
T PF11471_consen 29 TIEQRLAALEQRLQAAEQRAQAA 51 (60)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHH
Confidence 58999999999999888775554
No 105
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=21.27 E-value=3.7e+02 Score=20.36 Aligned_cols=29 Identities=10% Similarity=0.196 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
+.....+++++++++++.+.+..++..++
T Consensus 26 ~~~~~~~~~~~~~~~~~l~~en~~L~~ei 54 (85)
T TIGR02209 26 TRQLNNELQKLQLEIDKLQKEWRDLQLEV 54 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556666666666666666655554443
No 106
>PHA01750 hypothetical protein
Probab=21.23 E-value=3.1e+02 Score=20.70 Aligned_cols=15 Identities=27% Similarity=0.501 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHHHHH
Q psy510 83 TTRIEDLNTVIGETQ 97 (380)
Q Consensus 83 ~~~i~~l~~~i~~~~ 97 (380)
++++..+..++++++
T Consensus 41 ~~ELdNL~~ei~~~k 55 (75)
T PHA01750 41 NSELDNLKTEIEELK 55 (75)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334444444444443
No 107
>PRK14161 heat shock protein GrpE; Provisional
Probab=21.20 E-value=3.6e+02 Score=24.32 Aligned_cols=16 Identities=25% Similarity=0.318 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHHHH
Q psy510 82 VTTRIEDLNTVIGETQ 97 (380)
Q Consensus 82 l~~~i~~l~~~i~~~~ 97 (380)
.+++++++++++++.+
T Consensus 24 ~~~ei~~l~~e~~elk 39 (178)
T PRK14161 24 ANPEITALKAEIEELK 39 (178)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 3344444444444433
No 108
>KOG3850|consensus
Probab=21.15 E-value=5.3e+02 Score=26.31 Aligned_cols=31 Identities=3% Similarity=0.088 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRHRVLVAAAK 109 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~ 109 (380)
+..+.++++++++....+++.++.+...+.+
T Consensus 262 l~aileeL~eIk~~q~~Leesye~Lke~~kr 292 (455)
T KOG3850|consen 262 LDAILEELREIKETQALLEESYERLKEQIKR 292 (455)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555555554444444444444444333
No 109
>PF01024 Colicin: Colicin pore forming domain; InterPro: IPR000293 Colicins are plasmid-encoded polypeptide toxins produced by and active against Escherichia coli and closely related bacteria. Colicins are released into the environment to reduce competition from other bacterial strains. Colicins bind to outer membrane receptors, using them to translocate to the cytoplasm or cytoplasmic membrane, where they exert their cytotoxic effect, including depolarisation of the cytoplasmic membrane, DNase activity, RNase activity, or inhibition of murein synthesis. Channel-forming colicins (colicins A, B, E1, Ia, Ib, and N) are transmembrane proteins that depolarize the cytoplasmic membrane, leading to dissipation of cellular energy []. These colicins contain at least three domains: an N-terminal translocation domain responsible for movement across the outer membrane and periplasmic space; a central domain responsible for receptor recognition; and a C-terminal cytotoxic domain responsible for channel formation in the cytoplasmic membrane []. This entry represents the C-terminal cytotoxic domain, which has a globin-like fold with additional helices at either end.; GO: 0019835 cytolysis, 0050829 defense response to Gram-negative bacterium, 0016021 integral to membrane; PDB: 2I88_A 1CII_A 1RH1_A 1COL_B 1A87_A 3FEW_X.
Probab=21.11 E-value=1.2e+02 Score=27.70 Aligned_cols=32 Identities=19% Similarity=0.432 Sum_probs=30.0
Q ss_pred CchhhhHHHHHHHHHhhhhhHHHHHHHHHHHH
Q psy510 213 TPYTIITFPFLFAVMFGDLGHGAIMFLFAAWM 244 (380)
Q Consensus 213 T~~~aitFp~fFG~MfGD~GyGlll~l~~~~l 244 (380)
|.+++.+|.+.-|.-+|=+|+|+++.++|.++
T Consensus 144 s~v~a~~fs~~~~t~igi~g~ail~a~v~s~I 175 (187)
T PF01024_consen 144 SAVAAFAFSFILGTPIGILGIAILMAVVGSLI 175 (187)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHh
Confidence 88899999999999999999999999999987
No 110
>PF00509 Hemagglutinin: Haemagglutinin; InterPro: IPR001364 Haemagglutinin (HA) is one of two main surface fusion glycoproteins embedded in the envelope of influenza viruses, the other being neuraminidase (NA). There are sixteen known HA subtypes (H1-H16) and nine NA subtypes (N1-N9), which together are used to classify influenza viruses (e.g. H5N1). The antigenic variations in HA and NA enable the virus to evade host antibodies made to previous influenza strains, accounting for recurrent influenza epidemics []. The HA glycoprotein is present in the viral membrane as a single polypeptide (HA0), which must be cleaved by the host's trypsin-like proteases to produce two peptides (HA1 and HA2) in order for the virus to be infectious. Once HA0 is cleaved, the newly exposed N-terminal of the HA2 peptide then acts to fuse the viral envelope to the cellular membrane of the host cell, which allows the viral negative-stranded RNA to infect the host cell. The type of host protease can influence the infectivity and pathogenicity of the virus. The haemagglutinin glycoprotein is a trimer containing three structurally distinct regions: a globular head consisting of anti-parallel beta-sheets that form a beta-sandwich with a jelly-roll fold (contains the receptor binding site and the HA1/HA2 cleavage site); a triple-stranded, coiled-coil, alpha-helical stalk; and a globular foot composed of anti-parallel beta-sheets [, ]. Each monomer consists of an intact HA0 polypeptide with the HA1 and HA2 regions linked by disulphide bonds. The N terminus of HA1 provides the central strand in the 5-stranded globular foot, while the rest of the HA1 chain makes its way to the 8-stranded globular head. HA2 provides two alpha helices, which form part of the triple-stranded coiled-coil that stabilises the trimer, its C terminus providing the remaining strands of the 5-stranded globular foot. This entry represents the entire haemagglutinin protein (HA0) consisting of both the HA1 and HA2 regions, as found in influenza A and B viruses.; GO: 0046789 host cell surface receptor binding, 0019064 viral envelope fusion with host membrane, 0019031 viral envelope; PDB: 2WR5_A 2IBX_A 2WR0_B 2WR1_C 2XN9_F 2WRF_I 3S11_E 3BT6_A 3SM5_E 2FK0_H ....
Probab=21.05 E-value=8.2e+02 Score=26.12 Aligned_cols=57 Identities=11% Similarity=0.071 Sum_probs=31.6
Q ss_pred hhHHHHHHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 49 QLKTRVMKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLV 105 (380)
Q Consensus 49 ~~~~~v~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~ 105 (380)
++..+|.+|.+.++-.--.+.....+....++++++++++....+-..+.|+--+++
T Consensus 375 ~it~kvN~iiek~n~~fe~i~~ef~~ve~Ri~~l~~~v~d~~~d~wsynaELlVlle 431 (550)
T PF00509_consen 375 QITKKVNSIIEKMNKQFEQIDKEFNEVEKRIDNLEKKVDDKIADVWSYNAELLVLLE 431 (550)
T ss_dssp HHHHHHHHHHHTTTCEEEECSCSSSTTGHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhccchhhHHHHHHHHHHHHHHHHHhhhccchhhhcccHHHHHHhc
Confidence 355566666666653322344444555555666777776666655555555544443
No 111
>PF06645 SPC12: Microsomal signal peptidase 12 kDa subunit (SPC12); InterPro: IPR009542 This family consists of several microsomal signal peptidase 12 kDa subunit proteins. Translocation of polypeptide chains across the endoplasmic reticulum (ER) membrane is triggered by signal sequences. Subsequently, signal recognition particle interacts with its membrane receptor and the ribosome-bound nascent chain is targeted to the ER where it is transferred into a protein-conducting channel. At some point, a second signal sequence recognition event takes place in the membrane and translocation of the nascent chain through the membrane occurs. The signal sequence of most secretory and membrane proteins is cleaved off at this stage. Cleavage occurs by the signal peptidase complex (SPC) as soon as the lumenal domain of the translocating polypeptide is large enough to expose its cleavage site to the enzyme. The signal peptidase complex is possibly also involved in proteolytic events in the ER membrane other than the processing of the signal sequence, for example the further digestion of the cleaved signal peptide or the degradation of membrane proteins. Mammalian signal peptidase is as a complex of five different polypeptide chains. This family represents the 12 kDa subunit (SPC12).; GO: 0008233 peptidase activity, 0006465 signal peptide processing, 0005787 signal peptidase complex, 0016021 integral to membrane
Probab=21.04 E-value=68 Score=24.74 Aligned_cols=22 Identities=23% Similarity=0.499 Sum_probs=19.2
Q ss_pred HHHHHHHhHHHHHHHHHhcCCC
Q psy510 268 RYIILLMGLFSMYTGLIYNDDR 289 (380)
Q Consensus 268 r~il~~~Gi~si~~G~lyg~fF 289 (380)
++++.+.++.|.++|+++.++.
T Consensus 14 ~~il~~~~iisfi~Gy~~q~~~ 35 (76)
T PF06645_consen 14 QYILIISAIISFIVGYITQSFS 35 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3688899999999999999984
No 112
>PF09325 Vps5: Vps5 C terminal like; InterPro: IPR015404 Vps5 is a sorting nexin that functions in membrane trafficking. This is the C-terminal dimerisation domain [].
Probab=20.97 E-value=4e+02 Score=24.23 Aligned_cols=43 Identities=12% Similarity=0.210 Sum_probs=33.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFI 116 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~ 116 (380)
...+.+..++.++++++...+..+++...+.+.+..+++.|..
T Consensus 160 ~~~~k~~~~~~ei~~~~~~~~~~~~~~~~is~~~k~E~~rf~~ 202 (236)
T PF09325_consen 160 NRQDKVEQAENEIEEAERRVEQAKDEFEEISENIKKELERFEK 202 (236)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677888888888888888888888887777777777653
No 113
>PF08181 DegQ: DegQ (SacQ) family; InterPro: IPR012554 This family consists of the DegQ (formerly sacQ) regulatory peptides. The DegQ family of peptides control the rates of synthesis of a class of both secreted and intracellular degradative enzymes in Bacillus subtilis. DegQ is 46 amino acids long and activates the synthesis of degradative enzymes. The expression of this peptide was shown to be subjected both to catabolite repression and DegS-DegU-mediated control. Thus allowing an increase in the rate of synthesis of degQ under conditions of nitrogen starvation [].
Probab=20.92 E-value=2.8e+02 Score=18.83 Aligned_cols=29 Identities=17% Similarity=0.085 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 78 MSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 78 ~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
.++++++-+=.++.+|.++.+.++.+-..
T Consensus 5 ~ieelkqll~rle~eirett~sl~ninks 33 (46)
T PF08181_consen 5 KIEELKQLLWRLENEIRETTDSLRNINKS 33 (46)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 35566666666777777776666555333
No 114
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=20.71 E-value=6e+02 Score=24.37 Aligned_cols=6 Identities=33% Similarity=0.246 Sum_probs=2.1
Q ss_pred HHHHHH
Q psy510 120 KIKAIY 125 (380)
Q Consensus 120 ~~~~~~ 125 (380)
.+-+.|
T Consensus 297 ~EIatY 302 (312)
T PF00038_consen 297 AEIATY 302 (312)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 333333
No 115
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=20.57 E-value=4.5e+02 Score=21.75 Aligned_cols=49 Identities=18% Similarity=0.159 Sum_probs=28.6
Q ss_pred HHHHcccCceeecCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 55 MKICEGFRATLYPCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAA 107 (380)
Q Consensus 55 ~ki~~~~~~~~~~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~ 107 (380)
.++|-.+|+..+- ..+..++.+-++++++.+++.++++++.+.++.+.+
T Consensus 69 ~~v~v~iG~g~~v----E~~~~eA~~~l~~r~~~l~~~~~~l~~~l~~l~~~~ 117 (129)
T cd00584 69 DKVLVDLGTGYYV----EKDLEEAIEFLDKKIEELTKQIEKLQKELAKLKDQI 117 (129)
T ss_pred CEEEEEcCCCEEE----EecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455455554431 234566667777777777777777666665554443
No 116
>PF11221 Med21: Subunit 21 of Mediator complex; InterPro: IPR021384 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med21 has been known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in Drosophila. The heterodimer of the two subunits Med7 and Med21 appears to act as a hinge between the middle and the tail regions of Mediator []. ; PDB: 1YKE_B 1YKH_B.
Probab=20.53 E-value=3.8e+02 Score=23.04 Aligned_cols=36 Identities=17% Similarity=0.253 Sum_probs=15.2
Q ss_pred cCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 67 PCPEAPTDRREMSMGVTTRIEDLNTVIGETQDHRHRVLVA 106 (380)
Q Consensus 67 ~~p~~~~~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~ 106 (380)
.+|+...+..+ -.++|.+|+++.++..+++.+.+++
T Consensus 91 sLPg~~~see~----Q~~~i~~L~~E~~~~~~el~~~v~e 126 (144)
T PF11221_consen 91 SLPGIEVSEEE----QLKRIKELEEENEEAEEELQEAVKE 126 (144)
T ss_dssp HSTTSSS-HHH----HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hCCCCCCCHHH----HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46764433322 2234444444444444444444333
No 117
>PF07895 DUF1673: Protein of unknown function (DUF1673); InterPro: IPR012874 This family contains hypothetical proteins of unknown function found in Methanosarcina acetivorans and Methanosarcina mazei.
Probab=20.48 E-value=3.3e+02 Score=25.04 Aligned_cols=16 Identities=13% Similarity=0.231 Sum_probs=9.7
Q ss_pred EEeccccCHHHHHHHH
Q psy510 143 ECWVPVLDIDAIQLAL 158 (380)
Q Consensus 143 ~GWvP~~~~~~l~~~l 158 (380)
-||||-...-+-++.+
T Consensus 11 MGWCPnaka~e~r~~~ 26 (205)
T PF07895_consen 11 MGWCPNAKALETRRRI 26 (205)
T ss_pred hCcCcCcCcccccccc
Confidence 5999977443334433
No 118
>PF04508 Pox_A_type_inc: Viral A-type inclusion protein repeat ; InterPro: IPR007596 The repeat is found in the A-type inclusion protein of the Poxvirus family [].; GO: 0016032 viral reproduction
Probab=20.45 E-value=1.5e+02 Score=17.63 Aligned_cols=15 Identities=27% Similarity=0.421 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHHH
Q psy510 80 MGVTTRIEDLNTVIG 94 (380)
Q Consensus 80 ~~l~~~i~~l~~~i~ 94 (380)
+.++.+|.+|+.++.
T Consensus 4 ~rlr~rI~dLer~L~ 18 (23)
T PF04508_consen 4 NRLRNRISDLERQLS 18 (23)
T ss_pred HHHHHHHHHHHHHHH
Confidence 445555555555443
No 119
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=20.37 E-value=80 Score=23.30 Aligned_cols=26 Identities=23% Similarity=0.525 Sum_probs=17.1
Q ss_pred chhhhHHHHHHH-HHhhhhhHHHHHHHHHHHHHHhh
Q psy510 214 PYTIITFPFLFA-VMFGDLGHGAIMFLFAAWMVMKE 248 (380)
Q Consensus 214 ~~~aitFp~fFG-~MfGD~GyGlll~l~~~~l~~~~ 248 (380)
.++.|+|.+||| ++ ...+|.|-....
T Consensus 12 iVLLISfiIlfgRl~---------Y~~I~a~~hHq~ 38 (59)
T PF11119_consen 12 IVLLISFIILFGRLI---------YSAIGAWVHHQD 38 (59)
T ss_pred HHHHHHHHHHHHHHH---------HHHHhHHHHHHH
Confidence 467789999998 44 445566655443
No 120
>TIGR02005 PTS-IIBC-alpha PTS system, alpha-glucoside-specific IIBC component. This model represents a family of fused PTS enzyme II B and C domains. A gene from Clostridium has been partially characterized as a maltose transporter, while genes from Fusobacterium and Klebsiella have been proposed to transport the five non-standard isomers of sucrose.
Probab=20.36 E-value=1.4e+02 Score=31.71 Aligned_cols=49 Identities=10% Similarity=0.044 Sum_probs=31.4
Q ss_pred HHHHHHHHHHHHHHhhccccccccchhHHHhhhhhHHHHHHHhHHHHHHHHHh
Q psy510 233 HGAIMFLFAAWMVMKEKPLMTLKTDNEIWKIFFGGRYIILLMGLFSMYTGLIY 285 (380)
Q Consensus 233 yGlll~l~~~~l~~~~~k~~~~~~~~~~~~~~f~~r~il~~~Gi~si~~G~ly 285 (380)
=|+++-+++.|+..|..+.+ ..+.+..|-+.|.+-+++.+.+++.|+++
T Consensus 141 gGIi~G~i~a~l~Nkf~~ik----LP~~L~FF~G~RfVpIi~~~~~~~l~~~~ 189 (524)
T TIGR02005 141 GAIIISGIITYIHNRFFDKR----LPVFLGIFQGTTFVVTIAFFVMLPCAAIT 189 (524)
T ss_pred HHHHHHHHHHHHHHHHhccc----cchhhhhcCCCcchHhHHHHHHHHHHHHH
Confidence 37888899999988876652 44556644355556666666666555544
No 121
>PRK14144 heat shock protein GrpE; Provisional
Probab=20.24 E-value=3.6e+02 Score=24.89 Aligned_cols=23 Identities=4% Similarity=-0.025 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy510 79 SMGVTTRIEDLNTVIGETQDHRH 101 (380)
Q Consensus 79 l~~l~~~i~~l~~~i~~~~~~~~ 101 (380)
...++++++++++++.+.++++.
T Consensus 47 ~~~l~~~i~~le~e~~elkdk~l 69 (199)
T PRK14144 47 YTALEEQLTLAEQKAHENWEKSV 69 (199)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555566666666655544443
No 122
>PRK11677 hypothetical protein; Provisional
Probab=20.03 E-value=5.5e+02 Score=22.10 Aligned_cols=47 Identities=11% Similarity=0.092 Sum_probs=24.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy510 74 DRREMSMGVTTRIEDLNTVIGETQDHRHRVLVAAAKNIKNWFIKVRK 120 (380)
Q Consensus 74 ~~~~~l~~l~~~i~~l~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~ 120 (380)
...+.+++.+.++++.+.++.+==.+-.+++....+....+...+.+
T Consensus 33 ~le~eLe~~k~ele~YkqeV~~HFa~TA~Ll~~L~~~Y~~Ly~HlA~ 79 (134)
T PRK11677 33 ALQYELEKNKAELEEYRQELVSHFARSAELLDTMAKDYRQLYQHMAK 79 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555555555555544444344455555555555555555544
Done!