Query 048021
Match_columns 143
No_of_seqs 182 out of 1120
Neff 5.0
Searched_HMMs 46136
Date Fri Mar 29 05:32:19 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/048021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/048021hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1603 Copper chaperone [Inor 99.7 1.8E-16 3.9E-21 108.2 8.8 68 8-75 2-72 (73)
2 PF00403 HMA: Heavy-metal-asso 99.6 1E-14 2.2E-19 94.8 7.0 56 14-69 1-62 (62)
3 COG2608 CopZ Copper chaperone 99.5 1.1E-13 2.5E-18 94.1 7.3 63 11-73 2-70 (71)
4 KOG4656 Copper chaperone for s 99.4 3.6E-13 7.7E-18 110.1 8.5 71 8-78 4-76 (247)
5 PLN02957 copper, zinc superoxi 98.9 1.7E-08 3.8E-13 82.5 9.8 71 9-79 4-76 (238)
6 PRK10671 copA copper exporting 98.7 5.8E-08 1.3E-12 90.9 7.4 62 11-74 3-67 (834)
7 TIGR00003 copper ion binding p 98.0 6.7E-05 1.5E-09 44.0 8.0 60 11-70 2-67 (68)
8 COG2217 ZntA Cation transport 97.7 0.00011 2.4E-09 69.0 7.0 61 11-72 2-69 (713)
9 KOG0207 Cation transport ATPas 97.6 0.00015 3.2E-09 69.7 7.4 76 6-81 64-145 (951)
10 PRK10671 copA copper exporting 97.4 0.00051 1.1E-08 64.7 7.5 62 12-73 100-164 (834)
11 KOG0207 Cation transport ATPas 97.2 0.00057 1.2E-08 65.7 6.0 61 18-78 2-66 (951)
12 PRK11033 zntA zinc/cadmium/mer 96.7 0.005 1.1E-07 57.8 7.3 64 10-73 52-119 (741)
13 TIGR02052 MerP mercuric transp 94.8 0.38 8.2E-06 31.0 8.1 62 11-72 23-90 (92)
14 PF05046 Img2: Mitochondrial l 92.5 0.67 1.5E-05 32.7 6.5 60 8-68 25-87 (87)
15 PF14492 EFG_II: Elongation Fa 92.3 0.73 1.6E-05 31.2 6.4 60 12-71 5-70 (75)
16 PF02680 DUF211: Uncharacteriz 91.9 0.99 2.1E-05 33.0 6.9 62 9-70 3-74 (95)
17 PRK13748 putative mercuric red 90.8 1.3 2.8E-05 39.5 8.0 67 14-80 3-74 (561)
18 COG1888 Uncharacterized protei 90.1 0.56 1.2E-05 34.3 4.2 61 10-70 5-76 (97)
19 PF04972 BON: BON domain; Int 89.7 0.26 5.7E-06 31.5 2.0 31 26-56 2-35 (64)
20 cd00371 HMA Heavy-metal-associ 89.4 1.8 4E-05 22.2 6.4 49 18-66 6-57 (63)
21 PF01883 DUF59: Domain of unkn 88.9 0.79 1.7E-05 30.2 3.9 32 11-42 34-72 (72)
22 PF03927 NapD: NapD protein; 87.7 2.4 5.2E-05 29.3 5.9 44 23-66 15-59 (79)
23 PF13732 DUF4162: Domain of un 84.5 3.6 7.8E-05 27.4 5.4 45 32-77 26-73 (84)
24 PRK11023 outer membrane lipopr 83.9 3.1 6.6E-05 33.0 5.5 41 24-64 128-171 (191)
25 cd04888 ACT_PheB-BS C-terminal 83.3 3.1 6.8E-05 26.7 4.6 33 10-42 40-74 (76)
26 COG1094 Predicted RNA-binding 83.1 2.7 5.8E-05 34.3 4.9 39 25-63 26-67 (194)
27 cd04908 ACT_Bt0572_1 N-termina 81.7 11 0.00024 24.1 7.1 56 15-72 4-66 (66)
28 PRK10553 assembly protein for 81.3 5.8 0.00013 28.2 5.6 44 23-66 17-65 (87)
29 PRK11198 LysM domain/BON super 80.4 3.3 7.2E-05 31.5 4.4 44 22-66 25-71 (147)
30 KOG4034 Uncharacterized conser 79.0 3.7 7.9E-05 32.8 4.3 58 10-68 109-169 (169)
31 PF14437 MafB19-deam: MafB19-l 77.7 5.3 0.00011 31.2 4.8 41 10-51 99-142 (146)
32 cd04883 ACT_AcuB C-terminal AC 74.3 19 0.00042 22.7 7.3 57 14-71 3-69 (72)
33 PF13192 Thioredoxin_3: Thiore 72.0 14 0.00029 24.5 5.2 14 12-26 2-15 (76)
34 smart00838 EFG_C Elongation fa 71.8 20 0.00044 24.2 6.1 45 16-60 7-52 (85)
35 cd04097 mtEFG1_C mtEFG1_C: C-t 70.3 23 0.0005 23.6 6.0 45 16-60 5-50 (78)
36 COG3643 Glutamate formiminotra 70.1 4.5 9.8E-05 34.6 3.0 58 20-77 15-77 (302)
37 PRK10568 periplasmic protein; 69.5 9.4 0.0002 30.5 4.6 37 20-56 57-96 (203)
38 cd03711 Tet_C Tet_C: C-terminu 69.2 25 0.00054 23.4 6.0 45 16-60 5-50 (78)
39 PF08002 DUF1697: Protein of u 69.0 21 0.00046 26.9 6.2 50 25-75 21-77 (137)
40 cd03713 EFG_mtEFG_C EFG_mtEFG_ 68.8 27 0.00058 23.1 6.1 46 15-60 4-50 (78)
41 cd02966 TlpA_like_family TlpA- 68.1 30 0.00065 22.4 6.6 54 12-74 22-83 (116)
42 PF09580 Spore_YhcN_YlaJ: Spor 67.4 22 0.00047 27.1 6.1 45 22-66 74-128 (177)
43 PRK11023 outer membrane lipopr 66.7 12 0.00025 29.7 4.5 46 21-66 47-96 (191)
44 cd03710 BipA_TypA_C BipA_TypA_ 66.2 29 0.00062 23.2 5.8 59 11-77 4-63 (79)
45 PF04468 PSP1: PSP1 C-terminal 65.8 21 0.00045 25.0 5.2 53 22-74 29-87 (88)
46 COG2151 PaaD Predicted metal-s 65.8 12 0.00026 27.8 4.2 32 12-43 50-88 (111)
47 TIGR03406 FeS_long_SufT probab 65.1 10 0.00023 30.0 4.0 33 12-44 114-153 (174)
48 PF07837 FTCD_N: Formiminotran 64.3 34 0.00073 27.5 6.7 64 13-76 5-74 (178)
49 cd04098 eEF2_C_snRNP eEF2_C_sn 64.2 32 0.0007 23.2 5.8 59 15-78 4-65 (80)
50 smart00749 BON bacterial OsmY 63.6 21 0.00047 21.4 4.5 34 26-59 2-38 (62)
51 PF01206 TusA: Sulfurtransfera 63.5 28 0.0006 22.5 5.2 47 19-73 8-57 (70)
52 COG3062 NapD Uncharacterized p 62.6 23 0.00049 25.9 5.0 45 22-66 17-62 (94)
53 PF15643 Tox-PL-2: Papain fold 61.7 7.8 0.00017 28.6 2.5 19 19-38 19-37 (100)
54 PF00013 KH_1: KH domain syndr 61.6 18 0.00038 22.6 3.9 37 26-65 20-58 (60)
55 PRK13763 putative RNA-processi 60.7 35 0.00075 26.8 6.2 39 26-64 22-62 (180)
56 KOG3411 40S ribosomal protein 60.2 9.8 0.00021 29.6 2.9 42 22-66 97-140 (143)
57 PF03958 Secretin_N: Bacterial 60.2 20 0.00043 23.5 4.1 28 38-65 46-74 (82)
58 PRK11200 grxA glutaredoxin 1; 60.1 23 0.00049 23.6 4.4 32 14-46 4-40 (85)
59 cd03419 GRX_GRXh_1_2_like Glut 59.8 43 0.00094 21.4 6.6 18 18-36 7-24 (82)
60 TIGR03665 arCOG04150 arCOG0415 59.8 20 0.00044 27.9 4.7 37 27-63 18-55 (172)
61 cd02410 archeal_CPSF_KH The ar 59.8 29 0.00063 27.1 5.5 61 22-82 52-121 (145)
62 cd06167 LabA_like LabA_like pr 59.7 13 0.00028 27.0 3.4 32 46-77 101-133 (149)
63 COG2823 OsmY Predicted peripla 59.4 22 0.00048 28.8 5.0 42 24-65 132-177 (196)
64 PF00679 EFG_C: Elongation fac 59.3 37 0.00079 23.2 5.5 46 11-60 7-54 (89)
65 TIGR02945 SUF_assoc FeS assemb 58.5 16 0.00035 25.4 3.6 34 12-45 38-78 (99)
66 PRK05412 putative nucleotide-b 57.8 50 0.0011 26.3 6.6 60 12-72 91-154 (161)
67 PF13291 ACT_4: ACT domain; PD 57.5 28 0.00061 22.8 4.5 32 10-41 47-79 (80)
68 PRK10555 aminoglycoside/multid 57.4 19 0.0004 35.5 5.0 43 24-66 158-209 (1037)
69 PF01106 NifU: NifU-like domai 56.9 22 0.00049 23.7 3.9 27 12-38 28-62 (68)
70 cd04909 ACT_PDH-BS C-terminal 56.8 47 0.001 20.9 5.6 56 14-70 3-69 (69)
71 cd01514 Elongation_Factor_C El 55.9 56 0.0012 21.5 6.5 58 16-78 5-64 (79)
72 cd02393 PNPase_KH Polynucleoti 55.9 36 0.00078 21.9 4.7 37 27-65 22-58 (61)
73 TIGR02190 GlrX-dom Glutaredoxi 55.6 27 0.00059 23.1 4.2 33 11-45 8-41 (79)
74 PLN03194 putative disease resi 54.6 43 0.00092 27.2 5.8 57 21-81 39-96 (187)
75 PRK09577 multidrug efflux prot 54.4 22 0.00048 34.9 4.9 45 25-69 158-212 (1032)
76 PF06998 DUF1307: Protein of u 54.2 9.1 0.0002 28.3 1.8 38 23-60 49-91 (123)
77 TIGR00288 conserved hypothetic 54.1 18 0.00039 28.5 3.6 35 46-80 107-142 (160)
78 PRK10568 periplasmic protein; 53.5 38 0.00081 27.1 5.4 32 25-56 141-175 (203)
79 TIGR00484 EF-G translation elo 53.4 46 0.001 31.1 6.7 61 12-72 406-474 (689)
80 PF03927 NapD: NapD protein; 53.2 39 0.00085 23.2 4.8 32 11-42 39-71 (79)
81 cd02394 vigilin_like_KH K homo 52.6 28 0.00061 21.8 3.7 28 36-65 29-59 (62)
82 TIGR00915 2A0602 The (Largely 51.8 26 0.00055 34.6 4.9 42 25-66 159-209 (1044)
83 PF04312 DUF460: Protein of un 51.2 51 0.0011 25.6 5.6 40 36-75 39-84 (138)
84 cd04096 eEF2_snRNP_like_C eEF2 51.2 62 0.0013 21.4 5.4 39 22-60 12-52 (80)
85 TIGR01709 typeII_sec_gspL gene 50.7 50 0.0011 28.4 6.0 52 26-77 313-371 (384)
86 PLN02925 4-hydroxy-3-methylbut 50.1 28 0.0006 33.6 4.7 26 15-40 630-666 (733)
87 PRK15127 multidrug efflux syst 50.0 29 0.00063 34.3 5.0 42 25-66 159-209 (1049)
88 PRK11670 antiporter inner memb 49.9 84 0.0018 27.3 7.3 67 12-78 48-147 (369)
89 TIGR02830 spore_III_AG stage I 49.1 23 0.00049 28.6 3.5 30 22-51 60-92 (186)
90 PRK06418 transcription elongat 48.5 98 0.0021 24.5 6.9 66 12-77 5-100 (166)
91 COG0066 LeuD 3-isopropylmalate 48.2 26 0.00057 28.6 3.7 47 24-70 117-168 (191)
92 PF08210 APOBEC_N: APOBEC-like 48.1 29 0.00063 27.6 3.9 65 10-76 73-148 (188)
93 COG1432 Uncharacterized conser 48.0 26 0.00057 27.4 3.6 38 44-81 110-148 (181)
94 PHA01634 hypothetical protein 47.4 20 0.00043 28.2 2.8 11 19-29 98-108 (156)
95 TIGR01305 GMP_reduct_1 guanosi 47.0 94 0.002 27.5 7.2 62 20-81 104-185 (343)
96 PF02662 FlpD: Methyl-viologen 46.9 64 0.0014 23.9 5.4 58 20-81 6-67 (124)
97 cd06482 ACD_HspB10 Alpha cryst 46.3 17 0.00037 25.5 2.1 23 34-56 15-40 (87)
98 KOG2874 rRNA processing protei 46.2 16 0.00035 32.0 2.3 31 22-53 72-102 (356)
99 smart00653 eIF2B_5 domain pres 46.0 44 0.00095 24.6 4.3 38 28-66 38-75 (110)
100 COG0602 NrdG Organic radical a 45.9 24 0.00052 28.5 3.2 30 45-74 71-106 (212)
101 PF10262 Rdx: Rdx family; Int 45.3 67 0.0015 21.4 4.9 12 55-66 64-75 (76)
102 cd03709 lepA_C lepA_C: This fa 45.3 92 0.002 20.9 6.1 48 11-61 4-52 (80)
103 PF02983 Pro_Al_protease: Alph 45.3 68 0.0015 21.0 4.8 19 36-54 23-42 (62)
104 PRK12596 putative monovalent c 45.3 1.5E+02 0.0032 23.2 7.6 56 10-65 90-148 (171)
105 COG1666 Uncharacterized protei 45.2 45 0.00097 26.6 4.5 45 21-66 105-151 (165)
106 PF14535 AMP-binding_C_2: AMP- 45.1 84 0.0018 21.7 5.5 54 26-79 7-81 (96)
107 TIGR01676 GLDHase galactonolac 45.0 29 0.00063 32.2 3.9 32 40-71 117-149 (541)
108 PF08777 RRM_3: RNA binding mo 44.6 89 0.0019 22.5 5.7 54 13-66 2-59 (105)
109 PRK14812 hypothetical protein; 44.6 59 0.0013 24.4 4.9 31 23-53 44-74 (119)
110 PRK07334 threonine dehydratase 44.6 80 0.0017 27.5 6.5 62 12-73 326-402 (403)
111 PF00712 DNA_pol3_beta: DNA po 44.1 38 0.00083 24.2 3.8 29 25-53 72-101 (120)
112 PHA02757 hypothetical protein; 43.9 1.1E+02 0.0023 21.3 6.7 47 5-52 9-62 (75)
113 PF01963 TraB: TraB family; I 43.6 51 0.0011 26.0 4.7 39 24-72 216-258 (259)
114 PF03646 FlaG: FlaG protein; 43.6 74 0.0016 22.4 5.1 42 25-66 43-95 (107)
115 cd04886 ACT_ThrD-II-like C-ter 43.6 56 0.0012 19.8 4.1 50 22-71 9-72 (73)
116 cd06475 ACD_HspB1_like Alpha c 43.6 22 0.00049 24.5 2.4 22 34-55 17-41 (86)
117 PRK10560 hofQ outer membrane p 43.3 78 0.0017 27.5 6.2 46 26-71 8-57 (386)
118 PRK00007 elongation factor G; 43.3 93 0.002 29.3 7.0 58 15-72 411-476 (693)
119 cd04901 ACT_3PGDH C-terminal A 43.2 50 0.0011 20.5 3.8 47 19-66 7-59 (69)
120 PRK06027 purU formyltetrahydro 43.2 1.3E+02 0.0027 25.4 7.2 65 11-75 48-122 (286)
121 TIGR00914 2A0601 heavy metal e 43.1 37 0.00079 33.4 4.5 60 12-71 683-760 (1051)
122 PF04461 DUF520: Protein of un 42.9 53 0.0011 26.1 4.6 52 19-71 99-153 (160)
123 TIGR00503 prfC peptide chain r 42.4 1E+02 0.0022 28.3 7.0 62 12-73 394-462 (527)
124 PF01936 NYN: NYN domain; Int 42.4 23 0.00049 25.2 2.3 30 46-75 97-127 (146)
125 PF00462 Glutaredoxin: Glutare 42.3 81 0.0018 19.3 4.6 27 19-47 7-34 (60)
126 PRK10743 heat shock protein Ib 41.8 21 0.00045 27.1 2.1 22 34-55 52-76 (137)
127 cd06477 ACD_HspB3_Like Alpha c 41.6 23 0.0005 24.6 2.2 24 34-57 14-40 (83)
128 KOG0125 Ataxin 2-binding prote 41.3 72 0.0016 28.5 5.6 50 27-76 112-172 (376)
129 PF01565 FAD_binding_4: FAD bi 41.2 32 0.0007 24.6 3.0 32 40-71 57-89 (139)
130 cd06497 ACD_alphaA-crystallin_ 41.1 25 0.00053 24.3 2.2 22 34-55 17-41 (86)
131 PF01849 NAC: NAC domain; Int 41.1 32 0.0007 22.1 2.7 28 26-53 2-35 (58)
132 PRK11597 heat shock chaperone 41.0 22 0.00048 27.2 2.2 23 33-55 49-74 (142)
133 PRK00741 prfC peptide chain re 40.9 1.1E+02 0.0024 28.0 7.0 63 12-74 393-462 (526)
134 PF00873 ACR_tran: AcrB/AcrD/A 40.4 57 0.0012 31.8 5.3 54 20-73 684-751 (1021)
135 cd06471 ACD_LpsHSP_like Group 40.3 26 0.00057 23.8 2.3 23 34-56 17-42 (93)
136 cd06476 ACD_HspB2_like Alpha c 40.3 25 0.00054 24.2 2.2 23 34-56 14-39 (83)
137 TIGR02024 FtcD glutamate formi 40.2 31 0.00067 29.9 3.1 65 12-76 6-76 (298)
138 COG0694 Thioredoxin-like prote 40.1 58 0.0013 23.5 4.1 33 12-44 44-85 (93)
139 PRK12740 elongation factor G; 39.9 88 0.0019 29.0 6.2 59 13-71 388-454 (668)
140 PRK12739 elongation factor G; 39.7 98 0.0021 29.1 6.6 57 15-71 408-472 (691)
141 cd06478 ACD_HspB4-5-6 Alpha-cr 39.4 29 0.00063 23.7 2.4 22 34-55 14-38 (83)
142 PF01193 RNA_pol_L: RNA polyme 39.4 60 0.0013 21.1 3.8 44 22-65 10-59 (66)
143 PHA03075 glutaredoxin-like pro 39.4 90 0.0019 23.8 5.1 55 20-75 12-75 (123)
144 TIGR00490 aEF-2 translation el 39.1 1.8E+02 0.004 27.5 8.3 62 11-72 389-458 (720)
145 cd00105 KH-I K homology RNA-bi 39.1 78 0.0017 19.4 4.2 35 29-65 22-61 (64)
146 cd06481 ACD_HspB9_like Alpha c 38.9 27 0.00059 24.1 2.2 22 34-55 14-38 (87)
147 KOG4730 D-arabinono-1, 4-lacto 38.6 28 0.0006 32.3 2.7 35 40-74 105-140 (518)
148 cd06479 ACD_HspB7_like Alpha c 38.5 28 0.00061 24.0 2.2 22 34-55 15-39 (81)
149 PF09941 DUF2173: Uncharacteri 38.5 62 0.0014 24.0 4.1 32 30-61 4-37 (108)
150 TIGR03675 arCOG00543 arCOG0054 38.4 78 0.0017 29.8 5.7 71 12-82 55-138 (630)
151 cd06470 ACD_IbpA-B_like Alpha- 38.3 29 0.00063 23.9 2.2 24 34-57 18-44 (90)
152 PF08712 Nfu_N: Scaffold prote 38.2 56 0.0012 22.8 3.7 39 26-66 37-78 (87)
153 PRK05301 pyrroloquinoline quin 37.8 63 0.0014 27.5 4.7 23 4-26 9-35 (378)
154 PRK13010 purU formyltetrahydro 37.8 1.7E+02 0.0038 24.7 7.3 55 21-75 63-126 (289)
155 PRK13351 elongation factor G; 37.5 1.2E+02 0.0025 28.4 6.7 58 14-71 406-471 (687)
156 TIGR00411 redox_disulf_1 small 37.4 67 0.0015 20.4 3.8 8 19-26 9-16 (82)
157 TIGR03028 EpsE polysaccharide 37.4 71 0.0015 25.9 4.7 56 14-69 39-112 (239)
158 cd06498 ACD_alphaB-crystallin_ 37.3 31 0.00067 23.8 2.2 22 34-55 14-38 (84)
159 PF14424 Toxin-deaminase: The 37.2 62 0.0013 24.5 4.1 32 11-42 97-129 (133)
160 PRK01641 leuD isopropylmalate 37.1 57 0.0012 26.5 4.1 44 28-71 129-176 (200)
161 PF00578 AhpC-TSA: AhpC/TSA fa 37.0 1.1E+02 0.0024 20.8 5.1 55 12-75 28-89 (124)
162 COG0071 IbpA Molecular chapero 36.9 29 0.00063 26.0 2.2 25 33-57 56-83 (146)
163 TIGR02189 GlrX-like_plant Glut 36.9 83 0.0018 22.0 4.5 48 13-69 10-60 (99)
164 PRK08868 flagellar protein Fla 36.6 78 0.0017 24.7 4.6 29 38-66 91-129 (144)
165 PRK10509 bacterioferritin-asso 36.3 18 0.00038 24.1 0.9 22 14-35 32-53 (64)
166 PF00698 Acyl_transf_1: Acyl t 36.3 96 0.0021 25.7 5.4 53 26-78 136-190 (318)
167 COG2177 FtsX Cell division pro 36.0 54 0.0012 28.1 4.0 31 11-42 61-92 (297)
168 cd06526 metazoan_ACD Alpha-cry 36.0 30 0.00065 23.3 2.0 22 34-55 14-38 (83)
169 PRK10553 assembly protein for 35.9 89 0.0019 22.1 4.5 32 10-41 41-73 (87)
170 TIGR03027 pepcterm_export puta 35.8 79 0.0017 24.0 4.5 56 14-69 38-112 (165)
171 PRK11282 glcE glycolate oxidas 35.7 48 0.001 28.9 3.6 36 34-69 42-80 (352)
172 PF07867 DUF1654: Protein of u 35.7 1.4E+02 0.0031 20.7 5.3 39 12-52 27-66 (73)
173 COG0695 GrxC Glutaredoxin and 35.5 67 0.0014 21.7 3.7 38 19-66 9-47 (80)
174 cd01790 Herp_N Homocysteine-re 35.4 1.5E+02 0.0032 20.6 5.4 57 8-66 9-67 (79)
175 PF12685 SpoIIIAH: SpoIIIAH-li 35.4 73 0.0016 25.1 4.4 48 26-74 138-194 (196)
176 PLN02432 putative pectinestera 35.1 90 0.0019 26.8 5.2 32 5-36 3-34 (293)
177 KOG1653 Single-stranded DNA-bi 35.0 49 0.0011 26.7 3.3 38 42-79 51-88 (175)
178 TIGR01617 arsC_related transcr 34.7 68 0.0015 22.9 3.8 45 19-71 7-53 (117)
179 PRK10329 glutaredoxin-like pro 34.7 98 0.0021 20.9 4.4 26 19-46 9-35 (81)
180 PF04459 DUF512: Protein of un 34.7 1.2E+02 0.0026 24.6 5.6 37 30-66 117-158 (204)
181 TIGR02886 spore_II_AA anti-sig 34.6 1.5E+02 0.0032 20.1 5.9 66 11-77 8-81 (106)
182 PF12229 PG_binding_4: Putativ 34.4 1.3E+02 0.0029 20.2 5.1 31 36-66 69-104 (114)
183 PRK14023 homoaconitate hydrata 34.3 57 0.0012 25.7 3.6 33 36-68 112-150 (166)
184 cd04885 ACT_ThrD-I Tandem C-te 34.2 1.3E+02 0.0027 19.3 5.4 54 16-71 2-67 (68)
185 PF02120 Flg_hook: Flagellar h 34.0 1.4E+02 0.003 19.6 5.3 44 37-80 26-81 (85)
186 COG2092 EFB1 Translation elong 33.8 76 0.0017 22.9 3.8 30 14-43 52-83 (88)
187 cd06472 ACD_ScHsp26_like Alpha 33.7 43 0.00093 22.9 2.5 23 34-56 16-42 (92)
188 PRK07560 elongation factor EF- 33.5 2E+02 0.0044 27.2 7.7 61 12-72 390-458 (731)
189 PF14611 SLS: Mitochondrial in 33.5 87 0.0019 24.5 4.5 41 36-76 55-99 (210)
190 TIGR02898 spore_YhcN_YlaJ spor 33.5 1.3E+02 0.0028 23.5 5.5 30 23-52 54-84 (158)
191 PRK10614 multidrug efflux syst 33.1 64 0.0014 31.8 4.4 60 12-71 660-737 (1025)
192 COG1908 FrhD Coenzyme F420-red 32.9 57 0.0012 25.1 3.2 40 40-79 24-66 (132)
193 cd03421 SirA_like_N SirA_like_ 32.8 1.3E+02 0.0029 19.1 6.0 47 19-74 7-56 (67)
194 COG1334 FlaG Uncharacterized f 32.6 1.2E+02 0.0026 23.0 4.9 43 24-66 54-107 (120)
195 TIGR00377 ant_ant_sig anti-ant 32.5 1.6E+02 0.0034 19.8 6.3 64 12-77 13-85 (108)
196 cd04877 ACT_TyrR N-terminal AC 32.4 1E+02 0.0022 20.0 4.1 29 13-41 39-68 (74)
197 PF07338 DUF1471: Protein of u 32.0 54 0.0012 21.2 2.6 20 47-66 6-26 (56)
198 TIGR03341 YhgI_GntY IscR-regul 31.9 80 0.0017 25.2 4.1 27 12-38 139-173 (190)
199 TIGR03131 malonate_mdcH malona 31.7 1.5E+02 0.0032 24.1 5.8 52 24-76 130-184 (295)
200 PRK09579 multidrug efflux prot 31.6 92 0.002 30.7 5.2 42 25-66 158-209 (1017)
201 TIGR02194 GlrX_NrdH Glutaredox 31.6 70 0.0015 20.5 3.2 25 19-45 7-32 (72)
202 TIGR02816 pfaB_fam PfaB family 31.6 89 0.0019 29.0 4.9 50 25-75 337-388 (538)
203 PF02863 Arg_repressor_C: Argi 31.3 1.1E+02 0.0024 20.2 4.2 44 21-66 21-69 (70)
204 PF00352 TBP: Transcription fa 31.3 72 0.0016 21.8 3.3 22 45-66 56-77 (86)
205 PF08534 Redoxin: Redoxin; In 31.3 1.6E+02 0.0034 20.9 5.3 53 12-74 31-91 (146)
206 cd04902 ACT_3PGDH-xct C-termin 31.2 97 0.0021 19.2 3.7 10 57-66 52-61 (73)
207 TIGR02159 PA_CoA_Oxy4 phenylac 31.2 75 0.0016 24.4 3.7 31 12-43 26-63 (146)
208 COG0841 AcrB Cation/multidrug 31.0 88 0.0019 31.2 5.0 48 23-70 155-213 (1009)
209 cd06464 ACD_sHsps-like Alpha-c 30.6 46 0.001 21.5 2.2 23 34-56 14-39 (88)
210 PF00873 ACR_tran: AcrB/AcrD/A 30.4 53 0.0011 32.0 3.3 51 21-71 154-215 (1021)
211 PF01058 Oxidored_q6: NADH ubi 30.3 10 0.00022 28.2 -1.2 32 45-76 45-80 (131)
212 TIGR00668 apaH bis(5'-nucleosy 30.2 1.1E+02 0.0025 26.0 5.0 44 22-70 10-59 (279)
213 COG0277 GlcD FAD/FMN-containin 30.1 44 0.00096 28.6 2.5 50 21-71 68-120 (459)
214 cd03035 ArsC_Yffb Arsenate Red 30.1 1.3E+02 0.0029 21.3 4.6 42 18-67 6-49 (105)
215 PRK15078 polysaccharide export 30.0 96 0.0021 27.2 4.6 56 14-69 133-205 (379)
216 PRK07451 translation initiatio 29.9 98 0.0021 23.2 4.0 28 45-73 85-113 (115)
217 PRK10614 multidrug efflux syst 29.9 98 0.0021 30.5 5.1 42 25-66 159-210 (1025)
218 KOG2193 IGF-II mRNA-binding pr 29.8 43 0.00093 31.0 2.4 31 45-75 326-361 (584)
219 TIGR02515 IV_pilus_PilQ type I 29.8 1.5E+02 0.0032 26.0 5.7 74 11-87 98-189 (418)
220 KOG2872 Uroporphyrinogen decar 29.6 58 0.0013 28.8 3.1 40 39-78 279-336 (359)
221 COG5583 Uncharacterized small 29.6 80 0.0017 20.9 3.1 30 22-51 8-37 (54)
222 COG2761 FrnE Predicted dithiol 29.2 58 0.0013 27.1 3.0 35 34-71 181-218 (225)
223 cd03028 GRX_PICOT_like Glutare 29.2 1.8E+02 0.004 19.6 5.3 39 20-70 22-63 (90)
224 PRK08343 secD preprotein trans 29.2 78 0.0017 28.3 4.0 40 25-66 69-112 (417)
225 COG1941 FrhG Coenzyme F420-red 29.1 82 0.0018 26.7 3.8 57 20-76 13-84 (247)
226 PRK07738 flagellar protein Fla 29.0 1.4E+02 0.003 22.4 4.7 28 39-66 67-104 (117)
227 COG2008 GLY1 Threonine aldolas 28.9 99 0.0021 27.3 4.5 47 20-69 256-303 (342)
228 cd03029 GRX_hybridPRX5 Glutare 28.9 1.4E+02 0.003 18.9 4.2 27 18-46 8-35 (72)
229 PF13383 Methyltransf_22: Meth 28.8 1E+02 0.0022 25.5 4.4 47 19-76 173-226 (242)
230 cd04884 ACT_CBS C-terminal ACT 28.5 1.6E+02 0.0035 18.7 6.6 57 15-71 2-70 (72)
231 PF08502 LeuA_dimer: LeuA allo 28.4 75 0.0016 23.3 3.2 38 39-76 40-82 (133)
232 PRK14719 bifunctional RNAse/5- 27.9 1.2E+02 0.0025 26.7 4.8 51 14-71 47-99 (360)
233 PRK10503 multidrug efflux syst 27.8 94 0.002 30.8 4.6 55 12-66 674-743 (1040)
234 cd02977 ArsC_family Arsenate R 27.8 1.5E+02 0.0032 20.5 4.5 42 19-68 7-50 (105)
235 PHA00019 IV phage assembly pro 27.8 1.7E+02 0.0037 25.8 5.9 50 22-71 30-86 (428)
236 PF01873 eIF-5_eIF-2B: Domain 27.7 79 0.0017 23.7 3.2 35 31-66 54-88 (125)
237 PF02107 FlgH: Flagellar L-rin 27.7 41 0.00089 26.5 1.8 29 33-61 109-138 (179)
238 PRK10555 aminoglycoside/multid 27.7 94 0.002 30.7 4.6 52 21-72 694-759 (1037)
239 PRK08965 putative monovalent c 27.6 2.9E+02 0.0062 21.2 8.5 53 12-64 94-149 (162)
240 PRK03988 translation initiatio 27.5 1.2E+02 0.0027 23.1 4.3 28 37-66 70-97 (138)
241 COG0480 FusA Translation elong 27.5 2.8E+02 0.0061 26.7 7.5 63 10-72 404-474 (697)
242 PRK11152 ilvM acetolactate syn 27.5 1.5E+02 0.0032 20.4 4.4 30 12-43 46-76 (76)
243 PF13098 Thioredoxin_2: Thiore 27.5 73 0.0016 21.6 2.9 30 35-64 80-112 (112)
244 cd04887 ACT_MalLac-Enz ACT_Mal 27.5 1.2E+02 0.0026 19.1 3.7 26 13-38 42-67 (74)
245 PRK14054 methionine sulfoxide 27.4 2.2E+02 0.0047 22.6 5.8 30 11-45 4-34 (172)
246 TIGR03143 AhpF_homolog putativ 27.2 3.1E+02 0.0066 24.9 7.5 52 13-64 478-554 (555)
247 TIGR02109 PQQ_syn_pqqE coenzym 27.2 1.4E+02 0.003 25.1 5.0 19 8-26 4-26 (358)
248 COG0106 HisA Phosphoribosylfor 27.1 1.6E+02 0.0036 24.7 5.3 49 26-74 112-167 (241)
249 TIGR01158 SUI1_rel translation 27.1 1.6E+02 0.0035 21.2 4.7 44 27-73 55-99 (101)
250 PRK09019 translation initiatio 27.1 1.5E+02 0.0033 21.9 4.6 43 26-73 61-106 (108)
251 PRK05412 putative nucleotide-b 27.0 1.6E+02 0.0035 23.4 5.0 69 14-82 7-86 (161)
252 cd03418 GRX_GRXb_1_3_like Glut 27.0 1.7E+02 0.0036 18.3 5.5 25 18-44 7-32 (75)
253 COG1774 Uncharacterized homolo 26.8 50 0.0011 28.1 2.2 57 22-78 86-148 (265)
254 cd03027 GRX_DEP Glutaredoxin ( 26.8 1.7E+02 0.0037 18.6 4.4 25 19-45 9-34 (73)
255 cd01570 NAPRTase_A Nicotinate 26.8 1.9E+02 0.0042 24.8 5.9 51 23-80 262-312 (327)
256 TIGR02024 FtcD glutamate formi 26.6 2.4E+02 0.0052 24.5 6.3 55 12-66 182-250 (298)
257 cd07422 MPP_ApaH Escherichia c 26.6 1.4E+02 0.0031 24.7 4.9 44 22-70 8-57 (257)
258 TIGR02520 pilus_B_mal_scr type 26.6 2.4E+02 0.0052 25.6 6.7 61 26-86 209-281 (497)
259 TIGR02515 IV_pilus_PilQ type I 26.6 2E+02 0.0044 25.1 6.1 50 22-71 10-63 (418)
260 KOG1368 Threonine aldolase [Am 26.4 1.2E+02 0.0026 27.1 4.5 47 24-70 283-334 (384)
261 COG1596 Wza Periplasmic protei 26.4 1.6E+02 0.0034 24.1 5.1 53 19-71 93-161 (239)
262 PRK02769 histidine decarboxyla 26.4 1.1E+02 0.0023 26.7 4.2 30 52-81 142-172 (380)
263 smart00827 PKS_AT Acyl transfe 26.2 2.1E+02 0.0045 23.0 5.7 52 25-76 135-189 (298)
264 PRK06824 translation initiatio 26.2 1.3E+02 0.0029 22.6 4.2 45 26-73 71-116 (118)
265 PRK06382 threonine dehydratase 26.2 4E+02 0.0088 23.2 7.8 62 11-72 329-405 (406)
266 COG3133 SlyB Outer membrane li 26.1 86 0.0019 24.7 3.2 45 26-75 100-146 (154)
267 PLN02962 hydroxyacylglutathion 26.0 3.1E+02 0.0067 22.6 6.7 37 40-76 27-68 (251)
268 cd02396 PCBP_like_KH K homolog 25.8 1.7E+02 0.0037 18.6 4.3 39 24-65 18-62 (65)
269 COG3870 Uncharacterized protei 25.7 1.4E+02 0.0029 22.4 4.1 27 50-76 6-33 (109)
270 PRK00166 apaH diadenosine tetr 25.6 1.5E+02 0.0033 24.8 4.9 44 22-70 10-59 (275)
271 TIGR03279 cyano_FeS_chp putati 25.6 4E+02 0.0086 24.3 7.8 41 26-66 335-380 (433)
272 PRK11145 pflA pyruvate formate 25.5 63 0.0014 25.7 2.5 29 47-75 72-107 (246)
273 TIGR02196 GlrX_YruB Glutaredox 25.4 1.6E+02 0.0034 17.7 3.9 19 19-39 8-26 (74)
274 PF06504 RepC: Replication pro 25.4 61 0.0013 27.9 2.5 28 25-52 248-276 (281)
275 cd02066 GRX_family Glutaredoxi 25.4 1.6E+02 0.0034 17.5 4.3 25 18-44 7-32 (72)
276 PRK05096 guanosine 5'-monophos 25.3 2.2E+02 0.0049 25.2 6.0 59 21-79 106-184 (346)
277 cd03036 ArsC_like Arsenate Red 25.2 1.9E+02 0.0041 20.5 4.8 42 19-68 7-50 (111)
278 TIGR00915 2A0602 The (Largely 25.2 1.1E+02 0.0024 30.2 4.6 46 21-66 695-751 (1044)
279 PRK06719 precorrin-2 dehydroge 25.1 1.5E+02 0.0033 22.5 4.5 36 40-75 7-44 (157)
280 TIGR01679 bact_FAD_ox FAD-link 25.1 87 0.0019 27.5 3.5 32 40-71 64-96 (419)
281 cd04874 ACT_Af1403 N-terminal 25.0 1.7E+02 0.0036 17.7 4.2 42 25-66 14-62 (72)
282 cd04906 ACT_ThrD-I_1 First of 25.0 2.2E+02 0.0048 19.1 7.7 60 14-74 3-73 (85)
283 TIGR01437 selA_rel uncharacter 24.7 2.5E+02 0.0055 23.8 6.2 48 27-74 274-332 (363)
284 PF03960 ArsC: ArsC family; I 24.7 1.4E+02 0.0031 20.9 4.0 42 20-69 5-48 (110)
285 PRK10503 multidrug efflux syst 24.5 1.4E+02 0.0029 29.7 5.0 41 26-66 169-219 (1040)
286 TIGR00735 hisF imidazoleglycer 24.5 1.6E+02 0.0034 23.9 4.7 30 40-70 8-46 (254)
287 PRK03745 signal recognition pa 24.5 1.6E+02 0.0034 21.6 4.2 41 25-66 34-90 (100)
288 cd04905 ACT_CM-PDT C-terminal 24.4 80 0.0017 20.7 2.5 27 11-37 42-68 (80)
289 cd02973 TRX_GRX_like Thioredox 24.4 1.6E+02 0.0034 18.1 3.8 15 20-35 10-24 (67)
290 TIGR01127 ilvA_1Cterm threonin 24.4 3.4E+02 0.0074 23.2 6.9 62 11-72 304-380 (380)
291 TIGR01514 NAPRTase nicotinate 24.4 1.4E+02 0.003 26.7 4.7 38 41-78 302-341 (394)
292 PF13241 NAD_binding_7: Putati 24.1 1.9E+02 0.0041 20.1 4.5 37 40-76 1-39 (103)
293 PRK10638 glutaredoxin 3; Provi 24.1 1.7E+02 0.0036 19.2 4.1 30 14-45 5-35 (83)
294 PF08478 POTRA_1: POTRA domain 24.0 85 0.0018 19.8 2.5 28 25-52 36-65 (69)
295 PF00011 HSP20: Hsp20/alpha cr 24.0 71 0.0015 21.7 2.2 23 34-56 14-39 (102)
296 PF10934 DUF2634: Protein of u 24.0 1.4E+02 0.0031 21.6 4.0 32 22-53 68-103 (112)
297 PRK08452 flagellar protein Fla 23.9 1.8E+02 0.0038 22.0 4.5 28 39-66 74-111 (124)
298 PF00241 Cofilin_ADF: Cofilin/ 23.9 1.3E+02 0.0029 21.3 3.8 44 23-66 2-51 (127)
299 COG3286 Uncharacterized protei 23.6 66 0.0014 26.5 2.3 23 54-76 88-111 (204)
300 PRK14426 acylphosphatase; Prov 23.6 2.6E+02 0.0057 19.5 7.9 64 11-75 4-75 (92)
301 COG5494 Predicted thioredoxin/ 23.5 2.1E+02 0.0046 24.3 5.3 48 18-66 18-86 (265)
302 COG1084 Predicted GTPase [Gene 23.4 1.2E+02 0.0025 27.0 3.9 47 25-76 151-199 (346)
303 PF00438 S-AdoMet_synt_N: S-ad 23.4 1.6E+02 0.0034 21.5 4.0 50 21-70 19-79 (100)
304 PLN00072 3-isopropylmalate iso 23.3 2.1E+02 0.0046 24.1 5.3 32 36-68 194-233 (246)
305 TIGR00311 aIF-2beta translatio 23.1 1.5E+02 0.0033 22.5 4.1 36 37-74 65-101 (133)
306 TIGR00158 L9 ribosomal protein 23.0 1.7E+02 0.0037 22.4 4.4 41 32-72 69-111 (148)
307 PLN03210 Resistant to P. syrin 22.9 1.7E+02 0.0036 29.2 5.3 76 1-81 1-81 (1153)
308 cd07042 STAS_SulP_like_sulfate 22.8 1.4E+02 0.003 19.7 3.5 68 10-77 8-83 (107)
309 PF14334 DUF4390: Domain of un 22.7 1.8E+02 0.0039 22.2 4.5 33 39-71 10-45 (165)
310 KOG1752 Glutaredoxin and relat 22.7 2.9E+02 0.0063 20.0 5.3 43 18-69 21-66 (104)
311 PRK09579 multidrug efflux prot 22.7 1.3E+02 0.0028 29.7 4.5 53 22-74 671-737 (1017)
312 PF01625 PMSR: Peptide methion 22.6 1.5E+02 0.0032 23.0 4.0 26 12-42 2-28 (155)
313 KOG1372 GDP-mannose 4,6 dehydr 22.6 1.6E+02 0.0035 25.8 4.6 70 14-84 84-165 (376)
314 PF12242 Eno-Rase_NADH_b: NAD( 22.6 1.6E+02 0.0035 20.8 3.8 55 20-76 15-71 (78)
315 PF04359 DUF493: Protein of un 22.5 2.3E+02 0.005 19.2 4.6 35 5-39 48-82 (85)
316 TIGR00489 aEF-1_beta translati 22.5 1.8E+02 0.0039 20.6 4.1 21 23-43 62-83 (88)
317 PRK05528 methionine sulfoxide 22.3 1.1E+02 0.0023 24.0 3.1 26 12-42 3-29 (156)
318 cd04879 ACT_3PGDH-like ACT_3PG 22.3 1E+02 0.0022 18.5 2.5 29 10-41 40-69 (71)
319 PRK12484 nicotinate phosphorib 22.3 2.3E+02 0.0049 25.6 5.6 35 45-79 278-312 (443)
320 TIGR00007 phosphoribosylformim 22.1 3.1E+02 0.0067 21.4 5.8 48 25-72 108-163 (230)
321 PRK13585 1-(5-phosphoribosyl)- 22.0 3.2E+02 0.007 21.5 6.0 48 26-73 113-168 (241)
322 cd03033 ArsC_15kD Arsenate Red 21.9 2.6E+02 0.0057 20.2 5.0 45 16-68 5-51 (113)
323 PRK11439 pphA serine/threonine 21.8 1.5E+02 0.0033 23.4 4.0 41 21-66 25-70 (218)
324 cd08630 PI-PLCc_delta3 Catalyt 21.8 78 0.0017 26.8 2.4 52 24-77 31-97 (258)
325 COG4747 ACT domain-containing 21.8 3.9E+02 0.0084 20.7 6.7 59 11-70 68-133 (142)
326 TIGR02200 GlrX_actino Glutared 21.8 2.1E+02 0.0045 17.7 4.0 18 19-38 8-25 (77)
327 COG1782 Predicted metal-depend 21.6 2.5E+02 0.0053 26.8 5.8 59 24-82 77-144 (637)
328 PRK13763 putative RNA-processi 21.6 1.9E+02 0.0042 22.6 4.5 27 27-55 115-141 (180)
329 TIGR00655 PurU formyltetrahydr 21.6 4.8E+02 0.01 21.9 7.2 53 23-75 55-117 (280)
330 cd01733 LSm10 The eukaryotic S 21.6 1.4E+02 0.0029 20.5 3.2 29 26-55 8-36 (78)
331 PRK11018 hypothetical protein; 21.5 2.6E+02 0.0057 18.7 6.4 52 13-72 9-64 (78)
332 PRK04021 hypothetical protein; 21.4 2.2E+02 0.0047 20.3 4.4 45 20-65 43-91 (92)
333 PRK15127 multidrug efflux syst 21.4 1.4E+02 0.003 29.6 4.4 49 22-70 696-759 (1049)
334 PHA02125 thioredoxin-like prot 21.4 1E+02 0.0022 20.0 2.6 15 20-35 9-23 (75)
335 PF11823 DUF3343: Protein of u 21.3 97 0.0021 20.4 2.4 28 50-77 7-35 (73)
336 PRK10677 modA molybdate transp 21.3 4.5E+02 0.0097 21.3 6.8 58 8-75 24-81 (257)
337 cd03012 TlpA_like_DipZ_like Tl 21.3 3E+02 0.0064 19.2 5.4 53 18-74 32-91 (126)
338 PF04273 DUF442: Putative phos 21.2 1.4E+02 0.0031 21.7 3.5 34 47-80 7-40 (110)
339 COG4771 FepA Outer membrane re 21.1 94 0.002 30.0 3.0 29 25-53 67-96 (699)
340 TIGR00914 2A0601 heavy metal e 21.0 1.7E+02 0.0037 28.9 4.9 40 27-66 170-219 (1051)
341 TIGR01394 TypA_BipA GTP-bindin 20.9 3.3E+02 0.0072 25.4 6.6 67 9-75 301-381 (594)
342 PF04724 Glyco_transf_17: Glyc 20.7 55 0.0012 28.7 1.3 43 20-68 259-302 (356)
343 cd06404 PB1_aPKC PB1 domain is 20.6 2.4E+02 0.0052 20.1 4.4 57 13-72 14-76 (83)
344 TIGR01952 nusA_arch NusA famil 20.6 60 0.0013 24.9 1.4 19 55-73 106-125 (141)
345 PRK13562 acetolactate synthase 20.5 3.2E+02 0.007 19.3 5.4 36 12-47 45-82 (84)
346 PF05193 Peptidase_M16_C: Pept 20.5 96 0.0021 21.8 2.4 21 46-66 19-39 (184)
347 cd08591 PI-PLCc_beta Catalytic 20.4 86 0.0019 26.6 2.4 51 25-77 32-99 (257)
348 cd08593 PI-PLCc_delta Catalyti 20.4 1.6E+02 0.0035 24.9 4.1 54 22-77 29-97 (257)
349 PRK13586 1-(5-phosphoribosyl)- 20.3 3.5E+02 0.0077 21.9 6.0 50 27-76 111-169 (232)
350 PLN02626 malate synthase 20.3 1.3E+02 0.0027 28.4 3.6 43 43-85 87-129 (551)
351 PRK01565 thiamine biosynthesis 20.3 2E+02 0.0042 25.2 4.7 11 28-38 61-71 (394)
352 COG3260 Ni,Fe-hydrogenase III 20.2 90 0.002 24.5 2.3 65 12-76 9-84 (148)
353 PF09186 DUF1949: Domain of un 20.2 2.1E+02 0.0046 17.1 5.7 42 24-66 7-51 (56)
354 PRK12698 flgH flagellar basal 20.1 72 0.0016 26.3 1.9 28 34-61 153-181 (224)
355 COG2761 FrnE Predicted dithiol 20.1 2E+02 0.0044 23.9 4.5 37 9-45 2-44 (225)
No 1
>KOG1603 consensus Copper chaperone [Inorganic ion transport and metabolism]
Probab=99.69 E-value=1.8e-16 Score=108.17 Aligned_cols=68 Identities=35% Similarity=0.464 Sum_probs=62.7
Q ss_pred CCceEEEEEEeecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc-C-CceEEeec
Q 048021 8 SAKMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS-G-KYANLCWV 75 (143)
Q Consensus 8 s~~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k-G-k~Ael~~~ 75 (143)
...++++++|+|+|+||+++|+++|..++|| ++.+|.++++|+|.|.+||..|+++|++ | ++++++..
T Consensus 2 ~~~~~~v~kv~~~C~gc~~kV~~~l~~~~GV~~v~id~~~~kvtV~g~~~p~~vl~~l~k~~~k~~~~~~~ 72 (73)
T KOG1603|consen 2 PPIKTVVLKVNMHCEGCARKVKRVLQKLKGVESVDIDIKKQKVTVKGNVDPVKLLKKLKKTGGKRAELWKV 72 (73)
T ss_pred CCccEEEEEECcccccHHHHHHHHhhccCCeEEEEecCCCCEEEEEEecCHHHHHHHHHhcCCCceEEecC
Confidence 3567899999999999999999999999999 7999999999999999999999999999 6 88877654
No 2
>PF00403 HMA: Heavy-metal-associated domain; InterPro: IPR006121 Proteins that transport heavy metals in micro-organisms and mammals share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases []. A conserved domain has been found in a number of these heavy metal transport or detoxification proteins []. The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. Structure solution of the fourth HMA domain of the Menke's copper transporting ATPase shows a well-defined structure comprising a four-stranded antiparallel beta-sheet and two alpha helices packed in an alpha-beta sandwich fold []. This fold is common to other domains and is classified as "ferredoxin-like".; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2VOY_A 1P6T_A 1KQK_A 2RML_A 1JWW_A 3K7R_F 1FES_A 1CC8_A 1FD8_A 2GGP_A ....
Probab=99.56 E-value=1e-14 Score=94.82 Aligned_cols=56 Identities=21% Similarity=0.271 Sum_probs=51.9
Q ss_pred EEEE-eecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeec---CHHHHHHHHhc-CCc
Q 048021 14 GLKV-DTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRA---NPRKLLRKLQS-GKY 69 (143)
Q Consensus 14 vlkV-~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~v---dp~~Ll~aI~k-Gk~ 69 (143)
+|+| +|+|++|+++|+++|.+++|| ++.+|+++++|+|.+.. ++++|+++|++ ||+
T Consensus 1 t~~v~~m~C~~C~~~v~~~l~~~~GV~~v~vd~~~~~v~v~~~~~~~~~~~i~~~i~~~Gy~ 62 (62)
T PF00403_consen 1 TFKVPGMTCEGCAKKVEKALSKLPGVKSVKVDLETKTVTVTYDPDKTSIEKIIEAIEKAGYE 62 (62)
T ss_dssp EEEEESTTSHHHHHHHHHHHHTSTTEEEEEEETTTTEEEEEESTTTSCHHHHHHHHHHTTSE
T ss_pred CEEECCcccHHHHHHHHHHHhcCCCCcEEEEECCCCEEEEEEecCCCCHHHHHHHHHHhCcC
Confidence 6889 699999999999999999999 79999999999999874 56999999999 985
No 3
>COG2608 CopZ Copper chaperone [Inorganic ion transport and metabolism]
Probab=99.49 E-value=1.1e-13 Score=94.12 Aligned_cols=63 Identities=21% Similarity=0.355 Sum_probs=55.4
Q ss_pred eEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEE--Ee-ecCHHHHHHHHhc-CCceEEe
Q 048021 11 MTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYI--TG-RANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 11 ~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV--~G-~vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
++.+|+|+ |+|++|+++|+++|++++|| +|+||++.++++| ++ .++.++|+++|++ ||+++.+
T Consensus 2 ~~~~l~v~~MtC~~C~~~V~~al~~v~gv~~v~v~l~~~~~~V~~d~~~~~~~~i~~ai~~aGy~~~~~ 70 (71)
T COG2608 2 MKTTLKVEGMTCGHCVKTVEKALEEVDGVASVDVDLEKGTATVTFDSNKVDIEAIIEAIEDAGYKVEEI 70 (71)
T ss_pred ceEEEEECCcCcHHHHHHHHHHHhcCCCeeEEEEEcccCeEEEEEcCCcCCHHHHHHHHHHcCCCeeec
Confidence 56889995 89999999999999999999 7999999955555 45 5899999999999 9998754
No 4
>KOG4656 consensus Copper chaperone for superoxide dismutase [Inorganic ion transport and metabolism]
Probab=99.44 E-value=3.6e-13 Score=110.09 Aligned_cols=71 Identities=15% Similarity=0.248 Sum_probs=67.3
Q ss_pred CCceEEEEEEeecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEeecCCC
Q 048021 8 SAKMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCWVSAG 78 (143)
Q Consensus 8 s~~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~ 78 (143)
...-+++|.|.|+|++|++.|++.|+.++|| +|+||++++.|.|.+.+++.+|..+|+. |++|.|.+.+.+
T Consensus 4 d~~~~~efaV~M~cescvnavk~~L~~V~Gi~~vevdle~q~v~v~ts~p~s~i~~~le~tGr~Avl~G~G~p 76 (247)
T KOG4656|consen 4 DDTYEAEFAVQMTCESCVNAVKACLKGVPGINSVEVDLEQQIVSVETSVPPSEIQNTLENTGRDAVLRGAGKP 76 (247)
T ss_pred CCceeEEEEEechhHHHHHHHHHHhccCCCcceEEEEhhhcEEEEEccCChHHHHHHHHhhChheEEecCCch
Confidence 3667889999999999999999999999999 7999999999999999999999999999 999999999765
No 5
>PLN02957 copper, zinc superoxide dismutase
Probab=98.86 E-value=1.7e-08 Score=82.50 Aligned_cols=71 Identities=14% Similarity=0.251 Sum_probs=65.0
Q ss_pred CceEEEEEEeecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEeecCCCC
Q 048021 9 AKMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCWVSAGN 79 (143)
Q Consensus 9 ~~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~~ 79 (143)
+.++++|.|.|+|+.|+++|+++|.+++|| .+.+++..++++|.+..++..|+++|++ |+.+++++....+
T Consensus 4 ~~~~~~~~VgMsC~~Ca~~Iek~L~~~~GV~~v~vn~~~~~v~V~~~~~~~~I~~aIe~~Gy~a~~~~~~~~~ 76 (238)
T PLN02957 4 PELLTEFMVDMKCEGCVAAVKNKLETLEGVKAVEVDLSNQVVRVLGSSPVKAMTAALEQTGRKARLIGQGDPE 76 (238)
T ss_pred CcEEEEEEECccCHHHHHHHHHHHhcCCCeEEEEEEcCCCEEEEEecCCHHHHHHHHHHcCCcEEEecCCCcc
Confidence 356778999999999999999999999999 7999999999999988899999999999 9999999987665
No 6
>PRK10671 copA copper exporting ATPase; Provisional
Probab=98.65 E-value=5.8e-08 Score=90.95 Aligned_cols=62 Identities=18% Similarity=0.225 Sum_probs=56.0
Q ss_pred eEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEee
Q 048021 11 MTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 11 ~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
++++|+|+ |+|++|+++|+++|++++|| ++++|++ +++|++..+++.|.++|++ ||++++..
T Consensus 3 ~~~~l~V~gmtC~~C~~~i~~al~~~~gv~~v~v~~~--~~~v~~~~~~~~i~~~i~~~Gy~~~~~~ 67 (834)
T PRK10671 3 QTIDLTLDGLSCGHCVKRVKESLEQRPDVEQADVSIT--EAHVTGTASAEALIETIKQAGYDASVSH 67 (834)
T ss_pred eEEEEEECCcccHHHHHHHHHHHhcCCCcceEEEeee--EEEEEecCCHHHHHHHHHhcCCcccccc
Confidence 67899997 89999999999999999999 7999994 5667778899999999999 99999864
No 7
>TIGR00003 copper ion binding protein. This model describes an apparently copper-specific subfamily of the metal-binding domain HMA (Pfam family pfam00403). Closely related sequences outside this model include mercury resistance proteins and repeated domains of eukaryotic eukaryotic copper transport proteins. Members of this family are strictly prokaryotic. The model identifies both small proteins consisting of just this domain and N-terminal regions of cation (probably copper) transporting ATPases.
Probab=98.02 E-value=6.7e-05 Score=44.04 Aligned_cols=60 Identities=10% Similarity=0.112 Sum_probs=50.0
Q ss_pred eEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEe---ecCHHHHHHHHhc-CCce
Q 048021 11 MTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITG---RANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 11 ~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G---~vdp~~Ll~aI~k-Gk~A 70 (143)
|+..+.|+ ++|..|+..|++.+...+++ ...+++..+++.|.. ..+...+...+.. |+.+
T Consensus 2 ~~~~~~v~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 67 (68)
T TIGR00003 2 QKFTVQVMSMTCQHCVDKIEKFVGELEGVSKVQVKLEKASVKVEFDAPQATEICIAEAILDAGYEV 67 (68)
T ss_pred cEEEEEECCeEcHHHHHHHHHHHhcCCCEEEEEEEcCCCEEEEEeCCCCCCHHHHHHHHHHcCCCc
Confidence 56778896 69999999999999999998 699999999988874 3577888888877 8754
No 8
>COG2217 ZntA Cation transport ATPase [Inorganic ion transport and metabolism]
Probab=97.66 E-value=0.00011 Score=68.96 Aligned_cols=61 Identities=20% Similarity=0.276 Sum_probs=54.5
Q ss_pred eEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEee---cC-HHHHHHHHhc-CCceEE
Q 048021 11 MTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGR---AN-PRKLLRKLQS-GKYANL 72 (143)
Q Consensus 11 ~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~---vd-p~~Ll~aI~k-Gk~Ael 72 (143)
++..|.|. |+|..|+.+|+ +|++++|| ++.+++...++.|... .+ ++.+.+++++ ||.+..
T Consensus 2 ~~~~l~v~Gm~Ca~C~~~ie-~l~~~~gV~~~~vn~~t~~~~v~~~~~~~~~~~~~~~~v~~~gy~~~~ 69 (713)
T COG2217 2 RETSLSVEGMTCAACASRIE-ALNKLPGVEEARVNLATERATVVYDPEEVDLPADIVAAVEKAGYSARL 69 (713)
T ss_pred ceeEEeecCcCcHHHHHHHH-HHhcCCCeeEEEeecccceEEEEecccccccHHHHHHHHHhcCccccc
Confidence 46788997 79999999999 99999999 7999999999999853 45 8999999999 999876
No 9
>KOG0207 consensus Cation transport ATPase [Inorganic ion transport and metabolism]
Probab=97.64 E-value=0.00015 Score=69.66 Aligned_cols=76 Identities=22% Similarity=0.297 Sum_probs=62.3
Q ss_pred CCCCceEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEE--e-ecCHHHHHHHHhc-CCceEEeecCCCC
Q 048021 6 SSSAKMTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYIT--G-RANPRKLLRKLQS-GKYANLCWVSAGN 79 (143)
Q Consensus 6 ~~s~~~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~--G-~vdp~~Ll~aI~k-Gk~Ael~~~~~~~ 79 (143)
+.....+|.|.|+ |+|..|++.|++.|++++|| ++.+.+-..+..|. + ...++.+.+.|++ |+.++++.-.+++
T Consensus 64 ~~~~~~~~~l~v~GmtC~scv~~i~~~l~~~~gv~~~~val~~~~~~v~~dp~v~s~~~~~e~ie~~gf~a~~i~~~~~~ 143 (951)
T KOG0207|consen 64 SEITASKCYLSVNGMTCASCVATIERNLRKIEGVESAVVALSASKAEVIYDPAVTSPDSIAESIEDLGFSAELIESVNGN 143 (951)
T ss_pred CccccceeEEEecCceeHHHHHHHHHHhhccCCcceEEEEeeccceeEEECCcccCchhHHHHHHhcCccceehhcccCC
Confidence 3344449999998 79999999999999999999 77777766666665 4 3589999999999 9999998887766
Q ss_pred CC
Q 048021 80 QI 81 (143)
Q Consensus 80 s~ 81 (143)
+.
T Consensus 144 ~~ 145 (951)
T KOG0207|consen 144 SN 145 (951)
T ss_pred CC
Confidence 54
No 10
>PRK10671 copA copper exporting ATPase; Provisional
Probab=97.38 E-value=0.00051 Score=64.71 Aligned_cols=62 Identities=19% Similarity=0.320 Sum_probs=55.4
Q ss_pred EEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEe
Q 048021 12 TCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 12 tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
++.|.|. |+|.+|+..|++.|.+++|| ++.++...+++.|.+..++..+.+.+++ |+.+.++
T Consensus 100 ~~~l~V~Gm~Ca~Ca~~Ie~~L~~~~GV~~a~vnl~t~~~~V~~~~s~~~I~~~I~~~Gy~a~~~ 164 (834)
T PRK10671 100 SQQLLLSGMSCASCVSRVQNALQSVPGVTQARVNLAERTALVMGSASPQDLVQAVEKAGYGAEAI 164 (834)
T ss_pred eEEEEeCCcCcHHHHHHHHHHHhcCCCceeeeeecCCCeEEEEccCCHHHHHHHHHhcCCCcccc
Confidence 5678885 79999999999999999999 7999999999999877889999999999 9988654
No 11
>KOG0207 consensus Cation transport ATPase [Inorganic ion transport and metabolism]
Probab=97.23 E-value=0.00057 Score=65.74 Aligned_cols=61 Identities=16% Similarity=0.173 Sum_probs=53.6
Q ss_pred eecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEE--eecCHHHHHHHHhc-CCceEEeecCCC
Q 048021 18 DTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYIT--GRANPRKLLRKLQS-GKYANLCWVSAG 78 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~--G~vdp~~Ll~aI~k-Gk~Ael~~~~~~ 78 (143)
.|+|..|.+.|++++.+.+|| +++|++.++..+|. ..++++.|+++|+. |+++++..-...
T Consensus 2 gmtc~ac~~si~~~~~~~~g~~~i~vsl~~~~~~v~~~~~~~~~~i~~~ied~gf~~~~~~~~~~ 66 (951)
T KOG0207|consen 2 GMTCSACSNSIEKAISRKPGVQKIEVSLAQKRANVSYDNIVSPESIKETIEDMGFEASLLSDSEI 66 (951)
T ss_pred CccHHHHhhhHHHHHhcCCCceeEEEEeccccceEEEeeccCHHHHHHHhhcccceeeecccCcc
Confidence 489999999999999999999 89999987776665 56899999999999 999998876543
No 12
>PRK11033 zntA zinc/cadmium/mercury/lead-transporting ATPase; Provisional
Probab=96.71 E-value=0.005 Score=57.76 Aligned_cols=64 Identities=11% Similarity=0.083 Sum_probs=51.7
Q ss_pred ceEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeec-CHHHHHHHHhc-CCceEEe
Q 048021 10 KMTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRA-NPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 10 ~~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~v-dp~~Ll~aI~k-Gk~Ael~ 73 (143)
..++.++|. |+|.+|+.+|+++|.+++|| ++.++...+++.|.... ..+.+.+++++ |+.+...
T Consensus 52 ~~r~~l~V~Gm~C~sCa~~Ie~aL~~~~GV~~v~Vn~at~k~~V~~d~~~~~~I~~aI~~~Gy~a~~~ 119 (741)
T PRK11033 52 GTRYSWKVSGMDCPSCARKVENAVRQLAGVNQVQVLFATEKLVVDADNDIRAQVESAVQKAGFSLRDE 119 (741)
T ss_pred CceEEEEECCCCcHHHHHHHHHHHhcCCCeeeEEEEcCCCeEEEEecccchHHHHHHHHhcccccccc
Confidence 446778886 79999999999999999999 79999998888886321 12677888888 9987643
No 13
>TIGR02052 MerP mercuric transport protein periplasmic component. This model represents the periplasmic mercury (II) binding protein of the bacterial mercury detoxification system which passes mercuric ion to the MerT transporter for subsequent reduction to Hg(0) by the mercuric reductase MerA. MerP contains a distinctive GMTCXXC motif associated with metal binding. MerP is related to a larger family of metal binding proteins (pfam00403).
Probab=94.78 E-value=0.38 Score=30.98 Aligned_cols=62 Identities=21% Similarity=0.223 Sum_probs=45.1
Q ss_pred eEEEEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEe---ecCHHHHHHHHhc-CCceEE
Q 048021 11 MTCGLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITG---RANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 11 ~tvvlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G---~vdp~~Ll~aI~k-Gk~Ael 72 (143)
.+..+.++ +.|..|...+++.+...+|+ ...++.....+.+.- ..+...+...+++ |+.+++
T Consensus 23 ~~~~~~~~~~~c~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 90 (92)
T TIGR02052 23 QTVTLEVPGMTCVACPITVETALQKVDGVSKAEVTFKTKLAVVTFDDEKTNVKALTEATTDAGYPSSL 90 (92)
T ss_pred eEEEEEECCeEcHHHHHHHHHHHhcCCCEEEEEEEecCCEEEEEECCCCCCHHHHHHHHHhcCCCeEe
Confidence 34566675 68999999999999999998 577777776665541 2466666667777 877654
No 14
>PF05046 Img2: Mitochondrial large subunit ribosomal protein (Img2); InterPro: IPR007740 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family of proteins has been identified as part of the mitochondrial large ribosomal subunit in Saccharomyces cerevisiae [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome
Probab=92.48 E-value=0.67 Score=32.71 Aligned_cols=60 Identities=18% Similarity=0.142 Sum_probs=49.6
Q ss_pred CCceEEEEEEeecChhhHHHHHHHHhccCC--eeEEEeCCCCEEEEEeecCHHHHHHHHhc-CC
Q 048021 8 SAKMTCGLKVDTNDPAWFSSMTKVLRKIKG--ASYTIDAEEGMAYITGRANPRKLLRKLQS-GK 68 (143)
Q Consensus 8 s~~~tvvlkV~m~C~gC~~kIkkaL~kI~G--VsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk 68 (143)
+...|++=||++.=..+++.++++|....+ ..+.|+...+.|.|.|.. ...|.+.|.+ ||
T Consensus 25 ~~~~T~IrkI~GD~~aL~~dL~~~l~~~~~~~~~~~V~~~~g~i~IkG~~-~~~Vk~wL~~~GF 87 (87)
T PF05046_consen 25 NRKITVIRKIEGDIWALKKDLRKFLGEKPKKKIDVRVNELTGHIEIKGDH-VEEVKKWLLEKGF 87 (87)
T ss_pred cEeEEEEEeecCCHHHHHHHHHHHhhhhcCCCcceEEeecCCEEEEcCcc-HHHHHHHHHHCcC
Confidence 345566668999999999999999987766 468999999999999986 6778888877 75
No 15
>PF14492 EFG_II: Elongation Factor G, domain II; PDB: 1WDT_A 2DY1_A 2XEX_A 1ELO_A 2XSY_Y 2WRK_Y 1DAR_A 2WRI_Y 2XUY_Y 3J0E_H ....
Probab=92.34 E-value=0.73 Score=31.24 Aligned_cols=60 Identities=15% Similarity=0.222 Sum_probs=44.5
Q ss_pred EEEEEEeecChhhHHHHHHHHhcc----CCeeEEEeCCCCEEEEEe--ecCHHHHHHHHhcCCceE
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKI----KGASYTIDAEEGMAYITG--RANPRKLLRKLQSGKYAN 71 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI----~GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~kGk~Ae 71 (143)
.+.+.|.-.-.+=..++.++|++| +++.+..|.+++.+.|.| .+--+.++++|++.|.++
T Consensus 5 v~~~~i~p~~~~d~~kl~~aL~~l~~eDP~l~~~~d~et~e~~l~g~Gelhlev~~~~L~~~~~v~ 70 (75)
T PF14492_consen 5 VLSVAIEPKNKEDEPKLSEALQKLSEEDPSLRVERDEETGELILSGMGELHLEVLLERLKRRFGVE 70 (75)
T ss_dssp SEEEEEEESSHHHHHHHHHHHHHHHHH-TTSEEEEETTTSEEEEEESSHHHHHHHHHHHHHTTCEB
T ss_pred eEEEEEEECCHhHHHHHHHHHHHHHhcCCeEEEEEcchhceEEEEECCHHHHHHHHHHHHHHHCCe
Confidence 445555555566677888888776 455799999999999996 567899999998844443
No 16
>PF02680 DUF211: Uncharacterized ArCR, COG1888; InterPro: IPR003831 This entry describes proteins of unknown function.; PDB: 3BPD_I 2RAQ_F 2X3D_E.
Probab=91.90 E-value=0.99 Score=32.99 Aligned_cols=62 Identities=16% Similarity=0.185 Sum_probs=39.0
Q ss_pred CceEEEEEEeecChhhHHHHHHHHhccCCe-e-----EEEeCCC--CEEEEEee-cCHHHHHHHHhc-CCce
Q 048021 9 AKMTCGLKVDTNDPAWFSSMTKVLRKIKGA-S-----YTIDAEE--GMAYITGR-ANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 9 ~~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-s-----V~VDle~--~~VtV~G~-vdp~~Ll~aI~k-Gk~A 70 (143)
+.+.++|.|--.-+--.-.+.++|.+++|| . .++|.+. =+++|+|. +|-++|.++|++ |--+
T Consensus 3 ~irRlVLDVlKP~~p~i~e~A~~l~~~~gV~gVnitv~EvD~ete~lkitiEG~~id~d~i~~~Ie~~Gg~I 74 (95)
T PF02680_consen 3 GIRRLVLDVLKPHEPSIVELAKALSELEGVDGVNITVVEVDVETENLKITIEGDDIDFDEIKEAIEELGGVI 74 (95)
T ss_dssp SEEEEEEEEEEESSS-HHHHHHHHHTSTTEEEEEEEEEEE-SSEEEEEEEEEESSE-HHHHHHHHHHTT-EE
T ss_pred ceeEEEEEeecCCCCCHHHHHHHHHhCCCcceEEEEEEEeeccccEEEEEEEeCCCCHHHHHHHHHHcCCeE
Confidence 345677777532233334677888899887 3 3455553 35666785 899999999999 7543
No 17
>PRK13748 putative mercuric reductase; Provisional
Probab=90.81 E-value=1.3 Score=39.50 Aligned_cols=67 Identities=19% Similarity=0.243 Sum_probs=51.3
Q ss_pred EEEEe-ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEe--ecCHHHHHHHHhc-CCceEEeecCCCCC
Q 048021 14 GLKVD-TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITG--RANPRKLLRKLQS-GKYANLCWVSAGNQ 80 (143)
Q Consensus 14 vlkV~-m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G--~vdp~~Ll~aI~k-Gk~Ael~~~~~~~s 80 (143)
.+.+. ++|..|..++++.+...+|+ ...++...+.+.+.. ..+...+...++. |+.+++...++.+.
T Consensus 3 ~i~i~g~~C~~c~~~ie~~l~~~~gv~~a~~~~~~~~~~v~~~~~~~~~~i~~~i~~~g~~~~~~~~~~~~~ 74 (561)
T PRK13748 3 TLKITGMTCDSCAAHVKDALEKVPGVQSADVSYPKGSAQLAIEVGTSPDALTAAVAGLGYRATLADAPPTDN 74 (561)
T ss_pred EEEECCeecHHHHHHHHHHHhcCCCeeEEEEEcCCCEEEEEECCCCCHHHHHHHHHHcCCeeeccCcccccc
Confidence 35564 68999999999999999998 688888888877763 2466777777778 99888777754443
No 18
>COG1888 Uncharacterized protein conserved in archaea [Function unknown]
Probab=90.11 E-value=0.56 Score=34.27 Aligned_cols=61 Identities=18% Similarity=0.154 Sum_probs=36.6
Q ss_pred ceEEEEEEee-cChhhHHHHHHHHhccCCe---e---EEEeCC--CCEEEEEe-ecCHHHHHHHHhc-CCce
Q 048021 10 KMTCGLKVDT-NDPAWFSSMTKVLRKIKGA---S---YTIDAE--EGMAYITG-RANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 10 ~~tvvlkV~m-~C~gC~~kIkkaL~kI~GV---s---V~VDle--~~~VtV~G-~vdp~~Ll~aI~k-Gk~A 70 (143)
.+.++|.|-- +-+--.-.+.+.|.+++|| + .++|.+ +=++||.| .+|-+.|.+.|++ |-.+
T Consensus 5 iRRlVLDvlKP~~~p~ive~A~~lskl~gVegVNItv~eiD~et~~~~itIeG~~ldydei~~~iE~~Gg~I 76 (97)
T COG1888 5 IRRLVLDVLKPHRGPTIVELALELSKLEGVEGVNITVTEIDVETENLKITIEGTNLDYDEIEEVIEELGGAI 76 (97)
T ss_pred ceeeeeeecCCcCCCcHHHHHHHHhhcCCcceEEEEEEEeeehhcceEEEEEcCCCCHHHHHHHHHHcCCee
Confidence 3455666532 2222222344455555554 2 344444 56788888 4899999999999 8644
No 19
>PF04972 BON: BON domain; InterPro: IPR007055 The BON domain is typically ~60 residues long and has an alpha/beta predicted fold. There is a conserved glycine residue and several hydrophobic regions. This pattern of conservation is more suggestive of a binding or structural function rather than a catalytic function. Most proteobacteria seem to possess one or two BON-containing proteins, typically of the OsmY-type proteins; outside of this group the distribution is more disparate. The OsmY protein is an Escherichia coli 20 kDa outer membrane or periplasmic protein that is expressed in response to a variety of stress conditions, in particular, helping to provide protection against osmotic shock. One hypothesis is that OsmY prevents shrinkage of the cytoplasmic compartment by contacting the phospholipid interfaces surrounding the periplasmic space. The domain architecture of two BON domains alone suggests that these domains contact the surfaces of phospholipids, with each domain contacting a membrane [].; PDB: 2L26_A 2KGS_A 2KSM_A.
Probab=89.66 E-value=0.26 Score=31.53 Aligned_cols=31 Identities=16% Similarity=0.305 Sum_probs=16.9
Q ss_pred HHHHHHHhccCCe---eEEEeCCCCEEEEEeecC
Q 048021 26 SSMTKVLRKIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 26 ~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.+|+.+|.+-+++ ++.|...++.|++.|.++
T Consensus 2 ~~v~~~L~~~~~~~~~~i~v~v~~g~v~L~G~v~ 35 (64)
T PF04972_consen 2 TKVRAALRADPWLPDSNISVSVENGVVTLSGEVP 35 (64)
T ss_dssp -----------CTT-TTEEEEEECTEEEEEEEES
T ss_pred cccccccccccccCCCeEEEEEECCEEEEEeeCc
Confidence 4678888874444 488999999999999984
No 20
>cd00371 HMA Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones. HMA domain contains two cysteine residues that are important in binding and transfer of metal ions, such as copper, cadmium, cobalt and zinc. In the case of copper, stoichiometry of binding is one Cu+ ion per binding domain. Repeats of the HMA domain in copper chaperone has been associated with Menkes/Wilson disease due to binding of multiple copper ions.
Probab=89.43 E-value=1.8 Score=22.20 Aligned_cols=49 Identities=27% Similarity=0.409 Sum_probs=33.0
Q ss_pred eecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEee--cCHHHHHHHHhc
Q 048021 18 DTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGR--ANPRKLLRKLQS 66 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~--vdp~~Ll~aI~k 66 (143)
++.|..|...++..+...+|+ ...+......+.+... .+...+...+..
T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 57 (63)
T cd00371 6 GMTCAGCVSKIEKALEKLPGVESVEVDLETGKATVEYDPEVSPEELLEAIED 57 (63)
T ss_pred CeEcHHHHHHHHHHHhcCCCEeEEEEEccCCEEEEEECCCCCHHHHHHHHHH
Confidence 468999999999999888887 5666666555555532 255554444444
No 21
>PF01883 DUF59: Domain of unknown function DUF59; InterPro: IPR002744 This family includes prokaryotic proteins of unknown function. The family also includes PhaH (O84984 from SWISSPROT) from Pseudomonas putida. PhaH forms a complex with PhaF (O84982 from SWISSPROT), PhaG (O84983 from SWISSPROT) and PhaI (O84985 from SWISSPROT), which hydroxylates phenylacetic acid to 2-hydroxyphenylacetic acid []. So members of this family may all be components of ring hydroxylating complexes.; PDB: 3LNO_C 3CQ3_A 3CQ2_D 2CU6_B 3CQ1_A 3UX3_B 3UX2_A 1WCJ_A 1UWD_A.
Probab=88.86 E-value=0.79 Score=30.21 Aligned_cols=32 Identities=16% Similarity=0.313 Sum_probs=22.1
Q ss_pred eEEEEEEeecChhhH------HHHHHHHhccCCe-eEEE
Q 048021 11 MTCGLKVDTNDPAWF------SSMTKVLRKIKGA-SYTI 42 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~------~kIkkaL~kI~GV-sV~V 42 (143)
-++.|.+.+...+|. +.|+++|+.++|| +|+|
T Consensus 34 ~~V~v~l~l~~~~~~~~~~l~~~i~~~l~~l~gv~~V~V 72 (72)
T PF01883_consen 34 GKVSVSLELPTPACPAAEPLREEIREALKALPGVKSVKV 72 (72)
T ss_dssp CEEEEEE--SSTTHTTHHHHHHHHHHHHHTSTT-SEEEE
T ss_pred CEEEEEEEECCCCchHHHHHHHHHHHHHHhCCCCceEeC
Confidence 356677777666665 7889999999999 7775
No 22
>PF03927 NapD: NapD protein; InterPro: IPR005623 This entry represents NapD, the twin-arginine signal-peptide-binding chaperone for NapA, functioning as an assembly protein for the periplasmic nitrate reductase NapABC. The periplasmic NapABC enzyme likely functions during growth in nitrate-limited environments [].; PDB: 2JSX_A 2PQ4_A.
Probab=87.74 E-value=2.4 Score=29.31 Aligned_cols=44 Identities=14% Similarity=0.211 Sum_probs=30.4
Q ss_pred hhHHHHHHHHhccCCeeEEEeCCCCEEEEEee-cCHHHHHHHHhc
Q 048021 23 AWFSSMTKVLRKIKGASYTIDAEEGMAYITGR-ANPRKLLRKLQS 66 (143)
Q Consensus 23 gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~-vdp~~Ll~aI~k 66 (143)
+=...|+++|.+++|++|....++|++.|.=. .+...+.+.|.+
T Consensus 15 ~~~~~v~~~l~~~~gvEVh~~~~~GKiVVtiE~~~~~~~~~~~~~ 59 (79)
T PF03927_consen 15 ERLEEVAEALAAIPGVEVHAVDEDGKIVVTIEAESSEEEVDLIDA 59 (79)
T ss_dssp CCHHHHHHHHCCSTTEEEEEEETTTEEEEEEEESSHHHHHHHHHH
T ss_pred hhHHHHHHHHHcCCCcEEEeeCCCCeEEEEEEeCChHHHHHHHHH
Confidence 44678999999999998755444477777643 455666655554
No 23
>PF13732 DUF4162: Domain of unknown function (DUF4162)
Probab=84.51 E-value=3.6 Score=27.42 Aligned_cols=45 Identities=18% Similarity=0.189 Sum_probs=32.8
Q ss_pred HhccCCe-eEEEeCCC-CEEEEEeecCHHHHHHHHhc-CCceEEeecCC
Q 048021 32 LRKIKGA-SYTIDAEE-GMAYITGRANPRKLLRKLQS-GKYANLCWVSA 77 (143)
Q Consensus 32 L~kI~GV-sV~VDle~-~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~ 77 (143)
|+.++|| ++..+... -++.|....++.+|++.|.+ |. +.-+....
T Consensus 26 l~~~~~v~~v~~~~~~~~~i~l~~~~~~~~ll~~l~~~g~-I~~f~~~~ 73 (84)
T PF13732_consen 26 LEELPGVESVEQDGDGKLRIKLEDEETANELLQELIEKGI-IRSFEEEE 73 (84)
T ss_pred HhhCCCeEEEEEeCCcEEEEEECCcccHHHHHHHHHhCCC-eeEEEEcC
Confidence 8888999 67765433 35555666788999999999 99 76555543
No 24
>PRK11023 outer membrane lipoprotein; Provisional
Probab=83.86 E-value=3.1 Score=33.02 Aligned_cols=41 Identities=12% Similarity=0.191 Sum_probs=32.6
Q ss_pred hHHHHHHHHhccCCe---eEEEeCCCCEEEEEeecCHHHHHHHH
Q 048021 24 WFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRANPRKLLRKL 64 (143)
Q Consensus 24 C~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vdp~~Ll~aI 64 (143)
=..+|+.+|..-+++ .|+|..++|.|++.|.++.++..+++
T Consensus 128 It~kik~~L~~~~~v~~~~I~V~t~~G~V~L~G~v~~~e~~~a~ 171 (191)
T PRK11023 128 ITTKVRSQLLTSDSVKSSNVKVTTENGEVFLLGLVTQREAKAAA 171 (191)
T ss_pred HHHHHHHHHhcCCCCCcceEEEEEECcEEEEEEEeCHHHHHHHH
Confidence 456899999877777 38999999999999999876554444
No 25
>cd04888 ACT_PheB-BS C-terminal ACT domain of a small (~147 a.a.) putative phenylalanine biosynthetic pathway protein described in Bacillus subtilis (BS) PheB (PheB-BS) and related domains. This CD includes the C-terminal ACT domain of a small (~147 a.a.) putative phenylalanine biosynthetic pathway protein described in Bacillus subtilis (BS) PheB (PheB-BS) and other related ACT domains. In B. subtilis, the upstream gene of pheB, pheA encodes prephenate dehydratase (PDT). The presumed product of the pheB gene is chorismate mutase (CM). The deduced product of the B. subtilis pheB gene, however, has no significant homology to the CM portion of the bifunctional CM-PDT of Escherichia coli. The presence of an ACT domain lends support to the prediction that these proteins function as a phenylalanine-binding regulatory protein. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=83.26 E-value=3.1 Score=26.75 Aligned_cols=33 Identities=18% Similarity=0.334 Sum_probs=24.1
Q ss_pred ceEEEEEEeecChh-hHHHHHHHHhccCCe-eEEE
Q 048021 10 KMTCGLKVDTNDPA-WFSSMTKVLRKIKGA-SYTI 42 (143)
Q Consensus 10 ~~tvvlkV~m~C~g-C~~kIkkaL~kI~GV-sV~V 42 (143)
...+.|.|+..-.. --++|.++|++++|| +|.+
T Consensus 40 ~~~i~~~v~v~~~~~~l~~l~~~L~~i~~V~~v~~ 74 (76)
T cd04888 40 RANVTISIDTSTMNGDIDELLEELREIDGVEKVEL 74 (76)
T ss_pred eEEEEEEEEcCchHHHHHHHHHHHhcCCCeEEEEE
Confidence 34566666655444 778999999999999 7765
No 26
>COG1094 Predicted RNA-binding protein (contains KH domains) [General function prediction only]
Probab=83.05 E-value=2.7 Score=34.28 Aligned_cols=39 Identities=23% Similarity=0.450 Sum_probs=31.6
Q ss_pred HHHHHHHHhccCCeeEEEeCCCCEEEEEee---cCHHHHHHH
Q 048021 25 FSSMTKVLRKIKGASYTIDAEEGMAYITGR---ANPRKLLRK 63 (143)
Q Consensus 25 ~~kIkkaL~kI~GVsV~VDle~~~VtV~G~---vdp~~Ll~a 63 (143)
...|+++|.+..|+.+++|.+++.|+|... .||..++++
T Consensus 26 ~g~v~k~ie~~~~~~~~iD~~~~~V~i~~~~~t~Dp~~~~ka 67 (194)
T COG1094 26 WGEVKKAIEEKTGVKLRIDSKTGSVTIRTTRKTEDPLALLKA 67 (194)
T ss_pred cccchHHHHhhcCeEEEEECCCCeEEEEecCCCCChHHHHHH
Confidence 457888998888899999999999999965 476665544
No 27
>cd04908 ACT_Bt0572_1 N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains. Included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains as seen in the uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related ACT domains. These tandem ACT domain proteins belong to the superfamily of ACT regulatory domains.
Probab=81.71 E-value=11 Score=24.06 Aligned_cols=56 Identities=29% Similarity=0.443 Sum_probs=35.7
Q ss_pred EEEee-cChhhHHHHHHHHhccCCe---eEEE-eC-CCCEEEEEeecCHHHHHHHHhc-CCceEE
Q 048021 15 LKVDT-NDPAWFSSMTKVLRKIKGA---SYTI-DA-EEGMAYITGRANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 15 lkV~m-~C~gC~~kIkkaL~kI~GV---sV~V-Dl-e~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael 72 (143)
|.|.+ ...|=-.+|.++|.+ .|+ ++.+ .. +...+.+.. -|++++++.|++ |+++.+
T Consensus 4 i~v~v~d~pG~La~v~~~l~~-~~inI~~i~~~~~~~~~~~rl~~-~~~~~~~~~L~~~G~~v~~ 66 (66)
T cd04908 4 LSVFLENKPGRLAAVTEILSE-AGINIRALSIADTSEFGILRLIV-SDPDKAKEALKEAGFAVKL 66 (66)
T ss_pred EEEEEcCCCChHHHHHHHHHH-CCCCEEEEEEEecCCCCEEEEEE-CCHHHHHHHHHHCCCEEEC
Confidence 34444 456777788888877 455 3322 22 234444444 578899999999 998753
No 28
>PRK10553 assembly protein for periplasmic nitrate reductase; Provisional
Probab=81.32 E-value=5.8 Score=28.21 Aligned_cols=44 Identities=16% Similarity=0.299 Sum_probs=30.2
Q ss_pred hhHHHHHHHHhccCCeeEE-EeCCCCEEEEEee-cCHHHH---HHHHhc
Q 048021 23 AWFSSMTKVLRKIKGASYT-IDAEEGMAYITGR-ANPRKL---LRKLQS 66 (143)
Q Consensus 23 gC~~kIkkaL~kI~GVsV~-VDle~~~VtV~G~-vdp~~L---l~aI~k 66 (143)
.=...|+++|.+++|++|- .|.++||+.|.=. .+...+ +++|+.
T Consensus 17 e~~~~V~~~l~~ipg~Evh~~d~~~GKiVVtiE~~~~~~~~~~i~~I~~ 65 (87)
T PRK10553 17 ERISDISTQLNAFPGCEVAVSDAPSGQLIVVVEAEDSETLLQTIESVRN 65 (87)
T ss_pred HHHHHHHHHHHcCCCcEEEeecCCCCeEEEEEEeCChHHHHHHHHHHHc
Confidence 3367899999999999866 4677788777633 344444 445554
No 29
>PRK11198 LysM domain/BON superfamily protein; Provisional
Probab=80.35 E-value=3.3 Score=31.54 Aligned_cols=44 Identities=18% Similarity=0.314 Sum_probs=30.9
Q ss_pred hhhHHHHHHHHhccCCe---eEEEeCCCCEEEEEeecCHHHHHHHHhc
Q 048021 22 PAWFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
+.=+++|+++|.+ .|+ .+.|..++|.|++.|.++....+.++..
T Consensus 25 ~~~~~~i~~~i~~-~~~~~~~i~V~v~~G~v~l~G~v~s~~~~~~~~~ 71 (147)
T PRK11198 25 EDAADALKEHISK-QGLGDADVNVQVEDGKATVSGDAASQEAKEKILL 71 (147)
T ss_pred HHHHHHHHHHHHh-cCCCcCCceEEEeCCEEEEEEEeCCHHHHHHHHH
Confidence 3445677888876 454 3788888999999999865555555544
No 30
>KOG4034 consensus Uncharacterized conserved protein NOF (Neighbor of FAU) [Function unknown]
Probab=79.04 E-value=3.7 Score=32.79 Aligned_cols=58 Identities=21% Similarity=0.169 Sum_probs=46.7
Q ss_pred ceEEEEEEeecChhhHHHHHHHHhccCCeeEE--EeCCCCEEEEEeecCHHHHHHHHhc-CC
Q 048021 10 KMTCGLKVDTNDPAWFSSMTKVLRKIKGASYT--IDAEEGMAYITGRANPRKLLRKLQS-GK 68 (143)
Q Consensus 10 ~~tvvlkV~m~C~gC~~kIkkaL~kI~GVsV~--VDle~~~VtV~G~vdp~~Ll~aI~k-Gk 68 (143)
.+|++=||++.--.|++.++..|+-+.|-++. |+.-+|++.+.|..-. .|.+.|.+ ||
T Consensus 109 ~lT~IRkVeGDi~aLe~DL~s~L~~~~~~s~~t~VnelsgqI~~~g~~v~-~vr~~L~eKGF 169 (169)
T KOG4034|consen 109 ILTVIRKVEGDIWALENDLRSTLEMSPKKSYATHVNELSGQIVLKGNHVD-TVREWLQEKGF 169 (169)
T ss_pred EEEEEEeecccHHHHHHHHHHHHhhccCCChhhhhhhhcceEEEeCChHH-HHHHHHHHccC
Confidence 45666689999999999999999999887654 8888999998875322 88888887 75
No 31
>PF14437 MafB19-deam: MafB19-like deaminase
Probab=77.75 E-value=5.3 Score=31.23 Aligned_cols=41 Identities=22% Similarity=0.220 Sum_probs=32.2
Q ss_pred ceEEEEEEee-cChhhHHHHHHHHhccCCe-eEEEeCC-CCEEEE
Q 048021 10 KMTCGLKVDT-NDPAWFSSMTKVLRKIKGA-SYTIDAE-EGMAYI 51 (143)
Q Consensus 10 ~~tvvlkV~m-~C~gC~~kIkkaL~kI~GV-sV~VDle-~~~VtV 51 (143)
...+++.|+- -|+.|...|....+++ |+ +++|-.. .|++.+
T Consensus 99 g~~~tm~Vdr~vC~~C~~~i~~~a~~l-Gl~~L~I~~~~sG~~~~ 142 (146)
T PF14437_consen 99 GRSMTMYVDRDVCGYCGGDIPSMAEKL-GLKSLTIHEPDSGKVYY 142 (146)
T ss_pred CCeEEEEECcccchHHHHHHHHHHHHc-CCCeEEEEecCCCcEEE
Confidence 4467888874 7999999999988885 88 6887666 776655
No 32
>cd04883 ACT_AcuB C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. This CD includes the C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. AcuB is putatively involved in the anaerobic catabolism of acetoin, and related proteins. Studies report the induction of AcuB by nitrate respiration and also by fermentation. Since acetoin can be secreted and later serve as a source of carbon, it has been proposed that, during anaerobic growth when other carbon sources are exhausted, the induction of the AcuB protein results in acetoin catabolism. AcuB-like proteins have two N-terminal tandem CBS domains and a single C-terminal ACT domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=74.29 E-value=19 Score=22.75 Aligned_cols=57 Identities=16% Similarity=0.337 Sum_probs=36.0
Q ss_pred EEEEee-cChhhHHHHHHHHhccCCe---eEEEeCC----CCEEEEEee-cCHHHHHHHHhc-CCceE
Q 048021 14 GLKVDT-NDPAWFSSMTKVLRKIKGA---SYTIDAE----EGMAYITGR-ANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 14 vlkV~m-~C~gC~~kIkkaL~kI~GV---sV~VDle----~~~VtV~G~-vdp~~Ll~aI~k-Gk~Ae 71 (143)
.|.|.+ ...|.-.+|.+.|.+. |+ ++..... ...++|.-. .+++.++++|++ |+++.
T Consensus 3 ~~~v~~~d~pG~l~~i~~~l~~~-~inI~~i~~~~~~~~~~~~v~i~v~~~~~~~~~~~L~~~G~~v~ 69 (72)
T cd04883 3 QIEVRVPDRPGQLADIAAIFKDR-GVNIVSVLVYPSKEEDNKILVFRVQTMNPRPIIEDLRRAGYEVL 69 (72)
T ss_pred EEEEEECCCCCHHHHHHHHHHHc-CCCEEEEEEeccCCCCeEEEEEEEecCCHHHHHHHHHHCCCeee
Confidence 455555 4567888888888774 44 3333222 223455432 578899999999 99653
No 33
>PF13192 Thioredoxin_3: Thioredoxin domain; PDB: 1ZYP_B 1ZYN_A 1HYU_A 1ILO_A 1J08_F 2YWM_B 2AYT_B 2HLS_B 1A8L_A 2K8S_B ....
Probab=72.01 E-value=14 Score=24.50 Aligned_cols=14 Identities=7% Similarity=-0.036 Sum_probs=9.3
Q ss_pred EEEEEEeecChhhHH
Q 048021 12 TCGLKVDTNDPAWFS 26 (143)
Q Consensus 12 tvvlkV~m~C~gC~~ 26 (143)
++.+ +.-.|..|..
T Consensus 2 ~I~v-~~~~C~~C~~ 15 (76)
T PF13192_consen 2 KIKV-FSPGCPYCPE 15 (76)
T ss_dssp EEEE-ECSSCTTHHH
T ss_pred EEEE-eCCCCCCcHH
Confidence 3444 5557999984
No 34
>smart00838 EFG_C Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold.
Probab=71.79 E-value=20 Score=24.15 Aligned_cols=45 Identities=13% Similarity=0.243 Sum_probs=30.6
Q ss_pred EEeecC-hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHH
Q 048021 16 KVDTND-PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKL 60 (143)
Q Consensus 16 kV~m~C-~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~L 60 (143)
+|++.| +.+..+|...|.+-.|.-++.+...+.+.|++.++...+
T Consensus 7 ~~~I~~p~~~~g~v~~~l~~rrG~i~~~~~~~~~~~i~~~iP~~~~ 52 (85)
T smart00838 7 KVEVTVPEEYMGDVIGDLNSRRGKIEGMEQRGGAQVIKAKVPLSEM 52 (85)
T ss_pred EEEEEeCHHHHHHHHHHHHHcCCEEECeeccCCcEEEEEECCHHHH
Confidence 344444 457778899998888863344444568999999885554
No 35
>cd04097 mtEFG1_C mtEFG1_C: C-terminus of mitochondrial Elongation factor G1 (mtEFG1)-like proteins found in eukaryotes. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. Eukaryotic EF-2 operates in the cytosolic protein synthesis machinery of eukaryotes, EF-Gs in protein synthesis in bacteria. Eukaryotic mtEFG1 proteins show significant homology to bacterial EF-Gs. Mutants in yeast mtEFG1 have impaired mitochondrial protein synthesis, respiratory defects and a tendency to lose mitochondrial DNA. There are two forms of mtEFG present in mammals (desig
Probab=70.32 E-value=23 Score=23.56 Aligned_cols=45 Identities=18% Similarity=0.237 Sum_probs=29.6
Q ss_pred EEeecC-hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHH
Q 048021 16 KVDTND-PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKL 60 (143)
Q Consensus 16 kV~m~C-~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~L 60 (143)
+|++.| +.+..+|...|.+-.|.-+..+..++.+.|++.++...+
T Consensus 5 ~~~I~~p~~~~g~v~~~l~~rrg~i~~~~~~~~~~~i~~~~P~~e~ 50 (78)
T cd04097 5 KVEVTAPTEFQGNVIGLLNKRKGTIVDTDTGEDEFTLEAEVPLNDM 50 (78)
T ss_pred EEEEEecHHHHHHHHHHHHHCCCEEeceEecCCeEEEEEEECHHHh
Confidence 344444 356668888888888863344444578999999875544
No 36
>COG3643 Glutamate formiminotransferase [Amino acid transport and metabolism]
Probab=70.13 E-value=4.5 Score=34.64 Aligned_cols=58 Identities=19% Similarity=0.185 Sum_probs=44.1
Q ss_pred cChhhHHHHHHHHhccCCe---eEEEeCCCCEEEEEeecCHHHHHHHHhc--CCceEEeecCC
Q 048021 20 NDPAWFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRANPRKLLRKLQS--GKYANLCWVSA 77 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~Ael~~~~~ 77 (143)
.-..-..+|..+..++++| +|+.|..-++-.++-..||+++++|.-+ ++.++++-...
T Consensus 15 ~~~~~ie~i~a~~~~~~~v~ildve~danhNRsViT~vgdp~~~~~A~f~~ik~AaelIDM~~ 77 (302)
T COG3643 15 RDLEKIEKIVAAAKSIPTVKILDVEMDANHNRSVITLVGDPSKVVNAAFALIKKAAELIDMRN 77 (302)
T ss_pred ccHHHHHHHHHHHhcCCceEEEEeccCCCCCceEEEEecChHHHHHHHHHHHHHHHHhhchhc
Confidence 3444566777888888998 3666666677777777899999999888 88899988753
No 37
>PRK10568 periplasmic protein; Provisional
Probab=69.55 E-value=9.4 Score=30.52 Aligned_cols=37 Identities=11% Similarity=0.215 Sum_probs=29.4
Q ss_pred cChhhHHHHHHHHhccCCe---eEEEeCCCCEEEEEeecC
Q 048021 20 NDPAWFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.-..=..+|+.+|..-+++ .+.|...+|.|++.|.++
T Consensus 57 ~D~~I~~~v~~~L~~~~~i~~~~I~V~v~~G~V~L~G~V~ 96 (203)
T PRK10568 57 DDSAITAKVKAALVDHDNIKSTDISVKTHQKVVTLSGFVE 96 (203)
T ss_pred cHHHHHHHHHHHHHhCCCCCCCceEEEEECCEEEEEEEeC
Confidence 3444567888888876666 489999999999999986
No 38
>cd03711 Tet_C Tet_C: C-terminus of ribosomal protection proteins Tet(M) and Tet(O). This domain has homology to the C terminal domains of the elongation factors EF-G and EF-2. Tet(M) and Tet(O) catalyze the release of tetracycline (Tc) from the ribosome in a GTP-dependent manner thereby mediating Tc resistance. Tcs are broad-spectrum antibiotics. Typical Tcs bind to the ribosome and inhibit the elongation phase of protein synthesis, by inhibiting the occupation of site A by aminoacyl-tRNA.
Probab=69.18 E-value=25 Score=23.37 Aligned_cols=45 Identities=7% Similarity=-0.020 Sum_probs=31.1
Q ss_pred EEeecC-hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHH
Q 048021 16 KVDTND-PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKL 60 (143)
Q Consensus 16 kV~m~C-~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~L 60 (143)
++++.| ..+..+|...|.+-.|.-+..+..++.+.|++.++...+
T Consensus 5 ~~~i~~p~~~~g~v~~~l~~rrg~i~~~~~~~~~~~i~~~~P~~~~ 50 (78)
T cd03711 5 RFELEVPQDALGRAMSDLAKMGATFEDPQIKGDEVTLEGTIPVATS 50 (78)
T ss_pred EEEEEcCHHHHHHHHHHHHHcCCEeeCcEecCCEEEEEEEECHHHH
Confidence 344444 457778999999888863344445578999999885554
No 39
>PF08002 DUF1697: Protein of unknown function (DUF1697); InterPro: IPR012545 This family contains many hypothetical bacterial proteins.; PDB: 2HIY_B.
Probab=68.98 E-value=21 Score=26.87 Aligned_cols=50 Identities=14% Similarity=0.242 Sum_probs=35.0
Q ss_pred HHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc------CCceEEeec
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS------GKYANLCWV 75 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k------Gk~Ael~~~ 75 (143)
-+.++.+|.++ |. +|..=+.+|-|.++...+++.|..+|++ |+++.++--
T Consensus 21 MaeLr~~l~~~-Gf~~V~Tyi~SGNvvf~~~~~~~~l~~~ie~~l~~~fG~~v~v~vr 77 (137)
T PF08002_consen 21 MAELREALEDL-GFTNVRTYIQSGNVVFESDRDPAELAAKIEKALEERFGFDVPVIVR 77 (137)
T ss_dssp HHHHHHHHHHC-T-EEEEEETTTTEEEEEESS-HHHHHHHHHHHHHHH-TT---EEEE
T ss_pred HHHHHHHHHHc-CCCCceEEEeeCCEEEecCCChHHHHHHHHHHHHHhcCCCeEEEEe
Confidence 35678888886 77 7999999999999977887777766654 998765544
No 40
>cd03713 EFG_mtEFG_C EFG_mtEFG_C: domains similar to the C-terminal domain of the bacterial translational elongation factor (EF) EF-G. Included in this group is the C-terminus of mitochondrial Elongation factor G1 (mtEFG1) and G2 (mtEFG2) proteins. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. In bacteria this translocation step is catalyzed by EF-G_GTP, which is hydrolyzed to provide
Probab=68.79 E-value=27 Score=23.06 Aligned_cols=46 Identities=13% Similarity=0.239 Sum_probs=30.8
Q ss_pred EEEeecC-hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHH
Q 048021 15 LKVDTND-PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKL 60 (143)
Q Consensus 15 lkV~m~C-~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~L 60 (143)
++|++.| +.+..+|...|.+-.|.-+..+...+.+.|.+.++...+
T Consensus 4 ~~~~I~~p~~~~g~v~~~l~~rrg~i~~~~~~~~~~~i~~~iP~~e~ 50 (78)
T cd03713 4 MKVEVTVPEEYMGDVIGDLSSRRGQILGTESRGGWKVIKAEVPLAEM 50 (78)
T ss_pred EEEEEEcCHHHHHHHHHHHHHcCCceEceeccCCcEEEEEEcCHHHH
Confidence 3455555 457778888898888863333344568899999875543
No 41
>cd02966 TlpA_like_family TlpA-like family; composed of TlpA, ResA, DsbE and similar proteins. TlpA, ResA and DsbE are bacterial protein disulfide reductases with important roles in cytochrome maturation. They are membrane-anchored proteins with a soluble TRX domain containing a CXXC motif located in the periplasm. The TRX domains of this family contain an insert, approximately 25 residues in length, which correspond to an extra alpha helix and a beta strand when compared with TRX. TlpA catalyzes an essential reaction in the biogenesis of cytochrome aa3, while ResA and DsbE are essential proteins in cytochrome c maturation. Also included in this family are proteins containing a TlpA-like TRX domain with domain architectures similar to E. coli DipZ protein, and the N-terminal TRX domain of PilB protein from Neisseria which acts as a disulfide reductase that can recylce methionine sulfoxide reductases.
Probab=68.08 E-value=30 Score=22.42 Aligned_cols=54 Identities=13% Similarity=0.146 Sum_probs=29.7
Q ss_pred EEEEEEeecChhhHHHHHHHHhcc------CCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEee
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKI------KGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI------~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
++++-....|+.|.+.+.. |.++ +++. +.|+.+.. +++.+.+.+++ +....++.
T Consensus 22 ~ll~f~~~~C~~C~~~~~~-l~~~~~~~~~~~~~~~~v~~d~~--------~~~~~~~~~~~~~~~~~~~~ 83 (116)
T cd02966 22 VLVNFWASWCPPCRAEMPE-LEALAKEYKDDGVEVVGVNVDDD--------DPAAVKAFLKKYGITFPVLL 83 (116)
T ss_pred EEEEeecccChhHHHHhHH-HHHHHHHhCCCCeEEEEEECCCC--------CHHHHHHHHHHcCCCcceEE
Confidence 3333345689999865433 3332 2332 22222221 48889999888 86555544
No 42
>PF09580 Spore_YhcN_YlaJ: Sporulation lipoprotein YhcN/YlaJ (Spore_YhcN_YlaJ); InterPro: IPR019076 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain.
Probab=67.39 E-value=22 Score=27.12 Aligned_cols=45 Identities=16% Similarity=0.303 Sum_probs=30.5
Q ss_pred hhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecC---------HHHHHHHHhc
Q 048021 22 PAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRAN---------PRKLLRKLQS 66 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vd---------p~~Ll~aI~k 66 (143)
..=+.+|.+.+.+++|| ++.|=..++.+.|--.++ ..+|.++|++
T Consensus 74 ~~~a~~i~~~v~~~~~V~~A~vvv~~~~a~Vav~~~~~~~~~~~i~~~V~~~v~~ 128 (177)
T PF09580_consen 74 QQLADRIANRVKKVPGVEDATVVVTDDNAYVAVDLDFNRFNTKKIKKKVEKAVKS 128 (177)
T ss_pred HHHHHHHHHHHhcCCCceEEEEEEECCEEEEEEEecccccchhHHHHHHHHHHHH
Confidence 45678999999999999 666666666666653332 3455555555
No 43
>PRK11023 outer membrane lipoprotein; Provisional
Probab=66.65 E-value=12 Score=29.69 Aligned_cols=46 Identities=17% Similarity=0.224 Sum_probs=33.2
Q ss_pred ChhhHHHHHHHHhccCCe----eEEEeCCCCEEEEEeecCHHHHHHHHhc
Q 048021 21 DPAWFSSMTKVLRKIKGA----SYTIDAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GV----sV~VDle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
-..=+.+|+.+|..-+++ .+.|...+|.|++.|.++.+......++
T Consensus 47 D~~i~~~V~~aL~~~~~l~~~~~I~V~v~~G~V~L~G~V~~~~~k~~A~~ 96 (191)
T PRK11023 47 DGTLELRVNNALSKDEQIKKEARINVTAYQGKVLLTGQSPNAELSERAKQ 96 (191)
T ss_pred hHHHHHHHHHHHhhCcccCcCceEEEEEECCEEEEEEEeCCHHHHHHHHH
Confidence 334467888888765544 4888889999999999876555555444
No 44
>cd03710 BipA_TypA_C BipA_TypA_C: a C-terminal portion of BipA or TypA having homology to the C terminal domains of the elongation factors EF-G and EF-2. A member of the ribosome binding GTPase superfamily, BipA is widely distributed in bacteria and plants. BipA is a highly conserved protein with global regulatory properties in Escherichia coli. BipA is phosphorylated on a tyrosine residue under some cellular conditions. Mutants show altered regulation of some pathways. BipA functions as a translation factor that is required specifically for the expression of the transcriptional modulator Fis. BipA binds to ribosomes at a site that coincides with that of EF-G and has a GTPase activity that is sensitive to high GDP:GTP ratios and, is stimulated by 70S ribosomes programmed with mRNA and aminoacylated tRNAs. The growth rate-dependent induction of BipA allows the efficient expression of Fis, thereby modulating a range of downstream processes, including DNA metabolism and type III secreti
Probab=66.23 E-value=29 Score=23.25 Aligned_cols=59 Identities=15% Similarity=0.204 Sum_probs=35.6
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCeeEEEeCC-CCEEEEEeecCHHHHHHHHhcCCceEEeecCC
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGASYTIDAE-EGMAYITGRANPRKLLRKLQSGKYANLCWVSA 77 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GVsV~VDle-~~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~ 77 (143)
+.+++.|+ +.|..+|...|.+..|.-..++.. ++.+.|++.++...+. |+..+|.+...
T Consensus 4 ~~v~I~~P---~~~~g~V~~~l~~rrg~i~~~~~~~~~~~~i~~~~P~~~~~-----~~~~~Lrs~T~ 63 (79)
T cd03710 4 EELTIDVP---EEYSGAVIEKLGKRKGEMVDMEPDGNGRTRLEFKIPSRGLI-----GFRSEFLTDTR 63 (79)
T ss_pred EEEEEEeC---chhhHHHHHHHHhCCCEEEccEECCCCEEEEEEEECHHHHc-----CcHHHHHhhCC
Confidence 34444444 456668889898888863233333 3689999998855442 44444444433
No 45
>PF04468 PSP1: PSP1 C-terminal conserved region; InterPro: IPR007557 The yeast polymerase suppressor 1 (PSP1) protein partially suppresses mutations in DNA polymerases alpha and delta []. The C-terminal half of PSP1 contains a domain, which is also found in several hypothetical proteins from both eukaryotic and prokaryotic sources: Crithidia fasciculata RBP45 and RBP33, subunits of the cycling sequence binding protein (CSBP) II. RBP45 and RBP33 proteins bind specifically to the cycling sequences present in several mRNAs that accumulate periodically during the cell cycle. RBP45 and RBP33 are phosphoproteins, which are phosphorylated differentially during progression through the cell cycle. Hypothetical proteins with high sequence similarity have been identified in other kinetoplastid organisms []. Bacillus subtilis yaaT protein, which plays a significant role in phosphorelay during initiation of sporulation. It is possible that the yaaT protein is also related to DNA replication. The sequence of the yaaT protein is widely conserved in prokaryotes (bacteria and archaea), but the functions of the protein are unknown []. The actual biological significance of the PSP1 C-terminal domain has not yet been clearly established.
Probab=65.82 E-value=21 Score=24.99 Aligned_cols=53 Identities=11% Similarity=0.023 Sum_probs=38.7
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEE----eecCHHHHHHHHhc--CCceEEee
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYIT----GRANPRKLLRKLQS--GKYANLCW 74 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~----G~vdp~~Ll~aI~k--Gk~Ael~~ 74 (143)
.-|.+.|+..-..++=|+++...+.+++++- +.+|=..|++.|.+ +-++++..
T Consensus 29 ~~c~~~~~~~~L~m~lvd~e~~~D~~k~~fyy~a~~rvDFR~Lvr~L~~~f~~RIem~q 87 (88)
T PF04468_consen 29 KFCRELVKELGLPMKLVDVEYQFDGSKLTFYYTAESRVDFRELVRDLAREFKTRIEMRQ 87 (88)
T ss_pred HHHHHHHHHcCCCeEEEEEEEEcCCCEEEEEEEeCCcCcHHHHHHHHHHHhCceEEEEc
Confidence 4466666664434433368888899999985 46899999999998 77777753
No 46
>COG2151 PaaD Predicted metal-sulfur cluster biosynthetic enzyme [General function prediction only]
Probab=65.77 E-value=12 Score=27.84 Aligned_cols=32 Identities=16% Similarity=0.328 Sum_probs=24.1
Q ss_pred EEEEEEeecChhh------HHHHHHHHhccCCe-eEEEe
Q 048021 12 TCGLKVDTNDPAW------FSSMTKVLRKIKGA-SYTID 43 (143)
Q Consensus 12 tvvlkV~m~C~gC------~~kIkkaL~kI~GV-sV~VD 43 (143)
.+.+++.++-.|| ...|+.+|+.++|| +++|+
T Consensus 50 ~v~v~mtlT~~gCP~~~~i~~~v~~al~~~~~v~~v~V~ 88 (111)
T COG2151 50 LVKVKMTLTSPGCPLAEVIADQVEAALEEIPGVEDVEVE 88 (111)
T ss_pred eEEEEEecCCCCCCccHHHHHHHHHHHHhcCCcceEEEE
Confidence 4556666677888 57899999999998 55554
No 47
>TIGR03406 FeS_long_SufT probable FeS assembly SUF system protein SufT. The function is unknown for this protein family, but members are found almost always in operons for the the SUF system of iron-sulfur cluster biosynthesis. The SUF system is present elsewhere on the chromosome for those few species where SUF genes are not adjacent. This family shares this property of association with the SUF system with a related family, TIGR02945. TIGR02945 consists largely of a DUF59 domain (see Pfam family pfam01883), while this protein is about double the length, with a unique N-terminal domain and DUF59 C-terminal domain. A location immediately downstream of the cysteine desulfurase gene sufS in many contexts suggests the gene symbol sufT. Note that some other homologs of this family and of TIGR02945, but no actual members of this family, are found in operons associated with phenylacetic acid (or other ring-hydroxylating) degradation pathways.
Probab=65.14 E-value=10 Score=30.03 Aligned_cols=33 Identities=3% Similarity=0.134 Sum_probs=23.5
Q ss_pred EEEEEEeecChhhH------HHHHHHHhccCCe-eEEEeC
Q 048021 12 TCGLKVDTNDPAWF------SSMTKVLRKIKGA-SYTIDA 44 (143)
Q Consensus 12 tvvlkV~m~C~gC~------~kIkkaL~kI~GV-sV~VDl 44 (143)
++.+++.++..+|. +.|+.+|.+++|| +|+|++
T Consensus 114 ~V~I~mtLt~p~c~~~~~L~~dV~~aL~~l~gV~~V~V~l 153 (174)
T TIGR03406 114 RVDIEMTLTAPGCGMGPVLVEDVEDKVLAVPNVDEVEVEL 153 (174)
T ss_pred EEEEEEEeCCCCCcHHHHHHHHHHHHHHhCCCceeEEEEE
Confidence 46677777666654 4589999999998 566654
No 48
>PF07837 FTCD_N: Formiminotransferase domain, N-terminal subdomain; InterPro: IPR012886 The formiminotransferase (FT) domain of formiminotransferase-cyclodeaminase (FTCD) forms a homodimer, with each protomer being comprised of two subdomains. The formiminotransferase domain has an N-terminal subdomain that is made up of a six-stranded mixed beta-pleated sheet and five alpha helices, which are arranged on the external surface of the beta sheet. This, in turn, faces the beta-sheet of the C-terminal subdomain to form a double beta-sheet layer. The two subdomains are separated by a short linker sequence, which is not thought to be any more flexible than the remainder of the molecule. The substrate is predicted to form a number of contacts with residues found in both the N-terminal and C-terminal subdomains []. This entry represents the N-terminal subdomain of the formiminotransferase domain.; GO: 0005542 folic acid binding, 0016740 transferase activity, 0008152 metabolic process; PDB: 2PFD_C 1QD1_B.
Probab=64.26 E-value=34 Score=27.54 Aligned_cols=64 Identities=19% Similarity=0.207 Sum_probs=43.8
Q ss_pred EEEEE-eecChhhHHHHHHHHhccCCe---eEEEeCCCCEEEEEeecCHHHHHHHHhc--CCceEEeecC
Q 048021 13 CGLKV-DTNDPAWFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRANPRKLLRKLQS--GKYANLCWVS 76 (143)
Q Consensus 13 vvlkV-~m~C~gC~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~Ael~~~~ 76 (143)
|+..+ ++.-..=..+|.+++++.+|| ++.-|.+-++..++=..+|+.|.+++-. ...++++-..
T Consensus 5 CvpN~SEgR~~~~ie~I~~a~~~~~gv~ll~~~~D~~~NRsv~T~vG~p~~v~~a~~~~~~~A~~~IDm~ 74 (178)
T PF07837_consen 5 CVPNFSEGRDKEVIEAIAKAARNVPGVKLLDVFSDADYNRSVITLVGEPEAVAEAAFAAIRKALELIDMR 74 (178)
T ss_dssp E--EES-SSSHHHHHHHHHHCCTSTTEEEEEEEEETTTTEEEEEEEE-HHHHHHHHHHHHHHHHCC--GG
T ss_pred EeeeeccCCCHHHHHHHHHHHHcCCCCEEEecCCCCCCCCCeEEEeeChHHHHHHHHHHHHHHHHhcchh
Confidence 45555 345666778899999999998 4777888888888866678888888766 6666666554
No 49
>cd04098 eEF2_C_snRNP eEF2_C_snRNP: This family includes a C-terminal portion of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. This domain is homologous to the C-terminal domain of the eukaryotic translational elongation factor EF-2. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site, the uncharged tRNA from the P site to the E-site and, the mRNA is shifted one codon relative to the ribosome.
Probab=64.18 E-value=32 Score=23.19 Aligned_cols=59 Identities=15% Similarity=0.091 Sum_probs=35.0
Q ss_pred EEEeecC-hhhHHHHHHHHhccCCeeEEEeCCCCE--EEEEeecCHHHHHHHHhcCCceEEeecCCC
Q 048021 15 LKVDTND-PAWFSSMTKVLRKIKGASYTIDAEEGM--AYITGRANPRKLLRKLQSGKYANLCWVSAG 78 (143)
Q Consensus 15 lkV~m~C-~gC~~kIkkaL~kI~GVsV~VDle~~~--VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~ 78 (143)
++|++.| ..|..+|...|.+-.|.-+..+..++. ..|+|.++..++. |+..+|.+...|
T Consensus 4 ~~~ei~~p~~~~g~v~~~L~~rrg~i~~~~~~~~~~~~~i~a~vP~~e~~-----~~~~~Lrs~T~G 65 (80)
T cd04098 4 YEVEITCPADAVSAVYEVLSRRRGHVIYDTPIPGTPLYEVKAFIPVIESF-----GFETDLRVHTQG 65 (80)
T ss_pred EEEEEEECHHHHhHHHHHHhhCCcEEeeeeccCCCCcEEEEEECCHHHHh-----ChHHHHHhhCCC
Confidence 3444443 466778888898888863334444444 8899998744332 455555544433
No 50
>smart00749 BON bacterial OsmY and nodulation domain.
Probab=63.59 E-value=21 Score=21.41 Aligned_cols=34 Identities=15% Similarity=0.162 Sum_probs=25.6
Q ss_pred HHHHHHHhccCCe-e--EEEeCCCCEEEEEeecCHHH
Q 048021 26 SSMTKVLRKIKGA-S--YTIDAEEGMAYITGRANPRK 59 (143)
Q Consensus 26 ~kIkkaL~kI~GV-s--V~VDle~~~VtV~G~vdp~~ 59 (143)
.+|+++|...+.+ . +.+....+.+++.|.+.+..
T Consensus 2 ~~v~~~l~~~~~~~~~~~~v~~~~~~vvL~g~~~~~~ 38 (62)
T smart00749 2 EKVKKALAKDGLIKADSIVVVTDGGVVVLLGGVVDNA 38 (62)
T ss_pred hhHHHHHhhCCCCCcCceEEEEECCEEEEeeecCCHH
Confidence 5788888887766 2 88888899999988764333
No 51
>PF01206 TusA: Sulfurtransferase TusA; InterPro: IPR001455 SirA functions as a response regulator as part of a two-component system, where BarA is the sensor kinase. This system increases the expression of virulence genes and decreases the expression of motility genes []. BarA phosphorylates SirA, thereby activating the protein. Phosphorylated SirA directly activates virulence expression by interacting with hilA and hilC promoters, while repressing the flagellar regulon indirectly by binding to the csrB promoter, which in turn affects flagellar gene expression. Orthologues of SirA from Salmonella spp. can be found throughout proteobacteria, such as GacA in Psuedomonas spp., VarA in Vibrio cholerae, ExpA in Erwinia carotovora, LetA in Legionella pneumophila, and UvrY in Escherichia coli []. A sensor kinase for SirA is present in each of these organisms as well; the sensor kinase is known as BarA in E. coli and Salmonella spp., but has different names in other genera. In different species, SirA/BarA orthologues are required for virulence gene expression, exoenzyme and antibiotic production, motility, and biofilm formation. The structure of SirA consists of an alpha/beta sandwich with a beta-alpha-beta-alpha-beta(2) fold, comprising a mixed four-stranded beta-sheet stacked against two alpha-helices, both of which are nearly parallel to the strands of the beta-sheet []. Several uncharacterised bacterial proteins (73 to 81 amino-acid residues in length) that contain a well-conserved region in their N-terminal region show structural similarity to the SirA protein, including the E. coli protein YedF (P0AA31 from SWISSPROT), and other members of the UPF0033 family.; GO: 0016783 sulfurtransferase activity, 0008033 tRNA processing, 0005737 cytoplasm; PDB: 3LVJ_D 3LVK_B 1DCJ_A 3HZ7_A 1JDQ_A 1JE3_A 1PAV_A.
Probab=63.53 E-value=28 Score=22.47 Aligned_cols=47 Identities=13% Similarity=0.031 Sum_probs=32.8
Q ss_pred ecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEee--cCHHHHHHHHhc-CCceEEe
Q 048021 19 TNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGR--ANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~--vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
..|+...-+++++|++++. ...+.|..+ .....|.+.+++ |+...-+
T Consensus 8 ~~CP~Pll~~~~~l~~l~~--------G~~l~v~~d~~~~~~di~~~~~~~g~~~~~~ 57 (70)
T PF01206_consen 8 LSCPMPLLKAKKALKELPP--------GEVLEVLVDDPAAVEDIPRWCEENGYEVVEV 57 (70)
T ss_dssp -STTHHHHHHHHHHHTSGT--------T-EEEEEESSTTHHHHHHHHHHHHTEEEEEE
T ss_pred CCCCHHHHHHHHHHHhcCC--------CCEEEEEECCccHHHHHHHHHHHCCCEEEEE
Confidence 4899999999999999852 234444432 346778888888 9964433
No 52
>COG3062 NapD Uncharacterized protein involved in formation of periplasmic nitrate reductase [Inorganic ion transport and metabolism]
Probab=62.65 E-value=23 Score=25.92 Aligned_cols=45 Identities=22% Similarity=0.347 Sum_probs=32.1
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEE-eecCHHHHHHHHhc
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYIT-GRANPRKLLRKLQS 66 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~-G~vdp~~Ll~aI~k 66 (143)
+.=..+|+.+|..|+|++|-..-.+|++.|. -.-+...|++.|+.
T Consensus 17 pe~l~av~~~L~~ip~~EV~~~d~~GKlVVVie~~~~~~l~~tie~ 62 (94)
T COG3062 17 PERLSAVKTALLAIPGCEVYGEDAEGKLVVVIEAEDSETLLETIES 62 (94)
T ss_pred HHHHHHHHHHHhcCCCcEeeccCCCceEEEEEEcCchHHHHHHHHH
Confidence 4456789999999999986554443555554 44578888888876
No 53
>PF15643 Tox-PL-2: Papain fold toxin 2
Probab=61.67 E-value=7.8 Score=28.59 Aligned_cols=19 Identities=16% Similarity=0.205 Sum_probs=16.2
Q ss_pred ecChhhHHHHHHHHhccCCe
Q 048021 19 TNDPAWFSSMTKVLRKIKGA 38 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GV 38 (143)
..|..|+.+|++.|.+ .|+
T Consensus 19 ~qC~~cA~Al~~~L~~-~gI 37 (100)
T PF15643_consen 19 FQCVECASALKQFLKQ-AGI 37 (100)
T ss_pred eehHHHHHHHHHHHHH-CCC
Confidence 5899999999999976 455
No 54
>PF00013 KH_1: KH domain syndrome, contains KH motifs.; InterPro: IPR018111 The K homology (KH) domain was first identified in the human heterogeneous nuclear ribonucleoprotein (hnRNP) K. It is a domain of around 70 amino acids that is present in a wide variety of quite diverse nucleic acid-binding proteins []. It has been shown to bind RNA [, ]. Like many other RNA-binding motifs, KH motifs are found in one or multiple copies (14 copies in chicken vigilin) and, at least for hnRNP K (three copies) and FMR-1 (two copies), each motif is necessary for in vitro RNA binding activity, suggesting that they may function cooperatively or, in the case of single KH motif proteins (for example, Mer1p), independently []. According to structural [, , ] analysis the KH domain can be separated in two groups. The first group or type-1 contain a beta-alpha-alpha-beta-beta-alpha structure, whereas in the type-2 the two last beta-sheet are located in the N-terminal part of the domain (alpha-beta-beta-alpha-alpha-beta). Sequence similarity between these two folds are limited to a short region (VIGXXGXXI) in the RNA binding motif. This motif is located between helice 1 and 2 in type-1 and between helice 2 and 3 in type-2. Proteins known to contain a type-1 KH domain include bacterial polyribonucleotide nucleotidyltransferases (2.7.7.8 from EC); vertebrate fragile X mental retardation protein 1 (FMR1); eukaryotic heterogeneous nuclear ribonucleoprotein K (hnRNP K), one of at least 20 major proteins that are part of hnRNP particles in mammalian cells; mammalian poly(rC) binding proteins; Artemia salina glycine-rich protein GRP33; yeast PAB1-binding protein 2 (PBP2); vertebrate vigilin; and human high-density lipoprotein binding protein (HDL-binding protein). More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; GO: 0003723 RNA binding; PDB: 1TUA_A 2Z0S_A 1WE8_A 4AM3_B 4AIM_A 4AID_A 2HH3_A 2JVZ_A 1J4W_A 2HH2_A ....
Probab=61.56 E-value=18 Score=22.63 Aligned_cols=37 Identities=16% Similarity=0.406 Sum_probs=25.4
Q ss_pred HHHHHHHhccCCeeEEEeCC--CCEEEEEeecCHHHHHHHHh
Q 048021 26 SSMTKVLRKIKGASYTIDAE--EGMAYITGRANPRKLLRKLQ 65 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle--~~~VtV~G~vdp~~Ll~aI~ 65 (143)
..|++ |++.-|+.+.++.+ ...|+|.| +++.+.+|++
T Consensus 20 ~~i~~-I~~~t~~~I~i~~~~~~~~v~I~G--~~~~v~~A~~ 58 (60)
T PF00013_consen 20 SNIKE-IEEETGVKIQIPDDDERDIVTISG--SPEQVEKAKK 58 (60)
T ss_dssp HHHHH-HHHHHTSEEEEESTTEEEEEEEEE--SHHHHHHHHH
T ss_pred CcHHH-hhhhcCeEEEEcCCCCcEEEEEEe--CHHHHHHHHh
Confidence 34444 55555888888765 45888888 8888887764
No 55
>PRK13763 putative RNA-processing protein; Provisional
Probab=60.67 E-value=35 Score=26.81 Aligned_cols=39 Identities=23% Similarity=0.365 Sum_probs=27.3
Q ss_pred HHHHHHHhccCCeeEEEeCCCCEEEEEe-e-cCHHHHHHHH
Q 048021 26 SSMTKVLRKIKGASYTIDAEEGMAYITG-R-ANPRKLLRKL 64 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle~~~VtV~G-~-vdp~~Ll~aI 64 (143)
.++-+.|.+.-|+.+++|.++++|.|.. . .|++.+.+|.
T Consensus 22 Gk~Ik~I~e~tg~~I~i~~~~g~V~I~~~~~~d~~~i~kA~ 62 (180)
T PRK13763 22 GETKKEIEERTGVKLEIDSETGEVIIEPTDGEDPLAVLKAR 62 (180)
T ss_pred hhHHHHHHHHHCcEEEEECCCCeEEEEeCCCCCHHHHHHHH
Confidence 4556667776789999999989999972 1 4555554443
No 56
>KOG3411 consensus 40S ribosomal protein S19 [Translation, ribosomal structure and biogenesis]
Probab=60.19 E-value=9.8 Score=29.61 Aligned_cols=42 Identities=10% Similarity=0.154 Sum_probs=29.4
Q ss_pred hhhHHHHHHHHhccCCee-EEEeCCCC-EEEEEeecCHHHHHHHHhc
Q 048021 22 PAWFSSMTKVLRKIKGAS-YTIDAEEG-MAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVs-V~VDle~~-~VtV~G~vdp~~Ll~aI~k 66 (143)
++|.++ +|++|+.+. |+.+.+.+ +++=.|.-|.+.|...|..
T Consensus 97 ~~i~rk---vlQ~Le~~~~ve~hp~gGR~lt~~GqrdldrIa~~i~~ 140 (143)
T KOG3411|consen 97 GGIARK---VLQALEKMGIVEKHPKGGRRLTEQGQRDLDRIAGQIRE 140 (143)
T ss_pred cHHHHH---HHHHHHhCCceeeCCCCcceeCcccchhHHHHHHHHHh
Confidence 556654 555555553 78888765 5666699999999998865
No 57
>PF03958 Secretin_N: Bacterial type II/III secretion system short domain; InterPro: IPR005644 This is a group of NolW-like proteins, which are closely related to bacterial type II and III secretion system protein (IPR004846 from INTERPRO).; PDB: 3EZJ_C 2Y3M_A 3OSS_D.
Probab=60.16 E-value=20 Score=23.54 Aligned_cols=28 Identities=7% Similarity=0.186 Sum_probs=20.6
Q ss_pred eeEEEeCCCCEEEEEeec-CHHHHHHHHh
Q 048021 38 ASYTIDAEEGMAYITGRA-NPRKLLRKLQ 65 (143)
Q Consensus 38 VsV~VDle~~~VtV~G~v-dp~~Ll~aI~ 65 (143)
+.|.+|...+.+.|.|.. +-+.|.+.|+
T Consensus 46 ~~i~~d~~tNsliv~g~~~~~~~i~~li~ 74 (82)
T PF03958_consen 46 GRIVADERTNSLIVRGTPEDLEQIRELIK 74 (82)
T ss_dssp TEEEEECTTTEEEEEEEHHHHHHHHHHHH
T ss_pred eEEEEECCCCEEEEEeCHHHHHHHHHHHH
Confidence 469999999999999974 4444444444
No 58
>PRK11200 grxA glutaredoxin 1; Provisional
Probab=60.08 E-value=23 Score=23.63 Aligned_cols=32 Identities=9% Similarity=0.047 Sum_probs=20.8
Q ss_pred EEEEeecChhhHHHHHHHHhcc----CCee-EEEeCCC
Q 048021 14 GLKVDTNDPAWFSSMTKVLRKI----KGAS-YTIDAEE 46 (143)
Q Consensus 14 vlkV~m~C~gC~~kIkkaL~kI----~GVs-V~VDle~ 46 (143)
++=.--.|..|. ++++.|..+ .|+. ..+|..+
T Consensus 4 ~iy~~~~C~~C~-~a~~~L~~l~~~~~~i~~~~idi~~ 40 (85)
T PRK11200 4 VIFGRPGCPYCV-RAKELAEKLSEERDDFDYRYVDIHA 40 (85)
T ss_pred EEEeCCCChhHH-HHHHHHHhhcccccCCcEEEEECCC
Confidence 333334799998 667778886 6773 5555554
No 59
>cd03419 GRX_GRXh_1_2_like Glutaredoxin (GRX) family, GRX human class 1 and 2 (h_1_2)-like subfamily; composed of proteins similar to human GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes
Probab=59.82 E-value=43 Score=21.42 Aligned_cols=18 Identities=11% Similarity=0.172 Sum_probs=13.8
Q ss_pred eecChhhHHHHHHHHhccC
Q 048021 18 DTNDPAWFSSMTKVLRKIK 36 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~ 36 (143)
...|+.|. ++++.|.++.
T Consensus 7 ~~~Cp~C~-~~~~~l~~~~ 24 (82)
T cd03419 7 KSYCPYCK-RAKSLLKELG 24 (82)
T ss_pred cCCCHHHH-HHHHHHHHcC
Confidence 45799998 6777788863
No 60
>TIGR03665 arCOG04150 arCOG04150 universal archaeal KH domain protein. This family of proteins is universal among the 41 archaeal genomes analyzed in and is not observed outside of the archaea. The proteins contain a single KH domain (pfam00013) which is likely to confer the ability to bind RNA.
Probab=59.82 E-value=20 Score=27.88 Aligned_cols=37 Identities=22% Similarity=0.386 Sum_probs=26.1
Q ss_pred HHHHHHhccCCeeEEEeCCCCEEEEE-eecCHHHHHHH
Q 048021 27 SMTKVLRKIKGASYTIDAEEGMAYIT-GRANPRKLLRK 63 (143)
Q Consensus 27 kIkkaL~kI~GVsV~VDle~~~VtV~-G~vdp~~Ll~a 63 (143)
+.-+.|.+.-|+.+++|.++++|.|. ...||..+.+|
T Consensus 18 ~~Ik~I~~~tgv~I~Id~~~g~V~I~~~t~d~~~i~kA 55 (172)
T TIGR03665 18 ETKKEIEERTGVKLDIDSETGEVKIEEEDEDPLAVMKA 55 (172)
T ss_pred hHHHHHHHHhCcEEEEEcCCceEEEecCCCCHHHHHHH
Confidence 34445666668899999998999994 23566666655
No 61
>cd02410 archeal_CPSF_KH The archaeal cleavage and polyadenylation specificity factor (CPSF) contains an N-terminal K homology RNA-binding domain (KH). The archeal CPSFs are predicted to be metal-dependent RNases belonging to the beta-CASP family, a subgroup enzymes within the metallo-beta-lactamase fold. The KH motif is a beta-alpha-alpha-beta-beta unit that folds into an alpha-beta structure with a three stranded beta-sheet interupted by two contiguous helices. In general, KH domains are known to bind single-stranded RNA or DNA and are found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
Probab=59.81 E-value=29 Score=27.10 Aligned_cols=61 Identities=13% Similarity=0.162 Sum_probs=41.7
Q ss_pred hhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEee----c--CHHHHHHHHhc--CCceEEeecCCCCCCc
Q 048021 22 PAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGR----A--NPRKLLRKLQS--GKYANLCWVSAGNQIT 82 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~----v--dp~~Ll~aI~k--Gk~Ael~~~~~~~s~~ 82 (143)
+.-.+.|++.+-+-.|| ++.+|.+.|.|+|... + -....++.|.+ |...+++..++-.|.+
T Consensus 52 e~A~~~I~~ivP~ea~i~di~Fd~~tGEV~IeaeKPG~ViGk~g~~~reI~~~tgW~p~vvRtpPi~S~t 121 (145)
T cd02410 52 EEAIKIILEIVPEEAGITDIYFDDDTGEVIIEAEKPGLVIGKGGSTLREITRETGWAPKVVRTPPIQSRT 121 (145)
T ss_pred HHHHHHHHHhCCCccCceeeEecCCCcEEEEEEcCCeEEEecCchhHHHHHHHhCCeeEEEecCCCCcHH
Confidence 33344555555333578 7999999999999853 1 34555455554 9999999988876654
No 62
>cd06167 LabA_like LabA_like proteins. A well conserved group of bacterial proteins with no defined function. LabA, a member from Synechococcus elongatus PCC 7942, has been shown to play a role in cyanobacterial circadian timing. It is required for negative feedback regulation of the autokinase/autophosphatase KaiC, a central component of the circadian clock system. In particular, LabA seems necessary for KaiC-dependent repression of gene expression.
Probab=59.69 E-value=13 Score=27.03 Aligned_cols=32 Identities=19% Similarity=0.339 Sum_probs=27.2
Q ss_pred CCEEEEEeecCHHHHHHHHhc-CCceEEeecCC
Q 048021 46 EGMAYITGRANPRKLLRKLQS-GKYANLCWVSA 77 (143)
Q Consensus 46 ~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~ 77 (143)
+.-|.|+|..|=.-++++|++ |+++++++.+.
T Consensus 101 d~ivLvSgD~Df~~~i~~lr~~G~~V~v~~~~~ 133 (149)
T cd06167 101 DTIVLVSGDSDFVPLVERLRELGKRVIVVGFEA 133 (149)
T ss_pred CEEEEEECCccHHHHHHHHHHcCCEEEEEccCc
Confidence 345666788999999999999 99999999973
No 63
>COG2823 OsmY Predicted periplasmic or secreted lipoprotein [General function prediction only]
Probab=59.44 E-value=22 Score=28.84 Aligned_cols=42 Identities=14% Similarity=0.247 Sum_probs=33.1
Q ss_pred hHHHHHHHHhccCCe---eEEEeCCCCEEEEEeec-CHHHHHHHHh
Q 048021 24 WFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRA-NPRKLLRKLQ 65 (143)
Q Consensus 24 C~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~v-dp~~Ll~aI~ 65 (143)
=..+|+.+|..-+.+ +|+|..+++.|++.|.+ ++++-.+|.+
T Consensus 132 iTt~Vka~Ll~~~~v~s~~IkV~t~~g~V~L~G~V~s~~e~~~A~~ 177 (196)
T COG2823 132 ITTKVKAALLGDDNVKSSNIKVETENGEVYLSGLVDSQEEAERAEE 177 (196)
T ss_pred HHHHHHHHhccCCCCCcceEEEEEECCEEEEEEEeCCHHHHHHHHH
Confidence 346899999988877 39999999999999999 5555555543
No 64
>PF00679 EFG_C: Elongation factor G C-terminus; InterPro: IPR000640 Translation elongation factors are responsible for two main processes during protein synthesis on the ribosome [, , ]. EF1A (or EF-Tu) is responsible for the selection and binding of the cognate aminoacyl-tRNA to the A-site (acceptor site) of the ribosome. EF2 (or EF-G) is responsible for the translocation of the peptidyl-tRNA from the A-site to the P-site (peptidyl-tRNA site) of the ribosome, thereby freeing the A-site for the next aminoacyl-tRNA to bind. Elongation factors are responsible for achieving accuracy of translation and both EF1A and EF2 are remarkably conserved throughout evolution. Elongation factor EF2 (EF-G) is a G-protein. It brings about the translocation of peptidyl-tRNA and mRNA through a ratchet-like mechanism: the binding of GTP-EF2 to the ribosome causes a counter-clockwise rotation in the small ribosomal subunit; the hydrolysis of GTP to GDP by EF2 and the subsequent release of EF2 causes a clockwise rotation of the small subunit back to the starting position [, ]. This twisting action destabilises tRNA-ribosome interactions, freeing the tRNA to translocate along the ribosome upon GTP-hydrolysis by EF2. EF2 binding also affects the entry and exit channel openings for the mRNA, widening it when bound to enable the mRNA to translocate along the ribosome. This entry represents the C-terminal domain found in EF2 (or EF-G) of both prokaryotes and eukaryotes (also known as eEF2), as well as in some tetracycline-resistance proteins. This domain adopts a ferredoxin-like fold consisting of an alpha/beta sandwich with anti-parallel beta-sheets. It resembles the topology of domain III found in these elongation factors, with which it forms the C-terminal block, but these two domains cannot be superimposed []. This domain is often found associated with (IPR000795 from INTERPRO), which contains the signatures for the N terminus of the proteins. More information about these proteins can be found at Protein of the Month: Elongation Factors [].; GO: 0005525 GTP binding; PDB: 1WDT_A 2DY1_A 3CB4_F 3DEG_C 2EFG_A 1ELO_A 2XSY_Y 2WRK_Y 1DAR_A 2WRI_Y ....
Probab=59.30 E-value=37 Score=23.18 Aligned_cols=46 Identities=20% Similarity=0.229 Sum_probs=31.9
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCe--eEEEeCCCCEEEEEeecCHHHH
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGA--SYTIDAEEGMAYITGRANPRKL 60 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GV--sV~VDle~~~VtV~G~vdp~~L 60 (143)
+.+++.|+ ..+..+|...|.+-.|. +.+.+ .++.+.|.|.++...+
T Consensus 7 ~~~~I~~p---~~~~g~v~~~l~~r~g~i~~~~~~-~~~~~~i~~~iP~~~~ 54 (89)
T PF00679_consen 7 MSVEISVP---EEYLGKVISDLSKRRGEILSMDPI-GGDRVVIEAEIPVREL 54 (89)
T ss_dssp EEEEEEEE---GGGHHHHHHHHHHTT-EEEEEEEE-STTEEEEEEEEEGGGH
T ss_pred EEEEEEEC---HHHHHHHHHHhcccccEEEechhh-hhhheeEEEEEChhhh
Confidence 34444444 78899999999998886 34444 5889999998754433
No 65
>TIGR02945 SUF_assoc FeS assembly SUF system protein. Members of this family belong to the broader Pfam family pfam01883, or Domain of Unknown Function DUF59. Many members of DUF59 are candidate ring hydroxylating complex subunits. However, members of the narrower family defined here all are found in genomes that carry the FeS assembly SUF system. For 70 % of these species, the member of this protein family is found as part of the SUF locus, usually immediately downstream of the sufS gene.
Probab=58.52 E-value=16 Score=25.39 Aligned_cols=34 Identities=15% Similarity=0.312 Sum_probs=21.4
Q ss_pred EEEEEEeecChhhH------HHHHHHHhccCCe-eEEEeCC
Q 048021 12 TCGLKVDTNDPAWF------SSMTKVLRKIKGA-SYTIDAE 45 (143)
Q Consensus 12 tvvlkV~m~C~gC~------~kIkkaL~kI~GV-sV~VDle 45 (143)
++.+.+.+...+|. ++|+.+|.+++|+ +|+|++.
T Consensus 38 ~v~i~l~l~~p~~~~~~~l~~~i~~al~~l~gv~~v~v~i~ 78 (99)
T TIGR02945 38 HVDIQMTLTAPNCPVAGSMPGEVENAVRAVPGVGSVTVELV 78 (99)
T ss_pred eEEEEEEECCCCCChHHHHHHHHHHHHHhCCCCceEEEEEE
Confidence 45555555433333 4588889899998 5766543
No 66
>PRK05412 putative nucleotide-binding protein; Reviewed
Probab=57.82 E-value=50 Score=26.26 Aligned_cols=60 Identities=12% Similarity=0.170 Sum_probs=41.2
Q ss_pred EEEEEE-eecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEee--cCHHHHHHHHhc-CCceEE
Q 048021 12 TCGLKV-DTNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGR--ANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 12 tvvlkV-~m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~--vdp~~Ll~aI~k-Gk~Ael 72 (143)
+.++++ ++--...+++|.|+++. .+..|.......+|.|+|. -|-..++..|++ .+...|
T Consensus 91 rq~i~lk~GI~~e~AKkIvK~IKd-~klKVqa~IQGd~vRVtgKkrDDLQ~viallk~~d~~~pL 154 (161)
T PRK05412 91 KQEVKLKQGIDQELAKKIVKLIKD-SKLKVQAQIQGDQVRVTGKKRDDLQAVIALLRKADLGQPL 154 (161)
T ss_pred EEEEehhhccCHHHHHHHHHHHHh-cCCceeEEecCcEEEEecCCHhHHHHHHHHHHhccCCCCc
Confidence 344444 24456678888888876 3456888889999999995 367777777776 554433
No 67
>PF13291 ACT_4: ACT domain; PDB: 2KO1_B 3IBW_A.
Probab=57.54 E-value=28 Score=22.82 Aligned_cols=32 Identities=25% Similarity=0.386 Sum_probs=22.8
Q ss_pred ceEEEEEEeecChhhHHHHHHHHhccCCe-eEE
Q 048021 10 KMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYT 41 (143)
Q Consensus 10 ~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~ 41 (143)
...+.|.|+..--.=-..|.+.|++++|| +|.
T Consensus 47 ~~~~~l~v~V~d~~~L~~ii~~L~~i~~V~~V~ 79 (80)
T PF13291_consen 47 TARITLTVEVKDLEHLNQIIRKLRQIPGVISVE 79 (80)
T ss_dssp EEEEEEEEEESSHHHHHHHHHHHCTSTTEEEEE
T ss_pred EEEEEEEEEECCHHHHHHHHHHHHCCCCeeEEE
Confidence 44566777766555556888889999999 664
No 68
>PRK10555 aminoglycoside/multidrug efflux system; Provisional
Probab=57.41 E-value=19 Score=35.48 Aligned_cols=43 Identities=12% Similarity=0.114 Sum_probs=33.4
Q ss_pred hHHHHHHHHhccCCe-eEEEeCCCCEEEEEe--------ecCHHHHHHHHhc
Q 048021 24 WFSSMTKVLRKIKGA-SYTIDAEEGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 24 C~~kIkkaL~kI~GV-sV~VDle~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
=++.|++.|++++|| +|++.-...++.|.= .+++.+|.++|+.
T Consensus 158 ~~~~l~~~L~~v~GV~~V~~~G~~~ei~V~vD~~kl~~~gls~~~v~~al~~ 209 (1037)
T PRK10555 158 VASNIQDPLSRVNGVGDIDAYGSQYSMRIWLDPAKLNSFQMTTKDVTDAIES 209 (1037)
T ss_pred HHHHHHHHhhcCCCeEEEEEcCCceEEEEEECHHHHHHcCCCHHHHHHHHHH
Confidence 347799999999999 799987666676662 2577888889986
No 69
>PF01106 NifU: NifU-like domain; InterPro: IPR001075 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] []. FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins [, ]. It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly []. The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA []. SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA [], acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins []. Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen []. This entry represents the C-terminal of NifU and homologous proteins. NifU contains two domains: an N-terminal (IPR002871 from INTERPRO) and a C-terminal domain []. These domains exist either together or on different polypeptides, both domains being found in organisms that do not fix nitrogen (e.g. yeast), so they have a broader significance in the cell than nitrogen fixation. ; GO: 0005506 iron ion binding, 0051536 iron-sulfur cluster binding, 0016226 iron-sulfur cluster assembly; PDB: 2JNV_A 2Z51_A 1TH5_A 1VEH_A 1XHJ_A.
Probab=56.88 E-value=22 Score=23.71 Aligned_cols=27 Identities=7% Similarity=0.113 Sum_probs=17.5
Q ss_pred EEEEEEeecChhh-------HHHHHHHHh-ccCCe
Q 048021 12 TCGLKVDTNDPAW-------FSSMTKVLR-KIKGA 38 (143)
Q Consensus 12 tvvlkV~m~C~gC-------~~kIkkaL~-kI~GV 38 (143)
++.++..+.|.+| ...|+++|+ .++.+
T Consensus 28 ~V~V~l~GaC~gC~~s~~Tl~~~Ie~~L~~~~~~v 62 (68)
T PF01106_consen 28 VVYVRLTGACSGCPSSDMTLKQGIEQALREAVPEV 62 (68)
T ss_dssp EEEEEEESSCCSSCCHHHHHHHHHHHHHHHHSTT-
T ss_pred EEEEEEEeCCCCCCCHHHHHHHHHHHHHHHHCCCC
Confidence 5566667899999 355666665 45655
No 70
>cd04909 ACT_PDH-BS C-terminal ACT domain of the monofunctional, NAD dependent, prephenate dehydrogenase (PDH). The C-terminal ACT domain of the monofunctional, NAD dependent, prephenate dehydrogenase (PDH) enzyme that catalyzes the formation of 4-hydroxyphenylpyruvate from prephenate, found in Bacillus subtilis (BS) and other Firmicutes, Deinococci, and Bacteroidetes. PDH is the first enzyme in the aromatic amino acid pathway specific for the biosynthesis of tyrosine. This enzyme is feedback-inhibited by tyrosine in B. subtilis and other microorganisms. Both phenylalanine and tryptophan have been shown to be inhibitors of this activity in B. subtilis. Bifunctional chorismate mutase-PDH (TyrA) enzymes such as those seen in Escherichia coli do not contain an ACT domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=56.77 E-value=47 Score=20.88 Aligned_cols=56 Identities=21% Similarity=0.319 Sum_probs=35.3
Q ss_pred EEEEee-cChhhHHHHHHHHhccCCe---eEEEeC---C-CCEEEEEe--ecCHHHHHHHHhc-CCce
Q 048021 14 GLKVDT-NDPAWFSSMTKVLRKIKGA---SYTIDA---E-EGMAYITG--RANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 14 vlkV~m-~C~gC~~kIkkaL~kI~GV---sV~VDl---e-~~~VtV~G--~vdp~~Ll~aI~k-Gk~A 70 (143)
.|.|.+ .=.|.-.+|.+.|.+. |+ ++.... . .+.+.+.- ..+.+++++.|++ |+++
T Consensus 3 ~~~v~~~d~~G~L~~l~~~l~~~-~i~i~~~~~~~~~~~~~~~~~i~v~~~~~~~~~~~~L~~~G~~v 69 (69)
T cd04909 3 DLYVDVPDEPGVIAEVTQILGDA-GISIKNIEILEIREGIGGILRISFKTQEDRERAKEILKEAGYEV 69 (69)
T ss_pred EEEEEcCCCCCHHHHHHHHHHHc-CCCceeeEeEEeecCCcEEEEEEECCHHHHHHHHHHHHHcCCcC
Confidence 355555 4467888899988774 33 333322 1 34444443 3478999999999 9864
No 71
>cd01514 Elongation_Factor_C Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of elongation factors (EFs) bacterial EF-G, eukaryotic and archeal EF-2 and eukaryotic mitochondrial mtEFG1s and mtEFG2s. This group also includes proteins similar to the ribosomal protection proteins Tet(M) and Tet(O), BipA, LepA and, spliceosomal proteins: human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and yeast counterpart Snu114p. This domain adopts a ferredoxin-like fold consisting of an alpha-beta sandwich with anti-parallel beta-sheets, resembling the topology of domain III found in the elongation factors EF-G and eukaryotic EF-2, with which it forms the C-terminal block. The two domains however are not superimposable and domain III lacks some of the characteristics of this domain. EF-2/EF-G in complex with GTP, promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site, the
Probab=55.94 E-value=56 Score=21.46 Aligned_cols=58 Identities=17% Similarity=0.280 Sum_probs=36.4
Q ss_pred EEeecC-hhhHHHHHHHHhccCCeeEEEeCCC-CEEEEEeecCHHHHHHHHhcCCceEEeecCCC
Q 048021 16 KVDTND-PAWFSSMTKVLRKIKGASYTIDAEE-GMAYITGRANPRKLLRKLQSGKYANLCWVSAG 78 (143)
Q Consensus 16 kV~m~C-~gC~~kIkkaL~kI~GVsV~VDle~-~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~ 78 (143)
+|++.| ..+..+|...|.+-.|.-+..+..+ +.+.|++.++...+. |+..+|.+...|
T Consensus 5 ~~~I~~p~~~~g~v~~~l~~rrg~v~~~~~~~~~~~~i~~~iP~~e~~-----g~~~~lr~~T~G 64 (79)
T cd01514 5 KVEITVPEEYLGAVIGDLSKRRGEILGMEPRGTGRVVIKAELPLAEMF-----GFATDLRSLTQG 64 (79)
T ss_pred EEEEEcCHHHHHHHHHHHHhcCCeeEeeEecCCCeEEEEEECCHHHHc-----CcHHHhhhhcCC
Confidence 444444 4577789999988888633333444 789999998844331 555555555443
No 72
>cd02393 PNPase_KH Polynucleotide phosphorylase (PNPase) K homology RNA-binding domain (KH). PNPase is a polyribonucleotide nucleotidyl transferase that degrades mRNA in prokaryotes and plant chloroplasts. The C-terminal region of PNPase contains domains homologous to those in other RNA binding proteins: a KH domain and an S1 domain. KH domains bind single-stranded RNA and are found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
Probab=55.86 E-value=36 Score=21.94 Aligned_cols=37 Identities=16% Similarity=0.308 Sum_probs=25.1
Q ss_pred HHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHh
Q 048021 27 SMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQ 65 (143)
Q Consensus 27 kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~ 65 (143)
+.-+.|+.--|+++.|+- +++|+|.| .+++.+.+|.+
T Consensus 22 ~~ik~I~~~tg~~I~i~~-~g~v~I~G-~~~~~v~~A~~ 58 (61)
T cd02393 22 KTIKKIIEETGVKIDIED-DGTVYIAA-SDKEAAEKAKK 58 (61)
T ss_pred hHHHHHHHHHCCEEEeCC-CCEEEEEe-CCHHHHHHHHH
Confidence 334456666788888876 68999999 44566666543
No 73
>TIGR02190 GlrX-dom Glutaredoxin-family domain. This C-terminal domain with homology to glutaredoxin is fused to an N-terminal peroxiredoxin-like domain.
Probab=55.56 E-value=27 Score=23.08 Aligned_cols=33 Identities=15% Similarity=0.084 Sum_probs=22.1
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCee-EEEeCC
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGAS-YTIDAE 45 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GVs-V~VDle 45 (143)
..++|=..-.|..|. ++++.|.+ .||. ..+|++
T Consensus 8 ~~V~ly~~~~Cp~C~-~ak~~L~~-~gi~y~~idi~ 41 (79)
T TIGR02190 8 ESVVVFTKPGCPFCA-KAKATLKE-KGYDFEEIPLG 41 (79)
T ss_pred CCEEEEECCCCHhHH-HHHHHHHH-cCCCcEEEECC
Confidence 345555556899998 67888877 4784 445544
No 74
>PLN03194 putative disease resistance protein; Provisional
Probab=54.65 E-value=43 Score=27.16 Aligned_cols=57 Identities=7% Similarity=-0.063 Sum_probs=39.7
Q ss_pred ChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEeecCCCCCC
Q 048021 21 DPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCWVSAGNQI 81 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~~s~ 81 (143)
..+-+.-+.++|.. .|+.+-+|.++-+ .|.--...|.++|+. ..-+.+++....+|.
T Consensus 39 R~~FvshL~~aL~~-~GI~vF~D~~el~---~G~~i~~~L~~AIeeSri~IvVfS~~Ya~S~ 96 (187)
T PLN03194 39 KRTIATLLYDHLSR-LNLRPFLDNKNMK---PGDKLFDKINSAIRNCKVGVAVFSPRYCESY 96 (187)
T ss_pred cccHHHHHHHHHHH-CCCEEEEcCcccc---CCCcHHHHHHHHHHhCeEEEEEECCCcccch
Confidence 45778889999988 6887666654211 244445688889998 888888877666554
No 75
>PRK09577 multidrug efflux protein; Reviewed
Probab=54.38 E-value=22 Score=34.94 Aligned_cols=45 Identities=11% Similarity=0.104 Sum_probs=34.6
Q ss_pred HHHHHHHHhccCCe-eEEEeCCCCEEEEE--------eecCHHHHHHHHhc-CCc
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAEEGMAYIT--------GRANPRKLLRKLQS-GKY 69 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle~~~VtV~--------G~vdp~~Ll~aI~k-Gk~ 69 (143)
++.|+..|++++|| +|+++-.+.+|.|. -.+++.+|.++|++ +..
T Consensus 158 ~~~l~~~L~~v~GV~~V~~~G~e~~v~V~vD~~kl~~~Gls~~~V~~~l~~~n~~ 212 (1032)
T PRK09577 158 SANVLQALRRVEGVGKVQFWGAEYAMRIWPDPVKLAALGLTASDIASAVRAHNAR 212 (1032)
T ss_pred HHHHHHHHhcCCCcEEEEecCCceEEEEEeCHHHHHHcCCCHHHHHHHHHHhCCc
Confidence 47899999999999 79998876677764 12567778889987 544
No 76
>PF06998 DUF1307: Protein of unknown function (DUF1307); InterPro: IPR009736 This entry is represented by Bacteriophage VT1-Sakai, YehR. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical bacterial proteins of around 150 residues in length. Some family members are described as putative lipoproteins but the function of the family is unknown.; PDB: 2JOE_A.
Probab=54.21 E-value=9.1 Score=28.30 Aligned_cols=38 Identities=24% Similarity=0.299 Sum_probs=25.3
Q ss_pred hhHHHHHHHHhccCCeeEEEeCCCCEEEEEee-----cCHHHH
Q 048021 23 AWFSSMTKVLRKIKGASYTIDAEEGMAYITGR-----ANPRKL 60 (143)
Q Consensus 23 gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~-----vdp~~L 60 (143)
.+...+.+.++.|+||+..++.+++.++..-. +|.++|
T Consensus 49 ~~~~~~~~~yk~ikGv~~~~e~kd~k~ve~i~IDy~k~D~~~l 91 (123)
T PF06998_consen 49 KQIKPQSKKYKGIKGVTYKVEYKDDKVVETIEIDYTKADFDKL 91 (123)
T ss_dssp HHHHHHHTTTTTSSS-EEEEEE-SSEEEEEEEEETTS--HHHH
T ss_pred HHHHHHHHhhcCCCCeEEEEEEeCCEEEEEEEEecccCCHHHH
Confidence 36666667778899998888888888877643 566666
No 77
>TIGR00288 conserved hypothetical protein TIGR00288. This family of orthologs is restricted to but universal among the completed archaeal genomes so far. Eubacterial proteins showing at least local homology include slr1870 from Synechocystis PCC6803 and two proteins from Aquifex aeolicusr, none of which is characterized.
Probab=54.12 E-value=18 Score=28.49 Aligned_cols=35 Identities=20% Similarity=0.230 Sum_probs=29.3
Q ss_pred CCEEEEEeecCHHHHHHHHhc-CCceEEeecCCCCC
Q 048021 46 EGMAYITGRANPRKLLRKLQS-GKYANLCWVSAGNQ 80 (143)
Q Consensus 46 ~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~~s 80 (143)
+.-|.|+|..|=..|+++|++ |++|++++++..-+
T Consensus 107 D~~vLvSgD~DF~~Lv~~lre~G~~V~v~g~~~~ts 142 (160)
T TIGR00288 107 DAVALVTRDADFLPVINKAKENGKETIVIGAEPGFS 142 (160)
T ss_pred CEEEEEeccHhHHHHHHHHHHCCCEEEEEeCCCCCh
Confidence 456777899999999999999 99999999865444
No 78
>PRK10568 periplasmic protein; Provisional
Probab=53.47 E-value=38 Score=27.05 Aligned_cols=32 Identities=13% Similarity=0.232 Sum_probs=25.9
Q ss_pred HHHHHHHHhccCCe---eEEEeCCCCEEEEEeecC
Q 048021 25 FSSMTKVLRKIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 25 ~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
..+|+.+|..-+.+ .++|+.++|.|++.|.++
T Consensus 141 t~~vk~~L~~~~~v~~~~I~V~v~~G~V~L~G~V~ 175 (203)
T PRK10568 141 TSEIKAKLLADDIVPSRKVKVETTDGVVQLSGTVD 175 (203)
T ss_pred HHHHHHHHhhCCCCCcceeEEEEeCcEEEEEEEEC
Confidence 66788888765555 489999999999999984
No 79
>TIGR00484 EF-G translation elongation factor EF-G. After peptide bond formation, this elongation factor of bacteria and organelles catalyzes the translocation of the tRNA-mRNA complex, with its attached nascent polypeptide chain, from the A-site to the P-site of the ribosome. Every completed bacterial genome has at least one copy, but some species have additional EF-G-like proteins. The closest homolog to canonical (e.g. E. coli) EF-G in the spirochetes clusters as if it is derived from mitochondrial forms, while a more distant second copy is also present. Synechocystis PCC6803 has a few proteins more closely related to EF-G than to any other characterized protein. Two of these resemble E. coli EF-G more closely than does the best match from the spirochetes; it may be that both function as authentic EF-G.
Probab=53.44 E-value=46 Score=31.14 Aligned_cols=61 Identities=21% Similarity=0.222 Sum_probs=42.7
Q ss_pred EEEEEEeecChhhHHHHHHHHhccC----CeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceEE
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIK----GASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYANL 72 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~----GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ael 72 (143)
.+...|.-.=.....++.++|++|. ++.|.+|.+.++..|.| ++--+.++++|++ |.++++
T Consensus 406 vl~~~i~p~~~~d~~kL~~aL~~L~~eDpsl~v~~~~etge~il~g~GelHLei~~~~L~~~~~vev~~ 474 (689)
T TIGR00484 406 VISLAVEPKTKADQEKMGIALGKLAEEDPTFRTFTDPETGQTIIAGMGELHLDIIVDRMKREFKVEANV 474 (689)
T ss_pred eEEEEEEECCcccHHHHHHHHHHHHHhCCEEEEEECCCCCCEEEEEeeHHHHHHHHHHHHHHhCCeeEe
Confidence 3344444455566777888887763 33577888899888885 5678899999988 655553
No 80
>PF03927 NapD: NapD protein; InterPro: IPR005623 This entry represents NapD, the twin-arginine signal-peptide-binding chaperone for NapA, functioning as an assembly protein for the periplasmic nitrate reductase NapABC. The periplasmic NapABC enzyme likely functions during growth in nitrate-limited environments [].; PDB: 2JSX_A 2PQ4_A.
Probab=53.16 E-value=39 Score=23.17 Aligned_cols=32 Identities=6% Similarity=0.130 Sum_probs=26.2
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCe-eEEE
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTI 42 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~V 42 (143)
-++++-++....+-..+.-++|+.|+|| ++..
T Consensus 39 GKiVVtiE~~~~~~~~~~~~~i~~l~GVlsa~l 71 (79)
T PF03927_consen 39 GKIVVTIEAESSEEEVDLIDAINALPGVLSASL 71 (79)
T ss_dssp TEEEEEEEESSHHHHHHHHHHHCCSTTEEEEEE
T ss_pred CeEEEEEEeCChHHHHHHHHHHHcCCCceEEEE
Confidence 4677777888888888888999999999 7543
No 81
>cd02394 vigilin_like_KH K homology RNA-binding domain_vigilin_like. The vigilin family is a large and extended family of multiple KH-domain proteins, including vigilin, also called high density lipoprotein binding protien (HBP), fungal Scp160 and bicaudal-C. Yeast Scp160p has been shown to bind RNA and to associate with both soluble and membrane-bound polyribosomes as a mRNP component. Bicaudal-C is a RNA-binding molecule believed to function in embryonic development at the post-transcriptional level. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
Probab=52.64 E-value=28 Score=21.84 Aligned_cols=28 Identities=14% Similarity=0.195 Sum_probs=20.5
Q ss_pred CCeeEEEeCC---CCEEEEEeecCHHHHHHHHh
Q 048021 36 KGASYTIDAE---EGMAYITGRANPRKLLRKLQ 65 (143)
Q Consensus 36 ~GVsV~VDle---~~~VtV~G~vdp~~Ll~aI~ 65 (143)
-|+.+.+... ++.|+|.|. ++.+.+|++
T Consensus 29 ~g~~I~i~~~~~~~~~v~I~G~--~~~v~~A~~ 59 (62)
T cd02394 29 TGVKIRFPDPGSKSDTITITGP--KENVEKAKE 59 (62)
T ss_pred hCCEEEcCCCCCCCCEEEEEcC--HHHHHHHHH
Confidence 3777777764 799999998 566666653
No 82
>TIGR00915 2A0602 The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family. This family is one of several subfamilies within the scope of pfam model pfam00873.
Probab=51.76 E-value=26 Score=34.58 Aligned_cols=42 Identities=14% Similarity=0.222 Sum_probs=32.9
Q ss_pred HHHHHHHHhccCCe-eEEEeCCCCEEEEEe--------ecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAEEGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
++.|+..|++++|| +|++.-.+.++.|.= .+++.+|.++|++
T Consensus 159 ~~~l~~~L~~v~GV~~V~~~G~~~ei~V~vD~~kl~~~gls~~dV~~~i~~ 209 (1044)
T TIGR00915 159 ASNMVDPISRLEGVGDVQLFGSQYAMRIWLDPAKLNSYQLTPADVISAIQA 209 (1044)
T ss_pred HHHHHHHHhCCCCceEEEecCCceEEEEEECHHHHHHcCCCHHHHHHHHHH
Confidence 36799999999999 799987766677762 2567788888875
No 83
>PF04312 DUF460: Protein of unknown function (DUF460); InterPro: IPR007408 This is an archaeal protein of unknown function.
Probab=51.24 E-value=51 Score=25.58 Aligned_cols=40 Identities=10% Similarity=0.165 Sum_probs=32.9
Q ss_pred CCe-e--EEEeCCCCEEEEEee--cCHHHHHHHHhc-CCceEEeec
Q 048021 36 KGA-S--YTIDAEEGMAYITGR--ANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 36 ~GV-s--V~VDle~~~VtV~G~--vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
+|. . .-+|++.+.+.+.+. .+...|++.|.+ |+.+.+..=
T Consensus 39 PG~ttgiAildL~G~~l~l~S~R~~~~~evi~~I~~~G~PviVAtD 84 (138)
T PF04312_consen 39 PGTTTGIAILDLDGELLDLKSSRNMSRSEVIEWISEYGKPVIVATD 84 (138)
T ss_pred CCceeEEEEEecCCcEEEEEeecCCCHHHHHHHHHHcCCEEEEEec
Confidence 565 2 567999999999874 799999999999 999887654
No 84
>cd04096 eEF2_snRNP_like_C eEF2_snRNP_like_C: this family represents a C-terminal domain of eukaryotic elongation factor 2 (eEF-2) and a homologous domain of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site, the uncharged tRNA from the P site to the E-site and, the mRNA is shifted one codon relative to the ribosome.
Probab=51.22 E-value=62 Score=21.40 Aligned_cols=39 Identities=13% Similarity=0.072 Sum_probs=26.3
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCC--CCEEEEEeecCHHHH
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAE--EGMAYITGRANPRKL 60 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle--~~~VtV~G~vdp~~L 60 (143)
+.+..+|...|.+-.|.-+..+.. .+.+.|++.++...+
T Consensus 12 ~~~~g~V~~~l~~rrg~i~~~~~~~~~~~~~i~~~iP~~e~ 52 (80)
T cd04096 12 EDALGKVYSVLSKRRGHVLSEEPKEGTPLFEIKAYLPVIES 52 (80)
T ss_pred HHHhhHHHHhhhhCeeEEeEEeecCCCccEEEEEEEeHHHH
Confidence 456778899998888863333333 455999998875443
No 85
>TIGR01709 typeII_sec_gspL general secretion pathway protein L. This model represents GspL, protein L of the main terminal branch of the general secretion pathway, also called type II secretion. It transports folded proteins across the bacterial outer membrane and is widely distributed in Gram-negative pathogens.
Probab=50.70 E-value=50 Score=28.38 Aligned_cols=52 Identities=21% Similarity=0.277 Sum_probs=37.8
Q ss_pred HHHHHHHhccCCe---eEEEeCCCCEEEEEeec----CHHHHHHHHhcCCceEEeecCC
Q 048021 26 SSMTKVLRKIKGA---SYTIDAEEGMAYITGRA----NPRKLLRKLQSGKYANLCWVSA 77 (143)
Q Consensus 26 ~kIkkaL~kI~GV---sV~VDle~~~VtV~G~v----dp~~Ll~aI~kGk~Ael~~~~~ 77 (143)
..+..+|..++|+ ++..|.+.++++++... +-+++..+|+.|+.|++-+...
T Consensus 313 ~~l~~al~~~~~~~l~sL~y~~~~~~L~l~l~a~~~~~le~l~~~l~~g~~v~~~~~~~ 371 (384)
T TIGR01709 313 AALATALGQLPGLQLQSLDFDGARGELRLKLEAPSDADLEQLRSRLARGFQVALGQAGA 371 (384)
T ss_pred HHHHHHhccCCCCceeEEeEcCCCCEEEEEEecCChHHHHHHHHHHhhhceeccccccc
Confidence 3456677777776 38889999999999764 4556666666699998877643
No 86
>PLN02925 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase
Probab=50.14 E-value=28 Score=33.65 Aligned_cols=26 Identities=15% Similarity=0.225 Sum_probs=19.8
Q ss_pred EEEe-ecChhhH----------HHHHHHHhccCCeeE
Q 048021 15 LKVD-TNDPAWF----------SSMTKVLRKIKGASY 40 (143)
Q Consensus 15 lkV~-m~C~gC~----------~kIkkaL~kI~GVsV 40 (143)
++.+ +.|+||- ++|++....++||.|
T Consensus 630 ~kte~isCPgCGRT~~dlq~~~~~I~~~~~hl~Gvki 666 (733)
T PLN02925 630 TKTEYVSCPSCGRTLFDLQEVSAEIREKTSHLPGVSI 666 (733)
T ss_pred cCCeEEECCCCCCccccHHHHHHHHHHHhhcCCCceE
Confidence 4444 5899994 578888889999854
No 87
>PRK15127 multidrug efflux system protein AcrB; Provisional
Probab=49.98 E-value=29 Score=34.25 Aligned_cols=42 Identities=10% Similarity=0.169 Sum_probs=31.6
Q ss_pred HHHHHHHHhccCCe-eEEEeCCCCEEEEEe--------ecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAEEGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
++.|+..|++++|| +|++.-...++.|.= .+++..|.++|+.
T Consensus 159 ~~~l~~~L~~v~GV~~V~~~G~~~ei~V~vDp~kl~~~gls~~~V~~~l~~ 209 (1049)
T PRK15127 159 AANMKDPISRTSGVGDVQLFGSQYAMRIWMNPNELNKFQLTPVDVINAIKA 209 (1049)
T ss_pred HHHHHHHHhcCCCceEEEEcCCceEEEEEeCHHHHHHcCCCHHHHHHHHHH
Confidence 35799999999999 799876656666662 2467777888884
No 88
>PRK11670 antiporter inner membrane protein; Provisional
Probab=49.85 E-value=84 Score=27.34 Aligned_cols=67 Identities=13% Similarity=0.077 Sum_probs=42.5
Q ss_pred EEEEEEeecChhh------HHHHHHHHhccCCe-eEEEeCCC------------------CEEEEE---e---e-cCHHH
Q 048021 12 TCGLKVDTNDPAW------FSSMTKVLRKIKGA-SYTIDAEE------------------GMAYIT---G---R-ANPRK 59 (143)
Q Consensus 12 tvvlkV~m~C~gC------~~kIkkaL~kI~GV-sV~VDle~------------------~~VtV~---G---~-vdp~~ 59 (143)
++.|.+.+.-.+| .+.++++|+.++|| +|+|.+.. +.+.|. | + .-..-
T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vIaV~S~KGGVGKTT~avN 127 (369)
T PRK11670 48 TLHIELVMPFVWNSAFEELKEQCSAELLRITGAKAIDWKLSHNIATLKRVNNQPGVNGVKNIIAVSSGKGGVGKSSTAVN 127 (369)
T ss_pred EEEEEEEECCCCchHHHHHHHHHHHHHHhcCCCceEEEEEeeehhhhccccccccCCCCCEEEEEeCCCCCCCHHHHHHH
Confidence 4556666543344 45789999999998 55543322 233443 1 1 23566
Q ss_pred HHHHHhc-CCceEEeecCCC
Q 048021 60 LLRKLQS-GKYANLCWVSAG 78 (143)
Q Consensus 60 Ll~aI~k-Gk~Ael~~~~~~ 78 (143)
|..+|.+ |+++-|+-....
T Consensus 128 LA~aLA~~G~rVlLID~D~q 147 (369)
T PRK11670 128 LALALAAEGAKVGILDADIY 147 (369)
T ss_pred HHHHHHHCCCcEEEEeCCCC
Confidence 7777888 999999988753
No 89
>TIGR02830 spore_III_AG stage III sporulation protein AG. CC A comparative genome analysis of all sequenced genomes of shows a number of proteins conserved strictly among the endospore-forming subset of the Firmicutes. This protein, a member of this panel, is found in a spore formation operon and is designated stage III sporulation protein AG.
Probab=49.15 E-value=23 Score=28.59 Aligned_cols=30 Identities=13% Similarity=0.361 Sum_probs=23.1
Q ss_pred hhhHHHHHHHHhccCCe-e--EEEeCCCCEEEE
Q 048021 22 PAWFSSMTKVLRKIKGA-S--YTIDAEEGMAYI 51 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GV-s--V~VDle~~~VtV 51 (143)
..=+++++.+|++|+|| + |.|.++.+.-.|
T Consensus 60 ~~lE~~L~~iL~~I~GvG~V~VmItl~s~~e~v 92 (186)
T TIGR02830 60 KQYENELKEILEKIEGVGDVTVMVNLDSSEEKV 92 (186)
T ss_pred HHHHHHHHHHHHhccCcceeEEEEEECCCceEE
Confidence 34678899999999999 4 777777665555
No 90
>PRK06418 transcription elongation factor NusA-like protein; Validated
Probab=48.50 E-value=98 Score=24.54 Aligned_cols=66 Identities=20% Similarity=0.253 Sum_probs=40.4
Q ss_pred EEEEEE--eecChhhHHHH------------HHHHhcc------CCee--EEEeCCCCEEEEEeec------CHHHHHHH
Q 048021 12 TCGLKV--DTNDPAWFSSM------------TKVLRKI------KGAS--YTIDAEEGMAYITGRA------NPRKLLRK 63 (143)
Q Consensus 12 tvvlkV--~m~C~gC~~kI------------kkaL~kI------~GVs--V~VDle~~~VtV~G~v------dp~~Ll~a 63 (143)
.|++-| .+-|++|.++| .++|-+| .++. =.+..++..+.|.... .--+.+++
T Consensus 5 ~~~~c~kt~ilC~~c~~~~~~G~v~~~dv~i~~~l~~l~~~~~l~~~~~~k~~~~ddrvIfvV~~gdg~aIGk~G~~ik~ 84 (166)
T PRK06418 5 ICEVCVKTGLLCPRCQSLLDSGEVTELDVEVSKVLLKLEEDKELKDVEYKKAYEVDDLVILLVTSGPRIPIGKGGKIAKA 84 (166)
T ss_pred eeeEEeccCccChhHHhHhhcCceEEeehHHHHHHHHhhccccccCceEEEEEEeCCEEEEEEeCCCcccccccchHHHH
Confidence 345544 46899999975 5777776 3331 1223334444444222 24467888
Q ss_pred Hhc--CCceEEeecCC
Q 048021 64 LQS--GKYANLCWVSA 77 (143)
Q Consensus 64 I~k--Gk~Ael~~~~~ 77 (143)
|++ |++++++...+
T Consensus 85 l~~~lgk~VevVE~s~ 100 (166)
T PRK06418 85 LSRKLGKKVRVVEKTN 100 (166)
T ss_pred HHHHhCCcEEEEEcCC
Confidence 887 99999998643
No 91
>COG0066 LeuD 3-isopropylmalate dehydratase small subunit [Amino acid transport and metabolism]
Probab=48.18 E-value=26 Score=28.58 Aligned_cols=47 Identities=19% Similarity=0.237 Sum_probs=32.2
Q ss_pred hHHHHHHHHhccCCeeEEEeCCCCEEEEE-e-e--cCHHHHHHHHhc-CCce
Q 048021 24 WFSSMTKVLRKIKGASYTIDAEEGMAYIT-G-R--ANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 24 C~~kIkkaL~kI~GVsV~VDle~~~VtV~-G-~--vdp~~Ll~aI~k-Gk~A 70 (143)
|...|++.++.++|..++||+++++|+.. | . .+.....+.+-. |.+.
T Consensus 117 ~~~~~~~l~~~~~g~~i~VDL~~~~v~~~~~~~~~f~i~~f~~~~L~~GlD~ 168 (191)
T COG0066 117 AEEDVTELFEDVPGDEVTVDLETGEVTNPTGGEEPFEIDDFRREILLNGLDD 168 (191)
T ss_pred ChHHHHHHHHhCCCCEEEEEcCCCeEEeCCCCEEEccCCHHHHHHHHcCCcc
Confidence 66667777777788899999999999954 3 2 344444444444 7543
No 92
>PF08210 APOBEC_N: APOBEC-like N-terminal domain; InterPro: IPR013158 This domain is found at the N terminus of the Apolipoprotein B mRNA editing enzyme. Apobec-1 catalyzes C to U editing of apolipoprotein B (apoB) mRNA in the mammalian intestine. The N-terminal domain of APOBEC-1 like proteins is the catalytic domain, while the C-terminal domain is a pseudocatalyitc domain. More specifically, the catalytic domain is a zinc dependent deaminases domain and is essential for cytidine deamination. APOBEC-3 like members contain two copies of this domain. This family also includes the functionally homologous activation induced deaminase, which is essential for the development of antibody diversity in B lymphocytes. RNA editing by APOBEC-1 requires homodimerisation and this complex interacts with RNA binding proteins to from the editosome [] (and references therein).; GO: 0008270 zinc ion binding, 0016814 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amidines; PDB: 3IQS_A 3IR2_A 3V4J_B 2KEM_A 2KBO_A 3V4K_A 3E1U_A 2JYW_A 2RPZ_A.
Probab=48.11 E-value=29 Score=27.59 Aligned_cols=65 Identities=18% Similarity=0.143 Sum_probs=38.4
Q ss_pred ceEEEEEEee-cChh----hHHHHHHHHhcc--CCeeEEEeCCCCEEEEEeecC---HHHHHHHHhc-CCceEEeecC
Q 048021 10 KMTCGLKVDT-NDPA----WFSSMTKVLRKI--KGASYTIDAEEGMAYITGRAN---PRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 10 ~~tvvlkV~m-~C~g----C~~kIkkaL~kI--~GVsV~VDle~~~VtV~G~vd---p~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
.-+++.-|.. -|.. |+++|.+.|++. ++|++.|=.+ ++.-...-. -.+=++.|.+ |-++++...+
T Consensus 73 ~y~ITwy~SwSPC~~~~~~Ca~~i~~FL~~~~~~~v~L~I~~a--rLY~~~~~~~~~~~eGLr~L~~aGv~v~iM~~~ 148 (188)
T PF08210_consen 73 IYRITWYLSWSPCPESDHCCAEKIAEFLKKHLKPNVSLSIFAA--RLYYHWEPEPLWNQEGLRRLASAGVQVEIMSYK 148 (188)
T ss_dssp EEEEEEEESSS--CC----HHHHHHHHHCCC--TTEEEEEEES--S--STTSTT---HHHHHHHHHHCTEEEEE-SHH
T ss_pred eEEEEEEEecCCCcchhhHHHHHHHHHHHHhCCCCCeEEEEEE--eeeeecCCcchhHHHHHHHHHHcCCEEEEcCHH
Confidence 3456667775 5999 999999999999 9986554333 121111222 2344445555 8888877553
No 93
>COG1432 Uncharacterized conserved protein [Function unknown]
Probab=47.95 E-value=26 Score=27.44 Aligned_cols=38 Identities=11% Similarity=0.174 Sum_probs=31.3
Q ss_pred CCCCEEEEEeecCHHHHHHHHhc-CCceEEeecCCCCCC
Q 048021 44 AEEGMAYITGRANPRKLLRKLQS-GKYANLCWVSAGNQI 81 (143)
Q Consensus 44 le~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~~s~ 81 (143)
.-+..|.++|..|=.-++++++. |+++++++..+..+.
T Consensus 110 ~~D~ivl~SgD~DF~p~v~~~~~~G~rv~v~~~~~~~s~ 148 (181)
T COG1432 110 NVDTIVLFSGDGDFIPLVEAARDKGKRVEVAGIEPMTSS 148 (181)
T ss_pred CCCEEEEEcCCccHHHHHHHHHHcCCEEEEEecCCcCHH
Confidence 44667788899998888999988 999999999885543
No 94
>PHA01634 hypothetical protein
Probab=47.42 E-value=20 Score=28.20 Aligned_cols=11 Identities=0% Similarity=-0.085 Sum_probs=9.5
Q ss_pred ecChhhHHHHH
Q 048021 19 TNDPAWFSSMT 29 (143)
Q Consensus 19 m~C~gC~~kIk 29 (143)
|.|+||+++|.
T Consensus 98 iDCeGCE~~l~ 108 (156)
T PHA01634 98 MDCEGCEEKLN 108 (156)
T ss_pred EEccchHHhcC
Confidence 78999998875
No 95
>TIGR01305 GMP_reduct_1 guanosine monophosphate reductase, eukaryotic. A deep split separates two families of GMP reductase. This family includes both eukaryotic and some proteobacterial sequences, while the other family contains other bacterial sequences.
Probab=47.02 E-value=94 Score=27.51 Aligned_cols=62 Identities=13% Similarity=0.156 Sum_probs=47.5
Q ss_pred cChhhHHHHHHHHhccCCee-EEEeCCC------------------CEEEEEee-cCHHHHHHHHhcCCceEEeecCCCC
Q 048021 20 NDPAWFSSMTKVLRKIKGAS-YTIDAEE------------------GMAYITGR-ANPRKLLRKLQSGKYANLCWVSAGN 79 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVs-V~VDle~------------------~~VtV~G~-vdp~~Ll~aI~kGk~Ael~~~~~~~ 79 (143)
.-+.+..++++.++.-+|++ +.||... +...|.|+ ++++...++|+.|-++..+++++|.
T Consensus 104 ~~~~d~er~~~L~~a~~~~d~iviD~AhGhs~~~i~~ik~ir~~~p~~~viaGNV~T~e~a~~Li~aGAD~ikVgiGpGS 183 (343)
T TIGR01305 104 SSDNDLEKMTSILEAVPQLKFICLDVANGYSEHFVEFVKLVREAFPEHTIMAGNVVTGEMVEELILSGADIVKVGIGPGS 183 (343)
T ss_pred cCHHHHHHHHHHHhcCCCCCEEEEECCCCcHHHHHHHHHHHHhhCCCCeEEEecccCHHHHHHHHHcCCCEEEEcccCCC
Confidence 34677788888887767785 7888764 46777884 6888888888779999999998875
Q ss_pred CC
Q 048021 80 QI 81 (143)
Q Consensus 80 s~ 81 (143)
..
T Consensus 184 ic 185 (343)
T TIGR01305 184 VC 185 (343)
T ss_pred cc
Confidence 54
No 96
>PF02662 FlpD: Methyl-viologen-reducing hydrogenase, delta subunit; InterPro: IPR003813 Methyl-viologen-reducing hydrogenase (MVH) is one of the enzymes involved in methanogenesis and coded in the mth-flp-mvh-mrt cluster of methane genes in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) []. No specific functions have been assigned to the delta subunit.; GO: 0015948 methanogenesis, 0055114 oxidation-reduction process
Probab=46.89 E-value=64 Score=23.88 Aligned_cols=58 Identities=16% Similarity=0.205 Sum_probs=36.2
Q ss_pred cChhhHHHHHHHHhccCCe-eEEEeCCCCEEEE--EeecCHHHHHHHHhcCCceEEe-ecCCCCCC
Q 048021 20 NDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYI--TGRANPRKLLRKLQSGKYANLC-WVSAGNQI 81 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV--~G~vdp~~Ll~aI~kGk~Ael~-~~~~~~s~ 81 (143)
.|..|.-.--.. -|+ .......-..|.| .|.+||.-|++++++|-+..++ +-.+++-+
T Consensus 6 ~C~~~ay~aad~----ag~~~~~~p~~vriIrvpC~Grv~~~~il~Af~~GADGV~V~gC~~g~Ch 67 (124)
T PF02662_consen 6 CCNWCAYAAADL----AGVSRLQYPPNVRIIRVPCSGRVDPEFILRAFEKGADGVLVAGCHPGDCH 67 (124)
T ss_pred EeCCCcHHHHHH----HhhccCCCCCCeEEEEccCCCccCHHHHHHHHHcCCCEEEEeCCCCCCCC
Confidence 577777544332 233 2233333333444 3899999999999999887766 66666643
No 97
>cd06482 ACD_HspB10 Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB10, also known as sperm outer dense fiber protein (ODFP), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB10 occurs exclusively in the axoneme of sperm cells and may have a cytoskeletal role.
Probab=46.32 E-value=17 Score=25.54 Aligned_cols=23 Identities=9% Similarity=0.310 Sum_probs=19.6
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecC
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.|+|+ +++|.++++.++|+|+..
T Consensus 15 dlPG~~kedI~V~v~~~~L~I~ger~ 40 (87)
T cd06482 15 DVCGFEPDQVKVKVKDGKVQVSAERE 40 (87)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEEe
Confidence 57898 299999999999999754
No 98
>KOG2874 consensus rRNA processing protein [Translation, ribosomal structure and biogenesis; Cell cycle control, cell division, chromosome partitioning]
Probab=46.19 E-value=16 Score=31.96 Aligned_cols=31 Identities=13% Similarity=0.219 Sum_probs=27.1
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEe
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYITG 53 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G 53 (143)
..|-.-|++||.+ .+|...+|+.+|.++|..
T Consensus 72 ke~wp~v~raL~e-~~i~c~lDL~egsm~V~T 102 (356)
T KOG2874|consen 72 KECWPLVERALDE-HHIKCVLDLVEGSMTVKT 102 (356)
T ss_pred HHHHHHHHHHHHh-cCcceEEeeccceeEEee
Confidence 4578889999987 578999999999999985
No 99
>smart00653 eIF2B_5 domain present in translation initiation factor eIF2B and eIF5.
Probab=45.99 E-value=44 Score=24.63 Aligned_cols=38 Identities=32% Similarity=0.443 Sum_probs=28.6
Q ss_pred HHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc
Q 048021 28 MTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 28 IkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
|.+-|.+-=|....+|.+ ++++|.|..++..|.+.|++
T Consensus 38 v~kyl~~ELgt~g~id~~-~rlii~G~~~~~~i~~~l~~ 75 (110)
T smart00653 38 VLKFLLAELGTQGSIDGK-GRLIVNGRFTPKKLQDLLRR 75 (110)
T ss_pred HHHHHHHHhCCceeECCC-CeEEEEEeeCHHHHHHHHHH
Confidence 334444433555677766 99999999999999999988
No 100
>COG0602 NrdG Organic radical activating enzymes [Posttranslational modification, protein turnover, chaperones]
Probab=45.93 E-value=24 Score=28.46 Aligned_cols=30 Identities=27% Similarity=0.181 Sum_probs=22.4
Q ss_pred CCCEEEEEee-c----CHHHHHHHHhc-CCceEEee
Q 048021 45 EEGMAYITGR-A----NPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 45 e~~~VtV~G~-v----dp~~Ll~aI~k-Gk~Ael~~ 74 (143)
...-|+++|- - +-..|+++|.+ |+++.+=.
T Consensus 71 ~~~~V~lTGGEP~~~~~l~~Ll~~l~~~g~~~~lET 106 (212)
T COG0602 71 KARGVSLTGGEPLLQPNLLELLELLKRLGFRIALET 106 (212)
T ss_pred CcceEEEeCCcCCCcccHHHHHHHHHhCCceEEecC
Confidence 4457888873 1 57889999998 99988644
No 101
>PF10262 Rdx: Rdx family; InterPro: IPR011893 This entry represents the Rdx family of selenoproteins, which includes mammalian selenoproteins SelW, SelV, SelT and SelH, bacterial SelW-like proteins and cysteine-containing proteins of unknown function in all three domains of life. Mammalian Rdx12 and its fish selenoprotein orthologues are also members of this family []. These proteins possess a thioredoxin-like fold and a conserved CXXC or CxxU (U is selenocysteine) motif near the N terminus, suggesting a redox function. Rdx proteins can use catalytic cysteine (or selenocysteine) to form transient mixed disulphides with substrate proteins. Selenium (Se) plays an essential role in cell survival and most of the effects of Se are probably mediated by selenoproteins. Selenoprotein W (SelW) plays an important role in protection of neurons from oxidative stress during neuronal development [], []. Selenoprotein T (SelT) is conserved from plants to humans. SelT is localized to the endoplasmic reticulum through a hydrophobic domain. The protein binds to UDP-glucose:glycoprotein glucosyltransferase (UGTR), the endoplasmic reticulum (ER)-resident protein, which is known to be involved in the quality control of protein folding [, ]. The function of SelT is unknown, although it may have a role in PACAP signaling during PC12 cell differentiation [, ]. Selenoprotein H (SelH) protects neurons against UVB-induced damage by inhibiting apoptotic cell death pathways, by preventing mitochondrial depolarization, and by promoting cell survival pathways [].; GO: 0008430 selenium binding, 0045454 cell redox homeostasis; PDB: 2OJL_B 2FA8_A 2P0G_C 2NPB_A 3DEX_C 2OKA_A 2OBK_G.
Probab=45.33 E-value=67 Score=21.43 Aligned_cols=12 Identities=8% Similarity=0.551 Sum_probs=8.3
Q ss_pred cCHHHHHHHHhc
Q 048021 55 ANPRKLLRKLQS 66 (143)
Q Consensus 55 vdp~~Ll~aI~k 66 (143)
.+++.|+++|++
T Consensus 64 P~~~~i~~~I~~ 75 (76)
T PF10262_consen 64 PDPDEIVQLIRD 75 (76)
T ss_dssp S-HHHHHHHHHH
T ss_pred CCHHHHHHHHhc
Confidence 378888888863
No 102
>cd03709 lepA_C lepA_C: This family represents the C-terminal region of LepA, a GTP-binding protein localized in the cytoplasmic membrane. LepA is ubiquitous in Bacteria and Eukaryota (e.g. Saccharomyces cerevisiae GUF1p), but is missing from Archaea. LepA exhibits significant homology to elongation factors (EFs) Tu and G. The function(s) of the proteins in this family are unknown. The N-terminal domain of LepA is homologous to a domain of similar size found in initiation factor 2 (IF2), and in EF-Tu and EF-G (factors required for translation in Escherichia coli). Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including S. cerevisiae GUF1) originated within the bacterial LepA family. LepA has never been observed in archaea, and eukaryl LepA is organellar. LepA is therefore a true bacterial GTPase, found only in the bacterial lineage.
Probab=45.31 E-value=92 Score=20.86 Aligned_cols=48 Identities=4% Similarity=0.190 Sum_probs=31.7
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCeeEEEeC-CCCEEEEEeecCHHHHH
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGASYTIDA-EEGMAYITGRANPRKLL 61 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GVsV~VDl-e~~~VtV~G~vdp~~Ll 61 (143)
+.+++.|+ +.+..+|...|.+-.|.-+.++. .++.+.|++.++...++
T Consensus 4 ~~v~i~vP---~e~~G~V~~~l~~rrG~i~~~~~~~~~~~~i~~~~P~~~~~ 52 (80)
T cd03709 4 VKATIITP---SEYLGAIMELCQERRGVQKDMEYLDANRVMLTYELPLAEIV 52 (80)
T ss_pred EEEEEEeC---HHhhHHHHHHHHHhCCEEeccEecCCCeEEEEEECCHHHHh
Confidence 34444444 45666888888888886233333 33589999999877774
No 103
>PF02983 Pro_Al_protease: Alpha-lytic protease prodomain; InterPro: IPR004236 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. The alpha-lytic protease prodomain is associated with serine peptidases, specifically the alpha-lytic endopeptidases and streptogrisin A, B, C, D and E, which are bacterial enzymes and which belong to MEROPS peptidase subfamily S1A (IPR001316 from INTERPRO). The protease precursor in Gram-negative bacterial proteases may be a general property of extracellular bacterial proteases []. The proteases are encoded with a large (166 amino acid) N-terminal pro region that is required transiently both in vivo and in vitro for the correct folding of the protease domain [, ]. The pro region also acts as a potent inhibitor of the mature enzyme []. ; GO: 0008236 serine-type peptidase activity, 0006508 proteolysis, 0005576 extracellular region; PDB: 3PRO_C 2PRO_B 4PRO_C.
Probab=45.30 E-value=68 Score=20.99 Aligned_cols=19 Identities=16% Similarity=0.274 Sum_probs=15.8
Q ss_pred CCe-eEEEeCCCCEEEEEee
Q 048021 36 KGA-SYTIDAEEGMAYITGR 54 (143)
Q Consensus 36 ~GV-sV~VDle~~~VtV~G~ 54 (143)
.++ .+-||++.++|.|...
T Consensus 23 ~~~~~WyvD~~tn~VVV~a~ 42 (62)
T PF02983_consen 23 VAVTSWYVDPRTNKVVVTAD 42 (62)
T ss_dssp GCEEEEEEECCCTEEEEEEE
T ss_pred CCcceEEEeCCCCeEEEEEC
Confidence 345 7999999999999854
No 104
>PRK12596 putative monovalent cation/H+ antiporter subunit E; Reviewed
Probab=45.25 E-value=1.5e+02 Score=23.19 Aligned_cols=56 Identities=7% Similarity=0.080 Sum_probs=40.3
Q ss_pred ceEEEEEEeecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEe--ecCHHHHHHHHh
Q 048021 10 KMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITG--RANPRKLLRKLQ 65 (143)
Q Consensus 10 ~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G--~vdp~~Ll~aI~ 65 (143)
+..+.+..+++-+.-.--+-..+.-.||- .+++|.+.++..|.. .-|++++++.|+
T Consensus 90 PgiV~vpl~l~sd~~~t~lAn~ITLTPGTltvdv~~d~~~L~VH~Ld~~~~e~~~~~i~ 148 (171)
T PRK12596 90 PGFVVIPLELQNRTALAVLACMITASPGTAWVDYNAARGILTIHVLDLEDAETWRHLIK 148 (171)
T ss_pred CeEEEEeccCCCHHHHHHHHHHHccCCCEEEEEEECCCCEEEEEEeeCCCHHHHHHHHH
Confidence 34455555567776666677777778996 899999999999985 346776666655
No 105
>COG1666 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.24 E-value=45 Score=26.61 Aligned_cols=45 Identities=7% Similarity=0.197 Sum_probs=34.4
Q ss_pred ChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeec--CHHHHHHHHhc
Q 048021 21 DPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRA--NPRKLLRKLQS 66 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v--dp~~Ll~aI~k 66 (143)
-..-+++|.|.++. .+..|........|.|+|.- |-.+++..++.
T Consensus 105 ~~e~AKKI~KlIkD-sklKVqaqIQGdqVRVtgK~rDDLQaVialvr~ 151 (165)
T COG1666 105 ESENAKKIVKLIKD-SKLKVQAQIQGDQVRVTGKKRDDLQAVIALVRG 151 (165)
T ss_pred hHHHHHHHHHHHHh-cccceeeeeccceEEEeccchhhHHHHHHHHHh
Confidence 45567888887765 44567888889999999963 78888888876
No 106
>PF14535 AMP-binding_C_2: AMP-binding enzyme C-terminal domain; PDB: 2Y27_A 2Y4N_A 3QOV_B 3S89_D 3LAX_A 2Y4O_B.
Probab=45.12 E-value=84 Score=21.70 Aligned_cols=54 Identities=17% Similarity=0.287 Sum_probs=33.0
Q ss_pred HHHHHHHhccCCe-e---EEEeCCC--CEEEEEee----c-C--------HHHHHHHHhc--CCceEEeecCCCC
Q 048021 26 SSMTKVLRKIKGA-S---YTIDAEE--GMAYITGR----A-N--------PRKLLRKLQS--GKYANLCWVSAGN 79 (143)
Q Consensus 26 ~kIkkaL~kI~GV-s---V~VDle~--~~VtV~G~----v-d--------p~~Ll~aI~k--Gk~Ael~~~~~~~ 79 (143)
..|+.+|.+++|+ . +.++.++ ..++|.-. . + .++|.++|++ |..+.+..++++.
T Consensus 7 ~~Ie~vl~~~~~~~~~y~i~v~~~~~~D~l~v~vE~~~~~~~~~~~~~~l~~~i~~~lk~~lgv~~~V~lv~~gt 81 (96)
T PF14535_consen 7 SQIEEVLREFPEVSPEYQIVVTREGGLDELTVRVELRPGFSDDAEDLEALAERIAERLKERLGVRPEVELVPPGT 81 (96)
T ss_dssp HHHHHHHCTSTTEEEEEEEEEEEETTEEEEEEEEEESTTCCTTHHHHHHHHHHHHHHHHHHHSS-EEEEEE-TT-
T ss_pred HHHHHHHHhCcCCCCcEEEEEEcCCCCcEEEEEEEECCccCcchHHHHHHHHHHHHHHHhhcCceEEEEEECCCC
Confidence 4789999999998 2 5555433 34444422 1 1 3466667766 9998887777654
No 107
>TIGR01676 GLDHase galactonolactone dehydrogenase. This model represents L-Galactono-gamma-lactone dehydrogenase (EC 1.3.2.3). This enzyme catalyzes the final step in ascorbic acid biosynthesis in higher plants. This protein is homologous to ascorbic acid biosynthesis enzymes of other species: L-gulono-gamma-lactone oxidase in rat and L-galactono-gamma-lactone oxidase in yeast. All three covalently bind the cofactor FAD.
Probab=44.96 E-value=29 Score=32.24 Aligned_cols=32 Identities=13% Similarity=0.241 Sum_probs=28.8
Q ss_pred EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceE
Q 048021 40 YTIDAEEGMAYITGRANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 40 V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae 71 (143)
++||.++++|+|.+-+.-.+|.++|.+ |....
T Consensus 117 l~vD~~~~tVtV~AG~~l~~L~~~L~~~Glal~ 149 (541)
T TIGR01676 117 LEVDEEKKRVRVQAGIRVQQLVDAIKEYGITLQ 149 (541)
T ss_pred EEEcCCCCEEEEcCCCCHHHHHHHHHHcCCEec
Confidence 588999999999999999999999999 97653
No 108
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=44.56 E-value=89 Score=22.46 Aligned_cols=54 Identities=17% Similarity=0.317 Sum_probs=29.3
Q ss_pred EEEEEe-ecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEE--eecCHHHHHHHHhc
Q 048021 13 CGLKVD-TNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYIT--GRANPRKLLRKLQS 66 (143)
Q Consensus 13 vvlkV~-m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~--G~vdp~~Ll~aI~k 66 (143)
|+|++. +.=+-+...|+.+|+....|. |++..-+.+-.|. ..-.+.++++++..
T Consensus 2 ~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G~~~g~VRf~~~~~A~~a~~~~~~ 59 (105)
T PF08777_consen 2 CILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRGDTEGYVRFKTPEAAQKALEKLKE 59 (105)
T ss_dssp -EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT-SEEEEEESS---HHHHHHHHHH
T ss_pred eEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCCCCEEEEEECCcchHHHHHHHHHh
Confidence 567776 444445899999999988884 7777665555555 45567788887765
No 109
>PRK14812 hypothetical protein; Provisional
Probab=44.56 E-value=59 Score=24.43 Aligned_cols=31 Identities=13% Similarity=-0.076 Sum_probs=21.1
Q ss_pred hhHHHHHHHHhccCCeeEEEeCCCCEEEEEe
Q 048021 23 AWFSSMTKVLRKIKGASYTIDAEEGMAYITG 53 (143)
Q Consensus 23 gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G 53 (143)
-|...+.+....-+|..++||+++++|+..+
T Consensus 44 ~~~~~~~~l~~~~~g~~i~vDL~~~~v~~~~ 74 (119)
T PRK14812 44 QPREVREKLAQLKPTDQVTVDLEQQKIISPV 74 (119)
T ss_pred CCHHHHHHHhccCCCCEEEEEcCCCEEEcCC
Confidence 4445555443334566899999999998754
No 110
>PRK07334 threonine dehydratase; Provisional
Probab=44.55 E-value=80 Score=27.48 Aligned_cols=62 Identities=16% Similarity=0.220 Sum_probs=40.6
Q ss_pred EEEEEEee-cChhhHHHHHHHHhccCC-e-eEEEeC-----CCCEEEEE--eec-C---HHHHHHHHhc-CCceEEe
Q 048021 12 TCGLKVDT-NDPAWFSSMTKVLRKIKG-A-SYTIDA-----EEGMAYIT--GRA-N---PRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 12 tvvlkV~m-~C~gC~~kIkkaL~kI~G-V-sV~VDl-----e~~~VtV~--G~v-d---p~~Ll~aI~k-Gk~Ael~ 73 (143)
++.|.|.. .=.|=-.+|.++|.+.+. | ++.... +++++.+. =.+ | -+.|+++|++ |+.++|+
T Consensus 326 ~v~l~I~~~dr~GlL~dI~~~is~~~~nI~~v~~~~~~~~~~~~~~~i~l~i~V~d~~~L~~vi~~Lr~~g~~~~~~ 402 (403)
T PRK07334 326 LARLRVDIRDRPGALARVTALIGEAGANIIEVSHQRLFTDLPAKGAELELVIETRDAAHLQEVIAALRAAGFEARLV 402 (403)
T ss_pred EEEEEEEeCCCCCHHHHHHHHHhhCCCceEEEEEEecccCCCCCeEEEEEEEEeCCHHHHHHHHHHHHHcCCeeEeC
Confidence 46777775 455668888888876532 2 455543 24565544 222 4 4588999999 9999985
No 111
>PF00712 DNA_pol3_beta: DNA polymerase III beta subunit, N-terminal domain; InterPro: IPR022634 This entry describes the N-terminal domain of the beta chain of DNA polymerase III. This is a complex, multichain enzyme responsible for most of the replicative synthesis in bacteria. The beta chain is required for initiation of replication from an RNA primer, nucleotide triphosphate (dNTP) residues being added to the 5'-end of the growing DNA chain.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0008408 3'-5' exonuclease activity, 0006260 DNA replication, 0009360 DNA polymerase III complex; PDB: 3P16_A 3RB9_B 2AWA_C 1VPK_A 2XUR_B 3Q4K_A 3BEP_A 3D1G_A 1UNN_B 3Q4J_D ....
Probab=44.10 E-value=38 Score=24.22 Aligned_cols=29 Identities=21% Similarity=0.329 Sum_probs=20.7
Q ss_pred HHHHHHHHhccCC-eeEEEeCCCCEEEEEe
Q 048021 25 FSSMTKVLRKIKG-ASYTIDAEEGMAYITG 53 (143)
Q Consensus 25 ~~kIkkaL~kI~G-VsV~VDle~~~VtV~G 53 (143)
++++...++++++ ..|++..+++.++|.+
T Consensus 72 ak~l~~ivk~lp~~~~I~i~~~~~~l~I~~ 101 (120)
T PF00712_consen 72 AKKLFDIVKKLPDDEEITIEVEENKLTIKS 101 (120)
T ss_dssp HHHHHHHHHHSSSTSEEEEEEETTEEEEEE
T ss_pred hHHHHHHHHhCCCCCeEEEEEeCCEEEEEE
Confidence 6777777788777 4666666667777763
No 112
>PHA02757 hypothetical protein; Provisional
Probab=43.86 E-value=1.1e+02 Score=21.26 Aligned_cols=47 Identities=21% Similarity=0.291 Sum_probs=34.8
Q ss_pred CCCCCceEEEEE------EeecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEE
Q 048021 5 SSSSAKMTCGLK------VDTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYIT 52 (143)
Q Consensus 5 s~~s~~~tvvlk------V~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~ 52 (143)
||+.+...|++. .++.|+--..++.+.|.+ +-+ -|++|.+.|...+.
T Consensus 9 SskPprnvCViTPsgpfDFni~C~v~Qe~~~eildk-dK~CiiE~de~sGml~s~ 62 (75)
T PHA02757 9 SSKPPRNVCVITPSGPFDFNIACGVDQEKANEILDK-DKACIIEIDEDSGMLFSQ 62 (75)
T ss_pred CCCCCCCEEEEeCCCCccEEEEcCccHHHHHHHhcc-cceEEEEEcCCCCcEEEe
Confidence 444445556543 567899889999998876 445 69999999988876
No 113
>PF01963 TraB: TraB family; InterPro: IPR002816 In prokaryotes, for example Enterococcus faecalis (Streptococcus faecalis), the conjugative transfer of certain plasmids is controlled by peptide pheromones []. Plasmid free recipient cells secret plasmid specific oligopeptides, termed sex pheromones. They induce bacterial clumping and specifically activate the conjugative transfer of the corresponding plasmid. Once recipient cells acquire the plasmid they start to produce a pheromone inhibitor to block the activity of the pheromone and to prevent plasmid containing cells from clumping; they also become donor cells able to transfer the plasmid to plasmid free recipient cells. Examples of such plasmid-pheromone systems are bacteriocin plasmid pPD1 [], haemolysin/bacteriocin plasmid, pAD1 [], tetracycline-resistance plasmid, pCF10 [], and the haemolysin/bacteriocin plasmid, pOB1 []. TraB in combination with another factor contributes to pheromone shutdown in cells that have acquired a plasmid. It exact function has not yet been determined [, ]. This entry also contains plant and mammalian proteins, suggesting that these Trab-related proteins may have a somewhat wider or different function in eukaryotes.
Probab=43.65 E-value=51 Score=25.98 Aligned_cols=39 Identities=15% Similarity=0.336 Sum_probs=27.9
Q ss_pred hHHHHHHHHhccCCeeEEEeCCCCEEEEEee---cCHHHHHHHHhc-CCceEE
Q 048021 24 WFSSMTKVLRKIKGASYTIDAEEGMAYITGR---ANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 24 C~~kIkkaL~kI~GVsV~VDle~~~VtV~G~---vdp~~Ll~aI~k-Gk~Ael 72 (143)
++++|.++|++ ....+.|.|. .-++-|++.|++ ||+|+-
T Consensus 216 ~~~~i~~~l~~----------~~~~fvvVGa~HL~G~~gvl~lLr~~Gy~V~~ 258 (259)
T PF01963_consen 216 WAEKIEELLKE----------GGTVFVVVGAGHLPGEDGVLDLLRKKGYTVEP 258 (259)
T ss_pred HHHHHHHHHhc----------CCCEEEEEcchhccchhhHHHHHHhCCceeec
Confidence 55666665544 2346777775 478899999999 999874
No 114
>PF03646 FlaG: FlaG protein; InterPro: IPR005186 Although these proteins are known to be important for flagellar their exact function is unknown.; PDB: 2HC5_A.
Probab=43.60 E-value=74 Score=22.45 Aligned_cols=42 Identities=19% Similarity=0.472 Sum_probs=26.9
Q ss_pred HHHHHHHHhccC-CeeEEEeCCCCEEEEE------e----ecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIK-GASYTIDAEEGMAYIT------G----RANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~-GVsV~VDle~~~VtV~------G----~vdp~~Ll~aI~k 66 (143)
..+|.+.|..+. .+.+.+|.+.+.+.|+ | .++|+++++..++
T Consensus 43 v~~ln~~~~~~~~~l~F~vde~~~~~vVkViD~~T~eVIRqIP~Ee~l~l~~~ 95 (107)
T PF03646_consen 43 VEKLNEFLQALNTSLRFSVDEESGRVVVKVIDKETGEVIRQIPPEELLDLAKR 95 (107)
T ss_dssp HHHHHHHHTTSS--EEEEEEEETTEEEEEEEETTT-SEEEEE-HHHHHHHHHH
T ss_pred HHHHHHHHHhcCCceEEEEecCCCcEEEEEEECCCCcEEEeCCcHHHHHHHHH
Confidence 344555554432 2268999999999887 3 3589998887765
No 115
>cd04886 ACT_ThrD-II-like C-terminal ACT domain of biodegradative (catabolic) threonine dehydratase II (ThrD-II) and other related ACT domains. This CD includes the C-terminal ACT domain of biodegradative (catabolic) threonine dehydratase II (ThrD-II) and other related ACT domains. The Escherichia coli tdcB gene product, ThrD-II, anaerobically catalyzes the pyridoxal phosphate-dependent dehydration of L-threonine and L-serine to ammonia and to alpha-ketobutyrate and pyruvate, respectively. Tetrameric ThrD-II is subject to allosteric activation by AMP, inhibition by alpha-keto acids, and catabolite inactivation by several metabolites of glycolysis and the citric acid cycle. Also included in this CD are N-terminal ACT domains present in smaller (~170 a.a.) archaeal proteins of unknown function. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=43.56 E-value=56 Score=19.83 Aligned_cols=50 Identities=14% Similarity=0.236 Sum_probs=28.2
Q ss_pred hhhHHHHHHHHhccCC-e-eEEEeCC-----CCEEEE----Eee--cCHHHHHHHHhc-CCceE
Q 048021 22 PAWFSSMTKVLRKIKG-A-SYTIDAE-----EGMAYI----TGR--ANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~G-V-sV~VDle-----~~~VtV----~G~--vdp~~Ll~aI~k-Gk~Ae 71 (143)
.|=-.+|.++|.+..+ | ++..... .+...+ ... .+.+.|+++|++ |+++.
T Consensus 9 ~G~L~~i~~~i~~~~~nI~~i~~~~~~~~~~~~~~~~~i~v~~~~~~~l~~l~~~l~~~g~~~~ 72 (73)
T cd04886 9 PGQLAKLLAVIAEAGANIIEVSHDRAFKTLPLGEVEVELTLETRGAEHIEEIIAALREAGYDVR 72 (73)
T ss_pred CChHHHHHHHHHHcCCCEEEEEEEeccCCCCCceEEEEEEEEeCCHHHHHHHHHHHHHcCCEEe
Confidence 4445677787876433 2 3444332 344332 231 233589999999 99764
No 116
>cd06475 ACD_HspB1_like Alpha crystallin domain (ACD) found in mammalian small (s)heat shock protein (Hsp)-27 (also denoted HspB1 in human) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Hsp27 shows enhanced synthesis in response to stress. It is a molecular chaperone which interacts with a large number of different proteins. It is found in many types of human cells including breast, uterus, cervix, platelets and cancer cells. Hsp27 has diverse cellular functions including, chaperoning, regulation of actin polymerization, keratinocyte differentiation, regulation of inflammatory pathways in keratinocytes, and protection from oxidative stress through modulating glutathione levels. It is also a subunit of AUF1-containing protein complexes. It has been linked to several transduction pathways regulating cellular functions including differentiat
Probab=43.56 E-value=22 Score=24.51 Aligned_cols=22 Identities=18% Similarity=0.450 Sum_probs=18.8
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|+.++++++|+|+.
T Consensus 17 dlPG~~~edi~V~v~~~~L~I~g~~ 41 (86)
T cd06475 17 DVNHFAPEELVVKTKDGVVEITGKH 41 (86)
T ss_pred ECCCCCHHHEEEEEECCEEEEEEEE
Confidence 46888 39999999999999974
No 117
>PRK10560 hofQ outer membrane porin HofQ; Provisional
Probab=43.32 E-value=78 Score=27.52 Aligned_cols=46 Identities=9% Similarity=0.151 Sum_probs=36.9
Q ss_pred HHHHHHHhccCCeeEEEeCC-CCEEEEEe-ecCHHHHHHHHhc--CCceE
Q 048021 26 SSMTKVLRKIKGASYTIDAE-EGMAYITG-RANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle-~~~VtV~G-~vdp~~Ll~aI~k--Gk~Ae 71 (143)
+.|-+.|.++-|.++-+|.+ +|+|++.- .++.+++++.|.+ |+..+
T Consensus 8 ~~vl~~la~~~g~NiVidp~V~G~VTl~~~~V~~~qal~~iL~~~gl~~~ 57 (386)
T PRK10560 8 AQVLQALAEQEKLNLVVSPDVSGTVSLHLTDVPWKQALQTVVKSAGLILR 57 (386)
T ss_pred HHHHHHHHHhcCceEEECCCCcceEEEEEeCCCHHHHHHHHHHhCCCcEE
Confidence 45666788888888888887 88999985 5788999988887 87665
No 118
>PRK00007 elongation factor G; Reviewed
Probab=43.30 E-value=93 Score=29.26 Aligned_cols=58 Identities=21% Similarity=0.264 Sum_probs=41.8
Q ss_pred EEEeecChhhHHHHHHHHhccCC----eeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceEE
Q 048021 15 LKVDTNDPAWFSSMTKVLRKIKG----ASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYANL 72 (143)
Q Consensus 15 lkV~m~C~gC~~kIkkaL~kI~G----VsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ael 72 (143)
..|.-.-.+...++.++|++|.- +.|.+|.+.++..|.| ++--+.+++.|++ |..+++
T Consensus 411 ~aIep~~~~d~~kL~~aL~~L~~eDpsl~v~~~~etge~~l~g~GelHLei~~~rL~~~~~vev~~ 476 (693)
T PRK00007 411 VAVEPKTKADQEKMGIALQKLAEEDPSFRVSTDEETGQTIIAGMGELHLDIIVDRMKREFKVEANV 476 (693)
T ss_pred EEEEECCcccHHHHHHHHHHHHHhCCeEEEEEcCCCCCEEEEEecHHhHHHHHHHHHHHhCCeeEe
Confidence 33433445678999999998732 3577788888888885 5678999999988 655543
No 119
>cd04901 ACT_3PGDH C-terminal ACT (regulatory) domain of D-3-Phosphoglycerate Dehydrogenase (3PGDH) found in fungi and bacteria. The C-terminal ACT (regulatory) domain of D-3-Phosphoglycerate Dehydrogenase (3PGDH) found in fungi and bacteria. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In Escherichia coli, the SerA 3PGDH is feedback-controlled by the end product L-serine in an allosteric manner. In the homotetrameric enzyme, the interface at adjacent ACT (regulatory) domains couples to create an extended beta-sheet. Each regulatory interface forms two serine-binding sites. The mechanism by which serine transmits inhibition to the active site is postulated to involve the tethering of the regulatory domains together to create a rigid quaternary structure with a solvent-
Probab=43.24 E-value=50 Score=20.49 Aligned_cols=47 Identities=15% Similarity=0.289 Sum_probs=23.4
Q ss_pred ecChhhHHHHHHHHhccCCe---eEEEeCCCCEEEEEeec---CHHHHHHHHhc
Q 048021 19 TNDPAWFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRA---NPRKLLRKLQS 66 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~v---dp~~Ll~aI~k 66 (143)
..-.|.-.+|...|.+. |+ .+......+...+.=.+ ..+.++++|++
T Consensus 7 ~d~~g~l~~i~~~l~~~-~~nI~~~~~~~~~~~a~~~~~~~~~~l~~li~~l~~ 59 (69)
T cd04901 7 KNVPGVLGQINTILAEH-NINIAAQYLQTRGEIGYVVIDIDSEVSEELLEALRA 59 (69)
T ss_pred cCCCcHHHHHHHHHHHc-CCCHHHHhccCCCCEEEEEEEcCCCCCHHHHHHHHc
Confidence 34455666666666553 43 24443333443332111 56677777766
No 120
>PRK06027 purU formyltetrahydrofolate deformylase; Reviewed
Probab=43.23 E-value=1.3e+02 Score=25.40 Aligned_cols=65 Identities=8% Similarity=0.010 Sum_probs=41.1
Q ss_pred eEEEEEEee-cChhhHHHHHHHHhccC---CeeEEEeCC--CCEEEEE--ee-cCHHHHHHHHhc-CCceEEeec
Q 048021 11 MTCGLKVDT-NDPAWFSSMTKVLRKIK---GASYTIDAE--EGMAYIT--GR-ANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 11 ~tvvlkV~m-~C~gC~~kIkkaL~kI~---GVsV~VDle--~~~VtV~--G~-vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
-.+.+.|+. .++.....++.+|.++. |+++.++.. ..++.|- |. .+-+.|+++++. ...+++..+
T Consensus 48 F~m~i~v~~~~~~~~~~~L~~~L~~l~~~l~l~i~l~~~~~~~ri~vl~Sg~gsnl~al~~~~~~~~~~~~i~~v 122 (286)
T PRK06027 48 FFMRVEFEGDGLIFNLETLRADFAALAEEFEMDWRLLDSAERKRVVILVSKEDHCLGDLLWRWRSGELPVEIAAV 122 (286)
T ss_pred EEEEEEEEeCCCCCCHHHHHHHHHHHHHHhCCEEEEcccccCcEEEEEEcCCCCCHHHHHHHHHcCCCCcEEEEE
Confidence 345555666 45556889999998886 445555433 3444443 43 478999999888 555665554
No 121
>TIGR00914 2A0601 heavy metal efflux pump (cobalt-zinc-cadmium). This model represents a family of H+/heavy metal cation antiporters. This family is one of several subfamilies within the scope of pfam model pfam00873.
Probab=43.05 E-value=37 Score=33.43 Aligned_cols=60 Identities=13% Similarity=0.169 Sum_probs=41.4
Q ss_pred EEEEEEeec----ChhhHHHHHHHHhccCCe-eEEEeCCC--CEEEEEe--------ecCHHHHHHHHhc---CCceE
Q 048021 12 TCGLKVDTN----DPAWFSSMTKVLRKIKGA-SYTIDAEE--GMAYITG--------RANPRKLLRKLQS---GKYAN 71 (143)
Q Consensus 12 tvvlkV~m~----C~gC~~kIkkaL~kI~GV-sV~VDle~--~~VtV~G--------~vdp~~Ll~aI~k---Gk~Ae 71 (143)
.+++++.+. =..-+++|++.|++++|| +|..|.+. .++.|+= -++++.|.+.|+. |..+.
T Consensus 683 ~i~i~l~G~d~~~L~~~a~~v~~~l~~~pgv~~v~~~~~~~~~e~~i~id~~~~~~~Glt~~~v~~~l~~~~~g~~~g 760 (1051)
T TIGR00914 683 DVAVKVFGDDLDDLDATAEKISAVLKGVPGAADVKVEQTTGLPYLTVEIDREKAARYGLTVGDVQDTVATAVGGRMSG 760 (1051)
T ss_pred CeEEEEECCCHHHHHHHHHHHHHHHhcCCCceeeeccccCCCceEEEEECHHHHHHcCCCHHHHHHHHHHHhCCceee
Confidence 466666542 346678899999999999 79887754 3444441 1577888888876 65543
No 122
>PF04461 DUF520: Protein of unknown function (DUF520); InterPro: IPR007551 This entry represents the UPF0234 family of uncharacterised proteins.; PDB: 1IN0_A.
Probab=42.87 E-value=53 Score=26.10 Aligned_cols=52 Identities=10% Similarity=0.116 Sum_probs=34.6
Q ss_pred ecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEee--cCHHHHHHHHhc-CCceE
Q 048021 19 TNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGR--ANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~--vdp~~Ll~aI~k-Gk~Ae 71 (143)
+--...+++|.+.++.. +..|.......+|.|+|. -|-.++++.|++ .+...
T Consensus 99 GI~~d~AKkIvK~IKd~-klKVqa~IQgd~vRVtgKkrDDLQ~viallk~~d~~~p 153 (160)
T PF04461_consen 99 GIDQDTAKKIVKLIKDS-KLKVQAQIQGDQVRVTGKKRDDLQEVIALLKEQDLGIP 153 (160)
T ss_dssp S--HHHHHHHHHHHHHH---SEEEEEETTEEEEEES-HHHHHHHHHHHHHS--SS-
T ss_pred ccCHHHHHHHHHHHHhc-CCceeEEecCcEEEEecCCHHHHHHHHHHHHcccCCCC
Confidence 34566788888888763 446788888999999995 367778888877 44443
No 123
>TIGR00503 prfC peptide chain release factor 3. This translation releasing factor, RF-3 (prfC) was originally described as stop codon-independent, in contrast to peptide chain release factor 1 (RF-1, prfA) and RF-2 (prfB). RF-1 and RF-2 are closely related to each other, while RF-3 is similar to elongation factors EF-Tu and EF-G; RF-1 is active at UAA and UAG and RF-2 is active at UAA and UGA. More recently, RF-3 was shown to be active primarily at UGA stop codons in E. coli. All bacteria and organelles have RF-1. The Mycoplasmas and organelles, which translate UGA as Trp rather than as a stop codon, lack RF-2. RF-3, in contrast, seems to be rare among bacteria and is found so far only in Escherichia coli and some other gamma subdivision Proteobacteria, in Synechocystis PCC6803, and in Staphylococcus aureus.
Probab=42.40 E-value=1e+02 Score=28.29 Aligned_cols=62 Identities=15% Similarity=0.094 Sum_probs=47.1
Q ss_pred EEEEEEeecChhhHHHHHHHHhcc--CCe-eEEEeCCCCEEEEE--eecCHHHHHHHHhc--CCceEEe
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKI--KGA-SYTIDAEEGMAYIT--GRANPRKLLRKLQS--GKYANLC 73 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI--~GV-sV~VDle~~~VtV~--G~vdp~~Ll~aI~k--Gk~Ael~ 73 (143)
.+...|...-.+-.+++.++|.+| +|. .|..|.+.+...|. |.+--+.++++|++ |-.+++-
T Consensus 394 ~~~~~v~~~~~~d~~kl~~aL~~L~eED~l~v~~~~~t~e~il~g~GelhleV~~~RL~~ey~v~v~~~ 462 (527)
T TIGR00503 394 ELFRRIRLKDPLKQKQLLKGLVQLSEEGAVQVFRPLDNNDLIVGAVGVLQFDVVVYRLKEEYNVEARYE 462 (527)
T ss_pred ceEEEEEECChhhHHHHHHHHHHHHhhCCeEEEEcCCCCCEEEEEEeHHHHHHHHHHHHHHhCCeEEEe
Confidence 445566677788899999999988 444 57778888887777 46778999999988 6665543
No 124
>PF01936 NYN: NYN domain; InterPro: IPR021139 This highly conserved domain has no known function. However it contains many conserved aspartates, suggesting an enzymatic function such as an endonuclease or glycosyl hydrolase.; PDB: 2QIP_A.
Probab=42.37 E-value=23 Score=25.24 Aligned_cols=30 Identities=23% Similarity=0.401 Sum_probs=21.1
Q ss_pred CCEEEEEeecCHHHHHHHHhc-CCceEEeec
Q 048021 46 EGMAYITGRANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 46 ~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
+.-|.|+|..|-.-+++.|++ |+++.+++.
T Consensus 97 d~ivLvSgD~Df~~~v~~l~~~g~~V~v~~~ 127 (146)
T PF01936_consen 97 DTIVLVSGDSDFAPLVRKLRERGKRVIVVGA 127 (146)
T ss_dssp SEEEEE---GGGHHHHHHHHHH--EEEEEE-
T ss_pred CEEEEEECcHHHHHHHHHHHHcCCEEEEEEe
Confidence 556777899999999999999 999999995
No 125
>PF00462 Glutaredoxin: Glutaredoxin; InterPro: IPR002109 Glutaredoxins [, , ], also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system []. Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond []. It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed [] that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry represents Glutaredoxin.; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity, 0045454 cell redox homeostasis; PDB: 1QFN_A 1GRX_A 1EGO_A 1EGR_A 3RHC_A 3RHB_A 3IPZ_A 1NHO_A 3GX8_A 3D5J_A ....
Probab=42.26 E-value=81 Score=19.34 Aligned_cols=27 Identities=22% Similarity=0.310 Sum_probs=17.7
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCCCC
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAEEG 47 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle~~ 47 (143)
-.|+.|. ++++.|++ .|+. -.+|+++.
T Consensus 7 ~~C~~C~-~~~~~L~~-~~i~y~~~dv~~~ 34 (60)
T PF00462_consen 7 PGCPYCK-KAKEFLDE-KGIPYEEVDVDED 34 (60)
T ss_dssp TTSHHHH-HHHHHHHH-TTBEEEEEEGGGS
T ss_pred CCCcCHH-HHHHHHHH-cCCeeeEcccccc
Confidence 4799998 56777766 5674 44455443
No 126
>PRK10743 heat shock protein IbpA; Provisional
Probab=41.81 E-value=21 Score=27.07 Aligned_cols=22 Identities=18% Similarity=0.407 Sum_probs=19.0
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|| +++|+++++.++|+|+.
T Consensus 52 elPGv~kedi~V~v~~~~LtI~ge~ 76 (137)
T PRK10743 52 AVAGFAESELEITAQDNLLVVKGAH 76 (137)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEE
Confidence 57999 39999999999999974
No 127
>cd06477 ACD_HspB3_Like Alpha crystallin domain (ACD) found in mammalian HspB3, also known as heat-shock protein 27-like protein (HSPL27, 17-kDa) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB3 is expressed in adult skeletal muscle, smooth muscle, and heart, and in several other fetal tissues. In muscle cells HspB3 forms an oligomeric 150 kDa complex with myotonic dystrophy protein kinase-binding protein (MKBP/ HspB2), this complex may comprise one of two independent muscle-cell specific chaperone systems. The expression of HspB3 is induced during muscle differentiation controlled by the myogenic factor MyoD. HspB3 may also interact with Hsp22 (HspB8).
Probab=41.58 E-value=23 Score=24.60 Aligned_cols=24 Identities=21% Similarity=0.236 Sum_probs=20.1
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecCH
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRANP 57 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vdp 57 (143)
.++|+ +++|+++.+.++|+|....
T Consensus 14 dlpG~~~edI~V~v~~~~L~I~ge~~~ 40 (83)
T cd06477 14 DVVQFRPEDIIIQVFEGWLLIKGQHGV 40 (83)
T ss_pred EcCCCCHHHeEEEEECCEEEEEEEEcc
Confidence 57898 3999999999999997533
No 128
>KOG0125 consensus Ataxin 2-binding protein (RRM superfamily) [General function prediction only]
Probab=41.29 E-value=72 Score=28.46 Aligned_cols=50 Identities=14% Similarity=0.230 Sum_probs=34.6
Q ss_pred HHHHHHhccCCe-eEEEeCCC------CEEEEEeecCHHHHHHHHhc----CCceEEeecC
Q 048021 27 SMTKVLRKIKGA-SYTIDAEE------GMAYITGRANPRKLLRKLQS----GKYANLCWVS 76 (143)
Q Consensus 27 kIkkaL~kI~GV-sV~VDle~------~~VtV~G~vdp~~Ll~aI~k----Gk~Ael~~~~ 76 (143)
.++.-..+.--| +|+|-..+ |=||.+...|.++-+++|-. |+++|+--..
T Consensus 112 DL~aMF~kfG~VldVEIIfNERGSKGFGFVTmen~~dadRARa~LHgt~VEGRkIEVn~AT 172 (376)
T KOG0125|consen 112 DLRAMFEKFGKVLDVEIIFNERGSKGFGFVTMENPADADRARAELHGTVVEGRKIEVNNAT 172 (376)
T ss_pred cHHHHHHhhCceeeEEEEeccCCCCccceEEecChhhHHHHHHHhhcceeeceEEEEeccc
Confidence 344444554334 56664322 56888888999999999976 9999986654
No 129
>PF01565 FAD_binding_4: FAD binding domain This is only a subset of the Pfam family; InterPro: IPR006094 Various enzymes use FAD as a co-factor, most of these enzymes are oxygen-dependent oxidoreductases, containing a covalently bound FAD group which is attached to a histidine via an 8-alpha-(N3-histidyl)-riboflavin linkage. One of the enzymes Vanillyl-alcohol oxidase (VAO, 1.1.3.38 from EC) has a solved structure, the alignment includes the FAD binding site, called the PP-loop, between residues 99-110 []. The FAD molecule is covalently bound in the known structure, however the residue that links to the FAD is not in the alignment. VAO catalyses the oxidation of a wide variety of substrates, ranging from aromatic amines to 4-alkylphenols. ; GO: 0008762 UDP-N-acetylmuramate dehydrogenase activity, 0016491 oxidoreductase activity, 0050660 flavin adenine dinucleotide binding, 0055114 oxidation-reduction process; PDB: 1ZR6_A 3HSU_A 2AXR_A 3D2J_A 3D2H_A 3FW9_A 3FW8_A 3FW7_A 3GSY_A 3FWA_A ....
Probab=41.19 E-value=32 Score=24.60 Aligned_cols=32 Identities=22% Similarity=0.251 Sum_probs=28.4
Q ss_pred EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceE
Q 048021 40 YTIDAEEGMAYITGRANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 40 V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae 71 (143)
+++|.+.++|+|.+-+...+|.+.|.+ |....
T Consensus 57 ~~id~~~~~v~v~aG~~~~~l~~~l~~~g~~~~ 89 (139)
T PF01565_consen 57 IEIDPENGTVTVGAGVTWGDLYEALAPRGLMLP 89 (139)
T ss_dssp EEEETTTTEEEEETTSBHHHHHHHHHHHTEEES
T ss_pred ccccccceeEEEeccccchhccccccccccccc
Confidence 689999999999998999999999988 87554
No 130
>cd06497 ACD_alphaA-crystallin_HspB4 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaA-crystallin (HspB4, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 does not belong to this group. Mutations inHspB4 have been associated with Autosomal Dominant Congenital Cataract (ADCC). The chaperone-like functions of HspB4 are considered important for maintaining lens transparency and preventing cataract.
Probab=41.11 E-value=25 Score=24.31 Aligned_cols=22 Identities=18% Similarity=0.292 Sum_probs=18.7
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|+++++.++|+|+.
T Consensus 17 dlpG~~~edi~V~v~~~~L~I~g~~ 41 (86)
T cd06497 17 DVKHFSPEDLTVKVLDDYVEIHGKH 41 (86)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEE
Confidence 46888 39999999999999974
No 131
>PF01849 NAC: NAC domain; InterPro: IPR002715 Nascent polypeptide-associated complex (NAC) is among the first ribosome-associated entities to bind the nascent polypeptide after peptide bond formation. The nascent polypeptide-associated complex (NAC) of yeast functions in the targeting process of ribosomes to the ER membrane []. NAC may prevent binding of ribosome nascent chains (RNCs) without a signal sequence to yeast membranes.; PDB: 3MCE_D 3MCB_A 3LKX_B 1TR8_B.
Probab=41.10 E-value=32 Score=22.07 Aligned_cols=28 Identities=29% Similarity=0.592 Sum_probs=16.8
Q ss_pred HHHHHHHhc-----cCCe-eEEEeCCCCEEEEEe
Q 048021 26 SSMTKVLRK-----IKGA-SYTIDAEEGMAYITG 53 (143)
Q Consensus 26 ~kIkkaL~k-----I~GV-sV~VDle~~~VtV~G 53 (143)
++++++|++ |+|| .|.+-..++++.+-.
T Consensus 2 kk~~~~l~klgl~~i~~i~eV~i~~~dg~~~~~~ 35 (58)
T PF01849_consen 2 KKLQKMLKKLGLKEIPGIEEVTIRKDDGTVFVFN 35 (58)
T ss_dssp ------GHHCT-EEETTEEEEEEEETTTEEEEEE
T ss_pred HHHHHHHHHcCCcccCCcEEEEEEECCceEEEEc
Confidence 455666554 5788 699999998887763
No 132
>PRK11597 heat shock chaperone IbpB; Provisional
Probab=40.95 E-value=22 Score=27.24 Aligned_cols=23 Identities=17% Similarity=0.221 Sum_probs=19.4
Q ss_pred hccCCe---eEEEeCCCCEEEEEeec
Q 048021 33 RKIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 33 ~kI~GV---sV~VDle~~~VtV~G~v 55 (143)
..++|| +++|+++++.++|+|+.
T Consensus 49 adlPGv~kedi~V~v~~~~LtI~ge~ 74 (142)
T PRK11597 49 LALAGFRQEDLDIQLEGTRLTVKGTP 74 (142)
T ss_pred EEeCCCCHHHeEEEEECCEEEEEEEE
Confidence 357898 39999999999999974
No 133
>PRK00741 prfC peptide chain release factor 3; Provisional
Probab=40.87 E-value=1.1e+02 Score=27.99 Aligned_cols=63 Identities=14% Similarity=0.066 Sum_probs=47.1
Q ss_pred EEEEEEeecChhhHHHHHHHHhcc--CCe-eEEEeCCCCEEEEE--eecCHHHHHHHHhc--CCceEEee
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKI--KGA-SYTIDAEEGMAYIT--GRANPRKLLRKLQS--GKYANLCW 74 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI--~GV-sV~VDle~~~VtV~--G~vdp~~Ll~aI~k--Gk~Ael~~ 74 (143)
.+-..|.-.-.+-.+++.++|.+| +|. .+..|.+.+...|. |.+--+.++++|++ |-++++-.
T Consensus 393 ~~~~~v~p~~~~d~~kl~~aL~~L~eED~l~~~~~~~t~e~il~g~G~lhleV~~~RL~~ey~v~v~~~~ 462 (526)
T PRK00741 393 ELFRRVRLKNPLKQKQLQKGLVQLSEEGAVQVFRPLDNNDLILGAVGQLQFEVVAHRLKNEYNVEAIYEP 462 (526)
T ss_pred cEEEEEEECCchhHHHHHHHHHHHhhcCCeEEEECCCCCCEEEEEEeHHHHHHHHHHHHHHhCCEEEEec
Confidence 344556666778889999999988 444 57778888888777 46778999999998 66666543
No 134
>PF00873 ACR_tran: AcrB/AcrD/AcrF family; InterPro: IPR001036 The Escherichia coli acrA and acrB genes encode a multi-drug efflux system that is believed to protect the bacterium against hydrophobic inhibitors []. The E. coli AcrB protein is a transporter that is energized by proton-motive force and that shows the widest substrate specificity among all known multidrug pumps, ranging from most of the currently used antibiotics, disinfectants, dyes, and detergents to simple solvents. The structure of ligand-free AcrB shows that it is a homotrimer of 110kDa per subunit. Each subunit contains 12 transmembrane helices and two large periplasmic domains (each exceeding 300 residues) between helices 1 and 2, and helices 7 and 8. X-ray analysis of the overexpressed AcrB protein demonstrated that the three periplasmic domains form, in the centre, a funnel-like structure and a connected narrow (or closed) pore. The pore is opened to the periplasm through three vestibules located at subunit interfaces. These vestibules were proposed to allow direct access of drugs from the periplasm as well as the outer leaflet of the cytoplasmic membrane. The three transmembrane domains of AcrB protomers form a large, 30A-wide central cavity that spans the cytoplasmic membrane and extends to the cytoplasm X-ray crystallographic structures of the trimeric AcrB pump from E. coli with four structurally diverse ligands demonstrated that three molecules of ligand bind simultaneously to the extremely large central cavity of 5000 cubic angstroms, primarily by hydrophobic, aromatic stacking and van der Waals interactions. Each ligand uses a slightly different subset of AcrB residues for binding. The bound ligand molecules often interact with each other, stabilising the binding. ; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 2V50_B 1T9U_A 2HRT_B 3NOC_A 3NOG_A 4DX7_A 1OYD_A 3AOB_A 1T9V_A 4DX6_B ....
Probab=40.40 E-value=57 Score=31.81 Aligned_cols=54 Identities=19% Similarity=0.198 Sum_probs=40.9
Q ss_pred cChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEee----------cCHHHHHHHHhc---CCceEEe
Q 048021 20 NDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGR----------ANPRKLLRKLQS---GKYANLC 73 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~----------vdp~~Ll~aI~k---Gk~Ael~ 73 (143)
....-+++|++.|++++|+ +|..|.+.++-.+.=. ++++.|.+.|+. |..+.-+
T Consensus 684 ~L~~~a~~v~~~l~~~pgv~dv~~~~~~~~~el~i~~dreka~~~Gls~~~va~~l~~a~~G~~v~~~ 751 (1021)
T PF00873_consen 684 ELRKAAEKVKAKLAEIPGVTDVRDDWEDGQPELRIDPDREKAARLGLSPADVARTLRTAFSGSVVGTF 751 (1021)
T ss_dssp HHHHHHHHHHHHHHHSTTEEEEEESSSSBEEEEEEEE-HHHHHHTTB-HHHHHHHHHHHHS-EEEEEE
T ss_pred HHHHHHHHHHHHHHhCCCcccccccccccCcceEEEecHHHHHHcCCCHHHHHHHHHHHhccccccee
Confidence 4678899999999999999 8999999887777643 456778888876 7655433
No 135
>cd06471 ACD_LpsHSP_like Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18.5, HSP 18.55 and HSP 19.3. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Transcription of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is regulated by a variety of stresses including heat, cold and ethanol. Early growing L. plantarum cells contain elevated levels of these mRNAs which rapidly fall of as the cells enter stationary phase. Also belonging to this group is Bifidobacterium breve (Bb) HSP20 and Oenococcus oenis (syn. Leuconostoc oenos) (Oo) HSP18. Transcription of the gene encoding BbHSP20 is strongly induced following heat or osmotic shock, and that of the gene encoding OoHSP18 following heat, ethanol or acid shock. OoHSP18 is peripherally associated with the cytoplasmic me
Probab=40.26 E-value=26 Score=23.85 Aligned_cols=23 Identities=17% Similarity=0.431 Sum_probs=19.1
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecC
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.|+|| +++|+.+++.++|+|...
T Consensus 17 ~lPGv~~edi~v~~~~~~L~I~g~~~ 42 (93)
T cd06471 17 DLPGFKKEDIKLDYKDGYLTISAKRD 42 (93)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEEc
Confidence 57888 299999999999999754
No 136
>cd06476 ACD_HspB2_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB2/heat shock 27kDa protein 2 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB2 is preferentially and constitutively expressed in skeletal muscle and heart. HspB2 shows homooligomeric activity and forms aggregates in muscle cytosol. Although its expression is not induced by heat shock, it redistributes to the insoluble fraction in response to heat shock. In the mouse heart, HspB2 plays a role in maintaining energetic balance, by protecting cardiac energetics during ischemia/reperfusion, and allowing for increased work during acute inotropic challenge. hHspB2 [previously also known as myotonic dystrophy protein kinase (DMPK) binding protein (MKBP)] is selectively up-regulated in skeletal muscles from myotonic dystrophy patients.
Probab=40.25 E-value=25 Score=24.25 Aligned_cols=23 Identities=9% Similarity=0.157 Sum_probs=19.2
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecC
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.++|+ +++|++++++++|+|+..
T Consensus 14 dlpG~~~edi~V~v~~~~L~I~g~~~ 39 (83)
T cd06476 14 DVCHFTPDEITVRTVDNLLEVSARHP 39 (83)
T ss_pred EcCCCCHHHeEEEEECCEEEEEEEEc
Confidence 46888 399999999999999753
No 137
>TIGR02024 FtcD glutamate formiminotransferase. This model covers enzymes from metazoa as well as gram-positive bacteria and archaea. In humans, deficiency of this enzyme results in a disease phenotype. The crystal structure of the enzyme has been studied in the context of the catalytic mechanism.
Probab=40.16 E-value=31 Score=29.90 Aligned_cols=65 Identities=14% Similarity=0.175 Sum_probs=46.1
Q ss_pred EEEEEE-eecChhhHHHHHHHHhccCCe---eEEEeCCCCEEEEEeecCHHHHHHHHhc--CCceEEeecC
Q 048021 12 TCGLKV-DTNDPAWFSSMTKVLRKIKGA---SYTIDAEEGMAYITGRANPRKLLRKLQS--GKYANLCWVS 76 (143)
Q Consensus 12 tvvlkV-~m~C~gC~~kIkkaL~kI~GV---sV~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~Ael~~~~ 76 (143)
-|+..+ ++.-..=..+|.+++++.+|| ++.-|.+-++..|+=..+|+.|.+++-+ ++.++++-..
T Consensus 6 eCvpnfSEGR~~~vie~I~~a~~~~~gv~lld~~~D~d~NRsv~Tlvg~pe~v~eaa~~~~~~A~elIDm~ 76 (298)
T TIGR02024 6 ECVPNFSEGRNKEVIEKIVDAIIKTDNVKLLDVDMDPDHNRSVITFVGEPECVVNAALKLAKKAAELIDMR 76 (298)
T ss_pred EEeeeeccCCCHHHHHHHHHHHhcCCCcEEEeCCCCCCCCceEEEEecChHHHHHHHHHHHHHHHHhcCcc
Confidence 345555 345666778889999999998 3555666677777755668888888877 6666777664
No 138
>COG0694 Thioredoxin-like proteins and domains [Posttranslational modification, protein turnover, chaperones]
Probab=40.09 E-value=58 Score=23.53 Aligned_cols=33 Identities=9% Similarity=0.067 Sum_probs=21.2
Q ss_pred EEEEEEeecChhhH-------HHHHHHHhc-cCCe-eEEEeC
Q 048021 12 TCGLKVDTNDPAWF-------SSMTKVLRK-IKGA-SYTIDA 44 (143)
Q Consensus 12 tvvlkV~m~C~gC~-------~kIkkaL~k-I~GV-sV~VDl 44 (143)
++.++.-+.|.||- ..|++.|+. +++| .|....
T Consensus 44 ~V~l~l~GaC~gC~sS~~TLk~gIE~~L~~~i~ev~~V~~v~ 85 (93)
T COG0694 44 VVYLRLGGACSGCPSSTVTLKNGIERQLKEEIPEVKEVEQVT 85 (93)
T ss_pred eEEEEeCCcCCCCcccHHHHHHHHHHHHHHhCCccceEEEcc
Confidence 45566667999995 347777764 5666 554433
No 139
>PRK12740 elongation factor G; Reviewed
Probab=39.89 E-value=88 Score=28.99 Aligned_cols=59 Identities=19% Similarity=0.276 Sum_probs=41.4
Q ss_pred EEEEEeecChhhHHHHHHHHhccC----CeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceE
Q 048021 13 CGLKVDTNDPAWFSSMTKVLRKIK----GASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 13 vvlkV~m~C~gC~~kIkkaL~kI~----GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ae 71 (143)
+.+.|.-.-.+-..++.++|++|. .+.|..|.+.+...|.| ++--+.++++|++ |.++.
T Consensus 388 ~~~~i~p~~~~d~~~L~~aL~~l~~~Dpsl~v~~~~~~ge~~l~g~GelhLei~~~~L~~~~~~~v~ 454 (668)
T PRK12740 388 ISLAIEPKDKGDEEKLSEALGKLAEEDPTLRVERDEETGQTILSGMGELHLDVALERLKREYGVEVE 454 (668)
T ss_pred eEEEEEECCcchHHHHHHHHHHHHHhCCeEEEEECCCCCCEEEEEecHHHHHHHHHHHHHHhCceeE
Confidence 333444344566888889888873 33577788888888885 5678899999988 54444
No 140
>PRK12739 elongation factor G; Reviewed
Probab=39.70 E-value=98 Score=29.08 Aligned_cols=57 Identities=23% Similarity=0.266 Sum_probs=40.7
Q ss_pred EEEeecChhhHHHHHHHHhcc----CCeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceE
Q 048021 15 LKVDTNDPAWFSSMTKVLRKI----KGASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 15 lkV~m~C~gC~~kIkkaL~kI----~GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ae 71 (143)
+.|.-.-.....++.++|++| +++.|.+|.+.++..|.| ++--+.+++.|++ |-+++
T Consensus 408 ~aiep~~~~d~~kL~~aL~~L~~eDpsl~v~~~~etge~il~g~GelHLei~~~rL~~~f~vev~ 472 (691)
T PRK12739 408 LAVEPKTKADQDKMGLALQKLAEEDPTFRVETDEETGQTIISGMGELHLDIIVDRMKREFKVEAN 472 (691)
T ss_pred EEEEECCcccHHHHHHHHHHHHHhCCeEEEEEcCCCCCEEEEEecHHHHHHHHHHHHHHhCCeeE
Confidence 344444556777888888776 334577788899999986 5678889999988 65444
No 141
>cd06478 ACD_HspB4-5-6 Alpha-crystallin domain found in alphaA-crystallin (HspB4), alphaB-crystallin (HspB5), and the small heat shock protein (sHsp) HspB6, also known as Hsp20. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 on the other hand is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. HspB5's functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its ol
Probab=39.40 E-value=29 Score=23.70 Aligned_cols=22 Identities=14% Similarity=0.247 Sum_probs=18.3
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|+++++.++|+|..
T Consensus 14 dlpG~~~edI~V~v~~~~L~I~g~~ 38 (83)
T cd06478 14 DVKHFSPEELSVKVLGDFVEIHGKH 38 (83)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEE
Confidence 46888 38999999999999964
No 142
>PF01193 RNA_pol_L: RNA polymerase Rpb3/Rpb11 dimerisation domain; InterPro: IPR011261 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase (RNAP) II, which is responsible for all mRNA synthesis in eukaryotes, consists of 12 subunits. Subunits Rpb3 and Rpb11 form a heterodimer that is functionally analogous to the archaeal RNAP D/L heterodimer, and to the prokaryotic RNAP alpha subunit (RpoA) homodimer. In each case, they play a key role in RNAP assembly by forming a platform on which the catalytic subunits (eukaryotic Rpb1/Rpb2, and prokaryotic beta/beta') can interact []. These different subunits share regions of homology required for dimerisation. In eukaryotic Rpb11 and archaeal L subunits, the dimerisation domain consists of a contiguous Rpb11-like domain, whereas in eukaryotic Rpb3, archaeal D and bacterial RpoA subunits (IPR011263 from INTERPRO), the dimerisation domain consists of the Rpb11-like domain interrupted by an insert domain. In the prokaryotic alpha subunit, this dimerisation domain is the N-terminal domain [].; GO: 0003899 DNA-directed RNA polymerase activity, 0046983 protein dimerization activity, 0006351 transcription, DNA-dependent; PDB: 1HQM_B 1YNJ_A 1YNN_A 1I6V_A 2GHO_A 3HKZ_V 2PMZ_X 2PA8_L 3GTK_C 1TWH_C ....
Probab=39.40 E-value=60 Score=21.08 Aligned_cols=44 Identities=14% Similarity=0.252 Sum_probs=24.9
Q ss_pred hhhHHHHHHHHhcc-CCeeEEEeCCCCEEEEE----eecCHHHHH-HHHh
Q 048021 22 PAWFSSMTKVLRKI-KGASYTIDAEEGMAYIT----GRANPRKLL-RKLQ 65 (143)
Q Consensus 22 ~gC~~kIkkaL~kI-~GVsV~VDle~~~VtV~----G~vdp~~Ll-~aI~ 65 (143)
--=.+.++++|.+- +|+.+.-....+++.++ |..+|.+++ +|++
T Consensus 10 ~tl~N~LRr~ll~~vp~~ai~~~~~~~~~~~~IeT~g~~~p~~~l~~A~~ 59 (66)
T PF01193_consen 10 HTLGNALRRILLSEVPGVAIDGHPNEDKFVFRIETDGSLTPKEALLKAIK 59 (66)
T ss_dssp HHHHHHHHHHHHSSSEEEEEEESSEEEEEEEEEEEBSSS-HHHHHHHHHH
T ss_pred hHHHHHHHHHHHhcCCCceEEecCCCCEEEEEEEECCCCCHHHHHHHHHH
Confidence 33467888888764 77755554445555554 556665554 4443
No 143
>PHA03075 glutaredoxin-like protein; Provisional
Probab=39.36 E-value=90 Score=23.85 Aligned_cols=55 Identities=11% Similarity=0.159 Sum_probs=41.1
Q ss_pred cChhhHHHHHHHHhccCCe-e------EEEeCCCCEEEEEeecCHHHHHHHHhc--CCceEEeec
Q 048021 20 NDPAWFSSMTKVLRKIKGA-S------YTIDAEEGMAYITGRANPRKLLRKLQS--GKYANLCWV 75 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GV-s------V~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~Ael~~~ 75 (143)
-|+=|+ .+..+|++|++- + ..+=.++|.|.+-|..-.-.++..+-+ |....++.-
T Consensus 12 ~C~vCe-~~s~~l~~ledeY~ilrVNIlSfFsK~g~v~~lg~d~~y~lInn~~~~lgne~v~lfK 75 (123)
T PHA03075 12 LCSVCE-SISEALKELEDEYDILRVNILSFFSKDGQVKVLGMDKGYTLINNFFKHLGNEYVSLFK 75 (123)
T ss_pred ccHHHH-HHHHHHHHhhccccEEEEEeeeeeccCCceEEEecccceehHHHHHHhhcccEEEEEE
Confidence 599998 778888888874 3 334558999999997777788888877 766554443
No 144
>TIGR00490 aEF-2 translation elongation factor aEF-2. This model represents archaeal elongation factor 2, a protein more similar to eukaryotic EF-2 than to bacterial EF-G, both in sequence similarity and in sharing with eukaryotes the property of having a diphthamide (modified His) residue at a conserved position. The diphthamide can be ADP-ribosylated by diphtheria toxin in the presence of NAD.
Probab=39.14 E-value=1.8e+02 Score=27.52 Aligned_cols=62 Identities=16% Similarity=0.312 Sum_probs=45.4
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccC----CeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceEE
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIK----GASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYANL 72 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~----GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ael 72 (143)
..+.+.|.-.-.+=..++.++|.+|. .+.|..|.+.+...|.| ++--+.++++|++ |-.+++
T Consensus 389 Pv~~~~i~p~~~~d~~kL~~aL~~L~~eDPsl~v~~d~etge~il~g~GElHLei~~~rL~~~~~vev~~ 458 (720)
T TIGR00490 389 PVVTVAIEAKNTKDLPKLIEVLRQVAKEDPTVHVEINEETGEHLISGMGELHLEIIVEKIREDYGLDVET 458 (720)
T ss_pred ceEEEEEEECCHHHHHHHHHHHHHHHhhCCeEEEEECCCCCCeEEEEccceeHHHHHHHHHHHhCCceee
Confidence 44555565566677889999998873 33577788888888885 6788999999988 555543
No 145
>cd00105 KH-I K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
Probab=39.06 E-value=78 Score=19.39 Aligned_cols=35 Identities=29% Similarity=0.413 Sum_probs=22.5
Q ss_pred HHHHhccCCeeEEEeCC-----CCEEEEEeecCHHHHHHHHh
Q 048021 29 TKVLRKIKGASYTIDAE-----EGMAYITGRANPRKLLRKLQ 65 (143)
Q Consensus 29 kkaL~kI~GVsV~VDle-----~~~VtV~G~vdp~~Ll~aI~ 65 (143)
-+.|+..-|+.+.|+.. ...|+|.|. ++.+.+|+.
T Consensus 22 i~~I~~~s~~~I~i~~~~~~~~~~~v~i~G~--~~~v~~a~~ 61 (64)
T cd00105 22 IKEIREETGAKIKIPDSGSGSEERIVTITGT--PEAVEKAKE 61 (64)
T ss_pred HHHHHHHHCCEEEEcCCCCCCCceEEEEEcC--HHHHHHHHH
Confidence 34455555887777764 577888887 555655543
No 146
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9 interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=38.89 E-value=27 Score=24.15 Aligned_cols=22 Identities=14% Similarity=0.410 Sum_probs=18.7
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|.+++++++|+|+.
T Consensus 14 dlpG~~~edI~V~v~~~~L~I~g~~ 38 (87)
T cd06481 14 DVRGFSPEDLSVRVDGRKLVVTGKR 38 (87)
T ss_pred ECCCCChHHeEEEEECCEEEEEEEE
Confidence 46888 39999999999999964
No 147
>KOG4730 consensus D-arabinono-1, 4-lactone oxidase [Defense mechanisms]
Probab=38.59 E-value=28 Score=32.27 Aligned_cols=35 Identities=17% Similarity=0.130 Sum_probs=30.6
Q ss_pred EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEee
Q 048021 40 YTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 40 V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
|++|.+..+|||+.-+.-.+|++.|++ |+....++
T Consensus 105 v~~dpe~~tvTV~aGirlrQLie~~~~~GlsL~~~~ 140 (518)
T KOG4730|consen 105 VEFDPELKTVTVQAGIRLRQLIEELAKLGLSLPNAP 140 (518)
T ss_pred eeeCchhceEEeccCcCHHHHHHHHHhcCccccCCC
Confidence 788999999999999999999999999 99765433
No 148
>cd06479 ACD_HspB7_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB7, also known as cardiovascular small heat shock protein (cvHsp), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB7 is a 25-kDa protein, preferentially expressed in heart and skeletal muscle. It binds the cytoskeleton protein alpha-filamin (also known as actin-binding protein 280). The expression of HspB7 is increased during rat muscle aging. Its expression is also modulated in obesity implicating this protein in this and related metabolic disorders. As the human gene encoding HspB7 is mapped to chromosome 1p36.23-p34.3 it is a positional candidate for several dystrophies and myopathies.
Probab=38.53 E-value=28 Score=24.00 Aligned_cols=22 Identities=0% Similarity=0.074 Sum_probs=18.7
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|.+++++++|+|+.
T Consensus 15 dlpG~~pedi~V~v~~~~L~I~ger 39 (81)
T cd06479 15 DVSDFSPEDIIVTTSNNQIEVHAEK 39 (81)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEE
Confidence 46888 39999999999999964
No 149
>PF09941 DUF2173: Uncharacterized conserved protein (DUF2173); InterPro: IPR018685 This family of various hypothetical prokaryotic proteins has no known function.
Probab=38.48 E-value=62 Score=24.02 Aligned_cols=32 Identities=13% Similarity=0.151 Sum_probs=25.7
Q ss_pred HHHhccCCe-e-EEEeCCCCEEEEEeecCHHHHH
Q 048021 30 KVLRKIKGA-S-YTIDAEEGMAYITGRANPRKLL 61 (143)
Q Consensus 30 kaL~kI~GV-s-V~VDle~~~VtV~G~vdp~~Ll 61 (143)
+.|.+|||| - .++..+.+.+...|+++++.+.
T Consensus 4 ~~Lm~lpGv~AAg~Fs~~G~l~e~~G~l~~~~a~ 37 (108)
T PF09941_consen 4 DKLMKLPGVVAAGEFSDDGKLVEYKGELDEEMAE 37 (108)
T ss_pred HHhhcCCCeEEEEEECCCCeEEeeecCCCHHHHH
Confidence 347899999 4 8889999999999988766543
No 150
>TIGR03675 arCOG00543 arCOG00543 universal archaeal KH-domain/beta-lactamase-domain protein. This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782.
Probab=38.44 E-value=78 Score=29.78 Aligned_cols=71 Identities=11% Similarity=0.085 Sum_probs=47.6
Q ss_pred EEEEEEe----ecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEee----c---CHHHHHHHHhc-CCceEEeecCCC
Q 048021 12 TCGLKVD----TNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGR----A---NPRKLLRKLQS-GKYANLCWVSAG 78 (143)
Q Consensus 12 tvvlkV~----m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~----v---dp~~Ll~aI~k-Gk~Ael~~~~~~ 78 (143)
.++++.+ +.=+.-.+.|++.+-+-.|| ++.+|.+.|.|.|+.+ + .-..+.+-.++ |...+++..++-
T Consensus 55 r~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~f~~~~~~v~i~~~~p~~~~~~~~~~~~~i~~~~~w~~~~~~~~~~ 134 (630)
T TIGR03675 55 RIVIRPDPSVLLPPEEAIEKIKEIVPEEAGITDIYFDDVTGEVIIEAEKPGLVIGKGGSTLREITAETGWTPKVVRTPPI 134 (630)
T ss_pred eEEEecChhhcCCHHHHHHHHHHhCCCcCCceeEEecCCCceEEEEEcCCeEEEecCcchHHHHHHHhCCeeeEEecCCC
Confidence 3555544 22344455666666554588 7999999999999853 2 34445555556 999999998886
Q ss_pred CCCc
Q 048021 79 NQIT 82 (143)
Q Consensus 79 ~s~~ 82 (143)
.|.+
T Consensus 135 ~~~~ 138 (630)
T TIGR03675 135 ESKT 138 (630)
T ss_pred CcHH
Confidence 6655
No 151
>cd06470 ACD_IbpA-B_like Alpha-crystallin domain (ACD) found in Escherichia coli inclusion body-associated proteins IbpA and IbpB, and similar proteins. IbpA and IbpB are 16 kDa small heat shock proteins (sHsps). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. IbpA and IbpB are produced during high-level production of various heterologous proteins, specifically human prorenin, renin and bovine insulin-like growth factor 2 (bIGF-2), and are strongly associated with inclusion bodies containing these heterologous proteins. IbpA and IbpB work as an integrated system to stabilize thermally aggregated proteins in a disaggregation competent state. The chaperone activity of IbpB is also significantly elevated as the temperature increases from normal to heat shock. The high temperature results in the disassociation of 2-3-MDa IbpB oligomers into smaller approximately 6
Probab=38.32 E-value=29 Score=23.87 Aligned_cols=24 Identities=21% Similarity=0.392 Sum_probs=19.5
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecCH
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRANP 57 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vdp 57 (143)
.+||+ +++|..+++.++|+|....
T Consensus 18 ~lPG~~kedi~v~~~~~~L~I~g~~~~ 44 (90)
T cd06470 18 AVAGFSEDDLEIEVENNQLTVTGKKAD 44 (90)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEEcc
Confidence 57888 2999999999999997543
No 152
>PF08712 Nfu_N: Scaffold protein Nfu/NifU N terminal; InterPro: IPR014824 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] []. FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins [, ]. It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly []. The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA []. SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA [], acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins []. Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen []. This domain is found at the N terminus of NifU (from NIF system) and NifU related proteins, and in the human Nfu protein. Both of these proteins are thought to be involved in the assembly of iron-sulphur clusters, functioning as scaffolds [, ]. ; GO: 0005506 iron ion binding; PDB: 2FFM_A 1PQX_A 2K1H_A.
Probab=38.19 E-value=56 Score=22.79 Aligned_cols=39 Identities=23% Similarity=0.286 Sum_probs=27.8
Q ss_pred HHHHHHHhccCCe-eEEEeCCCCEEEEE--eecCHHHHHHHHhc
Q 048021 26 SSMTKVLRKIKGA-SYTIDAEEGMAYIT--GRANPRKLLRKLQS 66 (143)
Q Consensus 26 ~kIkkaL~kI~GV-sV~VDle~~~VtV~--G~vdp~~Ll~aI~k 66 (143)
.-+-++|-+|+|| +|-+..+ =|+|+ ..+|.+.|+..|..
T Consensus 37 spLA~~Lf~i~gV~~Vf~~~d--fItVtK~~~~~W~~l~~~I~~ 78 (87)
T PF08712_consen 37 SPLAQALFAIPGVKSVFIGDD--FITVTKNPDADWEDLKPEIRE 78 (87)
T ss_dssp -HHHHHHHTSTTEEEEEEETT--EEEEEE-TTS-HHHHHHHHHH
T ss_pred CHHHHHhcCCCCEeEEEEECC--EEEEeeCCCCCHHHHHHHHHH
Confidence 4566677799999 6777655 56665 56899999888865
No 153
>PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional
Probab=37.82 E-value=63 Score=27.53 Aligned_cols=23 Identities=4% Similarity=-0.071 Sum_probs=16.2
Q ss_pred CCCCCCceEEEEEE----eecChhhHH
Q 048021 4 SSSSSAKMTCGLKV----DTNDPAWFS 26 (143)
Q Consensus 4 ss~~s~~~tvvlkV----~m~C~gC~~ 26 (143)
||....+..+.|.| |+.|..|..
T Consensus 9 ~~~~~~P~~l~i~iT~~CNl~C~~C~~ 35 (378)
T PRK05301 9 APAVGPPLWLLAELTYRCPLQCPYCSN 35 (378)
T ss_pred CCCCCCCeEEEEEecCccCcCCCCCCC
Confidence 55666667777776 478999953
No 154
>PRK13010 purU formyltetrahydrofolate deformylase; Reviewed
Probab=37.80 E-value=1.7e+02 Score=24.75 Aligned_cols=55 Identities=9% Similarity=0.118 Sum_probs=35.2
Q ss_pred ChhhHHHHHHHHhcc---CCeeEEEeCC--CCEEEE--Eee-cCHHHHHHHHhc-CCceEEeec
Q 048021 21 DPAWFSSMTKVLRKI---KGASYTIDAE--EGMAYI--TGR-ANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 21 C~gC~~kIkkaL~kI---~GVsV~VDle--~~~VtV--~G~-vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
++-....++++|+.+ -|+++.+... ..++.| +|. -+-+.|+++++. ..++++..+
T Consensus 63 ~~~~~~~l~~~l~~l~~~l~l~~~i~~~~~~~kiavl~Sg~g~nl~al~~~~~~~~l~~~i~~v 126 (289)
T PRK13010 63 EAASVDTFRQEFQPVAEKFDMQWAIHPDGQRPKVVIMVSKFDHCLNDLLYRWRMGELDMDIVGI 126 (289)
T ss_pred CCCCHHHHHHHHHHHHHHhCCeEEEecCCCCeEEEEEEeCCCccHHHHHHHHHCCCCCcEEEEE
Confidence 444567788777665 3555555543 334444 344 479999999998 667776665
No 155
>PRK13351 elongation factor G; Reviewed
Probab=37.48 E-value=1.2e+02 Score=28.38 Aligned_cols=58 Identities=19% Similarity=0.242 Sum_probs=40.6
Q ss_pred EEEEeecChhhHHHHHHHHhcc----CCeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceE
Q 048021 14 GLKVDTNDPAWFSSMTKVLRKI----KGASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 14 vlkV~m~C~gC~~kIkkaL~kI----~GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ae 71 (143)
.+.|+-.-..=..++.++|++| +++.|..|.+.++..+.| ++--+.++++|++ |.+++
T Consensus 406 ~~~Iep~~~~d~~kL~~aL~~L~~eDpsl~v~~~~etge~ii~g~GelHLei~~~rL~~~~~vev~ 471 (687)
T PRK13351 406 SLAVEPERRGDEQKLAEALEKLVWEDPSLRVEEDEETGQTILSGMGELHLEVALERLRREFKLEVN 471 (687)
T ss_pred EEEEEECCcccHHHHHHHHHHHHHhCCeEEEEECCCCCCEEEEEecHHHHHHHHHHHHHHhCCceE
Confidence 3344434444567888888877 344577788899998886 5678899999988 65544
No 156
>TIGR00411 redox_disulf_1 small redox-active disulfide protein 1. This protein is homologous to a family of proteins that includes thioredoxins, glutaredoxins, protein-disulfide isomerases, and others, some of which have several such domains. The sequence of this protein at the redox-active disufide site, CPYC, matches glutaredoxins rather than thioredoxins, although its overall sequence seems closer to thioredoxins. It is suggested to be a ribonucleotide-reducing system component distinct from thioredoxin or glutaredoxin.
Probab=37.40 E-value=67 Score=20.36 Aligned_cols=8 Identities=13% Similarity=-0.035 Sum_probs=6.1
Q ss_pred ecChhhHH
Q 048021 19 TNDPAWFS 26 (143)
Q Consensus 19 m~C~gC~~ 26 (143)
-.|..|..
T Consensus 9 ~~C~~C~~ 16 (82)
T TIGR00411 9 PTCPYCPA 16 (82)
T ss_pred CCCcchHH
Confidence 47999983
No 157
>TIGR03028 EpsE polysaccharide export protein EpsE. Sequences in this family of proteins are members of a polysaccharide export protein family (pfam02563) which includes the wza protein from E.coli. This family of proteins are homologous to the EpsE protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=37.36 E-value=71 Score=25.85 Aligned_cols=56 Identities=20% Similarity=0.220 Sum_probs=39.1
Q ss_pred EEEEe-ecChhhHHHHHHHHhc---cCCeeEEEeC---CCCEEEEEeecC---------HHHHHHHHhc--CCc
Q 048021 14 GLKVD-TNDPAWFSSMTKVLRK---IKGASYTIDA---EEGMAYITGRAN---------PRKLLRKLQS--GKY 69 (143)
Q Consensus 14 vlkV~-m~C~gC~~kIkkaL~k---I~GVsV~VDl---e~~~VtV~G~vd---------p~~Ll~aI~k--Gk~ 69 (143)
.++|. ++.+.=++.|++.|.+ +....|+|.+ ...+|+|.|++. ...|+++|.. |..
T Consensus 39 ~v~v~G~T~~e~~~~I~~~l~~~~~~~~p~V~V~v~~~~~~~V~V~GeV~~PG~~~l~~~~tl~~ai~~AGG~~ 112 (239)
T TIGR03028 39 EVKLGGETPAAAERKIASRLSKGGFVKQPQVTINVLQYRGQQVSVLGQVNRPGRYPLETAGRVSDVLALAGGVT 112 (239)
T ss_pred eEEECCCCHHHHHHHHHHHHhhcCcccCCEEEEEEEeccceEEEEEEEecCCceEEcCCCCcHHHHHHHcCCCC
Confidence 35564 5788888999999975 3332344443 367999999752 4789999988 555
No 158
>cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. Its functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ
Probab=37.33 E-value=31 Score=23.76 Aligned_cols=22 Identities=14% Similarity=0.212 Sum_probs=18.5
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|+..++.++|+|+.
T Consensus 14 dlpG~~~edi~V~v~~~~L~I~g~~ 38 (84)
T cd06498 14 DVKHFSPEELKVKVLGDFIEIHGKH 38 (84)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEE
Confidence 46888 39999999999999974
No 159
>PF14424 Toxin-deaminase: The BURPS668_1122 family of deaminases
Probab=37.24 E-value=62 Score=24.46 Aligned_cols=32 Identities=6% Similarity=0.007 Sum_probs=25.3
Q ss_pred eEEEEEEee-cChhhHHHHHHHHhccCCeeEEE
Q 048021 11 MTCGLKVDT-NDPAWFSSMTKVLRKIKGASYTI 42 (143)
Q Consensus 11 ~tvvlkV~m-~C~gC~~kIkkaL~kI~GVsV~V 42 (143)
-++.|-.++ .|..|..-|+.-....+.+.|.|
T Consensus 97 G~i~l~te~~pC~SC~~vi~qF~~~~pni~~~v 129 (133)
T PF14424_consen 97 GTIDLFTELPPCESCSNVIEQFKKDFPNIKVNV 129 (133)
T ss_pred ceEEEEecCCcChhHHHHHHHHHHHCCCcEEEE
Confidence 456666676 79999999999999999885544
No 160
>PRK01641 leuD isopropylmalate isomerase small subunit; Provisional
Probab=37.05 E-value=57 Score=26.52 Aligned_cols=44 Identities=18% Similarity=0.095 Sum_probs=27.0
Q ss_pred HHHHHhccCCeeEEEeCCCCEEEEEee---cCHH-HHHHHHhcCCceE
Q 048021 28 MTKVLRKIKGASYTIDAEEGMAYITGR---ANPR-KLLRKLQSGKYAN 71 (143)
Q Consensus 28 IkkaL~kI~GVsV~VDle~~~VtV~G~---vdp~-~Ll~aI~kGk~Ae 71 (143)
+.+++.+-+|..++||+++++|++.|. .... -.++.|..|.+..
T Consensus 129 l~~~~~~~~g~~i~vDl~~~~v~~~~~~~~f~i~~~~~~~l~~G~D~i 176 (200)
T PRK01641 129 LFKLVEANPGAELTVDLEAQTVTAPDKTFPFEIDPFRRHCLLNGLDDI 176 (200)
T ss_pred HHHHHhcCCCCEEEEEcCCCEEecCCeEEEEEcCHHHHHHHHcCCCHH
Confidence 333343346778999999999998663 2333 3444444486643
No 161
>PF00578 AhpC-TSA: AhpC/TSA family; InterPro: IPR000866 Peroxiredoxins (Prxs) are a ubiquitous family of antioxidant enzymes that also control cytokine-induced peroxide levels which mediate signal transduction in mammalian cells. Prxs can be regulated by changes to phosphorylation, redox and possibly oligomerisation states. Prxs are divided into three classes: typical 2-Cys Prxs; atypical 2-Cys Prxs; and 1-Cys Prxs. All Prxs share the same basic catalytic mechanism, in which an active-site cysteine (the peroxidatic cysteine) is oxidised to a sulphenic acid by the peroxide substrate. The recycling of the sulphenic acid back to a thiol is what distinguishes the three enzyme classes. Using crystal structures, a detailed catalytic cycle has been derived for typical 2-Cys Prxs, including a model for the redox-regulated oligomeric state proposed to control enzyme activity []. Alkyl hydroperoxide reductase (AhpC) is responsible for directly reducing organic hyperoxides in its reduced dithiol form. Thiol specific antioxidant (TSA) is a physiologically important antioxidant which constitutes an enzymatic defence against sulphur-containing radicals. This family contains AhpC and TSA, as well as related proteins.; GO: 0016209 antioxidant activity, 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 1QMV_A 1PRX_B 3HJP_C 3HA9_A 2V41_G 2V32_C 2V2G_C 3LWA_A 3IA1_B 1ZYE_G ....
Probab=36.98 E-value=1.1e+02 Score=20.79 Aligned_cols=55 Identities=15% Similarity=0.077 Sum_probs=31.1
Q ss_pred EEEEEEee-cChhhHHHHHH---HHhccC--CeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEeec
Q 048021 12 TCGLKVDT-NDPAWFSSMTK---VLRKIK--GASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 12 tvvlkV~m-~C~gC~~kIkk---aL~kI~--GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
++++-... .|..|...+.. .+.+++ |+ .|..-..-+++.+.+.+++ +..-.++.-
T Consensus 28 ~vl~f~~~~~c~~c~~~l~~l~~~~~~~~~~~~---------~vi~is~d~~~~~~~~~~~~~~~~~~~~D 89 (124)
T PF00578_consen 28 VVLFFWPTAWCPFCQAELPELNELYKKYKDKGV---------QVIGISTDDPEEIKQFLEEYGLPFPVLSD 89 (124)
T ss_dssp EEEEEESTTTSHHHHHHHHHHHHHHHHHHTTTE---------EEEEEESSSHHHHHHHHHHHTCSSEEEEE
T ss_pred EEEEEeCccCccccccchhHHHHHhhhhccceE---------EeeecccccccchhhhhhhhccccccccC
Confidence 33333456 89999977644 333333 34 1111122355578888887 877666654
No 162
>COG0071 IbpA Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]
Probab=36.93 E-value=29 Score=25.95 Aligned_cols=25 Identities=20% Similarity=0.314 Sum_probs=20.8
Q ss_pred hccCCe---eEEEeCCCCEEEEEeecCH
Q 048021 33 RKIKGA---SYTIDAEEGMAYITGRANP 57 (143)
Q Consensus 33 ~kI~GV---sV~VDle~~~VtV~G~vdp 57 (143)
..|+|+ +++|.+..+.++|+|+...
T Consensus 56 ~elPG~~kedI~I~~~~~~l~I~g~~~~ 83 (146)
T COG0071 56 AELPGVDKEDIEITVEGNTLTIRGEREE 83 (146)
T ss_pred EEcCCCChHHeEEEEECCEEEEEEEecc
Confidence 467999 3999999999999997643
No 163
>TIGR02189 GlrX-like_plant Glutaredoxin-like family. This family of glutaredoxin-like proteins is aparrently limited to plants. Multiple isoforms are found in A. thaliana and O.sativa.
Probab=36.87 E-value=83 Score=22.01 Aligned_cols=48 Identities=6% Similarity=0.065 Sum_probs=28.1
Q ss_pred EEEEEeecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc--CCc
Q 048021 13 CGLKVDTNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS--GKY 69 (143)
Q Consensus 13 vvlkV~m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~ 69 (143)
+++=.--.|+-|. ++++.|.++ |+. ..+|+++ ..+..++++++.+ |.+
T Consensus 10 Vvvysk~~Cp~C~-~ak~~L~~~-~i~~~~vdid~-------~~~~~~~~~~l~~~tg~~ 60 (99)
T TIGR02189 10 VVIFSRSSCCMCH-VVKRLLLTL-GVNPAVHEIDK-------EPAGKDIENALSRLGCSP 60 (99)
T ss_pred EEEEECCCCHHHH-HHHHHHHHc-CCCCEEEEcCC-------CccHHHHHHHHHHhcCCC
Confidence 3333345899999 677788875 673 3333332 2333566677765 554
No 164
>PRK08868 flagellar protein FlaG; Provisional
Probab=36.61 E-value=78 Score=24.71 Aligned_cols=29 Identities=28% Similarity=0.515 Sum_probs=23.1
Q ss_pred eeEEEeCCCCEEEEE------e----ecCHHHHHHHHhc
Q 048021 38 ASYTIDAEEGMAYIT------G----RANPRKLLRKLQS 66 (143)
Q Consensus 38 VsV~VDle~~~VtV~------G----~vdp~~Ll~aI~k 66 (143)
+.|.||.+.+++.|+ | .++++++++.+++
T Consensus 91 L~F~vdeetgr~VVkViD~~T~EVIRQIP~Ee~L~la~~ 129 (144)
T PRK08868 91 LSFRVDEESGRDVVTIYEASTGDIIRQIPDEEMLEVLRR 129 (144)
T ss_pred eEEEEecCCCCEEEEEEECCCCceeeeCCCHHHHHHHHH
Confidence 369999999998887 3 3689998888876
No 165
>PRK10509 bacterioferritin-associated ferredoxin; Provisional
Probab=36.35 E-value=18 Score=24.10 Aligned_cols=22 Identities=9% Similarity=0.036 Sum_probs=17.8
Q ss_pred EEEEeecChhhHHHHHHHHhcc
Q 048021 14 GLKVDTNDPAWFSSMTKVLRKI 35 (143)
Q Consensus 14 vlkV~m~C~gC~~kIkkaL~kI 35 (143)
.+++-..|+.|...|++.|...
T Consensus 32 ~~~~g~~CG~C~~~i~~il~~~ 53 (64)
T PRK10509 32 FVPVGNQCGKCIRAAREVMQDE 53 (64)
T ss_pred hcCCCCCccchHHHHHHHHHHH
Confidence 3566678999999999998654
No 166
>PF00698 Acyl_transf_1: Acyl transferase domain; InterPro: IPR014043 Enzymes like bacterial malonyl CoA-acly carrier protein transacylase (2.3.1.39 from EC) and eukaryotic fatty acid synthase (2.3.1.85 from EC) that are involved in fatty acid biosynthesis belong to this group. Also included are the polyketide synthases 6-methylsalicylic acid synthase (2.3.1 from EC), a multifunctional enzyme that involved in the biosynthesis of patulin and conidial green pigment synthase (2.3.1 from EC).; PDB: 3HHD_C 2JFD_D 2JFK_A 3G87_A 3IM9_A 2QO3_B 3IM8_A 3EZO_A 2QJ3_A 2QC3_A ....
Probab=36.29 E-value=96 Score=25.67 Aligned_cols=53 Identities=17% Similarity=0.075 Sum_probs=39.2
Q ss_pred HHHHHHHhccCCeeEEEeCCCCEEEEEeec-CHHHHHHHHhc-CCceEEeecCCC
Q 048021 26 SSMTKVLRKIKGASYTIDAEEGMAYITGRA-NPRKLLRKLQS-GKYANLCWVSAG 78 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-dp~~Ll~aI~k-Gk~Ael~~~~~~ 78 (143)
..++..+..-++|.+.+....+.++|.|.. +-+++++.+++ |..+.++.+..+
T Consensus 136 ~~~~~~~~~~~~v~ia~~Ns~~q~visG~~~~l~~~~~~l~~~~~~~~~l~v~~a 190 (318)
T PF00698_consen 136 EEEEEKLALPPDVEIANINSPRQVVISGEREALEALVERLKAEGIKAKRLPVSYA 190 (318)
T ss_dssp HHHHHHHHTTTTEEEEEEEETTEEEEEEEHHHHHHHHHHHHHTTSEEEEESSSSE
T ss_pred hHHhhhccccccceeeeeccccccccCCCHHHHHHHHHHhhccceeEEEeeeecc
Confidence 344444455566777777778899999975 67778888888 988888877543
No 167
>COG2177 FtsX Cell division protein [Cell division and chromosome partitioning]
Probab=36.01 E-value=54 Score=28.08 Aligned_cols=31 Identities=19% Similarity=0.250 Sum_probs=24.1
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCe-eEEE
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTI 42 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~V 42 (143)
.++.|+.+.. +.|.+.|++.|++++|| ++++
T Consensus 61 i~vyL~~~~~-~~~~~~v~~~i~~~~gV~~v~~ 92 (297)
T COG2177 61 ITVYLQIDAD-QDDAALVREKIEGIPGVKSVRF 92 (297)
T ss_pred EEEEEecCCC-hHHHHHHHHHHhcCCCcceEEE
Confidence 3455555556 99999999999999999 5655
No 168
>cd06526 metazoan_ACD Alpha-crystallin domain (ACD) of metazoan alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=35.96 E-value=30 Score=23.26 Aligned_cols=22 Identities=14% Similarity=0.398 Sum_probs=18.1
Q ss_pred ccCCe---eEEEeCCCCEEEEEeec
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRA 55 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~v 55 (143)
.++|+ +++|.++++.++|+|..
T Consensus 14 dlpG~~~edI~v~v~~~~L~I~g~~ 38 (83)
T cd06526 14 DVKGFKPEELKVKVSDNKLVVEGKH 38 (83)
T ss_pred ECCCCCHHHcEEEEECCEEEEEEEE
Confidence 46888 38888889999999974
No 169
>PRK10553 assembly protein for periplasmic nitrate reductase; Provisional
Probab=35.92 E-value=89 Score=22.11 Aligned_cols=32 Identities=13% Similarity=0.198 Sum_probs=24.9
Q ss_pred ceEEEEEEeecChhhHHHHHHHHhccCCe-eEE
Q 048021 10 KMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYT 41 (143)
Q Consensus 10 ~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~ 41 (143)
.-++++-++..+.+=.-..-+.|+.|+|| ++.
T Consensus 41 ~GKiVVtiE~~~~~~~~~~i~~I~~l~GVlsa~ 73 (87)
T PRK10553 41 SGQLIVVVEAEDSETLLQTIESVRNVEGVLAVS 73 (87)
T ss_pred CCeEEEEEEeCChHHHHHHHHHHHcCCCceEEE
Confidence 34677777888888777878889999999 643
No 170
>TIGR03027 pepcterm_export putative polysaccharide export protein, PEP-CTERM sytem-associated. This protein family belongs to the larger set of polysaccharide biosynthesis/export proteins described by Pfam model pfam02563. Members of this family are variable in either containing of lacking a 78-residue insert, but appear to fall within a single clade, nevertheless, where the regions in which the gene is found encode components of the PEP-CTERM/EpsH proposed exosortase protein sorting system.
Probab=35.84 E-value=79 Score=24.02 Aligned_cols=56 Identities=14% Similarity=0.158 Sum_probs=39.2
Q ss_pred EEEEe-ecChhhHHHHHHHHhcc-CC--eeEEEeC----CCCEEEEEeec---------CHHHHHHHHhc--CCc
Q 048021 14 GLKVD-TNDPAWFSSMTKVLRKI-KG--ASYTIDA----EEGMAYITGRA---------NPRKLLRKLQS--GKY 69 (143)
Q Consensus 14 vlkV~-m~C~gC~~kIkkaL~kI-~G--VsV~VDl----e~~~VtV~G~v---------dp~~Ll~aI~k--Gk~ 69 (143)
.++|. ++.+.=++.|+++|+++ .. |.|.+.. ...+|+|.|++ +...|+++|.. |..
T Consensus 38 ~v~v~G~T~~e~~~~I~~~l~~~~~~p~v~V~v~~~~~~~~~~V~V~GeV~~PG~y~~~~~~tl~~ai~~AGG~~ 112 (165)
T TIGR03027 38 DLVASGKTPTQLARDIEEKLAKYVRNPVVTVIVTGFIGPFSEQIRVVGAAANPQALPYREGMTLLDVMIAVGGLT 112 (165)
T ss_pred eEEECCCCHHHHHHHHHHHHHHhccCCeEEEEEeecccCCCcEEEEEEEeCCCceeeeCCCCcHHHHHHHcCCCC
Confidence 35564 58888999999999874 22 3444432 35799999975 36789999987 554
No 171
>PRK11282 glcE glycolate oxidase FAD binding subunit; Provisional
Probab=35.68 E-value=48 Score=28.95 Aligned_cols=36 Identities=14% Similarity=0.132 Sum_probs=29.9
Q ss_pred ccCCe-e-EEEeCCCCEEEEEeecCHHHHHHHHhc-CCc
Q 048021 34 KIKGA-S-YTIDAEEGMAYITGRANPRKLLRKLQS-GKY 69 (143)
Q Consensus 34 kI~GV-s-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~ 69 (143)
.+.+. . +++|.++.+|+|.+-+.-.+|.++|.+ |..
T Consensus 42 dl~~ln~Ile~d~~~~~vtV~AG~~l~el~~~L~~~G~~ 80 (352)
T PRK11282 42 DTRAHRGIVSYDPTELVITARAGTPLAELEAALAEAGQM 80 (352)
T ss_pred EcccCCCcEEEcCCCCEEEECCCCCHHHHHHHHHHcCCe
Confidence 44444 3 488999999999999999999999999 864
No 172
>PF07867 DUF1654: Protein of unknown function (DUF1654); InterPro: IPR012449 This entry is represented by Bacteriophage F116 (Pseudomonas phage F116), Orf28. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of proteins from the Pseudomonadaceae.
Probab=35.66 E-value=1.4e+02 Score=20.70 Aligned_cols=39 Identities=18% Similarity=0.389 Sum_probs=27.6
Q ss_pred EEEEEEee-cChhhHHHHHHHHhccCCeeEEEeCCCCEEEEE
Q 048021 12 TCGLKVDT-NDPAWFSSMTKVLRKIKGASYTIDAEEGMAYIT 52 (143)
Q Consensus 12 tvvlkV~m-~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~ 52 (143)
.++.+.+. .-+.+. .|...|...+||.++.. ++|.|+|.
T Consensus 27 v~I~R~~~e~~~~W~-~vl~~i~Etegv~v~~~-dDGsv~i~ 66 (73)
T PF07867_consen 27 VTIHRLDDESDEDWE-QVLEEIAETEGVEVTFN-DDGSVRIR 66 (73)
T ss_pred EEEEECCCCCHHHHH-HHHHHHhcCCCeEEEEc-CCCeEEEE
Confidence 45567666 445554 66777888889888877 77788876
No 173
>COG0695 GrxC Glutaredoxin and related proteins [Posttranslational modification, protein turnover, chaperones]
Probab=35.46 E-value=67 Score=21.70 Aligned_cols=38 Identities=18% Similarity=0.342 Sum_probs=23.5
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
-.|+-|+ +.+++|.. .|++ .++++.+ .+++...+++++
T Consensus 9 ~~CPyC~-~ak~~L~~-~g~~~~~i~~~~--------~~~~~~~~~~~~ 47 (80)
T COG0695 9 PGCPYCK-RAKRLLDR-KGVDYEEIDVDD--------DEPEEAREMVKR 47 (80)
T ss_pred CCCchHH-HHHHHHHH-cCCCcEEEEecC--------CcHHHHHHHHHH
Confidence 3699999 66777876 6774 4444443 223455566666
No 174
>cd01790 Herp_N Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein. Herp (Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein) , is an integral membrane protein that is induced by the endoplasmic reticulum (ER) stress response pathway and is involved in improving the balance of folding capacity and protein loads in the ER. Herp has an N-terminal ubiquitin-like domain that is involved in Herp degradation, but is not necessary for its enhancement of amyloid beta-protein generation.
Probab=35.39 E-value=1.5e+02 Score=20.60 Aligned_cols=57 Identities=11% Similarity=0.006 Sum_probs=37.4
Q ss_pred CCceEEEEEEeecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeec--CHHHHHHHHhc
Q 048021 8 SAKMTCGLKVDTNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRA--NPRKLLRKLQS 66 (143)
Q Consensus 8 s~~~tvvlkV~m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v--dp~~Ll~aI~k 66 (143)
+...+..|.|+..=.-.+..+|+.|.+..|. ..+.+.++++..|++ |...|.+.+++
T Consensus 9 ~~~~~~~~~ve~~~~~TV~~lK~~i~~~~~~--~~~~~~QrLIy~GKiLkD~~tL~~~~~~ 67 (79)
T cd01790 9 PNQKYEDQTVSCFLNWTVGELKTHLSRVYPS--KPLEQDQRLIYSGKLLPDHLKLRDVLRK 67 (79)
T ss_pred CCCCeEEEEEecCCcChHHHHHHHHHHhcCC--CCChhHeEEEEcCeeccchhhHHHHhhc
Confidence 3444656666544445788888888764331 244578899999986 77777777644
No 175
>PF12685 SpoIIIAH: SpoIIIAH-like protein; InterPro: IPR024232 Stage III sporulation protein AH (SpoIIIAH) is a protein that is involved in forespore engulfment. It forms a channel with SpoIIIAH that is open on the forespore end and closed (or gated) on the mother cell end. This allows sigma-E-directed gene expression in the mother-cell compartment of the sporangium to trigger the activation of sigma-G forespore-specific gene expression by a pathway of intercellular signaling. This family of proteins is found in bacteria, archaea and eukaryotes and so must have a wider function than in sporulation. Proteins in this family are typically between 174 and 223 amino acids in length.; PDB: 3UZ0_A 3TUF_A.
Probab=35.37 E-value=73 Score=25.14 Aligned_cols=48 Identities=15% Similarity=0.172 Sum_probs=30.9
Q ss_pred HHHHHHHhccCCe-eEEEeCCCCEEEEEee------cCHHHHHHHHhc--CCceEEee
Q 048021 26 SSMTKVLRKIKGA-SYTIDAEEGMAYITGR------ANPRKLLRKLQS--GKYANLCW 74 (143)
Q Consensus 26 ~kIkkaL~kI~GV-sV~VDle~~~VtV~G~------vdp~~Ll~aI~k--Gk~Ael~~ 74 (143)
..|+..|++ +|- ++-|.+++++|.|.-. .+..+|.+.+.+ |..++-+.
T Consensus 138 ~~iE~llka-kGf~davv~~~~~~v~VvV~~~~L~~~~~~~I~diV~~~~~v~~~~I~ 194 (196)
T PF12685_consen 138 MEIENLLKA-KGFEDAVVFIEDDSVDVVVKADKLSDAEAAQIIDIVMRETGVPAENIS 194 (196)
T ss_dssp HHHHHHHHT-TS-SEEEEE-SSSEEEEEEE-S---HHHHHHHHHHHHHHHC-STSEEE
T ss_pred HHHHHHHHh-CCCCceEEEeeCCEEEEEEeCCCCCHHHHHHHHHHHHHHhCCCcCeEE
Confidence 466676665 798 7888888888877632 367788888877 76655443
No 176
>PLN02432 putative pectinesterase
Probab=35.14 E-value=90 Score=26.77 Aligned_cols=32 Identities=16% Similarity=0.214 Sum_probs=24.1
Q ss_pred CCCCCceEEEEEEeecChhhHHHHHHHHhccC
Q 048021 5 SSSSAKMTCGLKVDTNDPAWFSSMTKVLRKIK 36 (143)
Q Consensus 5 s~~s~~~tvvlkV~m~C~gC~~kIkkaL~kI~ 36 (143)
|++-..++.++.|.....|.-++|..||..++
T Consensus 3 ~~~~~~~~~~~~Va~~Gsg~f~TIq~Aida~p 34 (293)
T PLN02432 3 SSIDLSTAILIRVDQSGKGDFRKIQDAIDAVP 34 (293)
T ss_pred ccccccceEEEEECCCCCCCccCHHHHHhhcc
Confidence 33444456777787777788999999998875
No 177
>KOG1653 consensus Single-stranded DNA-binding protein [Replication, recombination and repair]
Probab=34.99 E-value=49 Score=26.66 Aligned_cols=38 Identities=18% Similarity=0.356 Sum_probs=31.0
Q ss_pred EeCCCCEEEEEeecCHHHHHHHHhcCCceEEeecCCCC
Q 048021 42 IDAEEGMAYITGRANPRKLLRKLQSGKYANLCWVSAGN 79 (143)
Q Consensus 42 VDle~~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~~ 79 (143)
.+..=++|++.|.|-.+=|++.++.|+.+.++++..+.
T Consensus 51 ~~~~vnkv~lvG~VGqdPl~k~~rngrpVtiFsv~T~~ 88 (175)
T KOG1653|consen 51 LERGVNKVILVGRVGQDPLQKILRNGRPVTIFSVGTGG 88 (175)
T ss_pred hhcccceEEEEcccccchHHHhhcCCCeEEEEEeecCc
Confidence 33445789999999888888888889999999997654
No 178
>TIGR01617 arsC_related transcriptional regulator, Spx/MgsR family. This model represents a portion of the proteins within the larger set covered by Pfam model pfam03960. That larger family includes a glutaredoxin-dependent arsenate reductase (TIGR00014). Characterized members of this family include Spx and MgsR from Bacillus subtili. Spx is a global regulator for response to thiol-specific oxidative stress. It interacts with RNA polymerase. MgsR (modulator of the general stress response, also called YqgZ) provides a second level of regulation for more than a third of the proteins in the B. subtilis general stress regulon controlled by Sigma-B.
Probab=34.73 E-value=68 Score=22.92 Aligned_cols=45 Identities=20% Similarity=0.159 Sum_probs=29.9
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceE
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae 71 (143)
-.|..|++. ++.|.+ .|+. ..+|..+ ...+.++|.+.+++ |..++
T Consensus 7 ~~C~~c~ka-~~~L~~-~~i~~~~idi~~------~~~~~~el~~l~~~~~~~~~ 53 (117)
T TIGR01617 7 PNCTTCKKA-RRWLEA-NGIEYQFIDIGE------DGPTREELLDILSLLEDGID 53 (117)
T ss_pred CCCHHHHHH-HHHHHH-cCCceEEEecCC------ChhhHHHHHHHHHHcCCCHH
Confidence 479999954 566766 5773 4555543 23457788888888 75443
No 179
>PRK10329 glutaredoxin-like protein; Provisional
Probab=34.71 E-value=98 Score=20.86 Aligned_cols=26 Identities=8% Similarity=0.216 Sum_probs=18.5
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCCC
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAEE 46 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle~ 46 (143)
-.|.-|.+ +++.|.+ .||. -++|.++
T Consensus 9 ~~Cp~C~~-ak~~L~~-~gI~~~~idi~~ 35 (81)
T PRK10329 9 NDCVQCHA-TKRAMES-RGFDFEMINVDR 35 (81)
T ss_pred CCCHhHHH-HHHHHHH-CCCceEEEECCC
Confidence 47999995 8888877 6884 4555543
No 180
>PF04459 DUF512: Protein of unknown function (DUF512); InterPro: IPR007549 This is a domain of uncharacterised prokaryotic proteins. It is often found C-terminal to the radical SAM domain (IPR007197 from INTERPRO).
Probab=34.65 E-value=1.2e+02 Score=24.61 Aligned_cols=37 Identities=16% Similarity=0.377 Sum_probs=27.0
Q ss_pred HHHhccCCeeEE---EeCC--CCEEEEEeecCHHHHHHHHhc
Q 048021 30 KVLRKIKGASYT---IDAE--EGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 30 kaL~kI~GVsV~---VDle--~~~VtV~G~vdp~~Ll~aI~k 66 (143)
+.|++++|.+|+ |.-+ .++|||.|-+....|+++|+.
T Consensus 117 ~~l~~~~~~~v~V~~V~N~fFG~~ItVaGLLTg~Dii~~L~~ 158 (204)
T PF04459_consen 117 EKLNRIPGLEVEVVPVKNRFFGGTITVAGLLTGQDIIEQLKG 158 (204)
T ss_pred HHHhccCCCeEEEEEeecCCCCCCeEEeeCccHHHHHHHhCc
Confidence 334455776433 3333 689999999999999999987
No 181
>TIGR02886 spore_II_AA anti-sigma F factor antagonist. The anti-sigma F factor antagonist, also called stage II sporulation protein AA, is a protein universal among endospore-forming bacteria, all of which belong to the Firmcutes
Probab=34.65 E-value=1.5e+02 Score=20.14 Aligned_cols=66 Identities=9% Similarity=0.104 Sum_probs=39.6
Q ss_pred eEEEEEEee-----cChhhHHHHHHHHhccCCe-eEEEeCCCC-EEEEEeecCHHHHHHHHhc-CCceEEeecCC
Q 048021 11 MTCGLKVDT-----NDPAWFSSMTKVLRKIKGA-SYTIDAEEG-MAYITGRANPRKLLRKLQS-GKYANLCWVSA 77 (143)
Q Consensus 11 ~tvvlkV~m-----~C~gC~~kIkkaL~kI~GV-sV~VDle~~-~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~ 77 (143)
..+++++.+ .++...+.+.+++.. ++. .+.+|+.+= -+-.+|-.--..+.+.+++ |.+..++++.+
T Consensus 8 ~~~vi~l~G~L~f~~~~~~~~~l~~~~~~-~~~~~vilDls~v~~iDssgi~~L~~~~~~~~~~g~~l~l~~~~~ 81 (106)
T TIGR02886 8 DVLIVRLSGELDHHTAERVRRKIDDAIER-RPIKHLILNLKNVTFMDSSGLGVILGRYKKIKNEGGEVIVCNVSP 81 (106)
T ss_pred CEEEEEEecccchhhHHHHHHHHHHHHHh-CCCCEEEEECCCCcEecchHHHHHHHHHHHHHHcCCEEEEEeCCH
Confidence 467778765 345555555544432 355 588888752 2222233344566777777 99999988854
No 182
>PF12229 PG_binding_4: Putative peptidoglycan binding domain; InterPro: IPR022029 This domain is found associated with the L,D-transpeptidase domain PF03734 from PFAM. The structure of this domain has been solved and shows a mixed alpha-beta fold composed of nine beta strands and four alpha helices. This domain is usually found to be duplicated. Therefore, it seems likely that this domain acts to bind the two unlinked peptidoglycan chains and bring them into close association so they can be cross linked by the transpeptidase domain. ; PDB: 2HKL_C 1ZAT_A.
Probab=34.36 E-value=1.3e+02 Score=20.17 Aligned_cols=31 Identities=23% Similarity=0.335 Sum_probs=20.0
Q ss_pred CCeeEEEeCCCCEEEEE----e-ecCHHHHHHHHhc
Q 048021 36 KGASYTIDAEEGMAYIT----G-RANPRKLLRKLQS 66 (143)
Q Consensus 36 ~GVsV~VDle~~~VtV~----G-~vdp~~Ll~aI~k 66 (143)
++++..+..++++++|. | .+|.+++.++|.+
T Consensus 69 ~~~na~~~~~~~~~~i~~~~~G~~id~~~~~~~i~~ 104 (114)
T PF12229_consen 69 PPVNASISIDGGTFTITPGQYGWKIDVDKLAEAIKK 104 (114)
T ss_dssp TTS-EEEE-SSCEEEE---SS-EEE-HHHHHHHHHH
T ss_pred CCcceEEEEeCCEEEEcCCcCeEEEcHHHHHHHHHH
Confidence 56755565699999997 4 3788888888876
No 183
>PRK14023 homoaconitate hydratase small subunit; Provisional
Probab=34.30 E-value=57 Score=25.68 Aligned_cols=33 Identities=27% Similarity=0.408 Sum_probs=24.7
Q ss_pred CCeeEEEeCCCCEEEEEee-----cCHHHHHHHHhc-CC
Q 048021 36 KGASYTIDAEEGMAYITGR-----ANPRKLLRKLQS-GK 68 (143)
Q Consensus 36 ~GVsV~VDle~~~VtV~G~-----vdp~~Ll~aI~k-Gk 68 (143)
+|..++||+++++|+..|. ..++.+++.|+. |.
T Consensus 112 ~gd~i~vDl~~~~v~~~g~~~~~~~~~~~~~~il~aGGl 150 (166)
T PRK14023 112 DGDEVELDLETGVLTRGGETFQLRPPPEFLLEALKEGSI 150 (166)
T ss_pred CCCEEEEECCCCEEEECCEEEEEeeCCHHHHHHHHcCCH
Confidence 5668999999999987653 356667777776 53
No 184
>cd04885 ACT_ThrD-I Tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes each of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=34.24 E-value=1.3e+02 Score=19.26 Aligned_cols=54 Identities=15% Similarity=0.199 Sum_probs=31.2
Q ss_pred EEee-cChhhHHHHHHHHhccCCe---eEEEeC---CCCEEEEEe----ecCHHHHHHHHhc-CCceE
Q 048021 16 KVDT-NDPAWFSSMTKVLRKIKGA---SYTIDA---EEGMAYITG----RANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 16 kV~m-~C~gC~~kIkkaL~kI~GV---sV~VDl---e~~~VtV~G----~vdp~~Ll~aI~k-Gk~Ae 71 (143)
+|.+ ..+|=-.++-+.|.. |. ++.-+. +...|.|.= ....++|+++|++ |+.++
T Consensus 2 ~v~ipdkPG~l~~~~~~i~~--~~nI~~~~~~~~~~~~~~v~v~ie~~~~~~~~~i~~~L~~~G~~~~ 67 (68)
T cd04885 2 AVTFPERPGALKKFLELLGP--PRNITEFHYRNQGGDEARVLVGIQVPDREDLAELKERLEALGYPYV 67 (68)
T ss_pred EEECCCCCCHHHHHHHHhCC--CCcEEEEEEEcCCCCceEEEEEEEeCCHHHHHHHHHHHHHcCCCcc
Confidence 4444 355555666666655 43 344443 334444432 2356788899998 99753
No 185
>PF02120 Flg_hook: Flagellar hook-length control protein FliK; InterPro: IPR021136 This entry represents the C-terminal domain of the flagellar hook-length control protein FliK. This entry also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. During flagellar morphogenesis in Salmonella typhimurium and Escherichia coli, flagellar hook-length control protein (FliK) controls the length of the hook by directly measuring the hook length [, ]. It is considered unlikely that FliK functions as a molecular ruler for determining hook length, but that it is more likely to be employing a novel mechanism. The deduced amino acid sequences of FliK proteins from S. typhimurium and E. coli have molecular masses of 41,748 and 39,246 Da, respectively, and are fairly hydrophilic []. Sequence comparison reveals around 50% identity, with greatest conservation in the C-terminal region, with 71% identity in the last 154 amino acids - mutagenesis of this conserved region completely abolishes motility. The central and C-terminal regions are rich in proline and glutamine respectively; it is thought that they may constitute distinct domains [].; PDB: 2RRL_A.
Probab=33.98 E-value=1.4e+02 Score=19.63 Aligned_cols=44 Identities=23% Similarity=0.175 Sum_probs=30.0
Q ss_pred CeeEEEeCCCCEEEEEeec-----------CHHHHHHHHhc-CCceEEeecCCCCC
Q 048021 37 GASYTIDAEEGMAYITGRA-----------NPRKLLRKLQS-GKYANLCWVSAGNQ 80 (143)
Q Consensus 37 GVsV~VDle~~~VtV~G~v-----------dp~~Ll~aI~k-Gk~Ael~~~~~~~s 80 (143)
.|.|.+.+.++.++|.=.+ ....|.++|++ |+.+.-+.+..+++
T Consensus 26 ~v~v~l~~~~~~l~v~~~~~~~~~~~~L~~~~~~L~~~L~~~G~~~~~~~v~~~~~ 81 (85)
T PF02120_consen 26 SVEVKLRLQGGNLSVQFTAENPETKELLRQNLPELKERLQAQGLEVVNLSVSQGSS 81 (85)
T ss_dssp -EEEEEEEETTEEEEEEE--SSHHHHHHHHTHHHHHHHHHTTT-EEEEEEEESS--
T ss_pred cEEEEEEEeCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCeEEEEEEECCC
Confidence 3567888888888887322 46788999999 99998777765544
No 186
>COG2092 EFB1 Translation elongation factor EF-1beta [Translation, ribosomal structure and biogenesis]
Probab=33.84 E-value=76 Score=22.87 Aligned_cols=30 Identities=17% Similarity=0.299 Sum_probs=22.1
Q ss_pred EEEEee-cChhhHHHHHHHHhccCCe-eEEEe
Q 048021 14 GLKVDT-NDPAWFSSMTKVLRKIKGA-SYTID 43 (143)
Q Consensus 14 vlkV~m-~C~gC~~kIkkaL~kI~GV-sV~VD 43 (143)
.|.|-| .-.|--..+.++|++++|| ++++.
T Consensus 52 ~l~vvv~D~Eg~td~~ee~l~~vegV~sveve 83 (88)
T COG2092 52 KLYVVVEDKEGGTDALEEALEEVEGVESVEVE 83 (88)
T ss_pred EEEEEEcccccCcHHHHHHHhhccCcceEEEE
Confidence 344444 4567788999999999999 67663
No 187
>cd06472 ACD_ScHsp26_like Alpha crystallin domain (ACD) found in Saccharomyces cerevisiae (Sc) small heat shock protein (Hsp)26 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. ScHsp26 is temperature-regulated, it switches from an inactive to a chaperone-active form upon elevation in temperature. It associates into large 24-mers storage forms which upon heat shock disassociate into dimers. These dimers initiate the interaction with non-native substrate proteins and re-assemble into large globular assemblies having one monomer of substrate bound per dimer. This group also contains Arabidopsis thaliana (Ath) Hsp15.7, a peroxisomal matrix protein which can complement the morphological phenotype of S. cerevisiae mutants deficient in Hsps26. AthHsp15.7 is minimally expressed under normal conditions and is strongly induced by heat and oxidative st
Probab=33.66 E-value=43 Score=22.87 Aligned_cols=23 Identities=22% Similarity=0.570 Sum_probs=17.7
Q ss_pred ccCCe---eEEEeCCCC-EEEEEeecC
Q 048021 34 KIKGA---SYTIDAEEG-MAYITGRAN 56 (143)
Q Consensus 34 kI~GV---sV~VDle~~-~VtV~G~vd 56 (143)
.|+|| +++|+++++ .++|+|...
T Consensus 16 ~lPGv~~edi~i~v~~~~~L~I~g~~~ 42 (92)
T cd06472 16 DVPGVKKEDVKVEVEDGRVLRISGERK 42 (92)
T ss_pred ECCCCChHhEEEEEeCCCEEEEEEEec
Confidence 57898 288888764 899999753
No 188
>PRK07560 elongation factor EF-2; Reviewed
Probab=33.51 E-value=2e+02 Score=27.25 Aligned_cols=61 Identities=15% Similarity=0.304 Sum_probs=43.3
Q ss_pred EEEEEEeecChhhHHHHHHHHhccC----CeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceEE
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIK----GASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYANL 72 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~----GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ael 72 (143)
.+.+.|.-.-.+=..++.++|.+|. ++.|..|.+.+...|.| ++--+.++++|++ |..+++
T Consensus 390 v~~~aI~p~~~~d~~kL~~aL~~L~~eDPsl~v~~~~etge~~l~g~GElHLei~~~rL~~~~~vev~~ 458 (731)
T PRK07560 390 VVTVAIEAKNPKDLPKLIEVLRQLAKEDPTLVVKINEETGEHLLSGMGELHLEVITYRIKRDYGIEVVT 458 (731)
T ss_pred eEEEEEEECCHHHHHHHHHHHHHHHhhCCcEEEEEcCCCCCeEEEcCCHHHHHHHHHHHHHHhCCceEe
Confidence 4444454445556788999998873 33577788889988885 5678899999977 655543
No 189
>PF14611 SLS: Mitochondrial inner-membrane-bound regulator
Probab=33.49 E-value=87 Score=24.51 Aligned_cols=41 Identities=17% Similarity=0.174 Sum_probs=27.5
Q ss_pred CCeeEEEeCCCCEEEEEee-cCHHHHHHHHhc---CCceEEeecC
Q 048021 36 KGASYTIDAEEGMAYITGR-ANPRKLLRKLQS---GKYANLCWVS 76 (143)
Q Consensus 36 ~GVsV~VDle~~~VtV~G~-vdp~~Ll~aI~k---Gk~Ael~~~~ 76 (143)
-|+.|++..+.+++.|+|. ...+.|...|.+ .-..+-+...
T Consensus 55 ~~~~I~~~~~~~~i~I~g~k~~~~~i~~~i~~~l~~i~~~~i~l~ 99 (210)
T PF14611_consen 55 NGAKIEVSRSENRIRITGTKSTAEYIEASINEILSNIRTEEIDLS 99 (210)
T ss_pred cCceEEEecCCcEEEEEccHHHHHHHHHHHHHHHhhcEEEEEecc
Confidence 5667888889999999985 456666666665 4444444444
No 190
>TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family. YhcN and YlaJ are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic 40-residue C-terminal domain that is not included in the seed alignment for this model. A portion of the low-complexity region between the lipoprotein signal sequence and the main conserved region of the protein family was also excised from the seed alignment.
Probab=33.48 E-value=1.3e+02 Score=23.51 Aligned_cols=30 Identities=20% Similarity=0.256 Sum_probs=23.4
Q ss_pred hhHHHHHHHHhccCCe-eEEEeCCCCEEEEE
Q 048021 23 AWFSSMTKVLRKIKGA-SYTIDAEEGMAYIT 52 (143)
Q Consensus 23 gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~ 52 (143)
.=+.+|.+.+.+++|| ++.+=...+.+.|-
T Consensus 54 ~~A~~Ia~~v~~v~~V~dA~vvVtg~~A~Vg 84 (158)
T TIGR02898 54 DVADEIASEAAKVKGVKDATVVITGNYAYVG 84 (158)
T ss_pred HHHHHHHHHHhcCCCCceEEEEEECCEEEEE
Confidence 6788999999999999 66666666666664
No 191
>PRK10614 multidrug efflux system subunit MdtC; Provisional
Probab=33.09 E-value=64 Score=31.77 Aligned_cols=60 Identities=10% Similarity=0.086 Sum_probs=42.2
Q ss_pred EEEEEEee----cChhhHHHHHHHHhccCCe-eEEEeCCCC--EEEEEe--------ecCHHHHHHHHhc---CCceE
Q 048021 12 TCGLKVDT----NDPAWFSSMTKVLRKIKGA-SYTIDAEEG--MAYITG--------RANPRKLLRKLQS---GKYAN 71 (143)
Q Consensus 12 tvvlkV~m----~C~gC~~kIkkaL~kI~GV-sV~VDle~~--~VtV~G--------~vdp~~Ll~aI~k---Gk~Ae 71 (143)
.+.++|-+ .=..-+++|++.|++++|| +|..|...+ ++.|+= -+++..|.++|+. |..+.
T Consensus 660 ~i~i~l~G~d~~~L~~~a~~i~~~L~~~pgv~~v~~~~~~~~~el~i~id~~ka~~~Gls~~~v~~~l~~~~~G~~~~ 737 (1025)
T PRK10614 660 SYQYTLLSDDLAALREWEPKIRKALAALPELADVNSDQQDKGAEMALTYDRDTMARLGIDVQAANSLLNNAFGQRQIS 737 (1025)
T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHhcCCCeEEeecCCCCCCceEEEEECHHHHHHcCCCHHHHHHHHHHHhCCceee
Confidence 36666643 3456788999999999999 788876654 555541 2577888888887 66553
No 192
>COG1908 FrhD Coenzyme F420-reducing hydrogenase, delta subunit [Energy production and conversion]
Probab=32.90 E-value=57 Score=25.15 Aligned_cols=40 Identities=23% Similarity=0.366 Sum_probs=27.1
Q ss_pred EEEeCCCCEEEE--EeecCHHHHHHHHhcCCceEE-eecCCCC
Q 048021 40 YTIDAEEGMAYI--TGRANPRKLLRKLQSGKYANL-CWVSAGN 79 (143)
Q Consensus 40 V~VDle~~~VtV--~G~vdp~~Ll~aI~kGk~Ael-~~~~~~~ 79 (143)
+.....-..+.| +|+++|+-|++|+++|-+..+ .+..-++
T Consensus 24 mqyp~~vRiIrv~CsGrvn~~fvl~Al~~GaDGV~v~GC~~ge 66 (132)
T COG1908 24 MQYPPNVRIIRVMCSGRVNPEFVLKALRKGADGVLVAGCKIGE 66 (132)
T ss_pred ccCCCceEEEEeeccCccCHHHHHHHHHcCCCeEEEecccccc
Confidence 344444444444 499999999999999888664 4454444
No 193
>cd03421 SirA_like_N SirA_like_N, a protein of unknown function with an N-terminal SirA-like domain. The SirA, YedF, YeeD protein family is present in bacteria as well as archaea. SirA (also known as UvrY, and YhhP) belongs to a family of a two-component response regulators that controls secondary metabolism and virulence. The other member of this two-component system is a sensor kinase called BarA which phosphorylates SirA. A variety of microorganisms have similar proteins, all of which contain a common CPxP sequence motif in the N-terminal region. YhhP is suggested to be important for normal cell division and growth in rich nutrient medium. Moreover, despite a low primary sequence similarity, the YccP structure closely resembles the non-homologous C-terminal RNA-binding domain of E. coli translation initiation factor IF3. The signature CPxP motif serves to stabilize the N-terminal helix as part of the N-capping box and might be important in mRNA-binding.
Probab=32.76 E-value=1.3e+02 Score=19.12 Aligned_cols=47 Identities=15% Similarity=0.068 Sum_probs=32.4
Q ss_pred ecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEe--ecCHHHHHHHHhc-CCceEEee
Q 048021 19 TNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITG--RANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
+.|+.-.-+++++| +++. .+.+.|.. ......|.+.+++ |+.+++..
T Consensus 7 ~~CP~P~l~~k~al-~~~~--------g~~l~v~~d~~~s~~~i~~~~~~~G~~~~~~~ 56 (67)
T cd03421 7 LACPQPVIKTKKAL-ELEA--------GGEIEVLVDNEVAKENVSRFAESRGYEVSVEE 56 (67)
T ss_pred CCCCHHHHHHHHHH-hcCC--------CCEEEEEEcChhHHHHHHHHHHHcCCEEEEEe
Confidence 47999999999999 6542 22344433 3445788888888 99986443
No 194
>COG1334 FlaG Uncharacterized flagellar protein FlaG [Cell motility and secretion]
Probab=32.58 E-value=1.2e+02 Score=22.97 Aligned_cols=43 Identities=19% Similarity=0.363 Sum_probs=31.8
Q ss_pred hHHHHHHHHhccCC-eeEEEeCCCCEEEEE------ee----cCHHHHHHHHhc
Q 048021 24 WFSSMTKVLRKIKG-ASYTIDAEEGMAYIT------GR----ANPRKLLRKLQS 66 (143)
Q Consensus 24 C~~kIkkaL~kI~G-VsV~VDle~~~VtV~------G~----vdp~~Ll~aI~k 66 (143)
-.+.|.+.|+.+.. +++.+|-+-|.+.|+ |+ ++|+++++..++
T Consensus 54 ~v~~ink~~k~~nt~l~F~~dd~lg~~vVkI~d~~TgeVIRqIPpee~L~l~~r 107 (120)
T COG1334 54 IVEDINKLLKSLNTHLNFSYDDELGELVVKIIDKDTGEVIRQIPPEEALELAAR 107 (120)
T ss_pred HHHHHHHHHHhhcCceEEEEecccCcEEEEEEECCCCcchhhCChHHHHHHHHH
Confidence 34567777777754 489999999998887 32 588888877665
No 195
>TIGR00377 ant_ant_sig anti-anti-sigma factor. This superfamily includes small (105-125 residue) proteins related to SpoIIAA of Bacillus subtilis, an anti-anti-sigma factor. SpoIIAA can bind to and inhibit the anti-sigma F factor SpoIIAB. Also, it can be phosphorylated by SpoIIAB on a Ser residue at position 59 of the seed alignment. A similar arrangement is inferred for RsbV, an anti-anti-sigma factor for sigma B. This Ser is fairly well conserved within a motif resembling MXS[STA]G[VIL]X[VIL][VILF] among homologous known or predicted anti-anti-sigma factors. Regions similar to SpoIIAA and apparently homologous, but differing considerably near the phosphorlated Ser of SpoIIAA, appear in a single copy in several longer proteins.
Probab=32.48 E-value=1.6e+02 Score=19.83 Aligned_cols=64 Identities=16% Similarity=0.255 Sum_probs=38.4
Q ss_pred EEEEEEee-----cChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEE--eecCHHHHHHHHhc-CCceEEeecCC
Q 048021 12 TCGLKVDT-----NDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYIT--GRANPRKLLRKLQS-GKYANLCWVSA 77 (143)
Q Consensus 12 tvvlkV~m-----~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~--G~vdp~~Ll~aI~k-Gk~Ael~~~~~ 77 (143)
.+++++.+ .+....+.+..++.+ ++. .+.+|+.+ .-.+. |-.--..+.+.+++ |.+..++++.+
T Consensus 13 ~~vi~~~G~l~~~~~~~~~~~l~~~~~~-~~~~~vvidls~-v~~iDssgl~~L~~~~~~~~~~~~~~~l~~~~~ 85 (108)
T TIGR00377 13 VVIVRLSGELDAHTAPLLREKVTPAAER-TGPRPIVLDLED-LEFMDSSGLGVLLGRYKQVRRVGGQLVLVSVSP 85 (108)
T ss_pred EEEEEEecccccccHHHHHHHHHHHHHh-cCCCeEEEECCC-CeEEccccHHHHHHHHHHHHhcCCEEEEEeCCH
Confidence 55666643 466666666666654 466 58899874 33333 22234444555566 88888888754
No 196
>cd04877 ACT_TyrR N-terminal ACT domain of the TyrR protein. ACT_TyrR: N-terminal ACT domain of the TyrR protein. The TyrR protein of Escherichia coli controls the expression of a group of transcription units (TyrR regulon) whose gene products are involved in the biosynthesis or transport of the aromatic amino acids. Binding to specific DNA sequences known as TyrR boxes, the TyrR protein can either activate or repress transcription at different sigma70 promoters. Its regulatory activity occurs in response to intracellular levels of tyrosine, phenylalanine and tryptophan. The TyrR protein consists of an N-terminal region important for transcription activation with an ATP-independent aromatic amino acid binding site (contained within the ACT domain) and is involved in dimerization; a central region with an ATP binding site, an ATP-dependent aromatic amino acid binding site and is involved in hexamerization; and a helix turn helix DNA binding C-terminal region. In solution, in the absence
Probab=32.42 E-value=1e+02 Score=20.00 Aligned_cols=29 Identities=17% Similarity=0.214 Sum_probs=17.7
Q ss_pred EEEEEeecChhhHHHHHHHHhccCCe-eEE
Q 048021 13 CGLKVDTNDPAWFSSMTKVLRKIKGA-SYT 41 (143)
Q Consensus 13 vvlkV~m~C~gC~~kIkkaL~kI~GV-sV~ 41 (143)
+.|.++..--.=-..+.+.|++++|| +|+
T Consensus 39 i~l~i~v~~~~~L~~li~~L~~i~gV~~V~ 68 (74)
T cd04877 39 IYLNFPTIEFEKLQTLMPEIRRIDGVEDVK 68 (74)
T ss_pred EEEEeEecCHHHHHHHHHHHhCCCCceEEE
Confidence 44555543222246778888999998 653
No 197
>PF07338 DUF1471: Protein of unknown function (DUF1471); InterPro: IPR010854 This entry consists of several hypothetical Enterobacterial proteins of around 90 residues in length. Some of the proteins are annotated as ydgH precursors and contain two copies of this region, one at the N terminus and the other at the C terminus. The function of this family is unknown.; PDB: 2NOC_A 2JNA_B 4EVU_B.
Probab=32.01 E-value=54 Score=21.15 Aligned_cols=20 Identities=20% Similarity=0.400 Sum_probs=16.3
Q ss_pred CEEEEEee-cCHHHHHHHHhc
Q 048021 47 GMAYITGR-ANPRKLLRKLQS 66 (143)
Q Consensus 47 ~~VtV~G~-vdp~~Ll~aI~k 66 (143)
++|+|+|. .+|+.+.++|.+
T Consensus 6 G~Isvs~~~~s~~d~~~~la~ 26 (56)
T PF07338_consen 6 GTISVSGNFGSPDDAEEALAK 26 (56)
T ss_dssp EEEEEEEECSSHHHHHHHHHH
T ss_pred EEEEEccccCCHHHHHHHHHH
Confidence 68999998 788888888865
No 198
>TIGR03341 YhgI_GntY IscR-regulated protein YhgI. IscR (TIGR02010) is an iron-sulfur cluster-binding transcriptional regulator (see Genome Property GenProp0138). Members of this protein family include YhgI, whose expression is under control of IscR, and show sequence similarity to IscA, a known protein of iron-sulfur cluster biosynthesis. These two lines of evidence strongly suggest a role as an iron-sulfur cluster biosynthesis protein. An older study designated this protein GntY and suggested a role for it and for the product of an adjacent gene, based on complementation studies, in gluconate utilization.
Probab=31.86 E-value=80 Score=25.21 Aligned_cols=27 Identities=11% Similarity=0.025 Sum_probs=18.8
Q ss_pred EEEEEEeecChhhH-------HHHHHHHhc-cCCe
Q 048021 12 TCGLKVDTNDPAWF-------SSMTKVLRK-IKGA 38 (143)
Q Consensus 12 tvvlkV~m~C~gC~-------~kIkkaL~k-I~GV 38 (143)
++.++..+.|.||. ..|++.|+. ++.|
T Consensus 139 ~v~v~l~GaC~gC~~s~~Tl~~~ie~~l~~~~p~v 173 (190)
T TIGR03341 139 VAVLQFGGGCNGCSMVDVTLKDGVEKTLLERFPEL 173 (190)
T ss_pred EEEEEEeecCCCCcchHHHHHHHHHHHHHHhCCCc
Confidence 55667778999996 356666643 6666
No 199
>TIGR03131 malonate_mdcH malonate decarboxylase, epsilon subunit. Members of this protein family are the epsilon subunit of malonate decarboxylase. This subunit has malonyl-CoA/dephospho-CoA acyltransferase activity. Malonate decarboxylase may be a soluble enzyme, or linked to membrane subunits and active as a sodium pump. The epsilon subunit is closely related to the malonyl CoA-acyl carrier protein (ACP) transacylase family described by TIGR00128, but acts on an ACP subunit of malonate decarboxylase that has an unusual coenzyme A derivative as its prothetic group.
Probab=31.73 E-value=1.5e+02 Score=24.12 Aligned_cols=52 Identities=17% Similarity=0.155 Sum_probs=38.3
Q ss_pred hHHHHHHHHhccCCeeEEEeCCCCEEEEEeec-CHHHHHHHHhc-CC-ceEEeecC
Q 048021 24 WFSSMTKVLRKIKGASYTIDAEEGMAYITGRA-NPRKLLRKLQS-GK-YANLCWVS 76 (143)
Q Consensus 24 C~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-dp~~Ll~aI~k-Gk-~Ael~~~~ 76 (143)
....|++.|.+. +|.|......+.++|.|.. +-+.+.+.+++ |. ++..+.+.
T Consensus 130 ~~~~~~~~l~~~-~v~ia~~Nsp~~~visG~~~~l~~l~~~l~~~g~~~~~~l~v~ 184 (295)
T TIGR03131 130 DLAAVEALIAKH-GVYLAIINAPDQVVIAGSRAALRAVAELARAAGASRAKRLAVR 184 (295)
T ss_pred CHHHHHHHHHHc-CEEEEEEcCCCCEEEECCHHHHHHHHHHHHhcCCceEEECCCC
Confidence 356788888765 4777777778899999964 67788888887 76 77776654
No 200
>PRK09579 multidrug efflux protein; Reviewed
Probab=31.64 E-value=92 Score=30.75 Aligned_cols=42 Identities=7% Similarity=-0.038 Sum_probs=30.6
Q ss_pred HHHHHHHHhccCCe-eEEEeCC-CCEEEEEe--------ecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAE-EGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle-~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
++.|+..|++++|| +|++.-. +.++.|.= .+++++|.++|+.
T Consensus 158 ~~~i~~~L~~i~GV~~V~~~G~~~~ei~V~vd~~kl~~~gls~~dV~~al~~ 209 (1017)
T PRK09579 158 SRVIQPKLATLPGMAEAEILGNQVFAMRLWLDPVKLAGFGLSAGDVTQAVRR 209 (1017)
T ss_pred HHHHHHHhhcCCCceEEEecCCCceEEEEEeCHHHHHHcCCCHHHHHHHHHH
Confidence 46799999999999 7887554 44555541 2567888888876
No 201
>TIGR02194 GlrX_NrdH Glutaredoxin-like protein NrdH. NrdH-redoxin is a representative of a class of small redox proteins that contain a conserved CXXC motif and are characterized by a glutaredoxin-like amino acid sequence and thioredoxin-like activity profile. Unlike other the glutaredoxins to which it is most closely related, NrdH aparrently does not interact with glutathione/glutathione reductase, but rather with thioredoxin reductase to catalyze the reduction of ribonucleotide reductase.
Probab=31.57 E-value=70 Score=20.53 Aligned_cols=25 Identities=20% Similarity=0.197 Sum_probs=17.1
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCC
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAE 45 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle 45 (143)
-.|..|. ++++.|.+ .||. -.+|++
T Consensus 7 ~~Cp~C~-~ak~~L~~-~~i~~~~~di~ 32 (72)
T TIGR02194 7 NNCVQCK-MTKKALEE-HGIAFEEINID 32 (72)
T ss_pred CCCHHHH-HHHHHHHH-CCCceEEEECC
Confidence 3799998 67788877 4773 344444
No 202
>TIGR02816 pfaB_fam PfaB family protein. The protein PfaB is part of four gene locus, similar to polyketide biosynthesis systems, responsible for omega-3 polyunsaturated fatty acid biosynthesis in several high pressure and/or cold-adapted bacteria. The fairly permissive trusted cutoff set for this model allows detection of homologs encoded near homologs to other proteins of the locus: PfaA, PfaC, and/or PfaD. The likely role in every case is either polyunsaturated fatty acid or polyketide biosynthesis.
Probab=31.55 E-value=89 Score=29.00 Aligned_cols=50 Identities=22% Similarity=0.204 Sum_probs=38.1
Q ss_pred HHHHHHHHhccCCeeEEEeCCCCEEEEEeec-CHHHHHHHHhc-CCceEEeec
Q 048021 25 FSSMTKVLRKIKGASYTIDAEEGMAYITGRA-NPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 25 ~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-dp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
...|+..|...++|.+.+.-. +.++|.|.. .-++|++++++ |..+..+.+
T Consensus 337 ~~~V~~~L~~~~~V~IAaiN~-~q~VISG~~~Ai~~l~~~L~~~Gi~~r~L~a 388 (538)
T TIGR02816 337 AAPIEALLKDFPHAYLAIIQG-DTCVIAGCEAQCKALLAALGKRGIAANRVTA 388 (538)
T ss_pred HHHHHHHhccCCCeEEEEeCC-CCeEeeCCHHHHHHHHHHHHhCCeeeeeccc
Confidence 577888888877787555444 688899864 67888888888 988877644
No 203
>PF02863 Arg_repressor_C: Arginine repressor, C-terminal domain; InterPro: IPR020899 The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes, an example of the latter being Giardia lamblia (Giardia intestinalis) []. The three-enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 10% of total cell protein []. Most prokaryotic arginine deiminase pathways are under the control of a repressor gene, termed ArgR []. This is a negative regulator, and will only release the arginine deiminase operon for expression in the presence of arginine []. The crystal structure of apo-ArgR from Bacillus stearothermophilus has been determined to 2.5A by means of X-ray crystallography []. The protein exists as a hexamer of identical subunits, and is shown to have six DNA-binding domains, clustered around a central oligomeric core when bound to arginine. It predominantly interacts with A.T residues in ARG boxes. This hexameric protein binds DNA at its N terminus to repress arginine biosyntheis or activate arginine catabolism. Some species have several ArgR paralogs. In a neighbour-joining tree, some of these paralogous sequences show long branches and differ significantly from the well-conserved C-terminal region. ; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1B4B_B 1B4A_A 3V4G_A 1F9N_F 2P5M_A 1XXA_E 1XXC_C 1XXB_F 3LAJ_D 3BUE_D ....
Probab=31.29 E-value=1.1e+02 Score=20.25 Aligned_cols=44 Identities=9% Similarity=0.154 Sum_probs=29.8
Q ss_pred ChhhHHHHHHHHhc--cCCe-eEEEeCCCCEEEEEe--ecCHHHHHHHHhc
Q 048021 21 DPAWFSSMTKVLRK--IKGA-SYTIDAEEGMAYITG--RANPRKLLRKLQS 66 (143)
Q Consensus 21 C~gC~~kIkkaL~k--I~GV-sV~VDle~~~VtV~G--~vdp~~Ll~aI~k 66 (143)
=+|+++.|..+|.+ +++| ..-.+ +++|-|-. ..+.+.|.+.|++
T Consensus 21 ~pG~A~~va~~iD~~~~~~I~GtIAg--dDTilvi~~~~~~a~~l~~~l~~ 69 (70)
T PF02863_consen 21 LPGNAQAVAAAIDQLNLPEIFGTIAG--DDTILVICRSEEDAEELEEKLKE 69 (70)
T ss_dssp STTCHHHHHHHHHHHCGTTEEEEEEE--SSEEEEEESTTSHHHHHHHHHHT
T ss_pred CCCcHHHHHHHHHhcCCcccEEEEeC--CCEEEEEeCCHHHHHHHHHHHHh
Confidence 38899999999875 4666 33333 45666654 4578888888764
No 204
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=31.28 E-value=72 Score=21.78 Aligned_cols=22 Identities=18% Similarity=0.326 Sum_probs=17.9
Q ss_pred CCCEEEEEeecCHHHHHHHHhc
Q 048021 45 EEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 45 e~~~VtV~G~vdp~~Ll~aI~k 66 (143)
.+|+|+|.|.-+++.+.+++++
T Consensus 56 ~sGki~itGaks~~~~~~a~~~ 77 (86)
T PF00352_consen 56 SSGKIVITGAKSEEEAKKAIEK 77 (86)
T ss_dssp TTSEEEEEEESSHHHHHHHHHH
T ss_pred cCCEEEEEecCCHHHHHHHHHH
Confidence 4799999998888888877764
No 205
>PF08534 Redoxin: Redoxin; InterPro: IPR013740 This redoxin domain is found in peroxiredoxin, thioredoxin and glutaredoxin proteins. Peroxiredoxins (Prxs) constitute a family of thiol peroxidases that reduce hydrogen peroxide, peroxinitrite, and hydroperoxides using a strictly conserved cysteine []. Chloroplast thioredoxin systems in plants regulate the enzymes involved in photosynthetic carbon assimilation []. It is thought that redoxins have a large role to play in anti-oxidant defence. Cadmium-sensitive proteins are also regulated via thioredoxin and glutaredoxin thiol redox systems [].; GO: 0016491 oxidoreductase activity; PDB: 2H30_A 1TP9_A 1Y25_A 1XVQ_A 2B1K_A 2G0F_A 2B1L_B 3K8N_A 1Z5Y_E 3OR5_A ....
Probab=31.25 E-value=1.6e+02 Score=20.94 Aligned_cols=53 Identities=13% Similarity=0.108 Sum_probs=30.7
Q ss_pred EEEEEEee-cChhhHHHHHHHHhcc------CCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEee
Q 048021 12 TCGLKVDT-NDPAWFSSMTKVLRKI------KGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 12 tvvlkV~m-~C~gC~~kIkkaL~kI------~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
+++.-... .|+-|...+. .|.++ +||. .|.|...-++. +.+.+++ |....++.
T Consensus 31 ~vv~f~~~~~Cp~C~~~~p-~l~~l~~~~~~~~v~--------~v~v~~~~~~~-~~~~~~~~~~~~~~~~ 91 (146)
T PF08534_consen 31 VVVNFWASAWCPPCRKELP-YLNELQEKYKDKGVD--------VVGVSSDDDPP-VREFLKKYGINFPVLS 91 (146)
T ss_dssp EEEEEESTTTSHHHHHHHH-HHHHHHHHHHTTTCE--------EEEEEESSSHH-HHHHHHHTTTTSEEEE
T ss_pred EEEEEEccCCCCcchhhhh-hHHhhhhhhccCceE--------EEEecccCCHH-HHHHHHhhCCCceEEe
Confidence 33334455 8999998874 34433 3442 22333334444 8888888 77766544
No 206
>cd04902 ACT_3PGDH-xct C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). The C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with an extended C-terminal (xct) region from bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback-controlled by the end product L-serine in an allosteric manner. Some 3PGDH enzymes have an additional domain formed by an extended C-terminal region. This additional domain introduces significant asymmetry to the homotetramer. Adjacent ACT (regulatory) domains interact, creating two serine-binding sites, however, this asymmetric arrangement results in the formation of two different and distinct domain interfaces between iden
Probab=31.20 E-value=97 Score=19.24 Aligned_cols=10 Identities=30% Similarity=0.750 Sum_probs=6.3
Q ss_pred HHHHHHHHhc
Q 048021 57 PRKLLRKLQS 66 (143)
Q Consensus 57 p~~Ll~aI~k 66 (143)
+..++++|++
T Consensus 52 ~~~~~~~l~~ 61 (73)
T cd04902 52 PDEVLEELRA 61 (73)
T ss_pred CHHHHHHHHc
Confidence 4466666666
No 207
>TIGR02159 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subunit. Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation. Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA.
Probab=31.19 E-value=75 Score=24.43 Aligned_cols=31 Identities=10% Similarity=0.165 Sum_probs=20.0
Q ss_pred EEEEEEeecChhhH------HHHHHHHhccCCe-eEEEe
Q 048021 12 TCGLKVDTNDPAWF------SSMTKVLRKIKGA-SYTID 43 (143)
Q Consensus 12 tvvlkV~m~C~gC~------~kIkkaL~kI~GV-sV~VD 43 (143)
++.+.|.++-.+|. +.|+++|+.+ |+ +|+|+
T Consensus 26 ~V~VtIt~Ty~gcpa~e~L~~~I~~aL~~~-Gv~~V~V~ 63 (146)
T TIGR02159 26 GVVVKFTPTYSGCPALEVIRQDIRDAVRAL-GVEVVEVS 63 (146)
T ss_pred EEEEEEEeCCCCCchHHHHHHHHHHHHHhc-CCCeEEEe
Confidence 45666666666665 4488888876 77 45443
No 208
>COG0841 AcrB Cation/multidrug efflux pump [Defense mechanisms]
Probab=31.02 E-value=88 Score=31.22 Aligned_cols=48 Identities=15% Similarity=0.081 Sum_probs=36.7
Q ss_pred hhHHHHHHHHhccCCe-eEEEeCC-CCEEEEEe--------ecCHHHHHHHHhc-CCce
Q 048021 23 AWFSSMTKVLRKIKGA-SYTIDAE-EGMAYITG--------RANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 23 gC~~kIkkaL~kI~GV-sV~VDle-~~~VtV~G--------~vdp~~Ll~aI~k-Gk~A 70 (143)
--++.|+..|++++|| +|++--. +..|.|.= .++++.|..+|++ ...+
T Consensus 155 ~~~~~l~~~L~~v~GV~~V~~~G~~~~~~rI~ldp~kLa~~gLt~~dV~~ai~~qN~~~ 213 (1009)
T COG0841 155 YAASNVRDELSRVPGVGSVQLFGAQEYAMRIWLDPAKLAAYGLTPSDVQSAIRAQNVQV 213 (1009)
T ss_pred HHHHHHHHHHhcCCCceEEEEcCCCceeEEEEeCHHHHHHcCCCHHHHHHHHHHhCccc
Confidence 3567799999999999 8998776 67777762 2578888899987 4443
No 209
>cd06464 ACD_sHsps-like Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=30.56 E-value=46 Score=21.47 Aligned_cols=23 Identities=22% Similarity=0.537 Sum_probs=18.9
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecC
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.++|+ +++|...++.+.|+|...
T Consensus 14 ~lpg~~~~~i~V~v~~~~l~I~g~~~ 39 (88)
T cd06464 14 DLPGFKKEDIKVEVEDGVLTISGERE 39 (88)
T ss_pred ECCCCCHHHeEEEEECCEEEEEEEEe
Confidence 56888 389999999999999754
No 210
>PF00873 ACR_tran: AcrB/AcrD/AcrF family; InterPro: IPR001036 The Escherichia coli acrA and acrB genes encode a multi-drug efflux system that is believed to protect the bacterium against hydrophobic inhibitors []. The E. coli AcrB protein is a transporter that is energized by proton-motive force and that shows the widest substrate specificity among all known multidrug pumps, ranging from most of the currently used antibiotics, disinfectants, dyes, and detergents to simple solvents. The structure of ligand-free AcrB shows that it is a homotrimer of 110kDa per subunit. Each subunit contains 12 transmembrane helices and two large periplasmic domains (each exceeding 300 residues) between helices 1 and 2, and helices 7 and 8. X-ray analysis of the overexpressed AcrB protein demonstrated that the three periplasmic domains form, in the centre, a funnel-like structure and a connected narrow (or closed) pore. The pore is opened to the periplasm through three vestibules located at subunit interfaces. These vestibules were proposed to allow direct access of drugs from the periplasm as well as the outer leaflet of the cytoplasmic membrane. The three transmembrane domains of AcrB protomers form a large, 30A-wide central cavity that spans the cytoplasmic membrane and extends to the cytoplasm X-ray crystallographic structures of the trimeric AcrB pump from E. coli with four structurally diverse ligands demonstrated that three molecules of ligand bind simultaneously to the extremely large central cavity of 5000 cubic angstroms, primarily by hydrophobic, aromatic stacking and van der Waals interactions. Each ligand uses a slightly different subset of AcrB residues for binding. The bound ligand molecules often interact with each other, stabilising the binding. ; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 2V50_B 1T9U_A 2HRT_B 3NOC_A 3NOG_A 4DX7_A 1OYD_A 3AOB_A 1T9V_A 4DX6_B ....
Probab=30.37 E-value=53 Score=32.01 Aligned_cols=51 Identities=16% Similarity=0.068 Sum_probs=35.9
Q ss_pred ChhhHHHHHHHHhccCCe-eEEEeCC-CCEEEEEe--------ecCHHHHHHHHhc-CCceE
Q 048021 21 DPAWFSSMTKVLRKIKGA-SYTIDAE-EGMAYITG--------RANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GV-sV~VDle-~~~VtV~G--------~vdp~~Ll~aI~k-Gk~Ae 71 (143)
..-..+.|+..|++++|| +|++.-. +.++.|.= .+++++|.++|++ .....
T Consensus 154 ~~~~~~~l~~~L~~i~gV~~v~~~G~~~~ei~i~~d~~kl~~~gls~~~v~~~l~~~n~~~~ 215 (1021)
T PF00873_consen 154 RDYAEEQLKPRLERIPGVARVDISGGREREIQIELDPEKLAAYGLSLSDVAQALQANNVNQP 215 (1021)
T ss_dssp HHHHHHCTHHHHHTSTTEEEEEESSS--EEEEEEE-HHHHHHTT--HHHHHHHHHHHSCEEE
T ss_pred HHHHHHHHHHhccceeEEEEEEeccchhhhhhheechhhhhhhCCCHHHHHHHHHHhhhhcc
Confidence 344667899999999999 7998763 56666662 2578888899988 55543
No 211
>PF01058 Oxidored_q6: NADH ubiquinone oxidoreductase, 20 Kd subunit; InterPro: IPR006137 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. Among the many polypeptide subunits that make up complex I, there is one with a molecular weight of 20 kDa (in mammals) [], which is a component of the iron-sulphur (IP) fragment of the enzyme. It seems to bind a 4Fe-4S iron-sulphur cluster. The 20 kDa subunit has been found to be nuclear encoded, as a precursor form with a transit peptide in mammals, and in Neurospora crassa. It is and chloroplast encoded in various higher plants (gene ndhK or psbG).; GO: 0008137 NADH dehydrogenase (ubiquinone) activity, 0048038 quinone binding, 0051539 4 iron, 4 sulfur cluster binding, 0055114 oxidation-reduction process; PDB: 3MYR_E 3RGW_S 2FUG_F 3IAS_6 3I9V_F 3IAM_F 2YBB_6 3M9S_F 2FRV_G 1YQ9_B ....
Probab=30.35 E-value=10 Score=28.15 Aligned_cols=32 Identities=22% Similarity=0.229 Sum_probs=19.6
Q ss_pred CCCEEEEEeec--CHHHHHHHHhc--CCceEEeecC
Q 048021 45 EEGMAYITGRA--NPRKLLRKLQS--GKYANLCWVS 76 (143)
Q Consensus 45 e~~~VtV~G~v--dp~~Ll~aI~k--Gk~Ael~~~~ 76 (143)
+...+.|+|.+ .....++.|++ .+.--++.++
T Consensus 45 ~~diliVeG~v~~~~~~~~e~~~~~~~~a~~vIAvG 80 (131)
T PF01058_consen 45 EADILIVEGSVPRNMEEALEWLKELRPKAKAVIAVG 80 (131)
T ss_dssp TTEEEEEESBEETGGEEHHHHHHHHHGCSSEEEEEH
T ss_pred CceEEEEEeeccCCchHHHHHHHHHccCCceeEcCC
Confidence 55678889987 55666666666 3333455553
No 212
>TIGR00668 apaH bis(5'-nucleosyl)-tetraphosphatase (symmetrical). Alternate names include diadenosine-tetraphosphatase and Ap4A hydrolase.
Probab=30.21 E-value=1.1e+02 Score=25.99 Aligned_cols=44 Identities=18% Similarity=0.442 Sum_probs=34.2
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeec-----CHHHHHHHHhc-CCce
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRA-----NPRKLLRKLQS-GKYA 70 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-----dp~~Ll~aI~k-Gk~A 70 (143)
.||....++.|++|. ++.+..++.+.|.+ ++.++++.|.+ +-.+
T Consensus 10 HGc~daL~~LL~~i~-----f~~~~D~l~~lGDlVdRGP~slevL~~l~~l~~~~ 59 (279)
T TIGR00668 10 HGCYDELQALLERVE-----FDPGQDTLWLTGDLVARGPGSLEVLRYVKSLGDAV 59 (279)
T ss_pred cCCHHHHHHHHHHhC-----cCCCCCEEEEeCCccCCCCCHHHHHHHHHhcCCCe
Confidence 689999999998873 44556678888863 68889999988 7654
No 213
>COG0277 GlcD FAD/FMN-containing dehydrogenases [Energy production and conversion]
Probab=30.12 E-value=44 Score=28.61 Aligned_cols=50 Identities=18% Similarity=0.246 Sum_probs=35.0
Q ss_pred ChhhHHHHHHHHhccCCe-eE-EEeCCCCEEEEEeecCHHHHHHHHhc-CCceE
Q 048021 21 DPAWFSSMTKVLRKIKGA-SY-TIDAEEGMAYITGRANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GV-sV-~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae 71 (143)
+++|... ...|-.+... .| +||.++++|+|..-+.-.+|.+++++ |+..-
T Consensus 68 ~g~~~~~-~gvvl~l~~mn~i~~id~~~~~~~v~aGv~l~~l~~~l~~~G~~~p 120 (459)
T COG0277 68 SGGAVPD-GGVVLDLSRLNRILEIDPEDGTATVQAGVTLEDLEKALAPHGLFLP 120 (459)
T ss_pred cccccCC-CcEEEEchhhcchhccCcCCCEEEEcCCccHHHHHHHHHHcCCccC
Confidence 4444433 2333344444 23 79999999999988999999999998 87644
No 214
>cd03035 ArsC_Yffb Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC. The structure of Yffb and the conservation of the catalytic cysteine suggest that it is likely to function as a glutathione (GSH)-dependent thiol reductase. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from GSH via glutaredoxin, through a single catalytic cysteine.
Probab=30.08 E-value=1.3e+02 Score=21.34 Aligned_cols=42 Identities=21% Similarity=0.128 Sum_probs=27.1
Q ss_pred eecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-C
Q 048021 18 DTNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-G 67 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-G 67 (143)
.-.|..|.+ .++.|.. .|+. -.+|..+ -..+.++|.+.+++ |
T Consensus 6 ~~~C~~crk-a~~~L~~-~~i~~~~~di~~------~p~s~~eL~~~l~~~g 49 (105)
T cd03035 6 IKNCDTVKK-ARKWLEA-RGVAYTFHDYRK------DGLDAATLERWLAKVG 49 (105)
T ss_pred CCCCHHHHH-HHHHHHH-cCCCeEEEeccc------CCCCHHHHHHHHHHhC
Confidence 348999984 4555655 4774 3344433 23677888888887 8
No 215
>PRK15078 polysaccharide export protein Wza; Provisional
Probab=30.01 E-value=96 Score=27.23 Aligned_cols=56 Identities=23% Similarity=0.338 Sum_probs=39.6
Q ss_pred EEEEe-ecChhhHHHHHHHHhcc-CCeeEEEeCC---CCEEEEEeec---------C-HHHHHHHHhc--CCc
Q 048021 14 GLKVD-TNDPAWFSSMTKVLRKI-KGASYTIDAE---EGMAYITGRA---------N-PRKLLRKLQS--GKY 69 (143)
Q Consensus 14 vlkV~-m~C~gC~~kIkkaL~kI-~GVsV~VDle---~~~VtV~G~v---------d-p~~Ll~aI~k--Gk~ 69 (143)
.++|. ++-+.=++.|++.|++. ..-.|.|.+. .++|+|.|++ + +..|+++|.. |..
T Consensus 133 ~V~vaG~T~~e~~~~I~~~L~~~~~~PqV~V~v~~~~s~~V~V~GeV~~PG~~~l~~~~~tlldaIa~AGG~~ 205 (379)
T PRK15078 133 KVHVAGKTVTEIRSDITGRLAKYIESPQVDVNIAAFRSQKAYVTGEVNKSGQQAITNVPLTILDAINAAGGLT 205 (379)
T ss_pred eEEECCCCHHHHHHHHHHHHHHhccCCeEEEEEccCCceEEEEEceecCCeEEEecCCCccHHHHHHHccCCC
Confidence 45665 48899999999999874 2213444443 5689999864 2 5799999987 654
No 216
>PRK07451 translation initiation factor Sui1; Validated
Probab=29.88 E-value=98 Score=23.21 Aligned_cols=28 Identities=25% Similarity=0.374 Sum_probs=17.0
Q ss_pred CCCEEEEEeecCHHHHHHHHhc-CCceEEe
Q 048021 45 EEGMAYITGRANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 45 e~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
+++.+.|.|+ -.++|.+.|.+ ||++.++
T Consensus 85 kd~~IelQGD-~r~~v~~~L~~~Gf~~k~~ 113 (115)
T PRK07451 85 KDNTIEIQGD-HRQKILEILIKLGYKAKIS 113 (115)
T ss_pred cCCEEEEcCc-HHHHHHHHHHHCCCeEeec
Confidence 4555666663 24567777777 7776654
No 217
>PRK10614 multidrug efflux system subunit MdtC; Provisional
Probab=29.85 E-value=98 Score=30.52 Aligned_cols=42 Identities=5% Similarity=0.127 Sum_probs=30.6
Q ss_pred HHHHHHHHhccCCe-eEEEeCC-CCEEEEEe--------ecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAE-EGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle-~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
++.|+..|++++|| +|++.-. +.++.|.= .+++++|.++|+.
T Consensus 159 ~~~l~~~L~~i~GV~~V~~~G~~~~ei~V~vd~~kl~~~gls~~dV~~al~~ 210 (1025)
T PRK10614 159 STQLAQTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRQAISN 210 (1025)
T ss_pred HHHHHHHhcCCCCceEEEecCCCceEEEEEeCHHHHHHcCCCHHHHHHHHHH
Confidence 36799999999999 7888654 34566651 2567777888876
No 218
>KOG2193 consensus IGF-II mRNA-binding protein IMP, contains RRM and KH domains [RNA processing and modification; General function prediction only]
Probab=29.79 E-value=43 Score=31.00 Aligned_cols=31 Identities=16% Similarity=0.253 Sum_probs=19.6
Q ss_pred CCCEEEEEeecC-----HHHHHHHHhcCCceEEeec
Q 048021 45 EEGMAYITGRAN-----PRKLLRKLQSGKYANLCWV 75 (143)
Q Consensus 45 e~~~VtV~G~vd-----p~~Ll~aI~kGk~Ael~~~ 75 (143)
.+.+++|+|.++ ...|.++|++-|.-.|+..
T Consensus 326 pERTItVkGsiEac~~AE~eImkKlre~yEnDl~a~ 361 (584)
T KOG2193|consen 326 PERTITVKGSIEACVQAEAEIMKKLRECYENDLAAM 361 (584)
T ss_pred ccceEEecccHHHHHHHHHHHHHHHHHHHhhhHHHh
Confidence 478999999653 4566677776444444443
No 219
>TIGR02515 IV_pilus_PilQ type IV pilus secretin (or competence protein) PilQ. A number of proteins homologous to PilQ are involved in type IV pilus formation, competence for transformation, type III secretion, and type II secretion (also called the main terminal branch of the general secretion pathway). Members of this family include PilQ itself, which is a component of the type IV pilus structure, from a number of species. In Haemophilus influenzae, the member of this family is associated with competence for transformation with exogenous DNA rather than with formation of a type IV pilus; the surface structure required for competence may be considered an unusual, incomplete type IV pilus structure.
Probab=29.77 E-value=1.5e+02 Score=26.01 Aligned_cols=74 Identities=23% Similarity=0.185 Sum_probs=44.2
Q ss_pred eEEEEEEeecChhhHHHHHHHHhcc---------CCeeEEEeCCCCEEEEEeecC----HHHHHHHHhc-CCc----eEE
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKI---------KGASYTIDAEEGMAYITGRAN----PRKLLRKLQS-GKY----ANL 72 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI---------~GVsV~VDle~~~VtV~G~vd----p~~Ll~aI~k-Gk~----Ael 72 (143)
.+-+|+++- .-++.+.++|+.+ ++-.|.+|...+.++|.|..+ .+++++.|.+ -+. |.+
T Consensus 98 ~t~~f~l~y---~~a~~v~~~l~~~~~~~~~~l~~~~~v~~d~~~n~lvv~~~~~~~~~i~~~i~~lD~~~~QV~Iea~I 174 (418)
T TIGR02515 98 QTELIQLNY---AKASDIAKVLTGDDGGKGSLLSPRGSVTVDPRTNTLIVTDIPENLARIRKLIAELDIPVKQVLIEARI 174 (418)
T ss_pred EEEEEEEeC---CcHHHHHHHHhhcccccccccCCCeeEEEECCcCeEEEEeCHHHHHHHHHHHHHhCCCccEEEEEEEE
Confidence 344555542 2366777777764 222589999999999999743 3444444444 333 446
Q ss_pred eecCCCCCCcccccc
Q 048021 73 CWVSAGNQITYGNAY 87 (143)
Q Consensus 73 ~~~~~~~s~~~~~~~ 87 (143)
+.+...+....|-++
T Consensus 175 veV~~~~~~~lGv~w 189 (418)
T TIGR02515 175 VEATDSFARELGVRW 189 (418)
T ss_pred EEEecCchHhcCeee
Confidence 666665555555544
No 220
>KOG2872 consensus Uroporphyrinogen decarboxylase [Coenzyme transport and metabolism]
Probab=29.65 E-value=58 Score=28.76 Aligned_cols=40 Identities=18% Similarity=0.366 Sum_probs=28.2
Q ss_pred eEEEeCC------CCEEEEEeecCHH-----------HHHHHHhc-CCceEEeecCCC
Q 048021 39 SYTIDAE------EGMAYITGRANPR-----------KLLRKLQS-GKYANLCWVSAG 78 (143)
Q Consensus 39 sV~VDle------~~~VtV~G~vdp~-----------~Ll~aI~k-Gk~Ael~~~~~~ 78 (143)
+++||++ .++||+.|++||. .+.+.|++ |++--++-.+.|
T Consensus 279 DWTvdp~ear~~~g~~VtlQGNlDP~~ly~s~e~it~~v~~mv~~fG~~ryI~NLGHG 336 (359)
T KOG2872|consen 279 DWTVDPAEARRRVGNRVTLQGNLDPGVLYGSKEEITQLVKQMVKDFGKSRYIANLGHG 336 (359)
T ss_pred cccccHHHHHHhhCCceEEecCCChHHhcCCHHHHHHHHHHHHHHhCccceEEecCCC
Confidence 4788876 5789999999864 34556666 877666665544
No 221
>COG5583 Uncharacterized small protein [Function unknown]
Probab=29.57 E-value=80 Score=20.86 Aligned_cols=30 Identities=27% Similarity=0.356 Sum_probs=20.5
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEE
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYI 51 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV 51 (143)
+.-..+|+++|++|+=-+|+|-..++.|+=
T Consensus 8 ~~~~ekI~~~Le~lkyGsV~ItVhdgqViQ 37 (54)
T COG5583 8 PEVIEKIKKALEGLKYGSVTITVHDGQVIQ 37 (54)
T ss_pred hHHHHHHHHHHhhcccceEEEEEECCEEEE
Confidence 344568888887775336888777777653
No 222
>COG2761 FrnE Predicted dithiol-disulfide isomerase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=29.23 E-value=58 Score=27.08 Aligned_cols=35 Identities=20% Similarity=0.353 Sum_probs=29.0
Q ss_pred ccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc--CCceE
Q 048021 34 KIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 34 kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~Ae 71 (143)
.|.|| .+.+ +++..|.|..+++.+..+|++ +..++
T Consensus 181 gI~gVP~fv~---d~~~~V~Gaq~~~v~~~al~~~~~~~~~ 218 (225)
T COG2761 181 GIRGVPTFVF---DGKYAVSGAQPYDVLEDALRQLLAEKAE 218 (225)
T ss_pred CCccCceEEE---cCcEeecCCCCHHHHHHHHHHHHhcccc
Confidence 46788 6666 789999999999999999998 66553
No 223
>cd03028 GRX_PICOT_like Glutaredoxin (GRX) family, PKC-interacting cousin of TRX (PICOT)-like subfamily; composed of PICOT and GRX-PICOT-like proteins. The non-PICOT members of this family contain only the GRX-like domain, whereas PICOT contains an N-terminal TRX-like domain followed by one to three GRX-like domains. It is interesting to note that PICOT from plants contain three repeats of the GRX-like domain, metazoan proteins (except for insect) have two repeats, while fungal sequences contain only one copy of the domain. PICOT is a protein that interacts with protein kinase C (PKC) theta, a calcium independent PKC isoform selectively expressed in skeletal muscle and T lymphocytes. PICOT inhibits the activation of c-Jun N-terminal kinase and the transcription factors, AP-1 and NF-kB, induced by PKC theta or T-cell activating stimuli. Both GRX and TRX domains of PICOT are required for its activity. Characterized non-PICOT members of this family include CXIP1, a CAX-interacting protein
Probab=29.21 E-value=1.8e+02 Score=19.60 Aligned_cols=39 Identities=15% Similarity=0.146 Sum_probs=23.9
Q ss_pred cChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc--CCce
Q 048021 20 NDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS--GKYA 70 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~A 70 (143)
.|+-|. ++++.|.+. |+. ..+|..+ + ..+.+.|.+ |...
T Consensus 22 ~Cp~C~-~ak~~L~~~-~i~y~~idv~~---------~-~~~~~~l~~~~g~~t 63 (90)
T cd03028 22 RCGFSR-KVVQILNQL-GVDFGTFDILE---------D-EEVRQGLKEYSNWPT 63 (90)
T ss_pred CCcHHH-HHHHHHHHc-CCCeEEEEcCC---------C-HHHHHHHHHHhCCCC
Confidence 577776 778888774 673 4555443 2 346666666 6543
No 224
>PRK08343 secD preprotein translocase subunit SecD; Reviewed
Probab=29.15 E-value=78 Score=28.32 Aligned_cols=40 Identities=8% Similarity=0.147 Sum_probs=26.3
Q ss_pred HHHHHHHHhccCCe-eEEEeC-CCCEEEEE--eecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKGA-SYTIDA-EEGMAYIT--GRANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDl-e~~~VtV~--G~vdp~~Ll~aI~k 66 (143)
...|++.|.. -|+ +..|.. .+++|.|+ | .|++++++.|.+
T Consensus 69 ~~vi~~Ri~~-~G~~e~~v~~~g~~~I~Ve~Pg-~d~~~~~~~i~~ 112 (417)
T PRK08343 69 VEILESRLNA-LGLKDIKVYPVGDQYIVVEVPG-VDADTAKEIIEK 112 (417)
T ss_pred HHHHHHHHhh-cCCCCCEEEEEcCceEEEEcCC-CCHHHHHHHhcC
Confidence 3444555544 377 444433 35688888 7 899998888887
No 225
>COG1941 FrhG Coenzyme F420-reducing hydrogenase, gamma subunit [Energy production and conversion]
Probab=29.07 E-value=82 Score=26.71 Aligned_cols=57 Identities=12% Similarity=0.150 Sum_probs=34.1
Q ss_pred cChhhHHHHHHH---HhccCC----e-e-EEEeCC----CCEEEEEeecCHHHHHHHHhc-CCceE-EeecC
Q 048021 20 NDPAWFSSMTKV---LRKIKG----A-S-YTIDAE----EGMAYITGRANPRKLLRKLQS-GKYAN-LCWVS 76 (143)
Q Consensus 20 ~C~gC~~kIkka---L~kI~G----V-s-V~VDle----~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae-l~~~~ 76 (143)
.|.||..++... |..+-. | . +.+|.+ -..+.|+|.+.-+.=++.+++ ..+++ +++.+
T Consensus 13 gC~GC~~slldl~E~L~dll~~~div~~~~l~D~keiPEvDValVEGsV~~ee~lE~v~ElRekakivVA~G 84 (247)
T COG1941 13 GCSGCHMSLLDLYEKLLDLLEDADIVYCPTLVDEKEIPEVDVALVEGSVCDEEELELVKELREKAKIVVALG 84 (247)
T ss_pred cccchHHHHHhHHHHHHHhhhhhcEEEeecccccccCCcccEEEEecccCcHHHHHHHHHHHHhCcEEEEEe
Confidence 799999966543 333222 2 2 444554 367888998866666666666 55543 55554
No 226
>PRK07738 flagellar protein FlaG; Provisional
Probab=28.96 E-value=1.4e+02 Score=22.38 Aligned_cols=28 Identities=18% Similarity=0.401 Sum_probs=22.6
Q ss_pred eEEEeCCCCEEEEE------e----ecCHHHHHHHHhc
Q 048021 39 SYTIDAEEGMAYIT------G----RANPRKLLRKLQS 66 (143)
Q Consensus 39 sV~VDle~~~VtV~------G----~vdp~~Ll~aI~k 66 (143)
.+.+|-+.+++.|+ | .++|+++++.+++
T Consensus 67 ~F~vdeet~~~vVkVvD~~T~EVIRQIPpEe~L~l~~~ 104 (117)
T PRK07738 67 KFELHEKLNEYYVQVVDERTNEVIREIPPKKLLDMYAA 104 (117)
T ss_pred EEEEecCCCcEEEEEEECCCCeeeeeCCCHHHHHHHHH
Confidence 58999999998887 3 3689999888876
No 227
>COG2008 GLY1 Threonine aldolase [Amino acid transport and metabolism]
Probab=28.92 E-value=99 Score=27.32 Aligned_cols=47 Identities=15% Similarity=0.256 Sum_probs=35.9
Q ss_pred cChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCc
Q 048021 20 NDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKY 69 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~ 69 (143)
|-..|++++...|.+.+|+++....+.+.|.|. ++...+ +++++ |..
T Consensus 256 Han~mA~~La~~~~~~~G~~~~~~~~tN~vf~~--l~~~~i-~~l~~~~~~ 303 (342)
T COG2008 256 HANAMAARLAEGLEAKPGVKLAFPVETNMVFVR--LPESAI-EALRLAGAL 303 (342)
T ss_pred HHHHHHHHHHHhhhhcCCceeccCCcccEEEEE--CChHHH-HHHHhhchh
Confidence 556799999999999999998888999999987 443334 55555 443
No 228
>cd03029 GRX_hybridPRX5 Glutaredoxin (GRX) family, PRX5 hybrid subfamily; composed of hybrid proteins containing peroxiredoxin (PRX) and GRX domains, which is found in some pathogenic bacteria and cyanobacteria. PRXs are thiol-specific antioxidant (TSA) proteins that confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins. PRX-GRX hybrid proteins from Haemophilus influenza and Neisseria meningitis exhibit GSH-dependent peroxidase activity. The flow of reducing equivalents in the catalytic cycle of the hybrid protein goes from NADPH - GSH reductase - GSH - GRX domain of hybrid - PRX domain of hybrid - peroxide substrate.
Probab=28.88 E-value=1.4e+02 Score=18.90 Aligned_cols=27 Identities=19% Similarity=0.173 Sum_probs=18.2
Q ss_pred eecChhhHHHHHHHHhccCCee-EEEeCCC
Q 048021 18 DTNDPAWFSSMTKVLRKIKGAS-YTIDAEE 46 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GVs-V~VDle~ 46 (143)
--.|..|. +++++|.. .||. ..+|+++
T Consensus 8 ~~~Cp~C~-~ak~~L~~-~~i~~~~~~v~~ 35 (72)
T cd03029 8 KPGCPFCA-RAKAALQE-NGISYEEIPLGK 35 (72)
T ss_pred CCCCHHHH-HHHHHHHH-cCCCcEEEECCC
Confidence 34899999 55888887 4774 4555543
No 229
>PF13383 Methyltransf_22: Methyltransferase domain
Probab=28.82 E-value=1e+02 Score=25.45 Aligned_cols=47 Identities=23% Similarity=0.157 Sum_probs=29.2
Q ss_pred ecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEE--eec----CHHHHHHHHhc-CCceEEeecC
Q 048021 19 TNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYIT--GRA----NPRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~--G~v----dp~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
|.|+||+=.+-+-|.+ .|+ +++.|+ +.. ...+++++|++ |++ +..++
T Consensus 173 iDIEG~Ew~~L~~~l~-~~~--------~Qi~iEiH~~~~~~~~~~~~l~~l~~~gfr--~F~~e 226 (242)
T PF13383_consen 173 IDIEGAEWTVLEPLLE-SGV--------CQILIEIHGWPSEHREWYKLLQELEKAGFR--LFNVE 226 (242)
T ss_pred EEcCccHHHHHHHHHh-cCC--------cEEEEEEEeCccchhHHHHHHHHHHHCCcE--EEEec
Confidence 6889999877766643 344 333333 211 14578999999 984 44443
No 230
>cd04884 ACT_CBS C-terminal ACT domain of the cystathionine beta-synthase (CBS) domain protein found in Thermotoga maritima, Tm0935, and delta proteobacteria. This CD includes the C-terminal ACT domain of the cystathionine beta-synthase (CBS) domain protein found in Thermotoga maritima, Tm0935, and delta proteobacteria. This protein has two N-terminal tandem CBS domains and a single C-terminal ACT domain. The CBS domain is found in a wide range of proteins, often in tandem arrangements and together with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=28.51 E-value=1.6e+02 Score=18.70 Aligned_cols=57 Identities=12% Similarity=0.134 Sum_probs=33.4
Q ss_pred EEEee-cChhhHHHHHHHHhccCC-e-eEEEeC-----CCCEEEEEeecC-H---HHHHHHHhcCCceE
Q 048021 15 LKVDT-NDPAWFSSMTKVLRKIKG-A-SYTIDA-----EEGMAYITGRAN-P---RKLLRKLQSGKYAN 71 (143)
Q Consensus 15 lkV~m-~C~gC~~kIkkaL~kI~G-V-sV~VDl-----e~~~VtV~G~vd-p---~~Ll~aI~kGk~Ae 71 (143)
|.|.+ ...|=-.+|...|.+..+ + ++..+. ..+.+.|.-..+ + +.|+++|+++|++.
T Consensus 2 l~v~~~d~pG~L~~l~~~i~~~g~nI~~i~~~~~~~~~~~~~~~v~v~~e~~~~~~~i~~~L~~~~~~~ 70 (72)
T cd04884 2 FTFLLEDKPGTLKPVVDTLREFNARIISILTAFEDAPDGMRRVFIRVTPMDRSKENELIEELKAKFTVV 70 (72)
T ss_pred EEEEecCCCccHHHHHHHHHHCCCeEEEEEeccccCCCCccEEEEEEEEecchHHHHHHHHHhCcccEE
Confidence 44555 578888888898888544 2 333322 234455542222 3 48888886676554
No 231
>PF08502 LeuA_dimer: LeuA allosteric (dimerisation) domain; InterPro: IPR013709 This is the C-terminal regulatory (R) domain of alpha-isopropylmalate synthase, which catalyses the first committed step in the leucine biosynthetic pathway []. This domain, is an internally duplicated structure with a novel fold []. It comprises two similar units that are arranged such that the two -helices pack together in the centre, crossing at an angle of 34 degrees, sandwiched between the two three-stranded, antiparallel beta-sheets. The overall domain is thus constructed as a beta-alpha-beta three-layer sandwich []. ; GO: 0003852 2-isopropylmalate synthase activity, 0009098 leucine biosynthetic process; PDB: 3HQ1_A 3HPZ_B 1SR9_A 3FIG_B 3HPS_A 3F6G_A 3F6H_A.
Probab=28.38 E-value=75 Score=23.32 Aligned_cols=38 Identities=18% Similarity=0.210 Sum_probs=27.6
Q ss_pred eEEEeC-CCCEEEEE--eecCHHHHHHHHhc--CCceEEeecC
Q 048021 39 SYTIDA-EEGMAYIT--GRANPRKLLRKLQS--GKYANLCWVS 76 (143)
Q Consensus 39 sV~VDl-e~~~VtV~--G~vdp~~Ll~aI~k--Gk~Ael~~~~ 76 (143)
.|++.. +....++. |.-+-+.+.++|.+ |.+++|....
T Consensus 40 ~v~l~~~~g~~~~~~~~GnGpv~A~~~Al~~~~g~~i~l~dy~ 82 (133)
T PF08502_consen 40 TVTLEVTDGEEHTISGTGNGPVDAFFNALNKALGIDIELIDYS 82 (133)
T ss_dssp EEEEEEETTEEEEEEEEESSHHHHHHHHHHCHCCCEEEEEEEE
T ss_pred EEEEEEcCCeEEEEEEECCChHHHHHHHHHHHcCCceEEEEEE
Confidence 355555 55555555 56789999999999 9999887653
No 232
>PRK14719 bifunctional RNAse/5-amino-6-(5-phosphoribosylamino)uracil reductase; Provisional
Probab=27.93 E-value=1.2e+02 Score=26.67 Aligned_cols=51 Identities=18% Similarity=0.210 Sum_probs=34.2
Q ss_pred EEEEee-cChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceE
Q 048021 14 GLKVDT-NDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 14 vlkV~m-~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae 71 (143)
++.|.. .|..|+.+|-++ ++.-|=+-.|.+. .|+.-..++++.|+. |.+++
T Consensus 47 ~i~~s~~p~~~cad~ii~~--gi~rVVi~~D~d~-----~G~~~~~~~~~~L~~aGi~V~ 99 (360)
T PRK14719 47 FITVSNTPVFQIADDLIAE--NISEVILLTDFDR-----AGRVYAKNIMEEFQSRGIKVN 99 (360)
T ss_pred EEEEeCCchHHHHHHHHHc--CCCEEEEEECCCC-----CCCccchHHHHHHHHCCCEEE
Confidence 455654 799999888763 4444422225553 676667788899999 99884
No 233
>PRK10503 multidrug efflux system subunit MdtB; Provisional
Probab=27.84 E-value=94 Score=30.77 Aligned_cols=55 Identities=11% Similarity=0.207 Sum_probs=38.6
Q ss_pred EEEEEEee----cChhhHHHHHHHHhccCCe-eEEEeCCC--CEEEEEe--------ecCHHHHHHHHhc
Q 048021 12 TCGLKVDT----NDPAWFSSMTKVLRKIKGA-SYTIDAEE--GMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 12 tvvlkV~m----~C~gC~~kIkkaL~kI~GV-sV~VDle~--~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
.+.+++.+ .=..-+++|++.|++++|| +|..+... .++.|+= -++++.|.++|+.
T Consensus 674 ~v~i~l~G~d~~~L~~~a~~l~~~l~~~pgv~~v~~~~~~~~~e~~v~id~~k~~~~Gls~~~v~~~l~~ 743 (1040)
T PRK10503 674 QYQFTLQATSLDALSTWVPKLMEKLQQLPQLSDVSSDWQDKGLVAYVNVDRDSASRLGISMADVDNALYN 743 (1040)
T ss_pred CeEEEEECCCHHHHHHHHHHHHHHHhcCCCeEEEEccccCCCceEEEEECHHHHHHcCCCHHHHHHHHHH
Confidence 35666654 3456778899999999999 78866554 4555541 2578888888887
No 234
>cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase.
Probab=27.82 E-value=1.5e+02 Score=20.51 Aligned_cols=42 Identities=19% Similarity=0.105 Sum_probs=27.7
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CC
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GK 68 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk 68 (143)
-.|..|.+. ++.|++ .|+. ..+|..+ ...+.+.|.+.+.+ |.
T Consensus 7 ~~C~~c~ka-~~~L~~-~~i~~~~idi~~------~~~~~~~l~~~~~~~~~ 50 (105)
T cd02977 7 PNCSTSRKA-LAWLEE-HGIEYEFIDYLK------EPPTKEELKELLAKLGL 50 (105)
T ss_pred CCCHHHHHH-HHHHHH-cCCCcEEEeecc------CCCCHHHHHHHHHhcCC
Confidence 479999955 566766 5774 4455543 33567788888877 74
No 235
>PHA00019 IV phage assembly protein
Probab=27.80 E-value=1.7e+02 Score=25.79 Aligned_cols=50 Identities=12% Similarity=0.121 Sum_probs=38.2
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCC-CCEEEEE----eecCHHHHHHHHhc--CCceE
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAE-EGMAYIT----GRANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle-~~~VtV~----G~vdp~~Ll~aI~k--Gk~Ae 71 (143)
+-=.+.+-+.|..+-|+++.+|.+ .|+|++. -.++.+++++.|.+ |....
T Consensus 30 ~~dI~~vl~~la~~~g~NiVidp~V~G~vTl~~~~l~~v~~~qaLd~iL~~~gl~~~ 86 (428)
T PHA00019 30 NSPIREFVSWYSQQTGKSVVLGPDVKGNVTVYSADVNPANLPQFFDSVLRANGFDLV 86 (428)
T ss_pred CCCHHHHHHHHHHhcCceEEECCCcceEEEEecccccCCCHHHHHHHHHHhcCceEE
Confidence 455666667888888889999887 8999993 34788888888877 77653
No 236
>PF01873 eIF-5_eIF-2B: Domain found in IF2B/IF5; InterPro: IPR002735 The beta subunit of archaeal and eukaryotic translation initiation factor 2 (IF2beta) and the N-terminal domain of translation initiation factor 5 (IF5) show significant sequence homology []. Archaeal IF2beta contains two independent structural domains: an N-terminal mixed alpha/beta core domain (topological similarity to the common core of ribosomal proteins L23 and L15e), and a C-terminal domain consisting of a zinc-binding C4 finger []. Archaeal IF2beta is a ribosome-dependent GTPase that stimulates the binding of initiator Met-tRNA(i)(Met) to the ribosomes, even in the absence of other factors []. The C-terminal domain of eukaryotic IF5 is involved in the formation of the multi-factor complex (MFC), an important intermediate for the 43S pre-initiation complex assembly []. IF5 interacts directly with IF1, IF2beta and IF3c, which together with IF2-bound Met-tRNA(i)(Met) form the MFC. This entry represents both the N-terminal and zinc-binding domains of IF2, as well as a domain in IF5.; GO: 0003743 translation initiation factor activity, 0006413 translational initiation; PDB: 2DCU_B 2D74_B 2E9H_A 2G2K_A 1NEE_A 3CW2_L 2QMU_C 3V11_C 2NXU_A 2QN6_C ....
Probab=27.73 E-value=79 Score=23.73 Aligned_cols=35 Identities=26% Similarity=0.377 Sum_probs=27.3
Q ss_pred HHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc
Q 048021 31 VLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 31 aL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
-|..-=|....+|.+ ++++|.|..++++|.+.|.+
T Consensus 54 y~~~ELgt~~~id~~-~~lii~G~~~~~~i~~~L~~ 88 (125)
T PF01873_consen 54 YFGKELGTQGSIDGK-GRLIINGRFSSKQIQDLLDK 88 (125)
T ss_dssp HHHHHSSSEEEEETT-TEEEEESSSSCCHHHHHHHH
T ss_pred HHHHHHCCceEECCC-CEEEEEEecCHHHHHHHHHH
Confidence 333333566778887 99999999999999999876
No 237
>PF02107 FlgH: Flagellar L-ring protein; InterPro: IPR000527 The flgH, flgI and fliF genes of Salmonella typhimurium encode the major proteins for the L, P and M rings of the flagellar basal body []. In fact, the basal body consists of four rings (L,P,S and M) surrounding the flagellar rod, which is believed to transmit motor rotation to the filament []. The M ring is integral to the inner membrane of the cell, and may be connected to the rod via the S (supramembrane) ring, which lies just distal to it. The L and P rings reside in the outer membrane and periplasmic space, respectively. FlgH and FlgI, which are exported across the cell membrane to their destinations in the outer membrane and periplasmic space, have typical N-terminal cleaved signal-peptide sequences. FlgH is predicted to have an extensive beta-sheet structure, in keeping with other outer membrane proteins [].; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0009427 bacterial-type flagellum basal body, distal rod, L ring
Probab=27.71 E-value=41 Score=26.50 Aligned_cols=29 Identities=21% Similarity=0.376 Sum_probs=24.9
Q ss_pred hccCCe-eEEEeCCCCEEEEEeecCHHHHH
Q 048021 33 RKIKGA-SYTIDAEEGMAYITGRANPRKLL 61 (143)
Q Consensus 33 ~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll 61 (143)
..|+|- .+.|+-+...++|+|.+.|+.|-
T Consensus 109 L~I~G~k~i~vn~e~~~i~lsGiVRp~DI~ 138 (179)
T PF02107_consen 109 LVIEGEKQIRVNGEEQYIRLSGIVRPEDID 138 (179)
T ss_pred EEEEEEEEEEECCCEEEEEEEEEECHHHCC
Confidence 356777 69999999999999999998875
No 238
>PRK10555 aminoglycoside/multidrug efflux system; Provisional
Probab=27.69 E-value=94 Score=30.70 Aligned_cols=52 Identities=13% Similarity=-0.025 Sum_probs=37.4
Q ss_pred ChhhHHHHHHHHhccCCe-eEEEeCCC--CEEEEEe--------ecCHHHHHHHHhc---CCceEE
Q 048021 21 DPAWFSSMTKVLRKIKGA-SYTIDAEE--GMAYITG--------RANPRKLLRKLQS---GKYANL 72 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GV-sV~VDle~--~~VtV~G--------~vdp~~Ll~aI~k---Gk~Ael 72 (143)
=..-++.+++.|++++|+ +|..+... .++.|+= -++++.|.++|+. |..+..
T Consensus 694 L~~~a~~l~~~l~~~~gv~~v~~~~~~~~~~l~i~id~~k~~~~Gls~~~v~~~l~~~~~G~~~~~ 759 (1037)
T PRK10555 694 LMAARNQLLALAAKNPELTRVRHNGLDDSPQLQIDIDQRKAQALGVSIDDINDTLQTAWGSSYVND 759 (1037)
T ss_pred HHHHHHHHHHHHhcCCCeEEEecCCcCCCceEEEEECHHHHHHcCCCHHHHHHHHHHHhcCceeeE
Confidence 456778899999999999 78887765 4555542 2577888888886 655443
No 239
>PRK08965 putative monovalent cation/H+ antiporter subunit E; Reviewed
Probab=27.55 E-value=2.9e+02 Score=21.24 Aligned_cols=53 Identities=11% Similarity=0.268 Sum_probs=36.7
Q ss_pred EEEEEEeecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEe--ecCHHHHHHHH
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITG--RANPRKLLRKL 64 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G--~vdp~~Ll~aI 64 (143)
.+++......+.-.--....+.=.||- .+++|.+++++.|.. .-++++..+.+
T Consensus 94 iv~vp~~~~~~~~~~~lan~ITLtPGTl~vdv~~~~~~L~VH~Ld~~~~~~~~~~i 149 (162)
T PRK08965 94 FVAVPLRLRSDLALTLLANIITLTPGTVVVEISRDRRTLYVHVLDLDDPEALIREI 149 (162)
T ss_pred EEEEeecCCChHHHHHHHHHHccCCCeEEEEEeCCCCEEEEEEeeCCCHHHHHHHH
Confidence 344444456777777777777778997 899999999999986 23555444443
No 240
>PRK03988 translation initiation factor IF-2 subunit beta; Validated
Probab=27.52 E-value=1.2e+02 Score=23.12 Aligned_cols=28 Identities=29% Similarity=0.510 Sum_probs=23.5
Q ss_pred CeeEEEeCCCCEEEEEeecCHHHHHHHHhc
Q 048021 37 GASYTIDAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 37 GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
|....+ ++++++|.|..++..|.+.|++
T Consensus 70 gt~g~i--~~~~lii~G~~~~~~i~~~L~~ 97 (138)
T PRK03988 70 GTAGNI--EGGRLILQGKFSPRVINEKIDR 97 (138)
T ss_pred CCceee--cCCEEEEEEeeCHHHHHHHHHH
Confidence 554455 5799999999999999999998
No 241
>COG0480 FusA Translation elongation factors (GTPases) [Translation, ribosomal structure and biogenesis]
Probab=27.51 E-value=2.8e+02 Score=26.70 Aligned_cols=63 Identities=17% Similarity=0.265 Sum_probs=48.6
Q ss_pred ceEEEEEEeecChhhHHHHHHHHhccCC----eeEEEeCCCCEEEEEe--ecCHHHHHHHHhc--CCceEE
Q 048021 10 KMTCGLKVDTNDPAWFSSMTKVLRKIKG----ASYTIDAEEGMAYITG--RANPRKLLRKLQS--GKYANL 72 (143)
Q Consensus 10 ~~tvvlkV~m~C~gC~~kIkkaL~kI~G----VsV~VDle~~~VtV~G--~vdp~~Ll~aI~k--Gk~Ael 72 (143)
...+.+.|+-.-..=..|+..+|++|.- +.|.+|.+.+...|.| .+--+.++++|++ |-.+++
T Consensus 404 ePVi~vavepk~~~d~~Kl~~aL~~l~~eDPt~~v~~d~Etge~iIsGmGELHLei~~drl~~~~~Vev~~ 474 (697)
T COG0480 404 EPVISVAVEPKTKADQEKLSEALNKLAEEDPTFRVETDEETGETIISGMGELHLEIIVDRLKREFGVEVEV 474 (697)
T ss_pred CceEEEEEeECChhhHHHHHHHHHHHHhhCCceEEEEcCCcccEEEEecchhhHHHHHHHHHhhcCceEEe
Confidence 3456677777778888899999998843 3599999999999995 6778889988887 555543
No 242
>PRK11152 ilvM acetolactate synthase 2 regulatory subunit; Provisional
Probab=27.50 E-value=1.5e+02 Score=20.36 Aligned_cols=30 Identities=17% Similarity=0.112 Sum_probs=21.6
Q ss_pred EEEEEEeecChhhHHHHHHHHhccCCe-eEEEe
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTID 43 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VD 43 (143)
.++|-|. -+.....|.|-|.|+.+| .|+++
T Consensus 46 riti~v~--~~~~i~ql~kQL~KL~dV~~V~~~ 76 (76)
T PRK11152 46 NIELTVA--SERPIDLLSSQLNKLVDVAHVEIL 76 (76)
T ss_pred EEEEEEC--CCchHHHHHHHHhcCcCeEEEEEC
Confidence 4455553 588888888888888888 67653
No 243
>PF13098 Thioredoxin_2: Thioredoxin-like domain; PDB: 1T3B_A 2L57_A 1EEJ_B 1TJD_A 1JZD_B 1JZO_A 1G0T_B 3GV1_A 1V58_A 2H0H_A ....
Probab=27.47 E-value=73 Score=21.55 Aligned_cols=30 Identities=27% Similarity=0.546 Sum_probs=17.0
Q ss_pred cCCe-e-EEEeCCCCEE-EEEeecCHHHHHHHH
Q 048021 35 IKGA-S-YTIDAEEGMA-YITGRANPRKLLRKL 64 (143)
Q Consensus 35 I~GV-s-V~VDle~~~V-tV~G~vdp~~Ll~aI 64 (143)
+.|. . +-+|.+.+.| .+.|-.++++|++.|
T Consensus 80 v~gtPt~~~~d~~G~~v~~~~G~~~~~~l~~~L 112 (112)
T PF13098_consen 80 VNGTPTIVFLDKDGKIVYRIPGYLSPEELLKML 112 (112)
T ss_dssp --SSSEEEECTTTSCEEEEEESS--HHHHHHHH
T ss_pred CCccCEEEEEcCCCCEEEEecCCCCHHHHHhhC
Confidence 4455 4 4455455545 467899999998876
No 244
>cd04887 ACT_MalLac-Enz ACT_MalLac-Enz CD includes the N-terminal ACT domain of putative NAD-dependent malic enzyme 1, Bacillus subtilis YqkI and related domains. The ACT_MalLac-Enz CD includes the N-terminal ACT domain of putative NAD-dependent malic enzyme 1, Bacillus subtilis YqkI, a malolactic enzyme (MalLac-Enz) which converts malate to lactate, and other related ACT domains. The yqkJ product is predicted to convert malate directly to lactate, as opposed to related malic enzymes that convert malate to pyruvate. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=27.45 E-value=1.2e+02 Score=19.13 Aligned_cols=26 Identities=12% Similarity=0.220 Sum_probs=13.9
Q ss_pred EEEEEeecChhhHHHHHHHHhccCCe
Q 048021 13 CGLKVDTNDPAWFSSMTKVLRKIKGA 38 (143)
Q Consensus 13 vvlkV~m~C~gC~~kIkkaL~kI~GV 38 (143)
+.|.|+..-..=.+.|.+.|++++||
T Consensus 42 ~~~~vev~~~~~l~~i~~~L~~i~gV 67 (74)
T cd04887 42 RDITVDAPSEEHAETIVAAVRALPEV 67 (74)
T ss_pred EEEEEEcCCHHHHHHHHHHHhcCCCe
Confidence 33444443333334566677777776
No 245
>PRK14054 methionine sulfoxide reductase A; Provisional
Probab=27.38 E-value=2.2e+02 Score=22.56 Aligned_cols=30 Identities=23% Similarity=0.265 Sum_probs=22.4
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCCe-eEEEeCC
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTIDAE 45 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~VDle 45 (143)
.+++|. +||-=.+++.+.+++|| ++++=-.
T Consensus 4 ~~a~fa-----gGCFWg~E~~f~~~~GV~~t~vGYa 34 (172)
T PRK14054 4 ETAVLA-----GGCFWGMEAPFDRVKGVISTRVGYT 34 (172)
T ss_pred eEEEEE-----cCChhhhHHHHccCCCEEEEEeeec
Confidence 345565 78888889999999999 7776433
No 246
>TIGR03143 AhpF_homolog putative alkyl hydroperoxide reductase F subunit. This family of thioredoxin reductase homologs is found adjacent to alkylhydroperoxide reductase C subunit predominantly in cases where there is only one C subunit in the genome and that genome is lacking the F subunit partner (also a thioredcxin reductase homolog) that is usually found (TIGR03140).
Probab=27.23 E-value=3.1e+02 Score=24.91 Aligned_cols=52 Identities=10% Similarity=0.127 Sum_probs=30.2
Q ss_pred EEEEE--eecChhhHHHHH---HHHhccCCee-EEEeCC-------------------CCEEEEEeecCHHHHHHHH
Q 048021 13 CGLKV--DTNDPAWFSSMT---KVLRKIKGAS-YTIDAE-------------------EGMAYITGRANPRKLLRKL 64 (143)
Q Consensus 13 vvlkV--~m~C~gC~~kIk---kaL~kI~GVs-V~VDle-------------------~~~VtV~G~vdp~~Ll~aI 64 (143)
+.|+| ..+|..|.+.++ ++....++|+ -.||.. ++++...|.++.++|++.|
T Consensus 478 ~~i~v~~~~~C~~Cp~~~~~~~~~~~~~~~i~~~~i~~~~~~~~~~~~~v~~vP~~~i~~~~~~~G~~~~~~~~~~~ 554 (555)
T TIGR03143 478 VNIKIGVSLSCTLCPDVVLAAQRIASLNPNVEAEMIDVSHFPDLKDEYGIMSVPAIVVDDQQVYFGKKTIEEMLELI 554 (555)
T ss_pred eEEEEEECCCCCCcHHHHHHHHHHHHhCCCceEEEEECcccHHHHHhCCceecCEEEECCEEEEeeCCCHHHHHHhh
Confidence 45665 458999986333 3333335663 222221 3556667777888888766
No 247
>TIGR02109 PQQ_syn_pqqE coenzyme PQQ biosynthesis protein E. This model describes coenzyme PQQ biosynthesis protein E, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=27.21 E-value=1.4e+02 Score=25.10 Aligned_cols=19 Identities=5% Similarity=-0.115 Sum_probs=12.6
Q ss_pred CCceEEEEEE----eecChhhHH
Q 048021 8 SAKMTCGLKV----DTNDPAWFS 26 (143)
Q Consensus 8 s~~~tvvlkV----~m~C~gC~~ 26 (143)
+.+..+.|.+ |+.|..|..
T Consensus 4 ~~P~~l~ieiT~~CNl~C~~C~~ 26 (358)
T TIGR02109 4 GPPLWLLAELTHRCPLQCPYCSN 26 (358)
T ss_pred CCCcEEEEeeccccCcCCCCCCC
Confidence 4555666665 578999953
No 248
>COG0106 HisA Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase [Amino acid transport and metabolism]
Probab=27.12 E-value=1.6e+02 Score=24.72 Aligned_cols=49 Identities=16% Similarity=0.260 Sum_probs=36.7
Q ss_pred HHHHHHHhccCC-eeEEEeCCCCEEEEEee-----cCHHHHHHHHhc-CCceEEee
Q 048021 26 SSMTKVLRKIKG-ASYTIDAEEGMAYITGR-----ANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 26 ~kIkkaL~kI~G-VsV~VDle~~~VtV~G~-----vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
.-+++++++..+ |-|.+|...++|.|.|. +++..+++.++. |-...|+-
T Consensus 112 ~~v~~~~~~~g~rivv~lD~r~g~vav~GW~e~s~~~~~~l~~~~~~~g~~~ii~T 167 (241)
T COG0106 112 DLVKELCEEYGDRIVVALDARDGKVAVSGWQEDSGVELEELAKRLEEVGLAHILYT 167 (241)
T ss_pred HHHHHHHHHcCCcEEEEEEccCCccccccccccccCCHHHHHHHHHhcCCCeEEEE
Confidence 345666666653 35889999999999873 588999999999 88776553
No 249
>TIGR01158 SUI1_rel translation initation factor SUI1, putative, prokaryotic. This family of archaeal and bacterial proteins is homologous to the eukaryotic translation intiation factor SUI1 involved in directing the ribosome to the proper start site of translation by functioning in concert with eIF-2 and the initiator tRNA-Met.
Probab=27.08 E-value=1.6e+02 Score=21.18 Aligned_cols=44 Identities=23% Similarity=0.332 Sum_probs=25.3
Q ss_pred HHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEe
Q 048021 27 SMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 27 kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
++-+.|++.=|...+| +++.+.|.|+. .++|.+.|.+ |+++..+
T Consensus 55 ~l~k~LKk~~gcGgtv--k~~~IeiQGD~-~~~v~~~L~~~G~~~k~~ 99 (101)
T TIGR01158 55 ELAKELKSKCGCGGTV--KDGVIEIQGDH-RDRVKDLLEKKGFKVKLI 99 (101)
T ss_pred HHHHHHHHHhcCCeeE--eCCEEEEeCcH-HHHHHHHHHHcCCCeeec
Confidence 4444455443442222 35677777753 4556677777 8887754
No 250
>PRK09019 translation initiation factor Sui1; Validated
Probab=27.06 E-value=1.5e+02 Score=21.91 Aligned_cols=43 Identities=23% Similarity=0.339 Sum_probs=27.5
Q ss_pred HHHHHHHhccCCe--eEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEe
Q 048021 26 SSMTKVLRKIKGA--SYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 26 ~kIkkaL~kI~GV--sV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
+++-+.|++-=|. +|. ++.+.|.|+ -.++|.+.|.+ ||++.+.
T Consensus 61 k~l~K~lKkk~gcGGtvk----~~~IelQGD-~r~~v~~~L~~~Gf~~k~~ 106 (108)
T PRK09019 61 KKLAAELKKKCGCGGAVK----DGVIEIQGD-KRDLLKSLLEAKGMKVKLA 106 (108)
T ss_pred HHHHHHHHHHhcCCCeEE----cCEEEEcCc-HHHHHHHHHHHCCCeEEec
Confidence 4444444443344 343 567777774 35778888888 9998865
No 251
>PRK05412 putative nucleotide-binding protein; Reviewed
Probab=27.02 E-value=1.6e+02 Score=23.42 Aligned_cols=69 Identities=17% Similarity=0.110 Sum_probs=39.4
Q ss_pred EEEEee-cChhhHHHHHHHHhc---cCCeeEEEeCCCCEEEEEeecC------HHHHHHHHhc-CCceEEeecCCCCCCc
Q 048021 14 GLKVDT-NDPAWFSSMTKVLRK---IKGASYTIDAEEGMAYITGRAN------PRKLLRKLQS-GKYANLCWVSAGNQIT 82 (143)
Q Consensus 14 vlkV~m-~C~gC~~kIkkaL~k---I~GVsV~VDle~~~VtV~G~vd------p~~Ll~aI~k-Gk~Ael~~~~~~~s~~ 82 (143)
+=+|+| --....+...|.|.. +.|+..+|++++..+++.+.-| -+.|..++.+ |-....+..+..+...
T Consensus 7 VSevD~qEv~NAv~qa~kEi~~RyDFKgs~a~iel~~~~i~l~a~~d~kl~~v~diL~~kl~KR~i~~k~ld~~~~e~~s 86 (161)
T PRK05412 7 VSEVDLQEVDNAVDQANKEISTRFDFKGSKASIELKEEEITLTAESDFQLKQVKDILRSKLIKRGIDLKALDYGKVEKAS 86 (161)
T ss_pred eeeCcHHHHHHHHHHHHHHHhcccccCCCccEEEEcCCEEEEEeCCHHHHHHHHHHHHHHHHHcCCCHHHcCCCCccccC
Confidence 334454 234444555555543 5688778888888999987543 3344445555 6665555554433433
No 252
>cd03418 GRX_GRXb_1_3_like Glutaredoxin (GRX) family, GRX bacterial class 1 and 3 (b_1_3)-like subfamily; composed of bacterial GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known i
Probab=26.98 E-value=1.7e+02 Score=18.34 Aligned_cols=25 Identities=28% Similarity=0.325 Sum_probs=16.6
Q ss_pred eecChhhHHHHHHHHhccCCee-EEEeC
Q 048021 18 DTNDPAWFSSMTKVLRKIKGAS-YTIDA 44 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GVs-V~VDl 44 (143)
.-.|..|. ++++.|++. |+. -.+|.
T Consensus 7 ~~~Cp~C~-~ak~~L~~~-~i~~~~i~i 32 (75)
T cd03418 7 KPNCPYCV-RAKALLDKK-GVDYEEIDV 32 (75)
T ss_pred CCCChHHH-HHHHHHHHC-CCcEEEEEC
Confidence 34799998 677778774 773 33444
No 253
>COG1774 Uncharacterized homolog of PSP1 [Function unknown]
Probab=26.83 E-value=50 Score=28.11 Aligned_cols=57 Identities=14% Similarity=0.128 Sum_probs=43.8
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEE----eecCHHHHHHHHhc--CCceEEeecCCC
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYIT----GRANPRKLLRKLQS--GKYANLCWVSAG 78 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~----G~vdp~~Ll~aI~k--Gk~Ael~~~~~~ 78 (143)
.-|..+|++-...+.=|+++++++.+++++. |++|=..|++-|.+ +-+.|++.++.-
T Consensus 86 ~~~~~k~~e~~Ldm~lv~~~~~~d~n~~v~~~ta~grvDfr~lv~dLa~~fk~rIElrqig~r 148 (265)
T COG1774 86 TICIVKFEEHRLDMKLVDVEITLDNNKYVFLFTARGRVDFRELVGDLAKIFKTRIELRQIGVR 148 (265)
T ss_pred HHHHHHHHhCCcceEEEEEEeeccCccEEEEeccccchhHHHHHHHHHHHHHhhhHHhhcCch
Confidence 4588888887777665678889988888885 67888888888887 777777777543
No 254
>cd03027 GRX_DEP Glutaredoxin (GRX) family, Dishevelled, Egl-10, and Pleckstrin (DEP) subfamily; composed of uncharacterized proteins containing a GRX domain and additional domains DEP and DUF547, both of which have unknown functions. GRX is a glutathione (GSH) dependent reductase containing a redox active CXXC motif in a TRX fold. It has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. By altering the redox state of target proteins, GRX is involved in many cellular functions.
Probab=26.80 E-value=1.7e+02 Score=18.58 Aligned_cols=25 Identities=20% Similarity=0.244 Sum_probs=17.3
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCC
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAE 45 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle 45 (143)
-.|..|. ++++.|.. .||. ..+|++
T Consensus 9 ~~C~~C~-ka~~~L~~-~gi~~~~~di~ 34 (73)
T cd03027 9 LGCEDCT-AVRLFLRE-KGLPYVEINID 34 (73)
T ss_pred CCChhHH-HHHHHHHH-CCCceEEEECC
Confidence 4799998 67777887 4774 444554
No 255
>cd01570 NAPRTase_A Nicotinate phosphoribosyltransferase (NAPRTase), subgroup A. Nicotinate phosphoribosyltransferase catalyses the formation of NAMN and PPi from 5-phosphoribosy -1-pyrophosphate (PRPP) and nicotinic acid, this is the first, and also rate limiting, reaction in the NAD salvage synthesis. This salvage pathway serves to recycle NAD degradation products. This subgroup is present in bacteria and eukaryota (except funghi).
Probab=26.79 E-value=1.9e+02 Score=24.76 Aligned_cols=51 Identities=10% Similarity=-0.027 Sum_probs=34.3
Q ss_pred hhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhcCCceEEeecCCCCC
Q 048021 23 AWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQSGKYANLCWVSAGNQ 80 (143)
Q Consensus 23 gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~~s 80 (143)
....++++.|.++ |. +.-+|.+++.+|+++|++..++|..+..++++..-.
T Consensus 262 ~~~~~~r~~l~~~-G~------~~~~Iv~Sdgld~~~i~~l~~~g~~~d~fGvGt~L~ 312 (327)
T cd01570 262 YLSKEARKMLDEA-GL------TKVKIVASNDLDEYTIAALNAQGAPIDAFGVGTRLV 312 (327)
T ss_pred HHHHHHHHHHHHC-CC------CCcEEEEeCCCCHHHHHHHHHCCCeeEEEecCcccc
Confidence 3445555555553 33 223777778899999998654499999999976543
No 256
>TIGR02024 FtcD glutamate formiminotransferase. This model covers enzymes from metazoa as well as gram-positive bacteria and archaea. In humans, deficiency of this enzyme results in a disease phenotype. The crystal structure of the enzyme has been studied in the context of the catalytic mechanism.
Probab=26.64 E-value=2.4e+02 Score=24.50 Aligned_cols=55 Identities=7% Similarity=0.163 Sum_probs=37.8
Q ss_pred EEEEEEeecChh--hHHHHHHHHh----ccCCe-eEEEeCC-CCEEEEEe------ecCHHHHHHHHhc
Q 048021 12 TCGLKVDTNDPA--WFSSMTKVLR----KIKGA-SYTIDAE-EGMAYITG------RANPRKLLRKLQS 66 (143)
Q Consensus 12 tvvlkV~m~C~g--C~~kIkkaL~----kI~GV-sV~VDle-~~~VtV~G------~vdp~~Ll~aI~k 66 (143)
-+-++|++.|.. =+++|.++++ .+++| .+-+.++ .+.|.|.= ...+..+.+.|++
T Consensus 182 liayNV~L~t~d~~~a~~IAk~vR~~gGGL~~VqAmgl~~~~~~~vqVsmNl~d~~~t~~~~v~e~v~~ 250 (298)
T TIGR02024 182 LIAFNVNLGTSNLEIAKKIAKAIRFQGGGLRFVKAIGLYLEEKNLVQVSMNLTNYEKTPLYRVFELIKM 250 (298)
T ss_pred eEEEEEeecCCCHHHHHHHHHHhhhcCCCccceEEeeEeecCCCeEEEEeeccCCccCCHHHHHHHHHH
Confidence 356888884433 3789999998 66777 4666665 68888873 2456677777665
No 257
>cd07422 MPP_ApaH Escherichia coli ApaH and related proteins, metallophosphatase domain. ApaH (also known as symmetrically cleaving Ap4A hydrolase and bis(5'nucleosyl)-tetraphosphatase) is a bacterial member of the PPP (phosphoprotein phosphatase) family of serine/threonine phosphatases that hydrolyzes the nucleotide-signaling molecule diadenosine tetraphosphate (Ap(4)A) into two ADP and also hydrolyzes Ap(5)A, Gp(4)G, and other extending compounds. Null mutations in apaH result in high intracellular levels of Ap(4)A which correlate with multiple phenotypes, including a decreased expression of catabolite-repressible genes, a reduction in the expression of flagellar operons, and an increased sensitivity to UV and heat. Ap4A hydrolase is important in responding to heat shock and oxidative stress via regulating the concentration of Ap4A in bacteria. Ap4A hydrolase is also thought to play a role in siderophore production, but the mechanism by which ApaH interacts with siderophore pathwa
Probab=26.62 E-value=1.4e+02 Score=24.73 Aligned_cols=44 Identities=23% Similarity=0.456 Sum_probs=32.2
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeec-----CHHHHHHHHhc-CCce
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRA-----NPRKLLRKLQS-GKYA 70 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-----dp~~Ll~aI~k-Gk~A 70 (143)
.||...++++|.++. .+.+..++.+.|.+ ++.++++.|.+ +-.+
T Consensus 8 HG~~~~L~~LL~~i~-----~~~~~D~Li~lGDlVdRGp~s~evl~~l~~l~~~v 57 (257)
T cd07422 8 QGCYDELQRLLEKIN-----FDPAKDRLWLVGDLVNRGPDSLETLRFVKSLGDSA 57 (257)
T ss_pred CCCHHHHHHHHHhcC-----CCCCCCEEEEecCcCCCCcCHHHHHHHHHhcCCCe
Confidence 489999999998862 33455677777763 58889999888 6333
No 258
>TIGR02520 pilus_B_mal_scr type IVB pilus formation outer membrane protein, R64 PilN family. Several related protein families encode outer membrane pore proteins for type II secretion, type III secretion, and type IV pilus formation. This protein family appears to encode a secretin for pilus formation, although it is quite different from PilQ. Members include the PilN lipoprotein of the plasmid R64 thin pilus, a type IV pilus. Scoring between the trusted and noise cutoffs are examples of bundle-forming pilus B (bfpB).
Probab=26.61 E-value=2.4e+02 Score=25.61 Aligned_cols=61 Identities=8% Similarity=0.119 Sum_probs=39.1
Q ss_pred HHHHHHHhccCC-e-eEEEeCCCCEEEEEeecC----HHHHHHHHhc--C----CceEEeecCCCCCCccccc
Q 048021 26 SSMTKVLRKIKG-A-SYTIDAEEGMAYITGRAN----PRKLLRKLQS--G----KYANLCWVSAGNQITYGNA 86 (143)
Q Consensus 26 ~kIkkaL~kI~G-V-sV~VDle~~~VtV~G~vd----p~~Ll~aI~k--G----k~Ael~~~~~~~s~~~~~~ 86 (143)
+.|++.|+.+=+ . .|.+|...+.|+|++.-+ .+++++.+.+ . .+|.++.+...+...+|-.
T Consensus 209 ~~l~~~i~~~Ls~~G~v~~~~~tg~l~Vt~tp~~l~~V~~~i~~l~~~l~rQV~Iea~IveV~~~~~~q~GId 281 (497)
T TIGR02520 209 NDIQQSIKSMLSSSGSWHLSGSTGSLVVTDVPEVLDRVASYIDSQNRRLTRQVLLNVKVLSVQFKGSDQTGVD 281 (497)
T ss_pred HHHHHHHHHHhCCCCcEEEcCCCCEEEEEeCHHHHHHHHHHHHHHHhhhcceEEEEEEEEEEEeccchhcCcc
Confidence 467888877633 3 699999999999998743 3333443332 2 2456777766655555554
No 259
>TIGR02515 IV_pilus_PilQ type IV pilus secretin (or competence protein) PilQ. A number of proteins homologous to PilQ are involved in type IV pilus formation, competence for transformation, type III secretion, and type II secretion (also called the main terminal branch of the general secretion pathway). Members of this family include PilQ itself, which is a component of the type IV pilus structure, from a number of species. In Haemophilus influenzae, the member of this family is associated with competence for transformation with exogenous DNA rather than with formation of a type IV pilus; the surface structure required for competence may be considered an unusual, incomplete type IV pilus structure.
Probab=26.55 E-value=2e+02 Score=25.10 Aligned_cols=50 Identities=8% Similarity=0.072 Sum_probs=38.1
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCC-CCEEEEEee-cCHHHHHHHHhc--CCceE
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAE-EGMAYITGR-ANPRKLLRKLQS--GKYAN 71 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle-~~~VtV~G~-vdp~~Ll~aI~k--Gk~Ae 71 (143)
+.=.+.|-+.|.+.-|+++.++.+ +++|++.-+ ++++++++.|.+ |+..+
T Consensus 10 ~~~l~dvL~~la~~~g~NiVi~~~V~g~Vtl~~~~v~~~~al~~Il~~~gl~~~ 63 (418)
T TIGR02515 10 DIPVRTVLQVIAEFTNLNIVVSDSVQGNITLRLKNVPWDQALDIILKSKGLDKR 63 (418)
T ss_pred CCCHHHHHHHHHHHhCCeEEECCCCcceEEEEEeCCCHHHHHHHHHHhCCCCEE
Confidence 334566777788888888777766 788888754 789999999988 87765
No 260
>KOG1368 consensus Threonine aldolase [Amino acid transport and metabolism]
Probab=26.43 E-value=1.2e+02 Score=27.10 Aligned_cols=47 Identities=11% Similarity=0.188 Sum_probs=35.8
Q ss_pred hHHHHHHHHhccCCeeEEEe-CCCCEEEEE---eecCHHHHHHHHhc-CCce
Q 048021 24 WFSSMTKVLRKIKGASYTID-AEEGMAYIT---GRANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 24 C~~kIkkaL~kI~GVsV~VD-le~~~VtV~---G~vdp~~Ll~aI~k-Gk~A 70 (143)
=++.+-+.+.+++++.++|- .+++.|.+. .++++++|.+.++| |-.+
T Consensus 283 ~A~~lAe~~~~~~~i~v~v~a~etNiv~~~l~q~~~~~~~l~~~~~k~gi~l 334 (384)
T KOG1368|consen 283 RAKELAEYINTPEEIRVEVPAVETNIVNMVLCQARLTAEELCKFLEKNGILL 334 (384)
T ss_pred HHHHHHHHhccccceeeecchhhcceeeeecccccCCHHHHHHHHHHCCeEE
Confidence 35667777888777877777 677777764 56899999999999 7543
No 261
>COG1596 Wza Periplasmic protein involved in polysaccharide export, contains SLBB domain of b-grasp fold [Cell wall/membrane/envelope biogenesis]
Probab=26.42 E-value=1.6e+02 Score=24.07 Aligned_cols=53 Identities=11% Similarity=0.133 Sum_probs=39.0
Q ss_pred ecChhhHHHHHHHHhcc---CCeeEEEeC-CCCEEEEEeecC----------HHHHHHHHhc--CCceE
Q 048021 19 TNDPAWFSSMTKVLRKI---KGASYTIDA-EEGMAYITGRAN----------PRKLLRKLQS--GKYAN 71 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI---~GVsV~VDl-e~~~VtV~G~vd----------p~~Ll~aI~k--Gk~Ae 71 (143)
.+=+.-...|+..|.+. +.|+|++.. ...+|.|.|++. ...|+++|.. |....
T Consensus 93 ~T~~el~~~I~~~L~~~~~~P~v~V~v~~~rp~~v~V~GeV~~PG~y~~~~~~~tv~~ai~~AGG~~~~ 161 (239)
T COG1596 93 KTLEELQSEIADRLAGYLVNPQVSVEVAQYRPQKVFVSGEVKTPGQYPLTPRGLTVLDAIALAGGLTPR 161 (239)
T ss_pred CCHHHHHHHHHHHHHhhccCCCeEEEEcccCCcEEEEeeecCCCceEEecCCccHHHHHHHHcCCCCcc
Confidence 36677888888888874 334555544 356899998763 6899999998 87764
No 262
>PRK02769 histidine decarboxylase; Provisional
Probab=26.38 E-value=1.1e+02 Score=26.71 Aligned_cols=30 Identities=13% Similarity=0.366 Sum_probs=21.6
Q ss_pred EeecCHHHHHHHHhc-CCceEEeecCCCCCC
Q 048021 52 TGRANPRKLLRKLQS-GKYANLCWVSAGNQI 81 (143)
Q Consensus 52 ~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~~s~ 81 (143)
.|.+|+++|.++|++ |.+..++...-++..
T Consensus 142 ~g~id~~~L~~~i~~~~~~t~lvv~t~gtt~ 172 (380)
T PRK02769 142 NGEIDYDDLISKIKENKNQPPIIFANIGTTM 172 (380)
T ss_pred CCcCcHHHHHHHHHhCCCCcEEEEEEeCCCC
Confidence 477899999999998 777666555444433
No 263
>smart00827 PKS_AT Acyl transferase domain in polyketide synthase (PKS) enzymes.
Probab=26.22 E-value=2.1e+02 Score=23.01 Aligned_cols=52 Identities=17% Similarity=0.148 Sum_probs=36.8
Q ss_pred HHHHHHHHhccCC-eeEEEeCCCCEEEEEeec-CHHHHHHHHhc-CCceEEeecC
Q 048021 25 FSSMTKVLRKIKG-ASYTIDAEEGMAYITGRA-NPRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 25 ~~kIkkaL~kI~G-VsV~VDle~~~VtV~G~v-dp~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
...+++.|.+... +.+......+.++|.|.. .-+.+.+.+++ |.++..+.+.
T Consensus 135 ~~~~~~~l~~~~~~~~ia~~ns~~~~visG~~~~l~~l~~~l~~~~~~~~~L~v~ 189 (298)
T smart00827 135 EEEVEELLAGYGGRVSVAAVNGPSSVVLSGDEDAVDELAAALEARGIRARRLKVD 189 (298)
T ss_pred HHHHHHHHHhcCCcEEEEEEcCCCCEEEECCHHHHHHHHHHHHHCCceEEECCCC
Confidence 4567777776643 466666677889999864 56667777777 8888777664
No 264
>PRK06824 translation initiation factor Sui1; Validated
Probab=26.19 E-value=1.3e+02 Score=22.57 Aligned_cols=45 Identities=27% Similarity=0.361 Sum_probs=25.4
Q ss_pred HHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEe
Q 048021 26 SSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
+++-+.|++-=|..-+| +++.+.|.|+ -.++|.+.|.+ ||++.+.
T Consensus 71 k~l~K~LKkk~gcGGtv--kd~~IeiQGD-~r~~v~~~L~~~G~~~k~~ 116 (118)
T PRK06824 71 KELAKELKRRCGTGGTL--KDGVIEIQGD-HVELLLAELLKRGFKAKKA 116 (118)
T ss_pred HHHHHHHHHHhcCCceE--ecCEEEEcCc-HHHHHHHHHHHCCCeEeec
Confidence 34444444433442122 3467777764 24677888888 8888754
No 265
>PRK06382 threonine dehydratase; Provisional
Probab=26.16 E-value=4e+02 Score=23.19 Aligned_cols=62 Identities=16% Similarity=0.287 Sum_probs=37.8
Q ss_pred eEEEEEEee-cChhhHHHHHHHHhccCC-e-eEEEe-----CCCCE--EEEE--ee--cCHHHHHHHHhc-CCceEE
Q 048021 11 MTCGLKVDT-NDPAWFSSMTKVLRKIKG-A-SYTID-----AEEGM--AYIT--GR--ANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 11 ~tvvlkV~m-~C~gC~~kIkkaL~kI~G-V-sV~VD-----le~~~--VtV~--G~--vdp~~Ll~aI~k-Gk~Ael 72 (143)
+.+.|.|.+ .=.|=-.+|.+.|.+... | ++..+ .+.++ |+|. +. -+.++|+++|++ ||+.+.
T Consensus 329 ~~~rl~v~v~D~pG~L~~l~~ii~~~~~nI~~v~~~~~~~~~~~~~~~v~i~vet~~~~~~~~v~~~L~~~Gy~~~~ 405 (406)
T PRK06382 329 QLVRIECNIPDRPGNLYRIANAIASNGGNIYHAEVDNLRKETPPGFQSVTFTVNVRGQDHLDRILNALREMGYKFNI 405 (406)
T ss_pred CEEEEEEEcCCCCCHHHHHHHHHhcCCCcEEEEEEeeccccCCCCcEEEEEEEEeCCHHHHHHHHHHHHHCCCCeee
Confidence 345566665 446667788888877433 2 45553 22333 4443 22 345599999999 999864
No 266
>COG3133 SlyB Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane]
Probab=26.14 E-value=86 Score=24.73 Aligned_cols=45 Identities=16% Similarity=0.389 Sum_probs=29.3
Q ss_pred HHHHHHHhccCCeeEEEeCCCCEEEE-EeecCHHHHHHHHhc-CCceEEeec
Q 048021 26 SSMTKVLRKIKGASYTIDAEEGMAYI-TGRANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle~~~VtV-~G~vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
.+|+.+|.+.+||+++|-.+++.-.+ .-+.|+..+ . |-++.|.+.
T Consensus 100 ~~ie~~~n~~~Gvel~IrkDdG~~i~vvQk~d~~~~-----~vGeRV~l~~~ 146 (154)
T COG3133 100 QGIEEAMNKTQGVELEIRKDDGQTIVVVQKQDQTRF-----SVGERVRLASS 146 (154)
T ss_pred hhhHhhhccCCceEEEEEecCCcEEEEEEecCCccc-----CCccEEEEecC
Confidence 57899999999998888877654433 334554332 2 555555544
No 267
>PLN02962 hydroxyacylglutathione hydrolase
Probab=25.98 E-value=3.1e+02 Score=22.55 Aligned_cols=37 Identities=11% Similarity=-0.000 Sum_probs=21.8
Q ss_pred EEEeC--CCCE-EEEE-eecCHHHHHHHHhc-CCceEEeecC
Q 048021 40 YTIDA--EEGM-AYIT-GRANPRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 40 V~VDl--e~~~-VtV~-G~vdp~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
+-+|. .+++ |.|. |..+.+.+++.|++ |.+++.+-..
T Consensus 27 ll~d~~~~~~~avlIDP~~~~~~~~l~~l~~~g~~i~~Il~T 68 (251)
T PLN02962 27 LLADVSHPDKPALLIDPVDKTVDRDLSLVKELGLKLIYAMNT 68 (251)
T ss_pred EEEeCCCCCCEEEEECCCCCcHHHHHHHHHHCCCeeEEEEcC
Confidence 34454 2344 4444 22367888888888 8877655444
No 268
>cd02396 PCBP_like_KH K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stran
Probab=25.83 E-value=1.7e+02 Score=18.58 Aligned_cols=39 Identities=21% Similarity=0.340 Sum_probs=23.3
Q ss_pred hHHHHHHHHhccCCeeEEEeCCC------CEEEEEeecCHHHHHHHHh
Q 048021 24 WFSSMTKVLRKIKGASYTIDAEE------GMAYITGRANPRKLLRKLQ 65 (143)
Q Consensus 24 C~~kIkkaL~kI~GVsV~VDle~------~~VtV~G~vdp~~Ll~aI~ 65 (143)
.-..|++ |+..-|+.|.+.... ..|+|.|. ++.+.+|+.
T Consensus 18 ~G~~i~~-i~~~tga~I~i~~~~~~~~~~r~v~I~G~--~~~v~~A~~ 62 (65)
T cd02396 18 GGSTIKE-IREETGAKIRVSKSVLPGSTERVVTISGK--PSAVQKALL 62 (65)
T ss_pred CcHHHHH-HHHHHCCEEEEcCCCCCCCCceEEEEEeC--HHHHHHHHH
Confidence 3345555 455577776665433 56777776 566666654
No 269
>COG3870 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.74 E-value=1.4e+02 Score=22.39 Aligned_cols=27 Identities=15% Similarity=0.012 Sum_probs=20.8
Q ss_pred EEEeecCHHHHHHHHhc-CCceEEeecC
Q 048021 50 YITGRANPRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 50 tV~G~vdp~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
.|.-..|.++|.++|.+ |++|+-+...
T Consensus 6 aivqd~da~~l~~~L~d~~fraTkLAsT 33 (109)
T COG3870 6 AIVQDQDANELEDALTDKNFRATKLAST 33 (109)
T ss_pred EEEecccHHHHHHHHHhCCceeEEeecc
Confidence 34456799999999999 9998755554
No 270
>PRK00166 apaH diadenosine tetraphosphatase; Reviewed
Probab=25.58 E-value=1.5e+02 Score=24.82 Aligned_cols=44 Identities=23% Similarity=0.485 Sum_probs=31.5
Q ss_pred hhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeec-----CHHHHHHHHhc-CCce
Q 048021 22 PAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRA-----NPRKLLRKLQS-GKYA 70 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-----dp~~Ll~aI~k-Gk~A 70 (143)
.||...++++|.++. ++.+..++.+.|.+ ++.++++.|.+ +..+
T Consensus 10 HG~~~~l~~ll~~~~-----~~~~~D~li~lGDlVdrGp~s~~vl~~l~~l~~~~ 59 (275)
T PRK00166 10 QGCYDELQRLLEKID-----FDPAKDTLWLVGDLVNRGPDSLEVLRFVKSLGDSA 59 (275)
T ss_pred CCCHHHHHHHHHhcC-----CCCCCCEEEEeCCccCCCcCHHHHHHHHHhcCCCe
Confidence 489999999998862 22345577777763 58888888888 6544
No 271
>TIGR03279 cyano_FeS_chp putative FeS-containing Cyanobacterial-specific oxidoreductase. Members of this protein family are predicted FeS-containing oxidoreductases of unknown function, apparently restricted to and universal across the Cyanobacteria. The high trusted cutoff score for this model, 700 bits, excludes homologs from other lineages. This exclusion seems justified because a significant number of sequence positions are simultaneously unique to and invariant across the Cyanobacteria, suggesting a specialized, conserved function, perhaps related to photosynthesis. A distantly related protein family, TIGR03278, in universal in and restricted to archaeal methanogens, and may be linked to methanogenesis.
Probab=25.57 E-value=4e+02 Score=24.32 Aligned_cols=41 Identities=17% Similarity=0.259 Sum_probs=29.8
Q ss_pred HHHHHHHhccCCeeEEE-----eCCCCEEEEEeecCHHHHHHHHhc
Q 048021 26 SSMTKVLRKIKGASYTI-----DAEEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~V-----Dle~~~VtV~G~vdp~~Ll~aI~k 66 (143)
+.+-+.|.+++|+++.+ +.=.++|||+|-+.-..|++.|+.
T Consensus 335 ~~~~~~l~~~~~~~v~v~~i~N~ffG~~vtV~GLltg~Dii~~l~~ 380 (433)
T TIGR03279 335 EPLVERLNAVEGLELDLHGLASDYWGQEITVTGLLTGQDLIAGLKG 380 (433)
T ss_pred HHHHHHHhcCCCcEEEEEEeccCCCCCCeeEeecccHHHHHHHhCC
Confidence 34444455568875433 333799999999999999999986
No 272
>PRK11145 pflA pyruvate formate lyase-activating enzyme 1; Provisional
Probab=25.51 E-value=63 Score=25.66 Aligned_cols=29 Identities=17% Similarity=0.119 Sum_probs=18.5
Q ss_pred CEEEEEee---cCHH---HHHHHHhc-CCceEEeec
Q 048021 47 GMAYITGR---ANPR---KLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 47 ~~VtV~G~---vdp~---~Ll~aI~k-Gk~Ael~~~ 75 (143)
+-|+++|- +.++ +|++.+++ |.++.+...
T Consensus 72 ~~V~~sGGEPll~~~~~~~l~~~~k~~g~~i~l~TN 107 (246)
T PRK11145 72 GGVTASGGEAILQAEFVRDWFRACKKEGIHTCLDTN 107 (246)
T ss_pred CeEEEeCccHhcCHHHHHHHHHHHHHcCCCEEEECC
Confidence 46888862 2333 56777777 887665543
No 273
>TIGR02196 GlrX_YruB Glutaredoxin-like protein, YruB-family. This glutaredoxin-like protein family contains the conserved CxxC motif and includes the Clostridium pasteurianum protein YruB which has been cloned from a rubredoxin operon. Somewhat related to NrdH, it is unknown whether this protein actually interacts with glutathione/glutathione reducatase, or, like NrdH, some other reductant system.
Probab=25.44 E-value=1.6e+02 Score=17.70 Aligned_cols=19 Identities=21% Similarity=0.186 Sum_probs=13.0
Q ss_pred ecChhhHHHHHHHHhccCCee
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS 39 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs 39 (143)
-.|+.|. +++++|+. .|+.
T Consensus 8 ~~C~~C~-~~~~~l~~-~~i~ 26 (74)
T TIGR02196 8 PWCPPCK-KAKEYLTS-KGIA 26 (74)
T ss_pred CCChhHH-HHHHHHHH-CCCe
Confidence 3699998 46677776 4563
No 274
>PF06504 RepC: Replication protein C (RepC); InterPro: IPR010522 This family consists of several bacterial replication protein C (RepC) sequences.
Probab=25.39 E-value=61 Score=27.93 Aligned_cols=28 Identities=21% Similarity=0.294 Sum_probs=23.7
Q ss_pred HHHHHHHHhccCCeeEEEe-CCCCEEEEE
Q 048021 25 FSSMTKVLRKIKGASYTID-AEEGMAYIT 52 (143)
Q Consensus 25 ~~kIkkaL~kI~GVsV~VD-le~~~VtV~ 52 (143)
..+|++||..|.++.++|+ ..+++++|.
T Consensus 248 R~~lR~AL~El~~~GW~V~e~~~g~~~I~ 276 (281)
T PF06504_consen 248 RQRLRKALAELAALGWTVDEYAKGKWEIG 276 (281)
T ss_pred HHHHHHHHHHHHHcCeEEeeccCccEEEe
Confidence 3578999999999889998 778888885
No 275
>cd02066 GRX_family Glutaredoxin (GRX) family; composed of GRX, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known including human GRX1 and GRX2, as well as E. coli GRX1 and GRX3, which
Probab=25.36 E-value=1.6e+02 Score=17.53 Aligned_cols=25 Identities=20% Similarity=0.301 Sum_probs=16.4
Q ss_pred eecChhhHHHHHHHHhccCCee-EEEeC
Q 048021 18 DTNDPAWFSSMTKVLRKIKGAS-YTIDA 44 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GVs-V~VDl 44 (143)
.-.|..|. +++..|.+. |+. ..+|.
T Consensus 7 ~~~Cp~C~-~~~~~L~~~-~i~~~~~di 32 (72)
T cd02066 7 KSTCPYCK-RAKRLLESL-GIEFEEIDI 32 (72)
T ss_pred CCCCHHHH-HHHHHHHHc-CCcEEEEEC
Confidence 34699998 677778875 353 34444
No 276
>PRK05096 guanosine 5'-monophosphate oxidoreductase; Provisional
Probab=25.28 E-value=2.2e+02 Score=25.22 Aligned_cols=59 Identities=17% Similarity=0.182 Sum_probs=42.7
Q ss_pred ChhhHHHHHHHHhccCCee-EEEeCCCC------------------EEEEEeec-CHHHHHHHHhcCCceEEeecCCCC
Q 048021 21 DPAWFSSMTKVLRKIKGAS-YTIDAEEG------------------MAYITGRA-NPRKLLRKLQSGKYANLCWVSAGN 79 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GVs-V~VDle~~------------------~VtV~G~v-dp~~Ll~aI~kGk~Ael~~~~~~~ 79 (143)
-+.+..++++.++...|++ +.||...+ .-.|-|.+ +++...+.|..|-++..+++++|.
T Consensus 106 ~~~d~er~~~L~~~~~g~D~iviD~AhGhs~~~i~~ik~ik~~~P~~~vIaGNV~T~e~a~~Li~aGAD~vKVGIGpGS 184 (346)
T PRK05096 106 SDADFEKTKQILALSPALNFICIDVANGYSEHFVQFVAKAREAWPDKTICAGNVVTGEMVEELILSGADIVKVGIGPGS 184 (346)
T ss_pred CHHHHHHHHHHHhcCCCCCEEEEECCCCcHHHHHHHHHHHHHhCCCCcEEEecccCHHHHHHHHHcCCCEEEEcccCCc
Confidence 4577888888777677885 88887653 34455765 677666655559999999998863
No 277
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=25.20 E-value=1.9e+02 Score=20.49 Aligned_cols=42 Identities=21% Similarity=0.222 Sum_probs=25.9
Q ss_pred ecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CC
Q 048021 19 TNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GK 68 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk 68 (143)
-.|+.|.+. ++.|.+ .|+. ..+|..+.. .+.+.|.+.+.+ |-
T Consensus 7 ~~C~~c~ka-~~~L~~-~~i~~~~idi~~~~------~~~~el~~~~~~~~~ 50 (111)
T cd03036 7 PKCSTCRKA-KKWLDE-HGVDYTAIDIVEEP------PSKEELKKWLEKSGL 50 (111)
T ss_pred CCCHHHHHH-HHHHHH-cCCceEEecccCCc------ccHHHHHHHHHHcCC
Confidence 479999955 566766 5774 455655432 345566666666 53
No 278
>TIGR00915 2A0602 The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family. This family is one of several subfamilies within the scope of pfam model pfam00873.
Probab=25.15 E-value=1.1e+02 Score=30.22 Aligned_cols=46 Identities=4% Similarity=-0.146 Sum_probs=34.4
Q ss_pred ChhhHHHHHHHHhccCCe-eEEEeCCC--CEEEEEe--------ecCHHHHHHHHhc
Q 048021 21 DPAWFSSMTKVLRKIKGA-SYTIDAEE--GMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GV-sV~VDle~--~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
=..-+++|++.|++++|+ +|..+... .++.|.= -++++.|.++|+.
T Consensus 695 L~~~a~~l~~~l~~~pg~~~v~~~~~~~~~~~~i~~d~~k~~~~Gls~~~v~~~l~~ 751 (1044)
T TIGR00915 695 LMQARNQLLGLAAQNPALTRVRPNGLEDEPQFKLNIDDEKAQALGVSIADINTTLST 751 (1044)
T ss_pred HHHHHHHHHHHHhcCCCeEEeeccccCCCceEEEEECHHHHHHcCCCHHHHHHHHHH
Confidence 356778899999999999 78877654 4566652 2578888888876
No 279
>PRK06719 precorrin-2 dehydrogenase; Validated
Probab=25.12 E-value=1.5e+02 Score=22.49 Aligned_cols=36 Identities=11% Similarity=0.221 Sum_probs=26.6
Q ss_pred EEEeCCCCEEEEEeecC-HHHHHHHHhc-CCceEEeec
Q 048021 40 YTIDAEEGMAYITGRAN-PRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 40 V~VDle~~~VtV~G~vd-p~~Ll~aI~k-Gk~Ael~~~ 75 (143)
+-+|++.++|.|-|-.. ....++.|.+ |.++.+++.
T Consensus 7 ~~l~l~~~~vlVvGGG~va~rka~~Ll~~ga~V~VIsp 44 (157)
T PRK06719 7 LMFNLHNKVVVIIGGGKIAYRKASGLKDTGAFVTVVSP 44 (157)
T ss_pred eEEEcCCCEEEEECCCHHHHHHHHHHHhCCCEEEEEcC
Confidence 67899999999997543 3445566666 999998853
No 280
>TIGR01679 bact_FAD_ox FAD-linked oxidoreductase. This model represents a family of bacterial oxidoreductases with covalently linked FAD, closely related to two different eukaryotic oxidases, L-gulonolactone oxidase (EC 1.1.3.8) from rat and D-arabinono-1,4-lactone oxidase (EC 1.1.3.37) from Saccharomyces cerevisiae.
Probab=25.06 E-value=87 Score=27.54 Aligned_cols=32 Identities=19% Similarity=0.209 Sum_probs=28.2
Q ss_pred EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceE
Q 048021 40 YTIDAEEGMAYITGRANPRKLLRKLQS-GKYAN 71 (143)
Q Consensus 40 V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ae 71 (143)
+++|.++++|+|.+-+.-.+|.+.|.+ |....
T Consensus 64 ~~~d~~~~~v~v~aG~~l~~l~~~L~~~G~~l~ 96 (419)
T TIGR01679 64 VDVDQPTGLATVEAGTRLGALGPQLAQRGLGLE 96 (419)
T ss_pred eeecCCCCEEEEcCCCCHHHHHHHHHHcCCccc
Confidence 478999999999998999999999999 98544
No 281
>cd04874 ACT_Af1403 N-terminal ACT domain of the yet uncharacterized, small (~133 a.a.), putative amino acid binding protein, Af1403, and related domains. This CD includes the N-terminal ACT domain of the yet uncharacterized, small (~133 a.a.), putative amino acid binding protein, Af1403, from Archaeoglobus fulgidus and other related archeal ACT domains. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=25.01 E-value=1.7e+02 Score=17.68 Aligned_cols=42 Identities=26% Similarity=0.382 Sum_probs=21.4
Q ss_pred HHHHHHHHhccCC-e-eEEEeCC-CCE--EEEE--eecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKG-A-SYTIDAE-EGM--AYIT--GRANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~G-V-sV~VDle-~~~--VtV~--G~vdp~~Ll~aI~k 66 (143)
-.+|...|.+-.. + ++..... ++. ++|. +......++++|++
T Consensus 14 l~~i~~~l~~~~~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~L~~ 62 (72)
T cd04874 14 LRDLTGVIAEHGGNITYTQQFIEREGKARIYMELEGVGDIEELVEELRS 62 (72)
T ss_pred HHHHHHHHHhCCCCEEEEEEeccCCCeEEEEEEEeccccHHHHHHHHhC
Confidence 3466666665432 2 3444333 222 3332 33356677888877
No 282
>cd04906 ACT_ThrD-I_1 First of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase). This CD includes the first of two tandem C-terminal ACT domains of threonine dehydratase I (ThrD-I; L-threonine hydrolyase) which catalyzes the committed step in branched chain amino acid biosynthesis in plants and microorganisms, the pyridoxal 5'-phosphate (PLP)-dependent dehydration/deamination of L-threonine (or L-serine) to 2-ketobutyrate (or pyruvate). ThrD-I is a cooperative, feedback-regulated (isoleucine and valine) allosteric enzyme that forms a tetramer and contains four pyridoxal phosphate moieties. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=24.96 E-value=2.2e+02 Score=19.08 Aligned_cols=60 Identities=15% Similarity=0.207 Sum_probs=36.7
Q ss_pred EEEEee-cChhhHHHHHHHHhccCCe-eEEEeC---CCCEEEEEe-----ecCHHHHHHHHhc-CCceEEee
Q 048021 14 GLKVDT-NDPAWFSSMTKVLRKIKGA-SYTIDA---EEGMAYITG-----RANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 14 vlkV~m-~C~gC~~kIkkaL~kI~GV-sV~VDl---e~~~VtV~G-----~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
+|+|.+ .=+|=-.++.++|.+ .+| ++.-+. ....|.|.= .-+.++++++|++ |++++...
T Consensus 3 vl~v~ipD~PG~L~~ll~~l~~-anI~~~~y~~~~~~~~~v~i~ie~~~~~~~~~~i~~~L~~~G~~~~~~~ 73 (85)
T cd04906 3 LLAVTIPERPGSFKKFCELIGP-RNITEFNYRYADEKDAHIFVGVSVANGAEELAELLEDLKSAGYEVVDLS 73 (85)
T ss_pred EEEEecCCCCcHHHHHHHHhCC-CceeEEEEEccCCCeeEEEEEEEeCCcHHHHHHHHHHHHHCCCCeEECC
Confidence 567776 345555666666662 234 455544 344455432 2347899999999 99987643
No 283
>TIGR01437 selA_rel uncharacterized pyridoxal phosphate-dependent enzyme. This model describes a protein related to a number of pyridoxal phosphate-dependent enzymes, and in particular to selenocysteine synthase (SelA), which converts Ser to selenocysteine on its tRNA. While resembling SelA, this protein is found only in species that have a better candidate SelA or else lack the other genes (selB, selC, and selD) required for selenocysteine incorporation.
Probab=24.71 E-value=2.5e+02 Score=23.79 Aligned_cols=48 Identities=13% Similarity=0.105 Sum_probs=32.0
Q ss_pred HHHHHHhccCCeeEEEeC-------CCCEEEEEe---ecCHHHHHHHHhc-CCceEEee
Q 048021 27 SMTKVLRKIKGASYTIDA-------EEGMAYITG---RANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 27 kIkkaL~kI~GVsV~VDl-------e~~~VtV~G---~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
.+.+.|.+++|+.+.... ..-.|++.. .+++.+|.++|++ |..+..+.
T Consensus 274 ~l~~~L~~i~g~~~~~~~~~~~~~~~~~~v~~~~~~~g~~~~~l~~~L~~~~~~I~~r~ 332 (363)
T TIGR01437 274 PFIEALNTLKGVSASIVQDEAGRDIARAEIRFDESELGMTAADVVQALRQGEPAIYTRG 332 (363)
T ss_pred HHHHHHhcCCCeEEEEecCCCCCcCceEEEEEeccCCCCCHHHHHHHHhcCCCCEEEee
Confidence 466777788888543221 123455553 3689999999988 87777764
No 284
>PF03960 ArsC: ArsC family; InterPro: IPR006660 Several bacterial taxon have a chromosomal resistance system, encoded by the ars operon, for the detoxification of arsenate, arsenite, and antimonite []. This system transports arsenite and antimonite out of the cell. The pump is composed of two polypeptides, the products of the arsA and arsB genes. This two-subunit enzyme produces resistance to arsenite and antimonite. Arsenate, however, must first be reduced to arsenite before it is extruded. A third gene, arsC, expands the substrate specificity to allow for arsenate pumping and resistance. ArsC is an approximately 150-residue arsenate reductase that uses reduced glutathione (GSH) to convert arsenate to arsenite with a redox active cysteine residue in the active site. ArsC forms an active quaternary complex with GSH, arsenate, and glutaredoxin 1 (Grx1). The three ligands must be present simultaneously for reduction to occur []. The arsC family also comprises the Spx proteins which are GRAM-positive bacterial transcription factors that regulate the transcription of multiple genes in response to disulphide stress []. The arsC protein structure has been solved []. It belongs to the thioredoxin superfamily fold which is defined by a beta-sheet core surrounded by alpha-helices. The active cysteine residue of ArsC is located in the loop between the first beta-strand and the first helix, which is also conserved in the Spx protein and its homologues.; PDB: 2KOK_A 1SK1_A 1SK2_A 1JZW_A 1J9B_A 1S3C_A 1SD8_A 1SD9_A 1I9D_A 1SK0_A ....
Probab=24.67 E-value=1.4e+02 Score=20.91 Aligned_cols=42 Identities=21% Similarity=0.155 Sum_probs=25.2
Q ss_pred cChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CCc
Q 048021 20 NDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GKY 69 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~ 69 (143)
.|..|.++ .+.|++ .|+. ..+|..+.. .+.++|.+.++. |-.
T Consensus 5 ~C~t~rka-~~~L~~-~gi~~~~~d~~k~p------~s~~el~~~l~~~~~~ 48 (110)
T PF03960_consen 5 NCSTCRKA-LKWLEE-NGIEYEFIDYKKEP------LSREELRELLSKLGNG 48 (110)
T ss_dssp T-HHHHHH-HHHHHH-TT--EEEEETTTS---------HHHHHHHHHHHTSS
T ss_pred CCHHHHHH-HHHHHH-cCCCeEeehhhhCC------CCHHHHHHHHHHhccc
Confidence 69999854 455666 5885 456776543 677888888888 743
No 285
>PRK10503 multidrug efflux system subunit MdtB; Provisional
Probab=24.54 E-value=1.4e+02 Score=29.67 Aligned_cols=41 Identities=5% Similarity=0.134 Sum_probs=31.2
Q ss_pred HHHHHHHhccCCe-eEEEeCC-CCEEEEEe--------ecCHHHHHHHHhc
Q 048021 26 SSMTKVLRKIKGA-SYTIDAE-EGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 26 ~kIkkaL~kI~GV-sV~VDle-~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
+.|+..|++++|| +|++.-. +.++.|.= .+++.+|.++|+.
T Consensus 169 ~~l~~~L~~i~gV~~V~~~G~~~~ei~V~vd~~kl~~~gls~~~v~~ai~~ 219 (1040)
T PRK10503 169 TRVAQKISQVSGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITG 219 (1040)
T ss_pred HHHHHHhcCCCCceEEEecCCCceEEEEEECHHHHHHcCCCHHHHHHHHHH
Confidence 5799999999999 7888764 45666662 2567788888876
No 286
>TIGR00735 hisF imidazoleglycerol phosphate synthase, cyclase subunit.
Probab=24.52 E-value=1.6e+02 Score=23.90 Aligned_cols=30 Identities=20% Similarity=0.456 Sum_probs=22.8
Q ss_pred EEEeCCCCEEEEEee--------cCHHHHHHHHhc-CCce
Q 048021 40 YTIDAEEGMAYITGR--------ANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 40 V~VDle~~~VtV~G~--------vdp~~Ll~aI~k-Gk~A 70 (143)
..+|+.+++| |+|. .||.++++++++ |.+.
T Consensus 8 p~iD~~~G~~-V~~~~~~~~~~~~dp~~~a~~~~~~G~~~ 46 (254)
T TIGR00735 8 PCLDVRDGRV-VKGVQFLNLRDAGDPVELAQRYDEEGADE 46 (254)
T ss_pred EEEEeECCEE-EEeEeecCceECCCHHHHHHHHHHcCCCE
Confidence 5778888888 7653 488888888888 7654
No 287
>PRK03745 signal recognition particle protein Srp19; Provisional
Probab=24.48 E-value=1.6e+02 Score=21.59 Aligned_cols=41 Identities=15% Similarity=0.252 Sum_probs=27.8
Q ss_pred HHHHHHHHhccCCeeE-EEeC---------------CCCEEEEEeecCHHHHHHHHhc
Q 048021 25 FSSMTKVLRKIKGASY-TIDA---------------EEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 25 ~~kIkkaL~kI~GVsV-~VDl---------------e~~~VtV~G~vdp~~Ll~aI~k 66 (143)
.+.|..|++++ |+.+ .++. ..|.|.|.....-.+|+++|.+
T Consensus 34 ~~EI~~a~~~l-gl~~~~~E~~K~yPr~~g~~~~~~~~Grv~V~~~~~K~~ll~~Ia~ 90 (100)
T PRK03745 34 LEEIVDAAEAL-GFKVIEVDREKLNPRLSGIDEELRTRGRIVIESPYGKSKSLKMIAQ 90 (100)
T ss_pred HHHHHHHHHHc-CCCcEeeccCCCCCCccCCccchhcCCEEEecCCCCHHHHHHHHHH
Confidence 34677777764 5543 3442 2688888777788888888865
No 288
>cd04905 ACT_CM-PDT C-terminal ACT domain of the bifunctional chorismate mutase-prephenate dehydratase (CM-PDT) enzyme and the prephenate dehydratase (PDT) enzyme. The C-terminal ACT domain of the bifunctional chorismate mutase-prephenate dehydratase (CM-PDT) enzyme and the prephenate dehydratase (PDT) enzyme, found in plants, fungi, bacteria, and archaea. The P-protein of E. coli (CM-PDT, PheA) catalyzes the conversion of chorismate to prephenate and then the decarboxylation and dehydration to form phenylpyruvate. These are the first two steps in the biosynthesis of L-Phe and L-Tyr via the shikimate pathway in microorganisms and plants. The E. coli P-protein (CM-PDT) has three domains with an N-terminal domain with chorismate mutase activity, a middle domain with prephenate dehydratase activity, and an ACT regulatory C-terminal domain. The prephenate dehydratase enzyme has a PDT and ACT domain. The ACT domain is essential to bring about the negative allosteric regulation by L-Phe bindi
Probab=24.44 E-value=80 Score=20.74 Aligned_cols=27 Identities=11% Similarity=0.065 Sum_probs=15.4
Q ss_pred eEEEEEEeecChhhHHHHHHHHhccCC
Q 048021 11 MTCGLKVDTNDPAWFSSMTKVLRKIKG 37 (143)
Q Consensus 11 ~tvvlkV~m~C~gC~~kIkkaL~kI~G 37 (143)
..++|.|+..|.-+...++++|..|+.
T Consensus 42 ~~~~f~vd~~~~~~~~~~~~~l~~l~~ 68 (80)
T cd04905 42 WEYVFFIDFEGHIEDPNVAEALEELKR 68 (80)
T ss_pred ceEEEEEEEECCCCCHHHHHHHHHHHH
Confidence 346666666554445566666665544
No 289
>cd02973 TRX_GRX_like Thioredoxin (TRX)-Glutaredoxin (GRX)-like family; composed of archaeal and bacterial proteins that show similarity to both TRX and GRX, including the C-terminal TRX-fold subdomain of Pyrococcus furiosus protein disulfide oxidoreductase (PfPDO). All members contain a redox-active CXXC motif and may function as PDOs. The archaeal proteins Mj0307 and Mt807 show structures more similar to GRX, but activities more similar to TRX. Some members of the family are similar to PfPDO in that they contain a second CXXC motif located in a second TRX-fold subdomain at the N-terminus; the superimposable N- and C-terminal TRX subdomains form a compact structure. PfPDO is postulated to be the archaeal counterpart of bacterial DsbA and eukaryotic protein disulfide isomerase (PDI). The C-terminal CXXC motif of PfPDO is required for its oxidase, reductase and isomerase activities. Also included in the family is the C-terminal TRX-fold subdomain of the N-terminal domain (NTD) of bacteri
Probab=24.44 E-value=1.6e+02 Score=18.11 Aligned_cols=15 Identities=13% Similarity=0.115 Sum_probs=10.7
Q ss_pred cChhhHHHHHHHHhcc
Q 048021 20 NDPAWFSSMTKVLRKI 35 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI 35 (143)
.|+-|. .+++.|+++
T Consensus 10 ~C~~C~-~~~~~l~~l 24 (67)
T cd02973 10 TCPYCP-DAVQAANRI 24 (67)
T ss_pred CCCCcH-HHHHHHHHH
Confidence 799995 556667655
No 290
>TIGR01127 ilvA_1Cterm threonine dehydratase, medium form. A form of threonine dehydratase with two copies of the C-terminal domain Pfam:PF00585 is described by TIGR01124. This model describes a phylogenetically distinct form with a single copy of pfam00585. This form branches with the catabolic threonine dehydratase of E. coli; many members are designated as catabolic for this reason. However, the catabolic form lacks any pfam00585 domain. Many members of this model are found in species with other Ile biosynthetic enzymes.
Probab=24.38 E-value=3.4e+02 Score=23.15 Aligned_cols=62 Identities=13% Similarity=0.180 Sum_probs=37.8
Q ss_pred eEEEEEEee-cChhhHHHHHHHHhccCC-e-eEEEe---C--CCCE--EEEEee----cCHHHHHHHHhc-CCceEE
Q 048021 11 MTCGLKVDT-NDPAWFSSMTKVLRKIKG-A-SYTID---A--EEGM--AYITGR----ANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 11 ~tvvlkV~m-~C~gC~~kIkkaL~kI~G-V-sV~VD---l--e~~~--VtV~G~----vdp~~Ll~aI~k-Gk~Ael 72 (143)
+.++|.|.+ .=.|=-.+|-..|.+..+ | +|..+ . ..++ |+|.-+ .+.++|+++|++ |+++++
T Consensus 304 r~~~l~v~l~D~pG~L~~v~~~i~~~~~NI~~i~~~r~~~~~~~~~~~v~v~vet~~~~~~~~i~~~L~~~G~~v~~ 380 (380)
T TIGR01127 304 RKVRIETVLPDRPGALYHLLESIAEARANIVKIDHDRLSKEIPPGFAMVEITLETRGKEHLDEILKILRDMGYNFYV 380 (380)
T ss_pred CEEEEEEEeCCCCCHHHHHHHHHhcCCCcEEEEEeeccccCCCCceEEEEEEEEeCCHHHHHHHHHHHHHcCCcccC
Confidence 455677776 457767777777776544 2 44333 1 1233 444322 356689999999 998763
No 291
>TIGR01514 NAPRTase nicotinate phosphoribosyltransferase. This model represents nicotinate phosphoribosyltransferase, the first enzyme in the salvage pathway of NAD biosynthesis from nicontinate (niacin). Members are primary proteobacterial but also include yeasts and Methanosarcina acetivorans. A related family, apparently non-overlapping in species distribution, is TIGR01513. Members of that family differ in substantially in sequence and have a long C-terminal extension missing from this family, but are proposed also to act as nicotinate phosphoribosyltransferase (see model TIGR01513).
Probab=24.36 E-value=1.4e+02 Score=26.73 Aligned_cols=38 Identities=11% Similarity=0.165 Sum_probs=30.7
Q ss_pred EEeCCCCEEEEEeecCHHHHHHHHh--cCCceEEeecCCC
Q 048021 41 TIDAEEGMAYITGRANPRKLLRKLQ--SGKYANLCWVSAG 78 (143)
Q Consensus 41 ~VDle~~~VtV~G~vdp~~Ll~aI~--kGk~Ael~~~~~~ 78 (143)
-+|.+.+++.+++.+|.+++++..+ +|.....++++..
T Consensus 302 gidp~~K~iv~Sd~Lde~~i~~L~~~~~g~~~d~FGVGT~ 341 (394)
T TIGR01514 302 GIDPKSKIIIFSDSLDVEKAIELSHYFKGRVKASFGIGTN 341 (394)
T ss_pred CCCCCCcEEEEcCCCCHHHHHHHHHHhcCCCceeEecCcc
Confidence 3477777788899999999998887 4877788888764
No 292
>PF13241 NAD_binding_7: Putative NAD(P)-binding; PDB: 3DFZ_B 1PJT_A 1PJS_A 1PJQ_A 1KYQ_B.
Probab=24.12 E-value=1.9e+02 Score=20.07 Aligned_cols=37 Identities=14% Similarity=0.136 Sum_probs=22.8
Q ss_pred EEEeCCCCEEEEEeecC-HHHHHHHHhc-CCceEEeecC
Q 048021 40 YTIDAEEGMAYITGRAN-PRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 40 V~VDle~~~VtV~G~vd-p~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
+-+|+++.+|.|.|.-+ ...-++.|.+ |-++.+++..
T Consensus 1 l~l~l~~~~vlVvGgG~va~~k~~~Ll~~gA~v~vis~~ 39 (103)
T PF13241_consen 1 LFLDLKGKRVLVVGGGPVAARKARLLLEAGAKVTVISPE 39 (103)
T ss_dssp EEE--TT-EEEEEEESHHHHHHHHHHCCCTBEEEEEESS
T ss_pred CEEEcCCCEEEEECCCHHHHHHHHHHHhCCCEEEEECCc
Confidence 35789999999997643 3333444455 9888888876
No 293
>PRK10638 glutaredoxin 3; Provisional
Probab=24.08 E-value=1.7e+02 Score=19.25 Aligned_cols=30 Identities=20% Similarity=0.257 Sum_probs=18.8
Q ss_pred EEEEeecChhhHHHHHHHHhccCCee-EEEeCC
Q 048021 14 GLKVDTNDPAWFSSMTKVLRKIKGAS-YTIDAE 45 (143)
Q Consensus 14 vlkV~m~C~gC~~kIkkaL~kI~GVs-V~VDle 45 (143)
+|=..-.|.-|. +++++|.. .|+. ..+|++
T Consensus 5 ~ly~~~~Cp~C~-~a~~~L~~-~gi~y~~~dv~ 35 (83)
T PRK10638 5 EIYTKATCPFCH-RAKALLNS-KGVSFQEIPID 35 (83)
T ss_pred EEEECCCChhHH-HHHHHHHH-cCCCcEEEECC
Confidence 333344799998 67777877 4773 344443
No 294
>PF08478 POTRA_1: POTRA domain, FtsQ-type; InterPro: IPR013685 FtsQ/DivIB bacterial division proteins (IPR005548 from INTERPRO) contain an N-terminal POTRA domain (for polypeptide-transport-associated domain). This is found in different types of proteins, usually associated with a transmembrane beta-barrel. FtsQ/DivIB may have chaperone-like roles, which has also been postulated for the POTRA domain in other contexts []. ; PDB: 2ALJ_A 2VH1_B 3J00_Z 2VH2_B.
Probab=23.98 E-value=85 Score=19.77 Aligned_cols=28 Identities=14% Similarity=0.186 Sum_probs=20.3
Q ss_pred HHHHHHHHhccCCe-eEEEeCC-CCEEEEE
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAE-EGMAYIT 52 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle-~~~VtV~ 52 (143)
.+++++.|.+++.| +++|... .+++.|.
T Consensus 36 ~~~~~~~l~~~p~V~~v~V~r~~P~~l~I~ 65 (69)
T PF08478_consen 36 LKKIEQRLEKLPWVKSVSVSRRFPNTLEIK 65 (69)
T ss_dssp HHHHHHCCCCTTTEEEEEEEEETTTEEEEE
T ss_pred HHHHHHHHHcCCCEEEEEEEEeCCCEEEEE
Confidence 46788888889999 6777654 5566664
No 295
>PF00011 HSP20: Hsp20/alpha crystallin family This prints entry is a subset of the Pfam entry.; InterPro: IPR002068 Prokaryotic and eukaryotic organisms respond to heat shock or other environmental stress by inducing the synthesis of proteins collectively known as heat-shock proteins (hsp) []. Amongst them is a family of proteins with an average molecular weight of 20 Kd, known as the hsp20 proteins []. These seem to act as chaperones that can protect other proteins against heat-induced denaturation and aggregation. Hsp20 proteins seem to form large heterooligomeric aggregates. Structurally, this family is characterised by the presence of a conserved C-terminal domain of about 100 residues.; PDB: 2BOL_B 3N3E_B 2H50_P 2H53_F 2BYU_L 1GME_D 3VQM_J 3VQK_E 3VQL_A 3AAC_A ....
Probab=23.97 E-value=71 Score=21.74 Aligned_cols=23 Identities=17% Similarity=0.416 Sum_probs=17.9
Q ss_pred ccCCe---eEEEeCCCCEEEEEeecC
Q 048021 34 KIKGA---SYTIDAEEGMAYITGRAN 56 (143)
Q Consensus 34 kI~GV---sV~VDle~~~VtV~G~vd 56 (143)
.++|+ ++.|..+++.++|+|...
T Consensus 14 ~lpG~~~edi~I~~~~~~L~I~g~~~ 39 (102)
T PF00011_consen 14 DLPGFDKEDIKIKVDDNKLVISGKRK 39 (102)
T ss_dssp E-TTS-GGGEEEEEETTEEEEEEEEE
T ss_pred ECCCCChHHEEEEEecCccceeceee
Confidence 45777 388888999999999755
No 296
>PF10934 DUF2634: Protein of unknown function (DUF2634); InterPro: IPR020288 This entry is represented by the Bacteriophage EJ-1, Orf60. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Bacteriophage EJ-1, Orf60 function has not been characterised. It has been shown to be simialr to XkdS (P54331 from SWISSPROT), which is encoded on a phage-like element (prophage) of PSBX found in Bacillus subtilis.
Probab=23.95 E-value=1.4e+02 Score=21.63 Aligned_cols=32 Identities=13% Similarity=0.237 Sum_probs=24.3
Q ss_pred hhhHHHHHHHHh---ccCCe-eEEEeCCCCEEEEEe
Q 048021 22 PAWFSSMTKVLR---KIKGA-SYTIDAEEGMAYITG 53 (143)
Q Consensus 22 ~gC~~kIkkaL~---kI~GV-sV~VDle~~~VtV~G 53 (143)
.--.+.|+.||. .|.+| ++++..+++++.|+=
T Consensus 68 sEi~r~I~EaL~~d~rI~~V~~f~f~~~~~~l~v~f 103 (112)
T PF10934_consen 68 SEIEREIEEALLQDPRITSVENFSFEWEGDSLYVSF 103 (112)
T ss_pred HHHHHHHHHHHhcCCCcceEEEEEEEEECCEEEEEE
Confidence 345678999995 45677 688888888888874
No 297
>PRK08452 flagellar protein FlaG; Provisional
Probab=23.93 E-value=1.8e+02 Score=22.02 Aligned_cols=28 Identities=4% Similarity=0.160 Sum_probs=21.8
Q ss_pred eEEEeCCCCEEEEE------e----ecCHHHHHHHHhc
Q 048021 39 SYTIDAEEGMAYIT------G----RANPRKLLRKLQS 66 (143)
Q Consensus 39 sV~VDle~~~VtV~------G----~vdp~~Ll~aI~k 66 (143)
.+.+|-+.+.+.|+ | .++|+++++.+++
T Consensus 74 ~F~~de~~~~~vVkVvD~~T~eVIRqIP~Ee~L~l~~~ 111 (124)
T PRK08452 74 RFGYNDKIKGLVVSVKEANGGKVIREIPSKEAIELMEY 111 (124)
T ss_pred EEEEcCCCCcEEEEEEECCCCceeeeCCCHHHHHHHHH
Confidence 58888888888886 2 2588988888776
No 298
>PF00241 Cofilin_ADF: Cofilin/tropomyosin-type actin-binding protein; InterPro: IPR002108 The actin-depolymerising factor homology (ADF-H) domain is an ~150-amino acid motif that is present in three phylogenetically distinct classes of eukaryotic actin-binding proteins [, , ]: ADF/cofilins, which include ADF, cofilin, destrin, actophorin, coactosin, depactin and glia maturation factors (GMFs) beta and gamma. ADF/cofilins are small actin-binding proteins composed of a single ADF-H domain. They bind both actin-monomers and filaments and promote rapid filament turnover in cells by depolymerising/fragmenting actin filaments. ADF/cofilins bind ADP-actin with higher affinity than ATP-actin and inhibit the spontaneous nucleotide exchange on actin monomers Twinfilins, which are actin monomer-binding proteins that are composed of two ADF-H domains Abp1/Drebrins, which are relatively large proteins composed of an N-terminal ADF-H domain followed by a variable region and a C-terminal SH3 domain. Abp1/Drebrins interact only with actin filaments and do not promote filament depolymerisation or fragmentation Although these proteins are biochemically distinct and play different roles in actin dynamics, they all appear to use the ADF-H domain for their interactions with actin. The ADF-H domain consists of a six-stranded mixed beta-sheet in which the four central strands (beta2-beta5) are anti-parallel and the two edge strands (beta1 and beta6) run parallel with the neighbouring strands. The sheet is surrounded by two alpha-helices on each side [, , ].; GO: 0003779 actin binding, 0005622 intracellular; PDB: 1AK6_A 1AK7_A 1V6F_A 2L72_A 1CFY_A 1QPV_A 1COF_A 1TVJ_A 1X67_A 3L50_A ....
Probab=23.86 E-value=1.3e+02 Score=21.25 Aligned_cols=44 Identities=16% Similarity=0.189 Sum_probs=31.9
Q ss_pred hhHHHHHHHHhc--cCCeeEEEeCCCCEEEEEe----ecCHHHHHHHHhc
Q 048021 23 AWFSSMTKVLRK--IKGASYTIDAEEGMAYITG----RANPRKLLRKLQS 66 (143)
Q Consensus 23 gC~~kIkkaL~k--I~GVsV~VDle~~~VtV~G----~vdp~~Ll~aI~k 66 (143)
.|..++++...+ ...+-+.|+.++.+|+|.. ..+.+.|++.|.+
T Consensus 2 e~~~~~~~~~~~~~~~~i~~~i~~~~~~i~v~~~g~~~~~~~el~~~l~~ 51 (127)
T PF00241_consen 2 ECKAAFQELKSKKSTRWIIFKIDDEKEEIVVEKSGSEGGSFDELLSHLPD 51 (127)
T ss_dssp HHHHHHHHHHTTTSCSEEEEEEETTSTEEEEEEEEEESSHHHHHHHCSCT
T ss_pred HHHHHHHHHHcCCCceEEEEEEeCCCcEEEEEeccCCCCCHHHHHHhccc
Confidence 577888886655 2223488888888999974 3578888888875
No 299
>COG3286 Uncharacterized protein conserved in archaea [Function unknown]
Probab=23.62 E-value=66 Score=26.52 Aligned_cols=23 Identities=17% Similarity=0.158 Sum_probs=20.3
Q ss_pred ecCHHHHHHHHhc-CCceEEeecC
Q 048021 54 RANPRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 54 ~vdp~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
.++|+.|+++|+- |+++++.+-.
T Consensus 88 ~Vp~~vl~daLk~~GyrVevr~~~ 111 (204)
T COG3286 88 NVPPDVLIDALKLLGYRVEVRGGE 111 (204)
T ss_pred CCCHHHHHHHHHhCCceEEeeCce
Confidence 3899999999999 9999988754
No 300
>PRK14426 acylphosphatase; Provisional
Probab=23.59 E-value=2.6e+02 Score=19.45 Aligned_cols=64 Identities=16% Similarity=0.177 Sum_probs=41.3
Q ss_pred eEEEEEEeec--ChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHH---HHHHHhcC--CceEEeec
Q 048021 11 MTCGLKVDTN--DPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRK---LLRKLQSG--KYANLCWV 75 (143)
Q Consensus 11 ~tvvlkV~m~--C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~---Ll~aI~kG--k~Ael~~~ 75 (143)
.++.+.|.+. .-|...-+.+.-.++ |+ -+.-++.+++|.|....++++ +++.|++| ..|.+-.+
T Consensus 4 ~~~~~~v~G~VQGVGFR~~v~~~A~~~-gl~G~V~N~~dG~Vei~~~G~~~~i~~f~~~l~~g~P~~a~V~~i 75 (92)
T PRK14426 4 VCIIAWVYGRVQGVGFRYHTQHEALKL-GLTGYAKNLDDGSVEVVACGEEEQVEKLMEWLKEGGPRSARVDRV 75 (92)
T ss_pred EEEEEEEEEeeCCcCchHHHHHHHHHh-CCEEEEEECCCCcEEEEEEeCHHHHHHHHHHHhcCCCCCeEEEEE
Confidence 4567777764 556777777766664 77 477888999888876656555 44555445 33544444
No 301
>COG5494 Predicted thioredoxin/glutaredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=23.48 E-value=2.1e+02 Score=24.27 Aligned_cols=48 Identities=8% Similarity=0.137 Sum_probs=30.7
Q ss_pred eecChhhHHHHHHHHhc---cCCee-EEEeC-----------------CCCEEEEEeecCHHHHHHHHhc
Q 048021 18 DTNDPAWFSSMTKVLRK---IKGAS-YTIDA-----------------EEGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~k---I~GVs-V~VDl-----------------e~~~VtV~G~vdp~~Ll~aI~k 66 (143)
.-+|..|. .+-+.|++ +++|. ++..+ .+|++..-+.+||+.|..+|+-
T Consensus 18 HktC~ssy-~Lf~~L~nkgll~~Vkii~a~~p~f~~~~~~V~SvP~Vf~DGel~~~dpVdp~~ies~~~G 86 (265)
T COG5494 18 HKTCVSSY-MLFEYLENKGLLGKVKIIDAELPPFLAFEKGVISVPSVFIDGELVYADPVDPEEIESILSG 86 (265)
T ss_pred ecchHHHH-HHHHHHHhcCCCCCceEEEcCCChHHHhhcceeecceEEEcCeEEEcCCCCHHHHHHHHcC
Confidence 34799998 66777887 55553 33221 2344444556899999888764
No 302
>COG1084 Predicted GTPase [General function prediction only]
Probab=23.41 E-value=1.2e+02 Score=27.00 Aligned_cols=47 Identities=19% Similarity=0.299 Sum_probs=34.2
Q ss_pred HHHHHHHHhccCCeeEEEeCCCCEEEEEe--ecCHHHHHHHHhcCCceEEeecC
Q 048021 25 FSSMTKVLRKIKGASYTIDAEEGMAYITG--RANPRKLLRKLQSGKYANLCWVS 76 (143)
Q Consensus 25 ~~kIkkaL~kI~GVsV~VDle~~~VtV~G--~vdp~~Ll~aI~kGk~Ael~~~~ 76 (143)
.++++..|++|+ .||++.-||+|.| ++--..|+++|.. -+.|+.+.+
T Consensus 151 L~~~r~~l~~LP----~Idp~~pTivVaG~PNVGKSSlv~~lT~-AkpEvA~YP 199 (346)
T COG1084 151 LRKARDHLKKLP----AIDPDLPTIVVAGYPNVGKSSLVRKLTT-AKPEVAPYP 199 (346)
T ss_pred HHHHHHHHhcCC----CCCCCCCeEEEecCCCCcHHHHHHHHhc-CCCccCCCC
Confidence 356677778888 5688999999999 5778899988875 334444444
No 303
>PF00438 S-AdoMet_synt_N: S-adenosylmethionine synthetase, N-terminal domain; InterPro: IPR022628 The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold. This entry represents the N-terminal domain of S-adenosylmethionine synthetase and is found in association with PF02772 from PFAM and PF02773 from PFAM. S-adenosylmethionine synthetase (MAT, 2.5.1.6 from EC) is the enzyme that catalyzes the formation of S-adenosylmethionine (AdoMet) from methionine and ATP []. AdoMet is an important methyl donor for transmethylation and is also the propylamino donor in polyamine biosynthesis. In bacteria there is a single isoform of AdoMet synthetase (gene metK), there are two in budding yeast (genes SAM1 and SAM2) and in mammals while in plants there is generally a multigene family. The sequence of AdoMet synthetase is highly conserved throughout isozymes and species. The active sites of both the Escherichia coli and rat liver MAT reside between two subunits, with contributions from side chains of residues from both subunits, resulting in a dimer as the minimal catalytic entity. The side chains that contribute to the ligand binding sites are conserved between the two proteins. In the structures of complexes with the E. coli enzyme, the phosphate groups have the same positions in the (PPi plus Pi) complex and the (ADP plus Pi) complex, and are located at the bottom of a deep cavity with the adenosyl group nearer the entrance []; GO: 0004478 methionine adenosyltransferase activity; PDB: 3RV2_A 3TDE_B 3S82_B 3IML_B 2P02_A 2OBV_A 1QM4_B 1O9T_B 1O93_A 1O92_B ....
Probab=23.37 E-value=1.6e+02 Score=21.53 Aligned_cols=50 Identities=16% Similarity=0.172 Sum_probs=29.6
Q ss_pred ChhhHHHHHHHHhccCC---eeEEEeCCCCEEEEEeec------CHHHHHH-HHhc-CCce
Q 048021 21 DPAWFSSMTKVLRKIKG---ASYTIDAEEGMAYITGRA------NPRKLLR-KLQS-GKYA 70 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~G---VsV~VDle~~~VtV~G~v------dp~~Ll~-aI~k-Gk~A 70 (143)
|+-=..+|--++.+.+. |.+++....+.|.|.|++ |-++|++ .|++ ||.-
T Consensus 19 cDqISDailD~~l~~dp~arVA~E~~~~~~~v~i~GEi~~~~~vd~~~ivR~~i~~IGY~~ 79 (100)
T PF00438_consen 19 CDQISDAILDACLKQDPNARVACETLVSTGMVIIAGEITSRAYVDIEKIVREVIKDIGYDD 79 (100)
T ss_dssp HHHHHHHHHHHHHHH-TT-EEEEEEEEETTEEEEEEEEESSHHHTHHHHHHHHHHHHT-EE
T ss_pred hceeeeccchHHHhcCCCCeEEEEEEeeccEEEEEEEeccchhhhHHHHHHHHHHHhCCCC
Confidence 33334443333334443 348889999999999974 5555554 4566 9975
No 304
>PLN00072 3-isopropylmalate isomerase/dehydratase small subunit; Provisional
Probab=23.34 E-value=2.1e+02 Score=24.14 Aligned_cols=32 Identities=13% Similarity=0.153 Sum_probs=21.5
Q ss_pred CCeeEEEeCCCCEEEEE--ee-----cCHHHHHHHHhc-CC
Q 048021 36 KGASYTIDAEEGMAYIT--GR-----ANPRKLLRKLQS-GK 68 (143)
Q Consensus 36 ~GVsV~VDle~~~VtV~--G~-----vdp~~Ll~aI~k-Gk 68 (143)
+|-.++||+++++|+-. |. ..++ +++.|+. |.
T Consensus 194 ~Gd~i~VDl~~~~v~n~t~g~~~~~~p~~~-~~~Il~aGGl 233 (246)
T PLN00072 194 TGDVVTVELGNSVLINHTTGKEYKLKPIGD-AGPVIDAGGI 233 (246)
T ss_pred CCCEEEEECCCCEEEECCCCeEEEecCCHH-HHHHHHcCCH
Confidence 45579999999998643 32 3454 7777776 53
No 305
>TIGR00311 aIF-2beta translation initiation factor aIF-2, beta subunit, putative.
Probab=23.09 E-value=1.5e+02 Score=22.50 Aligned_cols=36 Identities=14% Similarity=0.290 Sum_probs=26.2
Q ss_pred CeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEee
Q 048021 37 GASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 37 GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
|....+| +++++|.|..++..|.+.|++ =..=+++.
T Consensus 65 gt~g~i~--~~rlii~G~~~~~~i~~~L~~yI~~yVlC~ 101 (133)
T TIGR00311 65 GTAGNLE--GGRLILQGKFTHFLLNERIEDYVRKYVICR 101 (133)
T ss_pred CCCceec--CCEEEEEeecCHHHHHHHHHHHHhheEECC
Confidence 4434454 689999999999999999988 43334443
No 306
>TIGR00158 L9 ribosomal protein L9. Ribosomal protein L9 appears to be universal in, but restricted to, eubacteria and chloroplast.
Probab=23.02 E-value=1.7e+02 Score=22.44 Aligned_cols=41 Identities=15% Similarity=0.215 Sum_probs=27.2
Q ss_pred HhccCCeeEEEeCCCC-EEEEEeecCHHHHHHHHhc-CCceEE
Q 048021 32 LRKIKGASYTIDAEEG-MAYITGRANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 32 L~kI~GVsV~VDle~~-~VtV~G~vdp~~Ll~aI~k-Gk~Ael 72 (143)
..+|+++.++|..+.+ .=.+-|.+.+..|.++|.+ |..++-
T Consensus 69 ~~~l~~~~~~i~~k~ge~gklfGSVt~~~I~~~l~~~g~~idk 111 (148)
T TIGR00158 69 KEVLELGTLTISKKVGDEGKLFGSITTKQIADALKAAGLDLDK 111 (148)
T ss_pred HHHhcCcEEEEEEEeCCCCeEEEeECHHHHHHHHHHcCCcccH
Confidence 3446666555543322 2234588999999999988 998875
No 307
>PLN03210 Resistant to P. syringae 6; Provisional
Probab=22.88 E-value=1.7e+02 Score=29.19 Aligned_cols=76 Identities=18% Similarity=0.045 Sum_probs=44.7
Q ss_pred CCCCCCCCCceEEEEEEe--e--cChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEeec
Q 048021 1 MFSSSSSSAKMTCGLKVD--T--NDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 1 ~~~ss~~s~~~tvvlkV~--m--~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
|-||||+++....-.-++ + +-.+=..-+.++|.. +|+.+=+|-+ .-.|+.-...|.+||+. ...+.+++.
T Consensus 1 ~~~~~~~~~~~~~~vf~sfrg~d~r~~f~~hl~~~l~~-~~i~~f~d~~----~~~g~~~~~~l~~~i~~s~~~ivv~s~ 75 (1153)
T PLN03210 1 MASSSSSSRNWVYDVFPSFSGEDVRITFLSHFLKELDR-KLIIAFKDNE----IERSQSLDPELKQAIRDSRIAVVVFSK 75 (1153)
T ss_pred CCCCCCCCCCCCCcEEeeCCCcccccCHHHHHHHHHHH-CCCeEEccCC----ccCCCcccHHHHHHHHhCeEEEEEecC
Confidence 445555555443222222 2 346677788889877 7885334532 22455445568888888 777777776
Q ss_pred CCCCCC
Q 048021 76 SAGNQI 81 (143)
Q Consensus 76 ~~~~s~ 81 (143)
...+|.
T Consensus 76 ~ya~s~ 81 (1153)
T PLN03210 76 NYASSS 81 (1153)
T ss_pred Ccccch
Confidence 666553
No 308
>cd07042 STAS_SulP_like_sulfate_transporter Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Probab=22.78 E-value=1.4e+02 Score=19.73 Aligned_cols=68 Identities=12% Similarity=0.090 Sum_probs=36.1
Q ss_pred ceEEEEEEee-----cChhhHHHHHHHHhccCCe-eEEEeCCCC-EEEEEeecCHHHHHHHHhc-CCceEEeecCC
Q 048021 10 KMTCGLKVDT-----NDPAWFSSMTKVLRKIKGA-SYTIDAEEG-MAYITGRANPRKLLRKLQS-GKYANLCWVSA 77 (143)
Q Consensus 10 ~~tvvlkV~m-----~C~gC~~kIkkaL~kI~GV-sV~VDle~~-~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~ 77 (143)
..++++++.+ .++.-.+.+.+++..-+.. .+.+|+..- .+--.|-.--..+.+.+++ |.++.++++.+
T Consensus 8 ~~~~v~~l~G~l~~~~~~~l~~~~~~~~~~~~~~~~lilD~~~v~~iDss~~~~L~~~~~~~~~~~~~~~l~~~~~ 83 (107)
T cd07042 8 PGVLIYRIDGPLFFGNAEYFKDRLLRLVDEDPPLKVVILDLSAVNFIDSTAAEALEELVKDLRKRGVELYLAGLNP 83 (107)
T ss_pred CCEEEEEecCceEeehHHHHHHHHHHHhccCCCceEEEEECCCCchhhHHHHHHHHHHHHHHHHCCCEEEEecCCH
Confidence 3456677653 3555556665555432213 355666542 1111122223555666666 98888888765
No 309
>PF14334 DUF4390: Domain of unknown function (DUF4390)
Probab=22.71 E-value=1.8e+02 Score=22.18 Aligned_cols=33 Identities=18% Similarity=0.261 Sum_probs=27.0
Q ss_pred eEEEeCCCCEEEEEeec---CHHHHHHHHhcCCceE
Q 048021 39 SYTIDAEEGMAYITGRA---NPRKLLRKLQSGKYAN 71 (143)
Q Consensus 39 sV~VDle~~~VtV~G~v---dp~~Ll~aI~kGk~Ae 71 (143)
++.++..++.+.|...+ -++.|.+||.+|-...
T Consensus 10 ~~~~~~~~~~l~l~a~~~~~l~~~l~~AL~~Gipl~ 45 (165)
T PF14334_consen 10 SAELENSDGGLYLSADVDFELSPELEDALKNGIPLY 45 (165)
T ss_pred EEEEEEeCCEEEEEEEEeccCCHHHHHHHHcCCeEE
Confidence 48888899999999765 4899999999976654
No 310
>KOG1752 consensus Glutaredoxin and related proteins [Posttranslational modification, protein turnover, chaperones]
Probab=22.71 E-value=2.9e+02 Score=20.01 Aligned_cols=43 Identities=14% Similarity=0.322 Sum_probs=28.6
Q ss_pred eecChhhHHHHHHHHhccCCe-eEEEeCCCCEEEEEeecCHHHHHHHHhc--CCc
Q 048021 18 DTNDPAWFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRANPRKLLRKLQS--GKY 69 (143)
Q Consensus 18 ~m~C~gC~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~ 69 (143)
--.|.-|.+ +++.|.+ .|| -..+++++. -+..+|.++|.+ |.+
T Consensus 21 Ks~C~~c~~-~k~ll~~-~~v~~~vvELD~~-------~~g~eiq~~l~~~tg~~ 66 (104)
T KOG1752|consen 21 KSSCPYCHR-AKELLSD-LGVNPKVVELDED-------EDGSEIQKALKKLTGQR 66 (104)
T ss_pred CCcCchHHH-HHHHHHh-CCCCCEEEEccCC-------CCcHHHHHHHHHhcCCC
Confidence 368999997 8888888 566 233333322 344588888887 554
No 311
>PRK09579 multidrug efflux protein; Reviewed
Probab=22.67 E-value=1.3e+02 Score=29.73 Aligned_cols=53 Identities=9% Similarity=-0.117 Sum_probs=37.2
Q ss_pred hhhHHHHHHHHhccCCe-eEEEeCCCC--EEEEEe--------ecCHHHHHHHHhc---CCceEEee
Q 048021 22 PAWFSSMTKVLRKIKGA-SYTIDAEEG--MAYITG--------RANPRKLLRKLQS---GKYANLCW 74 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GV-sV~VDle~~--~VtV~G--------~vdp~~Ll~aI~k---Gk~Ael~~ 74 (143)
..-+++|++.|++++|+ ++..|...+ ++.|.= -+++..|.++|+. |..+.-+.
T Consensus 671 ~~~a~~l~~~l~~~~g~~~v~~~~~~~~~el~i~vd~~ka~~~Gls~~~i~~~l~~~~~G~~~~~~~ 737 (1017)
T PRK09579 671 LQVAQRVKQRAQESGKFAFLDIDLAFDKPEVVVDIDRAKAAQMGVSMQDLGGTLATLLGEGEINRFT 737 (1017)
T ss_pred HHHHHHHHHHHHcCCCcEEeeccccCCCceEEEEECHHHHHHcCCCHHHHHHHHHHHhCCceeeEEE
Confidence 56788999999999999 788777544 444441 1567788888876 65554433
No 312
>PF01625 PMSR: Peptide methionine sulfoxide reductase; InterPro: IPR002569 Peptide methionine sulphoxide reductase (Msr) reverses the inactivation of many proteins due to the oxidation of critical methionine residues by reducing methionine sulphoxide, Met(O), to methionine []. It is present in most living organisms, and the cognate structural gene belongs to the so-called minimum gene set [, ]. The domains: MsrA and MsrB, reduce different epimeric forms of methionine sulphoxide. This group represent MsrA, the crystal structure of which has been determined in a number of organisms. In Mycobacterium tuberculosis, the MsrA structure has been determined to 1.5 Angstrom resolution []. In contrast to the three catalytic cysteine residues found in previously characterised MsrA structures, M. tuberculosis MsrA represents a class containing only two functional cysteine residues. The overall structure shows no resemblance to the structures of MsrB (IPR002579 from INTERPRO) from other organisms; though the active sites show approximate mirror symmetry. In each case, conserved amino acid motifs mediate the stereo-specific recognition and reduction of the substrate. In a number of pathogenic bacteria including Neisseria gonorrhoeae, the MsrA and MsrB domains are fused; the MsrA being N-terminal to MsrB. This arrangement is reversed in Treponema pallidum. In N. gonorrhoeae and Neisseria meningitidis a thioredoxin domain is fused to the N terminus. This may function to reduce the active sites of the downstream MsrA and MsrB domains. ; GO: 0016671 oxidoreductase activity, acting on a sulfur group of donors, disulfide as acceptor, 0019538 protein metabolic process, 0055114 oxidation-reduction process; PDB: 2GT3_A 1FF3_B 2IEM_A 3E0M_D 2J89_A 3PIN_B 3PIM_B 3PIL_B 2L90_A 3BQF_A ....
Probab=22.63 E-value=1.5e+02 Score=23.00 Aligned_cols=26 Identities=15% Similarity=0.295 Sum_probs=20.3
Q ss_pred EEEEEEeecChhhHHHHHHHHhccCCe-eEEE
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTI 42 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~V 42 (143)
+++|. +||-=.++..+.+++|| ++.+
T Consensus 2 ~a~fa-----~GCFW~~e~~f~~~~GV~~t~v 28 (155)
T PF01625_consen 2 KAYFA-----GGCFWGVEAAFRRLPGVISTRV 28 (155)
T ss_dssp EEEEE-----ESSHHHHHHHHHTSTTEEEEEE
T ss_pred EEEEe-----cCCCeEhHHHHhhCCCEEEEEe
Confidence 45555 68888889999999999 6665
No 313
>KOG1372 consensus GDP-mannose 4,6 dehydratase [Carbohydrate transport and metabolism]
Probab=22.63 E-value=1.6e+02 Score=25.81 Aligned_cols=70 Identities=16% Similarity=0.275 Sum_probs=50.3
Q ss_pred EEEE---eecChhhHHHHHHHHhccC--------CeeEEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEeecCCCCCC
Q 048021 14 GLKV---DTNDPAWFSSMTKVLRKIK--------GASYTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCWVSAGNQI 81 (143)
Q Consensus 14 vlkV---~m~C~gC~~kIkkaL~kI~--------GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~~~s~ 81 (143)
.+++ +|+-..|-.|+-..++-.+ -|.|.+|+.+=+..|.+ +-.-.|++||+. |-.-.+.....+.|.
T Consensus 84 ~mkLHYgDmTDss~L~k~I~~ikPtEiYnLaAQSHVkvSFdlpeYTAeVda-vGtLRlLdAi~~c~l~~~VrfYQAstSE 162 (376)
T KOG1372|consen 84 SMKLHYGDMTDSSCLIKLISTIKPTEVYNLAAQSHVKVSFDLPEYTAEVDA-VGTLRLLDAIRACRLTEKVRFYQASTSE 162 (376)
T ss_pred eeEEeeccccchHHHHHHHhccCchhhhhhhhhcceEEEeecccceeeccc-hhhhhHHHHHHhcCcccceeEEecccHh
Confidence 4555 4788999887766654322 23478888888888875 445789999999 877777777777777
Q ss_pred ccc
Q 048021 82 TYG 84 (143)
Q Consensus 82 ~~~ 84 (143)
.||
T Consensus 163 lyG 165 (376)
T KOG1372|consen 163 LYG 165 (376)
T ss_pred hcc
Confidence 666
No 314
>PF12242 Eno-Rase_NADH_b: NAD(P)H binding domain of trans-2-enoyl-CoA reductase; PDB: 3ZU5_A 3ZU3_A 3ZU4_A 3ZU2_A 3S8M_A.
Probab=22.58 E-value=1.6e+02 Score=20.80 Aligned_cols=55 Identities=15% Similarity=0.134 Sum_probs=34.0
Q ss_pred cChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhc--CCceEEeecC
Q 048021 20 NDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQS--GKYANLCWVS 76 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~k--Gk~Ael~~~~ 76 (143)
|=.||++.|++-+.-++-- -.+ .-.++|.|.|.-.---|..+|.. |..|.-+++.
T Consensus 15 HP~GC~~~V~~qI~yvk~~-~~~-~GpK~VLViGaStGyGLAsRIa~aFg~gA~TiGV~ 71 (78)
T PF12242_consen 15 HPVGCARNVENQIEYVKSQ-GKI-NGPKKVLVIGASTGYGLASRIAAAFGAGADTIGVS 71 (78)
T ss_dssp -HHHHHHHHHHHHHHHHHC-----TS-SEEEEES-SSHHHHHHHHHHHHCC--EEEEEE
T ss_pred CcHHHHHHHHHHHHHHHhc-CCC-CCCceEEEEecCCcccHHHHHHHHhcCCCCEEEEe
Confidence 4479999999888765321 233 33578999987776667777766 7777777664
No 315
>PF04359 DUF493: Protein of unknown function (DUF493); InterPro: IPR007454 This family includes several proteins of uncharacterised function.; PDB: 1RWU_A 2JOQ_A 2H9Z_A.
Probab=22.54 E-value=2.3e+02 Score=19.17 Aligned_cols=35 Identities=14% Similarity=0.212 Sum_probs=28.3
Q ss_pred CCCCCceEEEEEEeecChhhHHHHHHHHhccCCee
Q 048021 5 SSSSAKMTCGLKVDTNDPAWFSSMTKVLRKIKGAS 39 (143)
Q Consensus 5 s~~s~~~tvvlkV~m~C~gC~~kIkkaL~kI~GVs 39 (143)
|+...-..+++.|.+....=...|=++|++++||.
T Consensus 48 S~~GkY~Svtv~v~v~s~eq~~~iy~~L~~~~~Vk 82 (85)
T PF04359_consen 48 SSKGKYVSVTVSVTVESAEQVDAIYRELKAHPGVK 82 (85)
T ss_dssp STTSSEEEEEEEEEESSHHHHHHHHHHHTTSSSEE
T ss_pred CCCCeEEEEEEEEEECCHHHHHHHHHHhccCCCEE
Confidence 45555667778888888888999999999999984
No 316
>TIGR00489 aEF-1_beta translation elongation factor aEF-1 beta. This model describes the archaeal translation elongation factor aEF-1 beta. The member from Sulfolobus solfataricus was demonstrated experimentally. It is a dimer that catalyzes the exchange of GDP for GTP on aEF-1 alpha.
Probab=22.50 E-value=1.8e+02 Score=20.64 Aligned_cols=21 Identities=19% Similarity=0.219 Sum_probs=17.7
Q ss_pred hhHHHHHHHHhccCCe-eEEEe
Q 048021 23 AWFSSMTKVLRKIKGA-SYTID 43 (143)
Q Consensus 23 gC~~kIkkaL~kI~GV-sV~VD 43 (143)
|-...|+.+|++++|| +++|.
T Consensus 62 g~td~lee~i~~ve~V~svev~ 83 (88)
T TIGR00489 62 GGTEAAEESLSGIEGVESVEVT 83 (88)
T ss_pred cChHHHHHHHhcCCCccEEEEE
Confidence 5568999999999999 77764
No 317
>PRK05528 methionine sulfoxide reductase A; Provisional
Probab=22.34 E-value=1.1e+02 Score=23.98 Aligned_cols=26 Identities=8% Similarity=0.119 Sum_probs=20.2
Q ss_pred EEEEEEeecChhhHHHHHHHHhccCCe-eEEE
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIKGA-SYTI 42 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~V 42 (143)
+++|. +||-=.++..+.+++|| ++++
T Consensus 3 ~a~fa-----gGCFWg~E~~f~~l~GV~~t~v 29 (156)
T PRK05528 3 TVYFA-----GGCLWGVQAFFKTLPGVIHTEA 29 (156)
T ss_pred EEEEe-----cCCchhhHHHHhcCCCEEEEEE
Confidence 45555 77777788899999999 7766
No 318
>cd04879 ACT_3PGDH-like ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). ACT_3PGDH-like: The ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with or without an extended C-terminal (xct) region found in various bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback controlled by the end product L-serine in an allosteric manner. In the Escherichia coli homotetrameric enzyme, the interface at adjacent ACT (regulatory) domains couples to create an extended beta-sheet. Each regulatory interface forms two serine-binding sites. The mechanism by which serine transmits inhibition to the active
Probab=22.25 E-value=1e+02 Score=18.46 Aligned_cols=29 Identities=17% Similarity=0.156 Sum_probs=19.4
Q ss_pred ceEEEEEEeecChhhHHHHHHHHhccCCe-eEE
Q 048021 10 KMTCGLKVDTNDPAWFSSMTKVLRKIKGA-SYT 41 (143)
Q Consensus 10 ~~tvvlkV~m~C~gC~~kIkkaL~kI~GV-sV~ 41 (143)
...+.|.|+-. . ...+.+.|++++|| +|.
T Consensus 40 ~~~~~~~v~~~--~-~~~l~~~l~~~~~V~~v~ 69 (71)
T cd04879 40 IAYMVLDVDSP--V-PEEVLEELKALPGIIRVR 69 (71)
T ss_pred EEEEEEEcCCC--C-CHHHHHHHHcCCCeEEEE
Confidence 33455666432 2 45888999999999 665
No 319
>PRK12484 nicotinate phosphoribosyltransferase; Provisional
Probab=22.25 E-value=2.3e+02 Score=25.65 Aligned_cols=35 Identities=14% Similarity=0.058 Sum_probs=27.8
Q ss_pred CCCEEEEEeecCHHHHHHHHhcCCceEEeecCCCC
Q 048021 45 EEGMAYITGRANPRKLLRKLQSGKYANLCWVSAGN 79 (143)
Q Consensus 45 e~~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~~ 79 (143)
+..+|.+++.+|.++|++..++|..+..++++..-
T Consensus 278 ~~~kIi~S~gLde~~i~~l~~~g~~id~fGvGt~L 312 (443)
T PRK12484 278 EQVKIVASGGLDEYRIAALLAAGAPIDGFGVGTRL 312 (443)
T ss_pred CCcEEEEeCCCCHHHHHHHHHCCCcCeEEeeCccc
Confidence 34578888889999999876559999999998643
No 320
>TIGR00007 phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase. Examples of this enzyme in Actinobacteria have been found to be bifunctional, also possessing phosphoribosylanthranilate isomerase activity ; the trusted cutoff here has now been raised to 275.0 to exclude the bifunctional group, now represented by model TIGR01919. HisA from Lactococcus lactis was reported to be inactive (MEDLINE:93322317).
Probab=22.08 E-value=3.1e+02 Score=21.43 Aligned_cols=48 Identities=17% Similarity=0.279 Sum_probs=33.5
Q ss_pred HHHHHHHHhccC--CeeEEEeCCCCEEEEEee-----cCHHHHHHHHhc-CCceEE
Q 048021 25 FSSMTKVLRKIK--GASYTIDAEEGMAYITGR-----ANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 25 ~~kIkkaL~kI~--GVsV~VDle~~~VtV~G~-----vdp~~Ll~aI~k-Gk~Ael 72 (143)
...++++++.+. -+-+.+|.+.++|.+.|. .++..+++.+.+ |-+..+
T Consensus 108 ~~~~~~~~~~~g~~~i~~sid~~~~~v~~~g~~~~~~~~~~~~~~~~~~~g~~~ii 163 (230)
T TIGR00007 108 PDLVKELLKEYGPERIVVSLDARGGEVAVKGWLEKSEVSLEELAKRLEELGLEGII 163 (230)
T ss_pred HHHHHHHHHHhCCCcEEEEEEEECCEEEEcCCcccCCCCHHHHHHHHHhCCCCEEE
Confidence 345666665552 245778888888887762 577889999988 988544
No 321
>PRK13585 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional
Probab=21.97 E-value=3.2e+02 Score=21.49 Aligned_cols=48 Identities=17% Similarity=0.306 Sum_probs=34.0
Q ss_pred HHHHHHHhccCC--eeEEEeCCCCEEEEEee-----cCHHHHHHHHhc-CCceEEe
Q 048021 26 SSMTKVLRKIKG--ASYTIDAEEGMAYITGR-----ANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 26 ~kIkkaL~kI~G--VsV~VDle~~~VtV~G~-----vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
..++++.+.+.. +-+.+|..+++|.+.|. .++..+.+++.+ |....++
T Consensus 113 ~~~~~i~~~~g~~~i~~sid~~~~~v~~~g~~~~~~~~~~~~~~~~~~~G~~~i~~ 168 (241)
T PRK13585 113 EIVRELSEEFGSERVMVSLDAKDGEVVIKGWTEKTGYTPVEAAKRFEELGAGSILF 168 (241)
T ss_pred HHHHHHHHHhCCCcEEEEEEeeCCEEEECCCcccCCCCHHHHHHHHHHcCCCEEEE
Confidence 345665555532 24678888899998873 378899999988 9876554
No 322
>cd03033 ArsC_15kD Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC. It is encoded by an ORF present in a gene cluster associated with nitrogen fixation that also encodes dinitrogenase reductase ADP-ribosyltransferase (DRAT) and dinitrogenase reductase activating glycohydrolase (DRAG). ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via glutaredoxin, through a single catalytic cysteine.
Probab=21.92 E-value=2.6e+02 Score=20.17 Aligned_cols=45 Identities=9% Similarity=-0.107 Sum_probs=28.5
Q ss_pred EEeecChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CC
Q 048021 16 KVDTNDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GK 68 (143)
Q Consensus 16 kV~m~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk 68 (143)
=.+-.|..|.+ .++.|+. .|+. -.+|..+ -..+.+.|.+.+++ |.
T Consensus 5 y~~p~C~~crk-A~~~L~~-~gi~~~~~d~~~------~p~s~~eL~~~l~~~g~ 51 (113)
T cd03033 5 YEKPGCANNAR-QKALLEA-AGHEVEVRDLLT------EPWTAETLRPFFGDLPV 51 (113)
T ss_pred EECCCCHHHHH-HHHHHHH-cCCCcEEeehhc------CCCCHHHHHHHHHHcCH
Confidence 33458999984 4566665 5774 4445543 23567778877777 74
No 323
>PRK11439 pphA serine/threonine protein phosphatase 1; Provisional
Probab=21.81 E-value=1.5e+02 Score=23.36 Aligned_cols=41 Identities=17% Similarity=0.261 Sum_probs=29.5
Q ss_pred ChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeec-----CHHHHHHHHhc
Q 048021 21 DPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRA-----NPRKLLRKLQS 66 (143)
Q Consensus 21 C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v-----dp~~Ll~aI~k 66 (143)
=-||...++++|++++ .+.+..++.+-|.+ ++.++++.|.+
T Consensus 25 IHG~~~~L~~lL~~i~-----~~~~~D~li~lGDlvDrGp~s~~vl~~l~~ 70 (218)
T PRK11439 25 IHGCFEQLMRKLRHCR-----FDPWRDLLISVGDLIDRGPQSLRCLQLLEE 70 (218)
T ss_pred ccCCHHHHHHHHHhcC-----CCcccCEEEEcCcccCCCcCHHHHHHHHHc
Confidence 3589999999998873 22345566666752 68888888877
No 324
>cd08630 PI-PLCc_delta3 Catalytic domain of metazoan phosphoinositide-specific phospholipase C-delta3. This subfamily corresponds to the catalytic domain present in metazoan phosphoinositide-specific phospholipase C (PI-PLC, EC 3.1.4.11)-delta3 isozymes. PI-PLC is a signaling enzyme that hydrolyzes the membrane phospholipids phosphatidylinositol-4,5-bisphosphate (PIP2) to generate two important second messengers in eukaryotic signal transduction cascades, Inositol 1,4,5-trisphosphate (InsP3) and diacylglycerol (DAG). InsP3 triggers inflow of calcium from intracellular stores, while DAG, together with calcium, activates protein kinase C, which then phosphorylates other molecules, leading to altered cellular activity. Calcium is required for the catalysis. PLC-delta represents a class of mammalian PI-PLC that has an N-terminal pleckstrin homology (PH) domain, an array of EF hands, a PLC catalytic core domain, and a C-terminal C2 domain. This family corresponds to the catalytic domain wh
Probab=21.80 E-value=78 Score=26.82 Aligned_cols=52 Identities=15% Similarity=0.223 Sum_probs=36.1
Q ss_pred hHHHHHHHHhccCCe-eEEEeCCCCE---EEEE-e-----ecCHHHHHHHHhc-CCceE----EeecCC
Q 048021 24 WFSSMTKVLRKIKGA-SYTIDAEEGM---AYIT-G-----RANPRKLLRKLQS-GKYAN----LCWVSA 77 (143)
Q Consensus 24 C~~kIkkaL~kI~GV-sV~VDle~~~---VtV~-G-----~vdp~~Ll~aI~k-Gk~Ae----l~~~~~ 77 (143)
=+..+.+||. .|+ -|++|..+|. -.|. | .+.-.+++++|++ ++.+. |++.+.
T Consensus 31 s~~~y~~aL~--~GcRcvElD~wdg~~~eP~V~HG~tlts~i~f~~v~~~I~~~AF~~s~yPvIlslE~ 97 (258)
T cd08630 31 STEAYVRAFA--QGCRCVELDCWEGPGGEPVIYHGHTLTSKILFRDVIQAVRQHAFTASPYPVILSLEN 97 (258)
T ss_pred cHHHHHHHHH--cCCcEEEEEeecCCCCCcEEeeCCccccceEHHHHHHHHHHHhccCCCCCEEEEeec
Confidence 3568888885 699 5999876543 3333 4 4788999999999 88753 555543
No 325
>COG4747 ACT domain-containing protein [General function prediction only]
Probab=21.80 E-value=3.9e+02 Score=20.75 Aligned_cols=59 Identities=20% Similarity=0.260 Sum_probs=40.4
Q ss_pred eEEEEEEee-cChhhHHHHHHHHhccCCe--e--EEEeCCCCEEEE-EeecCHHHHHHHHhc-CCce
Q 048021 11 MTCGLKVDT-NDPAWFSSMTKVLRKIKGA--S--YTIDAEEGMAYI-TGRANPRKLLRKLQS-GKYA 70 (143)
Q Consensus 11 ~tvvlkV~m-~C~gC~~kIkkaL~kI~GV--s--V~VDle~~~VtV-~G~vdp~~Ll~aI~k-Gk~A 70 (143)
.+=++.|+| .-+|--.+|-++|..- ++ + ..+-.++++..+ --.-|.++.+++|++ |-+.
T Consensus 68 ~~dVlaVEmeD~PG~l~~I~~vl~d~-diNldYiYAFv~ek~KAlli~r~ed~d~~~~aLed~gi~~ 133 (142)
T COG4747 68 ETDVLAVEMEDVPGGLSRIAEVLGDA-DINLDYIYAFVTEKQKALLIVRVEDIDRAIKALEDAGIKL 133 (142)
T ss_pred eeeEEEEEecCCCCcHHHHHHHHhhc-CcCceeeeeeeecCceEEEEEEhhHHHHHHHHHHHcCCee
Confidence 355788998 6789999999998763 34 2 222233344444 445689999999999 8653
No 326
>TIGR02200 GlrX_actino Glutaredoxin-like protein. This family of glutaredoxin-like proteins is limited to the Actinobacteria and contains the conserved CxxC motif.
Probab=21.79 E-value=2.1e+02 Score=17.67 Aligned_cols=18 Identities=28% Similarity=0.231 Sum_probs=13.5
Q ss_pred ecChhhHHHHHHHHhccCCe
Q 048021 19 TNDPAWFSSMTKVLRKIKGA 38 (143)
Q Consensus 19 m~C~gC~~kIkkaL~kI~GV 38 (143)
..|+.|. +++..|.++ |+
T Consensus 8 ~~C~~C~-~~~~~L~~~-~~ 25 (77)
T TIGR02200 8 TWCGYCA-QLMRTLDKL-GA 25 (77)
T ss_pred CCChhHH-HHHHHHHHc-CC
Confidence 5799998 477778775 55
No 327
>COG1782 Predicted metal-dependent RNase, consists of a metallo-beta-lactamase domain and an RNA-binding KH domain [General function prediction only]
Probab=21.65 E-value=2.5e+02 Score=26.81 Aligned_cols=59 Identities=12% Similarity=0.094 Sum_probs=40.7
Q ss_pred hHHHHHHHHhccCCe-eEEEeCCCCEEEEEeec-------CHHHHHHHHhc-CCceEEeecCCCCCCc
Q 048021 24 WFSSMTKVLRKIKGA-SYTIDAEEGMAYITGRA-------NPRKLLRKLQS-GKYANLCWVSAGNQIT 82 (143)
Q Consensus 24 C~~kIkkaL~kI~GV-sV~VDle~~~VtV~G~v-------dp~~Ll~aI~k-Gk~Ael~~~~~~~s~~ 82 (143)
-.+.|.+.+-+-.|| ++.+|.+.|.|.|..+- .-..+.+..++ |...+++..++-.|.+
T Consensus 77 A~~~I~eivP~ea~i~~i~Fd~~tGEViIea~KPGlvigk~g~~~reI~~~tgW~p~ivR~PPi~S~t 144 (637)
T COG1782 77 ARKIILEIVPEEAGITDIYFDDDTGEVIIEAKKPGLVIGKGGSTLREITAETGWAPKIVRTPPIQSRT 144 (637)
T ss_pred HHHHHHHhCccccCceeEEecCCCceEEEEecCCceEEecCchHHHHHHHHhCCcceeeecCCCchhh
Confidence 334444444444678 79999999999998542 23444555555 9999999998876665
No 328
>PRK13763 putative RNA-processing protein; Provisional
Probab=21.63 E-value=1.9e+02 Score=22.59 Aligned_cols=27 Identities=11% Similarity=0.324 Sum_probs=18.5
Q ss_pred HHHHHHhccCCeeEEEeCCCCEEEEEeec
Q 048021 27 SMTKVLRKIKGASYTIDAEEGMAYITGRA 55 (143)
Q Consensus 27 kIkkaL~kI~GVsV~VDle~~~VtV~G~v 55 (143)
+..++|+.+-|+.+.|+ +++|.|.|..
T Consensus 115 ~~~k~ie~~t~~~i~i~--~~~v~i~G~~ 141 (180)
T PRK13763 115 KTRRIIEELTGVDISVY--GKTVAIIGDP 141 (180)
T ss_pred HHHHHHHHHHCcEEEEc--CCEEEEEeCH
Confidence 45667777777755554 6679999853
No 329
>TIGR00655 PurU formyltetrahydrofolate deformylase. This model describes formyltetrahydrofolate deformylases. The enzyme is a homohexamer. Sequences from a related enzyme formyl tetrahydrofolate-specific enzyme, phosphoribosylglycinamide formyltransferase, serve as an outgroup for phylogenetic analysis. Putative members of this family, scoring below the trusted cutoff, include a sequence from Rhodobacter capsulatus that lacks an otherwise conserved C-terminal region.
Probab=21.62 E-value=4.8e+02 Score=21.93 Aligned_cols=53 Identities=15% Similarity=0.078 Sum_probs=33.4
Q ss_pred hhHHHHHHHHhc-c---CCeeEEEeCCCC--EEEE--Eee-cCHHHHHHHHhc-CCceEEeec
Q 048021 23 AWFSSMTKVLRK-I---KGASYTIDAEEG--MAYI--TGR-ANPRKLLRKLQS-GKYANLCWV 75 (143)
Q Consensus 23 gC~~kIkkaL~k-I---~GVsV~VDle~~--~VtV--~G~-vdp~~Ll~aI~k-Gk~Ael~~~ 75 (143)
.....++.+|+. + -|+++.++..+. ++.| +|. -+-+.|+++++. ..++++..+
T Consensus 55 ~~~~~l~~~l~~~~~~~~~l~i~l~~~~~~~ki~vl~Sg~g~nl~~l~~~~~~g~l~~~i~~v 117 (280)
T TIGR00655 55 LEESSLLAAFKSALAEKFEMTWELILADKLKRVAILVSKEDHCLGDLLWRWYSGELDAEIALV 117 (280)
T ss_pred CCHHHHHHHHHHHHHHHhCCEEEEecCCCCcEEEEEEcCCChhHHHHHHHHHcCCCCcEEEEE
Confidence 456777777666 4 355666655433 3333 344 478999999988 556776555
No 330
>cd01733 LSm10 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm10 is an SmD1-like protein which is thought to bind U7 snRNA along with LSm11 and five other Sm subunits to form a 7-member ring structure. LSm10 and the U7 snRNP of which it is a part are thought to play an important role in histone mRNA 3' processing.
Probab=21.62 E-value=1.4e+02 Score=20.49 Aligned_cols=29 Identities=21% Similarity=0.441 Sum_probs=22.1
Q ss_pred HHHHHHHhccCCeeEEEeCCCCEEEEEeec
Q 048021 26 SSMTKVLRKIKGASYTIDAEEGMAYITGRA 55 (143)
Q Consensus 26 ~kIkkaL~kI~GVsV~VDle~~~VtV~G~v 55 (143)
+.+-..|+++.|-.|.|.++++. ++.|.+
T Consensus 8 ~tl~~~L~~l~g~~V~VeLKng~-~~~G~L 36 (78)
T cd01733 8 NTLIILLQGLQGKVVTVELRNET-TVTGRI 36 (78)
T ss_pred chHHHHHHHCCCCEEEEEECCCC-EEEEEE
Confidence 34556778888878999999884 888863
No 331
>PRK11018 hypothetical protein; Provisional
Probab=21.51 E-value=2.6e+02 Score=18.73 Aligned_cols=52 Identities=10% Similarity=-0.019 Sum_probs=35.5
Q ss_pred EEEEEe-ecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEE--eecCHHHHHHHHhc-CCceEE
Q 048021 13 CGLKVD-TNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYIT--GRANPRKLLRKLQS-GKYANL 72 (143)
Q Consensus 13 vvlkV~-m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~--G~vdp~~Ll~aI~k-Gk~Ael 72 (143)
.+|.+- ..|+.-.=+.+++|++++. ...+.|. .......|.+.+++ |+.+..
T Consensus 9 ~~lD~rG~~CP~Pvl~~kk~l~~l~~--------G~~L~V~~d~~~a~~di~~~~~~~G~~v~~ 64 (78)
T PRK11018 9 YRLDMVGEPCPYPAVATLEALPQLKK--------GEILEVVSDCPQSINNIPLDARNHGYTVLD 64 (78)
T ss_pred eeEECCCCcCCHHHHHHHHHHHhCCC--------CCEEEEEeCCccHHHHHHHHHHHcCCEEEE
Confidence 345553 4899999999999999862 2233333 33456778888888 998753
No 332
>PRK04021 hypothetical protein; Reviewed
Probab=21.37 E-value=2.2e+02 Score=20.32 Aligned_cols=45 Identities=20% Similarity=0.224 Sum_probs=31.5
Q ss_pred cChhhHHHHHHHHhccCCeeEEEe----CCCCEEEEEeecCHHHHHHHHh
Q 048021 20 NDPAWFSSMTKVLRKIKGASYTID----AEEGMAYITGRANPRKLLRKLQ 65 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVsV~VD----le~~~VtV~G~vdp~~Ll~aI~ 65 (143)
--+.+=+.+-+.|.+.=|++|+|. -..++|.|.| ++++.+.++|.
T Consensus 43 ~~GkAN~ali~~LAk~l~~~I~I~~G~~sr~K~v~i~g-~~~e~l~~~L~ 91 (92)
T PRK04021 43 VKGKANKELVKFFSKLLGAEVEIIRGETSREKDLLVKG-ISLEEVKKKLK 91 (92)
T ss_pred CCChHHHHHHHHHHHHhCCCEEEEecCCcCceEEEEec-CCHHHHHHHhc
Confidence 356666777777777767766663 2356777776 68888888774
No 333
>PRK15127 multidrug efflux system protein AcrB; Provisional
Probab=21.36 E-value=1.4e+02 Score=29.65 Aligned_cols=49 Identities=12% Similarity=-0.006 Sum_probs=35.2
Q ss_pred hhhHHHHHHHHhccCC-e-eEEEeCCC--CEEEEEe--------ecCHHHHHHHHhc---CCce
Q 048021 22 PAWFSSMTKVLRKIKG-A-SYTIDAEE--GMAYITG--------RANPRKLLRKLQS---GKYA 70 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~G-V-sV~VDle~--~~VtV~G--------~vdp~~Ll~aI~k---Gk~A 70 (143)
..-+++|++.|++++| + +|..|.+. .++.|+= .+++..|.++|+. |..+
T Consensus 696 ~~~a~~l~~~l~~~~g~~~~v~~~~~~~~~~i~i~vd~~~~~~~Gls~~~v~~~l~~~~~G~~~ 759 (1049)
T PRK15127 696 TQARNQLLGEAAKHPDMLVGVRPNGLEDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYV 759 (1049)
T ss_pred HHHHHHHHHHHHhCCCceeeeecCccCCCceEEEEECHHHHHHcCCCHHHHHHHHHHHhCCcee
Confidence 4678999999999999 7 67777654 4444441 2578888888876 6543
No 334
>PHA02125 thioredoxin-like protein
Probab=21.36 E-value=1e+02 Score=19.98 Aligned_cols=15 Identities=7% Similarity=-0.053 Sum_probs=11.1
Q ss_pred cChhhHHHHHHHHhcc
Q 048021 20 NDPAWFSSMTKVLRKI 35 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI 35 (143)
.|+-|. .++..|.++
T Consensus 9 wC~~Ck-~~~~~l~~~ 23 (75)
T PHA02125 9 WCANCK-MVKPMLANV 23 (75)
T ss_pred CCHhHH-HHHHHHHHH
Confidence 699998 566777665
No 335
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=21.29 E-value=97 Score=20.42 Aligned_cols=28 Identities=14% Similarity=0.054 Sum_probs=14.2
Q ss_pred EEEeecCHHHHHHHHhc-CCceEEeecCC
Q 048021 50 YITGRANPRKLLRKLQS-GKYANLCWVSA 77 (143)
Q Consensus 50 tV~G~vdp~~Ll~aI~k-Gk~Ael~~~~~ 77 (143)
+..+..+.-+..+.|++ |.++.|++++.
T Consensus 7 ~F~st~~a~~~ek~lk~~gi~~~liP~P~ 35 (73)
T PF11823_consen 7 TFPSTHDAMKAEKLLKKNGIPVRLIPTPR 35 (73)
T ss_pred EECCHHHHHHHHHHHHHCCCcEEEeCCCh
Confidence 33344445555555555 55555555544
No 336
>PRK10677 modA molybdate transporter periplasmic protein; Provisional
Probab=21.27 E-value=4.5e+02 Score=21.27 Aligned_cols=58 Identities=14% Similarity=0.145 Sum_probs=29.8
Q ss_pred CCceEEEEEEeecChhhHHHHHHHHhccCCeeEEEeCCCCEEEEEeecCHHHHHHHHhcCCceEEeec
Q 048021 8 SAKMTCGLKVDTNDPAWFSSMTKVLRKIKGASYTIDAEEGMAYITGRANPRKLLRKLQSGKYANLCWV 75 (143)
Q Consensus 8 s~~~tvvlkV~m~C~gC~~kIkkaL~kI~GVsV~VDle~~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~ 75 (143)
+...++++-...+...+.+.|-++.++-.|+.|.+... .. ..|.+.|++|.++.|+..
T Consensus 24 ~~~~~l~v~~a~~~~~~~~~l~~~Fe~~~g~~v~~~~~---------~S-g~l~~qi~~g~~~Dv~~~ 81 (257)
T PRK10677 24 ADEGKITVFAAASLTNALQDIAAQYKKEKGVDVVSSFA---------SS-STLARQIEQGAPADLFIS 81 (257)
T ss_pred ccCCcEEEEEecChHHHHHHHHHHHHhhhCCeEEEEec---------cc-HHHHHHHHcCCCCCEEEE
Confidence 33344544444455566777777776655654333322 12 255566655554554444
No 337
>cd03012 TlpA_like_DipZ_like TlpA-like family, DipZ-like subfamily; composed uncharacterized proteins containing a TlpA-like TRX domain. Some members show domain architectures similar to that of E. coli DipZ protein (also known as DsbD). The only eukaryotic members of the TlpA family belong to this subfamily. TlpA is a disulfide reductase known to have a crucial role in the biogenesis of cytochrome aa3.
Probab=21.26 E-value=3e+02 Score=19.22 Aligned_cols=53 Identities=8% Similarity=0.007 Sum_probs=29.0
Q ss_pred eecChhhHHHHHH---HHhcc--CCee-EEEeCCCCEEEEEeecCHHHHHHHHhc-CCceEEee
Q 048021 18 DTNDPAWFSSMTK---VLRKI--KGAS-YTIDAEEGMAYITGRANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 18 ~m~C~gC~~kIkk---aL~kI--~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
...|..|.+.+.. ..+++ .|+. +.|...+ ...+.+++++.+.+++ |..-.++.
T Consensus 32 a~~C~~C~~~~p~l~~l~~~~~~~~~~vi~i~~~~----~~~~~~~~~~~~~~~~~~~~~p~~~ 91 (126)
T cd03012 32 TYCCINCLHTLPYLTDLEQKYKDDGLVVIGVHSPE----FAFERDLANVKSAVLRYGITYPVAN 91 (126)
T ss_pred CCCCccHHHHHHHHHHHHHHcCcCCeEEEEeccCc----cccccCHHHHHHHHHHcCCCCCEEE
Confidence 4689999866433 22222 2342 3332211 0112468888888888 77665554
No 338
>PF04273 DUF442: Putative phosphatase (DUF442); InterPro: IPR005939 Although this domain is uncharacterised it seems likely that it performs a phosphatase function.; GO: 0016787 hydrolase activity; PDB: 2F46_A 3GXH_B 3GXG_B.
Probab=21.22 E-value=1.4e+02 Score=21.66 Aligned_cols=34 Identities=15% Similarity=0.243 Sum_probs=20.2
Q ss_pred CEEEEEeecCHHHHHHHHhcCCceEEeecCCCCC
Q 048021 47 GMAYITGRANPRKLLRKLQSGKYANLCWVSAGNQ 80 (143)
Q Consensus 47 ~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~~s 80 (143)
..+.|.|..+++.|.+.-+.|++..|.--+.++.
T Consensus 7 ~~~~vs~Q~~~~d~~~la~~GfktVInlRpd~E~ 40 (110)
T PF04273_consen 7 DDLSVSGQPSPEDLAQLAAQGFKTVINLRPDGEE 40 (110)
T ss_dssp TTEEEECS--HHHHHHHHHCT--EEEE-S-TTST
T ss_pred CCeEECCCCCHHHHHHHHHCCCcEEEECCCCCCC
Confidence 4688899999999985544499988776655544
No 339
>COG4771 FepA Outer membrane receptor for ferrienterochelin and colicins [Inorganic ion transport and metabolism]
Probab=21.13 E-value=94 Score=30.01 Aligned_cols=29 Identities=21% Similarity=0.429 Sum_probs=21.9
Q ss_pred HHHHHHHHhccCCeeEEEe-CCCCEEEEEe
Q 048021 25 FSSMTKVLRKIKGASYTID-AEEGMAYITG 53 (143)
Q Consensus 25 ~~kIkkaL~kI~GVsV~VD-le~~~VtV~G 53 (143)
++.|..||+.+|||+++=. -.++.|.|.|
T Consensus 67 ~rDl~ealr~vpGV~ltg~~G~nr~IsIRG 96 (699)
T COG4771 67 YRDLAEALRTVPGVNLTGNRGNNRQISIRG 96 (699)
T ss_pred hhhHHHHHhcCCceEEeccCCCccceEEec
Confidence 8899999999999987744 4455566654
No 340
>TIGR00914 2A0601 heavy metal efflux pump (cobalt-zinc-cadmium). This model represents a family of H+/heavy metal cation antiporters. This family is one of several subfamilies within the scope of pfam model pfam00873.
Probab=20.97 E-value=1.7e+02 Score=28.87 Aligned_cols=40 Identities=10% Similarity=0.096 Sum_probs=28.6
Q ss_pred HHHHHHhccCCe-eEEEeCC-CCEEEEEe--------ecCHHHHHHHHhc
Q 048021 27 SMTKVLRKIKGA-SYTIDAE-EGMAYITG--------RANPRKLLRKLQS 66 (143)
Q Consensus 27 kIkkaL~kI~GV-sV~VDle-~~~VtV~G--------~vdp~~Ll~aI~k 66 (143)
.|+..|++++|| +|++.-. +.++.|.= .+++.+|.++|+.
T Consensus 170 ~l~~~L~~i~GV~~V~~~G~~~~ei~V~iD~~kl~~~glt~~dV~~~l~~ 219 (1051)
T TIGR00914 170 IIRPQLRTVPGVAEVNSIGGYVKQFLVAPDPEKLAAYGLSLADVVNALER 219 (1051)
T ss_pred HHHHHHhcCCCceEeeecCCceEEEEEEECHHHHHHcCCCHHHHHHHHHH
Confidence 478899999999 7888655 34555551 2467777788877
No 341
>TIGR01394 TypA_BipA GTP-binding protein TypA/BipA. This bacterial (and Arabidopsis) protein, termed TypA or BipA, a GTP-binding protein, is phosphorylated on a tyrosine residue under some cellular conditions. Mutants show altered regulation of some pathways, but the precise function is unknown.
Probab=20.86 E-value=3.3e+02 Score=25.39 Aligned_cols=67 Identities=7% Similarity=0.025 Sum_probs=44.1
Q ss_pred CceEEEEEEeec--ChhhHH-----HHHHHHhccC--Ce--eEEEeCCCCEEEEEe--ecCHHHHHHHHhc-CCceEEee
Q 048021 9 AKMTCGLKVDTN--DPAWFS-----SMTKVLRKIK--GA--SYTIDAEEGMAYITG--RANPRKLLRKLQS-GKYANLCW 74 (143)
Q Consensus 9 ~~~tvvlkV~m~--C~gC~~-----kIkkaL~kI~--GV--sV~VDle~~~VtV~G--~vdp~~Ll~aI~k-Gk~Ael~~ 74 (143)
+...+.|.++-. .+.=.+ +|..+|.++- .+ .|+-+.+.+...|.| .+--+.|+++|++ |+.+++-.
T Consensus 301 P~~~~~~~~~~~p~~~~e~k~~t~~~l~~~L~k~~~~d~sl~v~~~~~~~~~~v~g~GelHL~il~e~lrreg~e~~~~~ 380 (594)
T TIGR01394 301 PTLSMTFSVNDSPLAGKEGKKVTSRHIRDRLMRELETNVALRVEDTESADKFEVSGRGELHLSILIETMRREGFELQVGR 380 (594)
T ss_pred CeEEEEEEecCCCcccccchhhhHHHHHHHHHHhhccCCeEEEEEecCCCeEEEEEECHHHHHHHHHHHhccCceEEEeC
Confidence 444566776643 222122 3888888762 23 466677778888876 4678999999999 99887544
Q ss_pred c
Q 048021 75 V 75 (143)
Q Consensus 75 ~ 75 (143)
+
T Consensus 381 P 381 (594)
T TIGR01394 381 P 381 (594)
T ss_pred C
Confidence 3
No 342
>PF04724 Glyco_transf_17: Glycosyltransferase family 17; InterPro: IPR006813 This family represents beta-1,4-mannosyl-glycoprotein beta-1,4-N-acetylglucosaminyltransferase (2.4.1.144 from EC). This enzyme transfers the bisecting GlcNAc to the core mannose of complex N-glycans. The addition of this residue is regulated during development and has functional consequences for receptor signalling, cell adhesion, and tumour progression [, ].; GO: 0003830 beta-1,4-mannosylglycoprotein 4-beta-N-acetylglucosaminyltransferase activity, 0006487 protein N-linked glycosylation, 0016020 membrane
Probab=20.69 E-value=55 Score=28.73 Aligned_cols=43 Identities=19% Similarity=0.308 Sum_probs=31.4
Q ss_pred cChhhHHHHHHHHhccCCee-EEEeCCCCEEEEEeecCHHHHHHHHhcCC
Q 048021 20 NDPAWFSSMTKVLRKIKGAS-YTIDAEEGMAYITGRANPRKLLRKLQSGK 68 (143)
Q Consensus 20 ~C~gC~~kIkkaL~kI~GVs-V~VDle~~~VtV~G~vdp~~Ll~aI~kGk 68 (143)
||.-|-..+++.+.|+...+ .+... -..-.|++.|.+.|.+|+
T Consensus 259 HCS~Cf~~ie~i~~Km~SfSH~e~~r------~~~~~D~~~I~~~I~~G~ 302 (356)
T PF04724_consen 259 HCSSCFSTIEEIINKMQSFSHQERNR------WPEFLDPDYIQDLIRSGK 302 (356)
T ss_pred eccCcCCCHHHHHHHHHhhccccccc------chhhcCHHHHHHHHhcCc
Confidence 99999998999999987652 32211 111289999999999866
No 343
>cd06404 PB1_aPKC PB1 domain is an essential modular domain of the atypical protein kinase C (aPKC) which in complex with Par6 and Par3 proteins is crucial for establishment of apical-basal polarity of animal cells. PB1 domain is a modular domain mediating specific protein-protein interaction which play roles in many critical cell processes. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi,
Probab=20.65 E-value=2.4e+02 Score=20.07 Aligned_cols=57 Identities=14% Similarity=0.115 Sum_probs=40.5
Q ss_pred EEEEEeecChhhHHHHHHHHhccCCe--e-EEEeCCCCEEEEEeecCHHHHHHHHhc---CCceEE
Q 048021 13 CGLKVDTNDPAWFSSMTKVLRKIKGA--S-YTIDAEEGMAYITGRANPRKLLRKLQS---GKYANL 72 (143)
Q Consensus 13 vvlkV~m~C~gC~~kIkkaL~kI~GV--s-V~VDle~~~VtV~G~vdp~~Ll~aI~k---Gk~Ael 72 (143)
+.+.+.++|+.=.++|+...+-.++. . --+|.+..-++|+++ +.|.+|++- .++.+|
T Consensus 14 t~~d~~~s~e~L~~~v~~~c~~~~~q~ft~kw~DEEGDp~tiSS~---~EL~EA~rl~~~n~~~~l 76 (83)
T cd06404 14 TSIDPSISLEELCNEVRDMCRFHNDQPFTLKWIDEEGDPCTISSQ---MELEEAFRLYELNKDSEL 76 (83)
T ss_pred EEcCCCcCHHHHHHHHHHHhCCCCCCcEEEEEECCCCCceeecCH---HHHHHHHHHHHhcCcccE
Confidence 33444578888888999988877764 4 567999999999876 555555553 666554
No 344
>TIGR01952 nusA_arch NusA family KH domain protein, archaeal. This model represents a family of archaeal proteins found in a single copy per genome. It contains two KH domains (pfam00013) and is most closely related to the central region bacterial NusA, a transcription termination factor named for its iteraction with phage lambda protein N in E. coli. The proteins required for antitermination by N include NusA, NusB, nusE (ribosomal protein S10), and nusG. This system, on the whole, appears not to be present in the Archaea.
Probab=20.64 E-value=60 Score=24.91 Aligned_cols=19 Identities=21% Similarity=0.239 Sum_probs=11.3
Q ss_pred cCHHHHHHHHhc-CCceEEe
Q 048021 55 ANPRKLLRKLQS-GKYANLC 73 (143)
Q Consensus 55 vdp~~Ll~aI~k-Gk~Ael~ 73 (143)
++++.+-.+|-| |..+.++
T Consensus 106 V~~~d~~~AIGk~G~Ni~la 125 (141)
T TIGR01952 106 VHPRDKGIAIGKGGKNIERA 125 (141)
T ss_pred EChhhhhhhhCCCchhHHHH
Confidence 556666666666 6655543
No 345
>PRK13562 acetolactate synthase 1 regulatory subunit; Provisional
Probab=20.50 E-value=3.2e+02 Score=19.33 Aligned_cols=36 Identities=17% Similarity=0.217 Sum_probs=23.8
Q ss_pred EEEEEEeecChhhHHHHHHHHhccCCe-e-EEEeCCCC
Q 048021 12 TCGLKVDTNDPAWFSSMTKVLRKIKGA-S-YTIDAEEG 47 (143)
Q Consensus 12 tvvlkV~m~C~gC~~kIkkaL~kI~GV-s-V~VDle~~ 47 (143)
.+++-|+...+.=...|.|-|+|+..| . .++|+.++
T Consensus 45 Rmtivv~~~d~~~ieqI~kQL~KlidVikV~~~~~~~~ 82 (84)
T PRK13562 45 NMEIQVDIQDDTSLHILIKKLKQQINVLTVECYDLVDN 82 (84)
T ss_pred EEEEEEeCCCHHHHHHHHHHHhCCccEEEEEEeecccc
Confidence 455555545666777788888888777 5 45666554
No 346
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=20.47 E-value=96 Score=21.81 Aligned_cols=21 Identities=19% Similarity=0.336 Sum_probs=18.1
Q ss_pred CCEEEEEeecCHHHHHHHHhc
Q 048021 46 EGMAYITGRANPRKLLRKLQS 66 (143)
Q Consensus 46 ~~~VtV~G~vdp~~Ll~aI~k 66 (143)
+-++.|.|.++++++++.|++
T Consensus 19 n~~l~i~Gd~~~~~~~~~i~~ 39 (184)
T PF05193_consen 19 NMTLVIVGDIDPDELEKLIEK 39 (184)
T ss_dssp GEEEEEEESSGHHHHHHHHHH
T ss_pred ceEEEEEcCccHHHHHHHHHh
Confidence 446788899999999999988
No 347
>cd08591 PI-PLCc_beta Catalytic domain of metazoan phosphoinositide-specific phospholipase C-beta. This subfamily corresponds to the catalytic domain present in metazoan phosphoinositide-specific phospholipase C (PI-PLC, EC 3.1.4.11)-beta isozymes. PI-PLC is a signaling enzyme that hydrolyzes the membrane phospholipids phosphatidylinositol-4,5-bisphosphate (PIP2) to generate two important second messengers in eukaryotic signal transduction cascades, Inositol 1,4,5-trisphosphate (InsP3) and diacylglycerol (DAG). InsP3 triggers inflow of calcium from intracellular stores, while DAG, together with calcium, activates protein kinase C, which goes on to phosphorylate other molecules, leading to altered cellular activity. Calcium is required for the catalysis. PLC-beta represents a class of mammalian PI-PLC that has an N-terminal pleckstrin homology (PH) domain, an array of EF hands, a PLC catalytic core domain, a C2 domain, and a unique C-terminal coiled-coil (CT) domain necessary for homod
Probab=20.40 E-value=86 Score=26.58 Aligned_cols=51 Identities=14% Similarity=0.174 Sum_probs=35.8
Q ss_pred HHHHHHHHhccCCe-eEEEeCCCCE-----EEEE------eecCHHHHHHHHhc-CCceE----EeecCC
Q 048021 25 FSSMTKVLRKIKGA-SYTIDAEEGM-----AYIT------GRANPRKLLRKLQS-GKYAN----LCWVSA 77 (143)
Q Consensus 25 ~~kIkkaL~kI~GV-sV~VDle~~~-----VtV~------G~vdp~~Ll~aI~k-Gk~Ae----l~~~~~ 77 (143)
+..+.+||. .|. -|++|..+|. -.|. +.+.-..++++|++ ++.+. |++.+.
T Consensus 32 ~e~y~~aL~--~GcRcvElD~Wdg~~~~~eP~V~HG~tlts~i~f~~v~~aIk~~AF~~s~yPvIlslE~ 99 (257)
T cd08591 32 VEMYRQVLL--SGCRCIELDCWDGKGEDEEPIITHGKTMCTEILFKDVIEAIAETAFKTSEYPVILSFEN 99 (257)
T ss_pred HHHHHHHHH--hCCcEEEEEeecCCCCCCCCEEeeCCCCccCeEHHHHHHHHHHHhccCCCCCEEEEEec
Confidence 567888886 588 5999876544 3332 45788999999999 88753 555543
No 348
>cd08593 PI-PLCc_delta Catalytic domain of metazoan phosphoinositide-specific phospholipase C-delta. This subfamily corresponds to the catalytic domain present in metazoan phosphoinositide-specific phospholipase C (PI-PLC, EC 3.1.4.11)-delta isozymes. PI-PLC is a signaling enzyme that hydrolyzes the membrane phospholipids phosphatidylinositol-4,5-bisphosphate (PIP2) to generate two important second messengers in eukaryotic signal transduction cascades, Inositol 1,4,5-trisphosphate (InsP3) and diacylglycerol (DAG). InsP3 triggers inflow of calcium from intracellular stores, while DAG, together with calcium, activates protein kinase C, which then phosphorylates other molecules, leading to altered cellular activity. Calcium is required for the catalysis. PLC-delta represents a class of mammalian PI-PLC that has an N-terminal pleckstrin homology (PH) domain, an array of EF hands, a PLC catalytic core domain, and a C-terminal C2 domain. This CD corresponds to the catalytic domain which is
Probab=20.37 E-value=1.6e+02 Score=24.86 Aligned_cols=54 Identities=15% Similarity=0.293 Sum_probs=38.2
Q ss_pred hhhHHHHHHHHhccCCe-eEEEeCCCCE---EEE-Ee-----ecCHHHHHHHHhc-CCceE----EeecCC
Q 048021 22 PAWFSSMTKVLRKIKGA-SYTIDAEEGM---AYI-TG-----RANPRKLLRKLQS-GKYAN----LCWVSA 77 (143)
Q Consensus 22 ~gC~~kIkkaL~kI~GV-sV~VDle~~~---VtV-~G-----~vdp~~Ll~aI~k-Gk~Ae----l~~~~~ 77 (143)
...+..+++||+ .|+ -|++|..+|. -.| .| .+.-..++++|++ ++.+. |++.+.
T Consensus 29 ~ss~~~y~~aL~--~GcRcvElD~wdg~~~eP~v~HG~t~t~~i~f~~v~~~I~~~aF~~s~yPvIlslE~ 97 (257)
T cd08593 29 PSSTEAYIRALK--KGCRCVELDCWDGPDGEPIIYHGHTLTSKILFKDVIQAIREYAFKVSPYPVILSLEN 97 (257)
T ss_pred CccHHHHHHHHH--hCCcEEEEEeecCCCCCcEEeeCCccccCcCHHHHHHHHHHHhccCCCCCEEEEeec
Confidence 346778899987 588 5999987552 233 34 4788999999999 88753 555543
No 349
>PRK13586 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Provisional
Probab=20.33 E-value=3.5e+02 Score=21.94 Aligned_cols=50 Identities=16% Similarity=0.234 Sum_probs=33.6
Q ss_pred HHHHHHhccCC--eeEEEeC-CCCEEEEEe----ecCHHHHHHHHhc-CCc-eEEeecC
Q 048021 27 SMTKVLRKIKG--ASYTIDA-EEGMAYITG----RANPRKLLRKLQS-GKY-ANLCWVS 76 (143)
Q Consensus 27 kIkkaL~kI~G--VsV~VDl-e~~~VtV~G----~vdp~~Ll~aI~k-Gk~-Ael~~~~ 76 (143)
.++++.++..+ |-+.+|. ++++|.+.| ..++..+++.+++ |.. ..+..+.
T Consensus 111 ~~~~~~~~~g~~~ivvslD~~~~~~v~~~gw~~~~~~~~e~~~~l~~~g~~~ii~tdI~ 169 (232)
T PRK13586 111 LFHDIVREIGSNRVLVSIDYDNTKRVLIRGWKEKSMEVIDGIKKVNELELLGIIFTYIS 169 (232)
T ss_pred HHHHHHHHhCCCCEEEEEEcCCCCEEEccCCeeCCCCHHHHHHHHHhcCCCEEEEeccc
Confidence 45555555522 3577888 688898876 2578899999999 864 3444443
No 350
>PLN02626 malate synthase
Probab=20.27 E-value=1.3e+02 Score=28.41 Aligned_cols=43 Identities=19% Similarity=0.204 Sum_probs=36.8
Q ss_pred eCCCCEEEEEeecCHHHHHHHHhcCCceEEeecCCCCCCcccc
Q 048021 43 DAEEGMAYITGRANPRKLLRKLQSGKYANLCWVSAGNQITYGN 85 (143)
Q Consensus 43 Dle~~~VtV~G~vdp~~Ll~aI~kGk~Ael~~~~~~~s~~~~~ 85 (143)
|+.+.+|.|+|-+|...++.+|.+|-++-+.-.+.+++++..|
T Consensus 87 ~L~dRrvEItgP~drkm~inalNSga~~~maDfEDs~sPtW~n 129 (551)
T PLN02626 87 AVADRRVEITGPVERKMVINALNSGAKVFMADFEDSLSPTWEN 129 (551)
T ss_pred hhccceeeecCCCcHHHHHHHHcCCCCEEEecCCccCCCcchh
Confidence 5778999999999999999999999999988888887776433
No 351
>PRK01565 thiamine biosynthesis protein ThiI; Provisional
Probab=20.25 E-value=2e+02 Score=25.22 Aligned_cols=11 Identities=27% Similarity=0.477 Sum_probs=7.2
Q ss_pred HHHHHhccCCe
Q 048021 28 MTKVLRKIKGA 38 (143)
Q Consensus 28 IkkaL~kI~GV 38 (143)
+.++|+++.||
T Consensus 61 ~~~~l~~vfGI 71 (394)
T PRK01565 61 VIERLKKVFGI 71 (394)
T ss_pred HHHHHhhCCCc
Confidence 55566777776
No 352
>COG3260 Ni,Fe-hydrogenase III small subunit [Energy production and conversion]
Probab=20.19 E-value=90 Score=24.52 Aligned_cols=65 Identities=12% Similarity=0.043 Sum_probs=41.7
Q ss_pred EEEEEEe-ecChhhHHHHHHHHhccC-----CeeEEEeC-CCCEEEEEeecC---HHHHHHHHhc-CCceEEeecC
Q 048021 12 TCGLKVD-TNDPAWFSSMTKVLRKIK-----GASYTIDA-EEGMAYITGRAN---PRKLLRKLQS-GKYANLCWVS 76 (143)
Q Consensus 12 tvvlkV~-m~C~gC~~kIkkaL~kI~-----GVsV~VDl-e~~~VtV~G~vd---p~~Ll~aI~k-Gk~Ael~~~~ 76 (143)
.-++.|+ +.|+||.=.|..+|.-.= |+.+.-+. .+..+.|+|.+. .+.|+++.++ =..-.++.++
T Consensus 9 ~~v~~VdsG~CNgCdIEi~a~~sP~YDaErfgI~~v~sPRhADiLlVTG~vT~~~~e~lkk~Yea~PePKiViA~G 84 (148)
T COG3260 9 LHVYHVDSGGCNGCDIEILAALSPRYDAERFGIKVVNSPRHADILLVTGAVTRQMREPLKKAYEAMPEPKIVIAVG 84 (148)
T ss_pred eEEEEeccCCcCceeEEEeeccCcccchHHheeEEeCCCccccEEEEeccccHHHHHHHHHHHHhCCCCcEEEEEc
Confidence 4567888 599999977776655332 33333333 478889999764 6677777777 4444445553
No 353
>PF09186 DUF1949: Domain of unknown function (DUF1949); InterPro: IPR015269 Members of this entry are a set of functionally uncharacterised hypothetical bacterial proteins. They adopt a ferredoxin-like fold, with a beta-alpha-beta-beta-alpha-beta arrangement []. This entry contains the protein Impact, which is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. ; PDB: 2CVE_A 1VI7_A.
Probab=20.15 E-value=2.1e+02 Score=17.06 Aligned_cols=42 Identities=10% Similarity=0.095 Sum_probs=28.7
Q ss_pred hHHHHHHHHhccCCeeEEEeCCCCEEEEEeec---CHHHHHHHHhc
Q 048021 24 WFSSMTKVLRKIKGASYTIDAEEGMAYITGRA---NPRKLLRKLQS 66 (143)
Q Consensus 24 C~~kIkkaL~kI~GVsV~VDle~~~VtV~G~v---dp~~Ll~aI~k 66 (143)
...+|++.|....+.-.+.+-.+. |++...+ +.+.+.++|..
T Consensus 7 ~~~~v~~~l~~~~~~i~~~~y~~~-V~~~v~v~~~~~~~f~~~l~~ 51 (56)
T PF09186_consen 7 QYGKVERLLEQNGIEIVDEDYTDD-VTLTVAVPEEEVEEFKAQLTD 51 (56)
T ss_dssp CHHHHHHHHHHTTTEEEEEEECTT-EEEEEEEECCCHHHHHHHHHH
T ss_pred hHHHHHHHHHHCCCEEEcceecce-EEEEEEECHHHHHHHHHHHHH
Confidence 356889999886554456666655 7777654 56778887765
No 354
>PRK12698 flgH flagellar basal body L-ring protein; Reviewed
Probab=20.08 E-value=72 Score=26.31 Aligned_cols=28 Identities=18% Similarity=0.345 Sum_probs=23.2
Q ss_pred ccCCe-eEEEeCCCCEEEEEeecCHHHHH
Q 048021 34 KIKGA-SYTIDAEEGMAYITGRANPRKLL 61 (143)
Q Consensus 34 kI~GV-sV~VDle~~~VtV~G~vdp~~Ll 61 (143)
.|+|- .+.|+-+++.++|+|.+.|+.|-
T Consensus 153 ~I~GeK~i~vN~~~e~I~lsGvVRP~DI~ 181 (224)
T PRK12698 153 VIRGEKWISINNGDEFIRLTGIVRSQDIT 181 (224)
T ss_pred EEEEEEEEEECCCEEEEEEEEEECHHHCC
Confidence 45676 59999999999999998888764
No 355
>COG2761 FrnE Predicted dithiol-disulfide isomerase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=20.05 E-value=2e+02 Score=23.91 Aligned_cols=37 Identities=19% Similarity=0.131 Sum_probs=25.1
Q ss_pred CceEEEEEE--eecChhh---HHHHHHHHhccCCe-eEEEeCC
Q 048021 9 AKMTCGLKV--DTNDPAW---FSSMTKVLRKIKGA-SYTIDAE 45 (143)
Q Consensus 9 ~~~tvvlkV--~m~C~gC---~~kIkkaL~kI~GV-sV~VDle 45 (143)
+.+++++.| +.-|+-| .++++++|.+.++- .|+|...
T Consensus 2 ~~~~i~I~v~sD~vCPwC~ig~~rL~ka~~~~~~~~~v~i~w~ 44 (225)
T COG2761 2 NPMKIEIDVFSDVVCPWCYIGKRRLEKALAEYPQEVRVEIRWR 44 (225)
T ss_pred CCceEEEEEEeCCcCchhhcCHHHHHHHHHhcCcceeEEEEec
Confidence 345555555 6789999 57888888888753 5555443
Done!